BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10008
         (214 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
 gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
           scapularis]
          Length = 770

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/172 (37%), Positives = 93/172 (54%), Gaps = 20/172 (11%)

Query: 61  ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
           + +P +A G   T+   S  + M  S  KGR ++A  D+  GD +FVE+P A V+LP  +
Sbjct: 225 SQVPDVAYGAHGTVPSCSMAVDMLYSTEKGRFLVANRDLQPGDAIFVERPYASVLLPGHT 284

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
            +NC HC   +   +PC +C    YCS +C +++W  YHRWEC G L LL +VGIAHLA+
Sbjct: 285 KTNCQHCHKRLLNAVPCAQCNQVRYCSFACAKDSWNSYHRWEC-GNLNLLYSVGIAHLAV 343

Query: 181 KLILVSSHS-------------------DRYKEVYHLETHLQDMRPEDLYQY 213
           +++LV+  S                     Y  V+ L TH + M  EDL QY
Sbjct: 344 RVLLVTGLSGLADFCRHLAEGKVDVDKNGGYGSVHELVTHSEKMHVEDLLQY 395


>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 749

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 62/173 (35%), Positives = 101/173 (58%), Gaps = 6/173 (3%)

Query: 47  MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
           M+ ++ S++ ++   SLP +A GE+     AS  ++++ +  KGR+V+A  D+ +G  LF
Sbjct: 198 MEENADSTTDTAEFPSLPDVAFGENPNFRFASAAVEIKCTPKKGRYVVANRDIKRGQILF 257

Query: 107 VEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
           VEK  AFV L      NC +CC +S + P+PC EC+ ++YC+ +C  EA   YHRWEC G
Sbjct: 258 VEKAFAFVPLCHIKSDNCYNCCRSSGNTPVPCTECVDSIYCNITCWDEAHSSYHRWECLG 317

Query: 166 -ALRLLEAVGIAHLALKLILV----SSHSDRYKEVYHLETHLQDMRPEDLYQY 213
             + L   +GIA+LA++++       +  +R  EV  L T+   + P D+  Y
Sbjct: 318 NQMGLWAEIGIAYLAVRMLFKCTTDVTDDNRLDEVQKLVTNFSKLPPGDVISY 370


>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 737

 Score =  119 bits (298), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 61/157 (38%), Positives = 92/157 (58%), Gaps = 6/157 (3%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP-PCSM 121
           LP  A GE+     AS  + ++ S  KGRHV+A  DV KG+ LFVEKP AFV+L    S 
Sbjct: 198 LPDPAFGENPDFPNASAALDLKFSVDKGRHVVANRDVQKGEVLFVEKPFAFVLLDNEYSD 257

Query: 122 SNCNHCCT-SISAPIPCNECILAVYCSESCRREAWLRYHRWECHG-ALRLLEAVGIAHLA 179
           + C +C       P+PC  C   VYC+E CR++AW  YH+WEC G  + + + +GIAHL 
Sbjct: 258 AVCANCLKFRGDVPVPCKFCASTVYCTEQCRKKAWSTYHQWECFGNQIGIWDQIGIAHLT 317

Query: 180 LKLILVSSHSD---RYKEVYHLETHLQDMRPEDLYQY 213
           ++  L   ++D   ++ E+  L T++  +  +D++ Y
Sbjct: 318 VRTFLNCCYTDDTKKFNEIQRLVTNIDKIATQDMFVY 354


>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Metaseiulus occidentalis]
          Length = 775

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/150 (43%), Positives = 82/150 (54%), Gaps = 14/150 (9%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           AS+ +    S  KGR ++A  DV  GD LFVE P A V+LP  + ++C+HC   I A  P
Sbjct: 244 ASSALDCLYSTEKGRFIVANRDVRPGDNLFVESPYASVLLPAFTKTHCHHCYRRIKAAFP 303

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD------ 190
           C +C    YCS SC  E+W  YH  EC G L LL +VGIAHLA ++ILV+          
Sbjct: 304 CRQCAQVRYCSMSCSGESWNLYHSQEC-GNLDLLISVGIAHLAERVILVTGLGLIKDFMK 362

Query: 191 -------RYKEVYHLETHLQDMRPEDLYQY 213
                   Y  VY L TH +DM   DL+QY
Sbjct: 363 SCNTLECSYLPVYQLVTHEEDMHIADLFQY 392


>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 333

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/165 (36%), Positives = 98/165 (59%), Gaps = 6/165 (3%)

Query: 55  SASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV 114
           + S S   L  L  GE+     AS  I+++ +  KGR+V+A  ++ KG+TLFVE    FV
Sbjct: 28  TESPSQPKLKRLKSGENPNFRSASASIEVKHAPGKGRYVVANRNIKKGETLFVEDAFTFV 87

Query: 115 ILPPC-SMSNCNHCCTS-ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA-LRLLE 171
           ++      ++C++CC S +  P+PC EC+  ++C+  C  EA L YHRW C  + + LL 
Sbjct: 88  LVNNNKDNTHCHNCCKSYLDVPVPCTECVDTLFCNMKCWDEACLSYHRWLCLASQMGLLR 147

Query: 172 AVGIAHLALKLILV---SSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            +GI HLALKL+ V   +++++++ +V  L T+   + PED + Y
Sbjct: 148 QIGITHLALKLLCVCATTTNNEKFNDVQQLVTNFNKLTPEDAFSY 192


>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 742

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 55/143 (38%), Positives = 85/143 (59%), Gaps = 2/143 (1%)

Query: 73  TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
           TL   S K+++  S AKGR+ +A   +  G+ L  E P AFV+LP    S C +CC  + 
Sbjct: 240 TLPFRSPKMELCYSEAKGRYFVASTKIEYGELLIFENPFAFVLLPEYYNSFCYNCCVPLK 299

Query: 133 A-PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD- 190
              IPC+ C   ++C + C +EA   YHRWEC     + + +GIAHLAL+L + +S ++ 
Sbjct: 300 YYSIPCDNCCTILFCGDKCLQEARNSYHRWECKQGTSIFKCIGIAHLALRLTIETSQANS 359

Query: 191 RYKEVYHLETHLQDMRPEDLYQY 213
              ++Y+L TH+ D++  +LYQY
Sbjct: 360 NNDQIYNLLTHIDDLKSLELYQY 382


>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 735

 Score =  110 bits (275), Expect = 4e-22,   Method: Composition-based stats.
 Identities = 77/219 (35%), Positives = 111/219 (50%), Gaps = 11/219 (5%)

Query: 3   KQDVQETLSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQ--VSSTSSSASSSV 60
           +Q  +ETLS    +    D      K+   +IEK   N   A S Q  ++  S   +  +
Sbjct: 152 QQLAKETLSKIQTMIHDSDYIAPSMKD---SIEKKINNITVAESFQQNIAQNSVQDTDDL 208

Query: 61  ASLPSLACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
             L S    E +T    AS  I    S   GRHVIA + + KGD LF+EKPV+FV+L   
Sbjct: 209 LKLKSHIMFEENTNFPNASVAIDRIYSQELGRHVIANKSIKKGDILFLEKPVSFVLLSHD 268

Query: 120 SMSNCNHC-CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG-ALRLLEAVGIAH 177
           + + C HC C++   P+PC  C+   YC+E C  +AW  YH WEC G  + L + +GI H
Sbjct: 269 TYNLCPHCNCSNTDIPVPCTTCLNTFYCNEYCLNKAWSLYHCWECPGNQMELWKEIGIGH 328

Query: 178 LALKLILVSS---HSDRYKEVYHLETHLQDMRPEDLYQY 213
           LALK++L  S      ++ E+ +L T+   +   DL  Y
Sbjct: 329 LALKVLLTCSTMTDETKFNEIQNLVTNFDKLSMGDLTVY 367


>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 669

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 98/199 (49%), Gaps = 32/199 (16%)

Query: 47  MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
           M  S+ S      +   P L+ G +  + C S+ + +++S  KGR+ +A  DV  GD L 
Sbjct: 210 MMESTESVGLQKKLKIPPKLSYGSNLHVPCMSSAVTIESSPDKGRYAVATRDVKVGDVLI 269

Query: 107 VEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA 166
           VE P + V L PC++S+C+HC   + A IPC +C   VYCS+ CR  +W  YH  ECH  
Sbjct: 270 VENPYSSVGLQPCNVSHCHHCYIRVLASIPCLQCAGIVYCSKECRNASWEMYHNLECH-H 328

Query: 167 LRLLE--AVGIAHLALKLILVSS-----------------------------HSDRYKEV 195
           L L++   +G+ HLAL+ I+ +                              +   Y  V
Sbjct: 329 LDLIQELGLGMGHLALRTIIRTGLAFLLKFREQSANVNIPDESFHGCTVDGEYESNYYSV 388

Query: 196 YHLETHLQDMRPEDLYQYV 214
           Y+L  H +D +P DL++ V
Sbjct: 389 YNLVGHSEDRKPGDLFKRV 407


>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 662

 Score =  108 bits (271), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 55/129 (42%), Positives = 82/129 (63%), Gaps = 5/129 (3%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSE 148
           GRHVIA + + KGD LF+EKPV+FV+L   + + C HC C +   P+PC  C+   YC+E
Sbjct: 159 GRHVIANKSIKKGDILFLEKPVSFVVLSHDTYNRCPHCNCQNADIPVPCTTCLNTFYCNE 218

Query: 149 SCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV-SSHSD--RYKEVYHLETHLQD 204
            C  +AW  YH WEC G  ++L E +GI HLALK++L  S+ +D  ++ E+ +L T+   
Sbjct: 219 YCSTKAWSLYHCWECPGNQMKLWEEIGIGHLALKVLLTCSTMTDEIKFNEIQNLVTNFDK 278

Query: 205 MRPEDLYQY 213
           +  ++L  Y
Sbjct: 279 LSMDELTVY 287


>gi|449672885|ref|XP_002160254.2| PREDICTED: SET and MYND domain-containing protein 4-like [Hydra
           magnipapillata]
          Length = 729

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 107/233 (45%), Gaps = 43/233 (18%)

Query: 23  SHLDFKE--DVLTIEKSEE-NSVEAMSMQVSSTS---SSASSSVASLPSL--ACGESHTL 74
           S L FK+   VL  E SEE N VE +   V       + A + V  L S     G    L
Sbjct: 157 SQLSFKKCLTVLPTEFSEEKNQVEMLLSSVCKFDLAHTVAVNEVVELDSFIPVNGFDKIL 216

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP 134
              S+ I++   + +GR++ A ED+  G  +  EKP A V+LP    ++C  C   + + 
Sbjct: 217 TRGSSSIEVGYDSKQGRYIFAKEDIPNGSIIISEKPYAAVLLPHWYKTHCQLCFDKVVSL 276

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD---- 190
            PC EC   V+CS SC  +AW  YHR+EC   L L+E VGIAHL+L+++LVS   D    
Sbjct: 277 FPCYECAEVVFCSLSCYNDAWATYHRFECK-KLSLMEKVGIAHLSLRIVLVSDAKDLLRF 335

Query: 191 ------------------------------RYKEVYHLETHLQDMRPEDLYQY 213
                                          Y+ VY L TH   +  EDL+QY
Sbjct: 336 LGSDLNKFTDSPTLPSSKIEGCNDQGIYQGDYESVYFLSTHSDRLPIEDLFQY 388


>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
          Length = 732

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/171 (37%), Positives = 93/171 (54%), Gaps = 27/171 (15%)

Query: 68  CGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC 127
           CGE+ +   AS+ + +  S  KGR+V+A  D+  G+TLFVE+P A V+LP    S C+HC
Sbjct: 219 CGENSSFKGASSALNISHSKEKGRYVVANRDIKAGETLFVEQPNALVVLPDFQTSRCHHC 278

Query: 128 CTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL-- 184
               SA   PC  C    +CS+SCR+E+   YH +EC G   +L +VGIAHL  +++L  
Sbjct: 279 TRHSSAKRYPCLACGKIWFCSDSCRQESSC-YHNFEC-GLEAVLNSVGIAHLGARIVLSH 336

Query: 185 -------VSSHSDRYKEV---------------YHLETHLQDMRPEDLYQY 213
                      +D+ K+V               +HL +H + M PE+LYQY
Sbjct: 337 GLDSVLAFLKDTDKVKKVPGIDGPYDTKSYQVMFHLVSHTERMAPEELYQY 387


>gi|322800492|gb|EFZ21496.1| hypothetical protein SINV_13685 [Solenopsis invicta]
          Length = 677

 Score =  107 bits (268), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 96/172 (55%), Gaps = 5/172 (2%)

Query: 47  MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
           M+  + +++ ++   S P +A GE+     AS  I+++ +  KGR+V+A  D+ +G  L 
Sbjct: 243 MEDDADNTTDTTEFPSQPEVAYGENPNFRFASAAIEVKYTAEKGRYVVANRDIKRGQVLI 302

Query: 107 VEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
           VEK  AFV L    +  C +CC +S   P+PC EC+ + YC+  C  EA   YHRWEC G
Sbjct: 303 VEKAFAFVPLSHVKVDICYNCCRSSGDTPVPCTECVDSTYCNIDCWDEARSSYHRWECLG 362

Query: 166 -ALRLLEAVGIAHLALKLIL---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
             + L   +GI++LA++++     ++  +R  EV  L T+   + P D+  Y
Sbjct: 363 NQMGLWPQIGISYLAIRMLFKCTTATDDNRLNEVQELVTNFSKLPPCDVISY 414


>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 628

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 61/175 (34%), Positives = 94/175 (53%), Gaps = 9/175 (5%)

Query: 48  QVSSTSSSASSSVAS---LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDT 104
           Q+  TS   +    S   LP +  GE+  L  AS  + ++ +  KGRH++A  D+ +   
Sbjct: 79  QMVDTSDDVTGVTESHRILPEILFGENPNLPFASTALDIKYTPQKGRHMVANRDIKRSQI 138

Query: 105 LFVEKPVAFVILPPCSMSN-CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWE 162
           LFVEKP +FV L     S+ C +CC S    + PC+ CI  V+C  +C  EA+  +HRWE
Sbjct: 139 LFVEKPFSFVTLECDGASDLCENCCRSCGKNLTPCSGCIDTVFCDVNCWNEAYSSHHRWE 198

Query: 163 CHG-ALRLLEAVGIAHLALKLIL---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
           C G  +RL + +G+AHL LK +L   +++ +  +  V  L T   ++   DL  Y
Sbjct: 199 CVGNQMRLWQQIGVAHLGLKTLLKCTMTTDNSMFNRVQQLVTGFDNLSANDLIVY 253


>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 691

 Score =  103 bits (257), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 60/147 (40%), Positives = 88/147 (59%), Gaps = 7/147 (4%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHC- 127
           E+     AS  I ++ +   GRHV+A + ++KG+ LFVEKP++ V++P  S+S+ C++C 
Sbjct: 174 ENKNFPNASASIDIKYNEEVGRHVVANKFINKGEILFVEKPISIVLIPNESISDRCHNCN 233

Query: 128 CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA-LRLLEAVGIAHLALKLIL-V 185
           C     PIPC  C L  YC E C  EAW  YH WEC G+ +RL   VGI HLA K++   
Sbjct: 234 CFIGDIPIPCKTC-LYTYCGEKCLNEAWSLYHCWECPGSQMRLWGQVGITHLASKVLFNC 292

Query: 186 SSHSD--RYKEVYHLETHLQDMRPEDL 210
           S+ SD  R+ ++ +L +H   +   DL
Sbjct: 293 STMSDKIRFNQLQNLVSHFDKIPDADL 319


>gi|328785590|ref|XP_003250620.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Apis mellifera]
          Length = 444

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 51/129 (39%), Positives = 81/129 (62%), Gaps = 5/129 (3%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSE 148
           GRHVIA + + +GD LF+E+P++FV+L   + S C +C   +   P+PC  C+   YC+E
Sbjct: 155 GRHVIANKFIKEGDILFLEEPISFVLLNHDTYSYCQYCNNLNTDIPVPCRTCLNTFYCNE 214

Query: 149 SCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV-SSHSD--RYKEVYHLETHLQD 204
           +C  +AW  YH WEC G  + L + +GI HLALK++L  S+ +D  ++ E+ +L T+   
Sbjct: 215 NCLTKAWSSYHCWECPGNQMNLWKEIGIGHLALKVLLTCSTITDKIKFNEMQNLITNFDK 274

Query: 205 MRPEDLYQY 213
           +  +DL  Y
Sbjct: 275 LSIDDLTIY 283


>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
 gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
          Length = 634

 Score = 98.2 bits (243), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 106/219 (48%), Gaps = 20/219 (9%)

Query: 11  SSSLK-LCRLQDTSHLDFKEDVLTIE------KSEENSVEAMSMQVSSTSSSASS----- 58
           + SLK LCR++++  + F+E +++ +      K  E+ V+ +  Q+S  S+   +     
Sbjct: 118 ARSLKALCRIEESKKV-FEEAIVSSKTSKLKRKDRESWVKQVEKQISECSNQTENLMKSV 176

Query: 59  -------SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
                  S  S P    G     +  S  + ++ +  KGRH IA  D++ GD L VEKP 
Sbjct: 177 THLEEARSGFSTPVQNYGLHEKFVSISKALDIKYTEEKGRHTIAARDINIGDVLLVEKPF 236

Query: 112 AFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           A V+L   S S+C+ C   I AP+PC+ C    YCSE C +E+W  YH  EC     +  
Sbjct: 237 ASVLLQEQSKSHCHQCFVHILAPLPCSYCTTVRYCSEKCAKESWDAYHYAECMNLEHVYV 296

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHLETHLQDMRPEDL 210
           A    HLAL++++ +        V   E+  +   P +L
Sbjct: 297 AGKYGHLALRVVVKAGFQYLKASVKQFESEEKKCDPAEL 335


>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
          Length = 713

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 52/139 (37%), Positives = 75/139 (53%), Gaps = 9/139 (6%)

Query: 53  SSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVA 112
           + S    +  +P    GE+  L  AS K+KM  +  KGRH+IA  ++ +GD LF+EK  A
Sbjct: 186 TDSGQECLHEIPVFPDGENKCLAYASTKVKMSYNKDKGRHIIASTNIQRGDLLFIEK--A 243

Query: 113 FVILPPCSMSN------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG- 165
           FV  P    +       C +C + I + +PCN C + VYC E CR + W + HRWEC G 
Sbjct: 244 FVFAPVFKENKEFYSFKCYNCLSDIVSGVPCNTCTVCVYCDEKCRDQNWTKCHRWECDGM 303

Query: 166 ALRLLEAVGIAHLALKLIL 184
              +   +GIA  A K++L
Sbjct: 304 QANIWFDLGIALPAFKVML 322


>gi|345483949|ref|XP_003424915.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4-like [Nasonia vitripennis]
          Length = 633

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 88/184 (47%), Gaps = 37/184 (20%)

Query: 62  SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
           +LP +   E   +  AS+ I ++ +N  GRH++A  D+  G+ L +EKP A V+  P   
Sbjct: 204 TLPEIIRNE--IIPSASDAISIKYTNVFGRHLVASRDIDVGEILVIEKPYASVLSSPNIY 261

Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLEAVGIAHLA 179
           ++C+ C       IPC++CI A+YCSE CR EAW +YH  EC   G L  L    ++ L+
Sbjct: 262 THCSQCFKRTWDSIPCDKCIYAMYCSEKCRSEAWQQYHDIECSIKGYLIGLRMNDLSGLS 321

Query: 180 LKLILV---------------------------------SSHSDRYKEVYHLETHLQDMR 206
           LKL ++                                 + HSD Y+  Y L TH Q+ +
Sbjct: 322 LKLAILAVRESGSIRKLRKKIRKIDDCEDYLMKSISEDGTYHSDEYRPFYSLVTHEQNRK 381

Query: 207 PEDL 210
             DL
Sbjct: 382 RYDL 385


>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
 gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
           corporis]
          Length = 646

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 57/176 (32%), Positives = 90/176 (51%), Gaps = 30/176 (17%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           LP    G S+T   AS+ I +  +   GRHV A+ D+  GD LFVEKP A+++       
Sbjct: 143 LPVPVGGVSNTFPSASSYINVFYNETLGRHVKALHDIEVGDVLFVEKPFAYILFHEV--- 199

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
            C++C     +PIPC++C+  +YCS  C+++     H +EC G  +  +++GIAHL+L+L
Sbjct: 200 -CHNCAKQTISPIPCDKCLDVIYCSVECKKKFEKSVHSFEC-GKGKFFKSIGIAHLSLRL 257

Query: 183 ILVSS-------------------------HSDRYKEVYHLETHLQDMRPEDLYQY 213
           IL +S                          ++ Y  VY L T+L+    + L+QY
Sbjct: 258 ILAASSNYKELKKLISSYTETKYFDGIVPCENNSYHNVYGLITNLEKYEGKHLFQY 313


>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
           magnipapillata]
          Length = 1044

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 43/108 (39%), Positives = 66/108 (61%), Gaps = 1/108 (0%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
            S  +K+ ++N+KGRH IA  D+  G+ + +EKP A + LP C  ++C HC T      P
Sbjct: 226 TSEALKIVSTNSKGRHAIASRDIKAGEVIIIEKPFASLCLPECYNTHCYHCLTRFKINYP 285

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLI 183
           C  C    YCS SC +E+W ++H +EC +  + + + VG+AHLA K+I
Sbjct: 286 CRLCSTVNYCSISCEKESWEKFHCFECEYLGVLINDDVGLAHLAFKII 333


>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
          Length = 776

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/128 (39%), Positives = 74/128 (57%), Gaps = 10/128 (7%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           LP    G +   + AS  I+++   ++GRHV+AVE   +GD LF E+P A+V LP    +
Sbjct: 223 LPDCYLGPNPDFVSASKSIELRYKESRGRHVVAVEPAGRGDVLFSEEPYAWVALPS-DDA 281

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL-RLLEAVGIAHLALK 181
            C  CC +   P+P       VYC   C   A + +HRWEC GA   L   +GIAHLAL+
Sbjct: 282 ICEMCCDTDINPVP-------VYCGCECASRA-ISFHRWECVGAQCSLFPTIGIAHLALR 333

Query: 182 LILVSSHS 189
           ++L+S+++
Sbjct: 334 VLLISTNN 341


>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
 gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
          Length = 750

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 87/165 (52%), Gaps = 10/165 (6%)

Query: 27  FKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTS 86
            ++ +L ++KS        S   + +S SA  S    P +A G        S+ +K+   
Sbjct: 115 IRDKLLKLQKSCLRKQARHSQNKNGSSGSACLSGGRHPKMANG--------SSLLKINYD 166

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
             +GR + A  ++  GDTL  E+P + V+LP  + ++C  C  S+ AP+PCN C   +YC
Sbjct: 167 QNQGRFLQASSEIRAGDTLIAEEPYSAVLLPENAKTHCECCYKSLVAPVPCNHCSSVLYC 226

Query: 147 SESCRREAWLRYHRWECHGALRLLEAVG-IAHLALKLILVSSHSD 190
           S +CR +AW +YH  EC     +LE V    HL+L+++L +S  D
Sbjct: 227 SAACRNKAWSQYHHVECE-IFPVLEIVDTFTHLSLRILLTTSAKD 270


>gi|270016446|gb|EFA12892.1| hypothetical protein TcasGA2_TC004406 [Tribolium castaneum]
          Length = 1108

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 52/176 (29%), Positives = 95/176 (53%), Gaps = 21/176 (11%)

Query: 54  SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
           ++A   V + P+    ++ ++ CA N I++  +   GRHV+A  ++  G+ L +EKP A 
Sbjct: 140 NNAYDPVPTFPAFE--KNASIQCARNCIEIMQNERYGRHVVATRNIEMGEILAIEKPYAS 197

Query: 114 VILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLE 171
           ++    S+  C+ C       IPC++C  A+YCS++C+ +A+  YH++EC  H +L  L 
Sbjct: 198 IVTDSVSVY-CHECLKLCYNMIPCDKCTKALYCSDNCKDKAYESYHKYECPIHLSLDPLL 256

Query: 172 AVGIAHLALKLILVSSH----------------SDRYKEVYHLETHLQDMRPEDLY 211
                 LAL++ L+S +                SDR+KEV++L+ +++     DL+
Sbjct: 257 IDSSKRLALRIALISRNEWAGSLLNESPDEMYCSDRFKEVFNLDQNVRQRFTHDLF 312


>gi|157169545|ref|XP_001657892.1| hypothetical protein AaeL_AAEL001050 [Aedes aegypti]
 gi|108883672|gb|EAT47897.1| AAEL001050-PA [Aedes aegypti]
          Length = 662

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 86/172 (50%), Gaps = 7/172 (4%)

Query: 30  DVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTS--- 86
           D   +++  E+  +  +++    +         LP L+ G +  LM AS KI +  S   
Sbjct: 210 DDYRLQRLREDLEDLRTLKAEDAAMKPKEPEPVLPELSGGVNEELMGASKKISLSGSLED 269

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
           + +GR+++A ED+  G+T+ VE      + P    ++CN C T + A +PC +C    +C
Sbjct: 270 SNRGRYIVAAEDLKSGETVLVEPAYGACLYPKYFGTHCNQCFTRLVASVPCEQCCGVAFC 329

Query: 147 SESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHSDRYKEV 195
           S  CR  A   YHR+EC   L L+   G++   HLA +L+  +   ++  E+
Sbjct: 330 SPECRDRACAGYHRFECQ-YLDLMIGSGMSILCHLAFRLVTQAGSPEKAAEI 380


>gi|321479127|gb|EFX90083.1| hypothetical protein DAPPUDRAFT_309882 [Daphnia pulex]
          Length = 732

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 4/140 (2%)

Query: 54  SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
           S+A+++  +LP L  G +  +  AS  + M+  +  GR+ +A +D+  G TL  EKP A 
Sbjct: 264 STANANTNALPELVSGANSNIPNASKSLAMEVDDNSGRYYVAADDIKPGQTLVCEKPYAA 323

Query: 114 VILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
            +LP    S+C+HC   + AP+ C  C    YCS  CR EA   YH++EC G +  + A 
Sbjct: 324 CLLPGKFTSHCHHCFVRLIAPLGCLTCRGVFYCSVECRDEAASTYHQYEC-GIIDYMIAS 382

Query: 174 G---IAHLALKLILVSSHSD 190
           G   ++ +AL+++      D
Sbjct: 383 GSSILSWIALRILTKGKMED 402


>gi|189242482|ref|XP_001810915.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
          Length = 538

 Score = 87.4 bits (215), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 95/176 (53%), Gaps = 21/176 (11%)

Query: 54  SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
           ++A   V + P+    ++ ++ CA N I++  +   GRHV+A  ++  G+ L +EKP A 
Sbjct: 63  NNAYDPVPTFPAFE--KNASIQCARNCIEIMQNERYGRHVVATRNIEMGEILAIEKPYAS 120

Query: 114 VILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLE 171
           ++    S+  C+ C       IPC++C  A+YCS++C+ +A+  YH++EC  H +L  L 
Sbjct: 121 IVTDSVSVY-CHECLKLCYNMIPCDKCTKALYCSDNCKDKAYESYHKYECPIHLSLDPLL 179

Query: 172 AVGIAHLALKLILVSSH----------------SDRYKEVYHLETHLQDMRPEDLY 211
                 LAL++ L+S +                SDR+KEV++L+ +++     DL+
Sbjct: 180 IDSSKRLALRIALISRNEWAGSLLNESPDEMYCSDRFKEVFNLDQNVRQRFTHDLF 235


>gi|345483845|ref|XP_003424896.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
           [Nasonia vitripennis]
 gi|345483847|ref|XP_001604625.2| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
           [Nasonia vitripennis]
          Length = 632

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 79/155 (50%), Gaps = 3/155 (1%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESH-TLMCASNKIKMQTSNAKGRHV 93
           +K++E   + +      T       ++S  SL   + H  + CAS+ I ++   A GRH 
Sbjct: 171 DKNKEQLEQKIQTATQITELPKVEDISSKKSLLKFKRHERISCASDAIDLKYDEANGRHT 230

Query: 94  IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRRE 153
           +A  D++ G+ L  EKP A ++ P    ++C+HC       IPC  CI A+YCS  CR +
Sbjct: 231 VANRDINVGEILVFEKPYALLLKPERIYTHCSHCFIRAWDSIPCPNCIHAMYCSTKCRDK 290

Query: 154 AWLRYHRWEC--HGALRLLEAVGIAHLALKLILVS 186
           AW +YH  EC   G    L    +A  +LKL +++
Sbjct: 291 AWEQYHDIECPIKGYFLSLTMNDLAPFSLKLAVLA 325


>gi|195376269|ref|XP_002046919.1| GJ12224 [Drosophila virilis]
 gi|194154077|gb|EDW69261.1| GJ12224 [Drosophila virilis]
          Length = 661

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 8/155 (5%)

Query: 64  PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
           P+LA GE+  L  A+N +K+  +  KGR V+A E +  GD +  E PVA  +LP    +N
Sbjct: 240 PALAHGENPALSGAANVVKLVETKEKGRFVVANEGLKTGDVVLCENPVAACLLPNFFGTN 299

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL---AL 180
           C+HC   +  P+PC  C    +CS  C  EA   YHR+EC   + L    G++ L   AL
Sbjct: 300 CHHCFKRLHIPVPCLHCSGIAFCSAQCMGEACETYHRFECQ-FMDLFIGSGMSILCFVAL 358

Query: 181 KLIL-VSSHSDRYKEVYHLETHL---QDMRPEDLY 211
           ++    +S  D       L  HL   QD+R  D Y
Sbjct: 359 RVFTQAASLEDGLTTANLLFEHLCSHQDVREADDY 393


>gi|195012821|ref|XP_001983754.1| GH16068 [Drosophila grimshawi]
 gi|193897236|gb|EDV96102.1| GH16068 [Drosophila grimshawi]
          Length = 662

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 55/157 (35%), Positives = 82/157 (52%), Gaps = 9/157 (5%)

Query: 64  PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
           P+LA GE+  L  A+N +K+  +  KGR V+A E +  GD +  E PVA  +LP    +N
Sbjct: 241 PALAHGENPALSGAANVVKLVETKDKGRFVVANEGLKTGDVVLSENPVAACLLPNFFGTN 300

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLAL 180
           C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + L    G++   ++AL
Sbjct: 301 CHHCFKRLHTPVACLHCSGIAFCSAQCMGEACETYHRFECQ-FMDLFIGSGMSILCYIAL 359

Query: 181 KLIL-VSSHSDRYKEVYHLETHL---QDMR-PEDLYQ 212
           ++    +S  D  +    L  HL   QD+R P+D  Q
Sbjct: 360 RIFTQAASLEDGLQTSNLLFEHLVSHQDVRQPDDYLQ 396


>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 644

 Score = 84.0 bits (206), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/121 (35%), Positives = 65/121 (53%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
           ES  L  AS  +K+     KGR+++A E + +G+ +  EKP   V+LP    + C HC  
Sbjct: 55  ESKLLSNASKAVKLTNHGEKGRYILATETICRGEIIIKEKPYGCVLLPSHYNTRCYHCVR 114

Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
              APIPC  C    YCS  C++E+W  YH  EC     L +A   + L+L+++L +  S
Sbjct: 115 KTVAPIPCCTCTHVRYCSVECQQESWKSYHYIECPLWPFLSQAGNFSQLSLRILLKAGWS 174

Query: 190 D 190
           +
Sbjct: 175 N 175


>gi|117935365|gb|ABK56990.1| hypothetical protein GIP_L1_00040 [Glyptapanteles indiensis]
          Length = 561

 Score = 83.6 bits (205), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 71/124 (57%), Gaps = 3/124 (2%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           LPS+   ++  +  A++ + ++ S   G+HV A  D+  G+ L V +  A V++   + +
Sbjct: 213 LPSIIV-DNEEICGATSALAVEFSEQYGKHVRATRDIKVGEVLSVNEGYATVLMLDKTYT 271

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH--GALRLLEAVGIAHLAL 180
           +C HC     + IPCN CI AV+C E CRREAW  YH  EC   G +  +E   +A +AL
Sbjct: 272 HCAHCLKQTWSAIPCNFCIYAVFCGEDCRREAWKGYHEVECRVTGPMVAMEMNHMALMAL 331

Query: 181 KLIL 184
           +L++
Sbjct: 332 RLLV 335


>gi|190702163|gb|ACE75061.1| conserved hypothetical protein [Glyptapanteles flavicoxis]
          Length = 565

 Score = 82.8 bits (203), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 51/162 (31%), Positives = 85/162 (52%), Gaps = 5/162 (3%)

Query: 25  LDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQ 84
           +D KE  + + +   NS+  MS   S  +         LPS+   ++  +  A++ + ++
Sbjct: 177 MDGKESGVKLVEKTLNSLHLMSQ--SEAAFVKLDDKRHLPSIIV-DNEEICGATSALAVE 233

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
            S   G+HV A  D+  G+ L V +  A V++   + ++C HC     + IPCN CI A+
Sbjct: 234 FSEQYGKHVRATRDIKVGEVLSVNEGYATVLMLDKTYTHCAHCLKQTWSAIPCNFCIYAL 293

Query: 145 YCSESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLIL 184
           +C E CRREAW  YH  EC   G +  +E   +A +AL+L++
Sbjct: 294 FCGEDCRREAWEGYHEVECRVTGPMVAMEMNHMALMALRLLV 335


>gi|194869827|ref|XP_001972529.1| GG13835 [Drosophila erecta]
 gi|190654312|gb|EDV51555.1| GG13835 [Drosophila erecta]
          Length = 665

 Score = 82.8 bits (203), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 81/159 (50%), Gaps = 9/159 (5%)

Query: 59  SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           S   +P LA GES  L+ AS  +++  +  KGR V+A E +  GD L  E+PVA  + P 
Sbjct: 236 SAEQVPQLAHGESAELIGASKVVRLVETKDKGRFVVANEGLRAGDVLLCEEPVAACLEPT 295

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
              ++C+HC   +S P+ C  C    +CS  C  EA   YHR+EC   + L+   G++ L
Sbjct: 296 YFGTHCHHCFKRLSTPVSCLHCSGVAFCSAQCMGEASSSYHRFECE-YMDLMIGSGMSIL 354

Query: 179 ALKLILVSSHSDRYKE--------VYHLETHLQDMRPED 209
               + V + +   ++          HL +H ++ +P+D
Sbjct: 355 CFIALRVFTQAPSLEQGLATANLLFEHLCSHEEERQPDD 393


>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
 gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
          Length = 660

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 79/160 (49%), Gaps = 27/160 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ ++  +GR   A  DV  GD L VE+P   V+L   + ++C +C      P+ C  C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGDELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR+EA  +YH++EC G + ++   G +   H+AL++I              
Sbjct: 301 ADVLYCSEQCRQEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                    L+S   D ++ V  LE H  + +P + +Q+V
Sbjct: 360 IDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399


>gi|443689223|gb|ELT91670.1| hypothetical protein CAPTEDRAFT_90747 [Capitella teleta]
          Length = 673

 Score = 82.0 bits (201), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 71/156 (45%), Gaps = 31/156 (19%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           KGRH +AV+++  GDT+  E P  +V+ P      C HC   ++ P+ C +C+   YCSE
Sbjct: 247 KGRHAMAVQNIKVGDTVVKEDPYVWVLDPSQYGYRCYHCFKVLTYPVGCTQCMRVRYCSE 306

Query: 149 SCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSD--RYKE---------- 194
           +CR  AW  YH+ EC    A+ +     +  L L+ ILV+  S   RYK           
Sbjct: 307 TCRTSAWESYHKTECPYLPAMLMSRTGNLTSLVLRTILVTGMSKIIRYKNNPKSDQHDAQ 366

Query: 195 -----------------VYHLETHLQDMRPEDLYQY 213
                            +Y L TH +   P +L QY
Sbjct: 367 FSLFTDSKGVYMGGFVGLYGLLTHTEHRSPSELLQY 402


>gi|332022516|gb|EGI62819.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 637

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 94/210 (44%), Gaps = 39/210 (18%)

Query: 42  VEAMSMQVSSTSSSASSSVAS--LPSLACGESHT--LMCASNKIKMQTSNAKGRHVIAVE 97
           ++ +  +++ T  + ++ V +   P+L   +S+   + CAS+ + ++ +   GRH++A  
Sbjct: 187 LDVLHNKITQTKVNKNTKVKTEETPALPIIKSYNNEVPCASDAVAIKYNERYGRHIVATR 246

Query: 98  DVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLR 157
            +H G+ + VEKP A +++     ++C++C     A IPCN C  A+YCSE CR   W +
Sbjct: 247 KIHPGEIIAVEKPYALILVQENMQTHCSNCLRVCWANIPCNYCTYAMYCSEECRNIEWEK 306

Query: 158 YHRWEC--------------------HGALRLLEAVGIAHLALKLILVSSHSD------- 190
            H  EC                       L L EA GI  L   +  V  H D       
Sbjct: 307 CHNIECVVFPAVLEYSIYNLDLCSIRLAVLALREAGGIEKLKTMVKEVDEHDDPRTKGFS 366

Query: 191 --------RYKEVYHLETHLQDMRPEDLYQ 212
                   RY  VY L T+ +     DL++
Sbjct: 367 QDGKLHSNRYIGVYSLVTNTEKRSVSDLFK 396


>gi|321477118|gb|EFX88077.1| hypothetical protein DAPPUDRAFT_305694 [Daphnia pulex]
          Length = 651

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 80/154 (51%), Gaps = 12/154 (7%)

Query: 10  LSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACG 69
           +S + K+ +L D ++       + + +S E++  ++  QV     +A       P L   
Sbjct: 190 ISINSKIVKLGDDAN-----PTIGVSESNESAAASIERQVIKKRRTA-------PDLNRD 237

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
            +  L  AS  I++  S+ +GR ++A ED+  G T+ VEK +A ++L     S+C+HC  
Sbjct: 238 SNALLPAASTSIELTDSDDRGRCLVATEDIQIGTTVIVEKALASILLEEFKESHCHHCLH 297

Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
               P+PC++C    +CS  CR EAW  YH+ EC
Sbjct: 298 WTPGPVPCHQCSQVGFCSTLCRDEAWASYHQSEC 331


>gi|195493827|ref|XP_002094580.1| GE20125 [Drosophila yakuba]
 gi|194180681|gb|EDW94292.1| GE20125 [Drosophila yakuba]
          Length = 667

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 79/155 (50%), Gaps = 9/155 (5%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           +P LA GES  L+ AS  +++  +  KGR V+A E +  GD L  E+PVA  + P    +
Sbjct: 242 VPQLAHGESAELIGASKVVRLVETKEKGRFVVANEGLRTGDVLLCEEPVAACLEPTYFGT 301

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
           +C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + L+   G++ L    
Sbjct: 302 HCHHCFKRLRTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 360

Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
           + V + +   ++          HL +H ++ +PED
Sbjct: 361 LRVFTQASSLEQGLATANLLFEHLCSHEEERQPED 395


>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 673

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 5/108 (4%)

Query: 61  ASLPSLACGESHTLMCA-SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
           AS+  +  G+ HT++ + S+ +++Q    +GRH+IA  ++  G  L VE P AF      
Sbjct: 204 ASVIPIVDGKQHTILKSCSDAVELQFDEKRGRHLIATRNIKAGSVLIVETPFAFSTNKEA 263

Query: 120 SMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
              NC HC  ++    S  IPC+ C    +CSE CRREAW  YH++EC
Sbjct: 264 LDRNCLHCHITLKSSESVKIPCHFCQTVSFCSEKCRREAWQMYHQYEC 311


>gi|21355139|ref|NP_648574.1| SET and MYND domain protein 4 [Drosophila melanogaster]
 gi|7294583|gb|AAF49923.1| SET and MYND domain protein 4 [Drosophila melanogaster]
 gi|17945748|gb|AAL48922.1| RE32936p [Drosophila melanogaster]
          Length = 663

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           +P LA GES  L+ AS  +++  +  KGR V+A E +  GD L  E+PVA  + P    +
Sbjct: 238 VPQLAHGESAELVGASKVVRLVETKDKGRFVVANEGLRTGDVLLFEEPVAACLEPSYFGT 297

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
           +C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + L+   G++ L    
Sbjct: 298 HCHHCFKRLHTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 356

Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
           + + + +   ++          HL +H +D +P+D
Sbjct: 357 LRIFTQAPSLEQGLATANLLFEHLCSHEEDRQPDD 391


>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 683

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 62/121 (51%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
           E+  + CAS  +++  S  KGRH IA  D+  G+ L  E+    V+ P     +C+HC +
Sbjct: 174 ENGLISCASPSMRLAFSKDKGRHCIAQTDIVPGEILLKEEAFVAVLCPGEEDIHCHHCLS 233

Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
            + A +PC  C  A YC  +C R AW  YH+ EC     LL      H+AL+ I V+   
Sbjct: 234 PLVASVPCRGCSYAKYCGSACARAAWQSYHQRECPFGGLLLAMGVFCHVALRTIFVAGFE 293

Query: 190 D 190
           +
Sbjct: 294 E 294


>gi|242016107|ref|XP_002428677.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212513348|gb|EEB15939.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 697

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 51/159 (32%), Positives = 78/159 (49%), Gaps = 5/159 (3%)

Query: 29  EDVLTIEKSEENSVEAMS-MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSN 87
           E ++    S  N  E M   QV + + +       LP ++   +     ASN +K++   
Sbjct: 212 EKIIDNLTSSNNDDETMKKQQVENLNFNVPYEGEPLPKVSGEPNEMYPNASNLLKIRQGT 271

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
             GR   A   +  GDT+ VE+P   V+LP  + S+C+HC T + APIPC  C    +CS
Sbjct: 272 DCGRFAAAANKIEIGDTIVVEEPQTSVLLPKMNSSHCHHCFTRLFAPIPCKTCSGVAFCS 331

Query: 148 ESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLI 183
             CR  A   YHR+EC   L      G   I++L+L+++
Sbjct: 332 VQCRDSASNTYHRYECQ-LLGFFMGSGMSLISYLSLRMV 369


>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 673

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 5/108 (4%)

Query: 61  ASLPSLACGESHTLMCA-SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
           AS+  +  G+ HT++ + S+ +++Q    +GRH+IA  ++  G  L VE P AF      
Sbjct: 204 ASVIPIVNGKQHTVLKSCSDAVELQFDEKRGRHLIATRNIKAGSVLIVETPFAFSTNKEA 263

Query: 120 SMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
              NC HC  ++    S  IPC+ C    +CSE CRREAW  YH++EC
Sbjct: 264 LDRNCLHCHITLKSSESVKIPCHFCQTVSFCSEKCRREAWQMYHQYEC 311


>gi|383849683|ref|XP_003700474.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 686

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 49/160 (30%), Positives = 87/160 (54%), Gaps = 4/160 (2%)

Query: 33  TIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRH 92
           ++ ++E + + A  ++     +S  S    LP LA GE+  L  AS+ ++++ ++  G+ 
Sbjct: 182 SMRENENSLISAKELERRRGWTSVESK--PLPRLAAGENPRLPGASSLLEIEETDDAGKR 239

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
            IA +++  GD L VE P+  V+LP    ++C+HC +   AP+ C +C    +C   CR 
Sbjct: 240 AIAAKEIVPGDALVVEAPLGAVLLPDFFGTHCHHCFSRFVAPVGCPDCSGVAFCGIRCRD 299

Query: 153 EAWLRYHRWECHGALRLLEAVGIAHLA-LKLILVSSHSDR 191
            A   YH++EC   L LL   G++ L+ L L +V+ +  R
Sbjct: 300 AAISSYHKYECK-ILALLIGSGMSVLSMLALRMVTQNGPR 338


>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
 gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
          Length = 660

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 79/160 (49%), Gaps = 27/160 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ ++  +GR   A  DV  G+ L VE+P   V+L   + ++C +C      P+ C +C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPKC 300

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR EA  +YH++EC G + ++   G +   H+AL++I              
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                    L+S   D Y+ V  LE H  + +P + +Q+V
Sbjct: 360 IDEELTPEKLISLPKDDYRRVAQLERHQGERQPSNFFQHV 399


>gi|195589810|ref|XP_002084642.1| GD12724 [Drosophila simulans]
 gi|194196651|gb|EDX10227.1| GD12724 [Drosophila simulans]
          Length = 663

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           +P LA GES  L+ AS  +++  +  KGR V+A E +  GD L  E+PVA  + P    +
Sbjct: 238 VPQLAHGESAELVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVAACLEPSYFGT 297

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
           +C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + L+   G++ L    
Sbjct: 298 HCHHCFKRLRTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 356

Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
           + + + +   ++          HL +H +D +P+D
Sbjct: 357 LRIFTQAPSLEQGLATANLLFEHLCSHEEDRQPDD 391


>gi|195327115|ref|XP_002030267.1| GM24661 [Drosophila sechellia]
 gi|194119210|gb|EDW41253.1| GM24661 [Drosophila sechellia]
          Length = 660

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           +P LA GES  L+ AS  +++  +  KGR V+A E +  GD L  E+PVA  + P    +
Sbjct: 238 VPQLAHGESAELVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVAACLEPTYFGT 297

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
           +C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + L+   G++ L    
Sbjct: 298 HCHHCFKRLRTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 356

Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
           + + + +   ++          HL +H +D +P+D
Sbjct: 357 LRIFTQAPSLEQGLATANLLFEHLCSHEEDRQPDD 391


>gi|194747113|ref|XP_001955997.1| GF24983 [Drosophila ananassae]
 gi|190623279|gb|EDV38803.1| GF24983 [Drosophila ananassae]
          Length = 663

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 68/132 (51%), Gaps = 1/132 (0%)

Query: 59  SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           S   +P L  GE+  L  AS  +K+  +  KGR V+A E V  GD +  E+PVA  ++P 
Sbjct: 237 STEQVPKLTHGENPELSGASKVVKLVETKDKGRFVVASEGVKTGDVVLCEEPVAACLIPS 296

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
              +NC+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + LL   G++ L
Sbjct: 297 FFGTNCHHCFKRLHTPVSCLHCSGVAFCSAQCMGEACSSYHRFECE-FMDLLIGSGMSIL 355

Query: 179 ALKLILVSSHSD 190
               + + + ++
Sbjct: 356 CFIALRIFTQAN 367


>gi|383854636|ref|XP_003702826.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 631

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 72/137 (52%), Gaps = 5/137 (3%)

Query: 32  LTIEKSEENSVEAMSM---QVSSTSSSASSSVASLPSLACGESHT--LMCASNKIKMQTS 86
           +TI+ +    +  M +   +VS +  S   S+    SL   ++H   + CAS+ + ++ +
Sbjct: 173 MTIDNNTRKKMNDMFISAGRVSVSDKSNKQSITKQSSLPELKTHNPEIPCASDAVTLKYN 232

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
              GRH +A  D+H G+ + VEK  + ++ P    ++C++C       IPCN C  A+YC
Sbjct: 233 EHYGRHFVAARDIHPGEIIAVEKAYSVMLDPDNLQTHCSNCLKVSWTNIPCNYCTYAIYC 292

Query: 147 SESCRREAWLRYHRWEC 163
           SE C+   W +YH  EC
Sbjct: 293 SEECKLSDWKKYHDIEC 309


>gi|195126158|ref|XP_002007541.1| GI12333 [Drosophila mojavensis]
 gi|193919150|gb|EDW18017.1| GI12333 [Drosophila mojavensis]
          Length = 661

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 38/102 (37%), Positives = 56/102 (54%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           +P+L  GE+  L  A+  +K+  +  KGR V+A + +  GD +  E PVA  +LP    +
Sbjct: 239 VPALTHGENPELSGAAKVVKLVETKDKGRFVVADQGLKTGDVVLAEDPVAACLLPNFFGT 298

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           NC+HC   +  P+PC  C    +CS  C  EA   YHR+EC 
Sbjct: 299 NCHHCFKRLHTPVPCLHCSGIAFCSAQCMGEACETYHRFECQ 340


>gi|307195197|gb|EFN77181.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 693

 Score = 80.1 bits (196), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 65/117 (55%), Gaps = 6/117 (5%)

Query: 52  TSSSASSSVASLPSLACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP 110
           T S     V ++P +  G+ H+ L+   + + ++   A+GRH++A  D+  G  L V++P
Sbjct: 210 TPSFGKYCVTTVPVVD-GKRHSCLVSCPDSVTLRFDVARGRHLVATRDIRPGAVLIVDRP 268

Query: 111 VAFVILPPCSMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            ++       + NC HC  ++    S  IPC  C    +C+E+CR+EAW RYHR+EC
Sbjct: 269 FSYSTDASALIRNCLHCHATLKLENSVRIPCRNCQTVSFCTETCRKEAWQRYHRYEC 325


>gi|91089699|ref|XP_974834.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
 gi|270011321|gb|EFA07769.1| hypothetical protein TcasGA2_TC005323 [Tribolium castaneum]
          Length = 543

 Score = 80.1 bits (196), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 73/135 (54%), Gaps = 13/135 (9%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
           GR V+A  D+  G+ L VEKP+  +++   S ++C+ C +     IPC  C  A+YCSES
Sbjct: 196 GRFVVATRDIKVGEVLAVEKPLVTLVVNELS-NHCHECVSLCYNLIPCKTCTQAMYCSES 254

Query: 150 CRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSS----------HSDRYKEVYH 197
           CR  A+  YH++EC     LR L+   +   ALK+ L  +           SDRY+E+++
Sbjct: 255 CRDYAFDMYHKYECSILATLRFLQFDKLKLFALKISLQYNDLGETSDLLYRSDRYREIHN 314

Query: 198 LETHLQDMRPEDLYQ 212
           L T+       DL++
Sbjct: 315 LVTNTTKRSVPDLFE 329


>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
 gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
          Length = 660

 Score = 80.1 bits (196), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ ++  +GR   A  DV  G+ L VE+P   V+L   + ++C +C      P+ C  C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR EA  +YH++EC G + ++   G +   H+AL++I              
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                    L+S   D ++ V  LE H  + +P + +Q+V
Sbjct: 360 IDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399


>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
 gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
          Length = 660

 Score = 80.1 bits (196), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  +  +GR   A  DV  G+ L VE+P   V+L   + ++C +C      P+ C+ C
Sbjct: 241 VRIDRNRQEGRFARAAADVKAGEELLVERPFVAVLLEKFAKTHCENCFVRSGVPVACSRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR EA  ++H++EC G + +L   G +   H+AL++I              
Sbjct: 301 ADVIYCSEQCREEAANKFHKYEC-GIVPILWRSGASINNHIALRIIASKPLDYFLKLKPT 359

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                    L+S   D ++ V  LE H  + +P + +Q+V
Sbjct: 360 IDEELSPEKLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399


>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
 gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
          Length = 666

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 62/240 (25%), Positives = 108/240 (45%), Gaps = 32/240 (13%)

Query: 1   ALKQDVQETLSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSV 60
           ALK+ +  T  S+L   R +   HLD    +  +E     + +A   Q    + S++ ++
Sbjct: 172 ALKKCITATDDSTLPADR-RSKLHLDAMTMIKMLENDPRTAKQAAKQQKLKEAKSSTPTL 230

Query: 61  ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
               +L   +       S+ +++  +  +GR   A  DV  G  L VE P   V+L   +
Sbjct: 231 EQAQTLPYEKEFV----SDLVRIDQNPQEGRFARAASDVQVGQELLVEHPYVAVLLEKYA 286

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---H 177
            ++C  C      P+ C  C   +YCSE C+++A  +YH++EC G L ++   G +   H
Sbjct: 287 QTHCEFCFMRTVVPVSCPGCSDVIYCSEQCQQKAAAKYHKFEC-GLLPVIWRSGASINNH 345

Query: 178 LALKLI-----------------------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
           +AL++I                       L+S   D ++ V HLE H ++  P + +QYV
Sbjct: 346 MALRIIASKPLDYFMQLRASLDEELSLEQLLSLPKDDFRRVAHLERHQKERPPSNFFQYV 405


>gi|432896582|ref|XP_004076331.1| PREDICTED: SET and MYND domain-containing protein 4-like [Oryzias
           latipes]
          Length = 770

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 15/120 (12%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP---------------PCSMSNC 124
           K+ ++ +  KGRH++AVE +  GD +  ++P +FV++P                     C
Sbjct: 185 KVAIEYNLEKGRHLVAVERIAPGDVILTDRPYSFVLIPEMEEMREKDARADMFGTQYLRC 244

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
           + C       +PC  C  + YCS  C+REAW  +H WEC     L+    +AHLAL++ L
Sbjct: 245 HQCLAKTLCSVPCEGCSYSRYCSARCQREAWEEHHSWECSLGADLMAMGVMAHLALRITL 304


>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
 gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
          Length = 660

 Score = 79.7 bits (195), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ ++  +GR   A  DV  G+ L VE+P   V+L   + ++C +C      P+ C  C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR EA  +YH++EC G + ++   G +   H+AL++I              
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                    L+S   D ++ V  LE H  + +P + +Q+V
Sbjct: 360 IDEQLTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399


>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
          Length = 666

 Score = 79.7 bits (195), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 49/182 (26%), Positives = 85/182 (46%), Gaps = 34/182 (18%)

Query: 64  PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
           P+LA   + T   AS  +++ ++  KGR ++A +D+  G+ L V++P+  V+      S 
Sbjct: 223 PALAHLGNETHPNASRALRVGSTPEKGRQILADQDICLGEVLMVDQPITSVLFQDHHASR 282

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEA---VGIAHLAL 180
           C HC     AP PC E    ++CS  CR +AW  Y ++E   A  +L+A     I  LAL
Sbjct: 283 CYHCFQRCLAPFPCFETSSVIFCSRECRDKAWNTYQKYEYQYAA-VLDANFCGKIGQLAL 341

Query: 181 KLIL------------------------------VSSHSDRYKEVYHLETHLQDMRPEDL 210
           +++L                                 +   Y+ +YH++TH  + +P+DL
Sbjct: 342 RMLLECNLEGVLEFVRGLDKTGDVDDMEKVGLDKEGKYCSDYRSLYHMKTHADERKPDDL 401

Query: 211 YQ 212
           ++
Sbjct: 402 FE 403


>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
 gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
 gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
          Length = 567

 Score = 79.7 bits (195), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ ++  +GR   A  DV  G+ L VE+P   V+L   + ++C +C      P+ C  C
Sbjct: 148 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 207

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR EA  +YH++EC G + ++   G +   H+AL++I              
Sbjct: 208 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 266

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                    L+S   D ++ V  LE H  + +P + +Q+V
Sbjct: 267 IDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 306


>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
 gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
          Length = 646

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 77/159 (48%), Gaps = 27/159 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ ++  +GR   A  DV  G+ L VE+P   V+L   + S+C +C      P+ C  C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKSHCENCFMRTVVPVACPRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
              +YCSE CR EA  +YH++EC G + ++   G +   H+AL++I              
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359

Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
                    L+S   D ++ V  LE H  + +P + +Q+
Sbjct: 360 IDEQLTPEQLISLPKDDFRRVAQLERHQGERQPPNFFQH 398


>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 635

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 78/152 (51%), Gaps = 9/152 (5%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           LA G S     AS+ I +  S   GRH++A ++   GD + +E P A+VI      ++C+
Sbjct: 229 LAYGPSKEAPAASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYYTHCH 288

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-----GALRLLEAVGIAHLA- 179
           HC +     IPC+ C +A YCSE CR+ AW   H+ EC      G L  ++   I  L  
Sbjct: 289 HCLSRSYNLIPCSYCPVAQYCSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTK 348

Query: 180 --LKLILVSSHSDRYKEVYHLETHLQDMRPED 209
               LI+V+S   ++ E+  ++  L +  P++
Sbjct: 349 IIRFLIVVTSKGKKFDEL-RVDMELAESNPDN 379


>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 674

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/119 (34%), Positives = 63/119 (52%), Gaps = 4/119 (3%)

Query: 49  VSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVE 108
           VS  ++S+++ V ++PS+   +   L   S+ + +Q    +GRH++A  ++  G  L VE
Sbjct: 193 VSLENASSANRVTAIPSIDGKQHAMLESCSDAVTLQFDEKRGRHLVATRNIKAGYILIVE 252

Query: 109 KPVAFVILPPCSMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            P AF         NC +C  ++    S  IPC  C    +CSE CR EAW  YHR+EC
Sbjct: 253 SPFAFSTNNEAFDRNCLNCHVTLKLNDSVKIPCYFCRTVSFCSEKCRNEAWKLYHRYEC 311


>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 635

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 78/152 (51%), Gaps = 9/152 (5%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           LA G S     AS+ I +  S   GRH++A ++   GD + +E P A+VI      ++C+
Sbjct: 229 LAYGPSKEAPAASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYYTHCH 288

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-----GALRLLEAVGIAHLA- 179
           HC +     IPC+ C +A YCSE CR+ AW   H+ EC      G L  ++   I  L  
Sbjct: 289 HCLSRSYNLIPCSHCPVAQYCSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTK 348

Query: 180 --LKLILVSSHSDRYKEVYHLETHLQDMRPED 209
               LI+++S   ++ E+  ++  L +  P++
Sbjct: 349 IIRFLIVITSKGKKFDEL-RVDMELAESNPDN 379


>gi|195160613|ref|XP_002021169.1| GL24960 [Drosophila persimilis]
 gi|194118282|gb|EDW40325.1| GL24960 [Drosophila persimilis]
          Length = 661

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 71/135 (52%), Gaps = 4/135 (2%)

Query: 59  SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           S   +P LA GE+  L  A+N +K+  +  KGR V+A E +  GD +  E PVA  ++P 
Sbjct: 235 STEKVPQLAHGENPELSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPS 294

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA-- 176
              ++C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + LL   G++  
Sbjct: 295 FFGTHCHHCFKRLHTPVACLHCSGIAFCSAQCMGEACESYHRFECE-FMDLLIGSGMSIL 353

Query: 177 -HLALKLILVSSHSD 190
             +AL++   ++  D
Sbjct: 354 CFIALRIFTQAASVD 368


>gi|125978717|ref|XP_001353391.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
 gi|54642149|gb|EAL30898.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
          Length = 661

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 71/135 (52%), Gaps = 4/135 (2%)

Query: 59  SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           S   +P LA GE+  L  A+N +K+  +  KGR V+A E +  GD +  E PVA  ++P 
Sbjct: 235 STEKVPQLAHGENPELSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPS 294

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA-- 176
              ++C+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + LL   G++  
Sbjct: 295 FFGTHCHHCFKRLHTPVACLHCSGIAFCSAQCMGEACESYHRFECE-FMDLLIGSGMSIL 353

Query: 177 -HLALKLILVSSHSD 190
             +AL++   ++  D
Sbjct: 354 CFIALRIFTQAASVD 368


>gi|156543612|ref|XP_001604392.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 629

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 2/144 (1%)

Query: 20  QDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASN 79
           +D  +L F ++++  E +  + +E +S + ++     +    +   LA G S      S+
Sbjct: 169 KDEKNLTFIKELIDKEDAATDEMELISKKPANVPRYLADE--NDLKLANGPSDEAPSISD 226

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
            IK+  S   GRH+IA +    GD L +EKP A VI      ++C++C       IPC  
Sbjct: 227 GIKIAYSEKYGRHLIATKPFEPGDILLLEKPYANVIYREKYYTHCHYCLARSYNLIPCPH 286

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C L++YCSE+CR  AW + H  EC
Sbjct: 287 CPLSLYCSENCRTLAWSKGHEIEC 310


>gi|322794112|gb|EFZ17321.1| hypothetical protein SINV_03681 [Solenopsis invicta]
          Length = 464

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 106/249 (42%), Gaps = 49/249 (19%)

Query: 4   QDVQETLSSS---LKLCRLQDTSH--LDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASS 58
           Q V +TL      L    L DTS   L  K DVL    ++         ++ +T +    
Sbjct: 157 QSVDDTLKKVQHWLDKMSLNDTSREKLKTKLDVLRNRNTQ--------TKLCATDTRRKV 208

Query: 59  SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
             + LP++A   +  + CAS+ + ++ +N  GRH++A  D+  G+ + VEKP + ++   
Sbjct: 209 LESPLPTIA-SHNDEVPCASDAVVIKYNNHYGRHIVATRDICPGEVIAVEKPYSLILKQE 267

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-------------- 164
              ++C++C     A IPCN C  A+YCSE C+   W + H  EC               
Sbjct: 268 NMQTHCSNCLRVCWANIPCNYCTYAMYCSEECKNIEWKKCHDVECAIFPALIEYSFYNLD 327

Query: 165 ------GALRLLEAVGIAHLALKLILVSS---------------HSDRYKEVYHLETHLQ 203
                   L L +A G+  L   L  V                 HSDRY  VY L T+ +
Sbjct: 328 WCSIRLAVLALRDAGGMKELRTMLKEVDEHADPRTRGFSQDGKLHSDRYISVYSLVTNSE 387

Query: 204 DMRPEDLYQ 212
                DL++
Sbjct: 388 KRSVSDLFK 396


>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4-like [Ornithorhynchus anatinus]
          Length = 815

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 73/153 (47%), Gaps = 18/153 (11%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS------- 122
           E+  +  AS+ + ++    +GRH+IA ED+  G+ L  E+    V+ P   +        
Sbjct: 219 ENDQIPQASSCVLLRADIPRGRHLIAKEDILPGELLVREEAFVSVLTPGEDLGLQEGLET 278

Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                      +C+HC   +S+ IPCN C  A YCS+ C ++AW RYH+ EC     LL 
Sbjct: 279 KWNPGITHGDLHCHHCLKPVSSSIPCNGCSYAKYCSQECLQQAWERYHKIECPLGGILLT 338

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
                H+AL+ +LV+   D  + V      L +
Sbjct: 339 LGVFCHVALRTVLVAGFEDVSRSVERFPEQLTN 371


>gi|307183763|gb|EFN70437.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 646

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 60/101 (59%), Gaps = 1/101 (0%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           LP++A   +  + CAS+ + ++ +   GRHVIA  +++ G+ + VEKP A +++     +
Sbjct: 219 LPTIA-SYNDEVPCASDAVAIKYNTRCGRHVIATRNINPGEVIAVEKPYALLLVQQNMQT 277

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           +C++C     A IPCN C  A+YCSE CR   W R H  EC
Sbjct: 278 HCSNCLKVCWANIPCNYCTYAMYCSEECRYAEWKRCHDVEC 318


>gi|195435832|ref|XP_002065883.1| GK20590 [Drosophila willistoni]
 gi|194161968|gb|EDW76869.1| GK20590 [Drosophila willistoni]
          Length = 661

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 8/156 (5%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           LP L  G +  L  A+  +K+  +  KGR V+A   +  GD +  E PVA  +LP    +
Sbjct: 239 LPELTHGPNPELSGAAQGVKLVETKEKGRFVVANGGLKTGDVVLCENPVAACLLPNFFGT 298

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLA 179
           NC+HC   +  P+ C  C    +CS  C  EA   YHR+EC   + LL   G++    +A
Sbjct: 299 NCHHCFKRLRNPVACLHCSGVAFCSAQCMGEASESYHRFECE-FMDLLIGSGMSILCFIA 357

Query: 180 LKLIL-VSSHSDRYKEVYHLETHL---QDMRPEDLY 211
           L++    SS  D       L  HL   +D+R  D Y
Sbjct: 358 LRVFTQASSLEDGLNTANLLFEHLCSHEDVRQADDY 393


>gi|328785040|ref|XP_001122116.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 629

 Score = 77.4 bits (189), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 61/105 (58%), Gaps = 5/105 (4%)

Query: 61  ASLPSLACGESHTLM--CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           A LP +   ++H +   CAS+ I ++ ++  GRH++A   ++ G+ + +EKP + ++ P 
Sbjct: 206 APLPKI---KTHNIEVPCASDAITIKYNDKYGRHIVATRKINPGEVIAIEKPYSLILTPD 262

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
              ++C++C     A IPC  C  A+YCSE C+   W +YH  EC
Sbjct: 263 NIYTHCSNCLEVSWANIPCEYCTYAMYCSEECKAMEWKKYHDIEC 307


>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
 gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
          Length = 909

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 72/154 (46%), Gaps = 13/154 (8%)

Query: 20  QDTSHLDFKEDVLTIEKSE--ENSVEAM-----------SMQVSSTSSSASSSVASLPSL 66
           +D +  + K  +  IEK++  E+  EAM           S Q        S+    +P L
Sbjct: 168 KDDAKKEVKGTLEAIEKADIPEDRKEAMRKELKLAGRNISKQEEGKEGEESAEEVKVPKL 227

Query: 67  ACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNH 126
             GES  +  AS  ++M+    KGR ++A +    G  L VE+P A V+L     ++C+ 
Sbjct: 228 QYGESQQVAHASGGVEMREEEDKGRMLVAQKAFEPGSVLIVEQPYAAVLLQKHHSTHCHT 287

Query: 127 CCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
           C T +  P PC  C    YCS +C  +AW  YHR
Sbjct: 288 CVTPVLVPHPCRGCQYVQYCSGTCEEQAWREYHR 321


>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
 gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
          Length = 660

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 73/159 (45%), Gaps = 25/159 (15%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  +GR   A  DV  G+ L VE+P   V+L   + S+C +C      P+ C  C
Sbjct: 241 VRIDRSPQEGRFARAAADVKAGEELLVERPFVSVLLEKFAKSHCENCFVRSVVPVACPRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWEC---------------HGALRLLEAVGIAH-LALKLI- 183
              +YCSE CR EA  +YH++EC               H ALR+  +  + + L LK   
Sbjct: 301 ADVIYCSEQCRDEAAKKYHKYECGIVPTIWRSGASINNHIALRIFASKPLEYFLQLKPTI 360

Query: 184 --------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                   L+S   D ++ V  LE H     P + +Q+V
Sbjct: 361 DKELSPEELLSLPKDDFRRVAQLERHQNKRPPSNFFQHV 399


>gi|47221832|emb|CAG08886.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 743

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 68/173 (39%), Gaps = 50/173 (28%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP--------------------CSMSNCN 125
           S  KGRH++A E +  GD +  ++P + V++P                          C+
Sbjct: 195 SPEKGRHLVATERIAAGDVILSDRPYSCVLIPGMKEVKGKGAKQGTDGGELFGIEQRRCH 254

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL- 184
            C      P+PC+ C  + YCS SC++EAW  +HRWEC     L+   G+  LAL++ L 
Sbjct: 255 RCLAETLCPLPCDGCSYSRYCSASCQQEAWEEHHRWECPLGAELMVMGGMLQLALRVALK 314

Query: 185 -----------------------------VSSHSDRYKEVYHLETHLQDMRPE 208
                                           H D Y  V+HL  H+    P+
Sbjct: 315 AGQENIQTARKLIRDERASSKCSSQSSFQTPGHGDSYLSVFHLLHHMNQHSPQ 367


>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 631

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 60/111 (54%), Gaps = 4/111 (3%)

Query: 57  SSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVIL 116
              +A++P +       L   S+ ++++   A+GRH++A  D+  G  L V++P +F   
Sbjct: 157 GKRIAAMPVVDGKPHDCLASCSDSVELRLDEARGRHLVAARDIRPGAVLIVDQPFSFSTD 216

Query: 117 PPCSMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            P    NC +C  ++    S  IPC  C    +C+E+CR++AW  YH++EC
Sbjct: 217 GPALSRNCLYCHATLKLEDSVKIPCRNCQTVSFCTETCRKKAWEAYHQYEC 267


>gi|332023172|gb|EGI63428.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 624

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/100 (35%), Positives = 56/100 (56%), Gaps = 5/100 (5%)

Query: 69  GESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC 127
           G+ H+ L+   + +++Q   A+GRH++A  D+  G  L V++P +F         NC HC
Sbjct: 167 GKPHSCLVSCPDSVELQVDEARGRHLVATRDIRPGTVLIVDRPFSFSTDASALDRNCLHC 226

Query: 128 CTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
             ++    S  IPC  C    +C+E CR+EAW  YH++EC
Sbjct: 227 HATLKLEDSVRIPCRNCQTVAFCTEVCRKEAWETYHQYEC 266


>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
 gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
          Length = 660

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 73/159 (45%), Gaps = 25/159 (15%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  +GR   A  DV  G+ L VE+P   V+L   + ++C +C      P+ C  C
Sbjct: 241 VRIDRSPQEGRFARAAADVKAGEELLVERPFVSVLLEKFAKTHCENCFVRSVVPVACPRC 300

Query: 141 ILAVYCSESCRREAWLRYHRWEC---------------HGALRLLEAVGIAH-LALKLI- 183
              +YCSE CR EA  +YH++EC               H ALR+  +  + + L LK   
Sbjct: 301 ADVIYCSEQCRDEAAKKYHKYECGIVPTIWRSGASINNHIALRIFASKPLEYFLQLKPTI 360

Query: 184 --------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                   L+S   D ++ V  LE H     P + +Q+V
Sbjct: 361 DKELSPEELLSLPKDDFRRVAQLERHQNKRPPSNFFQHV 399


>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 621

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 61/107 (57%), Gaps = 3/107 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           + +Q +   GRH++A  D++ G+ +F+E+P    +      + C HC T     IPC  C
Sbjct: 202 VDVQYNEKYGRHLVAKRDINPGEIIFIEEPYMHCLDLVRGYTYCFHCLTPCLITIPCEHC 261

Query: 141 ILAVYCSESCRREAWLRYHRWEC---HGALRLLEAVGIAHLALKLIL 184
             A++CSE C+++AW++YH  EC     A   ++  G+ H+A+K ++
Sbjct: 262 GWAMFCSEGCKQQAWVKYHDLECAVYDFAKENVDGDGVKHMAVKSLI 308


>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
          Length = 704

 Score = 76.3 bits (186), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 91/219 (41%), Gaps = 40/219 (18%)

Query: 34  IEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCA-SNKIKMQTSNAKGRH 92
           +EK +  ++    +  +S S  A+++   +P++   + +  + + S+ + + +  AKGR 
Sbjct: 230 LEKKQSLALSQRLLDAASRSRRATANQIRIPTVHGAKPNDFLTSCSDALALGSDEAKGRK 289

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP----IPCNECILAVYCSE 148
           + A  D+  G  L V++P A          NC  C  S+  P    +PC  C    +CSE
Sbjct: 290 LYATRDIKPGTVLIVDRPFASTTDVAALCRNCLCCHASLKLPESIRVPCEHCQAVQFCSE 349

Query: 149 SCRREAWLRYHRWECH------------------------GALRLLEAVGIAHLALKL-- 182
            CRR AW ++H++EC                          ALR  +   +    LKL  
Sbjct: 350 DCRRRAWNKFHKYECRIFDYFYAKQAQKTNCLLAYRAVVAAALRNEKEFSLDQEFLKLHE 409

Query: 183 ---------ILVSSHSDRYKEVYHLETHLQDMRPEDLYQ 212
                     L S  S  YK V+ LE+H  D  P D  Q
Sbjct: 410 NDKDSLDIASLASYESRDYKTVFSLESHCADADPADNLQ 448


>gi|195382848|ref|XP_002050140.1| GJ20356 [Drosophila virilis]
 gi|194144937|gb|EDW61333.1| GJ20356 [Drosophila virilis]
          Length = 662

 Score = 76.3 bits (186), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 76/163 (46%), Gaps = 27/163 (16%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           S+ +++  +  +GR   A  DV  G  L VE P   V+L   + ++C +C      P+ C
Sbjct: 240 SSLVRIDQNAQEGRFARAAADVQVGQELLVEHPYVAVLLEKFAQTHCEYCFVRTVVPVAC 299

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI----------L 184
             C   +YCSE C+  A  +YH++EC G L ++   G +   H+AL++I          L
Sbjct: 300 PGCSDVIYCSEQCQERASAKYHKYEC-GILPVIWRSGASINNHMALRIIASKPLDYFLQL 358

Query: 185 VSSH-------------SDRYKEVYHLETHLQDMRPEDLYQYV 214
            SS               D ++ V HLE H    +P + +QYV
Sbjct: 359 KSSLDEELSLEQLLSLPKDDFRRVAHLERHEGQRQPSNFFQYV 401


>gi|340724466|ref|XP_003400603.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 631

 Score = 76.3 bits (186), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 70/135 (51%), Gaps = 11/135 (8%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
           CAS+ + ++ +   GRH++A   +  G+ + VEKP + ++ P    ++C++C     A I
Sbjct: 222 CASDAVTIKYNEEYGRHIVATHKIRPGEVIAVEKPYSLILTPDNIHTHCSNCLEVSWANI 281

Query: 136 PCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLEAVGIAHLALKLILVSSHSDRYK 193
           PC  C  A+YCSE C+   W +YH  EC    ++  +  V +   +L+L + +       
Sbjct: 282 PCEHCTYAMYCSEECKIMEWKKYHDVECTVFPSMLKMNFVKLDLFSLRLAIQAVRE---- 337

Query: 194 EVYHLETHLQDMRPE 208
                 TH+Q++R E
Sbjct: 338 -----ATHIQELRKE 347


>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 668

 Score = 75.9 bits (185), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 63  LPSLACGESHTLM-CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
           +P+L  G+ HT++   S+ + +Q    +GRH+IA +++  G  L VE P AF        
Sbjct: 203 IPTLN-GKQHTILKSCSDAVTLQFDEKRGRHLIATKNIKAGSVLIVETPFAFSTNKEALG 261

Query: 122 SNCNHC----CTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            NC HC     +S S  IPC  C    +CSE CR +AW  YH++EC
Sbjct: 262 RNCLHCHITLMSSNSVKIPCYYCQTVSFCSEKCRSKAWQIYHQYEC 307


>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 681

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 41/124 (33%), Positives = 64/124 (51%), Gaps = 1/124 (0%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           LA  E+  L  AS  + ++ +   G+  IA E +  GD L VE P A  +LP    ++C 
Sbjct: 191 LASDENPRLPGASTLLDIEETENAGKRAIAAEPIEPGDRLVVEAPYAATLLPEFFGTHCQ 250

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV 185
           HC +   API C +C    +C   CR  A   YH++EC   L LL   G++ L++  + +
Sbjct: 251 HCFSRFVAPIGCPDCSSVAFCGRECRDAAMASYHKYECK-VLALLIGSGMSVLSMLALRM 309

Query: 186 SSHS 189
           ++ S
Sbjct: 310 TTQS 313


>gi|380021898|ref|XP_003694793.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 629

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 60/105 (57%), Gaps = 5/105 (4%)

Query: 61  ASLPSLACGESHTLM--CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           A LP +   ++H +   CAS+ I ++ ++  GRH++A   +  G+ + +EKP + ++ P 
Sbjct: 206 APLPKI---KTHNIEVPCASDAITIKYNDKYGRHIVATRKISPGEVVAIEKPYSLILTPD 262

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
              ++C++C     A IPC  C  A+YCSE C+   W +YH  EC
Sbjct: 263 NIHTHCSNCLEVSWANIPCEYCTYAMYCSEECKAMEWKKYHDIEC 307


>gi|350424957|ref|XP_003493967.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 631

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 86/162 (53%), Gaps = 14/162 (8%)

Query: 12  SSLKLCRLQDTSHLDFKEDVLTIEK--SEENSVEAMSMQVSSTSSSASS------SVASL 63
           ++LK  R++D      KE    +EK  S++ + + +S ++++T  + +S      ++   
Sbjct: 152 NALKHPRMEDA----IKEAQYWLEKMSSDDINRKRLSEKLNNTKHNLASQKLKKQNIIKQ 207

Query: 64  PSLACGESHTLM--CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
           P L   ++  +   CAS+ + ++ +   GRH++A   +  G+ + VEKP + ++ P    
Sbjct: 208 PPLPKIKTRNIEVPCASDAVTIKYNEEYGRHIVATRKIRPGEVIAVEKPYSLILTPDNIH 267

Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           ++C++C     A IPC  C  A+YCSE C+   W +YH  EC
Sbjct: 268 THCSNCLEVSWANIPCEHCTYAMYCSEECKAMEWKKYHDVEC 309


>gi|383866011|ref|XP_003708465.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 636

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 53/98 (54%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           L  G S     AS+ + +  S   GRH++A ++   GD + +E P AF I      ++C+
Sbjct: 230 LTYGPSKEAPAASDGVSIAFSEKYGRHLVATKEFKPGDIVTIEDPFAFTIYMQSYFTHCH 289

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           HC       IPC +C +A YCSE+CR+ AW   H+ EC
Sbjct: 290 HCLARSFNLIPCLKCPVAQYCSETCRKLAWEMAHQIEC 327


>gi|328776415|ref|XP_003249162.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 714

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 46/153 (30%), Positives = 77/153 (50%), Gaps = 2/153 (1%)

Query: 28  KEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSN 87
           KE      ++ E S+ ++  QVS   + A  +      L  GE+  L  AS  + ++ + 
Sbjct: 178 KETCEKFLRANEKSLVSIE-QVSDRRNGALVATEEKYVLKGGENPRLPGASTFLDVEETV 236

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
             G+ VIA +++  G+ L +E P A ++LP    ++C HC +   API C +C    +C 
Sbjct: 237 NAGKRVIAAKNIEPGNRLIIESPHAAILLPEFFGTHCQHCFSRFKAPIGCPDCSSVAFCG 296

Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
             CR  A   YH++EC   L LL   G++ L++
Sbjct: 297 RKCRDTALASYHKYECK-ILVLLIGSGMSVLSM 328


>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
 gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
          Length = 650

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 48/152 (31%), Positives = 68/152 (44%), Gaps = 26/152 (17%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  +GR      D+     L +E+P   V+L   S+ +C HC   +S PI C  C   V+
Sbjct: 236 SPDEGRFARTNTDLKPNTILLLERPHVSVLLEDYSLDHCTHCFKRVSVPIACPLCADVVF 295

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHS------------- 189
           CS+ C  +A   YHR+EC G L +L   G +   H+AL++I   S               
Sbjct: 296 CSDECETKANATYHRYEC-GFLPILWGSGASITCHMALRMITQKSEEYFLKLKPELAGLT 354

Query: 190 ---------DRYKEVYHLETHLQDMRPEDLYQ 212
                    D Y++VY L TH     PED +Q
Sbjct: 355 NEQIDKLPVDDYRKVYKLVTHESTRSPEDFFQ 386


>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
          Length = 661

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 47/169 (27%), Positives = 77/169 (45%), Gaps = 23/169 (13%)

Query: 64  PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
           P L+   +     AS  +++     +GR   A  D+  G+ L +E+P   V+L   S S+
Sbjct: 228 PKLSGKHNKLFPAASETVEIAFDETRGRFAKADRDIQAGEILLIEEPHGGVLLSEFSKSH 287

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLAL 180
           C +C      P+PC +C   ++CSE C   A   YH +ECH  L LL   G +   H+AL
Sbjct: 288 CQNCFNKCLIPLPCPKCPNVIFCSEKCLDIALKSYHGYECH-ILPLLWKSGCSVTCHIAL 346

Query: 181 KLILVSS-------------------HSDRYKEVYHLETHLQDMRPEDL 210
           ++I  +S                    ++ Y+ +YHL +H      +D+
Sbjct: 347 RMITQNSKDYFMKIMQDLKEKPTGPYKTEDYRNIYHLVSHENKRTKQDI 395


>gi|158287355|ref|XP_309411.4| AGAP011232-PA [Anopheles gambiae str. PEST]
 gi|157019610|gb|EAA05175.4| AGAP011232-PA [Anopheles gambiae str. PEST]
          Length = 434

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 53/98 (54%), Gaps = 2/98 (2%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSE 148
           GRHV+A   +  GD + VEKP A V+        C  C        IPC EC +A+YCS+
Sbjct: 162 GRHVVATRQLRVGDVVMVEKPYATVLSDHMKRVRCAFCHAEEPFLLIPCEECTIAMYCSQ 221

Query: 149 SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
            C R AW +YHR+EC   L  +  +G  +LAL +  V+
Sbjct: 222 KCLRAAWQQYHRYEC-PILNDMRTIGTEYLALAVRTVA 258


>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 666

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 40/110 (36%), Positives = 59/110 (53%), Gaps = 6/110 (5%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           AS KI M+  +  GR   A E++  G+T+ VEKP   V+L    + NC HC      P P
Sbjct: 233 ASVKIFMKNDSTVGRFASASENIFPGETILVEKPYCSVLLEDYHLINCQHCFKRTQGPYP 292

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI 183
           C  C   ++CS +CR  A   YH++EC   L+ L   G++    +AL++I
Sbjct: 293 CENCSQVIFCSLNCRESA--NYHKYEC-SILQTLHKSGVSITVLMALRMI 339


>gi|345482384|ref|XP_001608081.2| PREDICTED: hypothetical protein LOC100124188 isoform 1 [Nasonia
           vitripennis]
 gi|345482386|ref|XP_003424586.1| PREDICTED: hypothetical protein LOC100124188 isoform 2 [Nasonia
           vitripennis]
          Length = 654

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 37/101 (36%), Positives = 54/101 (53%), Gaps = 2/101 (1%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           LPS    +  +  CAS+ + +      GRH+IA  D+  G+ + VE+     + P    +
Sbjct: 222 LPSFKPNKEAS--CASDAVAIAHHEIWGRHIIAERDIEPGEIIVVEENYLSFLDPTKMYA 279

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            C+ C       IPCN CI  VYCSE C+ EAW +YH++EC
Sbjct: 280 FCSTCMKPSLCLIPCNNCIYDVYCSEECKSEAWKKYHQFEC 320


>gi|195119678|ref|XP_002004356.1| GI19663 [Drosophila mojavensis]
 gi|193909424|gb|EDW08291.1| GI19663 [Drosophila mojavensis]
          Length = 762

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 78/158 (49%), Gaps = 15/158 (9%)

Query: 31  VLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKG 90
           V  +E+ +E   +    QVS      +S  A  PSL        + A N     +   KG
Sbjct: 187 VNQLEQLQEQLTKLQEQQVSEPKEELTSEQA--PSLQ-------ILARNHKIWNSPGVKG 237

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYCSE 148
           RH++A + +  GD +F E+   FV L P  +  C  C  S+  APIPC +C    VYCS 
Sbjct: 238 RHMVATQSIQPGDIIFKEQTACFVPLEPGLI--CQQCAASVLYAPIPCPQCHQQVVYCSR 295

Query: 149 SCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILV 185
            CR+    + H +EC G  + L   +G++HLAL+++L 
Sbjct: 296 HCRQLN-EQIHSYECAGYRKNLFTMLGVSHLALRMVLF 332


>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
          Length = 675

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 72/152 (47%), Gaps = 27/152 (17%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S+A+GR      D+     L +E+P    +L   S+ +C++C   +S PI C  C   V+
Sbjct: 250 SDAEGRFAKTSADLKPNTILLLERPHVSALLEDYSLDHCSNCFKRVSVPIACPLCSDVVF 309

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI------------------L 184
           CS+ C R A   +HR+EC G L +L   G +   H+AL++I                  L
Sbjct: 310 CSDECERRANASFHRYEC-GFLPILWGSGASITCHMALRMITQKSSEYFAKLQPELAAGL 368

Query: 185 VSSHSDR-----YKEVYHLETHLQDMRPEDLY 211
            +   DR     Y++VY L TH     PEDL+
Sbjct: 369 TNEQIDRLPVNDYRKVYQLVTHEATRTPEDLF 400


>gi|195027211|ref|XP_001986477.1| GH21386 [Drosophila grimshawi]
 gi|193902477|gb|EDW01344.1| GH21386 [Drosophila grimshawi]
          Length = 664

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 77/163 (47%), Gaps = 27/163 (16%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           S+ +++  +  +GR   A  DV  G  L VE P   V+L   + ++C +C      P+ C
Sbjct: 242 SSLVRIDQNPQEGRFARAAADVQVGQELLVEHPYVAVLLEKYAHTHCEYCFVRTVVPVAC 301

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI----------- 183
             C   +YCSE C++ +  +YH++EC G L ++   G +   H+AL++I           
Sbjct: 302 PGCSDVIYCSEQCQQRSADKYHKYEC-GILPIIWRSGASINNHMALRIIASKPLDYFLQL 360

Query: 184 ------------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                       L+S   D ++ V HLE H ++    + +QYV
Sbjct: 361 RPSLDEDLSLEQLLSLPKDDFRRVAHLERHQKERAASNFFQYV 403


>gi|410915300|ref|XP_003971125.1| PREDICTED: SET and MYND domain-containing protein 4-like [Takifugu
           rubripes]
          Length = 723

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 63/129 (48%), Gaps = 20/129 (15%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPP--------CSMSN------------CNHCC 128
           KGRH++A E V  GD +  ++P + V++P            SN            C+ C 
Sbjct: 170 KGRHLVATEKVAAGDVILRDRPYSCVLIPGRKEVQGKEIKHSNDGDILFGMEHKLCHRCL 229

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSH 188
           T    P+PC+ C  + YCS SC++EAW  +HRWEC     L+    ++ LAL++ L +  
Sbjct: 230 TETLCPLPCDGCSYSRYCSASCQQEAWEEHHRWECPLGAELMVMGVLSQLALRVALKAGK 289

Query: 189 SDRYKEVYH 197
            +  +   H
Sbjct: 290 KNIQRGRVH 298


>gi|345486427|ref|XP_001607097.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 596

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 34/90 (37%), Positives = 54/90 (60%), Gaps = 4/90 (4%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVA--FVILPPCSMSNCNHCCTSISAP 134
           +S+ ++++ S   GRH++A +D+  G+ +F+EKP    + I  P     C HC +     
Sbjct: 163 SSSLVEVRHSEKYGRHLVATQDIKPGEIIFIEKPYISCYNIKKP--YLYCCHCLSIAWTG 220

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECH 164
           IPC+ C   V+CSE C++EAW +YH  ECH
Sbjct: 221 IPCDNCGWFVFCSEKCKKEAWTQYHDIECH 250


>gi|158287323|ref|XP_309376.4| AGAP011270-PA [Anopheles gambiae str. PEST]
 gi|157019595|gb|EAA05063.4| AGAP011270-PA [Anopheles gambiae str. PEST]
          Length = 341

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 76/152 (50%), Gaps = 7/152 (4%)

Query: 29  EDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNA 88
           E ++  EK  +N++ A +   +++SS AS  V   P L+          +N ++++ +  
Sbjct: 76  EKLIQREKEVKNAL-AKARNKNASSSKASPDVVEEPELSYAAKENAPQVANCLELRKNEE 134

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCS 147
            GRHV+    +  GD + +E+P   V+        C+ C        IPC  C +A+YCS
Sbjct: 135 YGRHVVTTRKLKVGDVVMIERPFVTVLKDSFRYVRCDFCHEERPFTLIPCEGCTMAMYCS 194

Query: 148 ESCRREAWLRYHRWECHGALR----LLEAVGI 175
           E C  +A+  YHR+EC G LR    + EAV +
Sbjct: 195 EECLSKAYNNYHRYEC-GLLRDLWEVFEAVPL 225


>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
          Length = 648

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 36/102 (35%), Positives = 57/102 (55%)

Query: 62  SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
           S P L    +  L  AS  +++  +  +GR ++A ED+  G+T+ VEK  A ++      
Sbjct: 230 SAPELHRARNPLLPSASASVELINTPDRGRCLVATEDIKIGETVIVEKAHASILQYEFKE 289

Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           S+C+HC      P+PC++C    +CS  CR EAW  YH++EC
Sbjct: 290 SHCHHCLHWTPGPVPCHKCSQVGFCSTQCRDEAWDSYHQFEC 331


>gi|332025618|gb|EGI65780.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 630

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 74/154 (48%), Gaps = 16/154 (10%)

Query: 62  SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
           +   L  G S  +   S+ + +  S   GRH IA  +   GD + +E P A VI      
Sbjct: 219 NFSKLMNGPSREIPAVSDGVMIFFSERYGRHYIATREFKPGDIISIEDPYAHVIYEERYY 278

Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV-------- 173
           ++C++C       IPC+EC +A YCSE+CR+ AW   H  EC   L+LL  +        
Sbjct: 279 THCHYCLARSYNLIPCSECPIAQYCSENCRKLAWEACHETEC-PILKLLTNLLNVDKDKI 337

Query: 174 -GIAHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
             I+ +   LI+V+ +  R KE+       QDM+
Sbjct: 338 RMISKIIRLLIVVTENGTRIKEL------QQDMK 365


>gi|357620108|gb|EHJ72415.1| hypothetical protein KGM_11009 [Danaus plexippus]
          Length = 477

 Score = 73.2 bits (178), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 39/104 (37%), Positives = 59/104 (56%)

Query: 60  VASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
           V SL ++      T +CAS+K+++  S+  GRHV A ED++ G+ L +E P   ++L   
Sbjct: 19  VESLVTMKGPCDPTYVCASSKLEVVYSDEMGRHVCAKEDINVGEVLVIEDPYFTLLLKSQ 78

Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            +  C++C  S    +PCN C  A+YCSE C+  A   YH  EC
Sbjct: 79  YLLCCSYCLLSNLNLLPCNNCCFAMYCSEECKVRALKEYHSVEC 122


>gi|170042501|ref|XP_001848962.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167866038|gb|EDS29421.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 667

 Score = 73.2 bits (178), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 50/161 (31%), Positives = 79/161 (49%), Gaps = 13/161 (8%)

Query: 63  LPSLACGESHTLMCASNKIKM---QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
           LP ++ G +  L  AS+K+ +   +    KGR+++A +D+   + + VE  V   +    
Sbjct: 243 LPEVS-GANADLPYASHKLALGGAREDTTKGRYIVAAQDLKPAEPIVVEPAVGACLYTKF 301

Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA--- 176
             S+CN C   + AP+ C EC    +CS  CR  A   YHR+ECH  L LL   G++   
Sbjct: 302 FGSHCNACFAKLVAPVACPECAGVAFCSPECRDRACSGYHRFECH-YLDLLIGSGMSVLC 360

Query: 177 HLALKLILVSSHSDRYKEV-----YHLETHLQDMRPEDLYQ 212
           HLAL+L+  +   ++  E+       L  H     P D +Q
Sbjct: 361 HLALRLVTQAGTPEKAIELGAELKQTLCAHEDRREPSDYFQ 401


>gi|115495645|ref|NP_001070062.1| SET and MYND domain-containing protein 4 [Danio rerio]
 gi|115313830|gb|AAI24342.1| SET and MYND domain containing 4 [Danio rerio]
          Length = 753

 Score = 73.2 bits (178), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 74/140 (52%), Gaps = 15/140 (10%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS--------------- 122
           S+ + +  S+ KGRH++ +E+   G+ +  ++    V++P    +               
Sbjct: 185 SDGVSVYFSSDKGRHMLVMENKPAGEVVLEDEAYCSVLIPANIFNTGTNKAVETFGTEDR 244

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
           +C+HC +   + +PC +C  A YC ESC+++AW ++H+WEC     LL    + HLAL++
Sbjct: 245 HCHHCLSQSLSFVPCPKCSYARYCGESCQKDAWDQWHQWECPVGADLLAIGVLGHLALRV 304

Query: 183 ILVSSHSDRYKEVYHLETHL 202
           +L +  ++    + + + H+
Sbjct: 305 VLKAGQTEVQMGIKNTKDHV 324


>gi|195430720|ref|XP_002063397.1| GK21416 [Drosophila willistoni]
 gi|194159482|gb|EDW74383.1| GK21416 [Drosophila willistoni]
          Length = 661

 Score = 72.8 bits (177), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 75/163 (46%), Gaps = 27/163 (16%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           S+ +++  +  +GR   A  DV  G  L VE+P   V+L   + ++C  C      P+ C
Sbjct: 238 SDLVRIDQNKQEGRFARAAADVKIGQELLVERPYVAVLLEKYAKTHCEACFMRTVVPVAC 297

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI----------- 183
             C   VYCSE C++ A  +YH++EC G L  +   G +   H+AL++            
Sbjct: 298 PRCADVVYCSEKCQQLAANKYHKYEC-GILPSIWRSGASINNHIALRIFASKPWSYFKDL 356

Query: 184 ------------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
                       L+S   D ++ V  LE H  + +P + +Q+V
Sbjct: 357 KPLIDVELPVEKLISLPKDDFRRVAQLERHQTERQPSNFFQHV 399


>gi|118781832|ref|XP_311885.3| AGAP002999-PA [Anopheles gambiae str. PEST]
 gi|116129279|gb|EAA07926.3| AGAP002999-PA [Anopheles gambiae str. PEST]
          Length = 664

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 48/162 (29%), Positives = 86/162 (53%), Gaps = 12/162 (7%)

Query: 62  SLPSLAC-GESHTLMCASNKIKMQTS--NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
           SLP LA  GE+  ++ AS+KIK+  S  + +GR+V+A  D+  G+ +  E   A  +   
Sbjct: 237 SLPKLAGEGENGEILGASSKIKLAGSKEDPRGRYVVAAADLGPGEVILTEPAYAACLHAK 296

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA-- 176
              ++C+ C + + AP+ C +C    +CS +CR +A   YHR+EC   L L+   G++  
Sbjct: 297 YYGTHCSACFSRLIAPVACPDCCGVAFCSVACRDKACATYHRFECQ-YLDLMIGSGMSIL 355

Query: 177 -HLALKLILVSSHSDRYKEVYHLE-----THLQDMRPEDLYQ 212
            H+AL+++  +   ++  E   +       H +   PED ++
Sbjct: 356 CHVALRMVTQAGTPEKVLEEGKMLRDTFCAHTEHRDPEDHFK 397


>gi|328722737|ref|XP_001943013.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 661

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 63/123 (51%), Gaps = 4/123 (3%)

Query: 64  PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
           P +  G S      SN + ++ +  +GRH +A  DV  G  L  E+  A V+L   S S+
Sbjct: 227 PEVHGGRSEVYPAFSNAVSVEYNETEGRHAVAGSDVPVGKVLLTERAYASVLLREYSHSH 286

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLAL 180
           C +C   + AP+PC +C    +C E C+R A   YH  EC   L  L + G++    +AL
Sbjct: 287 CTNCFVRLVAPLPCPDCERVAFCGEFCKRSALESYHAVEC-SMLPALFSAGVSITCLIAL 345

Query: 181 KLI 183
           ++I
Sbjct: 346 RII 348


>gi|357603437|gb|EHJ63766.1| putative set and mynd domain-containing protein [Danaus plexippus]
          Length = 734

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/140 (33%), Positives = 66/140 (47%), Gaps = 9/140 (6%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
           ++ S S S    P L  G +  L   S  +K+   N KGR+ +A   V  GD + VE P 
Sbjct: 221 STQSESPSKLKEPQLTSGATLNLPALSKLLKITEDNEKGRYAVANAPVKTGDIVLVESPY 280

Query: 112 AFVILPPCSMSNCNHCCTSI-----SAPIPCNECILAVYCSESCRREAWLRYHRWECHGA 166
           A  +L  C  S+C HC   +     SAPI C  C    +CS  CR  A   YH +EC   
Sbjct: 281 AACLLADCHGSHCLHCFVRLEDFEDSAPIWCPNCSGVAFCSIQCRDAAISTYHLYEC-PF 339

Query: 167 LRLLEAVG---IAHLALKLI 183
             L    G   ++H+AL+++
Sbjct: 340 FNLFIGSGMSVLSHIALRMV 359


>gi|307207016|gb|EFN84839.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 638

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 75/150 (50%), Gaps = 16/150 (10%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           L  G S  +   S+ + +  S   GRH IA  D   GD + +E P A VI      ++C+
Sbjct: 231 LTYGSSKEVPATSDGVTVSFSKRYGRHYIATRDFKPGDIVSIEDPYAHVIYEERYYTHCH 290

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV---------GIA 176
           +C +     IPC++C +A YCSE CR+ AW   H  EC   L+LL ++          I+
Sbjct: 291 YCLSRSYNLIPCSKCPIAQYCSEKCRKLAWEVCHMTEC-PILKLLTSLLNVDKDKIRMIS 349

Query: 177 HLALKLILVSSHSDRYKEVYHLETHLQDMR 206
            +   LI+V+++  + +E+       QDM+
Sbjct: 350 KIIRLLIIVTANGSKIEEL------QQDMK 373


>gi|194858456|ref|XP_001969182.1| GG25277 [Drosophila erecta]
 gi|190661049|gb|EDV58241.1| GG25277 [Drosophila erecta]
          Length = 750

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 14/119 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
           +GR+++A E++ KG+ +F E+   FV  P   +  C  C  S+ SAPIPC +C    VYC
Sbjct: 221 RGRYMVAKENISKGNVIFSERASCFV--PLQQLLICQQCAASLMSAPIPCPKCHQRVVYC 278

Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
           S  C REA    H++EC    + LL+ +G++HLAL+L+L          + H+  HLQ+
Sbjct: 279 SRKC-REAHSAIHKYECAAYRKDLLKLLGVSHLALRLVLAY--------IPHIMPHLQE 328


>gi|270011320|gb|EFA07768.1| hypothetical protein TcasGA2_TC005322 [Tribolium castaneum]
          Length = 1086

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 70/125 (56%), Gaps = 5/125 (4%)

Query: 71  SHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS 130
           ++++ CA+N ++++      + V+A +D+  G  + VE P    ++    + +C+ C   
Sbjct: 195 NNSIPCATNSVEIKVDEKLRKRVVATKDIQIGQVIAVETPCVAALIN-VVLFHCHDCYIL 253

Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI-AHL--ALKL-ILVS 186
              PIPC  C   VYCSE+CR  A+ +YH+ EC   L + + VGI  H   ALK+ +LV 
Sbjct: 254 CYNPIPCKTCTEVVYCSEACRENAFAKYHQKECPIYLSMRKLVGIDTHFQWALKMTLLVQ 313

Query: 187 SHSDR 191
           + +D+
Sbjct: 314 TQADK 318



 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 63/129 (48%), Gaps = 4/129 (3%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           AS+ I+++      R V A   +  G+ + VEKP  F  L    + +C+ C      PIP
Sbjct: 722 ASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFT-LAAADLYHCHECYQLCYNPIP 780

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH---LALKLILVSSHSDRYK 193
           C  C   +YC E CR +A  +YH++EC   + L   VG      LA+K+  + S  +   
Sbjct: 781 CEICSQTLYCGEECRDKAREKYHQYECPILISLKNIVGKHKAFLLAIKMSFMISDENDVP 840

Query: 194 EVYHLETHL 202
           EVY L  +L
Sbjct: 841 EVYALVENL 849


>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 718

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 1/120 (0%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
           E+  L  AS  + ++ +   G+  IA + +  GD L VE P A  +LP    ++C HC +
Sbjct: 232 ENPRLPGASTLLDIEETENAGKRAIAAKPIEPGDRLVVEAPYAATLLPEFFGTHCQHCFS 291

Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
              API C +C    +C   CR  A   YH++EC   L LL   G++ L++  + +++ S
Sbjct: 292 RFVAPIGCPDCSSVAFCGRKCRDAAMASYHKYECK-ILALLIGSGMSVLSMLALRMATQS 350


>gi|158287868|ref|XP_309762.4| AGAP010931-PA [Anopheles gambiae str. PEST]
 gi|157019395|gb|EAA05512.4| AGAP010931-PA [Anopheles gambiae str. PEST]
          Length = 383

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 63/113 (55%), Gaps = 4/113 (3%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPI 135
           A +K + Q ++  GRHV+    +  GD + +EKP   V         C+ C    +   I
Sbjct: 100 ALSKARNQNASKYGRHVVTKRKLKVGDVVMIEKPFVTVAKETFQYIRCDFCQAKRLFTLI 159

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAH-LALKLILVS 186
           PC  C +A+YCSE C  +A+ +YHR+EC G LR L   +GI+  +AL++I ++
Sbjct: 160 PCEGCTVAMYCSEECISKAYGKYHRYEC-GVLRDLWTVLGISGVIALRMIAIA 211


>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
 gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
          Length = 649

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 26/152 (17%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  +GR      D+     + +EKP   V+L   S+S+C+ C   ++ P+ C +C   ++
Sbjct: 235 SQDEGRFAKTNMDLKPNTIILLEKPHVSVLLEEYSLSHCSTCFKRVTIPVCCPKCSDVIF 294

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSS--------------- 187
           CSE+C ++A   YHR+EC G L +    G +   H+AL++I   +               
Sbjct: 295 CSEACEKKANSSYHRYEC-GFLPIFWKSGASITCHMALRMITQKTEEYFVGLRHELDGLT 353

Query: 188 -------HSDRYKEVYHLETHLQDMRPEDLYQ 212
                  ++D Y++VY L TH     PED +Q
Sbjct: 354 SEVTDKLNNDDYRKVYKLVTHEDKRSPEDYFQ 385


>gi|158284652|ref|XP_307655.3| Anopheles gambiae str. PEST AGAP012638-PA [Anopheles gambiae str.
           PEST]
 gi|157020944|gb|EAA03452.3| AGAP012638-PA [Anopheles gambiae str. PEST]
          Length = 452

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 79/161 (49%), Gaps = 7/161 (4%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           EK  +N++ A +   +++SS  S  V   P L+          +N ++++ +   GRHV+
Sbjct: 95  EKEVKNAL-AKARNKNASSSKVSPDVVEEPELSYPVKKNAPQVANCLELRKNEEYGRHVV 153

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCRRE 153
               +  GD + +E+P   V+        C+ C        IPC  C +A+YCSE C  +
Sbjct: 154 TTRKLKVGDVVMIERPFVTVLKDSFRYVRCDFCHGERPFTLIPCEGCTMAMYCSEECLSK 213

Query: 154 AWLRYHRWECHGALR----LLEAVGIAHLALKLILVSSHSD 190
           A+ +YHR+EC G LR    + E V +  + +  I +++  +
Sbjct: 214 AYNKYHRYEC-GLLRDMWEVFEEVSLIDIRMIAIAITTFDN 253


>gi|194752637|ref|XP_001958627.1| GF12496 [Drosophila ananassae]
 gi|190619925|gb|EDV35449.1| GF12496 [Drosophila ananassae]
          Length = 751

 Score = 70.5 bits (171), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/100 (40%), Positives = 61/100 (61%), Gaps = 6/100 (6%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
           +GR+++A E + +G  +F E+   FV L    +  C  C +S+ SAPIPC +C    VYC
Sbjct: 224 RGRYMVATEAIPQGTAIFSERASCFVPLEQRLI--CQQCASSLMSAPIPCPQCHQKVVYC 281

Query: 147 SESCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV 185
           S  C RE     HR+EC G  + L + +G++HLAL+++LV
Sbjct: 282 SRKC-REGHANIHRYECAGYRMDLFKLLGVSHLALRMVLV 320


>gi|307171148|gb|EFN63141.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 641

 Score = 70.1 bits (170), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 1/105 (0%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           L  G S  +   S+ + +  S   GRH IA  +   GD + +E P A VI      ++C+
Sbjct: 234 LTYGPSQEVPAISDGVTISFSERYGRHYIATREFQPGDIISIEDPYAHVIYEERYYTHCH 293

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           +C +     IPC +C +A YCSE CR+  W   H+ EC   L+LL
Sbjct: 294 YCLSKSYNLIPCAKCPIAQYCSEKCRKLGWEACHKTEC-PILKLL 337


>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
          Length = 706

 Score = 69.7 bits (169), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 71/140 (50%), Gaps = 20/140 (14%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-------- 121
           E+ ++  AS+ +++     +GRH++A ED+  G +L  E+    V+ P  S         
Sbjct: 132 ENDSISGASSSLRLNFDMDRGRHLVASEDILPGQSLLEEEAFVSVLCPGDSFLLQDSAET 191

Query: 122 ------SN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
                 +N    C+HC   + A +PC  C  A YCS+ C   AW +YHR EC  GAL L 
Sbjct: 192 VWDTQVTNADLYCHHCLRQLLASVPCQGCSYAKYCSQICADVAWQQYHRTECSLGALLLT 251

Query: 171 EAVGIAHLALKLILVSSHSD 190
             V   H+AL+ +L++  ++
Sbjct: 252 LGV-FCHVALRTVLLAGFAE 270


>gi|345482388|ref|XP_001608083.2| PREDICTED: hypothetical protein LOC100124189 [Nasonia vitripennis]
          Length = 493

 Score = 69.7 bits (169), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 31/88 (35%), Positives = 47/88 (53%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
           CAS  + ++ +   GRH++A   +  G+ + +EK     I        C +C     A I
Sbjct: 220 CASTAVSIRYNKKWGRHIVADRRIEPGEVIAIEKSYLTSICLDGMYLFCANCVQQTWASI 279

Query: 136 PCNECILAVYCSESCRREAWLRYHRWEC 163
           PC  CI  VYCS+ C+ EAW +YH++EC
Sbjct: 280 PCESCIYNVYCSQKCKSEAWKKYHQYEC 307


>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
           africana]
          Length = 800

 Score = 69.3 bits (168), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 69/149 (46%), Gaps = 22/149 (14%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSNCNH-- 126
           ES  +  AS+ + + T   KGRH+IA +D+  G+ L   K  AFV +L P  M   +H  
Sbjct: 224 ESEQIPSASSSVSLCTDPLKGRHLIATKDILPGELLV--KEAAFVSVLNPGEMPQLHHGL 281

Query: 127 -----------------CCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
                            C     A +PC++C  A YCS+ C ++AW  YH  EC     L
Sbjct: 282 ESKWNTRVTNGDLHCHCCLKHTLATVPCDQCSYAKYCSQECMQQAWELYHTVECPQGGLL 341

Query: 170 LEAVGIAHLALKLILVSSHSDRYKEVYHL 198
           L      H+AL++ L++   D  K V  L
Sbjct: 342 LTLGIFCHIALRVTLLARFEDVGKVVRKL 370


>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
          Length = 739

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 67/145 (46%), Gaps = 15/145 (10%)

Query: 58  SSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP 117
           +S   L  +A   +  +  AS  + +     KGRH++A +++ +G+ L  E+  A VI+P
Sbjct: 180 TSTERLKEMALDRNPHITNASCALHLNFDTRKGRHLLASQNIEQGEVLIWEEAFASVIIP 239

Query: 118 P-----------CSMSNCNH----CCTSISAPIPCNECILAVYCSESCRREAWLRYHRWE 162
                         ++ C+H    C   + A +PC  C  A YCS+ C  +AW  YH  E
Sbjct: 240 ERKQWRKEIKWDTRITACDHYCHYCLNRVIASLPCQYCSFARYCSQECMDKAWRSYHYIE 299

Query: 163 CHGALRLLEAVGIAHLALKLILVSS 187
           C     LL      H AL+ +LV+ 
Sbjct: 300 CSMGDLLLALGMFCHTALRAVLVAG 324


>gi|340728154|ref|XP_003402393.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Bombus terrestris]
          Length = 350

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 2/104 (1%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP-CSM 121
           +PSL+  E+  +   S+ +++  +   G +  A + +  GD + +EKP    +LPP  ++
Sbjct: 82  IPSLSHDENKDIPDMSDAVRLVHNKKYGVNFEATKPIGTGDVIPIEKPQVTSVLPPDVAI 141

Query: 122 SN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           +N C++C    SA +PC  C  A+YCS+ CR +A+  YHR+ C+
Sbjct: 142 ANVCHYCLKRYSALLPCERCNSALYCSKQCRAQAFEEYHRFHCN 185


>gi|24652121|ref|NP_724802.1| CG1868, isoform B [Drosophila melanogaster]
 gi|21645544|gb|AAM71072.1| CG1868, isoform B [Drosophila melanogaster]
          Length = 751

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 14/120 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNEC-ILAVYC 146
           +GR+++A E + KG+ +F E+   FV  P   +  C  C  T +SAPIPC  C    VYC
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFV--PLEQLLICQQCAATLMSAPIPCPNCHQRVVYC 282

Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           S  C REA    H++EC    + +L  +GI+HLAL+L+L          + ++  HLQ+M
Sbjct: 283 SRKC-REAHSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIRPHLQEM 333


>gi|54650928|gb|AAV37042.1| AT13626p [Drosophila melanogaster]
          Length = 751

 Score = 68.9 bits (167), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 14/120 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNEC-ILAVYC 146
           +GR+++A E + KG+ +F E+   FV  P   +  C  C  T +SAPIPC  C    VYC
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFV--PLEQLLICQQCAATLMSAPIPCPNCHQRVVYC 282

Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           S  C REA    H++EC    + +L  +GI+HLAL+L+L          + ++  HLQ+M
Sbjct: 283 SRKC-REAHSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIRPHLQEM 333


>gi|19921918|ref|NP_610500.1| CG1868, isoform A [Drosophila melanogaster]
 gi|7303878|gb|AAF58923.1| CG1868, isoform A [Drosophila melanogaster]
 gi|15291779|gb|AAK93158.1| LD26240p [Drosophila melanogaster]
 gi|220945802|gb|ACL85444.1| CG1868-PA [synthetic construct]
 gi|220955560|gb|ACL90323.1| CG1868-PA [synthetic construct]
          Length = 718

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 14/120 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNEC-ILAVYC 146
           +GR+++A E + KG+ +F E+   FV  P   +  C  C  T +SAPIPC  C    VYC
Sbjct: 192 RGRYMVAKEAISKGNVIFSERASCFV--PLEQLLICQQCAATLMSAPIPCPNCHQRVVYC 249

Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           S  C REA    H++EC    + +L  +GI+HLAL+L+L          + ++  HLQ+M
Sbjct: 250 SRKC-REAHSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIRPHLQEM 300


>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Oreochromis niloticus]
          Length = 725

 Score = 68.6 bits (166), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/170 (24%), Positives = 75/170 (44%), Gaps = 30/170 (17%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           EK  E+  +  S     + ++ S SV  L    C           ++ + ++  KGRH++
Sbjct: 117 EKGREDDQKPASNDCKGSDTANSPSVGPLTVGIC----------PQVVVDSTVEKGRHLV 166

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSM--------------------SNCNHCCTSISAP 134
           A   +  G+ +  ++P + V++P                          C+ C T   + 
Sbjct: 167 AANRIAAGEVILTDRPYSCVLIPGMEEVKGKVGRYSRDSEVLFGVEDRRCHRCLTETLSL 226

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
           +PC+ C  + YCS +C++EAW  +HRWEC     L+    ++ LAL++ L
Sbjct: 227 VPCDGCSYSRYCSTACQQEAWEEHHRWECALGAHLMAMGVMSQLALRVAL 276


>gi|158287327|ref|XP_309378.2| AGAP011267-PA [Anopheles gambiae str. PEST]
 gi|157019597|gb|EAA05164.2| AGAP011267-PA [Anopheles gambiae str. PEST]
          Length = 540

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 59/120 (49%), Gaps = 2/120 (1%)

Query: 50  SSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEK 109
           +++SS  S  V   P L+          +N ++++ +   GRHV+    +  GD + +E+
Sbjct: 109 NASSSKISPDVVEEPELSYPSKENAPQVANCLELRKNEEYGRHVVTTRKLKVGDVVMIER 168

Query: 110 PVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALR 168
           P   V+        C+ C        IPC  C  A+YCSE C  +A+ +YHR+EC G LR
Sbjct: 169 PFVTVLRDSLRYVRCDFCHEERPFTLIPCEGCTAAMYCSEECLSKAYNKYHRYEC-GLLR 227


>gi|345486792|ref|XP_003425557.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 719

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 54/101 (53%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC 128
           G   T  C S+ I +  S   G+ ++A  D+  G+ +FV++     +    + + C HC 
Sbjct: 251 GTMVTETCTSSGINVSYSPDSGQQLVASHDLQPGEIIFVQQSYVTSVNTNKACAYCCHCM 310

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           T     IPC+ C L +YCS+ C+ EAW +YH  EC+ A+ +
Sbjct: 311 TPTWCTIPCDHCSLNMYCSKQCKDEAWNKYHDIECNMAMHM 351


>gi|340720355|ref|XP_003398606.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 485

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 63/114 (55%), Gaps = 2/114 (1%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
           T  +  +    +PSL+  E+  +   S+ +++  +   G +  A + +  GD + +EKP 
Sbjct: 58  TKQNNRNHFEDIPSLSHDENKDIPGMSDAVRLVHNKKYGVNFEATKPIGTGDVILIEKPQ 117

Query: 112 AFVILPP-CSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
              +LPP  +++N C++C    SA +PC  C  A+YCS+ CR +A+  +HR +C
Sbjct: 118 VTSVLPPDVAIANVCHYCLKRYSALLPCERCNSALYCSKECRAKAYEEHHRIQC 171


>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 676

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 47/153 (30%), Positives = 66/153 (43%), Gaps = 31/153 (20%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH +A +D+  G+TL +EKP   V+L    +SNC+HC   I  PIP  C+ C    YCS
Sbjct: 251 GRHAVATKDIEPGETLVIEKPHCAVLLAEYRLSNCHHCFAKIFVPIPTSCDTCNFVAYCS 310

Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVS------------------- 186
             CR +     H+ EC    +L   E      LALK I+                     
Sbjct: 311 IPCRNKD-AEIHKNECMILPSLWFSETSVNCFLALKAIVQKPFEELLALKDKLKATKGRF 369

Query: 187 -------SHSDRYKEVYHLETHLQDMRPEDLYQ 212
                     D ++ +Y L TH ++   EDL+ 
Sbjct: 370 ETSTQRPRRHDDFEAIYGLITHEEERTSEDLFH 402


>gi|195332853|ref|XP_002033107.1| GM20595 [Drosophila sechellia]
 gi|194125077|gb|EDW47120.1| GM20595 [Drosophila sechellia]
          Length = 751

 Score = 67.8 bits (164), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 44/120 (36%), Positives = 67/120 (55%), Gaps = 14/120 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
           +GR+++A E + KG+ +F E+   FV L    +  C HC  S+ SAPIPC  C    VYC
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFVPLEQQLI--CQHCAASLMSAPIPCPNCHQRVVYC 282

Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           S  CR    +  H++EC    + +L  +GI+HLAL+L+L          + ++  HLQ+M
Sbjct: 283 SRKCRDPQSV-IHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIMPHLQEM 333


>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
          Length = 596

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 69/152 (45%), Gaps = 25/152 (16%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
           ++  +GR + AV ++  G+ + +EKP A V+L     ++C+HC       IPC  C  A 
Sbjct: 184 STENEGRFITAVRNIAPGEIVLIEKPFASVLLRANYSNHCHHCLKHTLEGIPCRTCPDAR 243

Query: 145 YCSESCRREAWLRYHRWEC-------------HGALRLLEAVGIAHLALK------LILV 185
           +CSE+CR  A   YH++EC              G L        ++ +LK      L L 
Sbjct: 244 FCSEACRDTAMQTYHQYECSVLNTLHHSQINKFGCLAFRAITKQSYQSLKDIRAQDLPLN 303

Query: 186 SSHSD------RYKEVYHLETHLQDMRPEDLY 211
             HSD       Y  +  L TH +D   +DL+
Sbjct: 304 GCHSDGLYRPQDYNTIIQLVTHAKDRPVQDLF 335


>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
           harrisii]
          Length = 802

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 62/127 (48%), Gaps = 22/127 (17%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN---------------- 123
           + +++  ++GR++IA EDV  G+ L  E+  AFV +L P   +                 
Sbjct: 235 VSLRSDPSRGRYLIATEDVFPGELLVKEE--AFVSVLNPGETAGLRRGLEAQRDGRATPG 292

Query: 124 ---CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
              C+ C   I A +PC  C  A YCS+ C + AW RYHR EC     LL      H+AL
Sbjct: 293 DCHCHRCLKPIVATVPCEGCSYAKYCSQQCLQVAWERYHRVECPLGGTLLTLGVFCHVAL 352

Query: 181 KLILVSS 187
           + +L++ 
Sbjct: 353 RTVLLAG 359


>gi|328777896|ref|XP_001120776.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 633

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 49/98 (50%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
           L  G S      S+ I +  S   GRH++  ++   GD + +E P A+VI      ++C+
Sbjct: 227 LIYGPSKEAPAISDGISISFSEKYGRHLVVTKEFKPGDIITIEDPYAYVIYTQRYYTHCH 286

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           HC +     IPC  C +A YCSE CR  AW   H  EC
Sbjct: 287 HCLSRSYNLIPCLHCPVAQYCSEKCRILAWEMAHDIEC 324


>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
          Length = 802

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 69/148 (46%), Gaps = 18/148 (12%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
           GE+  L  AS+ + +     KGR+++A +D+  G+ L  E     V+    LPP      
Sbjct: 223 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 282

Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
                 ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL
Sbjct: 283 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 342

Query: 171 EAVGIAHLALKLILVSSHSDRYKEVYHL 198
                 H+AL+L LV    D  K V  L
Sbjct: 343 TLGVFCHIALRLTLVVGVEDVRKIVKKL 370


>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
           [Acyrthosiphon pisum]
 gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
           [Acyrthosiphon pisum]
          Length = 645

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 80/169 (47%), Gaps = 7/169 (4%)

Query: 23  SHLDFKEDVLTIEKSE--ENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNK 80
           SHL   E +++ EK +  E  ++    + +   ++      ++  L  G +  +   S  
Sbjct: 181 SHLS--EAIISKEKKKGIEMQIKNFLRETNKKETNNKQDKLNVIKLFGGPNKNIPALSKF 238

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +KM+ S + GR ++   D++ G+ L +EKP   V+       NC +C       IPC +C
Sbjct: 239 VKMKYSESMGRCLVVSSDINPGEVLAIEKPYVGVLRRESYEYNCRNCFKRCLNGIPCLKC 298

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLILVS 186
            L +YC E+CR +++   H++EC          G   + HL+L + L S
Sbjct: 299 TLVIYCDETCRIKSYESGHKYECSLFSTFNNWPGMDHMEHLSLNIFLKS 347


>gi|158287331|ref|XP_309383.4| AGAP011257-PA [Anopheles gambiae str. PEST]
 gi|157019599|gb|EAA05326.4| AGAP011257-PA [Anopheles gambiae str. PEST]
          Length = 540

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 83/182 (45%), Gaps = 8/182 (4%)

Query: 14  LKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHT 73
           LK  RL   S  ++  + L   + E  +  A +   +++SS  S  V   P L+      
Sbjct: 75  LKNIRLARES--NYSGEKLNQREKEVKNALAKARNKNASSSKISPDVVEEPELSYPAKEN 132

Query: 74  LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSIS 132
               +N ++++ +   GRHV+    +  GD + +E+P   V+        C+ C      
Sbjct: 133 APQVANCLELRKNEEYGRHVVTTRKLKVGDVVMIERPFVTVLKDSFRYVRCDFCHGERPF 192

Query: 133 APIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL----EAVGIAHLALKLILVSSH 188
             IPC  C  A+YCSE C  +A+  YHR++C G LR L    E V +  + +  I +++ 
Sbjct: 193 TLIPCEGCTAAMYCSEECLSKAYNNYHRYDC-GILRDLYEDFEEVSLIDIRMIAIAITTF 251

Query: 189 SD 190
            +
Sbjct: 252 DN 253


>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
          Length = 802

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 68/143 (47%), Gaps = 20/143 (13%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
           E+  +  AS+ + +     KGR+++A +D+  G+ L  E     V+ P      P  + +
Sbjct: 224 ENEQIPGASSSVSLHIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGEVPLRPRGLES 283

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
                       C+ C     AP+PC+ C  A YCS+ C ++AW RYH  EC  GAL L 
Sbjct: 284 KWDIQVTNGDLHCHRCLRHTLAPVPCDGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343

Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
             V   H+AL+  L++   D  K
Sbjct: 344 LGV-FCHVALRSTLLARFEDVGK 365


>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
          Length = 802

 Score = 67.4 bits (163), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/143 (30%), Positives = 68/143 (47%), Gaps = 20/143 (13%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
           E+  +  AS+ + +     KGR+++A +D+  G+ L  E     V+ P      P  + +
Sbjct: 224 ENEQIPGASSSVSLHIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGEVPLRPRGLES 283

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
                       C+ C     AP+PC+ C  A YCS+ C ++AW RYH  EC  GAL L 
Sbjct: 284 KWDIQVTNGDLHCHRCLRHTLAPVPCDGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343

Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
             V   H+AL+  L++   D  K
Sbjct: 344 LGV-FCHVALRSTLLARFEDVGK 365


>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
          Length = 802

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
           E+  +  AS+ + +     KGR ++A +D+  G+ L  E     V+ P      P S+ +
Sbjct: 224 ENEQIPGASSSVTLHIDPLKGRFLVATKDILPGELLVKEDAFVSVLNPGEVPLRPHSLES 283

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
                       C+ C     AP+PC  C  A YCS+ C ++AW RYH  EC  GAL L 
Sbjct: 284 KWEIQATSGDLHCHRCLRHTLAPVPCEGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343

Query: 171 EAVGIAHLALKLILVSSHSDRYKEVYHL 198
             V   H+AL+  L++   D  K +  L
Sbjct: 344 LGV-FCHVALRSTLLARFEDVGKVIKKL 370


>gi|195475072|ref|XP_002089809.1| GE22225 [Drosophila yakuba]
 gi|194175910|gb|EDW89521.1| GE22225 [Drosophila yakuba]
          Length = 753

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 62/105 (59%), Gaps = 6/105 (5%)

Query: 83  MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC- 140
           +  S  +GR+++A + + KG  +F E+   FV L    +  C  C  S+ SAPIPC +C 
Sbjct: 219 LTDSGQRGRYMVAKDTISKGKVIFSERASCFVPLEQHLI--CQQCAASLMSAPIPCPKCH 276

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
              VYCS  C REA    H++EC    + LL+ +G++HLAL+L+L
Sbjct: 277 QRVVYCSRKC-REAHSAIHKFECAAYRKNLLKLLGVSHLALRLVL 320


>gi|345496800|ref|XP_001601864.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 607

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 51/95 (53%), Gaps = 4/95 (4%)

Query: 71  SHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCC 128
           S  + CA+  + +  + + GRH+IA  D+  G+ +  E+   + + P        C+HC 
Sbjct: 178 SQKIPCAAESVTIAYNESIGRHLIATRDIKPGEVIIAEE--GYAVFPKIKKMYLFCSHCL 235

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           T     IPC+ C LA+YCSE C+++A   YH  EC
Sbjct: 236 TFAWNGIPCDNCALALYCSEECKKKALEEYHDVEC 270


>gi|189233943|ref|XP_974086.2| PREDICTED: similar to AGAP008839-PA [Tribolium castaneum]
          Length = 666

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 34/97 (35%), Positives = 49/97 (50%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
           GR   A+ED   G  +  E P   VI    ++ NC  CC S   P+ C  C  AV+CS +
Sbjct: 236 GRFARALEDFDTGVIIVEETPHCAVISQENALMNCQFCCISTQQPVACRNCGHAVFCSLN 295

Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
           C R+A L +H++EC     L  A    + A+ L ++S
Sbjct: 296 CERQANLTFHKYECKAQPVLFHAGASINCAMALRMIS 332


>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 704

 Score = 67.0 bits (162), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 81/168 (48%), Gaps = 12/168 (7%)

Query: 30  DVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMC-ASNKIKMQTSNA 88
           D +T E  +   ++ +  + S+ +  + +    +P     E  +++  AS  +  +    
Sbjct: 125 DSITGEDKKRQEIKTLLQESSNPTYVSFADHFDIPDEEIKEKSSMLANASAAVTCKHQEG 184

Query: 89  KGRHVIAVEDVHKGDTLFV--------EKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           KGR + A  D+  GD L          EKP A +IL     S+C+ C    S PIPC+ C
Sbjct: 185 KGRFLEATRDIAAGDRLLKAVSELLIKEKPYAAIILKEEESSHCHQCFEQCS-PIPCSNC 243

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI-AHLALKLILVSS 187
           I A YCS  CR +   +YH  EC G   LL+ V + + L+L++++ + 
Sbjct: 244 IHARYCSSRCRSDCLSQYHSIEC-GTEGLLQQVSVFSRLSLRILITAG 290


>gi|170048026|ref|XP_001851501.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870252|gb|EDS33635.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 521

 Score = 66.6 bits (161), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 42/76 (55%), Gaps = 1/76 (1%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSE 148
           GR + A  D   G+ +  E P   VI P  + + CNHC        IPC  C  A+YCSE
Sbjct: 172 GRGLTAKMDFLVGEIVLHENPSLVVIEPEVTFTRCNHCGRRNEYDLIPCKTCSSAMYCSE 231

Query: 149 SCRREAWLRYHRWECH 164
            CR+EA+ +YHR+EC 
Sbjct: 232 QCRKEAFTKYHRFECE 247


>gi|345486839|ref|XP_003425566.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 621

 Score = 66.6 bits (161), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 48/83 (57%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +K++     GRH++A  D+  G+ ++VE+P    +      + C+HC T+  + +PC+ C
Sbjct: 202 VKIEHDEKWGRHLVATRDIKPGEVIYVEEPYTKCLTIKHLRAYCSHCLTTTWSNVPCDHC 261

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
              ++CSE+C+  AW  YH  EC
Sbjct: 262 SWTMFCSEACKDLAWKNYHSSEC 284


>gi|444516396|gb|ELV11145.1| SET and MYND domain-containing protein 4 [Tupaia chinensis]
          Length = 536

 Score = 66.6 bits (161), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 23/175 (13%)

Query: 39  ENSVEAMSMQVSSTSSSASSSVASLPSLACG-----ESHTLMCASNKIKMQTSNAKGRHV 93
           + ++  + M+V        +S A+L   + G     E+  +  AS  + ++T   KGRH+
Sbjct: 103 QENLHRLKMKVQEKKILTKTSPAALAKASNGHDLREENEQISSASPSVGLRTHPLKGRHL 162

Query: 94  IAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN------------CNHCCTSISAPI 135
           IA +D+  G+ L  E     V+    +PP   S+ N            C+ C     A +
Sbjct: 163 IATKDILPGELLVKEDAFVSVLNPGDMPPLRHSLENKWDTRVTNGDLYCHRCLKHTLATV 222

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
           PC+ C  A YCS +C ++AW  YH  EC     LL      H+AL+  L++   D
Sbjct: 223 PCDGCSYAKYCSHACTQQAWELYHSTECSLGGLLLALGVFCHVALRATLIARFED 277


>gi|91089701|ref|XP_974846.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 561

 Score = 66.6 bits (161), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 63/129 (48%), Gaps = 4/129 (3%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           AS+ I+++      R V A   +  G+ + VEKP  F  L    + +C+ C      PIP
Sbjct: 197 ASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFT-LAAADLYHCHECYQLCYNPIP 255

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH---LALKLILVSSHSDRYK 193
           C  C   +YC E CR +A  +YH++EC   + L   VG      LA+K+  + S  +   
Sbjct: 256 CEICSQTLYCGEECRDKAREKYHQYECPILISLKNIVGKHKAFLLAIKMSFMISDENDVP 315

Query: 194 EVYHLETHL 202
           EVY L  +L
Sbjct: 316 EVYALVENL 324


>gi|71897069|ref|NP_001025886.1| SET and MYND domain-containing protein 4 [Gallus gallus]
 gi|82125404|sp|Q5F3V0.1|SMYD4_CHICK RecName: Full=SET and MYND domain-containing protein 4
 gi|60098707|emb|CAH65184.1| hypothetical protein RCJMB04_6f24 [Gallus gallus]
          Length = 742

 Score = 66.2 bits (160), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 43/126 (34%), Positives = 62/126 (49%), Gaps = 22/126 (17%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP----------------CSMSN----CN 125
           S  +GRH++A +D+  G  L  EK    V+ P                   ++N    C+
Sbjct: 237 STERGRHLVASQDILPGQNLLKEKAFVSVLCPGEGDSLLLQDSSETVWDTRVTNADLYCH 296

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLLEAVGIAHLALKLIL 184
           HC   + A IPC  C  A YCS++C   AW +YHR EC  GAL L   V   H+AL+ +L
Sbjct: 297 HCLKQLLASIPCCGCSYAKYCSQNCADVAWEQYHRTECPLGALLLTLGV-FFHVALRTVL 355

Query: 185 VSSHSD 190
           ++  S+
Sbjct: 356 LAGFSE 361


>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
          Length = 806

 Score = 66.2 bits (160), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 18/143 (12%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
           GE+  L  AS+ + +     KGR+++A +D+  G+ L  E     V+    LPP      
Sbjct: 227 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 286

Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
                 ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL
Sbjct: 287 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 346

Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
                 H+AL+L LV    D  K
Sbjct: 347 TLGVFCHIALRLTLVVGVEDVRK 369


>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
           mulatta]
          Length = 802

 Score = 66.2 bits (160), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 18/143 (12%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
           GE+  L  AS+ + +     KGR+++A +D+  G+ L  E     V+    LPP      
Sbjct: 223 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 282

Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
                 ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL
Sbjct: 283 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 342

Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
                 H+AL+L LV    D  K
Sbjct: 343 TLGVFCHIALRLTLVVGVEDVRK 365


>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
          Length = 806

 Score = 66.2 bits (160), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 18/143 (12%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
           GE+  L  AS+ + +     KGR+++A +D+  G+ L  E     V+    LPP      
Sbjct: 227 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 286

Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
                 ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL
Sbjct: 287 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 346

Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
                 H+AL+L LV    D  K
Sbjct: 347 TLGVFCHIALRLTLVVGVEDVRK 369


>gi|156548936|ref|XP_001606913.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 612

 Score = 66.2 bits (160), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 70/156 (44%), Gaps = 35/156 (22%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           + +Q +   GRH+IA   +  G+ + V KP    +      + C HC  +  A IPC+ C
Sbjct: 193 VSIQHNEKFGRHLIANRYIKPGEIIMVIKPYIKCLNLKNMHAFCGHCLKTSWATIPCDYC 252

Query: 141 ILAVYCSESCRREAWLRYHRWEC----------------HGALR-----LLEAVGIAHLA 179
              ++CSE C++EAW +YH  EC                  ALR     + EA GI +L 
Sbjct: 253 NWCMFCSEDCKQEAWQQYHDIECPVIPYIMFDQLGDYWKQLALRSTVMAIREAGGIQNLK 312

Query: 180 LKLILVSS--------------HSDRYKEVYHLETH 201
            +L LV +              ++DR++ +Y L  H
Sbjct: 313 EELDLVDNCNSKLHKGFLNGKFYNDRFQSIYSLSDH 348


>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 650

 Score = 66.2 bits (160), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 26/152 (17%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S+ +GR      D+     + VEKP   V+L   S ++C+ C   +S P+ C +C   V+
Sbjct: 236 SSEEGRFAKTNVDLKPNTIVLVEKPHVSVLLEEYSKTHCSTCFKRVSVPVCCPKCSDVVF 295

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-----------------LV 185
           CSE C   A   YH++EC G L +    G +   H+AL++I                 L 
Sbjct: 296 CSEDCESSANSGYHKYEC-GFLPIFWKSGASITCHMALRIITQQSEEYFLQLRPELDGLT 354

Query: 186 SSHSDR-----YKEVYHLETHLQDMRPEDLYQ 212
           S  +D+     Y+++Y L TH +    ED +Q
Sbjct: 355 SEQTDKLKHDDYRKIYKLVTHEETRSAEDFFQ 386


>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
 gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
          Length = 650

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 26/154 (16%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           + ++GR       +   + L  E P A V++  CS S+C+ CC+ +     C  C+  VY
Sbjct: 235 TESEGRFARTRNHLKPNNVLLKELPHASVVMSECSGSHCDQCCSRVEVLFSCPNCVDVVY 294

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLILVSSHS------------- 189
           CSE C+++A   +H +EC G L  L   G   ++ LAL+++   S               
Sbjct: 295 CSEECQKQAISGHHAFEC-GFLSFLRNSGANVVSMLALRIVSQKSEKYFYELQDELDNLQ 353

Query: 190 ---------DRYKEVYHLETHLQDMRPEDLYQYV 214
                    D Y++VY+  TH Q    ED  ++ 
Sbjct: 354 NDFVDSLFFDDYRKVYNFVTHGQQRNAEDYLKWT 387


>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Strongylocentrotus purpuratus]
          Length = 679

 Score = 65.9 bits (159), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 56/97 (57%), Gaps = 3/97 (3%)

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREA 154
           A  D+  G+ L  EKP A +IL     S+C+HC    S PIPC  CI A YCS  CR ++
Sbjct: 178 ATRDIAAGELLIKEKPYAAIILKEEESSHCHHCFEQCS-PIPCPNCIHARYCSSRCRSDS 236

Query: 155 WLRYHRWECHGALRLLEAVGI-AHLALKLILVSSHSD 190
             +YH  EC G  +LL+ V + + L+L++++ +   +
Sbjct: 237 LTQYHSIEC-GTEQLLQQVSVFSRLSLRILITAGREE 272


>gi|195581916|ref|XP_002080775.1| GD10068 [Drosophila simulans]
 gi|194192784|gb|EDX06360.1| GD10068 [Drosophila simulans]
          Length = 751

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 67/120 (55%), Gaps = 14/120 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
           +GR+++A E + KG+ +F E+   FV L    +  C  C  S+ SAPIPC  C    V+C
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFVPLEQQLI--CQQCAASLMSAPIPCPNCHQRVVFC 282

Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           S  C R+A    H++EC    + +L  +GI+HLAL+L+L          + ++  HLQ+M
Sbjct: 283 SRKC-RDAQSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIMPHLQEM 333


>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Monodelphis domestica]
          Length = 845

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 63/129 (48%), Gaps = 18/129 (13%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP---------PCSMSNC--- 124
           AS+ + + +  ++GR+++A ED+  G+ L  E+    V+ P         P +  NC   
Sbjct: 229 ASSSVSLCSDPSRGRYLVATEDILPGELLVKEEAFVSVLNPGETSWLRSDPGAKWNCRVT 288

Query: 125 ------NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
                 + C   + A IPC  C  A YCS+ C ++AW  YHR EC     LL      H+
Sbjct: 289 TGDLHCHRCLKPVWATIPCQGCSYARYCSQQCMQQAWECYHRTECSLGGTLLALGVFCHV 348

Query: 179 ALKLILVSS 187
           AL+ +L++ 
Sbjct: 349 ALRTVLLAG 357


>gi|189240077|ref|XP_971167.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 637

 Score = 65.5 bits (158), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 3/105 (2%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-SNCNHCCTSISAPI 135
           AS+KI ++TS   GR+V++  D+  G+ L  E P A   L P    ++C HC   + AP+
Sbjct: 228 ASSKITLKTSPDVGRYVVSNVDIATGEILVAE-PAAVACLNPEKFGTHCQHCFARLLAPV 286

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
            C  C    +CS  CR +A   YH++EC     LL   G++ L+L
Sbjct: 287 GCPHCSSVAFCSPKCRDDAITTYHKYECK-FFDLLLGSGMSVLSL 330


>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
           gorilla gorilla]
          Length = 804

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 68/147 (46%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR+++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLPSASSSIGLCVDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370


>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
          Length = 802

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 18/139 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
           E+  +  AS+ + +     KGR+++A +D+  G+ L  E     V+ P      P  + +
Sbjct: 224 ENEQIPGASSSVSLHIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGEVPLRPRGLES 283

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                       C+ C     AP+PC+ C  A YCS+ C ++AW RYH  EC     LL 
Sbjct: 284 KWDIQVTNGDLHCHRCLRHTLAPVPCDGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343

Query: 172 AVGIAHLALKLILVSSHSD 190
                H+AL+  L++   D
Sbjct: 344 LGLFCHVALRSTLLARFED 362


>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
          Length = 800

 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 69/147 (46%), Gaps = 17/147 (11%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP-CSMSN 123
           GE+  +  AS+ + ++T   +GR+++A +D+  G+ L  E     V+    +PP C + +
Sbjct: 223 GENGQVPSASSSVSLRTDPLRGRYLVATKDILPGELLVKEDAFVSVLHPGEMPPLCHLES 282

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                       C+ C     A +PC+ C  A YCS  C ++AW  YH  EC     LL 
Sbjct: 283 KWDTRVTNGDLYCHRCLDHTLATVPCDGCSYAKYCSHDCMQQAWDLYHSVECSLGGLLLT 342

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+  LV+   D  K +  L
Sbjct: 343 LGVFCHIALRSTLVARFEDASKVIRKL 369


>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
           garnettii]
          Length = 749

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 72/158 (45%), Gaps = 24/158 (15%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN----- 123
           E+  +  AS+ + + T   KGR++IA  D+  G+ L  E   AFV +L P  M       
Sbjct: 226 ENEQISSASSSVTLCTDPLKGRYLIATRDILPGELLVKED--AFVSVLNPGEMQPLHHGL 283

Query: 124 --------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
                         C+ C     A +PC+ C    YCS+ C ++AW  YH  EC     L
Sbjct: 284 ESKWRTRVTNGDLYCHRCLRHTLATVPCDGCSYVKYCSQECLQQAWELYHSIECSLGGLL 343

Query: 170 LEAVGIAHLALKLILVSSHSDRYKEVYHL--ETHLQDM 205
           L      H+AL+L L++   D  K +  L  ET L+D+
Sbjct: 344 LTLGVFCHIALRLTLLARFEDVGKIIRKLRDETSLKDI 381


>gi|350410051|ref|XP_003488929.1| PREDICTED: hypothetical protein LOC100742343 [Bombus impatiens]
          Length = 788

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
           T  +  +    +PSL+  E+  +   S+ +++  S   G +  A + +  GD + +EKP 
Sbjct: 140 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 199

Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           V  +I     ++  C +C     A +PC  C  A+YCS+ CR +A+  YHR++C+
Sbjct: 200 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 254



 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
           T  +  +    +PSL+  E+  +   S+ +++  S   G +  A + +  GD + +EKP 
Sbjct: 380 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 439

Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           V  +I     ++  C +C     A +PC  C  A+YCS+ CR +A+  YHR++C+
Sbjct: 440 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 494


>gi|270014900|gb|EFA11348.1| hypothetical protein TcasGA2_TC010888 [Tribolium castaneum]
          Length = 1112

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 34/97 (35%), Positives = 49/97 (50%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
           GR   A+ED   G  +  E P   VI    ++ NC  CC S   P+ C  C  AV+CS +
Sbjct: 236 GRFARALEDFDTGVIIVEETPHCAVISQENALMNCQFCCISTQQPVACRNCGHAVFCSLN 295

Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
           C R+A L +H++EC     L  A    + A+ L ++S
Sbjct: 296 CERQANLTFHKYECKAQPVLFHAGASINCAMALRMIS 332


>gi|328696607|ref|XP_003240079.1| PREDICTED: n-lysine methyltransferase SMYD2-B-like [Acyrthosiphon
           pisum]
          Length = 407

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 43/74 (58%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
           GR + A +++  G+ L +EKP+A V      M NCN+C     + IPC++C   VYC E+
Sbjct: 2   GRGLFASKNIKPGEILIIEKPIAGVFKNSMWMFNCNYCFQRCLSAIPCSKCSQVVYCDET 61

Query: 150 CRREAWLRYHRWEC 163
           C R+A   YH  EC
Sbjct: 62  CLRKAHTCYHGIEC 75


>gi|170040660|ref|XP_001848110.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864249|gb|EDS27632.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 574

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 2/106 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
           + M+   A GR ++A  +   GD +  EKPV  V+  P    NC+HC  S   + IPC  
Sbjct: 177 VGMKQYPASGRGLVAERNFKAGDVILDEKPVVGVVAWPYKYLNCSHCGISNQHSLIPCPN 236

Query: 140 CILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLIL 184
           C++ +YCSE C  E     HR+EC  GA         ++ ALKL  
Sbjct: 237 CVMYMYCSEECLAEDQRLTHRFECGFGAQAENTTFNSSNNALKLFF 282


>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 676

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH +A +D+  G+TL +E+P   V+L    ++NC+HC   I  PIP  C+ C    YCS
Sbjct: 251 GRHAVATKDIEPGETLVIERPHCAVLLAEYRLTNCHHCFKKIFVPIPTSCDTCNFVAYCS 310

Query: 148 ESCRREAWLRYHRWEC 163
             CR +     H+ EC
Sbjct: 311 ILCRNKD-AEIHKNEC 325


>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
           caballus]
          Length = 802

 Score = 64.3 bits (155), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/140 (31%), Positives = 65/140 (46%), Gaps = 20/140 (14%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP----PCSMS--- 122
           E+  +  AS  + + T   KGR++IA +D+  G+ L  E     V+ P    P  +    
Sbjct: 224 ENKQISGASASVSLCTDPFKGRYLIATKDILPGELLVKEDAFVSVLNPAEVLPLHLGLES 283

Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
                       C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC  GAL L 
Sbjct: 284 KWDTRVTNGDLYCHRCLKRSLAPVPCDGCSYAKYCSQECMQQAWDLYHNIECSLGALLLT 343

Query: 171 EAVGIAHLALKLILVSSHSD 190
             V   H+AL+  L++   D
Sbjct: 344 LGV-FCHVALRSTLLARFED 362


>gi|350410049|ref|XP_003488928.1| PREDICTED: hypothetical protein LOC100742217 [Bombus impatiens]
          Length = 987

 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
           T  +  +    +PSL+  E+  +   S+ +++  S   G +  A + +  GD + +EKP 
Sbjct: 140 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 199

Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           V  +I     ++  C +C     A +PC  C  A+YCS+ CR +A+  YHR++C+
Sbjct: 200 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 254



 Score = 64.3 bits (155), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
           T  +  +    +PSL+  E+  +   S+ +++  S   G +  A + +  GD + +EKP 
Sbjct: 386 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 445

Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           V  +I     ++  C +C     A +PC  C  A+YCS+ CR +A+  YHR++C+
Sbjct: 446 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 500



 Score = 52.8 bits (125), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 2/65 (3%)

Query: 102 GDTLFVEKP-VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYH 159
           GD + +EKP V  +I     ++  C +C     A +PC  C  A+YCS+ CR +A+  YH
Sbjct: 682 GDVILIEKPQVTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYH 741

Query: 160 RWECH 164
           R++C+
Sbjct: 742 RFQCN 746


>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
 gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
 gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
          Length = 804

 Score = 63.9 bits (154), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 66/142 (46%), Gaps = 18/142 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ + +     KGR+++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSSASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYK 193
                H+AL+L L+    D  K
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRK 365


>gi|380014448|ref|XP_003691244.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 661

 Score = 63.9 bits (154), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 45/159 (28%), Positives = 68/159 (42%), Gaps = 40/159 (25%)

Query: 61  ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
           ASLP L  GE+  L  AS  + ++ +   G+ VIA +++  GD L +E P A ++LP   
Sbjct: 117 ASLPVLKGGENPRLPGASTLLDVEETANAGKRVIAAKNIEPGDRLIIESPHAAILLPEFF 176

Query: 121 MSNCNHC-----------------------------------CTSI----SAPIPCNECI 141
            ++C HC                                   C +I     API C +C 
Sbjct: 177 GTHCQHCFSRKESDLQFSLICNTAYAFTTYETISCVKTNSFVCLTIIFRFKAPIGCPDCS 236

Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
              +C   CR  A   YH++EC   L LL   G++ L++
Sbjct: 237 SVAFCGRKCRDAALAGYHKYECK-ILALLIGSGMSVLSM 274


>gi|198459501|ref|XP_002138697.1| GA24934 [Drosophila pseudoobscura pseudoobscura]
 gi|198136711|gb|EDY69255.1| GA24934 [Drosophila pseudoobscura pseudoobscura]
          Length = 760

 Score = 63.9 bits (154), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 6/102 (5%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILA 143
           S ++GR++IA E + +G  +F E+   FV L    +  C  C  S+  APIPC +C    
Sbjct: 227 SGSRGRYMIAKEAIPQGKIIFREQATCFVPLEQRLI--CQQCAASLLCAPIPCPQCRQRV 284

Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
           VYCS  C RE     H++EC    + LL+ VG++HLAL+++L
Sbjct: 285 VYCSRQC-RERHKPIHKFECAAYRKDLLKMVGVSHLALRMVL 325


>gi|195172794|ref|XP_002027181.1| GL20110 [Drosophila persimilis]
 gi|194112994|gb|EDW35037.1| GL20110 [Drosophila persimilis]
          Length = 760

 Score = 63.9 bits (154), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 6/102 (5%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILA 143
           S ++GR++IA E + +G  +F E+   FV L    +  C  C  S+  APIPC +C    
Sbjct: 227 SGSRGRYMIAKEAIPQGKIIFREQATCFVPLEQRLI--CQQCAASLLCAPIPCPQCRQRV 284

Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
           VYCS  C RE     H++EC    + LL+ VG++HLAL+++L
Sbjct: 285 VYCSRQC-RERHKPIHKFECAAYRKDLLKMVGVSHLALRMVL 325


>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
          Length = 500

 Score = 63.9 bits (154), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 3/89 (3%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP- 136
           S  + ++++  KG H++A++    GDT+   +P AF I P      CN C    +  I  
Sbjct: 43  SFSVGVRSTENKGSHLVALKAFRPGDTVLRSRPYAFEIFPELREERCNECFRRPAEGISL 102

Query: 137 --CNECILAVYCSESCRREAWLRYHRWEC 163
             C+ C +  YC + C+  AW R H++EC
Sbjct: 103 LRCSSCKITRYCGKECQARAWKRSHKYEC 131


>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
 gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
          Length = 801

 Score = 63.9 bits (154), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 81/177 (45%), Gaps = 27/177 (15%)

Query: 39  ENSVEAMSMQVSSTSSSASSSVASLPS----LACGESHT-LMCASNKIKMQTSNAKGRHV 93
           + +V+ + M++    +   S  A+L      +A GE +  +  AS  + + T   KGRH+
Sbjct: 188 QRNVQQLKMKIQEKETLPESFPAALTKALEDMALGEENKQISGASLSVSLCTDPLKGRHL 247

Query: 94  IAVEDVHKGDTLFVEKPVAFV-ILPPCSMSNCNHCC-----TSIS--------------A 133
           +A +D+  G+ L  E   AFV +L P  M    HC      T ++              A
Sbjct: 248 VATKDILPGELLVKED--AFVSVLNPGEMPPLRHCLENKWDTRVTSADLYCHRCLRHTLA 305

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
            +PC  C  A YCS+ C ++AW  YH  EC     LL      H+AL++ L++   D
Sbjct: 306 TVPCGGCSYAKYCSQECVQQAWDHYHSTECSLGGLLLTLGVFCHVALRMTLLARFED 362


>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Taeniopygia guttata]
          Length = 797

 Score = 63.5 bits (153), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 19/120 (15%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS-----------------NCNHCCTSI 131
           +GRH++A +D+  G +L  E+    V+ P  S+                  +C+ C   +
Sbjct: 242 RGRHLVASQDIVPGQSLVKEEAFVSVLCPGESLPLQDGGTAWDTRATNADLHCHRCLRQL 301

Query: 132 SAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLLEAVGIAHLALKLILVSSHSD 190
            A +PC  C  A YCS+ C   AW +YHR EC  GAL L   V   H+AL+ +L++  ++
Sbjct: 302 LASVPCQGCSYAKYCSQDCADLAWQQYHRTECSLGALLLTLGV-FCHVALRTVLLAGFAE 360


>gi|345496836|ref|XP_003427832.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 520

 Score = 63.5 bits (153), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 41/75 (54%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           +GRH+    DV  G+ L+   P     +     +NC+HC +   A IPC+ C+ A+YCSE
Sbjct: 215 QGRHLTVTRDVGPGEALYASSPYVVCPIETRIYNNCSHCLSFAWACIPCDSCVYAMYCSE 274

Query: 149 SCRREAWLRYHRWEC 163
            C+ EA   YH  EC
Sbjct: 275 RCKNEAKDLYHDIEC 289


>gi|312381827|gb|EFR27477.1| hypothetical protein AND_05788 [Anopheles darlingi]
          Length = 479

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +N +++  S   GR+VI   D+  GD +  E+P   +++       C++C    +    P
Sbjct: 195 ANCLELAESKQFGRYVITNRDLKAGDIVIHEQPTHSLLIDTYRHMRCDYCLQKHLYTLRP 254

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C +A+YCSE CR++A L YHR+EC
Sbjct: 255 CEGCTVAMYCSEECRKQAQLTYHRYEC 281


>gi|158292886|ref|XP_001688544.1| AGAP005253-PA [Anopheles gambiae str. PEST]
 gi|157017203|gb|EDO64127.1| AGAP005253-PA [Anopheles gambiae str. PEST]
          Length = 638

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
           ++++ ++  GRH++  + +  GD L +EKP A ++        C  C    +   IPC  
Sbjct: 209 VELRQNSEFGRHLVTTQHLKAGDVLLIEKPYASMLNDKERYKRCAFCHNEDTFTLIPCEG 268

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
           C L +YCS+ C  +A+ +YHR+EC G LR
Sbjct: 269 CTLTMYCSDECMDKAYKQYHRYEC-GVLR 296


>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
           porcellus]
          Length = 800

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 66/141 (46%), Gaps = 22/141 (15%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN----- 123
           E+  +  AS+ +++ T   KGR++IA +D+  G+ L  E   AFV +L P  M       
Sbjct: 224 ENEHISSASSSVRLCTDPLKGRYLIATKDIIPGELLVKED--AFVSVLNPGEMPPPHQGL 281

Query: 124 --------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
                         C+ C     A +PC+ C  A YCS+ C ++AW  YH  EC     L
Sbjct: 282 ESRWDTRVTLGDLYCHRCLKHTLAMVPCHGCSYAKYCSQECMQQAWELYHSVECSLGELL 341

Query: 170 LEAVGIAHLALKLILVSSHSD 190
           L      H+AL+L L++   D
Sbjct: 342 LTLGVFCHVALRLTLLARFED 362


>gi|312377245|gb|EFR24125.1| hypothetical protein AND_11518 [Anopheles darlingi]
          Length = 479

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +N +++  S   GR+VI   D+  GD +  E+P   +++       C++C    +    P
Sbjct: 195 ANCLELAESKQFGRYVITNRDLKAGDIVIHEQPTHSLLIDTYRHMRCDYCLQKHLYTLRP 254

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C +A+YCSE CR++A L YHR+EC
Sbjct: 255 CEGCTVAMYCSEECRKQAQLTYHRYEC 281


>gi|158292888|ref|XP_314169.4| AGAP005253-PB [Anopheles gambiae str. PEST]
 gi|157017204|gb|EAA09348.4| AGAP005253-PB [Anopheles gambiae str. PEST]
          Length = 613

 Score = 63.2 bits (152), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
           ++++ ++  GRH++  + +  GD L +EKP A ++        C  C    +   IPC  
Sbjct: 209 VELRQNSEFGRHLVTTQHLKAGDVLLIEKPYASMLNDKERYKRCAFCHNEDTFTLIPCEG 268

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
           C L +YCS+ C  +A+ +YHR+EC G LR
Sbjct: 269 CTLTMYCSDECMDKAYKQYHRYEC-GVLR 296


>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           florea]
          Length = 678

 Score = 63.2 bits (152), Expect = 8e-08,   Method: Composition-based stats.
 Identities = 42/119 (35%), Positives = 61/119 (51%), Gaps = 6/119 (5%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH IA +D+  G+ L +EKP +  +L    + NC +C T I  P+P  C  C    YCS
Sbjct: 253 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLINCFYCFTKIFVPVPAVCQTCSCVAYCS 312

Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
            SCR +   + H  EC     L L +      LAL++I+  S    YK +  L+T+ +D
Sbjct: 313 ISCRNKD-AKIHENECPILPTLWLSKTSINCFLALRIIVQQSFDKLYK-LKDLKTNSKD 369


>gi|57970742|ref|XP_564258.1| AGAP011234-PA [Anopheles gambiae str. PEST]
 gi|55244729|gb|EAL41557.1| AGAP011234-PA [Anopheles gambiae str. PEST]
          Length = 646

 Score = 62.8 bits (151), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
           ++++  +  GRH++  + +  GD L +EKP A ++        C  C    +   IPC  
Sbjct: 207 VELRQDSEFGRHLVTTQHLKAGDVLMIEKPYASLLCERDQYKRCAFCHNEDTFTLIPCEG 266

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C +A+YCSE CR +A  +YHR+EC
Sbjct: 267 CTVAMYCSEECRDKAHKQYHRYEC 290


>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
          Length = 799

 Score = 62.8 bits (151), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 219 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 276

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 277 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 336

Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
               LL      H+AL++ L++   D
Sbjct: 337 LGGLLLTLGVFCHVALRMTLLARFED 362


>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
 gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
 gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
          Length = 799

 Score = 62.8 bits (151), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 219 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 276

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 277 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 336

Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
               LL      H+AL++ L++   D
Sbjct: 337 LGGLLLTLGVFCHVALRMTLLARFED 362


>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
           glaber]
          Length = 793

 Score = 62.8 bits (151), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 72/144 (50%), Gaps = 22/144 (15%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  +  AS+ +++ T   KGR+++A++D+  G+ L  E     V+    +PP       
Sbjct: 224 ENEQISNASSSVRLCTDPLKGRYLVAIKDILPGEILVKEDAFVSVLNPGEMPPLHQGLES 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YH  EC     LL 
Sbjct: 284 RWDTRVTNGDLYCHKCLKHTLAMVPCHGCSYANYCSQECMQQAWELYHSIECSLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEV 195
                H+AL+L L++    R+K+V
Sbjct: 344 LGMFCHVALRLTLLA----RFKDV 363


>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
          Length = 666

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 23/154 (14%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 86  IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 143

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 144 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 203

Query: 165 GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHL 198
               LL      H+AL++ L++   D  + V  L
Sbjct: 204 LGGLLLTLGVFCHVALRMTLLARFEDDDRVVRML 237


>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
          Length = 804

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370


>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
          Length = 804

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370


>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370


>gi|158287350|ref|XP_309407.4| AGAP011238-PA [Anopheles gambiae str. PEST]
 gi|157019608|gb|EAA45344.4| AGAP011238-PA [Anopheles gambiae str. PEST]
          Length = 497

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
           ++++  +  GRH++  + +  GD L +EKP A +++       C  C        IPC  
Sbjct: 210 VELRQDSEFGRHLVTTQHLKAGDVLLIEKPYANLLIDVERHVRCAFCQNEDRFTLIPCEG 269

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
           C + +YCSE CR +A  +YHR+EC G LR
Sbjct: 270 CTVTMYCSEECRDKAHKQYHRYEC-GVLR 297


>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
 gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
          Length = 804

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370


>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
          Length = 828

 Score = 62.4 bits (150), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 248 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 305

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 306 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 365

Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
               LL      H+AL++ L++   D
Sbjct: 366 LGGLLLTLGVFCHVALRMTLLARFED 391


>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 677

 Score = 62.4 bits (150), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 7/109 (6%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH IA +D+  G+ L +EKP +  +L    + NC +C T I  PIP  C  C    YCS
Sbjct: 252 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLINCFYCFTKIFVPIPAVCQTCSCVAYCS 311

Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHSDRYK 193
            SCR +   + H  EC   L  L A   +    LAL++I+  S    YK
Sbjct: 312 ISCRDKD-AKIHENEC-SILPTLWASKTSINCFLALRIIVQQSFEKLYK 358


>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
 gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
          Length = 666

 Score = 62.4 bits (150), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 86  IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 143

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 144 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 203

Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
               LL      H+AL++ L++   D
Sbjct: 204 LGGLLLTLGVFCHVALRMTLLARFED 229


>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
          Length = 716

 Score = 62.4 bits (150), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 136 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 193

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 194 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 253

Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
               LL      H+AL++ L++   D
Sbjct: 254 LGGLLLTLGVFCHVALRMTLLARFED 279


>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
          Length = 743

 Score = 62.0 bits (149), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)

Query: 66  LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
           +A GE +T +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M  
Sbjct: 163 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 220

Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
                              C+ C     A +PC  C  A YCS+ C ++AW  YH  EC 
Sbjct: 221 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 280

Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
               LL      H+AL++ L++   D
Sbjct: 281 LGGLLLTLGVFCHVALRMTLLARFED 306


>gi|195430416|ref|XP_002063251.1| GK21490 [Drosophila willistoni]
 gi|194159336|gb|EDW74237.1| GK21490 [Drosophila willistoni]
          Length = 745

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 6/103 (5%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNEC-ILA 143
           SN   R++IA  ++   + +F E   AFV  P      C+ C TS   APIPC  C    
Sbjct: 223 SNCDNRYMIAKNNIAAHEVIFQEMASAFV--PIEDNIICHQCATSFMCAPIPCPHCHHRV 280

Query: 144 VYCSESCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV 185
           +YCS +C R++ L  H++EC    L LL+ +G++HLA++L+L 
Sbjct: 281 IYCSRAC-RQSHLYIHQYECAAYRLNLLQMLGVSHLAMRLVLT 322


>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
 gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
          Length = 835

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIIPKL 370


>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
           troglodytes]
          Length = 835

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIIPKL 370


>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Meleagris gallopavo]
          Length = 706

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 20/122 (16%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPP----------------CSMSN----CNHCC 128
           +GRH++A +++  G  L  EK    V+ P                   ++N    C+HC 
Sbjct: 147 RGRHLVASQNILPGQNLLKEKAFVSVLCPGEGDSLLLQDSSETVWDTQVTNADLYCHHCL 206

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSH 188
             + A +PC  C  A YCS++C   AW +YHR EC     LL      H+AL+ +L++  
Sbjct: 207 KQLLASVPCCGCSYAKYCSQNCADVAWEQYHRTECPLGALLLVLGVFCHVALRTVLLAGF 266

Query: 189 SD 190
           S+
Sbjct: 267 SE 268


>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
          Length = 558

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 28  ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 87

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 88  KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 147

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 148 LGVFCHIALRLTLLVGFEDVRKIITKL 174


>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
          Length = 835

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGIFCHIALRLTLLVGFEDVRKIIPKL 370


>gi|170060861|ref|XP_001865989.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879226|gb|EDS42609.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 588

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 77/158 (48%), Gaps = 16/158 (10%)

Query: 48  QVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFV 107
           +V +T S A++ +  +P++  G           I+++ S+  GRH++A +    GDTL +
Sbjct: 180 EVEATLSHAANPI--IPNVVAG-----------IELEVSDVYGRHLVARQYFDFGDTLLI 226

Query: 108 EKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECH-- 164
           + P   V         C+HC    +   +PC +C+  ++CS  C + A  RYHR+EC   
Sbjct: 227 DDPYVVVADLGQQYRQCHHCLKFCTLKLVPCPDCVEVMFCSPECAKLAQQRYHRFECPIL 286

Query: 165 GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHL 202
             L  L+ +G   + +  I+ ++ +    ++  L  H+
Sbjct: 287 RGLYTLDKIGRMAILVARIVFTAVTGFNDDLDALREHI 324


>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
           jacchus]
          Length = 798

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 18/142 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPCSMS--- 122
           E+  L  AS+ + +     KGR+++A  ++  G+ L  E     V+    LPP   S   
Sbjct: 224 ENKQLSNASSSVGLCMDPLKGRYLVATREILPGEPLVKEDAFVSVLNPGELPPLHHSLES 283

Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                       C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWKLYHRIECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYK 193
                H+AL+L L+    D  K
Sbjct: 344 LGVFCHIALRLTLLVGFQDVSK 365


>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
           boliviensis boliviensis]
          Length = 800

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 18/142 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPCSMS--- 122
           E+  L  AS+ + +     KGR+++A  ++  G+ L  E     V+    LPP   S   
Sbjct: 224 ENKQLSNASSSVSLCMDPVKGRYLVATREILPGEPLVKEDAFVSVLNPGELPPLHHSLGS 283

Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                       C+ C     A +PC+ C  A YCS+ C ++AW  YHR EC     LL 
Sbjct: 284 KWNTRVTNADLYCHRCLRHTLATVPCDGCSYAKYCSQECLQQAWGLYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYK 193
                H+AL+L L+    D  K
Sbjct: 344 LGVFCHIALRLTLLVGFEDVSK 365


>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
          Length = 777

 Score = 61.6 bits (148), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 65/139 (46%), Gaps = 18/139 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  + CAS+ + + T   KGR +IA +D+  G+ L  E+    V+    +PP       
Sbjct: 199 ENEQISCASSSVTLCTDPFKGRSLIATKDILPGELLVREEAFVSVLNPGEMPPRRDGLES 258

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YH  EC     LL 
Sbjct: 259 KWDTRVTNGDLYCHRCLKHTVATVPCDGCSYAKYCSQECMQQAWDLYHSAECSLGGLLLT 318

Query: 172 AVGIAHLALKLILVSSHSD 190
                H AL+  L++   D
Sbjct: 319 LGVFCHTALRATLLARCED 337


>gi|170029107|ref|XP_001842435.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880642|gb|EDS44025.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 574

 Score = 61.2 bits (147), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 2/94 (2%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIP 136
           +  IKM++    GR ++A +D   GD +  EKP+A  I       +C++C     ++ IP
Sbjct: 189 AKGIKMESLPEFGRSMMAEKDFRAGDVILSEKPLAAAIDSSLRYVDCSYCTIVHFNSLIP 248

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C+  +YC E CR+  W + HR+EC  A +L+
Sbjct: 249 CPGCVSFMYCDEECRQADW-KTHRFECGVAEKLM 281


>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
          Length = 804

 Score = 60.8 bits (146), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/147 (30%), Positives = 66/147 (44%), Gaps = 18/147 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
           E+  L  AS+ I +     KGR ++A +D+  G+ L  E     V+    LPP       
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS+ C ++ W  YHR EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQVWELYHRTECPLGGLLLT 343

Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
                H+AL+L L+    D  K +  L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370


>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
          Length = 799

 Score = 60.8 bits (146), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 63/141 (44%), Gaps = 22/141 (15%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN----- 123
           E+  +  AS  + + T   KGRH++A +D+  G+ L  E   AFV +L P  M       
Sbjct: 224 ENKQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPRPHHCL 281

Query: 124 --------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
                         C+ C     A +PC  C  A YCS+ C ++AW  YH  EC     L
Sbjct: 282 ENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECSLGGLL 341

Query: 170 LEAVGIAHLALKLILVSSHSD 190
           L      H+AL++ L++   D
Sbjct: 342 LTLGVFCHVALRMTLLARFED 362


>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 4 [Nomascus leucogenys]
          Length = 703

 Score = 60.8 bits (146), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 71/158 (44%), Gaps = 22/158 (13%)

Query: 61  ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP--- 117
           A+L  L   E+  L  AS+ I +     KGR+++A +D+  G+ L  E   AFV +    
Sbjct: 114 AALFHLGQYENEQLSSASSSIGLCVDPLKGRYLVATKDILPGELLVKED--AFVSVLNPE 171

Query: 118 -------------PCSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
                           ++N    C+ C     A +PC+ C  A YCS+ C ++AW  YHR
Sbjct: 172 ELPPPPHGLDSKWDTRVTNGDLYCHRCLKHTLATVPCDRCSYAKYCSQECLQQAWELYHR 231

Query: 161 WECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHL 198
            EC     LL      H+AL+L L+    D  K +  L
Sbjct: 232 TECPLGGLLLTLGVFCHIALRLTLLVGFEDVLKIITKL 269


>gi|195029279|ref|XP_001987502.1| GH21957 [Drosophila grimshawi]
 gi|193903502|gb|EDW02369.1| GH21957 [Drosophila grimshawi]
          Length = 758

 Score = 60.8 bits (146), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 43/119 (36%), Positives = 65/119 (54%), Gaps = 7/119 (5%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
           E   L   ++KI M +S  +GR++IA + + +GD +F E+   FV  P      C  C  
Sbjct: 211 ELQKLTGDTHKISMDSST-RGRYMIATKRLSEGDIIFTEQADCFV--PIEQRLICQKCAA 267

Query: 130 S-ISAPIPCNEC-ILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILV 185
           S + APIPC +C    VYCS  C R+     H +EC    + LL  +G++HLA++L+L 
Sbjct: 268 SLLCAPIPCPQCHQRVVYCSRRC-RDLHKYIHSYECAAYRKDLLIMLGVSHLAMRLLLT 325


>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
          Length = 801

 Score = 60.5 bits (145), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 59/134 (44%), Gaps = 18/134 (13%)

Query: 83  MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP--------------CSMSN----C 124
           + T  +KGR ++A +DV  G+ L  E     V++P                 +SN    C
Sbjct: 237 LGTDPSKGRCLVATKDVLPGELLVKEDAFVSVLIPGEVPPLPHGLGSRWDTRISNGDLYC 296

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
           + C     A +PC+ C  A YCS  C R AW  YH  EC     LL      H+AL+L L
Sbjct: 297 HRCLKHTLATVPCDGCSYAKYCSLECLRRAWELYHSAECALGGLLLTLGIFCHVALRLTL 356

Query: 185 VSSHSDRYKEVYHL 198
           V+   D  K +  L
Sbjct: 357 VARFQDFGKVIRKL 370


>gi|195485531|ref|XP_002091128.1| GE12406 [Drosophila yakuba]
 gi|194177229|gb|EDW90840.1| GE12406 [Drosophila yakuba]
          Length = 573

 Score = 60.5 bits (145), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 48/84 (57%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ S  +GR V+   D+  GD + VE+P    +LPP     C  C   +    IPC+ 
Sbjct: 187 LELRESADEGRFVVTNRDLVVGDLVAVEQPFCSTLLPPMRYIRCATCKRENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A++CSE C+++A   +HR+EC
Sbjct: 247 CCSAMFCSEECKQQAMSTFHRFEC 270


>gi|158287352|ref|XP_309409.4| AGAP011237-PA [Anopheles gambiae str. PEST]
 gi|157019609|gb|EAA05217.4| AGAP011237-PA [Anopheles gambiae str. PEST]
          Length = 481

 Score = 60.1 bits (144), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 2/89 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
           ++++ ++  GRH++  + +  GD L +EKP A ++        C+ C        IPC  
Sbjct: 197 VELRQNSEVGRHLVTTQKLKVGDVLLIEKPYASMLNDQERYKRCDFCQNEDRFTLIPCEG 256

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
           C + +YCS+ C  +A  +YHR+EC G LR
Sbjct: 257 CTVTMYCSKECMDKAHKQYHRYEC-GVLR 284


>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
          Length = 680

 Score = 59.7 bits (143), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 65/135 (48%), Gaps = 17/135 (12%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC--CTSISAPI 135
           S KI ++     GR+ +A   +  GD + V+ P A V+ P    ++C+HC     +   +
Sbjct: 242 SKKILVEEDAVSGRYGVAASPIRVGDVIAVDAPYASVMNPEKFSTHCHHCYQILELGEVL 301

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHSDRY 192
           PC+ C L  +CS +CR  A   YH  EC   L  L A GI+   +L+L++I +       
Sbjct: 302 PCSHCDLVSFCSVNCRSRAMEIYHAIEC-PILSCLYAAGISIICYLSLRMIAI------- 353

Query: 193 KEVYHLETHLQDMRP 207
               H  +   D+RP
Sbjct: 354 ----HPPSFFMDVRP 364


>gi|158299652|ref|XP_319721.4| AGAP008973-PA [Anopheles gambiae str. PEST]
 gi|157013620|gb|EAA14846.4| AGAP008973-PA [Anopheles gambiae str. PEST]
          Length = 668

 Score = 59.7 bits (143), Expect = 8e-07,   Method: Composition-based stats.
 Identities = 40/102 (39%), Positives = 51/102 (50%), Gaps = 8/102 (7%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMS-----NCNHCCTSISAPIPCNECILA 143
           GR+V+A E +   DT+  E  V+FV +  P S S     +C  C      P PC  C  A
Sbjct: 165 GRYVVAAEAIKANDTVARETAVSFVPVYDPESSSTLPSFDCQKCAKVNVVPFPCPTCGRA 224

Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
            YCS  C R A    HR+EC G  + L   +GIAHL L+  L
Sbjct: 225 CYCSTRC-RVAHRPVHRFECFGYQKHLWYQIGIAHLGLRCFL 265


>gi|170036325|ref|XP_001846015.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878892|gb|EDS42275.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 559

 Score = 59.7 bits (143), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 1/80 (1%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
           R ++A  D + GD +   +P+   I       NC+HC    S + IPC  C+  +YC E 
Sbjct: 183 RILVAERDFNPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPGCVFFMYCGEE 242

Query: 150 CRREAWLRYHRWECHGALRL 169
           CR+++W  +HR+EC  A +L
Sbjct: 243 CRQKSWKLWHRFECPVATKL 262


>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
 gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
          Length = 485

 Score = 59.7 bits (143), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 8/140 (5%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           ++ +++  +  KGR +    D+  G+ +F E   A V+L   S+  C+ C      P  C
Sbjct: 6   TDPVEVFGAGLKGRGLRGTRDLSAGEVVFAEASFAAVVLDSLSLQVCHSCFRRQVNPHRC 65

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSD----- 190
            +C  A YC  +C+R AW   HR EC     + +A    +  +A  L  +  H+      
Sbjct: 66  AQCKFAHYCDRTCQRAAW-DEHRKECSAIRNIGKAPNENVRLVARILWRIQKHTGLVSDS 124

Query: 191 RYKEVYHLETHLQDMRPEDL 210
           +   +  LE HL  M PEDL
Sbjct: 125 QLTTLDMLEDHLSRMTPEDL 144


>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 675

 Score = 59.7 bits (143), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 54/126 (42%), Gaps = 3/126 (2%)

Query: 40  NSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDV 99
           N +   S +++        S  S P +  G    L C+            GRH +A  ++
Sbjct: 200 NQIAQKSNKMNLKKDKIDGSKRSCPKIEDGNPLFLSCSKAVTIRNDGGDIGRHAVATRNI 259

Query: 100 HKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCSESCRREAWLR 157
             G+ L VEKP    IL    +++C+ C   I  PIP  C  C    YCS SC R+A  +
Sbjct: 260 EPGEILVVEKPHCSFILAEYRLTHCHLCMNKIFVPIPSKCQVCSYLAYCSTSC-RDADAK 318

Query: 158 YHRWEC 163
            H  EC
Sbjct: 319 IHESEC 324


>gi|290976641|ref|XP_002671048.1| SET domain-containing protein [Naegleria gruberi]
 gi|284084613|gb|EFC38304.1| SET domain-containing protein [Naegleria gruberi]
          Length = 914

 Score = 59.3 bits (142), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 32/94 (34%), Positives = 46/94 (48%), Gaps = 3/94 (3%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
           C+     + T+   GR + A  ++  G  +  EKP   V L   + + CN+C   +   I
Sbjct: 274 CSPLHAHVITTEQYGRGMYASSELKSGQVVLSEKPFLCVTLD--TENTCNNCLNKLKKKI 331

Query: 136 PC-NECILAVYCSESCRREAWLRYHRWECHGALR 168
           PC N C +  YCS  CR EAW +YHR  C   +R
Sbjct: 332 PCKNNCGMEYYCSMECRSEAWDKYHRKLCGFDMR 365


>gi|170036323|ref|XP_001846014.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878891|gb|EDS42274.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 576

 Score = 59.3 bits (142), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 3/96 (3%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP--VAFVILPPCSMSNCNHCCTSI-SAP 134
           +  I M+     GR +IA++D   GD +  EKP   A       S   C HC +   ++ 
Sbjct: 183 AKGIGMKQYPGLGRGLIALKDFQPGDVILDEKPELCAVSFNNNFSYHYCFHCGSEFQTSL 242

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           IPC +C   +YCSESC    W + HR+EC  A++L 
Sbjct: 243 IPCPKCTNHMYCSESCLETDWKQAHRFECAVAMKLF 278


>gi|57974582|ref|XP_566179.1| AGAP000216-PA [Anopheles gambiae str. PEST]
 gi|55243600|gb|EAL41256.1| AGAP000216-PA [Anopheles gambiae str. PEST]
          Length = 526

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 69/138 (50%), Gaps = 8/138 (5%)

Query: 32  LTIEKSEENSVEAMSMQVSST--SSSASSSVASLP-SLACGESHTLMCASNKIKMQTSNA 88
           L   K +E+   A SM +S+T  ++  SS   + P  L+          +N ++M+ S  
Sbjct: 97  LLRRKLDESERCARSMLLSATDDATPGSSRHGAAPLKLSFPAYENAPRVANCLEMRRSPQ 156

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP---IPCNECILAVY 145
            GR++   + +  GD + +++P   V+ P    + C+HC     AP   IPC  C  A+Y
Sbjct: 157 YGRYLQTNKALKVGDVVMIDEPYVSVLEPEFCYARCDHC--QRPAPFTLIPCERCTKAMY 214

Query: 146 CSESCRREAWLRYHRWEC 163
           CS++C R A   YH +EC
Sbjct: 215 CSKNCLRRARTEYHEFEC 232


>gi|170038782|ref|XP_001847227.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167882473|gb|EDS45856.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 560

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 55/107 (51%), Gaps = 1/107 (0%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
           R ++A +D++ GD +   +P+   I       NC+HC    S + IPC +C+  +YC E 
Sbjct: 231 RILVAEKDINPGDVIMDAEPLLTAIDFNLCYKNCSHCGVKFSNSLIPCPDCVFFMYCEEE 290

Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVY 196
           CR+++W  +H++EC  A  L        L+   +   +  +R++  Y
Sbjct: 291 CRQKSWKLWHQFECPVATELRNFSNFNLLSTPRLSTGAELNRFELDY 337


>gi|170036321|ref|XP_001846013.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878890|gb|EDS42273.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 559

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIP 136
           ++ I M      GR +IA +D + GD +  EK     +    S  NCNHC +  + + IP
Sbjct: 175 ADGIGMNYEPKFGRGLIAEKDFNPGDIILEEKSELCGVDFNLSYRNCNHCSSRFNYSLIP 234

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C   +YCS+ C  + W  +HR+EC
Sbjct: 235 CPTCPFMMYCSQECLEQNWNLFHRFEC 261


>gi|170058970|ref|XP_001865157.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167877852|gb|EDS41235.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 556

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 46/93 (49%), Gaps = 1/93 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
           +N I M      GR ++AV D + GD +  EK          S SNC HC     S+ IP
Sbjct: 168 ANGIDMTQYEDMGRGLVAVHDFNAGDLILNEKIELCFGSFERSYSNCAHCGAEFRSSLIP 227

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           C  C+  +YC + CR E +   HR+EC  A +L
Sbjct: 228 CPGCVAFMYCGKKCRNEDFRTVHRFECAVATKL 260


>gi|170072158|ref|XP_001870107.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167868273|gb|EDS31656.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 554

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
           +  I M      GR ++A +D + GD +  E    FV         CN C  S+    IP
Sbjct: 172 AKGIAMGYDPKFGRGLVAEKDFNAGDLILDETSELFVHEFSLCFQRCNQCSASLYQILIP 231

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C+L +YCSE CR   W  +HR+EC
Sbjct: 232 CRTCVLFMYCSEKCRELHWKLFHRFEC 258


>gi|157136250|ref|XP_001656795.1| hypothetical protein AaeL_AAEL003447 [Aedes aegypti]
 gi|108881063|gb|EAT45288.1| AAEL003447-PA [Aedes aegypti]
          Length = 537

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 7/97 (7%)

Query: 97  EDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSESCRREAW 155
           +++  G+   VE+P+  V+ P    S CN+C +      IPC +C+  +YCSE CR EA+
Sbjct: 190 KELRVGEVAVVERPLLVVVEPEAVQSRCNYCGSKNELDLIPCRKCVSVMYCSEKCRDEAY 249

Query: 156 LRYHRWEC------HGALRLLEAVGIAHLALKLILVS 186
             YH++EC          R  +   +  L LKL  ++
Sbjct: 250 SCYHKFECAVIKDLKNLFRGPKPTRMFQLTLKLFWMA 286


>gi|354508096|ref|XP_003516089.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
           [Cricetulus griseus]
          Length = 391

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 18/139 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN 123
           E+  +  AS  + + T   KGRH++A +D+  G+ L  E     V+     PP    + N
Sbjct: 101 ENKQISGASLSVSLHTDPLKGRHLVAAKDILPGELLVKEDAFVSVLNPGETPPLHYGLEN 160

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                       C+ C     A +PC  C  A YCS+ C ++AW  YH  EC     LL 
Sbjct: 161 KWDTRVTNGDLYCHRCLKHTLATVPCGGCSYARYCSQECMQKAWECYHSTECPLGGLLLT 220

Query: 172 AVGIAHLALKLILVSSHSD 190
                H+AL++ L++   D
Sbjct: 221 LGVFCHVALRITLLARFED 239


>gi|195402463|ref|XP_002059824.1| GJ15028 [Drosophila virilis]
 gi|194140690|gb|EDW57161.1| GJ15028 [Drosophila virilis]
          Length = 723

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 6/103 (5%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIPCNEC-ILA 143
           +  +GR++IA + + +G+ +F E+   FV  P      C  C  S + APIPC  C    
Sbjct: 195 AGGRGRYMIASQRIKEGEIVFTEQAECFV--PTQGRLICQQCAASLLCAPIPCTRCHQRV 252

Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILV 185
           VYCS  C R+     H +EC    R LL  +G  HLAL+L+L 
Sbjct: 253 VYCSRRC-RQLHATIHGYECEAYRRDLLLPIGPFHLALRLVLT 294


>gi|195429543|ref|XP_002062817.1| GK19653 [Drosophila willistoni]
 gi|194158902|gb|EDW73803.1| GK19653 [Drosophila willistoni]
          Length = 574

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNE 139
           ++++ S  +GR ++  +D+  GD + +E+P    +LPP     C  C   +    IPC+ 
Sbjct: 187 LELRESPDEGRFIVTKQDLVVGDLVAIEQPFCSTLLPPMRYIRCATCKMENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+ +A   +HR+EC
Sbjct: 247 CCSVMFCSEECKTKAMTTFHRYEC 270


>gi|344235250|gb|EGV91353.1| SET and MYND domain-containing protein 4 [Cricetulus griseus]
          Length = 370

 Score = 58.9 bits (141), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 18/139 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN 123
           E+  +  AS  + + T   KGRH++A +D+  G+ L  E     V+     PP    + N
Sbjct: 80  ENKQISGASLSVSLHTDPLKGRHLVAAKDILPGELLVKEDAFVSVLNPGETPPLHYGLEN 139

Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                       C+ C     A +PC  C  A YCS+ C ++AW  YH  EC     LL 
Sbjct: 140 KWDTRVTNGDLYCHRCLKHTLATVPCGGCSYARYCSQECMQKAWECYHSTECPLGGLLLT 199

Query: 172 AVGIAHLALKLILVSSHSD 190
                H+AL++ L++   D
Sbjct: 200 LGVFCHVALRITLLARFED 218


>gi|170047663|ref|XP_001851333.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870014|gb|EDS33397.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 561

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 11/110 (10%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP-CSMSNCNHCCTS-ISAPIPCN 138
           I M      GR ++A +D + G+ +  ++P+    L P   +S+CN C    +   IPC 
Sbjct: 176 IGMSYDAKFGRGLVAEKDFNPGNVI-CDEPIELCALEPNLLLSSCNQCSAELVEMLIPCL 234

Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSH 188
            C + ++CSE CR   W  YHR+EC        AV   H     + V++H
Sbjct: 235 TCPMVMFCSEKCRELNWQLYHRFEC--------AVASKHFISSFVTVATH 276


>gi|170070169|ref|XP_001869489.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167866058|gb|EDS29441.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 541

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 1/80 (1%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
           R ++A +D++ GD +   +P+   I       NC+HC    S + IPC +C+  +YC E 
Sbjct: 231 RILVAEKDINPGDVIMDAEPLLTAIDFNLCYKNCSHCGVKFSNSLIPCPDCVFFMYCGEE 290

Query: 150 CRREAWLRYHRWECHGALRL 169
           CR+++W  +H++EC  A  L
Sbjct: 291 CRQKSWKLWHQFECPVATEL 310


>gi|170047661|ref|XP_001851332.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870013|gb|EDS33396.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 465

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 59/128 (46%), Gaps = 13/128 (10%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP-CSMSNCNHCCTS-ISAPIPCN 138
           I M      GR ++A +D + G+ +  ++P+    L P   +S+CN C    +   IPC 
Sbjct: 80  IGMSYDAKFGRGLVAEKDFNPGNVI-CDEPIELCALEPNLLLSSCNQCSAELVEMLIPCL 138

Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHL 198
            C + ++CSE CR   W  YHR+EC        AV   H     + V++H   +   Y L
Sbjct: 139 TCPMVMFCSEKCRELNWQLYHRFEC--------AVASKHFISSFVTVATHPRLF--FYGL 188

Query: 199 ETHLQDMR 206
                D++
Sbjct: 189 SQFGDDLQ 196


>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
          Length = 648

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/157 (24%), Positives = 70/157 (44%), Gaps = 26/157 (16%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
            + ++  ++   GR+  A +++     +  EKP    ++   S ++C+ C   +     C
Sbjct: 223 DDAVRYDSNETDGRYATAQKNLKPNQHILKEKPHGATLVQEYSSTHCSTCFERVEILFCC 282

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLILVSSH------ 188
             C+  V+CS  C + A   YHR+EC G LR L + G   ++ LAL+++    +      
Sbjct: 283 PNCVDVVFCSGRCEKTANQSYHRYEC-GFLRSLWSSGATIVSLLALRIVTQKPYSYFEAI 341

Query: 189 ----------------SDRYKEVYHLETHLQDMRPED 209
                           SD Y++V++L TH     PED
Sbjct: 342 RDELPNLTANFTDKLSSDDYRKVFNLVTHSDKRDPED 378


>gi|158284775|ref|XP_307865.2| AGAP009448-PA [Anopheles gambiae str. PEST]
 gi|157020893|gb|EAA03631.2| AGAP009448-PA [Anopheles gambiae str. PEST]
          Length = 561

 Score = 58.2 bits (139), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 64/142 (45%), Gaps = 4/142 (2%)

Query: 34  IEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHV 93
           ++K EE + E ++ + +            LP L+      +   +N +++Q +   GRHV
Sbjct: 112 LDKREEQAKEELA-KSNEKQKKKDDPKPELP-LSFPGHENMPHVANCLQLQQNEQYGRHV 169

Query: 94  IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCRR 152
           +    +  GD + ++ P    +  P     C+ C        IPC  C   +YCS  C  
Sbjct: 170 VTTRRLKVGDVVMLDTPFVKTLHDPLRHVRCDFCHAERPFTLIPCEGCTWVMYCSAECLG 229

Query: 153 EAWLRYHRWECHGALRLLEAVG 174
           +A+ +YHR+EC G +R L  V 
Sbjct: 230 KAYSQYHRYEC-GVMRDLWRVA 250


>gi|170041125|ref|XP_001848325.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864690|gb|EDS28073.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 592

 Score = 57.8 bits (138), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 56/101 (55%), Gaps = 1/101 (0%)

Query: 64  PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
           P L+   + ++   S+ ++++  +  GR+++   ++  GD + VEKP + ++       N
Sbjct: 171 PKLSYKPNPSIPHVSDCLELKEDSQFGRYLVTNRNLSAGDVVIVEKPFSRLLAAQLRYLN 230

Query: 124 CNHCCTSIS-APIPCNECILAVYCSESCRREAWLRYHRWEC 163
           C++C   +  + IPC  C   ++CSE+CR++A   YH  EC
Sbjct: 231 CDYCHQDVFLSLIPCESCSFTMFCSENCRKKANEGYHAVEC 271


>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
           familiaris]
          Length = 794

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 19/135 (14%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP-----------CSMSN----CNHCCT 129
           KGR++IA +D+  G+ L  E     V+    +PP             ++N    C+ C  
Sbjct: 243 KGRYLIATKDILPGELLVKEDAFVSVLNPGEMPPGLHHGLENKWDTRITNGDLYCHRCLK 302

Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
              A +PC+ C  A YCS  C ++AW  YH+ EC     LL      H+AL+  L++   
Sbjct: 303 HTLATVPCDGCSYAKYCSNECMQQAWDLYHQIECSLGGLLLTLGIFCHIALRSTLLARFE 362

Query: 190 DRYKEVYHLETHLQD 204
           D  K +  L + + +
Sbjct: 363 DASKVIRKLYSEISN 377


>gi|170040654|ref|XP_001848107.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864246|gb|EDS27629.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 561

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 2/108 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
           + M+     GR ++A  +   GD +  EK V  V+  P     C+HC  S   + IPC +
Sbjct: 178 VGMKQYPGSGRGLVAERNFKTGDVILDEKTVISVVGSPFKFLYCSHCGISNQHSLIPCPD 237

Query: 140 CILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLILVS 186
           C++ +YCSE C  E     HR+EC  GA         +++A KL L  
Sbjct: 238 CVMYMYCSEECLAEDKRLTHRFECGFGAQSENITFNGSNIAPKLFLYG 285


>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Ailuropoda melanoleuca]
 gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
          Length = 793

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 18/142 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP----------- 118
           E+  +  AS+ + ++    +GR++IA +D+  G+ L  E     V+ P            
Sbjct: 224 ENEQIASASSSVGLRVDPLRGRYLIATKDILPGELLVKEDAFVSVLNPGERPPLHHGLES 283

Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                ++N    C+ C     A +PC+ C  A YCS  C ++AW  YH  EC     LL 
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLAMVPCDGCSYAKYCSHECMQQAWDLYHNIECPLGGLLLA 343

Query: 172 AVGIAHLALKLILVSSHSDRYK 193
                H+AL+  L++   D  K
Sbjct: 344 LGVFCHVALRSTLLARFEDAGK 365


>gi|157130709|ref|XP_001661975.1| hypothetical protein AaeL_AAEL011838 [Aedes aegypti]
 gi|108871817|gb|EAT36042.1| AAEL011838-PA [Aedes aegypti]
          Length = 625

 Score = 57.4 bits (137), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 42/76 (55%), Gaps = 3/76 (3%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSE 148
           GR V+A  +   GD +  EK +   I P     NCNHC   +  + IPC +C+  ++CS+
Sbjct: 249 GRSVVAERNFKPGDVILNEKAMLTAISPEVKYKNCNHCSMENFHSLIPCPKCVSVMFCSK 308

Query: 149 SCRREAWLRY-HRWEC 163
            C  E  LRY HR+EC
Sbjct: 309 EC-LEKGLRYSHRFEC 323


>gi|198437054|ref|XP_002123001.1| PREDICTED: similar to SET and MYND domain containing 4 [Ciona
           intestinalis]
          Length = 773

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 8/149 (5%)

Query: 16  LCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLM 75
           L + +D    + ++ +  + +++E   E     V        S++ +       E+ ++ 
Sbjct: 128 LTKWKDGIGHELQDKLKKVNRNKETKFEWREYIVRRKEKKKGSNLNN-------ENKSVR 180

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAP 134
             S K+KM  +  KGRH     +    + L  E     V+ P    ++C++C T   S+ 
Sbjct: 181 NLSAKVKMNITPQKGRHYFTTFNTETNECLLEEVAYLGVLNPEFFSTHCSYCLTPCKSSG 240

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
           IPC  C   +YC E CR  AW  YH  EC
Sbjct: 241 IPCLGCSCTIYCDEQCRISAWKIYHWMEC 269


>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
           scrofa]
          Length = 788

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 18/132 (13%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP----------CSMS 122
           AS+ + ++    KGR ++A +D+  G+ L  E     V+    +PP            ++
Sbjct: 231 ASSSVSLRVDPLKGRSLVATKDILPGELLVKEDAFVSVLNPGEMPPRHEGLESKWDTRVT 290

Query: 123 N----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
           N    C+ C     A +PC+ C  A YCS+ C ++AW  YH  EC     LL      H+
Sbjct: 291 NGDLYCHRCLKPTLATVPCDGCSYAKYCSQECLQQAWDLYHSVECSLGGLLLTLGVFCHV 350

Query: 179 ALKLILVSSHSD 190
           AL+  L++   D
Sbjct: 351 ALRSTLLARFED 362


>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
           rubripes]
          Length = 434

 Score = 57.0 bits (136), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 67/126 (53%), Gaps = 5/126 (3%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   S  KG  + A   + +G+ ++V +P+A+ +    S + C+ C T     + C++
Sbjct: 4   KLERFVSPGKGNGLRATVRIEEGELVYVTEPLAYCVSQKQSRNVCHQCFTRHETLLRCSQ 63

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI--AHLALKLI--LVSSHSDRYKEV 195
           C +A YCS +C+R AW   H+ EC     LL  +      LA +LI  ++SS S   +E+
Sbjct: 64  CKMARYCSATCQRRAW-SDHKRECKCLQSLLPRIPTDSVRLAARLIFAMLSSCSSSSEEL 122

Query: 196 YHLETH 201
           Y LE H
Sbjct: 123 YTLEEH 128


>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 647

 Score = 56.6 bits (135), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 61/141 (43%), Gaps = 26/141 (18%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           +  +GR      D+     L  + P A V+      S+C+HCC  +     C  C+  +Y
Sbjct: 234 TETEGRFARTRIDLRPNHVLLRQLPHAAVVTGEFCESHCDHCCRRVEILFSCPRCMDVIY 293

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLI-----------------LV 185
           CS  C++ A   YHR+EC G L  L+  G   +A LAL+++                 L 
Sbjct: 294 CSSECQKTAQESYHRFEC-GFLSYLKNSGANVVAMLALRIVTQKSLDYFVEIKDELGSLG 352

Query: 186 SSHSDR-----YKEVYHLETH 201
           S   DR     Y+ +Y+  TH
Sbjct: 353 SEEVDRLPVDDYRRIYNFVTH 373


>gi|194883800|ref|XP_001975985.1| GG20247 [Drosophila erecta]
 gi|190659172|gb|EDV56385.1| GG20247 [Drosophila erecta]
          Length = 573

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ S  +GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 187 LELRESADEGRFVVTNRDLVVGDLVAVEQPFCSTLLAPMRYIRCATCKRENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+++A   YHR+EC
Sbjct: 247 CCSVMFCSEECKQQAMSTYHRFEC 270


>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 572

 Score = 56.6 bits (135), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 5/129 (3%)

Query: 61  ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
             +P L   +S      SN +++       R +IA +D++ G+ L VEKP + +++P   
Sbjct: 162 GEIPPLDREKSSFFDSTSNVVELSEDIYSNRKLIANDDINVGEVLIVEKPYSSIVIPKHF 221

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH--GALRLLEAVGIAHL 178
            ++C++C       IPC+ C   ++C E C   A    H  EC   G L  L A     L
Sbjct: 222 YTHCSNCFRRSLNLIPCSNCCTGMFCREECLDIA---NHVIECSYLGILSELNADRKEML 278

Query: 179 ALKLILVSS 187
           ALK+++ +S
Sbjct: 279 ALKILIDAS 287


>gi|170028158|ref|XP_001841963.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167871788|gb|EDS35171.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 289

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 3/101 (2%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
           R ++A +D++ GD +   +P+   I       NC+HC    S + IPC +C+  ++C E 
Sbjct: 62  RILVAEKDINPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPDCVFFMFCGEE 121

Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
           CR+++W  +H++EC       E   +   AL L  V+S S+
Sbjct: 122 CRQKSWKLWHQFECPHLPG--ETSQVEGRALPLAAVTSRSN 160


>gi|158300865|ref|XP_320681.4| AGAP011835-PA [Anopheles gambiae str. PEST]
 gi|157013367|gb|EAA00753.4| AGAP011835-PA [Anopheles gambiae str. PEST]
          Length = 575

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNE 139
           +++  S   GR+V    ++  GD + +E+P + ++        C+ C   SI   +PC  
Sbjct: 174 LELSESEQFGRYVATNRNLEAGDVVIIEQPFSRLLRDIYRHVRCDFCHRESIFTLLPCEN 233

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C +A+YCS SC  +A  +YHR+EC
Sbjct: 234 CTVAMYCSGSCASQAARQYHRYEC 257


>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
 gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
          Length = 377

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 1/88 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           + +KGR + A + +  GDT+  E+PV +++        C+ C   +S    C+ C  A Y
Sbjct: 13  TESKGRGLRAAKPLKSGDTILSEQPVVYMLSNMLRGQRCDFCLEKLSDLQRCSRCKFARY 72

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAV 173
           C  SC+R AW R H+ EC    R+   V
Sbjct: 73  CGASCQRAAW-RIHKSECERLKRVFPRV 99


>gi|170058966|ref|XP_001865155.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167877850|gb|EDS41233.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 559

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 1/90 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
           I M+ S+     ++A  + + GD +  EK     +L   S S C HC +S     IPC  
Sbjct: 174 IAMRQSDGNEDTLVAEREFNTGDVILNEKIELCAVLFKKSNSYCAHCSSSFKQNLIPCPG 233

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRL 169
           C+  +YC E CR+E +   HR+EC  A +L
Sbjct: 234 CVTVMYCGEECRKEDFRTVHRFECSIATKL 263


>gi|125810725|ref|XP_001361596.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
 gi|195154214|ref|XP_002018017.1| GL16991 [Drosophila persimilis]
 gi|54636772|gb|EAL26175.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
 gi|194113813|gb|EDW35856.1| GL16991 [Drosophila persimilis]
          Length = 574

 Score = 56.2 bits (134), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ S  +GR V+   D+  GD + VE+P    +LPP     C  C   +    IPC+ 
Sbjct: 187 LELRESPDEGRFVVTNRDLVVGDLVAVEQPFCSTLLPPMRYIRCATCKRENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   +HR+EC
Sbjct: 247 CCSVMFCSEECKLRATSSFHRFEC 270


>gi|170029113|ref|XP_001842438.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880645|gb|EDS44028.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 567

 Score = 56.2 bits (134), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 1/89 (1%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-AP 134
           C ++ I+M+  +  GR + A  D   G  +  EK    +       S+C  C   ++ + 
Sbjct: 177 CVADGIRMKVYDEFGRGMFAERDFQAGAVILDEKAALAITTLETRYSHCGRCVKQLTYSL 236

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
           IPC+ C+  +YCSE C RE     HR+EC
Sbjct: 237 IPCSGCVSTLYCSEECLREDERFAHRFEC 265


>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
 gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
          Length = 301

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 2/96 (2%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           + +++  S  KGR + A +++  GD LF E P A V+    +   C+ C         C 
Sbjct: 2   DNVEVFDSPGKGRGLRATKELLSGDVLFSEPPFAAVVFDSLADKICHSCFRRQDKLQRCG 61

Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
           +C  A YC ++C+   W + H+ EC GA++   A+G
Sbjct: 62  QCKFAQYCDKTCQSAGW-KEHKLEC-GAIKAFGAIG 95


>gi|157115764|ref|XP_001652685.1| hypothetical protein AaeL_AAEL007348 [Aedes aegypti]
 gi|108876753|gb|EAT40978.1| AAEL007348-PA [Aedes aegypti]
          Length = 543

 Score = 55.8 bits (133), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 70/155 (45%), Gaps = 19/155 (12%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI---------LPPCS 120
           ES   + +  +++ ++    GR ++   ++   + +  EK V+FV          +PP  
Sbjct: 220 ESDDGLLSFPELEEKSDTVHGRFIVTKNEIKADEPIVQEKAVSFVPVYDPRGHSEIPPF- 278

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLA 179
             +C  C      P PC  C  A YCS +C +E     HR+EC G  + L   +GIAHL 
Sbjct: 279 --DCQFCGEVNIIPFPCPSCGRACYCSVACCKEH-SELHRYECPGYEKHLWYLIGIAHLG 335

Query: 180 LKLILVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
           ++  L     D +    H    L++   E ++QYV
Sbjct: 336 MRSFL-----DGFASSTHNLEELKECNAEKMFQYV 365


>gi|194753329|ref|XP_001958966.1| GF12292 [Drosophila ananassae]
 gi|190620264|gb|EDV35788.1| GF12292 [Drosophila ananassae]
          Length = 574

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ S  +GR ++   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 187 LELRESGDEGRFIVTNRDLVVGDLVAVEQPFCSTLLSPMRYIRCATCKRENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+++A   +HR+EC
Sbjct: 247 CCSVMFCSEECKQQAMSTFHRFEC 270


>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
          Length = 554

 Score = 55.5 bits (132), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 43/74 (58%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A+  + ++ +  KGRH +A E++ +G T+ +E   AFV+       NC+ C   ++  + 
Sbjct: 57  ANYPVHLKNTKTKGRHAVASEELPEGATVSLETATAFVVRSEFIDQNCHFCLADLTQKVI 116

Query: 137 CNECILAVYCSESC 150
           C+ C L+ YCSE+C
Sbjct: 117 CSNCSLSYYCSETC 130


>gi|19922072|ref|NP_610730.1| CG8378 [Drosophila melanogaster]
 gi|5052674|gb|AAD38667.1|AF145692_1 BcDNA.LD29892 [Drosophila melanogaster]
 gi|7303523|gb|AAF58578.1| CG8378 [Drosophila melanogaster]
 gi|220953632|gb|ACL89359.1| CG8378-PA [synthetic construct]
          Length = 573

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 187 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 247 CCSTMFCSEECKSIAMQTYHRYEC 270


>gi|429892168|gb|AGA18626.1| CG8378, partial [Drosophila melanogaster]
          Length = 556

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 170 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 229

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 230 CCSTMFCSEECKSIAMQTYHRYEC 253


>gi|429892184|gb|AGA18634.1| CG8378, partial [Drosophila melanogaster]
          Length = 556

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 170 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 229

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 230 CCSTMFCSEECKSIAMQTYHRYEC 253


>gi|429892182|gb|AGA18633.1| CG8378, partial [Drosophila melanogaster]
          Length = 555

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 170 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 229

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 230 CCSTMFCSEECKSIAMQTYHRYEC 253


>gi|429892186|gb|AGA18635.1| CG8378, partial [Drosophila melanogaster]
          Length = 554

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 168 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 227

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 228 CCSTMFCSEECKSIAMQTYHRYEC 251


>gi|429892176|gb|AGA18630.1| CG8378, partial [Drosophila melanogaster]
          Length = 553

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 168 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 227

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 228 CCSTMFCSEECKSIAMQTYHRYEC 251


>gi|256070417|ref|XP_002571539.1| hypothetical protein [Schistosoma mansoni]
 gi|350645320|emb|CCD59943.1| hypothetical protein Smp_000700 [Schistosoma mansoni]
          Length = 782

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  ++  G H++A +++  G+ L ++KP +  +     + NC  C      PIPC  C
Sbjct: 270 VRLNYNDEFGWHIVATKNIQPGELLILDKPYSCRLHYSRLLLNCYRCSKRCINPIPCRSC 329

Query: 141 ILAVYCSESCRREAW 155
               +CSESC +EAW
Sbjct: 330 TQVGFCSESCEQEAW 344


>gi|429892154|gb|AGA18619.1| CG8378, partial [Drosophila melanogaster]
          Length = 556

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 169 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 228

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 229 CCSTMFCSEECKSIAMQTYHRYEC 252


>gi|429892172|gb|AGA18628.1| CG8378, partial [Drosophila melanogaster]
          Length = 555

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 169 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 228

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 229 CCSTMFCSEECKSIAMQTYHRYEC 252


>gi|429892164|gb|AGA18624.1| CG8378, partial [Drosophila melanogaster]
          Length = 553

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 168 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 227

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 228 CCSTMFCSEECKSIAMQTYHRYEC 251


>gi|170041129|ref|XP_001848327.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864692|gb|EDS28075.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 592

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 51/87 (58%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIP 136
           +N ++++  +  GR ++   D+  GD + +EKP + ++       +C++C   +I + IP
Sbjct: 188 ANCLELKHEDQYGRCIVTNRDLKVGDVVIIEKPHSTLLDEELRYLHCDYCNQEAILSLIP 247

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C +C +A++CS +C + A   YHR EC
Sbjct: 248 CKQCSIAMFCSNACYQSALDSYHRLEC 274


>gi|328721096|ref|XP_003247209.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 611

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 4/106 (3%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++++S   GR V    D++ GD + +++P  +  L     S C +C  S   PIPC +C
Sbjct: 203 LELKSSKNMGRGVYTTRDINPGDVVAIDEPYIYGGLQ-LHNSCCRNCLKSPVVPIPCTKC 261

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI 183
            L  YCSE+C  ++    H  EC     +    GI     LALK +
Sbjct: 262 SLVYYCSENCLHKSMKDGHNLECPIIFFIRSLPGITDLNQLALKWL 307


>gi|195333608|ref|XP_002033482.1| GM21333 [Drosophila sechellia]
 gi|194125452|gb|EDW47495.1| GM21333 [Drosophila sechellia]
          Length = 573

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIP 136
           +N ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IP
Sbjct: 184 ANCLELRETAAEGRFVVTNRDLVVGDLVCVEEPFCSTLLTPMRYIRCATCKRENYLTLIP 243

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C   ++CSE C+  A   YHR+EC
Sbjct: 244 CDSCCSTMFCSEECKSIAMQTYHRFEC 270


>gi|429892162|gb|AGA18623.1| CG8378, partial [Drosophila melanogaster]
 gi|429892174|gb|AGA18629.1| CG8378, partial [Drosophila melanogaster]
          Length = 559

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 174 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 233

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 234 CCSTMFCSEECKSIAMQTYHRYEC 257


>gi|429892160|gb|AGA18622.1| CG8378, partial [Drosophila melanogaster]
          Length = 559

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 174 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 233

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 234 CCSTMFCSEECKSIAMQTYHRYEC 257


>gi|429892156|gb|AGA18620.1| CG8378, partial [Drosophila melanogaster]
 gi|429892180|gb|AGA18632.1| CG8378, partial [Drosophila melanogaster]
          Length = 558

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 172 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 231

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 232 CCSTMFCSEECKSIAMQTYHRYEC 255


>gi|429892178|gb|AGA18631.1| CG8378, partial [Drosophila melanogaster]
          Length = 556

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 171 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 230

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 231 CCSTMFCSEECKSIAMQTYHRYEC 254


>gi|429892170|gb|AGA18627.1| CG8378, partial [Drosophila melanogaster]
          Length = 556

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 171 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 230

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 231 CCSTMFCSEECKSIAMQTYHRYEC 254


>gi|429892158|gb|AGA18621.1| CG8378, partial [Drosophila melanogaster]
 gi|429892166|gb|AGA18625.1| CG8378, partial [Drosophila melanogaster]
          Length = 557

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ + A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 171 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 230

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 231 CCSTMFCSEECKSIAMQTYHRYEC 254


>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
          Length = 576

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 22/151 (14%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN------ 123
           CAS+ + +     +GR++IA ED+  G+ L  E     V+     PP    ++N      
Sbjct: 18  CASSSVSLCMDPLRGRYLIATEDILPGELLLKEDAFVSVLNPGERPPLHQGLANKWDTRV 77

Query: 124 ------CNHCCTSISAPIPCNEC----ILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
                 C+ C T   A +PC+ C      A YCS  C ++AW  YH  EC     LL   
Sbjct: 78  TNGDLYCHRCLTHTLAMVPCDGCSYAKCYAKYCSHECMQQAWGLYHNRECPLGGLLLTLG 137

Query: 174 GIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
              H+AL+  L++      K +  L + + +
Sbjct: 138 IFCHVALRATLLARFEGASKVIRKLSSEISN 168


>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
          Length = 776

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 50/106 (47%), Gaps = 24/106 (22%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN-------------------CNHCC 128
           KGR++IA +D+  G+ L  E   AFV +L P  MS                    C+ C 
Sbjct: 237 KGRYLIATKDILPGELLVRED--AFVSVLNPGEMSPLCYGLDGKWNTRVTNGDLYCHRCL 294

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
               A +PC+ C  A YCS+ C ++AW  YH  EC  +L   E VG
Sbjct: 295 KHTLATVPCDGCSYAKYCSQECMQQAWDLYHSIEC--SLARFEDVG 338


>gi|170028853|ref|XP_001842309.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167877994|gb|EDS41377.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 566

 Score = 55.1 bits (131), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 78  SNKIKMQTSNAK-GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-I 135
           ++ I M+  N + GR +IA +D   G  +  EK V  +       SNC HC   ++   I
Sbjct: 176 ADGIAMKVYNDEHGRGLIAEKDFQAGAVIMNEKAVLTLTTLDAQYSNCGHCSVKLTYNMI 235

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           PC  C+  +YCSE C +E     HR+EC  A +L
Sbjct: 236 PCPGCVSMLYCSEECLQENERFTHRFECAIADKL 269


>gi|195582627|ref|XP_002081128.1| GD10841 [Drosophila simulans]
 gi|194193137|gb|EDX06713.1| GD10841 [Drosophila simulans]
          Length = 694

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
           ++++ ++A+GR V+   D+  GD + VE+P    +L P     C  C   +    IPC+ 
Sbjct: 187 LELRETSAEGRFVVTNRDLVVGDLVCVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 246

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   ++CSE C+  A   YHR+EC
Sbjct: 247 CCSTMFCSEECKSIAMQTYHRFEC 270


>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
           P131]
          Length = 582

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 11/92 (11%)

Query: 129 TSISAPIP-----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           TSI  P P     C+ C +  YCS++C+R AW  +H+ EC   L    A+G   LA+  +
Sbjct: 103 TSIMPPKPTAMLLCSGCKVGRYCSQACQRNAWKTFHKLECKAVLSEAPAMGPTTLAMYRL 162

Query: 184 LVSSHSDRYKE-----VYHLETHLQDMRPEDL 210
           L+     R+ E     +  LE+H ++   EDL
Sbjct: 163 LLLKKDGRFTERQWKAIMQLESHFEEQN-EDL 193


>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
           Y34]
          Length = 581

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 11/92 (11%)

Query: 129 TSISAPIP-----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           TSI  P P     C+ C +  YCS++C+R AW  +H+ EC   L    A+G   LA+  +
Sbjct: 103 TSIMPPKPTAMLLCSGCKVGRYCSQACQRNAWKTFHKLECKAVLSEAPAMGPTTLAMYRL 162

Query: 184 LVSSHSDRYKE-----VYHLETHLQDMRPEDL 210
           L+     R+ E     +  LE+H ++   EDL
Sbjct: 163 LLLKKDGRFTERQWKAIMQLESHFEEQN-EDL 193


>gi|326421768|gb|ADZ73912.1| SET and MYND domain containing protein 4 [Grampus griseus]
          Length = 212

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 40/75 (53%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C    SAP+PC+ C  A YCS+ C ++AW RYH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTSAPVPCDGCSYAKYCSQECLQQAWDRYHSIECSLGGLLLTLGAFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|170040662|ref|XP_001848111.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864250|gb|EDS27633.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 567

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSES 149
           R + A  D   GD +  EK V  ++  P    NC+HC   +  + IPC +C+ A+YCSE 
Sbjct: 191 RGLEAERDFRTGDVILDEKAVLSIVDVPLKYYNCSHCSAENQHSLIPCPDCVFAMYCSEE 250

Query: 150 CRREAWLRYHRWECHGALRL 169
           C        HR+EC  A +L
Sbjct: 251 CLERDKRLVHRFECGLATKL 270


>gi|195123617|ref|XP_002006300.1| GI20970 [Drosophila mojavensis]
 gi|193911368|gb|EDW10235.1| GI20970 [Drosophila mojavensis]
          Length = 574

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 48/87 (55%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +N ++M+ +  +GR ++  +D+  GD +  E+P    +L P     C++C        IP
Sbjct: 184 ANCLEMRETPDEGRFIVTTQDLVVGDIVATEEPFCSSLLAPMRYIRCSNCKEERYLTLIP 243

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C   ++CSE C++ A   +H++EC
Sbjct: 244 CDNCCSVMFCSEECKKRANSTFHKYEC 270


>gi|170049075|ref|XP_001870867.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167871002|gb|EDS34385.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 560

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 1/93 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +  I M      GR ++A +D + GD +  EK     I    +   CN C +      IP
Sbjct: 178 AKGIAMGYDPRFGRGLVAEKDFNAGDIILDEKLELCGIDYNVTFQTCNQCSSRRYHVLIP 237

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           C +C   ++CSE CR   W  +HR+EC  A +L
Sbjct: 238 CPKCPFFMFCSEECRELNWKLFHRFECDVATKL 270


>gi|157104022|ref|XP_001648222.1| hypothetical protein AaeL_AAEL003992 [Aedes aegypti]
 gi|108880438|gb|EAT44663.1| AAEL003992-PA [Aedes aegypti]
          Length = 594

 Score = 54.3 bits (129), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNE 139
           I+++  +  GR++I   +++ GD +  EKP + +++      NC++C        IPC  
Sbjct: 193 IELKEDDQFGRYLITNRNLNAGDIVIEEKPFSSLLVNDHRYMNCDYCHDDKFLTLIPCKC 252

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C + ++CS  C+++A   YHR EC
Sbjct: 253 CTVTMFCSTKCQQKAMDNYHRIEC 276


>gi|170060884|ref|XP_001865999.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879236|gb|EDS42619.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 378

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 51/87 (58%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIP 136
           +N ++++  +  GR ++   D+  GD + +EKP + ++       +C++C   +I + IP
Sbjct: 182 ANCLELKHEDQYGRCIVTNRDLKVGDVVIIEKPHSTLLDEELRYLHCDYCNQEAILSLIP 241

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C +C +A++CS +C + A   YHR EC
Sbjct: 242 CKQCSIAMFCSNACYQSALDSYHRLEC 268


>gi|170072162|ref|XP_001870109.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167868275|gb|EDS31658.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 481

 Score = 53.9 bits (128), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 1/93 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +  I M      GR ++A +D + GD +  EK     I    +   CN C +      IP
Sbjct: 97  AKGIAMGYDPRFGRGLVAEKDFNAGDIILDEKLELCGIDYNVTFQTCNQCSSRRYHVLIP 156

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           C +C   ++CSE CR   W  +HR+EC  A +L
Sbjct: 157 CPKCPFFMFCSEECRELNWKLFHRFECDVATKL 189


>gi|170049077|ref|XP_001870868.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167871003|gb|EDS34386.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 554

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
           +  I M      GR ++A +D + GD +  E    F         +C+ C  ++    IP
Sbjct: 172 AKGIAMGYDPRFGRGLVAEKDFNAGDLILDETSELFAHDFNLCFKHCSQCSANVCQILIP 231

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C+L +YCSE CR   W  +HR+EC
Sbjct: 232 CRTCVLFMYCSEECRELHWKLFHRFEC 258


>gi|326421786|gb|ADZ73921.1| SET and MYND domain containing protein 4 [Moschus moschiferus]
          Length = 212

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 2/85 (2%)

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLLEAVGIAHLALK 181
           +C+ C     AP+PC+ C  A YCS  C ++AW RYH  EC  GAL L   V   H+AL+
Sbjct: 18  HCHRCLRHTLAPVPCDGCSYAKYCSRECMQQAWDRYHSVECSLGALLLTLGV-FCHVALR 76

Query: 182 LILVSSHSDRYKEVYHLETHLQDMR 206
             L++   D  + +  L   + + R
Sbjct: 77  STLLARFEDVGEVIKKLCGEMNNQR 101


>gi|195024336|ref|XP_001985854.1| GH20862 [Drosophila grimshawi]
 gi|193901854|gb|EDW00721.1| GH20862 [Drosophila grimshawi]
          Length = 574

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +N ++M+ +  +GR ++   D+  GD + +E+P    +L P     C  C        IP
Sbjct: 184 ANCLEMRETPDEGRFIVTNRDLVVGDVIAIEQPFCSSLLAPMRYIRCATCKEERYLTLIP 243

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C  A++CSE C+  A   +H++EC
Sbjct: 244 CDSCCSAMFCSEECKERATSTFHKYEC 270


>gi|358342712|dbj|GAA50167.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
          Length = 761

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 76/198 (38%), Gaps = 34/198 (17%)

Query: 51  STSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP 110
           ST+S   +    LP    G        S+ + +  S  KGR+V+A E+   GD L VE  
Sbjct: 233 STASDIDAYTHELPEPRYGFQEAYPGLSSGLLVTESEQKGRYVVATEEFQPGDVLAVEPA 292

Query: 111 VAF----------------VILPPCSMSNCNHCCTSISA-PIPCNECILAVYCSES---- 149
             +                ++LP   MS C  C T +SA    C  C    YC+      
Sbjct: 293 SGWPANGPQTTVNASGDCLLLLPSQRMSKCTACLTPLSAIGYLCPHCCDTAYCAPPSKCF 352

Query: 150 ----CRREAWLRYHRWECHGA----LRLLEAVGIAHLALKLILVSSH-----SDRYKEVY 196
                R +  + Y +   H A    + LL + G+ HL  +L  +        S +   + 
Sbjct: 353 FRHLTRSDEMISYSQPPWHTAECRFMFLLNSTGLGHLCFRLAWLRRKNHQQISSQQLTID 412

Query: 197 HLETHLQDMRPEDLYQYV 214
            L +H  +   EDL+QY 
Sbjct: 413 RLVSHFDEFEVEDLFQYA 430


>gi|270012868|gb|EFA09316.1| hypothetical protein TcasGA2_TC030778 [Tribolium castaneum]
          Length = 568

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 13/104 (12%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           AS+KI ++TS   GR+V++  D+  G+ L  E P A   L P              AP+ 
Sbjct: 205 ASSKITLKTSPDVGRYVVSNVDIATGEILVAE-PAAVACLNPEKF-----------APVG 252

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
           C  C    +CS  CR +A   YH++EC     LL   G++ L+L
Sbjct: 253 CPHCSSVAFCSPKCRDDAITTYHKYECK-FFDLLLGSGMSVLSL 295


>gi|170050386|ref|XP_001861286.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872021|gb|EDS35404.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 456

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 45/94 (47%), Gaps = 1/94 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
           + M+     GR +IA  D   GD +  EKP           +NC +C T  + + +PC  
Sbjct: 46  VGMKLYKQFGRGLIAERDFQVGDIILDEKPTLVTNALGYRYTNCANCSTGFNHSLLPCPG 105

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
           C+  +YCSE C +      HR+EC  A +L  A+
Sbjct: 106 CVSYMYCSEECLQRDKRFVHRFECGMAEKLRNAM 139


>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 439

 Score = 53.5 bits (127), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 5/120 (4%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KG  + A   + KG+ +   +P+A  +    +   C+HC T   + + C++C +A Y
Sbjct: 11  SPGKGNGLRAAGRIRKGELVISAEPLACCVSNRLAKEVCHHCFTRHESLLRCSQCKMARY 70

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGI--AHLALKLI--LVSSHSDRYKEVYHLETH 201
           C+ +C+++AW   H+ EC     LL  +      LA KLI  L+S      +E+Y LE H
Sbjct: 71  CNTTCQKQAW-SGHKRECKCLQSLLPRLPTDSVRLAAKLIFALLSPSKSSSEELYTLEEH 129


>gi|170045012|ref|XP_001850118.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167868070|gb|EDS31453.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 558

 Score = 53.1 bits (126), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 7/117 (5%)

Query: 73  TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEK-PVAFVILPPCSMS--NCNHCCT 129
           T+   +  I M+     GR ++A  D   GD +  EK  + FV    C  +  NC HC +
Sbjct: 158 TIPFLAKGITMKQYGEMGRGLVAERDFTTGDVILDEKVDLCFV---SCFRNYLNCGHCGS 214

Query: 130 SI-SAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV 185
               + IPC  C++ +YCSE CR +     HR+EC  A +L     +  + +  +  
Sbjct: 215 EFCRSLIPCLGCVMFMYCSEKCREQDLRITHRFECSVASKLFSVTDLTAMMMARLFF 271


>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
 gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
          Length = 486

 Score = 53.1 bits (126), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  KGR + A ++   GD LF E P A V+    + S C+ C         C +C
Sbjct: 4   VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC ++C+R  W   H+ EC
Sbjct: 64  RFAQYCDKTCQRAGW-EEHKLEC 85


>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
 gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
          Length = 473

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  KGR + A ++   GD LF E P A V+    + S C+ C         C +C
Sbjct: 4   VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC ++C+R  W   H+ EC
Sbjct: 64  RFAQYCDKTCQRAGW-EEHKLEC 85


>gi|170052730|ref|XP_001862354.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873576|gb|EDS36959.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 569

 Score = 53.1 bits (126), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 48/89 (53%), Gaps = 12/89 (13%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNH---CCTSI--SAPIPCNECILAV 144
           GR ++A  D   GD +  EKP+         +S+  H   CC +    + IPC  C+ A+
Sbjct: 197 GRGLVAERDFKPGDIILDEKPMLM------HLSSWRHYCSCCAAFFRGSLIPCPGCVNAM 250

Query: 145 YCSESCRREAWLRYHRWECHGALRLLEAV 173
           YCSE CR++ +  +HR++C G +  LE V
Sbjct: 251 YCSEECRQQDYRYWHRFQC-GFVEQLEHV 278


>gi|157115760|ref|XP_001652683.1| hypothetical protein AaeL_AAEL007340 [Aedes aegypti]
 gi|108876751|gb|EAT40976.1| AAEL007340-PA [Aedes aegypti]
          Length = 363

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/145 (27%), Positives = 65/145 (44%), Gaps = 19/145 (13%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI---------LPPCSMSNCNHCCTS 130
           +++ ++    GR ++   ++   + +  EK V+FV          +PP    +C  C   
Sbjct: 210 ELEEKSDTVHGRFIVTKNEIKADEPIVQEKAVSFVPVYDPRGHSEIPPF---DCQFCGEV 266

Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHS 189
              P PC  C  A YCS +C +E     HR+EC G  + L   +GIAHL ++  L     
Sbjct: 267 NIIPFPCPSCGRACYCSVACCKEH-SELHRYECPGYEKHLWYLIGIAHLGMRSFL----- 320

Query: 190 DRYKEVYHLETHLQDMRPEDLYQYV 214
           D      H    L++   E ++QYV
Sbjct: 321 DGLASSTHNLEELKECNAEKMFQYV 345


>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
           domestica]
          Length = 490

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K+++ TS  KGR + A ++    D +F E+  + V+    +   C+ C         C +
Sbjct: 8   KVEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQ 67

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A YC  +C+++AWL  H+ EC
Sbjct: 68  CKFAHYCDRTCQKDAWLN-HKHEC 90


>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
           max]
          Length = 484

 Score = 52.8 bits (125), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 49/92 (53%), Gaps = 9/92 (9%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS----MSNCNHCCTSISA 133
           + K+ + T   KGR ++A  D + G+ +  ++P  +V +P  S       C+ C T+I+ 
Sbjct: 10  NRKLSLSTLPEKGRSLLATRDFYPGEVIISQEP--YVCVPNNSSVSPQKRCDGCFTTINN 67

Query: 134 PI--PCNECILAVYCSESCRREAWLRYHRWEC 163
            +   C+ C LA YC  +C+R  W + HR EC
Sbjct: 68  NVLSRCSRCQLAFYCGTACQRSEW-KLHRLEC 98


>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 66/152 (43%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR +IA      GD +  ++P A         S+C+HC TS +    
Sbjct: 13  AGRGLTVASIPGKGRGLIATFTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
           C+ C +  YCS +C++E W + H+ EC     L E            +  LALK      
Sbjct: 72  CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTENRKKMLTPTIRLMVRLALKRKLQNE 130

Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            ++ SS  D Y  V  LE+H+  +    L  Y
Sbjct: 131 KVIPSSSIDNYNLVDALESHISKVDENQLVLY 162


>gi|260826490|ref|XP_002608198.1| hypothetical protein BRAFLDRAFT_90370 [Branchiostoma floridae]
 gi|229293549|gb|EEN64208.1| hypothetical protein BRAFLDRAFT_90370 [Branchiostoma floridae]
          Length = 484

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 2/55 (3%)

Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLI 183
           +SAP+PC +C  A YCS  C   AW +YH  EC   G +      G+ HLAL+L+
Sbjct: 140 VSAPLPCQKCSTARYCSHECADLAWTQYHGVECQYLGLIHSSGVGGMGHLALRLV 194



 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 4/73 (5%)

Query: 57  SSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVIL 116
           +     +P+    E   L+ AS       S  +GRH +A +D+  GD L VEKP   V+L
Sbjct: 284 NGGFFRVPAEEVAEEDQLLVAS----YLQSEQRGRHAVAAQDIRVGDVLIVEKPYGSVVL 339

Query: 117 PPCSMSNCNHCCT 129
           P    ++C++CC+
Sbjct: 340 PEQGDTHCDYCCS 352


>gi|413933332|gb|AFW67883.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 190

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 67/152 (44%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A+  + + +   KGR +IA      GD +  ++P A         S+C+HC TS +    
Sbjct: 13  AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
           C+ C +  YCS +C++E W + H+ EC     L E            +  LALK      
Sbjct: 72  CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE 130

Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            ++ SS  D Y  V  LE+H+  +    L  Y
Sbjct: 131 KVIPSSSIDNYNLVDGLESHISKVDENQLVLY 162


>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Meleagris gallopavo]
          Length = 486

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 4   VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+R+AWL  H+ EC
Sbjct: 64  KFAYYCDRTCQRDAWLN-HKNEC 85


>gi|170051200|ref|XP_001861657.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167872534|gb|EDS35917.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 558

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 4/104 (3%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIP 136
           ++ I M+     GR + A +D   G+ +  E P+  + L     + CN+C  S     IP
Sbjct: 175 ADGIVMKRYPGFGRGLAAEKDFQPGNIILEESPM--LTLVGSRYNYCNNCAGSFQGCLIP 232

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
           C  C  A+YCSE C  + +  +HR+EC G    L+ V    +A+
Sbjct: 233 CPGCTSAMYCSEECLGKDYRFWHRFEC-GLAEKLQNVMFGTMAM 275


>gi|170029111|ref|XP_001842437.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167880644|gb|EDS44027.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 560

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 62/141 (43%), Gaps = 20/141 (14%)

Query: 65  SLACGESHTLM-CASNKIKMQTSN-----AKG----------RHVIAVEDVHKGDTLFVE 108
            LA GES  L+ C    I ++T+      AKG          R ++A  D   GD +  E
Sbjct: 144 KLAAGESKGLIPCPLMYINVETNPKIPFLAKGIHMEVLPDFDRSMVADRDFRTGDVILDE 203

Query: 109 KPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL 167
           K +  V+        C  C T    + IPC  C+  +YCSE C    W   HR+EC  + 
Sbjct: 204 KCILAVVDAERRYHRCGRCSTEGFYSLIPCPHCVSVMYCSEKCLEWDWKYVHRFECAVSD 263

Query: 168 RLLEAVGIAHLAL--KLILVS 186
           + L+ +   H+A+  KL L  
Sbjct: 264 K-LQYISYGHVAMGSKLFLYG 283


>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Meleagris gallopavo]
          Length = 478

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 9   VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+R+AWL  H+ EC
Sbjct: 69  KFAYYCDRTCQRDAWLN-HKNEC 90


>gi|170065531|ref|XP_001867978.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167862497|gb|EDS25880.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 562

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 51/117 (43%), Gaps = 3/117 (2%)

Query: 56  ASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI 115
              S+ ++P    G     M A  +I+M+     GR   A  D   GD L  EK +  VI
Sbjct: 157 GGRSLGAVPCPRMGIKIPFMAA--QIRMEIFPELGRGFCAERDFRPGDVLLDEKSMMAVI 214

Query: 116 LPPCSMSNCNHCCTS-ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
                  NC+ C  +   + IPC EC+  +YC E CR       HR+EC    +L E
Sbjct: 215 DSGLRYFNCSWCSNANFHSLIPCLECVSYMYCCEECRDRDRETCHRFECGIGGKLRE 271


>gi|170044019|ref|XP_001849660.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867271|gb|EDS30654.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 589

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 49/95 (51%), Gaps = 5/95 (5%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM--SNCNHCCTSIS-AP 134
           ++ I M      GR +IA +D + GD L  EK V    L  C +   +C+ C      + 
Sbjct: 201 ASGIGMNYEAKFGRGLIAQKDFNPGDILLDEK-VELCGLE-CDLIYQSCSQCSGEFGYSL 258

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           IPC  C LA++CS+ C+   W  YHR+EC  A +L
Sbjct: 259 IPCPGCPLAMFCSKECQEMNWKLYHRFECGVASKL 293


>gi|195381641|ref|XP_002049556.1| GJ20689 [Drosophila virilis]
 gi|194144353|gb|EDW60749.1| GJ20689 [Drosophila virilis]
          Length = 574

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 46/87 (52%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
           +N ++M+ +  +GR ++  +D+  GD +  E+P    +L P     C  C        IP
Sbjct: 184 ANCLEMRETPDEGRFIVTTQDLVVGDVVAAEEPFCSSLLAPMRYIRCATCKEERYLTLIP 243

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C   ++CSE C++ A   +H++EC
Sbjct: 244 CDSCCSVMFCSEECKQRANSTFHKYEC 270


>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
 gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
          Length = 486

 Score = 52.4 bits (124), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 4   VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+R+AWL  H+ EC
Sbjct: 64  KFAYYCDRTCQRDAWLN-HKNEC 85


>gi|170036329|ref|XP_001846017.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878894|gb|EDS42277.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 574

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 12/113 (10%)

Query: 61  ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
           A LP +A G           I M+     GR ++A  D + GD +  EKP+   +     
Sbjct: 188 AKLPFVAKG-----------IAMKYYPEFGRGLVAERDFNTGDVILDEKPMLMAVEHGLR 236

Query: 121 MSNCNHCCTSI-SAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEA 172
            S C++C  S   + IPC  C+  +YCSE C        HR+EC    RL  A
Sbjct: 237 YSVCSYCTASFRKSLIPCPNCVSCMYCSEKCLEMDHRFTHRFECGIMERLRNA 289


>gi|393904729|gb|EJD73778.1| hypothetical protein LOAG_18824 [Loa loa]
          Length = 420

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/129 (24%), Positives = 66/129 (51%), Gaps = 14/129 (10%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI--PCNECILAVYCSESCRREAWL 156
           +++    + ++P A++ +     + C++C       I   C++C  A YC++ C+R AW 
Sbjct: 11  IYQQRIRYFDEPFAYICINNQVNNYCSYCLRKPDKSILYKCSKCEFAKYCNKECQRLAWK 70

Query: 157 RYHRWEC------HGALRLLEAVGIAHLALKLILVSSHSDRYK-----EVYHLETHLQDM 205
           ++HR EC      +  L L E + ++ +  K++ +  + D+YK     +   L  H  D+
Sbjct: 71  KHHRMECQRLVMVYPNLPLTEVLFLSRIIDKVLFIEQNGDKYKWEKDRKWNSLIGHEDDI 130

Query: 206 RPEDLYQYV 214
           R ++L +YV
Sbjct: 131 RNDEL-KYV 138


>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
          Length = 473

 Score = 52.0 bits (123), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 4   VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+R+AWL  H+ EC
Sbjct: 64  KFAYYCDRTCQRDAWLN-HKNEC 85


>gi|157123231|ref|XP_001660071.1| hypothetical protein AaeL_AAEL009454 [Aedes aegypti]
 gi|108874450|gb|EAT38675.1| AAEL009454-PA [Aedes aegypti]
          Length = 586

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 1/75 (1%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSE 148
           GR ++A      GD +  EK     I P     NCN C   +  + IPC  C+  +YCSE
Sbjct: 211 GRSLVAERPFKAGDVILHEKAWMAAISPELKYQNCNRCSIANFHSLIPCPNCVSIMYCSE 270

Query: 149 SCRREAWLRYHRWEC 163
            C  E     HR+EC
Sbjct: 271 KCLEEDLKYSHRFEC 285


>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
 gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
          Length = 785

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 3/78 (3%)

Query: 80  KIKMQTSNA--KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
           +IK+Q+ +   KGR ++   DV  G  +  E+P A VIL  C  ++C++C   + A  +P
Sbjct: 214 QIKLQSVSVPDKGRGMVPPSDVSPGSLVHAEEPYAMVILKQCRETHCHYCLNDLPADRVP 273

Query: 137 CNECILAVYCSESCRREA 154
           C  C + +YCS  C+ +A
Sbjct: 274 CISCSIPLYCSHQCQIKA 291


>gi|270011318|gb|EFA07766.1| hypothetical protein TcasGA2_TC005320 [Tribolium castaneum]
          Length = 563

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 24/144 (16%)

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
           ++A  D+  G+ L +EKP A  +L    +++C+ C       IPC +C +A++CSE C+ 
Sbjct: 192 IVASRDIEIGEILAIEKPWA-AVLTTDRLTHCHECLCLCYNLIPCPKCTVALFCSELCKD 250

Query: 153 EAWLRYHRWECHGAL----RLLEAVGIAH---LALKLILVSS---------------HSD 190
           +A   YH++EC   L    R L    I     LA K+ +++                 SD
Sbjct: 251 KA-RSYHKYECPILLTICYRALCEFAIDEAMLLAFKVAILAKDDYSNLDYMRLDGTYRSD 309

Query: 191 RYKEVYHLETHLQDMRPEDLYQYV 214
           RY EV+ L+   +  R E  +Q +
Sbjct: 310 RYNEVHFLKPDGEFSRFEIFHQAI 333


>gi|170041205|ref|XP_001848363.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864728|gb|EDS28111.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 563

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 65/158 (41%), Gaps = 56/158 (35%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
           GR++I   D+  GD + +EKP +                       PC  C +A++CS+ 
Sbjct: 199 GRYLITNRDLKAGDVVALEKPFS----------------------KPCKGCTIAMFCSDE 236

Query: 150 CRREAWLRYHRWEC---HGALRLLEAVGIAHL---------------ALKLILVSSHSDR 191
           C+++A   YHR+EC       RL+E V +  L               AL+L++ S   + 
Sbjct: 237 CQQKAMEEYHRFECPILQDIHRLVEHVLVPALRLTIKTLTTFNFDLEALRLLMESDGKNT 296

Query: 192 ----------------YKEVYHLETHLQDMRPEDLYQY 213
                           Y+ VY+LE+H     P+ L QY
Sbjct: 297 PNALDMDWTSIDPKETYEIVYNLESHRDKRSPDHLGQY 334


>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
 gi|194691468|gb|ACF79818.1| unknown [Zea mays]
 gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 67/152 (44%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A+  + + +   KGR +IA      GD +  ++P A         S+C+HC TS +    
Sbjct: 13  AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
           C+ C +  YCS +C++E W + H+ EC     L E            +  LALK      
Sbjct: 72  CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE 130

Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            ++ SS  D Y  V  LE+H+  +    L  Y
Sbjct: 131 KVIPSSSIDNYNLVDGLESHISKVDENQLVLY 162


>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
 gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
          Length = 482

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 69/148 (46%), Gaps = 17/148 (11%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++     KGR ++  +D + G+ +  ++P   V     + S C+ C +S S    C+ C
Sbjct: 13  LRVANVTEKGRSLVTTKDFNPGEVIISQEPYVCVPNNSATESRCDRCFSS-SNVKKCSAC 71

Query: 141 ILAVYCSESCRREAWLRYHRWECHG--------------ALRLLEAVGIAH-LALKLILV 185
            +A YC  SC++  W + HR EC                ++RL+  + I   L  + I+ 
Sbjct: 72  QVAWYCGSSCQKLEW-KLHRIECEALSKLDKDRRKYVTPSIRLMVRLLIRRKLQSEKIIS 130

Query: 186 SSHSDRYKEVYHLETHLQDMRPEDLYQY 213
           S+ +D Y  V  L  H++D+  + L  Y
Sbjct: 131 STATDNYDLVEALVAHMKDIDEKQLVLY 158


>gi|440801098|gb|ELR22123.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 397

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 62/149 (41%), Gaps = 11/149 (7%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           E++E+  +    + + S   ++  + A    +  G +     A      + +   GR + 
Sbjct: 23  ERNEDGMLREAGIGIKSVVITSGGTAAGAQEV--GAASDASGADVDAGAEQAETMGRGMF 80

Query: 95  AVEDVHKGDTLFVEKPVAFVILPP----CSMSNCNHCCTSI---SAPIPCNECILAVYCS 147
           A+ D   GD +  E P+A  I P       M +C+ C   +   +A + C  C L  YCS
Sbjct: 81  ALRDFKAGDVILEESPLA--IAPRDCVVAQMLHCSQCLRKLAASTAAVACPHCRLDPYCS 138

Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIA 176
           + C  EAW  +HR  C  A    E  G A
Sbjct: 139 DRCAAEAWESHHRLLCEAAQADPETTGQA 167


>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
           porcellus]
          Length = 428

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 18/138 (13%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+HC       + C++
Sbjct: 5   KVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDHCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
           C +A YCS  C+++AW + H+ EC             ++RLL       + +KL+    S
Sbjct: 65  CRVAKYCSAKCQKKAW-QDHKQECKCLKSCKPRYPPDSVRLL-----GRVVIKLMEKTPS 118

Query: 188 HSDRYKEVYHLETHLQDM 205
            S++    Y LE+++  +
Sbjct: 119 ESEKLYSFYDLESNINKL 136


>gi|170058972|ref|XP_001865158.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167877853|gb|EDS41236.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 545

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 9/95 (9%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKP----VAFVILPPCSMSNCNHCCTS-ISAPI 135
           I M+     GR ++A  D   GD +  EK     ++FV     S   C HC ++ +++ I
Sbjct: 167 IHMKEFGDMGRGLVAERDFKTGDVILDEKSDLCSLSFVR----SFVECAHCGSAFLNSLI 222

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           PC+ C   +YC E CR E     HR+EC    +LL
Sbjct: 223 PCSLCFAVMYCGEKCREEDLRITHRFECSVVTKLL 257


>gi|328785875|ref|XP_003250668.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
           mellifera]
          Length = 554

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 62/128 (48%), Gaps = 16/128 (12%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVI-------LPPCSMSNCNHCCTSISAPIPCNECIL 142
           GR+ +AV+ +   D + ++K     +        P  +M  C++C     A IPC+ C  
Sbjct: 183 GRYFVAVKPIKMKDVILIDKSQITHLHKDDWDDDPTSNM--CHYCFKYCRALIPCDYCYH 240

Query: 143 AVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLIL-VSSHSDRYKEVYHLET 200
           A+YCS+ CR +A+  YH+  C +G L        +   LKL+L ++ +  R KE      
Sbjct: 241 ALYCSKECRGKAYQAYHQIYCRYGNLD-----NKSSFVLKLLLKITDNGARLKEALEYHK 295

Query: 201 HLQDMRPE 208
            L++M  E
Sbjct: 296 ELENMSEE 303


>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 19/143 (13%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI---------SAPIP 136
           S  KGR + A  D   G+ +   +P AFVI    +   C+HC   +         +  + 
Sbjct: 108 SEEKGRGIKAARDFKVGNLILKAEPFAFVIFDHMAEHVCHHCFNMVVRDRQGQPTTQLLR 167

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE-- 194
           C+ C  A YCS  C+++AW   H+ EC    R+        + +   ++   ++R ++  
Sbjct: 168 CSSCKFARYCSRECQKKAW-SMHKKECMAIKRIAPRTASDEVRMVSQILWKQAERGEKRA 226

Query: 195 -------VYHLETHLQDMRPEDL 210
                  V  L  HL DM  ED+
Sbjct: 227 KSEELCRVEELCDHLNDMSFEDV 249


>gi|189240591|ref|XP_001815228.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
          Length = 439

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 24/144 (16%)

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
           ++A  D+  G+ L +EKP A  +L    +++C+ C       IPC +C +A++CSE C+ 
Sbjct: 68  IVASRDIEIGEILAIEKPWA-AVLTTDRLTHCHECLCLCYNLIPCPKCTVALFCSELCKD 126

Query: 153 EAWLRYHRWECHGAL----RLLEAVGIAH---LALKLILVSS---------------HSD 190
           +A   YH++EC   L    R L    I     LA K+ +++                 SD
Sbjct: 127 KA-RSYHKYECPILLTICYRALCEFAIDEAMLLAFKVAILAKDDYSNLDYMRLDGTYRSD 185

Query: 191 RYKEVYHLETHLQDMRPEDLYQYV 214
           RY EV+ L+   +  R E  +Q +
Sbjct: 186 RYNEVHFLKPDGEFSRFEIFHQAI 209


>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
           rerio]
          Length = 231

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  KGR + A ++   GD LF E P A V+    + S C+ C         C +C
Sbjct: 4   VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC ++C+R  W   H+ EC
Sbjct: 64  RFAQYCDKTCQRAGW-EEHKLEC 85


>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 443

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 67/152 (44%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A+  + + +   KGR +IA      GD +  ++P A         S+C+HC TS +    
Sbjct: 13  AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
           C+ C +  YCS +C++E W + H+ EC     L E            +  LALK      
Sbjct: 72  CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE 130

Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            ++ SS  D Y  V  LE+H+  +    L  Y
Sbjct: 131 KVIPSSSIDNYNLVDGLESHISKVDENQLVLY 162


>gi|384485526|gb|EIE77706.1| hypothetical protein RO3G_02410 [Rhizopus delemar RA 99-880]
          Length = 510

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 1/80 (1%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           + +  IA + +  G  LF+E P+  V LP      CN+C         C+ C  A +CS 
Sbjct: 16  ENKRFIASKQLEPGTALFIEPPLVSVPLPSKRHQRCNYCLCKAQLQC-CSRCRSAYFCSN 74

Query: 149 SCRREAWLRYHRWECHGALR 168
           +C R AWL++HR  C    R
Sbjct: 75  ACFRNAWLQFHRILCEPQER 94


>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 352

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 13  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS +D Y  V  LE+H+ ++    L  Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162


>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
          Length = 481

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 13  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS +D Y  V  LE+H+ ++    L  Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162


>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
 gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
          Length = 429

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K +   S  KG  + A  ++H G+ L+  +P+A+ +   C+   C+ C +     + C++
Sbjct: 4   KFERFASAEKGNGLRATHEIHAGELLYTAEPLAYCVSNTCAKDLCHSCFSRRKTLLRCSQ 63

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C +A YC  +C+++AW   H+ EC
Sbjct: 64  CKVARYCDVTCQKQAW-SDHKREC 86


>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
          Length = 491

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 9   VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+R AWL  H+ EC    R
Sbjct: 69  KFAYYCDRTCQRAAWLN-HKNECSAIKR 95


>gi|281201842|gb|EFA76050.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 965

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 40/85 (47%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S++ GR     + +     LF EK   F +      S C++C   I API C +C LA Y
Sbjct: 361 SDSVGRLGETTDLIASNTILFQEKAYVFCLDRYSHSSYCHNCLKEILAPIYCKKCNLAQY 420

Query: 146 CSESCRREAWLRYHRWECHGALRLL 170
           C + C +E +  +H  EC     LL
Sbjct: 421 CGDECLKEDYNNFHEKECGKGFLLL 445


>gi|157130707|ref|XP_001661974.1| hypothetical protein AaeL_AAEL011836 [Aedes aegypti]
 gi|108871816|gb|EAT36041.1| AAEL011836-PA [Aedes aegypti]
          Length = 571

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 2/84 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNE 139
           I+ +   ++G  ++A  D   GD +  EKP+   +      + CN+C  T++S  IPC  
Sbjct: 196 IQGRQHPSQGPIMVAKRDFKAGDVILREKPMIACVSWENVFTRCNYCISTNLSHLIPCPN 255

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A++C E C R+A    HR+EC
Sbjct: 256 CATAMFCDEECMRKAQ-NVHRFEC 278


>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
 gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
          Length = 478

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  KGR + A+ +   GD +F E   + V+    S S C+ C       + C +C
Sbjct: 4   VEIFESEGKGRGLRAIRESWAGDIIFAEPAYSAVVFDNLSHSVCHSCFKRQEKLLRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C++E+W   H+ EC
Sbjct: 64  KFAHYCDRTCQKESWAN-HKNEC 85


>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 481

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 13  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS +D Y  V  LE+H+ ++    L  Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162


>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
 gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 480

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 13  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS +D Y  V  LE+H+ ++    L  Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162


>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
 gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
          Length = 482

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR +IA      GD +  ++P A         S+C+HC TS +    
Sbjct: 13  AGRGLAVASIPGKGRGLIATRTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +  YCS +C++E W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSMCQVTWYCSTNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLVLKRKLQNE 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS  D Y  V  LE+H+  +    L  Y
Sbjct: 131 KVIPSSSIDNYNLVDALESHIWKVDENHLVLY 162


>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
          Length = 425

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 13  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS +D Y  V  LE+H+ ++    L  Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162


>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 798

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCN 138
           K++  ++  KGR +++  D+ +   + VE+P + VI   C  ++C+ C   + A  +PC 
Sbjct: 219 KLRCVSTKEKGRGMVSECDIEEASVIHVEEPFSVVISKSCRETHCHFCLNELPADTVPCP 278

Query: 139 ECILAVYCSESCR 151
            C + VYCSESC+
Sbjct: 279 SCSIPVYCSESCQ 291


>gi|312381116|gb|EFR26935.1| hypothetical protein AND_06647 [Anopheles darlingi]
          Length = 545

 Score = 50.8 bits (120), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 7/96 (7%)

Query: 98  DVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI-PCNECILAVYCSESCRREAWL 156
           D+  G+ + +E+P   V+    + + C++C       + PC  C   +YCSE C+ +A  
Sbjct: 189 DLSVGEKVLLERPFVLVLEAEVAYARCDYCGACNEHNLRPCKHCTAVMYCSEECQEQALQ 248

Query: 157 RYHRWECH--GALRLL----EAVGIAHLALKLILVS 186
           RYH++EC     L+LL    +A  +  +AL+L  + 
Sbjct: 249 RYHQFECEVIDDLQLLYRGPKATRVLQVALRLFWLG 284


>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 282

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 13  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 72  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130

Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
               SS +D Y  V  LE+H+ ++    L  Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162


>gi|157130705|ref|XP_001661973.1| hypothetical protein AaeL_AAEL011840 [Aedes aegypti]
 gi|108871815|gb|EAT36040.1| AAEL011840-PA [Aedes aegypti]
          Length = 415

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 2/84 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNE 139
           I+ +   ++G  ++A  D   GD +  EKP+   +      + CN+C  T++S  IPC  
Sbjct: 40  IQGRQHPSQGPIMVAKRDFKAGDVILREKPMIACVSWENVFTRCNYCISTNLSHLIPCPN 99

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A++C E C R+A    HR+EC
Sbjct: 100 CATAMFCDEECMRKAQ-NVHRFEC 122


>gi|170038058|ref|XP_001846870.1| U5 snRNP-specific protein [Culex quinquefasciatus]
 gi|167881490|gb|EDS44873.1| U5 snRNP-specific protein [Culex quinquefasciatus]
          Length = 789

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSESCRRE 153
            + D + GD +   +P+   I       NC+HC    S + IPC  C+ ++YC E CR++
Sbjct: 149 GMRDFNPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPGCVFSMYCGEKCRQK 208

Query: 154 AWLRYHRW 161
           +W  +H++
Sbjct: 209 SWKLWHQF 216


>gi|170040667|ref|XP_001848113.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864252|gb|EDS27635.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 560

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 51/113 (45%), Gaps = 2/113 (1%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAP 134
           C +  I M+  +  GR ++A  +   GD +  EK +  V+        C+ C   +  + 
Sbjct: 174 CLAKGIAMKQYSGSGRGLVAERNFKAGDVILNEKAILSVVAVAFKYLYCSRCGIPNQHSL 233

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLILVS 186
           IPC  C+  +YCSE C  E     HR+EC  GA     A   ++LA KL    
Sbjct: 234 IPCPNCVHCMYCSEECLAEDSRMAHRYECSFGAQMANIAYDGSNLATKLFFYG 286


>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
          Length = 604

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 1/73 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCN 138
           K++  ++  KGR +++  D+ +   + VE+P + VI   C  ++C+ C   + A  +PC 
Sbjct: 219 KLRCVSTKEKGRGMVSECDIEEASVIHVEEPFSVVISKSCRETHCHFCLNELPADTVPCP 278

Query: 139 ECILAVYCSESCR 151
            C + VYCSESC+
Sbjct: 279 SCSIPVYCSESCQ 291


>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
 gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
           [Ricinus communis]
          Length = 781

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 68/125 (54%), Gaps = 10/125 (8%)

Query: 35  EKSEENSVEAMSMQVSSTSSSA----SSSVASLPSLACGESHTLMCASNKIKMQTSNAKG 90
           +K  EN ++ ++ Q + ++S +     +S+ S  +L+C  +H+      K+   T+  KG
Sbjct: 176 KKQIENELKIIAGQCAGSASRSFNHIDNSLGSF-NLSCQMTHS----EYKLLRVTTPDKG 230

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCNECILAVYCSES 149
           R + +  D+ +   + VEKP A ++L  C  ++C++C   + A  +PC  C + +YCS+ 
Sbjct: 231 RGMASGCDIPQASLVHVEKPYALIVLKNCRDTHCHYCLKELPADTVPCLSCSVPLYCSQL 290

Query: 150 CRREA 154
           C+  A
Sbjct: 291 CQVHA 295


>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 697

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 5/108 (4%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GR  +A +D+  G+ L VEKP   ++L    +++C+ C   I AP P  C  C    YCS
Sbjct: 271 GRFAVATKDIQPGELLVVEKPHCSMLLGEYRLTHCHRCSIRIVAPYPASCYLCSSVAYCS 330

Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSDRYK 193
            +C R+   R H  EC   G+L   +A     +AL+ I+   + +  K
Sbjct: 331 PNC-RDLDERVHSIECGLLGSLWCSKASVTCMMALRAIIQKPYEEFIK 377


>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
          Length = 623

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 61/136 (44%), Gaps = 14/136 (10%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP 134
           M A+       ++AKGR VIA      GD +  +   A+ ++     + C+ C    + P
Sbjct: 1   MAAAGWHIQDGADAKGRRVIADRAYRAGDVVLEDDAAAWCLISEQLGAWCDCCLAPAAQP 60

Query: 135 IPCNECILAVYCSESCRREAW-LRYHRWECHGALRLLEAVGIAH---LALKLILVSSHSD 190
           + C+ C LA Y S   +R AW    HR EC  ALR      + H    A++L L      
Sbjct: 61  LRCSACKLAWYASREHQRRAWQAGAHRLEC-AALRSC----VPHTPPTAVRLAL-----R 110

Query: 191 RYKEVYHLETHLQDMR 206
           RY EV  L  H  D++
Sbjct: 111 RYGEVLGLRHHWGDLQ 126


>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
          Length = 455

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 60/122 (49%), Gaps = 17/122 (13%)

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
           +GD +  E+P A+V+LP      C+ C   +   + C+ C+   YC+ +C++EAW   H+
Sbjct: 8   RGDLILQEQPFAYVLLPQFRHERCDRC-FKLGKVLKCSGCLYVRYCNRACQQEAWPD-HQ 65

Query: 161 WECHGALRLLEA---VGIAHLALKLIL----------VSSHSDR-YKEVYHLETHLQDMR 206
            EC   L+LL A   V  A L L  I+             ++D+ Y+    L  H +D+R
Sbjct: 66  AECE-KLKLLPATLVVPSAALMLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEEDIR 124

Query: 207 PE 208
            +
Sbjct: 125 KD 126


>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
 gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
          Length = 409

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 6/88 (6%)

Query: 82  KMQTSNAKGRHVIAVEDVH--KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP--IPC 137
           ++++S+  GR + A E  H   GD++   +P+ + + P  + + C H C +  AP    C
Sbjct: 10  RIESSSVGGRGLFAGEGSHYIPGDSIATYRPIVYCVTPTMAKACC-HWCLNSDAPKYYQC 68

Query: 138 NECILAVYCSESCRREAW-LRYHRWECH 164
           + C  AVYCS+ C + A+ L  HR EC 
Sbjct: 69  SGCRYAVYCSKECAKAAYKLGTHRRECQ 96


>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Ornithorhynchus anatinus]
          Length = 490

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 44/84 (52%), Gaps = 1/84 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K+++ TS +KGR + A ++    D +F E+  + V+    +   C+ C         C +
Sbjct: 8   KVEVFTSESKGRGLKATKEFWAADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQ 67

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A YC  +C+++AWL  H+ EC
Sbjct: 68  CKFAHYCDRTCQKDAWLN-HKNEC 90


>gi|170058968|ref|XP_001865156.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167877851|gb|EDS41234.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 557

 Score = 50.4 bits (119), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 1/87 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
           +  I+M+     GR ++A  D   GD +  EK    V+    S  +C HC ++   + IP
Sbjct: 169 AKGIRMKEFGDMGRGLVAERDFKTGDVILDEKSDLCVVSFERSFISCAHCGSTFWKSLIP 228

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C   +YC E CR       HR+EC
Sbjct: 229 CLGCAAVMYCGEKCREADLQAIHRFEC 255


>gi|170047760|ref|XP_001851378.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167870065|gb|EDS33448.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 574

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 2/109 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIP 136
           ++ IKM      GR +IA +    GD +  EK +   I       NCN C      + IP
Sbjct: 188 ASGIKMARLEDFGRSMIATKRFKTGDVILHEKSILASIDLKLQFQNCNLCSAEEGLSLIP 247

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-AVGIAHLALKLIL 184
           C  C   +YCS+ C    +   HR+EC  A ++   + G  +L  +L L
Sbjct: 248 CPNCATVMYCSQECLERDFRLVHRFECPIAEKVQHLSYGFVNLGSRLFL 296


>gi|440793165|gb|ELR14357.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 387

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 5/98 (5%)

Query: 84  QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP--PCSMSNCNHCCTSI---SAPIPCN 138
           + +   GR + A+ D   GD +  E P+AF         M +C+ C   +   +A + C 
Sbjct: 61  EQAETMGRGMFALRDFKAGDVILEESPLAFAPRDCVVAQMLHCSQCLRKLAASTAAVTCP 120

Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA 176
            C L  YCS+ C  EAW  +HR  C  A    E  G A
Sbjct: 121 HCRLDPYCSDRCAAEAWESHHRLLCEAAQADPETTGQA 158


>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Taeniopygia guttata]
          Length = 491

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 9   VELFTAEGKGRGLKAQKEFLAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+R AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQRAAWLS-HKNECSAIKR 95


>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Ornithorhynchus anatinus]
          Length = 477

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 44/84 (52%), Gaps = 1/84 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K+++ TS +KGR + A ++    D +F E+  + V+    +   C+ C         C +
Sbjct: 8   KVEVFTSESKGRGLKATKEFWAADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQ 67

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A YC  +C+++AWL  H+ EC
Sbjct: 68  CKFAHYCDRTCQKDAWLN-HKNEC 90


>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
           [Taeniopygia guttata]
          Length = 490

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 9   VELFTAEGKGRGLKAQKEFLAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+R AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQRAAWLS-HKNECSAIKR 95


>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
           [Taeniopygia guttata]
          Length = 478

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++   GD +F E   A V+    +   C+ C         C +C
Sbjct: 9   VELFTAEGKGRGLKAQKEFLAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+R AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQRAAWLS-HKNECSAIKR 95


>gi|327241846|gb|AEA40704.1| SET and MYND domain containing protein 4, partial [Ceratotherium
           simum]
          Length = 176

 Score = 50.1 bits (118), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 39/75 (52%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 16  CHRCLKHTLAPVPCDGCSYAKYCSQECMQQAWDLYHSVECSLGGLLLTLGVFCHIALRST 75

Query: 184 LVSSHSDRYKEVYHL 198
           L++   D  K + +L
Sbjct: 76  LLARFEDAGKVIRNL 90


>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Anolis carolinensis]
          Length = 485

 Score = 50.1 bits (118), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  ++ KGR + A ++   GD +F E+  A V+    +   C+ C    +    C +C
Sbjct: 4   VEVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AW+  H+ EC
Sbjct: 64  KFAHYCDRTCQKDAWVN-HKNEC 85


>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
 gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
          Length = 428

 Score = 49.7 bits (117), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   HR EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNISKL 136


>gi|158300855|ref|XP_552390.3| AGAP011843-PA [Anopheles gambiae str. PEST]
 gi|157013362|gb|EAL38849.3| AGAP011843-PA [Anopheles gambiae str. PEST]
          Length = 425

 Score = 49.7 bits (117), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 11/108 (10%)

Query: 84  QTSNAKG-RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI-PCNECI 141
           +T + +G R   ++ED   G  + VEKP   V+    +   C+ C  +    + PC  C 
Sbjct: 156 ETGDGEGIRTNRSLED---GAKVLVEKPFVLVLEAELAYQRCDFCGATNEHNLRPCTGCT 212

Query: 142 LAVYCSESCRREAWLRYHRWECH--GALRLL----EAVGIAHLALKLI 183
             +YCSE C+ +++ RYH++EC     L+LL    +   + H+ L+L 
Sbjct: 213 GVMYCSEECQEQSYQRYHQFECEIVDDLQLLFRGPKPTRMFHVVLRLF 260


>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
          Length = 553

 Score = 49.7 bits (117), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 2/80 (2%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI--PCNECILAVYC 146
           +GR++ +   + KG  +   +P+  V LP      CN+C    + P    C+ C    +C
Sbjct: 20  RGRYIKSTATIPKGTVIITSQPLGTVALPQTINEYCNYCFRKQTRPPLQRCSRCKSVYFC 79

Query: 147 SESCRREAWLRYHRWECHGA 166
              C   AWL YH++ C  A
Sbjct: 80  DMGCFSNAWLSYHQFVCDPA 99


>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
          Length = 428

 Score = 49.7 bits (117), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   HR EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNISKL 136


>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 838

 Score = 49.7 bits (117), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 3/87 (3%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS---APIP 136
           +++M+   A+GRH++A  D+  GD   V +P A  +      + C+H         A + 
Sbjct: 22  RVEMRVDPARGRHLVAAVDMRAGDEALVARPYACALHDSQRAARCDHTFKRADDGGALLR 81

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C  A Y     +  AW R HR EC
Sbjct: 82  CARCKHARYLGRDAQAAAWKRGHREEC 108


>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
 gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
 gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
          Length = 428

 Score = 49.7 bits (117), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   HR EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNISKL 136


>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Anolis carolinensis]
          Length = 472

 Score = 49.7 bits (117), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  ++ KGR + A ++   GD +F E+  A V+    +   C+ C    +    C +C
Sbjct: 4   VEVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AW+  H+ EC
Sbjct: 64  KFAHYCDRTCQKDAWVN-HKNEC 85


>gi|384485695|gb|EIE77875.1| hypothetical protein RO3G_02579 [Rhizopus delemar RA 99-880]
          Length = 394

 Score = 49.7 bits (117), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 1/85 (1%)

Query: 80  KIKMQTSNAKGRH-VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           K+K    + K  H +IA +D  + D L+ E P+   + P      CN C   +   + C 
Sbjct: 169 KLKELAQDDKVTHCLIAEQDFEENDVLYTETPLISALSPQLEGKYCNFCMNELKDVVECK 228

Query: 139 ECILAVYCSESCRREAWLRYHRWEC 163
            C    +CSE+C   A  +YH++ C
Sbjct: 229 NCDQVAFCSEACESAATKQYHQFLC 253


>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
           africana]
          Length = 478

 Score = 49.7 bits (117), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+      S C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLINSVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|170034229|ref|XP_001844977.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167875489|gb|EDS38872.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 582

 Score = 49.7 bits (117), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 1/84 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIPCNE 139
           ++M    A GR ++A      GD L  EK    +  P     +C+ C  S +   IPC++
Sbjct: 194 LRMTEVPAYGRAMVAERSFRPGDVLMKEKVFMTMSDPQYQYLHCHFCGESKLFNLIPCSD 253

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C   +YCS+ CR+E     HR+EC
Sbjct: 254 CASVMYCSDKCRQEDKRTLHRFEC 277


>gi|405974951|gb|EKC39558.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
          Length = 670

 Score = 49.7 bits (117), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/107 (24%), Positives = 43/107 (40%), Gaps = 33/107 (30%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAF-----VILPPCSMSNCNHCCTSI------------- 131
           GR + A ED+ +G+T+F + P+        +    ++ +C+HC  S+             
Sbjct: 255 GRAIYATEDIPEGETVFEDSPIVLGKVFDTVHYKTTIQSCDHCAQSLLSPQEYFMETFST 314

Query: 132 ---------------SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
                          + PI C  C    YCS  CR ++W  YH+  C
Sbjct: 315 MESDLRDLVLKYWPNTKPIYCKNCKFVKYCSNDCRTQSWNTYHKIIC 361


>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Iii)
          Length = 464

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 44  KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 102

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 103 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 156

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   R E L Q V
Sbjct: 157 EKLYSFYDLESNINKLTEDRKEGLRQLV 184


>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Brachypodium distachyon]
          Length = 482

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 61/141 (43%), Gaps = 17/141 (12%)

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
            KGR ++       G+ +   +P A         SNC+HC TS S    C+ C +A YC 
Sbjct: 24  GKGRGLVTTRRFFPGEVILCGEPYASTPNKILVGSNCDHCFTS-SNLRKCSVCRVAWYCG 82

Query: 148 ESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL----------VSSHSDRY 192
             C++E W + H+ EC     L +           L ++LIL           S+ +D Y
Sbjct: 83  SVCQKEEW-KLHQLECRAISALTDDRKKMLTPTIRLMVRLILRRKLQNEKVIPSTGTDNY 141

Query: 193 KEVYHLETHLQDMRPEDLYQY 213
             V  LE+H+ ++  + L  Y
Sbjct: 142 GLVDALESHISEVDDKQLVLY 162


>gi|170036319|ref|XP_001846012.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878889|gb|EDS42272.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 565

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 42/96 (43%), Gaps = 13/96 (13%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS------NCNHCCTSIS-A 133
           I M      GR + A  + + GD +  EK      L  C +       +CN C    S +
Sbjct: 173 IVMTYDPKFGRGLYAEREFNPGDIIINEK------LELCGLDYGYVYRHCNQCGADFSYS 226

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
            IPC  C   +YCSE C    W  YHR+EC  A +L
Sbjct: 227 LIPCPACTFCMYCSEECLEMNWKFYHRFECGVATKL 262


>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
           harrisii]
          Length = 490

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K+++ TS  KGR +   +++   D +F E   + V+    +   C+ C         C +
Sbjct: 8   KVEVFTSEGKGRGLKTTKELWAADVVFAEPAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQ 67

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C  A YC   C+++AWL  H+ EC
Sbjct: 68  CKFARYCGRPCQKDAWLN-HKNEC 90


>gi|327241840|gb|AEA40701.1| SET and MYND domain containing protein 4, partial [Hippopotamus
           amphibius]
          Length = 176

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 37/75 (49%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS  C +EAW  YH  EC     LL      H+AL+  
Sbjct: 16  CHRCLKHTLAPVPCDGCSYAKYCSHECMQEAWHLYHSVECSLGGLLLTLGVFCHIALRSA 75

Query: 184 LVSSHSDRYKEVYHL 198
           L++   D  K V  L
Sbjct: 76  LLARFEDVSKVVNKL 90


>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
          Length = 433

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 8/125 (6%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR +   ++   GD +F E  +A V+    +   C+ C         C++C  A Y
Sbjct: 9   SPGKGRGLKTTKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHY 68

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           C  +C+R  W   H+ EC GA++   A G A    + I V SH  +   V  LE H+ DM
Sbjct: 69  CDRTCQRAGWAE-HKQEC-GAIK---AYGKA--PNENIRVVSHM-QLITVEELEDHVADM 120

Query: 206 RPEDL 210
           + +++
Sbjct: 121 QEDEI 125


>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
 gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
          Length = 484

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 57/145 (39%), Gaps = 29/145 (20%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           KGR ++A  D + G T+  + P   V+      + C+ C     A   C+ C    YCS 
Sbjct: 24  KGRCLVADRDFYAGQTVLEQDPYVAVLDGDSRGNRCDACFKQSPALRRCSACKFVFYCSP 83

Query: 149 SCRREAWLRYHRWECHGALRL--------------------LEAVGIAHLALKLILVSSH 188
           +C+R  W + H+ EC   +RL                    L+A G        +L  + 
Sbjct: 84  TCQRSQW-KIHQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATG--------VLPVTA 134

Query: 189 SDRYKEVYHLETHLQDMRPEDLYQY 213
            D Y+ V  L TH  +   E L  Y
Sbjct: 135 LDNYEIVRALPTHFSETGDERLVMY 159


>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
 gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
          Length = 478

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  KGR + A  +   GD +F E   + V+    S   C+ C       + C +C
Sbjct: 4   VEIFDSEGKGRGLRATRESWAGDVIFAEPAYSAVVFDNLSHCVCHSCFKRQEKLLRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C++E+W+  H+ EC
Sbjct: 64  KFAHYCDRTCQKESWVN-HKHEC 85


>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 478

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 5/92 (5%)

Query: 75  MCASNK--IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
           M A NK  +++  + AKGR + A ++   G+ +F E   + V+    +   C HCC    
Sbjct: 1   MTAGNKENVRLFDAGAKGRGLKASKEFSTGEVVFAEPSYSAVVFDSLAFQVC-HCCFRRQ 59

Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWEC 163
           A +  C +C  A YC+ +C+   W   H+ EC
Sbjct: 60  AQLHRCGQCRFAHYCNRTCQTACW-EEHKQEC 90


>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
          Length = 428

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMHCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   HR EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNISKL 136


>gi|170040673|ref|XP_001848116.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864255|gb|EDS27638.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 586

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 54/127 (42%), Gaps = 11/127 (8%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP--- 134
           +N ++M      GR +IA      GD +  EK +  V        NCN C  S   P   
Sbjct: 191 ANGLRMAELPTYGRAMIAERSFEPGDVILHEKSLLTVGDARYLHLNCNLC--SEWKPYNL 248

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL------LEAVGIAHLALKLILVSSH 188
           IPC  C+  +YCSE CR++     HR+EC    +L      +  +G       L L   +
Sbjct: 249 IPCPNCVWVMYCSEECRQQDLRTVHRFECGITAKLKHLNSWMVDMGPKLFFYGLTLFGDN 308

Query: 189 SDRYKEV 195
            D+ K+ 
Sbjct: 309 VDKIKQF 315


>gi|327241844|gb|AEA40703.1| SET and MYND domain containing protein 4, partial [Camelus
           bactrianus]
          Length = 175

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 2/71 (2%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKL 182
           C+ C     A +PC+ C  A YCS+ C ++AW  YH  EC  GAL L   V   H+AL+ 
Sbjct: 16  CHRCLKHTLATVPCDGCSYAKYCSQECMQQAWDLYHSVECSQGALLLTLGV-FCHIALRS 74

Query: 183 ILVSSHSDRYK 193
            L++   D  K
Sbjct: 75  TLLARFEDAGK 85


>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Cavia porcellus]
          Length = 490

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEIFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKHEC 90


>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
           Zinc Finger Mynd Domain-Containing Protein 1
          Length = 429

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 9   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 67

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 68  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 121

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   R E L Q V
Sbjct: 122 EKLYSFYDLESNINKLTEDRKEGLRQLV 149


>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
 gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
          Length = 436

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   R E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDRKEGLRQLV 148


>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
           [Oryzias latipes]
          Length = 476

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 5/92 (5%)

Query: 75  MCASNK--IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
           M A NK  +++  + AKGR + A ++   G+ +F E   + V+    +   C HCC    
Sbjct: 1   MTAGNKENVRLFDAGAKGRGLKASKEFSTGEVVFAEPSYSAVVFDSLAFQVC-HCCFRRQ 59

Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWEC 163
           A +  C +C  A YC+ +C+   W   H+ EC
Sbjct: 60  AQLHRCGQCRFAHYCNRTCQTACW-EEHKQEC 90


>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
           Methyltransferase Inhibitor Sinefungin
 gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Ii)
 gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
 gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
 gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   R E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDRKEGLRQLV 148


>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 489

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 5/92 (5%)

Query: 75  MCASNK--IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
           M A NK  +++  + AKGR + A ++   G+ +F E   + V+    +   C HCC    
Sbjct: 1   MTAGNKENVRLFDAGAKGRGLKASKEFSTGEVVFAEPSYSAVVFDSLAFQVC-HCCFRRQ 59

Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWEC 163
           A +  C +C  A YC+ +C+   W   H+ EC
Sbjct: 60  AQLHRCGQCRFAHYCNRTCQTACW-EEHKQEC 90


>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Cavia porcellus]
          Length = 477

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEIFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKHECSAIKR 95


>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 53/108 (49%), Gaps = 22/108 (20%)

Query: 80  KIKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
           +IK+Q  ++  KGR + ++ ++ +   +  E+P A +IL  C  ++C+ C   + A  +P
Sbjct: 219 QIKLQCVSTPTKGRGMASLSEISQSYLVHTEEPYAAIILKHCRDTHCHFCFNELPADSVP 278

Query: 137 CNECILAVYCSESCRREA-------------------WLRYHRWECHG 165
           C  C + +YCS+ C+ +A                   W+  H+ EC G
Sbjct: 279 CTSCSIPLYCSQHCQMQAGGQELRNNSKNHDSDSNIEWIAEHKHECKG 326


>gi|26352708|dbj|BAC39984.1| unnamed protein product [Mus musculus]
          Length = 114

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 1/90 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRL 169
           C +A YCS  C+++AW   HR EC    RL
Sbjct: 65  CRIAKYCSAKCQKKAWP-DHRRECSCLKRL 93


>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
          Length = 462

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|426334416|ref|XP_004028748.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
           gorilla gorilla]
          Length = 182

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE++++ +   + E L Q V
Sbjct: 121 EKLYSFYDLESNIKKLTEDKKEGLRQLV 148


>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 489

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 2/83 (2%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A ++   GD +F E  +A V+    +   C+ C         C++C  A Y
Sbjct: 9   SPGKGRGLKATKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHY 68

Query: 146 CSESCRREAWLRYHRWECHGALR 168
           C  +C+R  W   H+ EC GA++
Sbjct: 69  CDRTCQRAGWAE-HKQEC-GAIK 89


>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 490

 Score = 48.9 bits (115), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 1/94 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           S+ + +  S AKGR +IA    H+G+ +  ++P    +        C+ C    S    C
Sbjct: 16  SHNLSVSVSPAKGRCLIANRAFHRGEVILQQEPYVCALDDSSQALRCDRCYRQSSNLKRC 75

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           + C    YC  +C+R  W   H++EC   +RLL+
Sbjct: 76  SACKTVFYCCANCQRSGW-GLHKFECAILVRLLK 108


>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 627

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 4/123 (3%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS-MSNC 124
           L  G +  +   S  ++++ S   GR V A  D++ GD + +++P   V  P  S + N 
Sbjct: 217 LVGGRNENIPALSAFVELKMSENMGRGVYATRDINPGDVVAIDEPYICVPFPDESEVCNY 276

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA--LRLLEAVG-IAHLALK 181
           N C     A   C +C++A YC++ C  + +   H   C     ++L   +  I  LALK
Sbjct: 277 NGCLKFDIALFRCPKCVMAYYCNKDCMNKDYKDGHHLVCPIICFIKLTPGISKINELALK 336

Query: 182 LIL 184
             L
Sbjct: 337 WFL 339


>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
 gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
          Length = 428

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   H+ EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLLGRVVVKLMDGK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNISKL 136


>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
           norvegicus]
          Length = 428

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  T+  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   H+ EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLLGRVVVKLMDGK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNISKL 136


>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
           [Ailuropoda melanoleuca]
          Length = 490

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
          Length = 490

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
 gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
           Full=CD8b-opposite; AltName: Full=Zinc finger protein
           BOP; Short=m-BOP
 gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
           Methyltransferase Smyd1
 gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
 gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
 gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
          Length = 490

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
          Length = 483

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 4   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 64  KFAHYCDRTCQKDAWLN-HKNECSAIKR 90


>gi|384498991|gb|EIE89482.1| hypothetical protein RO3G_14193 [Rhizopus delemar RA 99-880]
          Length = 511

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
           +A+ +   A + +  G TLF+E  +  + LP      CN+C +       C+ C  A +C
Sbjct: 14  SAENKQFTAAKHLEPGITLFIEPSLVSIPLPSKRHQRCNYCLSKAQLQC-CSRCRSAYFC 72

Query: 147 SESCRREAWLRYHRWEC 163
             +C R AWL +HR  C
Sbjct: 73  GNACFRNAWLHFHRVLC 89


>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 467

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 10/135 (7%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           + +KGR + A ++++ G+ +F E   A V+    +   C+ C    +    C +C  A Y
Sbjct: 14  AGSKGRGLRAAKELNTGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCKFAHY 73

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL-ILVSSHSD-------RYKEVYH 197
           C  +C+   W   H+ EC GA++ L +    ++ L   +L   H D       +   V  
Sbjct: 74  CDRTCQTACW-NEHKQEC-GAIKKLGSAPSENIRLAARVLWRMHKDTGIASDSQLISVDQ 131

Query: 198 LETHLQDMRPEDLYQ 212
           L+ H+ D+  ED  Q
Sbjct: 132 LQEHVADLAAEDFKQ 146


>gi|312384185|gb|EFR28972.1| hypothetical protein AND_02431 [Anopheles darlingi]
          Length = 641

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 73/165 (44%), Gaps = 45/165 (27%)

Query: 27  FKEDV--LTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQ 84
            KED+  L + K E  S E   +  + T  +A+S ++               AS+KIK+ 
Sbjct: 215 LKEDLEDLRVLKRETPSNEEEKVLPTFTGDAANSELSG--------------ASSKIKLV 260

Query: 85  TS--NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
            S  +++GR++ A ED+  G+ +  E  +A  + P                         
Sbjct: 261 GSKDDSRGRYIAAAEDIAPGEPILAEPAIAACLYPK-----------------------F 297

Query: 143 AVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLIL 184
             +CS +CR  A   YHR+EC   L L+   G++   HLAL+++L
Sbjct: 298 FAFCSVACRDRACASYHRFECQ-YLDLMIGSGMSILCHLALRIVL 341


>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
          Length = 485

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 4   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 64  KFAHYCDRTCQKDAWLN-HKNEC 85


>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
          Length = 472

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 4   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 63

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 64  KFAHYCDRTCQKDAWLN-HKNEC 85


>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
 gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
 gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
          Length = 477

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 489

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 60/133 (45%), Gaps = 10/133 (7%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A ++   GD +F E  +A V+    +   C+ C         C +C  A Y
Sbjct: 9   SPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHY 68

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV-------SSHSD-RYKEVYH 197
           C  +C+R  W   H+ EC  A++    V   ++ L   ++       S+ SD +   +  
Sbjct: 69  CDRTCQRAGWAE-HKLEC-SAIKAYGKVPNENIRLVARIMWRLDKEGSTVSDMQLTTLDE 126

Query: 198 LETHLQDMRPEDL 210
           LE H+ DM  +DL
Sbjct: 127 LEDHIADMPEDDL 139


>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Canis lupus familiaris]
          Length = 490

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
 gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
          Length = 584

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 1/76 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCN 138
           K+   ++  KGR + +  D+ +   +  E+P A +IL  C  ++C++C   + A  +PC 
Sbjct: 16  KLSQVSTPDKGRGMASQCDIPQASLVHKEEPYALIILKSCRETHCHYCLNQLRADTVPCT 75

Query: 139 ECILAVYCSESCRREA 154
            C + +YCSE C+ +A
Sbjct: 76  SCTIPLYCSEHCQIQA 91


>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 687

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 63/134 (47%), Gaps = 19/134 (14%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH +A  +V  G+ + +E+P    +L    +++C+ C   I  PIP  C  C    YCS
Sbjct: 262 GRHAVATREVSPGEIVIMERPYCASLLAENRLTHCHLCFARIFVPIPAACRTCSCVAYCS 321

Query: 148 ESCR-REAWLRYHRWECHGALRLL---EAVGIAHLALKLILVSSHSDRYKEVYHLETHLQ 203
             CR R+A +  H  EC   L +L   +A     LALK I        ++EV  L+  L 
Sbjct: 322 RRCRDRDAQV--HLREC-SLLPVLWHSKASVTCFLALKAITQRP----FEEVMKLKERLG 374

Query: 204 D------MRPEDLY 211
           D      + PE+ Y
Sbjct: 375 DAAGALRISPENPY 388


>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
 gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
          Length = 490

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
 gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
           sativus]
          Length = 775

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 1/74 (1%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCNECILA 143
           T++ KGR +++  ++     + VE+P A VIL  C  ++C++C   +    +PC  C + 
Sbjct: 219 TTSIKGRGMVSPTEIPPSSLVHVEEPYAVVILKHCRETHCHYCLNELPVDKVPCPSCSIP 278

Query: 144 VYCSESCRREAWLR 157
           +YCS+ C+ +A  R
Sbjct: 279 LYCSQHCQIQAGGR 292


>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 476

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/133 (27%), Positives = 60/133 (45%), Gaps = 10/133 (7%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A ++   GD +F E  +A V+    +   C+ C         C +C  A Y
Sbjct: 9   SPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHY 68

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV-------SSHSD-RYKEVYH 197
           C  +C+R  W   H+ EC  A++    V   ++ L   ++       S+ SD +   +  
Sbjct: 69  CDRTCQRAGWAE-HKLEC-SAIKAYGKVPNENIRLVARIMWRLDKEGSTVSDMQLTTLDE 126

Query: 198 LETHLQDMRPEDL 210
           LE H+ DM  +DL
Sbjct: 127 LEDHIADMPEDDL 139


>gi|307202234|gb|EFN81718.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 343

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%)

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           MS+C++C     A IPC  C  A+YCSE CR   W +YH  EC
Sbjct: 1   MSHCSNCLRVCLATIPCKYCTYAMYCSEQCRDIEWEKYHDVEC 43


>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
           garnettii]
          Length = 490

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 445

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 1/84 (1%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           +++  N  GR +IA +D+  G    +EKP   V+        C+ C    +    C+ C 
Sbjct: 9   QVENRNGYGRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQTCSGCFRLAAHMQQCSSCK 68

Query: 142 LAVYCSESCRREAWLRYHRWECHG 165
           +  YCS++C+R  W   H+ EC G
Sbjct: 69  VVQYCSQTCQRSDW-SIHKPECEG 91


>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
           cuniculus]
          Length = 490

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
 gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
          Length = 490

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 454

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 10/135 (7%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           + +KGR + A ++++ G+ +F E   A V+    +   C+ C    +    C +C  A Y
Sbjct: 14  AGSKGRGLRAAKELNTGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCKFAHY 73

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL-ILVSSHSD-------RYKEVYH 197
           C  +C+   W   H+ EC GA++ L +    ++ L   +L   H D       +   V  
Sbjct: 74  CDRTCQTACW-NEHKQEC-GAIKKLGSAPSENIRLAARVLWRMHKDTGIASDSQLISVDQ 131

Query: 198 LETHLQDMRPEDLYQ 212
           L+ H+ D+  ED  Q
Sbjct: 132 LQEHVADLAAEDFKQ 146


>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
 gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
 gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
          Length = 391

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/139 (23%), Positives = 65/139 (46%), Gaps = 18/139 (12%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           +K++  ++  KG  + A+  +  G+ LF   P+A+ +        C+ C       + C+
Sbjct: 4   SKVEKFSTTDKGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCS 63

Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLIL-VS 186
           +C +A YCS  C+++AW + H+ EC             ++RLL       +  KL+    
Sbjct: 64  QCRIAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMQETP 117

Query: 187 SHSDRYKEVYHLETHLQDM 205
           S S++    Y LE+++  +
Sbjct: 118 SESEKLYSFYDLESNINKL 136


>gi|78070776|gb|AAI07726.1| SMYD3 protein [Homo sapiens]
          Length = 182

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
          Length = 477

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
           cuniculus]
          Length = 477

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
          Length = 675

 Score = 48.1 bits (113), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 3/76 (3%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH +A  ++  G+ + VE+     +L    +++CN C   I  P P  C  C   +YCS
Sbjct: 249 GRHAVATREIVPGEIVIVERAHCAFLLEENRLTHCNLCFVRIFVPTPAACRTCNCVMYCS 308

Query: 148 ESCRREAWLRYHRWEC 163
             C R+A    H  EC
Sbjct: 309 RRC-RDADAEVHSREC 323


>gi|326421762|gb|ADZ73909.1| SET and MYND domain containing protein 4 [Neophocaena phocaenoides]
          Length = 212

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECMQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|326421780|gb|ADZ73918.1| SET and MYND domain containing protein 4 [Kogia sima]
          Length = 212

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 40/81 (49%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHLETHLQD 204
           L++   +  K +  L   + D
Sbjct: 79  LLARFEEAGKVIDKLCGEITD 99


>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
          Length = 476

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S+ KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFISDGKGRSLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAYYCDRTCQKDAWLN-HKNEC 90


>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
           griseus]
          Length = 498

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|197692952|gb|ACH71264.1| SET and MYND domain-containing 1 [Sus scrofa]
          Length = 194

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 6   VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 65

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 66  KFAHYCDRTCQKDAWLN-HKNEC 87


>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
           intestinalis]
          Length = 430

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/83 (25%), Positives = 42/83 (50%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS   GR + A  +++ G T+   +P A+++        C+ C       + C+ C
Sbjct: 5   VELFTSEKSGRGLKAKRNLNPGSTVLSSEPYAYLLSKKQKGVYCDFCFKKQDGLLQCSGC 64

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
               YC+ +C++ AW  +H+ EC
Sbjct: 65  KYMKYCNRNCQKMAWNEHHKAEC 87


>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
          Length = 467

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWAADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|297722479|ref|NP_001173603.1| Os03g0704200 [Oryza sativa Japonica Group]
 gi|255674817|dbj|BAH92331.1| Os03g0704200 [Oryza sativa Japonica Group]
          Length = 174

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 17/140 (12%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR + A      G+ +  ++P A         SNC++C  S +    
Sbjct: 34  AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 92

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEA-----VGIAHLALKLIL------- 184
           C+ C +A YC  +C+RE W + H+ EC     L E           L ++L+L       
Sbjct: 93  CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 151

Query: 185 ---VSSHSDRYKEVYHLETH 201
               SS +D Y  V  LE+H
Sbjct: 152 KAIPSSGTDNYNLVDALESH 171


>gi|326421778|gb|ADZ73917.1| SET and MYND domain containing protein 4 [Sousa chinensis]
          Length = 212

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|209878147|ref|XP_002140515.1| SET domain-containing protein [Cryptosporidium muris RN66]
 gi|209556121|gb|EEA06166.1| SET domain-containing protein [Cryptosporidium muris RN66]
          Length = 573

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 6/90 (6%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP---- 134
           N + ++ S  KGR +IA  D+  G+ +++E P  +++      S C+ C   I       
Sbjct: 9   NIVHLKVSEEKGRCIIANRDIKAGEVIWLEYPFIYIVFEEFLDSVCDTCLKYIGTEKVVN 68

Query: 135 -IPCNECILAVYCSESCRREAWLRYHRWEC 163
            I CN C +  YCS  C+  +    H++EC
Sbjct: 69  IIVCNGCSVVKYCSNICKSNS-EGIHKFEC 97


>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
 gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
          Length = 457

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 3/103 (2%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP A+V+     +  C++C  ++     C+ C    YC+ +C+++AW   
Sbjct: 19  IKRGTRILTEKPFAYVLKSQYRLERCDNCLEAVQKVRKCSNCRYVSYCNRTCQQQAW-PL 77

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
           H+ EC    R+   +    A +  +LIL   H    +  Y+ E
Sbjct: 78  HKHECPFLKRVQPRIVPDAARMLCRLILRLEHGGDLERGYYTE 120


>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
 gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
          Length = 438

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 10  KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 68

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 69  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 122

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   + E L Q V
Sbjct: 123 EKLYSFYDLESNINKLTEDKKEGLRQLV 150


>gi|326322048|gb|ADZ54024.1| SET and MYND domain containing 4 [Mesoplodon densirostris]
          Length = 212

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
           magnipapillata]
          Length = 1037

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 43/82 (52%), Gaps = 14/82 (17%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------CNHCCTSISAPIPCNECI 141
           KGR V A++D+ +G  +  E+P+A      C ++N       C++C +     + C++C 
Sbjct: 15  KGRGVRALQDIKRGVEILKEEPLA------CILTNSKYRGIRCDYCYSEPEKLLKCSKCK 68

Query: 142 LAVYCSESCRREAWLRYHRWEC 163
              YC + C+   W + H++EC
Sbjct: 69  FIAYCGKVCQASDW-KMHKYEC 89


>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
 gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
          Length = 511

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 6/89 (6%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           + + T + KGR +    D H GD +  ++P  +V +P  +   C+ C ++ +    C+ C
Sbjct: 13  LTVSTISEKGRSLFTTRDFHPGDVIISQEP--YVCVP--TQKRCDGCFSTTNLS-KCSRC 67

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRL 169
            +  YC   C++  W + HR EC    RL
Sbjct: 68  QVVWYCGTPCQKSEW-KLHRLECQALSRL 95


>gi|326421776|gb|ADZ73916.1| SET and MYND domain containing protein 4 [Sousa chinensis]
          Length = 212

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|326421772|gb|ADZ73914.1| SET and MYND domain containing protein 4 [Tursiops aduncus]
          Length = 212

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
           vinifera]
          Length = 477

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/152 (24%), Positives = 69/152 (45%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           ++N + + T+  KGR +++++D   G+ +  ++P   V       S C  C  S S    
Sbjct: 9   SANGLTVSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFRS-SNLKK 67

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL-----------LEAVGIAHLALKL--- 182
           C+ C +  YC  +C++  W + HR EC+   RL           +  +   ++  KL   
Sbjct: 68  CSACHVVWYCGSTCQKSDW-KLHRLECNALSRLEKERQKSLTPSIRLMVKLYMRRKLQSE 126

Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            I+ ++  D Y  V  L +H+ D+  + L  Y
Sbjct: 127 KIMPTTARDNYNLVEALVSHITDIDEKQLVLY 158


>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
           harrisii]
          Length = 428

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 35/141 (24%), Positives = 66/141 (46%), Gaps = 18/141 (12%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   S  +G  + AV  +  G+ LF  +P+A+ +        C  C       + C++
Sbjct: 5   KLEKFRSPGRGNGLRAVAPLRPGELLFRSEPLAYTVCKGSRGVVCERCLCRKEKLLRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSS- 187
           C +A YC   C+++AW + H+ EC             ++RLL  V       KL+  SS 
Sbjct: 65  CKVARYCGSECQKQAW-QDHKRECKCLKSSEPKFPPDSVRLLGKV-----VFKLLRESSC 118

Query: 188 HSDRYKEVYHLETHLQDMRPE 208
            S++    + LE++++++  E
Sbjct: 119 ASEKLYSFFDLESNIKNLSEE 139


>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
          Length = 492

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 66/150 (44%), Gaps = 20/150 (13%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC---TSISA 133
           A +K+  + S   G+ + A +D+  GD +F  K     +L   S+ +C   C   T   A
Sbjct: 2   ADSKVLKKKSAISGKGLFANQDIAAGDLIFSLKRPLIAVLDNVSLDSCCANCFASTGFGA 61

Query: 134 P------IPCNECILAVYCSESCRREAWLRYHRWECH-----GALRLL----EAVGIAHL 178
                    C  C    YC  SC+ ++W R+H+ EC      GA R L     AV I  L
Sbjct: 62  TNNDLSLRACTGCSTLKYCGRSCQSQSWKRHHKHECPVLKTLGAERQLPNAVRAV-IQTL 120

Query: 179 ALKLILVSSHSDRYKEVYHLETHLQDMRPE 208
            ++   + S  D +K +  L  HL ++R E
Sbjct: 121 VMRKSGLISDED-WKNLQELPAHLDELRQE 149


>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
          Length = 477

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 6/89 (6%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           + + T + KGR +    D H GD +  ++P  +V +P  +   C+ C ++ +    C+ C
Sbjct: 13  LTVSTISEKGRSLFTTRDFHPGDVIISQEP--YVCVP--TQKRCDGCFSTTNLS-KCSRC 67

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRL 169
            +  YC   C++  W + HR EC    RL
Sbjct: 68  QVVWYCGTPCQKSEW-KLHRLECQALSRL 95


>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 1240

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSE 148
           GR   A +D+ K +T+ +++      L   S S+ C++C     +PI C +C  + YCSE
Sbjct: 598 GRIAEATDDI-KANTILIKENAYVTCLDRASYSSHCHNCFKETLSPIFCKKCNHSQYCSE 656

Query: 149 SCRREAWLRYHRWECHGALRLL 170
            C  + +++YH  EC     LL
Sbjct: 657 KCLNQDFIKYHEMECSIGFLLL 678


>gi|356518688|ref|XP_003528010.1| PREDICTED: uncharacterized protein LOC100783548 [Glycine max]
          Length = 786

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)

Query: 81  IKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPC 137
           IK+Q  ++  KGR +++   +  G  + VE+P A +IL  C  ++C++C   + A  +PC
Sbjct: 215 IKLQCVSTPDKGRGMVSSCVISPGSLVHVEEPYAMIILKQCRETHCHYCLNDLPADRVPC 274

Query: 138 NECILAVYCSESCRREA 154
             C + +YCS  C+  A
Sbjct: 275 ISCSIPLYCSHQCQTRA 291


>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
          Length = 354

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 52/110 (47%), Gaps = 4/110 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           V +G T+F  KP A+V+   C+   C++C  S      C+ C    YC  +C++E+W   
Sbjct: 8   VKRGITIFTSKPFAYVLTSKCNAICCDNCLKSGKLS-KCSGCQYVYYCDRNCQKESW-PI 65

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
           H+ EC    R+   +    A L  ++I+  +     +  Y+ ET  +  +
Sbjct: 66  HKAECANLKRISPKIIPDAARLMARIIIKLNQGGANEVGYYCETKFRKFK 115


>gi|193676233|ref|XP_001943351.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 636

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 56/126 (44%), Gaps = 10/126 (7%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS--MSN 123
           L  G +  +   S  ++++ S   GR V A  D++ GD + +++P    I  P S     
Sbjct: 214 LVGGRNENIPALSAFVELKMSKNMGRGVYATRDINPGDVVAIDEPY---ICGPISDHTGV 270

Query: 124 CNH--CCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH---L 178
           C +  C     A IPC +C+L  YC++ C  +A    H  EC     +    GI     L
Sbjct: 271 CQYSGCLNLDLALIPCPKCLLVYYCNKDCMNKANEDGHYLECPIMYFIKSTPGITRMNEL 330

Query: 179 ALKLIL 184
           A+K  L
Sbjct: 331 AMKWFL 336


>gi|326421788|gb|ADZ73922.1| SET and MYND domain containing protein 4 [Delphinapterus leucas]
          Length = 212

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 37/72 (51%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEV 195
           L++   +  K +
Sbjct: 79  LLARFEEAGKVI 90


>gi|326421784|gb|ADZ73920.1| SET and MYND domain containing protein 4 [Elaphurus davidianus]
          Length = 212

 Score = 47.8 bits (112), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
 gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
          Length = 448

 Score = 47.4 bits (111), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 49/103 (47%), Gaps = 4/103 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP A+V+     +  C++C  +    + C+ C    YC+ SC+++AW   
Sbjct: 9   IKRGTRILTEKPFAYVLKSKYRLERCDNCLEATKV-LKCSNCRYVSYCNRSCQQQAW-SI 66

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
           H+ EC    R L  +    A +  +LIL   H    +  Y+ E
Sbjct: 67  HKHECLFLKRALPRIVPDAARMLCRLILRLEHGGDLERGYYTE 109


>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
           boliviensis boliviensis]
          Length = 490

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|326421766|gb|ADZ73911.1| SET and MYND domain containing protein 4 [Delphinus capensis]
 gi|326421770|gb|ADZ73913.1| SET and MYND domain containing protein 4 [Stenella attenuata]
 gi|326421774|gb|ADZ73915.1| SET and MYND domain containing protein 4 [Stenella coeruleoalba]
          Length = 212

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
 gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
 gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
           troglodytes]
 gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
 gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
          Length = 428

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|326421782|gb|ADZ73919.1| SET and MYND domain containing protein 4 [Platanista gangetica]
          Length = 212

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 79  LLARFEEAGKVINKL 93


>gi|328710706|ref|XP_003244339.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 648

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 1/99 (1%)

Query: 66  LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS-MSNC 124
           L  G +  +   S+ ++++ S +  R V A  D++ GD + +++P      P C+ + N 
Sbjct: 228 LVGGRNENIPALSSFVELKMSESMERGVYATRDINPGDVVAIDEPYIGWPFPECAGVCNF 287

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           N C    SA I C +C L  YC++ C  +     H  EC
Sbjct: 288 NGCMKINSALIQCPKCKLVSYCNKDCMNKDDKDGHNLEC 326


>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/152 (24%), Positives = 69/152 (45%), Gaps = 17/152 (11%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           ++N + + T+  KGR +++++D   G+ +  ++P   V       S C  C  S S    
Sbjct: 9   SANGLTVSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFRS-SNLKK 67

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL-----------LEAVGIAHLALKL--- 182
           C+ C +  YC  +C++  W + HR EC+   RL           +  +   ++  KL   
Sbjct: 68  CSACHVVWYCGSTCQKSDW-KLHRLECNALSRLEKERQKSLTPSIRLMVKLYMRRKLQSE 126

Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
            I+ ++  D Y  V  L +H+ D+  + L  Y
Sbjct: 127 KIMPTTARDNYNLVEALVSHITDIDEKQLVLY 158


>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ailuropoda melanoleuca]
          Length = 477

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + A+  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFSTAN-RGNGLRALAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEETPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEEKKEGLRQLV 148


>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
 gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
          Length = 453

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 33/63 (52%), Gaps = 2/63 (3%)

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
           +GD +  EKP A V+ P    S C+ C       + C+ C+   YC  SC++EAW   H+
Sbjct: 8   RGDVILQEKPFACVLDPRYRDSRCDRCFKETKV-MKCSNCLYVRYCGRSCQKEAWSD-HK 65

Query: 161 WEC 163
            EC
Sbjct: 66  EEC 68


>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
 gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
 gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
          Length = 490

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
          Length = 410

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 14/85 (16%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPC---SMSNCNHCCTSISAPI--------PCN 138
           G+ + A  D+ + + LF E+P  +V  PP    S++   H C     PI         C 
Sbjct: 105 GKGLFASRDIKQDEILFTEEP--YVYFPPWEGFSLARSGHVCGLCCKPILYANRLTQHCK 162

Query: 139 ECILAVYCSESCRREAWLRYHRWEC 163
            C +  YCS+SCR  AW ++H+ EC
Sbjct: 163 HCDM-YYCSKSCRATAWDQFHQLEC 186


>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
           jacchus]
          Length = 442

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIRR 95


>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
 gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
 gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
          Length = 428

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVTALRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGTPSES 120

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
          Length = 421

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 37/78 (47%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  +GR V     +  GD       +A V+LP    S+C+ C  S +    C  C  A Y
Sbjct: 14  SGDRGRRVETETALAAGDLALRAPALATVLLPALWDSHCHKCFASGTRLSRCGRCNTAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           CS++C++  W   HR EC
Sbjct: 74  CSKACQQADWKPDHRKEC 91


>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
          Length = 370

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 5/81 (6%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           K +H+ AV+    GD +  E+P A+V+      S C+ C       + C+ C    YC+ 
Sbjct: 3   KLKHINAVK---TGDLILSEEPFAYVLSSKEKGSRCDFCLEK-GKVLKCSGCQFVHYCNR 58

Query: 149 SCRREAWLRYHRWECHGALRL 169
           SC+++AW   H+WEC    R+
Sbjct: 59  SCQKDAW-EDHKWECANLKRI 78


>gi|327241838|gb|AEA40700.1| SET and MYND domain containing protein 4, partial [Balaenoptera
           acutorostrata]
          Length = 176

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 16  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 75

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 76  LLARFEEAGKVINKL 90


>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 489

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A ++   GD +F E  +A V+    +   C+ C         C +C  A Y
Sbjct: 9   SPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHY 68

Query: 146 CSESCRREAWLRYHRWEC 163
           C  +C+R  W   H+ EC
Sbjct: 69  CDRTCQRAGWAE-HKQEC 85


>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 476

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A ++   GD +F E  +A V+    +   C+ C         C +C  A Y
Sbjct: 9   SPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHY 68

Query: 146 CSESCRREAWLRYHRWEC 163
           C  +C+R  W   H+ EC
Sbjct: 69  CDRTCQRAGWAE-HKQEC 85


>gi|299750613|ref|XP_001829706.2| hypothetical protein CC1G_09595 [Coprinopsis cinerea okayama7#130]
 gi|298408988|gb|EAU92074.2| hypothetical protein CC1G_09595 [Coprinopsis cinerea okayama7#130]
          Length = 951

 Score = 47.4 bits (111), Expect = 0.004,   Method: Composition-based stats.
 Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 8/70 (11%)

Query: 100 HKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP------CNECILAVYCSESCRRE 153
           H+ +    E P   V++  C  S+  H C + ++  P      C  C    YCS  C++E
Sbjct: 747 HEANESRYEPPERHVVI--CDNSSAPHSCPNPTSLDPTKAGKTCGGCHWVNYCSAKCQQE 804

Query: 154 AWLRYHRWEC 163
            W R HRWEC
Sbjct: 805 DWTRRHRWEC 814


>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
          Length = 449

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTSEGKGRGLKATKEFWASDVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AW+  H+ EC    R
Sbjct: 69  KFARYCDRTCQKDAWVD-HKKECLAVKR 95


>gi|327241836|gb|AEA40699.1| SET and MYND domain containing protein 4, partial [Tursiops
           truncatus]
          Length = 176

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 38/75 (50%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 16  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 75

Query: 184 LVSSHSDRYKEVYHL 198
           L++   +  K +  L
Sbjct: 76  LLARFEEAGKVINKL 90


>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
           NIH/UT8656]
          Length = 550

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 33/151 (21%), Positives = 65/151 (43%), Gaps = 20/151 (13%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVH-KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA 133
           +    ++  + +  +G  + AV+D+  +   LFV +P+   +      +NC +C      
Sbjct: 29  LVEDERVYTKRTADRGNGLFAVDDIAAESQLLFVARPLLVALENAKLPTNCYYCFREPGG 88

Query: 134 P-------------IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
                           C+ C +A +C + C+ EAW +YHR EC    RL   V  + +  
Sbjct: 89  QGLWQMTERKEDSLKTCSRCKVAKFCDQKCQTEAWSQYHRLECKLFSRLYPRVLPSTVRA 148

Query: 181 KLILVSSHS------DRYKEVYHLETHLQDM 205
            + L+  H         ++++  L++H QD+
Sbjct: 149 VIRLLKQHKAGILPPGEWEQLLALQSHQQDL 179


>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
 gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
 gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
 gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
 gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
 gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
 gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
 gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
 gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
          Length = 490

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
          Length = 493

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
          Length = 484

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 3   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 62

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 63  KFAHYCDRTCQKDAWLN-HKNECSAIKR 89


>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
          Length = 425

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/124 (24%), Positives = 58/124 (46%), Gaps = 15/124 (12%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS--ISAPIPCNECILAVYCSESCRREAWL 156
           V KG T+   +P  FV+      + C++C  S  +   + C+ C    YC+ +C++++W 
Sbjct: 1   VKKGTTILTGRPFVFVLRSKYRTTRCDNCFKSEQLVKLLKCSSCQYVYYCNRNCQKQSW- 59

Query: 157 RYHRWECHGALRLLEAV------GIAHLALKLILVSSH------SDRYKEVYHLETHLQD 204
           R H+ EC    ++L  +       +A + +KL    +        + Y++   L +H  D
Sbjct: 60  RIHKRECVCLKKILPVILPDSVRLVARIIIKLKQGGADEMDYYTKENYRKFTDLMSHYSD 119

Query: 205 MRPE 208
           MR +
Sbjct: 120 MRAD 123


>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3-like [Apis florea]
          Length = 414

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + KG TLF  KP A+V+        C++C  S      C+ C    YC++SC++ +W   
Sbjct: 8   IKKGTTLFTAKPFAYVLYSKYRNERCDYCFKS-GKLFRCSVCKCIYYCNQSCQQMSW-TI 65

Query: 159 HRWECHGALRLLEAV--GIAHLALKLIL---------VSSHSD-RYKEVYHLETHLQDMR 206
           H  EC    R    V   +A L  ++I+         +  +S  +Y++   L +H  D++
Sbjct: 66  HSKECASLKRFSSKVIPDVARLMARIIIKLNQGGGEEIGYYSKTKYRKFKDLMSHYSDIK 125

Query: 207 PED 209
            ++
Sbjct: 126 KDE 128


>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
          Length = 510

 Score = 47.4 bits (111), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
           rotundata]
          Length = 438

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 4/108 (3%)

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
           KG T+F  KP A+V+      + C++C TS      C+ C    YC+ SC++ +W   H 
Sbjct: 10  KGTTIFTAKPFAYVLSSKYRTNRCDYCLTS-GKLFRCSACQYVYYCNRSCQQGSWPT-HN 67

Query: 161 WECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
            EC    R+   V   +A L  ++I+  S     +  Y+ +T  +  +
Sbjct: 68  TECVNLKRVSPKVVPDMARLMARIIIKLSQGGDDEVEYYTKTKFRRFK 115


>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
           aries]
          Length = 428

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 18/139 (12%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           +K++  ++  +G  + A+  +  G+ LF   P+A+ +        C+ C       + C+
Sbjct: 4   SKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCS 63

Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLIL-VS 186
           +C +A YCS  C+++AW + H+ EC             ++RLL       +  KL+    
Sbjct: 64  QCRIAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMQETP 117

Query: 187 SHSDRYKEVYHLETHLQDM 205
           S S++    Y LE+++  +
Sbjct: 118 SESEKLYSFYDLESNINKL 136


>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
          Length = 816

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-CNECILAVYCS 147
           KGR + +  D+     +  E P+A +I+ PC  ++C++C +   A +  C  C + +YCS
Sbjct: 250 KGRGMTSPNDISPASLIHAEDPLAVIIMKPCRDTHCHYCFSEAPADVVFCPSCTIPIYCS 309

Query: 148 ESCRREA 154
             C+ +A
Sbjct: 310 NRCQEKA 316


>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
 gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
          Length = 448

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 4/102 (3%)

Query: 98  DVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLR 157
           ++ +G  +  EKP A+V+     +  C++C  ++   + C+ C    YC+ +C+++AW  
Sbjct: 13  EIKRGTRILTEKPFAYVLKSKYRLERCDNCLEAVKV-LKCSNCRYVSYCNRACQQQAW-P 70

Query: 158 YHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYH 197
            H+ EC    R L  +    A +  +LIL   H    +  Y+
Sbjct: 71  LHKHECPFLKRALPRIVPDAARMLCRLILRLEHGGDLERGYY 112


>gi|170034243|ref|XP_001844984.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167875496|gb|EDS38879.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 556

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/119 (26%), Positives = 57/119 (47%), Gaps = 5/119 (4%)

Query: 48  QVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHK-GDTLF 106
           Q++   S A++  A LP +    +  +   +  + M+     GR +   +++ + GD L 
Sbjct: 172 QIAEGKSFANAPFARLP-MRLEPNPRIPFMAKGLAMKNDPDYGRCLFPDDEIFRPGDVLM 230

Query: 107 VEKPVAFVILPPCSMSNCNHCCTSISAP--IPCNECILAVYCSESCRREAWLRYHRWEC 163
            E+ +  +  P   + NCN+C T + A   +PC  C   +YCSE C  +     HR+EC
Sbjct: 231 EEEALVAMNDPKFRLVNCNYC-TEMKAFQLVPCPGCCSVMYCSEECLEKDGQGGHRFEC 288


>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 1-like [Pongo abelii]
          Length = 490

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|406866444|gb|EKD19484.1| MYND domain protein [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 754

 Score = 47.0 bits (110), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 93  VIAVEDVHKGDTLFVEKPVAFVI-LPPCSMSNCNHCCTSIS------APIPCNECILAVY 145
           + A   + KG+ + ++K +  V  +P  S  +C+ C  S++       PIPC+ C  A+Y
Sbjct: 407 IFATRLIEKGELVILDKSITGVSNVPSSSFRHCDACHGSLANQYLRQPPIPCSSCRKAIY 466

Query: 146 CSESCRREAWLRYHRWEC 163
           CS  C + A   YH+  C
Sbjct: 467 CSPECYKTATEGYHKVLC 484


>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
          Length = 490

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
 gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
           nagariensis]
          Length = 265

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 4/128 (3%)

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
            KGRH++A    + G  +  + P   V+    +   C+ C      P+ C    LA YC 
Sbjct: 28  GKGRHLVATRWFNPGAVVLQQDPYVSVLSDERTPGFCDFCFRPCERPLRCTRSKLARYCC 87

Query: 148 ESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           +  +R AW+  ++ EC   +R    V      LA +L+   +       ++ LE H  ++
Sbjct: 88  KEHQRLAWVAGYKMECEALVRCAPRVPPPTVRLAARLLWRRARCGGINGLWRLEHHWDEL 147

Query: 206 --RPEDLY 211
             R + LY
Sbjct: 148 DDRRKQLY 155


>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
          Length = 497

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
 gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
          Length = 477

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
 gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
          Length = 448

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 7/122 (5%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           S+ +++  +  KGR +   + +  G ++  E P   V+      S C+ C    S    C
Sbjct: 8   SDNVEIFETYEKGRGIRCKKQLAIGTSVGKENPFCHVVSQDMLSSYCHSCLLMQSELYKC 67

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG------IAHLALKLILVSSHSDR 191
           + C + +YC +SC++E W ++H++EC    RL   V       +  +A  ++     +D+
Sbjct: 68  SRCKIIMYCCKSCQKEDW-QWHKYECKSITRLGPKVPPDSIRLLGRVAYTILQGQDRADQ 126

Query: 192 YK 193
           +K
Sbjct: 127 FK 128


>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
           aries]
          Length = 391

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 18/139 (12%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           +K++  ++  +G  + A+  +  G+ LF   P+A+ +        C+ C       + C+
Sbjct: 4   SKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCS 63

Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLIL-VS 186
           +C +A YCS  C+++AW + H+ EC             ++RLL       +  KL+    
Sbjct: 64  QCRIAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMQETP 117

Query: 187 SHSDRYKEVYHLETHLQDM 205
           S S++    Y LE+++  +
Sbjct: 118 SESEKLYSFYDLESNINKL 136


>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
           mellifera]
          Length = 440

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 14/123 (11%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + KG TLF  KP A+V+        C++C  S      C+ C    YC++SC++ +W   
Sbjct: 8   IKKGTTLFTAKPFAYVLYSKYRNERCDYCFKS-GKLFRCSVCKCIYYCNQSCQQMSW-TI 65

Query: 159 HRWECHGALRLLEAV--GIAHLALKLIL---------VSSHSD-RYKEVYHLETHLQDMR 206
           H  EC    R    V   +A L  ++I+         +  +S  +Y++   L +H  D++
Sbjct: 66  HSKECASLKRFSSKVIPDVARLMARIIIKLNQGGGEEIGYYSKTKYRKFKDLMSHYSDIK 125

Query: 207 PED 209
            ++
Sbjct: 126 KDE 128


>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
           Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
           Methionine
          Length = 429

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 22/148 (14%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C         C++C 
Sbjct: 9   KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLXRCSQCR 67

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW   H+ EC             ++RLL       +  KL     S S
Sbjct: 68  VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLXDGAPSES 121

Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
           ++    Y LE+++  +   R E L Q V
Sbjct: 122 EKLYSFYDLESNINKLTEDRKEGLRQLV 149


>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
          Length = 428

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KG  + +   V  G+ L+  +P A+V+        C  C         C++C +A Y
Sbjct: 11  SPGKGSGLRSRRQVRPGELLYRAEPFAYVVTKEQLGGVCEQCLQRNEHLHRCSQCKVAKY 70

Query: 146 CSESCRREAWLRYHRWEC 163
           C +SC++EAWL  H+ EC
Sbjct: 71  CGKSCQKEAWLD-HKREC 87


>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
 gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
          Length = 457

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 3/103 (2%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP A+V+     +  C++C  ++     C+ C    YC+ +C+++AW   
Sbjct: 19  IKRGTRILSEKPFAYVLKSQYRLERCDNCLEAVQKVRKCSNCRYVSYCNRTCQQQAW-PL 77

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
           H+ EC    R+   +    A +  +LIL   H    +  Y+ E
Sbjct: 78  HKHECPFLKRVQPRIVPDAARMLCRLILRLEHGGDLERGYYTE 120


>gi|384485789|gb|EIE77969.1| hypothetical protein RO3G_02673 [Rhizopus delemar RA 99-880]
          Length = 459

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
           R ++   D+   + ++VEKP+   + P    S+CN C   ++   I C +C +  +CS+ 
Sbjct: 189 RGLVVNRDIAADEIIYVEKPIVSGLFPGLEGSHCNLCLKRLNDVKIECPDCDVVAFCSDE 248

Query: 150 CRREAWLRYHRWEC 163
           C   A   YH++ C
Sbjct: 249 CLNHAKEEYHQYIC 262


>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
           jacchus]
          Length = 428

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 68/150 (45%), Gaps = 21/150 (14%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   +  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFATADRGNGLRAVTPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVS-S 187
           C +A YCS  C+++AW   H+ EC             ++RLL       +  KL+  S S
Sbjct: 65  CRVAKYCSSKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEGSPS 118

Query: 188 HSDRYKEVYHLETHLQDM---RPEDLYQYV 214
            S++    Y LE+++  +   + E L Q V
Sbjct: 119 ESEKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
           gorilla gorilla]
          Length = 459

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 40/86 (46%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  +GR V    D++ GD        A V+LP    S+C+ C  +      C  C  A Y
Sbjct: 16  SGDRGRRVETQIDLNAGDLALRASASAAVLLPELWGSHCHKCFATGVRLSRCGRCHTAFY 75

Query: 146 CSESCRREAWLRYHRWECHGALRLLE 171
           CS++C++  W   HR EC    +L +
Sbjct: 76  CSKACQQADWAPDHRRECKSMAQLAQ 101


>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
 gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
          Length = 428

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 67/150 (44%), Gaps = 21/150 (14%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   +  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFATAKRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
           C +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S
Sbjct: 65  CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPS 118

Query: 188 HSDRYKEVYHLETHLQDM---RPEDLYQYV 214
            S++    Y LE+++  +   + E L Q V
Sbjct: 119 ESEKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
           garnettii]
          Length = 428

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 64/136 (47%), Gaps = 19/136 (13%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K  T+N +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++C 
Sbjct: 8   KFATAN-RGNGLRAVVALRPGELLFRSDPLAYTVCKGSRGIVCDRCLLGKEKLMRCSQCQ 66

Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
           +A YCS  C+++AW + H+ EC             ++RLL       +  KL+    S S
Sbjct: 67  VAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVIFKLMEETPSES 120

Query: 190 DRYKEVYHLETHLQDM 205
           ++    Y LE++++ +
Sbjct: 121 EKLYSFYDLESNIKKL 136


>gi|301603799|ref|XP_002931531.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
           (Silurana) tropicalis]
          Length = 176

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAV 144
           S  KG  V A++D+  G T+ + +P  + +   C +   C+HC       + C++C +  
Sbjct: 11  SPGKGNGVRALKDMSHGLTVMIAEPYVYTV---CRIKTACDHCLHRKEKLLRCSQCKVTR 67

Query: 145 YCSESCRREAWLRYHRWEC 163
           YC+  C+R+AW + H+ EC
Sbjct: 68  YCNSHCQRKAW-QGHKREC 85


>gi|119113125|ref|XP_564259.2| AGAP011233-PA [Anopheles gambiae str. PEST]
 gi|116131639|gb|EAL41558.2| AGAP011233-PA [Anopheles gambiae str. PEST]
          Length = 393

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 35/56 (62%), Gaps = 2/56 (3%)

Query: 122 SNCNHC-CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA 176
           + CN+C    +   IPC  C +A++CS +C  EA+ +YHR+EC G +R L  +G A
Sbjct: 230 ARCNYCHSRRLFTFIPCEGCTVAMFCSVACLTEAYRKYHRYEC-GLVRDLWKIGGA 284


>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
          Length = 535

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 1/91 (1%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP 134
           M +S+ ++++   +KGR + A +    G  +F   P+  +    C  S C+HC      P
Sbjct: 1   MSSSSPVEIRPHPSKGRALHATKSFPPGAVIFSFTPLLLLPTVACLTSVCSHCLRP-GEP 59

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHG 165
             C+ C  A YC  +C+  AW   H  EC  
Sbjct: 60  RLCSRCHAAAYCDATCQAAAWKAVHSRECKA 90


>gi|326421764|gb|ADZ73910.1| SET and MYND domain containing protein 4 [Lipotes vexillifer]
          Length = 212

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 4/72 (5%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     AP+PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 19  CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78

Query: 184 LVSSHSDRYKEV 195
           L++    R++E 
Sbjct: 79  LLA----RFEEA 86


>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
          Length = 548

 Score = 47.0 bits (110), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 40/95 (42%), Gaps = 9/95 (9%)

Query: 84  QTSNAKGRHVIAVEDVHKGDTLF-VEKPVAFVILPPC--SMSNCNHCCTSISAPIP---- 136
           +  +  GR + A      GD +   E+P   V+LPP   ++  CNHC        P    
Sbjct: 13  KGGDKGGRSIHATRRFKPGDVIACFEEPA--VVLPPGHRALEYCNHCLKKQQPGGPKLRA 70

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           C  C    YC  +C+R  W   H+ EC    RL E
Sbjct: 71  CTGCKTVAYCGPACQRANWSLVHKLECKAIQRLHE 105


>gi|350582164|ref|XP_003481215.1| PREDICTED: SET and MYND domain-containing protein 1-like [Sus
           scrofa]
          Length = 220

 Score = 47.0 bits (110), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ TS  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
           mulatta]
          Length = 399

 Score = 47.0 bits (110), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
             A YC  +C+++AWL  H+ EC
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNEC 90


>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
 gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
          Length = 410

 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 2/95 (2%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A+  + + +   KGR +IA      GD +  ++P A         S+C+HC TS +    
Sbjct: 13  AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           C+ C +  YCS +C++E W + H+ EC     L E
Sbjct: 72  CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTE 105


>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
          Length = 515

 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 70/150 (46%), Gaps = 16/150 (10%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS-NCNHCCTS-ISAPI 135
           S K K++ S   GR+++A +D+  G+ +  ++P  FV+ P    S  C +C    I+  +
Sbjct: 2   SVKYKVEKSEKLGRYLVAAKDIKAGERILSDQP--FVLGPSSDTSLVCFNCYLPLINKFL 59

Query: 136 PCNECILAVYCSESCRREAWLRYH-RWECH-----------GALRLLEAVGIAHLALKLI 183
            C  C +A  C      + + +YH R EC              + +++ VG   +   L+
Sbjct: 60  VCKNCAVAPLCPGDGCPDEFTKYHNRQECDVFRNLKLTKGISPMTMVQNVGSLSVLRALL 119

Query: 184 LVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
              ++   +K    LETHL+  R  +++QY
Sbjct: 120 KKETNLLEWKLFMELETHLERRRESNVWQY 149


>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
          Length = 799

 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 46/78 (58%), Gaps = 3/78 (3%)

Query: 80  KIKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIP 136
           +IK+Q  ++  KGR + ++ ++ +   +  E+P A +IL  C  ++C+ C   + A  +P
Sbjct: 224 QIKLQCVSTPTKGRGMASLSEISQSYLVHTEEPYAAIILKHCRDTHCHFCFNELPADSVP 283

Query: 137 CNECILAVYCSESCRREA 154
           C  C + +YCS+ C+ +A
Sbjct: 284 CTSCSIPLYCSQHCQMQA 301


>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
          Length = 383

 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 32/111 (28%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC----------------T 129
           +G+ + A +++ KG+T+FVE+PV    F+     +   C+HC                 +
Sbjct: 1   QGKGLFATKNIRKGETVFVERPVVSSQFLWNALYNYRACDHCLRALETAEENAQRLLGNS 60

Query: 130 SISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
           S+  P P            C  C + +YCS  CR+ AW +YHR  C G  R
Sbjct: 61  SLVLPHPEQCSIRKDLHQQCPRCQV-MYCSAECRQAAWEQYHRVLCLGPSR 110


>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 302

 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI--SAPIPCN 138
           +K+  S+ +GRHV+A  D+ +G+ L  E+    +         C+ C   +    P+ C 
Sbjct: 7   LKIAQSDDRGRHVVATTDIARGELLLSEECTVAMTYRDLESPYCSQCVKPMVDETPVECE 66

Query: 139 ECILAVYCSESC 150
            C L  YCS+SC
Sbjct: 67  GCFLERYCSKSC 78


>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
          Length = 428

 Score = 46.6 bits (109), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 70/151 (46%), Gaps = 21/151 (13%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           +K++  ++  +G  + A+  +  G+ LF   P+A+ +        C+ C       + C+
Sbjct: 4   SKVEKFSTADRGNGLRALARLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCS 63

Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVS 186
           +C +A YCS  C+++AW + H+ EC             ++RLL       +  KL+    
Sbjct: 64  QCRVAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVIFKLMEETP 117

Query: 187 SHSDRYKEVYHLETHLQDM---RPEDLYQYV 214
           S S++    Y LE+++  +   + E L Q V
Sbjct: 118 SESEKLYSFYDLESNMNKLTEEKKEGLRQLV 148


>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
          Length = 549

 Score = 46.6 bits (109), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 66/156 (42%), Gaps = 28/156 (17%)

Query: 75  MCASNKIKMQTSNAK---GRHVIAVEDVHKGDTLF-VEKPVAFVILPPC--SMSNCNHCC 128
           M A++ I+++ +      GR + A      GD +   + P   V+LPP   ++  CNHC 
Sbjct: 1   MLANDGIEVRGTKGGAKGGRSIYATRRFKPGDVIARFDNPA--VVLPPGHRALEYCNHCV 58

Query: 129 TSISAPI-----PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH------ 177
                P       C  C    YC  +C+R  W   H+ EC    RL EA   AH      
Sbjct: 59  KK-QRPAGVKLRACTGCKTVAYCGPACQRANWSLVHKLECKAIQRLHEA-KPAHQPDWVP 116

Query: 178 ----LALKLILVSSHSDRYKEVYHLETHLQDMRPED 209
                A +++L      R++E   LE H++  R +D
Sbjct: 117 TPIRAAAQVMLRPQVLARFEE---LEGHVEQWRKKD 149


>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
           [Equus caballus]
          Length = 490

 Score = 46.6 bits (109), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  S  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFNSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|356507528|ref|XP_003522516.1| PREDICTED: uncharacterized protein LOC100794609 [Glycine max]
          Length = 784

 Score = 46.2 bits (108), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 3/77 (3%)

Query: 81  IKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPC 137
           IK+Q  ++  KGR +++   +  G  +  E+P A +IL  C  ++C++C   + A  +PC
Sbjct: 217 IKLQRVSTPDKGRGIVSSCVISPGFLVHAEEPYAMIILKQCRETHCHYCLNDLPADRVPC 276

Query: 138 NECILAVYCSESCRREA 154
             C + +YCS  CR  A
Sbjct: 277 ILCSIPLYCSHQCRIRA 293


>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
          Length = 490

 Score = 46.2 bits (108), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
 gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
          Length = 490

 Score = 46.2 bits (108), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
 gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
          Length = 404

 Score = 46.2 bits (108), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 48/87 (55%), Gaps = 9/87 (10%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CN 125
           ++ T+  A + +++     +GR ++A   V +G+ LF E P   V+L P ++S+    C+
Sbjct: 6   QTPTMAAAGDVLRVADLPGRGRGLVAARIVREGEVLFSEPP---VLLYPSTLSSLRSYCS 62

Query: 126 HCCTSIS--APIPCNECILAVYCSESC 150
            C  S++  A IPC  C  A +CS +C
Sbjct: 63  ACFRSLTPAATIPCVSCRAAAFCSPAC 89


>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
          Length = 490

 Score = 46.2 bits (108), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T+  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
          Length = 432

 Score = 46.2 bits (108), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++  H GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFHVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
 gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
          Length = 380

 Score = 46.2 bits (108), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 63/132 (47%), Gaps = 12/132 (9%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KG  + A+ ++  G+ ++  +P AF +      + C  C     +   C++C  A Y
Sbjct: 11  SEGKGNGLRALREIKPGEVIYSCEPFAFCVARDFLKTACQSCLKRGESLSRCSQCKTARY 70

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVG------IAHLALKLILVSSHSDRYKEVYHL- 198
           C+  C+++AW   H+ EC     L   +       +A +  KL L  S SD+ +E+Y + 
Sbjct: 71  CNVQCQKQAWPD-HKRECKCLKHLQPRIPTDSVRLVARIIFKL-LSQSESDQ-EELYSIA 127

Query: 199 --ETHLQDMRPE 208
             ++HL DM  E
Sbjct: 128 EHQSHLADMSEE 139


>gi|327241842|gb|AEA40702.1| SET and MYND domain containing protein 4, partial [Sus scrofa]
          Length = 176

 Score = 46.2 bits (108), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 35/67 (52%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C  +  A +PC+ C  A YCS+ C ++AW  YH  EC     LL      H+AL+  
Sbjct: 16  CHRCLKTTLATVPCDGCSYAKYCSQECLQQAWDLYHSVECSLGGLLLTLGVFCHVALRST 75

Query: 184 LVSSHSD 190
           L++   D
Sbjct: 76  LLARFED 82


>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 697

 Score = 46.2 bits (108), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 6/70 (8%)

Query: 116 LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVG 174
           LPP    +C +C      P PC  C  A YCS +CR+      HR+EC G  + L   +G
Sbjct: 213 LPP---FDCQYCGEINVIPFPCITCGRACYCSVACRKNH--SSHRFECAGYGKHLWYLIG 267

Query: 175 IAHLALKLIL 184
           IAHL ++  L
Sbjct: 268 IAHLGIRSFL 277


>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
           africana]
          Length = 428

 Score = 46.2 bits (108), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K +T++ +G  + A   +  GD LF   P+A+ +      + C+ C       + C++C 
Sbjct: 8   KFRTAD-RGNGLRASAALRPGDLLFRSDPLAYTVCKGSRGAVCDRCLLGKEKLLRCSQCR 66

Query: 142 LAVYCSESCRREAWLRYHRWEC 163
           +A YCS  C+++AW   H+ EC
Sbjct: 67  VAKYCSAKCQKKAWPD-HKREC 87


>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
           familiaris]
          Length = 428

 Score = 45.8 bits (107), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 18/138 (13%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  +S ++G  + A   +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFSSASRGNGLRASAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
           C +A YCS  C+++AW   H+ EC             ++RLL       +  KL+    S
Sbjct: 65  CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEETPS 118

Query: 188 HSDRYKEVYHLETHLQDM 205
            S++    Y LE+++  +
Sbjct: 119 ESEKLYSFYDLESNINKL 136


>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
           distachyon]
          Length = 791

 Score = 45.8 bits (107), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 8/102 (7%)

Query: 54  SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
           SS+    A LP  A      L CAS      T N KGR ++++  +     + VE P+A 
Sbjct: 199 SSSDCQDAGLPHTAEPHKVVLECAS------TPN-KGRGMMSLNVISPASLIHVEDPLAA 251

Query: 114 VILPPCSMSNCNHCCTSISAPIP-CNECILAVYCSESCRREA 154
           +I+  C  ++C++C +   A +  C  C + VYCS+ C+ +A
Sbjct: 252 IIIKSCRETHCHYCFSEAPADVVFCPLCTIPVYCSKECQEQA 293


>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 552

 Score = 45.8 bits (107), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 61/146 (41%), Gaps = 24/146 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP---- 136
           + ++TS+  GR + A E +  G TL   KP   V+    S  N    C++ ++P P    
Sbjct: 25  LDIRTSSTDGRGIWAREALSAGATLMALKPHVHVL----STRNLELYCSACTSPAPVTGL 80

Query: 137 --CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG-------------IAHLALK 181
             C  C +  YCS +C+   W   H+ EC    R                   +  L  +
Sbjct: 81  KRCTRCRVIWYCSTACQNNDWT-MHKPECAAIQRWAANAPSPDVSIPAEPIRCLGRLLWR 139

Query: 182 LILVSSHSDRYKEVYHLETHLQDMRP 207
              + S S   KEV  L++H +D++P
Sbjct: 140 KQKLRSSSTWAKEVDGLQSHRKDLQP 165


>gi|194742499|ref|XP_001953740.1| GF17913 [Drosophila ananassae]
 gi|190626777|gb|EDV42301.1| GF17913 [Drosophila ananassae]
          Length = 675

 Score = 45.8 bits (107), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 7/98 (7%)

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
           +K   V A  D+ KG+ +  E P  +    P   +NC+ C         C  C    YCS
Sbjct: 223 SKNAGVFATSDIDKGEIVLAENPTYYQFGAP--FNNCDLCGVPQELIYTCAGCRYKTYCS 280

Query: 148 ESCRREAWLRYHRWECH----GALRLLEAVGIAHLALK 181
           ESC ++     H++EC     G + ++E+  +  L LK
Sbjct: 281 ESCMKDDE-EIHQFECSGFKVGIIPMMESTTLFRLFLK 317


>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
          Length = 428

 Score = 45.8 bits (107), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 18/138 (13%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   +  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFATAHRGNGLRAVVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
           C +A YCS  C+++AW   H+ EC             ++RLL       +  KL+  + S
Sbjct: 65  CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEEIPS 118

Query: 188 HSDRYKEVYHLETHLQDM 205
            S++    Y LE+++  +
Sbjct: 119 ESEKLYTFYDLESNINKL 136


>gi|307179276|gb|EFN67662.1| hypothetical protein EAG_07125 [Camponotus floridanus]
          Length = 275

 Score = 45.8 bits (107), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 37/72 (51%)

Query: 60  VASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
           +A++P +       L   S+ ++++   A+GRH++A  D+  G  L V++P +F    P 
Sbjct: 203 IAAMPVVDGKPHDCLASCSDSVELRLDEARGRHLVAARDIRPGAVLIVDQPFSFSTDGPA 262

Query: 120 SMSNCNHCCTSI 131
              NC + C  I
Sbjct: 263 LSRNCLYTCVYI 274


>gi|194900528|ref|XP_001979809.1| GG16796 [Drosophila erecta]
 gi|190651512|gb|EDV48767.1| GG16796 [Drosophila erecta]
          Length = 690

 Score = 45.8 bits (107), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
           V A  D+ KG+ + VE PV F    P    NC+ C         C+ C    YCS+SC  
Sbjct: 241 VFASCDIPKGEIVLVENPVHFQFSAP--FLNCDLCGVHQEQLYTCDNCRFRTYCSKSC-M 297

Query: 153 EAWLRYHRWECHG 165
           E+    H++EC+G
Sbjct: 298 ESDAEVHQYECYG 310


>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
 gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
          Length = 441

 Score = 45.4 bits (106), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C T       C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
          Length = 452

 Score = 45.4 bits (106), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 7/113 (6%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           K+   + K R+    E + KG T+    P  + +        C+ C  + S    C  C+
Sbjct: 41  KLLRGSLKMRNTGKHEPIPKGTTILESVPFVYCLKSSFRRELCDFCLKANSNLRKCLGCM 100

Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVG------IAHLALKLILVSSH 188
           +  YC   C+RE W + H+ EC   +R+   V       IA L LKL +++ +
Sbjct: 101 VVSYCGRVCQREGW-KDHKGECKNFVRVKPNVPTDSVRLIARLILKLQVINGY 152


>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
 gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=HSKM-B; AltName: Full=Histone methyltransferase
           SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
           AltName: Full=SET and MYND domain-containing protein 2
 gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
 gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
 gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
          Length = 433

 Score = 45.4 bits (106), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C T       C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
 gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
 gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
          Length = 433

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C T       C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|327241848|gb|AEA40705.1| SET and MYND domain containing protein 4, partial [Manis
           pentadactyla]
          Length = 176

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 36/75 (48%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           C+ C     A +PC+ C  A YCS  C+++AW  YH  EC     LL      H AL+  
Sbjct: 16  CHRCLKHTLATVPCDGCSHAKYCSPECKQQAWNLYHNIECSLGGLLLTLGVFCHTALRTT 75

Query: 184 LVSSHSDRYKEVYHL 198
           L++   D  K +  L
Sbjct: 76  LLARFEDVGKVIRKL 90


>gi|170040664|ref|XP_001848112.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864251|gb|EDS27634.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 558

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 2/84 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
           I M+     GR ++A  +   GD +  EK +  V+       NC+HC  S   + IPC  
Sbjct: 178 IGMKQYPGSGRGLVAERNFKTGDVILDEKTILSVVSVANRYLNCSHCGISNQHSLIPCPN 237

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C+  +YCS  C  E     HR+EC
Sbjct: 238 CVHCMYCSGECLAEDK-PLHRFEC 260


>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
          Length = 404

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 9/87 (10%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CN 125
           ++ T+  A + +++     +GR ++A  +V +G+ L  E P   V+L P ++S+    C+
Sbjct: 6   QTPTMAAAGDVLRVADLPGRGRGLVAARNVREGEVLLSEPP---VLLYPSTLSSLRSYCS 62

Query: 126 HCCTSI--SAPIPCNECILAVYCSESC 150
            C  S+  +A IPC  C  A +CS +C
Sbjct: 63  ACFRSLPPAATIPCVSCRAAAFCSPAC 89


>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
          Length = 433

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C T       C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
           troglodytes]
          Length = 452

 Score = 45.4 bits (106), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  +  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFITEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
           gallopavo]
          Length = 427

 Score = 45.4 bits (106), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KG  + +   V  G+ L+  +P A+V+        C  C         C++C +A Y
Sbjct: 11  SPGKGSGLRSRRRVRPGELLYRAEPFAYVVTKEQLGGVCEQCFQRNEHLHRCSQCKVAKY 70

Query: 146 CSESCRREAWLRYHRWEC 163
           C +SC++E WL  H+ EC
Sbjct: 71  CGKSCQKEGWLD-HKREC 87


>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
          Length = 428

 Score = 45.4 bits (106), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 33/141 (23%), Positives = 65/141 (46%), Gaps = 18/141 (12%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  ++  +G  + A+  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
           C +A YCS  C+++AW   H+ EC             ++RLL       + +KL+    S
Sbjct: 65  CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVVKLMEETPS 118

Query: 188 HSDRYKEVYHLETHLQDMRPE 208
            S++    Y LE+++  +  E
Sbjct: 119 ESEKLYSFYDLESNISKLTEE 139


>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
          Length = 433

 Score = 45.1 bits (105), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 40/139 (28%), Positives = 56/139 (40%), Gaps = 12/139 (8%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
           +S  KGR + A+     GD LF     A+V+      ++C HC         C  C  A 
Sbjct: 13  SSPGKGRGLRALRPFQVGDLLFSCPAYAYVLTVNERGNHCEHCFARKEGLSKCGRCKQAF 72

Query: 145 YCSESCRREAWLRYHRWECHGALRLLE----AVGIAHLALKLILVSSHSDRYKE-----V 195
           YC   C++E W   H+ EC   +   E    +  +   A  L   +SH DR        V
Sbjct: 73  YCDVECQKEDW-PMHKLECSPMVVFGENWNPSETVRLTARILAKQTSHPDRTPSEKLLAV 131

Query: 196 YHLETHLQ--DMRPEDLYQ 212
              E+HL   D   +DL Q
Sbjct: 132 KEFESHLDKLDNEKKDLIQ 150


>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
          Length = 490

 Score = 45.1 bits (105), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 1/88 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++  +  KGR + A ++    D +F E+  + V+        C+ C         C +C
Sbjct: 9   VEVFIAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
             A YC  +C+++AWL  H+ EC    R
Sbjct: 69  KFAHYCDRTCQKDAWLN-HKNECSAIKR 95


>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
          Length = 371

 Score = 45.1 bits (105), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C T       C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
           griseus]
          Length = 428

 Score = 45.1 bits (105), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 33/137 (24%), Positives = 62/137 (45%), Gaps = 16/137 (11%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   +  +G  + AV  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFATADRGNGLRAVVPLRPGELLFRCDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
           C +A YCS  C+++AW   H+ EC             ++RLL  V +  +  K     S 
Sbjct: 65  CRIAKYCSAKCQKKAWPD-HKQECKCLKSCKPRYPPDSVRLLARVIVKLMDEK----PSE 119

Query: 189 SDRYKEVYHLETHLQDM 205
           S++    Y LE+++  +
Sbjct: 120 SEKLYSFYDLESNINKL 136


>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
 gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
          Length = 482

 Score = 45.1 bits (105), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 6/83 (7%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS--MSNCNHCCTSISAPIPCNECILAVYC 146
           KGR +    +   G+ +  ++P  +V +P  S   S C+ C  SI+    C+ C +A YC
Sbjct: 21  KGRSLFTTRNFRPGEEIISQEP--YVCVPNNSPVESRCDRCFASINLK-KCSACKVAWYC 77

Query: 147 SESCRREAWLRYHRWECHGALRL 169
           S  C++  W + HR EC    RL
Sbjct: 78  SSVCQKSEW-KLHRLECDALARL 99


>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
          Length = 619

 Score = 45.1 bits (105), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 9/119 (7%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
           GRH +A   +  G+ + VE+P    +L    +++C+ C   I  P P  C       YCS
Sbjct: 193 GRHAVATRKIIPGEIVIVERPHCAFLLAETRLTHCHLCFVKIFVPTPAACRTYSCVAYCS 252

Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
             C R+A  + H  EC    AL   +A     LAL+ I        ++EV  L+   +D
Sbjct: 253 RRC-RDADAQVHSQECKLLPALWYSKASVTCFLALRAITQRP----FEEVMRLKEQFRD 306


>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
 gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 386

 Score = 45.1 bits (105), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 4/95 (4%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-- 136
           N +++++S  +GR++IA  D+  G+ L   K    V       ++C +C   + + I   
Sbjct: 6   NGLELKSSENEGRYLIATRDIQIGEDLLKCKSYFAVTSETLKTTSCFNCIKQLPSVIKLS 65

Query: 137 --CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
             CN+C    YC+E C+ E   ++  +EC    +L
Sbjct: 66  LKCNQCNEIWYCNEQCKNENINKHQHYECKFYKKL 100


>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
          Length = 433

 Score = 45.1 bits (105), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A V+      S+C HC         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFACPAYASVLTVGERGSHCEHCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
 gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
          Length = 459

 Score = 45.1 bits (105), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 22/83 (26%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           I      +K +  I    + +G  +  EKP AFV+     +  C++C  +    + C+ C
Sbjct: 8   IAKANGASKSKSNIPNPQIKRGQRILTEKPFAFVLKSKYRLERCDNCLEATKV-LKCSNC 66

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
               YC+ +C+ +AW + H+ EC
Sbjct: 67  RYVSYCNRACQSQAWAQ-HKHEC 88


>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
 gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
          Length = 404

 Score = 44.7 bits (104), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 47/87 (54%), Gaps = 9/87 (10%)

Query: 70  ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CN 125
           ++ T   A + +++     +GR ++A  +V +G+ L  E P   V+L P ++S+    C+
Sbjct: 6   QTPTTAAAGDALRVADLPGRGRGLVAARNVREGEVLLSEPP---VLLYPSTLSSLASYCS 62

Query: 126 HCCTSI--SAPIPCNECILAVYCSESC 150
            C  S+  +A +PC  C  A +CS +C
Sbjct: 63  ACFRSLPAAATVPCASCRAAAFCSPAC 89


>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
          Length = 433

 Score = 44.7 bits (104), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF  +  A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCRAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
          Length = 542

 Score = 44.7 bits (104), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 9/140 (6%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A + + +Q    KGR +   + V  GD + V  P+  +       + C+ C  +   P P
Sbjct: 5   APDGVAIQPHKTKGRALHTTKTVAAGDVIAVFTPLLLLPSLSHLTTVCSFCLRA-GTPRP 63

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEA------VGIAHLALKLILVS--SH 188
           C+ C  A YC   C+  AW   H  EC   +R +++      +     AL  +L+S    
Sbjct: 64  CSRCRAAYYCDARCQAAAWSGGHSLECAALVRAVKSSKKRREIPTPVRALVKVLLSCGQP 123

Query: 189 SDRYKEVYHLETHLQDMRPE 208
            D  K +  LE H+ + R E
Sbjct: 124 EDLSKNMDGLEGHVAERRRE 143


>gi|366987641|ref|XP_003673587.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
 gi|342299450|emb|CCC67204.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
          Length = 508

 Score = 44.7 bits (104), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 18/85 (21%)

Query: 84  QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP---CSMSNCNHCCT----------- 129
           Q  N KGR V A++   KGD +F EK +  VI+PP     + +   CCT           
Sbjct: 119 QDKNGKGRGVYALKHFKKGDMIFEEK-LPIVIIPPMEKLELMSKGKCCTLCGKSLYELST 177

Query: 130 --SISAPIPCNECILAVYCSESCRR 152
              +   + CN+C  AV+CS++C++
Sbjct: 178 HYIMMNGLDCNDCT-AVWCSKNCKK 201


>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
          Length = 429

 Score = 44.7 bits (104), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 7/135 (5%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++      KG  + A + V  G+ L+  +P+A  +   C    C+ C +     + C++
Sbjct: 4   KLERFVCPGKGNGLRANQRVLAGELLYTAEPLACCVSSQCRNVVCHSCFSRHETLLRCSQ 63

Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI------AHLALKLILVSSHSDRYK 193
           C +A YC  +C++ AW   H+ EC   L L   +        A +  +L+  S  + +  
Sbjct: 64  CKVARYCDVTCQKRAW-SIHKRECKCLLSLHPRIPTDSVRLAARIIFRLLCPSQMTPQLY 122

Query: 194 EVYHLETHLQDMRPE 208
                E+HL DM  E
Sbjct: 123 SFEEHESHLCDMGEE 137


>gi|414870818|tpg|DAA49375.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
          Length = 437

 Score = 44.7 bits (104), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 69  GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC 128
           G+ H ++  S    + T N KGR +I+ +D+     + VE P+A +I+     ++C+ C 
Sbjct: 212 GQPHKIVIES----ISTPN-KGRGMISTDDIPPASLIHVEDPLAAIIMKSSRETHCHFCF 266

Query: 129 TSISAPIP-CNECILAVYCSESCRREA 154
           +   A +  C  C + +YCS+ C+ ++
Sbjct: 267 SETPADVVFCPSCTIPIYCSKRCQEQS 293


>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3 [Equus caballus]
          Length = 440

 Score = 44.7 bits (104), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++   +  KG  + A+  +  G+ LF   P+A+ +        C+ C       + C++
Sbjct: 5   KVEKFITADKGNGLRALVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64

Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
           C +A YCS  C+++AW + H+ EC
Sbjct: 65  CRVAKYCSAKCQKKAW-QDHKREC 87


>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
          Length = 272

 Score = 44.7 bits (104), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C T       C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
           boliviensis]
          Length = 433

 Score = 44.7 bits (104), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A+E    GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALEPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
          Length = 433

 Score = 44.3 bits (103), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A+E    GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALEPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
          Length = 559

 Score = 44.3 bits (103), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 10/99 (10%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP--PCSMSNCNHCCTSIS----API--- 135
           +S   GR ++A +D   G +L        + LP  P   + CN+C    S    +P+   
Sbjct: 40  SSGPHGRSLVATQDFTPG-SLIATFSSPTLALPDGPSMRTTCNYCLRVGSNEGFSPVSLK 98

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
            C  C  AVYC  +C+R  W   H+ EC    R+ E  G
Sbjct: 99  ACTGCKAAVYCGPTCQRAHWKSIHKAECKMFARVRETTG 137


>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
 gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
          Length = 442

 Score = 44.3 bits (103), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 4/104 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           VH+G T++ EKP  +V+        C+ C       + C+ C    YC + C+++ W   
Sbjct: 7   VHEGCTIYKEKPFVYVLSSKLRTEYCDFCLKK-GQFMKCSGCHYVYYCGKVCQKDGW-SV 64

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLET 200
           H+ EC G  R+   +    A    +LI +        + Y+LE 
Sbjct: 65  HKSECRGLKRVAPRILPDAARFIARLIHILRKGGDLVKSYYLEN 108


>gi|146091661|ref|XP_001470086.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134084880|emb|CAM69278.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 712

 Score = 44.3 bits (103), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
           P+PC+    A+YCSESC+ +AW+ +H  EC    R  + V + H  L   + +
Sbjct: 426 PVPCDSNCSALYCSESCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTT 478


>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
 gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
           Full=ASH1-related protein 1; AltName: Full=Protein SET
           DOMAIN GROUP 37
 gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
 gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
          Length = 480

 Score = 44.3 bits (103), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 2/81 (2%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           KGR +    D   G+ +  +KP   V     S S C+ C  + +    C+ C +  YC  
Sbjct: 21  KGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCFKTNNLK-KCSACQVVWYCGS 79

Query: 149 SCRREAWLRYHRWECHGALRL 169
           SC++  W + HR EC    RL
Sbjct: 80  SCQKSEW-KLHRDECKALTRL 99


>gi|154340255|ref|XP_001566084.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063403|emb|CAM39582.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 713

 Score = 44.3 bits (103), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 27/44 (61%)

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH 177
           P+PC+    A+YCSESC+ +AW+ +H  EC    R  + V + H
Sbjct: 427 PVPCDSSCNALYCSESCKSKAWVEHHWVECTQRGRWRDGVPVVH 470


>gi|398017931|ref|XP_003862152.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322500381|emb|CBZ35458.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 712

 Score = 44.3 bits (103), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
           P+PC+    A+YCSESC+ +AW+ +H  EC    R  + V + H  L   + +
Sbjct: 426 PVPCDSNCSALYCSESCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTT 478


>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 823

 Score = 44.3 bits (103), Expect = 0.033,   Method: Composition-based stats.
 Identities = 21/60 (35%), Positives = 36/60 (60%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           AS+ + ++ S   GR V+A ED+  G+ +  E+  A V+L P  +++C+HC T  +  IP
Sbjct: 256 ASSAVAIRRSPEAGRFVVAAEDIKLGEPVTREEAYAAVLLRPYELTHCHHCFTQEAVLIP 315


>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 579

 Score = 44.3 bits (103), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 12/101 (11%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC------TSIS 132
           +  K+   +  GR VIA  D+  G+ +  E P   V+   C+   C +CC        I 
Sbjct: 72  DTFKVTEDSVSGRCVIASRDLKAGELVLREPPFVKVVRRDCASRQCAYCCQQVTERGKIE 131

Query: 133 APIPCNECILAVYCSESCR-REAWLRYHRWECHGALRLLEA 172
           A +P      AVYCS +C+ RE  LR       G L  + A
Sbjct: 132 ADVP-----FAVYCSRACQAREDALRAAEASALGKLAGISA 167


>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 471

 Score = 44.3 bits (103), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 54/131 (41%), Gaps = 9/131 (6%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
              KGR + A +++  G+ +F E   + V+    +   C+ C    +    C +C  A Y
Sbjct: 14  GGIKGRGLRATKEISTGEVVFAEPSYSAVVFDSFASQVCHSCFRHQAQLHRCGQCKFAHY 73

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD-------RYKEVYHL 198
           C+ +C+   W   H+ EC    +L    G        ++   H D       +   V  L
Sbjct: 74  CNRTCQTACW-EEHKQECQAIRKLGTVPGDKVRLAARVMWRIHKDTGVASDSQLLSVEEL 132

Query: 199 ETHLQDMRPED 209
           E H+ D+ PED
Sbjct: 133 EDHVADL-PED 142


>gi|358059760|dbj|GAA94529.1| hypothetical protein E5Q_01181 [Mixia osmundae IAM 14324]
          Length = 554

 Score = 44.3 bits (103), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 4/90 (4%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP- 136
           S+K +M+    K R V A +    GDTLF  +P   V+     ++ C HC    S     
Sbjct: 15  SDKFEMRMHPIKKRSVHARQAFAAGDTLFETRPPFAVVGNSDILTRCFHCLKPTSPEREL 74

Query: 137 --CNECILAVYCSESCRREAWL-RYHRWEC 163
             C+ C LA +C  +C +  W    H++EC
Sbjct: 75  KRCSSCQLARFCGPACVKAGWTPEGHKYEC 104


>gi|347830753|emb|CCD46450.1| hypothetical protein [Botryotinia fuckeliana]
          Length = 240

 Score = 44.3 bits (103), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 48/111 (43%), Gaps = 21/111 (18%)

Query: 87  NAKGRHVIAVEDVHKGDTLF-VEKPV-AFVILPPCSMSNCNHCCT--------------- 129
           + K   +   + V  GD L  VEKP+ A V   P   + C++C                 
Sbjct: 51  DPKNTGLFVDKAVSAGDILLKVEKPLFAIVEETPMRETTCDNCFAYQGMEELLGTVSTAK 110

Query: 130 -SISAPIP---CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA 176
             I   IP   C  C +  YC++ CR EAW  +H++EC    ++L  V +A
Sbjct: 111 KDIETAIPFNKCGSCRVFYYCNKKCREEAWEHHHKYECLILAKILATVNLA 161


>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
 gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
          Length = 468

 Score = 44.3 bits (103), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 27/103 (26%), Positives = 48/103 (46%), Gaps = 4/103 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP AFV+     +  C++C  +    + C+ C    YC+ +C+ +AW + 
Sbjct: 35  IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCNRACQMQAWAQ- 92

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
           H+ EC    ++   V    A +  +LIL   H       Y+ E
Sbjct: 93  HKHECPFLKKVHPRVVPDAARMLCRLILRLEHGGDLIRGYYTE 135


>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
 gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 413

 Score = 44.3 bits (103), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP---- 136
           +K+  S  +GR++IA  D+  G+++   K    V       ++C +C   I +P P    
Sbjct: 8   LKLSNSELEGRYIIANRDIDIGESILKCKSYFAVTCEDFKKNSCYNCIKLIKSPSPQQVP 67

Query: 137 -CNECILAVYCSESCRREAWLRYHRWEC 163
            C  C    YCSE C+++   ++  +EC
Sbjct: 68  RCFGCNEVWYCSEKCKQDNQAKHQHYEC 95


>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
 gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
          Length = 420

 Score = 44.3 bits (103), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           V +G  +  EKP AFV+     +  C++C  +      C+ C    YC+ SC+ +AW + 
Sbjct: 6   VKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKVR-KCSNCRYVSYCNRSCQTQAWAQ- 63

Query: 159 HRWEC 163
           H+ EC
Sbjct: 64  HKHEC 68


>gi|28317148|gb|AAO39589.1| LD11102p [Drosophila melanogaster]
          Length = 687

 Score = 43.9 bits (102), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 3/77 (3%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           K   V A  DV KG+ + VE PV F    P    NC  C         C+ C    YCS+
Sbjct: 238 KSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYRSYCSK 295

Query: 149 SCRREAWLRYHRWECHG 165
           SC + +    H++EC+G
Sbjct: 296 SCMK-SDAEVHQYECYG 311


>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
 gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
          Length = 469

 Score = 43.9 bits (102), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 4/103 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP AFV+     +  C++C  +    + C+ C    YC  SC+ +AW + 
Sbjct: 35  IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCHRSCQMQAWAQ- 92

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
           H+ EC    ++   V    A +  +LIL   H       Y+ E
Sbjct: 93  HKHECPFLKKVHPRVVPDAARMLCRLILRLEHGGDLIRGYYTE 135


>gi|157871558|ref|XP_001684328.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68127397|emb|CAJ05004.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 712

 Score = 43.9 bits (102), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 19/61 (31%), Positives = 32/61 (52%)

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYK 193
           P+PC+    A+YCSE C+ +AW+ +H  EC    R  + V + H  L   + +  +  + 
Sbjct: 426 PVPCDSNCSALYCSEGCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTTRSARSFV 485

Query: 194 E 194
           E
Sbjct: 486 E 486


>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 551

 Score = 43.9 bits (102), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 47/107 (43%), Gaps = 12/107 (11%)

Query: 80  KIKMQTSNAKG-RHVIAVEDVHKGDTL-FVEKPVAFVILPPC--SMSNCNHCC-----TS 130
           +++     AKG R + A      GD +   ++P   V+LPP   ++  CNHC        
Sbjct: 9   EVRGSKGGAKGGRSIHATRRFKPGDVIALFDEPA--VVLPPGHRALEYCNHCLKRQGHAG 66

Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH 177
           +     C  C    YC  +C++  W   H+ EC  A++ L  V  AH
Sbjct: 67  VGKLRACTGCKTVAYCGAACQKANWTLIHKVECK-AIQFLHEVKPAH 112


>gi|328708541|ref|XP_003243720.1| PREDICTED: hypothetical protein LOC100572177 [Acyrthosiphon pisum]
          Length = 125

 Score = 43.9 bits (102), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 5/44 (11%)

Query: 125 NHC----CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
           NHC    C +  A I CN+C +A YCSE CR+  W   H +ECH
Sbjct: 77  NHCHRENCNNFGATI-CNKCKMARYCSEICRKRHWWSKHSFECH 119


>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
 gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
          Length = 466

 Score = 43.9 bits (102), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP AFV+     +  C++C  +    + C+ C    YC  SC+ +AW + 
Sbjct: 33  IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCHRSCQMQAWAQ- 90

Query: 159 HRWEC 163
           H+ EC
Sbjct: 91  HKHEC 95


>gi|201065683|gb|ACH92251.1| FI04131p [Drosophila melanogaster]
          Length = 682

 Score = 43.9 bits (102), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 3/77 (3%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           K   V A  DV KG+ + VE PV F    P    NC  C         C+ C    YCS+
Sbjct: 238 KSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYRSYCSK 295

Query: 149 SCRREAWLRYHRWECHG 165
           SC + +    H++EC+G
Sbjct: 296 SCMK-SDAEVHQYECYG 311


>gi|386765940|ref|NP_650589.2| CG18213 [Drosophila melanogaster]
 gi|383292760|gb|AAF55379.4| CG18213 [Drosophila melanogaster]
          Length = 879

 Score = 43.9 bits (102), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 3/82 (3%)

Query: 84  QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILA 143
           + S  K   V A  DV KG+ + VE PV F    P    NC  C         C+ C   
Sbjct: 233 EKSIEKSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYR 290

Query: 144 VYCSESCRREAWLRYHRWECHG 165
            YCS+SC + +    H++EC+G
Sbjct: 291 SYCSKSCMK-SDAEVHQYECYG 311


>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
 gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
 gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
          Length = 433

 Score = 43.9 bits (102), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C  C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVSERGNHCEFCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C+RE W   H+ EC
Sbjct: 74  CNVECQREDW-PMHKLEC 90


>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
 gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
          Length = 439

 Score = 43.9 bits (102), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           V +G  +  EKP AFV+     +  C++C  +      C+ C    YC+ SC+ +AW + 
Sbjct: 6   VKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKVR-KCSNCRYVSYCNRSCQTQAWAQ- 63

Query: 159 HRWEC 163
           H+ EC
Sbjct: 64  HKHEC 68


>gi|352962201|gb|AEQ62998.1| RE05389p1 [Drosophila melanogaster]
          Length = 879

 Score = 43.9 bits (102), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 3/82 (3%)

Query: 84  QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILA 143
           + S  K   V A  DV KG+ + VE PV F    P    NC  C         C+ C   
Sbjct: 233 EKSIEKSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYR 290

Query: 144 VYCSESCRREAWLRYHRWECHG 165
            YCS+SC + +    H++EC+G
Sbjct: 291 SYCSKSCMK-SDAEVHQYECYG 311


>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
           vitripennis]
          Length = 391

 Score = 43.9 bits (102), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 2/71 (2%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + KGD L   KP A+V+      ++C++C  S    + C+ C    YC  SC++E+W   
Sbjct: 3   IKKGDCLISSKPFAYVLSSKHKDNHCDYCFKS-GKLLKCSGCQYVYYCDRSCQKESW-SV 60

Query: 159 HRWECHGALRL 169
           H+ EC    R+
Sbjct: 61  HKSECINLKRI 71


>gi|170073283|ref|XP_001870345.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167869783|gb|EDS33166.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 266

 Score = 43.9 bits (102), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 125 NHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           +HC    S  + PC  C+  +YC E CR+++W  +H++EC  A +L
Sbjct: 106 DHCGVKFSNSLKPCPGCVFFMYCGEECRQKSWKLWHQFECPVATKL 151


>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
          Length = 433

 Score = 43.9 bits (102), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C++C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCDYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C   C++E W   H+ EC
Sbjct: 74  CDVECQKEDW-PMHKLEC 90


>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 315

 Score = 43.9 bits (102), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 46/109 (42%), Gaps = 2/109 (1%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   +++  S   GR ++       G  L  E P A+V+        C+ C    S  + 
Sbjct: 7   AMESVRVVKSATCGRSLVFTSKFAAGKCLLEELPYAYVLCNKKRGLFCDFCLKQCSTLLR 66

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLI 183
           C+ C    YCS  C++  W R H+ +C    R+   V   +A L  ++I
Sbjct: 67  CSSCKYVRYCSRPCQKGDWKRCHKQDCKTLKRIHPRVPPDLAQLLAQII 115


>gi|340054943|emb|CCC49251.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 434

 Score = 43.9 bits (102), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 13/90 (14%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS------- 132
           ++++   +  GR + A+  +  G  + V++P     + P   S C HC + IS       
Sbjct: 154 ELRLPHESVYGRGIYALTAISAGTVVMVDEPFVVQRMVP---SACAHCLSDISTTSKEGG 210

Query: 133 ---APIPCNECILAVYCSESCRREAWLRYH 159
                + C  C +  YCS  CR  AW  YH
Sbjct: 211 AAVGVVLCTHCGVESYCSTGCREAAWREYH 240


>gi|170074355|ref|XP_001870561.1| U5 snRNP-specific protein [Culex quinquefasciatus]
 gi|167871141|gb|EDS34524.1| U5 snRNP-specific protein [Culex quinquefasciatus]
          Length = 951

 Score = 43.9 bits (102), Expect = 0.049,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           IPC  C+  +YC E CR+++W  +H++EC  A +L
Sbjct: 226 IPCPGCVFFMYCGEECRQKSWNLWHQFECPVAPKL 260


>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
           [Oryzias latipes]
          Length = 435

 Score = 43.5 bits (101), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A+ D   G+ +F     ++V+      ++C HC T       C +C  A Y
Sbjct: 14  SPDKGRGLRALRDFAVGELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C+R  W   H+ EC
Sbjct: 74  CNVDCQRGDW-PMHKLEC 90


>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
           [Oryzias latipes]
          Length = 424

 Score = 43.5 bits (101), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A+ D   G+ +F     ++V+      ++C HC T       C +C  A Y
Sbjct: 14  SPDKGRGLRALRDFAVGELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C+R  W   H+ EC
Sbjct: 74  CNVDCQRGDW-PMHKLEC 90


>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           terrestris]
          Length = 439

 Score = 43.5 bits (101), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 6/111 (5%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + KG T+   KP A+V+        C++C  S      C+ C    YC++SC++ +W   
Sbjct: 8   IKKGTTILTAKPFAYVLCSKYKNVRCDYCFKS-GKLFRCSACQYVYYCNQSCQQMSW-PM 65

Query: 159 HRWECHGALRLLEAVGI---AHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
           H  EC   L+     GI   A L  ++I+  +     ++ Y+ ET+ +  +
Sbjct: 66  HSKEC-ARLKKFSPWGISNVARLMARIIIKLNQGGDEEKGYYNETNYRKFK 115


>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
 gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
 gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
 gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
          Length = 468

 Score = 43.5 bits (101), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 2/65 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP AFV+     +  C++C  +    + C+ C    YC  SC+ +AW + 
Sbjct: 35  IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCHRSCQMQAWGQ- 92

Query: 159 HRWEC 163
           H+ EC
Sbjct: 93  HKHEC 97


>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
           [Brachypodium distachyon]
 gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
           [Brachypodium distachyon]
          Length = 389

 Score = 43.5 bits (101), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 33/119 (27%), Positives = 53/119 (44%), Gaps = 12/119 (10%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSNCNHCCTS 130
           A + +++     +GR ++A  D+ +G+ L  E P+  ++ P      P   S C  C   
Sbjct: 2   AGDALRVADLPGRGRGLVAARDILEGEVLLSEPPI--LLYPSSLASLPSYCSACFRCLPQ 59

Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
                PC  C  A +CS +C   A   + R  C  AL  L A   +H    L L+S++S
Sbjct: 60  APHAAPCPSCRAAAFCSPAC---AAASHPRLLC-AALSRLAAAPESHQEQLLFLLSAYS 114


>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
 gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
          Length = 530

 Score = 43.5 bits (101), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 5/77 (6%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCCTSISAPIPCNECILAVYC 146
           KGR +IA      GDT+    P   +++P  S ++  C+HC      P  C+ C    YC
Sbjct: 14  KGRCLIASRSFTPGDTILTFTPT--ILIPSLSHASTVCSHCFKP-GEPRSCSRCHAVAYC 70

Query: 147 SESCRREAWLRYHRWEC 163
             SC+  AW   H  EC
Sbjct: 71  DASCQSAAWKAIHSKEC 87


>gi|170033581|ref|XP_001844655.1| U5 snRNP-specific protein [Culex quinquefasciatus]
 gi|167874623|gb|EDS38006.1| U5 snRNP-specific protein [Culex quinquefasciatus]
          Length = 964

 Score = 43.5 bits (101), Expect = 0.055,   Method: Composition-based stats.
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           IPC  C+  +YC E CR+++W  +H++EC  A +L
Sbjct: 132 IPCPGCVFFMYCGEECRQKSWNLWHQFECPVAPKL 166


>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
          Length = 433

 Score = 43.5 bits (101), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
          Length = 433

 Score = 43.5 bits (101), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|440800364|gb|ELR21403.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 648

 Score = 43.5 bits (101), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 34/116 (29%)

Query: 124 CNHCCTSISAP--------------IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           C HC   ++ P              +P  E    VYCSESCR+EAW  +H   C G  +L
Sbjct: 315 CPHCAGDLAKPPPVLWQSVFRAYPSLPPAESYQEVYCSESCRKEAWESHHHSLCPG--QL 372

Query: 170 LEAVGIAHLALKLILVSSHSDRYKEV-YHL-----------ETHLQDMRPEDLYQY 213
            E  G  H AL  +L+      Y+EV YH              H Q+ +P+ ++ +
Sbjct: 373 AENQG-KHFALLNLLIM-----YREVGYHTIAMMLRILAQKAKHQQEAKPDAVFPF 422


>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
           mulatta]
          Length = 433

 Score = 43.5 bits (101), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|170060243|ref|XP_001865716.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878780|gb|EDS42163.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 510

 Score = 43.5 bits (101), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           IPC  C+  +YC E CR+++W  +H++EC  A +L
Sbjct: 226 IPCPGCVFFMYCGEECRQKSWNLWHQFECQVAPKL 260


>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
          Length = 520

 Score = 43.5 bits (101), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 44/95 (46%), Gaps = 2/95 (2%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   + + +   KGR ++A      G+ +  ++P A         S+C+HC  S +    
Sbjct: 13  AGRGLAVASVPGKGRGLVAARSFFPGEVIICQEPYASTPNRISVGSSCDHCFASGNLR-K 71

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           C+ C +A YC   C++E W + H+ EC     L E
Sbjct: 72  CSVCRVAWYCGSVCQKEEW-KLHQLECQAISALTE 105


>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 558

 Score = 43.5 bits (101), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 19/108 (17%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGD-TLFVEKPVAFVILPPCSMSNCNHC------ 127
           M A+ ++K     + GR + A +D   GD  L + +P    +        C  C      
Sbjct: 1   MAANVQVKQSGIESAGRGLFAKKDFQPGDIVLALNRPFVAELDQDRLRDTCAWCLQRGAS 60

Query: 128 ------CTSISAPIP------CNECILAVYCSESCRREAWLRYHRWEC 163
                  TS+  PI       C  C    YCS++C+  AW R H++EC
Sbjct: 61  AEDRAKATSLGLPIAGIETKACTGCKRVRYCSKTCQTRAWKREHKYEC 108


>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
 gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
          Length = 474

 Score = 43.5 bits (101), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 24/96 (25%), Positives = 46/96 (47%), Gaps = 8/96 (8%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC---TSISAPIP- 136
           +++  ++AKGR + A      G  +  ++P A+ ++       C++C        AP+  
Sbjct: 6   VELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDVVCHYCLCAPGQPGAPVED 65

Query: 137 ---CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
              C  C  A YC++ C+++AW   H+ EC    R+
Sbjct: 66  LHRCTGCKFAQYCTKECQKKAWPE-HKQECAAIKRI 100


>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
           [Taeniopygia guttata]
          Length = 404

 Score = 43.1 bits (100), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 50/114 (43%), Gaps = 32/114 (28%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC-------------- 128
           S+AKG+ + A  ++ KG+T+FVE+PV    F+     +   C+HC               
Sbjct: 19  SSAKGKGLFATRNIRKGETVFVERPVVSSQFLWNALYNYRACDHCLRALETAEENAQRLL 78

Query: 129 --TSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
             +S+  P P            C  C +  YCS  CR+ A  +YH+  C G  R
Sbjct: 79  GRSSLVLPHPEQCSIRKDLHQQCPRCQV-TYCSAECRQAALEQYHQVLCLGPSR 131


>gi|195441983|ref|XP_002068740.1| GK17864 [Drosophila willistoni]
 gi|194164825|gb|EDW79726.1| GK17864 [Drosophila willistoni]
          Length = 504

 Score = 43.1 bits (100), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 36/144 (25%), Positives = 71/144 (49%), Gaps = 16/144 (11%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ ++  GR V+AV+++  G+TL VE+P+  VILP      C  C    +    C
Sbjct: 9   AKKCQVKHNDTLGRFVVAVDNIRAGETLLVEEPI--VILPHLGERRCARCLNLTTN--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
           ++C L   C E        C+R A L+    +    L+  +   +  + +K +L+  H++
Sbjct: 65  SKCQLLPLCEECKDHEERDCQRLADLQLSEEQVQ-QLQDSQRTEVQSM-VKCLLLREHTE 122

Query: 191 R---YKEVYHLETHLQDMRPEDLY 211
               Y+E+Y ++  L   R  +++
Sbjct: 123 TRSLYEEMYQMDAQLDARRGTEVW 146


>gi|410929759|ref|XP_003978267.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Takifugu rubripes]
          Length = 790

 Score = 43.1 bits (100), Expect = 0.069,   Method: Composition-based stats.
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 5/63 (7%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECH----GALRLLEAVGIAHL 178
           C +C  S+S  + PC  C   +YCS  CR +AW   H+ EC     G    + AVG++  
Sbjct: 712 CYYCGRSVSVTLTPCYRCYKVLYCSRPCRLKAWDAIHKKECFRVKAGTRDCVAAVGLSQN 771

Query: 179 ALK 181
            L+
Sbjct: 772 GLE 774


>gi|401424796|ref|XP_003876883.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322493127|emb|CBZ28412.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 712

 Score = 43.1 bits (100), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 31/57 (54%)

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
           P+PC+    A YCSE+C+ +AW+ +H  EC    R  + V + H  L   + +  ++
Sbjct: 426 PVPCDSNCSAFYCSENCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTARSAN 482


>gi|328724231|ref|XP_003248075.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 624

 Score = 43.1 bits (100), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 10/140 (7%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
           T S+ +     +  L  G++  +   S  ++++ S   GR V A  D++ GD + +++P 
Sbjct: 205 TKSTTTMETLCVEQLVGGKNENIPALSAFVELKLSENMGRGVYATCDINPGDVVAIDEPY 264

Query: 112 AFVILPPCS----MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL 167
              I  P S    + + N C    S+ I C +C+L  YC++ C  + +   H   C    
Sbjct: 265 ---ICGPLSEFREVCHYNGCMKIHSSLIHCPKCLLVFYCNKDCMNKDYKDGHNLLCPIMY 321

Query: 168 RLLEAVGIA---HLALKLIL 184
            +    GI     LA+K  L
Sbjct: 322 HIKSRPGITKINELAIKWFL 341


>gi|170044639|ref|XP_001849948.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867702|gb|EDS31085.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 282

 Score = 43.1 bits (100), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 1/72 (1%)

Query: 91  RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
           R ++A +D + GD +   + +   I       NC+HC    S + IP   C+ ++YC E 
Sbjct: 165 RILVAEKDFNPGDVIMDAEQLLTAIDFNLCYENCSHCGVKFSNSLIPFPGCVFSMYCGEE 224

Query: 150 CRREAWLRYHRW 161
           CR+ +W  +H++
Sbjct: 225 CRQNSWKLWHQF 236


>gi|299740414|ref|XP_001838814.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
 gi|298404233|gb|EAU83041.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
          Length = 1320

 Score = 43.1 bits (100), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 12/104 (11%)

Query: 72  HTLMCASNKIKM--QTSNAKGRHVIAVEDVHKG-DTLFVEKPVAFVILPPCSMS-----N 123
           + L  A N+IK+   T   KGR    V DV    D + V     F   PP +       N
Sbjct: 381 YRLQMAFNEIKIGSDTEYWKGRRYEDVSDVTVYLDLMNVLYGFGFGRAPPLTQDPRFCYN 440

Query: 124 CNHCCTS----ISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
             HC T     ++    C+ C    YCS  C+++ W R HR EC
Sbjct: 441 IAHCATGYHPYMAGTKWCSGCKSVAYCSRECQKQDWKRVHREEC 484


>gi|307111681|gb|EFN59915.1| hypothetical protein CHLNCDRAFT_132958 [Chlorella variabilis]
          Length = 853

 Score = 43.1 bits (100), Expect = 0.076,   Method: Composition-based stats.
 Identities = 40/145 (27%), Positives = 65/145 (44%), Gaps = 11/145 (7%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS---A 133
           A   + +Q +   GR + A E++  G  +  E+P AF +      S C+ C   ++   A
Sbjct: 327 AGPALVVQQTEDAGRGLSAAEELPAGRDVLAEQPFAFALTKLGRRSVCSTCLAPLADAPA 386

Query: 134 PIPCNECILAVYCSESCRREAWLRYHR---WECHGALRLL---EAVGIAHLALKLILVSS 187
           P  C  C +  YC+ +CR  A   +H+    EC     +L   EAV    LA +L    +
Sbjct: 387 PYYCRRCPMPAYCTPACR--AADPFHQPGGPECGRPWTVLLPAEAVAALRLARRLRFGGA 444

Query: 188 HSDRYKEVYHLETHLQDMRPEDLYQ 212
            S   + V  L TH  ++ P ++ Q
Sbjct: 445 GSPAARHVASLGTHFAELDPTEVVQ 469


>gi|168011005|ref|XP_001758194.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690650|gb|EDQ77016.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 532

 Score = 43.1 bits (100), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 4/76 (5%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI---SAPIPCNECILAVY 145
           +GR  IA  D+  G+ +  E+ VAFV       + C+ CC  +   + P+ C  C  A+Y
Sbjct: 93  RGRCAIASRDIKAGEVIVAEEAVAFVPRSQDRTAVCHECCRDLHQNARPVECPGCKHAIY 152

Query: 146 CSESCRREAWLRYHRW 161
           C + C   A   + +W
Sbjct: 153 C-QDCEASAMASHRKW 167


>gi|219122841|ref|XP_002181746.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407022|gb|EEC46960.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 596

 Score = 43.1 bits (100), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 2/93 (2%)

Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLA 179
           S  NC  C  S    + C  C  A YCS  C++E W   HR +C       +A    +  
Sbjct: 6   SRGNCQGCRKSTGKLLRCQACKQAWYCSIECQKEDWKTNHRKQCRDVAASNKAQATWN-E 64

Query: 180 LKLILVSSHSDRYKEVYHLETH-LQDMRPEDLY 211
           L+ ++ S+  D+ +E +H  +  ++ +R +D Y
Sbjct: 65  LRTLMASASLDQAQEGFHKASDAVKQLRSDDAY 97


>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
 gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
           Y34]
 gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
           P131]
          Length = 542

 Score = 43.1 bits (100), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 49/116 (42%), Gaps = 2/116 (1%)

Query: 90  GRHVIAVEDVHKGDTLF-VEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV-YCS 147
           GR + A  D   GD +   + P+A       + + C+HC    +    C  C  AV YCS
Sbjct: 28  GRSLRATRDFQPGDLIAEFDNPLAAFPDAARASTTCHHCLDQNAKVFGCMGCEKAVKYCS 87

Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQ 203
             C+R  W   H  EC    ++  AVG   L   +  +     R+ EV  L + L+
Sbjct: 88  SECQRANWKLVHSKECKVFRKVQTAVGKDWLPTPVRTLVQLLVRWAEVQQLVSQLE 143


>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
 gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
          Length = 378

 Score = 43.1 bits (100), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 18/99 (18%)

Query: 83  MQTSNAKGRHVIAVEDVHKGDTL-FVEKPVAFVI---LPPCSMSNCNHCCTSISAP---- 134
           ++ S  +G +V+A +D+  GD + F EKP +  I   +       C++C T  +      
Sbjct: 13  VKNSKDQGNYVVASKDIKAGDFIVFREKPYSCCINFKIEENEKKICHYCFTINNNNNLNN 72

Query: 135 ----------IPCNECILAVYCSESCRREAWLRYHRWEC 163
                     I CN C L  YCSE C+ E +   H++EC
Sbjct: 73  NIHNMHTDNFIICNNCKLNWYCSEKCKNEDYQTSHQYEC 111


>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
          Length = 464

 Score = 42.7 bits (99), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 6/104 (5%)

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
           + A +    G+++ VE+  ++V       + C+ C     +   C++C  A YC   C++
Sbjct: 3   LFATKPYEVGNSILVEEAYSWVTFRDNGPNICHTCFKFSKSLKKCSQCKFARYCGLDCQK 62

Query: 153 EAWLRY-HRWECHGALRLLEA-----VGIAHLALKLILVSSHSD 190
           +AW    HRWEC      +EA     V +A L L   L S+  D
Sbjct: 63  KAWKENSHRWECKAIASSVEAKIPSIVRLAALFLFRALNSNDKD 106


>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
 gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
          Length = 458

 Score = 42.7 bits (99), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 2/81 (2%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           KGR ++  ++ + G+ +  ++P   V     ++S C+ C  S S    C+ C +  YC  
Sbjct: 21  KGRCLLTTKNFNPGEVILRQEPYVCVPNNSSTVSRCDGCFASESLK-KCSACQVVWYCGS 79

Query: 149 SCRREAWLRYHRWECHGALRL 169
           +C++  W + HR EC+   RL
Sbjct: 80  TCQKSEW-KLHRLECNALSRL 99


>gi|302835433|ref|XP_002949278.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
           nagariensis]
 gi|300265580|gb|EFJ49771.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
           nagariensis]
          Length = 467

 Score = 42.7 bits (99), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 6/82 (7%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP--IPCN 138
           ++++T N KGR +     V  G+T+  E P+  ++ P  + S C  C  ++     I C+
Sbjct: 15  LEVRTHNRKGRSLYTTRLVKGGETVLDEAPLLLLVAPETAQSTCIACLRAVRPQECIECS 74

Query: 139 ECILAVYCSESCRREA----WL 156
            C  A +CS +C++ A    WL
Sbjct: 75  VCRQACFCSATCQQTAQGTPWL 96


>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
          Length = 540

 Score = 42.7 bits (99), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 5/87 (5%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCCTSISAPIP 136
           + + ++  + +GR + A + V  G  L V  P+  +++P  S     C+ C   ++ P P
Sbjct: 7   DGVVIKPHSTRGRALHATKTVAAGGVLAVFSPL--ILVPSLSHLTIVCSFCLRGVT-PRP 63

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C  A YC   C+  AW   H  EC
Sbjct: 64  CSRCRAAYYCDAQCQAAAWSSGHAAEC 90


>gi|66804135|ref|XP_635865.1| hypothetical protein DDB_G0290183 [Dictyostelium discoideum AX4]
 gi|60464204|gb|EAL62363.1| hypothetical protein DDB_G0290183 [Dictyostelium discoideum AX4]
          Length = 447

 Score = 42.7 bits (99), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 18/121 (14%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTL-FVEKP--VAFVILPPC 119
           +PSL   E   L+  S    ++ S  KGR+V+A  D+  G+++ F E+P  V      P 
Sbjct: 56  IPSLLSSE---LISNSKNSIIKQSKEKGRYVVAKRDIKAGESIVFREEPYVVCMNYKDPY 112

Query: 120 SMSNCNHCC--------TSISAP---IPCNECILAVYCSESCRREAWLRYHRWECHGALR 168
               C+HC         + I+ P   + C  C +  YCS  C+    L YH+ EC    R
Sbjct: 113 ERKICHHCFGLSSTTSKSMITTPDFTLHCETCNIVWYCSNYCQSND-LTYHKHECFTYKR 171

Query: 169 L 169
           +
Sbjct: 172 M 172


>gi|66363372|ref|XP_628652.1| SET domain protein with MYND insert (Skm/BOP family)
           [Cryptosporidium parvum Iowa II]
 gi|46229836|gb|EAK90654.1| SET domain protein with MYND insert (Skm/BOP family)
           [Cryptosporidium parvum Iowa II]
          Length = 587

 Score = 42.7 bits (99), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 9/97 (9%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP----- 134
           KI+++    KGR +IA E++  G+++ +E+P   ++        C+ C   + +      
Sbjct: 20  KIEIKEDERKGRSIIAKEEIQIGESILMEEPYCRILFSDNIEEICDTCFNYLRSEGSYSE 79

Query: 135 --IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
             + C EC    +CS+ C  E+    H +EC G L+L
Sbjct: 80  CILECQECKKIKFCSKKCMEES-KTIHHFEC-GILKL 114


>gi|353234811|emb|CCA66832.1| hypothetical protein PIIN_00594 [Piriformospora indica DSM 11827]
          Length = 456

 Score = 42.7 bits (99), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 6/81 (7%)

Query: 86  SNAKGRHVIAV-EDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC--TSISAPIPCNECIL 142
           S A+ R V+A  + V  G  +     +A  +LP      C+ CC  T++ A   C+ C  
Sbjct: 12  SQARSRAVVAPGQTVKPGQVILRSSSLATSLLPKSRGQRCDECCRQTTVKA---CSRCKE 68

Query: 143 AVYCSESCRREAWLRYHRWEC 163
           A YC   C+  AW  +HR  C
Sbjct: 69  AFYCDTRCQSAAWKSHHRTTC 89


>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
 gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
 gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 793

 Score = 42.7 bits (99), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCS 147
           KGR + +  D+     +  E P+A +I+  C  ++C++C +   A  + C  C + +YCS
Sbjct: 227 KGRGMSSPNDISPASLIHAEDPLAVIIMKSCRDTHCHYCFSEAPADVVVCPSCTIPIYCS 286

Query: 148 ESCRREA 154
             C+ +A
Sbjct: 287 NRCQEKA 293


>gi|145517400|ref|XP_001444583.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412005|emb|CAK77186.1| unnamed protein product [Paramecium tetraurelia]
          Length = 354

 Score = 42.7 bits (99), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 2/62 (3%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE 194
           + C+ C  A YCS  CR   W  YH   C+G   L+  V ++ L  ++ L+ S S  + +
Sbjct: 23  VTCDSCSNAFYCSTKCRDFDWEGYHYLVCNGLFELVPKVSLSQLNDEMDLIGSGS--FGQ 80

Query: 195 VY 196
           VY
Sbjct: 81  VY 82


>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 548

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 45/95 (47%), Gaps = 13/95 (13%)

Query: 73  TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
           T+MC       + + AKG  VIA +D+  G T+  E  +A+  L    MS C+HC T + 
Sbjct: 57  TIMC-------EKTPAKGTVVIAKKDIPAGTTILAEPAIAWQPLEDRIMSVCHHCMTEVP 109

Query: 133 A-PIPCNE----CILAVYCSESCRREAWLRYHRWE 162
              + C E    C    YCS  C REA    HR E
Sbjct: 110 RWAVGCGEGAGGCSGLGYCSPKC-REASEALHRVE 143


>gi|190348269|gb|EDK40693.2| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 637

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 6/88 (6%)

Query: 124 CNHCCTSISA---PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAH 177
           C HC   + A      CN C L  YC++ C   +W + H++EC    +L  + G      
Sbjct: 72  CQHCTRRLPAHKSQYTCNGCKLYTYCNQRCYDLSWEKVHQYECPVFEQLKSSFGYQEFVR 131

Query: 178 LALKLILVSSHSDRYKEVYHLETHLQDM 205
           LAL+L  + S     K +  L TH +++
Sbjct: 132 LALRLCTLYSIPSYRKNLERLTTHREEV 159


>gi|312384571|gb|EFR29269.1| hypothetical protein AND_01935 [Anopheles darlingi]
          Length = 750

 Score = 42.4 bits (98), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 45/92 (48%), Gaps = 5/92 (5%)

Query: 93  VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCR 151
            +  + +  GD +  E+   + I P   +  C++C   ++ A +PC +C    YCSE+C 
Sbjct: 236 FVTTKRLKPGDVVLTEEACLWKISPEGQLELCDYCGADAVLALVPCVDCNRVFYCSEACH 295

Query: 152 REAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
           +     YH  EC    ++L   G++  A +L+
Sbjct: 296 KAGVSEYHWLEC----KVLRNQGLSLSATQLL 323


>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
           Group]
          Length = 687

 Score = 42.4 bits (98), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 1/67 (1%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCS 147
           KGR + +  D+     +  E P+A +I+  C  ++C++C +   A  + C  C + +YCS
Sbjct: 121 KGRGMSSPNDISPASLIHAEDPLAVIIMKSCRDTHCHYCFSEAPADVVVCPSCTIPIYCS 180

Query: 148 ESCRREA 154
             C+ +A
Sbjct: 181 NRCQEKA 187


>gi|146413731|ref|XP_001482836.1| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 637

 Score = 42.4 bits (98), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 6/88 (6%)

Query: 124 CNHCCTSISA---PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAH 177
           C HC   + A      CN C L  YC++ C   +W + H++EC    +L  + G      
Sbjct: 72  CQHCTRRLPAHKSQYTCNGCKLYTYCNQRCYDLSWEKVHQYECPVFEQLKSSFGYQEFVR 131

Query: 178 LALKLILVSSHSDRYKEVYHLETHLQDM 205
           LAL+L  + S     K +  L TH +++
Sbjct: 132 LALRLCTLYSIPSYRKNLERLTTHREEV 159


>gi|67596952|ref|XP_666111.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54657034|gb|EAL35879.1| hypothetical protein Chro.70568 [Cryptosporidium hominis]
          Length = 575

 Score = 42.4 bits (98), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 9/97 (9%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP----- 134
           KI+++    KGR +IA E++  G+++ +E+P   ++        C+ C   + +      
Sbjct: 7   KIEIKEDKRKGRSIIAKEEIQIGESILMEEPYCRILFSDNIEEICDTCFNYLRSAGSYSE 66

Query: 135 --IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
             + C EC    +CS+ C  E+    H +EC G L+L
Sbjct: 67  CILECQECKKIKFCSKKCMEES-KTIHHFEC-GILKL 101


>gi|113206687|gb|ABI34494.1| SET and MYND domain containing 2a [Danio rerio]
          Length = 156

 Score = 42.4 bits (98), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 51/132 (38%), Gaps = 10/132 (7%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD +F     A+V+        C  C T       C +C  A Y
Sbjct: 14  SPGKGRGLKAIKHFKVGDLVFACPAYAYVLTVNERGGRCECCFTRKEGLSKCGKCKQAYY 73

Query: 146 CSESCRREAWLRYHRWECHGALRLLE---AVGIAHLALKLILVSSH------SDRYKEVY 196
           C+  C+R  W   H+ EC       E         L  ++IL   H      S+R   + 
Sbjct: 74  CNVECQRGDWP-MHKLECSAMCAYGENWCPSETVRLVARIILKQKHQTERTPSERVLTLR 132

Query: 197 HLETHLQDMRPE 208
            LE HL  +  E
Sbjct: 133 ELEAHLDKLDNE 144


>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
 gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
          Length = 461

 Score = 42.4 bits (98), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 4/103 (3%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + +G  +  EKP AFV+        C++C         C+ C    YC+ +C+ +AWL  
Sbjct: 22  IKRGQRILTEKPFAFVLKSKYRQQRCDNCLEEGKVR-KCSNCRYVYYCNRACQTQAWL-M 79

Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
           H+ EC    R    +    A +  +LIL   H       Y+ E
Sbjct: 80  HKHECPFLKRSYPRIVPDAARMLCRLILRLEHGGDLIRGYYTE 122


>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
 gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
          Length = 433

 Score = 42.4 bits (98), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD LF     A+V+      ++C  C         C  C  A Y
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEFCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 74  CNVECQKEDW-PMHKLEC 90


>gi|358253876|dbj|GAA53887.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
          Length = 869

 Score = 42.4 bits (98), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 32/67 (47%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
           G H+ A  ++  G+ + +EKP A  I     + +C  C       +PC +C    +CS  
Sbjct: 324 GWHLQASRNIEAGEVILIEKPYARRICRRFLLKHCYQCFKRCLNLLPCRKCSEVGFCSTE 383

Query: 150 CRREAWL 156
           C + +WL
Sbjct: 384 CEKTSWL 390


>gi|296422187|ref|XP_002840643.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636864|emb|CAZ84834.1| unnamed protein product [Tuber melanosporum]
          Length = 502

 Score = 42.0 bits (97), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 34/132 (25%), Positives = 61/132 (46%), Gaps = 14/132 (10%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           ++  + ++E ++  ++  S+ A   V  LP L       L  A +  +   S+ K   V 
Sbjct: 169 DRFSDKTLEVLNAAMAKCSAKAELKVVDLPVL----QRDLSFAGDGDR---SSDKHLGVY 221

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-CNECILA------VYCS 147
           A ED+  G+T F E     V+  P + S C +C  ++      C++C         ++CS
Sbjct: 222 AKEDIFLGETFFSEASPLTVVADPENSSLCEYCGGNLPDQYEVCDDCWEGDEDSGVMWCS 281

Query: 148 ESCRREAWLRYH 159
           E CR++A+  YH
Sbjct: 282 EECRQKAFKSYH 293


>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 562

 Score = 42.0 bits (97), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/151 (23%), Positives = 60/151 (39%), Gaps = 17/151 (11%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
           C    + ++ S   GR +     + +G +L   +P   V+  P   S+C+ C    S   
Sbjct: 29  CIPPTLDLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYLSSHCSSCAAPASPER 88

Query: 136 P----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV------ 185
           P    C +C +  YCS+ C+   W + H+ EC    R   A     LA+    V      
Sbjct: 89  PQLKRCAKCHVVHYCSQKCQNADW-KAHKPECDALQRWATAAPSPDLAVPEEAVRCLGRM 147

Query: 186 ------SSHSDRYKEVYHLETHLQDMRPEDL 210
                 +  S   +E+  +++H   +RPE  
Sbjct: 148 LWQKQRNPSSIWSREIDSMQSHRSSLRPESF 178


>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
 gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
          Length = 435

 Score = 42.0 bits (97), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 51/132 (38%), Gaps = 10/132 (7%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A++    GD +F     A+V+        C  C T       C +C  A Y
Sbjct: 14  SPGKGRGLKAIKHFKVGDLVFACPAYAYVLTVNERGGRCECCFTRKEGLSKCGKCKQAYY 73

Query: 146 CSESCRREAWLRYHRWECHGALRLLE---AVGIAHLALKLILVSSH------SDRYKEVY 196
           C+  C+R  W   H+ EC       E         L  ++IL   H      S+R   + 
Sbjct: 74  CNVECQRGDW-PMHKLECSAMCAYGENWCPSETVRLVARIILKQKHQTERTPSERVLTLR 132

Query: 197 HLETHLQDMRPE 208
            LE HL  +  E
Sbjct: 133 ELEAHLDKLDNE 144


>gi|299744169|ref|XP_001840927.2| hypothetical protein CC1G_03156 [Coprinopsis cinerea okayama7#130]
 gi|298405996|gb|EAU80980.2| hypothetical protein CC1G_03156 [Coprinopsis cinerea okayama7#130]
          Length = 486

 Score = 42.0 bits (97), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 7/54 (12%)

Query: 117 PP---CSMSNCNHCCTSISAPIP----CNECILAVYCSESCRREAWLRYHRWEC 163
           PP   C++ + +       AP+P    C+ C+  VYCS  C+ E WL  HR+EC
Sbjct: 268 PPVMCCNLRHLDKTHGRTPAPLPEPKACSRCLCVVYCSTECQTEDWLAIHRFEC 321


>gi|328715975|ref|XP_003245797.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 632

 Score = 42.0 bits (97), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 5/115 (4%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
           T    +    S+  L  G +  +   S  ++++ S   GR V A  D++ GD + +++P 
Sbjct: 199 TEQKRTMETPSVEHLVGGRNENIPALSAFVELKMSEDMGRGVYATRDIYPGDVVAIDEP- 257

Query: 112 AFVILP---PCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            ++  P      + + N C     A   C +C L  YC+E C  + +   H   C
Sbjct: 258 -YICGPFRDDIEVCHYNGCLKLGFALFRCPKCQLVYYCNEDCMNKDYKDGHNLAC 311


>gi|195488849|ref|XP_002092487.1| GE11640 [Drosophila yakuba]
 gi|194178588|gb|EDW92199.1| GE11640 [Drosophila yakuba]
          Length = 497

 Score = 42.0 bits (97), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 37/145 (25%), Positives = 68/145 (46%), Gaps = 16/145 (11%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ ++  GR V+A+ ++  G+TL +E P+  V+LP      C+ C     +   C
Sbjct: 9   ARKCEIKQNDTLGRFVVALCNLRAGETLLLENPI--VVLPQIGDRRCSKCFKLTQS--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
            +C L   C +        CRR A +     +    L+  E   I   ALK +L+  H +
Sbjct: 65  RKCRLLALCEDCSDHDGRDCRRLAEMNLSNDQVE-LLQRKEHTEIQS-ALKCLLLREHEE 122

Query: 191 R---YKEVYHLETHLQDMRPEDLYQ 212
               Y+E+  +E+ L   R  D+++
Sbjct: 123 TLPLYEEMSQMESQLASRRGTDIWK 147


>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
           carolinensis]
          Length = 334

 Score = 42.0 bits (97), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  +G  +   + V  G  L+ E+P A+ +      + C  C       + C++C +A Y
Sbjct: 41  SPGRGNGLRLSKAVKAGALLYREEPFAYAVTKKRLGAACERCLRRKERLLRCSQCKVARY 100

Query: 146 CSESCRREAWLRYHRWEC 163
           C   C++EAW   H+ EC
Sbjct: 101 CDARCQKEAW-EDHKREC 117


>gi|358398595|gb|EHK47946.1| hypothetical protein TRIATDRAFT_290442 [Trichoderma atroviride IMI
           206040]
          Length = 561

 Score = 41.6 bits (96), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 70/166 (42%), Gaps = 24/166 (14%)

Query: 3   KQDVQETLSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVAS 62
           +QD   +      L R +     D + D  + E     S+++++  +   +   +  VA+
Sbjct: 179 RQDFDASDLPDFGLVRREVYPWNDHEPDRFSPE-----SLQSLNDGLKEMAPKCAVQVAT 233

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM- 121
           LP L  GES T     N   + T    G  V A ED+  G+ +  E    + +L   +  
Sbjct: 234 LPILLEGESET----DNYEIIPTCKQLG--VFAQEDIEAGEVVLRE----YTLLTANNRH 283

Query: 122 --SNCNHCCTSISA------PIPCNECILAVYCSESCRREAWLRYH 159
             S C+ C + ++       PI C EC   V+CS+ C   A  RYH
Sbjct: 284 KDSVCDACSSDLAPLGSDNQPIRCEECFDTVFCSQECHDLAQDRYH 329


>gi|328699920|ref|XP_001945553.2| PREDICTED: hypothetical protein LOC100161821 [Acyrthosiphon pisum]
          Length = 719

 Score = 41.6 bits (96), Expect = 0.20,   Method: Composition-based stats.
 Identities = 20/57 (35%), Positives = 27/57 (47%)

Query: 107 VEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           + KP  FV  P   M+ C     + SA I C+ C    YCS +C+R  W   H  +C
Sbjct: 656 INKPKTFVKPPQIIMTKCYARNCTKSATIMCSSCTSVKYCSHNCQRHDWYDSHINDC 712


>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
          Length = 428

 Score = 41.6 bits (96), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 36/150 (24%), Positives = 64/150 (42%), Gaps = 21/150 (14%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  +S  +G  + A+  +  G+ LF   P+A+ +        C  C       + C+ 
Sbjct: 5   KVEKFSSANRGNGLRALVQLRPGELLFRSDPLAYTVCKGSRGVVCVRCIIGRKYLLYCSR 64

Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
           C L   C   CR++AW + H+ EC             ++RLL       +  KL+    S
Sbjct: 65  CRLLAQCGAKCRKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEETPS 118

Query: 188 HSDRYKEVYHLETHLQDM---RPEDLYQYV 214
            S++    Y LE+++  +   + E L Q V
Sbjct: 119 ESEKLYSFYDLESNINKLTEDKKEGLRQLV 148


>gi|299754623|ref|XP_001841076.2| hypothetical protein CC1G_04920 [Coprinopsis cinerea okayama7#130]
 gi|298410845|gb|EAU80810.2| hypothetical protein CC1G_04920 [Coprinopsis cinerea okayama7#130]
          Length = 600

 Score = 41.6 bits (96), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 134 PIP--CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
           P+P  C  C++  YCS  C+RE W   HR+EC G  R  E++
Sbjct: 402 PVPKACRGCLVVGYCSAECQREDWQLIHRFECDGLKRQKESM 443


>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
 gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
 gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
          Length = 420

 Score = 41.6 bits (96), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 32/114 (28%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC-------------- 128
           S+AKG+ + A   + KG+ +FVEKPV    F+     +   C+HC               
Sbjct: 35  SSAKGKGLFATRSIRKGEAVFVEKPVVSSQFLWNALYNYRACDHCLRALETAEENAQRLL 94

Query: 129 --TSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
             +S+  P P            C  C +  YCS  CR+ A  +YH+  C G  R
Sbjct: 95  GRSSLVLPHPEQCSIRKDLHQQCPRCQV-TYCSAECRQAALEQYHQVLCLGPSR 147


>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score = 41.6 bits (96), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 36/81 (44%), Gaps = 2/81 (2%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           KGR +    D+  G+ +  +KP   V       S C+ C  + +    C+ C +  YC  
Sbjct: 21  KGRSLFTTRDIRPGEVILSQKPYICVPNNTSLESRCDGCFKTNNLK-KCSGCQVVWYCGS 79

Query: 149 SCRREAWLRYHRWECHGALRL 169
            C++  W + HR EC    RL
Sbjct: 80  FCQKSEW-KLHRHECKALTRL 99


>gi|384490543|gb|EIE81765.1| hypothetical protein RO3G_06470 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 41.6 bits (96), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 55/128 (42%), Gaps = 6/128 (4%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT---SISAPIPCNECILAV-Y 145
           G  V+A   + +G T+  E P    +      ++C +C T    I       +C   V Y
Sbjct: 43  GERVVANAKLKQGRTILEEDPFIRQLNDANRTTHCTYCFTEFKKIKKLCRNKDCKWEVLY 102

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
           CS +C+++ WL  H+W C     L         AL+  + S     Y  +  L +++   
Sbjct: 103 CSLNCQQQHWLTEHKWFCRFP-NLANTDKNVLFALQGYIASRSKSEY-TLPGLVSNIDSY 160

Query: 206 RPEDLYQY 213
           +PEDL +Y
Sbjct: 161 KPEDLEEY 168


>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
           niloticus]
          Length = 435

 Score = 41.6 bits (96), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + AV     G+ +F     ++V+      ++C HC T       C +C  A Y
Sbjct: 14  SPDKGRGLRAVRHFAVGELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C+R  W   H+ EC
Sbjct: 74  CNVDCQRGDW-PMHKLEC 90


>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
          Length = 441

 Score = 41.2 bits (95), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 43/98 (43%), Gaps = 7/98 (7%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC---CTSISA 133
           A   + +  S  KGR +IA     +G+ + ++ P   V+        C+ C   CT++  
Sbjct: 22  AEKGLAISISPEKGRCLIAKRSFTRGEIVLLQDPYVSVLDSASVNKRCDVCFRLCTNLKR 81

Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
              C+ C    YC  +C+R  W R H+ EC     L E
Sbjct: 82  ---CSVCKTTWYCGGTCQRNGW-RLHQHECKAITSLKE 115


>gi|195570243|ref|XP_002103118.1| GD20257 [Drosophila simulans]
 gi|194199045|gb|EDX12621.1| GD20257 [Drosophila simulans]
          Length = 408

 Score = 41.2 bits (95), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 3/83 (3%)

Query: 83  MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
            + S  K   V A  DV KG+ + VE PV F    P    NC  C         C+ C  
Sbjct: 232 FEKSIEKSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRY 289

Query: 143 AVYCSESCRREAWLRYHRWECHG 165
             YCS SC + +    H++EC G
Sbjct: 290 RSYCSMSCMK-SDAEVHQYECTG 311


>gi|409042906|gb|EKM52389.1| hypothetical protein PHACADRAFT_148986 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 436

 Score = 41.2 bits (95), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)

Query: 49  VSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQ-TSNAKGRHVIAVEDVHKGDTLFV 107
            +S+ S + +  A  P+ A  E   +   ++KI+++     KG+ ++A E + +G+ ++ 
Sbjct: 64  AASSRSKSGTGNAPYPTSAVIEGLDVSKWTSKIQVKYFGKHKGKGLVATERIAEGEVVWK 123

Query: 108 EKPVAFVILPPCSMSN-------CNHCCTSISAPIPCNECILAVYCSESCR 151
           E P  F++ P  S+ +       C HC T      P N   LAV CS S +
Sbjct: 124 EDP--FILAPEWSLYDLQTASVACTHCST------PLNNSPLAVPCSGSAK 166


>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 487

 Score = 41.2 bits (95), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 21/87 (24%), Positives = 39/87 (44%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   +++  S   GR ++      +G  +  E P A+ +        C+ C    S    
Sbjct: 5   AMESVRVVKSATCGRSLVFTSKFARGQCILEELPYAYTLHDNKRGLFCDFCLKKCSTLKK 64

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C    YC++SC+++ W R H+ +C
Sbjct: 65  CSSCNYVSYCNKSCQKKDWARCHKQDC 91


>gi|350593085|ref|XP_001927076.4| PREDICTED: tudor domain-containing protein 1-like [Sus scrofa]
          Length = 1373

 Score = 41.2 bits (95), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 59/148 (39%), Gaps = 12/148 (8%)

Query: 18  RLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSST----SSSASSSVASLPSLACGESHT 73
           RL+     + K + L  E++++  V      VSS     S  A+ S      L  G   +
Sbjct: 245 RLKSPETGNKKNNFLLCEQTKQYLVSQEDSSVSSNPNGVSGEAAGSRGDRKMLPTGNPVS 304

Query: 74  LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF-----VEKPVAFVIL-PPCSMSNCNHC 127
            +   N    +  N+K  + +      KGD L      +  P  F  L PP   + C+ C
Sbjct: 305 PLNIGNNSPPKEINSKPSNNVPPAKSKKGDKLVENSLSINNPALFSSLGPPLRSTTCHRC 364

Query: 128 CTSISAPIPCNECILAVYCSESCRREAW 155
              +   + C++C    YCS +C+R  W
Sbjct: 365 --GLFGSLRCSQCKQTYYCSTACQRRDW 390


>gi|145518734|ref|XP_001445239.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124412683|emb|CAK77842.1| unnamed protein product [Paramecium tetraurelia]
          Length = 356

 Score = 41.2 bits (95), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 2/62 (3%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE 194
           I C+ C    YCS  CR   W  YH   C+G   L+  + ++ L  ++ L+ S S  + +
Sbjct: 23  ITCDSCSSTFYCSSKCRDFDWEAYHYLVCNGQFELVPKLSLSQLNDEMNLIGSGS--FGQ 80

Query: 195 VY 196
           VY
Sbjct: 81  VY 82


>gi|328768420|gb|EGF78466.1| hypothetical protein BATDEDRAFT_90401 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 578

 Score = 41.2 bits (95), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC 127
           +S  I ++ + +KGRH +A   +  G T+ VE P AFV+    +   C++C
Sbjct: 13  SSYPITLKATKSKGRHFVAKSQLQPGTTVLVESPCAFVLFKDNARDYCSYC 63


>gi|405974952|gb|EKC39559.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
          Length = 766

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 48/105 (45%), Gaps = 31/105 (29%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPV--AFVILPPCSMS-NCNHCCTSI--------------- 131
           G+ V+A + + +G+ LF EKP+    V+    + + +C++C  SI               
Sbjct: 338 GKCVVAKKAIKEGEELFTEKPLIMGHVMDKDNNFALSCDNCAASILTAEDYFGSTLETME 397

Query: 132 ---------SAP-IP---CNECILAVYCSESCRREAWLRYHRWEC 163
                    S P IP   C++C    YCSE CRR+AW+  H   C
Sbjct: 398 LDLKELIKESWPDIPTVTCDKCQKVKYCSEDCRRQAWVSQHELIC 442


>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           impatiens]
          Length = 439

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 50/111 (45%), Gaps = 6/111 (5%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           + KG T+   KP A+V+        C++C  S      C+ C    YC++SC++ +W   
Sbjct: 8   IKKGTTILTAKPFAYVLCSKYKNVRCDYCFKS-GKLFRCSACQYVYYCNQSCQQMSW-PM 65

Query: 159 HRWECHGALRLLEAVGI---AHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
           H  EC   L+     GI   A L  ++I+  +     +  Y+ ET  +  +
Sbjct: 66  HSKEC-ARLKKCSPWGISNVARLMARIIIKLNQGGDEERGYYNETSYRKFK 115


>gi|363740353|ref|XP_003642314.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Gallus gallus]
          Length = 708

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGAL 167
           C  C  S+   + PC  C     CSE+CRR++W   H  EC G L
Sbjct: 375 CYQCGRSVGVQLSPCKRCYEVFTCSEACRRKSWNERHSLECSGLL 419


>gi|115475273|ref|NP_001061233.1| Os08g0205300 [Oryza sativa Japonica Group]
 gi|40253402|dbj|BAD05332.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|42761301|dbj|BAD11544.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|113623202|dbj|BAF23147.1| Os08g0205300 [Oryza sativa Japonica Group]
          Length = 331

 Score = 40.8 bits (94), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CNHCCTSIS 132
           A + ++      +GR ++A   + +G+ +  E+P+   +L P S+++    C+ C  S+S
Sbjct: 2   AGDALRAADLPGRGRGLLAARSIREGEVILTEQPL---LLYPASLASLPSFCSACFRSLS 58

Query: 133 API-PCNECILAVYCSESCRREAWLRYH 159
           A   PC  C  A +CS SC   +  R H
Sbjct: 59  AAASPCPSCRAAGFCSPSCAAASHPRLH 86


>gi|358387028|gb|EHK24623.1| hypothetical protein TRIVIDRAFT_30213 [Trichoderma virens Gv29-8]
          Length = 561

 Score = 40.8 bits (94), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 19/128 (14%)

Query: 41  SVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVH 100
           S+++++  +   +   +  VA+LP L  GES T     N   + T    G  V A ED+ 
Sbjct: 212 SLQSLNDGLREMAPKCAVQVATLPVLLEGESET----DNYEIIPTCKQLG--VFAQEDIE 265

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHC--CTSISAP-------IPCNECILAVYCSESCR 151
            G+ +  E    + +L   +    + C  C+S  AP       I C EC   V+CS+ C 
Sbjct: 266 AGEVVLRE----YTLLTANNRHKDSVCDACSSELAPLGSENRSIQCEECFDTVFCSQECH 321

Query: 152 REAWLRYH 159
            EA  RYH
Sbjct: 322 DEAQSRYH 329


>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 509

 Score = 40.8 bits (94), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 44/101 (43%), Gaps = 10/101 (9%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI---------LPPCSMSNCNHCCTSI 131
           + +  + A GRH+ A  D   G+ +   KP A V           P C  +       + 
Sbjct: 26  LALAYTPALGRHITAKRDFRAGELVLASKPYAAVADTDGPAAGKAPECFQAQDEDADVAA 85

Query: 132 SAPIP-CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           +A +  C +C  A YCS  C+R AW   H+ EC   +R L+
Sbjct: 86  AAEMKRCAQCRRAQYCSVECQRAAWHGGHKAECAAWVRGLQ 126


>gi|440802445|gb|ELR23374.1| SET and MYND domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 521

 Score = 40.8 bits (94), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 50/124 (40%), Gaps = 46/124 (37%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-----CNHCCTSI------------ 131
           KG+ ++A+ D  +GD +F E+P+  V  PP   +      C++C   +            
Sbjct: 170 KGKGLVALRDFKQGDEIFHEEPL--VCAPPFFPAQEKPLCCSNCMRYLDTREAMVDRMVA 227

Query: 132 ---------SAPIP----------------CNECILAVYCSESCRREAWLRYHRWEC--H 164
                    +A +P                C  C + VYC+  C+ +AW RYH   C  +
Sbjct: 228 HVKQSDKAKAAALPRFDELCPPFSETEGQMCQGCGVEVYCTTQCKDDAWRRYHEAACPAY 287

Query: 165 GALR 168
            ALR
Sbjct: 288 AALR 291


>gi|328876145|gb|EGG24508.1| hypothetical protein DFA_02751 [Dictyostelium fasciculatum]
          Length = 915

 Score = 40.8 bits (94), Expect = 0.38,   Method: Composition-based stats.
 Identities = 15/41 (36%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLR-YHRWEC 163
           C+ C  ++  P+ C+ C   VYCS++C++E+W    HR +C
Sbjct: 847 CHQCKQTLMKPLLCSGCKAVVYCSKACQKESWFHDGHRDKC 887


>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
          Length = 392

 Score = 40.8 bits (94), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 8/79 (10%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CNHCCTSIS 132
           A + ++      +GR ++A   + +G+ +  E+P+   +L P S+S+    C+ C  S+S
Sbjct: 2   AGDALRAADLPGRGRGLLAARSIREGEVILTEQPL---LLYPASLSSLPSFCSACFRSLS 58

Query: 133 API-PCNECILAVYCSESC 150
           A   PC  C  A +CS SC
Sbjct: 59  AAASPCPSCRAAGFCSPSC 77


>gi|255085856|ref|XP_002505359.1| predicted protein [Micromonas sp. RCC299]
 gi|226520628|gb|ACO66617.1| predicted protein [Micromonas sp. RCC299]
          Length = 435

 Score = 40.8 bits (94), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 4/79 (5%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKP-VAFVILPPCSMSNCNHCCTSISAPIP---CNECILAV 144
           +GR+ + ++D    D LF+  P +        SM  C  C        P   C++C LA 
Sbjct: 264 RGRYAMGMDDPTAADRLFIAAPDLGLEQAKIDSMLKCYMCGECALPHEPHEFCSKCHLAC 323

Query: 145 YCSESCRREAWLRYHRWEC 163
           YCS  C+++ W   H+ EC
Sbjct: 324 YCSRDCQKQHWKWRHKREC 342


>gi|328705723|ref|XP_003242888.1| PREDICTED: hypothetical protein LOC100576056 [Acyrthosiphon pisum]
          Length = 669

 Score = 40.8 bits (94), Expect = 0.40,   Method: Composition-based stats.
 Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
           V  G  L  ++P   VI   C M+ C +     SA I C+ C +  YCS SC+++ W+  
Sbjct: 601 VDVGTHLAGKQPTTDVI--KCYMNKCEN-----SAKIMCSNCKIVKYCSLSCQQQHWIIK 653

Query: 159 HRWECHGALRLLEAV 173
           H  EC    RL +A+
Sbjct: 654 HIDECDNLRRLNQAL 668


>gi|195349219|ref|XP_002041144.1| GM15391 [Drosophila sechellia]
 gi|194122749|gb|EDW44792.1| GM15391 [Drosophila sechellia]
          Length = 686

 Score = 40.4 bits (93), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 35/83 (42%), Gaps = 3/83 (3%)

Query: 83  MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
            + S  K   V A  DV KG  + VE PV F    P    NC  C         C+ C  
Sbjct: 232 FEKSIEKSGGVFASCDVPKGKIVLVENPVYFQFSAP--FLNCELCGVHQQQLYSCDNCRY 289

Query: 143 AVYCSESCRREAWLRYHRWECHG 165
             YCS SC + +    H++EC G
Sbjct: 290 RSYCSMSCMK-SDAEVHQYECTG 311


>gi|407424679|gb|EKF39087.1| hypothetical protein MOQ_000686 [Trypanosoma cruzi marinkellei]
          Length = 429

 Score = 40.4 bits (93), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 9/76 (11%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILA 143
           GR + A+  ++ G  + +++P    ++       C HC  +I      +  + C  C   
Sbjct: 163 GRGIYALTRINSGTPVMLDQPF---LVQRMRDDACAHCLATIGRSGASAGGVQCAHCDRE 219

Query: 144 VYCSESCRREAWLRYH 159
           VYCS +CR  AW  YH
Sbjct: 220 VYCSVACRDAAWREYH 235


>gi|320168148|gb|EFW45047.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 522

 Score = 40.4 bits (93), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 4/88 (4%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI---PC 137
           + ++   A+GR+++A   +  G+T+FV  P A+ +   C    C  C            C
Sbjct: 25  VAVREDAARGRYMVATRAIAAGETVFVALPYAYAVFTNCRKRVCAFCLLYDEQGRLFEHC 84

Query: 138 NECILAVYCSESCR-REAWLRYHRWECH 164
             C    YCS  CR R+  +  H+  CH
Sbjct: 85  AMCDQVYYCSAQCRQRDMDIGLHQRLCH 112


>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
 gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
           181]
          Length = 544

 Score = 40.4 bits (93), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 32/142 (22%), Positives = 62/142 (43%), Gaps = 14/142 (9%)

Query: 86  SNAKGRHVIAVEDVHKGDT-LFVEKPVAFVILPPCSMSNCN------HCCTSISAPIP-C 137
           S   G  + A  D+  G T + +++P   V+        C+      HC   ++  +  C
Sbjct: 65  SPGAGTGIFAATDIPVGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKAC 124

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKE- 194
             C +  YC+++C+ + W   H +EC     L+  V    A   L+++L S      KE 
Sbjct: 125 TGCRVVKYCNKTCQAKDWKFAHSFECRVYKELMPRVLPSNARAILRMVLRSERGKYPKEE 184

Query: 195 ---VYHLETHLQDMRPEDLYQY 213
                 LE+H+ +++ ++  Q+
Sbjct: 185 LELFRKLESHMAEIQAQNWEQW 206


>gi|195500128|ref|XP_002097242.1| GE26113 [Drosophila yakuba]
 gi|194183343|gb|EDW96954.1| GE26113 [Drosophila yakuba]
          Length = 690

 Score = 40.4 bits (93), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 12/105 (11%)

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
           N+ G  V A  ++ KG+ + VE PV F    P    NC+ C         C++C    YC
Sbjct: 238 NSGG--VFASCEMPKGEIVLVENPVHFQFSAP--FLNCDLCGVHQEQLYTCDDCRYRTYC 293

Query: 147 SESCRREAWLRYHRWECH----GALRLLEAVGIAHL---ALKLIL 184
           S  C  E+    H++EC+    G + +LEA  +  L   AL+ IL
Sbjct: 294 SRLC-MESDAEIHQYECYGYRIGLIPMLEASMLFRLFYEALEYIL 337


>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
          Length = 445

 Score = 40.4 bits (93), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 6/66 (9%)

Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILAVYCSESCRREAWLR 157
           T+  EKP+A V+ P    + C  C + I      S  + C++C    YCS  C+R+ W  
Sbjct: 3   TVAREKPLAAVLSPEFQDTYCATCFSEIDPSHLDSEILTCDDCTQVSYCSLKCQRKDWKT 62

Query: 158 YHRWEC 163
            H+ EC
Sbjct: 63  VHQLEC 68


>gi|326477204|gb|EGE01214.1| hypothetical protein TEQG_00267 [Trichophyton equinum CBS 127.97]
          Length = 1184

 Score = 40.4 bits (93), Expect = 0.53,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 97   EDVHKGDTLFVEKPVAFVILPPCSMSNCN-HCCTSISAPIPCNECILAVYCSESCRREAW 155
            +D  +G +  +E  +  + L   S SNC      S +  + C+ C +A YCS++C+RE W
Sbjct: 1110 KDTKEGVSQKIESLLQLMDLKKGSCSNCGGKKPGSGTKLLRCSGCKVAEYCSKNCQREDW 1169

Query: 156  LRYHRWEC 163
            +  H++ C
Sbjct: 1170 MARHKFAC 1177


>gi|194882335|ref|XP_001975267.1| GG20655 [Drosophila erecta]
 gi|190658454|gb|EDV55667.1| GG20655 [Drosophila erecta]
          Length = 498

 Score = 40.4 bits (93), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 36/145 (24%), Positives = 69/145 (47%), Gaps = 16/145 (11%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ ++  GR V+A+ ++  G+TL +E P+  V+LP      C+ C     +   C
Sbjct: 9   AKKCEIKQNDTLGRFVVALCNLRAGETLLLENPI--VVLPQMGDRRCSKCFKLTQS--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
            +C L   C +        CRR A + +   +    L+  E   I  + LK +L+  H +
Sbjct: 65  RKCHLLALCEDCSNHDERDCRRLAEMHFSADQVE-LLQRKEHTEIQPV-LKCLLLREHEE 122

Query: 191 R---YKEVYHLETHLQDMRPEDLYQ 212
               Y+E+  +E+ L   R  D+++
Sbjct: 123 TLPLYEEMSQMESQLATRRGTDIWK 147


>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
          Length = 437

 Score = 40.4 bits (93), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 50/123 (40%), Gaps = 12/123 (9%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
           KGR  +A   +  GD + +   V     P    S+C  C         C+ C  A YCS 
Sbjct: 12  KGRCAVASTSLRAGDRVLITSAVCAASSP----SSCGWCFAPGDVFSRCSACRKARYCSR 67

Query: 149 SCRREAWLRYHRWECHGALRLLE------AVGIAHLALKLILVSSHSDRYKE-VYHLETH 201
           +C++  W + HR EC     + E       + +A LA KL L S      K  V  L  H
Sbjct: 68  ACQQRDWPQ-HRHECAAWRSIPERNPSPTVLLVARLAAKLFLGSQVDQEEKNGVLKLRDH 126

Query: 202 LQD 204
           L D
Sbjct: 127 LAD 129


>gi|326471930|gb|EGD95939.1| hypothetical protein TESG_03400 [Trichophyton tonsurans CBS 112818]
          Length = 1217

 Score = 40.4 bits (93), Expect = 0.54,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 97   EDVHKGDTLFVEKPVAFVILPPCSMSNCN-HCCTSISAPIPCNECILAVYCSESCRREAW 155
            +D  +G +  +E  +  + L   S SNC      S +  + C+ C +A YCS++C+RE W
Sbjct: 1143 KDTKEGVSQKIESLLQLMDLKKGSCSNCGGKKPGSGTKLLRCSGCKVAEYCSKNCQREDW 1202

Query: 156  LRYHRWEC 163
            +  H++ C
Sbjct: 1203 MARHKFAC 1210


>gi|410914273|ref|XP_003970612.1| PREDICTED: SET and MYND domain-containing protein 5-like [Takifugu
           rubripes]
          Length = 417

 Score = 40.0 bits (92), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 31/111 (27%), Positives = 45/111 (40%), Gaps = 32/111 (28%)

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
           N KG+ + A     KGDT+F+E+P+    F+         C +C                
Sbjct: 29  NIKGKGLFAKRSFRKGDTIFIERPLVSAQFLWNSLYKYRACEYCLRALETAEENARRLSG 88

Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGA 166
              +S P P            C +C + +YCS  CR+ A  +YHR  C GA
Sbjct: 89  VPGLSLPHPELCCVRPELHQCCPQCQV-MYCSSECRQAAADQYHRVLCLGA 138


>gi|390603364|gb|EIN12756.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 450

 Score = 40.0 bits (92), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 30/69 (43%)

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREA 154
           A      GD + V++ +   ILP  +   CNHC    S    C  C    YCS  C+ + 
Sbjct: 24  ATRSFAPGDVILVDEALTTAILPSDAGRRCNHCLRLTSDLKRCAGCAAYHYCSTQCQSQQ 83

Query: 155 WLRYHRWEC 163
           W  +H+  C
Sbjct: 84  WSIHHKRIC 92


>gi|169864358|ref|XP_001838788.1| hypothetical protein CC1G_08952 [Coprinopsis cinerea okayama7#130]
 gi|116500120|gb|EAU83015.1| hypothetical protein CC1G_08952 [Coprinopsis cinerea okayama7#130]
          Length = 617

 Score = 40.0 bits (92), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C++C   VYCS+ C+RE W R HR+EC
Sbjct: 456 CSQCHSVVYCSDECQREDWKRVHRFEC 482


>gi|392567641|gb|EIW60816.1| hypothetical protein TRAVEDRAFT_71094 [Trametes versicolor
           FP-101664 SS1]
          Length = 316

 Score = 40.0 bits (92), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 10/91 (10%)

Query: 116 LPPCSMSNCNHCCTSISAPIP-------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
           LPP  ++ C HC       +P       C  C  A+YCS +C+R AW R H+ +C     
Sbjct: 16  LPPPCLTRCAHC-NKRERQLPAGAKLKRCKGCTGAMYCSRACQRAAWPR-HKNKCGSVGS 73

Query: 169 LLEAVGIAHLALKLILVSSHSDRYKEVYHLE 199
             +A G A  A    LV +  D   ++ HL+
Sbjct: 74  CYQAEGCAGYASSAALVEAMHDWVDDI-HLQ 103


>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
 gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
          Length = 392

 Score = 40.0 bits (92), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 34/124 (27%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSI---- 131
           N  +++    KGR +IA ++    + +F E+P     F        + C+HC   +    
Sbjct: 2   NHFEIREIPGKGRAMIATKNFAANEVIFEEEPFVSSQFSWNTAYGYAACDHCMRPLETLL 61

Query: 132 -------SAP---IP----------------CNECILAVYCSESCRREAWLRYHRWECHG 165
                  S P   +P                C  C +  YCSE C  EA  RYHR  C G
Sbjct: 62  ENVRRLASDPKVEVPLLQHDPTAAWVAQFTQCPRCKVR-YCSEDCLMEAQKRYHRVACMG 120

Query: 166 ALRL 169
           A RL
Sbjct: 121 AFRL 124


>gi|170067052|ref|XP_001868327.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863295|gb|EDS26678.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 288

 Score = 40.0 bits (92), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 7/74 (9%)

Query: 88  AKGRHV------IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNEC 140
           AKG+ V      +A +D + GD +   +P+   I       NC+HC    S + IPC   
Sbjct: 92  AKGKKVDHRRILVAEKDFNPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPGS 151

Query: 141 ILAVYCSESCRREA 154
           +  +YC E CR+++
Sbjct: 152 VFFMYCGEECRQKS 165


>gi|302682538|ref|XP_003030950.1| expressed protein [Schizophyllum commune H4-8]
 gi|300104642|gb|EFI96047.1| expressed protein, partial [Schizophyllum commune H4-8]
          Length = 423

 Score = 40.0 bits (92), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 4/86 (4%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP---C 137
           I++Q        V+A+ D  +G  +F +  +A  +L       C+  CT +++  P   C
Sbjct: 1   IQLQPHPRARDRVVAITDFIEGSVIFGQYALATALLSTEKGRRCD-ACTRLASDKPLRRC 59

Query: 138 NECILAVYCSESCRREAWLRYHRWEC 163
           + C    YC E C+R  W R H+  C
Sbjct: 60  SGCAEYYYCDEHCQRYHWQRNHKRIC 85


>gi|255732233|ref|XP_002551040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240131326|gb|EER30886.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 474

 Score = 40.0 bits (92), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 44/82 (53%), Gaps = 18/82 (21%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS----MSN---CNHC--------CTS 130
           S+ KG+ + A +D+ KGD ++ E+P+ FV  PP +    M N   C +C         ++
Sbjct: 114 SSKKGKGLYAKKDIKKGDLMWSEEPLFFV--PPLANVHLMKNASACTYCGKLLQQSEASA 171

Query: 131 ISAPIPCNECILAVYCSESCRR 152
           +   + CN C   V+CS+ C++
Sbjct: 172 VLKGLDCNVCA-EVWCSKKCKQ 192


>gi|118777349|ref|XP_001230586.1| AGAP009449-PA [Anopheles gambiae str. PEST]
 gi|116132908|gb|EAU77762.1| AGAP009449-PA [Anopheles gambiae str. PEST]
          Length = 257

 Score = 40.0 bits (92), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 2/63 (3%)

Query: 107 VEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
           ++ P    +   C    C+ C        IPC  C   +YCS+ C  +A+ +YHR+EC G
Sbjct: 2   LDTPFVKTLQEDCRYVRCDFCHAERPFTLIPCEGCTWVMYCSQECLSKAFDQYHRYEC-G 60

Query: 166 ALR 168
            +R
Sbjct: 61  VMR 63


>gi|326923979|ref|XP_003208210.1| PREDICTED: tudor domain-containing protein 1-like [Meleagris
           gallopavo]
          Length = 946

 Score = 40.0 bits (92), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 40/104 (38%), Gaps = 12/104 (11%)

Query: 62  SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI------ 115
           +LP    G S        K+ +  +N  G  V+  + V K   L       F I      
Sbjct: 4   ALPVKDAGSSMESTATRRKLPLTDTNDNGEMVLHSDSVEKPKKLQYASKNVFSIDYDKSL 63

Query: 116 ----LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAW 155
                PP  M  C+HC   +S  + C++C    YCS  C++  W
Sbjct: 64  FNTLTPPPQMRTCHHC--GLSGSLRCSQCKQIYYCSVDCQKRDW 105


>gi|326930121|ref|XP_003211200.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Meleagris gallopavo]
          Length = 1010

 Score = 40.0 bits (92), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGAL 167
           C  C  S+   + PC  C     CSE+CRR++W   H  EC G L
Sbjct: 797 CYQCGRSVGVQLSPCKHCYEVFTCSEACRRKSWNERHSRECSGLL 841


>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
           ND90Pr]
          Length = 529

 Score = 39.7 bits (91), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 8/85 (9%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI--AHLA-LKLILVSSHS---D 190
           C  C    YCS++C++EAW R HR EC   L+ +   G+  A LA ++L+    H    D
Sbjct: 94  CAACQRFRYCSKACQKEAWNRGHRHECK-ILKPMAGRGLPKAFLASMELLTRRKHGLIPD 152

Query: 191 RYKEVY-HLETHLQDMRPEDLYQYV 214
           +  E+   L +H+ D +    YQ +
Sbjct: 153 KEWELLCQLPSHIDDFKRNGTYQNI 177


>gi|299754705|ref|XP_001828134.2| hypothetical protein CC1G_12282 [Coprinopsis cinerea okayama7#130]
 gi|298410885|gb|EAU93684.2| hypothetical protein CC1G_12282 [Coprinopsis cinerea okayama7#130]
          Length = 580

 Score = 39.7 bits (91), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 26/62 (41%), Positives = 32/62 (51%), Gaps = 12/62 (19%)

Query: 107 VEKPVAFVILPPCSMSNCNHCCTSIS---APIP-CNECILAVYCSESCRREAWLRYHRWE 162
           VEKPV       C   N NH  T +S    PI  C+ C   VYCS +C +E W R H+ E
Sbjct: 411 VEKPVL------CD--NSNHLSTRVSRKRGPIKSCSRCRTVVYCSVACHKEDWGRRHQNE 462

Query: 163 CH 164
           C+
Sbjct: 463 CN 464


>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
           Japonica Group]
 gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
          Length = 392

 Score = 39.7 bits (91), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 8/79 (10%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CNHCCTSIS 132
           A + ++      +GR ++A   + +G+ +  E+P+   +L P S+++    C+ C  S+S
Sbjct: 2   AGDALRAADLPGRGRGLLAARSIREGEVILTEQPL---LLYPASLASLPSFCSACFRSLS 58

Query: 133 API-PCNECILAVYCSESC 150
           A   PC  C  A +CS SC
Sbjct: 59  AAASPCPSCRAAGFCSPSC 77


>gi|402220572|gb|EJU00643.1| hypothetical protein DACRYDRAFT_108715 [Dacryopinax sp. DJM-731
           SS1]
          Length = 400

 Score = 39.7 bits (91), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 32/121 (26%), Positives = 47/121 (38%), Gaps = 17/121 (14%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP----------PCSMSNC 124
           M  +  + +  S  KGR +     +  G  +    P A  +LP          PCS+   
Sbjct: 1   MAPTPLLNIVHSPTKGRCIYPTTPLSSGAAVLHALPYALGVLPSHRKRLCAYLPCSLPTT 60

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
           ++  TS      C+ C    YCS  C+R      H W C G  RL       H+A  ++L
Sbjct: 61  SYYTTS------CSGCGAVFYCSIDCKRGN-EEKHSWICAGLRRLKGWAKDRHIAGMMVL 113

Query: 185 V 185
           V
Sbjct: 114 V 114


>gi|166240380|ref|XP_001733010.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|165988569|gb|EDR41059.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 513

 Score = 39.7 bits (91), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 33/79 (41%), Gaps = 22/79 (27%)

Query: 97  EDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWL 156
           E + K   LF+EKP +F          C+  C               VYCSE+C+ EAW 
Sbjct: 225 EKMKKQKELFIEKP-SFT---------CDKNCK------------FEVYCSENCKNEAWK 262

Query: 157 RYHRWECHGALRLLEAVGI 175
           +YH   C     +L+  G 
Sbjct: 263 QYHNLLCPNYDLILKLYGF 281


>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
 gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 537

 Score = 39.7 bits (91), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 42/100 (42%), Gaps = 16/100 (16%)

Query: 90  GRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------- 135
           GR V + E    G+T+  +E P+  +       + CN+C   +S  I             
Sbjct: 23  GRTVFSTERFGAGETIAILENPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVKAGPRTC 81

Query: 136 -PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
             C  C  AVYC+  C+R+ W   H+ EC    R+ E  G
Sbjct: 82  KACTGCKAAVYCNAECQRQHWKLVHKAECKMFKRIRERTG 121


>gi|66809987|ref|XP_638717.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854407|sp|Q54Q80.1|Y4059_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0284059
 gi|60467339|gb|EAL65370.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1280

 Score = 39.7 bits (91), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 38/88 (43%)

Query: 83  MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
           +  S+  GR   A + +     L+ E+P    +        C +C   I +PI C EC  
Sbjct: 492 LYQSDLVGRISEASDFIPSNTVLYQEEPYVSCLDRNYHSQYCYNCFKEILSPIYCKECSN 551

Query: 143 AVYCSESCRREAWLRYHRWECHGALRLL 170
           + YCS  C  E +++ H  EC     ++
Sbjct: 552 SQYCSNKCLNEDYVKQHGRECGKGFLII 579


>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 449

 Score = 39.7 bits (91), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 25/106 (23%), Positives = 47/106 (44%), Gaps = 2/106 (1%)

Query: 79  NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
           N + +Q     G  +    D+ +G+ L  ++   +V+        C++CC+       C 
Sbjct: 3   NLVSVQEEPGAGLGLGVTADIERGECLLEKRLDLYVLNNELLNEACSYCCSQTKPTKRCA 62

Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
            C    YCS+ C+++ W   H+ EC  ALR     G+     +L++
Sbjct: 63  ACKQVHYCSKICQKQDW-SMHKLECK-ALRNASTNGLLPTVCRLLI 106


>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
           rubripes]
          Length = 434

 Score = 39.7 bits (91), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 1/79 (1%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
            S  KGR + AV     G+ +F     ++V+      + C  C T       C +C  A 
Sbjct: 13  VSTDKGRGLRAVRHFSVGELVFACPAYSYVLTVNERGAYCEQCFTRREDLFKCGKCKQAY 72

Query: 145 YCSESCRREAWLRYHRWEC 163
           YC+  C+R  W   H+ EC
Sbjct: 73  YCNVDCQRGDW-SMHKMEC 90


>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
          Length = 433

 Score = 39.7 bits (91), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 1/79 (1%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
            S  KGR + AV     G+ +F     ++V+      + C  C T       C +C  A 
Sbjct: 13  VSTDKGRGLRAVRHFSVGELVFACPAYSYVLTVNERGAYCEQCFTRREDLFKCGKCKQAY 72

Query: 145 YCSESCRREAWLRYHRWEC 163
           YC+  C+R  W   H+ EC
Sbjct: 73  YCNVDCQRGDW-SMHKMEC 90


>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
 gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
          Length = 463

 Score = 39.7 bits (91), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 1/86 (1%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           ++ +++ +S  KGR + A +    G+ +    P A+V+        C+ C         C
Sbjct: 2   ADGVEIFSSAEKGRGLCATKVFKPGNLVRAADPYAYVLCNSERGKRCDFCFARKDDMSRC 61

Query: 138 NECILAVYCSESCRREAWLRYHRWEC 163
           + C  A YC   C++ AW   H+ EC
Sbjct: 62  SGCKFARYCDGKCQKAAWTE-HKSEC 86


>gi|170056789|ref|XP_001864189.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876476|gb|EDS39859.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 508

 Score = 39.7 bits (91), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 57/127 (44%), Gaps = 4/127 (3%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S+  GR+++A  D+ +G+ + VE+P   V+ P      C   C   S  I C  C  A  
Sbjct: 13  SDRLGRYLVAKRDLARGELILVERPT--VVGPHWDSDVCCLNCFRNSCTI-CRLCRRAPL 69

Query: 146 CSE-SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
           C + +   EA  R++R        L     +      L+L  +  DR+ E+  +E HL+ 
Sbjct: 70  CYDCTGHDEAECRFYRESNLDINFLFNHFNVVTSVRCLLLYRTDRDRFWEMRRMEAHLEK 129

Query: 205 MRPEDLY 211
            R  +++
Sbjct: 130 RRGSEIW 136


>gi|299744348|ref|XP_001840718.2| hypothetical protein CC1G_09769 [Coprinopsis cinerea okayama7#130]
 gi|298406082|gb|EAU81127.2| hypothetical protein CC1G_09769 [Coprinopsis cinerea okayama7#130]
          Length = 723

 Score = 39.7 bits (91), Expect = 0.87,   Method: Composition-based stats.
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 132 SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           +  + C  C   VYCSE CR E W  +HR EC
Sbjct: 556 TGAVICATCEAQVYCSEECRTEDWDEFHRNEC 587


>gi|291226945|ref|XP_002733450.1| PREDICTED: MORN repeat containing 1-like [Saccoglossus kowalevskii]
          Length = 1129

 Score = 39.7 bits (91), Expect = 0.87,   Method: Composition-based stats.
 Identities = 25/91 (27%), Positives = 38/91 (41%), Gaps = 21/91 (23%)

Query: 85  TSNAKGRHVI---AVEDVH--------KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA 133
           TS   G+ V+     +D H        + D ++V KP+            C  C  SI  
Sbjct: 904 TSGEDGKQVVFHSIAKDSHGQEYPLLDRNDDIYVRKPL---------FKYCYECGRSIGV 954

Query: 134 PIP-CNECILAVYCSESCRREAWLRYHRWEC 163
            +  C  C    YCS++C+ +AW   H+ EC
Sbjct: 955 RLSACTRCKEVYYCSKACKLKAWNARHKEEC 985


>gi|195146441|ref|XP_002014193.1| GL22989 [Drosophila persimilis]
 gi|194103136|gb|EDW25179.1| GL22989 [Drosophila persimilis]
          Length = 709

 Score = 39.7 bits (91), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 43/104 (41%), Gaps = 4/104 (3%)

Query: 63  LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
           +P    G S  +  AS+  + + +      V A  ++ KG  + V+ PV F    P    
Sbjct: 211 VPFNGFGGSKKIPFASSACEYKRTMNGPAGVFARVNIPKGKIILVDTPVYFQFSAP--FL 268

Query: 123 NCNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHG 165
           NC  C          C+ C    YC+++C    W   H+ EC+G
Sbjct: 269 NCEMCGVHQELVFHTCSRCRYKTYCTQTCMELDW-EIHQTECYG 311


>gi|440802660|gb|ELR23589.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 669

 Score = 39.7 bits (91), Expect = 0.90,   Method: Composition-based stats.
 Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           C +C  ++  P+ C+ C   VYCS +C+R+ W + H+  C
Sbjct: 620 CWNCKRTLLKPLQCSRCQSVVYCSRTCQRDDWAK-HKVAC 658


>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
          Length = 475

 Score = 39.7 bits (91), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 19/83 (22%), Positives = 36/83 (43%), Gaps = 1/83 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           ++++ +   GR + + ++   G  +    P    +        C  CC      + C  C
Sbjct: 5   VEIRNTKEAGRGLYSTKNFETGMNILNSTPYVHTLALKYHGVRCAQCCVFKPKLLRCAGC 64

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
            L  YCS+ C++ AW + H+ EC
Sbjct: 65  KLQYYCSKDCQKSAW-KIHKEEC 86


>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
 gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
          Length = 453

 Score = 39.7 bits (91), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 33/135 (24%), Positives = 50/135 (37%), Gaps = 29/135 (21%)

Query: 99  VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
            + G T+  + P   V+      + C+ C     A   C+ C    YCS +C+R  W + 
Sbjct: 1   FYPGQTVLEQDPYVAVLDGDSRGNRCDACFKQSPALKRCSACKFVFYCSPTCQRSQW-KI 59

Query: 159 HRWECHGALRL--------------------LEAVGIAHLALKLILVSSHSDRYKEVYHL 198
           H+ EC   +RL                    L+A G        +L  +  D Y+ V  L
Sbjct: 60  HQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATG--------VLPVTALDNYEIVRAL 111

Query: 199 ETHLQDMRPEDLYQY 213
            TH  +   E L  Y
Sbjct: 112 PTHFSETGDERLVMY 126


>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
 gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
          Length = 555

 Score = 39.3 bits (90), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 21/137 (15%)

Query: 86  SNAKGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTS---------ISAPI 135
           S   G  + A +D+  G+T+  ++KP   V+      S C+ C  +         + A  
Sbjct: 66  SPGSGTGLFASKDIQPGETVVLIQKPFVAVLDTAQLESKCSGCLGAHANRQEAVELKA-- 123

Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYK 193
            C  C +  YC+++C+ + W  +H  EC     L   V    A   L+++++S    RY 
Sbjct: 124 -CTGCRVVKYCNKTCQAKDWKLFHSLECRIFQNLKPRVLPNNARAILRMVMLSERG-RYP 181

Query: 194 ----EVYH-LETHLQDM 205
               E++  LE+H++D+
Sbjct: 182 AGEMELFKGLESHMKDV 198


>gi|167521854|ref|XP_001745265.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776223|gb|EDQ89843.1| predicted protein [Monosiga brevicollis MX1]
          Length = 630

 Score = 39.3 bits (90), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 4/69 (5%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSISAPIPCNECILAVY 145
           KGR +     +  GD L  E+  A ++ P  +  N   C+ C     AP+PC  C   V+
Sbjct: 152 KGRALFTSTPLAAGDVLLRERAFASILAPTTTQENYLRCHECLDESWAPLPCPNCPR-VF 210

Query: 146 CSESCRREA 154
           CS  C  +A
Sbjct: 211 CSTECHAKA 219


>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
           harrisii]
          Length = 609

 Score = 39.3 bits (90), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 42/171 (24%), Positives = 65/171 (38%), Gaps = 39/171 (22%)

Query: 32  LTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLAC--GESHTLMCASNKIKMQTSNA- 88
           L++ +  +    A+ +  S      S    +LP  AC  G     + A  K +   S   
Sbjct: 162 LSLSRPPDGGASALRIPRSPQPPWESPRRRALPQ-ACVQGGPGCSLGAPAKPRESGSPGP 220

Query: 89  ---KGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA--------- 133
              +G+ + A   V KG+T+FVE+P+    F+         C+HC  ++           
Sbjct: 221 LALQGKGLFATRPVQKGETIFVERPLVASQFLWNALYRYRACDHCLRALETAEENAQRLS 280

Query: 134 -------PIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
                  P P            C  C +A YCS  CR+ A  +YHR  C G
Sbjct: 281 GRTCGALPHPEQCSIRKELHQLCPHCQVA-YCSAECRQAALEQYHRALCLG 330


>gi|194756634|ref|XP_001960581.1| GF11433 [Drosophila ananassae]
 gi|190621879|gb|EDV37403.1| GF11433 [Drosophila ananassae]
          Length = 497

 Score = 39.3 bits (90), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 69/147 (46%), Gaps = 20/147 (13%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ +   GR VIA+ ++  G+TL +E+P+  ++LP      C+ C         C
Sbjct: 9   AKKCQVKQNETLGRFVIALGNIRAGETLLLEEPI--LVLPHAGDRRCSKCFKLTEE--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
            +C L   C +        C+R + L+    +    + LL+    + +  ALK IL+  H
Sbjct: 65  RKCHLLALCGDCSEHDERDCKRLSDLKLSEEQ----VLLLQEKKHSEVQSALKCILLREH 120

Query: 189 SDR---YKEVYHLETHLQDMRPEDLYQ 212
            +    Y+E+  +E  L   R  D+++
Sbjct: 121 LETQPLYEEISQMENQLTARRGTDIWK 147


>gi|47226174|emb|CAG08321.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 417

 Score = 39.3 bits (90), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 43/110 (39%), Gaps = 32/110 (29%)

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
           N KG+ + A     KGDT+F E+P+    F+         C +C                
Sbjct: 29  NVKGKGLFAKRSFRKGDTIFTERPLVSAQFLWNSLYKYKACEYCLHALETAEENARRLSG 88

Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
              +S P P            C +C + +YCS  CR+ A  +YHR  C G
Sbjct: 89  VPGLSLPYPELCRVRPELHQACPQCQV-MYCSSECRQAAADQYHRVLCLG 137


>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
          Length = 420

 Score = 39.3 bits (90), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 1/95 (1%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   ++   S  KGR + A+   H GD LF     A V+       +C  C         
Sbjct: 5   ARGGLERFCSAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK 64

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           C  C  A YC   C++E W   H+ EC   + L E
Sbjct: 65  CGRCKQAFYCDVECQKEDW-PLHKLECSSMVVLGE 98


>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 311

 Score = 39.3 bits (90), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 45/97 (46%), Gaps = 3/97 (3%)

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-CNECILAVYCSESCRREAWLRYH 159
           KGD +    P ++          C+ C ++  A I  C  C +  YC + C+R +W   H
Sbjct: 5   KGDEILKCYPFSYAFREEHQHKVCDFCFSTGQAIISRCARCCVIYYCGKQCQRSSWKETH 64

Query: 160 RWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVY 196
           + EC   L++L+ +   H+ L LI+ +    RY   Y
Sbjct: 65  KHECK-YLKMLD-IKEPHMDLLLIIRTLCKLRYDGGY 99


>gi|157818831|ref|NP_001099672.1| tudor domain-containing protein 1 [Rattus norvegicus]
 gi|149040467|gb|EDL94505.1| tudor domain containing 1 (predicted) [Rattus norvegicus]
          Length = 1173

 Score = 39.3 bits (90), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/155 (25%), Positives = 64/155 (41%), Gaps = 26/155 (16%)

Query: 18  RLQDTSHLDFKEDVLTIEKS-------EENSVEAMSMQVSSTSSSASSSVASLPSLACGE 70
           RL+   + + K + L  E++       E+NSV  +S   +  +     S     +L  G 
Sbjct: 46  RLKSPENGNKKNNFLLCEQNKQYLASQEDNSV--VSSNPAGINGEVGGSKGDRKTLPTGN 103

Query: 71  SHTLMCASN----KIKMQTSNAKGRHVIAVEDVHKGDTLF-----VEKPVAFVIL-PPCS 120
             +L   +N    ++K + SNA     +  E   K   LF     V  P  F  L PP  
Sbjct: 104 PVSLSLGNNSPPNQVKTKPSNA-----VTPEKSKKSHKLFENALSVNNPALFNSLGPPLR 158

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAW 155
            + C+ C   +   + C++C    YCS +C+R  W
Sbjct: 159 STTCHRC--GLFGSLRCSQCKQTYYCSTACQRRDW 191


>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
 gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
          Length = 430

 Score = 39.3 bits (90), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 58/120 (48%), Gaps = 16/120 (13%)

Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSISAP-----IPCNECILAVYCSESCRREAWLRY 158
           T+  E+P+A V+ P  S   C  C   I +      + C++C+   YC+  C+R+ W   
Sbjct: 3   TVAREQPLAAVLSPQFSELYCATCFLEIDSSQETEILTCDDCLAVSYCTLKCQRKDWKSC 62

Query: 159 HRWECHGALRLLEAVGIAHLALKL---ILVSSHSDRYKE-----VYHLETHLQDMR--PE 208
           H++EC   LR   +     L +KL   +L++S S +        +  LET+ ++ R  PE
Sbjct: 63  HQFECE-ILRCQGSSTPMTLTMKLCIRVLLASRSTQTPSFNGAVLEDLETNYKEFRSSPE 121


>gi|400600152|gb|EJP67843.1| Wd tetratricopeptide repeat domain-containing protein [Beauveria
           bassiana ARSEF 2860]
          Length = 564

 Score = 39.3 bits (90), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/134 (23%), Positives = 62/134 (46%), Gaps = 18/134 (13%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           ++ ++ S+++++  +   +   +  VA+LP L      T   +++ +++     K   V 
Sbjct: 201 DRFDQESLDSLNADLKKMAPKCAIKVATLPVLL----ETASSSTDSLEI-IPTCKQLGVF 255

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSM---SNCNHCCTSISA------PIPCNECILAVY 145
           A ED+  G+ +  E    + +L   +    S C+ C   +        P+ C+EC   V+
Sbjct: 256 AEEDIEPGEAVLRE----YTLLTANNRLKDSACDACSCDLPPIGSEQEPVQCDECYDTVF 311

Query: 146 CSESCRREAWLRYH 159
           CS+ C  EA  RYH
Sbjct: 312 CSQYCHDEAQERYH 325


>gi|443920735|gb|ELU40589.1| SET domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 380

 Score = 39.3 bits (90), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 4/86 (4%)

Query: 79  NKIKMQTSNAKGR-HVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           N +    S+++ R   +A   +  G+ + VE P+A  I P      C+ C     +   C
Sbjct: 8   NGVFQLKSDSRARSRAVAASPLEAGNIVLVETPLAVAIHPTYKGRRCDGCLREFDSLQKC 67

Query: 138 NECILAVYCSESCRREAWLRYHRWEC 163
           + C +  YC  S    +W R+HR  C
Sbjct: 68  SGCGVYWYCGVSA---SWKRHHRRLC 90


>gi|260834511|ref|XP_002612254.1| hypothetical protein BRAFLDRAFT_100073 [Branchiostoma floridae]
 gi|229297629|gb|EEN68263.1| hypothetical protein BRAFLDRAFT_100073 [Branchiostoma floridae]
          Length = 143

 Score = 39.3 bits (90), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 34/74 (45%), Gaps = 3/74 (4%)

Query: 81  IKMQTSNAKGRHVIA---VEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           +++ T   KG+ V A      + +G  +  E P  F +    S   C+ C    + P+ C
Sbjct: 6   VELFTEPGKGKGVRARTVGRGIRQGSLIIAEDPYIFSLEVEESKRRCHSCLQKKTVPLVC 65

Query: 138 NECILAVYCSESCR 151
             C  A YCSE+C+
Sbjct: 66  YGCKYARYCSETCQ 79


>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
          Length = 524

 Score = 39.3 bits (90), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 61/131 (46%), Gaps = 14/131 (10%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSE 148
           GRH+IA  D+  GD +  EKP+  V+ P  +    C  C + +     C++C   +   +
Sbjct: 53  GRHLIACRDLKAGDVILQEKPI--VMGPKHTAGQICLGCYSGVDGRTRCSQCGWPMCGRD 110

Query: 149 SCRREAWLRYHRWECH----GALRLLEAVGI----AHLALK-LILVSSHSDRYKEVYHLE 199
            C   A    H  EC     G   ++ ++ +    + + L+ L L   + +R+ E+  LE
Sbjct: 111 DC--HAHESDHAAECGVMASGGRPIVGSLPVQAYQSVMVLRCLALRDQNPERWDELMQLE 168

Query: 200 THLQDMRPEDL 210
            H+Q+ R + +
Sbjct: 169 AHVQERRQKGM 179


>gi|170042649|ref|XP_001849030.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167866157|gb|EDS29540.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 822

 Score = 39.3 bits (90), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 1/34 (2%)

Query: 123 NCNHCCTSI-SAPIPCNECILAVYCSESCRREAW 155
           NC+HC     ++ IPC  C+  +YC E CR+++W
Sbjct: 201 NCSHCGVKFRNSLIPCPGCVFFMYCGEECRQKSW 234


>gi|323455804|gb|EGB11672.1| hypothetical protein AURANDRAFT_61652 [Aureococcus anophagefferens]
          Length = 859

 Score = 39.3 bits (90), Expect = 1.2,   Method: Composition-based stats.
 Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 1/62 (1%)

Query: 110 PVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           P A ++  P  + +C+ C         C+ C    YCS  C+R AW R HR  C    +L
Sbjct: 143 PAASLLSLP-RVQHCDGCGLRQPGMARCSRCRHVAYCSNGCQRNAWARRHRKTCAELAKL 201

Query: 170 LE 171
            E
Sbjct: 202 TE 203


>gi|401415234|ref|XP_003872113.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488335|emb|CBZ23582.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 494

 Score = 39.3 bits (90), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 20/91 (21%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA---------------- 133
           GR + A   +   +++  E P+   ++  C  + C HC   +SA                
Sbjct: 186 GRGLYATRHIQPRNSILCESPL---LVQRCDDTKCAHCLAPLSARSGSSITNTGVSDPLA 242

Query: 134 -PIPCNECILAVYCSESCRREAWLRYHRWEC 163
             + C  C    YCSE CR  AW +YH   C
Sbjct: 243 LGVACPHCEQETYCSEDCRDAAWEQYHVCSC 273


>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 450

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 1/95 (1%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   ++   S  KGR + A+   H GD LF     A V+       +C  C         
Sbjct: 5   ARGGLERFCSAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK 64

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
           C  C  A YC   C++E W   H+ EC   + L E
Sbjct: 65  CGRCKQAFYCDVECQKEDW-PLHKLECSSMVVLGE 98


>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
 gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Cofactor Product Adohcy
 gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Methyltransferase Inhibitor
           Sinefungin
 gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 433

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 35/86 (40%), Gaps = 1/86 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A+   H GD LF     A V+       +C  C         C  C  A Y
Sbjct: 14  SAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWECHGALRLLE 171
           C   C++E W   H+ EC   + L E
Sbjct: 74  CDVECQKEDW-PLHKLECSSMVVLGE 98


>gi|344299040|ref|XP_003421196.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1
           [Loxodonta africana]
          Length = 868

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C+ C  SI   + PC  C   + CS+ C+  AW  +H+ +C  AL ++
Sbjct: 815 CHQCGRSIGVRLSPCTRCYGILTCSKFCKTRAWNEFHKRDCGAALTMM 862


>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
 gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
          Length = 427

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 6/68 (8%)

Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE-----VYHLETHL 202
           ++C+R AW  +H+ EC   L    A+G   LA+  +L+     R+ E     +  LE+H 
Sbjct: 15  KACQRNAWKTFHKLECKAVLSEAPAMGPTTLAMYRLLLLKKDGRFTERQWKAIMQLESHF 74

Query: 203 QDMRPEDL 210
           ++   EDL
Sbjct: 75  EEQN-EDL 81


>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
 gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
          Length = 433

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A+   H GD LF     A V+       +C  C         C  C  A Y
Sbjct: 14  SAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFY 73

Query: 146 CSESCRREAWLRYHRWEC 163
           C   C++E W   H+ EC
Sbjct: 74  CDVECQKEDW-PLHKLEC 90


>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
          Length = 494

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 42/86 (48%), Gaps = 2/86 (2%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           ++     +GR + A  D+ +G+++  E+    V+L     + C  C  +    I C++C 
Sbjct: 9   RVDRDAVRGRMLRATRDLQEGESVLSERAYGNVVLSQHRATLCAVCLCAADPDICCDDCS 68

Query: 142 LAVYCSESCRREAWLRYHRWECHGAL 167
              +CS++C +E     H  EC GAL
Sbjct: 69  KVFFCSDAC-QEKLQDVHEKEC-GAL 92


>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
 gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
          Length = 544

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/142 (22%), Positives = 62/142 (43%), Gaps = 14/142 (9%)

Query: 86  SNAKGRHVIAVEDVHKGDT-LFVEKPVAFVILPPCSMSNCN------HCCTSISAPIP-C 137
           S   G  + A  ++  G T + +++P   V+        C+      HC   ++  +  C
Sbjct: 65  SPGAGTGIFAATNIPVGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKAC 124

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKE- 194
             C +  YC+++C+ + W   H +EC     L+  V    A   L++IL S      KE 
Sbjct: 125 TGCRVVKYCNKTCQAKDWKFVHSFECRVYKELMPRVLPSNARAILRMILRSERGKYPKEE 184

Query: 195 ---VYHLETHLQDMRPEDLYQY 213
                 LE+H+ +++ ++  Q+
Sbjct: 185 LELFRKLESHMAEIQAQNWEQW 206


>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
           B]
          Length = 604

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 58/151 (38%), Gaps = 24/151 (15%)

Query: 76  CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
           C  + ++++     GR +I  E    G TL   KP   V+      S C+ CC     P 
Sbjct: 74  CLPSYLEVRQVRDAGRGIITKERQKAGSTLIATKPHVTVLSTTYLDSYCSACC----GPA 129

Query: 136 P------CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL--------- 180
           P      C  C +  YC   C+   W   H+ EC    R   +     +A+         
Sbjct: 130 PETGLKRCTRCRVVWYCDSCCQNNDW-PIHKDECIAIQRWASSAPSPDVAIPSDAIRCLG 188

Query: 181 KLILVSSH----SDRYKEVYHLETHLQDMRP 207
           +++L   H    S   KE+  +++H   M+P
Sbjct: 189 RILLTQQHRGLDSIWSKEINAMQSHRSSMQP 219


>gi|449686608|ref|XP_002163562.2| PREDICTED: SET and MYND domain-containing protein 5-like [Hydra
           magnipapillata]
          Length = 395

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 52/122 (42%), Gaps = 33/122 (27%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSI 131
           M AS+ + ++ + +KG+ + A   + KGDT+  EKP+           N   C++C  S+
Sbjct: 6   MLASDNVYVKETLSKGKGLFAASAIKKGDTILTEKPLVLCQFSWNRQYNYVACDYCMRSL 65

Query: 132 SAP--------------IPCNE-CILAV---------------YCSESCRREAWLRYHRW 161
                            +P +E C L+V               +CS+ C  EA+ +YH+ 
Sbjct: 66  ETAQNMARRLAADYTLELPYHEQCSLSVQRVNSIYKCPNCCIPFCSKECYSEAYEKYHKS 125

Query: 162 EC 163
            C
Sbjct: 126 LC 127


>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
           [Strongylocentrotus purpuratus]
          Length = 396

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 41/105 (39%), Gaps = 30/105 (28%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPV--AFVILPPC-----------SMSNCNHCCTSISAP- 134
           KGR + A  D+  G  LF EKP+  A  +   C           SM N       +SA  
Sbjct: 16  KGRGLFASRDIAAGTVLFEEKPLVCAQFLWNECYHYEACEYCMRSMENAEEMARRLSANP 75

Query: 135 ---IPCNECILA-------------VYCSESCRREAWLRYHRWEC 163
              +PC EC  A              YC   CR +AW +YH+  C
Sbjct: 76  SLVLPCMECCKAEKQNFVHCPHCRISYCCSECREQAWNQYHQVLC 120


>gi|170097029|ref|XP_001879734.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164645137|gb|EDR09385.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 435

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 18/126 (14%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQT-SNAKGRHV 93
           EK     ++   + +SSTS   S+ V   PS        +   +NK+++      KG+ +
Sbjct: 44  EKRTRKILQNEGLVLSSTSIKRSTIV--YPSSRVIPQLDVTQWTNKVEVTFYDQQKGKGL 101

Query: 94  IAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------CNHCCTSI------SAPIPCNEC 140
           IA  D+ +G+ ++ E P  F+I P   + +       C +C T +       +PIPC   
Sbjct: 102 IATSDIEEGEVIWKEDP--FIIAPEWDLFDLQLKSTACAYCTTPLVDSPIAGSPIPCPAS 159

Query: 141 ILAVYC 146
           I + +C
Sbjct: 160 ISSAFC 165


>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Strongylocentrotus purpuratus]
          Length = 319

 Score = 38.9 bits (89), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 12/103 (11%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-------APIPCNECI 141
           KGR + AV  +  G  +  ++P  +++        C HC  ++S           C  C 
Sbjct: 14  KGRGLRAVSTIQAGTCVLEDEPYVYILNEEQLQLRCGHCLRALSLGNTSEVESKNCKSCK 73

Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
             VYC+++C + A  + HR+EC    +LL+      + L+L++
Sbjct: 74  RVVYCNKAC-KVAGRKEHRYEC----KLLQGKEKDDVTLRLLM 111


>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
           FGSC 2508]
          Length = 668

 Score = 38.9 bits (89), Expect = 1.4,   Method: Composition-based stats.
 Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 16/101 (15%)

Query: 89  KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
           +GR + + E    G+T+  ++ P+  +       + CN+C   +S  I            
Sbjct: 22  RGRSLFSTERFGAGETIAIIDNPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVEAGPRT 80

Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
              C  C  AVYC+  C+R  W   H+ EC    R+ E  G
Sbjct: 81  CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIKERTG 121


>gi|195335121|ref|XP_002034224.1| GM21751 [Drosophila sechellia]
 gi|194126194|gb|EDW48237.1| GM21751 [Drosophila sechellia]
          Length = 498

 Score = 38.9 bits (89), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 68/147 (46%), Gaps = 20/147 (13%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ ++  GR V+A+ ++  G+TL +E P+  V+LP      C+ C     +   C
Sbjct: 9   ARKCEIKHNDMLGRFVVALCNLRAGETLLLENPI--VVLPQMGDRRCSKCFNLTES--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
            +C L   C +        C+R A + +        + LL+      +   LK +L+  H
Sbjct: 65  RKCRLLALCEDCSYHDERDCKRLAEMNFS----DDQVELLQKKEHTEIQPVLKCLLLREH 120

Query: 189 SDR---YKEVYHLETHLQDMRPEDLYQ 212
            +    Y+E+  +E+ L   R  D+++
Sbjct: 121 EETLPLYEEMSQMESQLMTRRGTDIWK 147


>gi|299741263|ref|XP_001834348.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
 gi|298404633|gb|EAU87325.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
          Length = 434

 Score = 38.9 bits (89), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 17/107 (15%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHK----GDTLFVEKP-----------VAFVILPPCSMS 122
             K + +T N   R     ED H     G  L++ KP           V  +IL  C+  
Sbjct: 290 GKKTRPETVNHNPRAPNGPEDAHDYLWYGQDLWM-KPDVWNWVNSREDVKEIILRTCNHV 348

Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
            C+   T ++    C+ C L VYC + C++E W R H+ EC   L+L
Sbjct: 349 GCSARETEVAQFKRCSACRLVVYCGQECQKEDWKR-HKPECKQHLKL 394


>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
          Length = 565

 Score = 38.9 bits (89), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/97 (27%), Positives = 35/97 (36%), Gaps = 2/97 (2%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
              +++ TS   GR +    D   G  L   +P    +       NC   CTS      C
Sbjct: 45  GTNLRLGTSTLGGRGLFTQTDAKPGTRLLSVRPHIHAVSARFLEDNCT-LCTSEENVRRC 103

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
             C    YCS  C+   W   H+ EC    R  EA G
Sbjct: 104 TRCKKVAYCSTECQTADW-GIHKQECQSLRRWAEASG 139


>gi|340522060|gb|EGR52293.1| predicted protein [Trichoderma reesei QM6a]
          Length = 562

 Score = 38.9 bits (89), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 56/129 (43%), Gaps = 20/129 (15%)

Query: 41  SVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVH 100
           S++A++  +   +   +  VA+LP L  G+S T    S          K   V A ED+ 
Sbjct: 212 SLQALNDGLKEMAPKCAVQVATLPVLLEGDSDTDSLDS------IPTCKQLGVFAQEDIE 265

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHC--CTSISAP--------IPCNECILAVYCSESC 150
            G+ +  E    + +L   +    + C  C+S  AP        I C EC   V+CS+ C
Sbjct: 266 PGEVVLRE----YTLLTANNRHKDSVCDACSSELAPLGSSENQSIQCEECFDTVFCSQEC 321

Query: 151 RREAWLRYH 159
             +A  RYH
Sbjct: 322 YDQAQSRYH 330


>gi|346318901|gb|EGX88503.1| MYND domain protein [Cordyceps militaris CM01]
          Length = 565

 Score = 38.9 bits (89), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/134 (23%), Positives = 62/134 (46%), Gaps = 18/134 (13%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           ++ +  S++A++  +   +   +  VA+LP L      T   +++ +++     +   V 
Sbjct: 201 DRFDPASLDALNADLKKMAPKCAIKVATLPVLL----ETASSSTDSLEI-IPTCQQLGVF 255

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSM---SNCNHCCTSISA------PIPCNECILAVY 145
           A ED+  G+ +  E    + +L   +    S C+ C + +        P+ C+EC   V+
Sbjct: 256 AQEDIEPGEAVLRE----YTLLTANNRLKDSVCDACSSDLPPIGSEQEPVKCDECYDTVF 311

Query: 146 CSESCRREAWLRYH 159
           CS+ C  EA  RYH
Sbjct: 312 CSQFCHNEAQERYH 325


>gi|403366172|gb|EJY82881.1| Serine/threonine protein kinase [Oxytricha trifallax]
          Length = 1151

 Score = 38.9 bits (89), Expect = 1.5,   Method: Composition-based stats.
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
           +PC  C +  YCSE C ++ W R H+++C
Sbjct: 31  LPCPYCAVQFYCSERCVQQDWFRSHQFKC 59


>gi|169848656|ref|XP_001831033.1| hypothetical protein CC1G_09298 [Coprinopsis cinerea okayama7#130]
 gi|116507926|gb|EAU90821.1| hypothetical protein CC1G_09298 [Coprinopsis cinerea okayama7#130]
          Length = 661

 Score = 38.9 bits (89), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 7/67 (10%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWECH---GALRLLEAVGIAHLA----LKLILVSSHS 189
           C+ C+   YC + C+ + W R HR+ECH      R L  +GIA       L L+++    
Sbjct: 455 CSSCLSVTYCGQPCQSDDWDRVHRYECHILASDHRALTKLGIALPPRLKHLHLVIIQRQM 514

Query: 190 DRYKEVY 196
           D   + Y
Sbjct: 515 DDMGDAY 521


>gi|308468789|ref|XP_003096635.1| hypothetical protein CRE_29105 [Caenorhabditis remanei]
 gi|308241582|gb|EFO85534.1| hypothetical protein CRE_29105 [Caenorhabditis remanei]
          Length = 428

 Score = 38.9 bits (89), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 5/65 (7%)

Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSISAP-----IPCNECILAVYCSESCRREAWLRY 158
           TL  EKP+A V+ P      C  C   +        + C++C    YCS  C+R+ W   
Sbjct: 3   TLAREKPLAAVLCPQFQDFYCATCFAELDVNGETEILMCDDCSEVSYCSPKCQRQDWRSV 62

Query: 159 HRWEC 163
           H++EC
Sbjct: 63  HQFEC 67


>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
          Length = 456

 Score = 38.9 bits (89), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 4/80 (5%)

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAP---IPCNECILA 143
            +GR V A + + +G T+ +  P+AFV+     ++  C  C  S         C+ C  A
Sbjct: 9   GRGRAVRATKALARGQTVLLNPPLAFVLRHEERVARRCEDCFVSEKPEHRLANCSLCHTA 68

Query: 144 VYCSESCRREAWLRYHRWEC 163
            YCS+ C+   W R H+  C
Sbjct: 69  AYCSKPCQTRNWKRAHKHVC 88


>gi|339896842|ref|XP_003392212.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321398900|emb|CBZ08344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 2122

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 42/101 (41%), Gaps = 26/101 (25%)

Query: 91   RHV--IAVEDVHKGDTLFVEKPVAFV----ILP---------------PCSMSNCNHCCT 129
            RH+   A   + K + +FVE+P   +    +LP               P S +    CC 
Sbjct: 1436 RHIRLSATRRIQKDEVIFVERPAILMPFWPLLPSRLASASTGAGVSDRPVSTTPEPSCCA 1495

Query: 130  S-----ISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
                  ++ P+ C     +VYCSE CR+ A   YH  EC G
Sbjct: 1496 QCGRQRLAKPVNCPGGCHSVYCSEQCRQRALRLYHVVECGG 1536


>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
 gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
 gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
          Length = 430

 Score = 38.5 bits (88), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 34/78 (43%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A      G+ LF      +V+      ++C+ C T       C +C  A Y
Sbjct: 12  SPGKGRGLKATRSFALGELLFSCPAYTYVLTDNERGNHCDFCFTRKEGLSKCGKCKQAFY 71

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++  W   H+ EC
Sbjct: 72  CNVDCQKGDW-PMHKLEC 88


>gi|38566954|emb|CAE76256.1| conserved hypothetical protein [Neurospora crassa]
          Length = 533

 Score = 38.5 bits (88), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 16/101 (15%)

Query: 89  KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
           +GR + + E    G+T+  +E P+  +       + CN+C   +S  I            
Sbjct: 22  RGRSLFSTERFGAGETIAIIENPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVEAGART 80

Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
              C  C  AVYC+  C+R  W   H+ EC    R+ E  G
Sbjct: 81  CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIQERTG 121


>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
 gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
          Length = 400

 Score = 38.5 bits (88), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 16/96 (16%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSISAP--- 134
           ++++ S  +GR++IA  D+  G+ L   K  A+  +P C +     C +C   I  P   
Sbjct: 8   LELKKSETEGRYIIATRDIDVGENLL--KCKAYFSMP-CEIYKKITCGNCVKIIKRPQQE 64

Query: 135 -------IPCNECILAVYCSESCRREAWLRYHRWEC 163
                    CN C    YCSE C++E+   +  +EC
Sbjct: 65  KKKKVKEYKCNVCNEIWYCSEFCKKESQPIHAHYEC 100


>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
          Length = 561

 Score = 38.5 bits (88), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 44/113 (38%), Gaps = 31/113 (27%)

Query: 80  KIKMQ-TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN------CNHC----- 127
           KI+++ T   + + + A  DV KGD +F +  +   I  P    N      C+HC     
Sbjct: 192 KIELRYTDEIRRKSLFARCDVKKGDVVFTDTDLYANIKSPSLDINTYQRLFCDHCMGQLP 251

Query: 128 -----------------CTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
                            CT I   I  N      YCSE CR +AW  YH  EC
Sbjct: 252 KVSETETAEKVVLCKYGCTRIM--IGSNTVDTTKYCSEKCRDDAWDEYHCQEC 302


>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
          Length = 349

 Score = 38.5 bits (88), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 35/87 (40%), Gaps = 1/87 (1%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
           A   ++   S  KGR + A+   H GD LF     A V+       +C  C         
Sbjct: 5   ARGGLERFCSAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK 64

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C  A YC   C++E W   H+ EC
Sbjct: 65  CGRCKQAFYCDVECQKEDWP-LHKLEC 90


>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
 gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
           3D7]
          Length = 509

 Score = 38.5 bits (88), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 40/102 (39%), Gaps = 22/102 (21%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS-------------------M 121
            K++    +G+ V+AV  +  G  L    P  F+  P C                    +
Sbjct: 2   FKIEYKEDRGKCVVAVNQIRAGYCLVESHPEIFI--PLCVKYMAPRIIDQNNKKNNFKII 59

Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           + C +C    +  I C  C   VYCS+ C   AW + HR EC
Sbjct: 60  NTCFYCLEKFNKCICCPNCKYVVYCSDMCLERAW-KSHREEC 100


>gi|327267201|ref|XP_003218391.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Anolis carolinensis]
          Length = 1012

 Score = 38.5 bits (88), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           C HC  S+    +PC  C     CS+SC+ ++W   H+ EC  A R+
Sbjct: 945 CYHCGRSVGVNLVPCARCFETYSCSKSCKLKSWNERHKRECLQAGRM 991


>gi|219114284|ref|XP_002176313.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217402716|gb|EEC42705.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 705

 Score = 38.5 bits (88), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 45/121 (37%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN------CNHCCTSISAPI---- 135
           S  +GRH++A +   +G  L++E+P+  + LP  S+ N      C  C   +  P+    
Sbjct: 19  SPTRGRHLVATKAQPRGRLLWIERPL--LALP--SLENRPDAWTCAACHAFLGGPVRALQ 74

Query: 136 -------------------------------PCNECILAVYCSESCRREAWLRYHRWECH 164
                                          PC      VYCS  C R+ W  +HR+ C 
Sbjct: 75  HRWSRSPTPSSASSTLDDDDDDDHSEEYRTVPCRRNCGFVYCSADCERDVWAAHHRYLCT 134

Query: 165 G 165
           G
Sbjct: 135 G 135


>gi|50554579|ref|XP_504698.1| YALI0E32725p [Yarrowia lipolytica]
 gi|49650567|emb|CAG80302.1| YALI0E32725p [Yarrowia lipolytica CLIB122]
          Length = 655

 Score = 38.1 bits (87), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC----HGALRLLEAVGIAHLALKLILVSSHSDRY 192
           C  C LA+YC   C++  W  +H++EC    +    LL  V +A +      ++  +D  
Sbjct: 89  CGGCGLAMYCRAECQKSDWTEHHKYECKIFSNWTTPLLPGVSLALIVRTAFAMAQDTDLR 148

Query: 193 KEVYHLET 200
             ++ + T
Sbjct: 149 TRIFQMHT 156


>gi|85117964|ref|XP_965349.1| hypothetical protein NCU02962 [Neurospora crassa OR74A]
 gi|28927157|gb|EAA36113.1| predicted protein [Neurospora crassa OR74A]
          Length = 610

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 16/101 (15%)

Query: 89  KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
           +GR + + E    G+T+  +E P+  +       + CN+C   +S  I            
Sbjct: 22  RGRSLFSTERFGAGETIAIIENPLLALPDDANMRTTCNYCL-YVSGTIEFEGDVEAGART 80

Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
              C  C  AVYC+  C+R  W   H+ EC    R+ E  G
Sbjct: 81  CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIQERTG 121


>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
          Length = 583

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 42/111 (37%), Gaps = 20/111 (18%)

Query: 73  TLMCASNKIKMQTSNAK--GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS 130
           T+  A   + +Q S     G  +    DV +G+ +F   P+         M  C+ C   
Sbjct: 78  TIKAAEGSLSIQESTIPNAGSGMFTTRDVEEGELIFTSVPLVTCAEVGPGMEACDFCFQQ 137

Query: 131 ---ISAPIP---------------CNECILAVYCSESCRREAWLRYHRWEC 163
              +  P+                CN C L  YCS+SC + AW   H +EC
Sbjct: 138 RRRVFHPVEDRFLQPGEVLPPLHICNGCRLYQYCSQSCSQRAWDTGHLYEC 188


>gi|330791333|ref|XP_003283748.1| hypothetical protein DICPUDRAFT_147450 [Dictyostelium purpureum]
 gi|325086371|gb|EGC39762.1| hypothetical protein DICPUDRAFT_147450 [Dictyostelium purpureum]
          Length = 1240

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 48/112 (42%), Gaps = 5/112 (4%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S+  GR   A   + +   LF E+P    +        C +C   +  P+ C  C  A Y
Sbjct: 611 SDKVGRITEANSPIPENSLLFQERPYVSCLDRNYHSLYCYNCFKELLVPVYCKGCNNAQY 670

Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI-LVSSHSDRYKEVY 196
           C +SC +E +   H  EC     L      +H +L +I ++   +  Y+E+Y
Sbjct: 671 CGDSCLKEDYKNQHSVECDKGFLL----TCSHESLLVIRILIKKAKEYRELY 718


>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
 gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
          Length = 495

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 36/145 (24%), Positives = 69/145 (47%), Gaps = 18/145 (12%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ +   GR+V+A  ++  G+T+ +E+PV  ++L       C +C    S+   C
Sbjct: 9   AKKCEVKQNETLGRYVVAATNIKAGETVLLEQPV--LLLANNGDRRCCNCHQLTSS--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
            +C L   C++        CRR A L+    +        E       ALK +L+  H++
Sbjct: 65  GKCRLMPLCADCVEHHGPDCRRLAELQLQEQQVEQLQAHTEVCS----ALKYLLLREHAE 120

Query: 191 R---YKEVYHLETHLQDMRPEDLYQ 212
           +   Y+++  LE HL   R  D+++
Sbjct: 121 KRSHYEQLLQLEAHLARRRDTDIWR 145


>gi|146071727|ref|XP_001463183.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134067266|emb|CAM65535.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 501

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 20/91 (21%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA---------------- 133
           GR + A   +    ++  E P+   ++  C  + C HC   +SA                
Sbjct: 193 GRGLYATRHIQPRSSILCESPL---LVQRCDDTKCAHCLAPLSARRGSSITDTGASDPLA 249

Query: 134 -PIPCNECILAVYCSESCRREAWLRYHRWEC 163
             + C  C    YCSE CR  AW +YH   C
Sbjct: 250 SGVACPHCQHETYCSEDCRDAAWEQYHVCSC 280


>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
 gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
           42464]
          Length = 573

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 43/109 (39%), Gaps = 14/109 (12%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-SNCNHCC-TSISAPIP- 136
           +I    S+ K R + A      G T+         +    +M + CN+C  T+  +P P 
Sbjct: 10  RIHNDPSSPKRRSLRATRAFSPGSTIATFSNPLLALPDGATMRTTCNYCLRTTAPSPSPS 69

Query: 137 -----------CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
                      C  C  AVYC  +C+R  W   H+ EC    R+ E  G
Sbjct: 70  QSAATPPAFKACTACKAAVYCGPACQRAHWKAAHKAECKMFARVREQAG 118


>gi|242007166|ref|XP_002424413.1| protein msta, isoform A, putative [Pediculus humanus corporis]
 gi|212507813|gb|EEB11675.1| protein msta, isoform A, putative [Pediculus humanus corporis]
          Length = 540

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 37/157 (23%), Positives = 70/157 (44%), Gaps = 34/157 (21%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF----VILPPCSMSNCNHCCTSISAPIPC 137
           K++ ++  GR++IA E++ KGD +F E P+      V +P C    C++ C ++     C
Sbjct: 47  KLEQNDRLGRYLIASENISKGDVIFYESPLIVGPKTVTVPVC--LGCHNVC-NMDNYHEC 103

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALR---------------------LLEAVGIA 176
           + C   + CS  C+       H  EC+  ++                       E     
Sbjct: 104 DGCGFPL-CSPQCQVSD---NHVDECYQMMKSNYRIKMDSKKKKNNDDNNGGGKEIFYFP 159

Query: 177 HLALKLILVSS--HSDRYKEVYHLETHLQDMRPEDLY 211
            + L+ IL+    +S++YK++ +LE+H  D   + L+
Sbjct: 160 IVPLRCILLKGKKNSEKYKKIMNLESHYDDRINKPLF 196


>gi|398010425|ref|XP_003858410.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322496617|emb|CBZ31687.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 501

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 20/91 (21%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA---------------- 133
           GR + A   +    ++  E P+   ++  C  + C HC   +SA                
Sbjct: 193 GRGLYATRHIQPRSSILCESPL---LVQRCDDTKCAHCLAPLSARRGSSITDTGASDPLA 249

Query: 134 -PIPCNECILAVYCSESCRREAWLRYHRWEC 163
             + C  C    YCSE CR  AW +YH   C
Sbjct: 250 SGVACPHCQHETYCSEDCRDAAWEQYHVCSC 280


>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
          Length = 517

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 68/143 (47%), Gaps = 16/143 (11%)

Query: 75  MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA- 133
           M    K K+  S   GR+++A +++  G+ +  E+P+A   +       C  C  S+S  
Sbjct: 1   MAPIPKYKVTHSGRLGRYLVAAKNIAAGEVIIREEPIAVGPMTYRKDRLCFACLRSLSKI 60

Query: 134 --PIPCNECILAVYCSESCRREAWLRYHRW-EC--HGALRLL------EAVGIAHLALKL 182
                C+ C LA  CS +C  E   ++H   EC   G  R L      +A GI  LAL+L
Sbjct: 61  GRQYVCSRCNLAPLCSVAC--ETRTKHHTSDECLILGENRDLLAKNGIDATGIL-LALRL 117

Query: 183 ILVSSHSDR-YKEVYHLETHLQD 204
            L+   +   +++V H+E HL +
Sbjct: 118 WLIKCRNPAVWEQVDHMEDHLHE 140


>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
 gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
          Length = 446

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 6/100 (6%)

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
           KG  +  EKP A V+        C+ C       + C+ C+   YC+  C++EAW   H+
Sbjct: 8   KGSLILKEKPFACVLHSRYRAERCDKCFKP-GKVLKCSNCLYVRYCNRFCQKEAWPD-HQ 65

Query: 161 WECHGALRLLEAVGIAHLAL---KLILVSSHSDRYKEVYH 197
            EC G L+ +    +   AL   ++I        Y++ Y+
Sbjct: 66  DEC-GKLKAIGTRTVPDAALMISRIIRKLQKGGDYQKGYY 104


>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
          Length = 781

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 16/72 (22%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
           K++  ++  KGR +++  D+ +   + VE+P  F +LP              +  +PC  
Sbjct: 219 KLRCVSTKEKGRGMVSECDIEEASVIHVEEP--FSVLP--------------ADTVPCPS 262

Query: 140 CILAVYCSESCR 151
           C + VYCSESC+
Sbjct: 263 CSIPVYCSESCQ 274


>gi|281206757|gb|EFA80942.1| SET domain-containing protein [Polysphondylium pallidum PN500]
          Length = 367

 Score = 38.1 bits (87), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 19/101 (18%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKP-VAFVILPPCSMSNCNHCCTSISAP----- 134
           IK    + +GR +    DV  G  ++ EKP V++  L   + + CNHC  S+        
Sbjct: 60  IKDSGDSLRGRGLFTKLDVQSGQVIYTEKPFVSYQSLEIDNKTICNHCLKSLDKDKIEKS 119

Query: 135 ------------IPCNECILAVYCSESCRREAWLRYHRWEC 163
                         C +C    YCSE C+ E+ ++YH   C
Sbjct: 120 KQKKDDDFDDLFSKCVKCS-TRYCSEQCKSESDIQYHLASC 159


>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 463

 Score = 38.1 bits (87), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 8/83 (9%)

Query: 89  KGRHVIAVEDVHKGDTLFVE-KPVAFVILPPCSMSNCNHCCTS-------ISAPIPCNEC 140
           KG  + + +   +G+ L+V  KP+  V+      + C++C  S       I   I C  C
Sbjct: 20  KGYGLYSKKRFAEGERLYVFVKPLVAVLDEKHIKNTCSNCFISEEDKDDDIFHLIQCKGC 79

Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
            +  YC + C++E W  +H++EC
Sbjct: 80  KILQYCCQKCQKEDWEIFHQYEC 102


>gi|328697017|ref|XP_003240210.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 635

 Score = 38.1 bits (87), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 23/115 (20%), Positives = 49/115 (42%), Gaps = 5/115 (4%)

Query: 52  TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
           T    +  +  +  L  G +  +   S  ++++ S    R V A  D++ GD + +++P 
Sbjct: 203 TEQERAMKIPIIEHLVGGRNDNIPALSAFVELKMSEDMRRGVYATRDINPGDVVAIDEP- 261

Query: 112 AFVILPPCSMSNC---NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            ++  P  + +     N C     A   C +C +  YC++ C  + +   H + C
Sbjct: 262 -YICGPSGNDTEVCHYNGCLKMDFALFRCPKCFMVYYCTKDCMDKDYKEGHNFAC 315


>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
          Length = 544

 Score = 38.1 bits (87), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 14/142 (9%)

Query: 86  SNAKGRHVIAVEDVHKGDT-LFVEKPVAFVILPPCSMSNCN------HCCTSISAPIP-C 137
           S   G  + A  ++  G T + +++P   V+        C+      HC   ++  +  C
Sbjct: 65  SPGAGTGIFAATNIPVGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKAC 124

Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKE- 194
             C +  YC+++C+ + W   H +EC     L+  V    A   L+++L S      KE 
Sbjct: 125 TGCRVVKYCNKTCQAKDWKFAHSFECRVYKELMPRVLPSNARAILRMVLRSERGKYPKEE 184

Query: 195 ---VYHLETHLQDMRPEDLYQY 213
                 LE+H+ +++ ++  Q+
Sbjct: 185 LELFRKLESHMAEIQAQNWEQW 206


>gi|196012830|ref|XP_002116277.1| hypothetical protein TRIADDRAFT_60207 [Trichoplax adhaerens]
 gi|190581232|gb|EDV21310.1| hypothetical protein TRIADDRAFT_60207 [Trichoplax adhaerens]
          Length = 543

 Score = 38.1 bits (87), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 37/93 (39%), Gaps = 3/93 (3%)

Query: 73  TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
           T    S  IK  T   +G      ++  +G          FV  P  S   C  C  SI 
Sbjct: 366 TQTVGSLGIKTSTVGGEGEIAENTKNSSRGSRRLTPGRHRFVAKP--SFCYCYECGRSIG 423

Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWECH 164
             +  C+ C+   YCS+ C+  AW   H+ EC+
Sbjct: 424 VRLTACSRCLKVYYCSKQCKLRAWNARHKDECY 456


>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
 gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
          Length = 421

 Score = 38.1 bits (87), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 44/115 (38%), Gaps = 32/115 (27%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA-------- 133
            S+ KG+ + A   + KG+T+F EKP+    F          C+HC  S+          
Sbjct: 27  VSSGKGKGLFATRAIRKGETIFQEKPLVSSQFQWNALYRYRACDHCLRSLETAEENAQRL 86

Query: 134 --------PIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
                   P P            C  C +  YCS  C + A  +YHR  C GA R
Sbjct: 87  SGNAHVVLPYPELCTVRNGLHQQCPRCQV-TYCSAECLKAAADQYHRALCLGASR 140


>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
           morsitans]
          Length = 395

 Score = 38.1 bits (87), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 28/114 (24%), Positives = 47/114 (41%), Gaps = 34/114 (29%)

Query: 88  AKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC---------------- 128
            +GR +++ + ++  D +F E+P     F        + C+HC                 
Sbjct: 11  GRGRAMVSTKSLNTDDIIFEEEPFVSCQFSWNAAYGYAACDHCMRPLETITENIRRLANQ 70

Query: 129 TSISAPIP--------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
            +++ P+               C++C +  YCSE CR EA  RYHR  C G+ R
Sbjct: 71  PALAVPLTEYDPTAQWLKQFTSCSKCRVR-YCSEECRIEALKRYHRIACLGSYR 123


>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
 gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
          Length = 430

 Score = 38.1 bits (87), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR + A      G+ LF      +V+      ++C+ C         C +C  A Y
Sbjct: 12  SPGKGRGLKATRSFALGELLFTCPAYTYVLTDTERGNHCDFCFARKEGLSKCGKCKQAFY 71

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++  W   H+ EC
Sbjct: 72  CNVDCQKGDW-PMHKLEC 88


>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
          Length = 420

 Score = 38.1 bits (87), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 44/114 (38%), Gaps = 32/114 (28%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA--------- 133
           S+ KG+ + A   + KG+T+F EKP+    F          C+HC  S+           
Sbjct: 26  SSGKGKGLFATRAIRKGETIFQEKPLVSSQFQWNALYRYRACDHCLRSLETAEENAQRLS 85

Query: 134 -------PIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
                  P P            C  C +  YCS  C + A  +YHR  C GA R
Sbjct: 86  GNAHVVLPYPELCTVRNGLHQQCPRCQV-TYCSAECLKAAADQYHRALCLGASR 138


>gi|322707137|gb|EFY98716.1| wd and tetratricopeptide repeat protein [Metarhizium anisopliae
           ARSEF 23]
          Length = 553

 Score = 38.1 bits (87), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 60/139 (43%), Gaps = 29/139 (20%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           ++  E+S++A++  + + +   +  VA+LP L  G S+T     +   + T    G  V 
Sbjct: 201 DRFAEDSLKALNADLKTMAPKCAVQVATLPVLLEGASNT----DHYDIIPTCKQLG--VF 254

Query: 95  AVEDVHKGDTLFVE-----------KPVAFVI---LPPCSMSNCNHCCTSISAPIPCNEC 140
           AVED+  G+ +  E            PV       LPP   +N           + C+EC
Sbjct: 255 AVEDIEPGEDVLREYSLLTANNRLKDPVCDACSSDLPPLGSNN---------EAVNCDEC 305

Query: 141 ILAVYCSESCRREAWLRYH 159
              V+CS+ C   A  RYH
Sbjct: 306 YDTVFCSQFCHDVAQERYH 324


>gi|299752592|ref|XP_001831035.2| hypothetical protein CC1G_09300 [Coprinopsis cinerea okayama7#130]
 gi|298409907|gb|EAU90823.2| hypothetical protein CC1G_09300 [Coprinopsis cinerea okayama7#130]
          Length = 573

 Score = 37.7 bits (86), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 23/44 (52%)

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEA 172
           T I+    C+ C   VYCS  C+RE W   HR EC   LR  EA
Sbjct: 389 TEITMGAQCSGCHSVVYCSLQCQREDWNERHRTECLAMLRSYEA 432


>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 547

 Score = 37.7 bits (86), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 11/85 (12%)

Query: 90  GRHVIAVEDVHKG-DTLFVEKPVAFVILPPCSMSNCNHC-----CTSISAPIP-----CN 138
           G+ V A  DV KG D L V+ P+  ++      + C+ C       SI    P     C 
Sbjct: 46  GQGVFATIDVKKGADVLVVDNPLIALVEEAQLQNICSGCYDTSKAGSIDNRRPDLVKACT 105

Query: 139 ECILAVYCSESCRREAWLRYHRWEC 163
            C +  YC ++C+R+ W   H  EC
Sbjct: 106 RCKVVYYCDKNCQRKDWKAGHSLEC 130


>gi|338725783|ref|XP_003365201.1| PREDICTED: LOW QUALITY PROTEIN: ankyrin repeat and MYND
           domain-containing protein 1-like [Equus caballus]
          Length = 917

 Score = 37.7 bits (86), Expect = 2.8,   Method: Composition-based stats.
 Identities = 22/68 (32%), Positives = 32/68 (47%), Gaps = 11/68 (16%)

Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE---------AV 173
           C  C  SI    +PC  C   + CS+ C+ +AW  +H+ +C  AL  L          + 
Sbjct: 815 CYQCGRSIGVRLVPCTRCYGILTCSKYCKTKAWTDFHKKDC-SALAALAGDFVQHFDMSA 873

Query: 174 GIAHLALK 181
           G  HL+LK
Sbjct: 874 GKEHLSLK 881


>gi|328698136|ref|XP_003240555.1| PREDICTED: hypothetical protein LOC100574406 isoform 1
           [Acyrthosiphon pisum]
 gi|328698140|ref|XP_003240557.1| PREDICTED: hypothetical protein LOC100574406 isoform 3
           [Acyrthosiphon pisum]
          Length = 766

 Score = 37.7 bits (86), Expect = 2.9,   Method: Composition-based stats.
 Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 12/125 (9%)

Query: 55  SASSSVASLPSLACGESHTLM--CASNKIKMQTSN---AKGRHVIAVEDVHK---GDT-- 104
           +AS S  +LP  A   +  ++     N I M  S+   A+G+ +I     +K   G T  
Sbjct: 639 TASLSKTALPIPATPTTFVIIKPAVPNNISMANSSLNRAEGKTIIVGNKQYKFVKGPTGE 698

Query: 105 --LFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWE 162
               V K   FV  PP   + C     +    I C+ C    YCS  C++  W   H  +
Sbjct: 699 LRAVVNKTKTFVKPPPTIKTKCYSRNCTKPVTIMCSSCTSVKYCSNICQQHDWYDGHIND 758

Query: 163 CHGAL 167
           CH  L
Sbjct: 759 CHHLL 763


>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
          Length = 431

 Score = 37.7 bits (86), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 15/45 (33%), Positives = 21/45 (46%)

Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           C    C  C   ++ P+ C  C    YCS  C+R+ W   H+ EC
Sbjct: 378 CRGPQCAACNCFLTGPLLCGRCRQVAYCSSECQRQHWKAAHKREC 422


>gi|237757339|ref|NP_766438.2| ankyrin repeat and MYND domain-containing protein 1 [Mus musculus]
          Length = 1001

 Score = 37.7 bits (86), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C  SI   + PC  C   + CS+ C+ +AW+ +H+ +C+  + ++
Sbjct: 940 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 987


>gi|148708043|gb|EDL39990.1| ankyrin repeat and MYND domain containing 1 [Mus musculus]
          Length = 1002

 Score = 37.7 bits (86), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C  SI   + PC  C   + CS+ C+ +AW+ +H+ +C+  + ++
Sbjct: 941 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 988


>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
           rubripes]
          Length = 432

 Score = 37.7 bits (86), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 32/78 (41%), Gaps = 1/78 (1%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KGR +        G+ L   +  ++V+       +C  C T   +   C +C  A Y
Sbjct: 13  SPGKGRGLRVTRAFQVGELLLSSQAYSYVLSVKERGEHCESCFTRKKSLARCGKCKKAFY 72

Query: 146 CSESCRREAWLRYHRWEC 163
           C   C++  W   HR EC
Sbjct: 73  CDVKCQKGDWA-MHRLEC 89


>gi|391337345|ref|XP_003743030.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Metaseiulus occidentalis]
          Length = 437

 Score = 37.7 bits (86), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 14/79 (17%)

Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-----APIPCNECILAVYCSESCRR--- 152
           KG+ +   +P AF +      S C+ C  +       API C+ C +  YCS+ C R   
Sbjct: 6   KGEVILFSRPYAFALYESFVDSTCSFCFRTQGDGHDFAPIECHSCKVYKYCSDECLRLDQ 65

Query: 153 -EAWLRYHRWECHGALRLL 170
            + WL     EC  A R L
Sbjct: 66  MDHWL-----ECQVAKRKL 79


>gi|403259468|ref|XP_003922234.1| PREDICTED: tudor domain-containing protein 1 [Saimiri boliviensis
           boliviensis]
          Length = 1234

 Score = 37.7 bits (86), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 32/151 (21%), Positives = 69/151 (45%), Gaps = 19/151 (12%)

Query: 18  RLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCA 77
           RL+ + + + K + L  E++++     ++ Q  +++SS S S+      + G+   L   
Sbjct: 54  RLKSSENANKKNNFLLCEQTKQ----YLASQEDNSASSNSGSINGEVVASQGDRKKLPAG 109

Query: 78  SNK--IKMQTSNAKGRHVIAVEDVHKG----------DTLFVEKPVAFVIL-PPCSMSNC 124
           +++    ++ ++ KG +     +VH            ++L +  P  F  L PP   + C
Sbjct: 110 NSESSPSVENNSPKGVNTKPENNVHPAKSKKLHKLVENSLSINNPELFTSLGPPLRSTTC 169

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAW 155
           + C   +   + C++C    YCS +C+R  W
Sbjct: 170 HRC--GLFGSLRCSQCKQTYYCSTACQRRDW 198


>gi|113197611|gb|AAI20888.1| Ankmy1 protein [Mus musculus]
          Length = 905

 Score = 37.7 bits (86), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C  SI   + PC  C   + CS+ C+ +AW+ +H+ +C+  + ++
Sbjct: 844 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 891


>gi|81898358|sp|Q8C0W1.1|ANMY1_MOUSE RecName: Full=Ankyrin repeat and MYND domain-containing protein 1
 gi|26325636|dbj|BAC26572.1| unnamed protein product [Mus musculus]
 gi|113197927|gb|AAI20889.1| Ankyrin repeat and MYND domain containing 1 [Mus musculus]
          Length = 906

 Score = 37.7 bits (86), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C  SI   + PC  C   + CS+ C+ +AW+ +H+ +C+  + ++
Sbjct: 845 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 892


>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
 gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
          Length = 400

 Score = 37.7 bits (86), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 24/98 (24%), Positives = 44/98 (44%), Gaps = 7/98 (7%)

Query: 74  LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-SNCNHCCTSIS 132
           L C   + +    N KG        +  G  +  E+P ++ +     + + C++C   + 
Sbjct: 8   LFCEIGQGRGIRCNKKGS-----SGIEPGTLIVKEEPYSYTLTDGELLRTRCHYCLKRLE 62

Query: 133 APIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
             + C+ C  A YC+E C++ A   +H  EC G  RL+
Sbjct: 63  NSVSCDACRTAKYCNEECKKAAKF-HHTPECRGYSRLM 99


>gi|320165180|gb|EFW42079.1| SMYD5 protein [Capsaspora owczarzaki ATCC 30864]
          Length = 429

 Score = 37.7 bits (86), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 40/86 (46%), Gaps = 4/86 (4%)

Query: 53  SSSASSSVASLPSLACGESHTLMCASNKIKMQTSNA-KGRHVIAVEDVHKGDTLFVEKPV 111
           S  A+  V++ P     E    +CA   ++++  +A KGR + A +   KGD +F E P+
Sbjct: 19  SMPAAELVSARPLPLADEYFATLCAGKHVQVRVEDARKGRGLFATQAFKKGDIVFTEAPL 78

Query: 112 A---FVILPPCSMSNCNHCCTSISAP 134
               F+         C+ C  S+ +P
Sbjct: 79  VCAQFLWNEAYGYKACHQCMRSLESP 104


>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
          Length = 426

 Score = 37.4 bits (85), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 5/85 (5%)

Query: 102 GDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHR 160
           GD +   +P ++          C++C +S  A I  C+ C +  YC   C+R +W   H+
Sbjct: 6   GDEVLRCRPFSYAFKEKNRDKICDYCFSSRQAKILRCSRCCIIYYCGRQCQRLSWSEIHK 65

Query: 161 WECHGALRLLEAVGIAHLALKLILV 185
            EC    + L+ + +    L L+ +
Sbjct: 66  KEC----KYLKMLDLKEPPLDLLFI 86


>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
          Length = 511

 Score = 37.4 bits (85), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           +++ C +C   ++  I C  C    YCS+SC   AW ++HR EC
Sbjct: 58  TINICFYCFEKVNKCIYCPNCKYVAYCSDSCLERAW-KFHREEC 100


>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
          Length = 536

 Score = 37.4 bits (85), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 11/138 (7%)

Query: 28  KEDVLTIEKSEENSVEAMSMQVSSTSSSASS--SVASLPSLACGESHTLMCASNKIKMQT 85
           +ED+   +  EE+     S  +  T S A    +V SL     G           ++   
Sbjct: 65  REDMGGEDHGEEDGSTGFSPWILETPSKAPPLRTVRSLEDEGKG--------LGGMERFC 116

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
           S  KG           GD LF     A+V+      ++C +C         C  C  A Y
Sbjct: 117 SPGKGPGAAGSAAFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 176

Query: 146 CSESCRREAWLRYHRWEC 163
           C+  C++E W   H+ EC
Sbjct: 177 CNVECQKEDW-PMHKLEC 193


>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 363

 Score = 37.4 bits (85), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 30/141 (21%)

Query: 78  SNKIKMQTSNAK-GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSISA 133
           S+ I+++ +  K GR + A+    KG  +F E+P  FV  P   +     CNHC   ++ 
Sbjct: 63  SSSIQIKDNGTKSGRGLFAITAFKKGSQIFQEEP--FVSYPSLDVDKSTICNHCLKQLNN 120

Query: 134 PI------------PCNECILAVYCSESCRREAWLRYHRWECHGA--------LRLLEAV 173
                          C +C    YCSE CR  A +++H   C  +          L+E  
Sbjct: 121 NGNNNNDNKSKQQQQCTKCG-ERYCSEKCRSSAEVQHHLASCPTSSNIDNITRYSLVEKR 179

Query: 174 GIAHLALKL---ILVSSHSDR 191
               LA ++   IL+ +H D+
Sbjct: 180 RFPLLAARILSRILLENHFDK 200


>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
           [Oreochromis niloticus]
          Length = 417

 Score = 37.4 bits (85), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 45/110 (40%), Gaps = 32/110 (29%)

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
           N KG+ + A   + KG+T+F+E+P+    F+         C +C                
Sbjct: 29  NVKGKGLFAKRSMKKGETIFIERPLVSAQFLWNTLYKYKACEYCLRALETAEGNARRLSG 88

Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
              +S P P            C +C + +YCS  CR+ A  +YHR  C G
Sbjct: 89  IPGLSLPHPELCRVRPEQHQACPQCQV-MYCSSECRQAAADQYHRALCLG 137


>gi|322698402|gb|EFY90172.1| MYND domain protein, putative [Metarhizium acridum CQMa 102]
          Length = 553

 Score = 37.4 bits (85), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 37/135 (27%), Positives = 58/135 (42%), Gaps = 29/135 (21%)

Query: 39  ENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVED 98
           E+S++A++  + + +   +  VA+LP L  G S+T     +   + T    G  V AVED
Sbjct: 205 EDSLKALNADLKTMAPKCAVQVATLPVLLEGASNT----DHYDIIPTCKQLG--VFAVED 258

Query: 99  VHKGDTLFVE-----------KPVAFVI---LPPCSMSNCNHCCTSISAPIPCNECILAV 144
           +  G+ +  E            PV       LPP   +N           + C+EC   V
Sbjct: 259 IGPGEVVLREYSLLTANNRLKDPVCDACSSDLPPLGSNN---------EAVNCDECYDTV 309

Query: 145 YCSESCRREAWLRYH 159
           +CS+ C   A  RYH
Sbjct: 310 FCSQYCHDVAQERYH 324


>gi|293349946|ref|XP_001068522.2| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Rattus norvegicus]
 gi|392350877|ref|XP_003750785.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Rattus norvegicus]
 gi|149037544|gb|EDL91975.1| rCG55486 [Rattus norvegicus]
          Length = 1006

 Score = 37.4 bits (85), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C  SI   + PC  C   + CS+ C+ +AW+ +H+ +C+  + ++
Sbjct: 946 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 993


>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
           2509]
          Length = 547

 Score = 37.4 bits (85), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 16/101 (15%)

Query: 89  KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
           +GR + + E    G+T+  ++ P+  +       + CN+C   +S  I            
Sbjct: 22  RGRSLFSTESFGAGETIAIIDNPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVEAGPRT 80

Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
              C  C  AVYC+  C+R  W   H+ EC    R+ E  G
Sbjct: 81  CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIKERTG 121


>gi|302674593|ref|XP_003026981.1| hypothetical protein SCHCODRAFT_238272 [Schizophyllum commune H4-8]
 gi|300100666|gb|EFI92078.1| hypothetical protein SCHCODRAFT_238272 [Schizophyllum commune H4-8]
          Length = 634

 Score = 37.4 bits (85), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 13/79 (16%)

Query: 115 ILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR---------WECHG 165
           +L PC+ +N       + A  PC E   A YCS+ C+RE W   HR         W   G
Sbjct: 425 LLFPCNNTNAVVHEAEVRA-CPCGE---AFYCSKKCQREHWKTSHRDQCRRADDVWGTQG 480

Query: 166 ALRLLEAVGIAHLALKLIL 184
            + L +AV +   AL++++
Sbjct: 481 DITLGDAVFLFESALRVVV 499


>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 37.4 bits (85), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 8/124 (6%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
           ++ +   +GR + A  +V  G+++  +     V+L       C  C  +  A I C++C 
Sbjct: 9   RVDSDATRGRLLRATRNVKMGESILRDCAYGNVVLSGNRAELCAVCLRAADADICCDDCS 68

Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETH 201
              +CSE C RE     H  EC       EA+    L      V  HSD    V     +
Sbjct: 69  KVFFCSEEC-REKLRDIHENEC-------EALEEVDLIASKTSVDLHSDEEGNVRSSYAN 120

Query: 202 LQDM 205
           ++D+
Sbjct: 121 VKDL 124


>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 424

 Score = 37.4 bits (85), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 10/95 (10%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV----AFVILPPCSMSNCNHCCTSI--SAP 134
           +++     +GR ++A + +  G  +  E P+    AF  L       C+HC   +  SA 
Sbjct: 12  LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 71

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
             C  C L  +CS +C    +  +  W C   LRL
Sbjct: 72  QKCQSCSLVSFCSPNC----FASHTPWLCESLLRL 102


>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 517

 Score = 37.4 bits (85), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 37/103 (35%), Gaps = 18/103 (17%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------------- 123
           S   K+     KG+ V+AV  +  G  +    P  FV L    M+               
Sbjct: 6   SQMYKVDYKEDKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDTENKKTDYKT 65

Query: 124 ---CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
              C +C    S    C  C   VYCSE C    W + HR EC
Sbjct: 66  VNLCFYCFEKFSKSFYCPNCKYVVYCSEVCLERGW-KLHRDEC 107


>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 573

 Score = 37.4 bits (85), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 33/139 (23%), Positives = 57/139 (41%), Gaps = 34/139 (24%)

Query: 77  ASNKIKMQTSNAKGRHVIAVEDVH------------KGDTLFVEKPVAFVILPPCSMS-- 122
           A + ++++TS  +GR + A  D+              G+TL   KP+  V+  P      
Sbjct: 55  AGSPLEVKTSPQRGRGLYAARDIKAGELVWRAPKLTPGETLLKTKPLVSVLSKPNLYEPA 114

Query: 123 -------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI 175
                  +CNHC     A   C+ C    YC  +C+++ W   H+ EC    R       
Sbjct: 115 AGQKMELHCNHCYLLKPAQR-CSGCKGVYYCGAACQQDDWPS-HKTECKALTR------- 165

Query: 176 AHLALKLILVSSHSDRYKE 194
               ++ + V S+ ++ KE
Sbjct: 166 ----VRQLWVQSYPEKAKE 180


>gi|348515417|ref|XP_003445236.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 2
           [Oreochromis niloticus]
          Length = 373

 Score = 37.4 bits (85), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 45/110 (40%), Gaps = 32/110 (29%)

Query: 87  NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
           N KG+ + A   + KG+T+F+E+P+    F+         C +C                
Sbjct: 29  NVKGKGLFAKRSMKKGETIFIERPLVSAQFLWNTLYKYKACEYCLRALETAEGNARRLSG 88

Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
              +S P P            C +C + +YCS  CR+ A  +YHR  C G
Sbjct: 89  IPGLSLPHPELCRVRPEQHQACPQCQV-MYCSSECRQAAADQYHRALCLG 137


>gi|356546765|ref|XP_003541793.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
           max]
          Length = 990

 Score = 37.0 bits (84), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 115 ILPPCSMS-NCNHCCTSISAP--IPCNECILAVYCSESCRREAWLRYHRWECH 164
           ++PP S S N +H C   SAP    C+ C +  YCS +C+   W   H+ EC 
Sbjct: 62  VIPPVSASKNEHHVCARCSAPARTRCSRCKIVRYCSGNCQIIHWRLIHKQECQ 114


>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
 gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
          Length = 849

 Score = 37.0 bits (84), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 2/90 (2%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
           S +  M  S  K   +    +V  G  +  E+ +A ++L     ++C+ C   + A  +P
Sbjct: 206 SAQTAMYWSADKQWGLRTTSEVQAGTLVHSEEALAGILLKKHRPTHCHGCFGVVPADSVP 265

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGA 166
           C  C +  YC+++CR +A +  H+ EC G+
Sbjct: 266 CIGCGVVSYCNDACRDDATVE-HKLECGGS 294


>gi|426253397|ref|XP_004020382.1| PREDICTED: tudor domain-containing protein 1 [Ovis aries]
          Length = 1326

 Score = 37.0 bits (84), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 66/150 (44%), Gaps = 16/150 (10%)

Query: 18  RLQDTSHLDFKEDVLTIEK------SEENSVEAMSMQVSSTSSSASSSVASLPSLACGES 71
           RL+   + + K + L  E+      S+E+S  ++S   +  +   + S     +L  G S
Sbjct: 183 RLKSPENGNKKNNFLLCEQTKQFLASQEDS--SLSSHPNGVNGEMAGSKEDRKTLPTGNS 240

Query: 72  HTLMCASNKIKMQTSNAK-GRHVIAVE--DVHK--GDTLFVEKPVAFVIL-PPCSMSNCN 125
            + + A N    +  N+K   HV   +   VHK   ++L +  P  F  L PP   + C+
Sbjct: 241 VSPLNAGNNSPPKEVNSKPSNHVPPAKSKKVHKLVENSLSINNPALFNSLGPPLRSTTCH 300

Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAW 155
            C   +   + C++C    YCS  C+R  W
Sbjct: 301 RC--GLFGSLRCSQCKQTYYCSSVCQRRDW 328


>gi|303278744|ref|XP_003058665.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459825|gb|EEH57120.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 620

 Score = 37.0 bits (84), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 27/106 (25%), Positives = 46/106 (43%), Gaps = 2/106 (1%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
           +++ T   +GR ++A   +  G       P+A       +   C  C         C  C
Sbjct: 24  VEVVTREGRGRCLVATRSLTPGTLALSSLPLACAPFQELTRQRCGACFERPPIVRRCGAC 83

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
             A++C  +C+  AW+  H  EC  A + LE +  A +A + +LVS
Sbjct: 84  DRALFCDRACQAAAWVAGHSRECEKA-KELEKLPTA-VASEAVLVS 127


>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
 gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
 gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
          Length = 429

 Score = 37.0 bits (84), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 9/96 (9%)

Query: 108 EKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL 167
           E   AF +      S CN C TS++    C+ C    YCS+ C+R  W + H+ EC    
Sbjct: 10  ENVYAFALARDQVESYCNQCLTSMAELKKCSACRRLAYCSQECQRADW-KLHKVECKAIK 68

Query: 168 RLLEAVGIAHLALKLIL-----VSSHSDRYKEVYHL 198
              E   +A+ +++L++     +S + D   E Y++
Sbjct: 69  THNE---VANDSIRLVMRIAGKLSRNEDGEIEAYYI 101


>gi|440901267|gb|ELR52242.1| Ankyrin repeat and MYND domain-containing protein 1, partial [Bos
           grunniens mutus]
          Length = 958

 Score = 37.0 bits (84), Expect = 5.0,   Method: Composition-based stats.
 Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
           C  C  SI   + PC  C   + CS+ C+ +AW  +H+ +C+  + +
Sbjct: 912 CYQCGRSIGVRLAPCTRCYGILTCSKYCKTKAWSDFHKKDCNALMAI 958


>gi|281341007|gb|EFB16591.1| hypothetical protein PANDA_012308 [Ailuropoda melanoleuca]
          Length = 951

 Score = 37.0 bits (84), Expect = 5.0,   Method: Composition-based stats.
 Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWEC 163
           C+ C  S+    +PC  C   + CS+ C+  AW  +H+ +C
Sbjct: 908 CSQCGRSVGVRLVPCTRCCGVLTCSKYCKTRAWGDFHKRDC 948


>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 524

 Score = 37.0 bits (84), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 14/29 (48%), Positives = 20/29 (68%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
           I C  C +  YCSESC++ AW +YH+ +C
Sbjct: 259 IVCGHCKVEYYCSESCQQLAWSQYHQIQC 287


>gi|169866977|ref|XP_001840071.1| hypothetical protein CC1G_12125 [Coprinopsis cinerea okayama7#130]
 gi|116498849|gb|EAU81744.1| hypothetical protein CC1G_12125 [Coprinopsis cinerea okayama7#130]
          Length = 574

 Score = 37.0 bits (84), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 6/47 (12%)

Query: 123 NCNHCCTSISAP------IPCNECILAVYCSESCRREAWLRYHRWEC 163
           N  H      AP        C+ C   VYCSE+C+++ W R H+ EC
Sbjct: 376 NIGHLDRGFRAPSQGEGVRQCSRCRSVVYCSEACQKQDWERLHKREC 422


>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
 gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
          Length = 495

 Score = 37.0 bits (84), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 34/148 (22%), Positives = 58/148 (39%), Gaps = 13/148 (8%)

Query: 71  SHTLMCASNKIKMQTS-NAKGRHVIAVEDVHKG-DTLFVEKPVAFVILPPCSMSNCNHCC 128
           S TL   S +   Q + N  G  + A  D++ G D L ++ P   V+  P     C  C 
Sbjct: 4   SKTLPSVSTRTVPQPAPNGMGNGLFATADINPGEDVLHIKTPFVAVLDSPRLEDTCAGCF 63

Query: 129 T-----SISAPIPCNECILAVYCSESCRREAWLRYHRWEC------HGALRLLEAVGIAH 177
                 + +    C  C +  YC  +C+ + W   H  EC         +    A  +  
Sbjct: 64  GKRQMQNGNELKACTGCRVVKYCDRACQSKDWKFAHSVECLIFKNVKPMVLPNNARALLR 123

Query: 178 LALKLILVSSHSDRYKEVYHLETHLQDM 205
           + L+ +     S+ +K    LETH+ ++
Sbjct: 124 IVLRTVRNKYDSEEFKVFDGLETHMNEI 151


>gi|47213741|emb|CAF96102.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 787

 Score = 37.0 bits (84), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 1/41 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWEC 163
           C +C  S+   + PC  C    YCS  CR  AW   H+ EC
Sbjct: 747 CYYCGRSVLVRLTPCYRCYKVFYCSRPCRLRAWDELHKNEC 787


>gi|302787779|ref|XP_002975659.1| hypothetical protein SELMODRAFT_415607 [Selaginella moellendorffii]
 gi|300156660|gb|EFJ23288.1| hypothetical protein SELMODRAFT_415607 [Selaginella moellendorffii]
          Length = 1052

 Score = 37.0 bits (84), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 2/90 (2%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
           S +  M  S  K   +    +V  G  +  E+ +A ++L     ++C+ C   + A  +P
Sbjct: 176 SAQTAMYWSADKQWGLRTTSEVQAGTLVHSEEALAGILLKKHRPTHCHGCFGVVPADSVP 235

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGA 166
           C  C +  YC+++CR +A +  H+ EC G+
Sbjct: 236 CIGCGVVSYCNDACRDDATVE-HKLECGGS 264


>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
          Length = 501

 Score = 37.0 bits (84), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
            K++ S   GR+ +A ED+  GD +F E P A+    P S S+C   C    +P+ C
Sbjct: 8   FKVEKSETVGRYAVASEDLKPGDVIFSETPFAY---GPKSDSSC--LCLGCHSPVDC 59


>gi|302506607|ref|XP_003015260.1| hypothetical protein ARB_06383 [Arthroderma benhamiae CBS 112371]
 gi|291178832|gb|EFE34620.1| hypothetical protein ARB_06383 [Arthroderma benhamiae CBS 112371]
          Length = 1143

 Score = 37.0 bits (84), Expect = 5.4,   Method: Composition-based stats.
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 137  CNECILAVYCSESCRREAWLRYHRWEC 163
            C+ C +A YCS+ C+RE W+  H+  C
Sbjct: 1110 CSGCKVAEYCSKDCQREDWMARHKLAC 1136


>gi|410969783|ref|XP_003991371.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1 [Felis
            catus]
          Length = 1104

 Score = 37.0 bits (84), Expect = 5.5,   Method: Composition-based stats.
 Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 124  CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
            C  C  S+    +PC  C   + CS++C+  AW  +H+ +C   L ++
Sbjct: 1040 CYQCGRSVGVRLVPCTRCYGILTCSKACKTRAWGDFHKRDCGSLLAIV 1087


>gi|302653465|ref|XP_003018558.1| hypothetical protein TRV_07403 [Trichophyton verrucosum HKI 0517]
 gi|291182212|gb|EFE37913.1| hypothetical protein TRV_07403 [Trichophyton verrucosum HKI 0517]
          Length = 1143

 Score = 37.0 bits (84), Expect = 5.6,   Method: Composition-based stats.
 Identities = 12/27 (44%), Positives = 18/27 (66%)

Query: 137  CNECILAVYCSESCRREAWLRYHRWEC 163
            C+ C +A YCS+ C+RE W+  H+  C
Sbjct: 1110 CSGCKVAEYCSKDCQREDWMARHKLAC 1136


>gi|367050672|ref|XP_003655715.1| hypothetical protein THITE_60220 [Thielavia terrestris NRRL 8126]
 gi|347002979|gb|AEO69379.1| hypothetical protein THITE_60220 [Thielavia terrestris NRRL 8126]
          Length = 592

 Score = 37.0 bits (84), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 33/134 (24%), Positives = 61/134 (45%), Gaps = 19/134 (14%)

Query: 35  EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
           ++  E S+  ++ Q+S  +      V++LP L  GES+T     +  ++     K   + 
Sbjct: 214 DRFSEQSLSYLNQQLSEVAPKCEVRVSNLPVLLDGESNT-----DGYEI-VPTCKQLGLF 267

Query: 95  AVEDVHKGDTLFVEKPVAFVILPPCSM---SNCNHCCTSI------SAPIPCNECILAVY 145
           A ED+  G+T+  E    + +L   +    S C+ C T +      S  + C +C   ++
Sbjct: 268 AKEDIAPGETVLRE----YSLLTANNRHKESTCDACGTELPPLGSNSTAVSCPDCYDTIF 323

Query: 146 CSESCRREAWLRYH 159
           C E C ++A   YH
Sbjct: 324 CDEFCFKKAQELYH 337


>gi|125809191|ref|XP_001361025.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
 gi|54636198|gb|EAL25601.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
          Length = 501

 Score = 37.0 bits (84), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 35/146 (23%), Positives = 70/146 (47%), Gaps = 20/146 (13%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ +   GR V+A  D+  G+TL +E+P+  V+LP      C+  C  ++A   C
Sbjct: 9   AKKCELKQNETLGRFVVARCDLSAGETLLLEQPI--VVLPHIGDRRCSK-CFKLTATF-C 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
            +C L   C +        C+R   L+    +    +++L+      +  AL  +L+  H
Sbjct: 65  GKCRLLSLCEDCAGHDERDCQRLRELQLSDEQ----VQILQEKANTEVQPALACLLLGEH 120

Query: 189 SDR---YKEVYHLETHLQDMRPEDLY 211
                 ++E+  +ETHL+  R  +++
Sbjct: 121 QHSRPLFEEMSQMETHLEARRGTEIW 146


>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
          Length = 576

 Score = 37.0 bits (84), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 18/63 (28%), Positives = 24/63 (38%)

Query: 109 KPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR 168
           +P+  V     +   CNHC         C  C    YC   C+R  W   H+ EC    +
Sbjct: 54  EPLVVVPFGVAAAQTCNHCLDPRRPAKACTGCRAVAYCGSRCQRAHWTAVHKLECRVLKK 113

Query: 169 LLE 171
            LE
Sbjct: 114 ALE 116


>gi|195584110|ref|XP_002081858.1| GD11242 [Drosophila simulans]
 gi|194193867|gb|EDX07443.1| GD11242 [Drosophila simulans]
          Length = 498

 Score = 37.0 bits (84), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 66/147 (44%), Gaps = 20/147 (13%)

Query: 78  SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           + K +++ ++  GR V+A+ ++  G+TL +E P+  V+LP      C  C     +   C
Sbjct: 9   ARKCEIKHNDMLGRFVVALCNLRAGETLLLENPI--VVLPQMGDRRCTKCFNLTES--FC 64

Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
            +C L   C +        C+R A +          + LL+      +   LK +L+  H
Sbjct: 65  RKCRLLALCEDCSDHDERDCKRLAEMNLS----DDQVELLQKKEHTEIQPVLKCLLLREH 120

Query: 189 SDR---YKEVYHLETHLQDMRPEDLYQ 212
            +    Y+E+  +E+ L   R  D+++
Sbjct: 121 EETLPLYEEMSQMESQLMTRRGTDIWK 147


>gi|342321104|gb|EGU13041.1| Hypothetical Protein RTG_00754 [Rhodotorula glutinis ATCC 204091]
          Length = 1150

 Score = 37.0 bits (84), Expect = 5.8,   Method: Composition-based stats.
 Identities = 14/28 (50%), Positives = 17/28 (60%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWECH 164
           C  C  A YCS+ C+ EAW + HR  CH
Sbjct: 927 CRRCRRAKYCSKMCQSEAWNQGHRHWCH 954


>gi|326431298|gb|EGD76868.1| hypothetical protein PTSG_08216 [Salpingoeca sp. ATCC 50818]
          Length = 564

 Score = 37.0 bits (84), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 58/150 (38%), Gaps = 38/150 (25%)

Query: 48  QVSSTSSSASSSVASLPS--LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTL 105
           Q +S    A S+  S PS  L C E   ++       + TS    R ++A +D   GD +
Sbjct: 23  QATSRHVEAHSTAHSTPSSTLGCTEQRRVVPI-----VYTSTMTERRLVADKDYIVGDVV 77

Query: 106 FVEKPVA---FVILPPCSMSNCNHCCTSISA---------------PIP----------- 136
             + P     F     C    C+HC  S+ +               P P           
Sbjct: 78  LKDTPFVCAQFSWNRKCGYVACSHCLRSLESAEAMATRLAGQDVKLPHPELCPNNGRERV 137

Query: 137 -CNECILAVYCSESCRREAWLRYHRWECHG 165
            C +C   VYC+E+CR +AW  +H+  C G
Sbjct: 138 TCTQC-EEVYCNEACRDKAWQLHHKVLCPG 166


>gi|71420095|ref|XP_811365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70876023|gb|EAN89514.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 429

 Score = 37.0 bits (84), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILA 143
           GR + A++ ++    + +++P    ++       C HC  +I      +  + C  C   
Sbjct: 163 GRGIYALDRINSSTPVMLDQPF---LVQRMRDDACAHCLATIGRSGASAGGVQCAHCDRE 219

Query: 144 VYCSESCRREAWLRYH 159
            YCS +CR  AW  YH
Sbjct: 220 TYCSVACRDAAWREYH 235


>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 518

 Score = 37.0 bits (84), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 38/99 (38%), Gaps = 18/99 (18%)

Query: 82  KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-----------------C 124
           K+     KG+ V+AV  +  G  +    P  FV L    M+                  C
Sbjct: 10  KVDYKENKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDAENEETNYRTVNIC 69

Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            +C    S  + C  C   VYCS+ C   AW + HR EC
Sbjct: 70  FYCFEKFSKSVYCPNCKYVVYCSDVCLDLAW-KLHREEC 107


>gi|296221259|ref|XP_002756485.1| PREDICTED: tudor domain-containing protein 1 [Callithrix jacchus]
          Length = 1205

 Score = 37.0 bits (84), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 60/147 (40%), Gaps = 11/147 (7%)

Query: 18  RLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSV----ASLPSLACGESHT 73
           RL+ + + + K + L  E++++          SS  SS +  V            G S +
Sbjct: 80  RLKSSENGNKKNNFLLCEQTKQYLASQEDNSASSNPSSINGEVVGSQGDRKKSPAGNSES 139

Query: 74  LMCASNK--IKMQTSNAKGRHVIAVEDVHK--GDTLFVEKPVAFVIL-PPCSMSNCNHCC 128
                N    ++ T      H    + +HK  G++L +  P  F  L PP   + C+ C 
Sbjct: 140 SPSVENNSPKEVNTKPENNVHPAKSKKLHKLVGNSLSINNPGLFTSLRPPLRSTTCHRC- 198

Query: 129 TSISAPIPCNECILAVYCSESCRREAW 155
             +   + C++C    YCS +C+R  W
Sbjct: 199 -GLFGSLRCSQCKQTYYCSTACQRRDW 224


>gi|315045996|ref|XP_003172373.1| hypothetical protein MGYG_04963 [Arthroderma gypseum CBS 118893]
 gi|311342759|gb|EFR01962.1| hypothetical protein MGYG_04963 [Arthroderma gypseum CBS 118893]
          Length = 1210

 Score = 36.6 bits (83), Expect = 6.2,   Method: Composition-based stats.
 Identities = 16/42 (38%), Positives = 23/42 (54%)

Query: 122  SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            SNC     S +  + C+ C +A YCS+ C+RE W   H+  C
Sbjct: 1162 SNCGKKPGSGTKLLRCSGCKIAEYCSKECQREDWKVTHKSVC 1203


>gi|356556906|ref|XP_003546761.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
           max]
          Length = 989

 Score = 36.6 bits (83), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)

Query: 115 ILPPCSMS-NCNHCCTSISAP--IPCNECILAVYCSESCRREAWLRYHRWECH 164
           ++PP S S N +H C   SAP    C+ C +  YCS +C+   W   H+ EC 
Sbjct: 62  VIPPVSASKNEHHVCARCSAPARTRCSRCKIVRYCSGNCQIIHWRLNHKQECQ 114


>gi|384497176|gb|EIE87667.1| hypothetical protein RO3G_12378 [Rhizopus delemar RA 99-880]
          Length = 288

 Score = 36.6 bits (83), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 23/48 (47%)

Query: 116 LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           L  C+  +C    T       C  C  A YCS+SC+ +AW   HRW C
Sbjct: 172 LRRCANMSCQRWETQPREFAKCRRCRKAKYCSKSCQSKAWADGHRWWC 219


>gi|351737484|gb|AEQ60519.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
          Length = 374

 Score = 36.6 bits (83), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           C +C  +I  P+ CN+C    YCSE C+ E +  YH  +C
Sbjct: 328 CFYCNKNIEKPVVCNKCFRIKYCSEKCQSE-YNSYHSDDC 366


>gi|311977714|ref|YP_003986834.1| putative zinc finger MYND domain-containing protein [Acanthamoeba
           polyphaga mimivirus]
 gi|82000266|sp|Q5UQS2.1|YR331_MIMIV RecName: Full=Putative zinc finger MYND domain-containing protein
           R331
 gi|55416950|gb|AAV50600.1| unknown [Acanthamoeba polyphaga mimivirus]
 gi|308204761|gb|ADO18562.1| putative zinc finger MYND domain-containing protein [Acanthamoeba
           polyphaga mimivirus]
 gi|339061264|gb|AEJ34568.1| hypothetical protein MIMI_R331 [Acanthamoeba polyphaga mimivirus]
          Length = 374

 Score = 36.6 bits (83), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           C +C  +I  P+ CN+C    YCSE C+ E +  YH  +C
Sbjct: 328 CFYCNKNIEKPVVCNKCFRIKYCSEKCQSE-YNSYHSDDC 366


>gi|328700944|ref|XP_003241434.1| PREDICTED: hypothetical protein LOC100575277 [Acyrthosiphon pisum]
          Length = 1525

 Score = 36.6 bits (83), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 2/36 (5%)

Query: 128  CTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
            CT+ S+   CN C+ A YCS  C++  W  YH+ EC
Sbjct: 1488 CTNFSS-FSCNTCLAAFYCSIQCQKSDW-HYHQNEC 1521


>gi|449542915|gb|EMD33892.1| hypothetical protein CERSUDRAFT_86668 [Ceriporiopsis subvermispora
           B]
          Length = 245

 Score = 36.6 bits (83), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 50/117 (42%), Gaps = 24/117 (20%)

Query: 121 MSNCNHCCTSISAP---IPCNECILAVYCSESCRREAWLRYHRWEC-----------HGA 166
           +S C HC +S         C  C +  YCS  C++ AW   H+ +C           H  
Sbjct: 25  LSQCGHCYSSRGDGTTLFKCGACQVDTYCSRQCQKAAWA-VHKEKCAINRCAQEAGVHAQ 83

Query: 167 LRLLEAV------GIAHLALKLILVS---SHSDRYKEVYHLETHLQDMRPEDLYQYV 214
           L+ L          IA  A++ + ++   + ++R+  + HL+T     R + LYQ V
Sbjct: 84  LKNLRGYTDKHRPTIAEAAIRSLGLAQDVTRAERWCLIVHLQTRPNAQRTDRLYQCV 140


>gi|440463414|gb|ELQ32994.1| hypothetical protein OOU_Y34scaffold01005g18 [Magnaporthe oryzae
           Y34]
 gi|440481340|gb|ELQ61939.1| hypothetical protein OOW_P131scaffold01138g58 [Magnaporthe oryzae
           P131]
          Length = 374

 Score = 36.6 bits (83), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 21/33 (63%), Gaps = 1/33 (3%)

Query: 124 CNHCCTSI-SAPIPCNECILAVYCSESCRREAW 155
           CN  CT++ +AP+ C  C  A YCS+ C+R  W
Sbjct: 29  CNALCTNMHTAPLRCTGCGAATYCSKRCQRADW 61


>gi|431912257|gb|ELK14394.1| Ankyrin repeat and MYND domain-containing protein 1 [Pteropus
           alecto]
          Length = 1079

 Score = 36.6 bits (83), Expect = 6.8,   Method: Composition-based stats.
 Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWEC 163
           C  C  S+   + PC  C   + CS++C+ +AW  +H+ +C
Sbjct: 934 CYQCGRSVGVRLAPCARCYGMLACSKNCKTKAWADFHKRDC 974


>gi|407859371|gb|EKG06994.1| hypothetical protein TCSYLVIO_001875 [Trypanosoma cruzi]
          Length = 429

 Score = 36.6 bits (83), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 9/76 (11%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILA 143
           GR + A++ ++    + +++P    ++       C HC  +I      +  + C  C   
Sbjct: 163 GRGIYALDRINSSTPVMLDQPF---LVQRMRDDACAHCLATIGRSGASAGGVRCAHCDRE 219

Query: 144 VYCSESCRREAWLRYH 159
            YCS +CR  AW  YH
Sbjct: 220 TYCSVACRDAAWREYH 235


>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 477

 Score = 36.6 bits (83), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 7/64 (10%)

Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECH------GALRLLEAVGIAHLALKLILVSSH 188
           + C  C  A YC + C+R AW ++HR EC         L L E + ++ +  +++ + ++
Sbjct: 50  LKCASCEFARYCDKECQRIAW-KHHRNECRRLKAVFPNLPLTEVLFMSRIVDRVLFLEAN 108

Query: 189 SDRY 192
            D+Y
Sbjct: 109 GDKY 112


>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
 gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
          Length = 696

 Score = 36.6 bits (83), Expect = 7.1,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 4/78 (5%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI----PCNECILAVY 145
           GR V+A ED+ KG  +    P A V+       NC  C   I+  I     C  C   + 
Sbjct: 222 GRCVLASEDLPKGTMVLRVSPFASVLEDHKIEKNCGFCFKKINKSIRINQTCKNCKNHLL 281

Query: 146 CSESCRREAWLRYHRWEC 163
           C +    E  L YH+ EC
Sbjct: 282 CPQCSVDEYSLNYHKDEC 299


>gi|154336305|ref|XP_001564388.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134061423|emb|CAM38448.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 700

 Score = 36.6 bits (83), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 68/146 (46%), Gaps = 22/146 (15%)

Query: 32  LTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCAS---------NKIK 82
           L +E+  +  +EA S Q SS ++++  ++A +     GE+   M A+          +++
Sbjct: 172 LGLEELGQKDIEA-SKQESSDAANSPEAMAKV----FGETAGGMAAACLPPRARVNGEVR 226

Query: 83  MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI- 141
           ++ S AKGR ++AV  + +G  L  E P A V      +S C++C    +     +E   
Sbjct: 227 IERS-AKGRGLVAVSQLTEGSVL-EETPYAIVARTETLLSVCSYCLQRSTCLYHGDEYRQ 284

Query: 142 -----LAVYCSESCRREAWLRYHRWE 162
                   +CS SC + AW  Y + E
Sbjct: 285 HHVKSRGFFCSPSCAKAAWEHYGQQE 310


>gi|354474200|ref|XP_003499319.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
           [Cricetulus griseus]
          Length = 1012

 Score = 36.6 bits (83), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
           C  C  SI   + PC  C   + CS+ C+ +AW  +H+ +C+  + L+
Sbjct: 948 CYQCGRSIGVRLTPCPRCYGILTCSKYCKTKAWTEFHKKDCNDIMILM 995


>gi|168002744|ref|XP_001754073.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694627|gb|EDQ80974.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 671

 Score = 36.6 bits (83), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 13/91 (14%)

Query: 89  KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCNECILAVYC- 146
           K R + A +D+  G  +  E+P+A +IL     ++C+ C   +   P+ C  C + +YC 
Sbjct: 134 KQRGMHATKDLEAGSLVLEEEPMAALILKGHRNTHCHFCFEMLPPDPVVCFTCAIPLYCD 193

Query: 147 -----------SESCRREAWLRYHRWECHGA 166
                      SE    + W   H  EC GA
Sbjct: 194 VPCMGAACDESSEELDGQKWKGEHMHECGGA 224


>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
          Length = 530

 Score = 36.6 bits (83), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 5/80 (6%)

Query: 86  SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCCTSISAPIPCNECILA 143
           S+ KGR + + +    GD +    P   +++P  S  N  C+HC         C+ C   
Sbjct: 11  SHRKGRGIFSTKSFAPGDVILPFTPT--ILIPSLSHINTICSHCFKQAEVR-ACSRCHAV 67

Query: 144 VYCSESCRREAWLRYHRWEC 163
            YC  +C+   W   H  EC
Sbjct: 68  SYCDAACQAANWTAVHSKEC 87


>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 508

 Score = 36.6 bits (83), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 23/104 (22%), Positives = 45/104 (43%), Gaps = 9/104 (8%)

Query: 68  CGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF-VEKPVAFVILPPCSMSNCNH 126
           CG +  ++C   + + + +   G  + A   +  G  +F +    A V+  P     C++
Sbjct: 2   CG-TERIVCIDPQAEPKQTVDMGLGLFATSKIRPGANVFAITANFATVLDTPRLKDTCSN 60

Query: 127 CCTSISAPI-------PCNECILAVYCSESCRREAWLRYHRWEC 163
           C  ++   +        C  C +  YC+E C+ E+W   H+ EC
Sbjct: 61  CFVTVGDEVNPGLTLKACTGCRVVKYCNERCQTESWAASHKKEC 104


>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
          Length = 369

 Score = 36.6 bits (83), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 31/131 (23%), Positives = 57/131 (43%), Gaps = 22/131 (16%)

Query: 96  VEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP------CNECILAVYCSES 149
           +  +H    +F E P A  +L P +   C +C   +  P P      C  C  ++YCS+ 
Sbjct: 24  ISSIHARIKIF-ETPFATQVLNPKANEFCANC---LRGPAPGEKLLRCGGCNFSMYCSKD 79

Query: 150 CRREAWLRYHRWEC------HGALRLLEAVGIAHLALKLILVSSHSDRY-----KEVYHL 198
           C+  AWL  H+ EC         L L E + ++ +  ++  +  + D+      +    L
Sbjct: 80  CQAIAWL-VHKPECKRLKASFPNLPLTEVLFLSKVIDRIQFLEKNGDKLGVEAERTFSSL 138

Query: 199 ETHLQDMRPED 209
             H +D+R ++
Sbjct: 139 VDHKKDIREDE 149


>gi|261329776|emb|CBH12758.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 429

 Score = 36.6 bits (83), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 11/88 (12%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA--------PIPCNECI 141
           GR + A+  V  G  +  ++P  FV+    S + C HC +SI++         + C  C 
Sbjct: 161 GRGIYALTRVPSGTAVLADQP--FVVQRMNS-TTCAHCLSSITSASSTSSAAGVVCPHCG 217

Query: 142 LAVYCSESCRREAWLRYHRWECHGALRL 169
              YCS SCR  AW  YH   CH   ++
Sbjct: 218 QESYCSISCRDAAWREYHSCCCHATNKM 245


>gi|255072833|ref|XP_002500091.1| predicted protein [Micromonas sp. RCC299]
 gi|226515353|gb|ACO61349.1| predicted protein [Micromonas sp. RCC299]
          Length = 465

 Score = 36.2 bits (82), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 12/74 (16%)

Query: 94  IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-------CNECILAVYC 146
           + V+D+H     F + P    + PP   +N    C +  A IP       C+ C +A YC
Sbjct: 167 LLVDDIH---VTFRQLPTDVALTPPTPTANI--LCVACRASIPIDTRKKFCSGCNIAAYC 221

Query: 147 SESCRREAWLRYHR 160
             +C++  W R+ +
Sbjct: 222 CRACQKADWKRHKK 235


>gi|384497433|gb|EIE87924.1| hypothetical protein RO3G_12635 [Rhizopus delemar RA 99-880]
          Length = 382

 Score = 36.2 bits (82), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 23/48 (47%)

Query: 116 LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           L  C+  +C    T       C  C  A YCS+SC+ +AW   HRW C
Sbjct: 245 LRRCANLSCRKWETQPREFAKCRRCRKAKYCSKSCQSKAWADGHRWWC 292


>gi|299749011|ref|XP_001838436.2| hypothetical protein CC1G_09064 [Coprinopsis cinerea okayama7#130]
 gi|298408241|gb|EAU83370.2| hypothetical protein CC1G_09064 [Coprinopsis cinerea okayama7#130]
          Length = 619

 Score = 36.2 bits (82), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 18/32 (56%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALR 168
           C  C   VYCS +C+ E W R H+ EC G  R
Sbjct: 439 CGRCHSVVYCSSACQHEDWERRHQRECKGMAR 470


>gi|389626527|ref|XP_003710917.1| hypothetical protein MGG_04563 [Magnaporthe oryzae 70-15]
 gi|351650446|gb|EHA58305.1| hypothetical protein MGG_04563 [Magnaporthe oryzae 70-15]
          Length = 374

 Score = 36.2 bits (82), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)

Query: 124 CNHCCTSI-SAPIPCNECILAVYCSESCRREAW 155
           CN  CT + +AP+ C  C  A YCS+ C+R  W
Sbjct: 29  CNALCTKMHTAPLRCTGCGAATYCSKRCQRADW 61


>gi|72391782|ref|XP_846185.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62358390|gb|AAX78854.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70802721|gb|AAZ12626.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 429

 Score = 36.2 bits (82), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 11/88 (12%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA--------PIPCNECI 141
           GR + A+  V  G  +  ++P  FV+    S + C HC +SI++         + C  C 
Sbjct: 161 GRGIYALTRVPSGTAVLADQP--FVVQRMNS-TTCAHCLSSITSASSTSSAAGVVCPHCG 217

Query: 142 LAVYCSESCRREAWLRYHRWECHGALRL 169
              YCS SCR  AW  YH   CH   ++
Sbjct: 218 QESYCSISCRDAAWREYHSCCCHATNKM 245


>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
 gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
          Length = 492

 Score = 36.2 bits (82), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 3/65 (4%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSE 148
           GR ++A +++ +G  +  E PV  ++ P       C +CC  +     C +C +AV C+ 
Sbjct: 17  GRFIVASKNIKQGQLILKENPV--LLCPQVGGPIICFNCCAQLKKLFFCPDCRIAVLCNP 74

Query: 149 SCRRE 153
           +C+ +
Sbjct: 75  NCKNQ 79


>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
           vitripennis]
          Length = 386

 Score = 36.2 bits (82), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 28/110 (25%), Positives = 39/110 (35%), Gaps = 32/110 (29%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA-------- 133
           T   KG+ + AV    +GD +  E+P+    F          C++C   +          
Sbjct: 12  TDEEKGKGLFAVSSYKEGDIILEERPLVCCQFAWNADYKYLACDYCMKPLETAEENARRL 71

Query: 134 --------PIP------------CNECILAVYCSESCRREAWLRYHRWEC 163
                   P P            C  C  A YCS  C+ EAW RYH+  C
Sbjct: 72  SGKSDLILPFPECCETKKDSISECESC-GAKYCSTECQNEAWQRYHQILC 120


>gi|150866965|ref|XP_001386745.2| hypothetical protein PICST_63965 [Scheffersomyces stipitis CBS
           6054]
 gi|172044079|sp|A3M0J3.2|SET5_PICST RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|149388221|gb|ABN68716.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 478

 Score = 36.2 bits (82), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 18/88 (20%)

Query: 80  KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------CNHC----- 127
           KI++  ++ +G+ + A   + KG+ ++ E P+ F+  PP +  N       C+HC     
Sbjct: 110 KIQIVMTSKRGKGLYAKHKIAKGELIWEETPLFFI--PPLANINLVKTGKACSHCGKLLQ 167

Query: 128 ---CTSISAPIPCNECILAVYCSESCRR 152
               +S+   + CN C   V+CS+ C++
Sbjct: 168 TTTGSSMLKGLDCNVC-PEVWCSQQCKK 194


>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 541

 Score = 36.2 bits (82), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 4/127 (3%)

Query: 81  IKMQTSNA--KGRHVIAVEDVHKGDTLFV-EKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
           IKM  +     GR + A  D   G  + V + P+        S + C+HC    +    C
Sbjct: 14  IKMHGTKGFSPGRCLRAARDYKPGQLMAVFDDPLVAFPDAAGSKTTCHHCLVHNAKVFGC 73

Query: 138 NECILAV-YCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVY 196
             C  AV YCS+ C++  W   H  EC    ++  AV    L   +  +     R  EV+
Sbjct: 74  TGCDKAVSYCSDECQKVNWKLIHSKECKVFRKVQAAVSKNWLPTPVRALVQILLRLSEVH 133

Query: 197 HLETHLQ 203
            + T L+
Sbjct: 134 EVVTGLE 140


>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
 gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
          Length = 387

 Score = 36.2 bits (82), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 47/118 (39%), Gaps = 32/118 (27%)

Query: 81  IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV---------AFVILP------PCSMSNCN 125
           ++++T+  KGR + A + +  G  LF E P+         A+  L       P   + CN
Sbjct: 4   VEVKTTAEKGRGLFARQPIEAGTVLFEESPLVSSQFCWNAAYGYLACEYCLRPLETAECN 63

Query: 126 --HCCTSISAPIPCNECI---------------LAVYCSESCRREAWLRYHRWECHGA 166
                +  +  +P  EC                  +YCS+ C  +A  RYH+  C GA
Sbjct: 64  VRRLASDFAIDLPYPECCSVQEQLERHTRCADCGTLYCSQGCLNDALERYHKGLCLGA 121


>gi|299739523|ref|XP_001839567.2| hypothetical protein CC1G_12030 [Coprinopsis cinerea okayama7#130]
 gi|298403816|gb|EAU82242.2| hypothetical protein CC1G_12030 [Coprinopsis cinerea okayama7#130]
          Length = 628

 Score = 36.2 bits (82), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 27/47 (57%), Gaps = 3/47 (6%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC---HGALRLLEAVGIAHLAL 180
           C+ C   +YCS+ C++E W   HR+EC     + R+ EA G+ +  L
Sbjct: 428 CSGCQTVLYCSQQCQKEDWESRHRYECMVMRHSYRVNEARGVKYSPL 474


>gi|299744221|ref|XP_001840954.2| hypothetical protein CC1G_03183 [Coprinopsis cinerea okayama7#130]
 gi|298406021|gb|EAU81007.2| hypothetical protein CC1G_03183 [Coprinopsis cinerea okayama7#130]
          Length = 590

 Score = 36.2 bits (82), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 5/54 (9%)

Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWEC----HG-ALRLLEAVGIAH 177
           T +  P  C+ C   VYCSE C++  W   H+ EC    HG  +R    VG +H
Sbjct: 375 TELVLPKACSGCHTVVYCSEKCQKADWESRHKDECNVMKHGHMIRKQNGVGYSH 428


>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
          Length = 436

 Score = 36.2 bits (82), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 33/79 (41%), Gaps = 1/79 (1%)

Query: 85  TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
            S  KGR + A+     G+ LF       V+      S+C+ C         C  C  A 
Sbjct: 16  ASPGKGRGLRALRRYAVGELLFSCPAYTAVLTVSERGSHCDGCFARKEGLSKCGRCKQAF 75

Query: 145 YCSESCRREAWLRYHRWEC 163
           YC+  C++E W   H+ EC
Sbjct: 76  YCNVECQKEDW-PMHKLEC 93


>gi|303282879|ref|XP_003060731.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458202|gb|EEH55500.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 375

 Score = 36.2 bits (82), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 4/56 (7%)

Query: 121 MSNCNHCCTSISAPIP----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEA 172
           +SN    C + +A +     C  C L  YC ++C++E W + H+ EC  A    E 
Sbjct: 255 VSNACFSCGARAAGVERYSQCGGCHLVAYCGKACQKEDWKKRHKAECRPAKSFREG 310


>gi|344282241|ref|XP_003412882.1| PREDICTED: DNA polymerase theta [Loxodonta africana]
          Length = 2567

 Score = 36.2 bits (82), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 15/109 (13%)

Query: 13   SLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTS----------SSASSSVAS 62
            SL LC  +D+ +LD + + +  + + EN+ +  +   S+T+          SSASSS+  
Sbjct: 1285 SLALCDFEDSLYLDTQSEKIIQQMATENAKQEGAKDSSTTAGMRQTSVDGRSSASSSLKE 1344

Query: 63   LPSLACGESHTL-MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP 110
            +P    GE H++    +N ++++TS+   +      D H  D L  + P
Sbjct: 1345 IPVTLPGERHSVGTTGTNHLELKTSDTPKQR----SDSHAIDILTSQSP 1389


>gi|308485700|ref|XP_003105048.1| CRE-SET-10 protein [Caenorhabditis remanei]
 gi|308256993|gb|EFP00946.1| CRE-SET-10 protein [Caenorhabditis remanei]
          Length = 436

 Score = 36.2 bits (82), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 5/65 (7%)

Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSISAP-----IPCNECILAVYCSESCRREAWLRY 158
           TL  E P+A V+ P      C  C   +        + C++C    YCS  C+R+ W   
Sbjct: 3   TLARENPLAAVLCPQFQDFYCATCFAELDVNGEAEILMCDDCSEVSYCSLKCQRKDWRSV 62

Query: 159 HRWEC 163
           H++EC
Sbjct: 63  HQFEC 67


>gi|395327793|gb|EJF60190.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
          Length = 442

 Score = 36.2 bits (82), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 49/110 (44%), Gaps = 13/110 (11%)

Query: 47  MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
           +Q  S S +AS  V     L  G   T      ++K     AKG+ ++A E + +G+ ++
Sbjct: 61  VQTLSASGAASGQVHPTSKLIEGLDVTKWTTKAEVKY-FGKAKGKGLVAKEAIAEGEVIW 119

Query: 107 VEKPVAFVILPPCSMSN-------CNHCCTSISAP---IPCNECILAVYC 146
            E P  F++ P  ++ +       C HC T ++     +PC+    +  C
Sbjct: 120 KEDP--FILAPEWNLYDLQMSSHACGHCSTPLTTSPLIVPCSASTSSTPC 167


>gi|384494170|gb|EIE84661.1| hypothetical protein RO3G_09371 [Rhizopus delemar RA 99-880]
          Length = 432

 Score = 36.2 bits (82), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
           C  C  A YCS++C+ +AW   HRW C
Sbjct: 276 CRRCRKAKYCSKACQSKAWADGHRWWC 302


>gi|330790441|ref|XP_003283305.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
 gi|325086730|gb|EGC40115.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
          Length = 754

 Score = 36.2 bits (82), Expect = 9.8,   Method: Composition-based stats.
 Identities = 13/43 (30%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
           +  C  C  ++  P+ C  C    YCS+ C+++ WL  H+ +C
Sbjct: 668 LYKCWSCKLTLIKPLICGYCKTVAYCSKECQKDHWL-VHKEQC 709


>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
 gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
          Length = 518

 Score = 36.2 bits (82), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 58/131 (44%), Gaps = 20/131 (15%)

Query: 90  GRHVIAVEDVHKGDTLFVEKPVAFVILP---------PCSMSNCNHCCTSISAPIPCNEC 140
           GR V+A  D+ +G+ + +E+P  +V  P          C   +C+ C     AP+ C +C
Sbjct: 17  GRFVVAKRDLTRGEQILLEEP--YVTGPYWDADVSCLNCFRDSCSTCRQCGVAPL-CYDC 73

Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLET 200
            +      S  RE+ L  H    H          +      L+L  S+ +RY EV  +E+
Sbjct: 74  KIHDEAECSFYRESSLDRHFLYNH--------FNVVMPVRCLMLYRSNRERYDEVMTMES 125

Query: 201 HLQDMRPEDLY 211
            L++ R  +++
Sbjct: 126 RLEERRGTEIW 136


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.128    0.384 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,983,881,819
Number of Sequences: 23463169
Number of extensions: 104969295
Number of successful extensions: 434407
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 559
Number of HSP's successfully gapped in prelim test: 637
Number of HSP's that attempted gapping in prelim test: 433090
Number of HSP's gapped (non-prelim): 1373
length of query: 214
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 78
effective length of database: 9,168,204,383
effective search space: 715119941874
effective search space used: 715119941874
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 74 (33.1 bits)