BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy6242
         (541 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|242007168|ref|XP_002424414.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507814|gb|EEB11676.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 563

 Score =  355 bits (911), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 193/472 (40%), Positives = 291/472 (61%), Gaps = 40/472 (8%)

Query: 1   MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           M+++ E  D+ L +C +C     +KC++CK+V YC K+HQKE WK HK +C+  P+++KS
Sbjct: 1   MSTVGEEIDKLLEKCKICDVQTTNKCTSCKKVNYCCKEHQKEDWKNHKIQCR--PFQMKS 58

Query: 61  SPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
           S  LGR+L+AT  ++ G+ +  E P++ GP+    E +CLGC+K          +CP+C 
Sbjct: 59  SLELGRFLEATRRINAGEILFTEYPIVFGPRPE--EIVCLGCYK-----TEPEEKCPKCL 111

Query: 121 WPACSARCSGLSDAHTHAPECAILKLGC-----ETLLAYNDYKYEAILPLRCLILQRRSP 175
           WPACS+ C GL++   H  EC +L+L       ++ +  + ++++ I PLRCL+LQ+++P
Sbjct: 112 WPACSSSCPGLTNPDHHGSECFLLRLYTRNNQQQSYVTDDYFRFDVIFPLRCLLLQKKNP 171

Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLPGD-----------VLDD 223
             ++ +  ME+HM KRGP TE+Y++  + IVKYL  NF + L              + D+
Sbjct: 172 NGWKIITGMESHMKKRGPNTEIYKQTNEKIVKYLNDNFFKFLNDSGNREPDTEMILLPDE 231

Query: 224 TSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKH----SNMMQSKL 275
            + + LH ICG+I+VN ++I    G     LY    LMEHNCLPN +H    S   + K+
Sbjct: 232 RTPEILHKICGIIDVNSLEIQSPKGGTLNALYETAFLMEHNCLPNTRHIFQPSRRFKIKV 291

Query: 276 FVFRD----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
           F   D     HISTMYT+ALWGTQ R++HL  TK+F C C RC DP+E+GT  S +KCL 
Sbjct: 292 FANNDIQPGNHISTMYTHALWGTQQRQDHLRSTKHFICRCRRCGDPSEMGTNLSGLKCLA 351

Query: 332 EHKDQGDCW--ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
              DQ  C    LP+ PLD+ + W C  C   L +  V    + +G++VD+++ + P ++
Sbjct: 352 PTVDQKICGGTQLPMAPLDSITVWQCDKCPTSLTSGQVSDFVNMIGDEVDRILNKQPKIE 411

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
           SLE +L+KL+ + H +HYHCY+V HSL+QLYG QPGY   Q++  +LE+KI 
Sbjct: 412 SLETLLSKLKKLLHKNHYHCYSVMHSLVQLYGRQPGYLPHQMNDLMLEKKIG 463



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 5/52 (9%)

Query: 452 DEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
           +E +CLGC+K          +CP+C WPACS+ C GL++   H  EC +L+L
Sbjct: 91  EEIVCLGCYK-----TEPEEKCPKCLWPACSSSCPGLTNPDHHGSECFLLRL 137


>gi|193683766|ref|XP_001950081.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 533

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 181/446 (40%), Positives = 272/446 (60%), Gaps = 19/446 (4%)

Query: 4   IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           II    + +  CA C  +A  KC+AC  V YC K+HQK HWK HK +C  L YE++    
Sbjct: 7   IISKHLKNISHCAFCGNSASSKCAACTLVVYCSKEHQKAHWKQHKNEC--LSYELQIDSN 64

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLA-DNARCPRCFWP 122
           +GR+L A   ++P + I +E PL++GPK   +EPIC+ C + L  + +   + C +C WP
Sbjct: 65  VGRHLIAKRTINPFEIIIQEEPLVIGPKFPTSEPICIKCLQRLKRSESLVESLCEKCLWP 124

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
            C   C    + + H  EC +L  G E +   NDY Y+A+ PL+CL+LQ     K+  L 
Sbjct: 125 ICGTGCVTSINKNIHEGECTVLVKGSEKIAKNNDYMYDALTPLKCLLLQFTDKNKWNRLM 184

Query: 183 DMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD 242
           ++++HM  RGP +EVYEEI+S+  YL++N+   L G+   +TS+  +H ICG+++VN +D
Sbjct: 185 ELKSHMEYRGPESEVYEEINSVYNYLKNNY---LSGEEF-ETSSDLIHTICGILDVNALD 240

Query: 243 I---GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWG 293
           +   G     +Y  +  +EHNCLPN   S     +++V+      +  HI+TMYTNALWG
Sbjct: 241 VQVAGLELTAIYPTVSKLEHNCLPNTGISFDKCGRIYVYASRKITKGEHITTMYTNALWG 300

Query: 294 TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDW 353
           T+ RR HL  TKYF C C+RCSD TELGT FS + C  +   +G+  + P++PLD+ S+W
Sbjct: 301 TRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVCNVKGFCKGN--LTPIHPLDDSSEW 358

Query: 354 TCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVK 413
            C  C   +++  +  + ++L   VDK +Q NP++ SLE+  +KL+   H +HY C+ VK
Sbjct: 359 ECDRCPNTVSSDKIDAILTELNHIVDKALQ-NPSINSLEDAFSKLKTRIHSNHYLCFNVK 417

Query: 414 HSLIQLYGTQPGYAYTQLSSSLLERK 439
           H+LIQLYG   GY ++ L+  LL+RK
Sbjct: 418 HTLIQLYGHSMGYKHSMLTDDLLKRK 443



 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 443 VMSPNLKATDEPICLGCHKPLNPNLA-DNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           V+ P    T EPIC+ C + L  + +   + C +C WP C   C    + + H  EC +L
Sbjct: 88  VIGPKF-PTSEPICIKCLQRLKRSESLVESLCEKCLWPICGTGCVTSINKNIHEGECTVL 146

Query: 502 KLGCETLLAYNDY 514
             G E +   NDY
Sbjct: 147 VKGSEKIAKNNDY 159


>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
          Length = 563

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 196/453 (43%), Positives = 280/453 (61%), Gaps = 34/453 (7%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
           +CA C + A  KCSACK V+YC K HQK+HWK HK  C+  P+EI ++  +G+ L AT D
Sbjct: 5   KCAECDKPAELKCSACKLVSYCCKDHQKKHWKSHKTLCR--PFEITTTKEVGKCLVATRD 62

Query: 74  LHPGDRIARESPLIVGPKLALAE--PI-CLGCHKPLNPNLADNA-RCPRCFWPACSARCS 129
           L PGD I  E PL+ GP+  + E  P+ C GC + +   + +N+ RCP C +P C  RC 
Sbjct: 63  LSPGDVIISELPLVYGPRPHMVEEGPVPCPGCCRLI---ICENSPRCPGCDFPVCHPRCP 119

Query: 130 GLSDAHTHAPECAILKLGCETLLAYND----YKYEAILPLRCLILQRRSPKKYQELKDME 185
           GL D   H  EC IL L    + A N     Y+ + +L LRCL+LQ+++PKK+ +L +ME
Sbjct: 120 GLKDMEKHGHECLILSL--REIRAINGLHDFYRQDTLLALRCLLLQKKNPKKFAQLMEME 177

Query: 186 AHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLP-------GDVLDDTSAKCLHWICGVIE 237
           AH+ KRGP TE+Y+++ +    YL   F   L         +VL D S + +H ICG+I+
Sbjct: 178 AHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQARTGKEVLQDISKETVHKICGIID 237

Query: 238 VNGVDIGRYTQ--GLYSVICLMEHNCLPNAKHSN---------MMQSKLFVFRDTHISTM 286
           VN ++I +  +   LY    LMEH+CL N  HS           +++ L + +  HISTM
Sbjct: 238 VNALEINQDAEISVLYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTM 297

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YT+ALWGTQ RREHL  TKYF+C+C+RC DPTE+G+Y SA++CL    D  D + LP +P
Sbjct: 298 YTHALWGTQARREHLKETKYFSCTCKRCKDPTEMGSYLSALRCLGTGVDSCDGYQLPADP 357

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
            D+++ W C  C  +L   +V  + +Q+GE+VD +   +P VK L+ +L+K+    HP+H
Sbjct: 358 TDDNTQWLCNKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDNLLSKMLTFLHPNH 417

Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           YH Y+VKHSL+QLYG Q GY  +Q+S   + +K
Sbjct: 418 YHVYSVKHSLVQLYGYQQGYTPSQISDDTVLKK 450



 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query: 469 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
           ++ RCP C +P C  RC GL D   H  EC IL L
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHGHECLILSL 136


>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
          Length = 575

 Score =  347 bits (889), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 196/453 (43%), Positives = 280/453 (61%), Gaps = 34/453 (7%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
           +CA C + A  KCSACK V+YC K HQK+HWK HK  C+  P+EI ++  +G+ L AT D
Sbjct: 5   KCAECDKPAELKCSACKLVSYCCKDHQKKHWKSHKTLCR--PFEITTTKEVGKCLVATRD 62

Query: 74  LHPGDRIARESPLIVGPKLALAE--PI-CLGCHKPLNPNLADNA-RCPRCFWPACSARCS 129
           L PGD I  E PL+ GP+  + E  P+ C GC + +   + +N+ RCP C +P C  RC 
Sbjct: 63  LSPGDVIISELPLVYGPRPHMVEEGPVPCPGCCRLI---ICENSPRCPGCDFPVCHPRCP 119

Query: 130 GLSDAHTHAPECAILKLGCETLLAYND----YKYEAILPLRCLILQRRSPKKYQELKDME 185
           GL D   H  EC IL L    + A N     Y+ + +L LRCL+LQ+++PKK+ +L +ME
Sbjct: 120 GLKDMEKHGHECLILSL--REIRAINGLHDFYRQDTLLALRCLLLQKKNPKKFAQLMEME 177

Query: 186 AHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLP-------GDVLDDTSAKCLHWICGVIE 237
           AH+ KRGP TE+Y+++ +    YL   F   L         +VL D S + +H ICG+I+
Sbjct: 178 AHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQARTGKEVLQDISKETVHKICGIID 237

Query: 238 VNGVDIGRYTQ--GLYSVICLMEHNCLPNAKHSN---------MMQSKLFVFRDTHISTM 286
           VN ++I +  +   LY    LMEH+CL N  HS           +++ L + +  HISTM
Sbjct: 238 VNALEINQDAEISVLYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTM 297

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YT+ALWGTQ RREHL  TKYF+C+C+RC DPTE+G+Y SA++CL    D  D + LP +P
Sbjct: 298 YTHALWGTQARREHLKETKYFSCTCKRCKDPTEMGSYLSALRCLGTGVDSCDGYQLPADP 357

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
            D+++ W C  C  +L   +V  + +Q+GE+VD +   +P VK L+ +L+K+    HP+H
Sbjct: 358 TDDNTQWLCNKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDNLLSKMLTFLHPNH 417

Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           YH Y+VKHSL+QLYG Q GY  +Q+S   + +K
Sbjct: 418 YHVYSVKHSLVQLYGYQQGYTPSQISDDTVLKK 450



 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query: 469 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
           ++ RCP C +P C  RC GL D   H  EC IL L
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHGHECLILSL 136


>gi|357620844|gb|EHJ72885.1| hypothetical protein KGM_13877 [Danaus plexippus]
          Length = 517

 Score =  327 bits (837), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 173/413 (41%), Positives = 250/413 (60%), Gaps = 35/413 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI----CLGCHKPLNPNLA 111
           YEI+ S  LGRYL A  DL P D +  E PL+ GPK ++ +P     C+GC+KP+  ++ 
Sbjct: 6   YEIRKSEELGRYLVAARDLTPDDVVLTELPLVYGPK-SMPDPEALMPCVGCYKPIFTDVG 64

Query: 112 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLL--AYNDYKYEAILPLRCLI 169
           +  RC RC WP CS  C GL D   H  EC IL    E +L    + Y+++A+LPLRC +
Sbjct: 65  E--RCSRCGWPVCSGNCPGLKDPLHHGVECEILSARPECVLDNMADYYRHDALLPLRCAL 122

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSI-VKYLRSNFLEKLPGDV----LDDT 224
           LQ     K+++L ++++HM  R PGT+ Y+E D   VKYL + F+ K+  +V    L+  
Sbjct: 123 LQYTDDDKWKKLLELQSHMECRVPGTDAYDEADEFTVKYLMNVFINKIDKNVKNKYLNLI 182

Query: 225 SAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSK------ 274
           S + LH ICG+I+ N ++I    G     LY+  C+MEH+C+PN KH      K      
Sbjct: 183 SGELLHKICGIIDTNALEIRLPNGSELNALYATTCMMEHSCVPNTKHLFNTSGKDVKDKY 242

Query: 275 -------LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
                  + + +  H++TMY++ALWGTQ RR+HL  TKYF+C C RCSDPTELGTY SAM
Sbjct: 243 KITVKVVVPINKGDHVATMYSHALWGTQARRQHLKDTKYFSCKCIRCSDPTELGTYLSAM 302

Query: 328 KCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
           KC  + K   D   LP +PLD ++DW C  C+ ++N   ++++ S++GE+V+ +     +
Sbjct: 303 KCFGDDKGPCDGIHLPEDPLDEETDWACNKCTVKVNNSQINILISEMGEEVENVQMMGGS 362

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           V  LE +L +L    HP+HYH Y++KHSLIQLYG Q  Y    +S  +L++KI
Sbjct: 363 VNMLENILCRLSTFLHPNHYHLYSIKHSLIQLYGRQSSY----MSEEILDKKI 411



 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 2/46 (4%)

Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           C+GC+KP+  ++ +  RC RC WP CS  C GL D   H  EC IL
Sbjct: 52  CVGCYKPIFTDVGE--RCSRCGWPVCSGNCPGLKDPLHHGVECEIL 95


>gi|170034227|ref|XP_001844976.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167875488|gb|EDS38871.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 548

 Score =  311 bits (796), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 174/453 (38%), Positives = 256/453 (56%), Gaps = 35/453 (7%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATL 72
           +CA+C   A  KC+ CK V YC  +HQK+HWK  HK +C K PYE+  +  +GR+  AT 
Sbjct: 7   KCALCGVEASLKCAGCKRVVYCSPEHQKKHWKRQHKNECAK-PYELARNEEVGRFFLATE 65

Query: 73  DLHPGDRIARESPLIVGPKLALAE-------PICLGCHKPLNPNLADNARCPRCFWPACS 125
            +     +  ESP+++GPK  L E         C+GC       +    RC RC WP C 
Sbjct: 66  AIPKDTILFTESPMVIGPKWNLDEYEQRSAIVPCVGCFTDCQLGVY---RCDRCNWPVCK 122

Query: 126 ARCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
             C GL +A+ HA EC IL+ G       +    ++ Y+Y+A+L L+CL LQ  +   + 
Sbjct: 123 PDCPGLVNANLHAIECPILRFGGGPKPRDDPEAVFDYYRYDAMLVLKCLALQIHNRPLFD 182

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEV 238
           ++  +E+H   R  G++ Y + D   V+YL  NFL  L     ++   K    ICG++EV
Sbjct: 183 QMMQLESHYEAR-KGSQYYRDADDRTVQYLLKNFLAPLKKQ--EEIQGKT---ICGILEV 236

Query: 239 NGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLF------VFRDTHISTMYT 288
           N + I    GR   GLY + C++EH C+PN  +++   + L            H+ST YT
Sbjct: 237 NAMVIPLTNGREICGLYPIGCMLEHCCMPNCFYTSTFLNLLVYDLTNSFLSGEHLSTTYT 296

Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLD 348
           +ALWGTQ RR+HL   KYF+CSC RC+DPTELGTY SA++CL         + LP++ L+
Sbjct: 297 HALWGTQQRRDHLKTNKYFSCSCARCADPTELGTYLSALRCLGVDGVGCSGYQLPIDSLN 356

Query: 349 NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYH 408
           + SDW C  C  ++ A  V+ + +++GE++D  +++  ++K  E+M+ KL    HP+HYH
Sbjct: 357 DASDWKCNQCPVQIEADQVNFLLAKIGEEMDDAIEQKTSIKQKEDMIAKLLTFLHPNHYH 416

Query: 409 CYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
            +A+KHSLIQ+YG   GY   QLS   L  KI 
Sbjct: 417 IFALKHSLIQMYGHLKGYQTPQLSDRDLADKIG 449



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 26/49 (53%), Gaps = 3/49 (6%)

Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLG 504
           C+GC       +    RC RC WP C   C GL +A+ HA EC IL+ G
Sbjct: 99  CVGCFTDCQLGVY---RCDRCNWPVCKPDCPGLVNANLHAIECPILRFG 144


>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
 gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
          Length = 565

 Score =  302 bits (773), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 180/460 (39%), Positives = 263/460 (57%), Gaps = 39/460 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
           CA+C   +  KC+ CK V YC  +HQK+HW++ HK +C K PYE+  +  +GR+  AT +
Sbjct: 8   CALCGVPSSLKCAGCKLVVYCSPEHQKKHWRMQHKNECAK-PYELSRNEEVGRFFLATEE 66

Query: 74  LHPGDRIARESPLIVGPKLALAE-------PICLGCHKPLNPNLADNARCPRCFWPACSA 126
           +     +  E+P+++GPK  LAE         C+GC       +    RC  C WPAC  
Sbjct: 67  IAKDTILFTEAPMVIGPKWNLAEYEQRSMTVPCVGCFADCQLGMY---RCEACQWPACKP 123

Query: 127 RCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQE 180
            C GL +A+ HA EC IL+ G       +    ++ Y+Y+A+L L+CL LQ R+ + + +
Sbjct: 124 DCPGLENANLHALECGILRFGSGPKPRDDPEAVFDYYRYDALLALKCLALQIRNRQLFDQ 183

Query: 181 LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFL------EKLPGD-VLDDTSAKCLHWI 232
           +  +E+H   R  GT  Y+E D   V+YL  NFL      EK  G  VL     K LH I
Sbjct: 184 MLQLESHYEAR-KGTPFYKEADERTVQYLLKNFLNPLKKMEKQQGKTVLPVCDVKTLHKI 242

Query: 233 CGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFR-------D 280
            G++EVN + I    GR   GLY + C++EH C+PN  ++ N  +     F+        
Sbjct: 243 SGILEVNAMVIPLSNGREICGLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKG 302

Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCW 340
            H++T YT++LWGTQ RR+HL   KYF+CSC RC+DPTELGTY SA+KC+    +    +
Sbjct: 303 EHLTTTYTHSLWGTQQRRDHLKANKYFSCSCARCADPTELGTYLSALKCMGIDGNACAGY 362

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
            LP++PL   SDW C  C   + +  V+ + S++GE+VD  ++   +VK LE++++KL  
Sbjct: 363 QLPIDPLKETSDWKCNQCPITIESDQVNFLLSKIGEEVDDAMERKSSVKQLEDLISKLLT 422

Query: 401 MFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             HP+H+    +KHSLIQ+YG   GYA  QLS  +L  KI
Sbjct: 423 FLHPNHHFLLQLKHSLIQMYGHFKGYATNQLSDRILAEKI 462



 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 36/82 (43%), Gaps = 11/82 (13%)

Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGC--------ET 507
           C+GC       +    RC  C WPAC   C GL +A+ HA EC IL+ G         E 
Sbjct: 99  CVGCFADCQLGMY---RCEACQWPACKPDCPGLENANLHALECGILRFGSGPKPRDDPEA 155

Query: 508 LLAYNDYKSLSRAYLDHQQVRS 529
           +  Y  Y +L        Q+R+
Sbjct: 156 VFDYYRYDALLALKCLALQIRN 177


>gi|198461482|ref|XP_001362030.2| GA13100 [Drosophila pseudoobscura pseudoobscura]
 gi|198137355|gb|EAL26610.2| GA13100 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 179/458 (39%), Positives = 253/458 (55%), Gaps = 44/458 (9%)

Query: 13  MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
           M CAVC   A   C+ CK V YC K+HQK+HW  HK  CK  P   K   +LGR+L AT 
Sbjct: 4   MDCAVCGVAAAQTCTRCKIVRYCDKEHQKQHWPEHKRCCK--PIREKKDDVLGRFLVATQ 61

Query: 73  DLHPGDRIARESPLIVGPKLALAEPI-------CLGCHKP--LNPNLADNARCPRCFWPA 123
           D+     I  E PL+VGPK  ++E         C+GC+ P  L  +L     C  C WP 
Sbjct: 62  DIKANQIIFVEEPLVVGPKWYMSEAEKSSTTVPCVGCYTPCRLGKHL-----CRNCRWPV 116

Query: 124 CSARCSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
           CSA C         A EC++L LG       +T    + Y+ +A+L L+CL+LQR+ PKK
Sbjct: 117 CSASCE------HEALECSVLSLGPGPSARADTRGLNDFYRGDALLVLKCLLLQRKDPKK 170

Query: 178 YQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKL---PGDVLDDTSAKCLHWIC 233
           + +L +M++H  +R  GTE+++E  D IV YL   FL++L     DVL+    + LH +C
Sbjct: 171 WTDLLEMQSHEEER-KGTELHQEAEDGIVSYLSQRFLQRLEQSKKDVLEHCEPELLHRLC 229

Query: 234 GVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN---AKHSNMMQSKLFVFRD----TH 282
           G+IE N + +    G    GL+   C+MEH C PN       + MQ ++    D     H
Sbjct: 230 GIIETNYMVVELPTGIELSGLFRQACMMEHACQPNCYFQFDGSTMQIEVRAGGDLKKGDH 289

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           +   YTN LWGT  R+ HL +TK+F C CERC DPTE G+Y SA+ CL +         L
Sbjct: 290 LRITYTNILWGTHLRQHHLRLTKHFKCGCERCLDPTEFGSYVSALTCLGDVNKSCGGKQL 349

Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
           P++P+D  S W C +C   ++   V  + + + EQVDK++   P+   +E  L +L  M 
Sbjct: 350 PLDPVDEHSQWKCDACPMAMDTAYVTELQTHMTEQVDKMLAGRPSASEIELTLARLTQML 409

Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           HP+H+H + +KH+LIQLYG + G   + LS  +LERK+
Sbjct: 410 HPNHFHMFNLKHTLIQLYGNENGLELSSLSDGILERKL 447


>gi|350413549|ref|XP_003490027.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 480

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 161/417 (38%), Positives = 241/417 (57%), Gaps = 36/417 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C +C + A  KC+ CK+  YC K+HQ++ W  H+  C+   +EI+ S  LGRYL A+ DL
Sbjct: 9   CPICNQRATLKCNGCKQQFYCKKEHQRQDWPRHRSNCQA--WEIRESCELGRYLVASRDL 66

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
           +PGD I  ESPL+ GP L   + +C+GC K  N   + + RC +C WPAC   CSGL+D 
Sbjct: 67  NPGDVILSESPLVWGPALHSDQRVCVGCGKQCN---SSDTRCTKCLWPACDTDCSGLTDK 123

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
           + H  EC+ L      + A    + + +L +R LIL R+  K +  ++ +++H   RG G
Sbjct: 124 NRHGLECSFL------VKARIIPRCDVLLVIRMLILWRKKLKYWYSIEKLQSHQDSRGQG 177

Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--GRYTQGLYS 252
           T  YEE  +I +++     E+L  D  + +S   +H ICG+I+VN ++         +Y 
Sbjct: 178 TNAYEETMNIYQHI-----ERLLPD--NPSSKDIVHKICGLIDVNALETIPPEGCVAIYE 230

Query: 253 VICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHLA 302
             CL+EH+CL N +HS  +  K            + +  H+STMYT+ALW T+ RR HL 
Sbjct: 231 TACLLEHSCLANTRHSFKIDDKGRPRITVIAVCSIQKWDHLSTMYTHALWSTRARRAHLL 290

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
            TKYF+C C+RC+DPTELGT+   +KC +++      +ILP +PL+ DS W C SC   L
Sbjct: 291 ETKYFSCHCKRCADPTELGTHLGTLKCPHDND-----FILPKDPLNFDSQWGCISCPGTL 345

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
            A +V     +L E VD+++ +  N   L ++L++L  + HP H  C +V HSLIQL
Sbjct: 346 TASEVMQFVGKLEEDVDEIMYQ-ANKNKLVDLLSRLTTLLHPGHQLCTSVSHSLIQL 401



 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           +D+ +C+GC K  N   + + RC +C WPAC   CSGL+D + H  EC+ L
Sbjct: 86  SDQRVCVGCGKQCN---SSDTRCTKCLWPACDTDCSGLTDKNRHGLECSFL 133


>gi|340710128|ref|XP_003393648.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
          Length = 480

 Score =  285 bits (729), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 160/417 (38%), Positives = 241/417 (57%), Gaps = 36/417 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C +C + A  KC++CK+  YC K+HQ++ W  H+  C+   +EI+ S  LGRYL A+ DL
Sbjct: 9   CPICNQRATLKCNSCKQQFYCKKEHQRQDWPRHRSNCQA--WEIRESCELGRYLVASRDL 66

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
           +PGD I  ESPL+ GP L   + +C+GC K      + + RC +C WPAC   CSGL+D 
Sbjct: 67  NPGDVILSESPLVWGPALHSDQRVCVGCGKQC---YSGDTRCTKCLWPACDTNCSGLTDK 123

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
           + H  EC+ L      + A    + + +L +R LIL R+  K +  ++ +++H   RG G
Sbjct: 124 NRHGLECSFL------IKARIIPRCDVLLVIRMLILWRKKLKHWYSIEKLQSHQDSRGQG 177

Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--GRYTQGLYS 252
           T  YEE  +I  ++     E+L  D  + +S   +H ICG+I+VN ++         +Y 
Sbjct: 178 TNAYEETMNIYHHI-----ERLLPD--NPSSKDIVHKICGLIDVNALETIPPEGCVAIYE 230

Query: 253 VICLMEHNCLPNAKHSNMMQSKLF----------VFRDTHISTMYTNALWGTQPRREHLA 302
             CL+EH+CL N +HS  +  K            + +  H+STMYT+ALW T+ RR HL+
Sbjct: 231 TACLLEHSCLANTRHSFKIDDKGRPRITVIAVCPIQKWDHLSTMYTHALWSTRARRAHLS 290

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
            TKYF+C C+RC+DPTELGT+   +KC +++      +ILP +PL+ DS W C SC   L
Sbjct: 291 ETKYFSCHCKRCADPTELGTHLGTLKCPHDND-----FILPKDPLNFDSQWGCNSCPGTL 345

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
            A +V     +L E VD+++ +  N   L ++L++L  + HP H  C +V HSLIQL
Sbjct: 346 TASEVMQFVGKLEEDVDEIMYQ-ANKNKLVDLLSRLTTLLHPGHQLCTSVSHSLIQL 401



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 3/51 (5%)

Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           +D+ +C+GC K      + + RC +C WPAC   CSGL+D + H  EC+ L
Sbjct: 86  SDQRVCVGCGKQC---YSGDTRCTKCLWPACDTNCSGLTDKNRHGLECSFL 133


>gi|28573973|ref|NP_610202.3| CG14590 [Drosophila melanogaster]
 gi|19528095|gb|AAL90162.1| AT24727p [Drosophila melanogaster]
 gi|28380690|gb|AAF57282.2| CG14590 [Drosophila melanogaster]
 gi|220949792|gb|ACL87439.1| CG14590-PA [synthetic construct]
          Length = 553

 Score =  281 bits (720), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 172/454 (37%), Positives = 252/454 (55%), Gaps = 40/454 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC   A   C+ CK V YC ++HQK+HW  HK +C+  P+  +    LGRYL+ T ++
Sbjct: 7   CPVCGVAASQACTRCKMVRYCDREHQKQHWPQHKRRCR--PFSEEQDAELGRYLKVTQNI 64

Query: 75  HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
             G  +  E PL+VGPK  L++         C+GC+ P         +C RC WP CSA 
Sbjct: 65  AAGQIVFIEEPLVVGPKWYLSDADKEASNVPCVGCYTPCR---LGKHQCRRCRWPVCSAG 121

Query: 128 CSGLSDAHTHAPECAILKLGCET-----LLAYNDY-KYEAILPLRCLILQRRSPKKYQEL 181
           C   S       EC++L LG  +       + NDY + +A+L L+CL+LQR+SP K+  L
Sbjct: 122 CKHES------MECSVLSLGSGSPTRADARSLNDYFRGDALLVLKCLLLQRQSPTKWSAL 175

Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIE 237
            +M++H  +R  GT++YEE +  +V YL+  FL +L     ++L D   + LH +CG+IE
Sbjct: 176 LEMQSHEEER-KGTDLYEEAEKRVVTYLQKRFLCRLKQTNPNLLTDCGPEMLHRLCGIIE 234

Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTM 286
            N + I    G    GL+   C+MEH C PN       +++    R         H+   
Sbjct: 235 TNFMVIELPSGVELSGLFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLRKGDHLRIT 294

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YTN LWGTQ R+ HL +TK+F+C C RC DPTE GTY SA+ CL +         LPV+P
Sbjct: 295 YTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALTCLGDVNQTCGGTHLPVDP 354

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
           LD ++ W C +C   ++   V  + S + EQV+ L+   P+   +E +L +L  M HP+H
Sbjct: 355 LDENTQWKCDTCPMIVDGAYVAELQSHMTEQVEGLLAGCPSANQVELLLARLTHMLHPNH 414

Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           +H + +KH+LIQLYG + G     LS++ LERK+
Sbjct: 415 FHTFNLKHTLIQLYGNEAGLELGVLSNTQLERKL 448


>gi|345483721|ref|XP_001601934.2| PREDICTED: protein msta, isoform B-like [Nasonia vitripennis]
          Length = 480

 Score =  281 bits (718), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 160/418 (38%), Positives = 238/418 (56%), Gaps = 40/418 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC + A  +C +CK   YCGK+HQ+EHW  HK  C+   YE + +  LGR+L AT DL
Sbjct: 11  CAVCSKAATTRCGSCKLAFYCGKEHQREHWPRHKTSCRA--YEERENQELGRHLLATRDL 68

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSD 133
            P D I  E+P++ GP   + E +C+GC       + D   RCP C WPAC   C GL D
Sbjct: 69  KPDDLIISEAPIVWGPSSHVEERVCVGC------GVRDAVCRCPNCAWPACRLACDGLVD 122

Query: 134 AHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGP 193
            + H  EC IL      L+   DY     +    LIL+R++PKK++ L  +++H   RGP
Sbjct: 123 ENRHGYECGILAKA--RLMPRCDYLLILRV----LILRRKNPKKWRVLSKLQSHEESRGP 176

Query: 194 GTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLY 251
           GTE +EE++++ +YL S          LD  SA+ L  +CG+I+VN ++      +  +Y
Sbjct: 177 GTEAHEEMEAVKQYLGSLL-------SLDQGSAEALPKVCGLIDVNALETNPPEGSAAIY 229

Query: 252 SVICLMEHNCLPNAKHSNMMQSK------LFVFRDT----HISTMYTNALWGTQPRREHL 301
              CL+EH C+ N ++S  +  +      ++   D     H+ST YT+ LW T+ RREHL
Sbjct: 230 ETACLLEHRCVANTRYSFELDEQGRPRINVYAVTDIKKADHLSTTYTHVLWATRVRREHL 289

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TKYF+C CERC+DPTELG++   +KC  E       ++ P +PL+ ++DW+C  C   
Sbjct: 290 LATKYFSCRCERCADPTELGSHLGTLKCPCEKG-----FVTPDDPLNPETDWSCKDCPGV 344

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           L+A +V  +T +LGE+VD  +    +   + ++L +L  + HP H HC  + HSL+QL
Sbjct: 345 LSAGEVAQLTDRLGEEVDAAMS-AADKDIMSDLLFRLTVLLHPCHQHCLTIGHSLMQL 401



 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 27/52 (51%), Gaps = 7/52 (13%)

Query: 451 TDEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSDAHTHAPECAIL 501
            +E +C+GC       + D   RCP C WPAC   C GL D + H  EC IL
Sbjct: 88  VEERVCVGC------GVRDAVCRCPNCAWPACRLACDGLVDENRHGYECGIL 133


>gi|380026797|ref|XP_003697129.1| PREDICTED: uncharacterized protein LOC100866234 [Apis florea]
          Length = 480

 Score =  278 bits (710), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 162/418 (38%), Positives = 233/418 (55%), Gaps = 38/418 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C +C + A  KCS CK+  YC K+HQ+  W  HK  C+   +EI+ +  LGR+L A+ DL
Sbjct: 9   CPICNQRATLKCSGCKQQFYCKKEHQRVDWPRHKLTCQ--AWEIRENSELGRHLLASRDL 66

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
           +PGD I  ESPL  GP +   + +C+GC K      + N RC +C WPAC+  CSGL+D 
Sbjct: 67  NPGDVILSESPLAWGPSIHSDQRLCVGCGKQCK---SANTRCTKCLWPACAVDCSGLTDK 123

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
           + H  EC+ L      + A    + + +L +R LIL R+  K +  ++ +++H   RGPG
Sbjct: 124 NRHDLECSFL------IKAKIIPRCDVLLVIRMLILWRKKSKHWNSIQKLQSHEDSRGPG 177

Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK-CLHWICGVIEVNGVDI--GRYTQGLY 251
           T  YEE  +I  +++      LP    D++S K  +  ICG+I+VN ++         +Y
Sbjct: 178 TSAYEETMNIYYHIQ----RLLP----DNSSTKDIVSKICGLIDVNALETVPPEGCVAIY 229

Query: 252 SVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHL 301
              CL+EH+CL N +HS  +  K            + +  H+STMYT+ALW T+ RR HL
Sbjct: 230 ETACLLEHSCLANTRHSFTIDDKGRPRITVKAVCSIQKGDHLSTMYTHALWATRARRSHL 289

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TKYF+C C+RC+DPTELGT+   +KC      Q D +I   NPLD DS+W C SC   
Sbjct: 290 LETKYFSCHCKRCADPTELGTHLGTLKC-----PQDDGFISSKNPLDFDSEWGCDSCPGT 344

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           L A ++     +L E VD ++      K L  +L++L  + HP H  C  V HSLIQL
Sbjct: 345 LTASEMMQFIGKLEEDVDDVMCSAQKDK-LVNLLSRLTKLLHPGHQLCITVSHSLIQL 401



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           +D+ +C+GC K      + N RC +C WPAC+  CSGL+D + H  EC+ L
Sbjct: 86  SDQRLCVGCGKQCK---SANTRCTKCLWPACAVDCSGLTDKNRHDLECSFL 133


>gi|335892814|ref|NP_001229486.1| uncharacterized protein LOC724300 [Apis mellifera]
          Length = 479

 Score =  277 bits (709), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 161/417 (38%), Positives = 236/417 (56%), Gaps = 37/417 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C +C + A  KCS CK+  YC K+HQ+  W  HK  C+   +EI+ +  LGR+L A+ DL
Sbjct: 9   CPICNQRATLKCSGCKQQFYCKKEHQRMDWPRHKLTCQ--AWEIRENSELGRHLLASRDL 66

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
           +PGD I  ESPL+ GP +   + +C+GC K      + N RC +C WPAC+  CSGL+D 
Sbjct: 67  NPGDVILSESPLVWGPSIHSDQRLCVGCGKQCK---SANIRCTKCLWPACAVDCSGLTDK 123

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
           + H  EC+ L      + A    + + +L +R LIL  +  K +  ++ +++H   RGPG
Sbjct: 124 NRHDLECSFL------IKAKIIPRCDVLLVIRMLILWCKKSKYWNSIQKLQSHEDSRGPG 177

Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK-CLHWICGVIEVNGVDI--GRYTQGLY 251
           T VYEE  +I  +++      LP    D++S K  +  ICG+I++N ++         +Y
Sbjct: 178 TSVYEETMNIYYHIQ----RLLP----DNSSTKDIVSKICGLIDINALETVPPEGCVAIY 229

Query: 252 SVICLMEHNCLPNAKHSNMM---------QSKLFVFRDTHISTMYTNALWGTQPRREHLA 302
              CL+EH+CL N +HS  +         ++   + +  H+STMYT+ALW T+ RR HL 
Sbjct: 230 ETACLLEHSCLANTRHSFTIDKGRPRITVKALCSIQKGDHLSTMYTHALWATRVRRSHLL 289

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
            TKYF+C C+RC+DPTELGT+   +KC      Q + +ILP NPLD DS+W C SC   L
Sbjct: 290 ETKYFSCHCKRCADPTELGTHLGTLKC-----PQDNGFILPKNPLDFDSEWGCDSCPGTL 344

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
            A ++     +L E VD  V  +     L  +L++L  + HP H  C  V HSLIQL
Sbjct: 345 TASEMIQFIGKLEEDVDD-VMCSARKDKLVNLLSRLTTLLHPGHQLCITVSHSLIQL 400



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           +D+ +C+GC K      + N RC +C WPAC+  CSGL+D + H  EC+ L
Sbjct: 86  SDQRLCVGCGKQCK---SANIRCTKCLWPACAVDCSGLTDKNRHDLECSFL 133


>gi|307187820|gb|EFN72772.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 484

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 154/417 (36%), Positives = 232/417 (55%), Gaps = 36/417 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC + A  KC  C+   YC K HQ   W  H+  C  + +EI     LGRYL AT DL
Sbjct: 10  CPVCGKNATLKCGNCRREFYCDKSHQSRDWPRHRSAC--IAWEIGRDADLGRYLLATRDL 67

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            PGD I  E+PL+ GP     + +C+GC K  N ++  + RC  C WPAC + C GLSD 
Sbjct: 68  APGDAILSEAPLVWGPSTHATQRLCVGCGKRCNDDI--DMRCGVCRWPACRSNCEGLSDK 125

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
             H  ECA+L        A    + E +L +R LIL R   K++  L ++++H   RGPG
Sbjct: 126 KRHESECALLAR------ARIIPRCEVLLVIRMLILWRTKSKRWTSLANLQSHEGSRGPG 179

Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLYS 252
           T+ ++E+ SI ++L    +     D+L          ICG+I+VN ++      +  +Y 
Sbjct: 180 TDAHDEVTSISQHLGPLLMAFDCQDILPK--------ICGLIDVNALETMPPEGSMAIYE 231

Query: 253 VICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHLA 302
              L+EH+C+ N +HS  +  K            + +  H+STMYT+ALW T+ RREHL 
Sbjct: 232 NASLLEHSCVANTRHSFRIDDKGRPRITVYAVTSIKKGEHLSTMYTHALWSTRIRREHLL 291

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
           +TKYF C C+RC+DPTELGT+   +KC  ++       +LP +PL++++DW+C  C   +
Sbjct: 292 VTKYFACRCKRCADPTELGTHLGTLKCPCKNG-----LMLPNDPLNSNTDWSCNVCPGIV 346

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
            A +V  +  +L E+V  ++++     +L ++L++L  + HP H HC +V HSLIQL
Sbjct: 347 TASEVAQLMDRLEEEVTGIMKQATE-HTLSDLLSRLTVLLHPGHQHCISVSHSLIQL 402



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 8/73 (10%)

Query: 443 VMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 502
           V  P+  AT + +C+GC K  N ++  + RC  C WPAC + C GLSD   H  ECA+L 
Sbjct: 80  VWGPSTHAT-QRLCVGCGKRCNDDI--DMRCGVCRWPACRSNCEGLSDKKRHESECALLA 136

Query: 503 LG-----CETLLA 510
                  CE LL 
Sbjct: 137 RARIIPRCEVLLV 149


>gi|195380866|ref|XP_002049182.1| GJ20889 [Drosophila virilis]
 gi|194143979|gb|EDW60375.1| GJ20889 [Drosophila virilis]
          Length = 553

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 172/457 (37%), Positives = 249/457 (54%), Gaps = 43/457 (9%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
           +CAVC   A   C+ CK V YC  +HQK+HW  HK  C+  P+ ++    LGRYL AT D
Sbjct: 7   KCAVCDVAASLTCTRCKMVRYCNGEHQKQHWPQHKSCCR--PFRVQQDEQLGRYLVATQD 64

Query: 74  LHPGDRIARESPLIVGPKLALAE-------PICLGCHKPLNPNLADNARCPRCFWPACSA 126
           +     I  E PL+VGPK  L E         C+GC+ P         +C  C WP CSA
Sbjct: 65  IKAKQIIFVEEPLVVGPKWFLTEHEKSASIVPCVGCYTPCRLG---KHQCRNCRWPVCSA 121

Query: 127 RCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQE 180
            C           EC++L LG       +    ++ Y+++A+L L+CL+LQR+ P+++Q 
Sbjct: 122 SCLH------EKLECSVLSLGTAPTAKSDARALHDYYRHDALLVLKCLLLQRQQPERWQA 175

Query: 181 LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGD------VLDDTSAKCLHWIC 233
           L +M++H  +R  GTE+ +E +  IV YL   FL++L          L D   + LH +C
Sbjct: 176 LLEMQSHAEER-LGTELQQEAEQRIVSYLEQRFLQRLKQSKNRECCQLSDYEPELLHRLC 234

Query: 234 GVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTH 282
           G+IE N + I    G    GL+   C+MEH C PN       A     +++   + R  H
Sbjct: 235 GIIETNYMVIELATGVELSGLFRQACMMEHACQPNCYFQFDSATQRIAVRAGCDMKRGDH 294

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           +   YTN LWGTQ R+ HL +TK+F+C C RC DPTE G+Y SAM+CL +         L
Sbjct: 295 LKITYTNILWGTQMRQHHLRMTKHFSCRCARCEDPTEYGSYVSAMRCLGDVSKSCVGVQL 354

Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
           P++PL+  + W C +C   L+A  V  V + + E+VD+L+   P    +E +L +L  + 
Sbjct: 355 PLDPLNEQTQWKCDTCPVLLDAAYVTEVQTHMTEEVDQLLAGRPTASQVELLLARLSQLL 414

Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           HP+H+H + +KH+LIQLYG +PG   T+LS   L RK
Sbjct: 415 HPNHFHMFNLKHTLIQLYGHEPGLELTELSDLQLGRK 451


>gi|194864122|ref|XP_001970781.1| GG10833 [Drosophila erecta]
 gi|190662648|gb|EDV59840.1| GG10833 [Drosophila erecta]
          Length = 553

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 171/454 (37%), Positives = 250/454 (55%), Gaps = 40/454 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC   A   C+ CK V YC ++HQK+HW  HK +C+  P+  +    LGRYL+ T D+
Sbjct: 7   CPVCGVAASQACTRCKMVRYCDREHQKQHWPQHKRRCR--PFSEEQDAELGRYLKVTQDI 64

Query: 75  HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
             G  +  E PL+VGPK  L++         C+GC    NP      +C RC WP CSA 
Sbjct: 65  AAGQIVFIEEPLVVGPKWYLSDADKEASIVPCVGC---CNPCRLGKHQCRRCRWPVCSAG 121

Query: 128 CSGLSDAHTHAPECAILKLGCET-----LLAYNDY-KYEAILPLRCLILQRRSPKKYQEL 181
           C   S       EC++L L   +     + + NDY + +A+L L+CL+LQR+SP K+  L
Sbjct: 122 CEHES------LECSVLSLRSGSPARADVRSLNDYFRGDALLVLKCLLLQRQSPTKWSAL 175

Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIE 237
            +M++H  +R  GTE++EE +  +V YL+  FL +L     ++L D   + LH +CG+IE
Sbjct: 176 LEMQSHEDER-KGTELHEEAEQRVVSYLQQRFLCRLKQTNPNLLHDCGPEMLHRLCGIIE 234

Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTM 286
            N + I    G    GL+   C+MEH C PN       +++    R         H+   
Sbjct: 235 TNFMVIELPTGVELSGLFRQACMMEHACQPNCDFQFDNKTQQIAVRAGCDLRKGDHLRIT 294

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YTN LWGTQ R+ HL +TK+F+C C RC DPTE GTY SA+ CL +         L V+P
Sbjct: 295 YTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALACLGDVNQTCGGTHLAVDP 354

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
           LD ++ W C +C   L+   V  + S + E V+ L+   P+   +E +L +L  M HP+H
Sbjct: 355 LDENTQWKCDTCPMLLDGAYVSELQSHMTEHVEGLLGGCPSANQVELLLARLTHMLHPNH 414

Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           +H + +KH+LIQLYG + G     LS++ L+RK+
Sbjct: 415 FHTFNLKHTLIQLYGNEAGLELGVLSNTQLKRKL 448


>gi|195023574|ref|XP_001985712.1| GH20950 [Drosophila grimshawi]
 gi|193901712|gb|EDW00579.1| GH20950 [Drosophila grimshawi]
          Length = 557

 Score =  271 bits (693), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 168/455 (36%), Positives = 249/455 (54%), Gaps = 42/455 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC   A   C+ CK V YC  +HQK+HW  HK  CK  P+++++ P LGRYL AT  +
Sbjct: 8   CAVCNVAAPLTCTRCKLVRYCNGEHQKQHWPTHKTCCK--PFKVQNDPQLGRYLVATQTI 65

Query: 75  HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
                I  E PL+VGPK  L +         C+ C+ P   N      C  C WP CS  
Sbjct: 66  QAKQIIFVEEPLVVGPKWFLTDDEKSASIVPCVACYTPCRIN---KHLCRSCRWPVCSIS 122

Query: 128 CSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
           C           EC+IL LG       +    ++ Y+++A+L L+CL+LQR+ P+++  L
Sbjct: 123 CEH------EKLECSILSLGSPPSGKSDARGLHDYYRHDALLVLKCLLLQRQHPERWAAL 176

Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG-----DVLDDTSAKCLHWICGV 235
            +M++H  +R  GTE+++E +  IV YL+  FL+++        +L + SA+ LH +CG+
Sbjct: 177 LEMQSHEEER-LGTELHQEAEQRIVSYLQQRFLQRIKAAKNRESLLSEYSAELLHRLCGI 235

Query: 236 IEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTHIS 284
           IE N + I    G    G++   C+MEH C PN       A     +++   + +  H+ 
Sbjct: 236 IETNYMVIELATGVELSGVFRQACMMEHACQPNCYFQFDAATQRIAVRAGCDLKKGDHLK 295

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YTN LWGTQ R+ HL +TK+FNC C RC DP+E GTY SAM+CL +          PV
Sbjct: 296 ITYTNILWGTQMRQHHLRMTKHFNCRCSRCEDPSEYGTYVSAMRCLGDVSKSCGGIQQPV 355

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           +PLD  + W C +C   ++A  V  V + + ++V+KL+   P    +E +L +L  + HP
Sbjct: 356 DPLDEQTQWKCDTCPMLVDAAYVSEVQTHMTDEVEKLLAGQPTASQVELLLARLSQLLHP 415

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           +H+H + +KH+LIQLYG +PG     LS   L RK
Sbjct: 416 NHFHMFNLKHTLIQLYGHEPGLELATLSDLQLGRK 450


>gi|383853714|ref|XP_003702367.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 484

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 154/418 (36%), Positives = 237/418 (56%), Gaps = 37/418 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C +C   A  KCS CK+  YC K HQ++ WK H+  C+   +EI     LGR+L A+ DL
Sbjct: 9   CPICNGHATLKCSGCKQQFYCNKDHQRQDWKRHRSVCQA--WEIHEDSELGRHLLASRDL 66

Query: 75  HPGDRIARESPLIVGPKL-ALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
            PGD +  E+P++ GP L    + +C+GC K      + + RC  C W AC   C GL+D
Sbjct: 67  DPGDVVLSEAPVVWGPALHGDDQRLCVGCGKQCK---SSSTRCNSCGWAACKMDCPGLTD 123

Query: 134 AHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGP 193
             TH  EC+ L      +      + + +L +R L+L R+  K++  L+ +++H   RGP
Sbjct: 124 NTTHGLECSYL------IKTAIFPRSDVLLVIRMLLLWRKKSKQWMALEKLQSHDDSRGP 177

Query: 194 GTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLY 251
           GT+ YEE  ++ ++++  FL  +      +  AK    ICG+I+VN ++    R    +Y
Sbjct: 178 GTQAYEEAMNMCEHIQ-RFLPAIGAG--KEIVAK----ICGLIDVNALETAPPRGCVAIY 230

Query: 252 SVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHL 301
           +  CL+EH+C+ N +HS  +  K            + +  H+STMYT+ALW T+ RR HL
Sbjct: 231 NTACLLEHSCVANTRHSFSIDDKGRPKITVKALCSIKKGEHLSTMYTHALWATRARRAHL 290

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TKYF+C C+RC+DPTELG++   ++C +   D G  +ILP +PLD ++DW C SC   
Sbjct: 291 LETKYFSCRCKRCADPTELGSHLGTLRCPH---DNG--FILPKDPLDFETDWACDSCPGV 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           L A +V   T +L E VD+++ + P+  +L ++ ++L  + HP H  C  V HSLIQL
Sbjct: 346 LTASEVMQFTGKLEEDVDEVMYQ-PSKNALIDLYSRLTTLLHPSHQQCITVSHSLIQL 402



 Score = 46.2 bits (108), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 443 VMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           V  P L   D+ +C+GC K      + + RC  C W AC   C GL+D  TH  EC+ L
Sbjct: 79  VWGPALHGDDQRLCVGCGKQCK---SSSTRCNSCGWAACKMDCPGLTDNTTHGLECSYL 134


>gi|194743442|ref|XP_001954209.1| GF16861 [Drosophila ananassae]
 gi|190627246|gb|EDV42770.1| GF16861 [Drosophila ananassae]
          Length = 550

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 168/451 (37%), Positives = 247/451 (54%), Gaps = 37/451 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC   A   C+ CK V YC  +HQK+ W  HK +CK  P+ +     LGR+L AT ++
Sbjct: 7   CPVCGVAATLVCTRCKLVRYCDPEHQKQDWAQHKRRCK--PFRVAEDEQLGRFLVATQNI 64

Query: 75  HPGDRIARESPLIVGPKL--ALAEPI--CLGCHKPLNPNLADNARCPRCFWPACSARCSG 130
                   E PL+VGPK   +  E I  C+GCH P         +C  C WP CSA C+ 
Sbjct: 65  AAKQIAFVEEPLVVGPKWYHSQDETIVPCVGCHTPCR---LGKHQCRSCHWPVCSAACAD 121

Query: 131 LSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
                  A EC++L LG       +     + ++ +A+L LRCL+LQR++P K+  L +M
Sbjct: 122 ------EALECSVLSLGPTPGARADNRTLNDYFRGDALLVLRCLLLQRQNPAKWTALLEM 175

Query: 185 EAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNG 240
           ++H  +R  GTE++ E +  +V +L+  FL++L     D+L D  ++ LH ICG+IE N 
Sbjct: 176 QSHEEER-QGTELHVEAEMRVVDFLQQRFLKRLKQTKPDLLADCGSELLHRICGIIETNY 234

Query: 241 VDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTN 289
           + I    G    GL+   C+MEH C PN        ++    R         H+   YTN
Sbjct: 235 MVIELPSGVELSGLFRQACMMEHACQPNCYFQFDAFTQQITVRAGCDLQKGDHLRIAYTN 294

Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN 349
            LWGTQ R+ HL +TK+F+C C RC DPTE G+Y SA+ CL +         LPV+PL+ 
Sbjct: 295 ILWGTQLRQHHLRLTKHFSCHCARCLDPTENGSYVSALACLGDVNLCCTGTHLPVDPLNE 354

Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
           ++ W C +C   ++A  V  + + + +QV+ L+   P  K +E +L +L  M HP+H+H 
Sbjct: 355 ETQWKCDTCPMLVDAAYVSELQTHMTDQVESLLAGRPTAKEVELLLARLSQMLHPNHFHM 414

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           + +KH+LIQLYG +PG     LS+  LERK+
Sbjct: 415 FNLKHTLIQLYGNEPGLEMDTLSNGQLERKL 445



 Score = 38.5 bits (88), Expect = 10.0,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 17/81 (20%)

Query: 432 SSSLLERKISYVMSPNL------KATDEPI--CLGCHKPLNPNLADNARCPRCFWPACSA 483
           + ++  ++I++V  P +       + DE I  C+GCH P         +C  C WP CSA
Sbjct: 61  TQNIAAKQIAFVEEPLVVGPKWYHSQDETIVPCVGCHTPCR---LGKHQCRSCHWPVCSA 117

Query: 484 RCSGLSDAHTHAPECAILKLG 504
            C+        A EC++L LG
Sbjct: 118 ACAD------EALECSVLSLG 132


>gi|307197684|gb|EFN78851.1| Protein msta, isoform A [Harpegnathos saltator]
          Length = 487

 Score =  268 bits (685), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 157/420 (37%), Positives = 228/420 (54%), Gaps = 41/420 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC + A  KC  C+   YC K HQ   W  HK  C    +EI   P LGR+L A  DL
Sbjct: 13  CPVCGKDATLKCGNCRREFYCDKSHQSRDWPRHKSLC--CAWEIGQDPQLGRHLLAARDL 70

Query: 75  HPGDRIARESPLIVGPKL-ALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
            PGD I  E PL  GP + A  + +C+GC K        +ARC +C WP C   C GL+D
Sbjct: 71  APGDVILSEVPLAWGPSIHARDQRLCVGCGKQCA---EIDARCGKCRWPVCRLDCEGLTD 127

Query: 134 AHTHAPECAILKLGCETLLAYNDY--KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKR 191
              H PECA        LLA+     + + +L +R LIL R   K++  L ++++H   R
Sbjct: 128 KKRHQPECA--------LLAHIRIIPRCDVLLVIRMLILWRTKSKRWTSLTNLQSHEDSR 179

Query: 192 GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--GRYTQG 249
           GPGTE YEE+ SI +YL        P   +       L  ICG+I+VN ++      +  
Sbjct: 180 GPGTEAYEEVMSISQYLE-------PIVAMVPCCKSVLPKICGLIDVNALETMPPEGSVA 232

Query: 250 LYSVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRRE 299
           +Y  + L+EH+C+ N +HS  +  K           F+ +  H+STMYT+ALW T+ RRE
Sbjct: 233 IYENMSLLEHSCMANTRHSFRLDDKGRPRITVNAITFIKKGEHLSTMYTHALWSTRARRE 292

Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
           HL  TKYF+C CERC+DPTELG +  +++C  ++       +L  +PL+ D++W+C +C 
Sbjct: 293 HLLATKYFSCRCERCADPTELGAHLGSLQCPCKNG-----LMLADDPLNPDTEWSCTACP 347

Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
             + + +V  ++ +L E+V   V       +L ++L++L  + HP H HC +V HSLIQL
Sbjct: 348 GTVTSAEVAQLSDKLEEEVTG-VMRKATAHTLSDLLSRLTVLLHPGHQHCVSVSHSLIQL 406



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 3/63 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
           ++     P++ A D+ +C+GC K        +ARC +C WP C   C GL+D   H PEC
Sbjct: 79  EVPLAWGPSIHARDQRLCVGCGKQCA---EIDARCGKCRWPVCRLDCEGLTDKKRHQPEC 135

Query: 499 AIL 501
           A+L
Sbjct: 136 ALL 138


>gi|322799089|gb|EFZ20542.1| hypothetical protein SINV_07869 [Solenopsis invicta]
          Length = 491

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 159/428 (37%), Positives = 236/428 (55%), Gaps = 37/428 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC + A  KC  CK   YC K HQ + W  HK  C    +EI   P LG +L AT DL
Sbjct: 10  CPVCGKNATLKCGNCKREFYCDKSHQSQDWPRHKSTCGG--WEIGHDPALGHHLLATRDL 67

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            PGD I  E PL+ GP     + +C+GC K  +     + RC RC WP C   C GLSD 
Sbjct: 68  SPGDVILSEVPLVWGPSTHANQRVCVGCSKRCDDV---DVRCGRCCWPVCRPDCEGLSDK 124

Query: 135 HTHAPECAILKLGCETLLAYNDYKY----EAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  ECA+L   C  ++     +Y    E +L +R LIL R   +++  L ++++H+  
Sbjct: 125 KRHQQECALL--ACARIVPRYLLRYACRCEVLLIIRILILWRTKSRRWTSLANLQSHVES 182

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQ 248
           RGPGTE Y+E+ S+ + L    L  +  D  D      L  ICG+++VN ++      + 
Sbjct: 183 RGPGTEAYDEVSSVSQRLGPLLL--MASDCKD-----ILPMICGLVDVNALETAPPEGSM 235

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRR 298
            +Y    L+EH+C+ N +H+  +  K            + R  H+STMYT+ALW T+ RR
Sbjct: 236 AIYENASLLEHSCIANTRHNFRVDDKGRPRITVYAVTSIKRGEHLSTMYTHALWSTRARR 295

Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
           EHL  TKYF C C+RC+DPTELGT+   + C  ++       +LP +PL+ ++DW+C +C
Sbjct: 296 EHLLATKYFACRCKRCADPTELGTHLGTLNCPCKNG-----LMLPNDPLNPNADWSCNAC 350

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
           S  + + +V  +T +L E+V ++++E     +L ++L++L  + HP H HC +V HSL+Q
Sbjct: 351 SGTVTSAEVAQLTDRLEEEVTEVMKEATE-HTLSDLLSRLTVLLHPGHQHCISVSHSLMQ 409

Query: 419 LY-GTQPG 425
           L   T PG
Sbjct: 410 LLPATDPG 417



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 12/94 (12%)

Query: 408 HCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNL 467
           H  A+ H L+      PG         ++  ++  V  P+  A ++ +C+GC K  +   
Sbjct: 53  HDPALGHHLLATRDLSPG--------DVILSEVPLVWGPSTHA-NQRVCVGCSKRCDDV- 102

Query: 468 ADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
             + RC RC WP C   C GLSD   H  ECA+L
Sbjct: 103 --DVRCGRCCWPVCRPDCEGLSDKKRHQQECALL 134


>gi|195431128|ref|XP_002063600.1| GK21998 [Drosophila willistoni]
 gi|194159685|gb|EDW74586.1| GK21998 [Drosophila willistoni]
          Length = 557

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 173/463 (37%), Positives = 250/463 (53%), Gaps = 49/463 (10%)

Query: 13  MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
           + CA+C   A   C+ C+ V YC  +HQKEHW  HK KCK  P+       LGRYL AT 
Sbjct: 4   LDCAICGVAASRTCNRCRMVRYCDGEHQKEHWPSHKRKCK--PFREVQDEQLGRYLVATH 61

Query: 73  DLHPGDRIARESPLIVGPKLALAEPI-------CLGCHKP--LNPNLADNARCPRCFWPA 123
           D+     I  E PL+VGPK   +E         C+GC+ P  L  +L     C  C WP 
Sbjct: 62  DIEAKQIIFVEEPLVVGPKWYKSEAEKSSVVVPCVGCYTPCRLGKHL-----CHSCRWPV 116

Query: 124 CSARCSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
           CSA C   S       EC+IL LG       +T   ++ Y+ +A+L L+CL+LQR+ P++
Sbjct: 117 CSAMCQHES------LECSILSLGPGPAARSDTRGLHDYYREDALLVLKCLLLQRQHPER 170

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLP--------GDVLDDTSAKC 228
           +  L +M++H  +R  GTE+++E +  IV YL   FL +L           ++ +   + 
Sbjct: 171 WSALLEMQSHEEER-KGTELHQEAEKRIVNYLHQKFLHRLKQATKQSKESRLIAEYDPEL 229

Query: 229 LHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFV 277
           LH +CG++E N + I    G    GL+   C+MEH C PN       A     +++   +
Sbjct: 230 LHRLCGIVETNYMVIELPTGVELSGLFRQACMMEHACQPNCYFTFDAATQQIAVRAGCNL 289

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            +  H+   YTN LWGTQ R+ HL +TK+F C C RC DPTE G+Y SAM CL +     
Sbjct: 290 KKGDHLKITYTNILWGTQLRQHHLKMTKHFVCRCSRCLDPTEYGSYVSAMACLGDVNKTC 349

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
               LPVNPLD ++ W C +C   + A  V  + + +  +VDKL+   P+   +E +L +
Sbjct: 350 VGRQLPVNPLDEETQWKCDTCPMVVEAGYVTELQTHMTAEVDKLMSGKPSANEVELLLAR 409

Query: 398 LEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           L  M HP+H+H + +KH+LIQLYG + G   +QL +  LERK+
Sbjct: 410 LSLMLHPNHFHMFNLKHTLIQLYGHEEGLQMSQLGAGQLERKL 452


>gi|195580894|ref|XP_002080269.1| GD10336 [Drosophila simulans]
 gi|194192278|gb|EDX05854.1| GD10336 [Drosophila simulans]
          Length = 531

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 167/438 (38%), Positives = 244/438 (55%), Gaps = 42/438 (9%)

Query: 32  VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
           V YC ++HQK+HW  HK +C+  P+  +    LGRYL+ T D+  G  +  E PL+VGPK
Sbjct: 2   VRYCDREHQKQHWPQHKRRCR--PFSEEQDAELGRYLKVTQDIAAGQIVFIEEPLVVGPK 59

Query: 92  LALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 143
             L++         C+GC+ P         +C RC WP CSA C        H P EC++
Sbjct: 60  WYLSDADKEASIVPCVGCYTPCR---LGKHQCRRCRWPVCSAGCE-------HEPMECSV 109

Query: 144 LKLGCETLL-----AYNDY-KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV 197
           L LG  +       + NDY + +A+L L+CL+LQR+SP K+  L +M++H  +R  GTE+
Sbjct: 110 LSLGSGSSTRADTRSLNDYFRGDALLVLKCLLLQRQSPSKWSALLEMQSHEEER-KGTEL 168

Query: 198 YEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNGVDI----GRYTQG 249
           +EE +  +V YL+  FL +L      +L D   + LH +CG+IE N + I    G    G
Sbjct: 169 HEEAEKRVVSYLQKRFLCRLKQTNPSLLTDCGPEMLHRLCGIIETNFMVIELPSGVELSG 228

Query: 250 LYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHLA 302
           L+   C+MEH C PN       +++    R         H+   YTN LWGTQ R+ HL 
Sbjct: 229 LFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLRKGDHLRITYTNILWGTQLRQHHLR 288

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
           +TK+F+C C RC DPTE GTY SA+ CL +         LPV+PLD ++ W C +C   +
Sbjct: 289 LTKHFSCRCSRCLDPTEYGTYISALTCLGDVNQTCGGTHLPVDPLDENTQWKCDTCPMLV 348

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
           +   V  + S + EQV+ L+   P+   +E +L +L  M HP+H+H + +KH+LIQLYG 
Sbjct: 349 DGAYVAELQSHMTEQVEGLLAGCPSANQVELLLARLTHMLHPNHFHTFNLKHTLIQLYGN 408

Query: 423 QPGYAYTQLSSSLLERKI 440
           + G     LS++ LERK+
Sbjct: 409 EAGLELGVLSNTQLERKL 426


>gi|195475636|ref|XP_002090090.1| GE19428 [Drosophila yakuba]
 gi|194176191|gb|EDW89802.1| GE19428 [Drosophila yakuba]
          Length = 526

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 161/437 (36%), Positives = 244/437 (55%), Gaps = 40/437 (9%)

Query: 32  VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
           V YC ++HQK+HW  HK +C+  P+  +   +LGRYL+AT ++  G  +  E PL+VGPK
Sbjct: 2   VRYCDREHQKQHWPQHKRRCR--PFSEEQDAVLGRYLKATQEIAAGQIVFIEEPLVVGPK 59

Query: 92  LALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 144
              ++         C+GC  P         +C RC WP CSA C   S       EC++L
Sbjct: 60  WYHSDADNEASIVPCVGCCTPCR---LGKHQCRRCCWPVCSAGCEHES------MECSVL 110

Query: 145 KLGCETLL-----AYNDY-KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVY 198
            LG  +       + NDY + +A+L L+CL+LQR+SP K+  L +M++H  +R  GTE++
Sbjct: 111 SLGTGSPTRVDPRSLNDYFRGDALLVLKCLLLQRQSPTKWLALLEMQSHEDER-RGTELH 169

Query: 199 EEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNGVDI----GRYTQGL 250
           +E +  +V YL++ FL +L      +L D   + LH +CG+IE N + I    G    GL
Sbjct: 170 DEAEKRVVSYLQNRFLCRLKQTNPSLLHDCEPEMLHRLCGIIETNFMVIELPTGLELSGL 229

Query: 251 YSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAI 303
           +   C+MEH C PN         H   +++   + +  H+   YTN LWGTQ R+ HL +
Sbjct: 230 FRQACMMEHACQPNCDFQFDNKTHQVAVRAGCDLRKGDHLQITYTNILWGTQLRQHHLRL 289

Query: 304 TKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLN 363
           TK+F+C C RC DPTE GTY SAM CL +         L V+PLD ++ W C +C   ++
Sbjct: 290 TKHFSCRCSRCLDPTEYGTYISAMACLGDVNQTCGGTHLAVDPLDENTQWKCDTCPILVD 349

Query: 364 ARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQ 423
              V  + S + +QV+ L+   P+   +E +L +L  + HP+H+H + +KH+LIQLYG +
Sbjct: 350 GAYVAELQSHMTKQVEGLLAGCPSANQVELLLARLTHVLHPNHFHTFNLKHTLIQLYGNE 409

Query: 424 PGYAYTQLSSSLLERKI 440
            G     LS++ L RK+
Sbjct: 410 AGLESCVLSNNQLIRKL 426


>gi|332017349|gb|EGI58093.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 481

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 150/417 (35%), Positives = 231/417 (55%), Gaps = 36/417 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC + A  KC +C+   YC K HQ + W  H+  C    +EI    +LGR+L AT DL
Sbjct: 10  CPVCGKNATLKCGSCRREFYCDKSHQSQDWPRHRSTCSG--WEIGRDSVLGRHLLATRDL 67

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            PGD I  E+PL+ GP     + +C+GC +  +     +ARC  C WP C + C GLSD 
Sbjct: 68  APGDVILTETPLVWGPSTHTNQRVCVGCSERCDDI---DARCSECCWPVCRSDCEGLSDK 124

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
             H  ECA+L        A    + E +L +R LIL R   +++  L +++ H+  RGPG
Sbjct: 125 KRHQLECALLAR------ARIIPRCEVLLIIRLLILWRMKSRRWTSLANLQNHVESRGPG 178

Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLYS 252
           TE ++E+ S+ + L    L  +  D  D      L  ICG+I+VN ++      +  +Y 
Sbjct: 179 TEAHDEVSSVSQRLGPLLL--MASDCKD-----VLPMICGLIDVNALETAPPEGSVAIYE 231

Query: 253 VICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHLA 302
              L+EH+C+ N +HS  +  K            + +  H+STMYT+ALW T+ RREHL 
Sbjct: 232 HASLLEHSCIANTRHSFRIDDKGRPRITIYAVTSIKKGQHLSTMYTHALWSTRARREHLL 291

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
            TKYF C C+RC+DPTELGT+   + C  ++       +LP +PL+ +++W+C +C   +
Sbjct: 292 TTKYFACRCKRCADPTELGTHLGTLNCPCKNG-----LMLPNDPLNPNTNWSCDACPGTI 346

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
            + +V  +  +L E+V +++++     +L E+L++L  + HP H HC +V HSLIQL
Sbjct: 347 TSAEVAELIDRLEEEVVEVMKQATEC-TLSELLSRLTVLLHPGHQHCISVSHSLIQL 402



 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 3/51 (5%)

Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           T++ +C+GC +  +     +ARC  C WP C + C GLSD   H  ECA+L
Sbjct: 87  TNQRVCVGCSERCDDI---DARCSECCWPVCRSDCEGLSDKKRHQLECALL 134


>gi|242007166|ref|XP_002424413.1| protein msta, isoform A, putative [Pediculus humanus corporis]
 gi|212507813|gb|EEB11675.1| protein msta, isoform A, putative [Pediculus humanus corporis]
          Length = 540

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 155/464 (33%), Positives = 245/464 (52%), Gaps = 53/464 (11%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC + +   C  CK + YC K+HQK HW      C +  Y+++ +  LGRYL A+ ++
Sbjct: 5   CAVCGQPSSSICGGCKRLFYCSKEHQKLHWMKEHKGCCRAVYKLEQNDRLGRYLIASENI 64

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
             GD I  ESPLIVGPK  +  P+CLGCH     N+ +   C  C +P CS +C  +SD 
Sbjct: 65  SKGDVIFYESPLIVGPK-TVTVPVCLGCHNVC--NMDNYHECDGCGFPLCSPQCQ-VSD- 119

Query: 135 HTHAPEC-AILKLGCETLL-------------AYNDYKYEAILPLRCLILQ-RRSPKKYQ 179
             H  EC  ++K      +                +  Y  I+PLRC++L+ +++ +KY+
Sbjct: 120 -NHVDECYQMMKSNYRIKMDSKKKKNNDDNNGGGKEIFYFPIVPLRCILLKGKKNSEKYK 178

Query: 180 ELKDMEAHMSKR--GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
           ++ ++E+H   R   P  ++Y++  ++  ++R    E+L    +DD   K +  I G+++
Sbjct: 179 KIMNLESHYDDRINKPLFDLYKK--NVADFIR----ERLK---MDDVDEKEILTITGILD 229

Query: 238 VNGVDIGRYT---QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------ISTMY 287
            N  +I +     +GLY+V  ++ H+C PN KHS +      V   T        I+  Y
Sbjct: 230 TNAFEIVKNNVKLRGLYNVSSMLSHDCKPNTKHSFLQPDMTIVVTATRDIKSGELITATY 289

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC-----LNEHKDQGDCWIL 342
           T   W T  RREHL   K F+C CERCSDP+ELGTY  A++C      ++     +C+++
Sbjct: 290 TQTFWNTLSRREHLKTIKCFDCVCERCSDPSELGTYAGAIRCSKCTMTSDKNSVENCYVI 349

Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKL---- 398
           PV+PL+++S W C +C   ++ + + +    L  ++ K       V  LEE L K     
Sbjct: 350 PVDPLNSNSVWKCMNCQREMSHKQIKMGNDNLKNELTK--SNVKTVTGLEEFLIKYGVYE 407

Query: 399 EAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
             + H ++YH   VKH+LI+L G   GY Y+ +S  LL+RKI Y
Sbjct: 408 NGVLHSNNYHVVQVKHALIRLIGNTDGYTYSDISDDLLDRKIIY 451


>gi|195171611|ref|XP_002026597.1| GL11805 [Drosophila persimilis]
 gi|194111523|gb|EDW33566.1| GL11805 [Drosophila persimilis]
          Length = 528

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 159/424 (37%), Positives = 230/424 (54%), Gaps = 44/424 (10%)

Query: 47  HKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-------C 99
           HK  CK  P   +   +LGR+  AT D+     I  E PL+VGPK  ++E         C
Sbjct: 17  HKRCCK--PIREEKDDVLGRFWVATQDIKANQIIFVEEPLVVGPKWYMSEAEKSSTTVPC 74

Query: 100 LGCHKP--LNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLG------CETL 151
           +GC+ P  L  +L     C  C WP CSA C         A EC++L LG       +T 
Sbjct: 75  VGCYTPCRLGKHL-----CRNCRWPVCSASCE------HEALECSVLSLGPGPSARADTR 123

Query: 152 LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRS 210
              + Y+ +A+L L+CL+LQR+ PKK+ +L +M++H  +R  GTE+++E  D IV YL  
Sbjct: 124 GLNDFYRGDALLVLKCLLLQRKDPKKWTDLLEMQSHEEER-KGTELHQEAEDGIVSYLSQ 182

Query: 211 NFLEKL---PGDVLDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLP 263
            FL++L     DVL+    + LH +CG+IE N + +    G    GL+   C+MEH C P
Sbjct: 183 RFLQRLEQSKKDVLEHCEPELLHRLCGIIETNYMVVELPTGIELSGLFRQACMMEHACQP 242

Query: 264 N---AKHSNMMQSKLFVFRD----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
           N         MQ ++    D     H+   YTN LWGT  R+ HL +TK+F C CERC D
Sbjct: 243 NCYFQFDGTTMQIEVRAGGDLKKGDHLRITYTNILWGTHLRQHHLRLTKHFKCGCERCLD 302

Query: 317 PTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGE 376
           PTE G+Y SA+ CL +         LP++P+D  S W C +C   ++   V  + + + E
Sbjct: 303 PTEFGSYVSALTCLGDVNKSCGGKQLPLDPVDEQSQWKCDACPMAMDTAYVTELQTHMTE 362

Query: 377 QVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLL 436
           QVDK++   P+   +E  L +L  M HP+H+H + +KH+LIQLYG + G   + LS  +L
Sbjct: 363 QVDKMLAGRPSASEIELTLARLTQMLHPNHFHMFNLKHTLIQLYGNENGLELSSLSDGIL 422

Query: 437 ERKI 440
           ERK+
Sbjct: 423 ERKL 426


>gi|195124734|ref|XP_002006842.1| GI21287 [Drosophila mojavensis]
 gi|193911910|gb|EDW10777.1| GI21287 [Drosophila mojavensis]
          Length = 551

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 162/457 (35%), Positives = 241/457 (52%), Gaps = 46/457 (10%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC   A   C+ CK V YC  +HQ++ W  HK  C+  P+  +    LGRYL AT ++
Sbjct: 8   CAVCHVAARLTCTRCKVVRYCNGEHQRQDWPQHKSCCR--PFREQQDEQLGRYLVATHNV 65

Query: 75  HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
                I  E PL+VGPK  L +         C+GC+ P      D   C  C WP CS  
Sbjct: 66  KAKQIIFVEDPLVVGPKWFLTDEEKSASIVPCVGCYTPCR---VDKHHCRNCRWPVCSVG 122

Query: 128 CSGLSDAHTHAPECAILKLG--------CETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
           C           EC++L LG           L+ Y  Y+++A+L L+CL+LQ++ P +++
Sbjct: 123 CEH------EKLECSVLSLGNAPKTKTDVRALVDY--YRHDALLVLKCLLLQQQHPDRWR 174

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLP-----GDVLDDTSAKCLHWIC 233
            L +M++H  +R  GTE+++E +  +V YL+  FL+++        +  D   + LH +C
Sbjct: 175 ALLEMQSHEEER-LGTELHQEAEQRVVGYLQQRFLQRIKQSKQRDSMPSDYEPQLLHRLC 233

Query: 234 GVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTH 282
           G+IE N + I    G    GL+   C+MEH C PN       A     +++   + +  H
Sbjct: 234 GIIETNYMVIELATGIELSGLFRQACMMEHACQPNCYFQFDAATQRIAVRAGCDLKKGDH 293

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           +   YTN LWGTQ R  HL +TK+F+C C RC DPTE G+Y SAM+CL +         L
Sbjct: 294 LKITYTNILWGTQMRHHHLLMTKHFSCRCARCEDPTEYGSYVSAMRCLGDVSKTCVGVQL 353

Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
           PV+PLD  + W C +C   ++A  V  V + + EQV+ L+   P    +E +L +L  + 
Sbjct: 354 PVDPLDEKTQWKCDTCPMVVDAAYVTEVQTHMTEQVESLLAGRPTASQVELLLARLSQLL 413

Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           H +H+H + +KH+L+QLYG +PG     LS   L RK
Sbjct: 414 HTNHFHMFNLKHTLVQLYGHEPGLELATLSDLQLGRK 450


>gi|195440652|ref|XP_002068154.1| GK12548 [Drosophila willistoni]
 gi|194164239|gb|EDW79140.1| GK12548 [Drosophila willistoni]
          Length = 534

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 153/457 (33%), Positives = 249/457 (54%), Gaps = 50/457 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA C+  A   C+ACK V YC ++HQKEHWK  HK +CK   YE+ S+ +LGR+L+A
Sbjct: 5   LTNCAFCQARASQICAACKSVVYCSREHQKEHWKKGHKRECK--CYEVASNDMLGRHLRA 62

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLN-PNLADNA-RCPRCFWPACSARC 128
           T D+  G++I RE+PL+VGPK+A A PICLGCH+ +  P  + N  +C  C WP C   C
Sbjct: 63  TRDIKIGEQILREAPLVVGPKVASA-PICLGCHRNIGAPGKSMNYYKCSSCTWPLCGKEC 121

Query: 129 SGLSDAHTHAPECAIL-KLGCETLLAYN----------DYKYEAILPLRCLILQRRSPKK 177
                +  H  EC ++ +   ++ + Y+          +  Y  I+ LRC++L+   P+ 
Sbjct: 122 E---QSPHHRGECQLMAQSNFQSKINYHPDQDQNQERKESAYCVIMLLRCMLLKTSQPED 178

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICG 234
           +  L  +E H+ +R   T +Y+        LR+N   F++ + G  L + S   +  I  
Sbjct: 179 FVRLFSLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--LREWSEMEILRIAA 228

Query: 235 VIEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH---SNM---MQSKLFVFRDTHI 283
           +++ N  ++      R  + LY    ++ H+C+PN +H    NM     +K  + +   +
Sbjct: 229 ILDTNTFEVRQPKERRKVRALYPGAAMISHDCVPNMRHRFDDNMNIIFLAKRAISKGEIL 288

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
           +  YT  L  T  RR HL   K F+C+C+RCSDP+ELGT+ +A  CL     +    I+P
Sbjct: 289 NISYTQPLRSTIQRRLHLRQAKCFDCACQRCSDPSELGTFAAAQMCLKCKAGK----IIP 344

Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
           +NPL N S W C  C+ + +A+++  + +QL  +++ L +  P   +LE+ + +     H
Sbjct: 345 LNPLQNSSPWKCQLCNVKKSAKEIITIDNQLQLEIESLDKSTPT--ALEDFIYRHRIDLH 402

Query: 404 PHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             + H   VK++L QLYG  PG++  +L+   L+RK+
Sbjct: 403 ETNTHVLQVKYALTQLYGNAPGFSMDELTEDNLKRKV 439


>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
          Length = 524

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 144/441 (32%), Positives = 230/441 (52%), Gaps = 43/441 (9%)

Query: 18  CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPG 77
           C + A  +CS C+ V YC + HQK  WK H+  CK   Y I+ +  +GR+L A  DL  G
Sbjct: 8   CGKEAHQRCSGCQAVFYCSRDHQKSDWKKHRSHCK--AYRIEQTLTVGRHLIACRDLKAG 65

Query: 78  DRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH 137
           D I +E P+++GPK   A  ICLGC+  ++       RC +C WP C     G  D H H
Sbjct: 66  DVILQEKPIVMGPKHT-AGQICLGCYSGVD----GRTRCSQCGWPMC-----GRDDCHAH 115

Query: 138 ----APECAILKLGCETLLAYNDYK-YEAILPLRCLILQRRSPKKYQELKDMEAHMS-KR 191
               A EC ++  G   ++     + Y++++ LRCL L+ ++P+++ EL  +EAH+  +R
Sbjct: 116 ESDHAAECGVMASGGRPIVGSLPVQAYQSVMVLRCLALRDQNPERWDELMQLEAHVQERR 175

Query: 192 GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD----IGRYT 247
             G E  ++  + V+++R     ++P +++          +CG++ VN  +     G   
Sbjct: 176 QKGMEDVDQA-TAVRFIRETLGLQIPEELILQ--------LCGILMVNSFEQPPMKGNSQ 226

Query: 248 QGL---YSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMYTNALWGTQPRR 298
            GL   YS   L+EH+C+ NA      K   ++++ + + +   I+  YT  LWGT  R+
Sbjct: 227 HGLVAVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQ 286

Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
            HL+ TK+F C CERC DPTEL T+ S   C  +   Q    +L  NPLD+ SDW C  C
Sbjct: 287 RHLSQTKFFQCVCERCKDPTELDTFTSGFYC-QKCPQQSAGILLTENPLDDASDWICRQC 345

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
            AR  A     +   +G+++  L  +  +V+  E  L K   + HP+H++   VK +L Q
Sbjct: 346 GARQPANYAAEIIESVGKEIVAL--KKGSVRDCEGFLRKFSKVLHPNHFYLTDVKMALCQ 403

Query: 419 LYGTQPGYAYTQLSSSLLERK 439
           +YG   G +  +++   L  K
Sbjct: 404 MYGHLEGQSLIEMNDDTLNAK 424


>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
 gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
          Length = 529

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 156/456 (34%), Positives = 242/456 (53%), Gaps = 56/456 (12%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
           CA C++ A   C+ C+ V YC ++HQKEHWK  HK +CK   YE+ S+ LLGR+L+AT D
Sbjct: 7   CAYCQQRATQLCAGCRSVVYCSREHQKEHWKRGHKRECK--CYEVTSNELLGRHLRATRD 64

Query: 74  LHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSARCS 129
           +  G++I  E+PL+VGPK+A A P+CLGCH+    P  P +    +C  C WP CS  C 
Sbjct: 65  IRMGEQIMCEAPLVVGPKVA-ATPLCLGCHRNLLAPQKPAMT-FYKCSSCSWPMCSKECE 122

Query: 130 GLSDAHTHAPECAILKLG-CETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQEL 181
               +  H  EC ++     ++ + YN       +  Y  I+ LRC+ L+R +P+ +  L
Sbjct: 123 ---KSPFHLDECRLMAASNFQSKINYNPADPEGKESAYCVIMLLRCMQLKRSNPEAFARL 179

Query: 182 KDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVIEV 238
             +E H+ +R   T +Y+        LR+N   F++ + G    D     +  I  +++ 
Sbjct: 180 AALEDHLKERIE-TPLYQ-------VLRANLITFIKTVLGQ--RDWPELEILRIAAILDT 229

Query: 239 NGVDIG-----RYTQGLYSVICLMEHNCLPNAKH---SNM---MQSKLFVFRDTHISTMY 287
           N  ++      R  + ++    ++ H+C+PN +H    NM     +K  + +   +S  Y
Sbjct: 230 NAFEVRQNGERRKVRAIFPGGAMIAHDCVPNLRHRFDDNMRIIFLAKRPIPKGAILSISY 289

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV--- 344
           T  L  T  RR HL   K F+C+CERCSDPTELGT+  A  C       G C +  V   
Sbjct: 290 TQPLRSTVQRRVHLKQVKCFDCACERCSDPTELGTFAGAHLC-------GKCKVGKVISQ 342

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           NPL+N ++W C  C+ + +AR+V    ++L ++++ L +  P    LEE +       H 
Sbjct: 343 NPLENAANWHCQLCNVKKSAREVLTQDARLQQEIESLDKTTP--VHLEEFIQGHRIDLHD 400

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            + H    K++L QLYGT PGYA  +LS   L+RK+
Sbjct: 401 TNTHILQAKYALTQLYGTAPGYAMDELSEESLKRKV 436


>gi|194873875|ref|XP_001973295.1| GG13432 [Drosophila erecta]
 gi|190655078|gb|EDV52321.1| GG13432 [Drosophila erecta]
          Length = 530

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 149/456 (32%), Positives = 242/456 (53%), Gaps = 50/456 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  H+ +CK   +EI ++ +LGR+L A
Sbjct: 6   LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRAECK--CFEIATNEVLGRHLTA 63

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
           T D+  G++I +ESPL++GPK+A A P+CLGCH+    P NP   +  +C  C WP C  
Sbjct: 64  TRDIKIGEQILKESPLVLGPKVASA-PLCLGCHRNLLAPENPR-GNYHKCSSCSWPLCGK 121

Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
            C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+ + P+ +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMQLKAKDPEAF 178

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
             L  +E H+ +R   T +Y+        LR+N   F++ + G  + D     +  I  +
Sbjct: 179 LRLDTLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWLEMDILRIAAI 228

Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
           ++ N  ++      R  + LY    ++ H+C+PN +H      + +  +K  + +   +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRRIAKGEILS 288

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L  T  RR HL   K F+CSC RC DP ELG++  A  CL     +    I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           NPL N + W C  CS + +A++V    ++L ++++ L +  P   +LEE + +  A  H 
Sbjct: 345 NPLLNSAPWKCQLCSYKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            + H    K++L QLYG+ PG+A  +LS   L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSEESLNRKV 438



 Score = 38.5 bits (88), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 10/63 (15%)

Query: 443 VMSPNLKATDEPICLGCHK----PLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
           V+ P  K    P+CLGCH+    P NP   +  +C  C WP C   C    D+  H  EC
Sbjct: 80  VLGP--KVASAPLCLGCHRNLLAPENPR-GNYHKCSSCSWPLCGKEC---EDSVHHKAEC 133

Query: 499 AIL 501
            ++
Sbjct: 134 QLM 136


>gi|195353810|ref|XP_002043396.1| GM16483 [Drosophila sechellia]
 gi|194127519|gb|EDW49562.1| GM16483 [Drosophila sechellia]
          Length = 486

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 156/438 (35%), Positives = 225/438 (51%), Gaps = 87/438 (19%)

Query: 32  VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
           V YC ++HQK+HW  HK +C+  P+  +    LGRYL+ T D+  G  +  E PL+VGPK
Sbjct: 2   VRYCDREHQKQHWPQHKRRCR--PFREEQDAELGRYLKVTQDIAAGQIVFIEEPLVVGPK 59

Query: 92  LALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 143
             L++         C+GC+ P   +     +C RC WP CSA C        H P EC++
Sbjct: 60  WYLSDADKEASIVPCVGCYAP---SRLGKHQCRRCRWPVCSAGCE-------HEPMECSV 109

Query: 144 LKLGCET-----LLAYNDY-KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV 197
           L LG  +       + NDY + +A+L L+CL+LQR+SP K+  L +M++H  +R  GTE+
Sbjct: 110 LSLGSGSPTRADTRSLNDYFRGDALLVLKCLLLQRQSPTKWSALLEMQSHEEER-KGTEL 168

Query: 198 YEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNGVDI----GRYTQG 249
           +EE +  +V YL+  FL +L      +L D   + LH +CG+IE N + I    G    G
Sbjct: 169 HEEAEKRVVSYLQKRFLCRLKQTNPSLLTDCGPEMLHRLCGIIETNFMVIELPSGVELSG 228

Query: 250 LYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHLA 302
           L+   C+MEH C PN       +++    R         H+   YTN LWGTQ R+ HL 
Sbjct: 229 LFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLRKGDHLRITYTNILWGTQLRQHHLR 288

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
           +TK+F+C C RC DPTE GTY SA+ CL      GD                        
Sbjct: 289 LTKHFSCHCNRCLDPTEYGTYISALTCL------GD------------------------ 318

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
                          V+ L+   P+   +E +L +L  M HP+H+H + +KH+LIQLYG 
Sbjct: 319 ---------------VEGLLAGCPSANQVELLLARLTHMLHPNHFHTFNLKHTLIQLYGN 363

Query: 423 QPGYAYTQLSSSLLERKI 440
           + G     LS++ LERK+
Sbjct: 364 EAGLELGVLSNTQLERKL 381


>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
 gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
          Length = 532

 Score =  228 bits (580), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 144/455 (31%), Positives = 237/455 (52%), Gaps = 48/455 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  HK +CK   YE+ ++  LGR+L+A
Sbjct: 5   LTSCALCQAKASQLCAACRSVVYCSREHQKEHWKQGHKRECK--CYEVATNETLGRHLRA 62

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
           T D+  G+ I RE+PL++GPK+A A PICLGCH+ L        +  +C  C WP C   
Sbjct: 63  TRDVKIGEEILREAPLLLGPKVASA-PICLGCHRNLLAPQKQRGNYYKCSSCSWPLCGRE 121

Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQ 179
           C    ++  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+   P+ + 
Sbjct: 122 CE---ESSHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRCMQLKASDPEGFA 178

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVI 236
            L  +E H+ +R   T +Y+        LR+N   F++ + G  + D S   +  I  ++
Sbjct: 179 RLSALEDHLEER-LATPLYQ-------VLRANLITFIKTILG--MRDWSEVDILRIAAIL 228

Query: 237 EVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
           + N  ++      R  + L+    ++ H+C PN +H      + +  +K  + +   ++ 
Sbjct: 229 DTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILTI 288

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            YT  L  T  RR HL   K F+C+C+RC DPTELGT+  A  C+     +    I+ VN
Sbjct: 289 SYTQPLRSTIQRRLHLRQAKCFDCACDRCQDPTELGTFAGAQTCVKCKAGK----IISVN 344

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
           PL N ++W C  C+ + +A++V L  ++L ++++ L +  P     E+ + +     H  
Sbjct: 345 PLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTTP--VDFEDFIYRHRVELHET 402

Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           + H    K++L QLYG  PG+   +LS   L RK+
Sbjct: 403 NTHVLQAKYALTQLYGNAPGFTMDELSEESLSRKV 437


>gi|357603389|gb|EHJ63740.1| hypothetical protein KGM_20849 [Danaus plexippus]
          Length = 544

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 153/463 (33%), Positives = 230/463 (49%), Gaps = 50/463 (10%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC      KCS C+ V YC + HQK HWKLHK  C  +P +IK  P +GRYL+AT  +
Sbjct: 7   CEVCLTPTEQKCSGCQIVHYCSRDHQKRHWKLHKLHC--VPAKIKELPGIGRYLEATRTI 64

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
             GD I +E+PLI GP   +  P+CLGC+  L        RC +C WP CS +C+  S  
Sbjct: 65  KAGDVIMKEAPLITGPA-QITPPVCLGCYVLLEE--GKTTRCDKCGWPFCSEKCTHSS-- 119

Query: 135 HTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
             H PEC   +   E + + + Y      Y+ +  LRCL  +  + K + +L+ +E H  
Sbjct: 120 -VHEPECFYTQQRGEKV-SISTYGIPHPNYQCVTVLRCLYQRDHNMKLWNKLQALETHTE 177

Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ- 248
            R  GT+ +E    +V     +F  KL     ++   KC    CG+I++NG ++      
Sbjct: 178 DR-KGTDKWENDRKMVAQFIWSFF-KLERLFNEEEIMKC----CGIIQINGHEVPLLEPE 231

Query: 249 --GLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALWGTQPRREH 300
              ++  I ++EHNC  N   S       ++ + + + R +HIS  YT+ LWGT+ RR H
Sbjct: 232 YVAVFDKISMVEHNCRANCNKSFTSNGEIILSAGVAIPRGSHISVCYTDPLWGTEARRHH 291

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ---GDCWILPV-----------NP 346
           L+ +K+F CSCERCSD TE GT FSA+KC  ++       D +ILP+             
Sbjct: 292 LSDSKFFECSCERCSDVTEFGTMFSAVKCKKKNCKGYLLPDTFILPILHKTISPDPATRG 351

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
           LDN   W C  C   ++   +  +   +G ++  + + NP+  + E  +       HP H
Sbjct: 352 LDNKF-WKCKECKDEVSDEIIQQLLQDIGRELSIMEKGNPD--ACERFIEHCATYLHPSH 408

Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLE----RKISYVMS 445
           Y+      +L QL G   G A       LL+    RKI+ +++
Sbjct: 409 YYMIDAGLALAQLVGQDTGLAVVSDDRLLLKTQLCRKITELLN 451



 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/94 (27%), Positives = 38/94 (40%), Gaps = 7/94 (7%)

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLN 464
           H  HC   K   +   G       T  +  ++ ++   +  P       P+CLGC+  L 
Sbjct: 39  HKLHCVPAKIKELPGIGRYLEATRTIKAGDVIMKEAPLITGP--AQITPPVCLGCYVLLE 96

Query: 465 PNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
                  RC +C WP CS +C+  S    H PEC
Sbjct: 97  E--GKTTRCDKCGWPFCSEKCTHSS---VHEPEC 125


>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
 gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 145/455 (31%), Positives = 236/455 (51%), Gaps = 48/455 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  HK +CK   YE+ ++  LGR+L+A
Sbjct: 5   LTSCALCQAKASQLCAACRSVVYCSREHQKEHWKQGHKRECK--CYEVATNETLGRHLRA 62

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
           T D+  G+ I RE+PL++GPK+A A PICLGCH+ L        +  +C  C WP C   
Sbjct: 63  TRDVKIGEEILREAPLLLGPKVASA-PICLGCHRNLLAPQKQRGNYYKCSSCSWPLCGRE 121

Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQ 179
           C    ++  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+   P+ + 
Sbjct: 122 CE---ESPHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRCMQLKSSDPEGFA 178

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVI 236
            L  +E H+ +R   T +Y+        LR+N   F++ + G  + D S   +  I  ++
Sbjct: 179 RLSALEDHLEER-LATPLYQ-------VLRANLITFIKTILG--MRDWSEVDILRIAAIL 228

Query: 237 EVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
           + N  ++      R  + L+    ++ H+C PN +H      + +  +K  + +   +S 
Sbjct: 229 DTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILSI 288

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            YT  L  T  RR HL   K F+C+C RC DPTELGT+  A  C+     +    I+ VN
Sbjct: 289 SYTQPLRSTIQRRLHLRQAKCFDCACARCQDPTELGTFAGAQTCVKCKAGK----IISVN 344

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
           PL N ++W C  C+ + +A++V L  ++L ++++ L +  P     E+ + +     H  
Sbjct: 345 PLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTTP--VDFEDFIYRHRVELHET 402

Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           + H    K++L QLYG  PG+   +LS   L RK+
Sbjct: 403 NTHVLQAKYALTQLYGNAPGFTMDELSEESLSRKV 437


>gi|332373498|gb|AEE61890.1| unknown [Dendroctonus ponderosae]
          Length = 359

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 109/232 (46%), Positives = 153/232 (65%), Gaps = 12/232 (5%)

Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGRYTQ--GLYSVICLMEHNCLPNAKH---------S 268
           VL D S + +H ICG+++VN ++I +  +   LY    LMEHNCL N KH         +
Sbjct: 12  VLKDISKEIIHQICGIVDVNALEINQDAEVTALYPTAFLMEHNCLSNTKHMFDNADKGYT 71

Query: 269 NMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
             +++ L + +  HISTMYT+ALWGTQ RREHL  TKYF+C C RC+DPTELGTY SA+K
Sbjct: 72  ITIRASLPIAKGDHISTMYTHALWGTQARREHLRETKYFSCKCLRCADPTELGTYLSALK 131

Query: 329 CLNEHKDQGDCWI-LPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
           C+    D+    I LP++PLD++++W C  C A+L   +V  + +Q+GE+VD +   NP 
Sbjct: 132 CIGTQDDEPCGGIQLPIDPLDDNTEWACNKCDAKLTNHEVCFLVNQIGEEVDNVQLSNPT 191

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           V+ L+ +LTK+    HP+HYH Y+VKHSL+QLYG Q GY   QLS  ++ +K
Sbjct: 192 VRELDSILTKMLNFVHPNHYHVYSVKHSLVQLYGYQQGYMPNQLSDDIIAKK 243


>gi|91087991|ref|XP_973574.1| PREDICTED: similar to CG11160 CG11160-PA [Tribolium castaneum]
          Length = 528

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 140/407 (34%), Positives = 219/407 (53%), Gaps = 39/407 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +E+K+S ++GRY+ +  +L PGD I  E+PL++GP     +  CLGC+K L  +     +
Sbjct: 39  FEVKTSKVMGRYMVSRKNLKPGDVILSEAPLVIGPCTG-CKVQCLGCYKTLESSFC--TK 95

Query: 116 CPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYNDYK-----YEAILPLRCL 168
           C  C WP CS +C GL     H+  ECA+LK       L Y++++     + AI+PLRCL
Sbjct: 96  CT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCL 154

Query: 169 ILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSAK 227
           +L+    + YQ L  ME H        E+   I  + +  +   ++K+  D  L + S +
Sbjct: 155 LLKETDARSYQHLMTMETH-------NEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQE 207

Query: 228 CLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT--- 281
            +H +CG++EVN  +IG+     +GLY    LM H+C+PN  H +   +     R +   
Sbjct: 208 EIHSVCGILEVNSFEIGQQGVNIRGLYPSAFLMSHDCVPNTNHIDEESTFRLTVRASTRI 267

Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                I+  Y   L  T  RREHL   K+F C C RCSDPTELGT+ SA+ C        
Sbjct: 268 EPGEMITLSYAYTLQSTLKRREHLLENKFFECQCRRCSDPTELGTFTSALIC----PKCK 323

Query: 338 DCWILPVNPLDNDSDWTCGSCSA----RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
           D ++L  NPLD ++DWTC + +     +++A+ + L+  ++ ++VD +  +  +V ++E 
Sbjct: 324 DGFVLSSNPLDPEADWTCSNYNKCPGYKISAKSMQLLLDKITQEVDNI--DCNDVITMET 381

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            L K   + HP HY   +VK SL QLYG   G+   +LS  +L RK+
Sbjct: 382 FLKKYRNVLHPTHYINLSVKLSLSQLYGRINGFLIDELSDEVLTRKL 428



 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYND 513
           CLGC+K L  +     +C  C WP CS +C GL     H+  ECA+LK       L Y++
Sbjct: 81  CLGCYKTLESSFC--TKCT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDN 137

Query: 514 YKSL 517
           ++ +
Sbjct: 138 FEEM 141


>gi|270012059|gb|EFA08507.1| hypothetical protein TcasGA2_TC006159 [Tribolium castaneum]
          Length = 529

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 140/407 (34%), Positives = 219/407 (53%), Gaps = 39/407 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +E+K+S ++GRY+ +  +L PGD I  E+PL++GP     +  CLGC+K L  +     +
Sbjct: 40  FEVKTSKVMGRYMVSRKNLKPGDVILSEAPLVIGPCTG-CKVQCLGCYKTLESSFC--TK 96

Query: 116 CPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYNDYK-----YEAILPLRCL 168
           C  C WP CS +C GL     H+  ECA+LK       L Y++++     + AI+PLRCL
Sbjct: 97  CT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCL 155

Query: 169 ILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSAK 227
           +L+    + YQ L  ME H        E+   I  + +  +   ++K+  D  L + S +
Sbjct: 156 LLKETDARSYQHLMTMETH-------NEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQE 208

Query: 228 CLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT--- 281
            +H +CG++EVN  +IG+     +GLY    LM H+C+PN  H +   +     R +   
Sbjct: 209 EIHSVCGILEVNSFEIGQQGVNIRGLYPSAFLMSHDCVPNTNHIDEESTFRLTVRASTRI 268

Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                I+  Y   L  T  RREHL   K+F C C RCSDPTELGT+ SA+ C        
Sbjct: 269 EPGEMITLSYAYTLQSTLKRREHLLENKFFECQCRRCSDPTELGTFTSALIC----PKCK 324

Query: 338 DCWILPVNPLDNDSDWTCGSCSA----RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
           D ++L  NPLD ++DWTC + +     +++A+ + L+  ++ ++VD +  +  +V ++E 
Sbjct: 325 DGFVLSSNPLDPEADWTCSNYNKCPGYKISAKSMQLLLDKITQEVDNI--DCNDVITMET 382

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            L K   + HP HY   +VK SL QLYG   G+   +LS  +L RK+
Sbjct: 383 FLKKYRNVLHPTHYINLSVKLSLSQLYGRINGFLIDELSDEVLTRKL 429



 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 5/64 (7%)

Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYND 513
           CLGC+K L  +     +C  C WP CS +C GL     H+  ECA+LK       L Y++
Sbjct: 82  CLGCYKTLESSFC--TKCT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDN 138

Query: 514 YKSL 517
           ++ +
Sbjct: 139 FEEM 142


>gi|242397545|gb|ACS92862.1| MIP11718p [Drosophila melanogaster]
          Length = 514

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 146/456 (32%), Positives = 243/456 (53%), Gaps = 50/456 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  H+ +C+   +EI ++ +LGR+L+A
Sbjct: 6   LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
           T D+  G++I +E+PL++GPK+A A P+CLGCH+    P  P   +  +C  C WP C  
Sbjct: 64  TRDIKIGEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPGKPR-GNYHKCSSCSWPLCGK 121

Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
            C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+ + P  +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMHLKDKDPDAF 178

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
            +L ++E H+ +R   T +Y+        LR+N   F++ + G  + D     +  I  +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWPEMDILRIAAI 228

Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
           ++ N  ++      R  + LY    ++ H+C+PN +H      + +  +K  + +   +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNILFLAKRKIAKGEILS 288

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L  T  RR HL   K F+CSC RC DP ELG++  A  CL     +    I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           NPL N + W C  C+ + +A+DV    ++L ++++ L +  P   +LEE + +  A  H 
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKDVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            + H    K++L QLYG+ PG+A  +LS   L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSGESLNRKL 438


>gi|24666583|ref|NP_649084.1| CG18136 [Drosophila melanogaster]
 gi|7293833|gb|AAF49199.1| CG18136 [Drosophila melanogaster]
          Length = 530

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 146/456 (32%), Positives = 243/456 (53%), Gaps = 50/456 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  H+ +C+   +EI ++ +LGR+L+A
Sbjct: 6   LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
           T D+  G++I +E+PL++GPK+A A P+CLGCH+    P  P   +  +C  C WP C  
Sbjct: 64  TRDIKIGEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPGKPR-GNYHKCSSCSWPLCGK 121

Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
            C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+ + P  +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMHLKDKDPDAF 178

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
            +L ++E H+ +R   T +Y+        LR+N   F++ + G  + D     +  I  +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWPEMDILRIAAI 228

Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
           ++ N  ++      R  + LY    ++ H+C+PN +H      + +  +K  + +   +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILS 288

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L  T  RR HL   K F+CSC RC DP ELG++  A  CL     +    I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           NPL N + W C  C+ + +A+DV    ++L ++++ L +  P   +LEE + +  A  H 
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKDVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            + H    K++L QLYG+ PG+A  +LS   L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSGESLNRKL 438


>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
          Length = 458

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 155/450 (34%), Positives = 226/450 (50%), Gaps = 51/450 (11%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CA C+  + + CS C+ V YC ++HQK  W  HK  C   P++IK S   GRYL A+  +
Sbjct: 1   CAKCQLPSTNCCSKCRRVYYCSREHQKADWTNHKTLCGP-PFKIKESEAEGRYLVASRLI 59

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
             G+ I +E PL+VGPKL    P+CLGC+K    ++ D  RC RC WP CSA C   S  
Sbjct: 60  KAGEVILQELPLVVGPKLNTL-PLCLGCYK----SITDTYRCSRCNWPLCSAACEE-SAL 113

Query: 135 HTHAPECAILKLGCETLL-------AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAH 187
           H +  EC ++     TL+       A N   ++ I PLR L L        + L ++ +H
Sbjct: 114 HKNG-ECRMID---PTLMTNHLSQGAINSQVFQCITPLRYLTL---PDSDRERLDELVSH 166

Query: 188 MSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--- 243
           + +R  G ++Y  ++ +I  +LR   L       L    ++ +  +CG++E N  +I   
Sbjct: 167 LEQRR-GMDIYRLVEQNISSFLRYRLL-------LTQYDSESIQRVCGILETNCFEIRIQ 218

Query: 244 GRYT-QGLYSVICLMEHNCLPNAKHS---NMMQSKLFVFRDT----HISTMYTNALWGTQ 295
           GR + +GLY    LM H+C+ N +H       + ++   +D      IS  YT +LW T 
Sbjct: 219 GRVSVRGLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTL 278

Query: 296 PRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTC 355
            RR HL  TK+F C C RC+DP ELGT  SA+KC           +L  NPL   S+W C
Sbjct: 279 DRRLHLKSTKHFWCQCSRCADPRELGTLLSAVKCTG-----CGGAVLSQNPLYQMSNWEC 333

Query: 356 GSCSARL---NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
             C +       + VH       +Q+ +L +  P +  LE+ + K     HP   H    
Sbjct: 334 SDCGSVQKVEQVKRVHDSARMELKQIAQLARNRPEL--LEDFIRKYSGAIHPDSCHVIEA 391

Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
           K++L+QLYG  P   Y  LS  L+ERK+ Y
Sbjct: 392 KYALVQLYGNTPQLLYHDLSEELIERKLDY 421



 Score = 42.0 bits (97), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           +  ++ +++  V+ P L     P+CLGC+K    ++ D  RC RC WP CSA C
Sbjct: 61  AGEVILQELPLVVGPKLNTL--PLCLGCYK----SITDTYRCSRCNWPLCSAAC 108


>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
 gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
          Length = 530

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 145/456 (31%), Positives = 242/456 (53%), Gaps = 50/456 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  H+ +C+   +EI ++ +LGR+L+A
Sbjct: 6   LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
           T D+  G++I +E+PL++GPK+A A P+CLGCH+    P  P   +  +C  C WP C  
Sbjct: 64  TRDIKIGEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPEKPR-GNYHKCSSCSWPLCGK 121

Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
            C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+   P  +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVAGEEERKESAYCVIMLLRCMQLKAMDPDAF 178

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
            +L ++E H+ +R   T +Y+        LR+N   F++ + G  + D     +  I  +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWPEMDILRIAAI 228

Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
           ++ N  ++      R  + LY    ++ H+C+PN +H      + +  +K  + +   +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILS 288

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L  T  RR HL   K F+CSC RC DP ELG++  A  CL     +    I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           NPL N + W C  C+ + +A++V    ++L ++++ L +  P   +LEE + +  A  H 
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            + H    K++L QLYG+ PG+A  +LS   L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSEESLNRKV 438


>gi|195496367|ref|XP_002095664.1| GE22530 [Drosophila yakuba]
 gi|194181765|gb|EDW95376.1| GE22530 [Drosophila yakuba]
          Length = 530

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 142/455 (31%), Positives = 242/455 (53%), Gaps = 48/455 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  H+ +CK   +EI ++ +LGR+L+A
Sbjct: 6   LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECK--CFEIATNEVLGRHLKA 63

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
           T D+  G++I +ESPL++GPK+A A P+CLGCH+ L     +  +  +C  C WP C   
Sbjct: 64  TRDIKIGEQILKESPLVLGPKVASA-PLCLGCHRNLLAPEKSRGNYHKCSSCSWPLCGKE 122

Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKYQ 179
           C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+ + P+ + 
Sbjct: 123 CE---DSVHHKAECQLMSGSNFQSKINYVPGEDERKESAYCVIMLLRCMQLKAKDPEAFL 179

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEK--LPGDVLDDTSAKCLHWICGVI 236
           +L  +E H+ +R   T +Y+ + + ++ ++++    K  L  D+L          I  ++
Sbjct: 180 KLYTLEDHLKERLE-TPLYQVLRANLITFIKTVLGMKNWLEMDILR---------IAAIL 229

Query: 237 EVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
           + N  ++      R  + LY    ++ H+C+PN +H      + +  +K  + +   +S 
Sbjct: 230 DTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSI 289

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            YT  L  T  RR HL   K F+CSC RC DP ELG++  A  CL     +    I+ +N
Sbjct: 290 SYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISLN 345

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
           PL N + W C  C+ + +A++V    ++L ++++ L +  P   +LEE + +  A  H  
Sbjct: 346 PLLNSAPWKCQLCNYKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHET 403

Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           + H    K++L QLYG+ P +   +LS   L RK+
Sbjct: 404 NTHVLQAKYALTQLYGSAPRFGMEELSEESLNRKV 438


>gi|189240267|ref|XP_971538.2| PREDICTED: similar to CG18136 CG18136-PA [Tribolium castaneum]
          Length = 468

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 146/408 (35%), Positives = 220/408 (53%), Gaps = 42/408 (10%)

Query: 53  KLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLAD 112
           K P+ +K   +LGR + AT D+  G+ IA   PLIVGPK+A + P+CLGCHK L   LA+
Sbjct: 8   KAPFMVKRDEVLGRCVVATRDIRSGEVIAESLPLIVGPKMA-SPPLCLGCHKKLA--LAE 64

Query: 113 NA-RCPRCFWPACSARCSGLSDAHTHAPECAIL-KLGCETLLAYNDYK---YEAILPLRC 167
           +   C +CFWP C   C     +  H  EC I  K G +  +  ++ K   Y  I PLR 
Sbjct: 65  SRYDCSKCFWPLCGQACES---SPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPIAPLRA 121

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSA 226
           L+L++ SP+++  L D ++H+      T+VY+ +  S+V +     L KL      DT+ 
Sbjct: 122 LLLKKNSPERFNLLLDFQSHLQDH-EKTQVYQVLKKSLVPFFTQ--LLKL------DTNE 172

Query: 227 KCLHWICGVIEVNGVDIGRYTQ------GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
             +  IC + + N  ++ R TQ      GLY  I  + H+C  N KH     +   V   
Sbjct: 173 SEILTICSIFDTNCFEV-RDTQRLVNIRGLYPTISFLSHSCKHNTKHCFNGDNFRLVLTA 231

Query: 281 TH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
           T        ++T YT  LWGT  RR HL + K+F+C CERC+DPTE GTY SA+ C    
Sbjct: 232 TTPIKKGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERCTDPTEFGTYLSAVNC---- 287

Query: 334 KDQGD-CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
              GD   +   +PL+ ++DW C +C   +   D+      L ++++ L ++NPN  SLE
Sbjct: 288 SVCGDGSKVTSSDPLNCEADWFCENCGFAITGSDMLWGNEVLQKEINHLNKDNPN--SLE 345

Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           E L +   + H  + +C  VK++L Q+YG+  G+ + +LSS L++RKI
Sbjct: 346 EFLERYAEVLHGRNSYCLQVKYALCQIYGSLDGFKFEELSSDLIDRKI 393



 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 8/71 (11%)

Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSD 490
           S  ++   +  ++ P  K    P+CLGCHK L   LA++   C +CFWP C   C     
Sbjct: 31  SGEVIAESLPLIVGP--KMASPPLCLGCHKKLA--LAESRYDCSKCFWPLCGQACES--- 83

Query: 491 AHTHAPECAIL 501
           +  H  EC I 
Sbjct: 84  SPLHEAECEIF 94


>gi|270012346|gb|EFA08794.1| hypothetical protein TcasGA2_TC006485 [Tribolium castaneum]
          Length = 498

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 146/408 (35%), Positives = 220/408 (53%), Gaps = 42/408 (10%)

Query: 53  KLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLAD 112
           K P+ +K   +LGR + AT D+  G+ IA   PLIVGPK+A + P+CLGCHK L   LA+
Sbjct: 8   KAPFMVKRDEVLGRCVVATRDIRSGEVIAESLPLIVGPKMA-SPPLCLGCHKKLA--LAE 64

Query: 113 NA-RCPRCFWPACSARCSGLSDAHTHAPECAIL-KLGCETLLAYNDYK---YEAILPLRC 167
           +   C +CFWP C   C     +  H  EC I  K G +  +  ++ K   Y  I PLR 
Sbjct: 65  SRYDCSKCFWPLCGQACES---SPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPIAPLRA 121

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSA 226
           L+L++ SP+++  L D ++H+      T+VY+ +  S+V +     L KL      DT+ 
Sbjct: 122 LLLKKNSPERFNLLLDFQSHLQDH-EKTQVYQVLKKSLVPFFTQ--LLKL------DTNE 172

Query: 227 KCLHWICGVIEVNGVDIGRYTQ------GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
             +  IC + + N  ++ R TQ      GLY  I  + H+C  N KH     +   V   
Sbjct: 173 SEILTICSIFDTNCFEV-RDTQRLVNIRGLYPTISFLSHSCKHNTKHCFNGDNFRLVLTA 231

Query: 281 TH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
           T        ++T YT  LWGT  RR HL + K+F+C CERC+DPTE GTY SA+ C    
Sbjct: 232 TTPIKKGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERCTDPTEFGTYLSAVNC---- 287

Query: 334 KDQGD-CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
              GD   +   +PL+ ++DW C +C   +   D+      L ++++ L ++NPN  SLE
Sbjct: 288 SVCGDGSKVTSSDPLNCEADWFCENCGFAITGSDMLWGNEVLQKEINHLNKDNPN--SLE 345

Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           E L +   + H  + +C  VK++L Q+YG+  G+ + +LSS L++RKI
Sbjct: 346 EFLERYAEVLHGRNSYCLQVKYALCQIYGSLDGFKFEELSSDLIDRKI 393



 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 8/71 (11%)

Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSD 490
           S  ++   +  ++ P  K    P+CLGCHK L   LA++   C +CFWP C   C     
Sbjct: 31  SGEVIAESLPLIVGP--KMASPPLCLGCHKKLA--LAESRYDCSKCFWPLCGQACES--- 83

Query: 491 AHTHAPECAIL 501
           +  H  EC I 
Sbjct: 84  SPLHEAECEIF 94


>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
 gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
          Length = 530

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 145/456 (31%), Positives = 240/456 (52%), Gaps = 50/456 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  H+ +C+   +EI ++ +LGR+L+A
Sbjct: 6   LTSCALCQAKASQLCAACRNVVYCCREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
           T D+  G++I +E+PL++GPK+A A P+CL CH+    P  P   +  +C  C WP C  
Sbjct: 64  TRDIKIGEQILKEAPLVLGPKVASA-PLCLSCHRNLLAPEKPR-GNYHKCSSCSWPLCGK 121

Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
            C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+   PK +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMQLKAIDPKAF 178

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
            +L ++E H+ +R   T +Y+        LR+N   F+  + G  + D     +  I  +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFINTVLG--MKDWPEMDILRIAAI 228

Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
           ++ N  ++      R  + LY    ++ H+C+PN +H      + +  +K  + +   +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILS 288

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L  T  RR HL   K F+CSC RC DP ELG++  A  CL     +    I+  
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISQ 344

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
           NPL N + W C  C+ + +A++V    ++L ++++ L +  P   +LEE + +  A  H 
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402

Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            + H    K++L QLYG+ PG+A  +LS   L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSEESLNRKV 438


>gi|195377589|ref|XP_002047571.1| GJ13517 [Drosophila virilis]
 gi|194154729|gb|EDW69913.1| GJ13517 [Drosophila virilis]
          Length = 528

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 151/454 (33%), Positives = 239/454 (52%), Gaps = 52/454 (11%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
           CA C+  A   C+ C+ V YC ++HQKEHWK  HK +CK   +E+ S+  LGR+L+AT D
Sbjct: 7   CAFCQSHATQLCAGCRSVVYCSREHQKEHWKRGHKRECK--CFELSSNETLGRHLRATRD 64

Query: 74  LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNA-RCPRCFWPACSARCSG 130
           +  G++I RE+PL+VGPK+A + P+CLGCHK L P    A N  +C  C WP CSA C  
Sbjct: 65  IKMGEQIMREAPLVVGPKVA-SVPLCLGCHKNLLPPAKPAQNYYKCSACTWPLCSAECEQ 123

Query: 131 ----LSDAHTHAP---ECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
               L++    A    EC I     E    + +  Y  I+ LRC+ L+  SP  +  L +
Sbjct: 124 SPYHLAECRLMAGSNFECKINYKAAEQ--EHKESAYCVIMLLRCVQLKLESPAAFARLYE 181

Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVIEVNG 240
           +E H+ +R   T +Y+        LR+N   F++ + G  L++ S   +  I  +++ N 
Sbjct: 182 LEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--LNNCSELEILRIAAILDTNA 231

Query: 241 VDIGRY-----TQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTN 289
            ++ +       +GLY    +  H+C+PN +H      + +  +K  + +   +S  YT 
Sbjct: 232 FELRQLGGCVKVRGLYPGAAMFAHDCVPNMRHRFDDDMNIVFLAKRPIAKGEVLSISYTQ 291

Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC---WILPVNP 346
            L  T  RR HL   K F+C+C RC DPTELGT+  A  C       G C    I+ ++P
Sbjct: 292 PLRSTIQRRVHLKQVKCFDCACARCEDPTELGTFAGAHVC-------GKCKVGKIISMDP 344

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
           L   ++W C  C+ + +A++     +++ ++++ L +  P   SLE+ L +     H  +
Sbjct: 345 LQLAANWKCEVCNLKRSAKEFLTQDAKIEQELESLDKTTP--LSLEDFLYRHRVELHETN 402

Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            H    K++L QLYG   G+A  +LS   L+RK+
Sbjct: 403 THILQAKYALTQLYGHATGFAMDELSEKSLKRKV 436



 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 8/67 (11%)

Query: 438 RKISYVMSPNLKATDEPICLGCHKPLNP--NLADN-ARCPRCFWPACSARCSGLSDAHTH 494
           R+   V+ P  K    P+CLGCHK L P    A N  +C  C WP CSA C     +  H
Sbjct: 73  REAPLVVGP--KVASVPLCLGCHKNLLPPAKPAQNYYKCSACTWPLCSAEC---EQSPYH 127

Query: 495 APECAIL 501
             EC ++
Sbjct: 128 LAECRLM 134


>gi|194751716|ref|XP_001958171.1| GF10789 [Drosophila ananassae]
 gi|190625453|gb|EDV40977.1| GF10789 [Drosophila ananassae]
          Length = 438

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 140/442 (31%), Positives = 233/442 (52%), Gaps = 48/442 (10%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
           L  CA+C+  A   C+AC+ V YC ++HQKEHWK  HK +CK   YE+ ++  LGR+L+A
Sbjct: 7   LTSCALCQARASQLCAACRNVVYCSREHQKEHWKKGHKAECK--CYEVATNETLGRHLRA 64

Query: 71  TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
           T D+  G++I +E+PLI+GPK+A A P+CLGCH+ L        +  +C  C WP C  +
Sbjct: 65  TRDIKIGEQILKEAPLILGPKVASA-PLCLGCHRNLLAPEKRKGNYYKCSSCSWPLCGKQ 123

Query: 128 CSGLSDAHTHAPECAIL-KLGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKYQ 179
           C    D+  H  EC ++     ++ + Y        +  Y  I+ LRC+ L+ + P+ + 
Sbjct: 124 C---EDSPHHKAECQLMSSSNFQSKINYVPGEAERKESAYCVIMLLRCMQLKTKDPESFA 180

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVI 236
           +L  +E H+ +R   T +Y+        LR+N   F++ + G  L D S   +  I  ++
Sbjct: 181 KLYTLEDHLKER-LDTPLYQ-------VLRANLITFIKTVLG--LKDWSELDILRIAAIL 230

Query: 237 EVNGVDIGR-----YTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
           + N  ++ +       + L+    ++ H+C+PN +H      + +  +K  + +   +S 
Sbjct: 231 DTNTFEVRQPRERIKIRALFPGAAMISHDCVPNMRHRFDDDMNIIFLAKRKISKGEILSI 290

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            YT  L  T  RR HL   K F+CSC RC DP ELG++  A  C+     +    I+ +N
Sbjct: 291 SYTQPLRSTIQRRLHLRQAKCFDCSCARCQDPEELGSFAGAQNCVKCKAGK----IVSMN 346

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
           PL N ++W C  C+ +  A+DV    +QL ++++ L +  P   +LE+ + +     H  
Sbjct: 347 PLQNSANWKCQICNLKRTAKDVLTSDAQLQQELETLDKSTP--VALEDFIYRHRVELHET 404

Query: 406 HYHCYAVKHSLIQLYGTQPGYA 427
           + H    K++L QLYG  PG+ 
Sbjct: 405 NTHILQAKYALTQLYGNAPGFG 426


>gi|189239848|ref|XP_973542.2| PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
          Length = 829

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 217/425 (51%), Gaps = 31/425 (7%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
           RC VC + ALHKC  C    YC K+HQ+EHWK HK  C+  P++I    +LGR+L AT  
Sbjct: 6   RCEVCEKPALHKCGKCHNAHYCDKKHQREHWKQHKSVCQ--PFKICEDAVLGRHLVATRA 63

Query: 74  LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
           L PG+ + +E+PLI GP    + P+CLGC K ++   +    C +C WP CS  C     
Sbjct: 64  LKPGEVVIQEAPLIWGPSYN-SIPVCLGCGKAVDETCS--RPCQKCGWPVCSDLCE---K 117

Query: 134 AHTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
           + +H PEC       E +           Y+ I  LRCL  ++  PK +++L+ ++ H+ 
Sbjct: 118 SPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCLYQKQFLPKNWKKLETLQNHLE 177

Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---Y 246
           +    T    +  +I +++R  F  KL     ++   K +H    + +VNG ++     +
Sbjct: 178 EHKSTTNYESDRVTIAEFIRRFF--KLSATFSEEDIMK-VHGSTLIFQVNGHEVPLTEPF 234

Query: 247 TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREH 300
              +YS   ++EH+C PN   S   Q  + +          H+S  Y++ LWGT  RR  
Sbjct: 235 HVAIYSSASMLEHSCGPNCTKSFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYF 294

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP--VNPLDNDSDWTCGSC 358
           L  TK+F C CERC DP+E GT FSA+KC    K  G  ++LP     L    DW C  C
Sbjct: 295 LHETKFFWCHCERCEDPSEFGTNFSAIKC--STKSCGG-YLLPATFTELQKLPDWVCDKC 351

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
              ++   VH +  ++G  +  L  +  +++  ++ L   + + H +HY+C  VK++L Q
Sbjct: 352 DKTMSFYSVHDILERIGRDLFDL--DKSSIEDCKDFLNTHQDLLHENHYYCTEVKYTLSQ 409

Query: 419 LYGTQ 423
           L G +
Sbjct: 410 LIGQR 414


>gi|170056265|ref|XP_001863952.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876021|gb|EDS39404.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 526

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 140/403 (34%), Positives = 210/403 (52%), Gaps = 34/403 (8%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           PY ++ S + GRYL A+ +L  G+ I +  PL VGP  A ++P+CLGCH+      A   
Sbjct: 46  PYVVRHSDVWGRYLIASRNLKAGEVIIQVEPLAVGP-WAESDPVCLGCHRTFEVG-AKTV 103

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKL-GCETLL------AYNDYKYEAILPLRC 167
           RCP C W  CS  C+GL+  H+   EC  LK  G   LL            YEAIL LRC
Sbjct: 104 RCPTCNWRICSPTCTGLTTHHSRL-ECIPLKEHGVAKLLETCSSATQIKLAYEAILALRC 162

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEE-IDSIVKYLRSNFLEKLPGDVLDDTSA 226
           L+L+R +P +Y++L +M+A   +R   T +++   ++IV+ +R  +         ++ S 
Sbjct: 163 LLLKRTAPDRYEKLLEMDAMNERRQQITSLWKRNQEAIVQRIRDEW-------KFEEFSE 215

Query: 227 KCLHWICGVIEVNGVDIG---RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
             +H +CG+IEVN   IG    + + LY     + H+C PN  H++   S     R T  
Sbjct: 216 AEVHTVCGIIEVNAFQIGPTEVHARALYPEAFYIMHDCTPNTTHTDQPGSHELTVRPTRD 275

Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 ++  Y+  L GT  RR+HL  +K+F C C RC DPTE GT+ S +KC    K  
Sbjct: 276 LKAGEPLTLSYSYTLQGTLKRRQHLCDSKFFWCQCPRCEDPTEFGTHCSTLKCAKCPKG- 334

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
               +L  NPLD D+ W C +C   + A+ V  + + L ++++ +     NV+  E  L 
Sbjct: 335 ---LVLSSNPLDQDAPWKCKNCPHTIPAQSVTQLLNGLYKELESI--GGNNVELFENFLL 389

Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           K E + H +HY   + KHSL +LYG  P Y   +L+   + RK
Sbjct: 390 KYEPILHKNHYLFLSAKHSLCELYGKAPNYLMHELTLDQINRK 432



 Score = 46.6 bits (109), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 23/53 (43%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 502
           A  +P+CLGCH+      A   RCP C W  CS  C+GL+  H+   EC  LK
Sbjct: 83  AESDPVCLGCHRTFEVG-AKTVRCPTCNWRICSPTCTGLTTHHSRL-ECIPLK 133


>gi|321449930|gb|EFX62153.1| hypothetical protein DAPPUDRAFT_120473 [Daphnia pulex]
          Length = 510

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 149/443 (33%), Positives = 228/443 (51%), Gaps = 54/443 (12%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVCR  A   C  C E++YC K+HQKEHW +HK  CK  PY+I      GR + A+ +L
Sbjct: 10  CAVCRAEASQICGGCGEISYCSKEHQKEHWLVHKSLCK--PYKIVHDEKFGRCIFASKNL 67

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLN-PNLADNARCPRCFWPACSARCSGLSD 133
            PG+ I  E+ +I GPK     P CL C+  L+    A   RCP C +P C  +C+    
Sbjct: 68  KPGEIIFGETAVITGPKQGCT-PCCLKCYASLDRVQEASLFRCPNCNFPFCQEQCA---K 123

Query: 134 AHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAH--MSKR 191
           +  H  EC IL    ++ +  ND             + R  P    E K +E H  + ++
Sbjct: 124 SPEHEAECLILSRA-KSCIVIND-------------VHRIHP----EYKMLENHNDLRRQ 165

Query: 192 GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI----GRYT 247
                +Y+   ++V++LR     K+ G + D+ S + +H  CGVI+VN  +I     +Y 
Sbjct: 166 SDMWRIYQV--NVVQFLR-----KICG-LADEFSEEEIHASCGVIDVNAFEIRLAGNQYQ 217

Query: 248 Q--GLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTNALWGTQPRRE 299
           Q  G++ +  +M HNC+ N +H      +  +++ + + +   I T YT  L GT+ RR+
Sbjct: 218 QVLGVFPLASMMSHNCVANTQHVIDANYTMTVRASVPIMKGEQIFTSYTLPLEGTKERRD 277

Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN-DSDWTCGSC 358
            L  +K F C C RCSDPTE  TY SA++C N      +  +LP  PLD  ++DW C  C
Sbjct: 278 VLRHSKLFECDCSRCSDPTECSTYLSALRCPN----CSNGVVLPERPLDEPETDWQCSQC 333

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
             +L A  V  V  +L E+++ +  E   V+  E+ L++  ++ HP+H+     + SL  
Sbjct: 334 PYKLTAAVVTQVIDKLKEELETI--EPDQVEKFEDFLSRHASLLHPNHFLLTRARQSLSI 391

Query: 419 LYGTQPGYAYTQLSSSLLERKIS 441
           LYG    +    L+S  LERK+S
Sbjct: 392 LYGRNERFLLNTLTSQQLERKVS 414


>gi|270011901|gb|EFA08349.1| hypothetical protein TcasGA2_TC005992 [Tribolium castaneum]
          Length = 803

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 215/425 (50%), Gaps = 34/425 (8%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
           RC VC + ALHKC  C    YC K+HQ+EHWK HK  C+  P++I    +LGR+L AT  
Sbjct: 6   RCEVCEKPALHKCGKCHNAHYCDKKHQREHWKQHKSVCQ--PFKICEDAVLGRHLVATRA 63

Query: 74  LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
           L PG+ + +E+PLI GP    + P+CLGC K ++   +    C +C WP CS  C     
Sbjct: 64  LKPGEVVIQEAPLIWGPSYN-SIPVCLGCGKAVDETCS--RPCQKCGWPVCSDLCE---K 117

Query: 134 AHTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
           + +H PEC       E +           Y+ I  LRCL  ++  PK +++L+ ++ H+ 
Sbjct: 118 SPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCLYQKQFLPKNWKKLETLQNHLE 177

Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI---GRY 246
           +    T    +  +I +++R  F  KL     ++   K    + G   VNG ++     +
Sbjct: 178 EHKSTTNYESDRVTIAEFIRRFF--KLSATFSEEDIMK----VHGSTLVNGHEVPLTEPF 231

Query: 247 TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREH 300
              +YS   ++EH+C PN   S   Q  + +          H+S  Y++ LWGT  RR  
Sbjct: 232 HVAIYSSASMLEHSCGPNCTKSFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYF 291

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP--VNPLDNDSDWTCGSC 358
           L  TK+F C CERC DP+E GT FSA+KC    K  G  ++LP     L    DW C  C
Sbjct: 292 LHETKFFWCHCERCEDPSEFGTNFSAIKC--STKSCGG-YLLPATFTELQKLPDWVCDKC 348

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
              ++   VH +  ++G  +  L  +  +++  ++ L   + + H +HY+C  VK++L Q
Sbjct: 349 DKTMSFYSVHDILERIGRDLFDL--DKSSIEDCKDFLNTHQDLLHENHYYCTEVKYTLSQ 406

Query: 419 LYGTQ 423
           L G +
Sbjct: 407 LIGQR 411


>gi|357603991|gb|EHJ63998.1| hypothetical protein KGM_04190 [Danaus plexippus]
          Length = 564

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 141/408 (34%), Positives = 211/408 (51%), Gaps = 49/408 (12%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           YEIK S ++GRY+ AT D+ PG+ I     L+VGP       ICLGC++ L    +   +
Sbjct: 81  YEIKHSDVMGRYIVATRDIEPGEVIITAPALVVGPCSGCG-LICLGCYRELEE--SHLTK 137

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYN-DYK--------YEAILPLR 166
           C  C WP CS  C GL     H+      K  CETL     DYK        Y AI PLR
Sbjct: 138 CSGCKWPLCSNSCFGLGKYTGHS------KYECETLKKIPPDYKNMEDLRDSYHAITPLR 191

Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP--GDVLDDT 224
           CL+L++  P K+  L +ME+H + R    ++Y   D        N +++L   G   DD 
Sbjct: 192 CLLLKKEDPSKWCALSEMESHNAIRRSRGDIYPMNDR-------NVVQRLKKWGLGYDDD 244

Query: 225 SAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN---------MMQ 272
               +H +CG++EVN  ++G      + LYS   L+ H+C P+  H++          ++
Sbjct: 245 E---VHTVCGILEVNAFEVGASGANARALYSEAYLLAHDCTPSTSHTDHERRPGRPLTIR 301

Query: 273 SKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
           + +       IS  Y   L GT  RREHLA++K+F CSC RC+DPTE GT  S+++C   
Sbjct: 302 AAIRHVPGDVISLCYAYTLQGTLKRREHLAVSKFFMCSCVRCADPTERGTQASSLRC--- 358

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
              +    +L  +PL +D+ W C  C+  ++A  V L+  +L ++ D++  +  +V+  E
Sbjct: 359 --PRCAGSVLSTDPLRSDALWRCAECTYEMSAATVKLLLKRLTDEFDQI--DANDVQGFE 414

Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             LTK   +  P+HY   + KHSL QLYG    Y   +++  LL+RKI
Sbjct: 415 NYLTKYRNVLQPNHYLNLSAKHSLSQLYGKVADYMIHEMTEELLQRKI 462


>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
 gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
          Length = 527

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 142/457 (31%), Positives = 237/457 (51%), Gaps = 56/457 (12%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATL 72
           +CA C+  A   C+AC+ V YC ++HQKEHWK  HK +CK   YE+ S+ LLGR+L+AT 
Sbjct: 6   KCAYCQAHATQLCAACRNVVYCSREHQKEHWKRGHKLECK--CYELASNELLGRHLRATR 63

Query: 73  DLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSARC 128
           D+  G+ I RE+PL++GPK+A + PICLGCH+    P  P L +  +C  C WP C   C
Sbjct: 64  DIKMGELIMREAPLVLGPKVA-SMPICLGCHRNLLPPQKPAL-NYYKCSACSWPLCGPEC 121

Query: 129 SGLSDAHTHAPECAILKLG-CETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQE 180
                +  H  EC ++     ++ + YN       +  Y  I+ LRC+ L+   P  +  
Sbjct: 122 E---RSAFHVDECRLMAASKFQSKINYNPGQPVGKESAYCVIMLLRCMQLKESKPAAFAR 178

Query: 181 LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNF-LEKLPG-DVLDDTSAKCLHWICGVIE 237
           L ++E H+ +R   T +Y+ + + ++ ++++   L + P  D+L          I  +++
Sbjct: 179 LCELEDHLKERLE-TPLYQVLRANLISFIKTVLGLREWPDLDILR---------IAAILD 228

Query: 238 VNGVDIGRY-----TQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTM 286
            N  ++ +       +GL+    +  H+C+PN +H      + M  +K  + +   +S  
Sbjct: 229 TNAFEVRQAGDRIKVRGLFPGGAMFSHDCVPNMRHRFDDDMNIMFLAKRPIAKGEILSIS 288

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI---LP 343
           YT  L  T  RR HL   K F C+C RC+DPTELGTY  A  C         C +   + 
Sbjct: 289 YTQQLRSTIQRRLHLKQVKCFECACARCADPTELGTYAGAHMC-------AKCKVGKFIS 341

Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
           ++PL N ++W C  C+    A++  +  +++  +++ L +  P  +S E+ L +     H
Sbjct: 342 MDPLQNAANWRCEVCNLIRPAKEYLINDAKIEAELESLDKTTP--QSFEDFLYRHRVELH 399

Query: 404 PHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             + H    K++L Q+YG   G+   +LS   L+RK+
Sbjct: 400 ETNTHILQAKYALTQIYGNVKGFGMDELSDVSLKRKV 436


>gi|157125398|ref|XP_001654321.1| hypothetical protein AaeL_AAEL001920 [Aedes aegypti]
 gi|108882685|gb|EAT46910.1| AAEL001920-PA [Aedes aegypti]
          Length = 527

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 143/436 (32%), Positives = 223/436 (51%), Gaps = 42/436 (9%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           PY ++ S + GRYL A  +L  G++I +  PL +GP  A ++P+CLGC+K      +   
Sbjct: 46  PYIVRYSDVWGRYLVAARNLKAGEKIIQVEPLAMGP-WAASDPVCLGCYKRFAKG-SKII 103

Query: 115 RCPRCFWPACSARCSGLSDAHTHAP-ECAILKL-GCETLLAYNDYK-----YEAILPLRC 167
           RC  C W  CS  CSGL+    H   EC  LK      LL  +  +     YEAI  LRC
Sbjct: 104 RCSGCGWRICSHTCSGLTANGAHKRLECNPLKEHNVVNLLETSSTEQIKLMYEAIFALRC 163

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSA 226
           L+L++  P++Y  L +ME+         E+ ++I S+ K  +   + ++  +   D+ S 
Sbjct: 164 LLLKKTDPERYATLLEMESL-------NELRQKISSLWKRNQEMIVGRIRDEWRFDEFSE 216

Query: 227 KCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
             +H +CG+IEVN   IG+   Y + LY     + H+C PN  H++  ++ +   R T+ 
Sbjct: 217 GEIHTVCGIIEVNSFQIGQDDVYARALYPEAFYIMHDCTPNTTHTDDPKTHVLSIRLTND 276

Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I+  Y+  L GT  RR+HL  +K+F C C+RCSDPTE GT  S +KC    K  
Sbjct: 277 LKAGDPITLSYSYTLQGTLKRRQHLFESKFFWCQCKRCSDPTEFGTNCSTIKCTKCPKG- 335

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
              +IL  NPLD D+DW C SCS  + A+ +  +  +L +++D +  +  ++   E+   
Sbjct: 336 ---YILTTNPLDQDADWKCKSCSHTMPAKIIVQLLDRLFKELDAI--DGNSIDLYEQFNV 390

Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK---------ISYVMSPN 447
           K + + H +HY   + KHSL QLYG   GY  +++S   +++K         +  V  P 
Sbjct: 391 KYQNILHHNHYLFLSTKHSLCQLYGKISGYLISEMSLQQIKQKEKICRDLLSVVDVYEPG 450

Query: 448 LKATDEPICLGCHKPL 463
           L      I    H P+
Sbjct: 451 LSRLRGVIMYELHAPI 466



 Score = 38.9 bits (89), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAILK 502
           A  +P+CLGC+K      +   RC  C W  CS  CSGL+    H   EC  LK
Sbjct: 83  AASDPVCLGCYKRFAKG-SKIIRCSGCGWRICSHTCSGLTANGAHKRLECNPLK 135


>gi|242013801|ref|XP_002427589.1| protein msta, isoform B, putative [Pediculus humanus corporis]
 gi|212512004|gb|EEB14851.1| protein msta, isoform B, putative [Pediculus humanus corporis]
          Length = 472

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 125/399 (31%), Positives = 212/399 (53%), Gaps = 45/399 (11%)

Query: 67  YLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN-ARCPRCFWPACS 125
           YL A+  +  G+ I  + P++VGP    ++P+C+GC+  ++ N ++   RC  C WP CS
Sbjct: 1   YLVASKSIKQGEIILEDEPIVVGP-CQESKPLCIGCYTDIDLNDSNKYFRCVNCTWPLCS 59

Query: 126 ARCSGLSDAHTHAPE-CAILK---LGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
            +C G  +   H+ E C       +  + +  + ++ Y A+ P RCL+L++ +P+K++ +
Sbjct: 60  NQCKGFFNKFGHSKEECETFSKSFVKNKEIKTFVNFYY-ALTPFRCLLLKKINPEKWKII 118

Query: 182 KDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT-------SAKCLHWICG 234
           K+ME+H ++R    +++E          +N  EK+  D++ +            +H++CG
Sbjct: 119 KNMESHTNERKKDGKLWE----------TN--EKMAVDIIKNVWGLSDDDDDNDIHFVCG 166

Query: 235 VIEVNGVDIG----RYTQGLYSVICLMEHNCLPNAKHSNMMQSKL------FVFRDTHIS 284
           ++EVN  ++     +  + LYS   L  HNC+PN  HSN  + KL      F+ +++ ++
Sbjct: 167 ILEVNSFEVNFQNVQSLRALYSNAFLFSHNCIPNTSHSNNNEFKLIFRTTTFIQKNSAVT 226

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             Y   L  T  RR+HL  TK+F C+C+RCSDPTEL T+   +KC    +   + + LP 
Sbjct: 227 ISYAYTLQPTLLRRKHLEKTKFFQCNCQRCSDPTELNTFIGGVKC----QICKNGFYLPE 282

Query: 345 NPLDNDSDWTCGS---CSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
           N LDN S W C S   C A++ A ++     +   ++D +     +V+S E  + K + +
Sbjct: 283 NSLDNKSMWFCNSIKGCKAQIQASNIECTIERFFNELDGI--NGNDVESYEYFIIKCQEI 340

Query: 402 FHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           FH +HY C +  H L QLYG   GY    L+   L RKI
Sbjct: 341 FHKNHYLCLSAMHCLSQLYGKINGYLINDLTEEQLLRKI 379



 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 20/80 (25%), Positives = 42/80 (52%), Gaps = 16/80 (20%)

Query: 453 EPICLGCHKPLNPNLADN-ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAY 511
           +P+C+GC+  ++ N ++   RC  C WP CS +C G  +   H+ E       CET    
Sbjct: 29  KPLCIGCYTDIDLNDSNKYFRCVNCTWPLCSNQCKGFFNKFGHSKE------ECET---- 78

Query: 512 NDYKSLSRAYLDHQQVRSWT 531
                 S++++ +++++++ 
Sbjct: 79  -----FSKSFVKNKEIKTFV 93


>gi|357620843|gb|EHJ72884.1| hypothetical protein KGM_13875 [Danaus plexippus]
          Length = 516

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 137/448 (30%), Positives = 214/448 (47%), Gaps = 65/448 (14%)

Query: 32  VAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGP 90
           V YC K+HQK  WK  HK +CK  PY+I+ S   GR++ AT D+  G+ I +E P ++GP
Sbjct: 2   VYYCSKEHQKFDWKKSHKVECK--PYKIQYSDSCGRFMTATRDIKQGEVILKEKPAVIGP 59

Query: 91  KLALAEPICLGCHKPLNPNLADNA-----RCPRCFWPACSARCSGLSDAHTHAPECAIL- 144
           ++   +  CL C + L P   D+      RC  C WP C   C     A  H  EC I+ 
Sbjct: 60  RM-YCKVQCLSCGRQLQPIPNDDGNMDFIRCSSCNWPVCGIDCE---KAEVHREECKIMV 115

Query: 145 --KLGCETLLAYNDYKYE----------AILPLRCLILQRRSPKKYQELKDMEAHMSKRG 192
             K  C       D KYE           I PLR L+++  +P+ ++ + ++E+H+ +R 
Sbjct: 116 QSKYKC-------DIKYECPDKAEAAYCVIAPLRVLLMKDSNPRMFESIMNLESHVDQR- 167

Query: 193 PGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----G 244
             T +Y+        LR N   F+ ++ G   DD   K +  I  + + N  D+      
Sbjct: 168 INTPIYQ-------VLRPNLIMFIRQVLGMNFDD---KIILEISSIFDNNSFDVRSADKT 217

Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLF-------VFRDTHISTMYTNALWGTQPR 297
           +  + +Y +  +M H+C PN +H  + + K         + +   I+  YT  LWGT  R
Sbjct: 218 KRLRAIYLLASMMNHSCRPNTRHIYLGEDKTLALIATVHIAKGEEITATYTQPLWGTLDR 277

Query: 298 REHLAITKYFNCSCERCSDPTELGTYFSAMKC-----LNEHKDQGDCWILPVNPLDNDSD 352
           R+ L   KYF+C CERC+DPTE GTY   + C      N    +    ++  NPLD  + 
Sbjct: 278 RKFLKTNKYFDCKCERCADPTEFGTYLGNIYCTMCNSTNADVSRKGAMLVSTNPLDETAL 337

Query: 353 WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
           W C +C   +  R +      L E++  L +  P  +  E  + K +   HP ++     
Sbjct: 338 WKCENCGYFIQGRQMVWGNKALKEELSSLNKTGP--QDFEHFIEKYKLTLHPRNHLLIQA 395

Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           K +L+++YG   GY+  +L  +LL RKI
Sbjct: 396 KLALMEIYGNYKGYSLKELPENLLSRKI 423


>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
 gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
          Length = 497

 Score =  211 bits (537), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 130/407 (31%), Positives = 209/407 (51%), Gaps = 33/407 (8%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           PY++  +P LGRY  +  DL  GD I  E P  +GPK+  + P+CL C  P++ + A+  
Sbjct: 4   PYKVLDNPELGRYAVSAKDLKAGDVILEEIPFAIGPKVN-SSPLCLECCCPVDGS-ANGP 61

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLI 169
           +C  C WP C        +   H  ECA+   G        D      + + I PLR L+
Sbjct: 62  KCSDCGWPMCEECNEDKDNVVYHKKECALFVKGKAKFQNVEDSTASCMQLDCITPLRMLL 121

Query: 170 LQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
            +   P+++  EL  ME H   R  GT  +++  +IV YLR       P  + D  S + 
Sbjct: 122 AKEDHPERWDAELSVMEYHNDARKDGTTWHQDQQNIVGYLRG------PCGLKDRFSEEV 175

Query: 229 LHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN--------MMQSKLF 276
           +  + G++++N  ++    G   +GLY  + +M H+C+PN  HS         + +  + 
Sbjct: 176 IQQVIGILDINAFEVRTSKGYSARGLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAID 235

Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +     + T+YT  L GT  R+  L  +KYF C C RC DPTELGT+FS++KC  +  D 
Sbjct: 236 IPEGEKLYTVYTYTLNGTSTRQAALKSSKYFTCRCPRCLDPTELGTHFSSLKC--QKCDN 293

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENP-NVKSLEE 393
           G   I+  NPLD++++W C +C  +     V      +  ++D+L  ++  P  + + E+
Sbjct: 294 G--LIVSSNPLDDEAEWRCSNCEFKTRGAAVQKAIQVMQAEIDQLSCLENGPEKLTAFEK 351

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           +  K  ++ HP H+   +++HSLI+LYG  PGY   +L   LLERK+
Sbjct: 352 LYKKYRSILHPLHFIKTSIRHSLIELYGRVPGYEMPELPDILLERKV 398


>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 467

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 137/448 (30%), Positives = 222/448 (49%), Gaps = 50/448 (11%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA 70
           E   C +C   ++HKCSAC+ V YC K+HQKE WK H   CK   +++  +P LGR+  A
Sbjct: 2   ESQTCVICGGHSVHKCSACENVYYCSKKHQKEDWKKHSKVCKS--FKLAENPSLGRHYVA 59

Query: 71  TLDLHPGDRIARE-SPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCS 129
           T ++  G+ I R+  PLI G  +  + P+CL C+  LN  +A    C +C WP C     
Sbjct: 60  TRNIKVGEIILRDDQPLITGL-MYNSVPVCLQCYTMLNQEIA--IPCEKCGWPLCQ---- 112

Query: 130 GLSDAHTHAPECAILKLGCE--------TLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
              + + H  EC   K  C         T   Y    Y+ I  +R L L+  +P+ Y++L
Sbjct: 113 ---NCNEHGLEC---KFSCSRRDSKISITEFGYPHPSYQCINIIRALSLKDTNPESYKKL 166

Query: 182 KDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGV 241
             +E+H ++     E       I ++ ++           DD S + +  I G+++VNG 
Sbjct: 167 ISLESHCNEINNSKEPLNIAHFIKRFFKA-----------DDISEEEIVTIIGILQVNGH 215

Query: 242 DIGRYTQ---GLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALW 292
           ++         +Y +  L+EHNC  N   S       ++++ L + +  HIS  YT+ LW
Sbjct: 216 EVPLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGLIIRAALPITKGDHISICYTDPLW 275

Query: 293 GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSD 352
           GT  RR HL  TK+F C C RC DPTE  T F+A+KC   +K     ++LP   L+ + D
Sbjct: 276 GTANRRHHLLKTKFFECICNRCQDPTEFQTMFNALKC---NKINCSGYVLPKTFLEQEQD 332

Query: 353 WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
           + C +C + ++  ++  +   +G ++  +  +  ++ + +E L + +   HP+HY+   V
Sbjct: 333 YVCKTCESIVSCTEIEKMLEDIGIRLSNM--KKNDIVACKEFLDRYKNNLHPNHYYNIDV 390

Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             +L QL G Q G     +   LL  KI
Sbjct: 391 TIALTQLIGQQTG-GLAVIEEDLLIEKI 417


>gi|350420138|ref|XP_003492412.1| PREDICTED: hypothetical protein LOC100743563 [Bombus impatiens]
          Length = 442

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 133/448 (29%), Positives = 228/448 (50%), Gaps = 49/448 (10%)

Query: 10  EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           E    CA+C + A+HKCSAC  V YC KQHQKE W+ H   CK   +++    LLGR+  
Sbjct: 2   ESQTTCAICGKNAIHKCSACGNVYYCSKQHQKEDWRNHAKACKS--FKLAEDSLLGRHCI 59

Query: 70  ATLDLHPGDRIARES-PLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARC 128
           AT ++  G+ + ++  PLI GP    A P+CL C+  LN  +A    C +C WP C    
Sbjct: 60  ATRNIKVGEIVLKDDEPLIAGPMYNCA-PVCLRCYMVLNETIA--VACEKCGWPLCQ--- 113

Query: 129 SGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELKD 183
               D   H  EC       +  ++  ++      Y+ I  +R L  +  + + Y++L  
Sbjct: 114 ----DCKDHGLECNFSSSRRDHKVSITEFGHPHPSYQCITVIRALASKDVNLESYKKLLS 169

Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI 243
           +E+H  +         E+ + V++++  F         DD S + +  I G+++VNG ++
Sbjct: 170 LESHCDRINS-----HELSNTVRFIKRFFKT-------DDISEEEMTKIVGILQVNGHEV 217

Query: 244 GRYT---QGLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALWGT 294
                    +Y +  L+EHNC  N   S       ++ + + + +  HIS  YT+ LWGT
Sbjct: 218 PLTDPPYVAVYELTSLLEHNCKANCSKSFTDTGGLIIHAAVPIAKGDHISICYTDPLWGT 277

Query: 295 QPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC--WILPVNPLDNDSD 352
             RR HL  TK+F C+C+RC DPTE GT F+A++C     ++ +C  +ILP    + + D
Sbjct: 278 ANRRHHLLKTKFFECTCDRCKDPTEFGTMFNALRC-----NRMNCPGYILPKTFFEQEHD 332

Query: 353 WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
           + C  C + +   ++  +   +G  +  ++++N ++ + +E +++ ++  HP+H++   V
Sbjct: 333 YICKICESIVPYVEIEKMLEDIGIYL-SIIKKN-DIIACKEFISRYKSTLHPNHFYNIDV 390

Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             +L QL G Q G     + + LL  KI
Sbjct: 391 TIALAQLIGQQTG-GLAAVETDLLIEKI 417


>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 440

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 136/445 (30%), Positives = 221/445 (49%), Gaps = 44/445 (9%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA 70
           E   C +C   ++HKCSAC+ V YC K+HQKE WK H   CK   +++  +P LGR+  A
Sbjct: 2   ESQTCVICGGHSVHKCSACENVYYCSKKHQKEDWKKHSKICKS--FKLAENPSLGRHYIA 59

Query: 71  TLDLHPGDRIARE-SPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCS 129
           T ++  G+ I R+  PLI G       P+CL C+  LN  +A    C +C WP C     
Sbjct: 60  TRNIKVGEIILRDDQPLITGLMYNTV-PVCLQCYTVLNQEIA--IPCEKCGWPLCQ---- 112

Query: 130 GLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELKDM 184
              + + H  EC       ++ ++  ++      Y+ I  +R L L+  +P+ Y++L  +
Sbjct: 113 ---NCNEHGLECKFSSSRRDSKISITEFGYPHPSYQCINVIRALSLKDTNPESYKKLISL 169

Query: 185 EAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG 244
           E+H ++     E       I ++ ++           DD S + +  I G+++VNG ++ 
Sbjct: 170 ESHCNEMNNSKEPLNIAHFIKRFFKA-----------DDISEEEIATIIGILQVNGHEVP 218

Query: 245 RYTQ---GLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALWGTQ 295
                   +Y +  L+EHNC  N   S       ++++ L + +  HIS  YT+ LWGT 
Sbjct: 219 LTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGLIIRAALPITKGDHISICYTDPLWGTA 278

Query: 296 PRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTC 355
            RR HL  TK+F C C RC DPTE  T F+A+KC N +      +ILP   L+ + D+ C
Sbjct: 279 NRRHHLFKTKFFECICNRCQDPTEFQTMFNALKCNNINCSG---YILPKTFLEQEQDYIC 335

Query: 356 GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHS 415
             C + ++  ++  V   +G  +D    +  ++ + +E L + +   HP+HY+   V  +
Sbjct: 336 KICESVVSCTEIEKVLEDIG--IDLSNMKKNDIVACKEFLDRYKNDLHPNHYYNIDVTIA 393

Query: 416 LIQLYGTQPGYAYTQLSSSLLERKI 440
           L QL G Q G     +  +LL  KI
Sbjct: 394 LAQLIGQQTG-GLAVIEENLLIEKI 417


>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
 gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
          Length = 546

 Score =  204 bits (519), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 155/507 (30%), Positives = 238/507 (46%), Gaps = 43/507 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC + A  +CS C    YC  +HQ++ WK HK  C   P++I S    GRY+ AT D+
Sbjct: 26  CPVCNQPAKKRCSGCSAAYYCSVEHQRQDWKNHKNVCH--PFKICSDERYGRYMMATKDI 83

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
             GD + +ESPL+ GP   +  P+C+GC + L  N      C RC WP C   C    D 
Sbjct: 84  KAGDVVLKESPLVYGPA-QITNPVCVGCLQGLTEN--QFLECERCGWPVCKRECQ---DH 137

Query: 135 HTHAPECAI-LKLGCETLLA--YNDYK-YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
            +H  EC   +  G +  L   YN +  Y+ ++PLRCL+L  R P K+Q L  +E+H  +
Sbjct: 138 PSHKAECKFTIARGSKMQLRHFYNPHPVYQCLIPLRCLLLAERDPAKWQALIKLESHEEQ 197

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRY---T 247
           R    +   + + + K +   F  K      +D   K    I G+++VNG ++      +
Sbjct: 198 RRGSEQWRNDREGVAKLIPRFF--KCENKWSEDEILK----IVGILQVNGHEVPLTEPPS 251

Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF-------RDTHISTMYTNALWGTQPRREH 300
             +Y+   ++EH+C PN   S     K  VF       +   +S  Y++ LWGT  R++H
Sbjct: 252 VAIYNNASMLEHSCRPNLSKS-FTSKKEIVFWAPNPIKQGERLSISYSDVLWGTANRQDH 310

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSCS 359
           L  TK F C+C RC DPTE GTY SA+KC    KD      +LP N  +    + C  C 
Sbjct: 311 LQQTKLFRCTCVRCLDPTEFGTYLSALKCSGFKKDSNCSGLLLPENLKNWYGGYICNKCR 370

Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
             ++ +++  +  +   +VD    +  N +   + +        P+H+    VK SL Q+
Sbjct: 371 GLVDGKEITNILDRA--RVDHEAMQKDNEQHCHKYIAHYGRWLGPNHHLLVDVKISLSQI 428

Query: 420 YGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPIC------LGCHK-PLNPNLADNAR 472
            G     A  ++S   L  K+   +   L    + +C      +G  +  L+  LA+ AR
Sbjct: 429 MGGGDPNAIQKISDEDLMTKMK--ICQELIDVFQRVCPAEARVIGTTRFELHAALAEFAR 486

Query: 473 CPRCFWPACSARCSGLSDAHTHAPECA 499
             R      SA  S L D+  +A EC 
Sbjct: 487 --RGVESMNSAVRSALEDSLYNAEECV 511


>gi|195566137|ref|XP_002106647.1| GD16997 [Drosophila simulans]
 gi|194204029|gb|EDX17605.1| GD16997 [Drosophila simulans]
          Length = 532

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/457 (29%), Positives = 222/457 (48%), Gaps = 46/457 (10%)

Query: 37  KQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE 96
           +Q +KEH +    + K+LPY ++ S + GRYL A   L  G+ + RE PL +GP ++  +
Sbjct: 31  EQQEKEHSRAPTEE-KELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-GD 88

Query: 97  PICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAIL--------KLG 147
           P+CLGC++P++    D  RCP C WP C + C+GL   H H   EC +         KL 
Sbjct: 89  PVCLGCYQPVSLK-PDQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYGERRAVAGKLL 147

Query: 148 CETLL-AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IV 205
            E    A     YE ++ +R L+L++  P+++  +  ME+H  +R     ++   +  +V
Sbjct: 148 TERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV 207

Query: 206 KYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCL 262
           + LR  +        L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H+C 
Sbjct: 208 QRLRVTW-------QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCT 260

Query: 263 PNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           PN  H++  +S   + R +        ++  Y   L GT  RR  +   K F C C RCS
Sbjct: 261 PNTAHTDDPRSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCS 320

Query: 316 DPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLG 375
           DP ELGT  SA+ C           +  V+PL    DW C  C+ ++ A +V     ++ 
Sbjct: 321 DPRELGTDCSALVCATCRTGS----VRAVDPLQQSGDWACDRCAHKMGATEVERQLDRIN 376

Query: 376 EQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSL 435
             ++ +  +  ++  LE  L +   +  P+HY   + K+SL Q+YG   GY   Q+S   
Sbjct: 377 NDLEDI--DVHDIPGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMSPED 434

Query: 436 LERKISY---------VMSPNLKATDEPICLGCHKPL 463
           + RK SY         V+ P L      I    H P+
Sbjct: 435 IARKESYCREFLEIVDVLDPGLTRLRGLIMYELHAPV 471



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y V+HS I  YG          +   L R+    + P +  + +P+CLGC++P++    D
Sbjct: 49  YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYQPVSLK-PD 103

Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
             RCP C WP C + C+GL   H H   EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135


>gi|24641162|ref|NP_572675.1| CG11160, isoform A [Drosophila melanogaster]
 gi|22832070|gb|AAF47987.2| CG11160, isoform A [Drosophila melanogaster]
          Length = 532

 Score =  201 bits (512), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 140/460 (30%), Positives = 222/460 (48%), Gaps = 50/460 (10%)

Query: 37  KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
           KQ Q++  +   P + K+LPY ++ S + GRYL A   L  G+ + RE PL +GP ++  
Sbjct: 29  KQEQQDKEQSPSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87

Query: 96  EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAILK----LGCET 150
           +P+CLGC+ P++   AD  RCP C WP C + C+GL   H H   EC +      +  E 
Sbjct: 88  DPVCLGCYHPVSLK-ADQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYAERRAVAGEL 146

Query: 151 LL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEID 202
           L      A     YE ++ +R L+L++  P+++  +  ME+H  +R     +   YEE  
Sbjct: 147 LTERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEE-- 204

Query: 203 SIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEH 259
            +V+ LR  +        L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H
Sbjct: 205 KVVQRLRVTW-------QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAH 257

Query: 260 NCLPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCE 312
           +C PN  H++   S   + R +        ++  Y   L GT  RR  +   K F C C 
Sbjct: 258 DCTPNTAHTDDPSSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCR 317

Query: 313 RCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTS 372
           RCSDP ELGT  SA+ C           +  V+PL    DW C  C+ ++ A +V     
Sbjct: 318 RCSDPRELGTDCSALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLD 373

Query: 373 QLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLS 432
           ++   ++ +  +  ++  LE  L +   +  P+HY   + K+SL Q+YG   GY   Q+S
Sbjct: 374 RINNDLEDI--DVHDIPGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMS 431

Query: 433 SSLLERKISY---------VMSPNLKATDEPICLGCHKPL 463
              + RK SY         V+ P L      I    H P+
Sbjct: 432 PEDIARKESYCREFLEIVDVLDPGLTRLRGLIMYELHAPV 471



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y V+HS I  YG          +   L R+    + P +  + +P+CLGC+ P++   AD
Sbjct: 49  YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYHPVSLK-AD 103

Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
             RCP C WP C + C+GL   H H   EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135


>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
           [Megachile rotundata]
          Length = 523

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 136/436 (31%), Positives = 215/436 (49%), Gaps = 57/436 (13%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC   ++ KCSAC  V YC K+HQKE WK H   C+   +++  S  LGRY  A  ++
Sbjct: 7   CAVCGGESVFKCSACGNVYYCSKKHQKEDWKRHVQFCR--AFKLAESSSLGRYYIAARNI 64

Query: 75  HPGDRIARES-PLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
             G+ + ++  PL+ GP +  + P+CLGC+  L+ N A    C +C WP C        +
Sbjct: 65  KAGEIVLKDDRPLVAGP-MHNSVPVCLGCYMVLHENTA--VPCTKCGWPLCQ-------N 114

Query: 134 AHTHAPEC-----------AILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
              H  EC           +I K G      Y    Y+ I  +R L L+  + + Y +L 
Sbjct: 115 CKEHGTECDFTSSRRSDKVSITKFG------YPHPSYQCINVIRALSLKDNNIESYNKLL 168

Query: 183 DMEAHMSK-RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGV 241
            +E+H  K +   T ++EE  ++V +++  F         +D   + +  I GV++VNG 
Sbjct: 169 TLESHSDKIKNKETFIFEESSNLVNFIKRFFKS-------EDIPEEDIRKIIGVLQVNGH 221

Query: 242 DIGRYT---QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT------HISTMYTNALW 292
           ++         +Y +  L+EHNC  N   S      L +   T      HIS  YT+ LW
Sbjct: 222 EVPLTDPPYVAVYELASLLEHNCKANCSKSFTDTGGLIIHAATPITKGDHISICYTDPLW 281

Query: 293 GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC--WILPVNPL-DN 349
           GT  RR HL  TKYF C+C+RC DPTE G  F+A++C      + DC  ++LP   L D+
Sbjct: 282 GTMNRRHHLLETKYFECTCDRCRDPTEFGAMFNAIRC-----SEIDCSGYVLPKTFLGDH 336

Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
             D+ C +C+  +    +      +G  + ++ + + +V   +E L + + + HP+H++ 
Sbjct: 337 REDYVCTNCTTVVPLEIIEETLEDIGIHLSEMRKNDIDV--CKEFLNRYKNVLHPNHFYN 394

Query: 410 YAVKHSLIQLYGTQPG 425
             V  +L QL G Q G
Sbjct: 395 IDVTIALAQLIGQQSG 410


>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 514

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/411 (32%), Positives = 198/411 (48%), Gaps = 40/411 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I  +  +GR++ A  DL PG+ I  E P +VGPK A   P+CL C+ P  P+L +   
Sbjct: 20  YKILKNDKVGRFMVANKDLQPGEEIVTEMPFVVGPK-AFTYPLCLSCYAPWPPSLTEKPL 78

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
           C RC WP CS  C        +  EC I      K   +  L   +     + E I PLR
Sbjct: 79  CSRCGWPVCSTECENQPQHKDY--ECQIFVQANEKFNVQAALEEANENGVPQLECITPLR 136

Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
            L+   ++P+++  E+K+MEAH   R        +  ++V+YLR+          LD  S
Sbjct: 137 LLLESEKNPERWNNEVKNMEAHNKIRSQKAHWKSDQVNVVEYLRNQL-------KLDRFS 189

Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
            + +   CG++E+N  ++    G   + LY  + LM H+C+ N  HS          R T
Sbjct: 190 EELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTDYRIRLRTT 249

Query: 282 -------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
                   +   YT++L+ T  RREHL   K+F C+C RCSDPTELGT+ S++KC     
Sbjct: 250 VHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARCSDPTELGTHMSSLKC--NKC 307

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN-----PNVK 389
           D G   +L ++ LD DS W C  C    +   V  V   +  +VD + Q +       ++
Sbjct: 308 DNG--IVLSLDSLDPDSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQISGADGADAIQ 365

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             E ++ K  ++ HP H     ++HSL Q+YG    Y    L   +LE KI
Sbjct: 366 ERETVMKKYRSVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKI 416



 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 2/47 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           ++ +V+ P  KA   P+CL C+ P  P+L +   C RC WP CS  C
Sbjct: 47  EMPFVVGP--KAFTYPLCLSCYAPWPPSLTEKPLCSRCGWPVCSTEC 91


>gi|195479534|ref|XP_002100923.1| GE15901 [Drosophila yakuba]
 gi|194188447|gb|EDX02031.1| GE15901 [Drosophila yakuba]
          Length = 532

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/444 (30%), Positives = 215/444 (48%), Gaps = 49/444 (11%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLA 111
           K+ PY I+ S + GRYL A+  L  G+ + RE PL +GP ++  +P+CLGC++P++    
Sbjct: 45  KEQPYRIEHSDIYGRYLVASRQLEAGETLIREEPLAIGPCVS-GDPVCLGCYQPVSLK-P 102

Query: 112 DNARCPRCFWPACSARCSGLSDAHTHA-PECAIL----KLGCETLL-----AYNDYKYEA 161
           D  RCP C WP C   C+GL   H H   EC +      +  E L      A     YE 
Sbjct: 103 DQYRCPGCAWPLCGITCTGLKHRHGHTETECQLFGERRAVAGELLTERAGPAEVRDLYEL 162

Query: 162 ILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEIDSIVKYLRSNFLEKLPG 218
           ++ +R L+L++  P+++  +  ME+H  +R     +   YEE   +V+ LR  +      
Sbjct: 163 VMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEE--KVVQRLRVTW------ 214

Query: 219 DVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
             L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H+C PN  H++  +S  
Sbjct: 215 -QLEDLEAEQVHDVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPNTAHTDDPRSFE 273

Query: 276 FVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
            + R +        ++  Y   L GT  RR  +   K F C C RCSDP ELGT  SA+ 
Sbjct: 274 ILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALV 333

Query: 329 CLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNV 388
           C           +  V+PL+   DW C  CS ++ A +V     ++   ++ +  +  ++
Sbjct: 334 CATCRTGS----VRAVDPLEQTGDWACDRCSHKMGATEVERQLDRINNDLEDI--DVHDI 387

Query: 389 KSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY------ 442
             LE  L +   +  P+HY   + K+SL Q+YG   GY   Q+S   + RK SY      
Sbjct: 388 PGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMSPEDIARKESYCREFLE 447

Query: 443 ---VMSPNLKATDEPICLGCHKPL 463
              V+ P L      I    H P+
Sbjct: 448 IVDVLDPGLTRLRGLIMYELHAPV 471



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 6/93 (6%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y ++HS   +YG     +    +   L R+    + P +  + +P+CLGC++P++    D
Sbjct: 49  YRIEHS--DIYGRYLVASRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYQPVSLK-PD 103

Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAIL 501
             RCP C WP C   C+GL   H H   EC + 
Sbjct: 104 QYRCPGCAWPLCGITCTGLKHRHGHTETECQLF 136


>gi|156544373|ref|XP_001607398.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 507

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 146/461 (31%), Positives = 224/461 (48%), Gaps = 60/461 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I+ +  +GR+L A+ DL PG++I  E P +VGPK A   P+CL C+  + P   D+++
Sbjct: 12  YKIQQNDQVGRFLVASKDLEPGEQILTELPFVVGPKAA-TYPVCLSCYS-VWPATEDDSK 69

Query: 116 --CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETL---LAYNDY------KYEAILP 164
             C RC WP C   C   ++      EC I +   E     +A ++       + E I P
Sbjct: 70  PLCSRCSWPVCGPECE--NNPQHKDYECPIFEAAKEKFSIDVALSEEHQNGVPQLECITP 127

Query: 165 LRCLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
           LR L+   + P++++ E+KDMEAH  KR    + + +  +IV+Y+R           LD 
Sbjct: 128 LRLLLAAEKDPERWKSEIKDMEAHNKKRAQKNQWHIDHVNIVEYIRKRL-------KLDR 180

Query: 224 TSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQ 272
            S + +   CG+++VN  +I    G   +GLY  + +M HNC+ N  HS         ++
Sbjct: 181 FSEEDIQTACGILDVNSHEIRTAKGFLARGLYPKVAIMNHNCVSNTAHSIDPNDYRIFLR 240

Query: 273 SKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
           S + V     +   YT+AL  T  RREHL  +K+F C+C RCSDPTELGT+ S++KC   
Sbjct: 241 SAVKVPAAGELFGSYTHALLPTLLRREHLLESKHFACACNRCSDPTELGTHMSSLKC--S 298

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL----VQENPN- 387
             D G   +LP++ LD +S W C  C    N   V  V   +  ++D++      + P+ 
Sbjct: 299 KCDNG--VVLPLDSLDAESQWKCTHCEFSTNGLAVAKVQKIIQAEMDQVEMYSAADGPDA 356

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY----- 442
           +++ E  + K  ++ HP H      + SL QLYG    Y    L   +LE KI       
Sbjct: 357 IQNRETFMKKYHSVLHPRHALLTIPRFSLSQLYGRVEEYFLDDLPDIVLEHKIDMCRLVL 416

Query: 443 ----VMSPNLKATDEPICLGCHKPL--------NPNLADNA 471
               V+ P    +   I    H PL        N  + DNA
Sbjct: 417 QVLDVVEPGKTRSRGMILYELHAPLLFIAKGQWNAGVIDNA 457


>gi|157138251|ref|XP_001664198.1| hypothetical protein AaeL_AAEL003788 [Aedes aegypti]
 gi|108880683|gb|EAT44908.1| AAEL003788-PA, partial [Aedes aegypti]
          Length = 485

 Score =  197 bits (502), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 144/453 (31%), Positives = 232/453 (51%), Gaps = 59/453 (13%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATL 72
           RCAVC  +   +CS C++V+YCG+ HQ++HWK  H+ +C+   ++I S+ +LGR+L AT 
Sbjct: 7   RCAVCDASGGKQCSGCQQVSYCGRDHQRQHWKAGHREQCR--CFKISSNAILGRHLTATR 64

Query: 73  DLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNL-----ADNARCPRCFWPACSAR 127
            +  G+ I  +SPL++GPK+A + P+CLGCH+ L P +      D   C  C WP C   
Sbjct: 65  PIRRGELIFNDSPLLLGPKIA-SIPVCLGCHRNLPPLMNSHLQVDFYHCRHCSWPLCGPS 123

Query: 128 CSGLSDAHTHAPECAIL-----KLGCETLLAYND-----YKYEAILPLRCLILQRRSPKK 177
           C     A +H  EC +L      +G      ++D       Y A++PLR L+L+R +P++
Sbjct: 124 CE---TATSHKDECPLLARSSKSIGSRLQFCFSDPTRKESVYCAVVPLRALLLKRSNPER 180

Query: 178 YQE-LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
           +++    +E+H+  R   T +Y  + S +V ++R   +  + G+V    + + L  I  +
Sbjct: 181 FRDGFLTLESHLETR-IKTPLYAALRSNLVPFVRD--ILGMGGEV----TERELLEIAAI 233

Query: 236 IEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHIS 284
           ++ N  D+         + LY +  +M H+C PN KH    + +L V       +D  IS
Sbjct: 234 LDTNCYDVRLPELDVKVRALYELGAMMSHHCRPNTKHYFDERLRLVVVATVDIPKDAVIS 293

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L  T  RR  +  +K F C C+RC DPTELGTY  ++ C +  K +    ++  
Sbjct: 294 ISYTQPLLSTIQRRYAIQQSKCFECCCDRCRDPTELGTYLGSIVCPSCRKAK----VVAF 349

Query: 345 NPLDNDSDWTCGS--CSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT--KLEA 400
           +PLD  SDW C S  CS R +A D          ++  + +        E  L   +LE+
Sbjct: 350 DPLDLRSDWQCESRKCSFRESAVDAVRRNELRQAEIMAVARTVSPRDQYERFLVTHQLES 409

Query: 401 ------MFHPHHYHCYAVKHSLIQLY---GTQP 424
                 + HP + +   VK++L QLY    T+P
Sbjct: 410 SKEELGLLHPWNSNVLQVKYALTQLYRGAATEP 442


>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 510

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 131/409 (32%), Positives = 202/409 (49%), Gaps = 39/409 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++  +  +GRY+ A+ +L  G+ I  E+P IVGPK A   P+CL C+ P  P+  D   
Sbjct: 19  YKVLQNDQVGRYMIASRELQAGEEITSETPFIVGPK-ACTYPLCLSCYTPWPPSPDDKPL 77

Query: 116 CPRCFWPACSARCSGLSDAHTHAP-ECAIL-----KLGCETLLAYNDY-KYEAILPLRCL 168
           C +C W  C+  C    +A  H   EC I      K   + +L  +   + E I PLR L
Sbjct: 78  CSKCGWSVCNQECE---NAPQHKDYECPIFVEANEKFNVDVVLDGDGVPQLECITPLRLL 134

Query: 169 ILQRRSPKKY-QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           +   R+ +++ +E+KDMEAH   R    +   +  +IV YLR           LD  S +
Sbjct: 135 LESERNVERWNKEIKDMEAHNKIRSEKIQWKSDHVNIVDYLRKRL-------KLDRFSEE 187

Query: 228 CLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLF 276
            +  ICG++++N  ++    G   +GLY  + +M H+C+ N  HS         +++ L 
Sbjct: 188 FIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVSNTSHSISPIDYRIRLRTTLK 247

Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           V     +   YT++L  T  RREHL   KYF C+C RCSDPTELGT+ S++KC     D 
Sbjct: 248 VPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRCSDPTELGTHMSSLKC--NKCDN 305

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVKSL 391
           G   IL ++ LD+ S W C  C    + + V  +   +  +VD +           +   
Sbjct: 306 G--VILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKIIQAEVDTVEAISGADGADAILER 363

Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           E ++ K  ++ HPHH     ++HSL Q+YG    Y    L   +LE K+
Sbjct: 364 ENVIKKYRSVLHPHHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHKV 412


>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
 gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
          Length = 514

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 125/419 (29%), Positives = 210/419 (50%), Gaps = 36/419 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC E   +KCS C +V+YC  QHQK+ WK HKP C   P++I  +  LGR+L AT  +
Sbjct: 4   CHVCEEPTKNKCSNCNQVSYCSVQHQKQDWKAHKPSCH--PFKIAHNEQLGRHLVATRTI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+CLGC   + P+  D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEPD--DHIECEQCGWPLCGPECKSLDE- 117

Query: 135 HTHAPECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
             H  EC + K   +  +   ++      Y  +  +RCL++   SP+K  + +D+E+  S
Sbjct: 118 --HKAECRLTK-DRDQRVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQDLESLES 174

Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ- 248
            R    +   ++ SI +++   F  +      ++   K +    G +++NG ++      
Sbjct: 175 TRRGSNQWKADLVSIGQFIPKFFKTQ---KFTEEQIMKAV----GALQINGHEVPTTDPS 227

Query: 249 --GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREH 300
              ++      E++CLPN AK  N     ++ +   + ++ H+S  Y++A+WGT  R+ H
Sbjct: 228 HVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRH 287

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSA 360
           L  TK F C+CERC D TEL T +SA+KC +    Q    +LP    D +  W C  C  
Sbjct: 288 LMQTKLFKCACERCVDVTELDTNYSAIKCEDR---QCGGLMLPTKADDWNGSWRCRECHK 344

Query: 361 RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           ++    V  +  + G+ +  +     N ++  + L   E    P H+H   +K  L+QL
Sbjct: 345 QVQKHYVERILERAGKDIQSM---EKNAENGLKYLKHYEKWLPPQHFHMSEIKILLVQL 400



 Score = 38.9 bits (89), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 5/49 (10%)

Query: 454 PICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 502
           P+CLGC   + P+  D+  C +C WP C   C  L +   H  EC + K
Sbjct: 83  PVCLGCLNGIEPD--DHIECEQCGWPLCGPECKSLDE---HKAECRLTK 126


>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
 gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
          Length = 522

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 134/456 (29%), Positives = 217/456 (47%), Gaps = 32/456 (7%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC++ A  +CS C  V YC  +HQ++ WK+HK  C+  P++I S+   GR+L AT D+
Sbjct: 2   CPVCKKEASKRCSRCAMVYYCCVEHQQQDWKVHKTTCQ--PFKIFSNEQYGRFLVATRDI 59

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
             G+ + +ESPL+ GP   +  P+C+GC + L         C RC WP C   C    D+
Sbjct: 60  KAGEIVLKESPLVHGPA-QITGPVCVGCLQGLEEK--KYLDCERCGWPVCKRSCQ---DS 113

Query: 135 HTHAPECAILKLGCETLLAYNDY----KYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
            +H  EC         +   + Y     Y+ ++P+RCL+L    P +++ L  +E+H  +
Sbjct: 114 PSHQAECKFTIARGSKISIQHFYVPHPTYQCLMPVRCLLLAESDPARWETLLKLESHEEE 173

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRY---T 247
           R    +   + + + K +   F  K      +D   +    + G+I+VNG ++      +
Sbjct: 174 RRGSEQWRNDREGVAKLIPRFF--KCENKWDEDEILR----VVGIIQVNGHEVPMTEPSS 227

Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREHL 301
             +Y++  ++EH+C PN   S   + ++ ++      R   +S  YT+ LW T  R EHL
Sbjct: 228 VAIYNMASMLEHSCRPNLAKSFTNRGEVVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHL 287

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSCSA 360
             TK F C CERCSD TE  TYFSA++C    KD     ++LPV+      +W C  CS 
Sbjct: 288 QQTKMFRCECERCSDRTEYETYFSAVRCSGFQKDSKCKGYLLPVDNAQWTGEWQCLRCSK 347

Query: 361 RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
            ++   V  +  +   ++D    E          +         +H +   VK SL Q+ 
Sbjct: 348 EVSGATVCQILERA--RMDLEAMEKHREDHCNRYMKHYSKWLTANHQYMVDVKISLSQVI 405

Query: 421 GTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPIC 456
           G +   A  ++    L  KI   +  +L A  E IC
Sbjct: 406 GGKNPEAIKKIPYEKLMNKIK--ICQDLIALFEKIC 439


>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
 gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
          Length = 515

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 126/421 (29%), Positives = 208/421 (49%), Gaps = 40/421 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC +    KCS C +V+YC  QHQK+ WK HK  C   P++I  + LLGR+L AT  +
Sbjct: 4   CNVCEQPTKSKCSNCNQVSYCSVQHQKQDWKAHKSSCH--PFKIAHNDLLGRHLVATRTI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + I +E+PL+ GP   +  P+C+GC   + P   D+  C  C WP C   C  L + 
Sbjct: 62  KPYEIILKEAPLVRGPA-QITPPVCMGCLNSIEPE--DHIDCDLCGWPLCGPECKSLGE- 117

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPL-------RCLILQRRSPKKYQELKDMEAH 187
             H  EC + +   E     N +++    PL       RCL++   SP+K  + +++E+ 
Sbjct: 118 --HRAECQLTQ---ERGQKVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKASKFQELESL 172

Query: 188 MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT 247
            S R    +   ++ SI +++   F         +  S + +    G +++NG ++    
Sbjct: 173 ESTRRGSNQWKADLASIGQFIPKFF-------KTEKFSEEEVMRAVGALQINGHEVPTSD 225

Query: 248 Q---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRR 298
                ++      EH+C+PN AK  N     ++ +   + +++H+S  Y++A+WGT  R+
Sbjct: 226 PPHVAVFYTASFTEHSCMPNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQ 285

Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
            HL  TK F C+C+RC+D TELGT +SA+KC +    + D  +LP    D +  W C  C
Sbjct: 286 RHLVQTKLFKCACDRCTDVTELGTNYSAIKCEDR---RCDGLMLPSKTDDWNGSWKCREC 342

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
             ++    +  +  + G+ +  + +   N     + L   E    P HYH   VK  L+Q
Sbjct: 343 QKQVQKHYIDQILERAGKDLQSMEKTAENGL---KYLKHYEKWLPPRHYHMSEVKILLVQ 399

Query: 419 L 419
           L
Sbjct: 400 L 400


>gi|195060014|ref|XP_001995739.1| GH17917 [Drosophila grimshawi]
 gi|193896525|gb|EDV95391.1| GH17917 [Drosophila grimshawi]
          Length = 545

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 127/438 (28%), Positives = 215/438 (49%), Gaps = 44/438 (10%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y +  S + GRYL A   L  G+ +  E PL +GP ++  EP+CLGC++P+  + +   R
Sbjct: 61  YRVAHSDVFGRYLVANRRLEAGELLISEEPLAIGPCVS-CEPVCLGCYQPVQLD-SQQYR 118

Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK---------LGCETLLAYNDYKYEAILPL 165
           CP C WP C   C G+     H A ECA+           L     L      YE +L +
Sbjct: 119 CPDCNWPLCGPTCRGIRQRTGHSAEECALYAERRALLADVLTANATLEQRRDLYELVLIV 178

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDT 224
           R ++L++ S ++Y  ++ ME+H ++R     +++  +  +V+ LR+++      D L+  
Sbjct: 179 RIVLLKQHSAEQYANIRRMESHTAERRENATLWQHYEQKVVQRLRNDWQLANVCDELE-- 236

Query: 225 SAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN-------MMQSK 274
               LH ICG+++VN  +IG+     + LY    L+ H+C PN  H++       ++++ 
Sbjct: 237 ----LHEICGILDVNCFEIGQRGAKARTLYPSAFLLAHDCTPNTAHTDDPATFAILLRTS 292

Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
             V     ++  Y   L GT  RR  +   K F C C+RC+DP ELG+  SA+ C +   
Sbjct: 293 RRVLEQEPLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRCADPRELGSDCSALVCRSCRL 352

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEM 394
                 I    PL+  +DW C  C  RL+A D+  +  ++ + ++++  +  ++ +LE  
Sbjct: 353 GS----IRATAPLEQTADWACDRCDCRLSAADLERLIDRINDDLEEI--DVHDIPALENF 406

Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY---------VMS 445
           LT+   +  P+HY   + K+SL Q+YG   GY   ++S   +ERK  Y         ++ 
Sbjct: 407 LTRYREVLRPNHYLLLSAKYSLCQIYGRIEGYLLPEMSPQDIERKERYCRDFLEIVDLLD 466

Query: 446 PNLKATDEPICLGCHKPL 463
           P L      I    H P+
Sbjct: 467 PGLTRLRGLIMYELHAPI 484



 Score = 46.6 bits (109), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)

Query: 453 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH-APECAI 500
           EP+CLGC++P+  + +   RCP C WP C   C G+     H A ECA+
Sbjct: 100 EPVCLGCYQPVQLD-SQQYRCPDCNWPLCGPTCRGIRQRTGHSAEECAL 147


>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
 gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
          Length = 515

 Score =  194 bits (493), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 126/421 (29%), Positives = 208/421 (49%), Gaps = 40/421 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC +    KCS C +V+YC  QHQK+ WK HK  C   P++I  + LLGR+L AT  +
Sbjct: 4   CNVCDQPTKSKCSNCNQVSYCSVQHQKQDWKAHKSSCH--PFKIAHNDLLGRHLVATRTI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + I +E+PL+ GP   +  P+C+GC   + P   D+  C  C WP C   C  L + 
Sbjct: 62  KPYEIILKEAPLVRGPA-QITPPVCMGCLNSIEPE--DHIDCDLCGWPLCGPECKSLGE- 117

Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPL-------RCLILQRRSPKKYQELKDMEAH 187
             H  EC + +   E     N +++    PL       RCL++   SP+K  + +++E+ 
Sbjct: 118 --HRAECQLTQ---ERGQKVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKASKFQELESL 172

Query: 188 MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT 247
            S R    +   ++ SI +++   F         +  S + +    G +++NG ++    
Sbjct: 173 ESTRRGSNQWKADLASIGQFIPKFF-------KTEKFSEEEVMRAVGALQINGHEVPTSD 225

Query: 248 Q---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRR 298
                ++      EH+C+PN AK  N     ++ +   + +++H+S  Y++A+WGT  R+
Sbjct: 226 PPHVAVFYTASFTEHSCMPNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQ 285

Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
            HL  TK F C+C+RC+D TELGT +SA+KC +    + D  +LP    D +  W C  C
Sbjct: 286 RHLVQTKLFKCACDRCTDVTELGTNYSAIKCEDR---RCDGLMLPSKTDDWNGSWKCREC 342

Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
             ++    +  +  + G+ +  + +   N     + L   E    P HYH   VK  L+Q
Sbjct: 343 QKQVQKHYIDQILERAGKDLQSMEKTAENGL---KYLKHYEKWLPPRHYHMSEVKILLVQ 399

Query: 419 L 419
           L
Sbjct: 400 L 400


>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
 gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
          Length = 515

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/420 (29%), Positives = 212/420 (50%), Gaps = 38/420 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC E   + CS C +V+YC  QHQK+ WK HKP C   PY+I  + +LGR+L AT ++
Sbjct: 4   CQVCAEPTKNNCSNCNQVSYCSVQHQKQDWKSHKPNCH--PYKIAHNEVLGRHLVATRNI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + I +E+PL+ GP   ++ P+C+GC   + P   D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIILKEAPLVRGPA-QISAPVCMGCLNSIEP--TDHIECEQCGWPLCGPECKSLDE- 117

Query: 135 HTHAPECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
             H  EC + K   + +    +++     Y  +  +RCL+L   +    ++  D+E+  S
Sbjct: 118 --HRAECKLTKDRGQKV-NVQEFQGPHPLYTCVSTVRCLLLGETNAANAKKFSDLESLES 174

Query: 190 KRGPGTEVYEEIDSIVKYLRSNF-LEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ 248
            R   ++   ++ SI +++   F  +K   + +  T         G +++NG ++     
Sbjct: 175 TRRGSSQWKADLTSIGQFIPKFFKTQKFSEEEIMRT--------VGALQINGHEVPTSDP 226

Query: 249 ---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRRE 299
               ++ V    E++CLPN AK  N     ++ +   + +++H+S  Y++A+WGT  R+ 
Sbjct: 227 PHVAVFYVASFTENSCLPNLAKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQR 286

Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
           HL  TK F C+C RC DPTELGT +SA+KC  + K  G   +LP    D    W C  C 
Sbjct: 287 HLMQTKLFKCACLRCQDPTELGTNYSAIKC-EDRKCSG--LMLPTKADDWHGGWQCKECG 343

Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
            ++  + V  +  + G+ +  +      +++  + L   E      H+H   +K  L+Q+
Sbjct: 344 KQVPKQYVEGILERAGQDIQSM---EKTIENGFKYLKHYERWLPSQHFHMSEIKILLVQI 400


>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
          Length = 514

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/411 (31%), Positives = 198/411 (48%), Gaps = 40/411 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++  S  +GRY+ A+ +L  G+ I  E P +VGPK A   P+CL C+ P  P+  D   
Sbjct: 20  YKVLRSDQVGRYMVASRELQAGEEIMTEMPFVVGPK-ACTYPLCLACYTPWPPSPDDKPL 78

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
           C +C WP C   C        +  EC +      K      L  N+     + E I PLR
Sbjct: 79  CSKCGWPVCGEECENSLQHKDY--ECQVFVQANEKFNVTAALDENNENGVPQLECITPLR 136

Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
            L+   +S +++  E++DMEAH   R   ++   +  +IV+YLR           LD  S
Sbjct: 137 LLLESEKSVERWNSEVRDMEAHTKIRCQKSQWKSDHVNIVEYLRKRL-------KLDRFS 189

Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSK 274
            +C+   CG++E+N  +I    G   + LY  + +M H+C+ N  H+         +++ 
Sbjct: 190 EECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDYRIRLRTT 249

Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
           L +     +   YT++L  T  RREHL   K+F C+C RCSDPTEL T+ S++KC     
Sbjct: 250 LKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRCSDPTELATHMSSLKC--NKC 307

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
           D G   +LP++ LD +S W C  C   +N + V  V   +  +VD +           + 
Sbjct: 308 DNG--IVLPLDSLDAESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAISGADGADAIH 365

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             E +L K  ++ HP H     ++HSL Q+YG    Y    L   +LE K+
Sbjct: 366 ERETVLRKYRSVLHPRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHKV 416



 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           ++ +V+ P  KA   P+CL C+ P  P+  D   C +C WP C   C
Sbjct: 47  EMPFVVGP--KACTYPLCLACYTPWPPSPDDKPLCSKCGWPVCGEEC 91


>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
 gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
          Length = 517

 Score =  191 bits (486), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 125/419 (29%), Positives = 208/419 (49%), Gaps = 36/419 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC+E    KCS C +V+YC  QHQK+ WK HK  C   P++I  S  LGR+L AT  +
Sbjct: 6   CHVCQEPTKTKCSNCNQVSYCSVQHQKQDWKAHKTNCH--PFKIAHSEQLGRHLVATRTI 63

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + I RE+PL+ GP   ++ P+C+GC   +     D+  C +C WP C   C  L + 
Sbjct: 64  KPYEIILREAPLVRGPA-QISAPVCMGCLNSIEAE--DHIDCEQCGWPLCGPECKSLEE- 119

Query: 135 HTHAPECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
             H  EC + K   + +   N++      Y  +  +RCL++   SP+K  + +++E+  S
Sbjct: 120 --HQAECKLTKDRGQKV-NVNEFNGPHPLYTCVSTVRCLLIGETSPEKAAKFQELESLES 176

Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ- 248
            R    +   ++ SI +++   F           T  + +  + G +++NG +I      
Sbjct: 177 TRRGSNQWKADLASIGQFIPKFF------KTQKFTEEQIMRAV-GALQINGHEIPTSDPP 229

Query: 249 --GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREH 300
             G++      E++CLPN AK  N     ++ +   + ++ H+S  Y++A+WGT  R+ H
Sbjct: 230 HVGVFYTASFTENSCLPNLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRH 289

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSA 360
           L  TK F C+C RC+D TEL T++SA+KC +    Q    +LP    + +  W C  C  
Sbjct: 290 LMQTKLFKCACARCADVTELDTFYSALKCEDR---QCGGLMLPSKSDEWNGSWRCRECHK 346

Query: 361 RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           ++    V  +  + G+ +  + +   N     + L   E    P H+H   +K  L+QL
Sbjct: 347 QVQKHYVEGILERAGKDIQSMEKTAENGL---KYLKHYEKWLPPQHFHMSEIKILLVQL 402


>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
 gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
          Length = 514

 Score =  191 bits (485), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 122/418 (29%), Positives = 208/418 (49%), Gaps = 34/418 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC +   +KCS C +V+YC  QHQK+ WK HKP C   P++I  +  LGR+L AT  +
Sbjct: 4   CHVCEKPTKNKCSNCNQVSYCSVQHQKQDWKTHKPSCH--PFKIAHNEQLGRHLVATRTI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+CLGC   +     D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEAE--DHIECEQCGWPLCGPECKSLDE- 117

Query: 135 HTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  EC++ K   + +           Y  +  +RCL++   SP+K  + +++E+  S 
Sbjct: 118 --HKAECSLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQELESLEST 175

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
           R    +   ++ SI +++   F  +         S + +    G +++NG ++       
Sbjct: 176 RRGSNQWKADLVSIGQFIPKFFKTQ-------KFSEEEIMKAVGALQINGHEVPTTDPSH 228

Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
             ++      E++CLPN AK  N     ++ +   + ++ H+S  Y++A+WGT  R+ HL
Sbjct: 229 VAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHL 288

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TK F C+CERC D TEL T++SA+KC +    Q    +LP    D + +W C  C  +
Sbjct: 289 MQTKLFRCACERCVDVTELDTFYSAIKCEDR---QCGGLMLPTKTDDWNGNWRCRECHKQ 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           +    V  +  + G+ +  +      V++  + L   E    P H+H   +K  L+QL
Sbjct: 346 VQKHYVERILERAGKDIQSM---EKTVENGLKYLKHYEKWLPPQHFHISEIKILLVQL 400


>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
          Length = 513

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 130/411 (31%), Positives = 195/411 (47%), Gaps = 40/411 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I  +  +GRY+ A+ +L  G+ I  E P +VGPK A   P+CL C+    P+L+    
Sbjct: 19  YKILQNDKVGRYMVASKELEAGEEIVTEMPFVVGPK-AFTYPLCLSCYATWPPSLSAKPL 77

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
           C +C WP CS  C        +  EC +      K   +  L   +     + E I PLR
Sbjct: 78  CSKCGWPVCSEECENQPQHKDY--ECQVFVQANEKFNVQAALEEANENGVPQLECITPLR 135

Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
            L+   R+P+++  E+K+MEAH   R           +IV Y+R           LD  S
Sbjct: 136 LLLESERNPERWNDEIKNMEAHNKIRSQKKHWQSNQINIVDYIRKQL-------KLDRFS 188

Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHS-------NMMQSK 274
            + +H  CG++E+N  +I    G   + LY  + LM H+C+ N  HS         +++ 
Sbjct: 189 EEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVRLRTT 248

Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
           L +     +   YT++L  T  RREHL   KYF C+C RCSDPTELGT+ S++KC     
Sbjct: 249 LKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRCSDPTELGTHVSSLKC--NKC 306

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
           D G   +L ++ LD DS W C  C    +   V  V   +  +VD +           ++
Sbjct: 307 DNG--IVLSLDSLDPDSSWKCTHCEFTTSGSAVRKVLHIIQAEVDAVETISGADGADAIQ 364

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             E ++ K  ++ HP H     ++HSL Q+YG    Y    L   +LE KI
Sbjct: 365 ERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKI 415



 Score = 38.9 bits (89), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           ++ +V+ P  KA   P+CL C+    P+L+    C +C WP CS  C
Sbjct: 46  EMPFVVGP--KAFTYPLCLSCYATWPPSLSAKPLCSKCGWPVCSEEC 90


>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 513

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 130/411 (31%), Positives = 195/411 (47%), Gaps = 40/411 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I  +  +GRY+ A+ +L  G+ I  E P +VGPK A   P+CL C+    P+L+    
Sbjct: 19  YKILQNDKVGRYMVASKELEAGEEIVTEMPFVVGPK-AFTYPLCLSCYATWPPSLSAKPL 77

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
           C +C WP CS  C        +  EC +      K   +  L   +     + E I PLR
Sbjct: 78  CSKCGWPVCSEECENQPQHKDY--ECQVFVQAKEKFNVQAALEEANENGVPQLECITPLR 135

Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
            L+   R+P+++  E+KDMEAH   R           +IV Y+R           LD  S
Sbjct: 136 LLLESERNPERWNNEIKDMEAHNKIRSQKKHWQSNQINIVDYIRKQL-------KLDRFS 188

Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHS-------NMMQSK 274
            + +H  CG++E+N  +I    G   + LY  + LM H+C+ N  HS         +++ 
Sbjct: 189 EEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVRLRTT 248

Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
           L +     +   YT++L  T  RREHL   KYF C+C RCSDPTELGT+ S++KC     
Sbjct: 249 LKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRCSDPTELGTHVSSLKC--NKC 306

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
           D G   +L ++ LD +S W C  C    +   V  V   +  +VD +           ++
Sbjct: 307 DNG--IVLSLDSLDPESSWKCTHCEFTTSGSAVRKVLHIIQAEVDAVETISGADGADAIQ 364

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             E ++ K  ++ HP H     ++HSL Q+YG    Y    L   +LE KI
Sbjct: 365 ERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKI 415



 Score = 38.9 bits (89), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           ++ +V+ P  KA   P+CL C+    P+L+    C +C WP CS  C
Sbjct: 46  EMPFVVGP--KAFTYPLCLSCYATWPPSLSAKPLCSKCGWPVCSEEC 90


>gi|170029518|ref|XP_001842639.1| msta [Culex quinquefasciatus]
 gi|167863223|gb|EDS26606.1| msta [Culex quinquefasciatus]
          Length = 495

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 202/403 (50%), Gaps = 30/403 (7%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++   P +GRY  A+ DL  G+    ++P  VGP +  A P+CL C  P++   A  AR
Sbjct: 6   YKVLECPEMGRYGVASRDLRAGEIAYEDTPFAVGPSVGSA-PLCLECACPVD-GCAGGAR 63

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
           CPRC WP C    + +  +  H  EC +            D      + + I PLR L+ 
Sbjct: 64  CPRCGWPLCEKCGAEIEASVYHKAECELFAKHGSKFQNVEDSSEACVQLDCITPLRVLLA 123

Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +  +P  +  E+  ME H ++R        +  ++V +LR +   K      D  S + +
Sbjct: 124 KEANPDWWNAEIVMMEDHRAERDGNAFWKADQSNVVAFLRDSCGLK------DRCSEELI 177

Query: 230 HWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSN--------MMQSKLFVFRD 280
             + G+++VN  +    + +GLY  + +M H+C+ N  H+         + ++ + +   
Sbjct: 178 QKVIGILDVNAFEAHTCSLRGLYPKMGIMAHSCVTNLAHTVHPSKGYRLIARAAVDIEEG 237

Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCW 340
             + T YT+ L GT+ R+  L  TKYF C C+RC DPTELGT+FS+MKC  +  D G   
Sbjct: 238 AMLCTTYTHLLAGTRTRQAELQRTKYFTCQCKRCLDPTELGTHFSSMKC--QKCDNG--L 293

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKSLEEMLTK 397
           +    P D +++W C  C  +L    +      +  ++D+L  ++  P  ++S E +  K
Sbjct: 294 VESTKPTDEEAEWKCTHCEHKLKGVLIAKAIQVMQAEIDELAYMEYGPERLESFERVFKK 353

Query: 398 LEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             ++ HP H+   +++HSLI+LYG  PGY   +L   LLERKI
Sbjct: 354 FRSVLHPLHFINTSIRHSLIELYGRIPGYMMQELPDILLERKI 396



 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 442 YVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           + + P++ +   P+CL C  P++   A  ARCPRC WP C    + +  +  H  EC + 
Sbjct: 36  FAVGPSVGSA--PLCLECACPVD-GCAGGARCPRCGWPLCEKCGAEIEASVYHKAECELF 92


>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
          Length = 501

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 127/413 (30%), Positives = 208/413 (50%), Gaps = 49/413 (11%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           ++++ S  +GRY  A+ DL PGD I  E+P   GPK + +  +CLGCH P++        
Sbjct: 8   FKVEKSETVGRYAVASEDLKPGDVIFSETPFAYGPK-SDSSCLCLGCHSPVDCAYL---- 62

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYNDY--KYEAILPLRCLIL 170
           C  C WP C   C  L   H    EC +    K+  +T+    D   +YE I PLR L+ 
Sbjct: 63  CSTCSWPVCGPECEQLK-VHKEN-ECQVFSNAKVKFQTVEDPTDVCLQYECITPLRMLLE 120

Query: 171 QRRSPKKYQE-LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV-LDDTSAKC 228
           + R PK+++E +  MEAH  +R            I ++ + N +E + G   +D  S + 
Sbjct: 121 KERDPKRWEEEISIMEAHNEERKS--------RPIWEFNQHNIVEYIRGPCKMDKYSEEL 172

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICL------MEHNCLPNAKHSN------------- 269
           +H ICG++E+N  +    T   Y++ CL      + HNC+ N  H+              
Sbjct: 173 IHLICGILEINAFEA--RTPSCYAIRCLFPKLAILSHNCVSNIHHAVDCNGDGELKDCVV 230

Query: 270 MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            +++ + + +   + + YT +LW T  RRE L  +K+F+C+C RCS PTEL T+ S++KC
Sbjct: 231 TVRAAVDIPKGGELFSSYTFSLWPTLVRREFLRESKFFDCTCPRCSSPTELDTHMSSLKC 290

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN 387
             +  D G   I+  N  D+  +W C  C  + N R V  V + +  ++D++  +     
Sbjct: 291 --QRCDNG--VIISTNTADDACEWKCTHCDYKTNGRAVRKVFAAIQNEIDQVEYIGGPQG 346

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           ++  E +  K  ++ HP++ +   ++ +LIQLYG   GY    L   +LERKI
Sbjct: 347 IQQRETIFRKYRSVLHPNNSYMTTLRSALIQLYGKTEGYTLEDLPDVILERKI 399


>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
          Length = 513

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 129/413 (31%), Positives = 195/413 (47%), Gaps = 40/413 (9%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           + Y+I  +  +GRY+ A  +L  G+ I  E P IVGPK A   P+CL C+    P+L + 
Sbjct: 17  VTYKILQNDKVGRYMVANKELEAGEEIITEMPFIVGPKAA-TYPLCLSCYVSWPPSLNEK 75

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILP 164
             C +C WP CS  C        +  EC I      K   +  L   +     + E I P
Sbjct: 76  PLCSKCGWPVCSTECENQPQHKDY--ECQIFAQANEKFNVQAALEETNENGVPQLECITP 133

Query: 165 LRCLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
           LR L+   ++P++++ E+KDMEAH   R    +   +  +IV+Y+R           LD 
Sbjct: 134 LRLLLESEKNPERWKNEVKDMEAHNKIRSQKKQWKLDQVNIVEYIRKQL-------KLDR 186

Query: 224 TSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
            S K +  +CG++E+N  ++    G   + +Y  + LM H+C+ N  HS          R
Sbjct: 187 FSEKEIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRLR 246

Query: 280 DT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
            T        +   YT++L  T  RREHL   K+F C+C RCSDPTELGT+ S++KC   
Sbjct: 247 TTLKIPPGGELYGSYTHSLLPTMLRREHLLEGKHFACACARCSDPTELGTHMSSLKC--N 304

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPN 387
             D G   +L ++ LD +S W C  C    +   V  V   +  +VD +           
Sbjct: 305 KCDNG--IVLSLDSLDPESSWKCTHCEFSTSGSAVRKVLQIIHMEVDAVEAISGADGADA 362

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           ++  E +  K  ++ HP H     ++HSL Q+YG    Y    L   +LE KI
Sbjct: 363 IQERETVAKKYRSVLHPRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKI 415



 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 2/47 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           ++ +++ P  KA   P+CL C+    P+L +   C +C WP CS  C
Sbjct: 46  EMPFIVGP--KAATYPLCLSCYVSWPPSLNEKPLCSKCGWPVCSTEC 90


>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
 gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
          Length = 515

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 123/418 (29%), Positives = 207/418 (49%), Gaps = 34/418 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC E   +KCS C +V+YC  QHQK+ WK HKP C   P++I  +  LGR+L AT  +
Sbjct: 4   CHVCEEPTKNKCSNCNQVSYCSVQHQKQDWKRHKPSCH--PFKIDHNEQLGRHLVATRTI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+CLGC   +     D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEAE--DHIECEQCGWPLCGPECKSLDE- 117

Query: 135 HTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  EC + K   + +           Y  +  +RCL++   SP+K  + +++E+  S 
Sbjct: 118 --HKAECRLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQELESLEST 175

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ-- 248
           R    +   ++ SI +++   F  +      ++   K +    G +++NG ++       
Sbjct: 176 RRGSNQWKADLVSIGQFIPKFFKTQ---KFTEEEIMKAV----GALQINGHEVPTTDPSH 228

Query: 249 -GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
             ++      E++CLPN AK  N     ++ +   + ++ H+S  Y++A+WGT  R+ HL
Sbjct: 229 VAVFYTASFTENSCLPNLAKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHL 288

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TK F C+C+RC D TEL T +SA+KC +    Q    +LP    D +  W C  C  +
Sbjct: 289 MQTKLFKCACDRCVDVTELDTNYSAIKCEDR---QCGGLMLPTKADDWNGSWRCRECHKQ 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           +    V  +  + G+ +  +     NV++  + L   E    P H+H   +K  L+QL
Sbjct: 346 VQKHYVERILERAGKDIQSM---EKNVENGLKYLKHYEKWLPPQHFHMSEIKILLVQL 400


>gi|195399143|ref|XP_002058180.1| GJ15632 [Drosophila virilis]
 gi|194150604|gb|EDW66288.1| GJ15632 [Drosophila virilis]
          Length = 527

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 128/440 (29%), Positives = 208/440 (47%), Gaps = 48/440 (10%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           + +  S L GRYL A   L  G+ +  E PL +GP ++  EP+CLGC++P+    A   R
Sbjct: 43  FRVAHSELFGRYLVANRQLAAGELLITEQPLAIGPCVS-CEPVCLGCYRPVQLT-AQQYR 100

Query: 116 CPRCFWPACSARCSGLSD---AHTHAPECAILK----LGCETLLAYNDYK-----YEAIL 163
           C  C WP C A C G+      HT   EC I      L  E L  +         YE +L
Sbjct: 101 CAGCGWPLCGASCRGMQQRRLGHTE-EECEIYGQRRGLAAELLTTHATPDQVRDLYELVL 159

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD 222
            +R  +L+  +P  Y +++ ME+H + R     +++  +  +V+ LR ++        L 
Sbjct: 160 IVRIALLRHLTPDLYAQIRRMESHTAARRENATLWQHYEQKVVRRLRDDW-------QLQ 212

Query: 223 DTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN-------MMQ 272
              A  LH ICG+++VN  +IG++    + LY    L+ H+C PN  H++       +++
Sbjct: 213 QLEAAELHEICGILDVNCFEIGQHGAKARTLYPSAFLLAHDCSPNTAHTDDPLSYAILLR 272

Query: 273 SKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
           +   V     ++  Y   L GT  RR  +   K F C C RC+DP ELG+  SA+ C + 
Sbjct: 273 TSRAVREQETLTLSYAYTLQGTLKRRSFIQGGKLFWCRCRRCADPRELGSDCSALVCRSC 332

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
                   I  ++PLD  +DW C  C+ RL+A  +  +  ++ + ++ +  +  ++ +LE
Sbjct: 333 QLGS----IRAIDPLDQSADWACDRCAHRLSAVQLERLLDRINDDLESI--DVHDIPALE 386

Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY---------V 443
             L +   +  P+HY   + K+SL Q YG   GY   +LS   + RK  Y         +
Sbjct: 387 NFLARYREVLRPNHYLLLSAKYSLCQTYGRIEGYLLPELSPQDIARKERYCREFLAVIDL 446

Query: 444 MSPNLKATDEPICLGCHKPL 463
           + P L      I    H P+
Sbjct: 447 LEPGLTRLRGVIMYELHAPI 466



 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 27/51 (52%), Gaps = 5/51 (9%)

Query: 453 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD---AHTHAPECAI 500
           EP+CLGC++P+    A   RC  C WP C A C G+      HT   EC I
Sbjct: 82  EPVCLGCYRPVQLT-AQQYRCAGCGWPLCGASCRGMQQRRLGHTE-EECEI 130


>gi|170029520|ref|XP_001842640.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863224|gb|EDS26607.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 498

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 125/406 (30%), Positives = 203/406 (50%), Gaps = 36/406 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +++   P LGRY   T  L  G+ +  E P  +GPK + + P+CL C  P++ + A   +
Sbjct: 6   FKVLECPELGRYAVTTKKLKAGEELFEEKPFAIGPK-SDSPPVCLECCVPVDGS-AFGPK 63

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
           CP+C WP C   C  +     H  EC +           +D      + + I PLR L+ 
Sbjct: 64  CPQCGWPLCE-EC--VRSVVYHKAECDLFVKNKVKFQNVDDSTAGCVQLDCITPLRVLLA 120

Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +   P+++  E+  M+ H  +R   T    +  ++V++LRS    K      D  S + +
Sbjct: 121 KEADPERWNAEISRMQDHREERRGSTFWDADQKNVVEFLRSECGLK------DRCSEELI 174

Query: 230 HWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN--------MMQSKLFV 277
             + G++EVN  +     G   +GLY  + +M H+C+PN  HS           ++ + V
Sbjct: 175 QQVIGILEVNAFEARTSSGYTIRGLYPKLAIMAHSCVPNVVHSIHPSKDYRLTARTAIDV 234

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
              + + T YT  L GT  R+  L  TKYF C C+RC DPTELGT+FS++KC  +  D G
Sbjct: 235 EEGSKLYTTYTYTLSGTMVRQAALKSTKYFTCQCKRCLDPTELGTHFSSLKC--QKCDNG 292

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKSLEEM 394
              I+   P D +++W C  C  +L    +      +  ++D+L  ++  P  ++S E +
Sbjct: 293 --VIVSSKPTDEEAEWHCTHCEYKLKGAAMAKAIQVMQAEIDELAYMEYGPERLESFERV 350

Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             K  ++ HP H+   ++++SLI+LYG  PGY   +L   LLERKI
Sbjct: 351 FKKYRSVLHPLHFINTSIRNSLIELYGRIPGYMMPELPDILLERKI 396


>gi|170029522|ref|XP_001842641.1| msta [Culex quinquefasciatus]
 gi|167863225|gb|EDS26608.1| msta [Culex quinquefasciatus]
          Length = 489

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 130/407 (31%), Positives = 203/407 (49%), Gaps = 36/407 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           + +  SP LGRY  A  +L  G+ I  E    +GPK A   P+CL C  P++   AD  +
Sbjct: 6   FTVLESPELGRYGVARRNLRAGEIIFEEQVFAIGPK-ASTSPLCLECASPVDGG-ADGPK 63

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYND--YKYEAILPLRCLIL 170
           CP+C WP C   C G      H  EC +    K+  +     +    + + I PLR L+ 
Sbjct: 64  CPQCGWPLC-GECVG--SVVYHKGECELFVQHKVKFQNQQNSDGCCAQLDCITPLRVLLA 120

Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +   P+++  E+  ME H ++R        + +++V+YLR      L   + D  S + +
Sbjct: 121 KEAHPERWNAEICMMEDHRAERCGSVYWNADQNNVVRYLR------LACGLKDRCSEELI 174

Query: 230 HWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN--------MMQSKLFV 277
             + G++EVN  +     G   +GLY  + +M H+C+ N  HS         + ++ + V
Sbjct: 175 QQVIGILEVNAFEARTHRGYAVRGLYPKLAIMAHSCVRNVVHSIHPSKDYRLIARTAIDV 234

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                + T YT A   T  R+  L  +KYF+C CERC DPTELGTYFS++KC   +K   
Sbjct: 235 EEGGKLYTSYTFAQDPTHYRQTTLKDSKYFSCQCERCLDPTELGTYFSSLKCRKCNKGS- 293

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLV---QENPNVKSLEEM 394
              ++   P D +++W C SC   L A  + +VT  + + V K+V        ++S E+ 
Sbjct: 294 ---MVSSKPTDEEAEWHCTSCDRVLEADAMAIVTKVMQDDVGKIVYMESGTERLESYEQA 350

Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
             K  A+  P HY   +++ SLI+LYG  P Y   +LS + LERKI+
Sbjct: 351 FAKYSAVLEPLHYLLTSIRQSLIELYGRVPEYRLHELSDAKLERKIT 397



 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 4/59 (6%)

Query: 443 VMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           V +   KA+  P+CL C  P++   AD  +CP+C WP C   C G      H  EC + 
Sbjct: 35  VFAIGPKASTSPLCLECASPVDGG-ADGPKCPQCGWPLC-GECVG--SVVYHKGECELF 89


>gi|195133792|ref|XP_002011323.1| GI16059 [Drosophila mojavensis]
 gi|193907298|gb|EDW06165.1| GI16059 [Drosophila mojavensis]
          Length = 526

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 126/439 (28%), Positives = 216/439 (49%), Gaps = 43/439 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y ++ S L GRYL A   L  G+++ RE PL +GP ++  EP+CLGC++P+     +  R
Sbjct: 37  YRVEHSELYGRYLVANRALAAGEQLIREQPLAIGPCVS-CEPVCLGCYQPVQLG-PEQCR 94

Query: 116 CPRCFWPACSARCSGLSDAHTHA-----------PECAILKLGCETLLAYNDYKYEAILP 164
           C  C WP C+A C GL  +  H+           P+ A L     +  A  D  YE +L 
Sbjct: 95  CKGCGWPLCAADCRGLGQSSGHSREECELYGRRRPQAAELLKESASPDAVRDL-YELVLI 153

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDD 223
           +R L++++  P+KY +++ ME+H  +R     +++  +  +V+ L  ++  +L  ++   
Sbjct: 154 VRILLMRKHQPEKYAKIRLMESHTEQRRENLTLWQHYEKKVVQRLLHDW--QLAQELSGL 211

Query: 224 TSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN-------MMQS 273
             A+ +H ICG+++VN  +IG+     + LY    L+ H+C PN  H++       ++++
Sbjct: 212 EPAE-MHEICGILDVNCFEIGQRGGKARTLYPSAFLLAHDCRPNTSHTDDPLDYSIILRT 270

Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
              V     ++  Y   L GT  RR  +   K F C C+RC+DP ELG+  S + C    
Sbjct: 271 SRSVREQETLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRCADPRELGSDCSDLVC---- 326

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
           K      I    PL  ++DW C  C+ R+ +  V  +   + + ++ +  +  ++ +LE 
Sbjct: 327 KICRAGSIRATEPLKQEADWACDRCAHRIESVQVERLLDHINDDLESI--DVHDIPALEN 384

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY---------VM 444
            L +   +  P+HY   + K+SL Q+YG   GY   +LS   +ERK  Y         ++
Sbjct: 385 FLIRYREVLRPNHYLLLSAKYSLCQIYGRVEGYLLPELSPQDIERKERYCREFLAVVDIL 444

Query: 445 SPNLKATDEPICLGCHKPL 463
            P L      I    H P+
Sbjct: 445 EPGLTRLRGLIMYELHAPI 463



 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 5/88 (5%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y V+HS  +LYG          +   L R+    + P +    EP+CLGC++P+     +
Sbjct: 37  YRVEHS--ELYGRYLVANRALAAGEQLIREQPLAIGPCVSC--EPVCLGCYQPVQLG-PE 91

Query: 470 NARCPRCFWPACSARCSGLSDAHTHAPE 497
             RC  C WP C+A C GL  +  H+ E
Sbjct: 92  QCRCKGCGWPLCAADCRGLGQSSGHSRE 119


>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
          Length = 513

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 126/411 (30%), Positives = 195/411 (47%), Gaps = 40/411 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I  +  +GRY+ A  +L  G+ I  E P IVGPK A   P+CL C+    P L +   
Sbjct: 19  YKILQNDKVGRYMVANKELEAGEEIITEMPFIVGPKAA-TYPLCLSCYVSWPPTLNEKPL 77

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
           C +C WP CS  C        +  EC I      K   +  L   +     + E I PLR
Sbjct: 78  CSKCGWPVCSTECENQPQHKDY--ECQIFAQANEKFNVQAALEETNENGVPQLECITPLR 135

Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
            L+   ++P++++ E+KDMEAH   R    +   +  +IV+Y+R           L+  S
Sbjct: 136 LLLESEKNPERWKNEVKDMEAHNKIRSQKKQWKSDQINIVEYIRKQL-------KLERFS 188

Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSK 274
            + +  +CG++E+N  ++    G   + +Y  + LM H+C+ N  HS         +++ 
Sbjct: 189 EEQIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRLRTT 248

Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
           L +     +   YT++L  T  RREHL   K F C+C RCSDPTELGT+ S++KC     
Sbjct: 249 LKIPVGGELYGSYTHSLLPTMLRREHLLEGKNFACACARCSDPTELGTHMSSLKC--NKC 306

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
           D G   +L ++ LD +S W C  C    +   V  V   +  +VD +           ++
Sbjct: 307 DNG--IVLSLDSLDPESSWKCTHCEFSTSGSAVRKVLQIIQMEVDAVEAISGADGADAIQ 364

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
             E +  K  ++ HP H     ++HSL Q+YG    Y    L   +LE KI
Sbjct: 365 ERETVAKKYRSVLHPRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKI 415



 Score = 39.3 bits (90), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 2/47 (4%)

Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
           ++ +++ P  KA   P+CL C+    P L +   C +C WP CS  C
Sbjct: 46  EMPFIVGP--KAATYPLCLSCYVSWPPTLNEKPLCSKCGWPVCSTEC 90


>gi|195383960|ref|XP_002050693.1| GJ20068 [Drosophila virilis]
 gi|194145490|gb|EDW61886.1| GJ20068 [Drosophila virilis]
          Length = 515

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 128/442 (28%), Positives = 212/442 (47%), Gaps = 35/442 (7%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC     +KCS C +V+YC  QHQK+ WK HK +C   P++I  + LLGR+L AT ++
Sbjct: 4   CQVCETATKNKCSNCNQVSYCSVQHQKQDWKAHKSQC--YPFKIAQNELLGRHLVATRNI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+C+GC   + PN  D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIVIKEAPLMRGPS-QISAPVCMGCLNCIEPN--DHITCDKCGWPLCGPECHALDE- 117

Query: 135 HTHAPECAILKLGCE--TLLAYNDYK--YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  EC + +   +   +  +N     Y  +  +RCL++   +P    + K +E     
Sbjct: 118 --HRAECQLTQERGQKVNVQEFNGPHPLYTCVSTVRCLLIGETNPANAAKFKQLETLEQT 175

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
           R    +   ++ SI +++   F  +         S + +    G +++NG ++       
Sbjct: 176 RRGSNQWKADLASIGQFIPKFFRTQ-------KFSEEEIMRAVGALQINGHEVPTSDPPH 228

Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
             ++      E++C+PN AK  N     M+ +   + +++H+S  Y++A+WGT  R+ HL
Sbjct: 229 VAVFYTASFTENSCVPNLAKSFNKHGHCMLWAPKEIKKNSHLSICYSDAVWGTADRQRHL 288

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TK F CSCERC D TELGT +SA+KC  + K  G   +LP+   D    W C  C  +
Sbjct: 289 MQTKLFKCSCERCLDVTELGTNYSALKC-EDRKCPG--LLLPIKADDWHGSWRCRECQKQ 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
           +  + V  +  +  + +  + +   N     + L   E      HYH   VK   +QL  
Sbjct: 346 VQRKYVDGILERASQDIQSMEKTAENGF---KYLKHYEKWLPTQHYHLSEVKILQVQLIA 402

Query: 422 TQPGYAYTQLSSSLLERKISYV 443
                    LS   L+ K+ Y 
Sbjct: 403 KDQQELMI-LSDEHLQLKLKYA 423


>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
 gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
 gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
 gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
 gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
          Length = 513

 Score =  186 bits (471), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 122/418 (29%), Positives = 205/418 (49%), Gaps = 34/418 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC E   +KCS C +V+YC  QHQK+ WK+HKP C   P++I  +  LGR+L AT  +
Sbjct: 4   CHVCEEPTKNKCSNCNQVSYCSVQHQKQDWKVHKPSCH--PFKIAHNEQLGRHLVATRTI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+CLGC   +     D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEAE--DHIECEQCGWPLCGPECKSLDE- 117

Query: 135 HTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  EC + K   + +           Y  +  +RCL++   S +K  + +D+E+  S 
Sbjct: 118 --HKAECGLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSTEKASKFQDLESLEST 175

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
           R    +   ++ SI +++   F  +      ++   K +    G +++NG ++       
Sbjct: 176 RRGSNQWKADLVSIGQFIPKFFKTQ---KFTEEEIMKAV----GALQINGHEVPTTDPSH 228

Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
             ++      E++CLPN AK  N     ++ +   + ++ H+S  Y++A+WGT  R+ HL
Sbjct: 229 VAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHL 288

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TK F C+CERC D TEL T +SA+KC +    Q    +LP    + +  W C  C  +
Sbjct: 289 MQTKLFKCACERCVDVTELDTNYSAIKCEDR---QCGGLMLPTKADEWNGSWRCRECHKQ 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           +    V  +  + G+ +  + +   N     + L   E    P H+H   +K  L+QL
Sbjct: 346 VQKHYVERILERAGKDIQSMEKIAENGL---KYLKHYEKWLPPQHFHMSEIKILLVQL 400


>gi|91088043|ref|XP_966391.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270012080|gb|EFA08528.1| hypothetical protein TcasGA2_TC006181 [Tribolium castaneum]
          Length = 492

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 122/405 (30%), Positives = 195/405 (48%), Gaps = 39/405 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           + IK +  +GR+  A+  L PGD I  E+P   GPK + + P+CLGC  P++  +     
Sbjct: 5   FTIKRNEKVGRFAVASRALKPGDLIFTETPFAYGPK-SDSPPLCLGCFAPVDCTVL---- 59

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
           C  C WP C   C           EC +         A  D      +YE I PLR L++
Sbjct: 60  CSHCSWPVCGPECEAKPSHRDF--ECGVFASAGVKFQAVEDPTNVCLQYECITPLRVLLM 117

Query: 171 QRRSPKKYQE-LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV-LDDTSAKC 228
           + + PK++QE ++ ME+H   R        +   I ++ + N ++ L G   LD  + + 
Sbjct: 118 KEKDPKRWQEEIEAMESHNDVR--------KTKPIWEFNQVNIVDYLKGPCKLDRFADEL 169

Query: 229 LHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFV 277
           +H +CG++E+N  +     G   + L+  + ++ HNC+ N  HS         + + + V
Sbjct: 170 IHTVCGILEINAFEARASSGYLIRCLFPKLAILSHNCVSNVHHSVDCDTFQVSVSASVEV 229

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                + + YT +LW T  RRE L  +KYF C C+RC D TELGT+   +KC     D G
Sbjct: 230 PESGELFSSYTYSLWPTLVRREFLKESKYFECMCDRCKDKTELGTHLGTLKC--NKCDNG 287

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPNVKSLEEML 395
              IL  +PL    +W C  C  + +A  V  V   +  ++D  +++     ++  E + 
Sbjct: 288 --VILSTDPLSFTCEWKCTHCEFKTSAIAVRKVYEAVQTEIDAVEMLSGPEGIEQREAIF 345

Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            K  ++FHP + +   ++ SL QLYG   GY    L   LLERK+
Sbjct: 346 RKYRSVFHPKNAYMTILRTSLSQLYGRAEGYTVEDLPDLLLERKV 390


>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 483

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 125/399 (31%), Positives = 188/399 (47%), Gaps = 40/399 (10%)

Query: 68  LQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSAR 127
           + A  +L  G+ I  E P ++GPK A   P+CL C  P      D + C +C WP C   
Sbjct: 1   MVANRELRAGEEIITEMPFVIGPK-ACTYPLCLSCFTPWPLEPDDKSLCSKCGWPVCGEE 59

Query: 128 CSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLRCLILQRRSPKKY 178
           C        +  EC +      K   +  L  N      + E I PLR L+   R+ +++
Sbjct: 60  CENAPQHKDY--ECQVFAQANEKFNVDAALDGNSENGVPQLECITPLRLLLESERNVERW 117

Query: 179 -QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
            +E+KDMEAH   R   ++   +  +IV YLR           LD  S K +  ICG++E
Sbjct: 118 NKEVKDMEAHNKTRCQKSQWKSDQINIVDYLRKRL-------KLDRFSEKYIQTICGILE 170

Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFVFRDTHISTM 286
           +N  ++    G   +GLY  + +M H+C+ N  HS         +++ L +     +   
Sbjct: 171 INTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIRLRTTLKIPAGGELYAS 230

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YT++L  T  RREHL   K+F C+C RCSDPTELGT+ S++KC     D G   +LP++ 
Sbjct: 231 YTHSLLPTILRREHLLEGKHFACACPRCSDPTELGTHMSSLKC--NKCDNG--IVLPLDS 286

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVKSLEEMLTKLEAM 401
           LD++S W C  C    N + V  +   +  QVD             +   E ++ K   +
Sbjct: 287 LDSESTWKCTHCDFSTNGQAVRKILRIIQAQVDAAEAISGADGADAIYKRETVMKKYRLV 346

Query: 402 FHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            HPHH     ++HSL Q+YG    Y    L   +LE K+
Sbjct: 347 LHPHHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHKV 385


>gi|194762638|ref|XP_001963441.1| GF20287 [Drosophila ananassae]
 gi|190629100|gb|EDV44517.1| GF20287 [Drosophila ananassae]
          Length = 528

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 203/409 (49%), Gaps = 36/409 (8%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P+ I+ S + GRYL A+  L  G+ +  E PL +GP ++  +P+CLGC++P++ + +   
Sbjct: 44  PFRIEHSDIYGRYLVASRQLEAGELLITEEPLAIGPCVS-GDPVCLGCYQPVSLD-STQY 101

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPE----------CAILKLGCETLLAYNDYKYEAILP 164
           RCP C WP C   C G+   H H  E           A   L   T  A     YE ++ 
Sbjct: 102 RCPGCQWPLCRDSCPGIHQPHGHTSEECQLYGERRAVAAGLLTDRTGPAEVRDLYELVMI 161

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDD 223
           +R L+L++R P+++  ++ ME+H  +R     +++  +  +V+ LR  +        L+D
Sbjct: 162 VRILLLRQRDPEQFAAIERMESHTEERRKNQVLWQHYEQKVVQRLREVW-------QLED 214

Query: 224 TSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
              + +H +CG+++VN  +IG+     + LY    L+ H+C PN  H++  +S   + R 
Sbjct: 215 LQPEQIHQVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCSPNTAHTDHPESYAIILRT 274

Query: 281 TH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
           +        ++  Y   L GT  RR  +   K F C C+RCSDP ELG+  SA+ C    
Sbjct: 275 SRRVREREALTLSYAYTLQGTLKRRSFMHEGKLFWCVCQRCSDPRELGSDCSALVC--PV 332

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
             QG   +    PL+ +  W C  C  ++ A+D+     ++   ++ +  +  ++  LE 
Sbjct: 333 CRQGS--LRAQEPLNQEGSWGCDKCDHKMLAKDLESRLDRINNDLEAI--DVHDIPGLEN 388

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
            L +   +  P+HY   + K+SL Q+YG   GY   Q+S   + RK  Y
Sbjct: 389 FLVRYREVLRPNHYLLLSAKYSLCQIYGRIEGYLLPQMSPEDIARKERY 437



 Score = 45.8 bits (107), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPE 497
            + +P+CLGC++P++ + +   RCP C WP C   C G+   H H  E
Sbjct: 81  VSGDPVCLGCYQPVSLD-STQYRCPGCQWPLCRDSCPGIHQPHGHTSE 127


>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 519

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 128/437 (29%), Positives = 202/437 (46%), Gaps = 56/437 (12%)

Query: 8   EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
           E EE+  CAVC+  A  KC  CK V YCG++HQK HW+ H  KCK   Y++  +  LGR+
Sbjct: 6   EREEV--CAVCKVPAKQKCGGCKSVFYCGREHQKAHWREHSAKCKS--YKLVENEKLGRH 61

Query: 68  LQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSAR 127
             A   +  G+ + RE   +V        PICLGC+KPL  + A    C  C WP C A 
Sbjct: 62  YVAIRRIEAGEVVIREDDALVQAPQQETVPICLGCYKPLESDTAKP--CELCGWPLC-AS 118

Query: 128 CSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELK 182
           C+      +H  EC   K   +T +    +      Y+ I  +R L L+  +P+ Y+   
Sbjct: 119 CT------SHGAECDFTKRHRDTKVTITGFGIAHPTYKCIGVVRALALRETNPEAYERFT 172

Query: 183 DMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNG-- 240
            +          +   EE   I ++++  F      + L D S   +    G++++NG  
Sbjct: 173 KL---------ASGELEEPREIARFIKRFF------NKLHDFSEDEIARAAGIMQINGHE 217

Query: 241 ------VDIGRYTQGLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYT 288
                 V I  Y +  Y     +EH+C  N   S       ++++ + + +  HI   YT
Sbjct: 218 VPISEPVLIAVYDESSY-----VEHSCRANCSKSFTSSGGIVIRAAMEIEKGEHIMICYT 272

Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLD 348
           + LWGT  RR HL  TK+F C+C RCSDP E G+ F+A++C     D   C +     + 
Sbjct: 273 DPLWGTANRRHHLLRTKFFECTCPRCSDPAEFGSMFNAIEC--SKSDCSGCMLPKSFTVQ 330

Query: 349 NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYH 408
              D+ C  C    ++  V  +  Q+G+++  + + +    +      K  +  H +HY+
Sbjct: 331 EIPDYRCNKCDNSRSSNSVDEMLEQIGKEMASMRKNDAG--ACRSFARKHASKLHDNHYY 388

Query: 409 CYAVKHSLIQLYGTQPG 425
              VK +L Q+ G Q G
Sbjct: 389 MTDVKLALAQMIGQQTG 405


>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
 gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
          Length = 515

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 119/420 (28%), Positives = 203/420 (48%), Gaps = 34/420 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC+    +KCS C +V+YC  QHQK+ WK HK  C   P+ +  + LLGR+L AT ++
Sbjct: 4   CNVCQAETKNKCSNCNQVSYCSVQHQKQDWKSHKSNCH--PFRVAHNELLGRHLVATRNI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+C+GC   + PN  D+  C +C WP C   C  L + 
Sbjct: 62  KPYEIVLKEAPLMRGPA-QISVPVCMGCLNAIEPN--DHITCDKCGWPLCGPECQALDE- 117

Query: 135 HTHAPECAILKLGCE--TLLAYNDYK--YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  EC + +   +   +  +N     Y  +  +RCL+L   +P   ++ + +E+    
Sbjct: 118 --HKAECQLTEARGQKVNVQEFNGPHPLYTCVSTVRCLLLSETNPANAEKFQQLESLEQT 175

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
           R    +   ++ SI +++   F  +         S + +    G +++NG ++       
Sbjct: 176 RRGSNQWKADLASIGQFIPKFFRTQ-------KFSEEEIMRAVGALQINGHEVPTSDPPH 228

Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREHL 301
             ++      E++C+PN   S        ++      +  ++S  Y++A+WGT  R+ HL
Sbjct: 229 VAVFYTASFTENSCVPNLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHL 288

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TK F C+CERC D TELGTY+SA+KC  + K  G   +LP+   D +  W C  C  +
Sbjct: 289 MQTKLFKCTCERCVDVTELGTYYSALKC-EDRKCTG--LLLPIKADDWNGCWRCRDCQKQ 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
           +  + V  +  +  + +  + +   N     + L   E    P HYH   VK   +QL  
Sbjct: 346 VQRKYVDGILERAIKDIQSMEKTAENGF---KYLKHYEKWLPPQHYHLSEVKILQVQLIA 402



 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 5/47 (10%)

Query: 454 PICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAI 500
           P+C+GC   + PN  D+  C +C WP C   C  L +   H  EC +
Sbjct: 83  PVCMGCLNAIEPN--DHITCDKCGWPLCGPECQALDE---HKAECQL 124


>gi|195029731|ref|XP_001987725.1| GH19808 [Drosophila grimshawi]
 gi|193903725|gb|EDW02592.1| GH19808 [Drosophila grimshawi]
          Length = 515

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 126/441 (28%), Positives = 214/441 (48%), Gaps = 35/441 (7%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           C VC     +KCS C +VAYC  QHQKE WK HK  C   P++I  + +LGR+L AT ++
Sbjct: 4   CHVCETPTKNKCSNCNQVAYCSVQHQKEDWKAHKIHCH--PFKIAKNEVLGRHLVATRNI 61

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
            P + + +E+PL+ GP   ++ P+C+GC   +  N  D+  C +C WP C   C  L++ 
Sbjct: 62  KPYEIVLKEAPLMRGPA-QISAPVCMGCLNCIEAN--DHISCEKCGWPLCGPECPSLTE- 117

Query: 135 HTHAPECAILKLGCE--TLLAYNDYK--YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
             H  EC + +   +   +  +N     Y  +  +RCL+L   +P   ++   +E+    
Sbjct: 118 --HRAECQLTEERGQKVNVTEFNGPHPLYTCVSTVRCLLLNTTNPANAEKFNQLESLEQT 175

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
           R    +   ++ SI +++   F  +         S + +    G++++NG ++       
Sbjct: 176 RRGSNQWKADLASIGQFIPKFFRTQ-------KFSEEEIMRAVGILQINGHEVPISDPPH 228

Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
             ++      E++C+PN AK  N     ++ +   + +++H+S  Y++A+WGT  R+ HL
Sbjct: 229 VAVFYTASFTENSCVPNLAKSFNKHGHCVLWAPKEIKKNSHLSICYSDAVWGTADRQRHL 288

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
             TK F C+CERC D TELGT +SA+KC  + K  G   +LP    D +  W C  C  +
Sbjct: 289 MQTKLFKCACERCLDVTELGTNYSAIKC-EDRKCTGQ--LLPSKADDWNGSWRCLDCQKQ 345

Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
           +  + +  +  +  + +  + +   N     + L   E    P HYH   VK   +QL  
Sbjct: 346 VQRKYIDGILERASKDIQNMEKTAENGF---KYLKHYEKWLPPQHYHLSEVKILQVQLIA 402

Query: 422 TQPGYAYTQLSSSLLERKISY 442
                    LS   L+ K+ Y
Sbjct: 403 KDQKELMI-LSDEQLQLKLKY 422


>gi|242019692|ref|XP_002430293.1| protein msta, isoform A, putative [Pediculus humanus corporis]
 gi|212515408|gb|EEB17555.1| protein msta, isoform A, putative [Pediculus humanus corporis]
          Length = 501

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 132/428 (30%), Positives = 209/428 (48%), Gaps = 55/428 (12%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           ++I  +   GRY  A  D  PGD I +ESP   GPK + + P+CLGCH+ ++  +     
Sbjct: 8   FDILKNDKFGRYAVALQDYEPGDLIIQESPFTYGPK-SDSPPVCLGCHRKVDGKIL---- 62

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK--------YEAILPLRC 167
           C  C WP C  +C      H+   EC I        + +   K        YE I PLR 
Sbjct: 63  CSTCGWPVCDEKCEK-EPIHSEN-ECIIFS---SVKVRFTKVKNPQGNCPQYECITPLRV 117

Query: 168 LILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVY--EEIDSIVKYLRSNFLEKLPGDVLDDT 224
           L+ + + P+ +  ++  +E+H  +R   T+++  E I+++       FL K+   + +  
Sbjct: 118 LLGKEKFPEIWNSQISKLESHNEERQK-TDIWKVESINTV------GFLHKV-CKLQERF 169

Query: 225 SAKCLHWICGVIEVNGVD----IGRYTQGLYSVICLMEHNCLPNAKH-----------SN 269
           S+  +H+ CG++E N        G   + LY    L  H+C+PN  H           SN
Sbjct: 170 SSDLIHFACGILETNAFSSPSSYGYDMRYLYPKSSLFLHSCIPNVGHYIFKNSNDGERSN 229

Query: 270 MM--QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
           ++  ++ L + +    +  YT  LW T  RR HL   K+F+C C+RCSDPTEL T FS +
Sbjct: 230 VIHVRASLKIKKGQEFTLSYTQTLWPTLLRRSHLKEGKFFDCCCQRCSDPTELKTNFSTI 289

Query: 328 KCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN-- 385
           KC N +    + WIL  +PL  D+ W C SC+ +L    V  + S + ++V+ L + +  
Sbjct: 290 KCSNCY----NGWILNTDPLVVDAPWNCTSCNWKLTGDSVSKLFSNIQQEVENLNKLDNY 345

Query: 386 -PNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS--Y 442
              V+  E  + K +++ HP++     VK+SL QLYG    Y    L   LLE+K+   Y
Sbjct: 346 LDPVEIRENFIKKYKSLLHPNNGFLIIVKYSLAQLYGQAKEYFLDDLPDILLEKKLKLCY 405

Query: 443 VMSPNLKA 450
            +   LK 
Sbjct: 406 TLDEVLKT 413


>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
 gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
          Length = 504

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 126/410 (30%), Positives = 202/410 (49%), Gaps = 47/410 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ S + GR+L A   +  G+ + +E PL +GPK      +CL C++P         RC 
Sbjct: 14  VECSRIYGRFLVANEPVQRGELLVQEQPLAIGPKCN-GYVLCLACYRP-----DVQERCS 67

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLL-----AYNDYKYEAILPLRCLILQR 172
            C WP CS  C G +       ECA+L+L           + +  + + I+PLR L+ + 
Sbjct: 68  LCGWPLCSL-CDGNASNQHLELECAVLQLAKARFYEMPAGSLHWPQLDCIMPLRVLLAKE 126

Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
             P+++  E+  ME H   R    +V+  +  +I +YLR       P  + +  S + + 
Sbjct: 127 AQPERWLNEVAPMEHHEQARRENADVWHADRVNIAQYLRG------PCKLAERFSEELIM 180

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSN--------MMQSKLFV 277
            + GV+EVN  +  R +QG     LY    ++ HNC+PN   +          +++ + +
Sbjct: 181 QVVGVLEVNAFE-ARTSQGYPLRCLYPYTGILAHNCVPNTARTIHPSEGYKIRLRAMVAL 239

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                +   YT  L GT  R+ HL   KYF CSCERC D +EL T+FS+MKC       G
Sbjct: 240 EAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFSSMKC-------G 292

Query: 338 DC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE- 393
            C   W+LP +P   +S W C +C    ++ +V  + + L  +V+ +   +   K LEE 
Sbjct: 293 QCTEGWLLPKHPTAMNSSWNCRACDHSTSSEEVRQIVAALQAEVNAVQALDMGPKRLEES 352

Query: 394 --MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
             +L K +++ HP HY   +++  LI++YG   GY   QL+  LLERK S
Sbjct: 353 ERLLRKYKSLLHPSHYIATSLRQLLIEMYGRVQGYEMVQLTDQLLERKAS 402


>gi|158297510|ref|XP_317734.4| AGAP007779-PA [Anopheles gambiae str. PEST]
 gi|157015230|gb|EAA12332.4| AGAP007779-PA [Anopheles gambiae str. PEST]
          Length = 498

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 129/404 (31%), Positives = 200/404 (49%), Gaps = 31/404 (7%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++   P LGRY  A  DL  G+ +  E+P  VGPKL  + P+CL C  P++       R
Sbjct: 6   YKVLECPELGRYGVAARDLKAGELLFEETPFAVGPKLD-SPPLCLECCCPVDGG-EGGPR 63

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
           C RC WP C    +G ++   H  EC +            D      + + I PLR L+ 
Sbjct: 64  CSRCGWPLCEDCSAGGAELVYHRGECEVFAANGVRFRPVEDSTAGCVQLDCITPLRVLLA 123

Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +     +++ E++ ME H ++R  G     + ++IV +LR       P  +    SA+ +
Sbjct: 124 KESDEARWKREIEPMEYHDAERREGANWKVDENNIVAFLRG------PCGLGKRFSAELV 177

Query: 230 HWICGVIEVNGVD----IGRYTQGLYSVICLMEHNCLPNAKHS----NMMQSKLFV--FR 279
               G+++VN  +     G   +GLY  + +M HNC+PN  HS    N M +++ V    
Sbjct: 178 QRAIGLLDVNAFEGRTGSGYSCRGLYPQLAIMAHNCVPNVVHSIHPSNGMVARVAVDVKE 237

Query: 280 DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC 339
              + T YT  L GT  R+  L  +K+F C C RC DPTELGT+FS + C    K  G  
Sbjct: 238 GDKLYTTYTYTLTGTVARQSILKASKFFTCRCTRCLDPTELGTHFSTLLC---SKCSGG- 293

Query: 340 WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLV---QENPNVKSLEEMLT 396
            I   +PLD +++W CG C  + +   V      +  ++D+L     E   +++ E +  
Sbjct: 294 LITSTDPLDENAEWKCGQCGFKTSGAAVQKAVMTIHNEIDELACFEYEAGRLEAYETVYK 353

Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           K  ++ HP H+   +++HSLI+LYG  PGY   +L   LLERK+
Sbjct: 354 KYRSVLHPLHFINTSIRHSLIELYGRIPGYEMAELPDVLLERKV 397


>gi|198467928|ref|XP_001354556.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
 gi|198146172|gb|EAL31609.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
          Length = 533

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 124/410 (30%), Positives = 203/410 (49%), Gaps = 36/410 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           + I  S + GRYL A+  L  G+ +  E PL +GP ++  + +CLGC++P+     +  R
Sbjct: 46  FHIAQSEIYGRYLVASRQLEAGELLITEEPLAIGPCVS-GDAVCLGCYQPVGLG-REQYR 103

Query: 116 CPRCFWPACSARCSGLSDAHTHAPE-CAIL---KLGCETLLAYNDYK--------YEAIL 163
           CP C WP C A C GL     H  E CA+    +     LL  ND          YE ++
Sbjct: 104 CPGCGWPLCGASCPGLGRRIGHTTEECALYGERRPAAAELL--NDRTSPPRVRDLYELVM 161

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD 222
            +R L+L+  +P++Y  +  ME+H  +R    +++   +  +V+ LR  +  KL  D+  
Sbjct: 162 IVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEW--KL-DDLAA 218

Query: 223 DTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
           +  A+ +H ICG+++VN  +IG+     + LY    L+ H+C PN  H++  +S   + R
Sbjct: 219 EEEAQAVHEICGILDVNCFEIGQKGAKARTLYPSAFLLAHDCSPNTAHTDDPRSYAILLR 278

Query: 280 DTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
            +        ++  Y   L GT  RR  +   K F C C+RC+DP ELG+  SA+ C + 
Sbjct: 279 TSRRVREQEALTLSYAYTLQGTLKRRSFMYEGKLFWCRCQRCADPQELGSDCSALVCRSC 338

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
            K      +  V PL+  +DW C  C  R  A D+     ++   ++ +  +  +V  LE
Sbjct: 339 LKGT----VRAVTPLEQTADWACDRCDHRTTALDLENQLDRINNDLEAI--DVHDVPGLE 392

Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
             L +      P+HY   + K+SL Q+YG   GY   ++S + + RK  Y
Sbjct: 393 NFLLRYRETLRPNHYLLLSAKYSLCQIYGRIKGYLLPEMSLNDIVRKERY 442



 Score = 42.7 bits (99), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 500
            + + +CLGC++P+     +  RCP C WP C A C GL     H   ECA+
Sbjct: 82  VSGDAVCLGCYQPVGLG-REQYRCPGCGWPLCGASCPGLGRRIGHTTEECAL 132


>gi|195117408|ref|XP_002003239.1| GI23580 [Drosophila mojavensis]
 gi|193913814|gb|EDW12681.1| GI23580 [Drosophila mojavensis]
          Length = 507

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 129/415 (31%), Positives = 197/415 (47%), Gaps = 58/415 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A+  +  G+ +  E P  +GPK      +CL C++P       + RC 
Sbjct: 14  VEWSPIYGRYLTASEPVKRGELLIEEQPFALGPKCN-GYVLCLNCYRP-----EVHERCA 67

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEA------------ILPL 165
            C WP C     G  +          L+L C  L A     YE             I+PL
Sbjct: 68  LCGWPLCGPCSDGEEEEENQH-----LQLECSVLRAARARFYEMSAGSKYWPQLDCIMPL 122

Query: 166 RCLILQRRSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDD 223
           R L+ +   P+++  E+  ME H  +R    +V+  +  +I +YLR       P  + D 
Sbjct: 123 RVLLAKEAQPERWLNEVAPMEHHELERRENADVWHADRVNIAQYLRG------PCKLADR 176

Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSN--------M 270
            S + +  + GV+EVN  +  R +QG     L+    ++ HNC+PN   +          
Sbjct: 177 FSEELIMQVVGVLEVNAFE-ARTSQGYPLRCLFPYTGILAHNCVPNTARTIHPSEGYKIR 235

Query: 271 MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
           +++ + +     +   YT  L GT  R+ HL   KYF CSCERC D +EL T+FS++KC 
Sbjct: 236 LRAMVALEEGQQLQHSYTYTLDGTAQRQAHLREGKYFVCSCERCLDASELQTHFSSLKC- 294

Query: 331 NEHKDQGDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
                 G C   W+LP  P   DS W+C SC    +A DV  +   L  +V+ +   +  
Sbjct: 295 ------GQCTEGWLLPKQPTVVDSSWSCRSCDYSTSAEDVRQIVDALQAEVNAVQALDMG 348

Query: 388 VKSLEE---MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            K LEE   +L K +++ HP HY   +++  LI++YG   GY   QL+  LLERK
Sbjct: 349 PKRLEEAERLLRKYKSLLHPSHYIATSLRQLLIEMYGRVQGYEMVQLTDQLLERK 403


>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
          Length = 510

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 123/401 (30%), Positives = 189/401 (47%), Gaps = 41/401 (10%)

Query: 66  RYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACS 125
           RY+  + +L  G+ +  E P +VGPK A   P+CL C+ P  P   D   C  C WP C 
Sbjct: 27  RYMVTSRELQAGEELVTEMPFVVGPK-ACTYPLCLSCYTPWPPEPDDKQLCSTCGWPVCG 85

Query: 126 ARCSGLSDAHTHAP-ECAIL-----KLGCETLLAYNDY---KYEAILPLRCLILQRRSPK 176
             C    +A  H   EC +      K   +  L  N+    + E I PLR L+   R+ +
Sbjct: 86  QECE---NALQHKDYECQVFAQANEKFNIDAALQNNENGVPQLECITPLRLLLESERNVE 142

Query: 177 KY-QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
           ++ +E+KDME H   R    +   +  +IV YLR           LD  S + +  +CG+
Sbjct: 143 RWNKEIKDMEPHNKIRCQKAQWKSDHVNIVDYLRKRL-------KLDRFSEEHIQTVCGI 195

Query: 236 IEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFVFRDTHIS 284
           +E+N  ++    G   +GLY  + +M H+C+ N  HS         +++ L +     + 
Sbjct: 196 LEINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIRLRTTLKIPAGGELY 255

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT++L  T  RREHL   K+F C+C RCSDPTELGT+ S++KC     D G   +L +
Sbjct: 256 ASYTHSLLPTMLRREHLLEGKHFACACPRCSDPTELGTHMSSLKC--NKCDNG--IVLSL 311

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVKSLEEMLTKLE 399
           + LD++S W C  C      + V  V   +  +VD             +   E ++ K  
Sbjct: 312 DSLDSESKWKCTHCDFSTTGQAVRKVFRIIQAEVDAAEAISGADGADAIHERETIMKKYR 371

Query: 400 AMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           ++ HP H     ++HSL Q+YG    Y    L   +LE K+
Sbjct: 372 SVLHPQHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHKV 412


>gi|195169991|ref|XP_002025797.1| GL18252 [Drosophila persimilis]
 gi|194110650|gb|EDW32693.1| GL18252 [Drosophila persimilis]
          Length = 533

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 124/410 (30%), Positives = 202/410 (49%), Gaps = 36/410 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           + I  S + GRYL A+  L  G+ +  E PL +GP ++  + +CLGC++P+     +  R
Sbjct: 46  FHIAQSEIYGRYLVASRQLEAGELLITEEPLAIGPCVS-GDAVCLGCYQPVGLG-REQYR 103

Query: 116 CPRCFWPACSARCSGLSDAHTHAPE-CAIL---KLGCETLLAYNDYK--------YEAIL 163
           CP C WP C A C GL     H  E CA+    +     LL  ND          YE ++
Sbjct: 104 CPGCGWPLCGASCPGLGRRIGHTTEECALYGERRPAAAELL--NDRTSPPRVRDLYELVM 161

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD 222
            +R L+L+  +P++Y  +  ME+H  +R    +++   +  +V+ LR  +  KL  D+  
Sbjct: 162 IVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEW--KL-DDLAA 218

Query: 223 DTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
           +  A+ +H ICG+++VN  +IG+     + LY    L+ H+C PN  H++  +S   + R
Sbjct: 219 EEEAQAVHEICGILDVNCFEIGQKGAKARTLYPSAFLLAHDCSPNTAHTDDPRSYAILLR 278

Query: 280 DTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
            +        ++  Y   L GT  RR  +   K F C C+RC+DP ELG+  SA+ C + 
Sbjct: 279 TSRRVREQEALTLSYAYTLQGTLKRRSFMYEGKLFWCRCQRCADPQELGSDCSALVCRSC 338

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
            K      +  V PL+  +DW C  C  R  A D+     ++   ++ +  +  +V  LE
Sbjct: 339 LKGT----VRAVTPLEQTADWACDRCDHRTKALDLENQLDRINNDLEAI--DVHDVPGLE 392

Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
             L +      P+HY   + K+SL Q+YG   GY   ++S   + RK  Y
Sbjct: 393 NFLLRYRETLRPNHYLLLSAKYSLCQIYGRIKGYLLPEMSLHDIVRKERY 442



 Score = 42.7 bits (99), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)

Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 500
            + + +CLGC++P+     +  RCP C WP C A C GL     H   ECA+
Sbjct: 82  VSGDAVCLGCYQPVGLG-REQYRCPGCGWPLCGASCPGLGRRIGHTTEECAL 132


>gi|195050973|ref|XP_001993006.1| GH13590 [Drosophila grimshawi]
 gi|193900065|gb|EDV98931.1| GH13590 [Drosophila grimshawi]
          Length = 504

 Score =  179 bits (454), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 135/417 (32%), Positives = 200/417 (47%), Gaps = 61/417 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ S L GRYL A+     G+ +  E PL +GPK      +CL C++P  P      RC 
Sbjct: 14  VEWSRLYGRYLVASGSATHGELLVEELPLAIGPKCN-GPVVCLVCYRPDVPE-----RCI 67

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYE------------AILPL 165
            C WP CSA C+  ++A+ HA      +L C  LLA     YE            +I+PL
Sbjct: 68  HCGWPLCSA-CNE-TEANLHA------QLECPLLLASRARFYEMPTGSRYWPQLDSIMPL 119

Query: 166 RCLILQRRSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDD 223
           R L+ +   P+++  E+  ME H   R    +++  +  +I +YLR       P  + + 
Sbjct: 120 RVLLAKEAQPERWLNEVAPMEHHEEARRENADIWHADRVNIAQYLRG------PCKLAER 173

Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHS----NMMQSK 274
            S + +  + GV+EVN  +  R   G     LY    ++ HNC+PN   S       + +
Sbjct: 174 FSEELIMQVVGVLEVNAFE-ARTINGYPLRCLYPYTGILAHNCVPNTARSIHPSEGYKVR 232

Query: 275 LFVFRD----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
           L    D      +   YT  L GT  R+ HL   KYF CSC+RC D +EL T+FS++KC 
Sbjct: 233 LRAMVDLEAGQQLQHSYTYTLDGTVQRQAHLQQGKYFRCSCKRCLDASELQTHFSSLKC- 291

Query: 331 NEHKDQGDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
                 G C   W+LP  P   +S W C +C    +++DV  +   L  +V+ +      
Sbjct: 292 ------GQCTEGWLLPKQPTALESSWQCNACDNTTSSQDVREIVENLQAEVNGVQALEMG 345

Query: 388 VKSLEE---MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
            K LEE   +L K +++ HP HY    ++  LI++YG   GY   QLS  LLERK S
Sbjct: 346 AKRLEESERLLRKYKSLLHPSHYIATGLRQLLIEMYGRVQGYEMVQLSDQLLERKSS 402


>gi|347964965|ref|XP_309220.5| AGAP001025-PA [Anopheles gambiae str. PEST]
 gi|333466560|gb|EAA04943.5| AGAP001025-PA [Anopheles gambiae str. PEST]
          Length = 519

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 130/435 (29%), Positives = 201/435 (46%), Gaps = 43/435 (9%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
           C VC   AL +C+ C++V YCG+ HQ+  WK  H+ +C++  +++  +  LGR+L AT  
Sbjct: 6   CRVCSAPALQRCAGCQQVGYCGRDHQRADWKAQHRDQCRR--FKVVRNDRLGRHLVATRH 63

Query: 74  LHPGDRIARESPLIVGPKLALAEPICLGCHKPL------NPNLADNARCPRCFWPACSAR 127
           +  G+ I R+ P  VGPK+A   P+CLGC++ L         L     C RC WP C   
Sbjct: 64  IKQGEIIYRDEPYAVGPKIANV-PLCLGCNRNLMAGWDATRGLDRFHECSRCGWPLCGPG 122

Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQ 179
           C  ++    H PEC++L   G    +  N       +  Y  I+PLR L+L+R +P++Y 
Sbjct: 123 CEEVAQ---HRPECSVLAGSGYRPNIRPNPSNPEQRESAYCVIVPLRVLLLERIAPERYA 179

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV-LDDTSAKCLHWICGVIEV 238
            ++  E+H+ +R            +   LRSN +  L   + L   S + +  +  +++ 
Sbjct: 180 TVQGFESHLDER--------LASPLYGVLRSNLVPFLRQVLRLQQYSEQTVLKLSAILDT 231

Query: 239 NGVDIGRYTQ-----GLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDTH----ISTMY 287
           N  +I    Q     GLY +  ++ H+C PN KH   + +   L    D      I   Y
Sbjct: 232 NCYEIRLPEQHVKVRGLYPLGAMLSHDCRPNTKHYFDDRLHMVLVATVDIPAGGVIHASY 291

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
           T  L GT  RR  L   K F+C CERC+DPTE GT  S  +C N  +      +L   P 
Sbjct: 292 TQPLLGTVQRRLALRQAKCFDCCCERCADPTEYGTSASGFRCPNCRRTPS--LVLAAEPT 349

Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHY 407
           +  + W C +       R    V      Q + L  +       EE L +     HP + 
Sbjct: 350 NYRTVWRCQNQRCAYQERPDQYVARCERMQQELLALDRTEPAGYEEFLARHATTLHPWNA 409

Query: 408 HCYAVKHSLIQLYGT 422
           +    K++L QL G+
Sbjct: 410 YMLQAKYALTQLLGS 424


>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 499

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 126/404 (31%), Positives = 188/404 (46%), Gaps = 36/404 (8%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           ++   P +GRY+ A +DL        E PLI GP   +  P+CLGC K +  + A   RC
Sbjct: 7   KVAEDPTVGRYVAANVDLAAWQVALVERPLIRGPS-QVTRPVCLGCLKSITADSA--VRC 63

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQ 171
            RC WP C    S  SD   H  EC       +  +   D+      Y++I  +RCL  +
Sbjct: 64  ERCGWPMCPD--SNCSDDEWHKAECDWTVTRRKQKIEIKDFTNPHPSYQSITVIRCLYQK 121

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
             +P+ + +L  +E+H + R  G++   +   I  +L+  F +  P +   D   +    
Sbjct: 122 HNNPEVWAKLTKLESHCASRRGGSKYETDRIWIADHLQ-RFFKLDPTEWPVDEILR---- 176

Query: 232 ICGVIEVNGVDIGRYTQ---GLYSVICLMEHNCLPNAKHSNMMQSKLF-------VFRDT 281
           +CG+++VNG ++         +Y    ++EH+C+PN   S      L        V    
Sbjct: 177 VCGIVQVNGHEVPLTDPPYVAIYDAGSMLEHSCVPNCSKSFTRDGHLLIRTAAAAVESGG 236

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC-- 339
           H+S  YT+ LWGT  R  HLA TK+F C C RCSDPTELGTYFS +KC  E     DC  
Sbjct: 237 HLSISYTDVLWGTAQRLAHLADTKFFVCKCPRCSDPTELGTYFSGVKCATE-----DCMG 291

Query: 340 WILP-VNPLDNDS---DWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
           + LP V+P  ND    +W C  C    +   + L  + +G  +  + + +P    +    
Sbjct: 292 YSLPNVHPSTNDPFDVEWMCNFCFTVADPNRIELSLAHVGRDMAAMNRRDPKHCHMFLDH 351

Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
                  HP+HY+   V+ +L QLYG   G     LS   L  K
Sbjct: 352 YVSAGDLHPNHYYFTEVRLNLAQLYGQLEGQPLDTLSPEQLSHK 395



 Score = 38.5 bits (88), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 46/112 (41%), Gaps = 7/112 (6%)

Query: 428 YTQLSSSLLERKISYVMSPNLKATDE---PICLGCHKPLNPNLADNARCPRCFWPACSAR 484
           Y   +  L   +++ V  P ++   +   P+CLGC K +  + A   RC RC WP C   
Sbjct: 17  YVAANVDLAAWQVALVERPLIRGPSQVTRPVCLGCLKSITADSA--VRCERCGWPMCPD- 73

Query: 485 CSGLSDAHTHAPECAILKLGCETLLAYNDYKSLSRAYLDHQQVRSWTQSSDN 536
            S  SD   H  EC       +  +   D+ +   +Y     +R   Q  +N
Sbjct: 74  -SNCSDDEWHKAECDWTVTRRKQKIEIKDFTNPHPSYQSITVIRCLYQKHNN 124


>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
 gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
          Length = 507

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 132/441 (29%), Positives = 202/441 (45%), Gaps = 53/441 (12%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A         +  E P  VGPK      +CLGC++  + +  +   C 
Sbjct: 14  VQWSPVCGRYLVARGAAVGHGLLIEELPFAVGPKCN-GPVVCLGCYQ-FDAD-TEEEYCS 70

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
            C WP CS  C+     H    EC +L+                 + + ILPLR L+ + 
Sbjct: 71  ECGWPLCS-ECAQDEGNHHFRAECRVLQDARARFFPLPKGSSHCPQLDCILPLRVLLAKE 129

Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
             P+++  +++ ME H  +R    +V+  +  +I +YLR       P  +    S + + 
Sbjct: 130 TDPERWDSQVEPMEHHEEERKEDADVWHADRVNIAQYLRG------PCKLSSRFSEELIM 183

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHS-------NMMQSKLFVF 278
            + G++EVN  +  R T+G     LY    ++ HNC+PN   S        +    +   
Sbjct: 184 QVVGILEVNAFE-ARTTRGFPLRCLYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDL 242

Query: 279 RDTH-ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
           +D   +   YT  L GT  R+ HL   KYF+C+CERC DPTELGT+FS++KC       G
Sbjct: 243 KDGQPLHHSYTYTLDGTAQRQRHLKQGKYFSCTCERCQDPTELGTHFSSLKC-------G 295

Query: 338 DC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE- 393
            C   + +P  P D+DS W C +C A  +  +V  +   L  +V+K+       K LEE 
Sbjct: 296 QCTEGYQVPRQPTDSDSSWNCNNCGASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEA 355

Query: 394 --MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY--------- 442
             +L K +++ HP H+    ++  LI++YG   GY   QL    LERK+           
Sbjct: 356 ERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPEHQLERKVELCRQVLGVLN 415

Query: 443 VMSPNLKATDEPICLGCHKPL 463
           V  P L  T        H PL
Sbjct: 416 VFEPGLSRTRAMNLYEMHVPL 436


>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
          Length = 486

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 152/526 (28%), Positives = 236/526 (44%), Gaps = 100/526 (19%)

Query: 7   IEDEELMR----CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
           +EDE+  R    C VC+  A  +CS C  V YC  +HQ++HWK HK  C   P+++ S+ 
Sbjct: 2   LEDEDSDRIEKICPVCQVEATKRCSRCALVYYCSVEHQQQHWKKHKATCN--PFKVFSNE 59

Query: 63  LLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWP 122
             GR+L AT ++  G+ + +E PL+ GP   +  PI +                      
Sbjct: 60  QFGRFLVATREIRAGEVVIKEPPLVHGPA-QITGPIAI---------------------- 96

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
                       H + P                   Y+ ++PLRCL+L    P K++ L+
Sbjct: 97  -----------QHFYVPHPM----------------YQCLMPLRCLLLAESDPAKWELLQ 129

Query: 183 DMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD 242
            +E+H  +R  G+E++      V  L   F +   GD  D+     +  I G+I+VNG +
Sbjct: 130 QLESHEEERR-GSELWRNDREGVAKLIPRFFKT--GDRWDEDE---ILRIVGIIQVNGHE 183

Query: 243 IGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWG 293
           I      +  +Y+   ++EH+C PN   S   + ++ ++      R   +S  YT+A+W 
Sbjct: 184 IPLSVPSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWA 243

Query: 294 TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG-DCWILPVNPLDN--D 350
           T  R EHL  TK F C CERCSDPTE  TYFSA++C    KD     +ILPV+  DN   
Sbjct: 244 TGNRLEHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQKDSKCKGYILPVDS-DNWTT 302

Query: 351 SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQE-----NPNVKSLEEMLTKLEAMFHPH 405
            DW+C  C   +    V  +  +    ++ + +      N  +K   + LT       P+
Sbjct: 303 GDWSCQKCRGVVTGAAVGQILERARMDLEAMQKHREDHCNKYMKHYSKWLT-------PN 355

Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPIC------LGC 459
           H +   VK SL Q+ G +   A  ++    L  KI   +  +L    E IC       G 
Sbjct: 356 HQYLVDVKISLSQVIGGKAPEAIKKIPYEKLMSKIK--ICQDLLTLFEKICPAEARAFGA 413

Query: 460 HK-PLNPNLADNA-RCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
            +  L+  LA+ A R      PAC+   S L ++  +A EC  + L
Sbjct: 414 TRFELHAALAELARRSTESNNPACT---SQLEESLYNANECVRMLL 456


>gi|194889892|ref|XP_001977181.1| GG18383 [Drosophila erecta]
 gi|190648830|gb|EDV46108.1| GG18383 [Drosophila erecta]
          Length = 402

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 113/379 (29%), Positives = 189/379 (49%), Gaps = 37/379 (9%)

Query: 37  KQHQKEHWKLHKPKC--KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLAL 94
           KQ Q++  +   P    K+ PY ++ S + GRYL A+  L  G+ + RE PL +GP ++ 
Sbjct: 27  KQEQQDKERSASPTAEGKEQPYRVEHSDIYGRYLVASRQLEAGETLIREEPLAIGPCVS- 85

Query: 95  AEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAIL----KLGCE 149
            +P+CLGC++P++   +D  RCP C WP C + C+GL   H H   EC +      +  E
Sbjct: 86  GDPVCLGCYQPVSLK-SDQYRCPGCAWPLCGSACAGLKQRHGHTETECQLYGERRAVAGE 144

Query: 150 TLL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS- 203
            L      A     YE I+ +R L+L++  P+++  +  ME+H  +R     ++   +  
Sbjct: 145 LLTERAGPAEVRDLYELIMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEK 204

Query: 204 IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHN 260
           +V  LR  +        L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H+
Sbjct: 205 VVHRLRVTWQ-------LEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHD 257

Query: 261 CLPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCER 313
           C PN  H++  +S   + R +        ++  Y   L GT  RR  +   K F C C R
Sbjct: 258 CTPNTAHTDDPRSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRR 317

Query: 314 CSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ 373
           CSDP ELGT  SA+ C           +  V+PL+   DW C  CS ++ A +V     +
Sbjct: 318 CSDPRELGTDCSALVCATCRTGS----VRAVDPLEQTGDWACDRCSHKMGATEVERQLDR 373

Query: 374 LGEQVDKL-VQENPNVKSL 391
           +   ++ + V + P +++ 
Sbjct: 374 INNDLEDIDVHDIPGLENF 392



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 6/92 (6%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y V+HS I  YG     +    +   L R+    + P +  + +P+CLGC++P++   +D
Sbjct: 48  YRVEHSDI--YGRYLVASRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYQPVSLK-SD 102

Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
             RCP C WP C + C+GL   H H   EC +
Sbjct: 103 QYRCPGCAWPLCGSACAGLKQRHGHTETECQL 134


>gi|260800041|ref|XP_002594945.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
 gi|229280183|gb|EEN50956.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
          Length = 458

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 121/414 (29%), Positives = 200/414 (48%), Gaps = 59/414 (14%)

Query: 63  LLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWP 122
           L GRYL A  DL  G  +  E P++ GP    +  +CLGC   ++ +     RC  C WP
Sbjct: 4   LTGRYLVANGDLPMGHLVISEEPILTGPSYTTS-ALCLGCFDSVDGSY----RCKGCNWP 58

Query: 123 ACSARCSGLSDAH----TH---APECAILKLGCETLLAYNDY-----KYEAILPLRCLIL 170
            C+ +C           TH   +  CA+LK    T +   D+      Y+ I PLRCL+L
Sbjct: 59  LCNRKCETAGSHQLEVCTHDYSSLSCAMLKT---TDIDITDFVGPNTSYQFITPLRCLLL 115

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           + R P +++ +  ME+H   +        E+  ++V+ +R           +D+ S   +
Sbjct: 116 KSRDPARWKLISGMESHTEDQTREYRWGMEMGRNVVRVIRQTL-------GMDEFSEHEI 168

Query: 230 HWICGVIEVNGVDIGRYTQG-------------LYSVICLMEHNCLPN------AKHSNM 270
             +CG++ VN  ++  +T G             +Y    LM H+CL N      +K+   
Sbjct: 169 MKVCGILTVNAFEL--FTTGPNKDRDSTPRAWAVYPTTYLMNHDCLANTLTSIDSKNKMH 226

Query: 271 MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
           +++++ V +   ++  Y+  LWGT+ RR  L   KYF CSC+RC DPTELG++ S+ +C 
Sbjct: 227 VRTRMPVKKGEALTAEYSECLWGTEIRRHQLHRYKYFWCSCQRCRDPTELGSFISSHRCT 286

Query: 331 NEHKDQGDCWILPVNPLD-NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQ---ENP 386
           N   +     ++   PLD N + + C SC   +    +     Q+ E+V +L     E P
Sbjct: 287 NCGGN-----VVATKPLDFNSTGFRCESCRVIIRGEVLKGTNHQVEEEVFELESKELETP 341

Query: 387 NVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            ++  E+++ + E++ HP+HY     K  L   YG QPGY+  +LS   L +K+
Sbjct: 342 -IEEYEDLVERHESILHPNHYAMLRCKKFLAHAYGRQPGYSLAELSEDRLNKKM 394


>gi|24641164|ref|NP_727478.1| CG11160, isoform B [Drosophila melanogaster]
 gi|22832071|gb|AAN09278.1| CG11160, isoform B [Drosophila melanogaster]
          Length = 403

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 188/378 (49%), Gaps = 36/378 (9%)

Query: 37  KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
           KQ Q++  +   P + K+LPY ++ S + GRYL A   L  G+ + RE PL +GP ++  
Sbjct: 29  KQEQQDKEQSPSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87

Query: 96  EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAILK----LGCET 150
           +P+CLGC+ P++   AD  RCP C WP C + C+GL   H H   EC +      +  E 
Sbjct: 88  DPVCLGCYHPVSLK-ADQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYAERRAVAGEL 146

Query: 151 LL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-I 204
           L      A     YE ++ +R L+L++  P+++  +  ME+H  +R     ++   +  +
Sbjct: 147 LTERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKV 206

Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNC 261
           V+ LR  +        L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H+C
Sbjct: 207 VQRLRVTWQ-------LEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDC 259

Query: 262 LPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
            PN  H++   S   + R +        ++  Y   L GT  RR  +   K F C C RC
Sbjct: 260 TPNTAHTDDPSSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRC 319

Query: 315 SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQL 374
           SDP ELGT  SA+ C           +  V+PL    DW C  C+ ++ A +V     ++
Sbjct: 320 SDPRELGTDCSALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRI 375

Query: 375 GEQVDKL-VQENPNVKSL 391
              ++ + V + P +++ 
Sbjct: 376 NNDLEDIDVHDIPGLENF 393



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y V+HS I  YG          +   L R+    + P +  + +P+CLGC+ P++   AD
Sbjct: 49  YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYHPVSLK-AD 103

Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
             RCP C WP C + C+GL   H H   EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135


>gi|20152089|gb|AAM11404.1| RE25548p [Drosophila melanogaster]
          Length = 403

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 188/378 (49%), Gaps = 36/378 (9%)

Query: 37  KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
           KQ Q++  +   P + K+LPY ++ S + GRYL A   L  G+ + RE PL +GP ++  
Sbjct: 29  KQEQQDKEQSPSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87

Query: 96  EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAILK----LGCET 150
           +P+CLGC+ P++   AD  RCP C WP C + C+GL   H H   EC +      +  E 
Sbjct: 88  DPVCLGCYHPVSLK-ADQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYAERRAVAGEL 146

Query: 151 LL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-I 204
           L      A     YE ++ +R L+L++  P+++  +  ME+H  +R     ++   +  +
Sbjct: 147 LTERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKV 206

Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNC 261
           V+ LR  +        L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H+C
Sbjct: 207 VQRLRVTW-------QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDC 259

Query: 262 LPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
            PN  H++   S   + R +        ++  Y   L GT  RR  +   K F C C RC
Sbjct: 260 TPNTAHTDDPSSFEILLRTSRRVREREALTLGYAYTLQGTLKRRAFMHEGKLFWCCCRRC 319

Query: 315 SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQL 374
           SDP ELGT  SA+ C           +  V+PL    DW C  C+ ++ A +V     ++
Sbjct: 320 SDPRELGTDCSALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRI 375

Query: 375 GEQVDKL-VQENPNVKSL 391
              ++ + V + P +++ 
Sbjct: 376 NNDLEDIDVHDIPGLENF 393



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 32/92 (34%), Positives = 46/92 (50%), Gaps = 6/92 (6%)

Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
           Y V+HS I  YG          +   L R+    + P +  + +P+CLGC+ P++   AD
Sbjct: 49  YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYHPVSLK-AD 103

Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
             RCP C WP C + C+GL   H H   EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135


>gi|118776986|ref|XP_307207.3| AGAP011523-PA [Anopheles gambiae str. PEST]
 gi|116133074|gb|EAA03232.3| AGAP011523-PA [Anopheles gambiae str. PEST]
          Length = 473

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 126/396 (31%), Positives = 180/396 (45%), Gaps = 40/396 (10%)

Query: 66  RYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACS 125
           RY+ A  DL  GD I    P +VGP  A  EP+CLGCH    P  +   RC  C W  CS
Sbjct: 1   RYIVAARDLKAGDVIIETPPFVVGP-CAETEPVCLGCHSAFQPG-SVVYRCEICNWRICS 58

Query: 126 ARCSGLSDAHTHAP-ECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKY 178
             C GL  A +H   EC  L+          +  A     YEAIL LRC++L+     +Y
Sbjct: 59  PSCQGLVPAGSHRQLECIPLRDKAVHNHLQTSPAAQVKLMYEAILTLRCMLLKTVDRPQY 118

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSAKCLHWICGVIE 237
             L  M+       P  +V ++I  +    +   +E++       + S   LH ICG+IE
Sbjct: 119 DRLLGMD-------PLNDVRQQIPKLWNRNQKEIVERIRNQWGFAEYSELELHTICGIIE 171

Query: 238 VNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQS---KLFVFRDTH----ISTMY 287
           VN  ++G+     + L+    L+ H+C PN  H++  Q+    + V RD      ++  Y
Sbjct: 172 VNAFEVGQEPTKARALFPEAYLLMHDCTPNTGHTDAPQTHHLTVRVLRDVKAGEPLTLTY 231

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL----NEHKDQGDCWILP 343
            + L GT  RR+HL   K+F+C+C RC+DP+ELGT  SA++C       H   G      
Sbjct: 232 AHILQGTLKRRQHLREEKFFSCNCRRCTDPSELGTNCSALRCTKCPKGTHTADGSAQ--- 288

Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
                      C  CS  ++   V  +                +V  LE  L    A+ H
Sbjct: 289 ----PRKRKCHCRQCSTVMSCEAV--ILLLEQLSQQLETIGGNDVDGLETFLRVQGAVLH 342

Query: 404 PHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            +HY   +VKHSL +LYG   G+   QLS   L+RK
Sbjct: 343 DNHYLLLSVKHSLCELYGKIEGFLIPQLSREQLKRK 378



 Score = 41.6 bits (96), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 442 YVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH 494
           +V+ P   A  EP+CLGCH    P  +   RC  C W  CS  C GL  A +H
Sbjct: 21  FVVGPC--AETEPVCLGCHSAFQPG-SVVYRCEICNWRICSPSCQGLVPAGSH 70


>gi|357606572|gb|EHJ65115.1| hypothetical protein KGM_22194 [Danaus plexippus]
          Length = 482

 Score =  165 bits (417), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 184/385 (47%), Gaps = 43/385 (11%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLA-LAEPICLGCHKPLNPNLADNA 114
           Y++K +  LGRYL A  +L  GD I  + P  VGPK      P+CL C+ P+     ++ 
Sbjct: 7   YDVKRNVKLGRYLVANAELGCGDLIFTDYPFAVGPKPGKYTPPLCLSCYCPI-----ESK 61

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEA-----ILPLRCLI 169
            C RC WP CSA C     +  H PEC++            D+   A     + PLR L+
Sbjct: 62  YCSRCSWPICSAECEL---SPNHQPECSVFSKAKIKFQPVEDWTVSAPQLDCVTPLRLLV 118

Query: 170 LQRRSPKKY-QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
            + + P ++ +E++ ME H  +R   +    +  +IV YL            +D     C
Sbjct: 119 AKEQDPDRWSKEVQAMETHTEQRRKRSTWKADQINIVNYL------------VDHCKLNC 166

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-------MQSKLFVFRDT 281
                 V +V     G   + +Y  + +  H+C+PN  HS +       +++ + + +  
Sbjct: 167 RFSKELVEQVCASRGGFSIRAVYPRLAIAAHSCVPNIVHSILQPDYRVEVRAAVPLQKGQ 226

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI 341
            +   YT+ L  T  RREHL  +K+F+C C RC+DPTELGT+ S  KC    K      +
Sbjct: 227 RLHLSYTHVLSPTILRREHLRESKFFDCDCPRCTDPTELGTHLSTFKCSKCKKG----IV 282

Query: 342 LPVNPLDNDSDWTCG--SCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE---MLT 396
           L  NPLD ++ W C    C  R ++  +H + S L +++D L   +   +++E+    + 
Sbjct: 283 LSKNPLDKEASWNCSENDCDFRTSSAVMHKLLSDLQDELDSLDSLDTASEAVEQREAFIN 342

Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYG 421
           K + + H  H     VKH+L+QLYG
Sbjct: 343 KYQLILHHRHSFMLCVKHALVQLYG 367



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 39/93 (41%), Gaps = 24/93 (25%)

Query: 442 YVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
           + + P       P+CL C+ P+     ++  C RC WP CSA C     +  H PEC++ 
Sbjct: 37  FAVGPKPGKYTPPLCLSCYCPI-----ESKYCSRCSWPICSAECEL---SPNHQPECSV- 87

Query: 502 KLGCETLLAYNDYKSLSRAYLDHQQVRSWTQSS 534
                           S+A +  Q V  WT S+
Sbjct: 88  ---------------FSKAKIKFQPVEDWTVSA 105


>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
 gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
 gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
 gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
 gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
          Length = 507

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 123/410 (30%), Positives = 197/410 (48%), Gaps = 48/410 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A   +     +  E P  VGPK      +CLGC++P NP+  +   C 
Sbjct: 14  VQWSPVCGRYLVAKGAIRGHGLLIEELPFAVGPKCN-GPVVCLGCYEP-NPD-PEEELCS 70

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
            C WP C   C+  +D      EC+ LK                 + + I+PLR L+ + 
Sbjct: 71  ECGWPLC-VECAQQADNAHFRLECSQLKDARARFFRLPSGSRHCPQLDCIMPLRVLLAKE 129

Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            +P+++  E+  ME H  +R    +V+  +  +I +YLR       P  + +  S + + 
Sbjct: 130 ANPERWDNEVAPMEHHKEERQRDADVWHADRVNIAQYLRG------PCQLANRFSEELIM 183

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
            + GV+EVN  +  R  +G     L+    ++ HNC+PN   S +  S+ +  R   +  
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIRLRAMVD 241

Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT  L GT  R++HL   K+F C CERC DPTELGT+FS++KC       
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC------- 294

Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVK 389
           G C   + +P  P + D+ W C +C +  +  D   +   L  +V+  VQ  P     ++
Sbjct: 295 GQCAEGFQVPRQPTEPDTSWNCANCGSDTSNADALAMLQSLQSEVNA-VQALPMAAKRLE 353

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            +E +L K +++ HP H+    ++  LI++YG   GY   QL    LERK
Sbjct: 354 EIERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 403


>gi|195432146|ref|XP_002064087.1| GK19981 [Drosophila willistoni]
 gi|194160172|gb|EDW75073.1| GK19981 [Drosophila willistoni]
          Length = 390

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 114/398 (28%), Positives = 187/398 (46%), Gaps = 49/398 (12%)

Query: 36  GKQHQKEHWKLHKPKCKKL----------------PYEIKSSPLLGRYLQATLDLHPGDR 79
           GK+HQ+ H    +P  K                   + I+ S + GRYL A   L PG+ 
Sbjct: 6   GKKHQRHHQTQRQPHQKAEVDEVQVEEEEEPLVIPAFRIEKSEIYGRYLVANRQLEPGEL 65

Query: 80  IARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH-A 138
           +  E PL +GP ++  +P+CLGC+ P+  + A   RC  C WP C ++C GL  A  H A
Sbjct: 66  LITEQPLAIGPCVS-GDPVCLGCYSPVTLD-ASQYRCSACGWPLCGSKCKGLDTAVGHTA 123

Query: 139 PECAILK---------LGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
            EC +L          L   T        YE I+ +R L+L++ +P++Y  +  ME+H  
Sbjct: 124 AECKVLAGRRSLAGDLLTLGTAPGQVRDLYELIMIVRILLLRQNAPEQYALVMRMESHSE 183

Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---Y 246
           +R     +++  +  V        +   GD+ +   A+ LH ICG+++VN  +IG+    
Sbjct: 184 ERRKNATLWQHYEEKVVQRLIELWQM--GDLAE---AQQLHEICGILDVNCFEIGQNGAK 238

Query: 247 TQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFVFRDTHISTMYTNALWGTQPRRE 299
            + LY    L+ H+C PN  H++       ++++   V     ++  Y   L GT  RR 
Sbjct: 239 ARTLYPSAFLLAHDCSPNTAHTDDPGTYAILLRTSRRVHEQEALTLSYAYTLQGTLKRRS 298

Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
            +   K F C C RC+DP ELGT  SA+ C +         I  V PL   +DW C  C 
Sbjct: 299 FMHEGKLFWCQCRRCADPKELGTDCSALACRSCQLGS----IRAVQPLKQTADWACDRCK 354

Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
            +++A++V     ++   ++ +  +  ++  LE  L +
Sbjct: 355 RQVSAQEVEHQLDRINNDLEAI--DVHDIPGLENFLIR 390



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 2/53 (3%)

Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH-APECAIL 501
            + +P+CLGC+ P+  + A   RC  C WP C ++C GL  A  H A EC +L
Sbjct: 78  VSGDPVCLGCYSPVTLD-ASQYRCSACGWPLCGSKCKGLDTAVGHTAAECKVL 129


>gi|170061020|ref|XP_001866058.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879295|gb|EDS42678.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 494

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 124/413 (30%), Positives = 196/413 (47%), Gaps = 46/413 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR-- 115
           + ++  LGR+L AT  +  G+ I RESP ++GPK A + P+CLGCH+ L+P   D  +  
Sbjct: 8   LSTNATLGRHLVATRPIRSGEVIFRESPTVLGPKTA-SVPLCLGCHRNLDPITTDAGKKY 66

Query: 116 --CPRCFWPACSARCSGLSDAHTHAPECAIL-------KLGCETLLAYNDYK-YEAILPL 165
             C  C WP CS  C     +  H  EC +        ++  + L+    +  Y AI+PL
Sbjct: 67  YNCQHCGWPLCSPSCE---TSCYHRDECQLFASKSYRPQIRFDELVPSKKHSGYCAIVPL 123

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDT 224
           R ++L+R+ P ++ +L  +E+H+  R   T +Y  + S +V ++R         ++ ++ 
Sbjct: 124 RAILLKRKEPARFAKLATLESHVETRQ-STPLYAAVQSNLVPFVRDVL------NLRNEV 176

Query: 225 SAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKH------SNMMQS 273
           S   L  I G+ + N  +I     G   + LY +  +M H C PN KH      + +M +
Sbjct: 177 SVGQLMEIAGIFDTNSYEIRIPERGIKIRALYELGAMMAHCCQPNTKHFFDEELNLVMIA 236

Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
            + + +D  IS  Y   L  T  RR  +   K F C C RCSDPTE  TY  ++ C    
Sbjct: 237 AVDIPKDEMISISYAQPLQATIQRRFTIKQAKCFECGCHRCSDPTEFRTYAGSIVCPQCS 296

Query: 334 KDQGDCWILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
           K      ++ VNPLD  SDW C    CS R +A+           ++ +L          
Sbjct: 297 KS----MVVAVNPLDFRSDWRCEDKKCSYRESAQQYITRNEAFRAEIGQL-SSGAGPDGY 351

Query: 392 EEMLTKLEA--MFHPHHYHCYAVKHSLIQLYGTQPGYAYT--QLSSSLLERKI 440
           E +L + ++  + HP + +   VK++L QLYG           LS   L RK+
Sbjct: 352 ECLLGRFQSTDVLHPWNTNVLQVKYALTQLYGGGGPGGINLKDLSEKQLRRKV 404



 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 6/58 (10%)

Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNAR----CPRCFWPACSARC 485
           S  ++ R+   V+ P  K    P+CLGCH+ L+P   D  +    C  C WP CS  C
Sbjct: 26  SGEVIFRESPTVLGP--KTASVPLCLGCHRNLDPITTDAGKKYYNCQHCGWPLCSPSC 81


>gi|195472006|ref|XP_002088293.1| GE18493 [Drosophila yakuba]
 gi|194174394|gb|EDW88005.1| GE18493 [Drosophila yakuba]
          Length = 507

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 121/410 (29%), Positives = 196/410 (47%), Gaps = 48/410 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GR+L A         +  E PL VGPK      +CLGC++P +P+  +   C 
Sbjct: 14  VQWSPVCGRFLVAKGATLGHGLLIEELPLAVGPKCN-GPVVCLGCYQP-DPDPVEGL-CS 70

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
            C WP C   CS  +D   +  EC  L+                 + + I+PLR L+ + 
Sbjct: 71  ECGWPVCE-ECSQQADNTHYQLECRELRDARARFFPLPSGSSHCPQLDCIMPLRVLLAKE 129

Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            +P+++  E+  ME H  +R    +V+  +  +I +YLR       P  +    S + + 
Sbjct: 130 ANPERWDNEVAPMEHHEEERKKDADVWHADRVNIAQYLRG------PCKLASRFSEELIM 183

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
            + GV+EVN  +  R  +G     L+    ++ HNC+PN   S +  S+ +  R   +  
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIRLRAMVD 241

Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT  L GT  R++HL   K+F C C+RC DPTELGT+FS++KC       
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRCLDPTELGTHFSSLKC------- 294

Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVK 389
           G C   + +P  P + D+ W C SC +  +  +   +   L  +V+  VQ  P     ++
Sbjct: 295 GQCAEGFQVPRQPTEPDTSWNCASCGSDTSNVEALTLLQSLQSEVNA-VQALPMAAKRLE 353

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            +E +L K +++ HP H+    ++  LI++YG   GY   QL    LERK
Sbjct: 354 EVERLLRKYKSLLHPQHFIATGLRQLLIEMYGRVKGYEMVQLPDHQLERK 403


>gi|225713426|gb|ACO12559.1| msta, isoform A [Lepeophtheirus salmonis]
          Length = 505

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 136/447 (30%), Positives = 211/447 (47%), Gaps = 47/447 (10%)

Query: 15  CAVCRETALHKCSACKEVAYCGK-QHQKEHWKLHKPKCKK-LPYEIKSSPLLGRYLQATL 72
           C  C   A   C+ C  V++C +  H+  H+   K +     P+ +K +  LGR+L AT 
Sbjct: 7   CFYCGNPATASCTTCGLVSFCSEPSHKALHYGHKKDETSFCFPFAVKRNSQLGRHLVATR 66

Query: 73  DLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS 132
           D+   + I  ESPL++GP ++ +   CL C K L+        C  C +P CS  CS   
Sbjct: 67  DIKALEVILEESPLVIGPYIS-SNNQCLECFKTLD---TSPHSCISCNYPFCSQECS--- 119

Query: 133 DAHTHAPEC-AILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQE-LKDMEAHMSK 190
               H  EC A  K G +     ND+ Y  I  LR L LQ   P+ +   +KDM  H S 
Sbjct: 120 QGPWHKIECSAFQKYGHKRDENDNDF-YSIITTLRLLSLQESKPEFWNTIMKDMLDHNSD 178

Query: 191 RGPGT----EVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--- 243
               T      YEE  ++VK ++S  L K  G+   DT    L  + GVI+ NG D+   
Sbjct: 179 IQEYTPEKWTSYEE--TVVKKIQS--LTK--GNADTDT----LTRLIGVIDTNGADLNLP 228

Query: 244 --GRYTQG--LYSVICLMEHNCLPNAKHSNM------MQSKLFVFRDTHISTMYTNALWG 293
             G++ +G  LY +   + H+C+ N K  N       M++ + + +   I   Y      
Sbjct: 229 HDGKHGKGTGLYLIYSNLNHSCVCNTKTINFQDRRIEMRASVNISKGEQICNQYVRPNKC 288

Query: 294 TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDW 353
           T  RR  +    YF+C C+RC+DPTE G+Y  A+ C           +LP+NPL+ +S+W
Sbjct: 289 TLARRTTIKNKWYFDCCCKRCADPTECGSYLGALLC-----SSCKSVVLPINPLETESNW 343

Query: 354 TCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVK 413
            C  CS   +   V  V   +GE +++    +PN  S E++L   + + HP+HY     +
Sbjct: 344 ACKGCSKEYDVEYVMGVLISVGESLEEATPYDPN--SYEKILYDTQDLLHPNHYLRIEAE 401

Query: 414 HSLIQLYGTQPGYAYTQ-LSSSLLERK 439
             +  L G  P  ++++ +    +ERK
Sbjct: 402 LKVAALIGNIPTSSFSKGMPRPAVERK 428


>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
 gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
          Length = 507

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 123/409 (30%), Positives = 194/409 (47%), Gaps = 46/409 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A         +  E PL VGPK      +CLGC++  +PN  +   C 
Sbjct: 14  VQWSPVCGRYLVAKGATLGHGLLIEELPLAVGPKCN-GPVVCLGCYE-TDPN-PEEELCS 70

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
            C WP C A CS  +D      EC  L+                 + + I+PLR L+ + 
Sbjct: 71  ECGWPLC-AECSEQADNAHFQLECRELRDARARFFPLPSGSSHCPQLDCIMPLRVLLAKE 129

Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            +P+++  E+  ME H  +R    +V+  +  +I +YLR       P  +    S + + 
Sbjct: 130 ANPERWDSEVAPMEHHEEERKKDADVWHADRVNIAQYLRG------PCKLASRFSEELIM 183

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
            + GV+EVN  +  R  +G     L+    ++ HNC+PN   S +  S+ +  R   +  
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIRLRAMVD 241

Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT  L GT  R++HL   K+F C C+RC DPTELGT+FS++KC       
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRCLDPTELGTHFSSLKC------- 294

Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
           G C   + +P  P + D+ W+C SC +  +  +   +   L  +V+ +       K LEE
Sbjct: 295 GQCAEGYQVPRQPTEADTSWSCASCGSDTSNVEALAMLQSLQSEVNAVQALPMGAKRLEE 354

Query: 394 ---MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
              +L K +++ HP H+    ++  LI++YG   GY   QL    LERK
Sbjct: 355 VERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 403


>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
 gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
          Length = 507

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 120/410 (29%), Positives = 195/410 (47%), Gaps = 48/410 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A         +  E P  +GPK      +CLGC++  +P+  +   C 
Sbjct: 14  VQWSPVCGRYLVAKGATRGHGLLIEELPFAIGPKCN-GPVVCLGCYE-TDPD-PEEELCS 70

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
            C WP C   C+  +D      EC  L+                 + + I+PLR L+ + 
Sbjct: 71  ECGWPLC-VECAQQADNAHFQLECRELRDARARFFRLPSGSSHCPQLDCIMPLRVLLAKE 129

Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            +P+++  E+  ME H  +R    +V+  +  +I +YLR       P  + +  S + + 
Sbjct: 130 ANPERWDNEVAPMEHHKEERQKDADVWHADRVNIAQYLRG------PCQLANRFSEELIM 183

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
            + GV+EVN  +  R  +G     L+    ++ HNC+PN   S +  S+ +  R   +  
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTARS-IYPSEGYKIRLRAMVD 241

Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT  L GT  R++HL   K+F C CERC DPTELGT+FS++KC       
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC------- 294

Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVK 389
           G C   + +P  P + D+ W+C SC +  +  D   +   L  +V+  VQ  P     ++
Sbjct: 295 GQCAEGFQVPRQPTEPDTSWSCASCGSETSNVDALAMLQSLQSEVNA-VQALPMAAKRLE 353

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            +E +L K +++ HP H+    ++  LI++YG   GY   QL    LERK
Sbjct: 354 EIERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 403


>gi|386769408|ref|NP_001245962.1| CG43129, isoform C [Drosophila melanogaster]
 gi|383291417|gb|AFH03636.1| CG43129, isoform C [Drosophila melanogaster]
          Length = 503

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 197/416 (47%), Gaps = 48/416 (11%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   +LGR L  +  +  G+ +  E P   GPK   +  +CLGC++ L      +
Sbjct: 2   LKARLQEDAVLGRCLATSSSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLA--YNDY---KYEAILPLR 166
           +  RC  C WP C + C+   D   H  EC +      T      +D    + + I  LR
Sbjct: 61  SLDRCELCDWPLCGS-CADDEDVTEHRGECQVFSAARVTFAGNVSDDGVCPQLDCITILR 119

Query: 167 CLILQRRSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDT 224
            L+ +  +P+++  E+  ME H  +R    +V+  +  +I +YLR       P  + +  
Sbjct: 120 VLLAKEANPERWDNEVAPMEHHKEERQRDADVWHADRVNIAQYLRG------PCQLANRF 173

Query: 225 SAKCLHWICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
           S + +  + GV+EVN  +  R  +G     L+    ++ HNC+PN   S +  S+ +  R
Sbjct: 174 SEELIMQVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIR 231

Query: 280 DTHISTM---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
              +  +         YT  L GT  R++HL   K+F C CERC DPTELGT+FS++KC 
Sbjct: 232 LRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC- 290

Query: 331 NEHKDQGDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP- 386
                 G C   + +P  P + D+ W C +C +  +  D   +   L  +V+  VQ  P 
Sbjct: 291 ------GQCAEGFQVPRQPTEPDTSWNCANCGSDTSNADALAMLQSLQSEVNA-VQALPM 343

Query: 387 ---NVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
               ++ +E +L K +++ HP H+    ++  LI++YG   GY   QL    LERK
Sbjct: 344 AAKRLEEIERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 399


>gi|125986939|ref|XP_001357232.1| GA14319 [Drosophila pseudoobscura pseudoobscura]
 gi|54645563|gb|EAL34301.1| GA14319 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 124/409 (30%), Positives = 195/409 (47%), Gaps = 45/409 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ S + GRYL A         +  E P  +GPK      +CLGC +P  P       C 
Sbjct: 14  VQWSAVCGRYLVAQESTPARSLLIEELPFAIGPKCN-GPVVCLGCCRP-APATDAEELCV 71

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCET---LLAYNDY--KYEAILPLRCLILQR 172
            C WP C   CS   D+     EC   ++       L + + Y  + + ILPLR L+ + 
Sbjct: 72  LCGWPLCE-ECSENEDSPHVQLECREFQISGARFYRLPSGSSYCPQLDCILPLRVLLAKE 130

Query: 173 RSPKKYQ-ELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            +P+++  E+  ME H S+R    +V+  ++ +I +YLR       P  +    S + + 
Sbjct: 131 ANPRRWDLEVAPMEHHESERRLNEDVWHADLVNIAQYLRG------PCKLASRFSEELIM 184

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
            + GV+EVN  +  R  +G     LY    ++ H+C+PN   S +  S+ F  R   +  
Sbjct: 185 QVVGVLEVNAFE-ARTARGYPLRCLYPYTGILAHSCVPNTARS-IYPSEGFKIRLRAMVD 242

Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT  L GT  R+ HL   K+F C+CERC DPTEL T+FS++KC       
Sbjct: 243 LDDGQPLHHSYTYTLDGTAQRQTHLREGKFFTCNCERCLDPTELQTHFSSLKC------- 295

Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKS 390
           G C   ++ P  P + D+ W C SC A  +  DV      +  +V  +  ++  P+ ++ 
Sbjct: 296 GQCAEGFLAPKQPTEVDTSWNCCSCGASTSNVDVETTIKSIQSEVSGVQALEMGPHRLEE 355

Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            E +L K +++ HP H+    ++  LI++YG    Y   QLS  LLERK
Sbjct: 356 AERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQRYEMVQLSDDLLERK 404


>gi|195146636|ref|XP_002014290.1| GL19122 [Drosophila persimilis]
 gi|194106243|gb|EDW28286.1| GL19122 [Drosophila persimilis]
          Length = 508

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 124/409 (30%), Positives = 195/409 (47%), Gaps = 45/409 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ S + GRYL A         +  E P  +GPK      +CLGC +P  P       C 
Sbjct: 14  VQWSAVCGRYLVAQESTPARSLLIEELPFAIGPKCN-GPVVCLGCCRP-APATDVEELCV 71

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCET---LLAYNDY--KYEAILPLRCLILQR 172
            C WP C   CS   D+     EC   ++       L + + Y  + + ILPLR L+ + 
Sbjct: 72  LCGWPLCE-ECSENEDSPHVQLECREFQISGARFYRLPSGSSYCPQLDCILPLRVLLAKE 130

Query: 173 RSPKKYQ-ELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            +PK+++ E+  ME H S+R    +V+  ++ +I +YLR       P  +    S + + 
Sbjct: 131 ANPKRWEVEVAPMEHHESERRLNEDVWHADLVNIAQYLRG------PCKLASRFSEELIM 184

Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
            + GV+EVN  +  R  +G     LY    ++ H+C+PN   S +  S+ F  R   +  
Sbjct: 185 QVVGVLEVNAFE-ARTARGYPLRCLYPYTGILAHSCVPNTARS-IYPSEGFKIRLRAMVD 242

Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT  L GT  R+ HL   K+F C+CERC DPTEL T+FS++KC       
Sbjct: 243 LDDGQPLHHSYTYTLDGTAQRQTHLREGKFFTCNCERCLDPTELQTHFSSLKC------- 295

Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKS 390
           G C   ++    P + D+ W C SC A  +  DV      +  +V  +  ++  P+ ++ 
Sbjct: 296 GQCAEGFLASKQPTEVDTSWNCCSCGASTSNVDVETTIKSIQSEVSGVQALEMGPHRLEE 355

Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
            E +L K +++ HP H+    ++  LI++YG    Y   QLS  LLERK
Sbjct: 356 AERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQRYEMVQLSDDLLERK 404


>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
          Length = 423

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 108/329 (32%), Positives = 167/329 (50%), Gaps = 47/329 (14%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---------- 106
           ++  +  LGRY+ AT ++  G+ I R++P +VGPK+A + P+CLGC++ L          
Sbjct: 13  QVVENACLGRYVVATRNIRQGEVIFRDAPAVVGPKMA-SVPVCLGCNRDLMATATVRQQG 71

Query: 107 NPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLG--------CETLLAYNDYK 158
           +PN   + +C RC WP C A C     A  H  EC+I+                A  +  
Sbjct: 72  DPNFRFH-QCSRCGWPVCGAECEA---ADQHRSECSIMAGSSYRPKICPAREEQARRESA 127

Query: 159 YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLP 217
           Y  I+PLR L+LQ  SP  Y +L+ +E+H+++R   + +YE + S +V ++RS       
Sbjct: 128 YCVIVPLRVLLLQSHSPDTYAQLQRLESHVAERL-KSPLYEVVRSNLVPFIRSVL----- 181

Query: 218 GDVLDDTSAKCLHWICGVIEVNGVDIG---RYTQ--GLYSVICLMEHNCLPNAKH----- 267
              L   SA+ +  +C +++ N  +I    R T+   LY +  ++ H+C PN KH     
Sbjct: 182 --GLQQYSAETILEVCAMLDTNCFEIRLPERRTKVRALYPLGAMLSHDCRPNTKHYFDDA 239

Query: 268 -SNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
              ++ + + + R   IST YT  L GT  RR  L  +K+F+C CERC+DPTELGT  S 
Sbjct: 240 LRMVLVATVDIERGATISTSYTQPLLGTLHRRLALKQSKHFDCQCERCTDPTELGTNLSG 299

Query: 327 MKCLNEHKDQGDCWILPVNPLDNDSDWTC 355
            +C    K      +L   P    S W C
Sbjct: 300 FRCPVCRKG----IVLSAQPCSPQSVWRC 324



 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 16/77 (20%)

Query: 435 LLERKISYVMSPNLKATDEPICLGCHKPL----------NPNLADNARCPRCFWPACSAR 484
           ++ R    V+ P + +   P+CLGC++ L          +PN   + +C RC WP C A 
Sbjct: 35  VIFRDAPAVVGPKMASV--PVCLGCNRDLMATATVRQQGDPNFRFH-QCSRCGWPVCGAE 91

Query: 485 CSGLSDAHTHAPECAIL 501
           C     A  H  EC+I+
Sbjct: 92  CEA---ADQHRSECSIM 105


>gi|195350806|ref|XP_002041929.1| GM11450 [Drosophila sechellia]
 gi|194123734|gb|EDW45777.1| GM11450 [Drosophila sechellia]
          Length = 462

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 121/447 (27%), Positives = 190/447 (42%), Gaps = 94/447 (21%)

Query: 37  KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
           KQ Q+E  +   P + K+LPY ++ S + GRYL A   L  G+ + RE PL +GP ++  
Sbjct: 29  KQEQQEKEQSTSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87

Query: 96  EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYN 155
           +P+CLGC++P++    D  RCP C WP                                 
Sbjct: 88  DPVCLGCYQPVSLQ-PDQYRCPGCAWP--------------------------------- 113

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEIDSIVKYLRSNF 212
                  +P          P+++  +  ME+H  +R     +   YEE   +V+ LR  +
Sbjct: 114 -------IP---------DPEQFALIARMESHTEERRQNAVLWRHYEE--KVVQRLRGTW 155

Query: 213 LEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQ 272
                   L+D  A+ +H I          IG   + LY    L+ H+C PN  H++  +
Sbjct: 156 -------QLEDLEAEQVHEIG--------QIGVKVRTLYPSAFLLAHDCTPNTAHTDDPR 200

Query: 273 SKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
           S   + R +        ++  Y   L GT  RR  +   K F C C RCSDP ELGT  S
Sbjct: 201 SFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCS 260

Query: 326 AMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
           A+ C           +  V+PL    DW C  C+ ++ A +V     ++   ++ +  + 
Sbjct: 261 ALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRINNDLEDI--DV 314

Query: 386 PNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY--- 442
            ++  LE  L +   +  P+HY   + K+SL Q+YG   GY   Q++   + RK SY   
Sbjct: 315 HDIPGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMTPEDIARKESYCRE 374

Query: 443 ------VMSPNLKATDEPICLGCHKPL 463
                 V+ P L      I    H P+
Sbjct: 375 FLEIVDVLDPGLTRLRGLIMYELHAPV 401


>gi|239791604|dbj|BAH72247.1| ACYPI001902 [Acyrthosiphon pisum]
          Length = 241

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 72/154 (46%), Positives = 103/154 (66%), Gaps = 3/154 (1%)

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
           MYTNALWGT+ RR HL  TKYF C C+RCSD TELGT FS + C  +   +G+  + P++
Sbjct: 1   MYTNALWGTRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVCNVKGFCKGN--LTPIH 58

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
           PLD+ S+W C  C   +++  +  + ++L   VDK +Q NP++ SLE+  +KL+   H +
Sbjct: 59  PLDDSSEWECDRCPNTVSSDKIDAILTELNHIVDKALQ-NPSINSLEDAFSKLKTRIHSN 117

Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           HY C+ VKH+LIQLYG   GY ++ L+  LL+RK
Sbjct: 118 HYLCFNVKHTLIQLYGHSMGYKHSMLTDDLLKRK 151


>gi|322792382|gb|EFZ16366.1| hypothetical protein SINV_10209 [Solenopsis invicta]
          Length = 513

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 188/382 (49%), Gaps = 32/382 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++  S  LGRYL    ++ PG+ I RE P+ VGP +   +  C  C + L P +    +
Sbjct: 7   YKVAHSNKLGRYLMTAKNIAPGEVIIREEPIAVGPMMYKKDCFCFACMRTL-PKIGKGRQ 65

Query: 116 --CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
             C RC F P CS  C   ++ HT+  EC I K   + L          +L LR  +++ 
Sbjct: 66  YVCSRCNFAPLCSVACEVRTNHHTNE-ECQIFKDNRDLLTESGIDVARILLALRLWLIKH 124

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
           ++P  ++++  MEAH+ KR  GT V+ + + ++V  +R     ++P ++ +  SA+ +  
Sbjct: 125 KNPAVWEKIDRMEAHLDKR-IGTPVWNDREVNVVNVIRR---LRIPAEI-EPPSAELMQK 179

Query: 232 ICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVF 278
           +CG+++VN  ++       G   +GLY  + LM H+C  N       K    + + + + 
Sbjct: 180 LCGILDVNAFELRSPGALDGFLLRGLYIEVTLMAHDCRGNTHVTVDDKFQLTVYASVAIK 239

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM---KCLNEHKD 335
           +D  I   YT++L GT  R+++L   KYF C C  C+DP EL ++ S++   +C      
Sbjct: 240 QDETIFFNYTSSLLGTADRKDYLREGKYFECECSLCADPYELQSHLSSVLCPRCKKGFVG 299

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
             D  I  V+P +  S W C  C      R +    +     +D+   E+ ++K ++ +L
Sbjct: 300 MQDTSI--VDPYER-SRWQCQMCKKTYGGRLIRATLNICRTLIDEC--EDVDIKRIDGLL 354

Query: 396 TKLEAMFHPHHYHCYAVKHSLI 417
            +L    H +H+   ++K  L+
Sbjct: 355 KRLSRSLHTNHFLMLSLKQKLL 376


>gi|380022731|ref|XP_003695192.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 554

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/390 (29%), Positives = 186/390 (47%), Gaps = 37/390 (9%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L Y+I  S  LGRYLQA+ DL  G+ I RE P+ VGP   +  PIC  C   L P++  +
Sbjct: 15  LKYKIAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCMKNPICFECLSLL-PDIEQD 73

Query: 114 AR--CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLIL 170
               C  C     C   C   +  H+   EC I+K   E  +   D     +  LR  +L
Sbjct: 74  FHYICSGCNVVTLCGFSCEERAYYHSMY-ECEIIKNNMELSIENTDTLAAVLFVLRLWLL 132

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +++ P+ ++ +  +E+H++KR   T V+E+ + +IV  L+S  L  +P +  D + ++ +
Sbjct: 133 KQKDPELWKRVLSLESHLNKR-RNTIVWEDREINIVNVLKS--LNFVPEN--DPSVSEII 187

Query: 230 HWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH 282
             +CG+++VN  ++       G   +GLY    LM H+C  N   +      L V+    
Sbjct: 188 QQLCGILDVNSFELRSPGGMDGLLLRGLYLEASLMAHDCRGNVHVTADDNFHLTVYASIP 247

Query: 283 IST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           I         YT++L GT  RRE+L   KYF C C+ C DP E+G+Y S + C      Q
Sbjct: 248 IKEGDVIFFNYTSSLLGTTGRREYLRTGKYFECECDLCKDPYEMGSYMSCILC--PRCRQ 305

Query: 337 GDCWI---LPVNPLDNDSDWTCGSCSARLNA---RDVHLVTSQLGEQVDKLVQENPNVKS 390
           G   +   L   P   +S W C  C   ++    +    ++  L + VD+      N+K 
Sbjct: 306 GYVGMQDPLTKFPYSKNSKWQCNKCKKIISGCLIKTSLSISKNLVDDVDEY-----NIKV 360

Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
           +E +  KL    H +H+   ++K  L+ +Y
Sbjct: 361 MENLKEKLSKSLHKNHFLMLSLKQKLLNVY 390


>gi|383858904|ref|XP_003704939.1| PREDICTED: protein msta, isoform A-like [Megachile rotundata]
          Length = 530

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 183/385 (47%), Gaps = 27/385 (7%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L Y++  S   GRYLQA  +L  G+ I RE P+ VGP  +  + +C  C + L P L   
Sbjct: 13  LKYKVACSEKFGRYLQAAKNLQAGEVILREKPIAVGPITSSNDYLCFACLRLL-PKLKKG 71

Query: 114 AR--CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL-PLRCLI 169
            +  C +C   P C   C      HT + EC I K   E L A N      IL PLR  +
Sbjct: 72  TQYVCSKCNIAPLCGTACQEQPKHHT-SYECEIFKTN-ENLSADNIETLIGILLPLRLWL 129

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +++  P+  + ++ MEAHM +R   +   +   +++  +++  L  +P D  D +  + L
Sbjct: 130 VKQTDPELSKRIESMEAHMEERRNTSVWRDRETNVINVMKA--LHLIPED--DTSIPEYL 185

Query: 230 HWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH 282
             +CG+++VN  ++       G   +GLY     M H+C  N   +     +L V+    
Sbjct: 186 QQLCGILDVNTFELRSPGGLDGLLLRGLYVEASFMAHDCRGNTHLTVDDHFQLTVYASIP 245

Query: 283 IST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD- 335
           I         YT++L GT  RREHL   KYF C C  CSDP E+G+Y S++ C    K  
Sbjct: 246 IKEGEAILFNYTSSLLGTLGRREHLRGGKYFECECSLCSDPYEMGSYMSSILCPRCRKGY 305

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
            G    L +   +  + W C  C + ++ R +    +     +D +  ++ ++K +E + 
Sbjct: 306 MGIRTPLTIYAYEKSTKWQCDKCRSCISGRLIRATVNITKTLIDDM--DDYDIKGMETLT 363

Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLY 420
            KL   FHP+HY   ++K  L+  Y
Sbjct: 364 KKLLGSFHPNHYLMLSLKQKLLAAY 388


>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
          Length = 517

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 187/381 (49%), Gaps = 31/381 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++  S  LGRYL A  ++  G+ I RE P+ VGP     + +C  C + L+  +     
Sbjct: 7   YKVTHSGRLGRYLVAAKNIAAGEVIIREEPIAVGPMTYRKDRLCFACLRSLS-KIGRQYV 65

Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL-PLRCLILQRR 173
           C RC   P CS  C   +  HT + EC IL      LLA N      IL  LR  +++ R
Sbjct: 66  CSRCNLAPLCSVACETRTKHHT-SDECLILGEN-RDLLAKNGIDATGILLALRLWLIKCR 123

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
           +P  ++++  ME H+ +R  GT V+++ + ++V  +R   + ++P D ++  SA+ +  +
Sbjct: 124 NPAVWEQVDHMEDHLHERM-GTPVWKDREVNVVNVIR---MLRVP-DGIEPPSAELMQKL 178

Query: 233 CGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
           CG+++VN  ++       G   +GLY    LM H+C  N   +     +L V+    I  
Sbjct: 179 CGILDVNTFELRSPGALDGLLLRGLYVEAALMAHDCRGNTHLTVDDDFRLTVYASVAIGE 238

Query: 286 M------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM---KCLNEHKDQ 336
                  YT++L GT  RREHL   KYF C C+ C D  EL ++ S++   +C     + 
Sbjct: 239 NEPILFNYTSSLLGTADRREHLREGKYFECECDLCKDSCELQSHLSSLLCPRCKKGFIEM 298

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
            D  I  V+P   +S W C +C      R +    +     +++   E+ ++KSL+ +L 
Sbjct: 299 QDTSI--VDPYARESRWQCQTCKRTYGGRLIRATLNICRTLIEEC--EDIDIKSLDGLLK 354

Query: 397 KLEAMFHPHHYHCYAVKHSLI 417
           KL    H +H+   ++K  L+
Sbjct: 355 KLSRSLHTNHFLMLSLKQKLL 375


>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
          Length = 513

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 120/408 (29%), Positives = 189/408 (46%), Gaps = 53/408 (12%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LALAEPICLGCHK---PLNPNLA 111
           + I+ SPL GR + AT D+  G+ I  + PL++GP+      P+C+ C+K   PL P   
Sbjct: 39  WSIEYSPLGGRGMFATRDIQAGELIFTDVPLLIGPRCYRKCFPMCVVCYKNNCPLFPCDH 98

Query: 112 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETL-LAYNDYKYEAILPLRCLIL 170
           D      C  P CS  C    ++  H+ EC +L+    T   ++++    A++P+R L L
Sbjct: 99  D------CGLPICSTECE---NSTMHSQECRLLREWMPTCGSSWSNELLLAVIPIRGLTL 149

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
              S ++ + L   E H+      T  +E     +  L+ N ++K P    +    + + 
Sbjct: 150 ---SKEQRKLLYAFECHLEL----TRTHE-----IDLLKRN-VKKFP----NKEQLELMK 192

Query: 231 WICGVIEVNGVDI--------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-- 280
            ICGV   N  ++            +GLY +  L  H C+PN +H    Q +L V     
Sbjct: 193 RICGVFNTNSFEVLVASNKDCTTSLRGLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLP 252

Query: 281 ----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I+  YT+ LW T  RR+ L  TK+F+C+C RCSDP+E G+   A+ C    KD 
Sbjct: 253 IAAGEEITMSYTDLLWDTSSRRQFLKATKHFSCNCSRCSDPSEFGSQLGALLC---AKDD 309

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN--VKSLEEM 394
               +LP NPL+  S W C  C   +N + +  + SQL   V   +   P+  VK +E  
Sbjct: 310 CSGHLLPSNPLNFISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTPHEIVKFIETA 369

Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
           L+KL       ++    VK  +I  +G  P   +  L+ + L  K  Y
Sbjct: 370 LSKL---VPTSNFILIDVKFRIISYFGRTPNLKWEDLTDAELRIKKMY 414


>gi|332018227|gb|EGI58832.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 518

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 184/382 (48%), Gaps = 32/382 (8%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I  S  LGRYL    ++ PG+ + RE P+ +GP +   +  C  C + L P + +  +
Sbjct: 7   YKIAHSKKLGRYLVTAKNIVPGEILIREEPIAIGPMVYKKDCFCFACMRTL-PKIGEGRQ 65

Query: 116 --CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL-PLRCLILQ 171
             C RC F P CS  C   ++ HT+  EC I K     LL  N      IL  LR  I++
Sbjct: 66  YACSRCKFVPLCSVACEIRTNHHTNE-ECQIFK-DNRDLLTENRIDVAGILLALRLWIIK 123

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            ++P  ++++  MEAH+ KR  GT V+ + + ++V  +R     + P ++ +  + + + 
Sbjct: 124 HKNPAVWEKIDCMEAHLDKR-IGTSVWRDREVNVVDVIRR---LRWPAEI-ESPNTELMQ 178

Query: 231 WICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI 283
            +CG+++VN  ++       G   +GLY    LM H+C  N   +     +L V+    I
Sbjct: 179 KLCGILDVNTFELRSPSALDGLLLRGLYIEATLMAHDCRGNTHLTVDDNFQLTVYASVAI 238

Query: 284 STM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                    YT++L GT  R++HL   KYF C C  C DP EL ++ S++ C    K  G
Sbjct: 239 KQNEPILFNYTSSLLGTVDRKDHLREGKYFECECSLCMDPYELQSHLSSVLCPRCKK--G 296

Query: 338 DCWILPVNPLD--NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
              I  ++ +D    S W C  C      R +    +     +D+   E+ ++K ++ +L
Sbjct: 297 FIGIQDISTIDPYEQSRWQCQMCKKTYGGRLIRATLNICRTLIDEC--EDVDIKKIDSLL 354

Query: 396 TKLEAMFHPHHYHCYAVKHSLI 417
            KL    H +H+   ++K  L+
Sbjct: 355 KKLSRSLHTNHFLMLSLKQKLL 376


>gi|321474905|gb|EFX85869.1| hypothetical protein DAPPUDRAFT_237516 [Daphnia pulex]
          Length = 595

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 129/464 (27%), Positives = 204/464 (43%), Gaps = 77/464 (16%)

Query: 35  CGKQHQ-KEHWKLHKPKCKKL-------PYEIKSSPLLGRYLQATLDLHPGDRIARESPL 86
           CG + Q +E+  + +P  +K           + SSP LGRY  A+    PGD + +E PL
Sbjct: 43  CGSRTQGRENRGMTQPNNRKSSSGGSKPAIHLDSSPELGRYFVASRAFKPGDIVMKEFPL 102

Query: 87  IVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL-K 145
              P   L+E +C  CH+      A    C  C  P CS  C     +  H  EC  L K
Sbjct: 103 ASAPG-PLSEHVCFACHRLF----ARAHICSDCRSPLCSRNCQV---SPEHQRECRYLTK 154

Query: 146 LGCETLLAYNDYKYEAILPLRCLILQR---------RSPKKYQELKDMEAHMSKRGPGTE 196
           +  E      + K + + P++  +L             P K+ +L D+E+H + R   + 
Sbjct: 155 IIAEIENRQRNGKIQRV-PVQLGMLMLPLRLLMLRREQPDKWNQLMDLESHTTARQATSI 213

Query: 197 VYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG------RYT--- 247
                 +I  +L +  ++++P  + DD     +  +CG ++VN  +I       R+T   
Sbjct: 214 WGFNQRNITPFLETMLIDEMPDIINDD----LVQQMCGAVDVNSFEIRIETPDRRHTNDL 269

Query: 248 ------------QGLYSVICLMEHNCLPNAKHSNMMQSKLFV-----FRDTH-ISTMYTN 289
                       +G + +  LM H+C+ NA+ S    S++ V      R+   I   YT+
Sbjct: 270 PQQPMPNSQEVLRGTFYMASLMAHSCVVNAQISIGNDSQMTVKATVPIREGEGIFVSYTD 329

Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV-NPLD 348
            L  T  RR  L   K+F C C RC DPTE GT+ SA++C    +     W+LP  +P  
Sbjct: 330 PLQTTLQRRTFLEKGKHFICRCRRCQDPTEFGTFASAVRC----RSCTSGWVLPNDDPAS 385

Query: 349 NDS---------DWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL----EEML 395
             S          W C  CS+ L + ++   T ++ + + K +Q  P  + L    EE+ 
Sbjct: 386 IASAAAVEGVALRWKCSECSSELKSEEIGR-TEEVIQNIVKNIQRLPVDRHLIDRCEELF 444

Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
             L    H +H     ++ SLI LYG  PG+   +LS  LL+RK
Sbjct: 445 VTLPKKIHVNHAFLLQLRISLIHLYGNVPGFLIPELSVGLLQRK 488


>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
 gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
          Length = 518

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 176/384 (45%), Gaps = 51/384 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y+I  S  LGR++ A  DL  G++I  E P + GP    A+  CL C +           
Sbjct: 8   YQILESEQLGRFVVAKRDLTRGEQILLEEPYVTGPYWD-ADVSCLNCFRDSCSTCRQCGV 66

Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK-LGCETLLAYNDYKYEAILPLRCLILQRR 173
            P C+            D   H   EC+  +    +    YN +    ++P+RCL+L R 
Sbjct: 67  APLCY------------DCKIHDEAECSFYRESSLDRHFLYNHFN--VVMPVRCLMLYRS 112

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSN--FLEKLPGDVLDDTSAKCLH 230
           + ++Y E+  ME+ + +R  GTE+++  +  +VK L  +  F EK  G    + + + + 
Sbjct: 113 NRERYDEVMTMESRLEER-RGTEIWDIHEKFVVKPLMESGIFAEKFEGL---EVTGELIQ 168

Query: 231 WICGVIEVNGVDI-----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
            ICG+++ N  +I           G   +GLY    LM H+C+PN   S    S + VF 
Sbjct: 169 RICGILDANTFEIRGNVDSRESQMGNLARGLYPKTALMMHSCMPNTLLSVDGYSNVRVFT 228

Query: 280 DTHI------STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
              +      +  YT +L+GT  R+ HL   KYF CSC RC DPTELGT+ S+++C    
Sbjct: 229 SAPVKMGEILNISYTRSLFGTYDRQTHLRKGKYFICSCRRCKDPTELGTHLSSIRC--SE 286

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
            D G C   P  P      W C  C   L    V+ +   L    D ++     V+ LE 
Sbjct: 287 CDDGLCSFYPDEP-----RWECNKCHKLLKREYVNEI---LCAARDDIMACPLEVRDLER 338

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLI 417
           +++K   + +PHH      K +L+
Sbjct: 339 VISKHSKILNPHHSLVLEAKQTLM 362


>gi|312373518|gb|EFR21239.1| hypothetical protein AND_17329 [Anopheles darlingi]
          Length = 445

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 171/358 (47%), Gaps = 42/358 (11%)

Query: 137 HAPECAILKLGCETLLAYND-----YKYEAILPLRCLILQRRSPKKYQ-ELKDMEAHMSK 190
           H  ECA+         +  D      + + I PLR L+ +  +P++++ E+  ME H  K
Sbjct: 30  HGAECAVFAEKRVRFQSVEDSTAGCVQLDCITPLRMLLAKEANPERWEREIVPMEYHDEK 89

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI----GRY 246
           R  G     +  +IV +LR+         + D  S   +  + GV++VN  +     G  
Sbjct: 90  RREGANWKVDDHNIVSFLRNAC------GLADRFSVDLIQQVIGVLDVNAFEGRTCNGYS 143

Query: 247 TQGLYSVICLMEHNCLPNAKHSNMMQSKLF---------VFRDTHISTMYTNALWGTQPR 297
            +GLY  + +M H+C+PN  HS +  S+ F         V     + T YT  L GT  R
Sbjct: 144 VRGLYPQLAIMAHSCVPNVVHS-IHPSEAFRLEARVAVDVAEGEKLYTTYTYTLTGTVAR 202

Query: 298 REHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGS 357
           +  L  +K+F C C RC DP+ELGT+FS++ C   +K  G   I+  +PLD +++W CG 
Sbjct: 203 QAILKASKFFTCMCPRCVDPSELGTHFSSLLC---NKCVGGI-IISSDPLDENAEWKCGD 258

Query: 358 CSARLNARDVHLVTSQLGEQVDKLV---QENPNVKSLEEMLTKLEAMFHPHHYHCYAVKH 414
           C  + +   V      +  ++D+L     +   +++ E +  K  ++ HP H+   +++H
Sbjct: 259 CGFKTSGAAVQKAILTMHNELDELACLEYDAGRLEAYERLYKKFRSVLHPRHFINTSIRH 318

Query: 415 SLIQLYGTQPGYAYTQLSSSLLERKISY---------VMSPNLKATDEPICLGCHKPL 463
           SLI+LYG  PGY   +L   LLERK+           V+ P L  +   +    H PL
Sbjct: 319 SLIELYGRIPGYEMAELPDVLLERKVDLCRDVLRVLDVVEPGLSRSRAMVLYELHAPL 376


>gi|158289967|ref|XP_559081.3| AGAP010377-PA [Anopheles gambiae str. PEST]
 gi|157018413|gb|EAL41037.3| AGAP010377-PA [Anopheles gambiae str. PEST]
          Length = 503

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/372 (30%), Positives = 165/372 (44%), Gaps = 49/372 (13%)

Query: 66  RYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF-WPAC 124
           R+L A  D+ PG+ I  E P++VGP    A+  CLGC  P +        C  C   P C
Sbjct: 1   RFLVANRDIKPGELILAEDPIVVGPYWD-ADISCLGCLGPASRT------CRVCLKGPLC 53

Query: 125 SARCSGLSDAHTHAP-ECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
                   D   H P EC+  +   +    +    +  I PLRCL+L R    KY E+  
Sbjct: 54  R-------DCSRHDPVECSFYEQASKLNKNFLFDHFNIITPLRCLLLYRSDRDKYNEMMA 106

Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD--TSAKCLHWICGVIEVN-- 239
           ME+H   R   TE++   +   +Y+    L +     +DD   + + L  ICG+++VN  
Sbjct: 107 MESHFESR-RDTEIWHIHN---QYVVEPMLNEKDFQQIDDLTVTGELLQRICGILDVNTF 162

Query: 240 ---------GVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST----- 285
                    GV +    +GLY    LM HNC  N   +    SKL ++    I       
Sbjct: 163 EIRGNMDSQGVQMNNLARGLYPKTSLMTHNCQTNTLIAVDGMSKLRLYSSIGIKAGELLY 222

Query: 286 -MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
             YT  L+ T  R+ HL   KYF C+C RCSDPTELGT+ S++KC     D G C   P 
Sbjct: 223 YNYTRVLFSTFERQTHLRKGKYFICNCARCSDPTELGTHLSSLKC--TACDDGLCVFHPE 280

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
            P      W C +C  +L    V+ +  +  + +     ++ N   LE+++ K     +P
Sbjct: 281 TP-----KWECLTCGHQLPREYVNQIMCEARDDISSCALDSRN---LEKVIGKHSKTLNP 332

Query: 405 HHYHCYAVKHSL 416
           HH      K SL
Sbjct: 333 HHSLVLEAKQSL 344


>gi|328713664|ref|XP_003245146.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 464

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 112/404 (27%), Positives = 180/404 (44%), Gaps = 53/404 (13%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           PY +  SP LGR+  A  D+  G+ +  E PL+VGPK A + P+CL C+ P         
Sbjct: 17  PYAVVQSPELGRHWVAVRDIVAGEVLLEERPLVVGPK-AGSPPVCLTCYAP-----TAGY 70

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRC-LILQRR 173
           +C +C WP C  RC   +  H  A EC ++    ++        Y  ++PLRC L+L +R
Sbjct: 71  KCSKCGWPVCGPRCEA-APVHRDA-ECPLIDGHYDS---RRSAAYCFVMPLRCMLLLHQR 125

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
             ++  E + +++H+  R   T +Y     ++  ++      +  G   +  SA      
Sbjct: 126 DGRRAAEFRSLQSHLDDR-LNTPLYRAYAVNVAAFVLDRLGLRSAGYGHNHRSAL---EA 181

Query: 233 CGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM-----------MQSKLFV 277
             V++ N  ++    GR  + +Y    +M H C PN KH  +           + + + +
Sbjct: 182 AAVLDTNAFEVRRPGGRKFRAVYGRASMMAHCCTPNTKHVFIGDETDGQPTIRVVAAVPI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            R   I+  YT  LW T+ RR HL+  K F C+C RC+DP ELGT+  ++ C       G
Sbjct: 242 ARGYPITATYTQTLWCTRDRRRHLSAAKCFECACARCADPVELGTHLGSVAC-----GGG 296

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
            C   P         W C +C  RL A DV    +    Q    + +  +    E  L +
Sbjct: 297 QC---PGGRATAAGQWLCTTC-GRL-ATDVEAAHAL---QTVGALSKTRDCAGFERFLER 348

Query: 398 LEAM----FHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLE 437
           +        H +H+    VK++L QLY  +     + LS+  LE
Sbjct: 349 VRDGTMPPLHANHHVTVGVKYALAQLYADR----ISDLSTKQLE 388


>gi|321473703|gb|EFX84670.1| hypothetical protein DAPPUDRAFT_314752 [Daphnia pulex]
          Length = 497

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 116/407 (28%), Positives = 196/407 (48%), Gaps = 40/407 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIA-RESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           + I  S  +G++L AT ++   + I   E  L VGPK A   P+CL C++ ++ +     
Sbjct: 54  FTIGKSSDIGKFLVATRNIRADEIIIDAEEALAVGPKQA-THPVCLNCYRRVDGSYL--- 109

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILK----LGCETLLAYNDYK-YEAILPLRCLI 169
            CP C WP CS  C+ + +      EC +L        +  L   D + Y+ I PLR ++
Sbjct: 110 -CPLCQWPLCSEVCAQMENGAHQKYECQLLANREAYPTKEELEQEDCRLYDCIAPLRIVL 168

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPG--DVLDDTSAK 227
              R+P++ +    +E H         +Y +   I    R   +E +     + ++T+ +
Sbjct: 169 DNHRNPERSRLWLGLEDH--------RLYRQKIGIWHVDRKTVVEPVLNKWQLGENTNEE 220

Query: 228 CLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKL 275
            L   CG++EVN  ++         + +YS  CLM HNC+PN       A     +++ +
Sbjct: 221 ELQRACGILEVNAFEVCDDDKTISARAVYSHACLMAHNCVPNTACCVNPATQRMTVRAAV 280

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            +     I+T YT  L GT  RR HL  TK+F+C C RC DPTELG++ S++ C    + 
Sbjct: 281 DIPAGQMITTSYTFTLDGTLRRRSHLKETKFFDCCCPRCCDPTELGSHLSSLAC---PRC 337

Query: 336 QGDCWILPVNPLDNDSDWTCGS--CSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
            G  + +P +PL+  S W C    C  R+ +  V+ + + L  + D+L  +  + + LE 
Sbjct: 338 SGRNYTVPEDPLNPASFWKCQGEKCGNRIGSDQVNQLLACLQRKADELNYD--DFQGLEI 395

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            +    ++ HP+H     +K+ L   YG  PG+   ++S   L RK+
Sbjct: 396 FIETWSSLLHPNHSLLIGIKYYLCHFYGNAPGFLLKEMSWRFLTRKM 442


>gi|350403683|ref|XP_003486875.1| PREDICTED: hypothetical protein LOC100747756 [Bombus impatiens]
          Length = 814

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 122/392 (31%), Positives = 192/392 (48%), Gaps = 41/392 (10%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L Y +  S  LGRYLQA  +L  G+ I RE P+ VGP  +  +P C  C + L P +   
Sbjct: 13  LKYRVAYSEKLGRYLQAAKNLTAGEVILREEPIAVGPITSSKDPACFACLRLL-PKIKKE 71

Query: 114 AR--CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLIL 170
            +  C +C   P CS  C   S  HT   EC I K   + L+   +     +LPLR  +L
Sbjct: 72  LQYVCSKCNIAPLCSTACEERSKHHT-PDECEIFKRNKDLLINNTENIIGVLLPLRLWLL 130

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           ++R  + +++++ MEAH  KR   T V+++ + ++V  ++S  L  +P    D + ++ L
Sbjct: 131 RQRDAELWKQVESMEAHTDKRR-NTSVWKDRELNVVNVIKS--LHLVPDG--DPSVSELL 185

Query: 230 HWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF---- 278
             +CG+++VN  ++       G   +GLY    LM H+C  N   +     +L V+    
Sbjct: 186 QLLCGILDVNCFELRSPGGLDGLLLRGLYVEASLMAHDCRGNTHLTADDNFQLTVYASLP 245

Query: 279 --RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I   YT++L GT  RREHL   KYF C C  CSDP E+G+Y S++ C       
Sbjct: 246 IKEGDEIYFNYTSSLLGTLGRREHLRGGKYFECECPLCSDPYEMGSYMSSILCPRCR--- 302

Query: 337 GDCWILPVNPL-----DNDSDWTCGSCSARLNARDVHL---VTSQLGEQVDKLVQENPNV 388
            + +I   NPL     +  + W C  C + +  R V     +T  L + VD     + ++
Sbjct: 303 -EGYIGMQNPLTKFPYEKGTRWQCNKCKSSIGGRLVRATLNITKTLIDDVD-----DYDI 356

Query: 389 KSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
           K LE ++ KL   FH +H+   ++K  L+  Y
Sbjct: 357 KGLETLMAKLSKSFHRNHFLMLSLKQKLLAAY 388


>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
          Length = 490

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 118/407 (28%), Positives = 184/407 (45%), Gaps = 46/407 (11%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHK---PLNPNLADN 113
           +  SPL GR L AT D+  G+ I  + PL++GP+      P+C+ C+K   PL P   D 
Sbjct: 29  VDRSPLGGRGLFATRDIQAGELIFTDVPLLMGPRCYNKHLPMCVVCYKNNCPLFP--CDY 86

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLA-YNDYKYEAILPLRCLILQR 172
                C  P CSA C   S  H    EC  L+    T  + ++      ++ +R L L  
Sbjct: 87  G----CGLPICSAECEN-SAMHVQG-ECQFLREWAPTCGSTWSKDLLLVVVLIRGLAL-- 138

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
            S ++   L   E H +      EV++ I      L+ + L++   ++  +   K +  +
Sbjct: 139 -SEEQRTLLYAFECH-ANLNRNYEVHQFI------LKIDLLKRNVANLPSEEQMKLMQRV 190

Query: 233 CGVIEVNGVDI---------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV------ 277
           C V   N  ++             +GLY +  L  H C+PN++H    Q +L V      
Sbjct: 191 CDVFNTNSFEVVVPSRNNDHTMSLRGLYPMGALQNHCCVPNSRHHFDDQQRLHVSAALPI 250

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                I+  YT+ LW T  RR+ L ITK F+C+C RCSDP E G+  SA+ C    KD+ 
Sbjct: 251 AAGEEITMSYTDLLWDTSTRRQFLRITKRFSCNCNRCSDPLEFGSRLSALLC---AKDEC 307

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN--VKSLEEML 395
              +LP NPL+ +S W C  C   +N R +  + S L   V  ++ + P    +  E +L
Sbjct: 308 LGHLLPRNPLNYESSWICDKCQTSVNHRQIECIHSGLNTFVSNVMYKTPREIQRFTEAIL 367

Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
            KL       +Y    +K+ +I  +G  P   +  L+   L  K +Y
Sbjct: 368 PKLVPAI---NYIMLDIKYCIISYFGRVPDLKWEDLTDVELRIKRTY 411


>gi|350424241|ref|XP_003493732.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
          Length = 498

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 116/429 (27%), Positives = 195/429 (45%), Gaps = 55/429 (12%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LALAEPICLGCHKPLNPNLADN 113
           P+ I+SS L GR L AT D+   + I  +SPL+VGPK +     +C+ C+K    N    
Sbjct: 32  PWTIRSSLLGGRGLFATRDIKQNELIFIDSPLLVGPKYIGKYFEMCVSCYK----NECAL 87

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETL-LAYNDYKYEAILPLRCLILQ 171
             C R C  P CS +C   S  H +  EC  LK    T    ++     A++P+R L L 
Sbjct: 88  FPCDRGCGLPVCSTQCEN-SPKHVNY-ECEYLKSLVPTCGTDWSPNLLLAVVPIRALFLT 145

Query: 172 RRSPKK------------YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD 219
            +  K             Y E++ ++ +++   P  E  E +  + + L +N  E     
Sbjct: 146 EQQRKCLATLQSDKNLTCYPEIEQLKKNVTN-SPSEEDMELMKHMCRILNTNSFE----- 199

Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV-- 277
                           I V+  +     +GLYS+     H C+PN +H    + +++V  
Sbjct: 200 ---------------TIMVHDKEHSVSLRGLYSIASFQNHCCVPNTRHHFDGEFRMYVSA 244

Query: 278 ----FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
                    I++ YT+  W T  RR+ L+ITK+F+C C+RCSDPTE  +   A+ C +  
Sbjct: 245 ALPIAAGEEITSTYTSLFWDTTLRRKFLSITKHFSCMCKRCSDPTEFKSRLGALLCAS-- 302

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
            D+    +LP++PL+  + W C  C+  LN R +  + S +   + +++ + P  + + +
Sbjct: 303 -DKCSGELLPLDPLNMTTPWICDKCAITLNHRQICSIRSGIAGIIKEVLCKTP--REIYK 359

Query: 394 MLTKLEAMFHPHHYHCYA-VKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATD 452
            L K  ++  P   +C   +K  ++  YG   G  +  LS + L  K  +  +  L   D
Sbjct: 360 FLQKELSLLTPWSNYCIVDMKFRIVSYYGRSDGAKWEDLSDTELNLKAKFC-NDLLSVLD 418

Query: 453 EPICLGCHK 461
           +  C  C K
Sbjct: 419 DLNCGDCRK 427


>gi|345480417|ref|XP_001607394.2| PREDICTED: protein msta, isoform B-like [Nasonia vitripennis]
          Length = 557

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 187/407 (45%), Gaps = 49/407 (12%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           YE+  S +LGRYL A  +L  G+ I RE  L+VGP +   +  C G +       A NA 
Sbjct: 15  YELAYSDVLGRYLVAAKNLSAGEVIFREDALVVGPAMFANDVFCFGLNPAKLATRAPNAA 74

Query: 116 CPRCFWPACS-----------ARCSGLSDAHTHA-PECAILKLGCETLLAYNDYKYEAIL 163
            PR                   +   L D+  H+  EC +L+   + +L   +     ++
Sbjct: 75  SPRSAIELARYIRILHVSHSLFQRDALQDSGLHSTAECDLLRAKRDVMLENMEDLMHTLV 134

Query: 164 PLRC-LILQRRSPKKYQE-LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVL 221
            L+  L++   S +  +E L  MEAHM++R  G+EV+ E ++ V  +   F     G V 
Sbjct: 135 HLKLWLVMSAGSDQDVRERLSKMEAHMNER-RGSEVWFEREANVVDVFRRF-----GLVR 188

Query: 222 DDTSAKCLHWICGVIEVNGVDIGR------------YTQGLYSVICLMEHNCLPNAKHSN 269
            +  A+ + W+CGV++VN  ++                +G++    LM H C   A  + 
Sbjct: 189 SENEAELVQWLCGVLDVNSFELRTPVPGSNGNNGSPLLRGIFLEAALMAHACRGTAHIAV 248

Query: 270 MMQSKLFVFRDT------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY 323
             + ++ V+          I+  YT++L GT  R+EHL + KYF C C  C DP E G++
Sbjct: 249 DDRFQMTVYAAVPIPAGETIAFNYTSSLLGTIERQEHLQVGKYFRCECSMCVDPLEHGSF 308

Query: 324 FSAMKCLNEHKDQGDCWILPVNPLDND-----SDWTCGSCSARLNARDVHLVTSQLGEQV 378
            S + C    KD    ++   N  D D     + W C  C  R+    +   T ++G+ +
Sbjct: 309 VSCILCPRCKKD----YVAIQNARDKDPYGRKTKWQCRKCK-RIFRGCLVKSTIEIGKDL 363

Query: 379 DKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPG 425
            + V +  N K++E +L +L + FH +H+    +K  ++ +Y  + G
Sbjct: 364 IENVGDG-NSKAMESLLVRLSSTFHTNHFVMLTLKQKVLAVYRKEVG 409


>gi|328785466|ref|XP_001121590.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 526

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 110/391 (28%), Positives = 182/391 (46%), Gaps = 68/391 (17%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L Y++  S  LGRYLQA+ DL  G+ I RE P+ VGP   + +PIC  C   L PN+ ++
Sbjct: 15  LKYKVAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCVKDPICFECLSIL-PNIEED 73

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKL---GCETLLAYNDYKYEAILPLRCLIL 170
                      +  CSG          C ++ L    CE    Y+   +         +L
Sbjct: 74  V----------NYVCSG----------CNVVTLCGVTCEERGIYHSALW---------LL 104

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRS-NFLEKLPGDVLDDTSAKC 228
           +++ P+ ++ +  +E+H++KR   T V+E+ + +IV  L+S NFLE       D + ++ 
Sbjct: 105 KQKDPELWKRVLSLESHLNKR-RNTIVWEDREINIVNVLKSLNFLEN------DPSVSEM 157

Query: 229 LHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
           +  +CG+++VN  ++       G   +GLY    +M H+C  N   +      L V+   
Sbjct: 158 IQQLCGILDVNSFELRSPGGMDGLLLRGLYLEASMMAHDCRGNVHVTADDNFHLTVYASI 217

Query: 282 HIST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            I         YT++L GT  RRE+L   KYF C C+ C DP E+G+Y S + C      
Sbjct: 218 PIKEGDTIFFNYTSSLLGTTGRREYLRTGKYFECECDLCKDPYEMGSYMSCIVC--PRCR 275

Query: 336 QGDCWI---LPVNPLDNDSDWTCGSCSARLNA---RDVHLVTSQLGEQVDKLVQENPNVK 389
           QG   +   L   P    + W C  C   ++    +    ++  L + VD+      N+K
Sbjct: 276 QGYVGMQDPLTKFPYTKSAKWQCNKCKKIISGCLIKTSLSISKNLVDDVDEY-----NIK 330

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
            +E +  KL    H +H+   ++K  L+ +Y
Sbjct: 331 VMENLKDKLSKSLHKNHFLMLSLKQKLLNVY 361


>gi|170044116|ref|XP_001849704.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867385|gb|EDS30768.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 502

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 168/379 (44%), Gaps = 56/379 (14%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +++  +  L R++ A  DL  G++I  E PL+ GP    A+  CL C +      +   +
Sbjct: 8   FQLLHTEQLDRFVVAKRDLPRGEQILLEEPLVTGPYWD-ADVSCLSCLRDSCSTCSKCGK 66

Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK--LGCETLLAYNDYKYEAILPLRCLILQR 172
            P C+            D   H A EC   +     +    YN +    I P+RCL+L R
Sbjct: 67  APLCY------------DCTEHDATECDFFRESTALDKHFLYNHFN--VITPVRCLLLYR 112

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD----TSAKC 228
           ++   + E+  ME+H+ +R  G+E+++  D   KY+ +  LE    D L+D     + + 
Sbjct: 113 KNRDLFDEMMRMESHVEER-RGSEIWQIHD---KYVVTPLLEAGILDSLNDGELEVTGEL 168

Query: 229 LHWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
           +  ICG+++VN  +I              +GLY    LM HNC+PN   S      L VF
Sbjct: 169 IQRICGILDVNTFEIRGDVDSSQSGSNLARGLYPKTSLMVHNCVPNTLLSIDGVGNLRVF 228

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGD 338
                      +  GT  R+ HL   KYF C C RC DPTELGT+ S++KC     D+G 
Sbjct: 229 ----------TSAPGTFERQTHLRQGKYFTCYCRRCKDPTELGTHLSSIKCT--ECDEGL 276

Query: 339 CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKL 398
           C   P  P      W C  C   L    V+ V      ++ +   E   V+ LE ++ K 
Sbjct: 277 CSFYPSEP-----RWECNKCRKLLKREYVNEVMCAARSEIMECALE---VRDLERVIGKH 328

Query: 399 EAMFHPHHYHCYAVKHSLI 417
               +PHH      K +L+
Sbjct: 329 SRTLNPHHSLVLEAKQTLL 347


>gi|91088045|ref|XP_966477.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270012081|gb|EFA08529.1| hypothetical protein TcasGA2_TC006182 [Tribolium castaneum]
          Length = 481

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 107/407 (26%), Positives = 183/407 (44%), Gaps = 53/407 (13%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P+ IK +    R++ A   +  G+ I  E+P + GPK   +  +CLGC + ++   A   
Sbjct: 5   PFTIKQTDQ-NRHVVANRLIKSGEIILTETPFVYGPKPG-SPVLCLGCFECVDSKTA--- 59

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY--------KYEAILPLR 166
            C +C WP C   C  LS    +  EC I     +  + +ND         +YE I PLR
Sbjct: 60  -CSKCSWPVCGPECEFLSSHKDN--ECEIF---AKNRVKFNDVDDPYDVCLQYEGITPLR 113

Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
            L+ + + PK++ E             G ++ +   ++V ++R N  +      +D    
Sbjct: 114 MLLAKEKDPKRWLE----------EIGGLKIRKNNVNVVDFVR-NVCK------MDQFCD 156

Query: 227 KCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR--- 279
             +H +CG++++N +++    G   + LY  +  + HNC+ N   +   +      R   
Sbjct: 157 DLIHTVCGILDLNAIEVRAPSGYLIRCLYPKLSQISHNCVTNLFQTVTPEDFKITLRASG 216

Query: 280 ----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
               +  +   Y   LW T  RR+ L   K  +C C+RC D TEL T+ S +KC     D
Sbjct: 217 NISENQELFYNYVYPLWPTLIRRDFLKENKNLDCCCKRCGDKTELRTHLSTLKC--SKCD 274

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPNVKSLEE 393
            G   +L  + L +  DW C  C  + NA  V  V   +  +++  ++V     ++  E 
Sbjct: 275 NG--ILLSSDSLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEGIEQREA 332

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           +  K  ++FHP + +   ++  L QLYG  PGY   +L   L E K+
Sbjct: 333 IFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 379


>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
          Length = 490

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 127/441 (28%), Positives = 199/441 (45%), Gaps = 71/441 (16%)

Query: 30  KEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVG 89
           KEV      H KE+    + +   + Y    S L GR + AT D+   + I  ++PLIVG
Sbjct: 12  KEVEEVFISHLKENKIFQEQQLWTVGY----SSLGGRGMFATRDIKQNELIFIDAPLIVG 67

Query: 90  PK-LALAEPICLGCHK---PLNPNLADNARCPR-CFWPACSARCSGLSDAHTHAPECAIL 144
           PK L     +C+ C+K   PL P       C + C  P CS +C    ++  H  EC  L
Sbjct: 68  PKCLNKQTKMCICCYKNECPLFP-------CDKGCGLPVCSIQCE---NSPNHVNECEYL 117

Query: 145 KL---GCETLLAYNDYKYEAILPLRCLIL---QRRSPKKYQELKDM----EAHMSKRG-- 192
           +     C T    N     A++P+R L +   QR+     Q  K++    E  + K+   
Sbjct: 118 RSLIPTCGTDWCLN--LLLAMIPIRGLFMTKMQRKCLATLQYNKNLFSKYEIELLKKNVM 175

Query: 193 --PGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--Q 248
             P  E  E ++ + +   +N  E +           C+H           D   +T  +
Sbjct: 176 NSPSDEDIEMMERVCRAFNTNSFETI-----------CVH-----------DKDHFTNLR 213

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS-----TM-YTNALWGTQPRREHLA 302
           GLY +  L  H C+PN +H    + +L+V     IS     TM YT+  W T  RR+ L 
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273

Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
           +TK F+C C+RCSDPTE  +  SA+ C +   D+    +LP NPL+  + W C  CS  +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNSKLSALLCAS---DKCSGELLPKNPLNMRTSWICNKCSITV 330

Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH-HYHCYAVKHSLIQLYG 421
           N R +  + S L   +++++ + P  + + + L K  ++  P  +Y    VK  +I  YG
Sbjct: 331 NHRQICSIRSGLAAIIEEIIYKTP--REIFKFLEKDLSIIVPQSNYLIMDVKFRIISYYG 388

Query: 422 TQPGYAYTQLSSSLLERKISY 442
                 +  LS   L+ K +Y
Sbjct: 389 RSNDLQWKDLSDIELDIKANY 409


>gi|170056789|ref|XP_001864189.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876476|gb|EDS39859.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 508

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 170/387 (43%), Gaps = 55/387 (14%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           YEI  S  LGRYL A  DL  G+ I  E P +VGP    ++  CL C +          R
Sbjct: 8   YEILESDRLGRYLVAKRDLARGELILVERPTVVGPHWD-SDVCCLNCFRNSCTICRLCRR 66

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY--KYEAILPLRCLILQRR 173
            P C+       C+G  +A     EC   +   E+ L  N     +  +  +RCL+L R 
Sbjct: 67  APLCY------DCTGHDEA-----ECRFYR---ESNLDINFLFNHFNVVTSVRCLLLYRT 112

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL---- 229
              ++ E++ MEAH+ KR  G+E++   D + KY+    L+    D + D   + L    
Sbjct: 113 DRDRFWEMRRMEAHLEKR-RGSEIW---DILSKYVVEPLLQNGAFDRVGDGEGEKLTVNE 168

Query: 230 ---HWICGVIEVNGVDIGR-----------YTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
                 CGV +VN  +I             + +GLY    LM H+C+PNA  S   +S L
Sbjct: 169 DLIQHCCGVFDVNAFEIRGNGTSQSVQQNCFVRGLYPRAALMAHDCVPNAFISVDGESNL 228

Query: 276 FVF------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            V+      +   +   YT  L+GT  RR HL   KYF C+C RC DPTELGT+  +++C
Sbjct: 229 RVYAAVPIEKGQMVLFNYTRCLFGTFERRAHLRKGKYFLCTCARCEDPTELGTHLCSVRC 288

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
            N       C     +     + W C  C   L    V  V +      +  +  +  + 
Sbjct: 289 TN-------CSAGLCSYYSKVTKWKCDKCPHELTREYVKQVFTN---ARNDAMARSFEIP 338

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSL 416
            LEE +T+  A  +P +     +K +L
Sbjct: 339 ELEEAITRHSATLNPRNSLVLELKQTL 365


>gi|170065144|ref|XP_001867817.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167882269|gb|EDS45652.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 506

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 166/379 (43%), Gaps = 59/379 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +++  +  L R++ A  DL  G++I  E PL+ GP    A+  CL C +      +   +
Sbjct: 8   FQLLHTEQLDRFVVAKRDLPRGEQILLEEPLVTGPYWD-ADVSCLSCLRDSCSTCSKCGK 66

Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK--LGCETLLAYNDYKYEAILPLRCLILQR 172
            P C+            D   H   EC   +     +    YN +    I P+RCL+L R
Sbjct: 67  APLCY------------DCSEHDTTECDFFRETTALDKHFLYNHFN--VITPVRCLLLYR 112

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD----TSAKC 228
           ++   + E+  ME+H+ +R  G+E+++  D   KY+ +  LE    D L+D     + + 
Sbjct: 113 KNRDLFDEMMRMESHVEER-RGSEIWQIHD---KYVVTPLLEAGILDSLNDGELEVTGEL 168

Query: 229 LHWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
           +  ICG+++VN  +I              +GLY    LM HNC+PN   S      L VF
Sbjct: 169 IQRICGILDVNTFEIRGDVDSSQSGSNLARGLYPKTSLMVHNCVPNTLLSIDGVGNLRVF 228

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGD 338
                      +  GT  R+ HL   KYF C C RC DPTELGT+ S++KC     D+G 
Sbjct: 229 ----------TSAPGTFERQTHLRQGKYFTCYCRRCKDPTELGTHLSSIKCT--ECDEGL 276

Query: 339 CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKL 398
           C   P  P      W C  C   L    V+ V      ++ +       V+ LE ++ K 
Sbjct: 277 CSFYPSEP-----RWECNKCRKLLKREYVNEVMCAARSEIMEC------VRDLERVIGKH 325

Query: 399 EAMFHPHHYHCYAVKHSLI 417
               +PHH      K +L+
Sbjct: 326 SRTLNPHHSLVLEAKQTLV 344


>gi|442615898|ref|NP_001259436.1| CG11160, isoform C [Drosophila melanogaster]
 gi|440216646|gb|AGB95279.1| CG11160, isoform C [Drosophila melanogaster]
          Length = 369

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 90/323 (27%), Positives = 151/323 (46%), Gaps = 37/323 (11%)

Query: 163 LPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEIDSIVKYLRSNFLEKLPGD 219
           + +R L+L++  P+++  +  ME+H  +R     +   YEE   +V+ LR  +       
Sbjct: 1   MIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEE--KVVQRLRVTW------- 51

Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSN------- 269
            L+D  A+ +H +CG+++VN  +IG+     + LY    L+ H+C PN  H++       
Sbjct: 52  QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPNTAHTDDPSSFEI 111

Query: 270 MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
           ++++   V     ++  Y   L GT  RR  +   K F C C RCSDP ELGT  SA+ C
Sbjct: 112 LLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 171

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
                      +  V+PL    DW C  C+ ++ A +V     ++   ++ +  +  ++ 
Sbjct: 172 ATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRINNDLEDI--DVHDIP 225

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY------- 442
            LE  L +   +  P+HY   + K+SL Q+YG   GY   Q+S   + RK SY       
Sbjct: 226 GLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMSPEDIARKESYCREFLEI 285

Query: 443 --VMSPNLKATDEPICLGCHKPL 463
             V+ P L      I    H P+
Sbjct: 286 VDVLDPGLTRLRGLIMYELHAPV 308


>gi|237651911|gb|ACR08644.1| SET and MYND domain containing 1, partial [Drosophila silvestris]
          Length = 246

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 135/248 (54%), Gaps = 31/248 (12%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC   A   C+ CK V YC  +HQK+HW  HK  CK  P++++  P LGRYL AT ++
Sbjct: 8   CAVCNVAAPLTCTRCKLVRYCNGEHQKQHWPAHKTCCK--PFKVQDDPQLGRYLVATQNI 65

Query: 75  HPGDRIARESPLIVGPKLALAEP-------ICLGCHKPLNPNLADNARCPRCFWPACSAR 127
                I  E PL+VGPK  L +         C+ C+ P   N      C  C WP CS  
Sbjct: 66  QAKQIIFVEEPLVVGPKWFLTDDEKSANIVPCVACYTPCRIN---KHLCRSCRWPVCSIS 122

Query: 128 CSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
           C           EC+IL LG       +    ++ Y+++A+L L+CL+LQR+ P+++  L
Sbjct: 123 CEH------EKFECSILSLGSPPSGKSDARGLHDYYRHDALLVLKCLLLQRQHPERWTAL 176

Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG-----DVLDDTSAKCLHWICGV 235
            +M++H  +R  GTE+++E +  IV YL+  FL+++ G      +L +  A+ LH +CG+
Sbjct: 177 LEMQSHEEERV-GTELHQEAEQRIVSYLQQRFLQRIKGAKNRESLLSEYRAELLHRLCGI 235

Query: 236 IEVNGVDI 243
           IE N + I
Sbjct: 236 IETNYMVI 243


>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
          Length = 501

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 115/428 (26%), Positives = 196/428 (45%), Gaps = 57/428 (13%)

Query: 37  KQHQKEHWKLH--KPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LA 93
           ++H   H  L   +P C    ++I++S L GR + A+ D+   + +  ++ ++ GP+  +
Sbjct: 11  RRHLVRHGLLEAAQPTC----WKIENSKLSGRGVIASRDIAVDELVYADAAVVQGPRCYS 66

Query: 94  LAEPICLGCHKP---LNPNLADNARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCE 149
              P+C+ C+K    L P       C R C  P CS  C   +D H    ECA LK G E
Sbjct: 67  KYLPLCVSCYKTGCVLFP-------CDRGCGLPVCSEVCQ--NDPHHVEAECAYLK-GLE 116

Query: 150 TLLA--YNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKY 207
                 ++    +A++P+R L+L +   + +  L+  EA    R        E+D     
Sbjct: 117 PTCGSGWSLELLQAVVPIRALMLPKELTRVFYSLQCHEAAQHGR--------EVD----M 164

Query: 208 LRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHN 260
           L+SN    + GD+ D    + +  +C  ++ N  +      G+ T  + L+ V  L  H 
Sbjct: 165 LKSN----VKGDI-DADDEELMRRVCRAMDTNSFETAVIRGGKSTSLRALFPVGALTNHQ 219

Query: 261 CLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
           C+PN +H    + +L V+          I+  Y + LW TQ RR  L  TK+F C C RC
Sbjct: 220 CVPNTRHIVNAEGELLVYAAVPIAEGQEITMSYADVLWDTQMRRHFLLATKHFACQCPRC 279

Query: 315 SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQL 374
           +D TE G+   A+ C  ++  QG+  +LP  PL  +  WTC  C   +  R +  + + +
Sbjct: 280 TDVTECGSLLGALSCAIDNC-QGN--LLPNTPLSLNCPWTCNECGLIIKGRQISSIRAGI 336

Query: 375 GEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSS 434
               ++   ENP  K L  +  +L  +    +Y    +   ++ L+G   G  +++L+  
Sbjct: 337 IAATEEFKYENPR-KVLRFVNKELSLLVPETNYVMMEMMFRIVSLFGRAEGCTWSELTDE 395

Query: 435 LLERKISY 442
            L  KI Y
Sbjct: 396 ELNIKILY 403


>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
 gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
          Length = 492

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 173/390 (44%), Gaps = 56/390 (14%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           Y++  + + GR++ A+ ++  G  I +E+P+++ P++     IC  C   L         
Sbjct: 8   YQVAKNEIFGRFIVASKNIKQGQLILKENPVLLCPQVG-GPIICFNCCAQLKKLFF---- 62

Query: 116 CPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDY--KYEAILPLRCLILQR 172
           CP C     C+  C    +   +  EC  LK      + ++D     E + PLRCL+ ++
Sbjct: 63  CPDCRIAVLCNPNCK---NQFHNLQECTALK---SLPIKHDDVLSNPEIVTPLRCLLWRQ 116

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPG-DVLDDTSAK--CL 229
            +  + +    +EAH++ R            I +  R N  + L    +LD++  K   +
Sbjct: 117 FNKNQLENFLQLEAHLNFRRDS--------QIWRRNRVNIEDVLFKLKILDESDLKEEVV 168

Query: 230 HWICGVIEVNGVDI------------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
             ICG++++N  D+                +GLY    LM H+C+ N   +      L+V
Sbjct: 169 QKICGILDINTFDVRQPQRNRLGFNQAENLRGLYLRAALMAHDCVANTHLAVDDDFVLYV 228

Query: 278 F------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
                    + I   YT+ L G   R+  L   K+F C C RC D TE GT  S++KC  
Sbjct: 229 HAAVDIPEGSPIYFNYTDVLQGNDERKRRLLNAKHFECQCSRCRDSTEFGTEMSSLKCPR 288

Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
            HK      +L  NP      W C SC    +   + +  ++   ++D+   E P +++L
Sbjct: 289 CHKG-----LLRPNP------WQCSSCQRIFHDSLIKMTVTECRRRIDEF--ETPTIENL 335

Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
           E+ L KL   FHP HY    V+ +L++LY 
Sbjct: 336 EKFLKKLSFTFHPQHYLVLEVEQNLVRLYA 365


>gi|195479030|ref|XP_002086547.1| GE22790 [Drosophila yakuba]
 gi|194186337|gb|EDW99948.1| GE22790 [Drosophila yakuba]
          Length = 395

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 150/296 (50%), Gaps = 30/296 (10%)

Query: 159 YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEK 215
           Y  I+ LRC+ L+ + P+ + +L  +E H+ +R   T +Y+        LR+N   F++ 
Sbjct: 24  YCVIMLLRCMQLKAKDPEAFLKLYTLEDHLKERLE-TPLYQ-------VLRANLITFIKT 75

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH--- 267
           + G  + D     +  I  +++ N  ++      R  + LY    ++ H+C+PN +H   
Sbjct: 76  VLG--MKDWLEMDILRIAAILDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFD 133

Query: 268 ---SNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYF 324
              + +  +K  + +   +S  YT  L  T  RR HL   K F+CSC RC DP ELG++ 
Sbjct: 134 DDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFA 193

Query: 325 SAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQE 384
            A  CL     +    I+ +NPL N + W C  C+ + +A++V    ++L ++++ L + 
Sbjct: 194 GAQTCLKCKAGK----IISLNPLLNSAPWKCQLCNYKRSAKEVVTSDAELQQELESLDKT 249

Query: 385 NPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
            P   +LEE + +  A  H  + H    K++L QLYG+ PG+   +LS   L RK+
Sbjct: 250 TP--VALEEFIYRHRADLHETNTHVLQAKYALTQLYGSAPGFGMEELSEESLNRKV 303


>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
          Length = 485

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 122/444 (27%), Positives = 196/444 (44%), Gaps = 77/444 (17%)

Query: 30  KEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVG 89
           KEV      H KE+    + +   + Y    S L GR + AT D+   + I  ++PLIVG
Sbjct: 10  KEVEEVFVSHLKENKIFQEQQLWTVGY----SSLGGRGMFATRDIKQNELIFIDAPLIVG 65

Query: 90  PKLALA-EPICLGCHK---PLNPNLADNARCPR-CFWPACSARCSGLSDAHTHAPECAIL 144
           PK +     +C+ C+K   PL P       C + C  P CS +C    ++  H  EC  L
Sbjct: 66  PKCSNKHTKMCVCCYKNECPLFP-------CDKGCGLPVCSTQCE---NSPNHVNECEYL 115

Query: 145 KL--------GCETLLAYNDYKYEAILPLRCLIL---QRRSPKKYQELKDM----EAHMS 189
           +          C  LL        A +P+R L +   QR+     Q  K++    E  + 
Sbjct: 116 RSLVPICGTDWCLNLLL-------ATIPIRGLFMTKMQRKCLATLQYNKNLFSKYEIELL 168

Query: 190 KRG----PGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR 245
           K+     P  E  E ++ + +   +N  E                     I V+  D   
Sbjct: 169 KKNVMNSPSDEDIEMMERVCRAFNTNSFE--------------------TIRVHDKDHST 208

Query: 246 YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST------MYTNALWGTQPRRE 299
             +GLY +  L  H C+PN +H    + +L+V     IST       YT+  W T  RR+
Sbjct: 209 NLRGLYPLGSLQNHCCIPNTRHYFDEEFQLYVRAALPISTGEEITMSYTSLFWDTTLRRQ 268

Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
            L +TK F+C C+RCSDPTE  +   A+ C ++ K  G+  +LP NPL+  + W C  C+
Sbjct: 269 FLNVTKNFSCMCKRCSDPTEFNSKLGALLCASD-KCCGE--LLPKNPLNMRTSWICNKCT 325

Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH-HYHCYAVKHSLIQ 418
             +N R +  + S L   +++++ + P  + + + L K  ++  P  +Y    VK  +I 
Sbjct: 326 ISVNHRQICSIRSGLAAIIEEVIYKTP--REIFKFLQKDLSIIVPQSNYLIMDVKFRIIS 383

Query: 419 LYGTQPGYAYTQLSSSLLERKISY 442
            YG      +  L+ + L+ K +Y
Sbjct: 384 YYGRSDDLQWKDLTDTELDIKANY 407


>gi|157124069|ref|XP_001660316.1| hypothetical protein AaeL_AAEL009751 [Aedes aegypti]
 gi|108874117|gb|EAT38342.1| AAEL009751-PA [Aedes aegypti]
          Length = 500

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 166/380 (43%), Gaps = 48/380 (12%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EI  S  LGRYL A  DL  G +I  E P++VGP    +   CL C +          R 
Sbjct: 9   EILESDELGRYLVAKRDLARGQQILVEQPIVVGPYWD-SGIRCLNCFRNSCTICRKCKRA 67

Query: 117 PRCFWPACSARCSGLSDAHTH-APECAI-LKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
           P CF            D  TH   EC   L  G      ++ +    + P+RCL+L   +
Sbjct: 68  PLCF------------DCTTHNETECEFYLNSGLNINFLFDHFN--VVTPVRCLLLASSN 113

Query: 175 PKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD-DTSAKCLHWI 232
             K+ E+  MEAH  +R   T+++   +  +V+ L  N  +   G + +   +++ +  I
Sbjct: 114 RAKFDEMMTMEAHTEER-RDTDIWNAHEKKVVRPLLEN--KAFDGKMENLKVTSELIQRI 170

Query: 233 CGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH 282
           CG  +VN  +I              +GLY    LM HNC+ N   S    + L ++  T 
Sbjct: 171 CGFFDVNSFEIRGNMDDAQQEESLIRGLYPKAALMAHNCVSNTLISVDGDANLRLYVTTP 230

Query: 283 IST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
           +         YT +L+GT  RR HL + KYF C+C RC DPTELGT+ S++KC     D 
Sbjct: 231 VKKGEMLFYNYTRSLFGTFERRTHLKLGKYFVCTCSRCEDPTELGTHLSSVKCTG--CDD 288

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
           G C          ++ W C  C+  L    V+ V S   +   +      N+  LE ++ 
Sbjct: 289 GLCSYYA-----EENKWRCNKCAKELEETYVNNVLSDARKDAFECAN---NIDELERVIA 340

Query: 397 KLEAMFHPHHYHCYAVKHSL 416
           K     HP +     +K SL
Sbjct: 341 KHSETLHPCNSIILELKQSL 360


>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
          Length = 494

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 183/409 (44%), Gaps = 54/409 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LALAEPICLGCHK---PLNPNLA 111
           + I+ S L GR + AT D+   + I  ++PLIVGP+ +     +C+ C+K   PL P   
Sbjct: 33  WTIRYSNLGGRGMFATRDIAANELIFIDAPLIVGPRCVGKHLEMCVCCYKSECPLFP--- 89

Query: 112 DNARCPR-CFWPACSARCSGLSDAHTHAPECAILKL---GCETLLAYNDYKYEAILPLRC 167
               C R C  P CS  C   S  H +  EC  L+     C T  + N     A++P+R 
Sbjct: 90  ----CDRGCGLPICSLECEN-SPKHVNY-ECNYLRSLVPTCGTDWSLN--LLLAVIPIRA 141

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           L +  +  K    L+  +   S+       YE     +  L+ N +EK P +       +
Sbjct: 142 LFMTEQQRKCLAALQCDQTLTSQ-------YE-----INLLQRN-VEKSPSE----ADVE 184

Query: 228 CLHWICGVIEVNGV--------DIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
            +  +CG    N          D     +GLY +  L  HNC PN +H      +L+V  
Sbjct: 185 LMKRVCGAFNSNSFETVSVLDKDHSTSLRGLYPLGALQNHNCAPNTRHHFDENYRLYVSA 244

Query: 280 DTHIS-----TM-YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
              IS     TM YT+  W T  RR+ L+ITK+F+C+C+RCSDPTE G+   A+ C    
Sbjct: 245 TVPISAGEELTMSYTSLFWDTTLRRQFLSITKHFSCTCKRCSDPTESGSMLCALLC---A 301

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
            D     +LP +PL   + W C  C   +N R +  + S L    ++ + ++P  +  + 
Sbjct: 302 FDNCSGNLLPRDPLRIGTPWVCDMCGKSINYRQICSIRSGLAAITEEALYKSPR-EIFKF 360

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
           M  +L  +    +Y    +K  +I  YG   G  +  L+   L+ K  Y
Sbjct: 361 MQKELFRIVPLSNYLIIDMKFRIISYYGRTEGLEWADLTDVELDTKSEY 409


>gi|170061022|ref|XP_001866059.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879296|gb|EDS42679.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 495

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 112/400 (28%), Positives = 172/400 (43%), Gaps = 64/400 (16%)

Query: 87  IVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL-- 144
           +VGPK A + P+CLGCH+ L+     N  C  C WP C + C     +  H  EC +L  
Sbjct: 1   MVGPKTA-SVPVCLGCHRNLDATKQFN-NCQHCGWPMCGSSCE---TSRYHREECQLLAA 55

Query: 145 -----KLGCETLLAYNDYK-YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVY 198
                ++G +       Y  Y  I+PLR ++L+R  P+ + ++  ME+H+  R   T +Y
Sbjct: 56  KNYLPQIGFDECNPGKKYSAYCTIVPLRAILLKRTDPQLWAKVVSMESHLETRK-NTPLY 114

Query: 199 EEIDS-IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYS 252
             I S +V ++R      +   +  + S + L  + G+ + N  +I         + LY 
Sbjct: 115 AAIRSNVVPFVR------MVLGLRTEASEEELMEVTGIFDTNSFEIRIPEKSIKIRALYE 168

Query: 253 VICLMEHNCLPNAKHSNMMQSKLFVF-------------------------RDTHISTMY 287
           +  LM H C PN +H    +  L +                          +D  IS  Y
Sbjct: 169 LGALMNHCCQPNTRHYFDEEFNLVMIAAGRILEQLQDRTNSYFELLSVDIPKDEMISISY 228

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
              L  T  RR  +   K F C C RCSDPTE  TY  ++ C    K      ++ VNPL
Sbjct: 229 AQPLQATIQRRFTIKQAKCFECGCHRCSDPTEFRTYAGSIVCPQCSKS----MVVAVNPL 284

Query: 348 DNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLV-QENPNVKSLEEMLTKLEA--MF 402
           D  SDW C    CS R +A+           ++ +L     P+    E +L + ++  + 
Sbjct: 285 DFRSDWRCEDKKCSYRESAQQYITRNEAFRAEIGQLSCGAGPD--GYECLLGRFQSTDVL 342

Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYT--QLSSSLLERKI 440
           HP + +   VK++L QLYG           LS   L RK+
Sbjct: 343 HPWNTNVLQVKYALTQLYGGGGPGGINLKDLSEKQLRRKV 382


>gi|242019690|ref|XP_002430292.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
           humanus corporis]
 gi|212515407|gb|EEB17554.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
           humanus corporis]
          Length = 487

 Score =  121 bits (304), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 172/384 (44%), Gaps = 52/384 (13%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR-CPRC-FWP 122
           G ++ A+ DL PG+ I +E PLI+GP +  +  +CLGC + +   L +  + C  C   P
Sbjct: 3   GTFMVASKDLKPGEIILKEQPLILGPSIN-SNILCLGCFRRI---LFEKFKFCRECQIAP 58

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY---KYEAILPLRCLILQ-------- 171
            C   CS L   H+   EC   K   E      ++    ++ +LPL+C  L+        
Sbjct: 59  VCGEDCS-LKSQHSKY-ECKYYKKLFENDKIDKNFIVSNFQIVLPLKCFSLKLFLFPDDD 116

Query: 172 RRSPKKYQELKDMEAHMSKRG--PGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
                 +    +ME+H+ +R   P    Y+E  +I+K L S        ++L+D     +
Sbjct: 117 DDDDHLWDNFINMESHVDERQNEPIWNYYQE--NIIKVLNS-------VNILNDEECLLM 167

Query: 230 HWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHS------NMMQS 273
             ICG+++VN  ++          G   +G+Y     M H+C+ N   S        + +
Sbjct: 168 QKICGILDVNSFEMRCPDDEQISAGNQLRGIYREAAFMSHDCIGNTHVSLNDFFEMTIHA 227

Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
            + +     I   Y N    T  R++ L   KYFNC C+RC+DPTELGTY  ++ C   H
Sbjct: 228 SVPIKSGQAIFFNYANPFECTAARQQRLKKGKYFNCFCKRCTDPTELGTYSGSIICPRCH 287

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
           +      I+  NPL + S W C  C+     + +   T+  G+ +   V E  N   LE 
Sbjct: 288 EG----CIISQNPLSDSSGWNCEKCNHFFYNQLIK-ATNVEGQNLLSDVDET-NSTQLEN 341

Query: 394 MLTKLEAMFHPHHYHCYAVKHSLI 417
           +L K    F  ++Y    +K  LI
Sbjct: 342 LLKKCLKSFPQNNYVMTEIKQKLI 365


>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
          Length = 497

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 105/416 (25%), Positives = 183/416 (43%), Gaps = 58/416 (13%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNPNLADN 113
           P+EIK S L G  + AT  +  G+ I ++ P+I+GP+ A   P+ C+ C+   +  L   
Sbjct: 27  PWEIKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLSCVSCYSRRDLRLCGK 86

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL---PLRCLIL 170
               +C    CS +C     +  H  EC +++      ++  D   + +    P+R L+L
Sbjct: 87  ----KCGLLVCSEKCE---KSLGHQKECKVVRQWQSKPIS-EDLSVQLVRVLSPIRSLLL 138

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                   + LK   AH S +        E+D +   L  N         + +   K + 
Sbjct: 139 GEEDKSVVKCLK---AHKSDQHGF-----EVDVMKDALALN---------VKEDEEKFMR 181

Query: 231 WICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFV 277
           ++C V++ N  ++           +GLY +  L  H+C PN       K   ++++  F+
Sbjct: 182 FVCSVMDSNAFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            + + I   Y+  +W T  RR HL  TK+F C C+RC DPTE G+Y  ++ C    K   
Sbjct: 242 PQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSILC----KVCK 297

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNV--KSLEEML 395
              ++P N L  D  W C  C + +   +V  +T  LG  ++    ++  V  + L + L
Sbjct: 298 TGKVIPTNSLQTDK-WQCEVCGSLIKKEEVANITFVLGSALNSFDDDDVAVMLRLLHQKL 356

Query: 396 ---------TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
                    TK   +   ++     +K+ LI + G Q G+A+  LS   L  K+++
Sbjct: 357 PNLSCCDYSTKTAKLVPENNQTVVELKYKLIWILGHQQGFAWNDLSDEYLSLKVNF 412


>gi|198458396|ref|XP_001361024.2| GA21934 [Drosophila pseudoobscura pseudoobscura]
 gi|198136331|gb|EAL25600.2| GA21934 [Drosophila pseudoobscura pseudoobscura]
          Length = 489

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 166/377 (44%), Gaps = 47/377 (12%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A +++  G+ +  E PL+V P    A+  C  C +           C +C  +P
Sbjct: 16  LGRHLVANINIDAGETLLEERPLLVAPHWECAQLKCAQCMEE------SYVMCRKCQVFP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
            C   CS  +D      EC     G     C+ LL  N      I  L  L+L    P  
Sbjct: 70  LC-MDCSQHNDF-----ECEFFASGAGSRICKDLLVKN----YGICALLKLLLLLEDPAT 119

Query: 178 -------YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVL--DDTSAKC 228
                  +Q   D+E +    G   E  E++  +   L S  +E LP   L  D   A C
Sbjct: 120 RADCQTLFQRSVDLEDYRDGEGMWQEHEEQV--VRPLLASGLVEALPTQQLTADVLHAHC 177

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNM-MQSKLFVFRDTH 282
           +   C   EV G D G   +G++     + H+C+PN      +  NM + + + +     
Sbjct: 178 VRIDCNAFEVTGRD-GETLKGVFICGAGLPHHCVPNTVVALDEQFNMKLYAAVPLNAGDV 236

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           I   YTN L GT  R+  L ++++ +CSC RC D TE+GT+ S+MKC         C I 
Sbjct: 237 IYNSYTNPLMGTSQRQHQLRLSRHLDCSCSRCQDATEMGTHMSSMKCRECSAGFSICEIG 296

Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
           P    D   +W C  C+A L A DVH V ++ G   D LV     ++  E +L + + + 
Sbjct: 297 P----DGSGNWRCVDCNAILPAVDVHEVLAEAG---DALVDAKGELQVYESLLVRYQQLL 349

Query: 403 HPHHYHCYAVKHSLIQL 419
           HP+H+    +K ++  +
Sbjct: 350 HPNHFLLLDIKQNIASI 366


>gi|195154475|ref|XP_002018147.1| GL17551 [Drosophila persimilis]
 gi|194113943|gb|EDW35986.1| GL17551 [Drosophila persimilis]
          Length = 489

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 107/372 (28%), Positives = 168/372 (45%), Gaps = 37/372 (9%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A +++  G+ +  E PL+V P    A+  C  C +           C +C  +P
Sbjct: 16  LGRHLVANMNIDAGETLLEERPLLVAPHWECAQLKCAQCMEE------SYVMCRKCQVFP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
            C   CS  +D      EC     G     C+ LL  N Y   A+L L  L+    +   
Sbjct: 70  LC-MDCSQHNDF-----ECEFFASGAGSRICKDLLVKN-YGICALLKLLLLLEDPATRAD 122

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVK--YLRSNFLEKLPGDVL--DDTSAKCLHWIC 233
            Q L +   ++     G  +++E +  V    L S  +E LP   L  D   A C+   C
Sbjct: 123 CQTLFERSVNLEDYRDGEGMWQEHEEQVVGPLLASGLVEALPTQQLTADVLHAHCVRIDC 182

Query: 234 GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNM-MQSKLFVFRDTHISTMY 287
              EV G D G   +G++     + H+C+PN      +  NM + + + +     I   Y
Sbjct: 183 NAFEVTGRD-GETLKGVFVCGAGLPHHCVPNTVVALDEQFNMKLYAAVPLNAGDVIYNSY 241

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
           TN L GT  R+  L ++++ +CSC RC D TE+GT+ S+MKC         C I P    
Sbjct: 242 TNPLMGTSQRQHQLRLSRHLDCSCSRCQDATEMGTHMSSMKCRECSAGFSICEIGP---- 297

Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHY 407
           D   +W C  C+A L A DVH V ++ G   D LV     ++  E +L + + + HP+H+
Sbjct: 298 DGSGNWRCVDCNAILPAVDVHEVLAEAG---DALVDAKGELQVYESLLVRYQQLLHPNHF 354

Query: 408 HCYAVKHSLIQL 419
               +K ++  +
Sbjct: 355 LLLDIKQNIASI 366


>gi|270011595|gb|EFA08043.1| hypothetical protein TcasGA2_TC005637 [Tribolium castaneum]
          Length = 319

 Score =  118 bits (296), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 75/247 (30%), Positives = 122/247 (49%), Gaps = 18/247 (7%)

Query: 205 VKYLRSNFL-------EKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLM 257
           ++YL   FL       E+    +L+D S + L  I   ++ N     +     Y    L+
Sbjct: 8   IEYLVQKFLTPLKKLEEETHQIILEDKSEETLGKIFNFLQSN-THHNKQLISFYPNASLL 66

Query: 258 EHNCLPNAKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFN--CS 310
           +H+C PN   +       +++   V   T ++  YT+   GT  R ++L   K  N  C 
Sbjct: 67  KHSCTPNTIFTTENFEITLRAISGVSEGTPLTCSYTDLFSGTHVRLKNL---KKLNKTCF 123

Query: 311 CERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLV 370
           C RC+DPTE GT+FSA+KC+   +D      LPVNP + ++ W C  C   L  + +   
Sbjct: 124 CARCTDPTEFGTFFSALKCMGTKQDPCGGNQLPVNPTNEETLWVCDKCKIELPHKQIVKF 183

Query: 371 TSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQ 430
              LG QV K +    +++ LEE L KL    HP+H++ +AVK++L++LYG +      Q
Sbjct: 184 VKHLGGQVGKTLARKVSIEELEEFLGKLLMFLHPNHFYIFAVKNALVELYGEESLATKIQ 243

Query: 431 LSSSLLE 437
           +   L++
Sbjct: 244 ICEELID 250


>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
 gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
          Length = 325

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/229 (32%), Positives = 115/229 (50%), Gaps = 31/229 (13%)

Query: 232 ICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
           + GV+EVN  +  R  +G     L+    ++ HNC+PN   S +  S+ +  R   +  +
Sbjct: 3   VVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTARS-IYPSEGYKIRLRAMVDL 60

Query: 287 ---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                    YT  L GT  R++HL   K+F C CERC DPTELGT+FS++KC       G
Sbjct: 61  EEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC-------G 113

Query: 338 DC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVKS 390
            C   + +P  P + D+ W+C  C +  +  D   +   L  +V+  VQ  P     ++ 
Sbjct: 114 QCAEGFQVPRQPTEPDTSWSCAGCGSDTSNADALAMLQSLQSEVNA-VQALPMAAKRLEE 172

Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
           +E +L K +++ HP H+    ++  LI++YG   GY   QL    LERK
Sbjct: 173 IERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 221


>gi|307211936|gb|EFN87850.1| Protein msta, isoform B [Harpegnathos saltator]
          Length = 331

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 96/338 (28%), Positives = 156/338 (46%), Gaps = 47/338 (13%)

Query: 49  PKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKL-ALAEPICLGCHK--- 104
           P+ +   + I+ SPL GR + A  D+ PG+ I  + PL++GP+  +   P+C+ C+K   
Sbjct: 20  PQDESRSWRIERSPLGGRGMFAMRDIQPGELIFMDVPLLLGPRCHSKYLPMCVVCYKNDC 79

Query: 105 PLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK-LGCETLLAYNDYKYEAIL 163
           PL P   D+     C  P CS +C   S  H  A EC IL+ L       ++      ++
Sbjct: 80  PLFP--CDHG----CGLPICSTKCEN-SLMHARA-ECRILRELEPTCGSTWSKDLLLTVV 131

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
            +R L L   S ++ + L   E H++   P  E+             + L++    + D+
Sbjct: 132 LIRALAL---SSEQRRLLFAFECHLNL-APNYEI-------------DLLKRNITHLPDE 174

Query: 224 TSAKCLHWICGVIEVNGVDIGRYT--------QGLYSVICLMEHNCLPNAKHSNMMQSKL 275
                +  ICG+   N  ++            +G+Y +  L  H C+PN +H    Q +L
Sbjct: 175 EQLALMKRICGIFNTNSFEVVAVRDKDRVVSLRGIYPIGALQNHCCVPNTRHHFDDQQRL 234

Query: 276 F------VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
           +      +     I+  YT+ LW T+ RR  L +TK+F+C C RCSDP E G+   A+ C
Sbjct: 235 YMSAALPIAAGQEITMSYTDLLWDTRSRRCFLRVTKHFSCDCNRCSDPLEFGSQLGALLC 294

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV 367
             E+       +LP + L  +S WTC  C   ++ R V
Sbjct: 295 AKENCPGH---LLPCDSLSYESSWTCSECQTSVSHRQV 329


>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
          Length = 515

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 103/393 (26%), Positives = 175/393 (44%), Gaps = 61/393 (15%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL-NPNLAD 112
           + Y+++ S  LGRYL A  D+  G+RI  + P ++GP  +    +C  C+ PL N  L  
Sbjct: 3   VKYKVEKSEKLGRYLVAAKDIKAGERILSDQPFVLGPS-SDTSLVCFNCYLPLINKFLV- 60

Query: 113 NARCPRC-FWPACSARCSGLSDAHT---HAPECAI---LKL--GCETLLAYNDYKYEAIL 163
              C  C   P C     G  D  T   +  EC +   LKL  G   +    +    ++L
Sbjct: 61  ---CKNCAVAPLCPG--DGCPDEFTKYHNRQECDVFRNLKLTKGISPMTMVQNVGSLSVL 115

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
             R L+ +  +  +++   ++E H+ +R   + V++  D+ VK+++S       G + + 
Sbjct: 116 --RALLKKETNLLEWKLFMELETHLERRRE-SNVWQYYDNTVKFIQS------LGLLENG 166

Query: 224 TSAKCLHWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHS----N 269
            +   +  IC  I+VN  ++              +G+Y    L+ H+C+ N   S    N
Sbjct: 167 QNQDLVQKICAAIDVNSFEVRGPPIPAIGCAEILRGVYLQAALLSHDCIANTHMSIDDNN 226

Query: 270 MM--QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
           M+   + + + +   I+  YT+ L GT PR++HL + KYF C+C RC+D TELGT+ S+ 
Sbjct: 227 MLVCHASVDIKKGESINYNYTDPLKGTIPRQQHLIVGKYFKCTCTRCTDNTELGTFMSSA 286

Query: 328 KCLNEHK----DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQ 383
            C    K      GD WI             C  C   + + D+        ++++  V 
Sbjct: 287 ICPGCKKGYITKNGDVWI-------------CFDCKKEVESMDIEPKIKCCSDKLE--VI 331

Query: 384 ENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
              + K LEE +  +  +  P HY     K  L
Sbjct: 332 NKKDEKELEEYIKYVSLVLAPGHYLLLDAKQRL 364


>gi|347963044|ref|XP_566389.3| AGAP000041-PA [Anopheles gambiae str. PEST]
 gi|333467386|gb|EAL41307.3| AGAP000041-PA [Anopheles gambiae str. PEST]
          Length = 522

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 157/353 (44%), Gaps = 46/353 (13%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNP-NLAD 112
           P++I +    GR L AT D+   + +  + P++VGP++   E I C  C   L    L  
Sbjct: 61  PWDIATFEGYGRGLVATRDIAVNELVFVDRPVLVGPRVNNYEVIFCASCCSILRRLQLCT 120

Query: 113 NARCPRCFWPACSARCS-GLSDAHTHAPECAILK-LGCETLLAYNDYKYEAILPLRCLIL 170
                 C  P CS RC   +  +  HA ECA+++    +    Y+     A+  +R  +L
Sbjct: 121 GG----CRLPICS-RCDYSVGASSPHAAECALIQSWQPKDAGRYSKNILYALTSIRGFLL 175

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
             R        + +   M    P  E+  EI+   + L+  +   L G+     + + L 
Sbjct: 176 SERD-------RSIVLQMEGHPPRKEMTTEIE---RLLKDGYFWNLDGE---GPAVRYLR 222

Query: 231 WICGVIEVNGVDIGRYT------------QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
            +  ++  N  +  R              +GLY +  LM H C PN ++    + ++ V 
Sbjct: 223 QVVNILNTNAFETSRIVADEENNQHEIILRGLYILGALMNHCCRPNVRYVFDGELRMRVH 282

Query: 279 ------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
                 +   I   Y+  LWG+Q R  HL  +K F C C+RC DPTE GTY  A+KC+ E
Sbjct: 283 ASRPIKKGEQIMNNYSKILWGSQHRIIHLCFSKNFLCCCDRCKDPTEFGTYLGALKCVRE 342

Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
              +G   +LPVNPL   S W C  CS +++    ++  S++ E   +++  N
Sbjct: 343 SCPEGR--LLPVNPLKISSVWRCDRCSLKMD----NIKASKIQEIASRMIMNN 389


>gi|290462845|gb|ADD24470.1| Protein msta, isoform A [Lepeophtheirus salmonis]
          Length = 486

 Score =  115 bits (287), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 96/423 (22%), Positives = 189/423 (44%), Gaps = 58/423 (13%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK--LPYEIKSSPLLGRYLQ 69
           +  C +C     + CS CK+   C       H++LH  +  +  LP++++   + GR L 
Sbjct: 1   MQTCFLCGSPVSNICSYCKDCYVCDT-----HFELHHQRELRICLPFDVEYDSVKGRILV 55

Query: 70  ATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR-----CPRCFWPAC 124
           A+ D+ P + +  +   + GP L    P+CLGC + +      N       CP+CF+P+C
Sbjct: 56  ASRDIKPEEVVLFDKAAVFGPNLL---PVCLGCLEKIKSTRGKNGELHYSVCPKCFFPSC 112

Query: 125 SARCSGLSDAHTHAP-ECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKY 178
             +   +S+   H+  EC+I          + D++     Y  I  LR ++  +R+    
Sbjct: 113 GNK-ECISETSVHSKYECSIFA-ATGFKPKFKDFESPHFIYNVITILRLILEMQRNEDNR 170

Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEV 238
             +  +  H  +R      + E++  +K + S  ++ L    + + +   +    GV++ 
Sbjct: 171 ILVNQLMTHHDQR---VNKFPEVEIGIKRITSFLMDDLN---IPNVNGDDVIQAFGVMKT 224

Query: 239 NGVDIGRYT-QGLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNAL 291
           N +  G    +GLY +  ++ H+C+ N +         +++SK+ + +   ++  Y  + 
Sbjct: 225 NAMTFGDGNGRGLYPIGSIISHSCVSNLEPLTEPMECLVLRSKISIKKGNELTIRYM-SF 283

Query: 292 WGTQPR-REHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDND 350
           W  + + ++ L    +FN  CERC DP+++ T  S+  C    K +   ++ P+N L+ +
Sbjct: 284 WSHRTQLKDSLNNQWFFNYECERCQDPSDMKTMLSSELC---SKCKQKSYLKPINSLNLN 340

Query: 351 SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE------MLTKLEAMFHP 404
           +DW C  C              +  +QV  L+ E P  +S          + K E +FH 
Sbjct: 341 TDWQCDFCQE-----------VRAKKQVCDLLAEFPEPESFTSTDEITFFIKKSEKVFHA 389

Query: 405 HHY 407
           HH+
Sbjct: 390 HHF 392


>gi|195431056|ref|XP_002063564.1| GK21977 [Drosophila willistoni]
 gi|194159649|gb|EDW74550.1| GK21977 [Drosophila willistoni]
          Length = 487

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/396 (28%), Positives = 169/396 (42%), Gaps = 43/396 (10%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
            GR+L A LD+  GD +  E PL+V P     +  C  C +           C  C  +P
Sbjct: 16  FGRHLVAQLDIEAGDTLVEERPLLVAPHWECGQMKCAKCLQE------SYVMCRSCKMFP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPK 176
            C   C G  D      EC     G      C+ LL  N Y   A++    L+    + K
Sbjct: 70  LC-MDCQGHDDF-----ECQFFASGGPGGYICKDLLVKN-YGVCALMKFLLLLEDPSTRK 122

Query: 177 KYQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGD---VLDDTSAKCLHW 231
             + L +   ++     G  +++E +  ++   + S  L  L        DD  A C+  
Sbjct: 123 DCERLLETPINLDDYRDGDGMWQEHEELAVRPLMASGLLMSLTQAQSLTADDLHAYCIRI 182

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN--AKHSNMMQSKLFVFRDTH----IST 285
                EV   D G   +G++ V   + H+C+PN           KL+   D      I  
Sbjct: 183 DSYSFEVTARD-GDTLKGVF-VCGSLPHHCVPNTVVALDEEFNLKLYAAVDLKSGEVIYN 240

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            YTN L GT  R+  L +T+   CSC RC DPTE GT  S++KC    ++  D + L   
Sbjct: 241 SYTNPLMGTSQRQHQLRLTRRLECSCSRCLDPTECGTNMSSVKC----RECPDGYAL--C 294

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
            L +  DW C  CSA L A DVH + ++LG     LV+    +++ E +L + + + HP+
Sbjct: 295 ELSSGGDWRCSKCSAILPAADVHELLAELG---GALVEAKGELQAFESLLAQYKTLLHPN 351

Query: 406 HYHCYAVKHSLIQ-LYGTQPGYAYTQLSSSLLERKI 440
           H+    +K ++   L  T       Q S  LL R++
Sbjct: 352 HFLMLDIKQNIASILRATALMNCMDQPSKKLLARRV 387


>gi|194756636|ref|XP_001960582.1| GF11432 [Drosophila ananassae]
 gi|190621880|gb|EDV37404.1| GF11432 [Drosophila ananassae]
          Length = 489

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 164/370 (44%), Gaps = 32/370 (8%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A +++ PGD +  E PL+V P     +  C  C +           C RC  +P
Sbjct: 16  LGRHLVAGMNIEPGDTLLEERPLLVTPHWECGQLKCAQCFQ------ESYVICRRCQVYP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQE-- 180
            C   CS   D           K  C+ +L  N      I  L  L+L    P    +  
Sbjct: 70  LC-MDCSQHDDFECDFFASGAGKGLCKDILVKN----YGICGLLKLLLLLEDPATRPDCA 124

Query: 181 -LKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKCLHWICGV 235
            L +   ++     G  +++E +  V    + S  +E L    L  D   A C+      
Sbjct: 125 TLVETPVNLDDYRDGDGMWQEHEEQVVRPLMASGLVEVLSSQQLTCDSLHAHCIRIDSNA 184

Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST------MYTN 289
            EV   D G   +G++     + HNC+PN   +   Q  + ++    + T       YTN
Sbjct: 185 FEVTARD-GDTLKGVFVCGASLPHNCVPNTVVALDEQFNMKLYAAVPLQTGDIIYNSYTN 243

Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN 349
            L GT  R+  L +++  +CSC RC DPTE+GT+ S+MKC    +    C    ++P   
Sbjct: 244 PLMGTSQRQHQLRLSRRLDCSCSRCLDPTEMGTHMSSMKCRECPQGYAVC---EMDPDGR 300

Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
             DW C  C A ++A +VH + +++G     LV+   +++  E +LT+ ++M HP+H+  
Sbjct: 301 PGDWRCLECKASMSAANVHELLAEVG---GALVEARGDLQVYESLLTQYKSMVHPNHFLL 357

Query: 410 YAVKHSLIQL 419
             +K ++  +
Sbjct: 358 LDIKQNIASI 367


>gi|312384987|gb|EFR29584.1| hypothetical protein AND_01294 [Anopheles darlingi]
          Length = 366

 Score =  110 bits (274), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 98/339 (28%), Positives = 153/339 (45%), Gaps = 36/339 (10%)

Query: 104 KPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK----Y 159
           KPL  N      C RC WP C   C    D   H  EC +       +   + Y     Y
Sbjct: 5   KPLEEN--KYLECERCGWPVCKRSCQ---DQRGHRAECELTVARGSKVSIQHFYVPHPLY 59

Query: 160 EAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD 219
           + +LPLRCL+L    P +++ L  +++H  +   G+E +      V  L   F  K    
Sbjct: 60  QCLLPLRCLLLAESDPARWKALMRLQSH-DEEWRGSEQWRTDRRGVAQLIPRFF-KCDNR 117

Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLF 276
             +D   + +    G+I+VNG ++      +  +YSV  ++EH+C PN   S   Q ++ 
Sbjct: 118 WQEDDILQAI----GIIQVNGHEVPLTEPASVAIYSVASMLEHSCRPNLAKSFSDQGEII 173

Query: 277 VF------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
           ++        T +S  YT+ LW T  RR HL  TK F+C+C+RC DPTE GT+FSA++C 
Sbjct: 174 LWAPVAIAEGTRLSISYTDVLWTTAHRRAHLLQTKLFDCACDRCLDPTEFGTFFSALRCS 233

Query: 331 N-EHKDQGD----CWILPVN-PLDNDSDWTCGSCSARLNARDVHLVTSQLGE--QVDKLV 382
              + DQ       ++LP +  +  D DW CG C    + +    V +Q+ E  ++D   
Sbjct: 234 GFRNSDQSSQMCKGYLLPCDISVQGDDDWVCGRC----HGKQPGTVVAQILERARIDIQA 289

Query: 383 QENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
            +        + L        P+H +    K +L Q  G
Sbjct: 290 MQKQREDHCVKYLRHYAKWLTPNHQYMVEAKMALSQAIG 328


>gi|195335119|ref|XP_002034223.1| GM21750 [Drosophila sechellia]
 gi|194126193|gb|EDW48236.1| GM21750 [Drosophila sechellia]
          Length = 488

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 168/378 (44%), Gaps = 49/378 (12%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A++ + PGD I  E PL+V P     +  C  C +           C RC  +P
Sbjct: 16  LGRHLVASIGIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69

Query: 123 ACSARCSGLSDAHTHAP-ECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPK 176
            C        D + H   EC     G     C+ +L  N      I  L  L+L   +P 
Sbjct: 70  LCM-------DCNQHDEFECEFFTSGAGKALCKDILVKN----FGICGLLKLLLLLENPS 118

Query: 177 KY---QELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVLDDTSAKCLHW 231
                Q L D+  ++S    G  +++E + +V    + S   + LP   L   S+  LH 
Sbjct: 119 TKADCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQGL---SSDALHA 175

Query: 232 ICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT------ 281
            C  I+ N  ++    G   +G++     + H+C+PN   +   Q  + ++         
Sbjct: 176 HCIRIDSNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGD 235

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI 341
            I   YTN L GT  R+  L +++   C+C RC DPTE+GT+ S++KC    K+ G   +
Sbjct: 236 IIYNSYTNPLMGTSQRQHQLRLSRRLECTCSRCLDPTEMGTHMSSLKC----KECGGFSV 291

Query: 342 LPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
             ++      DW C  C A L A +VH + +++G     LV    +++  E +LT+   +
Sbjct: 292 CEIDSNGKLGDWRCPDCRALLTAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPL 348

Query: 402 FHPHHYHCYAVKHSLIQL 419
            HP+H+    +K ++  +
Sbjct: 349 LHPNHFMLLDIKQNIASI 366


>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
          Length = 397

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 160/361 (44%), Gaps = 53/361 (14%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           + + +  SPL GR L AT D+  G+ +  + PL+ GP+  +   +  GC    N +    
Sbjct: 28  VKWAVYESPLGGRGLVATQDIKAGEVLFVDYPLVYGPRSGII--VQRGCTVCKNIDSDIF 85

Query: 114 ARCPRCFWPACSARCSGLSDAH-------THAPECAILKLGCETLLAYNDYKYEAILPLR 166
            +C +C    CS +C   SD H       +H P    ++   +TLL+       A+  +R
Sbjct: 86  FKCSKCALILCSVQCQN-SDFHSGDCSIISHWPNKVPIEEVDDTLLS------RALTAIR 138

Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
            L+L     +KY         + + G       EI  + +Y           D+      
Sbjct: 139 ALLLNE--DQKYLLTSLQANKLPQYG------SEIRDLKQYF----------DIPLHEEE 180

Query: 227 KCLHWICGVIEVNGVDIGR-------YTQGLYSVICLMEHNCLPNAKH------SNMMQS 273
             +  IC ++  N   +           +GLY V  ++ HNC+PN ++         +++
Sbjct: 181 FMILVIC-ILNTNAFQMATPYGKKEMSLRGLYPVASILNHNCVPNTRNCFNGDLQMTVKA 239

Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
              +   + I T Y+  LWGT  RR +L  +K+F C CERC+DPTE GT  +A+KC +  
Sbjct: 240 TKTINAGSEIFTCYSGMLWGTPARRLYLYKSKHFLCDCERCADPTERGTLLAALKCFST- 298

Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
           + QG   +LP+ PL   + W C  C  R+   ++ ++ S LG  +  L  +  NV  LE 
Sbjct: 299 ECQGS--LLPIQPLKTTTAWRCLECGMRVPNDNICVIQSALGSLMGSL--DLKNVDELEN 354

Query: 394 M 394
            
Sbjct: 355 F 355


>gi|225717680|gb|ACO14686.1| msta, isoform A [Caligus clemensi]
          Length = 521

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 119/493 (24%), Positives = 204/493 (41%), Gaps = 65/493 (13%)

Query: 15  CAVCRETALHKCSAC---KEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQAT 71
           C +C + +   C      + V +C K+H++ H       C   PY +  S  +GR+L A+
Sbjct: 14  CFICHQPSSDSCRKSNHDETVYFCSKEHERLHSGDKTSSC--FPYRVTKSHQVGRFLVAS 71

Query: 72  LDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCS-- 129
             +  G+ I +E P IVGP   ++ PIC+ C+K + P     + CP+C +P C   C   
Sbjct: 72  RKILRGEIIFQELPGIVGPNPVVSSPICVACYKQVGP----RSSCPKCGFPLCEESCERD 127

Query: 130 -GLSDAHTHAPECAIL-KLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAH 187
            G    H H     +  +  C++ + +      AILPL+ L+ +  +P  Y  L  +  H
Sbjct: 128 HGFICEHLHRFGIDLTPEEMCDSSILH------AILPLKLLLSKESNPHVYNCLSILMDH 181

Query: 188 MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT 247
              R    E  E I + +  L+S  LE+       D   K L    G++EVN  ++   T
Sbjct: 182 REDRLNDPEYLEGIQTPI--LKS--LERSLSTYSTDELLKAL----GIVEVNSYEVYNLT 233

Query: 248 -----QGLYSVICLMEHNCLPNAK--------HSNMMQSKLFVFRDTHISTMYTNALWGT 294
                +GL+ +  L+ H+C+P  +        +   M S   +     +S  Y +     
Sbjct: 234 GHSGFRGLFCLTSLLSHDCVPKRRPIIGHETPYGLTMISTRNINEGEILSINYVHTQKPN 293

Query: 295 QPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT 354
           + RR  L    YF CSC+RC D  E G +  A+ C         C    + P  +   + 
Sbjct: 294 RIRRRTLKDNWYFECSCKRCEDEAEFGLHPDAIWC-------DSCQEGLIKP-SHKGIYC 345

Query: 355 CGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKH 414
           C  CS   +   +  +   L  Q  ++ ++  +V    E+  +   +   +HY   + + 
Sbjct: 346 CDKCSREKSQGCIEDLIESLESQKAEIDRQ--DVNQYLELYRRSSVLLGENHYFLNSPRR 403

Query: 415 SLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLN---PNLADNA 471
            +I L+  +P  +  +L           V SP++      +C G  + LN   P L+ N 
Sbjct: 404 WIIPLF-CRPLTSSPRL-----------VPSPDMLRDKIQLCEGYLRLLNKIEPGLSKNR 451

Query: 472 RCPRCFWPACSAR 484
                 W  CS +
Sbjct: 452 GKVLYEWAECSLK 464


>gi|194882337|ref|XP_001975268.1| GG20654 [Drosophila erecta]
 gi|190658455|gb|EDV55668.1| GG20654 [Drosophila erecta]
          Length = 488

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 167/372 (44%), Gaps = 37/372 (9%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A++ + PG+ +  E PL+V P     +  C  C +           C RC  +P
Sbjct: 16  LGRHLVASISIEPGETVLEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKY---Q 179
            C   CS   +           K  C+ +L  N      I  L  L+L   +P      Q
Sbjct: 70  LC-MDCSQHDEFECEFFASGAGKALCKDILVKN----FGICGLLKLLLLLENPNTQADCQ 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
            L D+  ++S    G  +++E + +V    + S  ++ LP   L   ++  LH  C  I+
Sbjct: 125 MLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLVDVLPTQQL---TSDVLHAHCIRID 181

Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTMY 287
            N  ++    G   +G++     + H+C+PN   +   Q  + ++          I   Y
Sbjct: 182 SNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNSY 241

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
           TN L GT  R+  L +++   CSC RC DPTE+GT+ S++KC    K+ G   +  ++  
Sbjct: 242 TNPLMGTSQRQHQLRLSRRLECSCSRCLDPTEMGTHMSSIKC----KECGGFSVCEIDLN 297

Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHY 407
               DW C  C A L A +VH + +++G     LV    +++  E +LT+   + HP+H+
Sbjct: 298 GKLGDWRCPDCRALLTAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPLLHPNHF 354

Query: 408 HCYAVKHSLIQL 419
               +K ++  +
Sbjct: 355 MLLDIKQNIASI 366


>gi|195584108|ref|XP_002081857.1| GD11241 [Drosophila simulans]
 gi|194193866|gb|EDX07442.1| GD11241 [Drosophila simulans]
          Length = 488

 Score =  108 bits (269), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 98/373 (26%), Positives = 165/373 (44%), Gaps = 39/373 (10%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A++ + PGD I  E PL+V P     +  C  C +           C RC  +P
Sbjct: 16  LGRHLVASIGIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69

Query: 123 ACSARCSGLSDAHTHAP-ECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPK 176
            C        D + H   EC     G     C+ +L  N +    +L L  L+    +  
Sbjct: 70  LCM-------DCNQHDEFECEFFTSGAGKALCKDILVKN-FGICGLLKLLLLLENPSTKA 121

Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLP--GDVLDDTSAKCLHWI 232
             Q L D+  ++S    G  +++E + +V    + S   + LP  G   D   A C+   
Sbjct: 122 DCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQGLTSDALHAHCIRID 181

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTM 286
               EV   D G   +G++     + H+C+PN   +   Q  + ++          I   
Sbjct: 182 SNSFEVTAKD-GDTLKGVFIWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNS 240

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YTN L GT  R+  L +++   C+C RC DPTE+GT+ S++KC    K+ G   +  ++ 
Sbjct: 241 YTNPLMGTSQRQHQLRLSRRLECTCSRCLDPTEMGTHMSSLKC----KECGGFSVCEIDS 296

Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
                DW C  C A + A +VH + +++G     LV    +++  E +LT+   + HP+H
Sbjct: 297 NGKLGDWRCPDCRALITAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPLLHPNH 353

Query: 407 YHCYAVKHSLIQL 419
           +    +K ++  +
Sbjct: 354 FMLLDIKQNIASI 366


>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
 gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
          Length = 787

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 143/328 (43%), Gaps = 47/328 (14%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHK--PLNP 108
           KK  + + +S + GR + AT D+  G+ I RE  L+VGP     +   C+ CH+  P   
Sbjct: 378 KKPNWTLAASTVAGRGVFATRDIVAGELIFRERALVVGPTARKGQLSTCVCCHRLLPTQG 437

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L       RC  P C+A CSG   + TH  EC   +          D + E + PL   
Sbjct: 438 FLCKQ----RCTLPVCAA-CSG---SVTHRAECEHFR---RWQPKDADAEEEQVNPLSLR 486

Query: 169 ILQRRS----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           IL         K+ + L D     ++RG      +EI    +  R NF         D  
Sbjct: 487 ILTAVRVFHLGKEQRHLVDAMQANAERG----FRQEIIKAAQCFR-NF------PTTDKP 535

Query: 225 SAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNMMQSKL 275
               L  I GV+  N  +      GR T  +GL+ +  +M H C PNA H   N   + +
Sbjct: 536 FMDQLFRIVGVLNTNAFEAPCRTDGRETLLRGLFPLTAIMNHECTPNASHYFDNGRLAIV 595

Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
              RD      I+T YT  LWG   R   L +TK+F C C RC+D TE GTY SA+ C  
Sbjct: 596 RAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCNDNTENGTYLSALFC-- 653

Query: 332 EHKDQG-DCWILPVNPLDNDSDWTCGSC 358
             ++QG    ++PV       DW C SC
Sbjct: 654 --REQGCRGLVIPVQTRTLQPDWRCLSC 679



 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 108/477 (22%), Positives = 170/477 (35%), Gaps = 93/477 (19%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPN 109
           K+  + +  SP+ GR + AT D+  G+ + RE  ++VGP    +  +  C  C++ +   
Sbjct: 49  KEPAWRVSDSPISGRGIFATRDIAKGEELFREHTILVGPTAHRSSNLRTCTLCYRLVPGQ 108

Query: 110 LADNARC-PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP---- 164
               A C   C  P C     G S  H         K  C+  L +     E I P    
Sbjct: 109 TDAEALCLAGCGLPVCGE--CGKSPRH---------KTECDLFLKWKPKVTERIDPRSLR 157

Query: 165 ----LRCLIL---QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP 217
               +RC  L   QRR     Q   D   +M +       +E+     + L   +     
Sbjct: 158 ILSVVRCFFLDEQQRRLLYAMQANAD-RYYMKEVERAAACFEDFPREQEMLDFFYRTICA 216

Query: 218 GDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
            +     S  C         V G ++    + L+ +  L+ H C PNA H       + V
Sbjct: 217 FNTNAFESRSC---------VEGHEV--LIRALFPLAGLLNHQCTPNAGHHFQDGETIIV 265

Query: 278 FR------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGT-----YFSA 326
                      I+  Y   LW T  R+  L +TK F C C RC DPT L +     +  A
Sbjct: 266 CATERIACGAEITVSYAKMLWSTLARKIFLGMTKGFMCQCPRCQDPTSLTSSGIIGHHRA 325

Query: 327 MKCLNEHKDQGDCWILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQE 384
              + E + +   W    N ++  +        C      +   L+   LGE    L ++
Sbjct: 326 SSGIAEQRQRTTTW---TNRMNTTATMLAHMAPCRDTTPEQLAQLIDMHLGE----LREK 378

Query: 385 NPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQL-SSSLLERKISYV 443
            PN                      + +  S +   G    +A   + +  L+ R+ + V
Sbjct: 379 KPN----------------------WTLAASTVAGRGV---FATRDIVAGELIFRERALV 413

Query: 444 MSPNLKATDEPICLGCHK--PLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
           + P  +      C+ CH+  P    L       RC  P C+A CSG   + TH  EC
Sbjct: 414 VGPTARKGQLSTCVCCHRLLPTQGFLCKQ----RCTLPVCAA-CSG---SVTHRAEC 462


>gi|24654325|ref|NP_611181.1| CG9640, isoform A [Drosophila melanogaster]
 gi|442623987|ref|NP_001261039.1| CG9640, isoform B [Drosophila melanogaster]
 gi|7302840|gb|AAF57914.1| CG9640, isoform A [Drosophila melanogaster]
 gi|54650638|gb|AAV36898.1| RE22408p [Drosophila melanogaster]
 gi|440214465|gb|AGB93571.1| CG9640, isoform B [Drosophila melanogaster]
          Length = 488

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 165/376 (43%), Gaps = 45/376 (11%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A++ + PGD I  E PL+V P     +  C  C +           C RC  +P
Sbjct: 16  LGRHLVASIAIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69

Query: 123 ACSARCSGLSDAHTHAP-ECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPK 176
            C        D + H   EC     G     C+ +L  N +    +L L  L+   R+  
Sbjct: 70  LCM-------DCNQHDEFECEFFTSGAGKALCKDILVKN-FGICGLLKLLLLLENPRTKG 121

Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKCLHWI 232
             Q L D+  ++S    G  +++E + +V    + S   + LP   L  D   A C+   
Sbjct: 122 DCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQELTSDALHAHCIRID 181

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTM 286
               EV   D G   +G++     + H+C+PN   +   Q  + ++          I   
Sbjct: 182 SNSFEVTAKD-GDTLKGIFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNS 240

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
           YTN L GT  R+  L +++   C C RC DPTE+GT+ S++KC        +C    V  
Sbjct: 241 YTNPLMGTSQRQHQLRLSRRLECICSRCLDPTEMGTHMSSLKC-------KECPGFSVCE 293

Query: 347 LDND---SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
           +D++    DW C  C A L A +VH + + +G     LV    +++  E +LT+   + H
Sbjct: 294 IDSNGKLGDWRCPDCRALLTAAEVHELQAAVG---SALVDAMGDLQVYEALLTQYGPLLH 350

Query: 404 PHHYHCYAVKHSLIQL 419
           P+H+    +K ++  +
Sbjct: 351 PNHFMLLDIKQNIASI 366


>gi|170067842|ref|XP_001868639.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863902|gb|EDS27285.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 504

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 105/385 (27%), Positives = 165/385 (42%), Gaps = 74/385 (19%)

Query: 32  VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
           VA+ G+       KL+ P+    P++I      GR L AT D+   + I  + P++VGP+
Sbjct: 29  VAFLGE-------KLYSPE---QPWDIAVFEGYGRGLVATRDIAVNELIFLDRPILVGPR 78

Query: 92  LALAEPI-CLGCHKPLNPNL---ADNARCPRCFWPACSARCSGLSDAHT-HAPECAIL-- 144
           +   + I C  C + +   L   +   R P C      A C   +   T H  EC ++  
Sbjct: 79  VNNYDVIFCASCCR-IQKRLTLCSGGCRLPIC------ADCDRSAAPETPHTAECTVITS 131

Query: 145 ------KLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVY 198
                 K  C+T+L        A+  +R ++L     +    + +ME H     P  ++ 
Sbjct: 132 WQPKDQKRYCKTILY-------ALTSIRAMLLNDLESRI---VFNMEGH----APRKDMI 177

Query: 199 EEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ---------- 248
            EID   + L+      LP    + T    L  I  V+  N  +  R  Q          
Sbjct: 178 TEID---RLLKDQIFANLPEGSANLTF---LRRIVNVLNTNAFETFRIVQDEENNDHEII 231

Query: 249 --GLYSVICLMEHNCLPNAKHSNMMQSKLF------VFRDTHISTMYTNALWGTQPRREH 300
             GLY +  LM H+CL N ++       ++      + +   I   Y+  LWGTQ R  H
Sbjct: 232 LRGLYILGALMNHHCLANVRYVFDENQVMWCHASRPIRKGEQIFNNYSKVLWGTQHRIIH 291

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSA 360
           L  +K+F C CERC D TELGTY  A+KC+ E    G   ++ +NPL   S W C  C  
Sbjct: 292 LWFSKHFLCECERCRDVTELGTYLGALKCVRERCPNGR--LVSLNPLKIASAWQCDKCGL 349

Query: 361 RLNARDVHLVTSQLGEQVDKLVQEN 385
           ++     +L  +++ E   ++V  N
Sbjct: 350 QMG----NLKVTKIQEIAGRMVLNN 370


>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
          Length = 478

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 150/326 (46%), Gaps = 39/326 (11%)

Query: 60  SSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-----PICLGCHKPLNPNLADNA 114
           S+ + GR + AT D   GD I  + PLIV P+    +     P+C  C+  +        
Sbjct: 33  SATVAGRGVVATRDYSVGDVIFVDVPLIVSPRAMGMDGTGSWPVCAVCYAVMMTPTG--- 89

Query: 115 RCPR-CFWPACSARCSGLSDAHTHAPECA-ILKLGCETLLAYNDYK---YEAILPLRCLI 169
            CP  C  P C  +C+   D   H  EC  + +L  +T      +    Y AI  +R L 
Sbjct: 90  -CPGGCRLPVCGRQCA---DRPEHRDECRYVRQLRPKTKSDGQTWSVGIYNAITAVRGLS 145

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL--PGDVLDDTSAK 227
           ++     KY+   D+   + K+      +E     V+ L+ N   +L    + +  T+ K
Sbjct: 146 IR---DGKYRYFLDV---LQKKSTDKLTFE-----VEELKRNISMELDKEDEQIMVTTCK 194

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS------NMMQSKLFVFRDT 281
            +   C    VN  +     +GLY V   M H C+PN  H+       + ++ L +++  
Sbjct: 195 IMDANCFETIVNRSNKLTSLKGLYPVAAFMNHCCVPNTMHNFNEKLQMIAKASLPIYKGQ 254

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI 341
            I+T YT ++W T  R+ HL  +K F C+C RC D  E GT  +A+ C+ ++ D     I
Sbjct: 255 EITTSYTYSIWPTSLRQNHLLTSKQFICTCSRCCDTEEFGTKLAALNCIVKNCDGR---I 311

Query: 342 LPVNPLDNDSDWTCGSCSARLNARDV 367
           LP+NPLD  S W C  C+  +++ ++
Sbjct: 312 LPINPLDKKSIWQCKMCTKLVSSTEM 337


>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
          Length = 387

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 90/351 (25%), Positives = 156/351 (44%), Gaps = 47/351 (13%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNPNLADN 113
           P+EIK S L G  + AT  +  G+ I ++ P+I+GP+ A   P+ C+ C+   +  L   
Sbjct: 27  PWEIKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLSCVSCYSRRDLRLCGK 86

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL---PLRCLIL 170
               +C    CS +C     +  H  EC +++      ++  D   + +    P+R L+L
Sbjct: 87  ----KCGLLVCSEKCE---KSLGHQKECKVVRQWQSKPIS-EDLSVQLVRVLSPIRSLLL 138

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                   + LK   AH S +  G EV    D++   ++ +               K + 
Sbjct: 139 GEEDKSVVKCLK---AHKSDQH-GFEVDVMKDALALNVKED-------------EEKFMR 181

Query: 231 WICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFV 277
           ++C V++ N  ++           +GLY +  L  H+C PN       K   ++++  F+
Sbjct: 182 FVCSVMDSNAFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            + + I   Y+  +W T  RR HL  TK+F C C+RC DPTE G+Y  ++ C    K   
Sbjct: 242 PQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSILC----KVCK 297

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNV 388
              ++P N L  D  W C  C + +   +V  +T  LG  ++    ++  V
Sbjct: 298 TGKVIPTNSLQTDK-WQCEVCGSLIKKEEVANITFVLGSALNSFDDDDVAV 347


>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
 gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
          Length = 844

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 143/328 (43%), Gaps = 47/328 (14%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHK--PLNP 108
           KK  + + +S + GR + AT D+  G+ I RE  L+VGP     +   C+ CH+  P   
Sbjct: 435 KKPNWTLAASTVAGRGVFATRDIVAGELIFRERALVVGPTARKGQLSTCVCCHRLLPTQG 494

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L       RC  P C+A CSG   + TH  EC   +          D + E + PL   
Sbjct: 495 FLCKQ----RCTLPVCAA-CSG---SVTHRAECEHFR---RWQPKDADAEEEQVNPLSLR 543

Query: 169 ILQRRS----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           IL         K+ + L D     ++RG      +EI    +  R NF         D  
Sbjct: 544 ILTAVRVFHLGKEQRHLVDAMQANAERG----FRQEIIKAAQCFR-NF------PTTDKP 592

Query: 225 SAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNMMQSKL 275
               L  I GV+  N  +      GR T  +GL+ +  +M H C PNA H   N   + +
Sbjct: 593 FMDQLFRIVGVLNTNAFEAPCRTDGRETLLRGLFPLTAIMNHECTPNASHYFDNGRLAIV 652

Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
              RD      I+T YT  LWG   R   L +TK+F C C RC+D TE GTY SA+ C  
Sbjct: 653 RAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCNDNTENGTYLSALFC-- 710

Query: 332 EHKDQG-DCWILPVNPLDNDSDWTCGSC 358
             ++QG    ++PV       DW C SC
Sbjct: 711 --REQGCRGLVIPVQTRTLQPDWRCLSC 736



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/287 (24%), Positives = 106/287 (36%), Gaps = 43/287 (14%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPN 109
           K+  + +  SP+ GR + AT D+  G+ + RE  ++VGP    +  +  C  C++ +   
Sbjct: 49  KEPAWRVSDSPISGRGIFATRDIAKGEELFREHTILVGPTAHRSSNLRTCTLCYRLVPGQ 108

Query: 110 LADNARC-PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP---- 164
               A C   C  P C     G S  H         K  C+  L +     E I P    
Sbjct: 109 TDAEALCLAGCGLPVCGE--CGKSPRH---------KTECDLFLKWKPKVTERIDPRSLR 157

Query: 165 ----LRCLIL---QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP 217
               +RC  L   QRR     Q   D   +M +       +E+     + L   +     
Sbjct: 158 ILSVVRCFFLDEQQRRLLYAMQANAD-RYYMKEVERAAACFEDFPREQEMLDFFYRTICA 216

Query: 218 GDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
            +     S  C         V G ++    + L+ +  L+ H C PNA H       + V
Sbjct: 217 FNTNAFESRSC---------VEGHEV--LIRALFPLAGLLNHQCTPNAGHHFQDGETIIV 265

Query: 278 FR------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
                      I+  Y   LW T  R+  L +TK F C C RC DPT
Sbjct: 266 CATERIACGAEITVSYAKMLWSTLARKIFLGMTKGFMCQCPRCQDPT 312


>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
 gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
          Length = 445

 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 140/325 (43%), Gaps = 49/325 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
           + IKSS + GR + AT D+  G+ I RE  L+ GP     +   C+ CH+ L P      
Sbjct: 40  WTIKSSKVAGRGVFATRDIAAGELIFRERALVTGPTARKGQLSSCVCCHQML-PQSGFLC 98

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDYKYEAILPLRCLI 169
           R  RC  P C A CS   DA  H  EC        K   E L   N      +  +R   
Sbjct: 99  R-HRCTLPVC-ADCS---DAEIHRAECEHFRRWQPKDADEELEQVNPLSLRILTAVRVFH 153

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN--FLEKLPGDVLDDTSAK 227
           L     K+ + L D     ++RG   E+ +       +  ++  F+++L           
Sbjct: 154 LS----KEQRHLVDAMQANAQRGYRQEIIKAAQCFRNFPTTDREFMDQL----------- 198

Query: 228 CLHWICGVIEVNGVDI----GRYT---QGLYSVICLMEHNCLPNAKH--SNMMQSKLFVF 278
               I GV+  N  +     G +    +GL+ +  +M H C PNA H   N   + +   
Sbjct: 199 --FRIVGVLNTNAFEAPCRSGGHEVLLRGLFPLTAIMNHECTPNASHYFDNGTLAVVRAA 256

Query: 279 RDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
           RD      I+T YT  LWG   R   L +TK FNC C RC+D TE GTY SA+ C    +
Sbjct: 257 RDIPKGGEITTTYTKILWGNLTRGIFLKMTKNFNCDCVRCNDNTENGTYLSALFC----R 312

Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
           +QG    ++PV       DW C SC
Sbjct: 313 EQGCRGLVIPVQTRTLQPDWKCLSC 337


>gi|91088051|ref|XP_966825.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270012083|gb|EFA08531.1| hypothetical protein TcasGA2_TC006184 [Tribolium castaneum]
          Length = 331

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 17/233 (7%)

Query: 221 LDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM------ 270
           +D      +H +CG++++N +++    G   + LY  +  + HNC+ N   +        
Sbjct: 1   MDQFCDNLIHTVCGILDLNAIEVRASSGYLIRCLYPKLSQISHNCVTNLFQTVTPEDFKI 60

Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            +++ + +  +  +   Y   LW T  RR+ L   K  +C C+RC D TEL T+ S +KC
Sbjct: 61  TLKASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 120

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPN 387
                D G   +L  +PL +  DW C  C  + NA  V  V   +  +++  ++V     
Sbjct: 121 --SKCDNG--ILLSSDPLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEG 176

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           ++  E +  K  ++FHP + +   ++  L QLYG  PGY   +L   L E K+
Sbjct: 177 IEQREAIFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 229


>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
 gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
          Length = 460

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/339 (28%), Positives = 141/339 (41%), Gaps = 57/339 (16%)

Query: 52  KKLP-YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNP 108
           ++LP + +  SP+ GR + AT D+  G+ I RE  L+VGP       +  C+ CH+ L  
Sbjct: 39  EELPNWTLAPSPVAGRGVFATRDIAAGELIFRERALVVGPTARKGSQLNTCVCCHRLLG- 97

Query: 109 NLADNARCP-RCFWPACSARCSGLSDAHTH-------------APECAILKLGCETLLAY 154
             A    CP RC  P C   C+  S+AH +             + EC  L    E +   
Sbjct: 98  --AKQFLCPHRCTLPVC-GECAD-SEAHHNECAHFQRWQPKDLSEECKELSPTGEPVDVV 153

Query: 155 NDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE 214
           N      +  +R   L     K+ + L D     ++RG   E+ +      K+       
Sbjct: 154 NPLSLRILTAVRVFYLS----KEQRALVDAMQANAERGYRQEIIKAAQCFRKF------- 202

Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKH 267
                  D      L  + GV+  N  +            +GL+ +  +M H C PNA H
Sbjct: 203 ----PTTDKPFMDQLFRVVGVLNTNAFEAPCRVDGHESLLRGLFPLTAIMNHECTPNASH 258

Query: 268 SNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
                 +L V R          I+T YT  LW    R   L +TKYF C+C+RC+D +E 
Sbjct: 259 Y-FENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSEN 317

Query: 321 GTYFSAMKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSC 358
           GTY SA+ C    ++QG    ++PV       DW C SC
Sbjct: 318 GTYLSALFC----REQGCKGLVIPVQTKTLQPDWRCLSC 352


>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
          Length = 421

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 119/257 (46%), Gaps = 30/257 (11%)

Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-------GRYT---QGLYSVI 254
           +  L+ N + KLP +       + +   CGV   N  ++         YT   +GLY + 
Sbjct: 98  IDLLKRN-VAKLPSE----EQMELMRRTCGVFNTNSFEVVVVPSRDNDYTTSLRGLYPMG 152

Query: 255 CLMEHNCLPNAKHSNMMQSKLFV------FRDTHISTMYTNALWGTQPRREHLAITKYFN 308
            L  H C+PN +H    Q +L+V           I+  YT+ LW T  RR+ L +TK F+
Sbjct: 153 ALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLKVTKRFS 212

Query: 309 CSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV- 367
           C+C RCSDP E G+  S + C    KD     +LP NPL+++S W C  C   +N R V 
Sbjct: 213 CNCNRCSDPLEFGSQLSTLLC---AKDNCSGCLLPRNPLNHESSWICDKCQISVNYRQVI 269

Query: 368 HLVTSQLGEQVDKLVQENPN--VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPG 425
             + S L   V   + + P   ++  E +L++L       +Y    +K+ +I  +G  P 
Sbjct: 270 ECIHSGLNTFVSDAMYKTPREILRFTEAILSRL---VPATNYIMMDMKYRIISYFGRVPD 326

Query: 426 YAYTQLSSSLLERKISY 442
             +  L+ + L  K  Y
Sbjct: 327 LKWENLTDTELRIKKMY 343


>gi|157117568|ref|XP_001658830.1| hypothetical protein AaeL_AAEL008032 [Aedes aegypti]
 gi|108876001|gb|EAT40226.1| AAEL008032-PA [Aedes aegypti]
          Length = 527

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 150/353 (42%), Gaps = 56/353 (15%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNPNLADN 113
           P+++      GR + AT D+   + I  + P++VGP++   + I C  C + L   L   
Sbjct: 66  PWDLGVFEGSGRGMVATRDIAENELIFMDRPILVGPRVNNYDVIFCASCCR-LQKKLTLC 124

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILK--------LGCETLLAYNDYKYEAILPL 165
           +    C  P CS      +   +H  EC  +           C+T+L        A+  +
Sbjct: 125 S--GGCRLPVCSECDISAAPETSHTAECRRINSWEPKDQNRYCKTILY-------ALTSI 175

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
           R  +L   S       + +  +M    P  ++  E+D I   L+     KLP D  D   
Sbjct: 176 RGFLLDDLS-------RSIVLNMQAHPPRKDMTTEVDRI---LKDQVFNKLPEDGSD--- 222

Query: 226 AKCLHWICGVIEVNGVDIGRYTQ------------GLYSVICLMEHNCLPNAKHSNMMQS 273
            + L  +  V+  N  +  R  +            GLY +  LM H C PN ++  +   
Sbjct: 223 VQYLRKVIDVLNTNAFETFRMVKDEDNNDHEIVLRGLYILGALMNHCCRPNIRY--VFDE 280

Query: 274 KLF--------VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
           KL         + +   I   Y+ +LWGTQ R  HL  +K+F C C+RC DPTE+GTY  
Sbjct: 281 KLVMHCYASKPIRKGEQIFNNYSKSLWGTQHRIIHLCFSKHFLCGCDRCLDPTEMGTYLG 340

Query: 326 AMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQV 378
           A+KC+ +   QG   ++ +NP    S W C +C  ++N   +  +    G  +
Sbjct: 341 ALKCVRDSCPQGR--MVSMNPQKISSAWQCDTCGLKMNNLKITRIQEIAGRMI 391


>gi|195488851|ref|XP_002092488.1| GE11639 [Drosophila yakuba]
 gi|194178589|gb|EDW92200.1| GE11639 [Drosophila yakuba]
          Length = 488

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/392 (26%), Positives = 168/392 (42%), Gaps = 34/392 (8%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A++ + PGD I  E PL+V P     +  C  C +           C RC  +P
Sbjct: 16  LGRHLVASISIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKY---Q 179
            C   CS   +           K  C+ +L  N      I  L  L+L   +P      Q
Sbjct: 70  LC-MDCSQHDEFECEFFASGAGKALCKDILVKN----FGICGLLKLLLLLENPNTQADCQ 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKCLHWICGV 235
            L  +  ++     G  +++E + +V    +    ++ LP   L  D   A C+      
Sbjct: 125 MLIGVPINLGDYRDGEGMWQEHEELVVRPLMECGLVDCLPTQQLTSDILHAHCIRIDSNS 184

Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTMYTN 289
            EV   D G   +G++     + H+C+PN   +   Q  + ++          I   YTN
Sbjct: 185 FEVTAKD-GDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNSYTN 243

Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN 349
            L GT  R+  L +++   C+C RC DPTE+GT+ S++KC    K+ G   +  ++    
Sbjct: 244 PLMGTSQRQHQLRLSRRLECTCTRCLDPTEMGTHMSSLKC----KECGGFSVCEIDSNGK 299

Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
             DW C  C A L A +VH + +++G     LV    +++  E +LT+   + HP+H+  
Sbjct: 300 LGDWRCPDCKALLTAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPLLHPNHFML 356

Query: 410 YAVKHSLIQ-LYGTQPGYAYTQLSSSLLERKI 440
             +K ++   L       A  Q    LL R++
Sbjct: 357 LDIKQNIASILRAAALMTAMDQPCKKLLARRV 388


>gi|195056335|ref|XP_001995066.1| GH22836 [Drosophila grimshawi]
 gi|193899272|gb|EDV98138.1| GH22836 [Drosophila grimshawi]
          Length = 489

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/384 (26%), Positives = 167/384 (43%), Gaps = 61/384 (15%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A+L +  GD +  E PL+V P     +  C  C +           C +C  +P
Sbjct: 16  LGRHLVASLSIEAGDTLLEERPLLVTPHWECGQLKCAQCLQE------SYVMCRKCQVYP 69

Query: 123 ACSARCSGLSDAHTH-APECAILKLGC-----ETLLAYN----------DYKYEAILPLR 166
            C        D   H A EC     G      + LL  N              +  L  +
Sbjct: 70  LCM-------DCSEHDAFECEFFASGAGCRIGKDLLVKNYSICGLLKLLLLLEQPELRAQ 122

Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVK-YLRSNFLEKLPGDVL--DD 223
           C +L  R P K  + ++ EA M +        E  + +V+  + S   E LPG  L  D 
Sbjct: 123 CQLLVSR-PIKLDDYRN-EAGMWQ--------EHEEHVVRPLMASGLAEALPGQQLSSDV 172

Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI 283
             A C+       EV G D G   +G+Y     + H+C+PN   +   Q  + ++    +
Sbjct: 173 LHAHCIRIDSNSFEVTGRD-GEQLKGVYVCGAGLPHHCVPNTVVALDDQFNMKLYAAVPL 231

Query: 284 ST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
           +T       YTN L GT  R+  L ++++  C+C RC DPTELGT+ S+MKC    ++  
Sbjct: 232 ATGDIIYASYTNPLMGTSQRQHQLRLSRHMECACTRCLDPTELGTHMSSMKC----RECT 287

Query: 338 DCWILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
           D + +P   L    DW C    C   + A DVH + +++G     +V+    + + E +L
Sbjct: 288 DGYAVP--DLSGTGDWHCLGVDCEGVIPAADVHELLAEVG---GAMVEAKGELNAYEALL 342

Query: 396 TKLEAMFHPHHYHCYAVKHSLIQL 419
            + + + HP+H+    +K ++  +
Sbjct: 343 AQHKPLLHPNHFLLLDIKQNIASI 366


>gi|125809191|ref|XP_001361025.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
 gi|54636198|gb|EAL25601.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
          Length = 501

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 104/396 (26%), Positives = 172/396 (43%), Gaps = 74/396 (18%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+K +  LGR++ A  DL  G+ +  E P++V P +                    + RC
Sbjct: 13  ELKQNETLGRFVVARCDLSAGETLLLEQPIVVLPHIG-------------------DRRC 53

Query: 117 PRCFWPACS--ARCSGLS---DAHTH-APECAILK---LGCETLLAYNDYKYEAILP-LR 166
            +CF    +   +C  LS   D   H   +C  L+   L  E +    +     + P L 
Sbjct: 54  SKCFKLTATFCGKCRLLSLCEDCAGHDERDCQRLRELQLSDEQVQILQEKANTEVQPALA 113

Query: 167 CLIL--QRRSPKKYQELKDMEAHMSKRGPGTEVYE----------EIDSIVKYLRSNFLE 214
           CL+L   + S   ++E+  ME H+  R  GTE++           E   +VK L S    
Sbjct: 114 CLLLGEHQHSRPLFEEMSQMETHLEAR-RGTEIWNRYQAHAYAPLEATGVVKQLHSA--- 169

Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSN 269
              G+ +D+     +  + G+++VN  +I     G   +GLY    L  H+C+PN + + 
Sbjct: 170 --SGNAVDEI---LVQRLLGILDVNVYEIRAPEQGGAMRGLYRRAALFAHSCMPNLETAI 224

Query: 270 MMQSKLFVFRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY 323
             + ++ VF +  I+        YT+ L G + RR  L   K F+CSC RC DPTELGT+
Sbjct: 225 DDERRIKVFANRQITAGEILYICYTSVLLGNEERRHILKTGKCFDCSCARCQDPTELGTH 284

Query: 324 FSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV---HLVTSQLGEQVDK 380
            S+  C       G  +I  V  L+  + W C      LN         V++ L    ++
Sbjct: 285 MSSFICSGCACSGG--YI--VRQLETGT-WQC-----LLNPEHTLKPEFVSNMLERAKEE 334

Query: 381 LVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
           +     ++  LE +L K+  + H +HY    +K ++
Sbjct: 335 IFHARGDIYRLELLLVKMSRLLHGNHYLMLDLKQNI 370


>gi|312380816|gb|EFR26711.1| hypothetical protein AND_07035 [Anopheles darlingi]
          Length = 368

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 91/183 (49%), Gaps = 35/183 (19%)

Query: 83  ESPLIVGPKLALAE-------PICLGCHKPLNPNLAD----NARCPRCFWPACSARCSGL 131
           E+PL++GPK  L E         C+GC        AD      RC  C WPAC   C GL
Sbjct: 15  ENPLVIGPKWNLDEYEQRSTIVPCVGC-------FADCHLGQFRCEHCHWPACKPDCPGL 67

Query: 132 SDAHTHAPECAILKLGC--------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
           ++A+ H  EC IL+ G         ET   Y  Y+Y+A+L L+CL +Q RSP ++ ++ +
Sbjct: 68  TNANLHGIECGILRFGRAPKPTDDPETFFDY--YRYDALLVLKCLAIQIRSPTQFDQMMN 125

Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE-------KLPGDVLDDTSAKCLHWICGVI 236
           +E+H  +R       +  D +V YL  NFL+       K    VL     K LH I G++
Sbjct: 126 LESHYEERKKTHYYRDTDDRVVSYLLRNFLDPLQKLEKKQDKQVLKVCDRKALHKIAGIL 185

Query: 237 EVN 239
           EVN
Sbjct: 186 EVN 188



 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 53/76 (69%)

Query: 366 DVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPG 425
           +V+ + S++GE+VD+L+    +VK  E+++ KL+   HP+H+H   ++HSLIQ+YG   G
Sbjct: 186 EVNFLLSKIGEEVDELMGRKSSVKEFEDLIQKLQNFLHPNHFHLLTLQHSLIQMYGHFNG 245

Query: 426 YAYTQLSSSLLERKIS 441
           Y    LS +LL+RK+S
Sbjct: 246 YQLQVLSDALLQRKLS 261



 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 33/69 (47%), Gaps = 8/69 (11%)

Query: 472 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGC--------ETLLAYNDYKSLSRAYLD 523
           RC  C WPAC   C GL++A+ H  EC IL+ G         ET   Y  Y +L      
Sbjct: 51  RCEHCHWPACKPDCPGLTNANLHGIECGILRFGRAPKPTDDPETFFDYYRYDALLVLKCL 110

Query: 524 HQQVRSWTQ 532
             Q+RS TQ
Sbjct: 111 AIQIRSPTQ 119


>gi|195124463|ref|XP_002006712.1| GI21214 [Drosophila mojavensis]
 gi|193911780|gb|EDW10647.1| GI21214 [Drosophila mojavensis]
          Length = 489

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 166/382 (43%), Gaps = 57/382 (14%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A+L + PG+ +  E PL+V P        C  C +           C RC  +P
Sbjct: 16  LGRHLVASLSIEPGETLLEEQPLLVAPHWECDRLKCAQCLQE------SYVICRRCQVYP 69

Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYN----DYKYEAILPL-----------RC 167
            C   CS    AH  A EC     G  + +  +    +Y   A+L L           +C
Sbjct: 70  LC-MDCS----AHD-AFECEFFASGAGSGIGKDLLIKNYGICALLKLLLLLEQPDQRAQC 123

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVL--DDTS 225
            IL  R  K       +E + ++ G   E  E++  +   + S   + LP   L  D   
Sbjct: 124 QILLERPVK-------LEDYRNEDGVWQEHEEQV--VRPIMESGLAQVLPDQQLTSDALH 174

Query: 226 AKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------R 279
           A C+       EV   D G   +G+Y     + H+C+PN   +   Q  L ++       
Sbjct: 175 AHCIRIDTSAYEVTARD-GDTLKGVYVRGANLPHSCVPNTVVALDEQFNLKLYAAVPLES 233

Query: 280 DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC 339
              I T Y N L GT  R+  L +T++  C+C RC DPTELGT+ S+MKC    +D  D 
Sbjct: 234 GQIIFTSYANPLMGTSQRQHQLRMTRHMECACVRCLDPTELGTHMSSMKC----RDCSDG 289

Query: 340 WILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
           +   V  L    +W C   SC A + A DVH + +++G     LV     + + E +L +
Sbjct: 290 Y--AVCDLGGTGNWRCLDVSCGAVIPAADVHELLAEVG---GALVDAKGELSAYESLLAQ 344

Query: 398 LEAMFHPHHYHCYAVKHSLIQL 419
            + + HP+H+    +K ++  +
Sbjct: 345 NKQLLHPNHFLLLDIKQNIASI 366


>gi|195441983|ref|XP_002068740.1| GK17864 [Drosophila willistoni]
 gi|194164825|gb|EDW79726.1| GK17864 [Drosophila willistoni]
          Length = 504

 Score =  101 bits (252), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 105/387 (27%), Positives = 168/387 (43%), Gaps = 53/387 (13%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           ++K +  LGR++ A  ++  G+ +  E P+++ P L   E  C  C      NL  N  C
Sbjct: 13  QVKHNDTLGRFVVAVDNIRAGETLLVEEPIVILPHLG--ERRCARC-----LNLTTNF-C 64

Query: 117 PRC-FWPACSA-------RCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL-RC 167
            +C   P C          C  L+D          L+L  E +    D +   +  + +C
Sbjct: 65  SKCQLLPLCEECKDHEERDCQRLAD----------LQLSEEQVQQLQDSQRTEVQSMVKC 114

Query: 168 LILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEIDSIVK--YLRSNFLEKLPGDVLDD 223
           L+L+  +  +  Y+E+  M+A +  R  GTEV+    + V      S  L KL      +
Sbjct: 115 LLLREHTETRSLYEEMYQMDAQLDAR-RGTEVWNNYQANVANPLESSGILGKLWQGSDLE 173

Query: 224 TSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNA-----KHSNMMQS 273
              + +  +  + ++N  +I     G   +G+Y    L  H+C+PN        SN  Q 
Sbjct: 174 KDVELVQRLLAIFDINAFEIRAPESGGSMRGIYRRASLFPHSCMPNLVSAIDDASNGRQL 233

Query: 274 KLFVFRDTHISTM----YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
           K++  R      +    YTN L GTQ RRE L   K FNC+C RC DPTELGT+ S+  C
Sbjct: 234 KMYASRFIAAGEILYNCYTNVLLGTQERREILKQGKCFNCNCPRCVDPTELGTHMSSFMC 293

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
            N     G    + +NP  N   W C         R+   V + L    +++     N+ 
Sbjct: 294 NNCSSTGG---YIVLNPETN--QWQCLLNPEHTLKRE--FVANMLERAKEEVFHARDNIY 346

Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSL 416
            +E +L KL  + H +HY    +K ++
Sbjct: 347 RMELLLAKLSRLLHGNHYVMLDLKQNI 373


>gi|194882335|ref|XP_001975267.1| GG20655 [Drosophila erecta]
 gi|190658454|gb|EDV55667.1| GG20655 [Drosophila erecta]
          Length = 498

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 81/297 (27%), Positives = 140/297 (47%), Gaps = 47/297 (15%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EIK +  LGR++ A  +L  G+ +  E+P++V P++   +  C  C K         + C
Sbjct: 13  EIKQNDTLGRFVVALCNLRAGETLLLENPIVVLPQMG--DRRCSKCFK------LTQSFC 64

Query: 117 PRCFWPACSARCSG--------LSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRC 167
            +C   A    CS         L++ H  A +  +L+            ++  I P L+C
Sbjct: 65  RKCHLLALCEDCSNHDERDCRRLAEMHFSADQVELLQ----------RKEHTEIQPVLKC 114

Query: 168 LILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDD 223
           L+L+        Y+E+  ME+ ++ R  GT++++     +         L++L G   +D
Sbjct: 115 LLLREHEETLPLYEEMSQMESQLATR-RGTDIWKNYQEHAFAPLDAGGVLKQLRGSADED 173

Query: 224 TSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
                +  + G++++N  +I     G   +GLY    L  H+C+PN   S   + ++ V+
Sbjct: 174 L----VQGLLGILDINAYEIRAPESGGAMRGLYRRAGLFAHSCMPNLVISIDDEKRIKVY 229

Query: 279 RDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            +  I+        YTN L GT+ RR+ L   K F+C+C RC DPTELGT+ S+  C
Sbjct: 230 ANRFIAAGEILYNCYTNVLLGTEERRQILKEGKCFDCTCARCQDPTELGTHMSSFMC 286


>gi|195154477|ref|XP_002018148.1| GL17552 [Drosophila persimilis]
 gi|194113944|gb|EDW35987.1| GL17552 [Drosophila persimilis]
          Length = 501

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 103/396 (26%), Positives = 172/396 (43%), Gaps = 74/396 (18%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+K +  LGR++ A  DL  G+ +  E  ++V P +                    + RC
Sbjct: 13  ELKQNETLGRFVVARCDLSAGETLLLEQSIVVLPHIG-------------------DRRC 53

Query: 117 PRCFWPACS--ARCSGLS---DAHTH-APECAILK---LGCETLLAYNDYKYEAILP-LR 166
            +CF    +   +C  LS   D   H   +C  L+   L  E +    +     + P L 
Sbjct: 54  SKCFKLTATFCGKCRLLSLCEDCAGHDERDCQRLRELQLSDEQVQILQEKANTEVQPALA 113

Query: 167 CLIL--QRRSPKKYQELKDMEAHMSKRGPGTEVYE----------EIDSIVKYLRSNFLE 214
           CL+L   + S   ++E+  ME H+  R  GTE+++          E   +VK L S    
Sbjct: 114 CLLLGEHQHSRPLFEEMSQMETHLEAR-RGTEIWKRYQAHAYAPLEATGVVKQLHSA--- 169

Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSN 269
              G+ +D+     +  + G+++VN  +I     G   +GLY    L  H+C+PN + + 
Sbjct: 170 --SGNAVDEI---LVQRLLGILDVNVYEIRAPEQGGAMRGLYRRAALFAHSCMPNLETAI 224

Query: 270 MMQSKLFVFRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY 323
             + ++ VF +  I+        YT+ L G + RR  L   K F+CSC RC DPTELGT+
Sbjct: 225 DDERRIKVFANRQIAAGEILYICYTSVLLGNEERRHILKTGKCFDCSCARCQDPTELGTH 284

Query: 324 FSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV---HLVTSQLGEQVDK 380
            S+  C       G  +I  V  L+  + W C      LN         V++ L    ++
Sbjct: 285 MSSFICSGCACSGG--YI--VRQLETGT-WQC-----LLNPEHTLKPEFVSNMLERAKEE 334

Query: 381 LVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
           +     ++  LE +L K+  + H +HY    +K ++
Sbjct: 335 IFHARGDIYRLELLLVKMSRLLHGNHYLMLDLKQNI 370


>gi|195584110|ref|XP_002081858.1| GD11242 [Drosophila simulans]
 gi|194193867|gb|EDX07443.1| GD11242 [Drosophila simulans]
          Length = 498

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 81/289 (28%), Positives = 138/289 (47%), Gaps = 31/289 (10%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EIK + +LGR++ A  +L  G+ +  E+P++V P++   +  C  C      NL ++  C
Sbjct: 13  EIKHNDMLGRFVVALCNLRAGETLLLENPIVVLPQMG--DRRCTKCF-----NLTESF-C 64

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRSP 175
            +C   A    CS   +        A + L  + +      ++  I P L+CL+L+    
Sbjct: 65  RKCRLLALCEDCSDHDE--RDCKRLAEMNLSDDQVELLQKKEHTEIQPVLKCLLLREHEE 122

Query: 176 KK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
               Y+E+  ME+ +  R  GT++++     +         L +L G   +D     +  
Sbjct: 123 TLPLYEEMSQMESQLMTR-RGTDIWKNYQEHAFTPLNSGGVLAQLRGAADEDL----VQG 177

Query: 232 ICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
           + G++++N  +I     G   +GLY    L  H+C+PN   S   + ++ V+ +  I+  
Sbjct: 178 LLGILDINAYEIRAPEVGGAMRGLYRRAGLFAHSCMPNLVISIDDERRIKVYANRFIAAG 237

Query: 287 ------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
                 YTN L GT+ RR+ L   K F+CSC RC DPTELGT+ S+  C
Sbjct: 238 EILYNCYTNVLLGTEERRQILKEGKCFDCSCPRCQDPTELGTHMSSFIC 286


>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
          Length = 462

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 131/323 (40%), Gaps = 45/323 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
           + I SS + GR + AT D+  G+ I +E  L+ GP     +   C+ CH+ L P      
Sbjct: 57  WTISSSTVAGRGVFATRDISAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 115

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
           R  RC  P C       SD+  H  EC   +      +   D + E + P+   IL    
Sbjct: 116 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 167

Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                K+ + L D     ++R    E+ +       +              D      L 
Sbjct: 168 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 216

Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
            I GV+  N  +      G  T  +GL+ +  +M H C PNA H      +L V R    
Sbjct: 217 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 275

Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I+T YT  LWG   R   L +TK+F C C RC D TE GTY SA+ C    ++Q
Sbjct: 276 IPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQ 331

Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
           G    ++PV       DW C +C
Sbjct: 332 GCRGLVIPVQTRTLQPDWRCITC 354


>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
 gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
 gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
          Length = 462

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 131/323 (40%), Gaps = 45/323 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
           + I SS + GR + AT D+  G+ I +E  L+ GP     +   C+ CH+ L P      
Sbjct: 57  WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 115

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
           R  RC  P C       SD+  H  EC   +      +   D + E + P+   IL    
Sbjct: 116 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 167

Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                K+ + L D     ++R    E+ +       +              D      L 
Sbjct: 168 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 216

Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
            I GV+  N  +      G  T  +GL+ +  +M H C PNA H      +L V R    
Sbjct: 217 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 275

Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I+T YT  LWG   R   L +TK+F C C RC D TE GTY SA+ C    ++Q
Sbjct: 276 IPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQ 331

Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
           G    ++PV       DW C +C
Sbjct: 332 GCRGLVIPVQTRTLQPDWRCITC 354


>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
          Length = 495

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 132/322 (40%), Gaps = 43/322 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
           + I SS + GR + AT D+  G+ I +E  L+ GP     +   C+ CH+ L P      
Sbjct: 90  WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 148

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
           R  RC  P C       SD+  H  EC   +      +   D + E + P+   IL    
Sbjct: 149 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 200

Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                K+ + L D     ++R    E+ +       +              D      L 
Sbjct: 201 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 249

Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDT 281
            I GV+  N  +      G  T  +GL+ +  +M H C PNA H   N   + +   RD 
Sbjct: 250 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHYFENGRLAVVRAARDI 309

Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
                I+T YT  LWG   R   L +TK+F C C RC D TE GTY SA+ C    ++QG
Sbjct: 310 PKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQG 365

Query: 338 -DCWILPVNPLDNDSDWTCGSC 358
               ++PV       DW C +C
Sbjct: 366 CRGLVIPVQTRTLQPDWRCITC 387


>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
          Length = 991

 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 131/323 (40%), Gaps = 45/323 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
           + I SS + GR + AT D+  G+ I +E  L+ GP     +   C+ CH+ L P      
Sbjct: 586 WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 644

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
           R  RC  P C       SD+  H  EC   +      +   D + E + P+   IL    
Sbjct: 645 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 696

Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                K+ + L D     ++R    E+ +       +              D      L 
Sbjct: 697 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 745

Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
            I GV+  N  +      G  T  +GL+ +  +M H C PNA H      +L V R    
Sbjct: 746 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 804

Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I+T YT  LWG   R   L +TK+F C C RC D TE GTY SA+ C    ++Q
Sbjct: 805 IPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQ 860

Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
           G    ++PV       DW C +C
Sbjct: 861 GCRGLVIPVQTRTLQPDWRCITC 883



 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 80/321 (24%), Positives = 124/321 (38%), Gaps = 58/321 (18%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKL--ALAEPICLGCHKPLNPNLADN 113
           + +  SP+ GR + AT ++  G+ + RE  L+VGP    ++    C  C++ +  +    
Sbjct: 44  WRVADSPISGRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSA 103

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
           A CP  C  P CS       D+  H  EC + +        +   + + I P        
Sbjct: 104 ALCPAGCGLPVCSE----CRDSTRHDLECKLFR-------KWKPLESQRIEPRALRILSV 152

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           +RC  L   S K    L  M+A+M  R    EV    D    + R         D+LD  
Sbjct: 153 VRCFFLDEASRKL---LYAMQANMD-RYYMQEVQRAADCFEHFPREQ-------DMLDYF 201

Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
                 +     E    V+G ++    + L+ +  L+ H C PNA H       + V   
Sbjct: 202 YRTICAFNTNAFESRSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 259

Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----------- 323
                   I+  Y   LW T  R+  L +TK+F C C RC DPT + +            
Sbjct: 260 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPTFMASITNRKQTSVKGA 319

Query: 324 --FSAMKCLNEHKDQGDCWIL 342
             FS  +   E  D G C ++
Sbjct: 320 GGFSHSEVFGEQIDSGKCSLM 340


>gi|195335121|ref|XP_002034224.1| GM21751 [Drosophila sechellia]
 gi|194126194|gb|EDW48237.1| GM21751 [Drosophila sechellia]
          Length = 498

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 81/289 (28%), Positives = 137/289 (47%), Gaps = 31/289 (10%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EIK + +LGR++ A  +L  G+ +  E+P++V P++   +  C  C      NL ++  C
Sbjct: 13  EIKHNDMLGRFVVALCNLRAGETLLLENPIVVLPQMG--DRRCSKCF-----NLTESF-C 64

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRSP 175
            +C   A    CS            A +    + +      ++  I P L+CL+L+    
Sbjct: 65  RKCRLLALCEDCS--YHDERDCKRLAEMNFSDDQVELLQKKEHTEIQPVLKCLLLREHEE 122

Query: 176 KK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
               Y+E+  ME+ +  R  GT++++     +         L +L G   +D     +  
Sbjct: 123 TLPLYEEMSQMESQLMTR-RGTDIWKNYQEHAFTPLNSGGVLAQLRGAADEDL----VQG 177

Query: 232 ICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
           + G++++N  +I     G   +GLYS   L  H+C+PN   S   + ++ V+ +  I+  
Sbjct: 178 LLGILDINAYEIRAPEVGGAMRGLYSRAGLFAHSCMPNLVISIDDERRIKVYANRFIAAG 237

Query: 287 ------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
                 YTN L GT+ RR+ L   K F+CSC RC DPTELGT+ S+  C
Sbjct: 238 EILYNCYTNVLLGTEERRQILKEGKCFDCSCPRCQDPTELGTHMSSFIC 286


>gi|321474757|gb|EFX85721.1| hypothetical protein DAPPUDRAFT_98275 [Daphnia pulex]
          Length = 222

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 4/116 (3%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVCR  A   C  C E++YC K+HQK+HW  HK +CK  PY+I      GR + A+ ++
Sbjct: 16  CAVCRSKASQICGGCGEISYCSKEHQKQHWATHKSQCK--PYKIVFDQKYGRIMVASKNI 73

Query: 75  HPGDRIARESPLIVGPKLALAEPICLGCHKPL-NPNLADNARCPRCFWPACSARCS 129
            PG+ I RE  +  GPK     P CL C+  L N + A   RCP C +P C  +C+
Sbjct: 74  KPGEIIFREKAIFTGPKQGCL-PCCLACYTSLENVDEASLFRCPGCNFPFCQEKCA 128


>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
 gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
          Length = 462

 Score = 94.7 bits (234), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/323 (28%), Positives = 130/323 (40%), Gaps = 45/323 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
           + I SS + GR + AT D+  G+ I +E  L+ GP     +   C+ CH+ L P      
Sbjct: 57  WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 115

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
           R  RC  P C       +D+  H  EC   +      +   D + E + PL   IL    
Sbjct: 116 R-HRCTLPVCET----CADSEEHQAECEHFRRWQPKDV---DAEQEQVNPLSLRILTAVR 167

Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                K+ + L D     ++R    E+ +       +              D      L 
Sbjct: 168 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 216

Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
            I GV+  N  +      G  T  +GL+ +  +M H C PNA H      +L V R    
Sbjct: 217 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 275

Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                 I+T YT  LW    R   L +TK+F C C RC D TE GTY SA+ C    ++Q
Sbjct: 276 IPKGGEITTTYTKILWSNLTRNIFLKMTKHFACDCARCHDNTENGTYLSALFC----REQ 331

Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
           G    ++PV       DW C +C
Sbjct: 332 GCRGLVIPVQTRTLQPDWRCITC 354


>gi|24654327|ref|NP_611182.1| CG9642 [Drosophila melanogaster]
 gi|7302839|gb|AAF57913.1| CG9642 [Drosophila melanogaster]
          Length = 498

 Score = 94.0 bits (232), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/289 (28%), Positives = 134/289 (46%), Gaps = 31/289 (10%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EIK +  LGR+  A  ++  G+ +  E+P++V P   + E  C  C      NL ++  C
Sbjct: 13  EIKQNDTLGRFAVALCNVRAGETLLLENPIVVLP--LMGERRCSKCF-----NLTESF-C 64

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRSP 175
            +C   A    CS   +        A +    + +      ++  I P L+CL+L+    
Sbjct: 65  RKCRLLALCEDCSDHDE--RDCKRLAEMNFSDDQVELLQKKEHTEIQPVLKCLLLREHEE 122

Query: 176 KK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
               Y+E+  M++ +  R  GTEV++     +         L +L G   +D     +  
Sbjct: 123 TLPLYEEMSQMDSQLMTR-RGTEVWKNYQEHAFTPLDYGGVLAQLRGAADEDL----VQG 177

Query: 232 ICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
           + G++++N  +I     G   +GLY    L  H+C PN   S   + ++ V+ +  I+  
Sbjct: 178 LLGILDINAYEIRAPEVGGAMRGLYRRAGLFAHSCTPNLVISIDDEQRIKVYANRFIAAG 237

Query: 287 ------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
                 YTN L GT+ RR+ L + K F+CSC RC DPTELGT+ S+  C
Sbjct: 238 EILYNCYTNVLLGTEERRKILKVGKCFDCSCPRCQDPTELGTHMSSFIC 286


>gi|195488849|ref|XP_002092487.1| GE11640 [Drosophila yakuba]
 gi|194178588|gb|EDW92199.1| GE11640 [Drosophila yakuba]
          Length = 497

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 137/296 (46%), Gaps = 45/296 (15%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EIK +  LGR++ A  +L  G+ +  E+P++V P++   +  C  C K         + C
Sbjct: 13  EIKQNDTLGRFVVALCNLRAGETLLLENPIVVLPQIG--DRRCSKCFK------LTQSFC 64

Query: 117 PRCFWPACSARCSG--------LSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            +C   A    CS         L++ +    +  +L+    T         E    L+CL
Sbjct: 65  RKCRLLALCEDCSDHDGRDCRRLAEMNLSNDQVELLQRKEHT---------EIQSALKCL 115

Query: 169 ILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDT 224
           +L+        Y+E+  ME+ ++ R  GT++++     +         L++L G   +D 
Sbjct: 116 LLREHEETLPLYEEMSQMESQLASR-RGTDIWKSYQEHAFAPLDSGGVLKQLRGSADEDL 174

Query: 225 SAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
               +  + G+++VN  +I     G   +GLY    L  H+C+PN   S   + ++ V+ 
Sbjct: 175 ----VQGLLGILDVNAYEIRAPESGGAMRGLYRRAGLFAHSCMPNLVISIDDERRIKVYA 230

Query: 280 DTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
           +  I+        YTN L GT+ R++ L   K F+C+C RC DPTELGT+ S+  C
Sbjct: 231 NRFIAAGEILYNCYTNVLLGTEERQQILKEGKCFDCTCPRCKDPTELGTHMSSFMC 286


>gi|194756634|ref|XP_001960581.1| GF11433 [Drosophila ananassae]
 gi|190621879|gb|EDV37403.1| GF11433 [Drosophila ananassae]
          Length = 497

 Score = 93.2 bits (230), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 83/300 (27%), Positives = 137/300 (45%), Gaps = 49/300 (16%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK---------LALAEPICLGCHKPLN 107
           ++K +  LGR++ A  ++  G+ +  E P++V P            L E  C  CH    
Sbjct: 13  QVKQNETLGRFVIALGNIRAGETLLLEEPILVLPHAGDRRCSKCFKLTEEFCRKCHLL-- 70

Query: 108 PNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKY-EAILPLR 166
                 A C  C        C  LSD          LKL  E +L   + K+ E    L+
Sbjct: 71  ------ALCGDCSEHD-ERDCKRLSD----------LKLSEEQVLLLQEKKHSEVQSALK 113

Query: 167 CLILQRRSPKK--YQELKDMEAHMSKRGPGTEVYE--EIDSIVKYLRSNFLEKLPGDVLD 222
           C++L+     +  Y+E+  ME  ++ R  GT++++  +  +     +S  L++L  +  +
Sbjct: 114 CILLREHLETQPLYEEISQMENQLTAR-RGTDIWKNNQEHAYNPLEKSGVLKQLNSEANE 172

Query: 223 DTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
           D     +  + G+++VN  +I     G   +GLY    L  H+C+PN   +   + ++ V
Sbjct: 173 DL----VQGLLGMLDVNAFEIRAPESGGSMRGLYRRAGLYPHSCMPNLVTAIDDERRIKV 228

Query: 278 FRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
           + +  I+        YTN L GT  RR  L   K F+C+C RC DPTELGT+ S+  C N
Sbjct: 229 YANRFIAAGEILYNCYTNILLGTDERRHILKAGKCFDCTCSRCQDPTELGTHMSSFVCSN 288


>gi|270012082|gb|EFA08530.1| hypothetical protein TcasGA2_TC006183 [Tribolium castaneum]
          Length = 331

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 17/233 (7%)

Query: 221 LDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM------ 270
           +D      +H +CG++++N +++    G  T+ LY  +  + HNC+ N   +        
Sbjct: 1   MDQFCDDLIHTVCGILDLNAIEVRAPSGYLTRCLYPKLSQISHNCVTNLFQTVTPEDFKI 60

Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            +++ + +  +  +   Y   LW T  RR+ L   K  +C C+RC D TEL T+ S +KC
Sbjct: 61  TLRASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 120

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPN 387
                D G   +L  + L +  DW C  C  + NA  V  V   +  +++  ++V     
Sbjct: 121 --SKCDNG--ILLSSDSLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEG 176

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           ++  E +  K  ++FHP + +   ++  L QLYG  PGY   +L   L E K+
Sbjct: 177 IEQREAIFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 229


>gi|189239865|ref|XP_966657.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 356

 Score = 92.8 bits (229), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 17/233 (7%)

Query: 221 LDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM------ 270
           +D      +H +CG++++N +++    G  T+ LY  +  + HNC+ N   +        
Sbjct: 26  MDQFCDDLIHTVCGILDLNAIEVRAPSGYLTRCLYPKLSQISHNCVTNLFQTVTPEDFKI 85

Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            +++ + +  +  +   Y   LW T  RR+ L   K  +C C+RC D TEL T+ S +KC
Sbjct: 86  TLRASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 145

Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPN 387
                D G   +L  + L +  DW C  C  + NA  V  V   +  +++  ++V     
Sbjct: 146 --SKCDNG--ILLSSDSLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEG 201

Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
           ++  E +  K  ++FHP + +   ++  L QLYG  PGY   +L   L E K+
Sbjct: 202 IEQREAIFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 254


>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
 gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
 gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
          Length = 448

 Score = 92.4 bits (228), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 89/325 (27%), Positives = 134/325 (41%), Gaps = 50/325 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKL--ALAEPICLGCHKPLNPNLADN 113
           + +  SP+ GR + AT ++  G+ + RE  L+VGP    ++    C  C++ +  +    
Sbjct: 44  WRVADSPISGRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSA 103

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
           A CP  C  P CS       D+  H  EC + +        +   + + I P        
Sbjct: 104 ALCPAGCGLPVCSE----CRDSTRHDLECKLFR-------KWKPLESQRIEPRALRILSV 152

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           +RC  L   S K    L  M+A+M  R    EV    D    + R         D+LD  
Sbjct: 153 VRCFFLDEASRKL---LYAMQANMD-RYYMQEVQRAADCFEHFPREQ-------DMLDYF 201

Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
                 +     E    V+G ++    + L+ +  L+ H C PNA H       + V   
Sbjct: 202 YRTICAFNTNAFESRSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 259

Query: 281 ------THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
                   I+  Y   LW T  R+  L +TK+F C C RC DPTE GTY SA+ C    +
Sbjct: 260 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPTENGTYLSALFC----R 315

Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
           +QG    ++PV       DW C +C
Sbjct: 316 EQGCRGLVIPVQTRTLQPDWRCITC 340


>gi|21430220|gb|AAM50788.1| LD23906p [Drosophila melanogaster]
          Length = 417

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 69/260 (26%), Positives = 119/260 (45%), Gaps = 24/260 (9%)

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKC 228
           R+    Q L D+  ++S    G  +++E + +V    + S   + LP   L  D   A C
Sbjct: 47  RTKGDCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQELTSDALHAHC 106

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ 282
           +       EV   D G   +G++     + H+C+PN   +   Q  + ++          
Sbjct: 107 IRIDSNSFEVTAKD-GDTLKGIFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDI 165

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           I   YTN L GT  R+  L +++   C C RC DPTE+GT+ S++KC        +C   
Sbjct: 166 IYNSYTNPLMGTSQRQHQLRLSRRLECICSRCLDPTEMGTHMSSLKC-------KECPGF 218

Query: 343 PVNPLDND---SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLE 399
            V  +D++    DW C  C A L A +VH + + +G     LV    +++  E +LT+  
Sbjct: 219 SVCEIDSNGKLGDWRCPDCRALLTAAEVHELQAAVGS---ALVDAMGDLQVYEALLTQYG 275

Query: 400 AMFHPHHYHCYAVKHSLIQL 419
            + HP+H+    +K ++  +
Sbjct: 276 PLLHPNHFMLLDIKQNIASI 295


>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
 gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
          Length = 766

 Score = 91.7 bits (226), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 136/333 (40%), Gaps = 58/333 (17%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           + + +S + GR + AT D+  G+ I RE  L+VGP       +  C+ CH  L    A  
Sbjct: 354 WTLAASVVAGRGVFATRDIARGELIFRERALVVGPTARKGAQLNTCVCCHCLLP---AQG 410

Query: 114 ARCP-RCFWPACSARCSGLSDAHTHAPECAILKL---------GCETLLAYNDYKYEAIL 163
             C  RC  P C       SD+ TH  EC   +          G E   A      E I 
Sbjct: 411 FLCAHRCTLPVCGP----CSDSETHREECQHFQRWQPIDAVPEGAEPSGA------EQIN 460

Query: 164 PLRCLILQRRS----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD 219
           PL   IL         ++ + L D     ++RG      +EI    +  R NF       
Sbjct: 461 PLSLRILTAVRVFHLGQQQRSLVDAMQANAERG----YRQEIIKAAQCFR-NF------P 509

Query: 220 VLDDTSAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNM 270
             D      L  I GV+  N  +      G  T  +GL+ +  +M H C PNA H   N 
Sbjct: 510 TTDKPFMDQLFRIVGVLNTNAFEAPCRVDGHETLLRGLFPLTAIMNHECTPNASHYFDNG 569

Query: 271 MQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
             + +   RD      I+T YT  LW    R   L +TK F C C RC+D TE GTY SA
Sbjct: 570 RMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKNFMCDCPRCNDNTENGTYLSA 629

Query: 327 MKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSC 358
           + C    ++QG    ++PV       DW C SC
Sbjct: 630 LFC----REQGCKGLVIPVQTKTMQPDWRCLSC 658



 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 117/287 (40%), Gaps = 50/287 (17%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHK-PLNPNLAD 112
           +++  SP+ GR + A  D+  G+ + RE  L+VGP    +  +  C  C+  P   +   
Sbjct: 37  WQVADSPISGRGIFAIRDIEQGEELFREHTLLVGPTAHRSSNLRTCTICYNLPNGCDTDA 96

Query: 113 NARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP------- 164
            A CP  C  P C +          H  EC + +        +     E I P       
Sbjct: 97  QALCPSGCGLPVCPS----CRPTERHQLECKLFR-------KWKPKDTERIEPRALRILS 145

Query: 165 -LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP--GDVL 221
            +RC  L   + K+ Q L  M+A+ + R    EV    D           E  P   D+L
Sbjct: 146 VVRCFFL---TEKQRQLLYAMQAN-ADRYYMKEVQRAADC---------FEHFPHEQDML 192

Query: 222 DDTSAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMM 271
           D        +     E    V G +I    + L+ +  L+ H+C PNA H      + ++
Sbjct: 193 DYFYRTICAFNTNAFESRSRVGGNEI--LVRALFPLAGLLNHHCTPNAAHHFEDGETIVV 250

Query: 272 QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
            +   + + T I+  Y   LW T  R+  LA+TK+F C C RC DPT
Sbjct: 251 SATERIPKGTEITMTYAKLLWSTLARKLFLAMTKHFICQCPRCQDPT 297


>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
 gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
          Length = 1033

 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 97/196 (49%), Gaps = 15/196 (7%)

Query: 233 CGVIEVNGVDIGRYTQ---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHI 283
            G +++NG ++         ++      E++CLPN AK  N     ++ +   + ++ H+
Sbjct: 730 VGALQINGHEVPTTDPPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHL 789

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
           S  Y++A+WGT  R+ HL  TK F C+CERC D TEL T +SA+KC +    Q    +LP
Sbjct: 790 SICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTELDTNYSAIKCEDR---QCGGLMLP 846

Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
               D +  W C  C  ++    V  +  + G+ +  + +   N ++  + L   E    
Sbjct: 847 TKADDWNGSWRCRECHKQVQKHYVERILERAGKDIQSMEK---NAENGLKYLKHYEKWLP 903

Query: 404 PHHYHCYAVKHSLIQL 419
           P H+H   +K  L+QL
Sbjct: 904 PQHFHMSEIKILLVQL 919


>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
 gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
          Length = 495

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 77/292 (26%), Positives = 139/292 (47%), Gaps = 37/292 (12%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+K +  LGRY+ A  ++  G+ +  E P+++       +  C  CH+        ++ C
Sbjct: 13  EVKQNETLGRYVVAATNIKAGETVLLEQPVLLLANNG--DRRCCNCHQ------LTSSFC 64

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK--YEAILPLRCLILQRR 173
            +C   P C A C        H P+C  L             +   E    L+ L+L+  
Sbjct: 65  GKCRLMPLC-ADC-----VEHHGPDCRRLAELQLQEQQVEQLQAHTEVCSALKYLLLREH 118

Query: 174 SPKK--YQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSN--FLEKLPGDVLDDTSAKC 228
           + K+  Y++L  +EAH+++R   T+++    + +++ L+++   L+   G  +D+   + 
Sbjct: 119 AEKRSHYEQLLQLEAHLARR-RDTDIWRSYRAEVIEPLQASGLLLQLRNGAEIDE---EL 174

Query: 229 LHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI 283
           L  +  + ++NG +I     G   +G+Y    L  H+CLPN   +   Q ++ ++ +  I
Sbjct: 175 LQRLLAIADINGFEIRAPESGGAMRGVYMTAALFAHSCLPNLVTAVDEQRRIKLYANRII 234

Query: 284 STM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
           +        YT+ L GT  RR  L   K F+C C RC+DPTELGT+ S+  C
Sbjct: 235 AAGEILYNCYTDILLGTDERRRILKTGKCFDCQCARCTDPTELGTHMSSFVC 286


>gi|195381153|ref|XP_002049319.1| GJ20817 [Drosophila virilis]
 gi|194144116|gb|EDW60512.1| GJ20817 [Drosophila virilis]
          Length = 489

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 163/376 (43%), Gaps = 45/376 (11%)

Query: 64  LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
           LGR+L A++ +  G+ +  E PL+V P     +  C  C +           C +C  +P
Sbjct: 16  LGRHLVASMSIEAGETLLDERPLLVAPHWECGQLKCAQCLQE------SYVMCRKCQVYP 69

Query: 123 ACSARCSGLSDAHTH-APECAILKLGCETLLAYN----DYKYEAILPLRCLILQRRSPKK 177
            C        D   H A EC     G  + ++ +    +Y   A+L L  L+ Q      
Sbjct: 70  LCM-------DCSEHDAFECEFFASGAGSRISKDLLVKNYGICALLKLLLLLEQPDQRAL 122

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
            Q L D    +        +++E D  V    + S  ++  P   L   +A+ LH  C  
Sbjct: 123 CQTLLDTPVKLEDYRNEDGLWQERDEQVVRPLMSSGLVQAFPDQQL---TAEVLHAHCIR 179

Query: 236 IEVNGVDI----GRYTQGLYSVICLMEHNCLPNA-----KHSNM-MQSKLFVFRDTHIST 285
           I+ +  ++    G   +G+Y     + H+C+PN      +  NM + + + +     I T
Sbjct: 180 IDNSAYEVTARDGDTLKGVYVRGASLPHSCVPNTVVALDEQFNMKLYAAVPLESGDIIYT 239

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            YTN L GT  R+  L ++++  C+C RC D TEL T+ S+MKC         C      
Sbjct: 240 SYTNPLMGTSQRQHQLRLSRHKECACTRCLDATELNTHMSSMKCRGCPDGYAVCQ----- 294

Query: 346 PLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
            L    DW C   SC A + A +VH + +++G     LV+    + + E +L + + + H
Sbjct: 295 -LSGSGDWRCLNASCGAVIPATEVHELLAEVG---SALVEARGELSAYETLLDQHKQLLH 350

Query: 404 PHHYHCYAVKHSLIQL 419
           P+H+    +K ++  +
Sbjct: 351 PNHFLLLDIKQNIASI 366


>gi|195056333|ref|XP_001995065.1| GH22837 [Drosophila grimshawi]
 gi|193899271|gb|EDV98137.1| GH22837 [Drosophila grimshawi]
          Length = 495

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 135/291 (46%), Gaps = 35/291 (12%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+K +  +GRY+ A+ ++  G+ +  E P+++       +  C  C+K        N+ C
Sbjct: 13  EVKENETMGRYVIASTNIREGETVLCEQPILLLSNNT--DRRCSSCYK------LTNSFC 64

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL--PLRCLILQ-- 171
            +C   P C   C      H  A +C  L             + +A +   L  L+L+  
Sbjct: 65  GKCRLLPLC-GEC-----VHHDARDCKRLAELQLEEEQVEKLQAQAEVNNALTYLVLREH 118

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSN-FLEKLPGDVLDDTSAKCL 229
           + +  +Y++L  ME+H++ R   TE++    + +V  L+S   L++L      +   + L
Sbjct: 119 KETQPQYEQLLQMESHLALR-RNTEIWRTYREQVVLPLQSTGLLQQLRNGA--EIDEELL 175

Query: 230 HWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS 284
             + G  +VNG +I     G   +GLY    L  H+C PN   +   + ++ V+ + +I 
Sbjct: 176 QHLLGTTDVNGFEIRAPESGGTLRGLYIRAALFPHSCTPNLVTAIDEERRIKVYANRYIP 235

Query: 285 TM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
                   YTN L GT+ RR  L   K F+C C RC DPTELGT+ S+  C
Sbjct: 236 AGEILYNCYTNILLGTEERRLILKQGKCFDCQCARCQDPTELGTHMSSFVC 286


>gi|195124465|ref|XP_002006713.1| GI21216 [Drosophila mojavensis]
 gi|193911781|gb|EDW10648.1| GI21216 [Drosophila mojavensis]
          Length = 491

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 82/297 (27%), Positives = 138/297 (46%), Gaps = 47/297 (15%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+K +  +GRYL A +++  G+ +  E PL++    A  +  C  CH+        +  C
Sbjct: 9   ELKENETMGRYLVAAVNIKAGETLLLEQPLLLLANNA--DRRCCNCHQ------LTSTFC 60

Query: 117 PRC-FWPACSA-------RCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            +C   P C+         C  L+    H  +   L+   E     N  K+        L
Sbjct: 61  GKCRLMPLCAECVDHHGLDCLRLAKLQLHVEQVQQLQANAEI---ANVLKF--------L 109

Query: 169 ILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSN--FLEKLPGDVLDD 223
           +L+  +     Y+EL  MEAH+++R   TE++      +++ L+S    L+   G  +D+
Sbjct: 110 LLREHAEMHSWYEELLQMEAHLARR-RDTEIWRTYQKQVIEPLQSAGLLLQLRNGSEIDE 168

Query: 224 TSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
                L  + G+ ++NG +I     G   +G+Y    L  H+C+PN   +   Q ++ ++
Sbjct: 169 ---DLLQRMLGIADINGFEIRAPESGSSMRGVYLRAGLFAHSCVPNVVAAIDEQHRIKLY 225

Query: 279 RDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            +  I+        YTN + GT  RR+ L I K F+C C RC DPTELGT+ S+  C
Sbjct: 226 ANRSIAAGEILYNCYTNIMLGTDERRQILKIGKCFDCQCVRCLDPTELGTHMSSFVC 282


>gi|321472908|gb|EFX83877.1| hypothetical protein DAPPUDRAFT_239494 [Daphnia pulex]
          Length = 291

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/202 (29%), Positives = 96/202 (47%), Gaps = 29/202 (14%)

Query: 260 NCLPNAKHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
           N + +AK+   +++ + + R       Y  AL G + R+  L  +K F C C RCSDPTE
Sbjct: 2   NHVIDAKYQMTVRASVPIMRGD-----YALALEGLRERQSLLRQSKLFECDCSRCSDPTE 56

Query: 320 LGTYFSAMKCLNEHKDQGDCWILPVNPLD-NDSDWTCGSCSARLNARDVHLVTSQLGEQV 378
             TY SA +C    +     ++LP+ PLD  +++W    CS +L A  V+ V  +L E+ 
Sbjct: 57  CSTYLSAQRC----QKCPTGFVLPIRPLDEKETEWKGNLCSHKLTAVVVNRVVDKLKEEF 112

Query: 379 DKL-----------------VQENPNVKS--LEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
           + +                 V   P + +   E  L +  ++ HP+H+   + + SL QL
Sbjct: 113 ETIGPNEVEKYFEILYLIIKVAIKPFLGTGRFEGFLKRHASLVHPNHFLFTSARQSLSQL 172

Query: 420 YGTQPGYAYTQLSSSLLERKIS 441
           YG    Y    L+   LERK++
Sbjct: 173 YGRDEKYLVNTLTMEQLERKVA 194


>gi|195454533|ref|XP_002074283.1| GK18438 [Drosophila willistoni]
 gi|194170368|gb|EDW85269.1| GK18438 [Drosophila willistoni]
          Length = 232

 Score = 85.9 bits (211), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 110/230 (47%), Gaps = 23/230 (10%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A  +      +  E P  VGPK +    +CLGC+ P   N      CP
Sbjct: 15  VEWSPIYGRYLVAQQNTSGHSLLIEELPFAVGPK-SQGGVVCLGCYSPELENC-----CP 68

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY----KYEAILPLRCLILQRR 173
           +C WP C   CS + D      EC I K          +     + + I+PLR L+ + +
Sbjct: 69  QCGWPLCE-ECSKIEDNVHKQMECRIFKEAKARFYRIANGGQCPQLDCIMPLRVLLAKEQ 127

Query: 174 SPKKYQ-ELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
            P++++ ++  ME H ++R   ++++  +  +I +YLR       P  +    S + +  
Sbjct: 128 QPERWEKDVAPMEHHEAERRENSDIWHADRVNIAQYLRG------PCKLASRFSEELIMQ 181

Query: 232 ICGVIEVNGVD----IGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
           + G++EVN  +    +G   + LY    ++ HNC+PN    ++ Q +L V
Sbjct: 182 VVGILEVNAFEARTTVGYPLRCLYPYTGILAHNCVPNTARIDISQRELQV 231


>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
 gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
          Length = 600

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 70/145 (48%), Gaps = 20/145 (13%)

Query: 229 LHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
           L  + GV+  N  +      G  T  +GL+ +  +M H C PNA H      +L V R  
Sbjct: 353 LFRVVGVLNTNAFEAPCRVDGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAA 411

Query: 282 H-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
                   I+T YT  LW    R   L +TKYF C+C+RC+D +E GTY SA+ C    +
Sbjct: 412 RDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLSALFC----R 467

Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
           +QG    ++PV       DW C SC
Sbjct: 468 EQGCKGLVIPVQTKTLQPDWRCLSC 492



 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 66/279 (23%), Positives = 114/279 (40%), Gaps = 32/279 (11%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           + +  SP+ GR + A  D+  G+ + RE  L+VGP       +  C  C++   P   D+
Sbjct: 28  WRVADSPISGRGIFAVRDIAQGELLFRERTLLVGPTAHRGRNLRTCTQCYR--QPAGEDS 85

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL-RCLIL-- 170
                C        C G   +  HA EC + +       +  D +   IL + RC  L  
Sbjct: 86  DAAALCTAGCGLPVCGGCVASPRHAVECQLFRKWQPKDPSRIDPRALRILSVVRCFFLNE 145

Query: 171 -QRRSPKKYQELKDMEAHMSKRGPGTEVYEEI---DSIVKYLRSNFLEKLPGDVLDDTSA 226
            QR+     Q   D   +M +     E +E       +++Y    F   +     +   +
Sbjct: 146 TQRKLLYAMQANAD-RYYMREVERAAECFEHFPREQDMLEY----FYRTVCAFNTNAFES 200

Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRD 280
           +C        +V+G ++    + L+ +  ++ H C PNA H      + ++ +   +   
Sbjct: 201 RC--------QVDGREV--VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLG 250

Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
             I+  Y   LW T  R+  L +TK+F C C RC DPT+
Sbjct: 251 AEITMSYAKLLWSTLARKMFLGMTKHFMCQCPRCQDPTQ 289


>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
 gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
          Length = 302

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 69/145 (47%), Gaps = 20/145 (13%)

Query: 229 LHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
           L  I GV+  N  +      G  T  +GL+ +  +M H C PNA H      +L V R  
Sbjct: 58  LFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAA 116

Query: 282 H-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
                   I+T YT  LWG   R   L +TK+F C C RC+D TE GTY SA+ C    +
Sbjct: 117 RDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCNDNTENGTYLSALFC----R 172

Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
           +QG    ++PV       DW C +C
Sbjct: 173 EQGCRGLVIPVQTRTLQPDWRCITC 197


>gi|307172339|gb|EFN63827.1| Protein msta, isoform A [Camponotus floridanus]
          Length = 352

 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/215 (24%), Positives = 95/215 (44%), Gaps = 22/215 (10%)

Query: 219 DVLDDTSAKCLHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMM 271
           D  +  SA+ +  +CG++++N  ++       G   +GLY    LM H+C  N   +   
Sbjct: 3   DKTEPPSAELIQKLCGILDINTFELRSPSILDGLLLRGLYIEASLMSHDCRGNTYLTMDD 62

Query: 272 QSKLFVFRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
             +L V+    I         YT++L GT  RR+HL   KY  C C  C DP EL ++ S
Sbjct: 63  NFQLTVYASVPIKQNEPILFNYTSSLLGTAERRQHLREGKYLECECSLCKDPYELQSHLS 122

Query: 326 AM---KCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLV 382
           ++   +C        D ++  + P +  S W C  C    + R +    +     +DK  
Sbjct: 123 SVLCPRCKEGFVGMQDTFV--IAPYERASRWQCQMCKKTYSGRLIRATLNICKTLIDKC- 179

Query: 383 QENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLI 417
               +   ++++L +L    H +H+   ++K  L+
Sbjct: 180 ---EDFDGIDDLLKRLSRSLHTNHFLMLSLKQKLL 211


>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
 gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
          Length = 611

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 68/145 (46%), Gaps = 20/145 (13%)

Query: 229 LHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
           L  + GV+  N  +            +GL+ +  +M H C PNA H      +L V R  
Sbjct: 364 LFRVVGVLNTNAFEAPCRVDTHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAA 422

Query: 282 H-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
                   I+T YT  LW    R   L +TKYF C+C+RC+D +E GTY +A+ C    +
Sbjct: 423 RAIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLAALFC----R 478

Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
           +QG    ++PV       DW C SC
Sbjct: 479 EQGCKGLVIPVQTKTLQPDWRCLSC 503



 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 70/276 (25%), Positives = 114/276 (41%), Gaps = 27/276 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           Y +  SP+ GR + A  D+  G+ I RE  L+VGP       +  C  C+     +   +
Sbjct: 39  YRVADSPISGRGIFALRDIAKGELIFREHTLLVGPTAHRGRNLRTCTQCYGQPPGDSDAS 98

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL---RCLI 169
           A CP  C  P C       + +  H PEC + +            + +A+  L   RC  
Sbjct: 99  ALCPAGCGLPVCG----NCASSTRHTPECQLFRKWQPKEATKAGIEPQALRILSVVRCFF 154

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS-NFLEKLPGDVLDDTSAKC 228
           L    P++ + L  M+A+ + R    EV    +   ++ R    LE     V     A  
Sbjct: 155 LN--EPQR-KLLYAMQAN-ADRYYIREVERAAECFEQFPRDPELLEYFYRTV----CAFN 206

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTH 282
            +       ++G ++    + L+ +  L+ H C PNA H      + ++ +   +     
Sbjct: 207 TNAFESRFHIDGQEV--VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAE 264

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
           I+  Y   LW T  R+  L +TK+F C C RC DPT
Sbjct: 265 ITMSYAKLLWSTLARKMFLGMTKHFMCRCPRCQDPT 300


>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
          Length = 429

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 121/296 (40%), Gaps = 47/296 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           +EI  S + GR +  T  L  GD I R+ P+++G      + +  C  C K L P+    
Sbjct: 25  WEIAVSKIAGRGVFTTRKLKRGDIILRDIPILIGLAARQEDTLNSCSICFKVL-PDTKFM 83

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            R   C  P CS      +    H  +CA+ K   E +    D     I+ L C+    R
Sbjct: 84  CR-QGCALPVCSV----CAKKKQHKDDCALFK-SWEPIEP--DVANSVIIRLICIA---R 132

Query: 174 SPKKYQELKDM----EAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
           +    +E +D+    +A++      TEV            +    K P D       + +
Sbjct: 133 AINLSKEQRDLIYCLQANLDN-NHRTEVRNA---------AKCFNKFPTD---KKLIEIM 179

Query: 230 HWICGVIEVNGVDIGR---------YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
           +    V+  NG D            + + L+ +  L+ H+C+PN+ ++   ++   V R 
Sbjct: 180 NRTVAVLRTNGFDETADRTNDNQEFFYRALFPLFALVNHDCVPNSYYTFEEKTNYMVLRA 239

Query: 280 ------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
                  T I+T YT    G   R   L + K F C C RCSDPTE G + SA+ C
Sbjct: 240 SVDLPEGTEITTTYTKLFTGNIARHLFLKMKKNFTCKCSRCSDPTEKGAFISAVYC 295


>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 606

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 99/417 (23%), Positives = 163/417 (39%), Gaps = 67/417 (16%)

Query: 53  KLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLAD 112
           +LP+ + SS  LGR+  A  D+ PGD +  E+P+    +    + +C  C++ L P LA 
Sbjct: 94  RLPWSLASSARLGRHALAKRDISPGDLVLIEAPVAAVVRSQFVKTVCHTCYREL-PALAP 152

Query: 113 NARCPRCFWPA-----CSARCSGLSD---AHTHAPECAILKLGCET---------LLAYN 155
               P     A     CS  C+ L+D   A T+     I K+  ET         +L  +
Sbjct: 153 PGSVPDRAAAAPNKQYCSRACA-LADTLAAVTNPIHAKIDKMAVETQVDPQLLRLILELD 211

Query: 156 DYKYEAILPLR-------CLILQRRSPKKYQ------ELKDMEAHMSKRGPGTEVYEE-I 201
             +  +  P +         +L+    K  Q       L+DME   S      E + + +
Sbjct: 212 SKRATSAPPGKPTPTPEGTWVLKFSRTKAGQLEAVHCTLEDMETLTSPWDKSPEAWRKSV 271

Query: 202 DSIVKYLRSNFLEK---LPGDV--LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICL 256
            +  + L+   + +    PG +  L D +A   +   G+   N  +      GLY  + +
Sbjct: 272 RAGCEALQQAIIAEGAYQPGSLQQLLDMAAVINNNAHGMGAANSTNTD-SALGLYPALSM 330

Query: 257 MEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSC 311
           + H+CLPN   A   + M  +    V     ++  Y N +   + R   L  TK+F C+C
Sbjct: 331 LNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVTYINIMEPRRIRARELMDTKHFACAC 390

Query: 312 ERCSDPTEL--GTYFSAMKCLNEHKDQGDC--WILP--VNPLDNDSDWTCGSCSARLNAR 365
           ERC  P E     +  A+KC       G C  W+L    + L     W C  C+  + +R
Sbjct: 391 ERCVSPLETHPDRFLEAVKC-----GAGGCDGWLLEQRADNLSEGDTWKCTVCATAVPSR 445

Query: 366 DVHLVTS--QLGEQVDKLVQENPNV----------KSLEEMLTKLEAMFHPHHYHCY 410
            V   T    + E       +  N+             E +L + E + HP H   +
Sbjct: 446 SVDGRTGPQDITESFRAAWHQAANILRYKGHAAGRPVFEAVLARAEGVLHPMHVSAF 502


>gi|194913085|ref|XP_001982623.1| GG12637 [Drosophila erecta]
 gi|190648299|gb|EDV45592.1| GG12637 [Drosophila erecta]
          Length = 309

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 114/284 (40%), Gaps = 45/284 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           + +  SP+ GR + AT ++ PG+ + RE  L+VGP    +  +  C  C++ +  +    
Sbjct: 44  WRVADSPISGRGIFATREIAPGEELFREHTLLVGPTAHRSTNLRTCTLCYRLIPGSTDSA 103

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
           A CP  C  P CS       D+  H  EC + +        +   + + I P        
Sbjct: 104 ALCPAGCGLPVCSE----CRDSSRHDLECKLFR-------KWKPLESQRIEPRALRILSV 152

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           +RC  L   + K    L  M+A+M  R    EV    D    + R         D+LD  
Sbjct: 153 VRCFFLDEAARKL---LYAMQANMD-RYYMQEVQRAADCFEHFPREQ-------DMLDYF 201

Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
                 +     E    V+G ++    + L+ +  L+ H C PNA H       + V   
Sbjct: 202 YRTICAFNTNAFESRSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 259

Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
                   I+  Y   LW T  R+  L +TK+F C C RC DPT
Sbjct: 260 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPT 303


>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
           max]
          Length = 484

 Score = 71.6 bits (174), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 97/419 (23%), Positives = 167/419 (39%), Gaps = 68/419 (16%)

Query: 62  PLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADNARCPRC 119
           P  GR L AT D +PG+ I  + P +  P  +   P   C GC   +N N+   +RC RC
Sbjct: 19  PEKGRSLLATRDFYPGEVIISQEPYVCVPNNSSVSPQKRCDGCFTTINNNVL--SRCSRC 76

Query: 120 ---FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL-------- 168
              F+  C   C           E  + +L CE L + + YK +++ P   L        
Sbjct: 77  QLAFY--CGTAC--------QRSEWKLHRLECEVLSSLHKYKRKSLTPSIRLMLRLYLRR 126

Query: 169 ------ILQRRSPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVL 221
                 I+   +   Y  ++ + AHMS       V Y ++ ++V     N + + PG  +
Sbjct: 127 KLQNDKIIPSTAMDNYNLVEALVAHMSDITEEQLVLYAQMANLV-----NSILEWPGINI 181

Query: 222 DDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLF 276
            + +     + C    +   ++     GLY VI ++ H+CLPN+       S ++++   
Sbjct: 182 KEIAENFSKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNSVLVFEGSSALVRAVQH 241

Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS-----DPTELGTYFSAMKCLN 331
           +   T +   Y      T  R++ L     F C+C RCS     D  +        KC +
Sbjct: 242 IPSGTEVLISYIETAESTMTRQKALKEQYLFTCTCPRCSKVGQYDDIQESAILEGYKCKS 301

Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDK----LVQE 384
           E      C    +   D    + C  C    +  ++  +T++   L E   K      QE
Sbjct: 302 E-----KCGGFLLRTTDGKG-FQCQGCGLIRDKEEIKRITTEIKLLSEDASKPSATYYQE 355

Query: 385 NPNV-KSLEEMLTKLEAMFHPHHYHCYAVKH----SLIQLYGTQPGYAYTQLSSSLLER 438
             ++ K +E++ T+L   FHP   +    +     SL++L       AY +L+    +R
Sbjct: 356 AISIYKRIEKLQTEL---FHPLSINLMHTREKILKSLMELEHWTEALAYCKLTIPFYQR 411


>gi|195477726|ref|XP_002100289.1| GE16967 [Drosophila yakuba]
 gi|194187813|gb|EDX01397.1| GE16967 [Drosophila yakuba]
          Length = 306

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 114/284 (40%), Gaps = 45/284 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           + +  SP+ GR + AT ++ PG+ + RE  L+VGP    +  +  C  C++ +  +    
Sbjct: 45  WRVADSPISGRGIFATREIAPGEELFREHTLLVGPTAHRSTNLRTCTLCYRLIPGSTDSA 104

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
           A CP  C  P CS       D+  H  EC + +        +   + + I P        
Sbjct: 105 ALCPAGCGLPVCSE----CRDSSRHDLECKLFR-------KWKPLESQRIEPRALRILSV 153

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           +RC  L   + K    L  M+A+M  R    EV    D    + R         D+LD  
Sbjct: 154 VRCFFLDEAARKL---LYAMQANMD-RYYMKEVQRAADCFKHFPREQ-------DMLDYF 202

Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
                 +     E    V+G ++    + L+ +  L+ H C PNA H       + V   
Sbjct: 203 YRTICAFNTNAFEARSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 260

Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
                   I+  Y   LW T  R+  L +TK+F C C RC DPT
Sbjct: 261 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPT 304


>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
 gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
          Length = 496

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/390 (22%), Positives = 155/390 (39%), Gaps = 63/390 (16%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLN 107
           K K   +E+  S + GR L AT  +  G+ I ++ PL++       + +  C  C + L 
Sbjct: 19  KDKDPAWEVGVSKIAGRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLREL- 77

Query: 108 PNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           P+     R   C  P CS           C  L     + PE A   I++L C    A N
Sbjct: 78  PDTRFMCRIG-CGLPVCSLCSKKKQHKTDCDLLKSWGPNEPEVANSVIIRLLC-VARALN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            +++E ++  V  LR+N  +K
Sbjct: 136 LNKDQRDL-IYCLQANLDNNHRV-EVRNAAKCFKNFPTDKKIFEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
                 D+             E N        + LY +  +M H+C+PN+ ++   ++ +
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFAVMNHDCIPNSYYTFEEKTNI 234

Query: 276 FVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
            + R          I+T YT    G   R   L I + F C C RCSDPTE G Y S + 
Sbjct: 235 MIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCSDPTEKGAYISGLY 294

Query: 329 CLNEH-------KDQG------DCWILPVNP----LDNDSDWTCGSCSARLNARDVHLVT 371
           C + +       +  G      +C +         +    D+  G+ +A+ N+  +  + 
Sbjct: 295 CRDTNCSGLVVPETTGLPHPSYNCLVCKQKSTHAQMMKSQDFASGAINAKANSNSLRTLV 354

Query: 372 SQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
             L E+ D  +  N N   ++  L+ L+ +
Sbjct: 355 QYLNEKSDNFI-PNSNYVVIDAKLSVLQRL 383


>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
 gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
          Length = 498

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/390 (22%), Positives = 155/390 (39%), Gaps = 63/390 (16%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLN 107
           K K   +E+  S + GR L AT  +  G+ I ++ PL++       + +  C  C + L 
Sbjct: 19  KDKDPAWEVGVSKIAGRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLREL- 77

Query: 108 PNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           P+     R   C  P CS           C  L     + PE A   I++L C    A N
Sbjct: 78  PDTRFMCRMG-CGLPVCSLCSKKKQHKTDCDLLKSWGPNEPEVANSVIIRLLC-VARALN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            +++E ++  V  LR+N  +K
Sbjct: 136 LNKDQRDL-IYCLQANLDNNHRV-EVRNAAKCFKNFPTDKKIFEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
                 D+             E N        + LY +  +M H+C+PN+ ++   ++ +
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFAVMNHDCIPNSYYTFEEKTNI 234

Query: 276 FVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
            + R          I+T YT    G   R   L I + F C C RCSDPTE G Y S + 
Sbjct: 235 MIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCSDPTEKGAYISGLY 294

Query: 329 CLNEH-------KDQG------DCWILPVNP----LDNDSDWTCGSCSARLNARDVHLVT 371
           C + +       +  G      +C +         +    D+  G+ +A+ N+  +  + 
Sbjct: 295 CRDTNCSGLVVPETTGLPHPSYNCLVCKQKSTHAQMMKSQDFASGAINAKANSNSLRTLV 354

Query: 372 SQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
             L E+ D  +  N N   ++  L+ L+ +
Sbjct: 355 QYLNEKSDNFI-PNSNYVVIDAKLSVLQRL 383


>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
 gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
           sativus]
          Length = 482

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/382 (21%), Positives = 157/382 (41%), Gaps = 49/382 (12%)

Query: 62  PLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFW 121
           P  GR L  T +  PG+ I  + P +  P  +  E  C  C   +N  L   + C   ++
Sbjct: 19  PEKGRSLFTTRNFRPGEEIISQEPYVCVPNNSPVESRCDRCFASIN--LKKCSACKVAWY 76

Query: 122 PACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR---- 173
             CS+ C           E  + +L C+ L   +  +++++ P +R +I   ++R+    
Sbjct: 77  --CSSVC--------QKSEWKLHRLECDALARLDKDRHKSVTPSIRLMIKLFIRRKLQSE 126

Query: 174 ------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
                 +   Y+ ++++  HMS       V Y ++ ++V     NF+ + PG  + + + 
Sbjct: 127 KVIPMAATDNYKLVEELIDHMSGIDEKQLVLYAQMGTLV-----NFILQWPGMNVKEIAV 181

Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
                 C    +   ++     GLY VI ++ H+CLPNA       + ++++   +    
Sbjct: 182 NFSKLACNAHTICDSELRPLGTGLYPVISIINHSCLPNAVLVFEGRTAVVRAVQHIPAGA 241

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERC-----SDPTELGTYFSAMKCLNEHKDQ 336
            +S  Y      T  R++ L     F C+C RC      D  +        +C N   DQ
Sbjct: 242 EVSISYIETAGSTMTRQKTLKENYLFTCTCSRCVKVAQEDEIKESAILEGYRCRN---DQ 298

Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
            D ++L  +   +D+ +TC  C    +  D+  + S++    D+    + + +S  E L 
Sbjct: 299 CDGFLLRNS---DDTGFTCQQCGLVRSKEDIKNIASKIKSISDE-ASTSLSSQSYAEALF 354

Query: 397 KLEAMFHPHHYHCYAVKHSLIQ 418
             E +       C+    SL+Q
Sbjct: 355 MYEKVEKLQRILCHPYSISLMQ 376


>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
 gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
          Length = 503

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 123/301 (40%), Gaps = 45/301 (14%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGISKIAGRGVVATRHLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCMIL 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS           C        + PE A   I++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKTDCDLFRSWGPNEPEVANSVIIRLLC-VARAIN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++      K     +V E ++  V  +R+N  +K
Sbjct: 136 LNKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKKFPTDKKVIEIMNRTVAVIRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  +M H+C+PN+      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVMNHDCIPNSYYTFEEKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     I+T YT    G   R   L + K F C C RCSDPTE G Y S + 
Sbjct: 235 MIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKMKKNFTCKCPRCSDPTEKGAYISGLY 294

Query: 329 C 329
           C
Sbjct: 295 C 295


>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
 gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
          Length = 506

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 91/390 (23%), Positives = 159/390 (40%), Gaps = 59/390 (15%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +E+  S + GR + AT  L  G+ I ++SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEVGVSKIAGRGVMATRQLKRGEIIFQDSPLLIG--LAAQEEDTLNACSICLKM 76

Query: 110 LADNARCPR--CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRC 167
           L D     R  C  P CS      +    H  +C + K          D     I+ L C
Sbjct: 77  LPDTRFMCRQGCGLPICSL----CAKKKQHKSDCDMFKSWGPN---EPDVANSVIIRLLC 129

Query: 168 LILQRRSPKKYQELKD-MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
           +       K  ++L   ++A++           E+ +  K  ++   +K   ++++ T A
Sbjct: 130 VARAINLTKDQRDLIYCLQANLDNNH-----RTEVRNAAKCFKNFPTDKKLVEIMNRTVA 184

Query: 227 KCLHWICGVIEVNGVD--IGRYT-------QGLYSVICLMEHNCLPNA------KHSNMM 271
                   V+  NG D    R T       + LY +  +M H+C+PN+      K +NM+
Sbjct: 185 --------VLRTNGFDKTTDRTTDNQEFSYRALYPLFAVMNHDCIPNSYYTFEEKTNNMI 236

Query: 272 -QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
            ++ + +     I+T YT    G   R  +L + K F C C RCSDPTE G Y S + C 
Sbjct: 237 VRAAVDIAEGEEITTTYTKLFTGNIARHLYLKMKKGFTCKCPRCSDPTEKGAYISGLYCR 296

Query: 331 NEH-------KDQG------DCWILPVNP----LDNDSDWTCGSCSARLNARDVHLVTSQ 373
           +         +  G      +C +         +    D+  G+ +A++N+  +  +   
Sbjct: 297 DTKCSGLVVPETTGLPHPNWNCLVCKQKSTHAQMMKSQDFASGAINAKVNSNSLRTLVHY 356

Query: 374 LGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
           L E+ D  +  + N   ++  ++ L+ + H
Sbjct: 357 LNEKSDSFI-PSSNYVVIDAKMSVLQRLAH 385


>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
 gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
          Length = 503

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 81/301 (26%), Positives = 124/301 (41%), Gaps = 45/301 (14%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS         + C  L     + P+ A   I++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLLKSWGPNEPDVANSVIIRLLC-VARALN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKIIEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  ++ H+C+PNA      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEDKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     ++T YT    G   R   L + K F C C RCSDPTE G + S + 
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294

Query: 329 C 329
           C
Sbjct: 295 C 295


>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
 gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
          Length = 496

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 93/395 (23%), Positives = 155/395 (39%), Gaps = 73/395 (18%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS         + C        + P+ A   I++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  ++ H+C+PNA      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     ++T YT    G   R   L + K F C C RCSDPTE G + S + 
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294

Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
           C +      +C  L V  +                          D+  G+ +A++N+  
Sbjct: 295 CRDT-----NCTGLVVPEISGLPHPNWNCLECKQKSTHAQMMKSQDFASGAINAKVNSNS 349

Query: 367 VHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
           +  +   L E+ D  +  N N   ++  ++ L+ +
Sbjct: 350 LRTLVQYLNEKSDSFI-PNSNYVVIDAKMSVLQRL 383


>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
 gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
          Length = 492

 Score = 68.9 bits (167), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 80/303 (26%), Positives = 127/303 (41%), Gaps = 49/303 (16%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEP----ICLGCHKP 105
           K K   +E+  S + GR + AT +L  G+ I ++SPL++G  LA  E      C  C K 
Sbjct: 19  KDKDPAWEVGVSKIAGRGVMATRNLKRGEIIFQDSPLLIG--LAAHEEDTLNACSVCFKM 76

Query: 106 LNPNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLA 153
           L P+     R   C  P CS           C       ++ P+ A   I++L C    A
Sbjct: 77  L-PDTRFMCR-QGCGLPVCSLCAKKKQHKTDCDMFKAWGSNEPDVANSVIIRLLC-VARA 133

Query: 154 YNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL 213
            N  K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  
Sbjct: 134 INLSKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKIVEIMNRTVAVLRTNGF 191

Query: 214 EKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KH 267
           +K      D+             E N        + LY +  +M H+C+PN+      K 
Sbjct: 192 DKTTDRTTDNQ------------EFN-------YRALYPLFAVMNHDCIPNSYYTFEEKT 232

Query: 268 SNMM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
           +NM+ ++ + +     I+T YT    G   R   L + K F C C RC DPTE G++ S 
Sbjct: 233 NNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCLDPTEKGSFISG 292

Query: 327 MKC 329
           + C
Sbjct: 293 LYC 295


>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
 gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
          Length = 499

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 82/304 (26%), Positives = 128/304 (42%), Gaps = 51/304 (16%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEP----ICLGCHKP 105
           K K   +EI  S + GR + AT +L  G+ I ++ PL++G  LA  E      C  C K 
Sbjct: 19  KVKDPAWEIGISKIAGRGVMATRNLKRGEIIFQDMPLLIG--LAAQEEDSLNACSICLKL 76

Query: 106 LNPNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLA 153
           L P+     R   C  P CS           C        + P+ A   I++L C    A
Sbjct: 77  L-PDTRFMCR-QGCGLPICSLCGKKRQHKTDCDMFKSWGPNEPDVANSVIIRLLC-VARA 133

Query: 154 YNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL 213
            N  K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  
Sbjct: 134 INLTKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLVEIMNRTVAVLRTNGF 191

Query: 214 EKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNA------K 266
           +K        T+ +C             D   ++ + LY +  +M H+C+PN+      K
Sbjct: 192 DK--------TTDRC------------TDNQEFSYRALYPLFGVMNHDCIPNSYYTFEEK 231

Query: 267 HSNMM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
            +NM+ ++ + +     I+T YT    G   R   L + K F C C RCSDPTE G Y S
Sbjct: 232 TNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCSDPTEKGAYIS 291

Query: 326 AMKC 329
            + C
Sbjct: 292 GLYC 295


>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
 gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
          Length = 497

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 90/376 (23%), Positives = 146/376 (38%), Gaps = 72/376 (19%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS         + C        + P+ A   I++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  ++ H+C+PNA      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     ++T YT    G   R   L + K F C C RCSDPTE G + S + 
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294

Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
           C +      +C  L V  +                          D+  G+ +A++N+  
Sbjct: 295 CRDT-----NCTGLVVPEISGLPHPNWNCLECKQKSTHAQMMKSQDFASGAINAKVNSNS 349

Query: 367 VHLVTSQLGEQVDKLV 382
           +  +   L E+ D  +
Sbjct: 350 LRTLVQYLNEKSDSFI 365


>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
 gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
          Length = 500

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 90/376 (23%), Positives = 146/376 (38%), Gaps = 72/376 (19%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS         + C        + P+ A   I++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  ++ H+C+PNA      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     ++T YT    G   R   L + K F C C RCSDPTE G + S + 
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294

Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
           C +      +C  L V  +                          D+  G+ +A++N+  
Sbjct: 295 CRDT-----NCTGLVVPEITGLPHPNWNCLVCKQKSTHAQMMKSQDFASGAINAKVNSNS 349

Query: 367 VHLVTSQLGEQVDKLV 382
           +  +   L E+ D  +
Sbjct: 350 LRTLVQYLNEKSDSFI 365


>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
 gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
          Length = 302

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 116/278 (41%), Gaps = 33/278 (11%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           + +  SP+ GR + A  ++  G+ + RE  L+VGP       +  C+ C+        ++
Sbjct: 42  WRVGDSPISGRGVFAARNIEKGEELFREHTLLVGPTAHRGRNLRTCIHCYHQPPGESEES 101

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL-RCLILQ 171
           A C   C  P C       + +  HA EC + +      L+  + +   IL + RC  L 
Sbjct: 102 ALCSAGCGLPVCP----DCAPSERHAVECQLFRKWQPKHLSIIEPRALRILSVVRCFFL- 156

Query: 172 RRSPKKYQELKDMEA-----HMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
             S  + Q L  M+A     +M +     E +E        L   F   +     +   +
Sbjct: 157 --SEPQRQLLYAMQANTDRYYMREVQRAAECFEHFPRDPDML-EYFYRTVCAFNTNAFES 213

Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRD 280
           +C         ++G ++   T+ L+ +  L+ H C PNA H      + ++ +   +   
Sbjct: 214 RC--------HIDGQEV--VTRALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAG 263

Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
             I+  YT  LW T  R+  L +TK+F C C RC DPT
Sbjct: 264 AEITMSYTKLLWSTLARKIFLGMTKHFMCQCPRCQDPT 301


>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
          Length = 500

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 89/376 (23%), Positives = 146/376 (38%), Gaps = 72/376 (19%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS         + C        + P+ A   +++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVVIRLLC-VARAIN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  ++ H+C+PNA      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     ++T YT    G   R   L + K F C C RCSDPTE G + S + 
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294

Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
           C +      +C  L V  +                          D+  G+ +A++N+  
Sbjct: 295 CRDT-----NCAGLVVPEITGLPHPNWNCLVCKQKSTHAQMMKSQDFASGAINAKVNSNS 349

Query: 367 VHLVTSQLGEQVDKLV 382
           +  +   L E+ D  +
Sbjct: 350 LRTLVQYLNEKSDSFI 365


>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
 gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
          Length = 499

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/302 (26%), Positives = 123/302 (40%), Gaps = 47/302 (15%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEP----ICLGCHKP 105
           K K   +EI  S + GR + AT  L  G+ I ++SPL++G  LA  E      C  C K 
Sbjct: 19  KDKDPAWEIGVSKIAGRGVMATRSLKRGEIIFKDSPLLIG--LAAHEEDALNACTVCLKE 76

Query: 106 LNPNLADNAR--CPRCFWPACSAR------CSGLSDAHTHAPECA---ILKLGCETLLAY 154
           L P+     R  C     P C+ +      C        + PE A   I++L C    A 
Sbjct: 77  L-PDTKFMCRQGCGLPVCPLCAKKKQHKTDCDMFKSWGPNEPEVANSVIIRLLC-IARAI 134

Query: 155 NDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE 214
           N  K +  L + CL     +  +  E+++            ++ E ++  V  +R+N  +
Sbjct: 135 NLTKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVMRTNGFD 192

Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSK 274
           K      D+             E N        + LY +  +M H+C+PN+ ++   ++ 
Sbjct: 193 KTTDRTNDNQ------------EFN-------YRALYPLFGVMNHDCIPNSYYTFDEKTN 233

Query: 275 LFVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
             V R          I+T YT    G   R   L + K F C C RCSDPTE G + S +
Sbjct: 234 HMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCSDPTEKGAFISGL 293

Query: 328 KC 329
            C
Sbjct: 294 YC 295


>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
 gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
          Length = 497

 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 80/301 (26%), Positives = 123/301 (40%), Gaps = 45/301 (14%)

Query: 50  KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
           K K   +EI  S + GR + AT  L  G+ I R+SPL++G  LA  E   L         
Sbjct: 19  KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLMG--LAAHEEDSLNACSVCLKM 76

Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
           L D     R  C  P CS         + C        + P+ A   I++L C    A N
Sbjct: 77  LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135

Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
             K +  L + CL     +  +  E+++            ++ E ++  V  LR+N  +K
Sbjct: 136 LTKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
                 D+             E N        + LY +  ++ H+C+PNA      K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234

Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
           M+ ++ + +     ++T YT    G   R   L + K F C C RCSDPTE G + S + 
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294

Query: 329 C 329
           C
Sbjct: 295 C 295


>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
          Length = 535

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 142/359 (39%), Gaps = 38/359 (10%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P EI+  P  GR L AT    PG  I   +PL++ P +A    +C  C +P  P L    
Sbjct: 6   PVEIRPHPSKGRALHATKSFPPGAVIFSFTPLLLLPTVACLTSVCSHCLRPGEPRL---- 61

Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL--PLRCLILQ 171
            C RC   A C A C   +    H+ EC  L    +T+    D +   +L  P R LI  
Sbjct: 62  -CSRCHAAAYCDATCQAAAWKAVHSRECKAL---TKTI---RDEERRRMLPTPTRALIQA 114

Query: 172 RRSPKKYQELKDMEAH-MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
               K    L D+E H + KR  G E + +I+ ++      F     GD     +A+ L 
Sbjct: 115 LLWGKIRDGLADLEGHVLEKRAEGDE-WRDIE-MMAVAGCAFSGLGRGDQDVRMAAEMLC 172

Query: 231 WICGVIEVNGVDIGRYTQGLY--SVICLMEHNCLPNAKHSNMMQSKLFVFRD-----THI 283
            I         D      GLY    + +  H+C+PNA    + ++ L +  +       I
Sbjct: 173 KIQNN-SFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRNALLIAENPIRAGDEI 231

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
              YT        RRE LA  K F C C RC D   +    +   C+N +       +  
Sbjct: 232 EIAYTFYTDPLPKRREALAHYK-FTCQCLRCRDNLNVYQVAAISPCVNLN---SQSLVPD 287

Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
           V+ L N    +  +  A +N         Q  E V+ +V     ++SL E    L++ F
Sbjct: 288 VSKLRNHPATSSAAKQAIVN---------QHSESVEHIVDALAVIESLSERRKLLKSEF 337


>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
 gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
          Length = 458

 Score = 65.5 bits (158), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 75/381 (19%), Positives = 151/381 (39%), Gaps = 53/381 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + + P  GR L  T + +PG+ I R+ P +  P                  N +  +RC 
Sbjct: 15  VSNLPEKGRCLLTTKNFNPGEVILRQEPYVCVPN-----------------NSSTVSRCD 57

Query: 118 RCFWPACSARCSGLS-----DAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
            CF      +CS         +     E  + +L C  L      K +A+ P   L+++ 
Sbjct: 58  GCFASESLKKCSACQVVWYCGSTCQKSEWKLHRLECNALSRLEKEKRKAVTPSIRLMVRL 117

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
              +K Q   D++           +Y ++ ++V     +F+ + P   L + +       
Sbjct: 118 YLRRKLQNEMDLDEKQ------LVLYAQMANLV-----HFILQWPEINLKEIAENFSKLA 166

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMY 287
           C    +   ++     GLY V+ ++ H+C+PNA       S+++++   +     +S  Y
Sbjct: 167 CNAHTICDCELRPLGTGLYPVVSIINHSCMPNAVLTFEGKSSVVRAVEHIPEGAEVSIAY 226

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC---SDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
            +    T  R++ L    +F C+C RC    D  +        +C +   D+ + ++L  
Sbjct: 227 IDTAGSTMTRQKALKEQYFFTCTCPRCIKVYDDIQESAILEGYRCKD---DRCNGFLLRD 283

Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP-----NVKSLEEMLTKLE 399
           +    D  + C +C  R +  +V  +  ++    DK ++         V SL +M+ KL+
Sbjct: 284 S---EDKGFICQTCGLRRSKEEVKRIVCEITAISDKKLKSTSPGNHEEVISLYKMIEKLQ 340

Query: 400 A-MFHPHHYHCYAVKHSLIQL 419
             + HP        +  L+++
Sbjct: 341 MELCHPFSISLMRTQEELLKI 361



 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 14 RCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C  + +L KCSAC+ V YCG   QK  WKLH+ +C  L
Sbjct: 55 RCDGCFASESLKKCSACQVVWYCGSTCQKSEWKLHRLECNAL 96


>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
          Length = 481

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 89/412 (21%), Positives = 170/412 (41%), Gaps = 55/412 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L A     PG+ +  + P    P        C  C    + NL   + C 
Sbjct: 19  VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
           R  W  C + C           E  + +L C  + A  + + + + P    +  L+L+R+
Sbjct: 76  RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                         Y  +  +E+H+S+      V Y ++ ++V+ +  +F  +L    + 
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
            T +K   + C    +   ++     GLY V+ ++ H+C+PNA       +  +++   +
Sbjct: 185 HTFSK---FACNAHTICDPELRSLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            ++  +S  Y      T  R++ L    YF C+C RC   +E        +C N+ K  G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQ---ENPNVKSL 391
             ++LP      +  +TC  CS   +  ++  + S    L ++V  LV    +N  V S+
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEVGSM 354

Query: 392 EEMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
            + + +LE  ++HP        + +L+++Y      Q    Y +L+  + ER
Sbjct: 355 YKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 406



 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100


>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
          Length = 425

 Score = 65.1 bits (157), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 89/411 (21%), Positives = 169/411 (41%), Gaps = 54/411 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L A     PG+ +  + P    P        C  C    + NL   + C 
Sbjct: 19  VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
           R  W  C + C           E  + +L C  + A  + + + + P    +  L+L+R+
Sbjct: 76  RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                         Y  +  +E+H+S+      V Y ++ ++V+ +  +F  +L    + 
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
            T +K   + C    +   ++     GLY V+ ++ H+C+PNA       +  +++   +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            ++  +S  Y      T  R++ L    YF C+C RC   +E        +C N+ K  G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQE--NPNVKSLE 392
             ++LP      +  +TC  CS   +  ++  + S    L ++V  LV    N  V S+ 
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVGSMY 354

Query: 393 EMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
           + + +LE  ++HP        + +L+++Y      Q    Y +L+  + ER
Sbjct: 355 KTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 405



 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100


>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 481

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 89/412 (21%), Positives = 170/412 (41%), Gaps = 55/412 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L A     PG+ +  + P    P        C  C    + NL   + C 
Sbjct: 19  VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
           R  W  C + C           E  + +L C  + A  + + + + P    +  L+L+R+
Sbjct: 76  RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                         Y  +  +E+H+S+      V Y ++ ++V+ +  +F  +L    + 
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
            T +K   + C    +   ++     GLY V+ ++ H+C+PNA       +  +++   +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            ++  +S  Y      T  R++ L    YF C+C RC   +E        +C N+ K  G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQ---ENPNVKSL 391
             ++LP      +  +TC  CS   +  ++  + S    L ++V  LV    +N  V S+
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEVGSM 354

Query: 392 EEMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
            + + +LE  ++HP        + +L+++Y      Q    Y +L+  + ER
Sbjct: 355 YKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 406



 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100


>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
 gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 480

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 89/411 (21%), Positives = 169/411 (41%), Gaps = 54/411 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L A     PG+ +  + P    P        C  C    + NL   + C 
Sbjct: 19  VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
           R  W  C + C           E  + +L C  + A  + + + + P    +  L+L+R+
Sbjct: 76  RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                         Y  +  +E+H+S+      V Y ++ ++V+ +  +F  +L    + 
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
            T +K   + C    +   ++     GLY V+ ++ H+C+PNA       +  +++   +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            ++  +S  Y      T  R++ L    YF C+C RC   +E        +C N+ K  G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQE--NPNVKSLE 392
             ++LP      +  +TC  CS   +  ++  + S    L ++V  LV    N  V S+ 
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVGSMY 354

Query: 393 EMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
           + + +LE  ++HP        + +L+++Y      Q    Y +L+  + ER
Sbjct: 355 KTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 405



 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100


>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 79/345 (22%), Positives = 139/345 (40%), Gaps = 48/345 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L AT    PGD I  + P    P   L    C  C    + NL   + C 
Sbjct: 19  VASIPGKGRGLIATFTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR 173
           R  W  CS+ C           E  + +L C  + A  + + + + P +R ++   L+R+
Sbjct: 76  RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTENRKKMLTPTIRLMVRLALKRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                     S   Y  +  +E+H+SK      V Y ++ ++V  +       LP   LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDALESHISKVDENQLVLYAQMANLVSLI-------LPFIELD 179

Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
             + +     + C    +   ++     GLY VI ++ H+C+PNA       +  +++  
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            + +D  +S  Y      T+ R   L    +F C+C RC    +        +C N+   
Sbjct: 240 PIDKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVKGFDEDPLLEGFRCKNQ--- 295

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
             D ++LP +       +TC  C A  +  ++  + S++ +  DK
Sbjct: 296 TCDGFLLPSS---GKKAYTCQKCGASRDVEEIKNMRSEILQLSDK 337


>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
 gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
           77-13-4]
          Length = 530

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 69/273 (25%), Positives = 113/273 (41%), Gaps = 34/273 (12%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P+++K   L GR L A+    PGD I   +P I+ P L+ A  +C  C KP  P      
Sbjct: 6   PFQVKRH-LKGRCLIASRSFTPGDTILTFTPTILIPSLSHASTVCSHCFKPGEPR----- 59

Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAIL-KLGCETLLAYNDYKYEAILPLRCLILQR 172
            C RC   A C A C   +    H+ EC +L K+  +        +     P+R ++   
Sbjct: 60  SCSRCHAVAYCDASCQSAAWKAIHSKECKVLQKVSAQG-------RPGLPTPVRAVVQAL 112

Query: 173 RSPKKYQELKDMEAH-MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
              +    L+D+E + +S RG       E+ ++     +            +   K L  
Sbjct: 113 VKAEVGAALEDLEGNDLSWRGSDKWADMEMMAMGASAFAGL------GTTQEAVKKALSL 166

Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAK-----HSNMMQSKLFVFRDT 281
           +C  I+ N       DIG+    L   + +  H+C+PNA         +++++  +  D 
Sbjct: 167 LCK-IQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGRRAILRAERPIKADE 225

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
            I   YT+  +    R+E LA   +F C C RC
Sbjct: 226 EIEISYTDYNYPLSKRKEALA-PYFFTCECTRC 257


>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
           vinifera]
          Length = 477

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 82/391 (20%), Positives = 157/391 (40%), Gaps = 54/391 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + ++P  GR L +  D  PG+ I  + P +  P  +     C GC +  + NL   + C 
Sbjct: 15  VSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFR--SSNLKKCSACH 72

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKK 177
             ++  C + C   SD   H       +L C  L      + +++ P   L+++    +K
Sbjct: 73  VVWY--CGSTCQK-SDWKLH-------RLECNALSRLEKERQKSLTPSIRLMVKLYMRRK 122

Query: 178 YQELKDM-----------EAHMSKRGPGTE----VYEEIDSIVKYLRSNFLEKLPGDVLD 222
            Q  K M           EA +S      E    +Y ++ ++V     N + + P   + 
Sbjct: 123 LQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLV-----NLILQWPDINVK 177

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR--- 279
           + +       C    +   ++     GLY VI ++ H+CLPN+    + + +L V R   
Sbjct: 178 EIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNSVL--VFEERLAVVRAVQ 235

Query: 280 ----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS-----DPTELGTYFSAMKCL 330
                T +   Y      T  R++ L     F C+C RC      D  +        +C 
Sbjct: 236 HIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYRCK 295

Query: 331 NEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK-LVQENPNVK 389
           +   D+ D ++L  +   +D  + C  C    N  ++  + S+L    DK  +  +    
Sbjct: 296 D---DRCDGFLLRDS---DDIGFICQQCGLVRNKEEIKRLASELKPLSDKATMSSSSQAT 349

Query: 390 SLEEMLTKLEA-MFHPHHYHCYAVKHSLIQL 419
           S+ +M+ KL+  +FHP   +    + +++++
Sbjct: 350 SIYKMIEKLQTKLFHPFSINLMRTREAILKI 380



 Score = 43.1 bits (100), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C R + L KCSAC  V YCG   QK  WKLH+ +C  L
Sbjct: 55 RCEGCFRSSNLKKCSACHVVWYCGSTCQKSDWKLHRLECNAL 96


>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
          Length = 443

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 79/345 (22%), Positives = 139/345 (40%), Gaps = 48/345 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L AT    PGD I  + P    P   L    C  C    + NL   + C 
Sbjct: 19  VASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR 173
           R  W  CS+ C           E  + +L C  + A  + + + + P +R ++   L+R+
Sbjct: 76  RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                     S   Y  +  +E+H+SK      V Y ++ ++V  +       LP   LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLI-------LPFIELD 179

Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
             + +     + C    +   ++     GLY VI ++ H+C+PNA       +  +++  
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            + +D  +S  Y      T+ R   L    +F C+C RC    +        +C N+   
Sbjct: 240 PINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVKGFDEDALLEGFRCKNQ--- 295

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
             D ++LP +       +TC  C A  +  ++  + S++ +  DK
Sbjct: 296 ACDGFLLPNS---GKKAYTCQKCGASRDVEEIKNMRSEILQLSDK 337


>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
 gi|194691468|gb|ACF79818.1| unknown [Zea mays]
 gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
          Length = 482

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 79/345 (22%), Positives = 139/345 (40%), Gaps = 48/345 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L AT    PGD I  + P    P   L    C  C    + NL   + C 
Sbjct: 19  VASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR 173
           R  W  CS+ C           E  + +L C  + A  + + + + P +R ++   L+R+
Sbjct: 76  RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                     S   Y  +  +E+H+SK      V Y ++ ++V  +       LP   LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLI-------LPFIELD 179

Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
             + +     + C    +   ++     GLY VI ++ H+C+PNA       +  +++  
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            + +D  +S  Y      T+ R   L    +F C+C RC    +        +C N+   
Sbjct: 240 PINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVKGFDEDALLEGFRCKNQ--- 295

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
             D ++LP +       +TC  C A  +  ++  + S++ +  DK
Sbjct: 296 ACDGFLLPNS---GKKAYTCQKCGASRDVEEIKNMRSEILQLSDK 337


>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
 gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
          Length = 482

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 89/413 (21%), Positives = 161/413 (38%), Gaps = 58/413 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L AT    PGD I  + P    P   L    C  C    + NL   + C 
Sbjct: 19  VASIPGKGRGLIATRTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMCQ 76

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
             ++  CS  C           E  + +L C  + A  + + + + P    +  L+L+R+
Sbjct: 77  VTWY--CSTNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLVLKRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                     S   Y  +  +E+H+ K      V Y ++ ++V  +       LP   LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDALESHIWKVDENHLVLYAQMANLVSLI-------LPLIELD 179

Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
             + +     + C    +   ++     GLY VI ++ H+C+PNA       +  +++  
Sbjct: 180 LKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            + ++  +S  Y      T+ R   L    +F CSC RC   +E        +C N+   
Sbjct: 240 PIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTCSCPRCVKGSEEDALLEGFRCKNQ--- 295

Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL---------VQENP 386
             D ++LP +       +TC  CS   +  ++  + S++ +  DK           +   
Sbjct: 296 TCDGFLLPDS---GKKAYTCQKCSVSRDEEEIQKMRSEILQLSDKASSFLSSGNKAEAGS 352

Query: 387 NVKSLEEMLTKLEAMFHPHHYH-CYAVKHSLIQLYGTQPGYAYTQLSSSLLER 438
             K++E++   L   F     H C  +    ++L   +   AY +L+  + ER
Sbjct: 353 VYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQDWRTALAYCRLTVPVYER 405


>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
 gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
          Length = 301

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 75/281 (26%), Positives = 112/281 (39%), Gaps = 40/281 (14%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+  SP  GR L+AT +L  GD +  E P       +LA+ IC  C +  +       RC
Sbjct: 5   EVFDSPGKGRGLRATKELLSGDVLFSEPPFAAVVFDSLADKICHSCFRRQDK----LQRC 60

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR--- 172
            +C F   C   C   +    H  EC  +K        +     E I  L   I+ R   
Sbjct: 61  GQCKFAQYCDKTCQS-AGWKEHKLECGAIKAFGAIGDNFGKAPNENI-RLAARIMWRLDK 118

Query: 173 ----RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV----LDDT 224
                S  +   L D+E H+S   P  ++ E     +K    NFL+  P +     +DD 
Sbjct: 119 EGGVMSDMQMTSLDDLENHISDM-PEDDLKE-----LKVDIHNFLDYWPRNSKQHRVDDI 172

Query: 225 SAKCLHWICGVIEVNGVDIGR------YTQGLYSVICLMEHNCLPN----AKHSNMMQSK 274
           S      I GVI  NG  +           GL+  +CL+ H+C PN      H  +    
Sbjct: 173 SH-----IFGVINCNGFTVSDQRGLNAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRA 227

Query: 275 LFVFR-DTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
           L        ++  Y + L  T+ R+  L +  +F+C+CE C
Sbjct: 228 LGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHC 268


>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 82/394 (20%), Positives = 157/394 (39%), Gaps = 57/394 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + ++P  GR L +  D  PG+ I  + P +  P  +     C GC +  + NL   + C 
Sbjct: 15  VSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFR--SSNLKKCSACH 72

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKK 177
             ++  C + C   SD   H       +L C  L      + +++ P   L+++    +K
Sbjct: 73  VVWY--CGSTCQK-SDWKLH-------RLECNALSRLEKERQKSLTPSIRLMVKLYMRRK 122

Query: 178 YQELKDM-----------EAHMSKRGPGTE----VYEEIDSIVKYLRSNFLEKLPGDVLD 222
            Q  K M           EA +S      E    +Y ++ ++V     N + + P   + 
Sbjct: 123 LQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLV-----NLILQWPDINVK 177

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR--- 279
           + +       C    +   ++     GLY VI ++ H+CLPN+    + + +L V R   
Sbjct: 178 EIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNS--VLVFEERLAVVRAVQ 235

Query: 280 ----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS-----DPTELGTYFSAMKCL 330
                T +   Y      T  R++ L     F C+C RC      D  +        +C 
Sbjct: 236 HIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYRCK 295

Query: 331 NEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN--- 387
           +   D+ D ++L  +   +D  + C  C    N  ++  + S+L    DK    + +   
Sbjct: 296 D---DRCDGFLLRDS---DDIGFICQQCGLVRNKEEIKRLASELKPLSDKATMSSSSHYV 349

Query: 388 -VKSLEEMLTKLEA-MFHPHHYHCYAVKHSLIQL 419
              S+ +M+ KL+  +FHP   +    + +++++
Sbjct: 350 EATSIYKMIEKLQTKLFHPFSINLMRTREAILKI 383



 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C R + L KCSAC  V YCG   QK  WKLH+ +C  L
Sbjct: 55 RCEGCFRSSNLKKCSACHVVWYCGSTCQKSDWKLHRLECNAL 96


>gi|281208900|gb|EFA83075.1| hypothetical protein PPL_03863 [Polysphondylium pallidum PN500]
          Length = 470

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 75/295 (25%), Positives = 129/295 (43%), Gaps = 42/295 (14%)

Query: 136 THAPECA-ILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ----ELKDMEAHMSK 190
            H  EC+ ++K+   T+   ND  + ++L LR L+ +    + Y      L   +    K
Sbjct: 127 VHQLECSFVVKVAQATVDVQNDLSH-SLLVLRILLKKNILSELYNGGVGTLSVQQQEFEK 185

Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD-------TSAKCLHWICGVIEVNGVDI 243
           R P     + ++   ++    F+E L  D   D       T  + ++  C  + VN    
Sbjct: 186 RAP-----QWLEQTSRFA-DRFIELLQEDKHIDPTLLSFFTKQEIINTACATM-VNSFAS 238

Query: 244 GR----YTQGLYSVICLMEHNCLPNAKHS---NMMQSKLF--VFRDTHISTMYTNALWGT 294
            +     + G +    L+ H+C PN  +S   N +Q +    + +   I   Y + L  T
Sbjct: 239 TKNSIIVSNGFFYEAALLNHSCQPNTFYSIQNNQLQMRCIQDIEQGGDIFDSYVDLLEPT 298

Query: 295 QPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV---NPLDN-- 349
             R++ L  +K+F C CERC DPTE G Y S+ KC +  +     W+ P+   N  DN  
Sbjct: 299 FERQQVLLQSKHFYCRCERCQDPTECGRYLSSFKCPSCQEG----WVSPIISYNEKDNVL 354

Query: 350 DSDWTCGSCSARLNARDVHLV--TSQLGEQV-DKLVQENPNVKSLEEMLTKLEAM 401
              W C  C    N R +  +    QL + + +K  + +PN   LE++ +K+E +
Sbjct: 355 ARLWRCLGCRQEQNDRIIQQLKGIDQLKQTINEKKWRVDPNA-YLEQIGSKMEEV 408


>gi|194768767|ref|XP_001966483.1| GF21974 [Drosophila ananassae]
 gi|190617247|gb|EDV32771.1| GF21974 [Drosophila ananassae]
          Length = 364

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 71/284 (25%), Positives = 113/284 (39%), Gaps = 45/284 (15%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
           + +  SP+ GR + AT D+  G+ + RE  L+VGP    +  +  C  C+  +  +   +
Sbjct: 49  WRVADSPISGRGIFATRDIAAGEELFREHTLLVGPTAHRSTNLRTCTLCYCLIPGDTDLD 108

Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
           A CP  C  P C       SD H         +L C+    +   + + I P        
Sbjct: 109 ALCPAGCGLPVCPD--CRTSDRH---------QLECKLFRKWKPLETQRIEPRALRILSV 157

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           +RC  L    P++ + L  M+A+ + R    EV    +    + +         D+LD  
Sbjct: 158 VRCFFLD--EPQR-KLLYAMQAN-ADRYYMKEVQRAAECFQHFPKEQ-------DMLDYF 206

Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
                 +     E    V G ++    + L+ +  L+ H C PNA H       + V   
Sbjct: 207 YRTICAFNTNAFESRSHVGGHEV--LVRALFPLAGLLNHQCTPNAAHHFEDGETIVVCST 264

Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
                   I+  Y   LW T  R+  L +TK+F C C RC DPT
Sbjct: 265 ERIPCGAEITMSYAKLLWSTLARKIFLGMTKHFMCQCVRCQDPT 308


>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 490

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 71/278 (25%), Positives = 116/278 (41%), Gaps = 36/278 (12%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           +  SP  GR L A    H G+ I ++ P +     +     C  C++  + NL   + C 
Sbjct: 21  VSVSPAKGRCLIANRAFHRGEVILQQEPYVCALDDSSQALRCDRCYRQ-SSNLKRCSACK 79

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL---ILQRRS 174
             F+  C A C   S    H  ECAIL      +    + K      LR +   +++RR 
Sbjct: 80  TVFY--CCANCQR-SGWGLHKFECAIL------VRLLKEKKRSPTPSLRLVMRFLIKRRL 130

Query: 175 PKKYQELKD----------MEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDV-LD 222
             +  +LK           +  HMS+      V Y ++ SIV+ + +      P +V + 
Sbjct: 131 QAERVQLKAAVDNYDVCEVLPTHMSETSEDRLVMYAQMASIVQQMMA------PDEVNVK 184

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAK-HSNMMQSKLFVFRDT 281
           + +     + C    +   ++     GLY VI ++ H+C+PNA  H +  ++ L    DT
Sbjct: 185 EVTQMICRFACNAHTICDEEVRPLGTGLYPVISIVNHSCVPNAVLHFDGNRAALRALEDT 244

Query: 282 H----ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
                I+  Y      T  RR+ L    YF+C+C RCS
Sbjct: 245 QEGTEITISYVELAASTNTRRKALRDQYYFDCNCIRCS 282



 Score = 38.9 bits (89), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 15/78 (19%)

Query: 8   EDEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK---KLPYEIKSSP 62
           +  + +RC  C  + + L +CSACK V YC    Q+  W LHK +C    +L  E K SP
Sbjct: 55  DSSQALRCDRCYRQSSNLKRCSACKTVFYCCANCQRSGWGLHKFECAILVRLLKEKKRSP 114

Query: 63  ----------LLGRYLQA 70
                     L+ R LQA
Sbjct: 115 TPSLRLVMRFLIKRRLQA 132


>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
 gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
          Length = 484

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 84/340 (24%), Positives = 138/340 (40%), Gaps = 50/340 (14%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GR L A  D + G  +  + P +           C  C K  +P L   + C   F+  C
Sbjct: 25  GRCLVADRDFYAGQTVLEQDPYVAVLDGDSRGNRCDACFKQ-SPALRRCSACKFVFY--C 81

Query: 125 SARCSGLSDAHTHAPECAIL-KLGCE----------TLLAYNDYKYEAILPLRCLILQRR 173
           S  C   S    H  EC +L +L  E           LL   + +   +LP+  L     
Sbjct: 82  SPTCQ-RSQWKIHQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATGVLPVTAL----- 135

Query: 174 SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVK-YLRSNFLEKLPGDVLDDTSAKC-LH 230
               Y+ ++ +  H S+ G    V Y ++  ++K  L + + E +     D     C  H
Sbjct: 136 --DNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAEDVKEITKDICRISCNGH 193

Query: 231 WICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHIS 284
            IC   E+  V IG     L+ V+ ++ H+C  N+      KH+ ++++   + R   ++
Sbjct: 194 TICD-DELRPVGIG-----LFPVVSIINHSCSSNSLLLFDGKHA-VVRALGTISRGCEVT 246

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----FSAMKCLNEHKDQGDCW 340
             Y      T  RRE L+   YF+C C RC+D +E G Y      A+ CL       D  
Sbjct: 247 VSYIELGASTNSRREALSDQYYFHCKCPRCTDDSEAGLYKDDVLEAVACL-------DPA 299

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
                 L N S W C SC +     +V+ ++++    ++K
Sbjct: 300 CESFMRLSNGS-WRCMSCGSSREPNEVNKLSTEAEGMIEK 338



 Score = 42.0 bits (97), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 9   DEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           D    RC  C  +  AL +CSACK V YC    Q+  WK+H+ +CK L
Sbjct: 53  DSRGNRCDACFKQSPALRRCSACKFVFYCSPTCQRSQWKIHQEECKVL 100


>gi|358388868|gb|EHK26461.1| hypothetical protein TRIVIDRAFT_35412 [Trichoderma virens Gv29-8]
          Length = 535

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 110/269 (40%), Gaps = 18/269 (6%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P EI+  P  GR L AT    PG  I   +PL++ P ++    +C  C +P NP      
Sbjct: 6   PLEIRPHPSKGRGLYATKSFSPGAVIFPFTPLLLLPTVSCLSSVCSYCLRPGNPRA---- 61

Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            C RC   + C A C   +    H+ EC  L+ G +        + +   P R L+    
Sbjct: 62  -CSRCHAASYCDAACQAAAWKAVHSRECKALRQGIKD----EGRRRQLPTPTRALMQALL 116

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI- 232
             +    LKD+E H+ ++    + + +I+ ++      F  K   + L   +A+ L  I 
Sbjct: 117 CGEIGDGLKDLEGHVLEKKAEGDEWRDIE-MMAMAACAFSGKGTAEELVRRAAEMLCKIQ 175

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-----THISTMY 287
               +    D+G     L   + +  H+C+PNA    + ++ L V          I   Y
Sbjct: 176 NNSFQRFDPDLGVVGLFLEPTLAMANHSCIPNAAVQFIGRNTLLVAESPIRAGDEIELAY 235

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSD 316
           T        RRE LA  K F C C RC D
Sbjct: 236 TFYTDPLLKRREALAHYK-FVCQCLRCRD 263


>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
          Length = 433

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 109/274 (39%), Gaps = 39/274 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I  SP  GR L+ T +   GD I  E  L      +LAE IC  C +          +C 
Sbjct: 6   IFDSPGKGRGLKTTKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEK----LQKCS 61

Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK 176
           +C F   C   C     A  H  EC  +K       AY     E I      ++      
Sbjct: 62  QCKFAHYCDRTCQRAGWAE-HKQECGAIK-------AYGKAPNENI-----RVVSHMQLI 108

Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVI 236
             +EL+D  A M +        +EI  + K    NFL+  P +    T     H I GVI
Sbjct: 109 TVEELEDHVADMQE--------DEIKEL-KVDIHNFLDYWPRNSKQHTIDDISH-IFGVI 158

Query: 237 EVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM-MQSKLFVFRDTHIST 285
             NG  +   R  Q    GL+  +C++ HNC PN      H  + ++S   +     ++ 
Sbjct: 159 NCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGKIELRSLGKIAEGEELTV 218

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
            Y + L  ++ RR  L    +F+C C+ C + T+
Sbjct: 219 AYVDFLNLSEERRRLLKTQYFFDCQCDYCKNGTK 252


>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
          Length = 598

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 80/331 (24%), Positives = 129/331 (38%), Gaps = 53/331 (16%)

Query: 47  HKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL 106
           H+P    L + +   P+ GR + A  D+  G+ I  E PL+     A             
Sbjct: 112 HQPHNSSLAWHVGHCPVKGRIMVAARDIEAGEIILEEEPLVAASWHA------------- 158

Query: 107 NPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLR 166
                   RC  C  P  S+ CS +   H + P  A      E  L       E +  + 
Sbjct: 159 -------HRCIECHEPHRSSECSRVQ--HKYPPAVAANMEEIERFL-------EGLESIE 202

Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEK--LPGDVLDD 223
            L   RR  K   +     A M    P T    E     V  +R + + K  +P  V D+
Sbjct: 203 ELDSARRFIKLLSKSSSDPAVMVHLSPCTAANMESCRKAVSAVRKHKVAKAIIPPTVSDE 262

Query: 224 TSAKCLHWICGVIEVNGVDIGRY-TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF---- 278
            +A+ L     ++  N  ++G +   GL+ + CLMEH+C PN   S    ++L+V     
Sbjct: 263 DAARML----SILNTNSHELGEWGGSGLFLLGCLMEHSCAPNCNFST-YGTQLWVTALNS 317

Query: 279 --RDTHISTMYTNALWG-TQPRREHLAITKYFNCSCERCSD-PTELGTYFSAMKCLNEHK 334
             +   +S  Y ++ +  T  RR+ L  +  F C C RC+  P  +  +   +K      
Sbjct: 318 IKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRCTTLPDVMRAFICPVK------ 371

Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNAR 365
            Q    + P+      SDW C  C ++L+ +
Sbjct: 372 -QCKGKVCPLGEGSKPSDWVCEKCKSQLDKK 401


>gi|440790376|gb|ELR11659.1| peptidylprolyl cis-trans isomerase, FKBP-type domain containing
          protein [Acanthamoeba castellanii str. Neff]
          Length = 165

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 24/45 (53%), Positives = 30/45 (66%), Gaps = 4/45 (8%)

Query: 15 CAVC----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          CAVC    RE  L +C+AC+ V YC  +HQK+HWK HK  CK +P
Sbjct: 14 CAVCQVSSREKELKRCAACQAVWYCDAEHQKQHWKTHKAICKLIP 58


>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oryzias latipes]
          Length = 476

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/287 (27%), Positives = 116/287 (40%), Gaps = 43/287 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I  SP  GR L+A  +   GD I  E+ L      +LAE +C  C +          RC 
Sbjct: 6   IFDSPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEK----LQRCG 61

Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
           +C F   C   C     A  H  EC+ +K       AY     E I  L   IL R    
Sbjct: 62  QCKFAHYCDRTCQRAGWAE-HKQECSAIK-------AYGKVPNENI-RLVARILWRLDKD 112

Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
               S  +   L ++E H+      T++ E+    +K    NFL+  P      T     
Sbjct: 113 GSTVSDMQLTTLDELEDHI------TDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDIS 166

Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM-MQSKLFVF 278
           H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN      H  + ++S   + 
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIA 225

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC----SDPTELG 321
               ++  Y + +  ++ R+  L    +F+C+CE C     D  +LG
Sbjct: 226 EGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLG 272


>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/300 (22%), Positives = 121/300 (40%), Gaps = 42/300 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + + P  GR L  T D+ PG+ I  + P I  P     E  C GC K    NL   + C 
Sbjct: 15  VSNLPHKGRSLFTTRDIRPGEVILSQKPYICVPNNTSLESRCDGCFK--TNNLKKCSGCQ 72

Query: 118 RCFWPACSARCSGLSDAHTHAPEC-AILKLGCE-------------TLLAYNDYKYEAIL 163
             ++  C + C   S+   H  EC A+ +L  E              L    + + E +L
Sbjct: 73  VVWY--CGSFCQK-SEWKLHRHECKALTRLEKEKRKFVTPTIRLMVKLYIKRNLQNEKVL 129

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTE-VYEEIDSIVKYLRSNFLEKLPGDVLD 222
           P+        S   Y  ++ + +HMS+       +Y ++ ++V     N + + P   L 
Sbjct: 130 PI-------TSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLV-----NLILQFPSVDLR 177

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
           + +     + C    +   ++     GL+ ++ ++ H+C PNA     +   ++++   +
Sbjct: 178 EIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRAMDNI 237

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD-----PTELGTYFSAMKCLNE 332
            +D+ I+  Y      T  R++ L     F+C C RCS+       E        +C NE
Sbjct: 238 SKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNVGKPHDIEESAILEGYRCSNE 297



 Score = 42.0 bits (97), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C +T  L KCS C+ V YCG   QK  WKLH+ +CK L
Sbjct: 55 RCDGCFKTNNLKKCSGCQVVWYCGSFCQKSEWKLHRHECKAL 96


>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
 gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
           Full=ASH1-related protein 1; AltName: Full=Protein SET
           DOMAIN GROUP 37
 gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
 gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
          Length = 480

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/300 (21%), Positives = 120/300 (40%), Gaps = 42/300 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + + P  GR L    D  PG+ I  + P I  P    +E  C GC K    NL   + C 
Sbjct: 15  VSNLPQKGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCFK--TNNLKKCSACQ 72

Query: 118 RCFWPACSARCSGLSDAHTHAPEC-AILKLGCET-------------LLAYNDYKYEAIL 163
             ++  C + C   S+   H  EC A+ +L  E              L    + + E +L
Sbjct: 73  VVWY--CGSSCQK-SEWKLHRDECKALTRLEKEKRKFVTPTIRLMVRLYIKRNLQNEKVL 129

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTE-VYEEIDSIVKYLRSNFLEKLPGDVLD 222
           P+        +   Y  ++ + +HMS+       +Y ++ ++V     N + + P   L 
Sbjct: 130 PI-------TTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLV-----NLILQFPSVDLR 177

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
           + +     + C    +   ++     GL+ ++ ++ H+C PNA     +   ++++   +
Sbjct: 178 EIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRAMDNI 237

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD-----PTELGTYFSAMKCLNE 332
            +D+ I+  Y      T  R++ L     F+C C RCS+       E        +C NE
Sbjct: 238 SKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNFGKPHDIEESAILEGYRCANE 297



 Score = 45.1 bits (105), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 23/42 (54%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C +T  L KCSAC+ V YCG   QK  WKLH+ +CK L
Sbjct: 55 RCDGCFKTNNLKKCSACQVVWYCGSSCQKSEWKLHRDECKAL 96


>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Oreochromis niloticus]
          Length = 476

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 76/278 (27%), Positives = 113/278 (40%), Gaps = 39/278 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I  SP  GR L+AT +   GD I  E+ +      +LAE IC  C +          RC 
Sbjct: 6   IFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEK----LQRCG 61

Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
           +C F   C   C     A  H  EC+ +K       AY     E I  L   I+ R    
Sbjct: 62  QCKFAHYCDRTCQRAGWAE-HKLECSAIK-------AYGKVPNENI-RLVARIMWRLDKE 112

Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
               S  +   L ++E H++   P  ++ E     +K    NFL+  P +    T     
Sbjct: 113 GSTVSDMQLTTLDELEDHIADM-PEDDLKE-----LKVDIHNFLDYWPHNSKQHTIDDIS 166

Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNMMQSKLF-VF 278
           H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN      H  +    L  + 
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIA 225

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
               ++  Y + L  ++ R+  L    +F+C+CE C +
Sbjct: 226 EGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 263


>gi|321448192|gb|EFX61351.1| hypothetical protein DAPPUDRAFT_273988 [Daphnia pulex]
          Length = 326

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/44 (54%), Positives = 30/44 (68%), Gaps = 2/44 (4%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
          CAVCR  A   C  C E++YC K+HQK+HW  HK +CK  PY+I
Sbjct: 19 CAVCRSKASQICGGCGEISYCSKEHQKQHWATHKSQCK--PYKI 60


>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
 gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
          Length = 482

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/276 (19%), Positives = 106/276 (38%), Gaps = 49/276 (17%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GR L  T D +PG+ I  + P +  P                  N A  +RC RCF  + 
Sbjct: 22  GRSLVTTKDFNPGEVIISQEPYVCVPN-----------------NSATESRCDRCFSSSN 64

Query: 125 SARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
             +CS    A      C  L     ++ CE L   +  + + + P   L+++    +K Q
Sbjct: 65  VKKCSACQVAWYCGSSCQKLEWKLHRIECEALSKLDKDRRKYVTPSIRLMVRLLIRRKLQ 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS----------------NFLEKLPGDVLDD 223
             K + +        T+ Y+ ++++V +++                 N + + P   + +
Sbjct: 125 SEKIISS------TATDNYDLVEALVAHMKDIDEKQLVLYAQMANLVNLILRWPDVNIKE 178

Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVF 278
            +       C    +   ++     GLY V+ ++ H+CLPNA         ++ +   + 
Sbjct: 179 IAENFSKLACNAHTICDSELRPLGTGLYPVVSIINHSCLPNAVLVFDGRLAVVHTVQHIP 238

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
           + + +   Y      T  R++ L    +F C+C RC
Sbjct: 239 KGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRC 274



 Score = 38.9 bits (89), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C  ++ + KCSAC+   YCG   QK  WKLH+ +C+ L
Sbjct: 55 RCDRCFSSSNVKKCSACQVAWYCGSSCQKLEWKLHRIECEAL 96


>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 562

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 73/298 (24%), Positives = 118/298 (39%), Gaps = 44/298 (14%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           +++ SP  GR L   + +  G  +    P +           C  C  P +P      RC
Sbjct: 35  DLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYLSSHCSSCAAPASPERPQLKRC 94

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYN-DYKYEAILPL-RCLILQRR 173
            +C     CS +C   +D   H PEC  L+       + +     EA+  L R L  ++R
Sbjct: 95  AKCHVVHYCSQKCQN-ADWKAHKPECDALQRWATAAPSPDLAVPEEAVRCLGRMLWQKQR 153

Query: 174 SPKKY--QELKDMEAHMSKRGPGTEVYEE----IDSIVKYLRSNFLEKLP-------GDV 220
           +P     +E+  M++H S   P  E +E       ++V+YL  +  EKL        GDV
Sbjct: 154 NPSSIWSREIDSMQSHRSSLRP--ESFESHAYLAHALVRYLGLDAPEKLASFGLRSAGDV 211

Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----------KHSN 269
           +D  S     +    I +    +      +   + L+ H+C PNA               
Sbjct: 212 VDLIS----RFTTNAITLAAPSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQEP 267

Query: 270 MMQSKLFVFRDTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTE 319
           +MQ  +   RD H    ++T Y +     + R++ L  T +F CSC  C+    DP E
Sbjct: 268 VMQ--VIALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCSCSLCAAEDPDPRE 323


>gi|281204009|gb|EFA78205.1| MYND-type zinc finger-containing protein [Polysphondylium
          pallidum PN500]
          Length = 854

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 30/66 (45%), Positives = 37/66 (56%), Gaps = 6/66 (9%)

Query: 1  MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
          MAS+     EEL  CA+C  T +  C+ C  V YCG +HQ +HW  HK +C  L    K 
Sbjct: 1  MASLT---TEELHHCAICSSTQVQICTGCLSVYYCGSEHQLQHWPQHKSQCSGL---NKR 54

Query: 61 SPLLGR 66
          S LLGR
Sbjct: 55 SDLLGR 60


>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
           rubripes]
          Length = 489

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 76/290 (26%), Positives = 114/290 (39%), Gaps = 44/290 (15%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I  SP  GR L+AT +   GD I  E  L      +LAE IC  C +          +C 
Sbjct: 6   IFDSPGKGRGLKATKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEK----LQKCS 61

Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK 176
           +C F   C   C     A  H  EC  +K       AY     E I  +  L+   R  K
Sbjct: 62  QCKFAHYCDRTCQRAGWAE-HKQECGAIK-------AYGKAPNENIRLVARLMW--RLDK 111

Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS---NFLEKLPGDVLDDTSAKCLHWIC 233
           +   + DM+  ++       V +  +  +K L+    NFL+  P +    T     H I 
Sbjct: 112 EGSVVSDMQL-ITVEELEDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDDISH-IF 169

Query: 234 GVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM------------- 270
           GVI  NG  +   R  Q    GL+  +C++ HNC PN      H N              
Sbjct: 170 GVINCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRRI 229

Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
            ++S   +     ++  Y + L  ++ RR  L    +F+C C+ C + T+
Sbjct: 230 ELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCKNGTK 279


>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
 gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
          Length = 696

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 73/336 (21%), Positives = 131/336 (38%), Gaps = 51/336 (15%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLA 111
           ++ P E+      GR + A+ DL  G  + R SP     +    E  C  C K +N ++ 
Sbjct: 209 QQFPIELVWRDGFGRCVLASEDLPKGTMVLRVSPFASVLEDHKIEKNCGFCFKKINKSIR 268

Query: 112 DNARCPRC----FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRC 167
            N  C  C      P CS     L   + H  EC IL    +   +     +  +  +  
Sbjct: 269 INQTCKNCKNHLLCPQCSVDEYSL---NYHKDECDILNFLKQYYPSSQTRDFRFMFRVLL 325

Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--DTS 225
            +++ +  K + +    +  ++ + P   +++    ++   R+  L+K+  + ++    S
Sbjct: 326 NVIKDKKNKSFSKENQSKQWLNHQNPF--IFDSYKYLINLSRT--LDKVQPEQMEAFKRS 381

Query: 226 AKCLHWI------------CGV-----IEVNGVDIGR------------YTQGLYSVICL 256
           A+ +  I            CGV     IE+  + +              Y  G++     
Sbjct: 382 AQSVIAIFNKLRGPKFFDECGVTIDEIIEIYSIVLSNGHEMLHPLNCHTYGLGIFPTGSY 441

Query: 257 MEHNCLPNAKHSNMMQSKLFVFRD-------THISTMYTNALWGTQPRREHLAITKYFNC 309
           + H+CLPNA   N  Q  + VFR          I T YT+       RR+HL    +F C
Sbjct: 442 LNHSCLPNAFWYNDDQG-MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFC 500

Query: 310 SCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
            C++C   ++L T  S + C N+       + +P N
Sbjct: 501 QCQQCKFQSKL-TDQSCLSCKNQLSSSNTSFKIPSN 535


>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
 gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
          Length = 128

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 8/87 (9%)

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTNAL 291
           V G +I    + L+ +  L+ H+C PNA H      + ++ +   + + T I+  Y   L
Sbjct: 42  VGGNEI--LVRALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLL 99

Query: 292 WGTQPRREHLAITKYFNCSCERCSDPT 318
           W T  R+  LA+TK+F C C RC DPT
Sbjct: 100 WSTLARKLFLAMTKHFICQCPRCQDPT 126


>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oryzias latipes]
          Length = 489

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 79/300 (26%), Positives = 116/300 (38%), Gaps = 56/300 (18%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I  SP  GR L+A  +   GD I  E+ L      +LAE +C  C +          RC 
Sbjct: 6   IFDSPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEK----LQRCG 61

Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
           +C F   C   C     A  H  EC+ +K       AY     E I  L   IL R    
Sbjct: 62  QCKFAHYCDRTCQRAGWAE-HKQECSAIK-------AYGKVPNENI-RLVARILWRLDKD 112

Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
               S  +   L ++E H+      T++ E+    +K    NFL+  P      T     
Sbjct: 113 GSTVSDMQLTTLDELEDHI------TDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDIS 166

Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM--------- 270
           H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN      H N          
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHS 225

Query: 271 -----MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC----SDPTELG 321
                ++S   +     ++  Y + +  ++ R+  L    +F+C+CE C     D  +LG
Sbjct: 226 QRRIELRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLG 285


>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 352

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 75/348 (21%), Positives = 141/348 (40%), Gaps = 47/348 (13%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L A     PG+ +  + P    P        C  C    + NL   + C 
Sbjct: 19  VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
           R  W  C + C           E  + +L C  + A  + + + + P    +  L+L+R+
Sbjct: 76  RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                         Y  +  +E+H+S+      V Y ++ ++V+ +  +F  +L    + 
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
            T +K   + C    +   ++     GLY V+ ++ H+C+PNA       +  +++   +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
            ++  +S  Y      T  R++ L    YF C+C RC   +E        +C N+ K  G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299

Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLV 382
             ++LP      +  +TC  CS   +  ++  + S    L ++V  LV
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLV 342



 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100


>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
           [Oreochromis niloticus]
          Length = 489

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 115/291 (39%), Gaps = 52/291 (17%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I  SP  GR L+AT +   GD I  E+ +      +LAE IC  C +          RC 
Sbjct: 6   IFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEK----LQRCG 61

Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
           +C F   C   C     A  H  EC+ +K       AY     E I  L   I+ R    
Sbjct: 62  QCKFAHYCDRTCQRAGWAE-HKLECSAIK-------AYGKVPNENI-RLVARIMWRLDKE 112

Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
               S  +   L ++E H++   P  ++ E     +K    NFL+  P +    T     
Sbjct: 113 GSTVSDMQLTTLDELEDHIADM-PEDDLKE-----LKVDIHNFLDYWPHNSKQHTIDDIS 166

Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSN------MMQS 273
           H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN      H N      M  S
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHS 225

Query: 274 KLF--------VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
           +          +     ++  Y + L  ++ R+  L    +F+C+CE C +
Sbjct: 226 QRRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 276


>gi|353238271|emb|CCA70222.1| hypothetical protein PIIN_04161 [Piriformospora indica DSM 11827]
          Length = 311

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 29/40 (72%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           +C VC+E A  +CS CK V YCG +HQ++HW  HK +C+K
Sbjct: 121 KCVVCKEYAPKQCSNCKTVQYCGAEHQRQHWSEHKAQCRK 160


>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
 gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
          Length = 474

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 117/295 (39%), Gaps = 44/295 (14%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCH-KPLNPN--LADN 113
           E+  +   GR L+AT     G  + ++ P       +  + +C  C   P  P   + D 
Sbjct: 7   ELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDVVCHYCLCAPGQPGAPVEDL 66

Query: 114 ARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
            RC  C F   C+  C   +    H  ECA +K      +       +  L  R L  ++
Sbjct: 67  HRCTGCKFAQYCTKECQKKAWPE-HKQECAAIKR-----ITPGKPVDQTRLVGRILWRRK 120

Query: 173 RSPKKYQELKD----------------MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL 216
           R      E KD                +E H+SKR    E  E ID  V     +F +  
Sbjct: 121 REENLNGEKKDGKENDEKKVELVKIEELEDHLSKRN--AEEKEAIDEKV----YSFGDYF 174

Query: 217 PGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN-AKHSN 269
             D + D+  +  H +  +I+ N + +   R  Q    G+Y  I ++ H+C PN    +N
Sbjct: 175 TYDEMPDSDEEMAH-LFAIIDCNAIGLNDHRGVQTIGVGIYPGISMLNHDCSPNCVAMNN 233

Query: 270 MMQSKLFVFR----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
             + ++   R       +   Y ++L  T+ RRE L +  YF+C C+ C+   EL
Sbjct: 234 GPRLEVRALRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTCTKGEEL 288


>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
           [Brachypodium distachyon]
          Length = 482

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 88/392 (22%), Positives = 155/392 (39%), Gaps = 56/392 (14%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           I S P  GR L  T    PG+ I    P    P   L    C  C    + NL   + C 
Sbjct: 19  IASIPGKGRGLVTTRRFFPGEVILCGEPYASTPNKILVGSNCDHCFT--SSNLRKCSVC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
           R  W  C + C           E  + +L C  + A  D + + + P    +  LIL+R+
Sbjct: 76  RVAW-YCGSVC--------QKEEWKLHQLECRAISALTDDRKKMLTPTIRLMVRLILRRK 126

Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
                         Y  +  +E+H+S+      V Y ++ ++V+ +       LP   LD
Sbjct: 127 LQNEKVIPSTGTDNYGLVDALESHISEVDDKQLVLYAQMANLVQLI-------LPAIELD 179

Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
             +T+     + C    +   ++     GL+  I  + H+C+PNA       +  +++  
Sbjct: 180 LKETAHIFSKFSCNAHTICDPELRPVGTGLFPAISTINHSCVPNAVLLFEGRTAYVRALQ 239

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
            +  +T +S  Y      T  R   L    +F C+C RC   +E        +C    KD
Sbjct: 240 PLSNNTEVSISYIETAATTLKRHNDLK-HYFFTCTCPRCIKGSEEDPLLEGYRC----KD 294

Query: 336 QG-DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQENPN--VK 389
           Q  D ++LP +       +TC  CS   +  +V  ++S+   L ++    V    N    
Sbjct: 295 QNCDGFLLPDS---GKKAYTCQKCSMCRDGEEVKKLSSEILLLSDKASSFVSSGNNNEAG 351

Query: 390 SLEEMLTKLE-AMFHPHHYHCYAVKHSLIQLY 420
           S+ +++ +LE  ++H         + +L++LY
Sbjct: 352 SVYKIIEQLERKLYHSFSITLLHTRETLLKLY 383



 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   QKE WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSVCQKEEWKLHQLECRAI 100


>gi|440800401|gb|ELR21440.1| hypothetical protein ACA1_183760 [Acanthamoeba castellanii str.
           Neff]
          Length = 454

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 96/381 (25%), Positives = 145/381 (38%), Gaps = 86/381 (22%)

Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI---L 170
           RC +C   A CS  C    +   H  EC  +K      L +     E +L LR +I   L
Sbjct: 11  RCNKCHTIAFCSPACRAEGEV-LHRVECFYVKK-----LGHLIVNLEILLLLRIVIARWL 64

Query: 171 QRRSPKK--------YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS--NFLEKLPGDV 220
           +  S +K        +  +  M+AH            +ID +  Y  +  + L  LP ++
Sbjct: 65  ETLSSQKRGVEGTPVFDRMMHMQAHAES---------QIDLLQTYAPAVEDMLRVLPAEM 115

Query: 221 LDDTSAKCLHWICGVIEVNGVDIGR---------------YTQGLYSVICLMEHNCLPNA 265
               + + +  I  +  VN   +GR                  GL+    ++ H+C  NA
Sbjct: 116 --KVAKEAMLIIVCICSVNQFAVGRPPAENESGEGQSKDGEGSGLFLTSAIVSHSCRENA 173

Query: 266 KHSN-----MMQSKLFVFRDTHISTMYT-NALWGTQPRREHLAITKYFNCSCERCSDPTE 319
            H        M +   +   + I+  Y  N L  TQ R+  L  TK F C CERC D TE
Sbjct: 174 MHRTKGRRLAMLALEPIAAGSEITWSYLGNPLLPTQIRQRELYQTKVFLCQCERCLDLTE 233

Query: 320 LGTYFSAMKCLNEHKDQGDCWILPVNP--LDNDSD-WTCGSCS---ARLNARDVHLVTSQ 373
                 A K  NE    G    LP+    ++ + D W C + +   A   ARDV  + + 
Sbjct: 234 CHLRAFACKACNE----GLLVPLPLTQAGIETEEDRWRCDNAACGRAAGFARDVERLEAG 289

Query: 374 LGEQVDKLVQENPNVKSLEEMLTKLEAMF--------HPHHYHCYAVKHSLIQLYGTQPG 425
           L  + ++        + L   L +LEA+         HPHHY        L+Q+Y     
Sbjct: 290 LVRRFEEAHAALGATQDLAAALQQLEAVIADHAGPVMHPHHY-------LLLQIY----- 337

Query: 426 YAYTQLSSSLLERKISYVMSP 446
                L   LL RK+   + P
Sbjct: 338 ----LLHGELLRRKLDGRLPP 354


>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
          Length = 530

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 82/364 (22%), Positives = 148/364 (40%), Gaps = 50/364 (13%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P+ +KS    GR + +T    PGD I   +P I+ P L+    IC  C K      A+  
Sbjct: 6   PFRVKSHRK-GRGIFSTKSFAPGDVILPFTPTILIPSLSHINTICSHCFKQ-----AEVR 59

Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            C RC   + C A C   +    H+ EC +L+   E        +     P+R ++    
Sbjct: 60  ACSRCHAVSYCDAACQAANWTAVHSKECKVLRKVTE------QGRPGLPTPIRAVVQALV 113

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWIC 233
            P+    ++D+E + +     +E + +++ ++    S F     G    +   K L  +C
Sbjct: 114 KPEIGNAIEDLEGNAAS-WRKSEKWADME-MMAMGASAFAGLGTG---QEDVQKALTILC 168

Query: 234 GVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHI 283
             I+ N       D+G+    L + + +  H+C+PNA         +++++  +  D  I
Sbjct: 169 K-IQTNAFHRYDTDLGQVGIFLGTKLAMANHSCIPNAMVQFIGRRAILRAEKPIKIDDEI 227

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----FSAMKCLNEHKDQGDC 339
              YT+  +    R+  LA   +F+C C RC    +L  Y     S +  +N H    D 
Sbjct: 228 EISYTDYTFPLSNRKLALA-PYFFDCMCLRCE--KDLNVYQVCAGSPIIDMNRHSLVADV 284

Query: 340 WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLE 399
             L  +P   D +      +AR  +           E++ +L+ +  +  SLE     L+
Sbjct: 285 GKLGRHPGATDPNKV---STARRQS-----------EELSELLDKKDSAGSLETRRASLQ 330

Query: 400 AMFH 403
             +H
Sbjct: 331 KRYH 334


>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
          Length = 582

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 91/423 (21%), Positives = 153/423 (36%), Gaps = 69/423 (16%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIAR 82
           L +C  C    YC K+ Q   W +HKP+CK+L   I  +P   + L   ++    + +  
Sbjct: 62  LLRCGGCNFSMYCSKECQATAWLVHKPECKRLK-AIFETPFATQVLNPKVNEFCANCLRG 120

Query: 83  ESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECA 142
            +P   G KL      C GC+                F   CS  C   +    H PEC 
Sbjct: 121 PAP---GEKLLR----CGGCN----------------FSMYCSKECQATA-WLVHKPECK 156

Query: 143 ILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK-------KYQELKDMEAHMSKRGPGT 195
            LK     L    +  + + +  R   L++   K       K+  L D +  + +     
Sbjct: 157 RLKASFPNL-PLTEVLFLSKVIDRIQFLEKNGDKLGIEAERKFSSLVDHKTDIREDEEKM 215

Query: 196 EVYEEIDSIVKYLR-SNFLEKLPGDVLD-----DTSAKCLHWICGVIEVNGVDIGRYTQG 249
             +E++ + +   R  + +EK  G+  D       ++  +H   G      +D+G     
Sbjct: 216 THFEKVFTKMGLFRKDDMIEK--GEFFDIFCKASINSHSIHTNAGNEVGMALDLG----- 268

Query: 250 LYSVICLMEHNCLPNAKHSNMMQSKLFVFR---------DTHISTM-YTNALWGTQPRRE 299
               +    H+C P    S +        R         DT  + + Y +       RR 
Sbjct: 269 ----VSKYNHSCRPTC--SMVFDGYRVCLRPLVPGVDAEDTEQAFISYIDVGRSKFVRRR 322

Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
            L    YFNC C RC DP +     +A++C N      D  IL  +  +   +  C  C 
Sbjct: 323 DLKSRWYFNCECTRCMDPED--DALTAIRCANP---SCDAPIL-TSETEEPMNIACDKCK 376

Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
             +   DV      +         + P    L ++L K E + HP + +   ++ +L  +
Sbjct: 377 TIVEENDVKAAQEYMKSLPASFDPKCP-ADILRDLLAKAEQVLHPSNVYVARLRTALFHV 435

Query: 420 YGT 422
            GT
Sbjct: 436 TGT 438


>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
          Length = 513

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 116/293 (39%), Gaps = 36/293 (12%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P E +  P   R L AT    P   I    PLI+ P L+    +C  C +P +P     +
Sbjct: 3   PIETRPHPSKHRALHATQPFRPSQTIHTFHPLILHPSLSHLASVCTHCLRPGSPRAC--S 60

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYNDYKYEAILPLRCLILQ 171
           RC   ++  C+  C   +    H+ EC  L   KLG  T         +   P+R L+  
Sbjct: 61  RCHAAYY--CNTSCQQAAWTAVHSKECKALQQRKLGSRT-------GADLPTPVRALLQT 111

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
               +    +  ++ H  +R   T+ + +++ +     +       G   +    K +  
Sbjct: 112 LLIKEIEDGVAALDGHTEERR-KTKSWPDLEMMALAACA-----FAGRQGESNVRKAVEL 165

Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
           +C  I+ N      VD+G+    L   + +  H+C+PNA         +++++  +    
Sbjct: 166 LCK-IQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILRAERPIQAGD 224

Query: 282 HISTMYTNALWGTQPRREHLAITKY-FNCSCERCSDPTELGTYFSAMKCLNEH 333
            I   YT+       RRE  A+ +Y F C+C RC    +L  Y     C N +
Sbjct: 225 EIEISYTDYTMPLSARRE--ALEQYSFECTCARCKG--DLNVYQVRTACPNTY 273


>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
 gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
          Length = 451

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 125/320 (39%), Gaps = 43/320 (13%)

Query: 99  CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK 158
           C+ C + L  N     +C R  W    A C  L  A+    +  +  LG           
Sbjct: 41  CMRCRRALFCN----EQCQRLGWWDHQAECGALKRAYPVVFDVEVRLLGRIAARHMQISS 96

Query: 159 YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK-LP 217
            +  +      L R+S +   +L D+  H+       E   +   I   L   +  K LP
Sbjct: 97  GKDAVDRENFYLNRQSER---QLGDLWHHVGSMRENVEEMRKFRRIQASLERFYKAKYLP 153

Query: 218 GDV----LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQ 272
            +     L   +    H I   + +N  +IG+   GLY  +C  +H+C PNA ++     
Sbjct: 154 NEQTLFKLHSRNYINRHAIGDQLYLN--EIGK---GLYLDLCAYDHSCRPNAIYTCEGFV 208

Query: 273 SKLFVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
           ++L    D+          T Y   L     RR  L  + YF+C CERC+DP +     +
Sbjct: 209 ARLKALDDSVNLSDRKRTHTCYITPLDSRAERRRLLKESWYFDCFCERCTDPND--HLLT 266

Query: 326 AMKCLNEHKDQGDCWILPVNPLDNDSD------WTCGSCSARLNAR-DVHLVTSQL---G 375
           A+KC N          +P++  DND D      +TC  C  R  +R DV     ++    
Sbjct: 267 ALKCNNCST------AVPIHNADNDKDLIDDSFFTCPKCEQRCFSRPDVEQALFKMIDIE 320

Query: 376 EQVDKLVQENPNVKSLEEML 395
           +Q+ K   E  +   L+ ML
Sbjct: 321 KQLAKTGSETLSKSELDSML 340


>gi|390597639|gb|EIN07038.1| hypothetical protein PUNSTDRAFT_104486 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 440

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%)

Query: 1   MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            A +  +  +EL  C VC +  L +CS C+   YCG + QK HW+ HKP C+ +
Sbjct: 253 FADVARLNADELKGCIVCGKQTLKECSRCRSAKYCGPECQKSHWREHKPVCRSI 306


>gi|195123617|ref|XP_002006300.1| GI20970 [Drosophila mojavensis]
 gi|193911368|gb|EDW10235.1| GI20970 [Drosophila mojavensis]
          Length = 574

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 74/307 (24%), Positives = 119/307 (38%), Gaps = 52/307 (16%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E++ +P  GR++  T DL  GD +A E P        +    C  C +     L     C
Sbjct: 188 EMRETPDEGRFIVTTQDLVVGDIVATEEPFCSSLLAPMRYIRCSNCKEERYLTLIPCDNC 247

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKL-----------GCETLLAYNDYKYEAILPL 165
             C    CS  C   +++  H  EC I+ L              T L+  D  Y +I  L
Sbjct: 248 --CSVMFCSEECKKRANSTFHKYECPIIDLLHRMFNKIHCIALRTTLSALDL-YGSIEEL 304

Query: 166 RCLILQRRSPKK------YQELKDMEAHMSKRGPGT--------EVYE------------ 199
                Q ++  K      Y +L   E + +  G  T        ++++            
Sbjct: 305 MAFCEQPQNQHKSVFDLDYTQLTPQEHYRAIHGLVTNQHLRSVSDLFQRSVVCAVLKHFI 364

Query: 200 -EIDSIVKYLRSNFLEKLPGDVLD---DTSAKCLHWICGVIEVNGV-DIGRYTQGLYSVI 254
            E   + +YL          D+L     TS   +H I  V + N   D   ++ G Y+ +
Sbjct: 365 MEYTPLKEYLGGEEGRNFFTDLLFRHLQTSPSNMHGIDLVEQANETKDDQSHSSGAYAFL 424

Query: 255 CLMEHNCLPNA-KHSNMMQSKLFVFRDTHIS-TMYTN-----ALWGTQPRREHLAITKYF 307
            L+ H+C PN  +    +++ LFV R       +Y N     A++G + R+E L++   F
Sbjct: 425 SLLNHSCAPNTLRIYEGVKAYLFVLRPIKAGGVLYDNYGAHFAVFGKKQRQETLSMQYRF 484

Query: 308 NCSCERC 314
           NC CE C
Sbjct: 485 NCKCEAC 491


>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
          Length = 520

 Score = 52.4 bits (124), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 82/362 (22%), Positives = 141/362 (38%), Gaps = 56/362 (15%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P+ +KS    GR + +T    PGD I   +P I+ P L+    IC  C K      A+  
Sbjct: 6   PFRVKSHHR-GRGIFSTKSFVPGDVILPFTPRILIPSLSHINNICSHCFKQ-----AEVR 59

Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            C RC   + C A C   +    H+ EC +L+   E        +     P+R ++    
Sbjct: 60  ACSRCHAVSYCDAACQAANWTAVHSKECKVLRKVTE------QGRPGLPTPIRAVVQALV 113

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEE---IDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
            P+    ++D+E      G GT   +     D  +  + ++    L G   +D      H
Sbjct: 114 KPEIGNAIEDLE------GNGTSWRKSEKWADMEMMAMGASAFAGL-GTGQEDIQTNAFH 166

Query: 231 WICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHIST 285
                      D+G+    L   + +  H+C+PNA         +++++  +  D  I  
Sbjct: 167 RY-------DTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEI 219

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----FSAMKCLNEHKDQGDCWI 341
            YT+  +    R+  LA   +F+C C RC    +L  Y     S +  +N H    D   
Sbjct: 220 SYTDYTFPLSKRKLALA-PYFFDCMCLRCE--KDLNVYQVCAGSPIIDMNRHSLVADVGK 276

Query: 342 LPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
           L  +P   DS+        +++A   H       E + +L+++  +  SLE     L   
Sbjct: 277 LGRHPGATDSN--------KVSAARRH------SEGLSELLEDKDSAGSLETRRASLRKR 322

Query: 402 FH 403
           +H
Sbjct: 323 YH 324


>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
          Length = 526

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 62/274 (22%), Positives = 111/274 (40%), Gaps = 34/274 (12%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
           P E +  P   R L AT    P   I    PLI+ P L+    +C  C +P +P     +
Sbjct: 3   PIETRPHPSKHRALHATQPFKPSQIIHTFHPLILHPSLSQLASVCTHCLRPGSPRAC--S 60

Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYNDYKYEAILPLRCLILQ 171
           RC   ++  C+  C   +    H+ EC  L   KLG  T         +   P+R L+  
Sbjct: 61  RCHAAYY--CNTSCQQAAWTAVHSKECKALQQRKLGSRT-------GADLPTPVRALLQT 111

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
             + +    +  ++ H  +R   T+ + +++ +     +       G   +    + +  
Sbjct: 112 LLNKETEDGIAALDGHTEERR-KTKSWPDLEMMAMAACAFA-----GRQGESNIRRAVEL 165

Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
           +C  I+ N      VD+G+    L   + +  H+C+PNA         +++++  +    
Sbjct: 166 LCK-IQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILRAERPIHAGD 224

Query: 282 HISTMYTNALWGTQPRREHLAITKY-FNCSCERC 314
            I   YT+       RRE  A+ +Y F C+C RC
Sbjct: 225 EIEISYTDYTMPLSTRRE--ALEQYSFECTCARC 256


>gi|348689633|gb|EGZ29447.1| hypothetical protein PHYSODRAFT_473651 [Phytophthora sojae]
          Length = 500

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 73/306 (23%), Positives = 119/306 (38%), Gaps = 51/306 (16%)

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           L+CL L  R  +   E+  +     +R           S    LR    +  P    D+ 
Sbjct: 124 LKCLALYERDARSLDEVLGLTCAGQQRAM---------SAALQLRRQLPDIFPAGFSDEQ 174

Query: 225 SAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVF 278
            A     + GV+  N  ++      GL+   C MEH+CLPN   +       M +   + 
Sbjct: 175 MAT----LIGVLNTNSHELENLGGSGLFLSACRMEHSCLPNCSFTTFDSTLWMTAIRPIA 230

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKY-FNCSCERC---SDPTELGTYFSAMKCLNEHK 334
               +S  Y N  +   P R+   +  Y F C+CE C    DPT       A++CL+ + 
Sbjct: 231 PGDALSIDYGNFFYRPTPERQESLLESYGFVCTCEACVSMPDPTR------AVRCLSGNC 284

Query: 335 DQGDCWILP---------VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
            QG     P         + P     +W C +C    +A + H  T  L  + D  + EN
Sbjct: 285 PQGVMLPCPIRTNAPSSSIGPNQLQFEWRCQTCGKVADAAE-H--TRILAAEQD--LMEN 339

Query: 386 PNVKSLEEMLTKL-EAMFHPHHYHCY-AVKHSLIQLYGTQPGYAYTQLS------SSLLE 437
              +SLEE+   +   + H  HY  + A+ +   +   T    A  Q++      + + E
Sbjct: 340 GFPESLEEVDAVVRRGVLHERHYLLFWALDNIGCEAAATTAFQAEAQMAQHRRALAQVWE 399

Query: 438 RKISYV 443
           R I+Y+
Sbjct: 400 RIIAYM 405


>gi|403332639|gb|EJY65354.1| Zinc finger MYND domain-containing protein 17 [Oxytricha
          trifallax]
          Length = 370

 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 34/52 (65%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          C +C++ A + CS CK V YC + HQ E WK+HK  CK+L  +++++  L +
Sbjct: 8  CNICKKPATNLCSKCKSVYYCNRDHQLEDWKIHKQVCKQLFEQLQNNAALSQ 59


>gi|393247959|gb|EJD55466.1| hypothetical protein AURDEDRAFT_155706 [Auricularia delicata
           TFB-10046 SS5]
          Length = 191

 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 13  MRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           M CA C      H C+ CK VAYC K HQ+ HWK+HKP C++
Sbjct: 140 MACAFCNSNEKTHLCADCKAVAYCSKAHQRAHWKVHKPTCRE 181


>gi|50287869|ref|XP_446364.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74637715|sp|Q6FTT0.1|SET5_CANGA RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|49525671|emb|CAG59288.1| unnamed protein product [Candida glabrata]
          Length = 515

 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 6/84 (7%)

Query: 243 IGRYTQGLYSVICLMEHNCLPNAKH--SNMMQSKL----FVFRDTHISTMYTNALWGTQP 296
           I + +  LY +   + HNC PN ++  +N ++ K+    F+ +D  + T Y N L G   
Sbjct: 335 INQLSGQLYFLYSFLNHNCEPNVRYDINNKLELKVYARKFIKKDEELVTTYVNPLHGVSL 394

Query: 297 RREHLAITKYFNCSCERCSDPTEL 320
           RR  L +   F C+C+RC+   EL
Sbjct: 395 RRRELRVNWGFICNCDRCAKEIEL 418


>gi|384254219|gb|EIE27693.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
          Length = 728

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
          CAVC E A H+C  CK   YC ++HQ++HW   HK  C+KL   I + 
Sbjct: 49 CAVCSEPAPHRCGGCKASRYCSREHQQQHWAAGHKQVCRKLALGIAAG 96


>gi|66815955|ref|XP_641994.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
 gi|60470035|gb|EAL68016.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
          Length = 563

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/262 (26%), Positives = 108/262 (41%), Gaps = 37/262 (14%)

Query: 124 CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
           CS  CS   +  TH  EC+ +    E+ +         +L LR +I  +    KYQE   
Sbjct: 89  CSEICS---NDMTHQLECSFINKLIESSIKNECDVSTCLLALRIMIRNKIESDKYQET-- 143

Query: 184 MEAHMSKRGPGTEVYEEID-SIVKYLRSNF------LEKLPGD--VLDDTSAKCLHWICG 234
               + K     E + EI+ S ++    NF      L++   D  +L   +      I  
Sbjct: 144 ----VGKLSNQLEKFIEINKSFIEKYDINFQQFINQLQQENSDPSLLAIFNKDEFLQIIS 199

Query: 235 VIEVNGV-----DIGR--YTQGLYSVICLMEHNCLPNA----KHSNM-MQSKLFVFRDTH 282
            I +N       D  R   + G +    L+ H+C PN     K  N+ M++   + +D  
Sbjct: 200 SIYINSFSGLSNDFNRKPISNGYFYKPALLNHSCEPNIFFTIKDKNLEMRACKKIEKDEE 259

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           I   Y + L  T  R++ L  +K F C C RCSD TE   Y S++ C   H + G  +I 
Sbjct: 260 IVDSYVDLLLPTIERQKILYNSKNFLCKCSRCSDSTEDERYLSSIYCF--HCNDGVEFIS 317

Query: 343 PVNPLDNDS-----DWTCGSCS 359
           P    + D+     +W C + S
Sbjct: 318 PEIYFNKDTKKLQENWKCSNTS 339


>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
          Length = 540

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 109/272 (40%), Gaps = 28/272 (10%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           IK     GR L AT  +  G  +A  SPLI+ P L+    +C  C + + P     +RC 
Sbjct: 11  IKPHSTRGRALHATKTVAAGGVLAVFSPLILVPSLSHLTIVCSFCLRGVTPR--PCSRCR 68

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI----LQRR 173
             ++  C A+C   + +  HA ECA L       +     + E   P+R L+    L R 
Sbjct: 69  AAYY--CDAQCQAAAWSSGHAAECAALT----GAIPPARRRREIPAPVRALVQLLLLLRS 122

Query: 174 SPKKYQELKD-MEAHMS--KRGPGTEVYEEIDSIVKYLRSNF--LEKLPGDVLDDTSAKC 228
                ++L D +E H +  +R PG    E    ++      F  +E   G V +     C
Sbjct: 123 GGGHTRDLLDGLEGHAAQRRRAPGWADME----LMAMAGCAFAGMETSEGAVREAAEILC 178

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHI 283
                        D+G     L   + +  H+CLPNA       + +++++  +     I
Sbjct: 179 -KLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVLRAEQRIQSGDEI 237

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCS 315
              YT+       R+  LA   +F C C RC+
Sbjct: 238 EISYTDYTSSLSKRQAALA-PYHFECRCRRCT 268


>gi|440801743|gb|ELR22748.1| MYND finger domain containing protein, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 320

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 30/47 (63%), Gaps = 5/47 (10%)

Query: 11  ELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           +L +C +C     R      C ACK+VAYC ++HQK+HW+ HKP C+
Sbjct: 269 QLAKCGLCEAKEARRGDFPVCGACKQVAYCTREHQKQHWRTHKPHCR 315


>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
 gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
          Length = 555

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 120/295 (40%), Gaps = 34/295 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGP--KLALAEPICLGCHKPLNPNLADNARCPR---- 118
           GRY++AT  L PGD +  E+     P       +  C  C K         +  P+    
Sbjct: 16  GRYVEATDILKPGDSLFSENSFASIPLENFLYKKDYCFNCCKYQISTEQQTSEQPKSTPG 75

Query: 119 --------CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLIL 170
                   C    CS  C+   D   H  EC+ +    E  L ++     ++L LR ++ 
Sbjct: 76  FPFQCKYGCNMWFCSNECA---DGMVHQLECSFISKFVELSLKHDCDASISLLALRVVMK 132

Query: 171 QRRSPKKYQE----LKDMEAHMSKRGPG-----TEVYEEIDSIVKYLRSNFLEKL-PGDV 220
            + + + +++    L + ++   +         +++++++  IVK   S         + 
Sbjct: 133 NKVASEVFEQTVGKLTNQKSIFIQNNKSFLDKYSKLFDQVYLIVKSENSELSSIFNKEEY 192

Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQS-KLFVFR 279
           L+   +  ++   G+   N  +    + G +    L+ H+C PN   S    + ++ V +
Sbjct: 193 LEVVCSIFVNSFAGL--SNNFNRTPISNGYFYKAALLNHSCEPNTFFSIQNSTLEMRVVK 250

Query: 280 DTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
           D +    I   Y + L  T  R++ L  +K F C C RC+D TE   + S++ C+
Sbjct: 251 DINKGEQIFDSYVDLLLPTIERQKILLKSKNFVCDCSRCTDITENSRFVSSIHCI 305


>gi|403341486|gb|EJY70048.1| MYND finger family protein [Oxytricha trifallax]
          Length = 425

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 32/47 (68%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
          C +C++ A + CS CK V YC + HQ E WK+HK  CK+L  +++++
Sbjct: 8  CNICKKPATNLCSKCKSVYYCNRDHQLEDWKIHKQVCKQLFEQLQNN 54


>gi|389739118|gb|EIM80312.1| hypothetical protein STEHIDRAFT_125885 [Stereum hirsutum FP-91666
           SS1]
          Length = 718

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/44 (54%), Positives = 27/44 (61%), Gaps = 2/44 (4%)

Query: 11  ELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           EL RC+ CR     LHKC  C +V YC    QK+HWK HKP CK
Sbjct: 674 ELYRCSWCRNPSAVLHKCRNCGKVRYCDSVCQKQHWKAHKPACK 717


>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
           bisporus H97]
          Length = 674

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 106/268 (39%), Gaps = 56/268 (20%)

Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSV---------------ICLMEHNCLPNA 265
           LD+ S+  +  I G++++    + R+T   ++V               I L  H+C PNA
Sbjct: 263 LDEFSSYGIDSIAGLVDL----VSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNA 318

Query: 266 ----------KHSNMMQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSC 311
                       +N   + +   RD      +   Y +     + R++ L  T YF CSC
Sbjct: 319 VPVFPRAPHNAKANEPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCSC 378

Query: 312 ERCSDPTELGTYFSAMKCLNEHKD-QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLV 370
             C  P + G Y  + + L   K+  G CW+    P ++D    C  C   +   D  L 
Sbjct: 379 SLCEKPRD-GAYVDSRESLYCPKNCGGLCWL----PTEDDPLARCAKCGTVVKDSDAILD 433

Query: 371 TSQLG----EQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGY 426
             ++G    ++V+K+   +P  K++ ++ T L  +        +           TQP  
Sbjct: 434 VIRIGFVALKKVNKIQISDPQ-KAI-QLTTNLIRILASARVCPF-----------TQPLL 480

Query: 427 AYTQLSSSLLERKISYVMSPNLKATDEP 454
           A ++L+++LL         PN+     P
Sbjct: 481 ALSRLNTTLLISNYRSPFDPNVVEISSP 508


>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
 gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
          Length = 453

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 79/333 (23%), Positives = 134/333 (40%), Gaps = 52/333 (15%)

Query: 74  LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
            +PG  +  + P +           C  C K  +P L   + C   F+  CS  C   S 
Sbjct: 1   FYPGQTVLEQDPYVAVLDGDSRGNRCDACFKQ-SPALKRCSACKFVFY--CSPTCQR-SQ 56

Query: 134 AHTHAPECAIL-KLGCE----------TLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
              H  EC +L +L  E           LL   + +   +LP+  L         Y+ ++
Sbjct: 57  WKIHQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATGVLPVTAL-------DNYEIVR 109

Query: 183 DMEAHMSKRGPGTEV-YEEIDSIVK-YLRSNFLEKLPGDVLDDTSAKC-LHWICGVIEVN 239
            +  H S+ G    V Y ++  ++K  L + + E +     D     C  H IC   E+ 
Sbjct: 110 ALPTHFSETGDERLVMYAQMAVLIKTILNARYAEDVKEITKDICRISCNGHTICD-DELR 168

Query: 240 GVDIGRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMYTNALWG 293
            V IG     L+ V+ ++ H+C  N+      KH+ ++++   + R   ++  Y      
Sbjct: 169 PVGIG-----LFPVVSVINHSCSSNSLLLFDGKHA-VVRALGTISRGCEVTVSYIELGAS 222

Query: 294 TQPRREHLAITKYFNCSCERCSD--PTELGTY----FSAMKCLNEHKDQGDCWILPVNPL 347
           T  RRE L+   YF+C C RC+D  P   G Y      A+ CL+   +           L
Sbjct: 223 TNSRREALSDQYYFHCKCPRCTDVSPQAHGLYKDDVLEAVACLDPACES-------FMRL 275

Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
            N S W C SC +     +V+ ++++    ++K
Sbjct: 276 SNGS-WRCMSCGSSREPNEVNKLSTEAEGMIEK 307



 Score = 41.6 bits (96), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 2/48 (4%)

Query: 9  DEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          D    RC  C  +  AL +CSACK V YC    Q+  WK+H+ +CK L
Sbjct: 20 DSRGNRCDACFKQSPALKRCSACKFVFYCSPTCQRSQWKIHQEECKVL 67


>gi|170041205|ref|XP_001848363.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167864728|gb|EDS28111.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 563

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 69/317 (21%), Positives = 119/317 (37%), Gaps = 88/317 (27%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+      GRYL    DL  GD +A E P         ++P C GC   +          
Sbjct: 191 ELVQDEKFGRYLITNRDLKAGDVVALEKPF--------SKP-CKGCTIAM---------- 231

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAIL---------------KLGCETLLAYNDYKYEA 161
                  CS  C   +    H  EC IL               +L  +TL  +N +  EA
Sbjct: 232 ------FCSDECQQKAMEEYHRFECPILQDIHRLVEHVLVPALRLTIKTLTTFN-FDLEA 284

Query: 162 ILPLRCLILQ--RRSP-------------KKYQELKDMEAHMSKRGPGTEVYEEIDSIVK 206
              LR L+    + +P             + Y+ + ++E+H  KR P       I++I+ 
Sbjct: 285 ---LRLLMESDGKNTPNALDMDWTSIDPKETYEIVYNLESHRDKRSPDHLGQYAINAII- 340

Query: 207 YLRSNFLEKLP-------GDVLDDTSAKCLHWICGVIEVNGVDIGR------------YT 247
            +    L++          + L + +   +     ++ VN  ++                
Sbjct: 341 -MSEILLQRTKIRDNCKNSNALRNITELLIMHHAMILPVNSFELSFEDYEDKACKREVLM 399

Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKL--FVFRDTHISTMYT------NALWGTQPRRE 299
           +G+Y+++ ++ H+C PN++  N     L  +V R     +  T       A+   Q RRE
Sbjct: 400 EGIYTILSMINHSCAPNSQPMNGSDDNLALYVLRPIKTGSPITIKYCVKFAITPLQERRE 459

Query: 300 HLAITKYFNCSCERCSD 316
           +L+   YF C CE C++
Sbjct: 460 YLSENYYFECQCEACAN 476


>gi|307187760|gb|EFN72732.1| Zinc finger MYND domain-containing protein 10 [Camponotus
           floridanus]
          Length = 434

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 30/43 (69%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           EL RC++C+E +  +CS CKEV YC +Q Q + W+ HK  CKK
Sbjct: 391 ELKRCSLCQEVSKKRCSKCKEVWYCSRQCQIKDWEEHKKICKK 433


>gi|332018517|gb|EGI59107.1| Zinc finger MYND domain-containing protein 17 [Acromyrmex
           echinatior]
          Length = 400

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 63/150 (42%), Gaps = 16/150 (10%)

Query: 15  CAVCRETA----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP-----YEIKSSPLLG 65
           C VC+       L KC  C+ +AYC +QHQK+HW  HK  CK +      ++++    L 
Sbjct: 2   CHVCKRFGDGVCLKKCGGCRMIAYCNRQHQKQHWSQHKSLCKVIQDVLRDFKMEDHGTLE 61

Query: 66  RYLQATLDLHPGDRIARESPL-IVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
            +    L+L        E PL I   ++      C  CH+     L D   C  C   A 
Sbjct: 62  EWDDQKLNLILLISFKLERPLEIYEKEMFQFLRECTVCHEQDGQLLED---CQSC---AS 115

Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAY 154
            + C    ++  H   C  L+L   + L++
Sbjct: 116 VSFCKDHKNSIKHKDTCVSLELCLRSNLSF 145


>gi|440799553|gb|ELR20597.1| MYND finger domain containing protein, partial [Acanthamoeba
           castellanii str. Neff]
          Length = 140

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 30/47 (63%), Gaps = 5/47 (10%)

Query: 11  ELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           +L +C +C     R      C ACK+VAYC ++HQK+HW+ HKP C+
Sbjct: 89  QLAKCGLCEAKEARRGDFPVCGACKQVAYCTREHQKQHWRTHKPHCR 135


>gi|380025446|ref|XP_003696485.1| PREDICTED: uncharacterized protein LOC100869161, partial [Apis
           florea]
          Length = 414

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 61/148 (41%), Gaps = 22/148 (14%)

Query: 15  CAVCR----ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL-LGRYLQ 69
           C +C+    E +L +C  C  ++YC K+HQK HW  HK  C  +   +K + + +   ++
Sbjct: 5   CHICKRFGKEISLKRCGNCSMISYCSKEHQKIHWSQHKDLCNAICSVLKDNTISMTLNIK 64

Query: 70  ATLDLHPGDRIARESPLIVGPKL----------ALAEP-ICLGCHKPLNPNLADNARCPR 118
             +DL    ++     L+V  KL              P  C+ CH      L D + CP 
Sbjct: 65  QNVDLKTWVQMKMNFMLLVAIKLDRKLEYYEEQMFKFPRSCVTCHDQNIKILEDCSNCPN 124

Query: 119 CFWPACSARCSGLSDAHTHAPECAILKL 146
                  + C    D   H   C +LKL
Sbjct: 125 I------SFCIKHKDNIIHKKFCNLLKL 146


>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 579

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 70/314 (22%), Positives = 115/314 (36%), Gaps = 43/314 (13%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +++    + GR + A+ DL  G+ + RE P +   +   A   C  C + +       A 
Sbjct: 74  FKVTEDSVSGRCVIASRDLKAGELVLREPPFVKVVRRDCASRQCAYCCQQVTERGKIEAD 133

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP------LRCLI 169
            P  F   CS  C    DA   A   A+ KL    + A  D   + +         R   
Sbjct: 134 VP--FAVYCSRACQAREDALRAAEASALGKLA--GISAARDVDIDLLRMLLRLLITRAKA 189

Query: 170 LQRRSPK----KYQELKDMEAHMSKRGPGTEVYEEID-----------------SIVKYL 208
           L  R P           D E      G G  + ++ +                 S+V+  
Sbjct: 190 LGLREPSGDSDSVSRGVDEEGEDGTMGEGLFLRQQWENLYALMHHREAMAPDWISVVREA 249

Query: 209 RSNFLEKLPGDVLDDTSAKCLHWICGV-IEVNGV--DIGR---YTQGLYSVICLMEHNCL 262
             + L+ LP  V  D   + +   C V +  +G+  D G       G++ +  ++ H C 
Sbjct: 250 GEDLLQLLPEWVRFDVE-EVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACR 308

Query: 263 PNAKH----SNMMQSKLF-VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
           PN        N+    L  V     +S  Y + L  T  RR+ L  +K+F C C RC +P
Sbjct: 309 PNCTFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENP 368

Query: 318 TELGTYFSAMKCLN 331
           + +  Y   + C +
Sbjct: 369 SSMDDYLDGVCCTD 382


>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
 gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
          Length = 511

 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 89/443 (20%), Positives = 166/443 (37%), Gaps = 94/443 (21%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GR L  T D HPGD I  + P +  P     +  C GC      NL+  +RC   ++  C
Sbjct: 22  GRSLFTTRDFHPGDVIISQEPYVCVP----TQKRCDGCFS--TTNLSKCSRCQVVWY--C 73

Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
              C           E  + +L C+ L   +  K +++ P   L+L+    +K Q+ K +
Sbjct: 74  GTPC--------QKSEWKLHRLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKII 125

Query: 185 EAH---------------------MSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGD--- 219
            +                      +S R    E     + S+V  L + +  +   D   
Sbjct: 126 PSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKE 185

Query: 220 ---VLDDTSAKCLHWICGVIEVNGVDIGRYTQ-------------------GLYSVICLM 257
              VL    A  +H I    E+N  +I  +                     GLY V+ ++
Sbjct: 186 EQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSII 245

Query: 258 EHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCE 312
            H+CLPN+          +++   + + T +   Y      T  R++ L     F C C 
Sbjct: 246 NHSCLPNSVLVFDGREASVRALQHIPKGTEVLISYIETAGSTVTRQKALREQYLFQCVCP 305

Query: 313 RCSDPTELGTY--------FSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNA 364
            CS   ++G Y            +C NE     D ++L      +   + C  C    + 
Sbjct: 306 LCS---KVGQYEDVRENAILEGYRCKNE---TCDGFLLRTT---DGKAFQCQECGLVRDK 356

Query: 365 RDVHLVTSQLGEQVDKLVQENPN----VKSLEEMLTKLEA-MFHPHHYHCYAVKH----S 415
            ++  + +++   +++  + + N      S+ +M+ KL+  ++HP   +    +     S
Sbjct: 357 EEIKQIATEIKFLLEEASKPSSNDSHEAISIHKMIEKLQTKLYHPFSINLMQTRETILKS 416

Query: 416 LIQLYGTQPGYAYTQLSSSLLER 438
           L++L   +   AY +L+  + +R
Sbjct: 417 LMKLEYWREALAYCKLTIPIYQR 439


>gi|291223160|ref|XP_002731579.1| PREDICTED: egl nine homolog 1-like [Saccoglossus kowalevskii]
          Length = 615

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 27/39 (69%), Gaps = 1/39 (2%)

Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          RCAVC +   L +CS C  V YC ++HQK+HWK+HK  C
Sbjct: 11 RCAVCEKIEDLKRCSRCHVVHYCSREHQKQHWKVHKTSC 49


>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
           rotundata]
          Length = 674

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 80/333 (24%), Positives = 117/333 (35%), Gaps = 93/333 (27%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFW--- 121
           GR+L AT ++  G  +  ESP          +  CL CH  L  N  D+ + P C++   
Sbjct: 234 GRHLVATRNIKAGYILIVESPFAFSTNNEAFDRNCLNCHVTLKLN--DSVKIP-CYFCRT 290

Query: 122 -PACSARCSGLSDAHTHAPECAILKLGCET------------LLAY-------------- 154
              CS +C   +    H  EC+I  + C              LLAY              
Sbjct: 291 VSFCSEKCRNEAWKLYHRYECSIFDIFCGNDSEQPQRQSSYLLLAYRMTIAGCLLSNTDK 350

Query: 155 -NDYKYEAILPLRCLILQRR-----------------SPKKYQELKDMEAHMSKRGPGTE 196
            N+     I  L    L+                   SP+ Y+ +  ++ H  K  P   
Sbjct: 351 VNNMDKTEIPVLNDNFLRYHGTNTNQECSDLGINEVYSPRDYRTILKLKTHCEKIEPNIN 410

Query: 197 VYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI-----------CGVIEVNGVDIG- 244
           +   I++I       FL K    VL      CL                 I  N  +I  
Sbjct: 411 LIRAIEAI-------FLTKCFTFVLSKMDVVCLKETFISLAVAMLHHLQAINCNAYEIVE 463

Query: 245 ------------RYTQG-LYSVICLMEHNCLPNA-KHSN-----MMQSKLFVFRDTHIST 285
                       RY  G +Y  + L+ H+C PN  +HS      +++S  F+ + T I  
Sbjct: 464 NLYDKKAHVWEPRYVGGAIYPSVSLVNHSCYPNVVRHSYPSGVVVVRSLRFIGKGTEIVD 523

Query: 286 MYTNALW---GTQPRREHLAITKYFNCSCERCS 315
            Y    W   G  PRRE+L     F C+CE C+
Sbjct: 524 CY-GPHWLSEGRLPRREYLWKKYRFLCACEACT 555


>gi|302692314|ref|XP_003035836.1| hypothetical protein SCHCODRAFT_84443 [Schizophyllum commune H4-8]
 gi|300109532|gb|EFJ00934.1| hypothetical protein SCHCODRAFT_84443 [Schizophyllum commune H4-8]
          Length = 283

 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 43/109 (39%), Gaps = 9/109 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
           CAVC      +C+ C  +AYCG+  QK HWK HK  CK +      +   G++ Q    +
Sbjct: 109 CAVCGNKTTSRCTGCLSIAYCGQACQKAHWKEHKGFCKTIRGGTWRTMTFGQHFQVGGQV 168

Query: 75  HPGDRIARESPLIVGPKLALAEP---------ICLGCHKPLNPNLADNA 114
                I   S     P     EP           +   +PL P+L   A
Sbjct: 169 MSAVSINHSSGKANTPINKKNEPPANVHGDKLFLVKIQRPLVPDLTQQA 217


>gi|238601033|ref|XP_002395301.1| hypothetical protein MPER_04666 [Moniliophthora perniciosa FA553]
 gi|215465804|gb|EEB96231.1| hypothetical protein MPER_04666 [Moniliophthora perniciosa FA553]
          Length = 129

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 2/54 (3%)

Query: 6  EIEDEELMRCAVCRETALH--KCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          ++ +EE+  C VC E  +H  +CS CK VAYC  +HQK  WK HK  C K  +E
Sbjct: 42 DLWEEEIKSCVVCGEKDIHLSRCSRCKIVAYCNAEHQKADWKQHKKHCIKPVWE 95


>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 643

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 58/268 (21%), Positives = 105/268 (39%), Gaps = 56/268 (20%)

Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSV---------------ICLMEHNCLPNA 265
           LD+ S+  +  I G++++    + R+T   ++V               I L  H+C PNA
Sbjct: 230 LDELSSYGIDSIAGLVDL----VSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNA 285

Query: 266 ----------KHSNMMQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSC 311
                       +N   + +   RD      +   Y +     + R++ L  T YF C+C
Sbjct: 286 VPVFPRAPHNAKANEPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCNC 345

Query: 312 ERCSDPTELGTYFSAMKCLNEHKD-QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLV 370
             C  P + G Y  + + L   K+  G CW+    P ++D    C  C   +   D  L 
Sbjct: 346 SLCEKPRD-GAYVDSRESLYCPKNCGGLCWL----PTEDDPLARCAKCGTVVKESDAILD 400

Query: 371 TSQLG----EQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGY 426
             ++G    ++V+K+   +P  K++ ++ T L  +        +           TQP  
Sbjct: 401 VIRIGFVALKKVNKVQISDPQ-KAI-QLTTNLIRILASARVCPF-----------TQPLL 447

Query: 427 AYTQLSSSLLERKISYVMSPNLKATDEP 454
           A ++L ++LL         PN+     P
Sbjct: 448 ALSRLHTTLLISNYRSPFDPNVAEISSP 475


>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
 gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
          Length = 680

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 78/343 (22%), Positives = 131/343 (38%), Gaps = 49/343 (14%)

Query: 19  RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGD 78
           RE  L      KE   C     K+  K    +  + P E+  S   GR + A+ DL  G 
Sbjct: 157 REIQLKIIKRNKEEKDC-----KDRIKRDFDQFNQFPVELVWSDQFGRRVLASEDLPKGS 211

Query: 79  RIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS--DAHT 136
            + R SP     +       C  C+K +  +  D  +C  C       RCS      +  
Sbjct: 212 MLLRVSPFGSCLEDDKIFKNCGFCYKKIEKSKRDQCKCKICNNFLLCERCSTTDPFASEY 271

Query: 137 HAPECAILKLGCETL--LAYNDYKYEAILPLRCLILQR-RSPKK---------------- 177
           H  EC IL+   E        D+++   + L  L  ++ +S KK                
Sbjct: 272 HKEECDILQFLKEYYPSTETKDFRFMFRVVLNALKEKKFQSFKKENCSSNWKKHEHPFIF 331

Query: 178 --YQELKDMEAHMSKRGPG-TEVYEEIDSIVKYLRSNFLEKLPGD--VLDDTSAKCLHWI 232
             YQ+L+++   + K  P   E ++   S +  +    + KL G+  +L   +   +  +
Sbjct: 332 DEYQDLENLSRTLDKVDPQQMEAFKRSASSIIAV----ISKLRGEDAILKYITLNEIIEL 387

Query: 233 CGVIEVNGVDI------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD------ 280
             ++  NG ++        Y  G++     + H+C PNA   N  Q  + VFR       
Sbjct: 388 YSIVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQG-MMVFRSLRPLKK 446

Query: 281 -THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGT 322
              + T YT+     + RR++L    +F C C +C   + L T
Sbjct: 447 GEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQCQYQSNLVT 489


>gi|195024336|ref|XP_001985854.1| GH20862 [Drosophila grimshawi]
 gi|193901854|gb|EDW00721.1| GH20862 [Drosophila grimshawi]
          Length = 574

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 75/308 (24%), Positives = 115/308 (37%), Gaps = 54/308 (17%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E++ +P  GR++    DL  GD IA E P        +    C  C +     L     C
Sbjct: 188 EMRETPDEGRFIVTNRDLVVGDVIAIEQPFCSSLLAPMRYIRCATCKEERYLTLIPCDSC 247

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKL------------------------GCETLL 152
             C    CS  C   + +  H  EC I+ L                          E L+
Sbjct: 248 --CSAMFCSEECKERATSTFHKYECPIIDLLHRMFNKIHGIALRTTLSALDLHGSIEELM 305

Query: 153 AYNDYKYEAILPLRCLILQRRSPKKY----------QELKDMEAHMSKRGPGTEVYE--- 199
           AY D        +  L   + +P+++          Q L+ + + + +R     V +   
Sbjct: 306 AYCDQPQNQHKSVFDLDYTQLTPQEHYRAIHGLVTNQHLRSV-SDLFQRSVVCAVLKHFI 364

Query: 200 -EIDSIVKYLRS----NFLEKLPGDVLDDTSAKCLHWICGVIEVNGV-DIGRYTQGLYSV 253
            E   + +YL      NF   L    L  TS   +H I  V + N   D   Y  G Y+ 
Sbjct: 365 IEYTPLKEYLGGEDGRNFFTDLLFRHLQ-TSPSNMHGIDLVEQPNETKDDQSYASGAYAF 423

Query: 254 ICLMEHNCLPNAKHSNM-MQSKLFVFRDTHIS-TMYTN-----ALWGTQPRREHLAITKY 306
           + L+ H+C PN    N  + + LFV R       +Y N     A++  + R+E L++   
Sbjct: 424 LSLINHSCAPNTMRINQGVHAYLFVLRPIKAGDVLYDNYGAHFAVFSKKQRQETLSMQYR 483

Query: 307 FNCSCERC 314
           FNC CE C
Sbjct: 484 FNCKCEAC 491


>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
 gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
          Length = 485

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 75/301 (24%), Positives = 120/301 (39%), Gaps = 54/301 (17%)

Query: 52  KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGC-HKPLNPNL 110
           K  P E+  + L GR L+ T DL  G+ +  E+        +L+  +C  C  + +NP+ 
Sbjct: 5   KTDPVEVFGAGLKGRGLRGTRDLSAGEVVFAEASFAAVVLDSLSLQVCHSCFRRQVNPH- 63

Query: 111 ADNARCPRCFWPACSARCSGLSDAHTHAPEC-AILKLGCETLLAYNDYKYEAILPLRCLI 169
               RC +C +     R    +    H  EC AI  +G     A N+      + L   I
Sbjct: 64  ----RCAQCKFAHYCDRTCQRAAWDEHRKECSAIRNIG----KAPNEN-----VRLVARI 110

Query: 170 LQR-------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD 222
           L R        S  +   L  +E H+S+  P  E  +E+ + VK   + + +K      D
Sbjct: 111 LWRIQKHTGLVSDSQLTTLDMLEDHLSRMTP--EDLKELKADVKTFYTYWPKKSKAVGED 168

Query: 223 DTSAKCLHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN-----------A 265
             S      + GVI  NG  +   R  Q    GL+  +CL+ H+C PN           A
Sbjct: 169 YVSH-----LFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHDCWPNCTVILNHGDQSA 223

Query: 266 KHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
             ++   S+    R          ++  Y + L  +  R+  L    YF+C CE C + T
Sbjct: 224 LDASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHCVNGT 283

Query: 319 E 319
           +
Sbjct: 284 K 284


>gi|343980823|gb|AEM76787.1| hypothetical protein [Armillaria mellea]
          Length = 267

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 22/39 (56%), Positives = 27/39 (69%), Gaps = 2/39 (5%)

Query: 15  CAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C  R+  L KCS CK+V+YCG + Q+ HWK HKP C
Sbjct: 226 CAACEKRDMRLQKCSRCKKVSYCGPECQRAHWKKHKPVC 264


>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
 gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
          Length = 473

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 70/279 (25%), Positives = 111/279 (39%), Gaps = 39/279 (13%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+  SP  GR L+AT +   GD +  E P       + A  IC  C +          RC
Sbjct: 5   EVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFR----RQEKLQRC 60

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR-- 173
            +C F   C   C   +    H  ECA +K        Y     E +  L   IL R   
Sbjct: 61  GQCRFAQYCDKTCQ-RAGWEEHKLECAAIK-------TYGKPPSENV-RLAARILWRMDK 111

Query: 174 -----SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                S  +   L+D+E H+       ++ E+     K    NFL+  P +    T    
Sbjct: 112 QGSVVSDNQLTTLEDLEDHI------CDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSV 165

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
            H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN     ++  ++ +    +
Sbjct: 166 SH-ILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGKIELRALGKI 224

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
                ++  Y + L  +  R+  L    +F+C+C+ C++
Sbjct: 225 SAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTE 263


>gi|328865944|gb|EGG14330.1| hypothetical protein DFA_12100 [Dictyostelium fasciculatum]
          Length = 521

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 58/241 (24%), Positives = 99/241 (41%), Gaps = 38/241 (15%)

Query: 247 TQGLYSVICLMEHNCLPNAKHS---NMMQSKLF--VFRDTHISTMYTNALWGTQPRREHL 301
           T G +    L+ H+C PNA  S   + +Q ++   + +D  I   Y + L  T  R+  L
Sbjct: 212 TNGFFYQAALLNHSCQPNAFFSFNGDKLQMRVVKDMDKDDSIYDSYVDLLLPTYERQLGL 271

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC--WILP-VNPLDNDSD------ 352
             +K F C C+RCS  T    + S++ C+ +     DC  WI P +   +N  D      
Sbjct: 272 LKSKNFFCQCKRCS--TREDCHLSSINCVGK-----DCQSWIAPLIGKKENKEDGSTSLT 324

Query: 353 --WTCGSCSARLNARDVHLVTSQLGEQVDKL----VQENPN--VKSLEEMLTKLEAMFHP 404
             W C  C+   +   V  +  QL      L     +EN +  V+     + +LE   H 
Sbjct: 325 VTWGCDDCTGTASESHVKNIQDQLVSFTKYLEKRSYRENADRFVEQASVTMAQLENKIHK 384

Query: 405 HHYHCYAVKHSLIQ--LYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKP 462
           +H       H+ +Q  L+ +       Q + S+   K+   +  N+   D    + C+K 
Sbjct: 385 NH-------HARLQYHLHMSNAYIKLQQFNESIYHAKVVKRLVKNVLQCDRGELVDCYKT 437

Query: 463 L 463
           +
Sbjct: 438 I 438


>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
 gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
           ATCC 50983]
          Length = 409

 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 111/280 (39%), Gaps = 29/280 (10%)

Query: 58  IKSSPLLGRYLQATLDLH--PGDRIARESPLI--VGPKLALAEPICLGCHKPLNPNLADN 113
           I+SS + GR L A    H  PGD IA   P++  V P +A A   C  CH  LN +    
Sbjct: 11  IESSSVGGRGLFAGEGSHYIPGDSIATYRPIVYCVTPTMAKA---C--CHWCLNSDAPKY 65

Query: 114 ARCPRCFWPA-CSARCSGLS-DAHTHAPECAILK-LGCETLLAYNDYKYEAILPLRCLIL 170
            +C  C +   CS  C+  +    TH  EC ++K L  E   +     + A   L  L+ 
Sbjct: 66  YQCSGCRYAVYCSKECAKAAYKLGTHRRECQLIKKLPAEHASSAPLTTFLAAAKLHWLV- 124

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
                ++ + ++     M +    ++  E        L S +L+    D++       + 
Sbjct: 125 -----QEDEAVRHSVDEMCRHADSSDTLEADSGSTAILLSRYLDGTQADLIYSPPVPVML 179

Query: 231 WICGVIEVNGVDIGR-----YTQGLYSVICLMEHNCLPNA----KHSNMMQSKLFVFRD- 280
            +  V+  N V I          GLY+ +  M H+C PN       S +    +    D 
Sbjct: 180 DLLRVLRYNAVTITNDSLQDVALGLYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDG 239

Query: 281 THISTMYTNALWGTQPRR-EHLAITKYFNCSCERCSDPTE 319
             +   Y +     + +R + L     F+CSCERC+   E
Sbjct: 240 AELFISYVDVCISPKAKRCQRLRDQYKFDCSCERCTREDE 279


>gi|407405659|gb|EKF30538.1| hypothetical protein MOQ_005654 [Trypanosoma cruzi marinkellei]
          Length = 585

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 77/338 (22%), Positives = 127/338 (37%), Gaps = 63/338 (18%)

Query: 107 NPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL 165
            P  A   +C RC F   CS  C+   D      EC +     E L +      +A +P 
Sbjct: 203 EPTAAALRKCRRCQFVSYCSDTCASCHDRQHEEYECRLFFRLREMLRSMQ--SCDAAVP- 259

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE---------KL 216
                         +   M  H      G +V +E    V  L S+ +E         +L
Sbjct: 260 -------------DDFVTMATHCITTVSGVKVRKEGHEAVLRLESHEVEVSQGLTPLLRL 306

Query: 217 PGDVLDDTSAKCLHWICGVIEVN--------GVDIGRYTQGLYSVICLMEHNCLPN-AKH 267
             D+ ++     +  I GV+  N        G+ +G+    + SV     H+CLPN A  
Sbjct: 307 VQDLFNEEDPTFITRILGVVRCNALAVCDASGLPVGQALH-VASVTSYFNHSCLPNCAIE 365

Query: 268 SNMMQSKLFVFRDTHISTMYTNAL-WGTQPRREHLAITKYFNCSCERCSDPTE--LGTYF 324
           ++ + +   +     ++  Y   L W    RRE LA   +F+C C RC D       +  
Sbjct: 366 ADAIVTTRAIRPGEELTISYLPQLYWPAWLRREELAERYFFDCRCMRCGDGDRHPFESAL 425

Query: 325 SAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQE 384
           SA   L   +++   +I  V  L       CG    R+ A+DV  V   +G++       
Sbjct: 426 SATLRLGGSREKEREYISSVQIL-------CG----RVRAKDVGDV--DVGDR------- 465

Query: 385 NPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
                +L  +L +      P HY C+ ++++L  +Y  
Sbjct: 466 ----DALLHLLQECRQHLFPFHYLCHELQNTLSFVYAV 499


>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
 gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
          Length = 410

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 66/291 (22%), Positives = 110/291 (37%), Gaps = 44/291 (15%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           + S P  GR L AT    PGD I  + P    P   L    C  C    + NL   + C 
Sbjct: 19  VASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKK 177
           R  W  CS+ C           E  + +L C  + A  + + + + P   L+++    +K
Sbjct: 76  RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRK 126

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
            Q  K M   +S   P    + E+D                  L + +     + C    
Sbjct: 127 LQNEK-MANLVSLILP----FIELD------------------LKEIAQTFSKFACNAHT 163

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNALW 292
           +   ++     GLY VI ++ H+C+PNA       +  +++   + +D  +S  Y     
Sbjct: 164 ICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETAT 223

Query: 293 GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
            T+ R   L    +F C+C RC    +        +C N+     D ++LP
Sbjct: 224 VTKKRNNDLK-QYFFTCTCPRCVKGFDEDALLEGFRCKNQ---ACDGFLLP 270


>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
           206040]
          Length = 543

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 67/274 (24%), Positives = 112/274 (40%), Gaps = 31/274 (11%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           +I+  P  GR L AT    PG  I   +PL++ P ++    +C  C +P NP       C
Sbjct: 12  QIRPHPTKGRGLYATQSFSPGQVILPFTPLLLLPTVSYLNIVCSYCLRPGNPRA-----C 66

Query: 117 PRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
            RC   + C A C   +    H+ EC +L+ G +      D +     P R LI      
Sbjct: 67  SRCHAASYCDATCQAAAWKAVHSRECKVLRQGIKD----EDRRRRLPTPTRALIQAVLRK 122

Query: 176 KKYQELKDMEAH-MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICG 234
           +    L+ +E H + K+    + +++I+ I+      F  K  G   +D   K    +C 
Sbjct: 123 EIGDGLEGLEGHVLEKKAAEGDEWKDIE-IMAMAACAFSGK--GTAEEDIR-KAAEMLCK 178

Query: 235 V----IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------ 284
           +     +    D+G     L   + +  H+C+PNA    + ++ L +  +   +      
Sbjct: 179 IQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLIAENPIRAGDEMEL 238

Query: 285 --TMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
             T YT+ L    P+R+       F C C RC D
Sbjct: 239 AYTFYTDPL----PKRKEALAHYKFVCQCLRCRD 268


>gi|299116928|emb|CBN75038.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 457

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 67/290 (23%), Positives = 114/290 (39%), Gaps = 65/290 (22%)

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           ++C+++ R+ P     L  + A    R       E + SI + +   F    P  + +D 
Sbjct: 102 IQCMLMYRKDPLSLDPLAPLTAANLPR-----CLESVVSIRELIPGLF----PDGMTNDR 152

Query: 225 SAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF----- 278
           +A+    I G +  N  ++ +    GL+ + C+MEH+C PN   +     KL+V      
Sbjct: 153 AAR----ILGTLNTNSHELEQVEGSGLFLMACIMEHDCTPNCSFTT-FGDKLWVTAIRQV 207

Query: 279 -RDTHISTMYTNALW-GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
                +S  Y N  +  T  R+  L     F C+C  C   T L     A +C ++    
Sbjct: 208 GEGERLSIDYGNNFYLPTAERKAELEDIYGFVCTCRAC---TVLPDRCRAFRCPSK---- 260

Query: 337 GDCWILPVNPLDNDSD------WTCGSCSARLNARDVHLV-------------TSQLG-E 376
            DC    V P  +  D      W C +C  + +  ++ ++               +LG E
Sbjct: 261 -DCRTGKVCPHGDGGDESAERSWGCLACGHQCDPEEIEVLLEAEESLSEALEMADELGVE 319

Query: 377 QVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCY-AVKHSLIQLYGTQPG 425
           Q+D LVQ+   V              HP HY  + A++ +  +L G   G
Sbjct: 320 QIDALVQDARVV--------------HPTHYVVFWALEATAKRLSGESAG 355


>gi|393243929|gb|EJD51442.1| hypothetical protein AURDEDRAFT_181799 [Auricularia delicata
          TFB-10046 SS5]
          Length = 394

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 36/65 (55%), Gaps = 3/65 (4%)

Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP-LLGRYLQAT 71
          MRCAVC E     CS+CK V YC  + QK HW  H+  C+  P +I SS  L+   L+  
Sbjct: 1  MRCAVCPEPGTKLCSSCKSVRYCSAKCQKAHWAEHRFTCR--PDKISSSDRLVHAVLKNA 58

Query: 72 LDLHP 76
          +  HP
Sbjct: 59 MPTHP 63


>gi|301105252|ref|XP_002901710.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100714|gb|EEY58766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 500

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 58/263 (22%), Positives = 97/263 (36%), Gaps = 44/263 (16%)

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
           L+CL +    P     + D+     +R         + +    LR    +  P    D  
Sbjct: 124 LKCLAMYECDPHALDGMMDLTCVGQQR---------VTNATLQLRRQLPDIFPAGFTDQQ 174

Query: 225 SAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVF 278
            A     + GV+  N  ++      GL+   C MEHNC PN   +       M +   + 
Sbjct: 175 MAT----LIGVLNTNSHELENLGGSGLFLSACRMEHNCKPNCSFTTFDSTLWMTAIRPIA 230

Query: 279 RDTHISTMYTNALWGTQPRREHLAITKY-FNCSCERC---SDPTELGTYFSAMKCLNEHK 334
               +S  Y N  +   P R+   +  Y F C+CE C    DPT       A++CL+ + 
Sbjct: 231 PGDALSIDYGNFFYRPTPERQECLLESYGFICTCEACVSLPDPTR------AVRCLSRNC 284

Query: 335 DQGDCWILP---------VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
            QG     P         + P     +W C +C    +  +   + +      ++ + EN
Sbjct: 285 PQGVMLPCPARTNTASSSIQPKKMQFEWRCQTCGTTADTAEHSRIFA-----AEQELLEN 339

Query: 386 PNVKSLEEMLTKL-EAMFHPHHY 407
              +SLEE+   +   + H  HY
Sbjct: 340 GFPESLEEVDAVVSRGVLHESHY 362


>gi|341874291|gb|EGT30226.1| CBN-SET-30 protein [Caenorhabditis brenneri]
          Length = 555

 Score = 48.1 bits (113), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 99/231 (42%), Gaps = 38/231 (16%)

Query: 124 CSARCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAIL-----PLRCLILQR 172
           CS  C   S A  H  EC  LK  C      E +L     +Y+ I       +      R
Sbjct: 62  CSKECQA-SGAFDHKYECGALK-KCSDLNTDERMLIRIIGRYKEIQDGNDKKIEGFYKNR 119

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
            S +   ++ +  A M K      V+++I D + ++   N+L       +D+ +A  LH 
Sbjct: 120 ESIRSVMQIWEHCADMKKDESAMNVFKKIYDRVKEFGDKNYL-------VDEETAFQLHS 172

Query: 232 --ICGVIEVNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HI 283
                   ++ VD  R   +GLY  +C  +H+C PNA +S N   +KL    D     ++
Sbjct: 173 RNFINRHSISNVDYLREIGKGLYLDLCRYDHSCRPNAIYSCNGTIAKLRALHDNVDLENV 232

Query: 284 STMYTNALWGTQP-----RREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
            T  T+  +   P     RR  L  T YF C C+RC DPT+   + +A+ C
Sbjct: 233 ET--THYTYIELPPCKIQRRHMLKETWYFECHCDRCDDPTD--NWLTAVIC 279


>gi|392568936|gb|EIW62110.1| hypothetical protein TRAVEDRAFT_27483 [Trametes versicolor
           FP-101664 SS1]
          Length = 399

 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 26/58 (44%), Positives = 32/58 (55%), Gaps = 7/58 (12%)

Query: 3   SIIEIEDEELMRC--AVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           S  E++   L RC  A C     R     +C  CKEV YCG+  QKE WK HKP+CK+
Sbjct: 299 SDAEVKSYVLKRCVRAGCDTREVRVAEFKRCGGCKEVVYCGQACQKEDWKAHKPRCKE 356


>gi|342179951|emb|CCC89425.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 433

 Score = 47.8 bits (112), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 60/263 (22%), Positives = 103/263 (39%), Gaps = 22/263 (8%)

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL-EKLPGDVLDDTSAKC 228
           ++ R P     L +++ H        EV + +  +V+ +      E+LP +V     A+ 
Sbjct: 110 IKSRRPGYRVVLNELQGH------AAEVSQNLSHMVRMVGELLRDEELPLEV-----ARL 158

Query: 229 LHWI-CGVIEVNGVDIGRYTQGLY--SVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHIS 284
           +  I C  IEVN        Q L+  ++     H+C PN A     + +   +     ++
Sbjct: 159 IGIIRCNTIEVNNELSLGVGQALHATTITSYFNHSCSPNCAIQGEFIVTTRVIAAGEELT 218

Query: 285 TMYTNAL-WGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
             Y   L W    RRE LA T YF+CSCERC D        + +  L    ++       
Sbjct: 219 ISYMPQLYWPVALRREELANTYYFHCSCERCRDNENDPFEAALVAVLRSGCEETGGGGGA 278

Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQL----GEQVDKLVQENPNVKSLEEMLTKLE 399
            +  +  +   CG     L A+  H+   Q        +D       +   L+ +L K  
Sbjct: 279 QDGNNAKAARVCG-LENHLEAQRKHITDVQTLCNRIRSMDPSDITTTDRNELQRLLQKCC 337

Query: 400 AMFHPHHYHCYAVKHSLIQLYGT 422
              +P HY C+ ++++L   Y  
Sbjct: 338 EELYPFHYLCHELRNALSFTYAA 360


>gi|323449643|gb|EGB05529.1| hypothetical protein AURANDRAFT_72192 [Aureococcus anophagefferens]
          Length = 1848

 Score = 47.8 bits (112), Expect = 0.014,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 9   DEELMR-CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           D+ + R CA C   A   C +CK   YC ++ QK+HW LHK  C
Sbjct: 185 DDRMRRMCAACGARAKFSCGSCKAARYCSRECQKDHWGLHKDDC 228


>gi|393213478|gb|EJC98974.1| hypothetical protein FOMMEDRAFT_31532 [Fomitiporia mediterranea
           MF3/22]
          Length = 374

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           C +C   A   C AC+   YCG  HQ+  WK HK  C++ PYE
Sbjct: 326 CELCPLEATKTCGACRGAWYCGDAHQRRDWKFHKSWCRQHPYE 368


>gi|290990111|ref|XP_002677680.1| predicted protein [Naegleria gruberi]
 gi|284091289|gb|EFC44936.1| predicted protein [Naegleria gruberi]
          Length = 1179

 Score = 47.8 bits (112), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 5/57 (8%)

Query: 10   EELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
            +E+ +C  C      E++L +CSAC  V YC  + QK+ WK HK KCKK+  E K +
Sbjct: 997  KEIGKCNTCGKGSSSESSLKRCSACNSVRYCSVECQKKDWKNHKEKCKKIQEESKQT 1053


>gi|393243935|gb|EJD51448.1| hypothetical protein AURDEDRAFT_159180 [Auricularia delicata
          TFB-10046 SS5]
          Length = 395

 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK-SSPLLGRYLQAT 71
          MRC VC E  + +C++C+ V YC ++ Q+ HW  HK  C+  P +I  +  L+   L+ +
Sbjct: 1  MRCEVCSEPGVKRCASCQSVRYCSQKCQQAHWAEHKFTCR--PDKITDADQLVHAVLKGS 58

Query: 72 LDLHPGDRIA 81
          +  HP  ++A
Sbjct: 59 MPTHPDVQLA 68


>gi|168063352|ref|XP_001783636.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664826|gb|EDQ51531.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 582

 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 7/78 (8%)

Query: 2   ASIIEIEDEELMR---CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           A  + I   ELM    CA C++ A+ +CS CK   YCG++ Q   WK+HK  C  L  ++
Sbjct: 508 AQTVPIWGIELMEQPTCAACKKAAVQRCSRCKSEWYCGRECQVSVWKMHKGACDILAGKV 567

Query: 59  KSSPLLGRYLQATLDLHP 76
           + S +     QA + L P
Sbjct: 568 EKSGV----AQADISLDP 581


>gi|118387165|ref|XP_001026696.1| MYND finger family protein [Tetrahymena thermophila]
 gi|89308463|gb|EAS06451.1| MYND finger family protein [Tetrahymena thermophila SB210]
          Length = 1283

 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 30/49 (61%)

Query: 11   ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
            E  +C +C++ A  +CS CK V YC ++ Q  HWK HK  CKK+  E K
Sbjct: 1044 EGFKCELCKKEATKRCSQCKTVWYCTRECQVAHWKDHKIACKKIVEENK 1092


>gi|66815713|ref|XP_641873.1| hypothetical protein DDB_G0279115 [Dictyostelium discoideum AX4]
 gi|60469913|gb|EAL67896.1| hypothetical protein DDB_G0279115 [Dictyostelium discoideum AX4]
          Length = 1045

 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 12  LMRCAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKC 51
           L RC  C++T +    C  CK VAYC K  QK+HW +HKP C
Sbjct: 808 LYRCWSCKQTLMKPLICGYCKTVAYCSKDCQKDHWPIHKPSC 849


>gi|406867546|gb|EKD20584.1| hypothetical protein MBM_01266 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 205

 Score = 47.4 bits (111), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 2   ASIIEIEDEELMRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           A  IEI    L  CAVC ++  +  CS C++ AYC K+ QK HW  HK  C+ +   +K+
Sbjct: 132 APGIEILKPGLKSCAVCGQSDTVKPCSGCRDFAYCSKECQKTHWPFHKKHCRHV-QAVKA 190

Query: 61  SPLLGRYLQATL 72
           +   G    A L
Sbjct: 191 ATEAGSMTAAAL 202


>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
           mellifera]
          Length = 440

 Score = 47.4 bits (111), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 33/219 (15%)

Query: 113 NARCPRCFWPACSARCSGLSDAHTHA-PECAILKLGCETLLAYNDYKYEAILPLRCLILQ 171
           N  C +  W   S  C+ L    +   P+ A  +L    ++  N    E I         
Sbjct: 55  NQSCQQMSWTIHSKECASLKRFSSKVIPDVA--RLMARIIIKLNQGGGEEI--------G 104

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
             S  KY++ KD+ +H S      +  E    +        L +  GD+    SA+ L  
Sbjct: 105 YYSKTKYRKFKDLMSHYSDIKKDEKKMEHFVCVC-----GVLYEFLGDMSIPNSAE-LMG 158

Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT----- 281
           I G I +N      +D+     G+Y    +++H+C PNA  +   +    + R T     
Sbjct: 159 IYGRIYINSFNISDLDMNNIGAGIYLGPSILDHSCKPNAVAT--FEGTTIIIRTTEDLPC 216

Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
                I   Y + +  T+ RRE L  + YF C+C++C +
Sbjct: 217 LDLSQIRISYIDVIKTTKDRREELQSSYYFWCNCKKCEE 255


>gi|395325441|gb|EJF57863.1| hypothetical protein DICSQDRAFT_149470 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 424

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 21/104 (20%)

Query: 14  RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQAT 71
           RCA C  +A  L  C+AC  + YC KQ Q+ HW LHK +C  +P+   S+  L  + +  
Sbjct: 238 RCAACGSSAGTLKFCNACHAIRYCSKQCQRSHWSLHKTRC--IPFSADSTVTLTPFYED- 294

Query: 72  LDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
                           +GP ++LA+        P+    A NAR
Sbjct: 295 ----------------IGPVMSLADVARSAFGFPVEKQPARNAR 322


>gi|366998786|ref|XP_003684129.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
 gi|357522425|emb|CCE61695.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
          Length = 523

 Score = 47.4 bits (111), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 12/89 (13%)

Query: 243 IGRY-----TQGLYSVICLMEHNCLPNAK-----HSNMMQ--SKLFVFRDTHISTMYTNA 290
           IGRY     +  LYS+  L+ HNC PN +     H+  ++  ++  + +++ + T Y N 
Sbjct: 347 IGRYNLNHKSAQLYSIYSLVNHNCEPNVRVEVDYHTKELKLYARKSISKNSELLTTYINP 406

Query: 291 LWGTQPRREHLAITKYFNCSCERCSDPTE 319
           L   + RR+ L I   F C+CERC +  +
Sbjct: 407 LHDVELRRKILLINYGFACNCERCENEID 435


>gi|261329994|emb|CBH12977.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 443

 Score = 47.4 bits (111), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 53/116 (45%), Gaps = 12/116 (10%)

Query: 18  CRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKSSPLLGRYLQATLDLHP 76
           C    L +CS+CK V YC ++ Q+EHW   H+  CK   Y+ ++  +L  Y+ A  ++  
Sbjct: 326 CGGEGLLRCSSCKAVYYCSEECQREHWSAAHRVPCKN--YKQRAETILAEYIAANKNVVG 383

Query: 77  GDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS 132
           G R  R S       L   E +      PL P+L    R  R  +    A  SG+S
Sbjct: 384 GKRGKRMS-------LQQQEELVAVLEVPLEPSLFYETR--RYLYDHRDASFSGVS 430


>gi|195029123|ref|XP_001987424.1| GH19974 [Drosophila grimshawi]
 gi|193903424|gb|EDW02291.1| GH19974 [Drosophila grimshawi]
          Length = 346

 Score = 47.4 bits (111), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 21/37 (56%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C     H CS CK + YC   HQ+ HW LHKP C
Sbjct: 128 CAACGCLGPHTCSRCKSINYCSSTHQRAHWPLHKPNC 164


>gi|242216123|ref|XP_002473871.1| predicted protein [Postia placenta Mad-698-R]
 gi|220726971|gb|EED80904.1| predicted protein [Postia placenta Mad-698-R]
          Length = 1108

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D +  RC VCR      +  C  CK+V YC    QK+HWK+HK +CK
Sbjct: 1060 RDPDAHRCFVCRGKGKPRIRACGGCKKVRYCSAACQKKHWKIHKSQCK 1107


>gi|307171609|gb|EFN63394.1| Egl nine-like protein 1 [Camponotus floridanus]
          Length = 491

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 3/85 (3%)

Query: 3   SIIEIEDEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           +I+      +  CAVC R   L +CS CK V YC K+HQ+  WK H+  C   P   +  
Sbjct: 26  AIVGDASSRVTVCAVCDRTDKLLRCSRCKAVFYCTKEHQRRDWKRHREFCATHP--ARED 83

Query: 62  PLLGRYLQATLDLHPGDRIARESPL 86
           P    +  +    +P D  +R+ P+
Sbjct: 84  PTESAHPSSPGSRNPRDTPSRQRPV 108


>gi|328872812|gb|EGG21179.1| MYND-type zinc finger-containing protein [Dictyostelium
          fasciculatum]
          Length = 961

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 3/52 (5%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          C VC  T +  C+ C  V YCG+ HQ  HW +HK KC  L    K + L+GR
Sbjct: 28 CEVCGSTNVQICTGCLLVYYCGENHQLLHWPVHKNKCSALD---KRTDLIGR 76


>gi|242215015|ref|XP_002473326.1| predicted protein [Postia placenta Mad-698-R]
 gi|220727553|gb|EED81468.1| predicted protein [Postia placenta Mad-698-R]
          Length = 1161

 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D +  RC VCR      +  C  CK+V YC    QK+HWK+HK +CK
Sbjct: 1113 RDPDAHRCFVCRGKGKPRIRACGGCKKVRYCSAACQKKHWKIHKSQCK 1160


>gi|72392066|ref|XP_846327.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62175451|gb|AAX69592.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70802863|gb|AAZ12768.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 443

 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 36/116 (31%), Positives = 53/116 (45%), Gaps = 12/116 (10%)

Query: 18  CRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKSSPLLGRYLQATLDLHP 76
           C    L +CS+CK V YC ++ Q+EHW   H+  CK   Y+ ++  +L  Y+ A  ++  
Sbjct: 326 CGGEGLLRCSSCKAVYYCSEECQREHWSAAHRVPCKN--YKQRAETILAEYIAANKNVVG 383

Query: 77  GDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS 132
           G R  R S       L   E +      PL P+L    R  R  +    A  SG+S
Sbjct: 384 GKRGKRMS-------LQQQEELVAVLEVPLEPSLFYETR--RYLYDHRDASFSGVS 430


>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFV--FRDTH----ISTMYTNALWGTQPRREHLA 302
           GL+S+  L+ H+C PN   SN   S + V   RD      I+  Y +       R++ L 
Sbjct: 164 GLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKELR 223

Query: 303 ITKYFNCSCERCSDP 317
            TK+F+C CERCS P
Sbjct: 224 DTKHFDCQCERCSTP 238


>gi|194771646|ref|XP_001967706.1| GF20518 [Drosophila ananassae]
 gi|190617390|gb|EDV32914.1| GF20518 [Drosophila ananassae]
          Length = 288

 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 69  CAACGCLAPHTCSKCKSIHYCSSEHQRAHWPQHKPNC 105


>gi|344301790|gb|EGW32095.1| hypothetical protein SPAPADRAFT_61174 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 415

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 68/300 (22%), Positives = 111/300 (37%), Gaps = 56/300 (18%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-----CLGCHKPLNPNLADNA----R 115
           G+ L A  ++  G+ I  E PL   P LA    I     C  C + L  +  +       
Sbjct: 122 GKGLYAKHNIAKGELIWEEEPLFFIPPLANINLIKSGTACSHCGRLLQQSEGNTVLRGLD 181

Query: 116 CPRCFWPACSARCS-------GLSDAHTHAPECAILKL-GCETLLAYNDYKYE------- 160
           C  C    C  +C        GL   + + P     KL   E  L   DY  E       
Sbjct: 182 CQVCAEVWCGTKCKKLDGVLHGLLKHNVYNPNAKRSKLFDPEAFLHMQDYCLEDQWNALY 241

Query: 161 AILPLRCLILQRRSPKKYQELKDME----------------AHMSKRGPGTEVYEEIDSI 204
           AI  +   IL  +S  K ++ K M                 A  +  G    V E+ +++
Sbjct: 242 AITLIYADILNDKSGTKAKQFKAMARVSQDIRYKALNSSAGAFDTLSGGALFVQEQQENL 301

Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN 264
            K     FL   P + +D    +   ++ G   +N +D       ++ +   + HNC PN
Sbjct: 302 WKLGYEKFLRIFPSNPID---YQEFMYMMGTYNINNLD-----SNVFLIQSHLNHNCNPN 353

Query: 265 AK-HSNMMQS---KLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
               + ++++   K++  RD      ++T Y N     Q R+  L +   F C CE+C +
Sbjct: 354 TSVETELLRTDGLKVYAARDIRAGEELTTTYVNPKNTVQQRQRELRVNWGFLCGCEKCKE 413


>gi|169865882|ref|XP_001839538.1| hypothetical protein CC1G_08917 [Coprinopsis cinerea okayama7#130]
 gi|116499410|gb|EAU82305.1| hypothetical protein CC1G_08917 [Coprinopsis cinerea okayama7#130]
          Length = 1180

 Score = 47.0 bits (110), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 28/42 (66%), Gaps = 3/42 (7%)

Query: 14   RCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            +C+VCR   +  L  CSAC+ V YC  + QK+ WK HKPKC+
Sbjct: 1136 KCSVCRGQGKPKLKACSACQRVRYCSTECQKKDWKAHKPKCQ 1177


>gi|290988356|ref|XP_002676887.1| hypothetical protein NAEGRDRAFT_58086 [Naegleria gruberi]
 gi|284090492|gb|EFC44143.1| hypothetical protein NAEGRDRAFT_58086 [Naegleria gruberi]
          Length = 499

 Score = 47.0 bits (110), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 25/38 (65%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           CAVC++ A  KC+ CK V YC    QK+HW +HK  CK
Sbjct: 459 CAVCKKAASKKCANCKAVYYCSADCQKKHWSVHKKLCK 496


>gi|194756844|ref|XP_001960680.1| GF11375 [Drosophila ananassae]
 gi|190621978|gb|EDV37502.1| GF11375 [Drosophila ananassae]
          Length = 347

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 128 CAACGCLAPHTCSKCKSIHYCSSEHQRAHWPQHKPNC 164


>gi|336364484|gb|EGN92841.1| hypothetical protein SERLA73DRAFT_116652 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336388527|gb|EGO29671.1| hypothetical protein SERLADRAFT_445456 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1163

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D E  RC VCR   +  L  C+ CK+V YC  + QK+ WK+HK +CK
Sbjct: 1115 RDPEAYRCFVCRGKGKPKLKMCTVCKKVRYCSSECQKKDWKVHKLRCK 1162


>gi|299741263|ref|XP_001834348.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
 gi|298404633|gb|EAU87325.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
          Length = 434

 Score = 47.0 bits (110), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 43/80 (53%), Gaps = 19/80 (23%)

Query: 18  CRET---ALHKCSACKEVAYCGKQHQKEHWKLHKPKCK---KLPYEIKSSPLLGRYLQAT 71
            RET      +CSAC+ V YCG++ QKE WK HKP+CK   KL   IK            
Sbjct: 352 ARETEVAQFKRCSACRLVVYCGQECQKEDWKRHKPECKQHLKLKSTIK------------ 399

Query: 72  LDLHPGDRIARESPLIVGPK 91
            + + G  +A ++P+++ P+
Sbjct: 400 -NWNNGRPVAADAPMLLSPE 418


>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 3-like [Apis florea]
          Length = 414

 Score = 47.0 bits (110), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 33/219 (15%)

Query: 113 NARCPRCFWPACSARCSGLSDAHTHA-PECAILKLGCETLLAYNDYKYEAILPLRCLILQ 171
           N  C +  W   S  C+ L    +   P+ A  +L    ++  N    E I         
Sbjct: 55  NQSCQQMSWTIHSKECASLKRFSSKVIPDVA--RLMARIIIKLNQGGGEEI--------G 104

Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
             S  KY++ KD+ +H S      +  E    +        L +  GD+    SA+ L  
Sbjct: 105 YYSKTKYRKFKDLMSHYSDIKKDEKKMEHFVCVC-----GVLYEFLGDMSIPNSAE-LMG 158

Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT----- 281
           I G I +N      +D+     G+Y    +++H+C PNA  +   +    + R T     
Sbjct: 159 IYGRIYINSFNISDLDMNNIGAGIYLGPSILDHSCKPNAVAT--FEGTTIIIRTTEDLPC 216

Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
                I   Y + +  T+ RRE L  + YF C+C++C +
Sbjct: 217 LDLSQIRISYIDVIKTTKDRREELQNSYYFWCNCKKCEE 255


>gi|145523317|ref|XP_001447497.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415008|emb|CAK80100.1| unnamed protein product [Paramecium tetraurelia]
          Length = 143

 Score = 47.0 bits (110), Expect = 0.028,   Method: Composition-based stats.
 Identities = 21/49 (42%), Positives = 26/49 (53%)

Query: 4   IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           II  E  +   C +C++    +C  CK V YCG   QK HWK HK  CK
Sbjct: 95  IILYEPAKQTICGICKKNGELRCGQCKLVYYCGVDCQKTHWKQHKGFCK 143


>gi|401417428|ref|XP_003873207.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489435|emb|CBZ24694.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 622

 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 48/202 (23%), Positives = 83/202 (41%), Gaps = 40/202 (19%)

Query: 233 CGVIEV---NGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI----ST 285
           C  +EV   +G+ + +      S+     H+C PN    + ++  +   R  H+    S 
Sbjct: 378 CNALEVTDPSGLGVAQALHAGNSIASFFNHSCAPNCA-IDTVRHAIVTTRTIHVGEELSI 436

Query: 286 MYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLNEHKDQGDCW 340
            Y   L W T+ RRE L+   YF C C+RC     DP E      +M+     KD    +
Sbjct: 437 AYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSGTKDPFERAL---SMELPTARKDATKHF 493

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
               +P+   +   C +  +R+               VD + Q++ +   L ++L+ L  
Sbjct: 494 ----HPIVQAA---CATVRSRM---------------VDDVSQKDAD--ELSKLLSDLST 529

Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
              P HY C+ V++ L  +Y  
Sbjct: 530 HLFPFHYLCHEVRNCLSFVYAV 551


>gi|169846118|ref|XP_001829775.1| hypothetical protein CC1G_05984 [Coprinopsis cinerea okayama7#130]
 gi|116509102|gb|EAU91997.1| hypothetical protein CC1G_05984 [Coprinopsis cinerea okayama7#130]
          Length = 254

 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 2/43 (4%)

Query: 14  RCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           RC  C  R+  L KCSACK V YC    QK +W+ HK +CK+L
Sbjct: 207 RCKACGKRDAKLMKCSACKSVTYCSADCQKANWRDHKARCKEL 249


>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
 gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
          Length = 486

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 112/292 (38%), Gaps = 52/292 (17%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+  SP  GR L+AT +   GD +  E P       + A  IC  C +          RC
Sbjct: 5   EVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKL----QRC 60

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR-- 173
            +C F   C   C   +    H  ECA +K        Y     E +  L   IL R   
Sbjct: 61  GQCRFAQYCDKTCQ-RAGWEEHKLECAAIK-------TYGKPPSENV-RLAARILWRMDK 111

Query: 174 -----SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                S  +   L+D+E H+       ++ E+     K    NFL+  P +    T    
Sbjct: 112 QGSVVSDNQLTTLEDLEDHIC------DISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSV 165

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN--AKHSNMMQSKL-FVFR 279
            H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN     +N  QS +  VF 
Sbjct: 166 SH-ILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFH 224

Query: 280 DT---------------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
                             ++  Y + L  +  R+  L    +F+C+C+ C++
Sbjct: 225 SQKRIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTE 276


>gi|397640191|gb|EJK73983.1| hypothetical protein THAOC_04367, partial [Thalassiosira oceanica]
          Length = 922

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 5/71 (7%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+ +  CA C   A     L KC+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 84  SCVPVADDGIETCANCGTIASDTVKLKKCTACQLVKYCGVDCQRAHRKQHKKACKQRAAE 143

Query: 58  IKSSPLLGRYL 68
           +K   L G+ L
Sbjct: 144 LKDEQLYGQGL 154



 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 41/89 (46%), Gaps = 7/89 (7%)

Query: 5   IEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
           ++ + E    CA C E       L  C+AC+ V YCG   Q+ H K HK  CK+   E+K
Sbjct: 675 VDRKSETGEACAKCGEHGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELK 734

Query: 60  SSPLLGRYLQATLDLHPGDRIARESPLIV 88
              L      A +      R+A++ P+ +
Sbjct: 735 DEQLYAN--DAGILAMVRARVAKKDPVAI 761


>gi|170118408|ref|XP_001890383.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164634653|gb|EDQ98981.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1168

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D +  +C VCR      + +CS C++V YC  + QK+ WK HKPKCK
Sbjct: 1120 RDPDARKCRVCRGKGKPKIKECSVCQKVRYCSPECQKKDWKAHKPKCK 1167


>gi|397602865|gb|EJK58288.1| hypothetical protein THAOC_21605, partial [Thalassiosira oceanica]
          Length = 245

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 5/72 (6%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+ +  CA C   A     L KC+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 93  SCVPVADDGIETCANCGTIASDTVKLKKCTACQLVKYCGVDCQRAHRKQHKKACKQRAAE 152

Query: 58  IKSSPLLGRYLQ 69
           +K   L G+ L+
Sbjct: 153 LKDEQLYGQGLE 164


>gi|339897286|ref|XP_003392344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|321399152|emb|CBZ08492.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 622

 Score = 46.6 bits (109), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 48/202 (23%), Positives = 82/202 (40%), Gaps = 40/202 (19%)

Query: 233 CGVIEV---NGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI----ST 285
           C  +EV   +G+ + +      S+     H+C PN    + ++  +   R  H+    S 
Sbjct: 378 CNALEVTDPSGLGVAQALHAGNSIASFFNHSCTPNCA-IDTVRHAIITTRTIHVGEELSI 436

Query: 286 MYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLNEHKDQGDCW 340
            Y   L W T+ RRE L+   YF C C+RC     DP E       M+     KD    +
Sbjct: 437 AYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSSNKDPFERAL---VMELPTARKDATKHF 493

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
               +P+   +   C +  +R+               VD + Q++ +   L ++L+ L  
Sbjct: 494 ----HPIVQTA---CATVRSRM---------------VDDVSQKDAD--ELSKLLSDLST 529

Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
              P HY C+ V++ L  +Y  
Sbjct: 530 HLFPFHYLCHEVRNCLSFVYAV 551


>gi|428177962|gb|EKX46839.1| hypothetical protein GUITHDRAFT_107195 [Guillardia theta CCMP2712]
          Length = 504

 Score = 46.6 bits (109), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 52/124 (41%), Gaps = 21/124 (16%)

Query: 15  CAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKC--------KKLPYEIKSS 61
           CA C++T      L KC  C++  YCG + QKEHW  H+  C        K    E+K+ 
Sbjct: 14  CANCKKTRDMVETLLKCPWCRKFFYCGSECQKEHWPEHRNFCPPRSSTSSKGEKLELKAK 73

Query: 62  PLLGRYLQATLDLHPGDRIARESPL---IVGPKLALAEPICLGCHKPLNPNLADNARCPR 118
           PL  + L+   +     RI R   L   +    L     +C  C    NP L     CP 
Sbjct: 74  PLEQKILETETEEQR--RIRRRKELAENVDDDDLPEGWTMCSSCQ---NPVLIAWLECPS 128

Query: 119 CFWP 122
           CF P
Sbjct: 129 CFRP 132


>gi|307108397|gb|EFN56637.1| hypothetical protein CHLNCDRAFT_144433 [Chlorella variabilis]
          Length = 142

 Score = 46.6 bits (109), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 26/50 (52%), Positives = 32/50 (64%), Gaps = 4/50 (8%)

Query: 15  CAVCRETALH--KCSACKEVAYCGKQHQKEHWKL--HKPKCKKLPYEIKS 60
           CA C +TALH  KCS C+ VAYC +  Q +HWK   HK +CK+L  E  S
Sbjct: 92  CAQCDQTALHLSKCSGCRAVAYCSRACQVKHWKEGGHKKECKRLAAEQSS 141


>gi|440800582|gb|ELR21618.1| MYND finger domain containing protein [Acanthamoeba castellanii
          str. Neff]
          Length = 254

 Score = 46.2 bits (108), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 24/47 (51%), Positives = 30/47 (63%), Gaps = 3/47 (6%)

Query: 11 ELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
          E+ +CAVC+  E  L +CS CKEV YC ++HQ E WK  H   CK L
Sbjct: 2  EVKKCAVCQKSEGELQRCSRCKEVHYCSREHQVEDWKKGHNKACKAL 48


>gi|397567676|gb|EJK45717.1| hypothetical protein THAOC_35652 [Thalassiosira oceanica]
          Length = 892

 Score = 46.2 bits (108), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 7/61 (11%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPL------LGR 66
          +CA C E A   C  CK V YCG+  Q++HW+  HK  C K+P  ++ + +      LG+
Sbjct: 6  KCAACGEEASKICQGCKLVGYCGRDCQQQHWREGHKVICSKVPKAVQKAEVKIDNFSLGQ 65

Query: 67 Y 67
          Y
Sbjct: 66 Y 66


>gi|397575310|gb|EJK49633.1| hypothetical protein THAOC_31473 [Thalassiosira oceanica]
          Length = 1054

 Score = 46.2 bits (108), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/47 (51%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 15   CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
            CAVC  T +L  CS C  VAYC + HQ +HW  HK  CK+L  E K+
Sbjct: 1006 CAVCGITVSLKLCSLCSSVAYCCRDHQVKHWPNHKADCKRLKKESKT 1052


>gi|195586319|ref|XP_002082925.1| GD11837 [Drosophila simulans]
 gi|194194934|gb|EDX08510.1| GD11837 [Drosophila simulans]
          Length = 347

 Score = 46.2 bits (108), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 164


>gi|21064263|gb|AAM29361.1| GM13546p [Drosophila melanogaster]
          Length = 347

 Score = 46.2 bits (108), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 164


>gi|194886157|ref|XP_001976561.1| GG22945 [Drosophila erecta]
 gi|190659748|gb|EDV56961.1| GG22945 [Drosophila erecta]
          Length = 345

 Score = 46.2 bits (108), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 127 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 163


>gi|290985106|ref|XP_002675267.1| predicted protein [Naegleria gruberi]
 gi|284088862|gb|EFC42523.1| predicted protein [Naegleria gruberi]
          Length = 1914

 Score = 46.2 bits (108), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 15   CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            CA C +  +  KC AC+ V YC K+ Q  HWK+HK +CKKL
Sbjct: 1874 CACCGKPNSKKKCGACQAVVYCSKECQASHWKVHKTQCKKL 1914


>gi|255548628|ref|XP_002515370.1| cysteine-type endopeptidase, putative [Ricinus communis]
 gi|223545314|gb|EEF46819.1| cysteine-type endopeptidase, putative [Ricinus communis]
          Length = 632

 Score = 46.2 bits (108), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 13/80 (16%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL-----------PYEIKSSP 62
           CA C      KCS CK V YC  + Q+ HWK  HK KCK L            + +K+S 
Sbjct: 68  CANCGNLGTKKCSRCKSVRYCSTECQEVHWKSGHKSKCKNLVRVNSVENAKTNFGVKASG 127

Query: 63  LLGRYLQATLDLHPGDRIAR 82
           L GR + + + L P + I +
Sbjct: 128 LGGR-ISSGIALVPSNGITK 146


>gi|398012025|ref|XP_003859207.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322497420|emb|CBZ32495.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 622

 Score = 46.2 bits (108), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 82/202 (40%), Gaps = 40/202 (19%)

Query: 233 CGVIEV---NGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI----ST 285
           C  +EV   +G+ + +      S+     H+C PN    + ++  +   R  H+    S 
Sbjct: 378 CNALEVTDPSGLGVAQALHAGNSIASFFNHSCTPNCA-IDTVRHAIITTRTIHVGEELSI 436

Query: 286 MYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLNEHKDQGDCW 340
            Y   L W T+ RRE L+   YF C C+RC     DP E       M+     +D    +
Sbjct: 437 AYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSSNKDPFERAL---GMELPTARRDATKHF 493

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
               +P+   +   C +  +R+               VD + Q++ +   L ++L+ L  
Sbjct: 494 ----HPIVQTA---CATVRSRM---------------VDDVSQKDAD--ELSKLLSDLST 529

Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
              P HY C+ V++ L  +Y  
Sbjct: 530 HLFPFHYLCHEVRNCLSFVYAV 551


>gi|195341850|ref|XP_002037518.1| GM18310 [Drosophila sechellia]
 gi|194132368|gb|EDW53936.1| GM18310 [Drosophila sechellia]
          Length = 347

 Score = 45.8 bits (107), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 164


>gi|343475522|emb|CCD13111.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 433

 Score = 45.8 bits (107), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 19/154 (12%)

Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL-EKLPGDVLDDTSAKC 228
           ++ R P     L +++ H        EV + +  +V+ +      E+LP +V     A+ 
Sbjct: 110 IKSRRPGYRVVLNELQGH------AAEVSQNLSHMVRMVGELLRDEELPLEV-----ARL 158

Query: 229 LHWI-CGVIEV-NGVDIGRYTQGLY--SVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI 283
           +  I C  IEV N + +G   Q L+  ++     H+C PN A     + +   +     +
Sbjct: 159 IGIIRCNTIEVSNELSLG-VGQALHATTITSYFNHSCSPNCAIQGEFIVTTRVIAAGEEL 217

Query: 284 STMYTNAL-WGTQPRREHLAITKYFNCSCERCSD 316
           +  Y   L W    RRE LA T YF+CSCERC D
Sbjct: 218 TISYMPQLYWPVALRREELANTYYFHCSCERCRD 251


>gi|145499622|ref|XP_001435796.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402931|emb|CAK68399.1| unnamed protein product [Paramecium tetraurelia]
          Length = 539

 Score = 45.8 bits (107), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 24/41 (58%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          CAVC       CS CK V YC  +HQ++HW +H+  C+ + 
Sbjct: 8  CAVCSLKTTFGCSQCKSVFYCSVEHQRQHWSVHQQSCQSMK 48


>gi|195387832|ref|XP_002052596.1| GJ20766 [Drosophila virilis]
 gi|194149053|gb|EDW64751.1| GJ20766 [Drosophila virilis]
          Length = 126

 Score = 45.8 bits (107), Expect = 0.052,   Method: Composition-based stats.
 Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 6/93 (6%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++  P+ GR L  +  +  GD +  E P   GPK   +  +CLGC++ L      +
Sbjct: 2   LKARLEHDPVYGRCLATSEAVESGDVVVEELPFAYGPKRE-SGIVCLGCYRFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C    S + DA  H  EC I 
Sbjct: 61  SLDRCELCDWPICG---SCMDDADDHRGECEIF 90


>gi|452978501|gb|EME78264.1| hypothetical protein MYCFIDRAFT_80693 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 466

 Score = 45.8 bits (107), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 37/161 (22%), Positives = 61/161 (37%), Gaps = 25/161 (15%)

Query: 197 VYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICL 256
           VY  +     Y+ +  LE L G+V+            G+   N   +    QG++++   
Sbjct: 188 VYVAMSPRYHYVSNEALETLKGEVVK---------AMGIFSANNFILAGGNQGVFALASR 238

Query: 257 MEHNCLPNAKHS---NMMQSKLFVFRDTHIS----TMYTNALWGTQP---RREHLAITKY 306
           + H+C+PN  H+   N+ +  +   RD          Y  A     P   R E L     
Sbjct: 239 INHSCVPNVHHTNNPNIRRETVHAMRDIEAGEELLANYLGAGATYDPRLTRMEALRNNHG 298

Query: 307 FNCSCERCSDPT------ELGTYFSAMKCLNEHKDQGDCWI 341
           F C C+ C DP        + + F  +    EH  +G  +I
Sbjct: 299 FICQCQACMDPNSDERRHSISSIFWGLNAYMEHSTEGHPYI 339


>gi|444320821|ref|XP_004181067.1| hypothetical protein TBLA_0E04990 [Tetrapisispora blattae CBS 6284]
 gi|387514110|emb|CCH61548.1| hypothetical protein TBLA_0E04990 [Tetrapisispora blattae CBS 6284]
          Length = 644

 Score = 45.8 bits (107), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 8/85 (9%)

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPN----AKHSNMMQSKLFVFRDTHIS----TMYTN 289
           +   +I +    LY +  L+ HNC PN       SN +   L   R+ +I     T Y N
Sbjct: 415 IGRFNINQINGQLYPIASLINHNCQPNIHIEVSDSNSLSLTLITRRNINIDEELLTTYIN 474

Query: 290 ALWGTQPRREHLAITKYFNCSCERC 314
            L G + RR  L +   F C CERC
Sbjct: 475 PLHGVKLRRRELLVNWGFLCHCERC 499


>gi|24762574|ref|NP_611890.1| zinc finger protein RP-8 [Drosophila melanogaster]
 gi|7291749|gb|AAF47170.1| zinc finger protein RP-8 [Drosophila melanogaster]
          Length = 347

 Score = 45.8 bits (107), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSPEHQRAHWPQHKPNC 164


>gi|254584925|ref|XP_002498030.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
 gi|238940924|emb|CAR29097.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
          Length = 485

 Score = 45.8 bits (107), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 74/310 (23%), Positives = 110/310 (35%), Gaps = 56/310 (18%)

Query: 53  KLPYEIKSSPLLG---RYLQATLDLHPGDRIARESPLIVGP---KLALAE-----PICLG 101
           KLP  ++     G   R L AT  L  GD I  + PL+V P   KL L +     P+C  
Sbjct: 104 KLPRSVRIGACSGDKQRGLFATNALKAGDMIFEDLPLVVVPPMDKLVLMQSGKACPLCGS 163

Query: 102 CHKPLNPNLADNA-RCPRCFWPACSARCSGLSDAHTH----APECAILKLG--------C 148
                   +  N   C  C    CS  C      HT        C  + L         C
Sbjct: 164 SVSHSQHTIIMNGLDCGECGAVWCSKNCKKNDLTHTSLRHTRSRCRQINLNGWKKYESFC 223

Query: 149 ETLLAYNDYKYEAILPLRCLILQRRSPK----KYQELKDMEAHMSK-RGPGTEVYEEIDS 203
           +  +    Y    I     +IL +        K+Q L D+   + K     T +   +D+
Sbjct: 224 KQYVFVAAYSIGIIYA--SMILDKNGKNGVHPKFQLLADVSQRIRKDSSDSTNLGGTLDA 281

Query: 204 IVKYLRSN------------FLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLY 251
                 S+            F E  P   LDD   +      G   +N V     +  ++
Sbjct: 282 SSGAFTSDDPEPMWTQALDLFKEAFPE--LDDMDMEAYLNCIGKFNINQV-----SGQIF 334

Query: 252 SVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNALWGTQPRREHLAITK 305
            V   + H+C PN ++    + +L ++   HI       T Y N L G + RR  L +  
Sbjct: 335 PVYSFINHDCEPNVRYEIDDKLRLKLYARKHIKKGEELLTTYVNPLHGVKLRRRELRVNW 394

Query: 306 YFNCSCERCS 315
            F C C+RC+
Sbjct: 395 GFLCHCDRCN 404


>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
 gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
          Length = 478

 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 74/282 (26%), Positives = 115/282 (40%), Gaps = 39/282 (13%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           EI  S   GR L+AT +   GD I  E          L+  +C  C K     L    RC
Sbjct: 5   EIFDSEGKGRGLRATRESWAGDVIFAEPAYSAVVFDNLSHCVCHSCFKRQEKLL----RC 60

Query: 117 PRC-FWPACSARCSGLSDAHTHAPEC-AILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
            +C F   C   C   S  + H  EC AI K G     A N+      L  R L    R 
Sbjct: 61  GQCKFAHYCDRTCQKESWVN-HKHECLAIKKAG----KAPNE---NIRLAARILWRIERE 112

Query: 175 PKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
                E     + D++ H+ K G G     E  S+++ ++  F++  P         + +
Sbjct: 113 GGGLTEGCLVSIDDLQNHLDKFGDG-----EKSSLMEDVQK-FMDFWPSQS-QQFGMQYI 165

Query: 230 HWICGVIEVNGVDIGRYTQGLYSV-------ICLMEHNCLPNAK---HSNMMQSKLF--V 277
             I  VI  NG  +    +GL +V       +CL  H+C PN     ++  ++ +    +
Sbjct: 166 SHIFSVISCNGFTLSD-QRGLQAVGVAIFPNLCLTNHDCWPNCTVIFNNGKIELRALGKI 224

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
            +   ++  Y + L  T+ R E L    YF+C+CE C++ T+
Sbjct: 225 NKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCTNKTK 266


>gi|255732233|ref|XP_002551040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240131326|gb|EER30886.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 474

 Score = 45.8 bits (107), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 66/300 (22%), Positives = 107/300 (35%), Gaps = 56/300 (18%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-----CLGCHKPLNPNLADNA----R 115
           G+ L A  D+  GD +  E PL   P LA    +     C  C K L  + A        
Sbjct: 118 GKGLYAKKDIKKGDLMWSEEPLFFVPPLANVHLMKNASACTYCGKLLQQSEASAVLKGLD 177

Query: 116 CPRCFWPACSARCS-------GLSDAHTHAPECAILK-LGCETLLAYNDY----KYEAIL 163
           C  C    CS +C        GL   + + P     K +  E  L   +Y    ++ A+ 
Sbjct: 178 CNVCAEVWCSKKCKQLEGTLHGLLKHNVYNPSSKRSKTIDAEAFLEIQEYCLEEQWNALY 237

Query: 164 PLRCL----ILQRRSPK--------------KYQELKDMEAHM-SKRGPGTEVYEEIDSI 204
            +  +    ++ R   K              +Y+ L        S  G    V E+ +S+
Sbjct: 238 AITLIYANCVIDRTGVKQKQFNAMARVSQDIRYKALNSSAGTFDSLNGGALFVQEQQESL 297

Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN 264
            +     FL   P + +D    +   ++ G   +N +D       ++ +   + HNC PN
Sbjct: 298 WRIGYEKFLRVFPKNPVD---YREFLYMMGTYNINNLD-----SNVFLIQSHLNHNCSPN 349

Query: 265 AKHSNMMQS----KLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
                 MQ     K+F  RD      ++T Y N       R+  L +   F C C +C +
Sbjct: 350 TSVETEMQRTDGLKVFAARDIKSGEELTTTYVNPSHTVHQRQRELRVNWGFVCGCAKCKE 409


>gi|201066195|gb|ACH92507.1| FI09725p [Drosophila melanogaster]
          Length = 350

 Score = 45.8 bits (107), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 22/37 (59%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC  +HQ+ HW  HKP C
Sbjct: 131 CAACGCHAPHACSKCKAIHYCSPEHQRAHWPQHKPNC 167


>gi|345560264|gb|EGX43389.1| hypothetical protein AOL_s00215g125 [Arthrobotrys oligospora ATCC
           24927]
          Length = 484

 Score = 45.8 bits (107), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 9/98 (9%)

Query: 232 ICGVIEVNGVDIGRYTQ--GLYSVICLMEHNCLPNAKHS---NMMQSKLFVFRDTHISTM 286
           + G+ + N +  G  +   G+Y  ICL+ H+C+PN  H+   N+ Q  +   R  +    
Sbjct: 239 LSGIFKTNAIPCGYNSSIAGIYLTICLINHSCIPNTYHNWNENLEQETIHAIRPINAGEE 298

Query: 287 YTNALWG---TQPRREHLAITKYFNCSCERC-SDPTEL 320
            T +      + PRR+ L     F+C C+ C SDP +L
Sbjct: 299 ITISYISESMSNPRRKRLQECFGFDCQCQLCTSDPKQL 336


>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
          Length = 441

 Score = 45.8 bits (107), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 48/216 (22%), Positives = 85/216 (39%), Gaps = 37/216 (17%)

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHL 301
           GLY VI ++ H+C PNA    + + +  V R        + ++  Y      T  R++ L
Sbjct: 160 GLYPVISIINHSCFPNA--VLLFEGRQAVVRAVEPIREGSELTVSYIEIAASTASRKKSL 217

Query: 302 AITKYFNCSCERC-----SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCG 356
               +F+C C RC      D           +C ++H     C    ++  D+     C 
Sbjct: 218 KEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRCSSDH-----CEGFLLHDPDDAQSLVCQ 272

Query: 357 SCSARLNARDVHLVTSQLGEQVDKLVQENPNV---------KSLEEMLTKLEA-MFHPHH 406
            C    N  +    T +   +VDKL +E   +         +SL E + +L+  ++HP+ 
Sbjct: 273 LCGCGRNEEE----TKKQARKVDKLGKEASKLLSSGNYSEARSLYEQIQQLQTQLWHPYS 328

Query: 407 YHCYAVKHSLI----QLYGTQPGYAYTQLSSSLLER 438
                   +L+    +LY  +    Y +L+    ER
Sbjct: 329 VILLRTGDTLLKICMELYDWKQALKYCRLTIPAYER 364



 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 14  RCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           RC VC    T L +CS CK   YCG   Q+  W+LH+ +CK +
Sbjct: 68  RCDVCFRLCTNLKRCSVCKTTWYCGGTCQRNGWRLHQHECKAI 110


>gi|195146634|ref|XP_002014289.1| GL19121 [Drosophila persimilis]
 gi|194106242|gb|EDW28285.1| GL19121 [Drosophila persimilis]
          Length = 142

 Score = 45.8 bits (107), Expect = 0.062,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 3/93 (3%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   + GR L     +  GD +  E P   GPK   +  +CLGCH+ L      +
Sbjct: 2   LKARLEHDAVFGRCLAVAEPVQRGDLLVEELPFAFGPKRD-SGIVCLGCHQYLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C     G  D   H  EC + 
Sbjct: 61  SLDRCELCDWPLCGVCAGGDDDGIHHRSECVVF 93


>gi|170113033|ref|XP_001887717.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164637355|gb|EDR01641.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1168

 Score = 45.8 bits (107), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 9    DEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            D +  +C VCR      +  C+ C++V YC  + QK+ WK HKPKCK
Sbjct: 1121 DPDAHKCRVCRRKGKPKIKVCAGCQKVRYCSSECQKKDWKTHKPKCK 1167


>gi|302692316|ref|XP_003035837.1| hypothetical protein SCHCODRAFT_105332 [Schizophyllum commune H4-8]
 gi|300109533|gb|EFJ00935.1| hypothetical protein SCHCODRAFT_105332, partial [Schizophyllum
           commune H4-8]
          Length = 543

 Score = 45.8 bits (107), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CA+C +    +C+ C  V+YCG+  QK HWK HK  C+ +
Sbjct: 366 CAICGKKTTSRCTGCLSVSYCGQACQKAHWKEHKEFCRTI 405


>gi|302681111|ref|XP_003030237.1| hypothetical protein SCHCODRAFT_236142 [Schizophyllum commune H4-8]
 gi|300103928|gb|EFI95334.1| hypothetical protein SCHCODRAFT_236142 [Schizophyllum commune H4-8]
          Length = 565

 Score = 45.4 bits (106), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 23/34 (67%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
           L +C+ C+   YCG+QHQ+ HW  HK +C K+ +
Sbjct: 532 LLRCAKCRSTVYCGQQHQRAHWSQHKQRCFKVEW 565


>gi|195489535|ref|XP_002092780.1| GE14382 [Drosophila yakuba]
 gi|194178881|gb|EDW92492.1| GE14382 [Drosophila yakuba]
          Length = 347

 Score = 45.4 bits (106), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 21/37 (56%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC   HQ+ HW  HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSDHQRAHWPQHKPNC 164


>gi|195121108|ref|XP_002005063.1| GI20261 [Drosophila mojavensis]
 gi|193910131|gb|EDW08998.1| GI20261 [Drosophila mojavensis]
          Length = 349

 Score = 45.4 bits (106), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 25/49 (51%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           CA C   A   CS CK + YC   HQ+ HWK HKP C        S+PL
Sbjct: 128 CAACGCLAPLACSRCKNINYCSSSHQRAHWKQHKPSCATGQSAQPSAPL 176


>gi|383849521|ref|XP_003700393.1| PREDICTED: uncharacterized protein LOC100881996, partial
          [Megachile rotundata]
          Length = 402

 Score = 45.4 bits (106), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 15 CAVCRE----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C VC++     +L +C  C  ++YC K+HQKEHW  HK  CK +
Sbjct: 6  CHVCKKFGEWISLKRCGNCTMISYCSKEHQKEHWSQHKDLCKAI 49


>gi|198476005|ref|XP_001357231.2| GA14318 [Drosophila pseudoobscura pseudoobscura]
 gi|198137505|gb|EAL34300.2| GA14318 [Drosophila pseudoobscura pseudoobscura]
          Length = 142

 Score = 45.4 bits (106), Expect = 0.071,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 3/93 (3%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   + GR L     +  GD +  E P   GPK   +  +CLGCH+ L      +
Sbjct: 2   LKARLEHDAVFGRCLAVAEPVQRGDLLVEELPFAFGPKRD-SGIVCLGCHQYLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C     G  D   H  EC + 
Sbjct: 61  SLDRCELCDWPLCGVCAGGDDDGIHHRSECVVF 93


>gi|397567519|gb|EJK45634.1| hypothetical protein THAOC_35746 [Thalassiosira oceanica]
          Length = 351

 Score = 45.4 bits (106), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 10/98 (10%)

Query: 234 GVIEVNGVDIG--RYTQGLYSVICLMEHNCLPNAKH---SNMMQSKLFVFRDTHIS---- 284
           GV + N   +G      GL+  +  + H+C PN  H     + Q+ +F  RD  I     
Sbjct: 145 GVYDTNSYQLGGDESHGGLFLTLARINHSCRPNVVHCWRPGLQQTLVFALRDIEIGEELF 204

Query: 285 TMYT-NALWGTQPRREHLAITKYFNCSCERCSDPTELG 321
           T+Y  +    T+ RRE+L     FNC+CE C++    G
Sbjct: 205 TIYGPSECMDTKGRREYLHDRFAFNCTCEMCTEGNNNG 242


>gi|170036771|ref|XP_001846235.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879678|gb|EDS43061.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 446

 Score = 45.4 bits (106), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 38/87 (43%), Gaps = 22/87 (25%)

Query: 14 RCAVCRETA-----------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
          RC VC   A           L  C  C+ + YCG+QHQKE WK HK  C+     + +  
Sbjct: 13 RCNVCFTRATEHRLTLVGIKLRYCKGCRLIGYCGEQHQKEDWKHHKDFCR-----VVTKV 67

Query: 63 LLGRYLQATLDLHPGDRIARESPLIVG 89
          L  R +   LD      +  E P IVG
Sbjct: 68 LTIRKIDHILD------VRTECPRIVG 88


>gi|449669894|ref|XP_002157768.2| PREDICTED: uncharacterized protein LOC100209808 [Hydra
          magnipapillata]
          Length = 519

 Score = 45.4 bits (106), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 1  MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
          M S I + +     C V     L +C  CK + YC ++HQK+HW +HK  CK+   E+ S
Sbjct: 1  MTSSISLNELYCFYCGV-DNIPLLRCVQCKSIWYCSREHQKKHWSIHKSFCKQKKKELVS 59


>gi|195383402|ref|XP_002050415.1| GJ20214 [Drosophila virilis]
 gi|194145212|gb|EDW61608.1| GJ20214 [Drosophila virilis]
          Length = 354

 Score = 45.4 bits (106), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 3/51 (5%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC--KKLPYEIKSSPL 63
           CA C   A H CS CKE+ YC   HQ+ HW+ HK  C   +LP E+ + PL
Sbjct: 128 CAACGCLAPHACSRCKEINYCCAAHQRAHWQQHKQHCGGGRLP-EVANKPL 177


>gi|397600303|gb|EJK57632.1| hypothetical protein THAOC_22302 [Thalassiosira oceanica]
          Length = 300

 Score = 45.4 bits (106), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 6/82 (7%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+  I ++D     CA C +T      L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 1  MSRCIAVDDGSAGVCANCGKTGSDTVKLRNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 60

Query: 56 YEIKSSPLLGRYLQ-ATLDLHP 76
           E+K   L  + L+ A  D  P
Sbjct: 61 AELKDEQLYSQGLERAAGDFCP 82


>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
          Length = 410

 Score = 45.4 bits (106), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 59/296 (19%), Positives = 110/296 (37%), Gaps = 37/296 (12%)

Query: 62  PLLGRYLQATLDLHPGDRIARESPLIVGP-----KLALAEPICLGCHKPLNPNLADNARC 116
           P+ G+ L A+ D+   + +  E P +  P      LA +  +C  C KP+         C
Sbjct: 102 PVTGKGLFASRDIKQDEILFTEEPYVYFPPWEGFSLARSGHVCGLCCKPILYANRLTQHC 161

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAY-NDYKYEAILPLRCLILQRRSP 175
             C    CS  C   +    H  EC  L       + +    +++A + +  +  +    
Sbjct: 162 KHCDMYYCSKSCRATAWDQFHQLECTRLNPAITPFMNFCEGERWQAPMAVARIYARLILA 221

Query: 176 KKYQELKDM------------EAHMSKRG-------PGTEVYEEIDSIVK--YLRSNFLE 214
            + +EL D+            E   +K         P  E++ +   +++  Y +     
Sbjct: 222 HQREELNDVIGRLDAFATVSQEERQAKETEWIFMEQPTRELWNKARDLLRKAYKQPPAKC 281

Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNG-VDIGRYTQGLYSVICLMEHNCLPN--------- 264
           K+  D+ +    K        +   G  +I     G+Y V   + HNC PN         
Sbjct: 282 KITQDLPEALKEKLFEDEETFLNFMGKFNINNQNGGMYLVHSHINHNCYPNVSIDYPNPK 341

Query: 265 AKHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
           +++   +++   + +   +   Y N  W  + R+ +L  +  FNC CERC    EL
Sbjct: 342 SQYKIAVRAVRDIKQGEQLFETYVNPRWNKETRQTYLDKSYLFNCKCERCEKDLEL 397


>gi|396484759|ref|XP_003842008.1| hypothetical protein LEMA_P077680.1 [Leptosphaeria maculans JN3]
 gi|312218584|emb|CBX98529.1| hypothetical protein LEMA_P077680.1 [Leptosphaeria maculans JN3]
          Length = 382

 Score = 45.4 bits (106), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 28/49 (57%)

Query: 5  IEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          +++  + L  CA C   A  KCS C+ + YCGK  Q  HWK HK  C++
Sbjct: 1  MQMTPQPLTTCAACGNPATDKCSGCRSINYCGKFCQTTHWKSHKLTCEQ 49


>gi|356518197|ref|XP_003527768.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
          max]
          Length = 734

 Score = 45.4 bits (106), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
          CAVCR  A  KCS CK V YC +  Q+ HWK  HK +CK+ 
Sbjct: 57 CAVCRNPANKKCSRCKSVRYCSQACQQAHWKSEHKMRCKEF 97


>gi|260835866|ref|XP_002612928.1| hypothetical protein BRAFLDRAFT_278662 [Branchiostoma floridae]
 gi|229298310|gb|EEN68937.1| hypothetical protein BRAFLDRAFT_278662 [Branchiostoma floridae]
          Length = 287

 Score = 45.4 bits (106), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 36/69 (52%), Gaps = 2/69 (2%)

Query: 14  RCAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE-IKSSPLLGRYLQAT 71
           +C  C E  A  KCSACK+V YCG+  QK HW  HK  C  L  E ++   L  +  Q +
Sbjct: 219 QCFTCGELQAEKKCSACKKVKYCGQACQKLHWFTHKKVCATLKAEYLQEQELAQKMKQQS 278

Query: 72  LDLHPGDRI 80
           L+   G  I
Sbjct: 279 LEGQEGTDI 287


>gi|452980714|gb|EME80475.1| hypothetical protein MYCFIDRAFT_156218 [Pseudocercospora
          fijiensis CIRAD86]
          Length = 235

 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)

Query: 7  IEDEELM-RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          I  EEL  +CA C +T   L KC+ C+ + YC ++ Q  HWK+HK +C +L
Sbjct: 9  INAEELQQKCARCSKTGGPLKKCAKCRSILYCDRECQTLHWKMHKKECSRL 59


>gi|356509862|ref|XP_003523663.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
           max]
          Length = 742

 Score = 45.1 bits (105), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           CAVCR  A  KCS CK V YC +  Q+ HWK  HK +CK+ 
Sbjct: 60  CAVCRNPANKKCSRCKSVRYCSQACQQMHWKSEHKVRCKEF 100


>gi|332018918|gb|EGI59464.1| Egl nine-like protein 1 [Acromyrmex echinatior]
          Length = 500

 Score = 45.1 bits (105), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 1/42 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          CAVC +T  L +CS CK V YC K+HQ+  WK H+  C   P
Sbjct: 28 CAVCDKTDKLSRCSRCKVVFYCTKEHQRRDWKRHREFCATHP 69


>gi|321478511|gb|EFX89468.1| hypothetical protein DAPPUDRAFT_40294 [Daphnia pulex]
          Length = 464

 Score = 45.1 bits (105), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 21/33 (63%)

Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          E  L  CS C+ +AYCG  HQKEHW  HK  CK
Sbjct: 46 EIKLLSCSGCRMIAYCGLAHQKEHWAQHKDLCK 78


>gi|397634138|gb|EJK71301.1| hypothetical protein THAOC_07282, partial [Thalassiosira oceanica]
          Length = 1664

 Score = 45.1 bits (105), Expect = 0.093,   Method: Composition-based stats.
 Identities = 25/60 (41%), Positives = 29/60 (48%), Gaps = 5/60 (8%)

Query: 15   CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
            CA C E       L  C+AC  V YCG   Q+ H   HK  CKK   EIK   L G+ L+
Sbjct: 1256 CANCGEEGSDAVKLRNCTACFLVKYCGVDCQRAHRTRHKKACKKRAAEIKDEKLYGQGLE 1315


>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
           vitripennis]
          Length = 621

 Score = 45.1 bits (105), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 70/335 (20%), Positives = 122/335 (36%), Gaps = 73/335 (21%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           +++ +   GR+L A  D++PG+ I  E P +    L      C  C  P    +     C
Sbjct: 203 DVQYNEKYGRHLVAKRDINPGEIIFIEEPYMHCLDLVRGYTYCFHCLTPCLITIP----C 258

Query: 117 PRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQ---- 171
             C W   CS  C   +    H  ECA+     E +   +  K+ A+  L C + +    
Sbjct: 259 EHCGWAMFCSEGCKQQAWVKYHDLECAVYDFAKENVDG-DGVKHMAVKSLICAVREAGGV 317

Query: 172 ---RRSPKKYQELKDMEAHMSKRGP-GTEVYEEI----------------DSIVKYLR-- 209
              R   K +    D      K G   +  ++ I                ++ +  LR  
Sbjct: 318 DQLRDELKAFDSCTDKLKGFVKDGKIQSSGFKSIYALSSNTSDKAEPIHKNNTIMILRAL 377

Query: 210 ---SNFLEKLPG-----DVLDDTSAKCL----HWICGVIEVNG--VDIGR--YTQGLYSV 253
              + +  K PG     ++  D     L    + +  + ++N   + IGR  YT GL + 
Sbjct: 378 VKNTKYFGKKPGFEKTEELKKDDKVLFLGSLVYKLSKIFQLNSRIIPIGRDFYTTGLDAR 437

Query: 254 IC-----------------LMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNA 290
           +C                 L+ H+C+PN K        + V+      + + +   Y   
Sbjct: 438 MCENKQCCTTGLYIAPITSLLNHSCIPNVKRCFSNNYSVIVYAVQPIKKGSQLFDCYQQE 497

Query: 291 LW--GTQPRREHLAITKYFNCSCERCSDPTELGTY 323
            +     PR++HL  T  FNC C+ C +  ++  Y
Sbjct: 498 FYEYNISPRQKHLKKTYNFNCDCKACKEKWDIVEY 532


>gi|397588547|gb|EJK54309.1| hypothetical protein THAOC_26078, partial [Thalassiosira
          oceanica]
          Length = 314

 Score = 45.1 bits (105), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 5/78 (6%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+ +  I D+ +  CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 1  MSCVPVIMDDGIKTCANCGKQGSDSVVLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 60

Query: 56 YEIKSSPLLGRYLQATLD 73
           E+K   L G+  +   D
Sbjct: 61 AELKDEQLFGQGKRGMKD 78


>gi|156408512|ref|XP_001641900.1| predicted protein [Nematostella vectensis]
 gi|156229041|gb|EDO49837.1| predicted protein [Nematostella vectensis]
          Length = 430

 Score = 45.1 bits (105), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 15  CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
           C VC E  ++ KC+ACK+V YC    QK HW  HK  C++L  E +   L+
Sbjct: 315 CYVCGERRSVKKCAACKKVGYCSVSCQKLHWSTHKKHCQRLAKEYEQELLM 365


>gi|198438401|ref|XP_002126749.1| PREDICTED: similar to programmed cell death 2 [Ciona intestinalis]
          Length = 340

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
           C +C    + KCS C+EV YC K HQ  HWK LHK +C K
Sbjct: 129 CGLCGNHGVKKCSRCREVQYCSKDHQVFHWKLLHKYQCSK 168


>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
           rerio]
          Length = 231

 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 86/222 (38%), Gaps = 34/222 (15%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+  SP  GR L+AT +   GD +  E P       + A  IC  C +          RC
Sbjct: 5   EVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFR----RQEKLQRC 60

Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR-- 173
            +C F   C   C   +    H  ECA +K        Y     E +  L   IL R   
Sbjct: 61  GQCRFAQYCDKTCQ-RAGWEEHKLECAAIK-------TYGKPPSENV-RLAARILWRMDK 111

Query: 174 -----SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                S  +   L+D+E H+       ++ E+     K    NFL+  P +    T    
Sbjct: 112 QGSVVSDNQLTTLEDLEDHI------CDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSV 165

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN 264
            H I GVI  NG  +   R  Q    GL+  +CL+ H+C PN
Sbjct: 166 SH-ILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPN 206


>gi|389747799|gb|EIM88977.1| hypothetical protein STEHIDRAFT_52815 [Stereum hirsutum FP-91666 SS1]
          Length = 1144

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 3/49 (6%)

Query: 7    IEDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            + D E  +C VCR   +  L  C  CK+V YC    QK  WK+HK +CK
Sbjct: 1095 LRDPEARKCFVCRGKGKPKLMTCKGCKKVRYCSDACQKRDWKVHKKRCK 1143


>gi|195429543|ref|XP_002062817.1| GK19653 [Drosophila willistoni]
 gi|194158902|gb|EDW73803.1| GK19653 [Drosophila willistoni]
          Length = 574

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 49/99 (49%), Gaps = 8/99 (8%)

Query: 224 TSAKCLHWICGVIEVNGV-DIGRYTQGLYSVICLMEHNCLPNA-KHSNMMQSKLFVFRDT 281
           TS   +H I  V +VN   D   ++ G Y+ + L+ H+C PN  +     ++ LFV R  
Sbjct: 393 TSPSNMHGIDLVEQVNETKDDQTHSSGAYAFLSLLNHSCAPNTLRIYEGTKAYLFVLRPI 452

Query: 282 HIS-TMYTN-----ALWGTQPRREHLAITKYFNCSCERC 314
                +Y N     A++  + R+E L++   FNC CE C
Sbjct: 453 KAGDVLYDNYGAHFAIFSKKERQETLSMQYRFNCKCEAC 491


>gi|449541346|gb|EMD32330.1| hypothetical protein CERSUDRAFT_118710 [Ceriporiopsis subvermispora
           B]
          Length = 446

 Score = 44.7 bits (104), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 5/49 (10%)

Query: 9   DEELMR-CAVCR----ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           DE L+R C  CR    +  L +C+AC  V YC KQ QKE W+ HK  CK
Sbjct: 211 DERLLRRCETCRVREPQKTLFRCAACGMVWYCSKQCQKEGWQTHKEICK 259


>gi|308512211|ref|XP_003118288.1| CRE-SET-30 protein [Caenorhabditis remanei]
 gi|308238934|gb|EFO82886.1| CRE-SET-30 protein [Caenorhabditis remanei]
          Length = 548

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 61/230 (26%), Positives = 97/230 (42%), Gaps = 36/230 (15%)

Query: 124 CSARCSGLSDAHTHAPECAILKLGC-----ETLLAYNDYKYEAIL-----PLRCLILQRR 173
           CS +C     A  H  EC  ++        E +L     +Y+ IL      +      R 
Sbjct: 62  CSKQCQVFG-AFDHKYECGAIQKCADLNTDERMLIRIIGRYKDILDGNDKKIDGFYTNRE 120

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLPGDVLDDTSAKCLHW- 231
           S +   ++ +  A M K      V+++I D + ++   N+L       +D+  A  LH  
Sbjct: 121 SGRTVMQIWEHCADMKKDEHAMNVFKKIYDRVKQFGDKNYL-------VDEEVAFQLHSR 173

Query: 232 -ICGVIEVNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HIS 284
                  ++ VD  R   +GLY  +C  +H+C PNA +S N   +KL    D     ++ 
Sbjct: 174 NFINRHSISNVDYLREIGKGLYLDLCRYDHSCRPNAIYSCNGTVAKLRALHDNVDLENVE 233

Query: 285 TMYTNALWGTQP-----RREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
           T  T+  +   P     RR  L  T YF C CERC DP +   + +A+ C
Sbjct: 234 T--THYTYIELPPCKIQRRHMLKETWYFECHCERCEDPED--NWLTAVIC 279


>gi|397600839|gb|EJK57744.1| hypothetical protein THAOC_22182 [Thalassiosira oceanica]
          Length = 415

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 5/74 (6%)

Query: 1   MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           M+  + + DE    CA C   A     L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 111 MSCFVPVADEGNETCANCGTAASDSVKLRDCAACRLVKYCGVDCQRAHRKQHKKACKQRA 170

Query: 56  YEIKSSPLLGRYLQ 69
            E+K   L  + L+
Sbjct: 171 AELKDEELYSQGLE 184


>gi|302828706|ref|XP_002945920.1| hypothetical protein VOLCADRAFT_86388 [Volvox carteri f.
          nagariensis]
 gi|300268735|gb|EFJ52915.1| hypothetical protein VOLCADRAFT_86388 [Volvox carteri f.
          nagariensis]
          Length = 1263

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 28/43 (65%)

Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          +  CA+C + AL +C+ CK   YC ++ Q++HW  HK +C +L
Sbjct: 1  MASCAICDKPALFRCARCKVERYCSRECQRDHWPHHKTRCVQL 43


>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
          Length = 348

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 47/98 (47%), Gaps = 19/98 (19%)

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLF---------VFRDTH 282
           CG IE    +IG+   GLY  +C  +H+C PN  ++ N   + L              TH
Sbjct: 59  CGYIE----EIGK---GLYLDLCAYDHSCRPNTIYTCNSFVATLRGLTAGVDLRNLNSTH 111

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
            S  Y + +  TQ RR+ L  T YF C C RC DP ++
Sbjct: 112 YS--YIDLINTTQQRRKLLKDTWYFECHCTRCDDPDDV 147


>gi|156551085|ref|XP_001602963.1| PREDICTED: egl nine homolog 1-like [Nasonia vitripennis]
          Length = 440

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C VC R   L +CS CK + YC K+HQK  WK HK  C +L
Sbjct: 26 CVVCGRNDKLLRCSRCKSIFYCTKEHQKLDWKHHKSVCSRL 66


>gi|125810725|ref|XP_001361596.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
 gi|195154214|ref|XP_002018017.1| GL16991 [Drosophila persimilis]
 gi|54636772|gb|EAL26175.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
 gi|194113813|gb|EDW35856.1| GL16991 [Drosophila persimilis]
          Length = 574

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 79/325 (24%), Positives = 124/325 (38%), Gaps = 57/325 (17%)

Query: 44  WKLHKPKCKKLPY-----EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI 98
           +KL      ++PY     E++ SP  GR++    DL  GD +A E P        +    
Sbjct: 170 FKLSFDSHAQVPYIADCLELRESPDEGRFVVTNRDLVVGDLVAVEQPFCSTLLPPMRYIR 229

Query: 99  CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK-----------LG 147
           C  C +     L     C  C    CS  C   + +  H  EC I+            + 
Sbjct: 230 CATCKRENYLTLIPCDSC--CSVMFCSEECKLRATSSFHRFECPIIDFLHRMFNKIHGIA 287

Query: 148 CETLLAYNDYKYEAILPLRCLILQRRSPKK------YQELKDMEAHMSKRGPGTEVYEEI 201
             T +A  D  Y +I  L     Q ++  K      Y +L   E + +  G  T  +   
Sbjct: 288 LRTTMAALDL-YPSIEELMAFCEQPQNQHKCAFDLDYSQLSAQEHYRAIHGLVTNQHLRS 346

Query: 202 DS---------------IVKYL-RSNFLEKLPG-----DVLD---DTSAKCLHWICGVIE 237
            S               I++Y    +FL    G     D+L     TS   +H I  V +
Sbjct: 347 VSDLFQRSVVCAVLKHFIIEYTPLKDFLGGEDGRNFFTDLLFRHLQTSPSNMHGIDLVEQ 406

Query: 238 VNGV-DIGRYTQGLYSVICLMEHNCLPNA-KHSNMMQSKLFVFRDTHI-STMYTN----- 289
           VN   D   ++ G Y+ + L+ H+C PN  +     ++ LFV R     + +Y N     
Sbjct: 407 VNETKDDQTHSSGAYAFLSLLNHSCAPNTLRIYEGTKAYLFVLRPIKAGNVLYDNYGAHF 466

Query: 290 ALWGTQPRREHLAITKYFNCSCERC 314
           A++  Q R + L++   F+C CE C
Sbjct: 467 AVFSKQQRLDTLSMQYRFDCKCEGC 491


>gi|389743859|gb|EIM85043.1| hypothetical protein STEHIDRAFT_81876 [Stereum hirsutum FP-91666
           SS1]
          Length = 171

 Score = 44.7 bits (104), Expect = 0.12,   Method: Composition-based stats.
 Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 2/41 (4%)

Query: 15  CAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           CA C  RE  + +C+ C +V YCG    ++HW  HKP CK+
Sbjct: 128 CAGCGKREVKMKQCNKCHKVLYCGAVCLEQHWPTHKPICKQ 168


>gi|158289961|ref|XP_311564.4| AGAP010381-PA [Anopheles gambiae str. PEST]
 gi|157018410|gb|EAA07249.4| AGAP010381-PA [Anopheles gambiae str. PEST]
          Length = 91

 Score = 44.7 bits (104), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 44/82 (53%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
          C  C   A  KC+ C++V YC   HQK+HWK         PYE+  S  +GR+  AT  +
Sbjct: 5  CGYCGVPAKLKCAGCQQVYYCNPDHQKKHWKAKHKHECVKPYELTKSDEIGRHFVATKTI 64

Query: 75 HPGDRIARESPLIVGPKLALAE 96
               +  E+PL++GPK  LA+
Sbjct: 65 EKDTILFSENPLVIGPKWNLAD 86


>gi|340714736|ref|XP_003395881.1| PREDICTED: hypothetical protein LOC100649411 [Bombus terrestris]
          Length = 376

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          C +C  T  L +CS CK V+YC K+HQ+  WK HK  C
Sbjct: 24 CVICNRTDKLLRCSRCKAVSYCTKEHQRRDWKRHKEFC 61


>gi|321462665|gb|EFX73687.1| hypothetical protein DAPPUDRAFT_227055 [Daphnia pulex]
          Length = 1025

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 4/52 (7%)

Query: 7    IEDEELMR--CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            IEDE      CAVC++T   L +CS C+   YC  + Q  HW +HK  CKKL
Sbjct: 974  IEDEPADEKCCAVCKKTPEKLKRCSRCRSTVYCSVECQHSHWPMHKLICKKL 1025


>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 445

 Score = 44.7 bits (104), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 63/266 (23%), Positives = 102/266 (38%), Gaps = 40/266 (15%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GR + AT DL  G     E P +     A     C GC + L  ++   + C    +  C
Sbjct: 17  GRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQTCSGCFR-LAAHMQQCSSCKVVQY--C 73

Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQE---- 180
           S  C   SD   H PEC   K                I P    +L R   K+ ++    
Sbjct: 74  SQTCQR-SDWSIHKPECEGFK-----------AVQPRIPPSPVRLLGRMMFKRAKDCNEF 121

Query: 181 ---LKDMEAHMSKRGPGTEVYEEIDSIVKYLR--SNFLEKLPGDVLDDTSAKCLHWICGV 235
              +  +E+H  KR       ++I+ I   L+  S F+   P  +L  T+A  +  +C  
Sbjct: 122 ERVVGQLESHRDKRAS-----KDIEHIAAMLQMASGFI---PPALLLSTTADMIA-LCCK 172

Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMYTN 289
           I+VN +   R    +Y  +  + H+C+PNA           +     +     I+  Y +
Sbjct: 173 IQVNTMTTERGV-AIYDRLSTVNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVD 231

Query: 290 ALWGTQPRREHLAITKYFNCSCERCS 315
                + R+  L    YF+C+C  C+
Sbjct: 232 VFQSCETRQRQLKEQYYFDCTCRLCT 257



 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 7/61 (11%)

Query: 15  CAVCRETALH--KCSACKEVAYCGKQHQKEHWKLHKPKC---KKLPYEIKSSP--LLGRY 67
           C+ C   A H  +CS+CK V YC +  Q+  W +HKP+C   K +   I  SP  LLGR 
Sbjct: 51  CSGCFRLAAHMQQCSSCKVVQYCSQTCQRSDWSIHKPECEGFKAVQPRIPPSPVRLLGRM 110

Query: 68  L 68
           +
Sbjct: 111 M 111


>gi|170595415|ref|XP_001902374.1| MYND finger family protein [Brugia malayi]
 gi|158589993|gb|EDP28779.1| MYND finger family protein [Brugia malayi]
          Length = 464

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 15/95 (15%)

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS--------NMMQSKLFVFRDTHIS 284
           CG IE    +IG+   GLY  +C  +H+C PNA ++          + + + +   +   
Sbjct: 231 CGYIE----EIGK---GLYLDLCAYDHSCRPNAIYTCNSFVATLRGLTANVDLRNLSSTY 283

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
             Y + +  TQ RR+ L  T YF C C RC DP +
Sbjct: 284 YSYIDLINTTQQRRKLLKDTWYFECHCTRCDDPDD 318


>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
          Length = 428

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 64/279 (22%), Positives = 107/279 (38%), Gaps = 34/279 (12%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPN 109
           L  E  S+   G  L+A   L PG+ + R  PL      G +  + +   LG  K +   
Sbjct: 4   LKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--- 60

Query: 110 LADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
                RC +C     CSA+C   +    H  EC  LK  C+     +  +    + ++ +
Sbjct: 61  -----RCSQCRVAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVVKLM 113

Query: 169 ILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD------ 222
                  +K     D+E+++SK      + EE    ++ L   F   +  ++ D      
Sbjct: 114 EETPSESEKLYSFYDLESNISK------LTEEKKEGIRQLAMTFQHFMREEIQDVSQLLP 167

Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFR 279
             D         C    +   ++     GLY  + L+ H+C PN +   N     L   R
Sbjct: 168 AFDIFEAFAKVFCNAFTICNAEMQGVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVR 227

Query: 280 DTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
           D      ++  Y + L  ++ RR+HL     F+C C RC
Sbjct: 228 DIETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRC 266


>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
          Length = 549

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 70/279 (25%), Positives = 110/279 (39%), Gaps = 31/279 (11%)

Query: 65  GRYLQATLDLHPGDRIAR-ESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPA 123
           GR + AT    PGD IAR ++P +V P    A   C  C K   P       C  C   A
Sbjct: 19  GRSIYATRRFKPGDVIARFDNPAVVLPPGHRALEYCNHCVKKQRPAGVKLRACTGCKTVA 78

Query: 124 -CSARCSGLSDAHTHAPECAILKLGCETLLAYN-DYKYEAILPLRCLILQRRSPKKYQEL 181
            C   C   + +  H  EC  ++   E   A+  D+      P+R        P+     
Sbjct: 79  YCGPACQRANWSLVHKLECKAIQRLHEAKPAHQPDW---VPTPIRAAAQVMLRPQVLARF 135

Query: 182 KDMEAHMS--KRGPGTEVYEEIDSIVK--YLRSNFLEKLPG--DVLDDTSAKCLHWICGV 235
           +++E H+   ++  GT++  +   +VK   L +   E+L     VL              
Sbjct: 136 EELEGHVEQWRKKDGTDLQLQSHGVVKCLGLDTVTFERLETSFQVLCKLQTNAFSRTEEY 195

Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNA 290
            E  GV        L + + ++ H+C+PNA       +  +++  F+     I   Y + 
Sbjct: 196 YETGGV-------FLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIEISYIDQ 248

Query: 291 LWGTQPR-REHLAITKY-FNCSCERCSDPTELGTYFSAM 327
              TQPR + H  +  Y F CSC +C    +L  Y  AM
Sbjct: 249 ---TQPRGKRHGELDLYHFECSCYKCQ--KDLDEYQVAM 282


>gi|357613780|gb|EHJ68714.1| putative Egl nine-like protein 1 [Danaus plexippus]
          Length = 473

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          L  CAVC +    +C  C  V YC  +HQ++ WK HK  C
Sbjct: 7  LASCAVCNQQTQRRCGRCFSVYYCNTEHQRQDWKRHKINC 46


>gi|453089339|gb|EMF17379.1| hypothetical protein SEPMUDRAFT_35708 [Mycosphaerella populorum
          SO2202]
          Length = 240

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
          CA C  T   L KC+ CK V YC ++ Q+ HWK HK +C +L  E    P
Sbjct: 10 CAKCNNTEGELKKCTKCKSVLYCNRECQRAHWKKHKRQCSRLAGEGAPPP 59


>gi|395327455|gb|EJF59854.1| hypothetical protein DICSQDRAFT_137978 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 443

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CAVC + A  +C+ C+ +AYCG + Q+  W  HK  C+ L
Sbjct: 258 CAVCGKRATSRCAGCQSIAYCGSECQRSDWPDHKITCRSL 297


>gi|406988119|gb|EKE08230.1| hypothetical protein ACD_17C00276G0002 [uncultured bacterium]
          Length = 361

 Score = 44.3 bits (103), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 2/40 (5%)

Query: 15 CAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          C+ C +   +L +C  C    YCGKQHQ++HW  HKP C+
Sbjct: 23 CSQCEKYNQSLLRCKQCLSTLYCGKQHQRQHWPTHKPVCQ 62


>gi|328778894|ref|XP_397368.4| PREDICTED: egl nine homolog 1-like [Apis mellifera]
          Length = 486

 Score = 44.3 bits (103), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          C VC  T  L +CS CK V YC K+HQK  WK HK  C
Sbjct: 24 CVVCNRTDKLLRCSRCKAVFYCTKEHQKRDWKRHKEFC 61


>gi|323451400|gb|EGB07277.1| hypothetical protein AURANDRAFT_64957 [Aureococcus anophagefferens]
          Length = 1781

 Score = 44.3 bits (103), Expect = 0.15,   Method: Composition-based stats.
 Identities = 18/33 (54%), Positives = 18/33 (54%)

Query: 26   CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
            CS CK V YC    QK HW  HK  C K P EI
Sbjct: 1747 CSRCKAVVYCDAACQKAHWATHKKHCAKKPAEI 1779


>gi|242208886|ref|XP_002470292.1| predicted protein [Postia placenta Mad-698-R]
 gi|220730599|gb|EED84453.1| predicted protein [Postia placenta Mad-698-R]
          Length = 380

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 27/38 (71%), Gaps = 3/38 (7%)

Query: 19  RETALHK---CSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           RET + +   C ACK+  YCG++ QK+ WK HKPKC++
Sbjct: 325 RETTVAQFKCCGACKDAVYCGQECQKKDWKAHKPKCQE 362


>gi|51535049|dbj|BAD37420.1| putative ubiquitin-specific protease 15 [Oryza sativa Japonica
           Group]
 gi|51535599|dbj|BAD37542.1| putative ubiquitin-specific protease 15 [Oryza sativa Japonica
           Group]
          Length = 1069

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 4/49 (8%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSP 62
           CA C   A  KCS CK V YC +  Q +HW+  HK KCK++    KSSP
Sbjct: 105 CAACGYIATKKCSGCKRVRYCSQGCQSKHWQSGHKFKCKQMK---KSSP 150


>gi|350414632|ref|XP_003490373.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
           [Bombus impatiens]
          Length = 439

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 30/49 (61%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
           ++  C++C E A  +CS CKEV YCG++ Q + W  HK  C K+   I+
Sbjct: 387 DVKECSLCHEEAKKRCSKCKEVWYCGRECQVKDWVKHKNICDKITKSIE 435


>gi|336375477|gb|EGO03813.1| hypothetical protein SERLA73DRAFT_119451 [Serpula lacrymans var.
          lacrymans S7.3]
          Length = 98

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 8  EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           D E  RC VCR   +  L  C+ CK+V YC  + QK+ WK+HK +CK
Sbjct: 50 RDPEAYRCFVCRGKGKPKLKMCTVCKKVRYCSSECQKKDWKVHKLRCK 97


>gi|393911885|gb|EJD76488.1| CBR-EGL-9 protein [Loa loa]
          Length = 483

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 43/92 (46%), Gaps = 9/92 (9%)

Query: 15 CAVCRE------TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
          CAVC         AL +C  C  VAYCG++H+   WK HKP CK +  ++  +      L
Sbjct: 10 CAVCGAPKSIDGNALSQCKQCSMVAYCGEEHRNFDWKRHKPLCKTVQSQLTRTG--ATTL 67

Query: 69 QATLDLHPGDRIA-RESPLIVGPKLALAEPIC 99
           AT+     D++   +   +V     L + +C
Sbjct: 68 SATIGQRSVDKLTDSDGYTVVTAAKWLGDKLC 99


>gi|342320836|gb|EGU12774.1| Microtubule-associated protein, putative [Rhodotorula glutinis
          ATCC 204091]
          Length = 560

 Score = 44.3 bits (103), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 29/62 (46%), Gaps = 3/62 (4%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
          CAVC + A  +C  CK V +C  + QK  W  HK  CK  P      PL  R L    DL
Sbjct: 12 CAVCDQPAKSRCGGCKRVPFCSPRCQKLIWSTHKALCKADPDVFHPPPLSARELG---DL 68

Query: 75 HP 76
           P
Sbjct: 69 RP 70



 Score = 42.4 bits (98), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 29/54 (53%)

Query: 10  EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           E+ ++CAVC +    +CS C  + +CG   QK  W  HK  CK  P    S+PL
Sbjct: 281 EKTVKCAVCDKDTKLRCSGCGILPFCGAACQKLLWPTHKTLCKVDPEVFHSAPL 334


>gi|195472004|ref|XP_002088292.1| GE18492 [Drosophila yakuba]
 gi|194174393|gb|EDW88004.1| GE18492 [Drosophila yakuba]
          Length = 151

 Score = 44.3 bits (103), Expect = 0.16,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   LLGR L  +  +  G+ +  E P   GPK   +  +CLGC++ L      +
Sbjct: 2   LKARLQEDALLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C + C+   D   H  EC + 
Sbjct: 61  SLDRCELCDWPLCGS-CADDEDVTEHRGECQVF 92


>gi|321469693|gb|EFX80672.1| hypothetical protein DAPPUDRAFT_103160 [Daphnia pulex]
          Length = 2032

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 2/41 (4%)

Query: 15   CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
            CA C++T   L +CS C+ ++YC  + Q++HWK HK  C K
Sbjct: 1991 CAACKQTTQNLKRCSRCRSISYCSVECQRQHWKQHKVFCTK 2031



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 2/54 (3%)

Query: 15   CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
            C  C++    L +CS C+   YC  + Q++HW  HK   K L    + SP+LG+
Sbjct: 953  CNACQKVPQNLKRCSRCRTAVYCSVECQRQHWAQHKKASKMLRTPSRGSPVLGK 1006


>gi|312085936|ref|XP_003144878.1| MYND finger family protein [Loa loa]
          Length = 535

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 15/95 (15%)

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS--------NMMQSKLFVFRDTHIS 284
           CG IE    +IG+   GLY  +C  +H+C PN  ++          + + + +   +   
Sbjct: 232 CGYIE----EIGK---GLYLDLCAYDHSCRPNTIYTCDGFVATLRGLTASVDLRNLSSAH 284

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
             Y + +  TQ RR+ L  T YF C C RCSDP +
Sbjct: 285 YSYIDLINTTQQRRKLLKDTWYFECHCTRCSDPDD 319


>gi|156839743|ref|XP_001643559.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
           70294]
 gi|171770010|sp|A7TPV3.1|SET5_VANPO RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|156114175|gb|EDO15701.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 499

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 11/83 (13%)

Query: 243 IGRY-----TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNAL 291
           IGRY     +  ++ +  L+ HNC PN +   +   ++ V+   +IS      T Y N L
Sbjct: 330 IGRYNINQISDQMFFLPSLINHNCEPNVRFEVVSNKEIRVYARKNISAGQELLTNYINPL 389

Query: 292 WGTQPRREHLAITKYFNCSCERC 314
            G + RR  L +   F C C+RC
Sbjct: 390 HGVKLRRRELRVNYGFLCHCDRC 412


>gi|71654175|ref|XP_815712.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70880787|gb|EAN93861.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 584

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 127/322 (39%), Gaps = 49/322 (15%)

Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAIL-----KLG----CETLLAYNDYKYEAILPL 165
           C RC F   CS RC+          EC +L      LG    CE  +  +D+   A   +
Sbjct: 211 CRRCQFVSYCSDRCASCHGRQHEEYECRLLFRLREMLGSMRSCEAAVP-DDFFTMATHCI 269

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPG-TEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
             +   +   + ++ +  +E+H  +   G T +   +  +       F+ ++ G V  + 
Sbjct: 270 TTMSGVKMRKEGHEAVLRLESHEVEVSQGLTPLVRLVQDLFSGEDPTFVTRILGVVRCNA 329

Query: 225 SAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI 283
            A C        + +G+ +G+      SV     H+CLPN A  +  + +   +     +
Sbjct: 330 LAVC--------DASGLPVGQALHA-ASVTSYFNHSCLPNCAIEAGAIVTTRAIRPGEEL 380

Query: 284 STMYTNAL-WGTQPRREHLAITKYFNCSCERC--SDPTELGTYFSAMKCLNEHKDQGDCW 340
           +  Y   L W    RRE LA   +F+C C RC   D +   +  SA   L   +++   +
Sbjct: 381 TISYLPQLYWPAWLRREELAERYFFDCRCVRCDDGDRSPFESALSATLRLGGSREKEREY 440

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
           I  V  L       CG    R+ A+DV  V   +G++            +L  +L +   
Sbjct: 441 ISSVQIL-------CG----RVRAKDVGDV--DVGDR-----------DALLHLLQECRQ 476

Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
              P HY C+ + ++L  +Y  
Sbjct: 477 HLFPFHYLCHELHNTLSFVYAV 498


>gi|340714716|ref|XP_003395871.1| PREDICTED: egl nine homolog 1-like [Bombus terrestris]
          Length = 486

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 1/38 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          C +C  T  L +CS CK V+YC K+HQ+  WK HK  C
Sbjct: 24 CVICNRTDKLLRCSRCKAVSYCTKEHQRRDWKRHKEFC 61


>gi|194862042|ref|XP_001969907.1| GG23679 [Drosophila erecta]
 gi|190661774|gb|EDV58966.1| GG23679 [Drosophila erecta]
          Length = 207

 Score = 44.3 bits (103), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 43/94 (45%), Gaps = 4/94 (4%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   LLGR L  +  +  G+ +  E P   GPK   +  +CLGC++ L      +
Sbjct: 2   LKARLQEDALLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAILK 145
           +  RC  C WP C + C+   D   H  EC +  
Sbjct: 61  SLDRCELCDWPLCGS-CADDEDVTEHRGECQVFS 93


>gi|397616736|gb|EJK64110.1| hypothetical protein THAOC_15187 [Thalassiosira oceanica]
          Length = 299

 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 10/74 (13%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+ I E+E      CA C + A     L +C+AC+ V YCG   Q+ H KLHK  CK+  
Sbjct: 1  MSGIQEVET-----CANCGKEASEAVKLKQCTACRLVKYCGVDCQRAHRKLHKKACKQRA 55

Query: 56 YEIKSSPLLGRYLQ 69
           E+K   L  + L+
Sbjct: 56 AELKDEQLYSQGLE 69


>gi|393905719|gb|EFO19192.2| MYND finger family protein [Loa loa]
          Length = 545

 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 15/95 (15%)

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS--------NMMQSKLFVFRDTHIS 284
           CG IE    +IG+   GLY  +C  +H+C PN  ++          + + + +   +   
Sbjct: 232 CGYIE----EIGK---GLYLDLCAYDHSCRPNTIYTCDGFVATLRGLTASVDLRNLSSAH 284

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
             Y + +  TQ RR+ L  T YF C C RCSDP +
Sbjct: 285 YSYIDLINTTQQRRKLLKDTWYFECHCTRCSDPDD 319


>gi|125598076|gb|EAZ37856.1| hypothetical protein OsJ_22202 [Oryza sativa Japonica Group]
          Length = 995

 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 4/49 (8%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSP 62
          CA C   A  KCS CK V YC +  Q +HW+  HK KCK++    KSSP
Sbjct: 31 CAACGYIATKKCSGCKRVRYCSQGCQSKHWQSGHKFKCKQMK---KSSP 76


>gi|157866065|ref|XP_001681739.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68125037|emb|CAJ03122.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 622

 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 48/211 (22%), Positives = 83/211 (39%), Gaps = 45/211 (21%)

Query: 229 LHWICGVI--------EVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
           L+ I GV+        + +G+ + +      S+     H+C PN    + ++  +   R 
Sbjct: 369 LYTIIGVLCCNALELADPSGLGVAQALHAGNSIASFFNHSCTPNCA-IDTVRHAIVTTRT 427

Query: 281 THI----STMYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLN 331
            H+    S  Y   L W T+ RRE L+ + YF C C+RC     DP E          L 
Sbjct: 428 IHVGEELSIAYIPQLYWPTRLRRERLSESYYFVCRCQRCESSNKDPFERALSME----LP 483

Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
             + +      P+         TC +  +R+    V  V+ +  +++ KL++E       
Sbjct: 484 TARREATKHFHPI------VQTTCATVRSRM----VGDVSQKDADELSKLLRE------- 526

Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
                 L     P HY C+ V++ L  +Y  
Sbjct: 527 ------LSTHLFPFHYLCHEVRNCLSFVYAV 551


>gi|336375469|gb|EGO03805.1| hypothetical protein SERLA73DRAFT_158294 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 1147

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D    RC VCR   +  L  C+ CK+V YC    QK+ WK+HK +CK
Sbjct: 1099 RDPAAYRCFVCRGKGKPKLKMCTVCKKVRYCSSGCQKKDWKIHKLRCK 1146


>gi|336388523|gb|EGO29667.1| hypothetical protein SERLADRAFT_445454 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1163

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D    RC VCR   +  L  C+ CK+V YC    QK+ WK+HK +CK
Sbjct: 1115 RDPAAYRCFVCRGKGKPKLKMCTVCKKVRYCSSGCQKKDWKIHKLRCK 1162


>gi|440799318|gb|ELR20373.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 362

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 1/46 (2%)

Query: 10  EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKL 54
           E++  CA C   A  +CSACK V YC   HQKEHW + HK +C+++
Sbjct: 107 EKVFFCAECGMRAEKRCSACKMVHYCTVTHQKEHWGRGHKEECEQI 152


>gi|326525078|dbj|BAK07809.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 372

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 27/50 (54%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C      +C +C  VAYC   HQ  HWK+HK +C +L  ++    +L ++
Sbjct: 13 CTSGTHRRCGSCGAVAYCSNDHQFIHWKVHKEECARLATQMSRIDMLSQF 62


>gi|224100385|ref|XP_002311856.1| predicted protein [Populus trichocarpa]
 gi|222851676|gb|EEE89223.1| predicted protein [Populus trichocarpa]
          Length = 391

 Score = 43.9 bits (102), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 28/54 (51%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          R   C  +A  +C  C  VAYC   HQ  HW  HK +C++L  ++K   +L  +
Sbjct: 8  RGTRCMGSARRRCGRCGAVAYCSVSHQMSHWNEHKEECERLEQQMKRVDVLNDF 61


>gi|358382303|gb|EHK19975.1| hypothetical protein TRIVIDRAFT_47985 [Trichoderma virens Gv29-8]
          Length = 448

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RCA C  T   L +C+ C  V YC ++HQ  HW LHK  C K+
Sbjct: 21 RCAHCLSTGPKLLRCTGCLAVHYCSREHQAAHWPLHKSTCIKI 63


>gi|383853834|ref|XP_003702427.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
           [Megachile rotundata]
          Length = 441

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 31/50 (62%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           ++  C +C E A ++CS CKE  YCG++ Q + W+ HK  C K+   +K+
Sbjct: 388 DVKECFLCHEEAKNRCSKCKEAWYCGRECQVKDWENHKHICNKITESVKN 437


>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
          Length = 464

 Score = 43.9 bits (102), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 79/208 (37%), Gaps = 39/208 (18%)

Query: 124 CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ---- 179
           CSA C   +D   H PECA LK            +   ++P   ++   R  +K +    
Sbjct: 93  CSAACQK-ADWKDHKPECAALK------------RISPVVPATFVMFLARILRKMERNTG 139

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLR------SNFLEKLPGDVLDDTSAKCLHWIC 233
           E+  ++ HM       +    + +I+++LR         L K    VL  TSA       
Sbjct: 140 EMDVLQLHMPGEPSDPQQQRGLFAILEHLRHFLPDAEKHLLKSAYPVLRITSANSF---- 195

Query: 234 GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------ISTM 286
           G+  V G ++G    GLY  +  + H+C PN   S          R  H       ++  
Sbjct: 196 GISGVEGNNLG---VGLYDTVSYINHSCAPNC--SITFSGVYARVRSVHDLPPNQELTIA 250

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y +       RR HL     F+C C RC
Sbjct: 251 YIDPCDPRAKRRAHLKSQFMFDCECSRC 278


>gi|397595780|gb|EJK56578.1| hypothetical protein THAOC_23507, partial [Thalassiosira oceanica]
          Length = 564

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 27/47 (57%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L GR L+
Sbjct: 384 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGRGLE 430


>gi|392559211|gb|EIW52396.1| hypothetical protein TRAVEDRAFT_75375 [Trametes versicolor
           FP-101664 SS1]
          Length = 178

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 22/114 (19%)

Query: 13  MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
           M C VC E A   CSACK   YC +  QK +WK HK  C+           + + L    
Sbjct: 3   MPCEVCYEDATRVCSACKYTRYCSEACQKANWKTHKKGCE-----------IQQMLNRMN 51

Query: 73  DLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSA 126
           + H     AR +P             C GC+     +   +  CP C + AC +
Sbjct: 52  EEHAAAPRARPNPK-----------RCTGCNVRFTEDYPCDGECPDCGYVACES 94


>gi|324503373|gb|ADY41469.1| Egl nine 1 [Ascaris suum]
          Length = 727

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           L +C  C  VAYCG++HQK  WK HKP CK +  ++
Sbjct: 146 LLRCQHCDMVAYCGEEHQKFDWKRHKPLCKTVQSQL 181


>gi|194762120|ref|XP_001963206.1| GF19729 [Drosophila ananassae]
 gi|190616903|gb|EDV32427.1| GF19729 [Drosophila ananassae]
          Length = 186

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   + GR L     +  GD +  E P   GPK   +  +CL C++ L      +
Sbjct: 2   LKARLEQDEVFGRCLAVAEPVQRGDLVLEELPFAQGPKRD-SGIVCLACYQFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C+A C+  SD   H+ EC + 
Sbjct: 61  SLDRCEICDWPLCAA-CAIGSDVTDHSGECEVF 92


>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
 gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
          Length = 614

 Score = 43.9 bits (102), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 50/237 (21%), Positives = 91/237 (38%), Gaps = 33/237 (13%)

Query: 249 GLYSVICLMEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAI 303
           GL+  + ++ H+C PN   A   ++M  +    + +   +   Y N     + R+  L  
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355

Query: 304 TKYFNCSCERCSDP--TELGTYFSAMKCLNEHKDQGDCWILPVNPLDND----SDWTCGS 357
           TK+F+C+C+RC +P  T +  +     C   H        L    L  D    + W C  
Sbjct: 356 TKHFDCTCDRCVEPLNTSIDRFLEGCVC---HVRGCGGVFLRTAALHEDQASSTTWECDL 412

Query: 358 CSARLNARDVHL------VTSQLGEQVDKLV------QENPNVKSLEEMLTKLEAMFHPH 405
           CS  L+   +        V ++  E++   V      +     K LE  L++     HP 
Sbjct: 413 CSRVLDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472

Query: 406 HYHCYAVKHSLIQ----LYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLG 458
           H   +     L+     +   + G    +L    +E+ +     P+L+  +   CLG
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLP---GPSLELANFYFCLG 526


>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
          Length = 584

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 72/322 (22%), Positives = 127/322 (39%), Gaps = 49/322 (15%)

Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAIL-----KLG----CETLLAYNDYKYEAILPL 165
           C RC F   CS RC+          EC +L      LG    CE  +  +D+   A   +
Sbjct: 211 CRRCQFVSYCSDRCASCHGRQHEEYECRLLFRLREMLGSMRSCEAAVP-DDFFTMATHCI 269

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPG-TEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
             +   +   + ++ +  +E+H  +   G T +   +  +       F+ ++ G V  + 
Sbjct: 270 TTVSGVKMRKEGHEAVLRLESHEVEVSQGLTPLVRLVQDLFSGEDPTFVTRILGVVRCNA 329

Query: 225 SAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI 283
            A C        + +G+ +G+      SV     H+CLPN A  +  + +   +     +
Sbjct: 330 LAVC--------DASGLPVGQALHA-ASVTSYFNHSCLPNCAIEAGAIVTTRAIRPGEEL 380

Query: 284 STMYTNAL-WGTQPRREHLAITKYFNCSCERCSDP--TELGTYFSAMKCLNEHKDQGDCW 340
           +  Y   L W    RRE LA   +F+C C RC D   +   +  SA   L   +++   +
Sbjct: 381 TISYLPQLYWPAWLRREELAERYFFDCRCVRCGDGDRSPFESALSATLRLGGSREKEREY 440

Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
           I  V  L       CG    R+ A+DV  V   +G++            +L  +L +   
Sbjct: 441 ISSVQIL-------CG----RVRAKDVGDV--DVGDR-----------DALLHLLQECRQ 476

Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
              P HY C+ + ++L  +Y  
Sbjct: 477 HLFPFHYLCHELHNTLSFVYAV 498


>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
 gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
          Length = 429

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          L KCSAC++VAYC  + Q+  WKLHK +CK +
Sbjct: 36 LKKCSACRKVAYCSAECQRADWKLHKRECKAI 67


>gi|209879507|ref|XP_002141194.1| MYND finger domain-containing protein [Cryptosporidium muris RN66]
 gi|209556800|gb|EEA06845.1| MYND finger domain-containing protein [Cryptosporidium muris RN66]
          Length = 232

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 27/47 (57%), Gaps = 2/47 (4%)

Query: 8   EDEELMRCAVCRETAL--HKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           E  +L RC  CR T+L   +C  CK+V YC    QKE WK HK  CK
Sbjct: 186 EKADLTRCENCRITSLPLMRCGRCKKVLYCSVTCQKEDWKYHKRICK 232


>gi|357122838|ref|XP_003563121.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
          [Brachypodium distachyon]
          Length = 372

 Score = 43.9 bits (102), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 28/50 (56%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C   A  +C +C  VAYC + HQ  HW LHK +C +L  ++    +L ++
Sbjct: 13 CAGGAHRRCGSCGAVAYCSRAHQFIHWTLHKEECARLATQMSHIDVLSQF 62


>gi|397571596|gb|EJK47869.1| hypothetical protein THAOC_33384 [Thalassiosira oceanica]
          Length = 1255

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 32/60 (53%), Gaps = 5/60 (8%)

Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          CA C + A     L  C+AC+ V YCG   QK H KLHK  CKK   E+K   L  + L+
Sbjct: 18 CANCGKFASNTVKLKDCTACRLVKYCGVDCQKAHRKLHKKACKKRAAELKDEQLYSQGLE 77


>gi|389745230|gb|EIM86411.1| hypothetical protein STEHIDRAFT_121370 [Stereum hirsutum FP-91666
           SS1]
          Length = 307

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 23/39 (58%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           C   +E  L  CS CK V YC K+ QKEHW  HK  CKK
Sbjct: 263 CEAIKEKDLATCSRCKLVYYCSKECQKEHWPRHKIWCKK 301


>gi|170036329|ref|XP_001846017.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167878894|gb|EDS42277.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 574

 Score = 43.9 bits (102), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 7/98 (7%)

Query: 53  KLPY-----EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLN 107
           KLP+      +K  P  GR L A  D + GD I  E P+++  +  L   +C  C     
Sbjct: 189 KLPFVAKGIAMKYYPEFGRGLVAERDFNTGDVILDEKPMLMAVEHGLRYSVCSYCTASFR 248

Query: 108 PNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 145
            +L     C  C +  CS +C  +    TH  EC I++
Sbjct: 249 KSLIPCPNCVSCMY--CSEKCLEMDHRFTHRFECGIME 284


>gi|391224601|gb|AFM37574.1| HIF prolyl hydroxylase [Trichoplax adhaerens]
          Length = 300

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 15 CAVCRETALH-KCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
          C++C +  +  KCS C+   YC K+HQ+ HWK+H+  C             G  L+ T  
Sbjct: 26 CSLCFKKLISVKCSLCQSAYYCSKEHQQSHWKIHRENCMDERRPKIGCDKTGLALEKTWQ 85

Query: 74 LHPGDRIA 81
          L   +R++
Sbjct: 86 LQKNERLS 93


>gi|170086215|ref|XP_001874331.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651883|gb|EDR16123.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 98

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)

Query: 8  EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           D +  +C VCR      + +C+ C++V YC  + QK+ WK HKPKCK
Sbjct: 50 RDPDARKCRVCRGKGKPRIKECAGCQKVRYCSPECQKKDWKAHKPKCK 97


>gi|392562241|gb|EIW55422.1| hypothetical protein TRAVEDRAFT_131337 [Trametes versicolor FP-101664
            SS1]
          Length = 1177

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 10   EELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
            E   RCA C   +  + KCS CK V+YC +  QK  WK HKP C
Sbjct: 1129 EHATRCAKCSWPDNKMRKCSRCKTVSYCSEGCQKADWKKHKPAC 1172


>gi|307203234|gb|EFN82389.1| Egl nine-like protein 1 [Harpegnathos saltator]
          Length = 473

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          C VC  T  L +CS CK V YC K+HQ+  WK H+  C   P
Sbjct: 24 CVVCERTDKLLRCSRCKAVFYCTKEHQRSDWKRHRDFCATHP 65


>gi|367012652|ref|XP_003680826.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
 gi|359748486|emb|CCE91615.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
          Length = 486

 Score = 43.9 bits (102), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 6/84 (7%)

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNAL 291
           V   ++ + +  +Y +   + H+C PN ++    + +L VF    I       T Y N L
Sbjct: 324 VGKFNLNQISGQIYPIYSFINHDCEPNVRYEIDDKLRLKVFARKPIKAGEELLTTYVNPL 383

Query: 292 WGTQPRREHLAITKYFNCSCERCS 315
            G + RR  L +   F C CERC+
Sbjct: 384 HGVKLRRRELRVNWGFLCQCERCT 407


>gi|330944524|ref|XP_003306390.1| hypothetical protein PTT_19525 [Pyrenophora teres f. teres 0-1]
 gi|311316122|gb|EFQ85513.1| hypothetical protein PTT_19525 [Pyrenophora teres f. teres 0-1]
          Length = 414

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 1/41 (2%)

Query: 15  CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C +C +     KC  CK VAYCG++HQ++ W  HK +C++L
Sbjct: 369 CKICGKYEGARKCVRCKTVAYCGEEHQRQDWPSHKRECRQL 409


>gi|383849148|ref|XP_003700208.1| PREDICTED: egl nine homolog 1-like [Megachile rotundata]
          Length = 486

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          C VC  T  L +CS CK V YC K+HQ+  WK HK  C
Sbjct: 24 CVVCNRTDKLLRCSRCKAVFYCTKEHQRRDWKRHKEFC 61


>gi|330792277|ref|XP_003284216.1| hypothetical protein DICPUDRAFT_147973 [Dictyostelium purpureum]
 gi|325085913|gb|EGC39312.1| hypothetical protein DICPUDRAFT_147973 [Dictyostelium purpureum]
          Length = 507

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 42/170 (24%), Positives = 81/170 (47%), Gaps = 20/170 (11%)

Query: 247 TQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTH----ISTMYTNALWGTQPRREHL 301
           T GLY ++    H+C PN +  +N  + K+   RD +    +   Y+ ++  +  RRE L
Sbjct: 228 TIGLYPLMVFFNHSCKPNLSILNNKKELKIICNRDINENEELFINYSPSICYSNERREML 287

Query: 302 AITKYFNCSCERC-SDPTE--LGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
               +F+C CE C  D  E  +  Y    KC   +K + + W       + +    C +C
Sbjct: 288 KQCFFFDCKCELCLKDELEKSMDLYILCNKC---NKGRVNIWY----DENKNRIVKCLNC 340

Query: 359 SARLNARDV---HLVTSQLGEQVDKLVQENP--NVKSLEEMLTKLEAMFH 403
           S+  NA ++   +L+T++L + +D+ + +N   N+   + +L +    F+
Sbjct: 341 SSDHNADEILKKNLITTKLQKCLDQFINQNSELNIDEFKLLLNQYCKQFY 390


>gi|326426901|gb|EGD72471.1| hypothetical protein PTSG_11593 [Salpingoeca sp. ATCC 50818]
          Length = 457

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +CAVC E A  +CS CK   YC +Q Q EHW  HK  C  +
Sbjct: 408 KCAVCGEPATKRCSKCKNEWYCRRQCQVEHWPKHKKLCATI 448


>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
 gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
          Length = 391

 Score = 43.5 bits (101), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 69/282 (24%), Positives = 107/282 (37%), Gaps = 36/282 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAE---PICLGCHKPLNPNLADNARCPRC-F 120
           GR L +T  L  G  +  +SP+++   L L +      L C K      + +  CP C  
Sbjct: 20  GRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCFKTIQSASVSCPTCSH 79

Query: 121 WPACSARCSGLSDAHTHAP-ECAILKL--GCETLLAYN--DYKYEAILPLRCLILQRRSP 175
              CS  C   + A +H P  C  L     C+  L ++  + + +A   +    L   SP
Sbjct: 80  QRFCSPTCLSAALASSHTPWVCQSLSRLRDCQDFLQHHSVERQIQAQFLVAAYNLAFVSP 139

Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYL--------RSNFLEKLPGDVLDDT-SA 226
             +Q L  ++       P   + + + S++  L         S  LE +   V  D  +A
Sbjct: 140 SDFQILLSLQGRAEDEDPA--IVQSLHSVISSLCPPPPIEGFSFSLELIAALVAKDRFNA 197

Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
             L     + E NG        G+Y    L  H+CLPNA       +N   +   V R  
Sbjct: 198 FGLMEPLNLNEENGGQRSVRAYGIYPKASLFNHDCLPNACRFDYVDTNNSGNTDIVVRMI 257

Query: 282 H---------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
           H         +S    N+ + T  RR+ L     F C C+RC
Sbjct: 258 HDVPQGREICLSYFPVNSNYST--RRKRLLEDYGFTCDCDRC 297


>gi|307109858|gb|EFN58095.1| hypothetical protein CHLNCDRAFT_142413 [Chlorella variabilis]
          Length = 318

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 9/99 (9%)

Query: 1  MASIIEIEDEELMRCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI- 58
          MA++ + E  + + CA C  T  L KCS C    +CG + QK +W  H+ +CK+  +   
Sbjct: 1  MAAMHQDELPDEVECANCGTTQNLLKCSRCHTAWFCGVKCQKAYWPFHRTQCKRNEFADA 60

Query: 59 --KSSPLLGRYL-----QATLDLHPGDRIARESPLIVGP 90
            +S P   R++     QA L     DR+ R      GP
Sbjct: 61 IEESEPKFARWMRKHGKQAVLKDDEVDRLERAGAAASGP 99


>gi|170113176|ref|XP_001887788.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164637149|gb|EDR01436.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 420

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 3/47 (6%)

Query: 9   DEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           D +  +C  CR  A   +  C+ C++V YC  + QK+ WK HKPKCK
Sbjct: 373 DPDAHKCRSCRRKAKPKIKVCAGCQKVRYCSSECQKKDWKAHKPKCK 419


>gi|168060010|ref|XP_001781992.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666565|gb|EDQ53216.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 388

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 16/47 (34%), Positives = 28/47 (59%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
          C   A  +C AC  + YC ++HQK HW  H   C ++ ++++ +P+L
Sbjct: 12 CSGPATLRCGACGAIRYCSRKHQKAHWDEHALVCSRMAHQMQLAPVL 58


>gi|405978745|gb|EKC43109.1| Egl nine-like protein 1 [Crassostrea gigas]
          Length = 319

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 3/60 (5%)

Query: 1  MASIIEIEDEE--LMRCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          MA +  + D     M C +C  +  L  CS C+   YC K+HQK  WK HK  CK++  E
Sbjct: 1  MAEVKSVRDRPNISMACQLCSSSERLMVCSGCRRTWYCSKEHQKLDWKYHKKNCKRMREE 60


>gi|189197141|ref|XP_001934908.1| conserved hypothetical protein [Pyrenophora tritici-repentis
          Pt-1C-BFP]
 gi|187980856|gb|EDU47482.1| conserved hypothetical protein [Pyrenophora tritici-repentis
          Pt-1C-BFP]
          Length = 240

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 20/39 (51%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          CA C      KCS CK + YC    Q   W LHKP CK+
Sbjct: 31 CATCHTPTPRKCSTCKNIRYCSATCQTTDWPLHKPVCKR 69


>gi|413943501|gb|AFW76150.1| putative ubiquitin carboxyl-terminal hydrolase superfamily protein
           [Zea mays]
          Length = 944

 Score = 43.5 bits (101), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           CA C      KCS CK V YC ++ Q +HW++ HK KCK++
Sbjct: 91  CAACGVVTSKKCSGCKRVRYCSQECQTKHWQIDHKFKCKQM 131


>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
           familiaris]
          Length = 428

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 63/283 (22%), Positives = 110/283 (38%), Gaps = 31/283 (10%)

Query: 46  LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLG 101
           +  PK +K      SS   G  L+A+  L PG+ + R  PL      G +  + +   LG
Sbjct: 1   MEPPKVEKF-----SSASRGNGLRASAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLG 55

Query: 102 CHKPLNPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYE 160
             K +        RC +C     CSA+C   +    H  EC  LK  C+     +  +  
Sbjct: 56  KEKLM--------RCSQCRVAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLL 105

Query: 161 AILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTE--VYEEIDSIVKYLRSNFLE--KL 216
             +  + +       +K     D+E++++K     +  + + + +   ++R    +  +L
Sbjct: 106 GRVVFKLMEETPSESEKLYSFYDLESNINKLTEDKKDGLRQLVMTFQHFMREEIQDASQL 165

Query: 217 PGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKL 275
           P     D        IC    +   ++     GLY  + L+ H+C PN +   N     L
Sbjct: 166 PSSF--DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLL 223

Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
              RD      ++  Y + L  ++ RR+ L     F C C RC
Sbjct: 224 RAVRDIEAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRC 266


>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
          Length = 428

 Score = 43.5 bits (101), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 23/34 (67%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          T L KC+ C E  YC ++ QK  WK+HKP+CK +
Sbjct: 34 TRLKKCAGCNEFFYCSQECQKNDWKVHKPECKSI 67


>gi|398389124|ref|XP_003848023.1| hypothetical protein MYCGRDRAFT_50206 [Zymoseptoria tritici
          IPO323]
 gi|339467897|gb|EGP82999.1| hypothetical protein MYCGRDRAFT_50206 [Zymoseptoria tritici
          IPO323]
          Length = 543

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 27/53 (50%)

Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
          +E   CA C +    KC AC  V YCG++ Q  HWK HK  CK    +I   P
Sbjct: 2  KEGYECASCHQYGNLKCKACHLVVYCGRRCQTAHWKEHKKYCKSPFMKIDWQP 54


>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
 gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
          Length = 453

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)

Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C +ET + KCS C  V YCG+  QKE W  HK +C+KL
Sbjct: 30 RCDRCFKETKVMKCSNCLYVRYCGRSCQKEAWSDHKEECEKL 71


>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 382

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 12/120 (10%)

Query: 211 NFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN 264
           NFL+  P +    T     H + GVI  NG  +   R  Q    GL+  +CL+ H+C PN
Sbjct: 54  NFLDYWPRNSKQHTIDDISH-LFGVINCNGFTVSDQRGLQAVGVGLFPNLCLVNHDCWPN 112

Query: 265 ----AKHSNM-MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
                 H  + ++S   +     ++  Y + L  ++ RR  L    +F+C CE C + T+
Sbjct: 113 CTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCEHCRNRTK 172


>gi|345483923|ref|XP_001603005.2| PREDICTED: zinc finger MYND domain-containing protein 10-like
           [Nasonia vitripennis]
          Length = 333

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 31/50 (62%), Gaps = 2/50 (4%)

Query: 5   IEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +EI D +  +C  C+E A ++CS CKE  YCG++ Q + W  HK  CK +
Sbjct: 275 LEIVDSK--KCFRCKEVAKNRCSRCKEAWYCGRECQVKDWTNHKTICKNI 322


>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
 gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
          Length = 614

 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 50/237 (21%), Positives = 91/237 (38%), Gaps = 33/237 (13%)

Query: 249 GLYSVICLMEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAI 303
           GL+  + ++ H+C PN   A   ++M  +    + +   +   Y N     + R+  L  
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355

Query: 304 TKYFNCSCERCSDP--TELGTYFSAMKCLNEHKDQGDCWILPVNPLDND----SDWTCGS 357
           TK+F+C+C+RC +P  T +  +     C   H        L    L  D    + W C  
Sbjct: 356 TKHFDCTCDRCVEPLNTSIDRFLEGCVC---HVRGCGGVFLRTAALHEDQASSTTWECDL 412

Query: 358 CSARLNARDVHL------VTSQLGEQVDKLV------QENPNVKSLEEMLTKLEAMFHPH 405
           CS  L+   +        V ++  E++   V      +     K LE  L++     HP 
Sbjct: 413 CSRILDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472

Query: 406 HYHCYAVKHSLIQ----LYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLG 458
           H   +     L+     +   + G    +L    +E+ +     P+L+  +   CLG
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLP---GPSLELANFYFCLG 526


>gi|323452529|gb|EGB08403.1| hypothetical protein AURANDRAFT_64147 [Aureococcus anophagefferens]
          Length = 1455

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 23/40 (57%), Gaps = 1/40 (2%)

Query: 15   CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
            CAVC   A L  C+ C    YC K HQ+ HWK H+  CKK
Sbjct: 1410 CAVCGVRANLQACARCGAAFYCSKAHQESHWKEHRRDCKK 1449


>gi|169600053|ref|XP_001793449.1| hypothetical protein SNOG_02856 [Phaeosphaeria nodorum SN15]
 gi|111068467|gb|EAT89587.1| hypothetical protein SNOG_02856 [Phaeosphaeria nodorum SN15]
          Length = 414

 Score = 43.5 bits (101), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 24/45 (53%), Gaps = 2/45 (4%)

Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          EEL  CA+C    +H CS C  + YC K  QK  W LHK  CK  
Sbjct: 8  EEL--CAMCNNMGVHACSGCHSIRYCSKLCQKTDWSLHKLLCKSF 50


>gi|195454531|ref|XP_002074282.1| GK18437 [Drosophila willistoni]
 gi|194170367|gb|EDW85268.1| GK18437 [Drosophila willistoni]
          Length = 129

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 4/94 (4%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   +++  + GR L  T  +H G+ +  E P   GPK   +  +CLGC+  L       
Sbjct: 2   LKARLENDSVFGRCLATTEQVHRGEIVVEELPFAYGPKRE-SGIVCLGCYCYLQFGEDGE 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAILK 145
           +  RC  C WP C A C+     + H  EC I  
Sbjct: 61  SLDRCELCDWPLC-AMCASNQYENDHQGECVIFS 93


>gi|330928344|ref|XP_003302230.1| hypothetical protein PTT_13958 [Pyrenophora teres f. teres 0-1]
 gi|311322550|gb|EFQ89675.1| hypothetical protein PTT_13958 [Pyrenophora teres f. teres 0-1]
          Length = 249

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 7/48 (14%)

Query: 14 RCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          +CA C++TA       L +C+ CK + YCG+  QK H+K HK  C +L
Sbjct: 3  QCAKCKKTAADCGVASLKRCAKCKSIVYCGRDCQKAHYKAHKAACNQL 50


>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
 gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 521

 Score = 43.5 bits (101), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 72/318 (22%), Positives = 113/318 (35%), Gaps = 66/318 (20%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGC--HKPLNPNL-- 110
           P  + S P+ GRYL AT DL     I R+ P       A  + +C  C    PLN  +  
Sbjct: 123 PIHVYSHPINGRYLVATKDLDEQTVILRDLPYTWAVDHATCDSVCQHCFLEVPLNQQILP 182

Query: 111 ADNARCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI 169
            D   C  C     CSA C  + D   H  EC I K         +  +Y   L     +
Sbjct: 183 TDFYMCEGCQRVGYCSANCRCI-DYSQHRFECQIFK-------ELDTEEYSPFLMSEIKL 234

Query: 170 LQRRSPKKYQELK-------DMEAHMSKRGPGTEVYEEIDSIV---KYLRSNFLEKLPGD 219
           L R   +K+ E         D+     K+      Y+   S++     LR N   +L  +
Sbjct: 235 LVRTLSRKWLEDSITQTAGIDINDETIKKQNTYNQYKNPQSLIPQDNGLRYNDYAELVSN 294

Query: 220 V--LDDTSAKCL-HWICGVIEVNGVDIGR-------------------YTQ--------- 248
           V   +++  + L +WIC  +      +G+                   Y Q         
Sbjct: 295 VENYNESLKESLSYWICKYVVKLSAKLGKIEDEFDLLNILLRNRCNAFYIQGRPRDGSSG 354

Query: 249 -----GLYSVICLMEHNCLPNAKH---SNMMQSKLFVFRDT----HISTMYTNALWGTQP 296
                G+Y       H+C PN  +   +N ++ +  + ++      ++  Y +       
Sbjct: 355 ESRGCGVYVRNSFFNHSCDPNVNYWVVNNTLEVECTLLKNVKEGDELTISYIDTTSPLNK 414

Query: 297 RREHLAITKYFNCSCERC 314
           RRE L     FNC C +C
Sbjct: 415 RREKLLEGYLFNCLCTKC 432


>gi|389749892|gb|EIM91063.1| hypothetical protein STEHIDRAFT_91033 [Stereum hirsutum FP-91666
           SS1]
          Length = 739

 Score = 43.5 bits (101), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 3/55 (5%)

Query: 2   ASIIEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKK 53
           A  ++++  EL RC+ CR     L +C  C    YCG   Q EHWK  HK  CKK
Sbjct: 673 ARSVDMKAVELYRCSWCRNPSAVLKRCKGCGGTTYCGASCQMEHWKTGHKKVCKK 727


>gi|397616553|gb|EJK64030.1| hypothetical protein THAOC_15280 [Thalassiosira oceanica]
          Length = 323

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 3  SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S + + D+    CA C +       L  C+AC+ V YCG   QK H K HK  CK+   E
Sbjct: 2  SCVPVVDDGAEVCANCGKQGSDTVKLRNCTACRLVKYCGVDCQKAHRKQHKNACKRRAAE 61

Query: 58 IKSSPLLGRYLQ 69
          +K   L  R L+
Sbjct: 62 LKDEQLYSRGLE 73


>gi|443702950|gb|ELU00773.1| hypothetical protein CAPTEDRAFT_224225 [Capitella teleta]
          Length = 1175

 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 6/56 (10%)

Query: 14  RCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
           RC+ CR        L +CS C + AYC +Q Q + W  HKP CK +P E+   P +
Sbjct: 674 RCSCCRRDCTANMQLKRCSKCFKTAYCDRQCQTQDWVRHKPNCKFVP-ELIGQPFM 728


>gi|321474624|gb|EFX85589.1| hypothetical protein DAPPUDRAFT_237743 [Daphnia pulex]
          Length = 908

 Score = 43.1 bits (100), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 15  CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C VC++    L +CS C+ +AYC  + Q  HW +HK  CKKL
Sbjct: 867 CTVCKKIPEKLKRCSRCRYIAYCSVECQHSHWPIHKAICKKL 908


>gi|403413554|emb|CCM00254.1| predicted protein [Fibroporia radiculosa]
          Length = 336

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 4/43 (9%)

Query: 15  CAVCRE---TALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
           CA C      AL  CS CK++ YC  QHQK HWK +HK +C +
Sbjct: 290 CAACGNPDAKALKVCSGCKQIMYCSPQHQKAHWKTVHKNQCSR 332


>gi|410904557|ref|XP_003965758.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Takifugu rubripes]
          Length = 419

 Score = 43.1 bits (100), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E    K CSACK V YC K  QK HW  HK  CKKL
Sbjct: 318 CTSCGEKGAQKRCSACKMVVYCDKACQKLHWFTHKKVCKKL 358


>gi|325182386|emb|CCA16839.1| ubiquitinspecific protease putative [Albugo laibachii Nc14]
          Length = 1031

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 26/49 (53%), Gaps = 2/49 (4%)

Query: 8   EDEELMRCAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           E  +L  C VCR T  A+ +C  C+ V YC    Q  HWK H+  C KL
Sbjct: 293 EGVQLKSCIVCRSTSEAVKRCGRCRRVYYCTVDCQGSHWKYHRTVCSKL 341


>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
          Length = 358

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 73/290 (25%), Positives = 111/290 (38%), Gaps = 39/290 (13%)

Query: 61  SPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA---EPIC--LGCHKPLNPNLADNAR 115
           +PL GR L     +  G+ +  + P  V P +A+    + IC  LGC + ++ + A++  
Sbjct: 59  NPLKGRTLSTNGPIRAGELLMADLPYAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVT 118

Query: 116 CPR-----CFWPACSARCSGLSDAHTHAPECAILKLGCETLLA-YNDYKYEAILPLRCLI 169
           C        +W  C+  C    D   H  ECA LK     L     D+ Y  +  +  L+
Sbjct: 119 CETGCSLDIWW--CNESCKD-EDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLL 175

Query: 170 LQRRSPKKYQEL---KDMEAHMSKRG--PGTEVYEEIDSIVKYLRSNFL--EKLPGDVLD 222
             R S  K   L   K ++   S R   P +++        KYL  + L  E L  D L 
Sbjct: 176 AARNSHSKIASLAAGKGVQLLRSNRDSWPTSQLQHWSTLAEKYLLQSSLLPETLDLDTLV 235

Query: 223 D----TSAKCLHWICGVIEV-----NGVDIG-RYTQGLYSVICLMEHNCLPNAKHSNMMQ 272
           D              G  E+      GV+ G +Y   ++  I L  H+C PN  H    +
Sbjct: 236 DLICAEETNVFELCPGPTEIIPHQSPGVERGTQYGLAVFLRITLANHSCAPNVTHQADDR 295

Query: 273 SKLFV--FRDTH------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
            ++ V   RD         S    +     Q RR+       F C+C RC
Sbjct: 296 GRMMVTALRDIAPGEECCTSYFDLSEYVDLQARRKKTQELFTFTCTCPRC 345


>gi|393232939|gb|EJD40515.1| hypothetical protein AURDEDRAFT_69933 [Auricularia delicata
           TFB-10046 SS5]
          Length = 351

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 23/34 (67%)

Query: 20  ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
            + L +CS CKEV YC +QHQK  W +HK  C+K
Sbjct: 317 SSGLIRCSRCKEVKYCSEQHQKYDWGIHKLVCEK 350


>gi|390598125|gb|EIN07524.1| hypothetical protein PUNSTDRAFT_136202 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 214

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%)

Query: 1   MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +A+    E+E+        +T+L +CS CK   YCGK  Q+E W++HK  CK L
Sbjct: 141 LAACASCEEEKTADINTPGKTSLMRCSGCKVTRYCGKACQREDWRVHKEFCKIL 194


>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
           africana]
          Length = 428

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 62/268 (23%), Positives = 103/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A+  L PGD + R  PL      G + A+ +   LG  K L        RC +C 
Sbjct: 15  GNGLRASAALRPGDLLFRSDPLAYTVCKGSRGAVCDRCLLGKEKLL--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C      +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-NCRPRYPPDSVRLLGRVVCKLMEGTPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E++++K      + E+    ++ L   F + +  ++ D        D        
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLAMTFQDFMREEIQDASQLPPSFDIFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STM 286
           IC    +   ++     GLY  + L+ H+C PN +   N     L   RD  +    +  
Sbjct: 179 ICNSFTICDAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|260818611|ref|XP_002604476.1| hypothetical protein BRAFLDRAFT_280521 [Branchiostoma floridae]
 gi|229289803|gb|EEN60487.1| hypothetical protein BRAFLDRAFT_280521 [Branchiostoma floridae]
          Length = 379

 Score = 43.1 bits (100), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 20/33 (60%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          L +C  C+   YC K+HQ+ HWK HK  CK  P
Sbjct: 4  LLRCGGCRTAYYCSKEHQRTHWKKHKRVCKSKP 36


>gi|302692318|ref|XP_003035838.1| hypothetical protein SCHCODRAFT_84444 [Schizophyllum commune H4-8]
 gi|300109534|gb|EFJ00936.1| hypothetical protein SCHCODRAFT_84444 [Schizophyllum commune H4-8]
          Length = 530

 Score = 43.1 bits (100), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C VC +    +C+ C  VAYCG + Q+ HWK HK  C+ +
Sbjct: 355 CVVCGKRTTSRCTGCLAVAYCGPECQRAHWKDHKSFCRAI 394


>gi|310800311|gb|EFQ35204.1| MYND finger [Glomerella graminicola M1.001]
          Length = 344

 Score = 43.1 bits (100), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 34/65 (52%), Gaps = 7/65 (10%)

Query: 14 RCAVCR-ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
          +CAVC   T L +C+ C+ V YCG+ HQ  H K HK  C ++  E +      RYLQ   
Sbjct: 13 QCAVCHSRTKLSRCAGCQVVYYCGRDHQVRHRKAHKSDCNQIK-EYRE-----RYLQEEN 66

Query: 73 DLHPG 77
           L  G
Sbjct: 67 ALRNG 71


>gi|328771148|gb|EGF81188.1| hypothetical protein BATDEDRAFT_87837 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 391

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 2/85 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKCKKLPYEIKSSPLLGRYLQATL 72
           C +C      KCSACK V YC K+H    W++  HK  C     + + + +  + +    
Sbjct: 157 CHLCGLKGSLKCSACKNVYYCSKEHSVLDWQIGGHKEHCGNSISDDQIAIVRAKAIFPEF 216

Query: 73  DLHPGDRIARESPLIVGPKLALAEP 97
           +L   D +  E P  V   L L EP
Sbjct: 217 ELQEEDEMEFELPATVTDSLVLEEP 241


>gi|350412393|ref|XP_003489630.1| PREDICTED: egl nine homolog 1-like [Bombus impatiens]
          Length = 486

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          C +C  T  L +CS CK V YC K+HQ+  WK HK  C
Sbjct: 24 CVICNRTDKLLRCSRCKAVFYCTKEHQRRDWKRHKEFC 61


>gi|221505568|gb|EEE31213.1| hypothetical protein TGVEG_082100 [Toxoplasma gondii VEG]
          Length = 858

 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 9/89 (10%)

Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
           +Y +     H+CLPNA       H+ +++S+  +     ++  Y    + +  T  RR+ 
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRSRADIAAGDEVTLTYLSEDDLMRPTLQRRKV 227

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
           L+ TK F C+CERCS P +    F    C
Sbjct: 228 LSETKDFICTCERCSAPVDFSRGFRCPAC 256


>gi|397635097|gb|EJK71718.1| hypothetical protein THAOC_06815 [Thalassiosira oceanica]
          Length = 300

 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 6/73 (8%)

Query: 3  SIIEIEDEELMRCAVCRE------TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
          S I ++D     CA C +        L  C+AC+ V YCG   Q+ H KLHK  CK+   
Sbjct: 2  SCIAVDDGSTDVCANCGKGCEGGAIKLKSCTACRLVKYCGVDCQRAHRKLHKKACKQRVA 61

Query: 57 EIKSSPLLGRYLQ 69
          E+K   L  + L+
Sbjct: 62 ELKDEELYSQGLE 74


>gi|397579542|gb|EJK51248.1| hypothetical protein THAOC_29597 [Thalassiosira oceanica]
          Length = 365

 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 5/73 (6%)

Query: 1   MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           M+  + + D+    CA C +       L  C+AC+ V YC    Q+ H K HK  CK+  
Sbjct: 65  MSCFVSVADDGYEACANCGKQGSEAVKLRNCTACRLVKYCSVDCQRAHRKQHKKACKQRA 124

Query: 56  YEIKSSPLLGRYL 68
            E+K   L G+ L
Sbjct: 125 AELKDEQLYGQGL 137


>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
           [Anolis carolinensis]
          Length = 472

 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 106/280 (37%), Gaps = 43/280 (15%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+  +   GR L+A  +   GD I  E         +L   +C  C K      A   RC
Sbjct: 5   EVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFK----RHAKLHRC 60

Query: 117 PRC-FWPACSARCSGLSDAHT-HAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
            +C F   C   C    DA   H  ECA +K        +     E I L  R +    R
Sbjct: 61  GQCKFAHYCDRTCQ--KDAWVNHKNECAAIK-------KHGKAPNENIRLAARIMWRIER 111

Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                 E     +  ++ H+   G      EE    ++    +FLE  P D     + + 
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFG------EEEKKELRLDLESFLEFWPHDG-KQFAMQY 164

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN-------AKHSNMMQSKL 275
           +  I GVI  NG  +   R  Q    G++  +CL+ H+C PN        K       K+
Sbjct: 165 ISHILGVINCNGFTLSDQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFNNGKIELRALGKI 224

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
            +  +  +S  Y + L   + RR+ L    YF+C+CE C 
Sbjct: 225 SIGEELTVS--YVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 262


>gi|115472083|ref|NP_001059640.1| Os07g0481000 [Oryza sativa Japonica Group]
 gi|113611176|dbj|BAF21554.1| Os07g0481000 [Oryza sativa Japonica Group]
 gi|215701040|dbj|BAG92464.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222637030|gb|EEE67162.1| hypothetical protein OsJ_24245 [Oryza sativa Japonica Group]
          Length = 380

 Score = 43.1 bits (100), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 26/50 (52%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C   A  +C  C  VAYC + HQ  HW  HK +C +L  ++    +L ++
Sbjct: 13 CAGEARRRCGGCGAVAYCSRAHQTVHWGFHKEECARLAEQMSRIDMLKQF 62


>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
 gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
 gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
          Length = 429

 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          L KCSAC+ +AYC ++ Q+  WKLHK +CK +
Sbjct: 36 LKKCSACRRLAYCSQECQRADWKLHKVECKAI 67


>gi|340715205|ref|XP_003396109.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
           isoform 1 [Bombus terrestris]
 gi|340715207|ref|XP_003396110.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
           isoform 2 [Bombus terrestris]
          Length = 440

 Score = 43.1 bits (100), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 27/44 (61%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           ++  C +C E A  +CS CKEV YCG++ Q + W  HK  C K+
Sbjct: 388 DVKECPLCHEEAKKRCSKCKEVWYCGRECQVKDWVKHKNICDKI 431


>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
          Length = 472

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 109/289 (37%), Gaps = 39/289 (13%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+ +S   GR L+AT +    D I  E         +L   +C  C K          RC
Sbjct: 5   EVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFK----RQEKLHRC 60

Query: 117 PRC-FWPACSARCSGLSDAH-THAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
            +C F   C   C    DA   H  ECA +K        Y     E I L  R +    R
Sbjct: 61  GQCKFAHYCDRTCQ--KDAWLNHKNECAAIK-------KYGKVPNENIRLAARIMWRVER 111

Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                 E     + D++ H+   G      EE    ++     FL+  P       S + 
Sbjct: 112 EGTGLTEGCLVSVDDLQNHVEHFG------EEEQKELRVDVDTFLQYWPPQS-QQFSMQY 164

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
           +  I GVI  NG  +   R  Q    G++  + L+ H+C PN     ++  ++ +    +
Sbjct: 165 ISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKI 224

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
                ++  Y + L  ++ RR  L    YF+CSCE C    +   + +A
Sbjct: 225 SEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAA 273


>gi|403411611|emb|CCL98311.1| predicted protein [Fibroporia radiculosa]
          Length = 675

 Score = 42.7 bits (99), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 16  AVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           ++C      +C+ACK  +YC K+ QK+HWK H+  CK 
Sbjct: 112 SICPREGSMRCAACKLTSYCSKECQKDHWKTHRRDCKN 149


>gi|390599653|gb|EIN09049.1| hypothetical protein PUNSTDRAFT_102596, partial [Punctularia
           strigosozonata HHB-11173 SS5]
          Length = 722

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 2/51 (3%)

Query: 5   IEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           + I+   L  C+ C     AL KC  C +  YC    QK HWK HKP C++
Sbjct: 670 VTIDSVSLYLCSFCENPSAALKKCGGCGKTRYCDAACQKGHWKAHKPLCQR 720


>gi|348689068|gb|EGZ28882.1| hypothetical protein PHYSODRAFT_353603 [Phytophthora sojae]
          Length = 175

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 10  EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCK 52
           E  +RC VC+  +   CS CK V YCG + QK+ WK  HK KCK
Sbjct: 128 EAALRCRVCKAPSTLVCSRCKVVRYCGAECQKQDWKASHKQKCK 171


>gi|297825441|ref|XP_002880603.1| ubiquitin carboxyl-terminal hydrolase family protein [Arabidopsis
           lyrata subsp. lyrata]
 gi|297326442|gb|EFH56862.1| ubiquitin carboxyl-terminal hydrolase family protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 650

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 2/84 (2%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQ 69
           + + C+VC +T+  KCS CK V YC    Q   WK  HK KCK       SSP+    + 
Sbjct: 60  DTVECSVCGKTSTKKCSRCKSVRYCSADCQTSDWKSGHKLKCKVF-RSTDSSPVGRDDID 118

Query: 70  ATLDLHPGDRIARESPLIVGPKLA 93
               L      +++S + + P+L+
Sbjct: 119 YKASLFGNMSASKKSKVALVPQLS 142


>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 425

 Score = 42.7 bits (99), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 5/104 (4%)

Query: 234 GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQS-KLFVFRDT----HISTMYT 288
           G   +NGV       GL+    +  H+C PN   S   Q   + V +D      ++  Y 
Sbjct: 184 GFSIMNGVTNEPVGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKDVKEGQELTISYV 243

Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
             L  T+ R++ L  + +F C C RC+  T    Y   ++C N+
Sbjct: 244 EVLQSTKKRQKILKDSYFFECQCSRCTTETTDDWYLDGLQCGNK 287


>gi|302676301|ref|XP_003027834.1| hypothetical protein SCHCODRAFT_237843 [Schizophyllum commune
          H4-8]
 gi|300101521|gb|EFI92931.1| hypothetical protein SCHCODRAFT_237843 [Schizophyllum commune
          H4-8]
          Length = 224

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 24/51 (47%)

Query: 1  MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          M++ + I D    RC VC      +CS C    YC K H    WK HK +C
Sbjct: 1  MSAAVPIPDGFETRCGVCAAPTSMRCSRCGSAYYCSKDHSAMDWKYHKKQC 51


>gi|156552215|ref|XP_001606321.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Nasonia vitripennis]
          Length = 411

 Score = 42.7 bits (99), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 15  CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C  E A  KCS CK+V YC ++ Q+ HW +HK  C +L
Sbjct: 339 CVTCGEEKATKKCSKCKQVQYCDRECQRLHWFIHKKTCSRL 379


>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 541

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 61/267 (22%), Positives = 107/267 (40%), Gaps = 35/267 (13%)

Query: 65  GRYLQATLDLHPGDRIAR-ESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFW-- 121
           GR L+A  D  PG  +A  + PL+  P  A ++  C  C       L  NA+   C    
Sbjct: 25  GRCLRAARDYKPGQLMAVFDDPLVAFPDAAGSKTTCHHC-------LVHNAKVFGCTGCD 77

Query: 122 ---PACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI-LQRRSPKK 177
                CS  C  ++    H+ EC + +   +  ++ N        P+R L+ +  R  + 
Sbjct: 78  KAVSYCSDECQKVNWKLIHSKECKVFR-KVQAAVSKNWLP----TPVRALVQILLRLSEV 132

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
           ++ +  +E H  K     E++E +      L++       G   DD +      I   I+
Sbjct: 133 HEVVTGLEGHGDKFRARKELWENMK-----LQAYAGIHYAGRKEDDANLNLAAEILCKIQ 187

Query: 238 VNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMY 287
            N       D G+    L  ++ ++ H+C+PNA     K    ++++  V     I+  Y
Sbjct: 188 TNSFDRFDADTGQSGTFLDPLLAMVNHSCIPNAVVLFWKRKAYLRAETPVKAGEDITISY 247

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            +       RR+ L +  +F C C RC
Sbjct: 248 IDYTKPLSFRRQDLELY-HFQCGCLRC 273


>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
          Length = 437

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 14/102 (13%)

Query: 240 GVDIGRYTQGLYSVICLMEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGT 294
            V IG Y +G      L  H+C PN   +     MQ ++   +  D  ++  Y   L  T
Sbjct: 196 AVGIGLYPEG-----ALFNHDCDPNCVVSFKGREMQVRVVRDIEVDEELTVSYVELLQST 250

Query: 295 QPRREHLAITKYFNCSCERCSDPTELGT----YFSAMKCLNE 332
           + RR  L  + +F+C C+RC   T   +    Y   ++C N+
Sbjct: 251 KARRRELKESYFFDCECKRCKAATNGQSNEDWYLDGLRCSNK 292


>gi|256075329|ref|XP_002573972.1| hypothetical protein [Schistosoma mansoni]
 gi|360044826|emb|CCD82374.1| hypothetical protein Smp_135930 [Schistosoma mansoni]
          Length = 478

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           RC  C E A  +CS C++  YCG++ Q +HW  HK  C  L   I S
Sbjct: 427 RCPTCGEIASKRCSRCRQEWYCGRECQVKHWLKHKDACDLLTEAITS 473


>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
          Length = 433

 Score = 42.7 bits (99), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 58/265 (21%), Positives = 102/265 (38%), Gaps = 20/265 (7%)

Query: 61  SPLLGRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           SP  GR L+A      GD +    A  S L VG + +  E  C          L+   RC
Sbjct: 14  SPGKGRGLRALQPFQVGDLLFACPAYASVLTVGERGSHCEH-CFA----RKEGLSKCGRC 68

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK 176
            + F+  C+  C    D   H  EC+ + +  E        +  A +  +  I   R+P 
Sbjct: 69  KQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPS 125

Query: 177 KYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
           +    +K+ E+H+ K     E  + I S +  L   + + L     D          C  
Sbjct: 126 EMLLAVKEFESHLDKLD--NEKKDLIQSDIAMLHQFYSKHLEFPDHDSLVVLFAQVNCNG 183

Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTHISTMYTNA 290
             +   ++      ++  + LM H+C PN   +       +++   V     + T Y + 
Sbjct: 184 FTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEVLPGEEVFTSYIDL 243

Query: 291 LWGTQPRREHLAITKYFNCSCERCS 315
           L+ T+ R + L  + +F C C+ C+
Sbjct: 244 LYPTEDRNDRLRDSYFFTCECQECT 268


>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
           vitripennis]
          Length = 391

 Score = 42.7 bits (99), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 71/166 (42%), Gaps = 25/166 (15%)

Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
           Y++ KD+ +H +      +  E   S+ + L  +F+E   G  L +++   +  + G I 
Sbjct: 106 YRKFKDLMSHYTDIKNDPKRIEHFVSLCQVLE-DFME---GTTLPNSAE--ILGLYGRIC 159

Query: 238 VNGV-----DIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD---------THI 283
           VN       D+     G+Y    +++H+C PNA    + +    + R          + I
Sbjct: 160 VNSYNILDPDMNSIGVGIYLGPSVIDHSCKPNA--VAVFEGTTILIRALEDIPRLDWSQI 217

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
              Y + L  T  R   L  T YF C CERC DP    TY +A  C
Sbjct: 218 HISYIDVLNTTSTRCTELQNTYYFLCECERCKDPE---TYATAAIC 260


>gi|117650662|gb|ABK54276.1| Egln3 [Branchiostoma belcheri]
          Length = 804

 Score = 42.7 bits (99), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 7/67 (10%)

Query: 15  CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK------KLPYEIKSSPLLGRY 67
           CAVC  ++ L +CS C+ V YC  +HQ ++WK HK  C+       +  + +S+   G  
Sbjct: 83  CAVCGAKSNLKRCSRCQGVWYCSSEHQSQNWKQHKKICRAKSAAQSVQSDTQSAAGKGHV 142

Query: 68  LQATLDL 74
            Q+ +DL
Sbjct: 143 SQSAVDL 149


>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
 gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
 gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
          Length = 477

 Score = 42.7 bits (99), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 70/289 (24%), Positives = 109/289 (37%), Gaps = 39/289 (13%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+ +S   GR L+AT +    D I  E         +L   +C  C K          RC
Sbjct: 10  EVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFK----RQEKLHRC 65

Query: 117 PRC-FWPACSARCSGLSDAH-THAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
            +C F   C   C    DA   H  ECA +K        Y     E I L  R +    R
Sbjct: 66  GQCKFAHYCDRTCQ--KDAWLNHKNECAAIK-------KYGKVPNENIRLAARIMWRVER 116

Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                 E     + D++ H+   G      EE    ++     FL+  P       S + 
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVEHFG------EEEQKELRVDVDTFLQYWPPQS-QQFSMQY 169

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
           +  I GVI  NG  +   R  Q    G++  + L+ H+C PN     ++  ++ +    +
Sbjct: 170 ISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKI 229

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
                ++  Y + L  ++ RR  L    YF+CSCE C    +   + +A
Sbjct: 230 SEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAA 278


>gi|389604014|ref|XP_003723152.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322504894|emb|CBZ14681.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 623

 Score = 42.7 bits (99), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 46/205 (22%), Positives = 80/205 (39%), Gaps = 45/205 (21%)

Query: 229 LHWICGVIEVNGVDIGR-----YTQGLY---SVICLMEHNCLPN-----AKHSNMMQSKL 275
           L+ I GV+  N +++         Q L+   ++     H+C+PN      +H+ +    +
Sbjct: 370 LYIIIGVLRCNALEVTDPSGLGVAQALHVGNTIASFFNHSCMPNCAIDTVRHAIVTTRII 429

Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLN 331
           +V  +  I+ +     W T+ RRE L+   YF C C RC     DP EL         L 
Sbjct: 430 YVGEELSIAYI-PQLYWPTRLRRERLSEGYYFVCHCPRCQRCDKDPFELALSME----LP 484

Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
             +        P+          C +  +R+               V  + QE+ +   L
Sbjct: 485 AARRGATTHFHPI------VQTACATIRSRM---------------VGDVSQEDAD--KL 521

Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSL 416
            ++L+ L     P HY C+ V++ L
Sbjct: 522 RKLLSDLLKHLFPFHYLCHEVRNCL 546


>gi|307213312|gb|EFN88764.1| Ankyrin repeat and MYND domain-containing protein 2 [Harpegnathos
           saltator]
          Length = 406

 Score = 42.7 bits (99), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E  A+ KCS CK V YC ++ Q+ HW +HK  C +L
Sbjct: 335 CVTCGEDKAIKKCSKCKAVQYCDRECQRLHWFMHKKACARL 375


>gi|427778123|gb|JAA54513.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 468

 Score = 42.7 bits (99), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           C  C E  A  +CSACK V YCG   QK HW  HK  C +L  E K +
Sbjct: 374 CTACGEPQADKRCSACKSVQYCGAPCQKLHWFTHKRHCARLAEEYKKA 421


>gi|242045648|ref|XP_002460695.1| hypothetical protein SORBIDRAFT_02g033310 [Sorghum bicolor]
 gi|241924072|gb|EER97216.1| hypothetical protein SORBIDRAFT_02g033310 [Sorghum bicolor]
          Length = 165

 Score = 42.7 bits (99), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 28/47 (59%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
           A  +C  C  VAYC + HQ  HW++HK +C++   +++   +L ++
Sbjct: 17 VADRRCGGCGAVAYCSRAHQIIHWRVHKEECERFAEQMRRVDVLNQF 63


>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Ailuropoda melanoleuca]
          Length = 477

 Score = 42.7 bits (99), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 65/287 (22%), Positives = 110/287 (38%), Gaps = 39/287 (13%)

Query: 46  LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLG 101
           +  PK +K      S+   G  L+A   L PG+ + R  PL      G +  + +   LG
Sbjct: 1   MEPPKVEKF-----STANRGNGLRALAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLG 55

Query: 102 CHKPLNPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYE 160
             K +        RC +C     CSA+C   +    H  EC  LK  C+     +  +  
Sbjct: 56  KEKLM--------RCSQCRVAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLL 105

Query: 161 AILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK----- 215
             +  + +       +K     D+E++++K     E  E +  +V   +    E+     
Sbjct: 106 GRVVFKLMEETPSESEKLYSFYDLESNINKLT--EEKKEGLRQLVMTFQHFMREEIQDAS 163

Query: 216 ---LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMM 271
              LP D+ +  +      IC    +   ++     GLY  + L+ H+C PN +   N  
Sbjct: 164 QLPLPFDIFEAFA----KVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGP 219

Query: 272 QSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
              L   RD      ++  Y + L  ++ RR+ L     F C C RC
Sbjct: 220 HLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
 gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
          Length = 429

 Score = 42.7 bits (99), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 64/281 (22%), Positives = 109/281 (38%), Gaps = 27/281 (9%)

Query: 56  YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
           +E  +S   G  L+AT ++H G+ +    PL        A+ +C  C       L    R
Sbjct: 5   FERFASAEKGNGLRATHEIHAGELLYTAEPLAYCVSNTCAKDLCHSCFSRRKTLL----R 60

Query: 116 CPRC-FWPACSARCS--GLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
           C +C     C   C     SD   H  EC  L+      +  +  +  A +  R L   +
Sbjct: 61  CSQCKVARYCDVTCQKQAWSD---HKRECKCLR-SLHPRIPTDSVRLAARIIFRLLSPSQ 116

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
            SP     L++ E+H+   G   E  E +  +   L+    +++P    +  +   L  +
Sbjct: 117 TSPG-LCSLEEHESHLCDMG--EERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLL 173

Query: 233 ----CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQS-KLFVFRDTH----I 283
               C    ++  ++     GLY  + L+ H+C P+       ++  L   RD      +
Sbjct: 174 AKLTCNCFTISDGELREIGVGLYPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEEL 233

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTEL 320
           +  Y   L  T+ RR  L    +F C C+RC+    DP  L
Sbjct: 234 TISYIGTLAPTRDRRTQLEEQYHFTCQCQRCTTADMDPVML 274


>gi|270011013|gb|EFA07461.1| HIF prolyl hydroxylase [Tribolium castaneum]
          Length = 478

 Score = 42.7 bits (99), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 13 MRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI-KSSPLLG 65
          + CAVC  +T L +C+ CK   YC K+HQ++ W  H+ +C +L   + K SP  G
Sbjct: 5  LSCAVCGTKTNLLRCARCKTRIYCSKEHQRKDWPNHRRECAQLALALDKQSPKSG 59


>gi|427784613|gb|JAA57758.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 430

 Score = 42.7 bits (99), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           C  C E  A  +CSACK V YCG   QK HW  HK  C +L  E K +
Sbjct: 336 CTACGEPQADKRCSACKSVQYCGAPCQKLHWFTHKRHCARLAEEYKKA 383


>gi|282158095|ref|NP_001164091.1| HIF prolyl hydroxylase [Tribolium castaneum]
          Length = 356

 Score = 42.7 bits (99), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 2/55 (3%)

Query: 13 MRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI-KSSPLLG 65
          + CAVC  +T L +C+ CK   YC K+HQ++ W  H+ +C +L   + K SP  G
Sbjct: 5  LSCAVCGTKTNLLRCARCKTRIYCSKEHQRKDWPNHRRECAQLALALDKQSPKSG 59


>gi|72003678|ref|NP_508850.2| Protein SET-30 [Caenorhabditis elegans]
 gi|351064639|emb|CCD73128.1| Protein SET-30 [Caenorhabditis elegans]
          Length = 560

 Score = 42.7 bits (99), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 238 VNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HISTMYTNAL 291
           ++ VD  R   +GLY  +C  +H+C PNA +S N + +KL    D     ++ T  T+  
Sbjct: 181 ISNVDYLREIGKGLYLDLCKYDHSCRPNAIYSCNGIVAKLRALHDNVDLENVET--THYT 238

Query: 292 WGTQP-----RREHLAITKYFNCSCERCSDPTE 319
           +   P     RR  L  T YF C CERC DP +
Sbjct: 239 YIELPPCKIQRRHMLKETWYFECHCERCDDPDD 271


>gi|195339973|ref|XP_002036591.1| GM18750 [Drosophila sechellia]
 gi|194130471|gb|EDW52514.1| GM18750 [Drosophila sechellia]
          Length = 141

 Score = 42.7 bits (99), Expect = 0.52,   Method: Composition-based stats.
 Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   +LGR L  +  +  G+ +  E P   GPK   +  +CLGC++ L      +
Sbjct: 2   LKARLQEDAVLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C + C+   D   H  EC + 
Sbjct: 61  SLDRCELCDWPLCGS-CADDEDVTEHRGECQVF 92


>gi|348680121|gb|EGZ19937.1| hypothetical protein PHYSODRAFT_298276 [Phytophthora sojae]
          Length = 423

 Score = 42.7 bits (99), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 2   ASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKCKK 53
           A+ ++  D+++  CA+C + A   CSAC    YC K HQK+HW    HK  C +
Sbjct: 136 AAQLKQVDDKVDLCALCGQRATFTCSACHVAQYCSKAHQKDHWTAGGHKQTCGQ 189


>gi|242096572|ref|XP_002438776.1| hypothetical protein SORBIDRAFT_10g026010 [Sorghum bicolor]
 gi|241916999|gb|EER90143.1| hypothetical protein SORBIDRAFT_10g026010 [Sorghum bicolor]
          Length = 979

 Score = 42.7 bits (99), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           CA C   A  KCS CK V YC ++ Q  HW+  HK KCK++
Sbjct: 85  CAACGGVASKKCSRCKRVRYCSQECQTNHWQSDHKFKCKQM 125


>gi|255714340|ref|XP_002553452.1| KLTH0D17160p [Lachancea thermotolerans]
 gi|238934832|emb|CAR23014.1| KLTH0D17160p [Lachancea thermotolerans CBS 6340]
          Length = 485

 Score = 42.7 bits (99), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 6/87 (6%)

Query: 235 VIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDT----HISTMYT 288
           +I +   ++ +    ++++   + HNC PN ++        KLF  +D      + T Y 
Sbjct: 321 MILIGKFNLNQREGQIFTLYSYLNHNCEPNVRYELDGKTGLKLFARKDIKKGEELLTTYV 380

Query: 289 NALWGTQPRREHLAITKYFNCSCERCS 315
           N L G   RR  L +   F C CERC+
Sbjct: 381 NPLHGVTLRRRELLVNWGFLCDCERCN 407


>gi|393236698|gb|EJD44245.1| hypothetical protein AURDEDRAFT_137472 [Auricularia delicata
          TFB-10046 SS5]
          Length = 222

 Score = 42.4 bits (98), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 16/85 (18%)

Query: 7  IEDEELMRCAVCRET----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS- 61
          +E  +   CAV   T     L  CS CK VAYC K+ Q+ HW LHK  C+     I +S 
Sbjct: 1  MEQRQCDNCAVTVSTEAGVVLGICSNCKFVAYCSKECQRAHWALHKDVCRLTAAIISASR 60

Query: 62 -----------PLLGRYLQATLDLH 75
                     P L +YL+    +H
Sbjct: 61 AHGVASGQYPTPNLAKYLRRFCSIH 85


>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
          Length = 475

 Score = 42.4 bits (98), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 28/63 (44%), Positives = 34/63 (53%), Gaps = 8/63 (12%)

Query: 13  MRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP-----LLG 65
           +RCA C   +  L +C+ CK   YC K  QK  WK+HK +CK L  E KS P      LG
Sbjct: 46  VRCAQCCVFKPKLLRCAGCKLQYYCSKDCQKSAWKIHKEECKMLKSE-KSMPDDITLFLG 104

Query: 66  RYL 68
           R L
Sbjct: 105 RIL 107


>gi|126646557|ref|XP_001388057.1| MYND finger domain protein [Cryptosporidium parvum Iowa II]
 gi|126117145|gb|EAZ51245.1| MYND finger domain protein [Cryptosporidium parvum Iowa II]
 gi|323509519|dbj|BAJ77652.1| cgd4_1070 [Cryptosporidium parvum]
          Length = 234

 Score = 42.4 bits (98), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 8   EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           ED+ L RC  CR   E  L  CS CK+VAYC    Q++ W  HK  CK
Sbjct: 187 EDKILNRCESCRKESEKTLLICSRCKKVAYCNVDCQRKDWSYHKQFCK 234


>gi|301118408|ref|XP_002906932.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262108281|gb|EEY66333.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 175

 Score = 42.4 bits (98), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 26/44 (59%), Gaps = 1/44 (2%)

Query: 10  EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCK 52
           E  +RC VC+  +   CS CK V YCG   QK+ WK  HK KCK
Sbjct: 128 EAALRCRVCKAPSTLVCSRCKAVRYCGADCQKQDWKASHKQKCK 171


>gi|397570679|gb|EJK47406.1| hypothetical protein THAOC_33872 [Thalassiosira oceanica]
          Length = 306

 Score = 42.4 bits (98), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          T L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L  + L+
Sbjct: 28 TKLKNCTACRLVKYCGVDCQRSHRKLHKKACKQRVAELKDEQLYSQGLE 76


>gi|397593190|gb|EJK55891.1| hypothetical protein THAOC_24316 [Thalassiosira oceanica]
          Length = 304

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 26/44 (59%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L G+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQLYGQ 72


>gi|390331847|ref|XP_003723365.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like
            [Strongylocentrotus purpuratus]
          Length = 1640

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 5/49 (10%)

Query: 12   LMRCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
            L+ C+ C+++     +L +C+ C  VAYC +  QK +W  H+P C++ P
Sbjct: 1063 LLHCSHCKKSCEKGRSLKRCTKCYRVAYCDQACQKSNWNTHRPTCRRFP 1111


>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 552

 Score = 42.4 bits (98), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 85/358 (23%), Positives = 136/358 (37%), Gaps = 59/358 (16%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           +I++S   GR + A   L  G  +    P +        E  C  C  P    +    RC
Sbjct: 26  DIRTSSTDGRGIWAREALSAGATLMALKPHVHVLSTRNLELYCSACTSPAP--VTGLKRC 83

Query: 117 PRC--FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP---LRCL--I 169
            RC   W  CS  C   +D   H PECA ++       A N    +  +P   +RCL  +
Sbjct: 84  TRCRVIW-YCSTACQN-NDWTMHKPECAAIQR-----WAANAPSPDVSIPAEPIRCLGRL 136

Query: 170 LQRR-----SPKKYQELKDMEAHMSKRGPGTEVYEE----IDSIVKYLRSNFLEKL---- 216
           L R+     S    +E+  +++H     P T  YE       ++VKYL  +   +L    
Sbjct: 137 LWRKQKLRSSSTWAKEVDGLQSHRKDLQPST--YESHTHLAHAVVKYLGLDAPGQLGAYG 194

Query: 217 ---PGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-------- 265
                D++D  S     +I   I +    +      +  ++ L+ H+C PNA        
Sbjct: 195 ITTAADLVDLIS----RFITNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRST 250

Query: 266 ----KHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
               +    MQ      +  D  I T Y +       R+  L  T  F C C  CS P  
Sbjct: 251 SDPSQQEPQMQVVAIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDG 310

Query: 320 LGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQ 377
           +     A+ C    K+ G    LP    + ++   CG C A +++ D  L   ++G++
Sbjct: 311 MD-LREAIWC---PKNCGGTCPLPT---EENTLSRCGKCKAIVSSTDSVLDALRVGQE 361


>gi|397619755|gb|EJK65389.1| hypothetical protein THAOC_13754, partial [Thalassiosira oceanica]
          Length = 234

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 27/47 (57%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L GR L+
Sbjct: 93  LKNCTACRLVKYCGVDCQRTHRKQHKKACKQRAAELKDEQLYGRGLE 139


>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
 gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
          Length = 446

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 91/242 (37%), Gaps = 50/242 (20%)

Query: 115 RCPRCFWPACSARCSGLSDAH------------THAPECAILK-LGCETLLAYNDYKYEA 161
           RC +CF P    +CS                   H  EC  LK +G  T+        +A
Sbjct: 30  RCDKCFKPGKVLKCSNCLYVRYCNRFCQKEAWPDHQDECGKLKAIGTRTVP-------DA 82

Query: 162 ILPLRCLI--LQR--------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN 211
            L +  +I  LQ+         + K Y+   D+  H           E   S++  LRS 
Sbjct: 83  ALMISRIIRKLQKGGDYQKGYYTAKFYRRFHDLMTHEDDIKNDAPRMEHFQSLLVVLRS- 141

Query: 212 FLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAK 266
               L  +    + A+ L  I G + +N  +I          G+Y  I +++H+C PNA 
Sbjct: 142 ----LVEEAAMPSKAELLQ-IFGKMCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNAL 196

Query: 267 HS-NMMQSKLFVFRDTH--------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
            + +     L +  D H        I   Y + +   + R+  L    YF C+CERC D 
Sbjct: 197 ATFDGTTIHLRLLEDYHGSEVDFSKIFISYIDLMNPAEERKARLRAQYYFECNCERCRDE 256

Query: 318 TE 319
            E
Sbjct: 257 QE 258


>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
          magnipapillata]
          Length = 1037

 Score = 42.4 bits (98), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 30/52 (57%), Gaps = 5/52 (9%)

Query: 13 MRCAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
          +RC  C      L KCS CK +AYCGK  Q   WK+HK +CK L    KS+P
Sbjct: 49 IRCDYCYSEPEKLLKCSKCKFIAYCGKVCQASDWKMHKYECKCLT---KSAP 97


>gi|348687379|gb|EGZ27193.1| hypothetical protein PHYSODRAFT_468296 [Phytophthora sojae]
          Length = 430

 Score = 42.4 bits (98), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 26/45 (57%)

Query: 20  ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
           E  L  C  C+ VAYC ++ QK+ WK HKP CK +  + + +  L
Sbjct: 250 EGKLMTCGGCRSVAYCSRECQKKAWKKHKPDCKAIRAQREQAEAL 294


>gi|380011327|ref|XP_003689760.1| PREDICTED: zinc finger MYND domain-containing protein 10-like,
           partial [Apis florea]
          Length = 436

 Score = 42.4 bits (98), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 28/49 (57%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
           ++  C +C E A  +CS CKE  YCG++ Q + W  HK  C K+   I+
Sbjct: 388 DIKECFLCHEEAKKRCSKCKEAWYCGRECQIKDWIKHKDICDKITKNIE 436


>gi|299470253|emb|CBN79557.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 734

 Score = 42.4 bits (98), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%)

Query: 14  RCAVCRETALH-KCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           +CA C ++    +CS C++VAYC K+ Q+  WK HKP C++
Sbjct: 691 KCAQCGKSGKPLQCSRCRKVAYCSKECQRRDWKSHKPSCRE 731


>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           impatiens]
          Length = 439

 Score = 42.4 bits (98), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 60/233 (25%), Positives = 88/233 (37%), Gaps = 35/233 (15%)

Query: 113 NARCPRCFWPACSARCSGLSDAH------------THAPECAILKLGCETLLAYNDYKYE 160
           N RC  CF      RCS     +             H+ ECA LK  C      N  +  
Sbjct: 30  NVRCDYCFKSGKLFRCSACQYVYYCNQSCQQMSWPMHSKECARLK-KCSPWGISNVARLM 88

Query: 161 A--ILPLRCLILQRR---SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
           A  I+ L     + R   +   Y++ KD+ +H S+     +  E    +   L   FLE 
Sbjct: 89  ARIIIKLNQGGDEERGYYNETSYRKFKDLMSHCSEIEKDEKKMEHFVCLCNVLH-KFLED 147

Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ----GLYSVICLMEHNCLPNAKHSNMM 271
           +P       S   L  I G I +N   I         G+Y    +++H+C PNA  +   
Sbjct: 148 MP-----IPSTAELLGIYGRITINSFSIFNSDMNIGVGIYLGPSILDHSCKPNAVATFEG 202

Query: 272 QS-KLFVFRD------THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
            +  +    D      + I   Y + +  T+ RR  L  + YF C CERC  P
Sbjct: 203 TTITVKAIEDLPSLDLSQIRIPYIDVIKTTRDRRAELQSSYYFWCDCERCEKP 255


>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
          Length = 428

 Score = 42.4 bits (98), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 61/268 (22%), Positives = 101/268 (37%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRALARLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVIFKLMEETPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E++M+K      + EE    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNMNK------LTEEKKEGLRQLVLTFQHFMREEIQDASQLPPSFDIFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
           IC    +   ++     GLY  + L+ H+C PN +   N     L   RD      ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|330790441|ref|XP_003283305.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
 gi|325086730|gb|EGC40115.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
          Length = 754

 Score = 42.4 bits (98), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 12  LMRCAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKC 51
           L +C  C+ T +    C  CK VAYC K+ QK+HW +HK +C
Sbjct: 668 LYKCWSCKLTLIKPLICGYCKTVAYCSKECQKDHWLVHKEQC 709


>gi|397597652|gb|EJK57057.1| hypothetical protein THAOC_22945, partial [Thalassiosira oceanica]
          Length = 439

 Score = 42.4 bits (98), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 15  CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           CA C ET      L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 150 CANCGETGSDAVKLKNCTACRLVKYCGVDCQRTHRKQHKKACKERVAELKDEKLYGQ 206


>gi|328793905|ref|XP_001122085.2| PREDICTED: zinc finger MYND domain-containing protein 10-like [Apis
           mellifera]
          Length = 373

 Score = 42.4 bits (98), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 29/50 (58%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           ++  C +C E A  +CS CKE  YCG++ Q + W  HK  C K+   I++
Sbjct: 323 DIKECFLCHEEAKKRCSKCKEAWYCGRECQIKDWIKHKNICDKITKNIEN 372


>gi|443687503|gb|ELT90474.1| hypothetical protein CAPTEDRAFT_222728 [Capitella teleta]
          Length = 128

 Score = 42.4 bits (98), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 24/39 (61%)

Query: 13  MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           ++C VC   A   C++CK+ AYCG+Q Q + W  HK  C
Sbjct: 87  VKCDVCTNLAAFLCASCKKAAYCGRQCQMQGWDQHKSTC 125


>gi|50550485|ref|XP_502715.1| YALI0D11792p [Yarrowia lipolytica]
 gi|74634629|sp|Q6C9E7.1|SET5_YARLI RecName: Full=Potential protein lysine methyltransferase SET5;
           AltName: Full=SET domain-containing protein 5
 gi|49648583|emb|CAG80903.1| YALI0D11792p [Yarrowia lipolytica CLIB122]
          Length = 438

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 105/277 (37%), Gaps = 35/277 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-----CLGCHKPLNPNLADNARCPRC 119
           G+ L A  D+   + +  E   ++ P L     +     C  C +P   +  +   CP C
Sbjct: 118 GKGLYAKKDIKKDEELWNEQAFLLVPPLEHVGIMRKGMGCAFCSRPFQSH--NGVECPSC 175

Query: 120 FWPACSARCSGLSDAHTHA-PECAILKLGCETLLAYNDY----KYEAILPLRCLILQR-R 173
               C  +C      H     E    K+      A+ D+     + A+  L  L L+  R
Sbjct: 176 AAKWCDNQCKKKDVTHVAMWHESRHGKVTRVEWRAFEDFCVENGWMALYALGLLWLRVIR 235

Query: 174 SPKKYQELKDMEAHMSKRGPGT--EVYEEIDSIVKYLRSNFLEKLPGDVLD----DTS-- 225
            PKK +  K M A  ++ G     +  E+ +S+    +S  L K   ++LD    DT   
Sbjct: 236 DPKKDEVQKQMMA-FARVGQDERHKAVEQSNSLFASEQSEVLWKKGYEMLDKLLLDTEFS 294

Query: 226 -AKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR----- 279
             K      G+  +N VD       +Y     + H+C PN    N+ +++    R     
Sbjct: 295 YEKDFLPGLGMFNINNVD-----GNMYLTQSHLNHSCEPNVDVKNVGRTQGISVRAKRDI 349

Query: 280 --DTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
                + T Y N       RR +L +   FNC+C RC
Sbjct: 350 KTGEELFTTYVNPEHQLDDRRYNLRVNWGFNCNCTRC 386


>gi|397586045|gb|EJK53487.1| hypothetical protein THAOC_27074, partial [Thalassiosira
          oceanica]
          Length = 150

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 26/44 (59%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L G+
Sbjct: 35 LKDCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQLYGQ 78


>gi|330795484|ref|XP_003285803.1| hypothetical protein DICPUDRAFT_97170 [Dictyostelium purpureum]
 gi|325084267|gb|EGC37699.1| hypothetical protein DICPUDRAFT_97170 [Dictyostelium purpureum]
          Length = 1002

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CAVC+   +  C+ C  V YCG +HQ + W  HK  C  L
Sbjct: 71  CAVCKSKNVQICTGCLLVYYCGTEHQNQDWPNHKDFCSGL 110


>gi|189205749|ref|XP_001939209.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187975302|gb|EDU41928.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 327

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 20/31 (64%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           + KCS C +V YCGKQHQK  W  H+  C K
Sbjct: 296 VSKCSGCGQVYYCGKQHQKLAWGFHRGYCAK 326


>gi|281210969|gb|EFA85135.1| hypothetical protein PPL_02134 [Polysphondylium pallidum PN500]
          Length = 819

 Score = 42.4 bits (98), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 11  ELMRCAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKCK 52
           +L RC  C+ + +    C  CK V+YC K  Q+EHW  HK  CK
Sbjct: 760 QLYRCGKCKLSLMKPLICGNCKSVSYCSKICQQEHWDFHKDICK 803


>gi|195578357|ref|XP_002079032.1| GD23737 [Drosophila simulans]
 gi|194191041|gb|EDX04617.1| GD23737 [Drosophila simulans]
          Length = 141

 Score = 42.4 bits (98), Expect = 0.65,   Method: Composition-based stats.
 Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 54  LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
           L   ++   +LGR L  +  +  G+ +  E P   GPK   +  +CLGC++ L      +
Sbjct: 2   LKARLQEDAVLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60

Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
           +  RC  C WP C + C+   D   H  EC + 
Sbjct: 61  SLDRCELCDWPLCGS-CADDEDVTEHRGECHVF 92


>gi|169601910|ref|XP_001794377.1| hypothetical protein SNOG_03831 [Phaeosphaeria nodorum SN15]
 gi|111067916|gb|EAT89036.1| hypothetical protein SNOG_03831 [Phaeosphaeria nodorum SN15]
          Length = 354

 Score = 42.4 bits (98), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 23/40 (57%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          CAVC + A  KC+ C+  AYC K+ Q   WK HK  C  L
Sbjct: 9  CAVCEKPASDKCARCRASAYCSKECQAADWKTHKTACADL 48


>gi|397632769|gb|EJK70688.1| hypothetical protein THAOC_07931, partial [Thalassiosira
          oceanica]
          Length = 283

 Score = 42.4 bits (98), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L  R
Sbjct: 7  LKNCAACRLVKYCGADCQRGHRKLHKTACKQRAAELKDEQLYSR 50


>gi|321464470|gb|EFX75478.1| hypothetical protein DAPPUDRAFT_214409 [Daphnia pulex]
          Length = 355

 Score = 42.4 bits (98), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 15  CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           CA C +E    KCS CK V YC +  QK HW +HK +C K
Sbjct: 316 CATCGQEKVGSKCSICKSVQYCNRDCQKLHWPIHKKECDK 355


>gi|410077102|ref|XP_003956133.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
 gi|372462716|emb|CCF56998.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
          Length = 484

 Score = 42.4 bits (98), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 47/108 (43%), Gaps = 11/108 (10%)

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDT----HISTMYTNAL 291
           +   +I + +  +Y +  L  H+C PNA++   + ++ K++  R       I   Y N L
Sbjct: 324 IGKFNINQISNQVYQLTSLANHSCEPNARYEIDSKLELKVYARRKIKPGEEIFLTYVNPL 383

Query: 292 WGTQPRREHLAITKYFNCSCERC-----SDPTELGTYFSAMKCLNEHK 334
            G   RR  L +   F C C RC          + +  S +K L+E K
Sbjct: 384 HGVNLRRRELRVNWGFLCKCSRCLKELKYSKLRINSETSQLKVLDEKK 431


>gi|193676233|ref|XP_001943351.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 636

 Score = 42.4 bits (98), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 8/91 (8%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICL--GCHKPLNPNLADNA 114
           E+K S  +GR + AT D++PGD +A + P I GP ++    +C   GC   LN +LA   
Sbjct: 230 ELKMSKNMGRGVYATRDINPGDVVAIDEPYICGP-ISDHTGVCQYSGC---LNLDLAL-I 284

Query: 115 RCPRCFWP-ACSARCSGLSDAHTHAPECAIL 144
            CP+C     C+  C   ++   H  EC I+
Sbjct: 285 PCPKCLLVYYCNKDCMNKANEDGHYLECPIM 315


>gi|397593560|gb|EJK55991.1| hypothetical protein THAOC_24202 [Thalassiosira oceanica]
          Length = 302

 Score = 42.4 bits (98), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+S + + D+    CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 1  MSSCVPVADDGDETCANCGKQGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKTACKQRA 60

Query: 56 YEIKSSPL 63
           E+K   L
Sbjct: 61 AELKDEQL 68


>gi|327292924|ref|XP_003231159.1| hypothetical protein TERG_08982 [Trichophyton rubrum CBS 118892]
 gi|326466578|gb|EGD92031.1| hypothetical protein TERG_08982 [Trichophyton rubrum CBS 118892]
          Length = 1458

 Score = 42.4 bits (98), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 14   RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            +C VC       C  CK+VAYCG +HQ+  W  HK  CK
Sbjct: 1416 KCVVCSAPG-KLCLGCKKVAYCGTEHQRHDWANHKTTCK 1453


>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
           protein 2-like [Xenopus (Silurana) tropicalis]
          Length = 430

 Score = 42.4 bits (98), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 33/153 (21%), Positives = 68/153 (44%), Gaps = 13/153 (8%)

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
           +R + +++  +KD E+H+SK     E  E I + +  L   + + L      D +A+   
Sbjct: 119 ERTASERFLSVKDFESHLSKLD--NEKLELIQNDIAALHRFYSKNLH---YSDNAAQVFL 173

Query: 231 WI---CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
           +    C    +   ++      ++  + LM H+C PN   +     +++   ++ H    
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDE 233

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           + T Y + L+ T+ R + L  + + NC C  CS
Sbjct: 234 VITSYIDLLYPTEDRNDRLIDSXFCNCDCRECS 266


>gi|328854461|gb|EGG03593.1| hypothetical protein MELLADRAFT_117274 [Melampsora
          larici-populina 98AG31]
          Length = 340

 Score = 42.4 bits (98), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 8/74 (10%)

Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
          RCA C     L  C+ CK   YC + HQ+E W  HK  CK     IK S +    L+  +
Sbjct: 10 RCARCPSNQDLKSCTGCKVQQYCSETHQREDWTNHKKDCK----SIKKSRM---RLEEAI 62

Query: 73 DLHPGDRIARESPL 86
            HP D     +PL
Sbjct: 63 QEHPNDPSFNTAPL 76


>gi|406698010|gb|EKD01257.1| hypothetical protein A1Q2_04414 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 359

 Score = 42.4 bits (98), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 10/98 (10%)

Query: 230 HWICGVIEVNGVDIGRYT-QG-LYSVICLMEHNCLPNAKH---SNMMQSKLFVFRDTHIS 284
           +W     + N + +G  + QG +Y  +C + H+C+PNA H    N+    +   RD    
Sbjct: 149 YW--STFDTNALPLGPNSPQGAIYPTVCYINHSCIPNANHCWNKNLKMETIHALRDIKKG 206

Query: 285 TMYTNA--LWGTQP-RREHLAITKYFNCSCERCSDPTE 319
              T A  L GT+  RR H+     F C CE C  P +
Sbjct: 207 EEITIAYILEGTRAERRVHVHERLGFECQCEACGAPDD 244


>gi|403417395|emb|CCM04095.1| predicted protein [Fibroporia radiculosa]
          Length = 1169

 Score = 42.4 bits (98), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 25/49 (51%), Gaps = 3/49 (6%)

Query: 8    EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
             D    +C VCR      L  C  C +V YC +  QK+HWK HK  CK+
Sbjct: 1121 RDPSAHKCFVCRGKGKPRLLACKGCMKVRYCSEACQKKHWKAHKANCKR 1169


>gi|195347882|ref|XP_002040480.1| GM18905 [Drosophila sechellia]
 gi|194121908|gb|EDW43951.1| GM18905 [Drosophila sechellia]
          Length = 71

 Score = 42.4 bits (98), Expect = 0.69,   Method: Composition-based stats.
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELG 321
           Y   LW T  R+  L +TK+F C C RC DPT  G
Sbjct: 3   YAKLLWSTLARKIFLGMTKHFICKCVRCQDPTGAG 37


>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
          Length = 384

 Score = 42.4 bits (98), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 45/211 (21%), Positives = 85/211 (40%), Gaps = 17/211 (8%)

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  I   R
Sbjct: 17  GRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPER 73

Query: 174 SP-KKYQELKDMEAHMSK-RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
           +P +K   +K+ E+H+ K      ++ +   + + +  S  LE L  D L    A+    
Sbjct: 74  TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFLDNDSLVVVFAQV--- 130

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IS 284
            C    +   ++      ++  + LM H+C PN   +   +  L   R          + 
Sbjct: 131 NCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPGEEVF 188

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 189 TSYIDLLYPTEDRNDRLRDSYFFTCECQECT 219


>gi|449541748|gb|EMD32730.1| hypothetical protein CERSUDRAFT_118469 [Ceriporiopsis
          subvermispora B]
          Length = 523

 Score = 42.4 bits (98), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 35/73 (47%), Gaps = 9/73 (12%)

Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHK----PKCKKLPYEIKSSPLL---GRYLQATL 72
          E     CS CK  +YC K+ QK+HWK+HK    PK  K       SP L     Y+ A  
Sbjct: 23 EQPFSICSVCKRASYCSKECQKKHWKVHKLLCQPKGSKPGERPNPSPHLPADSPYVVAET 82

Query: 73 D--LHPGDRIARE 83
          D  +H GD I  E
Sbjct: 83 DPNVHTGDEIRTE 95


>gi|358056892|dbj|GAA97242.1| hypothetical protein E5Q_03919 [Mixia osmundae IAM 14324]
          Length = 1165

 Score = 42.4 bits (98), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 22/38 (57%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           CA C   A H+CS C    YC K+ Q   W LHKP+C+
Sbjct: 927 CAHCHLRASHRCSLCNTERYCSKECQAARWPLHKPRCQ 964


>gi|169846112|ref|XP_001829772.1| hypothetical protein CC1G_05981 [Coprinopsis cinerea okayama7#130]
 gi|116509099|gb|EAU91994.1| hypothetical protein CC1G_05981 [Coprinopsis cinerea okayama7#130]
          Length = 277

 Score = 42.4 bits (98), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 23/43 (53%), Gaps = 2/43 (4%)

Query: 14  RCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +C  C  R+  L KC ACK V YCG   QK  W  HK  CK L
Sbjct: 233 KCKACGKRDAKLMKCGACKSVNYCGSTCQKADWPAHKAICKSL 275


>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
 gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
           Full=Histone methyltransferase SMYD2-B; AltName:
           Full=SET and MYND domain-containing protein 2B
 gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
          Length = 430

 Score = 42.0 bits (97), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 44/222 (19%), Positives = 89/222 (40%), Gaps = 33/222 (14%)

Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
           N  C +  WP     CS +  +  +      ++L               IL  +    +R
Sbjct: 73  NVDCQKGDWPMHKLECSSMCSSGQNWCPSETVRLTAR------------ILAKQKTQTER 120

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
            + +++  +K+ E+H+SK     E  E I++ +     + L +     + +     L ++
Sbjct: 121 TASERFMSVKEFESHLSKLD--NEKKELIENDI-----SALHRFYSKNVHNCDNAALEFL 173

Query: 233 CGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
              +  NG  I           ++  + LM H+C PN   +     +++   ++ H    
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEE 233

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTEL 320
           + T Y + L+ T+ R + L  + +F+C C  CS    DP +L
Sbjct: 234 VFTSYIDLLYPTEDRNDRLKDSYFFSCDCRECSTKQKDPAKL 275


>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
          Length = 442

 Score = 42.0 bits (97), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 62/289 (21%), Positives = 112/289 (38%), Gaps = 43/289 (14%)

Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
           + +C +  WPA    C  L       P  ++ +L    ++  N  +           ++ 
Sbjct: 48  DTQCQKGGWPAHKEECFFLVKVQPRVP-TSMARLLARVIIRTNQKE----------TIRA 96

Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIV-KYLRSNFLEKLPGDVLDDTSAKCLHW 231
            + + +  L      + + G  +E +  +  ++ +Y+ +++L   P   L     K +  
Sbjct: 97  FNGRTFDSLLSHSDEIKEDGEKSEFFVTLSHVLFEYMGADYLP--PASELLTIFGKVM-- 152

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR---------DTH 282
              V  ++  D+     GLY  + +++H+C P+A    +      V R         D+ 
Sbjct: 153 -VNVFTISNDDLNTIGLGLYLGLSVLDHSCDPDA--FVLFNGTKAVLRPLKQYITAYDSS 209

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
           +   Y + L  T  RR  L    +F C C  C D     T  S ++C   H   G C   
Sbjct: 210 LRIAYCDLLDLTSMRRNQLKQQFFFTCECSACLDLEREKTARS-VRC--RHCVDGYC--- 263

Query: 343 PVNPLDNDSD----WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
              PLD + +    W CG+ S       VHL+  Q+  +V +L QE  N
Sbjct: 264 ---PLDVNENSLVCWQCGATSEVHVDEAVHLM-QQVEWEVKRL-QERRN 307


>gi|449549245|gb|EMD40211.1| hypothetical protein CERSUDRAFT_122273 [Ceriporiopsis subvermispora
           B]
          Length = 704

 Score = 42.0 bits (97), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 4   IIEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           +I     EL RC+ CR    AL +C  C +  YC    QK HW  HK  CK
Sbjct: 652 LIRTNSVELYRCSWCRNPSAALRRCGRCGKTRYCDGACQKAHWGEHKAACK 702


>gi|170101224|ref|XP_001881829.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164643184|gb|EDR07437.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 621

 Score = 42.0 bits (97), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 24/42 (57%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           E+ +   C       CS CK V+YC K+ QK HW++HK  C+
Sbjct: 90  EVEKYKACDNPGTMACSGCKLVSYCSKECQKYHWRVHKRDCR 131


>gi|299743144|ref|XP_001835571.2| hypothetical protein CC1G_03353 [Coprinopsis cinerea
          okayama7#130]
 gi|298405520|gb|EAU86142.2| hypothetical protein CC1G_03353 [Coprinopsis cinerea
          okayama7#130]
          Length = 465

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 2  ASIIEIEDEELMRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          A  + +  ++L +C+VC++T  L  CSAC E  YCG + Q   W  HK +C K
Sbjct: 15 ARTVMLPLKKLEKCSVCQKTDGLRLCSACGEKVYCGSECQTNDWAEHKKQCGK 67


>gi|323452238|gb|EGB08113.1| hypothetical protein AURANDRAFT_71676 [Aureococcus anophagefferens]
          Length = 1696

 Score = 42.0 bits (97), Expect = 0.74,   Method: Composition-based stats.
 Identities = 19/58 (32%), Positives = 27/58 (46%), Gaps = 5/58 (8%)

Query: 2    ASIIEIEDEELMRCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            A +  +E  E      C  T      L +C+ CK   Y  K+HQ+ HW LHK  C+ +
Sbjct: 1177 AQLARLEPVEFATTRCCARTYATGVPLLRCARCKRTFYASKEHQEAHWPLHKRSCRHV 1234


>gi|221484463|gb|EEE22759.1| hypothetical protein TGGT1_035070 [Toxoplasma gondii GT1]
          Length = 859

 Score = 42.0 bits (97), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 9/89 (10%)

Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
           +Y +     H+CLPNA       H+ +++++  +     ++  Y    + +  T  RR+ 
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRTRADIAAGDEVTLTYLSEDDLMRPTLQRRKV 227

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
           L+ TK F C+CERCS P +    F    C
Sbjct: 228 LSETKDFICTCERCSAPVDFSRGFRCPAC 256


>gi|237837951|ref|XP_002368273.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
 gi|211965937|gb|EEB01133.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
          Length = 859

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 9/89 (10%)

Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
           +Y +     H+CLPNA       H+ +++++  +     ++  Y    + +  T  RR+ 
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRTRADIAAGDEVTLTYLSEDDLMRPTLQRRKV 227

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
           L+ TK F C+CERCS P +    F    C
Sbjct: 228 LSETKDFICTCERCSAPVDFSRGFRCPAC 256


>gi|393229006|gb|EJD36638.1| hypothetical protein AURDEDRAFT_74033 [Auricularia delicata
          TFB-10046 SS5]
          Length = 239

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 23/42 (54%)

Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C  CK  AYC K  QKEHW  HKP C+     + +S   GR+
Sbjct: 44 CGGCKFAAYCSKACQKEHWPSHKPICRFTVATVGASRAEGRF 85


>gi|302688639|ref|XP_003033999.1| hypothetical protein SCHCODRAFT_107214 [Schizophyllum commune
          H4-8]
 gi|300107694|gb|EFI99096.1| hypothetical protein SCHCODRAFT_107214, partial [Schizophyllum
          commune H4-8]
          Length = 216

 Score = 42.0 bits (97), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 21/38 (55%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          C VC      +CS C+ V YC K+H  + WK HKP C 
Sbjct: 16 CRVCGSPTSMRCSRCRVVYYCSKEHIAQDWKNHKPGCN 53


>gi|321471928|gb|EFX82900.1| hypothetical protein DAPPUDRAFT_316237 [Daphnia pulex]
          Length = 1216

 Score = 42.0 bits (97), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 15   CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            CAVC++    L +CS C+   YC  + Q  HW +HK  CKKL
Sbjct: 1175 CAVCKKIPEKLKRCSRCRSTVYCSVECQHSHWPMHKLICKKL 1216


>gi|323449447|gb|EGB05335.1| hypothetical protein AURANDRAFT_66543 [Aureococcus
          anophagefferens]
          Length = 380

 Score = 42.0 bits (97), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 1/54 (1%)

Query: 8  EDEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
          +D+++ RCA C RE A   C+ C+ V YC    Q+  WK HK  C      ++S
Sbjct: 11 DDDDVKRCAACGREAARLVCTRCRAVRYCDGDCQRRAWKAHKASCPTRDQRVES 64


>gi|169868772|ref|XP_001840957.1| hypothetical protein CC1G_03186 [Coprinopsis cinerea okayama7#130]
 gi|116498115|gb|EAU81010.1| hypothetical protein CC1G_03186 [Coprinopsis cinerea okayama7#130]
          Length = 445

 Score = 42.0 bits (97), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 44/93 (47%), Gaps = 5/93 (5%)

Query: 20  ETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGD 78
           E AL  C+ CK  AYCG++ Q+E WK +HK +C  L  E +      R++  T  L  GD
Sbjct: 272 ELALKICTNCKVTAYCGERCQREDWKQIHKRECGDLAKEYRKLVDAKRWVSPTTKL--GD 329

Query: 79  RIARESPLIVGPKLA--LAEPICLGCHKPLNPN 109
            +A  S       L   + + IC    +  N N
Sbjct: 330 LLAINSTFRAHDTLVPQILQQICRAGPQATNKN 362


>gi|15930025|gb|AAH15453.1| ANKMY2 protein, partial [Homo sapiens]
          Length = 161

 Score = 42.0 bits (97), Expect = 0.80,   Method: Composition-based stats.
 Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C  C E  A  +CS CK V YC +  QK HW  HK  CK L
Sbjct: 40 CTTCGEKGASKRCSVCKMVIYCDQTCQKTHWFTHKKICKNL 80


>gi|449540948|gb|EMD31935.1| hypothetical protein CERSUDRAFT_119267 [Ceriporiopsis subvermispora
           B]
          Length = 214

 Score = 42.0 bits (97), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 22  ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +L +CS C++  YCG +HQ   WK HK +CK L
Sbjct: 162 SLSRCSGCRKAMYCGPEHQSGTWKDHKSECKAL 194


>gi|326524007|dbj|BAJ97014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1026

 Score = 42.0 bits (97), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
          EL  CA C      KCS CK + YC  + Q +HW+  HK KCK++
Sbjct: 26 ELGPCAQCGAAGTKKCSGCKRMRYCSGECQSKHWQSDHKFKCKQM 70


>gi|390599611|gb|EIN09007.1| hypothetical protein PUNSTDRAFT_143642 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 729

 Score = 42.0 bits (97), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 12  LMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           L RC+ C     AL KC  C   +YC    QK HWK HK +C+K
Sbjct: 685 LYRCSYCGNPSAALKKCGGCGNTSYCDASCQKNHWKAHKNQCQK 728


>gi|268579277|ref|XP_002644621.1| C. briggsae CBR-SET-30 protein [Caenorhabditis briggsae]
          Length = 634

 Score = 42.0 bits (97), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 65/238 (27%), Positives = 100/238 (42%), Gaps = 35/238 (14%)

Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAIL----- 163
           C +C     CS +C   S A  H  EC  L   C      E +L     +Y+ IL     
Sbjct: 136 CDKCKVAKFCSKQCQT-SGAFDHKYECGPLH-NCKDLNTDERMLLRIIGRYKDILDGNDK 193

Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYL--RSNFLEKLPGDVL 221
            +      R S +   ++ +  A M K      V+++I   VK    +++ ++K     L
Sbjct: 194 KIDGFYTNRESGRSVMQIWEHCADMKKDENAMNVFKKIYDRVKQYGDKTHLVDKDTAFQL 253

Query: 222 DDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRD 280
              +    H I  V  +   +IG+   GLY  +C  +H+C PNA +S N   +KL    D
Sbjct: 254 HSRNFINRHSISNVDYLR--EIGK---GLYLDLCRYDHSCRPNAIYSCNGTVAKLRALHD 308

Query: 281 T----HISTMYTNALWGTQP-----RREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
                ++ T  T+  +   P     RR  L  T YF C CERC DP +   + +A+ C
Sbjct: 309 NVDLENVET--THYTYIELPPCKIQRRHMLKETWYFECHCERCDDPED--NWLTAVIC 362


>gi|302656038|ref|XP_003019776.1| hypothetical protein TRV_06153 [Trichophyton verrucosum HKI 0517]
 gi|291183546|gb|EFE39152.1| hypothetical protein TRV_06153 [Trichophyton verrucosum HKI 0517]
          Length = 1485

 Score = 42.0 bits (97), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 14   RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            +C VC       C  CK+VAYCG +HQ+  W  HK  CK
Sbjct: 1443 KCVVCSAPG-KLCLGCKKVAYCGTEHQRRDWVNHKTTCK 1480


>gi|397585818|gb|EJK53401.1| hypothetical protein THAOC_27174, partial [Thalassiosira
          oceanica]
          Length = 144

 Score = 42.0 bits (97), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          CA C ET      L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 16 CANCGETGSDAVKLKNCTACRLVKYCGVDCQRAHRKEHKKACKQRVAELKDEKLYGQ 72


>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 445

 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 44/162 (27%), Positives = 64/162 (39%), Gaps = 22/162 (13%)

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK-LPGDVLDDTSAKCLHWI 232
           + K Y+   D+  H        +  E   S+   LRS   E  +PG+       + L  I
Sbjct: 105 TDKCYRRFWDLMPHEEDIKKDEKRMEHFQSLTVVLRSLIDEAAMPGN-------QELLRI 157

Query: 233 CGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKH-SNMMQSKLFVFRDTH---- 282
            G + +N  +I          G+Y    +M+H+C PNA    +     + +  D H    
Sbjct: 158 FGKMCINSFNILDDEMNSIGTGMYLGASIMDHSCRPNAVAIFDGCNLNVRLLEDYHGAEI 217

Query: 283 ----ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
               I   Y + L  T  RR+ L    YF C CERC D  EL
Sbjct: 218 DFSKIFISYIDLLNPTDVRRDMLRKRYYFECGCERCRDEQEL 259


>gi|147838237|emb|CAN78307.1| hypothetical protein VITISV_005598 [Vitis vinifera]
          Length = 494

 Score = 42.0 bits (97), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
           C VC      KCS CK V YC +  Q  HWK  HK KCK      K++
Sbjct: 90  CLVCGNLGTKKCSGCKAVRYCSQTCQATHWKSGHKTKCKDFQLSGKAN 137


>gi|7510854|pir||T30001 hypothetical protein ZC8.3 - Caenorhabditis elegans
          Length = 306

 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 13/93 (13%)

Query: 238 VNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HISTMYTNAL 291
           ++ VD  R   +GLY  +C  +H+C PNA +S N + +KL    D     ++ T  T+  
Sbjct: 159 ISNVDYLREIGKGLYLDLCKYDHSCRPNAIYSCNGIVAKLRALHDNVDLENVET--THYT 216

Query: 292 WGTQP-----RREHLAITKYFNCSCERCSDPTE 319
           +   P     RR  L  T YF C CERC DP +
Sbjct: 217 YIELPPCKIQRRHMLKETWYFECHCERCDDPDD 249


>gi|170101132|ref|XP_001881783.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164643138|gb|EDR07391.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 428

 Score = 42.0 bits (97), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 22/40 (55%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C +C +    KC+ C  V YCGK+ Q   WK HK  CK L
Sbjct: 249 CELCGKREASKCTGCFSVVYCGKECQTADWKDHKKTCKTL 288


>gi|198458079|ref|XP_001360905.2| GA17017 [Drosophila pseudoobscura pseudoobscura]
 gi|198136215|gb|EAL25480.2| GA17017 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score = 42.0 bits (97), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 20/37 (54%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC   HQ+ HW  HK  C
Sbjct: 128 CAACGCYAPHACSRCKTIYYCSSAHQRAHWSDHKANC 164


>gi|340718013|ref|XP_003397467.1| PREDICTED: LOW QUALITY PROTEIN: ankyrin repeat and MYND
           domain-containing protein 2-like [Bombus terrestris]
          Length = 407

 Score = 42.0 bits (97), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L   I  S
Sbjct: 335 CITCGEDKAXQKCSKCKAVQYCDRECQRLHWFMHKKACTRLGQNITPS 382


>gi|397624310|gb|EJK67342.1| hypothetical protein THAOC_11640 [Thalassiosira oceanica]
          Length = 304

 Score = 42.0 bits (97), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+ L+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRVAELKDEQLYGQGLE 75


>gi|431908963|gb|ELK12554.1| Ankyrin repeat and MYND domain-containing protein 2 [Pteropus
           alecto]
          Length = 441

 Score = 42.0 bits (97), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 23/41 (56%), Gaps = 1/41 (2%)

Query: 15  CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  CRE  A  +CS CK V YC +  QK HW  HK  CK L
Sbjct: 320 CTTCREKGASKRCSVCKMVIYCDQTCQKTHWFAHKKMCKNL 360


>gi|195151327|ref|XP_002016599.1| GL11671 [Drosophila persimilis]
 gi|194110446|gb|EDW32489.1| GL11671 [Drosophila persimilis]
          Length = 348

 Score = 42.0 bits (97), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 20/37 (54%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK + YC   HQ+ HW  HK  C
Sbjct: 128 CAACGCYAPHACSRCKTIYYCSSAHQRAHWSDHKANC 164


>gi|397644046|gb|EJK76226.1| hypothetical protein THAOC_02027 [Thalassiosira oceanica]
          Length = 271

 Score = 42.0 bits (97), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   QK H KLHK  CK+   E+K   L  R
Sbjct: 31 LKSCTACRLVKYCGVDCQKAHRKLHKKACKQRAAELKDEQLYSR 74


>gi|297838801|ref|XP_002887282.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297333123|gb|EFH63541.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 392

 Score = 42.0 bits (97), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 25/50 (50%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C      +C  C  VAYC   HQ  HW  HK +C++L  +++   LL  +
Sbjct: 12 CVGPPTRRCGQCGSVAYCSVSHQSSHWSYHKEECERLEEQMRRVDLLNDF 61


>gi|405974013|gb|EKC38689.1| Ankyrin repeat and MYND domain-containing protein 2 [Crassostrea
           gigas]
          Length = 420

 Score = 42.0 bits (97), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 28/50 (56%), Gaps = 2/50 (4%)

Query: 8   EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           ED+    C   R  A  KCSACK V YC ++ QK HW  HK  CK+L  E
Sbjct: 314 EDDVCSTCGSLR--AKKKCSACKMVCYCDQRCQKLHWSTHKKLCKQLAEE 361


>gi|428180679|gb|EKX49545.1| hypothetical protein GUITHDRAFT_93215 [Guillardia theta CCMP2712]
          Length = 483

 Score = 41.6 bits (96), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 1/55 (1%)

Query: 1   MASIIEIED-EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           M  ++ +ED  E  +CAVC   A  +CS CK   YCG++ Q   WK HKP C+ +
Sbjct: 395 MYDMLNVEDFMEDPKCAVCGAPAEKRCSKCKMEWYCGRKCQVAAWKKHKPFCEMI 449


>gi|198425974|ref|XP_002127497.1| PREDICTED: similar to EGL nine homolog 1 [Ciona intestinalis]
          Length = 329

 Score = 41.6 bits (96), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          C VC     L  CS C+ V YC ++HQ+  WK+HK  CK
Sbjct: 11 CQVCNTPEQLRTCSECRTVWYCSREHQRSDWKVHKTNCK 49


>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
          Length = 432

 Score = 41.6 bits (96), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 58/265 (21%), Positives = 101/265 (38%), Gaps = 20/265 (7%)

Query: 61  SPLLGRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           SP  GR L+A    H GD +    A    L V  +    E  C          L+   RC
Sbjct: 14  SPGKGRGLRALQPFHVGDLLFSCPAYAYVLTVNERGNHCE-YCFA----RKEGLSKCGRC 68

Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP- 175
            + F+  C+  C    D   H  EC+ + +  E        +  A +  +  I   R+P 
Sbjct: 69  KQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPS 125

Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
           +K   +K+ E+H+ K     E  + I S V  L   + + L     D          C  
Sbjct: 126 EKLLAVKEFESHLEKLD--NEKRDLIQSDVAALHHFYSKHLEFPDTDSLVVLFAQVNCNG 183

Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTHISTMYTNA 290
             +   ++      ++  + LM H+C PN   +       +++   +     + T Y + 
Sbjct: 184 FTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDL 243

Query: 291 LWGTQPRREHLAITKYFNCSCERCS 315
           L+ T+ R + L  + +F C C+ C+
Sbjct: 244 LYPTEDRNDRLRDSYFFTCQCQECT 268


>gi|67585983|ref|XP_665160.1| MYND finger domain protein [Cryptosporidium hominis TU502]
 gi|54655649|gb|EAL34930.1| MYND finger domain protein [Cryptosporidium hominis]
          Length = 123

 Score = 41.6 bits (96), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 3/48 (6%)

Query: 8   EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           ED+ L RC  CR   E  L  CS CK+VAYC    Q++ W  HK  CK
Sbjct: 76  EDKILNRCESCRKESEKTLLICSRCKKVAYCNVDCQRKDWSYHKQFCK 123


>gi|410912118|ref|XP_003969537.1| PREDICTED: programmed cell death protein 2-like [Takifugu rubripes]
          Length = 365

 Score = 41.6 bits (96), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK-LPYEIKSSPLLGR 66
           L  C VC       CS C  V YCGK HQ  HWK  HK +C       +K+SP L R
Sbjct: 141 LKLCWVCGCLGNKACSRCHMVTYCGKHHQTVHWKHTHKKECSSPETTTVKTSPFLFR 197


>gi|390340692|ref|XP_791789.3| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Strongylocentrotus purpuratus]
          Length = 452

 Score = 41.6 bits (96), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 15  CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
           C+ C E  A  KCSACK+V YC    QK  W  HK +CK++  E K
Sbjct: 321 CSTCGEAKASKKCSACKQVNYCDAVCQKLEWFTHKKQCKRIAEEHK 366


>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
           mutus]
          Length = 383

 Score = 41.6 bits (96), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 44/212 (20%), Positives = 86/212 (40%), Gaps = 11/212 (5%)

Query: 110 LADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI 169
           L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  I
Sbjct: 11  LSKCGRCKQAFY--CNVECQR-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKI 67

Query: 170 LQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
              R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D      
Sbjct: 68  HPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLF 125

Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----I 283
               C    +   ++      ++  + LM H+C PN   +     +++   ++ H    +
Sbjct: 126 AQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEV 185

Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCS 315
            T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 186 FTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 217


>gi|397647092|gb|EJK77557.1| hypothetical protein THAOC_00606 [Thalassiosira oceanica]
          Length = 396

 Score = 41.6 bits (96), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 24/41 (58%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           L  C+AC+ V YCG   Q+ H KLHK  CK+   EIK   L
Sbjct: 122 LKDCTACRLVKYCGVDCQRAHRKLHKKACKRRAAEIKDEQL 162


>gi|397646698|gb|EJK77388.1| hypothetical protein THAOC_00784 [Thalassiosira oceanica]
          Length = 290

 Score = 41.6 bits (96), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   QK H KLH+  CK+   E+K   L  + L+
Sbjct: 27 LKDCTACRLVKYCGVDCQKAHRKLHRKACKQRAAELKDEKLYSQGLE 73


>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
          Length = 441

 Score = 41.6 bits (96), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 82/193 (42%), Gaps = 39/193 (20%)

Query: 237 EVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTN 289
           E+N + +G Y + +     +++H+C PNA    + +      R       +  +   Y +
Sbjct: 171 EMNPIGVGVYIRQIKPC-SVLDHSCRPNA--VAVFEGTTLRIRCVEPVDSEQDLRISYID 227

Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS----AMKCLNEHKDQGDCWILPVN 345
            L  T  RR +L    YFNC C  C D  +    FS    +++C      +G    +P  
Sbjct: 228 TLDDTTTRRRNLQQQYYFNCLCGECKDSEKDLIKFSFNCTSVEC------KGHVTQVPA- 280

Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEM----------L 395
               D  ++C  C   ++  DV+L  +Q  E+  K+++    +K   E           +
Sbjct: 281 ----DDRFSCDICGTTVD--DVNL--NQAAEKAQKIIKHLNELKKQREHESIRMKGLSCV 332

Query: 396 TKLEAMFHPHHYH 408
           +++ A+ HP++ H
Sbjct: 333 SEMRAILHPYNVH 345


>gi|397634288|gb|EJK71362.1| hypothetical protein THAOC_07207, partial [Thalassiosira
          oceanica]
          Length = 117

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 5/72 (6%)

Query: 3  SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S + + D+    CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 2  SCVPVMDDGDEACANCGKQGSDTVVLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 61

Query: 58 IKSSPLLGRYLQ 69
          ++   L G+ L+
Sbjct: 62 LEDEQLYGQGLE 73


>gi|359476454|ref|XP_002271840.2| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Vitis
           vinifera]
          Length = 940

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
           C VC      KCS CK V YC +  Q  HWK  HK KCK      K++
Sbjct: 87  CLVCGNLGTKKCSGCKAVRYCSQTCQATHWKSGHKTKCKDFQLSGKAN 134


>gi|342881956|gb|EGU82734.1| hypothetical protein FOXB_06742 [Fusarium oxysporum Fo5176]
          Length = 498

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 1/43 (2%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
          C+    + C+ C+ V YCG + QK HW +HK  CK  PY  K+
Sbjct: 18 CQNIGKYSCANCRLVVYCGSECQKAHWLIHKVDCKS-PYTKKT 59


>gi|393238670|gb|EJD46206.1| hypothetical protein AURDEDRAFT_151705 [Auricularia delicata
           TFB-10046 SS5]
          Length = 741

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 3/40 (7%)

Query: 15  CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKL-HKPKC 51
           CA C E +  L KC+AC++V YC KQ QK+ WK  H+ +C
Sbjct: 693 CAYCGEESSLLRKCAACRDVRYCSKQCQKKDWKFTHREQC 732


>gi|323447944|gb|EGB03849.1| hypothetical protein AURANDRAFT_67642 [Aureococcus anophagefferens]
          Length = 1642

 Score = 41.6 bits (96), Expect = 1.0,   Method: Composition-based stats.
 Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 6/66 (9%)

Query: 14   RCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL-PYEIKSSPLLGRY 67
            RCA C         +HKC+ C+ V YC +  QK HW  HK  C    P E K + +  +Y
Sbjct: 1555 RCAHCARLPGPTVRVHKCAQCRRVYYCSRDCQKSHWPKHKAFCSTSGPTEPKPNYIPPKY 1614

Query: 68   LQATLD 73
               + D
Sbjct: 1615 HAHSYD 1620


>gi|320162592|gb|EFW39491.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1600

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 24/38 (63%)

Query: 17   VCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            VC +TA  +C+ CK   YC ++ Q   WK HKP CK++
Sbjct: 1466 VCGKTAKFRCTMCKSQYYCSRECQTSDWKTHKPACKRM 1503


>gi|397563830|gb|EJK43958.1| hypothetical protein THAOC_37548 [Thalassiosira oceanica]
          Length = 300

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 5/72 (6%)

Query: 3  SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S + + D+    CA C +       L  C+AC+ V YCG   Q+ H KLHK  CK+   E
Sbjct: 2  SCVPVVDDSGEACANCGKQGSDTVKLRNCAACRLVKYCGVDCQRAHRKLHKKACKQRAAE 61

Query: 58 IKSSPLLGRYLQ 69
          ++   L  + L+
Sbjct: 62 LEDEKLYSQGLE 73


>gi|170115743|ref|XP_001889065.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164636013|gb|EDR00313.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 633

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 19/35 (54%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          CR      CSACK V+YC K  Q  HW  HK  CK
Sbjct: 52 CRNPGTMACSACKLVSYCSKDCQTTHWHRHKKDCK 86


>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
           porcellus]
          Length = 428

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDHCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    + ++ +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WQDHKQECKCLK-SCKPRYPPDSVRLLGRVVIKLMEKTPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E++++K      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDIFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STM 286
           IC    +   ++     GLY  + L+ H+C PN +   N     L   RD  +    +  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|367055708|ref|XP_003658232.1| hypothetical protein THITE_2124766 [Thielavia terrestris NRRL
          8126]
 gi|347005498|gb|AEO71896.1| hypothetical protein THITE_2124766 [Thielavia terrestris NRRL
          8126]
          Length = 447

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C  T   L +CS C+ V YCG+ HQ  H  +HK  C K+
Sbjct: 21 RCPQCTATGPKLLRCSGCRAVRYCGRDHQVAHRTMHKSACTKI 63


>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
           Neff]
          Length = 548

 Score = 41.6 bits (96), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 7/101 (6%)

Query: 249 GLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAI 303
           GLY +  L+ H+C PN    N      +++   V     +   Y +       R+  L  
Sbjct: 255 GLYPLASLINHSCQPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLE 314

Query: 304 TKYFNCSCERCSDPT--ELGTYFSAMKCLNEHKDQGDCWIL 342
           TK+F+C C RCS P    +  Y S  +C N+ K   D  ++
Sbjct: 315 TKHFDCLCNRCSPPITDSVDRYLSGFQCPNKAKTSCDGLLV 355


>gi|407408418|gb|EKF31865.1| hypothetical protein MOQ_004294 [Trypanosoma cruzi marinkellei]
          Length = 744

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 56/126 (44%), Gaps = 24/126 (19%)

Query: 239 NGVDIGRYTQGL--YSVICLMEHNCLPNA----KHSNMMQSKLFV-FRDT-------HIS 284
           NGV +G    G+  +  I L+EH+C PNA    +H    +S +F   R T        I+
Sbjct: 362 NGVSLGTLETGIGVFGGISLIEHSCHPNALVVFRHGCTSESIVFAELRATRPIGIGERIT 421

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCS---DPTELGTYFSAMKCLNEH-----KDQ 336
             Y       + R++ L    +F+C+C+ C+   D T L   F++   L E      K +
Sbjct: 422 IAYVPTFLPKEERKKRLRAKFFFSCACDHCTAGHDTTRL--MFASCGSLEERVVMCPKGR 479

Query: 337 GDCWIL 342
           G  W L
Sbjct: 480 GSEWTL 485


>gi|397602518|gb|EJK58199.1| hypothetical protein THAOC_21697 [Thalassiosira oceanica]
          Length = 309

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          CA C +T      L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L
Sbjct: 15 CANCGKTGSDTVKLKSCTACRLVKYCGVDCQRAHRKLHKRACKQRAAELKDEEL 68


>gi|328768420|gb|EGF78466.1| hypothetical protein BATDEDRAFT_90401 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 578

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 68/325 (20%), Positives = 110/325 (33%), Gaps = 60/325 (18%)

Query: 55  PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN- 113
           P  +K++   GR+  A   L PG  +  ESP         A   C  C K  N   A   
Sbjct: 16  PITLKATKSKGRHFVAKSQLQPGTTVLVESPCAFVLFKDNARDYCSYCMKRFNRMTAATR 75

Query: 114 ---------------------ARCPRC----FWPACSARCSGLSDAH------------- 135
                                  C  C    FW  CS  C     AH             
Sbjct: 76  PASTETSLFSLSSQSDSSNTCIVCTDCRNQTFW--CSESCKKADQAHQLICAAIVELPGI 133

Query: 136 --THAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHM-SKRG 192
              H+ + A+++L  + L    +   EAI  +        +P     + D+ +H  S +G
Sbjct: 134 AGMHSVDYALVRLIVQVL---ANTAVEAIWQMDSSCRVESTPMAC--ISDLLSHQKSYQG 188

Query: 193 PGTEVY----EEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ 248
           P         E+I + + +L+ N   K   +++   + +      GV +  G   G    
Sbjct: 189 PWLNAIDCAAEDIAACLSHLKINDSLKSKQEIIK-LACRINSNSHGVYDATGNKSGEIGV 247

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNALWGTQPRREHLA 302
           G++  + ++ H+C PN         ++ V   T +S        Y +       RR  L 
Sbjct: 248 GMFPFVAMLNHSCAPNCAFVTSQSGQMLVRTLTQVSPGTELCVGYIDLFTPRWERRGKLL 307

Query: 303 ITKYFNCSCERCSDPTELGTYFSAM 327
            TK F C+C RC    E     + +
Sbjct: 308 TTKLFWCTCIRCGPRAEETVVLTTL 332


>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
          Length = 333

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 9/107 (8%)

Query: 246 YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-------THISTMYTNALWGTQPRR 298
           Y  G++     + H+CLPNA   N  Q  + VFR          I T YT+       RR
Sbjct: 68  YGLGIFPTGSYLNHSCLPNAFWYNDDQG-MMVFRTLRPIKKGEEILTSYTDITTECSERR 126

Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
           +HL    +F C C++C    +L    S + C N+       + +P N
Sbjct: 127 KHLLKQYFFFCQCQQCKFQKKLADQ-SCLSCKNQLSSSNTSFKIPSN 172


>gi|397643232|gb|EJK75731.1| hypothetical protein THAOC_02539 [Thalassiosira oceanica]
          Length = 295

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 31/63 (49%), Gaps = 12/63 (19%)

Query: 13 MRCAVCRET------------ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
          M+C+ C ET             L  C+AC+ V YCG + QK H K HK  CK+   EIK 
Sbjct: 1  MKCSPCEETCANCGGRGSDSLKLKDCTACRLVKYCGVECQKVHRKQHKTICKQRAAEIKD 60

Query: 61 SPL 63
            L
Sbjct: 61 EQL 63


>gi|397575638|gb|EJK49803.1| hypothetical protein THAOC_31288 [Thalassiosira oceanica]
          Length = 342

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 34/66 (51%), Gaps = 5/66 (7%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+    CA C  TA     L  C+AC+ V YCG   Q+ H KLHK  CK+   E
Sbjct: 42  SCVPVTDDCDEACANCGLTASDTAKLKNCNACRLVKYCGVDCQRAHRKLHKKACKQRAAE 101

Query: 58  IKSSPL 63
           +K   L
Sbjct: 102 LKDEQL 107


>gi|392588951|gb|EIW78282.1| hypothetical protein CONPUDRAFT_167323 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 381

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 7/61 (11%)

Query: 9   DEELMR---CAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           +E LM    CA C  R   + +CSAC    YC  + Q+ HWK HKP C   P+  K++  
Sbjct: 189 EEGLMEDKNCANCHKRRERMKQCSACHTTMYCSTECQQSHWKAHKPSCN--PFSGKNTVT 246

Query: 64  L 64
           L
Sbjct: 247 L 247


>gi|290975218|ref|XP_002670340.1| predicted protein [Naegleria gruberi]
 gi|284083898|gb|EFC37596.1| predicted protein [Naegleria gruberi]
          Length = 700

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 5/45 (11%)

Query: 14  RCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           +CA C +T      L +CS C++V YC    QK HW  HK  CKK
Sbjct: 652 KCANCNKTGTKSAPLKRCSGCQKVFYCSGNCQKTHWSSHKTACKK 696


>gi|301107195|ref|XP_002902680.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262098554|gb|EEY56606.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 337

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 24/39 (61%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           L KC+ C++  YC +  QKE WKLHK  C K   EI +S
Sbjct: 293 LSKCARCRDAVYCSRDCQKEAWKLHKRLCGKTTEEITAS 331


>gi|395330558|gb|EJF62941.1| hypothetical protein DICSQDRAFT_83701 [Dichomitus squalens LYAD-421
           SS1]
          Length = 398

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 20/36 (55%)

Query: 18  CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
            R     +CS CKEV YC    QKE W  HK KCK+
Sbjct: 320 VRAAEFKRCSGCKEVIYCSPSCQKEDWPAHKKKCKE 355


>gi|440794150|gb|ELR15321.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 974

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 14  RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
           RCA+C R   L  CS C+ V YCG +HQ  HW++ H+  C+  P+ + S 
Sbjct: 887 RCAICSRVDQLRVCSGCRLVHYCGPEHQNLHWRVSHRGSCEG-PFRLLSG 935


>gi|189199084|ref|XP_001935879.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187982978|gb|EDU48466.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 276

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 10/93 (10%)

Query: 232 ICGVIEVNGVDIGRYTQ--GLYSVICLMEHNCLPNA-----KHSNMMQSKLF--VFRDTH 282
             G+ + N +  G  T+   +Y+ + L+ H+C+PN+     +H NM        +     
Sbjct: 38  FSGIFQTNALPCGSETKLYAVYATVSLINHDCMPNSSPNWNRHLNMQTIHALCDIKAGEE 97

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           I+  YTNA+  TQ R+ +L     F+C+C  CS
Sbjct: 98  ITVCYTNAV-TTQERQVYLKQHFGFDCACSICS 129


>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
           rotundata]
          Length = 438

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 68/156 (43%), Gaps = 18/156 (11%)

Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWIC 233
           +  K++  KD+ +H S      +  E    +   L   FL    GD     SA+ L  I 
Sbjct: 107 TKTKFRRFKDLMSHYSDIKKDEKKMEHFMFLCGVLFG-FL----GDTPMPNSAE-LMGIY 160

Query: 234 GVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRD------T 281
           G I +N      +D+     G+Y    +++H+C+PNA  +   +   +    D      +
Sbjct: 161 GRICINSFNIFDLDMNSIGVGIYLAPSVVDHSCVPNAVATFEGITLNIRTIEDLPSLDWS 220

Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
            I   Y + L  T+ RR  L  + YF C+C++C +P
Sbjct: 221 QIRISYIDVLKTTKERRSELQSSYYFWCNCKKCEEP 256


>gi|339236015|ref|XP_003379562.1| Egl nine protein [Trichinella spiralis]
 gi|316977745|gb|EFV60808.1| Egl nine protein [Trichinella spiralis]
          Length = 395

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/54 (31%), Positives = 30/54 (55%), Gaps = 5/54 (9%)

Query: 6  EIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          ++ +  ++ C+ C + +     L KCS C +V YC + HQ++HW  HK  C  +
Sbjct: 4  QMVNRSVLLCSFCGKPSSDNVPLFKCSGCLKVCYCNRLHQQQHWNDHKKHCDNM 57


>gi|449686514|ref|XP_002164124.2| PREDICTED: ankyrin repeat and MYND domain-containing protein
           2-like, partial [Hydra magnipapillata]
          Length = 317

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 2/58 (3%)

Query: 3   SIIEIEDEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           SI  ++  +   C  C     + +KCS+CK V YC +  QK HW  HK  C++L  E 
Sbjct: 227 SINGLQSADFTECCSCCGNRRSFNKCSSCKMVKYCNQSCQKLHWGTHKKFCERLKEEF 284


>gi|397630366|gb|EJK69745.1| hypothetical protein THAOC_08966 [Thalassiosira oceanica]
          Length = 716

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+ L+
Sbjct: 30 LKSCTACRLVKYCGVDCQRAHRKQHKKTCKQRAAELKDEELYGQGLE 76


>gi|307213998|gb|EFN89205.1| hypothetical protein EAI_07708 [Harpegnathos saltator]
          Length = 128

 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 4/73 (5%)

Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESP 85
          C  CK ++YCG++H++ H+  H   C  +   +KS P    Y  ATL+L+  D I     
Sbjct: 3  CDQCKMISYCGEKHKQMHYTQHMEFCAVIQKLLKSYP----YFWATLELNLEDWIQSRKE 58

Query: 86 LIVGPKLALAEPI 98
          L+   K  L+  +
Sbjct: 59 LVHLTKQELSRAL 71


>gi|440792097|gb|ELR13325.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 480

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 31/67 (46%), Gaps = 14/67 (20%)

Query: 7   IEDEELMRCAVCRETALHK------------CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           + D  + RCA+C +    +            CS C+EV YC    Q+EHW  HKP C   
Sbjct: 407 LPDAGMQRCAMCAKQVKGRNTDGKCGVQVKWCSRCREVGYCSPACQREHWPAHKPACA-- 464

Query: 55  PYEIKSS 61
           P  I +S
Sbjct: 465 PASITAS 471


>gi|302500465|ref|XP_003012226.1| hypothetical protein ARB_01486 [Arthroderma benhamiae CBS 112371]
 gi|291175783|gb|EFE31586.1| hypothetical protein ARB_01486 [Arthroderma benhamiae CBS 112371]
          Length = 1480

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 1/39 (2%)

Query: 14   RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            +C VC       C  CK+VAYCG +HQ+  W  HK  CK
Sbjct: 1438 KCVVCSAPG-KLCLGCKKVAYCGTEHQRLDWANHKATCK 1475


>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
           aries]
          Length = 428

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  IAKYCSAKCQKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMQETPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E++++K      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLALTFQHFMREEIQDASQLPPSFDIFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
           IC    +   ++     GLY  + L+ H+C PN +   N     L   RD      ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDVEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F+C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFDCDCFRC 266


>gi|321474405|gb|EFX85370.1| hypothetical protein DAPPUDRAFT_237975 [Daphnia pulex]
          Length = 842

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 2/42 (4%)

Query: 15  CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CAVC++    L +CS C+   YC  + Q  HW +HK  CKKL
Sbjct: 801 CAVCKKIPEKLKRCSRCRSTFYCSVECQHSHWPMHKILCKKL 842


>gi|432113963|gb|ELK36028.1| SET and MYND domain-containing protein 3 [Myotis davidii]
          Length = 239

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 16  AVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK-----KLPYEIKSSPLLGRYLQA 70
           AV R+  L +CS C+   YCG + QK+ W  HK +CK     K  Y   S  LLGR +  
Sbjct: 51  AVRRKEKLMRCSQCRVAKYCGAKCQKKAWPDHKRECKCLKSCKPRYPPDSVRLLGRVVVK 110

Query: 71  TLDLHPGD 78
            ++  P +
Sbjct: 111 LMEETPSE 118


>gi|307214000|gb|EFN89207.1| hypothetical protein EAI_07710 [Harpegnathos saltator]
          Length = 149

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 4/73 (5%)

Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESP 85
          C  CK ++YCG++H++ H+  H   C  +   +KS P    Y  ATL+L+  D I     
Sbjct: 1  CDQCKMISYCGEKHKQMHYTQHMEFCAVIQKLLKSYP----YFWATLELNLEDWIQSRKE 56

Query: 86 LIVGPKLALAEPI 98
          L+   K  L+  +
Sbjct: 57 LVHLTKQELSRAL 69


>gi|346324100|gb|EGX93697.1| Zinc finger domain-containing protein, MYND-type [Cordyceps
          militaris CM01]
          Length = 429

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 14 RCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC+ C  R   L +C  C  V YC ++HQ  HW  HKP C ++
Sbjct: 18 RCSHCQARAAKLLRCVGCSAVRYCNREHQTAHWPHHKPLCVQI 60


>gi|356498052|ref|XP_003517868.1| PREDICTED: uncharacterized protein LOC100808017 [Glycine max]
          Length = 380

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 26/46 (56%)

Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          A   C+ C+ VAYC   HQ  HW  HK +C +L  ++KS  +L  +
Sbjct: 20 ATRLCARCEAVAYCSLSHQIAHWSRHKHECDRLQQQLKSVEVLNNF 65


>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
 gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
 gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
          Length = 433

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 44/213 (20%), Positives = 86/213 (40%), Gaps = 11/213 (5%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQR-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
                C    +   ++      ++  + LM H+C PN   +     +++   ++ H    
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEE 235

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 236 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
          Length = 395

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 44/213 (20%), Positives = 86/213 (40%), Gaps = 11/213 (5%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 23  GLSKCGRCKQAFY--CNVECQR-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 79

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 80  IHPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVL 137

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
                C    +   ++      ++  + LM H+C PN   +     +++   ++ H    
Sbjct: 138 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEE 197

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 198 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 230


>gi|345330186|ref|XP_001505544.2| PREDICTED: annexin A7-like [Ornithorhynchus anatinus]
          Length = 584

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 22  ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +LH+C  C+ V YCG + Q+ +W +HK  CKKL
Sbjct: 171 SLHRCKRCQNVYYCGPECQRANWPVHKRFCKKL 203


>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
          Length = 477

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 68/278 (24%), Positives = 106/278 (38%), Gaps = 39/278 (14%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
           E+ +S   GR L+AT +    D I  E         +L   +C  C K          RC
Sbjct: 10  EVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFK----RQERLHRC 65

Query: 117 PRC-FWPACSARCSGLSDAH-THAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
            +C F   C   C    DA   H  EC+ +K        Y     E I L  R +    R
Sbjct: 66  GQCKFAHYCDRTCQ--KDAWLNHKNECSAIK-------RYGKVPNENIRLAARIMWRVER 116

Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
                 E     + D++ H+   G      EE    ++     FL+  P       S + 
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVEHFG------EEEQKELRVDVDTFLQYWPPQS-QQFSMQY 169

Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
           +  I GVI  NG  +   R  Q    G++  + L+ H+C PN     ++  ++ +    +
Sbjct: 170 ISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKI 229

Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
                ++  Y + L  ++ RR+ L    YF+CSCE C 
Sbjct: 230 SEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQ 267


>gi|242002976|ref|XP_002422563.1| ankyrin repeat and mynd domain-containing protein, putative
           [Pediculus humanus corporis]
 gi|212505353|gb|EEB09825.1| ankyrin repeat and mynd domain-containing protein, putative
           [Pediculus humanus corporis]
          Length = 331

 Score = 41.2 bits (95), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 1/44 (2%)

Query: 9   DEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           D ++  CA C  E    KCS CK V YC K  QK HW +HK  C
Sbjct: 250 DSDVDSCATCGEEKRSKKCSRCKSVQYCDKDCQKYHWFVHKKVC 293


>gi|440490913|gb|ELQ70409.1| hypothetical protein OOW_P131scaffold00034g31 [Magnaporthe oryzae
          P131]
          Length = 462

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 15 CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          CAVC E A L  C+ C+ V YCG++HQ++H   HK  C  +
Sbjct: 12 CAVCGEKAGLLFCTGCRVVHYCGRKHQEDHRPAHKSACNAI 52


>gi|357613394|gb|EHJ68480.1| pcdc2/rp-8 [Danaus plexippus]
          Length = 351

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/40 (47%), Positives = 23/40 (57%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C +C   A   CS CK+V YC +QHQ   WK HK +C  L
Sbjct: 131 CYICGARAPSHCSKCKKVYYCSRQHQILDWKNHKIECPNL 170


>gi|440794148|gb|ELR15319.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 715

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 3/50 (6%)

Query: 14  RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
           RCA+C R   L  CS C+ V YCG +HQ  HW++ H+  C+  P+ + S 
Sbjct: 628 RCAICSRVDQLRVCSGCRLVHYCGPEHQNLHWRVSHRGSCEG-PFRLLSG 676


>gi|397615238|gb|EJK63308.1| hypothetical protein THAOC_16041 [Thalassiosira oceanica]
          Length = 397

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 26/44 (59%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L KC+AC+ V YC  + QK H K HK  CKK   E++   L G+
Sbjct: 27 LKKCNACQLVKYCSVECQKAHRKQHKKACKKRSAELRDETLFGQ 70


>gi|321460587|gb|EFX71628.1| hypothetical protein DAPPUDRAFT_255583 [Daphnia pulex]
          Length = 981

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 2/42 (4%)

Query: 15  CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CA C++ +  L +CS C+   YC ++ Q++HW  HK  CKK+
Sbjct: 940 CAACKKVSQSLKRCSRCRSTVYCCEECQRQHWAQHKMDCKKV 981


>gi|390596497|gb|EIN05899.1| hypothetical protein PUNSTDRAFT_145782 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 665

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 21/35 (60%)

Query: 18  CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           C +  L  CS C+ V YC K+ Q +HWK+H   CK
Sbjct: 100 CNKEGLFSCSTCRLVKYCSKECQAKHWKIHSRDCK 134


>gi|389742410|gb|EIM83597.1| hypothetical protein STEHIDRAFT_141295 [Stereum hirsutum FP-91666
           SS1]
          Length = 678

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 5/48 (10%)

Query: 15  CAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           C VC +T     +L +CS CK   YC  + QK  WK HK  C  LP E
Sbjct: 453 CQVCSKTPEGDESLKRCSICKNRFYCSSRCQKRDWKKHKVDCSLLPSE 500


>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           terrestris]
          Length = 635

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 110/320 (34%), Gaps = 70/320 (21%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
           I  S   GR+L AT +  PGD +  E+P      +   +     CH  L+   NL   + 
Sbjct: 246 ISFSEKYGRHLVATKEFKPGDVVTIENPYAY---VIYTQRYYTHCHHCLSRSYNLIPCSY 302

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
           CP   +  CS +C  L+    H  EC I+ L    L    D        +R LI+     
Sbjct: 303 CPVAQY--CSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTKIIRFLIVVTSKG 360

Query: 176 KKYQELK-DMEAHMS---KRGPGTEVYEEIDS---------------------------- 203
           KK+ EL+ DME   S    R  G      +DS                            
Sbjct: 361 KKFDELRVDMELAESNPDNRSAGFTDEGILDSTSARSALSLATNMTTRPLIGISAFACIS 420

Query: 204 ----IVKYLRSNFL-EKLPGDVLDDT--------SAKCLHWICGVIEVN--------GVD 242
               I+   ++NF   K   D L D         S   +   C ++  N        G+ 
Sbjct: 421 ALAAILLATQTNFFCNKYEVDQLKDISNYPKIIFSGSLMFRACVIMSSNCFSVQQEPGIK 480

Query: 243 IGRYTQGLYSVICLMEHNCLPNA----KHSNMMQSKLF-VFRDTHISTMY--TNALWGTQ 295
            G    GLY    L  H+C PN     +   M+   L  ++    I T Y    A     
Sbjct: 481 TG---SGLYVTHSLYNHSCAPNTFRHFEELTMITRALRPIYPGDQIFTNYGAAYAYMTKS 537

Query: 296 PRREHLAITKYFNCSCERCS 315
            RRE +    +F C C  C+
Sbjct: 538 ERREKIIQDYFFECDCIACA 557


>gi|296083782|emb|CBI23999.3| unnamed protein product [Vitis vinifera]
          Length = 892

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 24/48 (50%), Gaps = 1/48 (2%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
          C VC      KCS CK V YC +  Q  HWK  HK KCK      K++
Sbjct: 11 CLVCGNLGTKKCSGCKAVRYCSQTCQATHWKSGHKTKCKDFQLSGKAN 58


>gi|321449733|gb|EFX62041.1| hypothetical protein DAPPUDRAFT_302174 [Daphnia pulex]
          Length = 211

 Score = 41.2 bits (95), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 2/42 (4%)

Query: 15  CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CA C++ +  L +CS C+   YC ++ Q++HW  HK  CKK+
Sbjct: 170 CAACKKVSQSLKRCSRCRSTVYCCEECQRQHWAQHKMDCKKV 211


>gi|397629349|gb|EJK69320.1| hypothetical protein THAOC_09432 [Thalassiosira oceanica]
          Length = 872

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 22/37 (59%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
          L  C AC+ V YC    Q+ HWK HK  CKKL  E++
Sbjct: 24 LKICKACRSVKYCSVGCQRAHWKQHKAACKKLAKELE 60


>gi|326430459|gb|EGD76029.1| hypothetical protein PTSG_00738 [Salpingoeca sp. ATCC 50818]
          Length = 1663

 Score = 41.2 bits (95), Expect = 1.4,   Method: Composition-based stats.
 Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 6/47 (12%)

Query: 12   LMRCAVCRETALH------KCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            L +C  C   A H      +C+ CK+V YC    Q+ HW  HK  C+
Sbjct: 1206 LAKCHACATLAEHATGALKRCARCKQVCYCSTACQQSHWPQHKSSCR 1252


>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
 gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
          Length = 433

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 44/213 (20%), Positives = 86/213 (40%), Gaps = 11/213 (5%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDSDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
                C    +   ++      ++  + LM H+C PN   +     +++   ++ H    
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSXIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEE 235

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 236 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|397624311|gb|EJK67343.1| hypothetical protein THAOC_11642 [Thalassiosira oceanica]
          Length = 1256

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23   LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
            L  C+AC+ V YCG   QK H KLH+  CK+   E+K   L  + L+
Sbjct: 982  LKDCTACRLVKYCGVDCQKAHRKLHREACKQRAAELKDEQLYSQGLE 1028


>gi|397636925|gb|EJK72461.1| hypothetical protein THAOC_06008, partial [Thalassiosira
          oceanica]
          Length = 328

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 5/68 (7%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+  + + D+    CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 24 MSCFVPVADDGYEACANCGKQGSDTVKLRNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 83

Query: 56 YEIKSSPL 63
           E+K   L
Sbjct: 84 AELKDEQL 91


>gi|397625914|gb|EJK67961.1| hypothetical protein THAOC_10918, partial [Thalassiosira oceanica]
          Length = 383

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 3   SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S I ++D     CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 80  SCIAVDDGSAGVCANCGKHGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 139

Query: 58  IKSSPLLGRYLQ 69
           +K   L  + L+
Sbjct: 140 LKDERLYSQGLE 151


>gi|397620610|gb|EJK65805.1| hypothetical protein THAOC_13300 [Thalassiosira oceanica]
          Length = 876

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 5/71 (7%)

Query: 1   MASIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           M S   + D+ +  CA C +       L  C+AC+ V YCG   Q+ H K HK  C +  
Sbjct: 505 MMSCAPVADDGVEACANCGKQGSGTVTLKNCTACRLVKYCGVDCQRAHRKQHKRACNQRV 564

Query: 56  YEIKSSPLLGR 66
            E+K+  L G+
Sbjct: 565 EELKNEQLYGQ 575


>gi|392562230|gb|EIW55411.1| hypothetical protein TRAVEDRAFT_23090 [Trametes versicolor FP-101664
            SS1]
          Length = 1167

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D    +C+VCR   +  L  CS C +V YC +  QK+ W  HK KCK
Sbjct: 1119 RDPAAHKCSVCRGRGQPKLKACSGCSKVRYCSQACQKKDWPRHKSKCK 1166


>gi|390599613|gb|EIN09009.1| hypothetical protein PUNSTDRAFT_143643 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 729

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 2/44 (4%)

Query: 12  LMRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           L  C+ C+  +  L KC  C    YC    QK+HWKLHK  C++
Sbjct: 684 LYECSYCQNASAVLKKCGGCGNTRYCNSSCQKKHWKLHKASCRR 727


>gi|195455340|ref|XP_002074677.1| GK23035 [Drosophila willistoni]
 gi|194170762|gb|EDW85663.1| GK23035 [Drosophila willistoni]
          Length = 348

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 20/37 (54%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C   A H CS CK V YC   HQ+ HW  HK  C
Sbjct: 128 CAACGCHAPHACSRCKAVHYCSSAHQRVHWTQHKATC 164


>gi|170093361|ref|XP_001877902.1| ectomycorrhiza-upregulated zf-MYND domain-containing protein
          [Laccaria bicolor S238N-H82]
 gi|164647761|gb|EDR12005.1| ectomycorrhiza-upregulated zf-MYND domain-containing protein
          [Laccaria bicolor S238N-H82]
          Length = 227

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 3/54 (5%)

Query: 14 RCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL-PYEIKSSPLL 64
          RC+ C+     ++ CS CK   YC K+ QK HW++HKP C+   P E+    LL
Sbjct: 5  RCSKCQFGHNEVNACSKCKTAWYCSKECQKAHWEIHKPLCRPYSPNEVWGIKLL 58


>gi|397582294|gb|EJK52236.1| hypothetical protein THAOC_28513, partial [Thalassiosira
          oceanica]
          Length = 69

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+  + + D++   CA C   A     L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 1  MSCFVPVADDDNEVCANCGTAASDSVKLRDCAACRLVKYCGVDCQRAHRKQHKKVCKQRV 60

Query: 56 YEIKSSPL 63
           E+K   L
Sbjct: 61 AELKDEEL 68


>gi|302692188|ref|XP_003035773.1| hypothetical protein SCHCODRAFT_105219 [Schizophyllum commune
          H4-8]
 gi|300109469|gb|EFJ00871.1| hypothetical protein SCHCODRAFT_105219, partial [Schizophyllum
          commune H4-8]
          Length = 213

 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 28/56 (50%)

Query: 8  EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          + +E   CA+C   A  +CS C +  YC K+HQ   W  HK  CK    + +  P+
Sbjct: 5  DTQERRCCAICSYPAPSQCSGCGQAFYCSKEHQTLAWSKHKRLCKIYQRQARGEPV 60


>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
          Length = 403

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 9/107 (8%)

Query: 246 YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-------THISTMYTNALWGTQPRR 298
           Y  G++     + H+CLPNA   N  Q  + VFR          I T YT+       RR
Sbjct: 138 YGLGIFPTGSYLNHSCLPNAFWYNDDQG-MMVFRTLRPIKKGEEILTSYTDITTECSERR 196

Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
           +HL    +F C C++C    +L    S + C N+       + +P N
Sbjct: 197 KHLLKQYFFFCQCQQCKFQKKLADQ-SCLSCKNQLSSSNTSFKIPSN 242


>gi|397582835|gb|EJK52436.1| hypothetical protein THAOC_28284 [Thalassiosira oceanica]
          Length = 377

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 7/70 (10%)

Query: 1  MASIIEIEDEELMRCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          M+  I ++D     CA C + +       L  C+AC+ V YCG   Q+ H K HK  CK+
Sbjct: 1  MSRCIAVDDGSAGVCANCGKGSDAAGTVKLRNCNACRLVKYCGVDCQRAHRKQHKKACKQ 60

Query: 54 LPYEIKSSPL 63
             E+K   L
Sbjct: 61 RVAELKDEQL 70


>gi|449435374|ref|XP_004135470.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
          [Cucumis sativus]
 gi|449478708|ref|XP_004155398.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
          [Cucumis sativus]
          Length = 376

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 27/54 (50%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          R   C  +A  +C  C  VAYC   HQ +HW  HK +CK+   +++    L  +
Sbjct: 8  RGTRCTGSATRRCGRCGVVAYCSVDHQVKHWNDHKDECKRFELQMERIDALNEF 61


>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
           latipes]
          Length = 495

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 5/99 (5%)

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTH----ISTMY 287
           C    ++  ++     GLY  + L+ H+C PN     N  + +L   +D +    +   Y
Sbjct: 243 CNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFNGTELQLRAVQDINPGEELRISY 302

Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
              L  T+ R++ L    +F C C+RCS   + G   S 
Sbjct: 303 IETLSLTEDRQKQLEEQYHFTCHCQRCSSKEQDGVMLSG 341


>gi|397563809|gb|EJK43951.1| hypothetical protein THAOC_37556 [Thalassiosira oceanica]
          Length = 299

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 41/86 (47%), Gaps = 7/86 (8%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR-YLQATLDLHPGDRIA 81
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L  + Y +  +D  P     
Sbjct: 27  LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYSQGYERPEVDFCP--ICT 84

Query: 82  RESPLIVGPKLALA----EPICLGCH 103
              PLI+G     A    + IC GC+
Sbjct: 85  LPIPLIMGEHSVFAVCCMKQICNGCN 110


>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
          Length = 536

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 43/213 (20%), Positives = 84/213 (39%), Gaps = 11/213 (5%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 164 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 220

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 221 IYPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 278

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTH 282
                C    +   ++      ++  + LM H+C PN   +       +++   +     
Sbjct: 279 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKTGEE 338

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 339 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 371


>gi|440475247|gb|ELQ43941.1| hypothetical protein OOU_Y34scaffold00119g4 [Magnaporthe oryzae
          Y34]
          Length = 459

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)

Query: 15 CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          CAVC E A L  C+ C+ V YCG++HQ++H   HK  C  +
Sbjct: 12 CAVCGEKAGLLFCTGCRVVHYCGRKHQEDHRPAHKSACNAI 52


>gi|397597649|gb|EJK57054.1| hypothetical protein THAOC_22942 [Thalassiosira oceanica]
          Length = 270

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 5/54 (9%)

Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          CA C ET      L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L
Sbjct: 13 CANCGETGSDAVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKEEQL 66


>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
          harrisii]
          Length = 428

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 26/39 (66%), Gaps = 1/39 (2%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          RC +CR+  L +CS CK   YCG + QK+ W+ HK +CK
Sbjct: 51 RC-LCRKEKLLRCSQCKVARYCGSECQKQAWQDHKRECK 88


>gi|321462662|gb|EFX73684.1| hypothetical protein DAPPUDRAFT_307653 [Daphnia pulex]
          Length = 1021

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 28/52 (53%), Gaps = 4/52 (7%)

Query: 7    IEDEELMR--CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            IEDE      CAVC++    L +CS C+   YC  + Q  HW  HK  CKKL
Sbjct: 970  IEDEPADEKCCAVCKKIPEKLKRCSRCRSAVYCSVECQHSHWPTHKHICKKL 1021


>gi|307189954|gb|EFN74190.1| Ankyrin repeat and MYND domain-containing protein 2 [Camponotus
           floridanus]
          Length = 405

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L   + +S
Sbjct: 333 CVTCGEDKATKKCSKCKAVQYCDRECQRLHWFMHKKACARLDPSLANS 380


>gi|326673176|ref|XP_002664281.2| PREDICTED: egl nine homolog 1-like [Danio rerio]
          Length = 328

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 9  DEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          D +   C +C +   L KC  C+   YC K+HQKEHWK HK  C++
Sbjct: 11 DGDKQYCELCGKMENLLKCGRCRISFYCSKEHQKEHWKKHKLTCRE 56


>gi|322800676|gb|EFZ21610.1| hypothetical protein SINV_05901 [Solenopsis invicta]
          Length = 407

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L
Sbjct: 335 CVTCSEDKATKKCSKCKAVQYCDRECQRLHWFMHKKACARL 375


>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 645

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 111/285 (38%), Gaps = 55/285 (19%)

Query: 203 SIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD-IGRYTQGLYSV-------- 253
           S+V+YL  + LE+L    + +TS             + VD I R+T   +++        
Sbjct: 242 SLVRYLGLSSLEELAAFGMQNTS-------------DLVDLISRFTTNTFTITSPTLAPL 288

Query: 254 -------ICLMEHNCLPNA---------KHSNMMQ--SKLFVFRDTHISTMYTNALWGTQ 295
                  + L+ H+C PNA         +   +MQ  +  ++  D  I T Y +    T 
Sbjct: 289 GACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQVVALKYIGPDEEILTAYIDTTLPTG 348

Query: 296 PRREHLAITKYFNCSCERCS-DPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT 354
            R++ L  T +F C C  CS  PT    +  AM C    K  G C +    P + +S   
Sbjct: 349 LRQQALKETYHFVCECPLCSPPPTSSVDWREAMWC--PKKCGGVCRV----PTEGNSLTR 402

Query: 355 CGSCSARLNARDVHLVTSQLG-EQVDKL--VQENPNVKSLEEMLTKLEAMFHPHHYHCYA 411
           C  C A +   D  L   ++G E +DK   VQ +   K+L+     +  +          
Sbjct: 403 CEKCKAPVKDTDAVLDALRVGQEGLDKAEKVQFSDPAKALQLTTNLIPILI---SAGLVP 459

Query: 412 VKHSLIQLYGTQPGYAYTQLSSSLLERKIS--YVMSPNLKATDEP 454
             H L+ L      +    L SS L    S   + SP ++A   P
Sbjct: 460 ASHPLLGLSRLHTSFLIANLGSSPLSAAGSDEEIQSPQVQAEATP 504


>gi|432910654|ref|XP_004078459.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Oryzias latipes]
          Length = 415

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E    K CS CK V YC +  QK HW  HK  CKKL
Sbjct: 320 CTTCGEKGAQKRCSICKTVIYCDQTCQKMHWFTHKKVCKKL 360


>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
 gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
 gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
          Length = 428

 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTALRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGTPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|397574117|gb|EJK49043.1| hypothetical protein THAOC_32119, partial [Thalassiosira oceanica]
          Length = 1069

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YCG   Q+ H K+HK  CK+   EIK   L  + L+
Sbjct: 415 LKNCNACRLVKYCGVDCQRAHRKVHKKACKQRAAEIKDEQLYSQGLK 461


>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
           troglodytes]
 gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
 gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
 gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
          Length = 428

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|397567134|gb|EJK45413.1| hypothetical protein THAOC_35971 [Thalassiosira oceanica]
          Length = 262

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 23/41 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          L  C+AC+ V YCG   QK H K HK  CKK   EIK   L
Sbjct: 37 LKSCTACRLVKYCGVDCQKAHRKQHKKACKKRAAEIKDEQL 77


>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
 gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
          Length = 428

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|397588711|gb|EJK54371.1| hypothetical protein THAOC_26007, partial [Thalassiosira
          oceanica]
          Length = 284

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L  + L+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQLYSQGLE 75


>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
          Length = 477

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C  T  L KCS C+ V YCG   QK  WKLH+ +C+ L
Sbjct: 51 RCDGCFSTTNLSKCSRCQVVWYCGTPCQKSEWKLHRLECQAL 92



 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 56/248 (22%), Positives = 91/248 (36%), Gaps = 63/248 (25%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GR L  T D HPGD I  + P +  P     +  C GC      NL+  +RC   ++  C
Sbjct: 22  GRSLFTTRDFHPGDVIISQEPYVCVP----TQKRCDGCFS--TTNLSKCSRCQVVWY--C 73

Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
              C           E  + +L C+ L   +  K +++ P   L+L+    +K Q+ K +
Sbjct: 74  GTPC--------QKSEWKLHRLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKII 125

Query: 185 EAH---------------------MSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGD--- 219
            +                      +S R    E     + S+V  L + +  +   D   
Sbjct: 126 PSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKE 185

Query: 220 ---VLDDTSAKCLHWICGVIEVNGVDIGRYTQ-------------------GLYSVICLM 257
              VL    A  +H I    E+N  +I  +                     GLY V+ ++
Sbjct: 186 EQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSII 245

Query: 258 EHNCLPNA 265
            H+CLPN+
Sbjct: 246 NHSCLPNS 253


>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
 gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
           Sinefungin Bound
          Length = 438

 Score = 40.8 bits (94), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 17  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 68

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 69  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 126

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 127 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 181

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 182 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 241

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 242 LDMLMTSEERRKQLRDQYCFECDCFRC 268


>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
 gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
 gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
          Length = 494

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQ--SKLFVFRDT----HISTMYTNALWGTQPRREHLA 302
           GL+ +  L+ H+C PN   SN     +++   RD      I+  Y +       RR+ L 
Sbjct: 210 GLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKELR 269

Query: 303 ITKYFNCSCERCSDP 317
            TK+F+C C RC+ P
Sbjct: 270 ETKHFDCQCGRCAAP 284


>gi|334183803|ref|NP_177172.6| zinc ion binding protein [Arabidopsis thaliana]
 gi|332196904|gb|AEE35025.1| zinc ion binding protein [Arabidopsis thaliana]
          Length = 374

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 25/50 (50%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C      +C  C  VAYC   HQ  HW  HK +C++L  +++   LL  +
Sbjct: 12 CVGPPTRRCGQCGAVAYCSVSHQISHWSYHKEECERLEEQMRRVDLLNDF 61


>gi|195996589|ref|XP_002108163.1| hypothetical protein TRIADDRAFT_52356 [Trichoplax adhaerens]
 gi|190588939|gb|EDV28961.1| hypothetical protein TRIADDRAFT_52356 [Trichoplax adhaerens]
          Length = 389

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/45 (48%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 15  CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           C  C E  A  KCSACK+V YC ++ QK HW  HK  CK    E 
Sbjct: 308 CYTCGEKRANMKCSACKKVKYCDQRCQKFHWFTHKKFCKYFAEEF 352


>gi|397623675|gb|EJK67098.1| hypothetical protein THAOC_11912, partial [Thalassiosira
          oceanica]
          Length = 330

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 6/70 (8%)

Query: 1  MASIIEIEDEELMRCAVCRETA------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          M   I ++D     CA C + +      L  C+AC+ V YCG   QK H K HK  CK+ 
Sbjct: 29 MNRCIAVDDGSTDVCANCGKGSEGGAIKLKNCTACRLVKYCGVDCQKAHRKQHKKACKQR 88

Query: 55 PYEIKSSPLL 64
            E+K   L 
Sbjct: 89 AAELKDERLF 98


>gi|397571653|gb|EJK47895.1| hypothetical protein THAOC_33352, partial [Thalassiosira oceanica]
          Length = 597

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 5/69 (7%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+    CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 224 SCVPVVDDGAEVCANCGKQGSDAVKLKNCTACRLVKYCGVDCQRAHRKQHKQACKQRAAE 283

Query: 58  IKSSPLLGR 66
           +K   L G+
Sbjct: 284 LKDEQLYGQ 292


>gi|390599621|gb|EIN09017.1| hypothetical protein PUNSTDRAFT_102560 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 675

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 12  LMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           L RC+ C     AL KC  C++  YC    QK+HWK HK  C   P
Sbjct: 630 LYRCSFCGNPSAALKKCGRCEKTRYCDASCQKKHWKAHKTDCHTSP 675


>gi|389745836|gb|EIM87016.1| hypothetical protein STEHIDRAFT_95997 [Stereum hirsutum FP-91666
           SS1]
          Length = 286

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/43 (53%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPY 56
           C VC E A   CS CK V YCGK  Q + WK  HK KC K  Y
Sbjct: 244 CPVCMEEAELLCSKCKTVRYCGKVCQTKDWKAGHKTKCFKALY 286


>gi|357117231|ref|XP_003560376.1| PREDICTED: uncharacterized protein LOC100846465 [Brachypodium
           distachyon]
          Length = 1046

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           E   CA C   ++ +CS CK + YC  + Q +HW+  HK KCK++
Sbjct: 101 ETAPCAECGRASIKRCSGCKRMRYCSGECQSKHWQSDHKFKCKQM 145


>gi|397581200|gb|EJK51849.1| hypothetical protein THAOC_28941, partial [Thalassiosira oceanica]
          Length = 174

 Score = 40.8 bits (94), Expect = 1.7,   Method: Composition-based stats.
 Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 5/61 (8%)

Query: 11  ELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLG 65
           E+  CA C + A     L  C+AC+ V YC    QK H K HK  CK+   E+K   L  
Sbjct: 103 EIEACANCGKEASDAVKLKNCTACRLVKYCSVDCQKTHRKKHKKACKQRAAELKDERLFS 162

Query: 66  R 66
           +
Sbjct: 163 Q 163


>gi|299472616|emb|CBN78268.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1372

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 13/27 (48%), Positives = 22/27 (81%)

Query: 25   KCSACKEVAYCGKQHQKEHWKLHKPKC 51
            +C+ C++V+YC K+ Q++HWK+HK  C
Sbjct: 1345 QCTRCRKVSYCSKECQRQHWKIHKAAC 1371


>gi|157125230|ref|XP_001654253.1| hypothetical protein AaeL_AAEL010138 [Aedes aegypti]
 gi|108873686|gb|EAT37911.1| AAEL010138-PA [Aedes aegypti]
          Length = 445

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 11/52 (21%)

Query: 14 RCAVCRETALHK-----------CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC +C   A  +           C  C+ + YCG+QHQK+ WK HK  C+ +
Sbjct: 13 RCNICFTRATDRRLALADQKVRFCKGCRLIGYCGEQHQKDDWKNHKDFCRAV 64


>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
          Length = 455

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 67/291 (23%), Positives = 100/291 (34%), Gaps = 62/291 (21%)

Query: 115 RCPRCFWPACSARCSGLSDAH------------THAPECAILKLGCETLLAYNDYKYEAI 162
           RC RCF      +CSG                  H  EC  LKL   TL          +
Sbjct: 30  RCDRCFKLGKVLKCSGCLYVRYCNRACQQEAWPDHQAECEKLKLLPATL----------V 79

Query: 163 LPLRCLILQR---------------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKY 207
           +P   L+L R                + K Y+   D+  H        +  E   ++   
Sbjct: 80  VPSAALMLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEEDIRKDVKRIEHFHTLNVV 139

Query: 208 LRSNFLEKLPGDVLDDTSAKCLHWIC-GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAK 266
           L+    E  P     D   +    +C     V   ++     G+Y    +++H+C PNA 
Sbjct: 140 LQRLLDE--PAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGMYLGASILDHSCRPNAV 197

Query: 267 HSNM-MQSKLFVFRD--------THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
            + +  Q +L +  D        + I   Y + +  +  RRE L+   YF C C RC D 
Sbjct: 198 ATFVGEQLQLRLLEDFAGPELDFSRIFISYIDLIDPSDTRREQLSERYYFRCECVRCRDE 257

Query: 318 TELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLN--ARD 366
            E      A  C N   D+      P+     D    C +C A  +  ARD
Sbjct: 258 AE-RELMGAAACQNRKCDE------PI----RDGQTLCSACEAPFDQSARD 297


>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
          Length = 716

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 9/95 (9%)

Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTMYTNAL--------WGTQP 296
           R  Q +YSV  L  H+C PN  H+  +   LF+    H++      L        W  + 
Sbjct: 457 RVGQAIYSVASLFNHSCQPNI-HAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKD 515

Query: 297 RREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
           R++ L     F C C  CS+        +A +C+N
Sbjct: 516 RQKFLKDEYSFRCECSGCSELNVSDLVLNAFRCVN 550


>gi|225424795|ref|XP_002267535.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
          [Vitis vinifera]
          Length = 369

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 25/50 (50%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C   A  +C  C  VAYC   HQ  HW  HK +C +L  ++K   +L  +
Sbjct: 12 CIGPATRRCGRCGAVAYCSVSHQVSHWSDHKEECGRLEQQMKLVDVLNDF 61


>gi|397646650|gb|EJK77370.1| hypothetical protein THAOC_00804 [Thalassiosira oceanica]
          Length = 303

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          L  C+AC+ V YCG + Q+ H KLHK  CK+   E+K   L
Sbjct: 27 LKNCTACRLVKYCGVECQRAHRKLHKKACKQRVAELKDEQL 67


>gi|397573252|gb|EJK48616.1| hypothetical protein THAOC_32572 [Thalassiosira oceanica]
          Length = 475

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 5/54 (9%)

Query: 15  CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           CA C +T      L  C+AC+ V YCG   Q+ H K HK  CKK   E+K   L
Sbjct: 187 CANCGKTGSDTVKLKDCTACRLVKYCGVDCQRAHRKQHKKACKKRAAELKDERL 240


>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
           garnettii]
          Length = 428

 Score = 40.8 bits (94), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 59/269 (21%), Positives = 101/269 (37%), Gaps = 34/269 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A + L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVVALRPGELLFRSDPLAYTVCKGSRGIVCDRCLLGKEKLM--------RCSQCQ 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVIFKLMEETPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E+++ K      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNIKK------LTEDKKEGLRQLAMTFQHFMREEIQDVSQLPPSFDVFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
           IC    +   ++     GLY  + L+ H+C PN +   N     L   RD      ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCS 315
           Y + L  ++ RR+ L     F C C RC 
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCLRCQ 267


>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
          Length = 799

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 9/95 (9%)

Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTMYTNAL--------WGTQP 296
           R  Q +YSV  L  H+C PN  H+  +   LF+    H++      L        W  + 
Sbjct: 483 RVGQAIYSVASLFNHSCQPNI-HAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKD 541

Query: 297 RREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
           R++ L     F C C  CS+        +A +C+N
Sbjct: 542 RQKFLKDEYSFRCECSGCSELNVSDLVLNAFRCVN 576


>gi|332028331|gb|EGI68378.1| Ankyrin repeat and MYND domain-containing protein 2 [Acromyrmex
           echinatior]
          Length = 407

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%), Gaps = 1/41 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L
Sbjct: 335 CVTCGEDKATKKCSKCKAVQYCDRECQRLHWFMHKKACARL 375


>gi|397647767|gb|EJK77844.1| hypothetical protein THAOC_00296 [Thalassiosira oceanica]
          Length = 304

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 29 LKDCTACRLVKYCGVDCQRAHRKQHKKACKRRAAELKDEQLYGQ 72


>gi|393233651|gb|EJD41221.1| hypothetical protein AURDEDRAFT_138760 [Auricularia delicata
           TFB-10046 SS5]
          Length = 222

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 19/30 (63%)

Query: 19  RETALHKCSACKEVAYCGKQHQKEHWKLHK 48
           RE  L KCS C+   YC  +HQ+ HW LHK
Sbjct: 163 REENLKKCSKCETAKYCSTEHQRAHWSLHK 192


>gi|397601218|gb|EJK57846.1| hypothetical protein THAOC_22072 [Thalassiosira oceanica]
          Length = 636

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 5/66 (7%)

Query: 3   SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S I ++D     CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 335 SCIAVDDGSAGVCANCGKHGSDTVKLKDCAACRLVKYCGVDCQRAHRKQHKKACKQRASE 394

Query: 58  IKSSPL 63
           +K   L
Sbjct: 395 LKDEKL 400


>gi|301106410|ref|XP_002902288.1| programmed cell death protein 2, putative [Phytophthora infestans
           T30-4]
 gi|262098908|gb|EEY56960.1| programmed cell death protein 2, putative [Phytophthora infestans
           T30-4]
          Length = 390

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKCKK 53
           CA+C + A   CSAC    YC K HQK+HW    HK  C +
Sbjct: 148 CALCGQRATFTCSACHVAQYCSKAHQKDHWTAGGHKQTCAQ 188


>gi|389738208|gb|EIM79409.1| hypothetical protein STEHIDRAFT_126513 [Stereum hirsutum FP-91666
          SS1]
          Length = 262

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%)

Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          E     C ACKEV YC K+ QK HW LHK  C+
Sbjct: 42 EEPYQACGACKEVIYCSKECQKAHWPLHKGPCR 74


>gi|296086488|emb|CBI32077.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 25/50 (50%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C   A  +C  C  VAYC   HQ  HW  HK +C +L  ++K   +L  +
Sbjct: 12 CIGPATRRCGRCGAVAYCSVSHQVSHWSDHKEECGRLEQQMKLVDVLNDF 61


>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
           Methyltransferase Inhibitor Sinefungin
 gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Ii)
 gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
 gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
 gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
          Length = 428

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 125 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|323450761|gb|EGB06641.1| hypothetical protein AURANDRAFT_65492 [Aureococcus anophagefferens]
          Length = 478

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 5/48 (10%)

Query: 13  MRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           ++CA C E +     + +C+ CK  +YC +  QK+HW  HKP C  +P
Sbjct: 427 LKCAQCGEVSSVRVTVSRCARCKTASYCSRHCQKKHWLKHKPTCVPVP 474


>gi|299740056|ref|XP_001840441.2| hypothetical protein CC1G_12970 [Coprinopsis cinerea okayama7#130]
 gi|298404065|gb|EAU81374.2| hypothetical protein CC1G_12970 [Coprinopsis cinerea okayama7#130]
          Length = 788

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 5   IEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           + ++   L RC+ C     AL KCS C++  YC    QK HW  HK  CK
Sbjct: 724 VRMDRVSLYRCSWCGNPSAALQKCSGCQKARYCDSTCQKSHWPDHKKPCK 773


>gi|307102719|gb|EFN50988.1| hypothetical protein CHLNCDRAFT_55450 [Chlorella variabilis]
 gi|307105437|gb|EFN53686.1| hypothetical protein CHLNCDRAFT_53614 [Chlorella variabilis]
          Length = 272

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 2/49 (4%)

Query: 14  RCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           +C+ C +    L  C+AC  V+YC    Q+ HWK HKP CK    E++ 
Sbjct: 220 QCSGCGKHSIGLRLCAACHSVSYCSPACQRSHWKQHKPNCKVAQKEMRG 268


>gi|397578508|gb|EJK50901.1| hypothetical protein THAOC_29984 [Thalassiosira oceanica]
          Length = 118

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 5/66 (7%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+    CA C   A     L  C+AC+ V YCG   Q+ H K HK  CKK   E
Sbjct: 45  SCVPVVDDGAEVCANCGAMASDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKACKKRAAE 104

Query: 58  IKSSPL 63
           +K   L
Sbjct: 105 LKDEEL 110


>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
          Length = 433

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 82/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C   C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CDVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  E I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKELIQSDIAALHHFYSKHLEFPDNDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEINPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
           Zinc Finger Mynd Domain-Containing Protein 1
          Length = 429

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 16  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 67

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 68  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 125

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 126 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 180

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 181 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 240

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 241 LDMLMTSEERRKQLRDQYCFECDCFRC 267


>gi|299749521|ref|XP_001836166.2| hypothetical protein CC1G_10947 [Coprinopsis cinerea okayama7#130]
 gi|298408476|gb|EAU85675.2| hypothetical protein CC1G_10947 [Coprinopsis cinerea okayama7#130]
          Length = 450

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 1/40 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
           C    + AL +C  C++V YC  +HQK HW+ +HK +C +
Sbjct: 404 CGKPGDEALKQCGGCRQVLYCSPEHQKFHWRAMHKNQCSR 443


>gi|66357248|ref|XP_625802.1| SET domain protein [Cryptosporidium parvum Iowa II]
 gi|46226909|gb|EAK87875.1| SET domain protein [Cryptosporidium parvum Iowa II]
          Length = 583

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 42/192 (21%), Positives = 85/192 (44%), Gaps = 24/192 (12%)

Query: 250 LYSVICLMEHNCLPN------AKHSNMMQSK--LFVFRDTHISTMYTNALWGTQPRREHL 301
           LY+VI +M HNC  +        ++ ++++K  L V  +  IS +  + L+     R  L
Sbjct: 314 LYNVISMMAHNCGASCCWHYGVDNTFVLRAKTRLEVGDEITISYISDDDLFKCSKTRREL 373

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT----CGS 357
                F C CERC++PT+L      +KC         C +  +   ++++  T    C  
Sbjct: 374 LSNWLFYCQCERCNNPTDLS---RGLKC-------ASCGVGSMFFKEDENGVTRSSRCSV 423

Query: 358 CSARLNARDVHLVTSQLGEQVDKLVQENPN-VKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
           C ++ +   +   T    + V +L + + N V+ +E +  + + +F   H+  Y +   L
Sbjct: 424 CRSQPDREIIDSYTDLESQYVARLEETDKNDVEDVEAVYQEAQKVF-TQHWIMYELDTML 482

Query: 417 IQLYGTQPGYAY 428
            + Y     +++
Sbjct: 483 FEAYKESQQFSF 494


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 12  LMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           L RCA C +  T L  C  CK VAYCG+  Q  H+K HK  C+
Sbjct: 419 LARCANCGKASTQLSLCGRCKSVAYCGRPCQVAHYKGHKATCR 461


>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like
          [Meleagris gallopavo]
          Length = 427

 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 21/34 (61%)

Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          R   LH+CS CK   YCGK  QKE W  HK +CK
Sbjct: 55 RNEHLHRCSQCKVAKYCGKSCQKEGWLDHKRECK 88


>gi|398394643|ref|XP_003850780.1| hypothetical protein MYCGRDRAFT_110262 [Zymoseptoria tritici
           IPO323]
 gi|339470659|gb|EGP85756.1| hypothetical protein MYCGRDRAFT_110262 [Zymoseptoria tritici
           IPO323]
          Length = 422

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 14  RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +C+ C + A  L +C+ CK  +YC +  Q  HWK+HK  C KL
Sbjct: 195 KCSQCDKPASSLKRCAKCKSTSYCDRACQTAHWKVHKKICAKL 237


>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus
          gallus]
          Length = 428

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 21/34 (61%)

Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          R   LH+CS CK   YCGK  QKE W  HK +CK
Sbjct: 55 RNEHLHRCSQCKVAKYCGKSCQKEAWLDHKRECK 88


>gi|397570025|gb|EJK47111.1| hypothetical protein THAOC_34195, partial [Thalassiosira oceanica]
          Length = 1544

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 24/38 (63%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K+
Sbjct: 807 LKNCTACRLVKYCGADCQRAHRKLHKKACKQRAAELKA 844



 Score = 39.3 bits (90), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 23/42 (54%)

Query: 23   LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
            L  C+AC+ V YC    Q+ H KLHK  CK+   E+K   L 
Sbjct: 1255 LKNCTACRLVKYCSADCQRAHRKLHKKACKQRAAELKDEQLF 1296


>gi|366987641|ref|XP_003673587.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
 gi|342299450|emb|CCC67204.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
          Length = 508

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 6/88 (6%)

Query: 235 VIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYT 288
           ++++   +I +    +Y +   + HNC PN ++    +  L VF    I       T Y 
Sbjct: 331 LLDIGKFNINQLEGQIYHLYSFINHNCEPNIRYEIDSKLCLRVFARKPIQPGEELLTTYV 390

Query: 289 NALWGTQPRREHLAITKYFNCSCERCSD 316
           N L G + RR  L +   F C C RC +
Sbjct: 391 NPLHGVKLRRRALKVNWGFLCQCARCEN 418


>gi|195493207|ref|XP_002094318.1| GE21757 [Drosophila yakuba]
 gi|194180419|gb|EDW94030.1| GE21757 [Drosophila yakuba]
          Length = 403

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 15  CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
           C+ C  E    KCS CK V YC ++ Q+ HW +HK  C +L  + ++ P
Sbjct: 333 CSTCGAEKPDKKCSKCKAVQYCDRECQRLHWFMHKKNCARLLAQSQTQP 381


>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
 gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
          Length = 377

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 14 RCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC  C E  + L +CS CK   YCG   Q+  W++HK +C++L
Sbjct: 50 RCDFCLEKLSDLQRCSRCKFARYCGASCQRAAWRIHKSECERL 92


>gi|397611228|gb|EJK61234.1| hypothetical protein THAOC_18316 [Thalassiosira oceanica]
          Length = 282

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 47/105 (44%), Gaps = 9/105 (8%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR-YLQATLDLHPGDRIA 81
           L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L  + +++  +D  P   + 
Sbjct: 22  LKNCTACRFVKYCGVDCQRAHRKLHKKACKQRAAELKEEQLYSQGHVRPEVDFCPICTLP 81

Query: 82  RESPLIVGPKLAL----AEPICLGCHKPLNPNLADNARCPRCFWP 122
              PL +G          + IC GC         D+  CP C  P
Sbjct: 82  --IPLPMGHHSVFRACCMQKICNGCDIAAKKRGMDD--CPFCRTP 122


>gi|56755307|gb|AAW25833.1| SJCHGC05428 protein [Schistosoma japonicum]
          Length = 131

 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 27/48 (56%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           RC  C E A  +CS C++  YCG++ Q +HW  HK  C  L   + ++
Sbjct: 80  RCPTCGEVASKRCSRCRQEWYCGRECQVKHWPRHKKACDLLTEAVSTN 127


>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
 gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form I)
          Length = 436

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 125 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|226505212|ref|NP_001151267.1| MYND finger family protein [Zea mays]
 gi|195645400|gb|ACG42168.1| MYND finger family protein [Zea mays]
          Length = 372

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
           A  +C  C  VAYC + HQ  HW++HK +C++   +++    L ++
Sbjct: 17 VADRRCGGCGAVAYCSRVHQIIHWRVHKEECERFAEQMRHIDALSQF 63


>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
 gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
          Length = 428

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 56/268 (20%), Positives = 104/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC+ LK  C+     +  +    + ++ +  +    +K  
Sbjct: 67  IAKYCSAKCQKKA-WPDHRRECSCLK-SCKPRYPPDSVRLLGRVIVKLMDEKPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E+++SK      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTM 286
           IC    +   ++     GLY  + L+ H+C PN          ++++   +     ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCIRC 266


>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
 gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
           Full=Zinc finger MYND domain-containing protein 1
 gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
 gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
          Length = 428

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 56/268 (20%), Positives = 104/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC+ LK  C+     +  +    + ++ +  +    +K  
Sbjct: 67  IAKYCSAKCQKKA-WPDHRRECSCLK-SCKPRYPPDSVRLLGRVIVKLMDEKPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E+++SK      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTM 286
           IC    +   ++     GLY  + L+ H+C PN          ++++   +     ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCIRC 266


>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 647

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 26/51 (50%)

Query: 4   IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            +E +D EL   +      + +C  CK +AYCG   Q+  W  HKP+C  L
Sbjct: 100 FLEKQDFELDNPSTPSNRTIRRCGKCKVIAYCGSNCQQLDWASHKPECTAL 150


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 25/43 (58%), Gaps = 2/43 (4%)

Query: 12  LMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           L RCA C +  T L  C  CK VAYCG+  Q  H+K HK  C+
Sbjct: 410 LARCANCGKASTQLSLCGRCKSVAYCGRPCQVAHYKGHKATCR 452


>gi|348673334|gb|EGZ13153.1| hypothetical protein PHYSODRAFT_510777 [Phytophthora sojae]
          Length = 560

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 7   IEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           +E+ EL RCA C + A     L +C+ CK   YC  Q Q+ HWK HK  C
Sbjct: 319 MENPELRRCAKCDKRAAPGKELKRCTRCKCTFYCDAQCQRGHWKDHKVSC 368


>gi|336388071|gb|EGO29215.1| hypothetical protein SERLADRAFT_359379 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 709

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 22/44 (50%), Gaps = 2/44 (4%)

Query: 11  ELMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           EL RC+ C      L KCS C +  YC    QK HW  HK  CK
Sbjct: 665 ELYRCSYCGNPSAVLRKCSGCAKARYCDNSCQKSHWSEHKIACK 708


>gi|405970379|gb|EKC35288.1| Ubiquitin carboxyl-terminal hydrolase 19 [Crassostrea gigas]
          Length = 1310

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 6/66 (9%)

Query: 14  RCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
           RCA CR        L +C+ C +V YC    QK HW +HK  C   P  +  SP +    
Sbjct: 685 RCASCRRLCPDNCKLKRCTRCFKVGYCDHSCQKNHWIMHKNHCNMTPEPV-GSPFIVSIP 743

Query: 69  QATLDL 74
           + ++D+
Sbjct: 744 EYSVDV 749


>gi|389747838|gb|EIM89016.1| hypothetical protein STEHIDRAFT_53749 [Stereum hirsutum FP-91666 SS1]
          Length = 1186

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 3/49 (6%)

Query: 7    IEDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
            I D E  +C  CR      L +C  C++V YC  + QK +W+ HK +CK
Sbjct: 1137 IRDPEARKCFACRGKGNPKLMECKGCRKVRYCSVECQKFNWEEHKKRCK 1185


>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
           [Oryzias latipes]
          Length = 435

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 40/210 (19%), Positives = 79/210 (37%), Gaps = 22/210 (10%)

Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
           N  C R  WP              H  EC  +    E        +  + + ++  +   
Sbjct: 75  NVDCQRGDWP-------------MHKLECVAMCTHGENWCPSETVRLVSRIIMKQRVTTE 121

Query: 173 RSP-KKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
           R+P ++   LK+ E+H+ K     +   + D + + Y  S  +  LP D   + +     
Sbjct: 122 RTPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQ 179

Query: 231 WICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTHIST 285
             C    +   ++      ++  + LM H+C PN   +       +++   +     I  
Sbjct: 180 VNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFN 239

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCS 315
            Y + L+ T+ R+E L  + +F C C  C+
Sbjct: 240 SYIDLLYPTEDRKERLLDSYFFGCQCTECT 269


>gi|350417592|ref|XP_003491497.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Bombus impatiens]
          Length = 407

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L   I  S
Sbjct: 335 CITCGEDKANKKCSKCKAVQYCDRECQRLHWFMHKKACTRLGQNITPS 382


>gi|332020173|gb|EGI60617.1| Zinc finger MYND domain-containing protein 10 [Acromyrmex
           echinatior]
          Length = 437

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 17/44 (38%), Positives = 28/44 (63%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           E  +C++C+E +  +CS CKEV YC ++ Q + W+ HK  C  +
Sbjct: 389 ETKKCSLCQEPSKKRCSRCKEVWYCSRECQVKDWENHKNICNTI 432


>gi|198465840|ref|XP_001353783.2| GA20758 [Drosophila pseudoobscura pseudoobscura]
 gi|198150338|gb|EAL29517.2| GA20758 [Drosophila pseudoobscura pseudoobscura]
          Length = 404

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 10  EELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +E   C+ C +E    KCS CK V YC ++ Q+ HW +HK  C +L
Sbjct: 327 DETSYCSTCGQENPDKKCSKCKAVQYCDRECQRLHWFMHKKSCARL 372


>gi|397631988|gb|EJK70365.1| hypothetical protein THAOC_08280, partial [Thalassiosira
          oceanica]
          Length = 217

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L  R L+
Sbjct: 22 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYSRGLE 68


>gi|354801524|gb|AER39528.1| egl nine 1-like protein [Carassius carassius]
          Length = 359

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 1/49 (2%)

Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
          C +C +   L KC  C+   YC K+HQ++ WK HK  CK+   + ++ P
Sbjct: 18 CELCGKMENLMKCGRCRSSFYCSKEHQRQDWKKHKRMCKEADKQQQTPP 66


>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
          Length = 428

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 56/268 (20%), Positives = 104/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC+ LK  C+     +  +    + ++ +  +    +K  
Sbjct: 67  IAKYCSAKCQKKA-WPDHRRECSCLK-SCKPRYPPDSVRLLGRVIVKLMDEKPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E+++SK      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTM 286
           IC    +   ++     GLY  + L+ H+C PN          ++++   +     ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCIRC 266


>gi|290985293|ref|XP_002675360.1| predicted protein [Naegleria gruberi]
 gi|284088956|gb|EFC42616.1| predicted protein [Naegleria gruberi]
          Length = 618

 Score = 40.4 bits (93), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 6/54 (11%)

Query: 6   EIEDEELMRCAVCRET------ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           E E+E+   CAVC  T       L  C  CK+V YC ++ Q++ WK HK +C+K
Sbjct: 565 EAENEKPKVCAVCGATQAEGSKKLLVCGKCKKVHYCSRECQEKDWKTHKKECQK 618


>gi|397624983|gb|EJK67613.1| hypothetical protein THAOC_11329 [Thalassiosira oceanica]
          Length = 307

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 5/67 (7%)

Query: 2   ASIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
           +S + I DE    CA C +       L  C+AC+ V YCG   QK H K HK  C+K   
Sbjct: 52  SSCVPIVDEGDEVCANCGKHGSDIVKLKHCNACRIVKYCGVDCQKAHRKQHKKACRKRAA 111

Query: 57  EIKSSPL 63
           E+K   L
Sbjct: 112 ELKDEQL 118


>gi|397617240|gb|EJK64346.1| hypothetical protein THAOC_14932 [Thalassiosira oceanica]
          Length = 289

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 26/49 (53%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
            L KC+AC  V YCG   QK H K HK  CKK   E+K   L  + L+
Sbjct: 15 VTLKKCNACHLVKYCGVDCQKAHRKEHKKVCKKRAAELKEERLYSQGLE 63


>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
 gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
           Full=Histone methyltransferase SMYD2-A; AltName:
           Full=SET and MYND domain-containing protein 2A
 gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
          Length = 430

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 38/164 (23%), Positives = 70/164 (42%), Gaps = 21/164 (12%)

Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
           +R   + +  +K+ E+H+SK     E  E I+S +  L   + + L       T    L 
Sbjct: 119 ERTPSETFLSVKEFESHLSKLD--NEKKELIESDIAALHRFYSKNL-----HYTDNAALV 171

Query: 231 WICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH-- 282
           ++   +  NG  I           ++  + LM H+C PN   +     +++   ++ H  
Sbjct: 172 FLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTFKGTVAEIRAVQEIHAG 231

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTEL 320
             + T Y + L+ T+ R + L  + +F C C  CS    DP +L
Sbjct: 232 DEVFTSYIDLLYPTEDRNDRLMDSYFFTCDCRECSTKQKDPAKL 275


>gi|336375120|gb|EGO03456.1| hypothetical protein SERLA73DRAFT_83458 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 712

 Score = 40.4 bits (93), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 22/44 (50%), Gaps = 2/44 (4%)

Query: 11  ELMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           EL RC+ C      L KCS C +  YC    QK HW  HK  CK
Sbjct: 668 ELYRCSYCGNPSAVLRKCSGCAKARYCDSSCQKSHWSDHKAVCK 711


>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
           [Oryzias latipes]
          Length = 424

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 32/159 (20%), Positives = 65/159 (40%), Gaps = 9/159 (5%)

Query: 164 PLRCLILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVL 221
           P    +   R+P ++   LK+ E+H+ K     +   + D + + Y  S  +  LP D  
Sbjct: 102 PSETRVTTERTPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD-- 159

Query: 222 DDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLF 276
            + +       C    +   ++      ++  + LM H+C PN   +       +++   
Sbjct: 160 QELTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQE 219

Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           +     I   Y + L+ T+ R+E L  + +F C C  C+
Sbjct: 220 INPGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECT 258


>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
          Length = 272

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 47/216 (21%), Positives = 87/216 (40%), Gaps = 17/216 (7%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L  +F  K  G   +D+   
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALH-HFYSKHLGFPDNDSLVV 174

Query: 228 CLHWI-CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
               + C    +   ++      ++  + LM H+C PN   +   +  L   R       
Sbjct: 175 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232

Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
              + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|407915489|gb|EKG09079.1| Zinc finger MYND-type protein [Macrophomina phaseolina MS6]
          Length = 309

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 10/64 (15%)

Query: 7  IEDEELMR--CAVCR--------ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
          I+DE  +   CA CR        + +L++CS C  V+YC K+ Q  HW  HK  CK+  Y
Sbjct: 3  IDDEWAVESGCAGCRRQQSDLAGDQSLNRCSRCHCVSYCSKECQTAHWTTHKSNCKRPNY 62

Query: 57 EIKS 60
           I++
Sbjct: 63 VIEA 66


>gi|397627358|gb|EJK68445.1| hypothetical protein THAOC_10375, partial [Thalassiosira oceanica]
          Length = 352

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 26/46 (56%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+ L
Sbjct: 75  LKSCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQGL 120


>gi|336370221|gb|EGN98562.1| hypothetical protein SERLA73DRAFT_183631 [Serpula lacrymans var.
          lacrymans S7.3]
          Length = 432

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 10 EELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          ++L++CAVC ++  L  CS+C E  YC  + QK  W  HK  C K
Sbjct: 15 KDLVKCAVCSKKKDLRLCSSCGEQTYCSAECQKNDWPTHKTSCGK 59


>gi|193785783|dbj|BAG51218.1| unnamed protein product [Homo sapiens]
          Length = 441

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15  CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E  A  +CS CK V YC +  QK HW  HK  CK L
Sbjct: 320 CTTCGEKGASKRCSVCKMVIYCDQTRQKTHWFTHKKICKNL 360


>gi|449545771|gb|EMD36741.1| hypothetical protein CERSUDRAFT_50971 [Ceriporiopsis subvermispora
           B]
          Length = 425

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C +C      +CS C+ V+YCG++ Q+  W  HK  C+ L
Sbjct: 247 CVLCGSKTASRCSQCQSVSYCGQECQRADWPSHKQTCRSL 286


>gi|397624427|gb|EJK67380.1| hypothetical protein THAOC_11597 [Thalassiosira oceanica]
          Length = 305

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 30 LRDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQ 73


>gi|348507084|ref|XP_003441087.1| PREDICTED: egl nine homolog 1-like [Oreochromis niloticus]
          Length = 359

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 9  DEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          D +   C +C R   L KC  C+   YC K+HQK+ WK HK  CK+
Sbjct: 9  DRDRQYCELCGRFDNLLKCGRCRSSFYCSKEHQKQDWKKHKLICKE 54


>gi|324508446|gb|ADY43564.1| SET domain-containing protein 14 [Ascaris suum]
          Length = 421

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          T L KC+ACK + YC +  QK  W +HK +CK
Sbjct: 35 TKLEKCAACKSIFYCNRSCQKADWPMHKVECK 66


>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
          [Schizosaccharomyces pombe 972h-]
 gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
 gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
          [Schizosaccharomyces pombe]
          Length = 483

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 3/56 (5%)

Query: 2  ASIIEIEDEELMR-CAVCRETAL--HKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            I+ ++   L R C+ C E  +   +C+ACK + YC K  QK  W  HK +CK L
Sbjct: 35 VDILSLDSANLTRTCSTCTEEKVKTQRCAACKIIHYCSKGCQKADWPFHKLECKAL 90


>gi|340369460|ref|XP_003383266.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
           [Amphimedon queenslandica]
          Length = 442

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           +CA+C   A  +CS C    YC ++ Q ++WK HKP C  L Y+ KSS
Sbjct: 396 KCALCGSEATKRCSRCCSEWYCSRECQVKNWKKHKPSCDIL-YQSKSS 442


>gi|225709842|gb|ACO10767.1| Programmed cell death protein 2 [Caligus rogercresseyi]
          Length = 374

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 6/63 (9%)

Query: 8   EDEELMRCAVCRETALHK-CSACKEVAYCGKQHQKEHWK--LHKPKCKKLPYEIKSSPLL 64
           ++  L  C +C   A +K CS CKE  YC K+HQ   WK   HK +C+      KS PL+
Sbjct: 133 QERHLQLCRLCGAGASNKTCSGCKEAKYCSKEHQTADWKSGSHKAECQS---PSKSVPLI 189

Query: 65  GRY 67
             Y
Sbjct: 190 SLY 192


>gi|348680577|gb|EGZ20393.1| hypothetical protein PHYSODRAFT_313078 [Phytophthora sojae]
          Length = 250

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 3/43 (6%)

Query: 15 CAVCRET--ALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKL 54
          C+ C+ T  AL +CS CK  AYC K  Q++HWK  H+  C KL
Sbjct: 5  CSACQSTDKALSRCSQCKWTAYCSKTCQRDHWKNGHRLTCGKL 47


>gi|195622642|gb|ACG33151.1| MYND finger family protein [Zea mays]
 gi|414886684|tpg|DAA62698.1| TPA: MYND finger family protein [Zea mays]
          Length = 372

 Score = 40.4 bits (93), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
           A  +C  C  VAYC + HQ  HW++HK +C++   +++    L ++
Sbjct: 17 VADRRCGGCGAVAYCSRVHQIIHWRVHKEECERFAEQMRHINALSQF 63


>gi|397589365|gb|EJK54635.1| hypothetical protein THAOC_25719 [Thalassiosira oceanica]
          Length = 147

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 3  SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S + + D     CA C  TA     L  C+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 2  SCVSVVDGSNEVCANCGTTASDIVKLKNCTACRLVKYCGVDCQRAHRKRHKKACKQRAAE 61

Query: 58 IKSSPLLGRYLQ 69
          ++   L  + L+
Sbjct: 62 LEDERLYSQGLK 73


>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
           impatiens]
          Length = 635

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 56/131 (42%), Gaps = 8/131 (6%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
           I  S   GR+L AT +  PGD +  E+P      +   +     CH  L+   NL   + 
Sbjct: 246 ISFSEKYGRHLVATKEFKPGDVVTIENPYAY---VIYTQRYYTHCHHCLSRSYNLIPCSH 302

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
           CP   +  CS +C  L+    H  EC I+ L    L    D        +R LI+     
Sbjct: 303 CPVAQY--CSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTKIIRFLIVITSKG 360

Query: 176 KKYQELK-DME 185
           KK+ EL+ DME
Sbjct: 361 KKFDELRVDME 371


>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
          Length = 349

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)

Query: 65  GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
           GR L+A    H GD +    A    L VG +    E     C       L+   RC + F
Sbjct: 18  GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72

Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEA-ILPLRCLILQRRSPKKYQ 179
           +  C   C    D   H  EC+ + +  E        +  A IL  + +  +R   +K  
Sbjct: 73  Y--CDVECQK-EDWPLHKLECSSMVVFGENWNPSETVRLTARILAKQKMHPERTPSEKLL 129

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
            +++ E+H+ K     E  + I S +  L   + + L  +  D +S   L     C    
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKHL--EFPDHSSLVVLFAQVNCNGFT 185

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
           +   ++      ++  + LM H+C PN   +     +++   ++ H    + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245

Query: 293 GTQPRREHLAITKYFNCSCERCS 315
            T+ R + L  + +F C C  C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268



 Score = 38.9 bits (89), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C   R+  L KC  CK+  YC  + QKE W LHK +C  +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93


>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
          Length = 428

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 63/283 (22%), Positives = 104/283 (36%), Gaps = 31/283 (10%)

Query: 46  LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKP 105
           +  PK +K      SS   G  L+A + L PG+ + R  PL           +C+ C   
Sbjct: 1   MEPPKVEKF-----SSANRGNGLRALVQLRPGELLFRSDPLAYTVCKGSRGVVCVRCIIG 55

Query: 106 LNPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP 164
               L     C RC     C A+C   +    H  EC  LK  C+     +  +    + 
Sbjct: 56  RKYLLY----CSRCRLLAQCGAKCRKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVV 109

Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD-- 222
            + +       +K     D+E++++K      + E+    ++ L   F   +  ++ D  
Sbjct: 110 FKLMEETPSESEKLYSFYDLESNINK------LTEDKKEGLRQLVMTFQHFMREEIQDAS 163

Query: 223 ------DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKL 275
                 D        IC    V   ++     GLY  + L+ H+C PN +   N     L
Sbjct: 164 QLPPSFDIFEAFAKVICNSFTVCNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLL 223

Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
              RD      ++  Y + L  ++ RR+ L     F C C RC
Sbjct: 224 RAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC 266


>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
          Length = 554

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 31/136 (22%), Positives = 56/136 (41%), Gaps = 22/136 (16%)

Query: 249 GLYSVICLM-EHNCLPNAKHSNMMQSKLF------VFRDTHISTMYTNALWGTQPRREHL 301
           GL+ +  L   H C PN     +   +L       V +D  +   Y +       RR+ L
Sbjct: 256 GLFPLGALFFNHGCNPNTAFVGLPNGQLAFRTIRSVQKDEELVVSYIDIYSDRDERRQEL 315

Query: 302 AITKYFNCSCERCSDPTE--LGTYFSAMKCLNEHKDQGDCWILPVNPLD----------N 349
             TK+F C C+RC+ P E  +  +   + C   ++ + D +++P   +D           
Sbjct: 316 LTTKHFWCKCKRCASPLEKSIDRFLQGVVC---NQCEKDVYMIPATNIDLLMKGERSLMI 372

Query: 350 DSDWTCGSCSARLNAR 365
           ++ + C SC   L  +
Sbjct: 373 ENTFKCASCGHELEGQ 388


>gi|167522379|ref|XP_001745527.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775876|gb|EDQ89498.1| predicted protein [Monosiga brevicollis MX1]
          Length = 386

 Score = 40.4 bits (93), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 23/38 (60%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           +C VC   A+++CS C+   YC +Q Q EHW  HK  C
Sbjct: 338 KCVVCGADAVNRCSRCQSEWYCRRQCQVEHWPKHKAAC 375


>gi|348681660|gb|EGZ21476.1| hypothetical protein PHYSODRAFT_557520 [Phytophthora sojae]
          Length = 341

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 18/38 (47%), Positives = 23/38 (60%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
           L KC+ C++  YC +  QK  WKLHK  C K P EI +
Sbjct: 297 LSKCARCRDAVYCSRDCQKAAWKLHKRLCGKTPEEISA 334


>gi|328872636|gb|EGG21003.1| hypothetical protein DFA_00872 [Dictyostelium fasciculatum]
          Length = 596

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 14  RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           RC  C + T L KC  CK+V YC K+ Q ++W  H+  C
Sbjct: 554 RCQTCFKSTNLKKCGRCKQVFYCSKECQIKNWAFHQSIC 592


>gi|397642774|gb|EJK75446.1| hypothetical protein THAOC_02829, partial [Thalassiosira oceanica]
          Length = 365

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 15  CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           CA C +T      L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 90  CANCGKTGSDAVKLKNCTACRLVKYCGVDCQRAHRKKHKKACKQRVAELKDEQLYGQ 146


>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
           terrestris]
          Length = 439

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 71/329 (21%), Positives = 119/329 (36%), Gaps = 64/329 (19%)

Query: 113 NARCPRCFWPACSARCSGLSDAH------------THAPECAILKLGCETLLAYNDYKYE 160
           N RC  CF      RCS     +             H+ ECA LK        ++ +   
Sbjct: 30  NVRCDYCFKSGKLFRCSACQYVYYCNQSCQQMSWPMHSKECARLK-------KFSPWGIS 82

Query: 161 AILPLRCLILQR-----------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLR 209
            +  L   I+ +            +   Y++ KD+ +H S+     +  E    +   L 
Sbjct: 83  NVARLMARIIIKLNQGGDEEKGYYNETNYRKFKDLMSHCSEIKKDEKKMEHFVCLCNVLH 142

Query: 210 SNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ----GLYSVICLMEHNCLPNA 265
             FLE +P       S   L  I G I +N   I         G+Y    +++H+C PNA
Sbjct: 143 -KFLEDMP-----IPSTAELLGIYGRITINSFSIFNLDMNIGVGIYLGPSILDHSCKPNA 196

Query: 266 KHS------NMMQSKLFVFRD-THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
             +      N+   +     D + I   Y + +     RR  L  + YF C CE+C  P 
Sbjct: 197 VATFEGTTINVKAIEDLPSLDLSQIRIPYIDVIKTAGDRRAELQSSYYFWCDCEKCEKPE 256

Query: 319 ELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQV 378
            +     A  C N+         L   P D ++D  C  C+ +    +   +  ++ E  
Sbjct: 257 PMA---EAAACPNK---------LCTYPCDPNAD-LCEKCNTKF-PENFKEIFDEISEFT 302

Query: 379 DKLVQENPNVKSLEEM---LTKLEAMFHP 404
              ++   N+  L+     L++ E + HP
Sbjct: 303 AYHLENMKNIAYLDVSKMCLSRQEGILHP 331


>gi|397607285|gb|EJK59631.1| hypothetical protein THAOC_20114, partial [Thalassiosira oceanica]
          Length = 340

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 5/74 (6%)

Query: 1   MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           M   + +E +    CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 62  MIKCVPVEGDGDDACANCGKLGSDAVKLKNCTACRLVKYCGVDCQRAHRKKHKKACKQRV 121

Query: 56  YEIKSSPLLGRYLQ 69
            E+K   L  + L+
Sbjct: 122 AEVKDEQLYSQGLE 135


>gi|397594234|gb|EJK56180.1| hypothetical protein THAOC_23985 [Thalassiosira oceanica]
          Length = 303

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 29 LKDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQ 72


>gi|397565442|gb|EJK44625.1| hypothetical protein THAOC_36821, partial [Thalassiosira oceanica]
          Length = 447

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 70/174 (40%), Gaps = 39/174 (22%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+    CA C +       L KC+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 148 SCVPVTDDGDEACANCGKHGNDTIKLKKCTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 207

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPIC-LGCHKPLNPNLADNARC 116
           +K   L   Y Q    L  GD                  PIC L  H P++ +   N  C
Sbjct: 208 LKDEEL---YSQGHERLE-GDFC----------------PICTLPVHIPIHEHSGFNTCC 247

Query: 117 PRCFWPAC--SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            +     C  +A+  G++D       CA     C T L  ND    A++  R L
Sbjct: 248 MKQICHGCNLAAQKRGMND-------CAF----CRTPLPKNDADGMAMVQSRVL 290


>gi|357484667|ref|XP_003612621.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
 gi|355513956|gb|AES95579.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
          Length = 384

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 66/291 (22%), Positives = 110/291 (37%), Gaps = 51/291 (17%)

Query: 62  PLLGRYLQATLDLHPGDRIARESPLIVGPKLAL----AEPICLGCHKPLNPNLADNARCP 117
           P  GR L AT DL  G  I  ESPL++     L      P C  C + L P  +   +CP
Sbjct: 43  PNKGRGLIATQDLKAGQIILTESPLLLYSATPLFSHTPSPYCHNCFRTLPP--SQTFQCP 100

Query: 118 RCF-WPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----------YKYEAILPL 165
            C  +  CS +C  ++   +H+         C+TL    +            + +A L +
Sbjct: 101 SCSNYLFCSQKCLSIALNSSHS------SWTCQTLSHLQNPTSPLSEKPCELQVQARLIV 154

Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS---NFLEKLPGDVLD 222
               L   +P K Q L  +  + + +    +  + + S++    S   NF  +L   ++ 
Sbjct: 155 AAYNLAIHTPSKLQTLLSLHGNPNDQDSIVDNAKFLHSLISPFCSPHMNFSAELAAKII- 213

Query: 223 DTSAKCLHWICGV--IEVNGVDIGRYTQGLYSVICLMEHNCLPNA-------------KH 267
             + + L+  C +      G        G+Y       H C+PNA             +H
Sbjct: 214 --AKERLNSFCLMEPYSQKGPQRSIKAYGIYQKATFFNHGCIPNACRFDYVESGEPGDEH 271

Query: 268 SNMMQSKLF----VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
           +  +  +L     V  +  IS    N  + T  R++ L     F C C+RC
Sbjct: 272 NTDIVIRLIKDVGVGSEICISYFRINKDYST--RKKILMEDYGFTCECDRC 320


>gi|195578359|ref|XP_002079033.1| GD23738 [Drosophila simulans]
 gi|194191042|gb|EDX04618.1| GD23738 [Drosophila simulans]
          Length = 161

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 53/119 (44%), Gaps = 9/119 (7%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
           ++ SP+ GRYL A         +  E P  VGPK      +CLGC++P +P+  +   C 
Sbjct: 14  VQWSPVCGRYLVAKGATRGHGLLIEELPFAVGPKCN-GPVVCLGCYEP-DPD-PEEELCS 70

Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQ 171
            C WP C A C+  +D      EC  LK                 + + I+PLR L+ +
Sbjct: 71  ECGWPLC-AECAQQADNAHFQLECRELKDARARFFRLPSGSSHCPQLDCIMPLRVLLAK 128


>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
           norvegicus]
          Length = 428

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 60/275 (21%), Positives = 105/275 (38%), Gaps = 40/275 (14%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    + ++ +  +    +K  
Sbjct: 67  IAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVVKLMDGKPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E+++SK      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IS 284
           IC    +   ++     GLY  + L+ H+C PN   S +      + R          ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNC--SIVFNGPHLLLRAVREIEAGEELT 236

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
             Y + L  ++ RR+ L     F C C RC  PT+
Sbjct: 237 ICYLDMLMTSEERRKQLRDQYCFECDCIRC--PTQ 269


>gi|397610576|gb|EJK60907.1| hypothetical protein THAOC_18674 [Thalassiosira oceanica]
          Length = 325

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YCG   QK H KLH+  CK+   E+K   L  + L+
Sbjct: 103 LKDCTACRLVKYCGVDCQKAHRKLHRKACKQRAAELKDEKLYSQGLE 149


>gi|167533770|ref|XP_001748564.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773083|gb|EDQ86728.1| predicted protein [Monosiga brevicollis MX1]
          Length = 500

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 24/39 (61%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          +C+VC + A + CS C +  YCG+  QK HW +H+  C 
Sbjct: 58 QCSVCMKPASYLCSKCCQAWYCGRDCQKNHWSVHRQDCS 96


>gi|66810001|ref|XP_638724.1| hypothetical protein DDB_G0284069 [Dictyostelium discoideum AX4]
 gi|60467344|gb|EAL65375.1| hypothetical protein DDB_G0284069 [Dictyostelium discoideum AX4]
          Length = 426

 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           C  C   A   CS CK+V YCGK+HQ+  W+L H  +CK L
Sbjct: 133 CEYCGCFAKSNCSGCKKVHYCGKEHQQLDWQLGHSEQCKLL 173


>gi|449473708|ref|XP_002189720.2| PREDICTED: zinc finger MYND domain-containing protein 10
           [Taeniopygia guttata]
          Length = 452

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           RC VC   A  +CS C+   YC +  Q +HW+ HKP C
Sbjct: 402 RCRVCGVEATKRCSRCRNEWYCTRACQVQHWQKHKPAC 439


>gi|397614171|gb|EJK62639.1| hypothetical protein THAOC_16738 [Thalassiosira oceanica]
          Length = 302

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 8/84 (9%)

Query: 1  MASIIEIEDEELMRCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          M+  I ++D     CA C + +       L  C+AC+ V YC    QK H K HK  CK+
Sbjct: 1  MSRCIAVDDGSAGVCANCGKGSDAEGTVKLKNCTACRLVKYCSVDCQKAHRKQHKKACKQ 60

Query: 54 LPYEIKSSPLLGRYLQ-ATLDLHP 76
             E+K   L  + L+ A  D  P
Sbjct: 61 RAAELKDEQLYSQGLERAAGDFCP 84


>gi|397591586|gb|EJK55412.1| hypothetical protein THAOC_24860 [Thalassiosira oceanica]
          Length = 213

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 26/44 (59%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           L  C+AC+ V YCG   Q+ H +LHK  CK+   E+K   L G+
Sbjct: 139 LKNCTACRLVKYCGVDCQRAHRQLHKKACKQRVAELKDEELYGQ 182


>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
           jacchus]
          Length = 428

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 101/268 (37%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVTPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CS++C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSSKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMEGSPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E++++K      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFDIFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STM 286
           IC    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDVEVGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|403340343|gb|EJY69455.1| MYND finger family protein [Oxytricha trifallax]
          Length = 442

 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           RC  C + A  +CS CK   YC ++ Q + WK HKP C
Sbjct: 362 RCNECGKNATQRCSRCKNQWYCSRECQLKQWKAHKPLC 399


>gi|405971699|gb|EKC36522.1| Egl nine-like protein 1 [Crassostrea gigas]
          Length = 420

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 13 MRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          +RC VC  +  L +C+ CK V YC +QHQ + W  HK  C
Sbjct: 11 LRCVVCDSKENLKRCTRCKGVVYCCRQHQIQDWPSHKVAC 50


>gi|219123565|ref|XP_002182093.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406694|gb|EEC46633.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1183

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 15  CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY----EIKSSPLL 64
           C+ C R  A  +C  C  V YC +  Q+ HW+ HK  CK+L      E   SP+L
Sbjct: 497 CSACYRAGATMRCKRCMSVYYCQRSCQESHWQFHKAPCKRLAATNTEETVGSPVL 551


>gi|397644789|gb|EJK76551.1| hypothetical protein THAOC_01680 [Thalassiosira oceanica]
          Length = 309

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 28/54 (51%), Gaps = 5/54 (9%)

Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          CA C +T      L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L
Sbjct: 15 CANCGKTGSDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQL 68


>gi|397634136|gb|EJK71299.1| hypothetical protein THAOC_07280 [Thalassiosira oceanica]
          Length = 311

 Score = 40.0 bits (92), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 23/41 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          L  C+AC  V YCG   QK H KLHK  CKK   E+K   L
Sbjct: 29 LKNCTACFLVKYCGVDCQKSHRKLHKRACKKRAAELKDEEL 69


>gi|406861729|gb|EKD14782.1| ectomycorrhiza-upregulated zf-MYND domain-containing protein
          [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
          Length = 472

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C +C+   +  +CSACK   YC  + QK+ W LHKP CK L
Sbjct: 8  CTICQNPDSSKRCSACKVTRYCSHECQKKGWPLHKPVCKFL 48


>gi|348525586|ref|XP_003450303.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Oreochromis niloticus]
          Length = 426

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E    K CS CK V YC +  QK HW  HK  CKKL
Sbjct: 320 CTTCGEKGAEKRCSICKMVIYCDQACQKMHWFTHKKICKKL 360


>gi|291222899|ref|XP_002731455.1| PREDICTED: EGL nine homolog 1-like [Saccoglossus kowalevskii]
          Length = 440

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCK-KLPYEIKSS 61
          CS C++  YC ++HQK HWK HK +C  K P+ + +S
Sbjct: 27 CSRCRDTWYCCREHQKTHWKTHKKECGPKNPFILGTS 63


>gi|440793753|gb|ELR14928.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 252

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 9/54 (16%)

Query: 7   IEDEELMRCAVCR--------ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           + +E  +RCA  +        ET  ++CS CK+V YCG++ Q+  W  HKP CK
Sbjct: 200 VVEERAIRCANGKCGRSEEDGET-FNRCSRCKKVYYCGRECQRADWPKHKPTCK 252


>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
          Length = 386

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 45/210 (21%), Positives = 81/210 (38%), Gaps = 15/210 (7%)

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  I   R
Sbjct: 19  GRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPER 75

Query: 174 SP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
           +P +K   +K+ E+H+ K     E  + I S V  L   + + L     D          
Sbjct: 76  TPSEKLLAVKEFESHLDKLD--NEKKDLIQSDVAALHHFYSKHLEFPDSDSLVVLFAQVN 133

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IST 285
           C    +   ++      ++  + LM H+C PN   +   +  L   R          + T
Sbjct: 134 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEINPGEEVFT 191

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCS 315
            Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 192 SYIDLLYPTEDRNDRLRDSYFFTCQCQECT 221


>gi|193656959|ref|XP_001949267.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           isoform 1 [Acyrthosiphon pisum]
 gi|328709997|ref|XP_003244132.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 384

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 1/40 (2%)

Query: 15  CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           C  C E  A  KC+ CK V YC ++ Q+ HW +HK +C +
Sbjct: 318 CGTCAENNATKKCAKCKAVQYCDRECQRIHWFVHKKECNR 357


>gi|321474337|gb|EFX85302.1| hypothetical protein DAPPUDRAFT_314029 [Daphnia pulex]
          Length = 420

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 2/42 (4%)

Query: 14  RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           +CA C++T+  + +CS C+ + YC K+ Q  HW  HK  C+K
Sbjct: 378 QCAKCKKTSQNVKRCSRCRSIFYCSKECQFSHWSNHKTVCQK 419


>gi|398409022|ref|XP_003855976.1| hypothetical protein MYCGRDRAFT_29456, partial [Zymoseptoria
          tritici IPO323]
 gi|339475861|gb|EGP90952.1| hypothetical protein MYCGRDRAFT_29456 [Zymoseptoria tritici
          IPO323]
          Length = 61

 Score = 40.0 bits (92), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 2/50 (4%)

Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
          +C  C + A  L  C+ CK   YC ++ Q  HW  HK  CK+L Y  K S
Sbjct: 9  QCLSCSQIAAGLQTCARCKAAKYCSRECQAAHWTAHKSACKRLNYVFKVS 58


>gi|397643623|gb|EJK75984.1| hypothetical protein THAOC_02275 [Thalassiosira oceanica]
          Length = 388

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 26/47 (55%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YC    Q+ H K HK  CKK   E+K   L G+ L+
Sbjct: 123 LKDCTACRIVKYCSVDCQRAHRKQHKKACKKRAAELKDEKLYGQGLE 169


>gi|397641165|gb|EJK74510.1| hypothetical protein THAOC_03807, partial [Thalassiosira oceanica]
          Length = 560

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 8   EDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
           ED  +  CA C +       L KC+AC+ V YC    QK H K HK  CK+   E+K   
Sbjct: 148 EDACIETCANCGKPESDTVTLKKCNACRLVKYCSVDCQKAHRKQHKKACKQRAEELKDEE 207

Query: 63  LLGR 66
           L  R
Sbjct: 208 LFSR 211


>gi|397640514|gb|EJK74163.1| hypothetical protein THAOC_04176 [Thalassiosira oceanica]
          Length = 385

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 9/81 (11%)

Query: 3  SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S + + D++   CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+   E
Sbjct: 2  SCVPVMDDDDEVCANCGKQGSDTIKLKICTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 61

Query: 58 IKSSPLLGRYLQATLDLHPGD 78
          +K   L   Y Q  LD   GD
Sbjct: 62 LKDEQL---YSQG-LDRPEGD 78


>gi|397579340|gb|EJK51173.1| hypothetical protein THAOC_29678, partial [Thalassiosira
          oceanica]
          Length = 167

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 5/74 (6%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M   + +E +    CA C +       L  C+AC+ V YCG   Q+ H KLHK  CK+  
Sbjct: 2  MMRCVPVEGDGDEACANCGKLGSDAVKLKNCTACRLVKYCGVDCQRAHRKLHKKVCKQRV 61

Query: 56 YEIKSSPLLGRYLQ 69
           E++   L  + L+
Sbjct: 62 AELRDEQLYSQGLK 75


>gi|380016914|ref|XP_003692413.1| PREDICTED: LOW QUALITY PROTEIN: ankyrin repeat and MYND
           domain-containing protein 2-like [Apis florea]
          Length = 407

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L   I
Sbjct: 335 CITCGEDKANKKCSKCKAVQYCDRECQRLHWFMHKKACTRLGQNI 379


>gi|401401559|ref|XP_003881041.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325115453|emb|CBZ51008.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 946

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%)

Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
           +Y +     H+CLPNA       H+ +++++  +     ++  Y    + +  T  RR  
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRTRADIAAGDEVTLTYLSEEDLMRPTLHRRRV 227

Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
           L+ TK F C+CERC+ P +    F    C
Sbjct: 228 LSETKDFICACERCAAPVDFSRGFRCPSC 256


>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
          yFS275]
 gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
          yFS275]
          Length = 449

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 25 KCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          +C+ACK+V YC K  QK+ W +HK +CK L
Sbjct: 60 RCAACKQVHYCSKICQKQDWSMHKLECKAL 89


>gi|406864160|gb|EKD17206.1| hypothetical protein MBM_04783 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 450

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 21/129 (16%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKL--------PYEIKSSPLLGRYLQATLD 73
           L+KCS CK + YC ++ QK+ W   HK  CK L        P E ++     +Y Q  L 
Sbjct: 30  LNKCSRCKRIRYCSQKCQKKDWNDGHKKLCKVLLGLSDFYPPVEAQT---WQQYRQQVLS 86

Query: 74  LHPGDRI--ARESPLIVG-PKLALAEPICLGCHK-PLNPNLADNAR---CPRCFWPACSA 126
                R+  A+  P+ +   ++ L +  C  C++ PL   LA + +   CP CF  A   
Sbjct: 87  SVAALRVCGAQVKPITLNDERILLYQAYCSNCYRTPL--QLAPDTKLKLCPNCFIAATCT 144

Query: 127 RCSGLSDAH 135
            CS   + H
Sbjct: 145 SCSSTPETH 153


>gi|397639864|gb|EJK73806.1| hypothetical protein THAOC_04549 [Thalassiosira oceanica]
          Length = 175

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 1  MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
          M+ +   ED++   CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 1  MSCVPVGEDDDAEACANCGKQGSDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKTCKQRA 60

Query: 56 YEIKSSPL 63
           E+K   L
Sbjct: 61 AELKDEQL 68


>gi|397563864|gb|EJK43971.1| hypothetical protein THAOC_37538 [Thalassiosira oceanica]
          Length = 316

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 5/72 (6%)

Query: 3  SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S I ++D     CA C +       L  C+AC+ V YCG   Q+ H K HK  C++   E
Sbjct: 2  SCIAVDDGSAGVCANCGKHETDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACRQRAAE 61

Query: 58 IKSSPLLGRYLQ 69
          +K   L  + L+
Sbjct: 62 LKDERLYSQGLE 73


>gi|342319761|gb|EGU11708.1| Serine/arginine repetitive matrix protein 2 [Rhodotorula glutinis
           ATCC 204091]
          Length = 437

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 5/48 (10%)

Query: 9   DEELMR-CAVCRE----TALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           D+ ++R C  CR+    T   +CS CK + YCG Q Q+EHW  HK  C
Sbjct: 53  DQPILRMCMTCRKMDLITKRSRCSGCKMIFYCGPQCQREHWPEHKIDC 100


>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
           Adohcy (Form Iii)
          Length = 464

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 51  GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 102

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 103 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 160

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
              D+E++++K     +  E +  +V   +    E++       P   L +  AK    I
Sbjct: 161 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 215

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
           C    +   ++     GLY  I L+ H+C PN +   N     L   RD  +    +  Y
Sbjct: 216 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 275

Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
            + L  ++ RR+ L     F C C RC
Sbjct: 276 LDMLMTSEERRKQLRDQYCFECDCFRC 302


>gi|397647582|gb|EJK77767.1| hypothetical protein THAOC_00381, partial [Thalassiosira oceanica]
          Length = 307

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 65  LKDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQ 108


>gi|159482787|ref|XP_001699449.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158272900|gb|EDO98695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 318

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 20/29 (68%)

Query: 26  CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           CS CK V YCG++ Q  HWK HK +C KL
Sbjct: 271 CSRCKSVRYCGRECQSAHWKEHKQECPKL 299


>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
          Length = 282

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100


>gi|322795587|gb|EFZ18266.1| hypothetical protein SINV_00088 [Solenopsis invicta]
          Length = 445

 Score = 40.0 bits (92), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 11  ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           E  +C VC+E +  +CS CKE  YC ++ Q + W+ HK  C
Sbjct: 391 EAKKCLVCQEASKKRCSKCKEAWYCSRECQVKDWENHKNIC 431


>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
          Length = 371

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDSDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|260799513|ref|XP_002594740.1| hypothetical protein BRAFLDRAFT_224157 [Branchiostoma floridae]
 gi|229279976|gb|EEN50751.1| hypothetical protein BRAFLDRAFT_224157 [Branchiostoma floridae]
          Length = 915

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 5/52 (9%)

Query: 12  LMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           + RC  C++ +     + +C+ C +  YC +Q QK+HW  HK  CK  P  I
Sbjct: 481 ITRCTSCQKISPDGCNMKRCTKCFKAGYCNQQCQKKHWPTHKAICKYSPEPI 532


>gi|118383381|ref|XP_001024845.1| TPR Domain containing protein [Tetrahymena thermophila]
 gi|89306612|gb|EAS04600.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 999

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 24/40 (60%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          +C +C +    +C ACK   YC ++H ++ W  HK KCK+
Sbjct: 7  KCVICGKQTNQRCQACKIPYYCSQEHLEQDWGNHKEKCKQ 46


>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
          Length = 420

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)

Query: 65  GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
           GR L+A    H GD +    A    L VG +    E     C       L+   RC + F
Sbjct: 18  GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72

Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP-KKYQ 179
           +  C   C    D   H  EC+ + +  E        +  A +  +  I   R+P +K  
Sbjct: 73  Y--CDVECQK-EDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQKIHPERTPSEKLL 129

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
            +++ E+H+ K     E  + I S +  L   + + L  +  D +S   L     C    
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKYL--EFPDHSSLVVLFAQVNCNGFT 185

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
           +   ++      ++  + LM H+C PN   +     +++   ++ H    + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245

Query: 293 GTQPRREHLAITKYFNCSCERCS 315
            T+ R + L  + +F C C  C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268



 Score = 38.9 bits (89), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C   R+  L KC  CK+  YC  + QKE W LHK +C  +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93


>gi|393239660|gb|EJD47190.1| hypothetical protein AURDEDRAFT_163797 [Auricularia delicata
           TFB-10046 SS5]
          Length = 722

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 8   EDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
           ED +L  CA C      L +CS C+EV YCG++ Q E W+  H+ +C +
Sbjct: 671 EDLDLHACAHCELESAVLRRCSLCREVRYCGRECQAEDWQNQHREECSR 719


>gi|307171148|gb|EFN63141.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
          Length = 641

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 56/131 (42%), Gaps = 8/131 (6%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
           I  S   GR+  AT +  PGD I+ E P      +   E     CH  L+   NL   A+
Sbjct: 251 ISFSERYGRHYIATREFQPGDIISIEDPY---AHVIYEERYYTHCHYCLSKSYNLIPCAK 307

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
           CP   +  CS +C  L     H  EC ILKL    L    D        +R LI+   + 
Sbjct: 308 CPIAQY--CSEKCRKLGWEACHKTECPILKLLTNLLNVDKDKIRMISKIIRLLIVVTENG 365

Query: 176 KKYQEL-KDME 185
            K +EL +DM+
Sbjct: 366 IKIKELEQDMK 376


>gi|198429203|ref|XP_002125038.1| PREDICTED: similar to ankyrin repeat and MYND domain containing 2
           [Ciona intestinalis]
          Length = 362

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 15  CAVCRETALH-KCSACKEVAYCGKQHQKEHWKLHKPKC-KKLPYEI 58
           C  C +T+   +CS CKE  YC K+ QK HW +HK  C K   YEI
Sbjct: 316 CDACGDTSSKLRCSVCKEALYCSKRCQKFHWFVHKKFCGKSKQYEI 361


>gi|397632503|gb|EJK70587.1| hypothetical protein THAOC_08041, partial [Thalassiosira oceanica]
          Length = 471

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 25/44 (56%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           L  C+AC  V YCG   QK H K+HK  CK+   E+K   L G+
Sbjct: 192 LKNCTACLLVKYCGVDCQKAHRKIHKKACKERAAELKDERLYGQ 235


>gi|397571081|gb|EJK47614.1| hypothetical protein THAOC_33653 [Thalassiosira oceanica]
          Length = 350

 Score = 40.0 bits (92), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L  + L+
Sbjct: 75  LKDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLFSQGLE 121


>gi|397605791|gb|EJK59116.1| hypothetical protein THAOC_20701 [Thalassiosira oceanica]
          Length = 369

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 5/64 (7%)

Query: 5   IEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
           + + D+    CA C +       L  C+AC+ V YCG   Q+ H K HK  CKK   E+K
Sbjct: 74  VPVADDGAEACANCGKHGSDAVKLKDCTACRLVKYCGVDCQRAHRKQHKKTCKKRAAELK 133

Query: 60  SSPL 63
              L
Sbjct: 134 DEEL 137


>gi|414886685|tpg|DAA62699.1| TPA: hypothetical protein ZEAMMB73_171774 [Zea mays]
          Length = 83

 Score = 40.0 bits (92), Expect = 3.1,   Method: Composition-based stats.
 Identities = 15/46 (32%), Positives = 27/46 (58%)

Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          A  +C  C  VAYC + HQ  HW++HK +C++   +++    L ++
Sbjct: 18 ADRRCGGCGAVAYCSRVHQIIHWRVHKEECERFAEQMRHINALSQF 63


>gi|336365142|gb|EGN93494.1| hypothetical protein SERLA73DRAFT_115908 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336377712|gb|EGO18873.1| hypothetical protein SERLADRAFT_454099 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1207

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 15   CAVCRETALHK---CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            CA C +    K   CS C++V+YC  + Q+  WK HKP+C  L
Sbjct: 1162 CAACEKEGKPKLLLCSKCRKVSYCSVECQRREWKKHKPQCAVL 1204


>gi|300121800|emb|CBK22374.2| unnamed protein product [Blastocystis hominis]
          Length = 57

 Score = 40.0 bits (92), Expect = 3.1,   Method: Composition-based stats.
 Identities = 17/40 (42%), Positives = 21/40 (52%)

Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          +C  C +    +CS CK V YC KQ Q   W +HK  C K
Sbjct: 4  KCQFCGKEGALRCSKCKRVYYCSKQCQVRDWPMHKSVCGK 43


>gi|397626218|gb|EJK68075.1| hypothetical protein THAOC_10788 [Thalassiosira oceanica]
          Length = 307

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L
Sbjct: 31 LKNCNACRLVKYCGVDCQRAHRKLHKKACKQRAAELKDEQL 71


>gi|326436236|gb|EGD81806.1| hypothetical protein PTSG_02518 [Salpingoeca sp. ATCC 50818]
          Length = 531

 Score = 40.0 bits (92), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 2/41 (4%)

Query: 15  CAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           CA C+ T      CS C+EVAYC    Q+ HW +HK  C +
Sbjct: 465 CAHCKRTLSRPLVCSGCREVAYCSGGCQRRHWAVHKQSCTR 505


>gi|328721096|ref|XP_003247209.1| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 611

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 75/175 (42%), Gaps = 21/175 (12%)

Query: 57  EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHK-PLNPNLADNAR 115
           E+KSS  +GR +  T D++PGD +A + P I G  L L    C  C K P+ P      +
Sbjct: 204 ELKSSKNMGRGVYTTRDINPGDVVAIDEPYIYG-GLQLHNSCCRNCLKSPVVP--IPCTK 260

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
           C   ++  CS  C   S    H  EC I+     +L    D    A+  L    L +   
Sbjct: 261 CSLVYY--CSENCLHKSMKDGHNLECPII-FFIRSLPGITDLNQLALKWL-SKELSKMGL 316

Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL---------EKLPGDVL 221
           KKY  L D  +  SK  P T  + E     +Y   NFL         E++P +VL
Sbjct: 317 KKYCSLVDTFSK-SKIDPKTRGFNENG---QYKSDNFLTAYSLDSCEEEMPIEVL 367


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 19/43 (44%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 13  MRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           +RCA C + +  L  C  CK V YCG+  Q  H+K HK KC++
Sbjct: 419 VRCANCGKNSVRLMMCGRCKAVMYCGRTCQVAHYKEHKAKCQR 461


>gi|41054031|ref|NP_956188.1| ankyrin repeat and MYND domain-containing protein 2 [Danio rerio]
 gi|27882498|gb|AAH44410.1| Zgc:55491 [Danio rerio]
          Length = 420

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C  C E    K CS CK V YCG+  QK HW  HK  CK L
Sbjct: 320 CTTCGEKGAEKRCSICKMVIYCGQACQKLHWFSHKKVCKML 360


>gi|290982213|ref|XP_002673825.1| predicted protein [Naegleria gruberi]
 gi|284087411|gb|EFC41081.1| predicted protein [Naegleria gruberi]
          Length = 492

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 25/42 (59%)

Query: 13  MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            +CA C + A H C  CK   YC  +  K +WK+HKP C+++
Sbjct: 427 FKCANCGKKANHYCGGCKIERYCSVECMKSNWKVHKPFCQEI 468


>gi|242822562|ref|XP_002487912.1| checkpoint protein kinase, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218712833|gb|EED12258.1| checkpoint protein kinase, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 1088

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 4/49 (8%)

Query: 4   IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           +IE+ D   + C +C E    +C  C+ V YC K +Q++ WK +K +CK
Sbjct: 160 LIELSD---LSCVICGEPG-RRCQGCQLVTYCSKDYQRKDWKNYKRECK 204


>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
 gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
          Length = 448

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 24/35 (68%)

Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          ++ L+KCS CK + YC K  QKE W+ HK +CK +
Sbjct: 61 QSELYKCSRCKIIMYCCKSCQKEDWQWHKYECKSI 95


>gi|452843846|gb|EME45781.1| hypothetical protein DOTSEDRAFT_34222 [Dothistroma septosporum
           NZE10]
          Length = 226

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 19/32 (59%)

Query: 22  ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           AL  C  C E  YC K+ QK  WK+HK  CKK
Sbjct: 194 ALFLCGVCGEGRYCSKECQKREWKVHKSSCKK 225


>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 450

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)

Query: 65  GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
           GR L+A    H GD +    A    L VG +    E     C       L+   RC + F
Sbjct: 18  GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72

Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP-KKYQ 179
           +  C   C    D   H  EC+ + +  E        +  A +  +  I   R+P +K  
Sbjct: 73  Y--CDVECQK-EDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQKIHPERTPSEKLL 129

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
            +++ E+H+ K     E  + I S +  L   + + L  +  D +S   L     C    
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKYL--EFPDHSSLVVLFAQVNCNGFT 185

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
           +   ++      ++  + LM H+C PN   +     +++   ++ H    + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245

Query: 293 GTQPRREHLAITKYFNCSCERCS 315
            T+ R + L  + +F C C  C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268



 Score = 38.9 bits (89), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C   R+  L KC  CK+  YC  + QKE W LHK +C  +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93


>gi|397596181|gb|EJK56701.1| hypothetical protein THAOC_23362 [Thalassiosira oceanica]
          Length = 518

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           L  C+AC+ V YCG   Q+ H + HK  CK+   E+K   L G+
Sbjct: 245 LKDCTACRLVKYCGVDCQRAHRRQHKKACKRRAAELKDEQLYGQ 288


>gi|392558232|gb|EIW51426.1| hypothetical protein TRAVEDRAFT_25089 [Trametes versicolor
          FP-101664 SS1]
          Length = 288

 Score = 40.0 bits (92), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 1/50 (2%)

Query: 3  SIIEIEDEELMRCAVCRETALHKCSACKE-VAYCGKQHQKEHWKLHKPKC 51
          S++ + D  +  CAVC   ++ +C+ C    AYC   H +EHW  H   C
Sbjct: 2  SLVAVLDPPMPSCAVCELPSMLRCAYCNSGPAYCSSSHFREHWVTHAVVC 51


>gi|397599887|gb|EJK57538.1| hypothetical protein THAOC_22404, partial [Thalassiosira oceanica]
          Length = 440

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 5/71 (7%)

Query: 1   MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           M+ +   E + +  CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+  
Sbjct: 165 MSCVPVAEVDVIEACANCGKQGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 224

Query: 56  YEIKSSPLLGR 66
            E+K   L G+
Sbjct: 225 AELKDEQLYGQ 235


>gi|397590002|gb|EJK54879.1| hypothetical protein THAOC_25455 [Thalassiosira oceanica]
          Length = 302

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC  V YCG   QK H K+HK  CK+   E+K   L G+
Sbjct: 28 LKNCTACLLVKYCGVDCQKAHRKIHKKACKERAAELKDERLYGQ 71


>gi|354801520|gb|AER39526.1| egl nine 1-like protein [Oncorhynchus mykiss]
          Length = 390

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 1/53 (1%)

Query: 2  ASIIEIEDEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          ++++   D +   C +C +   L KC  C+   YC K+HQK  WK HK  CK+
Sbjct: 9  SAVVTDLDRDQQYCELCGKMENLLKCGRCRNSFYCSKEHQKSDWKKHKQFCKE 61


>gi|397618435|gb|EJK64892.1| hypothetical protein THAOC_14321, partial [Thalassiosira oceanica]
          Length = 297

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 5/57 (8%)

Query: 15  CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           CA C +T      L  C+AC+ V YCG   Q+ H K H+  CK+   E+K   L G+
Sbjct: 90  CANCGKTGSDAVKLKNCTACRLVKYCGVDCQRAHRKKHQKACKERAAELKDEQLYGQ 146


>gi|307207016|gb|EFN84839.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
          Length = 638

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 68/152 (44%), Gaps = 13/152 (8%)

Query: 58  IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
           +  S   GR+  AT D  PGD ++ E P      +   E     CH  L+   NL   ++
Sbjct: 248 VSFSKRYGRHYIATRDFKPGDIVSIEDPY---AHVIYEERYYTHCHYCLSRSYNLIPCSK 304

Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRS 174
           CP   +  CS +C  L+    H  EC ILKL   +LL  +  K   I   +R LI+   +
Sbjct: 305 CPIAQY--CSEKCRKLAWEVCHMTECPILKL-LTSLLNVDKDKIRMISKIIRLLIIVTAN 361

Query: 175 PKKYQEL-KDM---EAHMSKRGPGTEVYEEID 202
             K +EL +DM   E++   R  G    E +D
Sbjct: 362 GSKIEELQQDMKIAESNPDNRTAGFTDTEILD 393


>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 509

 Score = 40.0 bits (92), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSK---LFVFRDTH----ISTMYTNALWGTQPRREHL 301
           G+Y V  L+ H C+ N   +  ++SK   +   RD      I+  +T+A   T  R+ HL
Sbjct: 272 GIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEITHAFTDAASPTVVRKAHL 331

Query: 302 AITKYFNCSCERCSD 316
                F+C+C RC+D
Sbjct: 332 KSLYAFDCNCSRCND 346


>gi|397618815|gb|EJK65055.1| hypothetical protein THAOC_14145 [Thalassiosira oceanica]
          Length = 434

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/41 (51%), Positives = 23/41 (56%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           L  C+AC  V YCG   QK H KLHK  CKK   E+K   L
Sbjct: 152 LKNCTACFLVKYCGVDCQKSHRKLHKRACKKRAAELKDEEL 192


>gi|397563281|gb|EJK43740.1| hypothetical protein THAOC_37790, partial [Thalassiosira oceanica]
          Length = 976

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 30/63 (47%), Gaps = 5/63 (7%)

Query: 21  TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA-----TLDLH 75
             L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G   +       LDLH
Sbjct: 315 VGLKNCTACRLVEYCGVDCQRAHRKKHKNVCKERAAELKDEELYGLGHERPEGGFLLDLH 374

Query: 76  PGD 78
             D
Sbjct: 375 AAD 377


>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
 gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
           Adohcy
          Length = 441

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 47/216 (21%), Positives = 87/216 (40%), Gaps = 17/216 (7%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L  +F  K  G   +D+   
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALH-HFYSKHLGFPDNDSLVV 174

Query: 228 CLHWI-CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
               + C    +   ++      ++  + LM H+C PN   +   +  L   R       
Sbjct: 175 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232

Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
              + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|397612058|gb|EJK61586.1| hypothetical protein THAOC_17901 [Thalassiosira oceanica]
          Length = 309

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 5/54 (9%)

Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
          CA C +T      L  C+AC+ V YCG   Q+ H K HK +CK+   E+K   L
Sbjct: 15 CANCGKTGSDTVKLKSCTACRLVKYCGVDCQRVHRKQHKKECKQRAAELKDEQL 68


>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
          Length = 433

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVV 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|298714742|emb|CBJ25641.1| pcdc2/rp-8 (programmed cell death protein 2) [Ectocarpus
           siliculosus]
          Length = 462

 Score = 40.0 bits (92), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 2/39 (5%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKC 51
           C VC ++A   CS C+   YCG++HQ  HWK   HK  C
Sbjct: 173 CCVCGQSAKSACSKCRIARYCGREHQALHWKTGGHKKAC 211


>gi|170118378|ref|XP_001890368.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164634638|gb|EDQ98966.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 532

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 3/43 (6%)

Query: 14 RCAVCRET---ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          +C V + +   A   CS CK V YC ++HQ + WK HK  CK+
Sbjct: 26 KCHVVKPSGAPAFSVCSGCKIVRYCSREHQVQDWKTHKAFCKR 68


>gi|116179140|ref|XP_001219419.1| hypothetical protein CHGG_00198 [Chaetomium globosum CBS 148.51]
 gi|88184495|gb|EAQ91963.1| hypothetical protein CHGG_00198 [Chaetomium globosum CBS 148.51]
          Length = 786

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 1/42 (2%)

Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RCAVC   + L +C  C  + YCG +HQ  H + HK  C  +
Sbjct: 10 RCAVCPVRSGLLRCGGCNVIRYCGPEHQSAHRREHKKACTTI 51


>gi|449301926|gb|EMC97935.1| hypothetical protein BAUCODRAFT_407676 [Baudoinia compniacensis
           UAMH 10762]
          Length = 137

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 20/37 (54%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           C    E  L  CS CK   YC K+ QK HWK HKP C
Sbjct: 85  CGAASEIGLLTCSKCKTRKYCCKECQKAHWKNHKPVC 121


>gi|405968412|gb|EKC33486.1| Zinc finger MYND domain-containing protein 10 [Crassostrea gigas]
          Length = 451

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 22/38 (57%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           +CAVC   A  +CS C+   YC ++ Q  HWK HK  C
Sbjct: 400 KCAVCGAEATKRCSRCQNEWYCRRECQVSHWKKHKTAC 437


>gi|321460122|gb|EFX71168.1| hypothetical protein DAPPUDRAFT_112030 [Daphnia pulex]
          Length = 1050

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 2/42 (4%)

Query: 15   CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
            CA C++ +  L +CS C+   YC  + Q++HW  HK  CKK+
Sbjct: 1009 CAACKKVSQSLKRCSRCRSTVYCCVECQRQHWAQHKLVCKKV 1050


>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
          Length = 816

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 9/95 (9%)

Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTMYTNAL--------WGTQP 296
           R  Q +YSV  L  H+C PN  H+  +   LF+    H++      L        W  + 
Sbjct: 78  RVGQAIYSVASLFNHSCQPNI-HAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKD 136

Query: 297 RREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
           R++ L     F C C  CS+        +A +C+N
Sbjct: 137 RQKFLKDEYSFRCECSGCSELNVSDLVLNAFRCVN 171


>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
          Length = 542

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 70/302 (23%), Positives = 127/302 (42%), Gaps = 39/302 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GR L  T  +  GD IA  +PL++ P L+    +C  C +   P     +RC   ++  C
Sbjct: 18  GRALHTTKTVAAGDVIAVFTPLLLLPSLSHLTTVCSFCLRAGTPR--PCSRCRAAYY--C 73

Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK-- 182
            ARC   + +  H+ ECA L    ++    +  + E   P+R L+    S  + ++L   
Sbjct: 74  DARCQAAAWSGGHSLECAALVRAVKS----SKKRREIPTPVRALVKVLLSCGQPEDLSKN 129

Query: 183 --DMEAHMS--KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEV 238
              +E H++  +R PG    E    ++      F  +   +  +++  + +  +C  ++ 
Sbjct: 130 MDGLEGHVAERRREPGWADME----MMAMGGCAFAGR---ETSEESVRQAVEILCK-LQT 181

Query: 239 NG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYT 288
           N       D+G     L   + +  H+CLPNA       + +++++  + +   I   YT
Sbjct: 182 NAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLRAESRIQKGDEIEISYT 241

Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK----CLNEHKDQGDCWILPV 344
           +       R+  LA    F C C RC+   +L  Y ++       LNE    GD   L  
Sbjct: 242 DYTSSLGKRKAALAPYN-FECRCRRCTQ--DLCVYQASAHYRDYMLNEDSILGDGTKLQT 298

Query: 345 NP 346
           +P
Sbjct: 299 HP 300


>gi|397646269|gb|EJK77200.1| hypothetical protein THAOC_00984 [Thalassiosira oceanica]
          Length = 316

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L KC+AC  V YCG   QK H K HK  CK+   E+K   L G+
Sbjct: 31 LKKCTACLLVKYCGVDCQKAHRKQHKKLCKQRVAELKDERLYGQ 74


>gi|397637923|gb|EJK72864.1| hypothetical protein THAOC_05566 [Thalassiosira oceanica]
          Length = 305

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 5/66 (7%)

Query: 3  SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          S + + D+    CA C  TA     L  C+AC+ V YCG   Q  H K HK  CK+   E
Sbjct: 2  SCVPVVDDGNEVCANCGTTASNTVKLKNCTACRLVKYCGVDCQGAHRKHHKKACKQRAAE 61

Query: 58 IKSSPL 63
          +K   L
Sbjct: 62 LKDEQL 67


>gi|397601150|gb|EJK57830.1| hypothetical protein THAOC_22092 [Thalassiosira oceanica]
          Length = 284

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 26/49 (53%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          C    L KC+AC+ V YC    Q+ H K HK  CKK   E++   L G+
Sbjct: 22 CDAVKLKKCTACQLVKYCSVDCQRAHRKHHKGACKKRAAELEDEQLYGQ 70


>gi|397590805|gb|EJK55166.1| hypothetical protein THAOC_25129, partial [Thalassiosira oceanica]
          Length = 273

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   L
Sbjct: 232 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQL 272


>gi|397564868|gb|EJK44379.1| hypothetical protein THAOC_37083 [Thalassiosira oceanica]
          Length = 684

 Score = 39.7 bits (91), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   Q+ H KLHK  CK+   E+K   +  + L+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQMYSQGLE 75


>gi|356546765|ref|XP_003541793.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
           max]
          Length = 990

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 2/51 (3%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL-PYEIKSSPL 63
           CA C   A  +CS CK V YC    Q  HW+L HK +C++L P++  S PL
Sbjct: 76  CARCSAPARTRCSRCKIVRYCSGNCQIIHWRLIHKQECQQLEPHKSSSFPL 126


>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
          Length = 417

 Score = 39.7 bits (91), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 11/91 (12%)

Query: 257 MEHNCLPNAKH-------SNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNC 309
           + H C PNA++       S+ ++    +     I+  YT+ L  T+ R+ HLA   +F C
Sbjct: 242 LNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHLAQGFHFTC 301

Query: 310 SCERCSDPTELGTYFSAMKCLNEHKDQGDCW 340
           +C RC+ P    T    +  +N+ + Q + W
Sbjct: 302 TCPRCTSPITDIT----LSIINDLQAQLNDW 328


>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
 gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=HSKM-B; AltName: Full=Histone methyltransferase
           SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
           AltName: Full=SET and MYND domain-containing protein 2
 gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
 gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
 gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
          Length = 433

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 47/216 (21%), Positives = 87/216 (40%), Gaps = 17/216 (7%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L  +F  K  G   +D+   
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALH-HFYSKHLGFPDNDSLVV 174

Query: 228 CLHWI-CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
               + C    +   ++      ++  + LM H+C PN   +   +  L   R       
Sbjct: 175 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232

Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
              + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|449486818|ref|XP_004157412.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Cucumis
           sativus]
          Length = 711

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           C VC      KCS CK V YC    Q+ HWK  HK KCK  
Sbjct: 78  CVVCGNLGSKKCSRCKAVRYCSPTCQEIHWKAGHKTKCKDF 118


>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
 gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
          Length = 428

 Score = 39.7 bits (91), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 58/270 (21%), Positives = 102/270 (37%), Gaps = 38/270 (14%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A   L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    + ++ +  +    +K  
Sbjct: 67  IAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVVKLMDGKPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E+++SK      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IS 284
           IC    +   ++     GLY  + L+ H+C PN   S +      + R          ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNC--SIVFNGPHLLLRAVREIEAGEELT 236

Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERC 314
             Y + L  ++ RR+ L     F C C RC
Sbjct: 237 ICYLDMLMTSEERRKQLRDQYCFECDCIRC 266


>gi|451996998|gb|EMD89464.1| hypothetical protein COCHEDRAFT_1205568 [Cochliobolus
           heterostrophus C5]
          Length = 437

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 53/127 (41%), Gaps = 15/127 (11%)

Query: 23  LHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKLPY---EIKSSPLLGRYLQATLDLHPGD 78
           L +C ACK V+YC  + QK+ W + HK  CKKL     + +  P  GR+           
Sbjct: 37  LKRCIACKRVSYCSVECQKQDWNRNHKQICKKLVTLNKQKRYVPSAGRHW---------- 86

Query: 79  RIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA 138
           R  RE   +   K    +P  LG        L D A    C +   ++ C+     H  A
Sbjct: 87  REYREEQELEFSKSGTMDPCELGTLIGRESQLEDGANLNACQYCHLTSFCTSCQQTHPSA 146

Query: 139 PECAILK 145
            ECA L+
Sbjct: 147 -ECATLQ 152


>gi|397607252|gb|EJK59617.1| hypothetical protein THAOC_20130 [Thalassiosira oceanica]
          Length = 312

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 46/103 (44%), Gaps = 5/103 (4%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA-TLDLHPGDRIA 81
           L  C+AC+ V YCG   Q+ H K HK  CK+   E++   L G+ L+    D  P   + 
Sbjct: 37  LKDCAACRLVKYCGVDCQRAHRKQHKKACKQRAAELEDEKLYGQGLERPKGDFCPICTLP 96

Query: 82  RESPLIVGP--KLALAEPICLGCHKPLNPNLADNARCPRCFWP 122
              P+      K   ++ IC+GC         D+  CP C  P
Sbjct: 97  IPLPMDKHSYFKTCCSKMICIGCSMATQKRGMDD--CPFCRTP 137


>gi|397575063|gb|EJK49512.1| hypothetical protein THAOC_31605 [Thalassiosira oceanica]
          Length = 309

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 25/46 (54%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
          L KC+AC  V YC    QK H K HK  CKK   E+K   L G+ L
Sbjct: 23 LKKCNACHLVKYCSVGCQKAHRKQHKGACKKRAAELKDEQLYGQGL 68


>gi|397568200|gb|EJK46011.1| hypothetical protein THAOC_35344 [Thalassiosira oceanica]
          Length = 425

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 5/62 (8%)

Query: 7   IEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
           + D  +  CA C +       L  C+AC+ V YCG   Q+ H K HK  CK+   E+K  
Sbjct: 149 VADNGIEACANCGKEGSITVKLRNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDE 208

Query: 62  PL 63
            L
Sbjct: 209 KL 210


>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
 gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
          Length = 597

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 107/297 (36%), Gaps = 60/297 (20%)

Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL--IL 170
           N+ C    W      C+ L +   HAP   +                +A+   RCL  +L
Sbjct: 139 NSECQNKDWATHKRECTALQEWAKHAPAAEV------------SVPSDAV---RCLGRLL 183

Query: 171 QRRSPKKY-----QELKDMEAHMSKRGP-GTEVYEEID-SIVKYLR-------SNFLEKL 216
            +R  K       Q++  M++      P G E +  +  S+V+YL         +F    
Sbjct: 184 WKRQKKGLDSTWAQQIAAMQSSRKSLQPSGYEFHTHLSHSLVRYLGLSSPDGLQDFGISS 243

Query: 217 PGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSK-- 274
           PGD+LD TS     +      V   D+      +  ++ L+ H+C PNA       SK  
Sbjct: 244 PGDLLDLTS----RFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTP 299

Query: 275 --------LFVFRDTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGT 322
                   +   RD      I T Y +       RR  L     F C C  C  P  +  
Sbjct: 300 STDEPLISVVAIRDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLCKTPPPV-- 357

Query: 323 YFSAMKCLNEHKDQGDCWILPV--NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQ 377
              A + L   K  G    LP+  NPL       C +C A +   D  L  +++G++
Sbjct: 358 --DAREALWCPKSCGGMCPLPIEENPLAQ-----CANCKAVVKDTDAVLDATRVGQE 407


>gi|297722479|ref|NP_001173603.1| Os03g0704200 [Oryza sativa Japonica Group]
 gi|255674817|dbj|BAH92331.1| Os03g0704200 [Oryza sativa Japonica Group]
          Length = 174

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   Q+E WKLH+ +C+ +
Sbjct: 90  LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 121


>gi|449447371|ref|XP_004141442.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Cucumis
           sativus]
          Length = 711

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 21/41 (51%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           C VC      KCS CK V YC    Q+ HWK  HK KCK  
Sbjct: 78  CVVCGNLGSKKCSRCKAVRYCSPTCQEIHWKAGHKTKCKDF 118


>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
          Length = 436

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 52/230 (22%), Positives = 93/230 (40%), Gaps = 26/230 (11%)

Query: 99  CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK 158
           C GC       L+   RC + F+  C+  C    D   H  ECA +          N + 
Sbjct: 55  CDGCF-ARKEGLSKCGRCKQAFY--CNVECQK-EDWPMHKLECASM-----CTFGQNWHP 105

Query: 159 YEAI-LPLRCLILQRRSPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNF 212
            E + L  R L  Q+  P++ Q      +K+ E+H+ K     E  E I + +  L   +
Sbjct: 106 SETVRLTARILAKQKTHPERTQSEKLLAVKEFESHLDKLD--NEKRELIQNDIAALHHFY 163

Query: 213 LEKLPGDVLDDTSAKCL--HWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-N 269
            + L  D  D+ +   L     C    +   ++      ++  + LM H+C PN   +  
Sbjct: 164 SKHL--DYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYK 221

Query: 270 MMQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
              +++   R+      I + Y + L+ T+ R + L  + +F+C C  C+
Sbjct: 222 GTLAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFSCDCRECT 271


>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
          Length = 433

 Score = 39.7 bits (91), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 43/213 (20%), Positives = 84/213 (39%), Gaps = 11/213 (5%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTH 282
                C    +   ++      ++  + LM H+C PN   +       +++   +     
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEE 235

Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 236 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|393238191|gb|EJD45729.1| hypothetical protein AURDEDRAFT_114035 [Auricularia delicata
          TFB-10046 SS5]
          Length = 249

 Score = 39.7 bits (91), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLG 65
          AL  C+ CK VAYC K+ Q+ HWK H+  C+     I +S   G
Sbjct: 40 ALSICAGCKFVAYCSKECQRAHWKSHQGVCRLTAATIGASRAQG 83


>gi|397646369|gb|EJK77239.1| hypothetical protein THAOC_00940, partial [Thalassiosira oceanica]
          Length = 616

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 15  CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           CA C +T      L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L  + L+
Sbjct: 222 CANCGKTGSDTVKLKHCTACRIVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYSQGLE 281


>gi|358388034|gb|EHK25628.1| hypothetical protein TRIVIDRAFT_32886 [Trichoderma virens Gv29-8]
          Length = 599

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 22/36 (61%)

Query: 17 VCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          +C+E     C +C+ VAYCG   Q+ HW+ HK  CK
Sbjct: 18 LCKEIGKLPCKSCRMVAYCGISCQQSHWEEHKKHCK 53


>gi|397597611|gb|EJK57048.1| hypothetical protein THAOC_22954 [Thalassiosira oceanica]
          Length = 781

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S + + D+    CA C +       L  C+AC+ V YCG   Q++H K HK  CK+   E
Sbjct: 483 SCVPVADDGDEACANCGKQGGDTVKLKSCTACRLVKYCGVDCQRDHRKQHKKTCKQRAAE 542

Query: 58  IKSSPL 63
           +K   L
Sbjct: 543 LKDEEL 548


>gi|195064659|ref|XP_001996607.1| GH23322 [Drosophila grimshawi]
 gi|193899819|gb|EDV98685.1| GH23322 [Drosophila grimshawi]
          Length = 466

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 26/44 (59%)

Query: 8   EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           ++E++ +C  C + A  KC  CKE+ YC ++ Q + W  HK  C
Sbjct: 417 KEEKIYKCGACYKVAKKKCGNCKEMYYCSRECQLKDWPRHKESC 460


>gi|170108840|ref|XP_001885628.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164639504|gb|EDR03775.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 247

 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 20/32 (62%)

Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          ET L  C+ CK V YC +Q Q + WK HKP C
Sbjct: 36 ETTLKVCARCKAVKYCSRQCQTKDWKNHKPAC 67


>gi|406862909|gb|EKD15958.1| hypothetical protein MBM_05969 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 276

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 18/42 (42%), Positives = 22/42 (52%)

Query: 12  LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           ++ C +     L  C  CK   YCGK+HQK  WK HK  C K
Sbjct: 228 VVSCTLRGADKLTVCGVCKIQRYCGKEHQKADWKYHKHICAK 269


>gi|389750668|gb|EIM91741.1| hypothetical protein STEHIDRAFT_164225 [Stereum hirsutum FP-91666
           SS1]
          Length = 442

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 20/42 (47%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 13  MRCAVCRE---TALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           M C   R+     L  CS CK V YCGK  Q+ HWK HK  C
Sbjct: 399 MSCGKGRDGEPVDLQCCSRCKTVKYCGKDCQRLHWKFHKITC 440


>gi|330935391|ref|XP_003304944.1| hypothetical protein PTT_17678 [Pyrenophora teres f. teres 0-1]
 gi|311318134|gb|EFQ86881.1| hypothetical protein PTT_17678 [Pyrenophora teres f. teres 0-1]
          Length = 327

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
           + +CS C +  YCGKQHQK  W  H+  C K
Sbjct: 296 VSRCSGCGQAYYCGKQHQKLAWGFHRGYCAK 326


>gi|71991003|ref|NP_001023832.1| Protein EGL-9, isoform b [Caenorhabditis elegans]
 gi|22265838|emb|CAD44114.1| Protein EGL-9, isoform b [Caenorhabditis elegans]
          Length = 363

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 20/37 (54%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C  + L  C  C  VAYC K+HQ+  W  HK  CK L
Sbjct: 46 CTSSQLQTCLFCGTVAYCSKEHQQLDWLTHKMICKSL 82


>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
 gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
          Length = 433

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 45/216 (20%), Positives = 86/216 (39%), Gaps = 17/216 (7%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSK-RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
           I   R+P +K   +K+ E+H+ K      ++ +   + + +  S  LE    D L    A
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVVFA 177

Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
           +     C    +   ++      ++  + LM H+C PN   +   +  L   R       
Sbjct: 178 QV---NCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232

Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
              + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|392567157|gb|EIW60332.1| hypothetical protein TRAVEDRAFT_45581 [Trametes versicolor
           FP-101664 SS1]
          Length = 704

 Score = 39.7 bits (91), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 24/49 (48%), Gaps = 2/49 (4%)

Query: 6   EIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           E+   EL RC+ C      L KC+ C +  YC    QK HW  HK  CK
Sbjct: 653 EMTSYELYRCSWCGNPSAVLRKCAGCGKTKYCDSGCQKSHWTEHKADCK 701


>gi|397642097|gb|EJK75025.1| hypothetical protein THAOC_03266 [Thalassiosira oceanica]
          Length = 286

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H + HK  CK+   E+K   L G+
Sbjct: 31 LKDCTACRLVKYCGVDCQRAHRRQHKKSCKRRAAELKDEQLYGQ 74


>gi|397575636|gb|EJK49801.1| hypothetical protein THAOC_31286 [Thalassiosira oceanica]
          Length = 490

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 7/66 (10%)

Query: 3  SIIEIEDEELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
          +++  +DE +  CA C         L  C+AC+ V YCG   Q+ H K HK +CK+   E
Sbjct: 4  TLVATDDEAV--CANCGLVSSDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKECKRRATE 61

Query: 58 IKSSPL 63
          +K   L
Sbjct: 62 LKDEQL 67


>gi|330914309|ref|XP_003296581.1| hypothetical protein PTT_06719 [Pyrenophora teres f. teres 0-1]
 gi|311331195|gb|EFQ95321.1| hypothetical protein PTT_06719 [Pyrenophora teres f. teres 0-1]
          Length = 461

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 25/47 (53%), Gaps = 7/47 (14%)

Query: 14 RCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          RCA C ++A       L +CS CK   YCG+  QK  WK HK  C +
Sbjct: 7  RCANCNKSAAECSLANLQQCSRCKTTEYCGRDCQKADWKTHKKICAQ 53


>gi|392567689|gb|EIW60864.1| hypothetical protein TRAVEDRAFT_71120 [Trametes versicolor
          FP-101664 SS1]
          Length = 322

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 27/49 (55%), Gaps = 5/49 (10%)

Query: 11 ELMRCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          +L RC+ C E       L KC+AC+ V YC KQ QK  W  H+  C+ +
Sbjct: 13 DLRRCSWCGEAEKPSQKLKKCAACEYVMYCSKQCQKSAWSDHRGSCQYM 61


>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
           mulatta]
          Length = 433

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
          Length = 384

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 44/210 (20%), Positives = 81/210 (38%), Gaps = 15/210 (7%)

Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
            RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  I   R
Sbjct: 17  GRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHAER 73

Query: 174 SP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
           +P +K   +K+ E+H+ K     E  + I S +  L   + + L     D          
Sbjct: 74  TPSEKLLAVKEFESHLEKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVN 131

Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IST 285
           C    +   ++      ++  + LM H+C PN   +   +  L   R          + T
Sbjct: 132 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEINPGEEVFT 189

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCS 315
            Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 190 SYIDLLYPTEDRNDRLRDSYFFTCECQECT 219


>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
           [Oreochromis niloticus]
          Length = 439

 Score = 39.7 bits (91), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 60/281 (21%), Positives = 111/281 (39%), Gaps = 22/281 (7%)

Query: 61  SPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           SP  G  L+A   +  G+ +    PL       LA+ +C  C       L    RC +C 
Sbjct: 11  SPGKGNGLRAAGRIRKGELVISAEPLACCVSNRLAKEVCHHCFTRHESLL----RCSQCK 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               C+  C   + +  H  EC  L+      L  +  +  A L    L   + S ++  
Sbjct: 67  MARYCNTTCQKQAWS-GHKRECKCLQ-SLLPRLPTDSVRLAAKLIFALLSPSKSSSEELY 124

Query: 180 ELKDMEAHMS-----KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI-- 232
            L++ E+H++     K+   +++   ++  ++    + L+++ G  L  +  + L  I  
Sbjct: 125 TLEEHESHLTSMPEQKKQGLSQLASMLELYLQQEAPDLLQEM-GSALPPSCREPLSLIAK 183

Query: 233 --CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HIST 285
             C    ++  ++     GLY  + L+ H+C PN        + +L   RD      ++ 
Sbjct: 184 VTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDIDPEDELTI 243

Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
            Y   L  T+ R+  L    +F C C+RC    + G   S 
Sbjct: 244 SYIETLSLTEDRQRQLEEQYHFTCHCQRCDSRDKDGLMLSG 284


>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
           Of Smyd2
 gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
 gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
 gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
 gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
          Length = 433

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|397569322|gb|EJK46672.1| hypothetical protein THAOC_34653 [Thalassiosira oceanica]
          Length = 392

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L G+
Sbjct: 22 LKDCTACRLVKYCGVDCQRAHRKRHKKACKQRAAELKDERLYGQ 65


>gi|321463445|gb|EFX74461.1| hypothetical protein DAPPUDRAFT_251928 [Daphnia pulex]
          Length = 790

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 2/39 (5%)

Query: 15  CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C++ +  L +CS C+ V YC  + Q+ HW LHK  C
Sbjct: 746 CASCKKISENLKRCSRCRTVVYCSVECQRGHWTLHKTVC 784


>gi|383860124|ref|XP_003705541.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
           [Megachile rotundata]
          Length = 407

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 15  CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
           C  C E  A  KCS CK V YC ++ Q+ HW +HK  C +L   I
Sbjct: 335 CITCGEDKANKKCSKCKIVQYCDRECQRLHWFMHKKACTRLGQSI 379


>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
          Length = 391

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 18  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 74

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 75  IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 132

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 133 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 190

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 191 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 225


>gi|443698784|gb|ELT98594.1| hypothetical protein CAPTEDRAFT_229252 [Capitella teleta]
          Length = 351

 Score = 39.7 bits (91), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 15/30 (50%), Positives = 21/30 (70%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
          L  C AC++  YC ++HQK HW+ HK +CK
Sbjct: 28 LSLCGACRQTWYCCREHQKSHWRQHKKECK 57


>gi|401883282|gb|EJT47497.1| hypothetical protein A1Q1_03609 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 358

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 10/101 (9%)

Query: 223 DTSAKCLHWICGVIEVNGVDIGRYT--QGLYSVICLMEHNCLPNAKHS---NMMQSKLFV 277
           D S  C ++   + + N + +G  +    +Y  +C + H+C+PNA HS   N+    +  
Sbjct: 141 DPSHPCANF--AIFDGNALPLGANSPKGAVYPTLCRINHSCIPNAHHSWNENLEMETIHA 198

Query: 278 FRDTHISTMYTNALWGT---QPRREHLAITKYFNCSCERCS 315
            RD       T   +       RR HL     F C+CE CS
Sbjct: 199 LRDISKGEEITIGYFLDGCRAERRAHLQNKFGFECNCEACS 239


>gi|397633519|gb|EJK71016.1| hypothetical protein THAOC_07582, partial [Thalassiosira oceanica]
          Length = 430

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           L KC+AC+ V YCG   Q+ H K HK  CK+   E+K   L
Sbjct: 158 LRKCTACRLVKYCGVDCQRAHRKQHKKACKQRVAELKDEQL 198


>gi|395323289|gb|EJF55768.1| hypothetical protein DICSQDRAFT_113934 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1176

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 3/48 (6%)

Query: 8    EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
             D  L +C+VCR   +  L  C  C+ V YC    QK+ W  HK +CK
Sbjct: 1127 RDPTLHKCSVCRGKGKPKLMTCKGCQRVRYCSTACQKKDWPRHKAQCK 1174


>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
          griseus]
          Length = 404

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 22/36 (61%)

Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          R+  L KC  CK+  YCG + QKE W LHK +C  +
Sbjct: 29 RKEGLSKCGRCKQAFYCGVECQKEDWPLHKLECSSM 64



 Score = 38.5 bits (88), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 87/215 (40%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C   C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 32  GLSKCGRCKQAFY--CGVECQK-EDWPLHKLECSSMVVFGENWNPSETVRLTARILAKQK 88

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   +   L  +  D +S  
Sbjct: 89  IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHQFYSRHL--EFPDHSSLV 144

Query: 228 CL--HWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH-- 282
            L     C    +   ++      ++  + LM H+C PN   +     +++   ++ H  
Sbjct: 145 VLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPG 204

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C  C+
Sbjct: 205 DEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT 239


>gi|299740414|ref|XP_001838814.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
 gi|298404233|gb|EAU83041.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
          Length = 1320

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 26  CSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKS 60
           CS CK VAYC ++ QK+ WK +H+ +CK L  + KS
Sbjct: 458 CSGCKSVAYCSRECQKQDWKRVHREECKLLATQRKS 493


>gi|389739146|gb|EIM80340.1| hypothetical protein STEHIDRAFT_68351 [Stereum hirsutum FP-91666
           SS1]
          Length = 652

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 21/37 (56%)

Query: 16  AVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           A C +     C +CK V YC K  Q +HWK+HK  CK
Sbjct: 115 AACPKRGSSLCGSCKLVQYCSKICQAQHWKIHKQDCK 151


>gi|384250513|gb|EIE23992.1| hypothetical protein COCSUDRAFT_62519 [Coccomyxa subellipsoidea
           C-169]
          Length = 620

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 43/187 (22%), Positives = 75/187 (40%), Gaps = 25/187 (13%)

Query: 244 GRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTNALWGTQPR 297
           G+    +YS+I  + H+C  N  +      + ++++ + +  D  +   Y + L     R
Sbjct: 123 GQPPAAMYSLISRLNHSCRANTAYHFKAGGAVVVRAVVDIDPDEELCISYLDPLQPLSRR 182

Query: 298 REHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV---NPLDNDSDWT 354
           +E L     F CSCERC+   ++    ++    +        W L     N +  + D  
Sbjct: 183 QEDLQQRFCFTCSCERCTLEQQM---LASTGLQDGTPAHAPTWFLSAIAPNAVGTEVD-- 237

Query: 355 CGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE-MLTKLEAMFHPHHYHCYAVK 413
             SC  RL        T Q  E+  +         +LE  +L  +E   HP+HY C A+ 
Sbjct: 238 --SCLTRL--------TRQAEEEFVRQGNATAAWTALEAGILQAVEEGAHPYHYRCLALY 287

Query: 414 HSLIQLY 420
             L+  Y
Sbjct: 288 ACLVSAY 294


>gi|147768947|emb|CAN69258.1| hypothetical protein VITISV_026383 [Vitis vinifera]
          Length = 502

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 24/50 (48%)

Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
          C   A  +C  C  VAYC   HQ  HW  HK  C +L  ++K   +L  +
Sbjct: 12 CIGPATRRCGRCGAVAYCSVSHQVSHWSDHKEXCGRLEQQMKLVDVLNDF 61


>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
           [Acyrthosiphon pisum]
          Length = 742

 Score = 39.7 bits (91), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 117/341 (34%), Gaps = 57/341 (16%)

Query: 65  GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
           GRY  A+  +  G+ +  E+P             C  C  PL         C  C    C
Sbjct: 257 GRYFVASTKIEYGELLIFENPFAFVLLPEYYNSFCYNCCVPLKYYSIPCDNC--CTILFC 314

Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
             +C   +    H  EC   K G             A L LR  I   ++     ++ ++
Sbjct: 315 GDKCLQEARNSYHRWEC---KQGTSIFKCIGI----AHLALRLTIETSQANSNNDQIYNL 367

Query: 185 EAHMSKRGPGTEVYE---EIDSIVKYL--RSNFLEKLPGDVLDDTSAKCLHWICGVIEVN 239
             H+       E+Y+       ++ YL  +++F EK P  VLD    + LH +  ++  N
Sbjct: 368 LTHIDDL-KSLELYQYSLTATLLLIYLQKKTDFFEKHPNLVLDSVGNELLHHMTRLV-CN 425

Query: 240 GVDIG---------------------RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLF-V 277
           G  I                      R    ++    L+ H+C PN   SN+++  +   
Sbjct: 426 GNAISTHMLSDYDSGSRTPIIDESQPRIGTAIFPTSSLLNHSCDPNIFSSNILKYVVIKA 485

Query: 278 FRDTHISTMYTNALWG-------TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK-- 328
            RD       TN  +G          R+  L    +F+C C  C DP     +F   +  
Sbjct: 486 SRDISEGEEITNC-YGPNFLRMRVVDRQASLKNQYHFDCECNTCLDPQADDLFFKTFEGL 544

Query: 329 -CLNEHKDQGDCWILPVNPLDNDSDWT--CGSCSARLNARD 366
            CL  H        +P    D D + T  CG C  R    D
Sbjct: 545 VCLRCHNG------IPATLSDLDVNETVYCGLCYVRFRTLD 579


>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
          Length = 520

 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           L KCS C+   YCG   QKE WKLH+ +C+ +
Sbjct: 69  LRKCSVCRVAWYCGSVCQKEEWKLHQLECQAI 100


>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
          Length = 433

 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|330934901|ref|XP_003304751.1| hypothetical protein PTT_17418 [Pyrenophora teres f. teres 0-1]
 gi|311318512|gb|EFQ87155.1| hypothetical protein PTT_17418 [Pyrenophora teres f. teres 0-1]
          Length = 557

 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 25/47 (53%), Gaps = 7/47 (14%)

Query: 14 RCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          RCA C ++A       L +CS CK   YCG+  QK  WK HK  C +
Sbjct: 7  RCANCNKSAAECSLANLQQCSRCKTTEYCGRDCQKADWKTHKKICAQ 53


>gi|169848916|ref|XP_001831162.1| hypothetical protein CC1G_09865 [Coprinopsis cinerea okayama7#130]
 gi|116507730|gb|EAU90625.1| hypothetical protein CC1G_09865 [Coprinopsis cinerea okayama7#130]
          Length = 443

 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 11  ELMRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           +L  C VC  T    +CS C  V YCG + Q E W  HK  CK L
Sbjct: 254 DLSGCEVCGSTKGASRCSGCLSVLYCGTKCQTEDWPRHKATCKSL 298


>gi|405959392|gb|EKC25437.1| hypothetical protein CGI_10023056 [Crassostrea gigas]
          Length = 475

 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
           CA C+ T   +CS CK + YC  + QKEHW  H+P C
Sbjct: 436 CARCKTTGSLQCSRCK-MPYCSHECQKEHWPEHRPHC 471


>gi|440790393|gb|ELR11676.1| MYND finger domain containing protein [Acanthamoeba castellanii
           str. Neff]
          Length = 669

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 9   DEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
           D++   CA C +TA   C  CK V YC ++ Q +HW+  HK  C K+
Sbjct: 617 DKKKQSCARCGKTANKACGGCKLVWYCSRECQSDHWQQSHKAACSKV 663


>gi|397631833|gb|EJK70303.1| hypothetical protein THAOC_08347, partial [Thalassiosira oceanica]
          Length = 351

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 24/44 (54%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L  R
Sbjct: 73  LKNCTACRLVKYCGVDCQRAHRKKHKKACKQRAAELKDEQLYSR 116


>gi|24644165|ref|NP_730906.1| HIF prolyl hydroxylase, isoform C [Drosophila melanogaster]
 gi|20151779|gb|AAM11249.1| RE70727p [Drosophila melanogaster]
 gi|23170431|gb|AAF52052.2| HIF prolyl hydroxylase, isoform C [Drosophila melanogaster]
 gi|220948766|gb|ACL86926.1| Hph-PB [synthetic construct]
 gi|220958252|gb|ACL91669.1| Hph-PB [synthetic construct]
          Length = 478

 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          RC++C  +  L +C+ CK V YC   HQ  HW  H+ +C+ L
Sbjct: 29 RCSICGTQQQLLRCAKCKAVYYCSPAHQHLHWPDHRTECRLL 70


>gi|397582820|gb|EJK52429.1| hypothetical protein THAOC_28291, partial [Thalassiosira
          oceanica]
          Length = 337

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 28/57 (49%)

Query: 17 VCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
           C    L  C+AC+ V YC    Q+ H K+HK  C+    E++   L G+ L+   D
Sbjct: 17 ACGAAKLKTCTACRLVKYCSVDCQRAHRKIHKKACRARAAELRDERLYGQGLERPED 73


>gi|397569320|gb|EJK46670.1| hypothetical protein THAOC_34651 [Thalassiosira oceanica]
          Length = 303

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ + YCG   Q+ H + HK  CKK   E+K   L G+
Sbjct: 23 LKDCTACRLIKYCGVDCQRAHRRQHKKACKKRAAELKDERLYGQ 66


>gi|67613152|ref|XP_667280.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54658400|gb|EAL37050.1| hypothetical protein Chro.40234 [Cryptosporidium hominis]
          Length = 582

 Score = 39.3 bits (90), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 41/192 (21%), Positives = 84/192 (43%), Gaps = 24/192 (12%)

Query: 250 LYSVICLMEHNCLPN------AKHSNMMQSK--LFVFRDTHISTMYTNALWGTQPRREHL 301
           LY+VI +M HNC  +        ++ ++++K  L +  +  IS +  + L+     R  L
Sbjct: 313 LYNVISMMAHNCGASCCWHYGVDNTFVLRAKTRLEIGDEITISYISDDDLFKCSKTRREL 372

Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT----CGS 357
                F C CERC++PT+L      +KC         C +  +   ++++  T    C  
Sbjct: 373 LSNWLFYCQCERCNNPTDLS---RGLKC-------ASCGVGSMFFKEDENGVTRSSRCSV 422

Query: 358 CSARLNARDVHLVTSQLGEQVDKLVQENPN-VKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
           C +  +   +   T    + V +L + + N V+ +E +  + + +F   H+  Y +   L
Sbjct: 423 CRSHPDREIIDSYTDLESQYVARLEETDKNDVEDVEAVYQEAQKVF-TQHWIMYELDTML 481

Query: 417 IQLYGTQPGYAY 428
            + Y     +++
Sbjct: 482 FEAYKESQQFSF 493


>gi|397568499|gb|EJK46168.1| hypothetical protein THAOC_35179 [Thalassiosira oceanica]
          Length = 374

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 27/56 (48%)

Query: 14  RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           R        L +C+AC+ V YC    QK H K HK  CKK   E+K   L  + L+
Sbjct: 99  RTKAGEAVVLKRCTACRLVKYCSVDCQKAHRKYHKQACKKRAAELKDEQLYEQGLE 154


>gi|397568098|gb|EJK45952.1| hypothetical protein THAOC_35408, partial [Thalassiosira oceanica]
          Length = 868

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           L KC+AC+ V YCG   Q+ H K HK  CK+   E+K   L
Sbjct: 156 LRKCTACRIVKYCGVDCQRAHRKQHKKACKQRAAELKDERL 196


>gi|323454349|gb|EGB10219.1| hypothetical protein AURANDRAFT_62835 [Aureococcus anophagefferens]
          Length = 3543

 Score = 39.3 bits (90), Expect = 4.7,   Method: Composition-based stats.
 Identities = 14/31 (45%), Positives = 19/31 (61%)

Query: 23   LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
            L KC+ CK   YC ++ QK  W +HK  C+K
Sbjct: 3509 LLKCARCKRTHYCSRECQKADWSVHKLTCRK 3539


>gi|397568360|gb|EJK46098.1| hypothetical protein THAOC_35255 [Thalassiosira oceanica]
          Length = 307

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
          L  C+AC+ V YCG   Q+ H + HK  CK+   E+K   L G+
Sbjct: 30 LKDCTACRLVKYCGVDCQRAHRRQHKKACKQRAAELKDEQLYGQ 73


>gi|302677903|ref|XP_003028634.1| expressed protein [Schizophyllum commune H4-8]
 gi|300102323|gb|EFI93731.1| expressed protein [Schizophyllum commune H4-8]
          Length = 219

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 23/46 (50%), Gaps = 2/46 (4%)

Query: 9   DEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           D EL RC+ CR     L KC  C +  YC  + QK  W  HK  CK
Sbjct: 171 DFELYRCSYCRNPSAVLRKCGGCSKARYCDAECQKLGWAAHKKTCK 216


>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
          Length = 428

 Score = 39.3 bits (90), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 34/268 (12%)

Query: 65  GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
           G  L+A + L PG+ + R  PL      G +  + +   LG  K +        RC +C 
Sbjct: 15  GNGLRAVVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66

Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
               CSA+C   +    H  EC  LK  C+     +  +    +  + +       +K  
Sbjct: 67  VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMEEIPSESEKLY 124

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
              D+E++++K      + E+    ++ L   F   +  ++ D        D        
Sbjct: 125 TFYDLESNINK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDIFEAFAKV 178

Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
           IC    +   ++     GLY  + L+ H+C PN +   N     L   RD      ++  
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTIC 238

Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
           Y + L  ++ RR+ L     F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266


>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           ASHR1-like [Glycine max]
          Length = 401

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 5/76 (6%)

Query: 249 GLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAI 303
           GLY VI ++ H+CLPN+       S ++ +   +   T +   Y      T  ++  L  
Sbjct: 128 GLYPVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEVPISYIETAGSTMTQQNALKS 187

Query: 304 TKYFNCSCERCSDPTE 319
           +  F C+C RCS   E
Sbjct: 188 STXFTCTCPRCSKVGE 203


>gi|281204212|gb|EFA78408.1| hypothetical protein PPL_09059 [Polysphondylium pallidum PN500]
          Length = 414

 Score = 39.3 bits (90), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 8   EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKK 53
           +D    +C  C   A  +C  C+ V YCGK HQ   W+L H+ +CK+
Sbjct: 135 QDARQPQCTFCGCAAPTRCGNCRRVHYCGKDHQSRDWELGHREQCKQ 181


>gi|397611439|gb|EJK61326.1| hypothetical protein THAOC_18212 [Thalassiosira oceanica]
          Length = 306

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 58/138 (42%), Gaps = 22/138 (15%)

Query: 1   MASIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
           M   + + D++   CA C +       L +C+AC+ V YCG   QK H K HK  C +  
Sbjct: 1   MMKCLPVMDDDDKVCANCGKHGSDTVKLKRCTACRLVKYCGVDCQKAHRKQHKKACNQRV 60

Query: 56  YEIKSSPLLGRYLQA-TLDLHPGDRIARESPLIVGPKLALAEPICL---GCHKPLNPNLA 111
            E+K   L  + L+    D  P   +    P+    K +L  P C+   GC      N+A
Sbjct: 61  AELKDEQLYSQGLERPEGDFCPICTLPIPFPI---DKHSLFHPCCMTRIGC------NVA 111

Query: 112 DNAR----CPRCFWPACS 125
           D       CP C  P C 
Sbjct: 112 DTKTGIKPCPFCRAPLCD 129


>gi|356563968|ref|XP_003550229.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
           max]
          Length = 735

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKL 54
           CAVC   A  KCS CK V YC +  Q+ HW   HK  CK  
Sbjct: 71  CAVCATPAPKKCSRCKAVRYCSQACQQSHWMSGHKTVCKNF 111


>gi|449549246|gb|EMD40212.1| hypothetical protein CERSUDRAFT_112416 [Ceriporiopsis subvermispora
           B]
          Length = 659

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 21/44 (47%), Gaps = 2/44 (4%)

Query: 11  ELMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           EL RC+ C+     L KC  C    YC    QK HW  HK  CK
Sbjct: 614 ELYRCSWCKNPSAVLRKCGGCGRTRYCDGVCQKSHWSEHKAACK 657


>gi|397623696|gb|EJK67106.1| hypothetical protein THAOC_11904 [Thalassiosira oceanica]
          Length = 309

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
          L  C+AC+ V YCG   Q+ H K HK +CK+   E+K   L  + L+
Sbjct: 35 LKNCTACRLVKYCGVDCQRAHRKQHKKECKQRAAELKDEQLYSQGLE 81


>gi|397573939|gb|EJK48939.1| hypothetical protein THAOC_32222 [Thalassiosira oceanica]
          Length = 392

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
           L KC+AC+ V YCG   Q+ H K HK  CK+   E+K   L
Sbjct: 119 LRKCTACRLVKYCGVDCQRAHRKQHKKACKQRVAELKDEQL 159


>gi|392560004|gb|EIW53187.1| hypothetical protein TRAVEDRAFT_133378 [Trametes versicolor
           FP-101664 SS1]
          Length = 431

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 21/40 (52%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
           C VC ++   +CS C+   YC    QK  W  HKP C+ L
Sbjct: 255 CEVCGKSGTSRCSQCQSATYCSPACQKIDWPHHKPTCRSL 294


>gi|167519084|ref|XP_001743882.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777844|gb|EDQ91460.1| predicted protein [Monosiga brevicollis MX1]
          Length = 549

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 5  IEIEDEELMRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
          + IE    + CA C  T   L +C+AC   AYC  + QK+HW+ HK +C
Sbjct: 1  MSIEALVHLVCANCGGTGERLKRCTACSMTAYCNAECQKQHWRHHKKEC 49


>gi|427794897|gb|JAA62900.1| Putative egl nine log 1, partial [Rhipicephalus pulchellus]
          Length = 427

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)

Query: 15  CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           C  C  T+ L +C+ C  + YC K+HQ+ HW  HK  CK
Sbjct: 63  CQFCGSTSGLLRCAQCHGIYYCSKEHQRLHWPAHKHICK 101


>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
 gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
           Full=Histone methyltransferase SMYD2; AltName: Full=SET
           and MYND domain-containing protein 2
 gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Cofactor Product Adohcy
 gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
           Smyd2 In Complex With The Methyltransferase Inhibitor
           Sinefungin
 gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
          Length = 433

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)

Query: 65  GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
           GR L+A    H GD +    A    L VG +    E     C       L+   RC + F
Sbjct: 18  GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72

Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP-KKYQ 179
           +  C   C    D   H  EC+ + +  E        +  A +  +  I   R+P +K  
Sbjct: 73  Y--CDVECQK-EDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQKIHPERTPSEKLL 129

Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
            +++ E+H+ K     E  + I S +  L   + + L  +  D +S   L     C    
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKYL--EFPDHSSLVVLFAQVNCNGFT 185

Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
           +   ++      ++  + LM H+C PN   +     +++   ++ H    + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245

Query: 293 GTQPRREHLAITKYFNCSCERCS 315
            T+ R + L  + +F C C  C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268



 Score = 38.9 bits (89), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 22/40 (55%)

Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
          C   R+  L KC  CK+  YC  + QKE W LHK +C  +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93


>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
          Length = 433

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)

Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
            L+   RC + F+  C+  C    D   H  EC+ + +  E        +  A +  +  
Sbjct: 61  GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117

Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
           I   R+P +K   +K+ E+H+ K     E  + I S +  L   + + L     D     
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVL 175

Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
                C    +   ++      ++  + LM H+C PN   +   +  L   R        
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233

Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
             + T Y + L+ T+ R + L  + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268


>gi|301123801|ref|XP_002909627.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262100389|gb|EEY58441.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 429

 Score = 39.3 bits (90), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 25/45 (55%)

Query: 20  ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
           E  L  C  C+ VAYC ++ QK+ WK HK  CK +  + + +  L
Sbjct: 249 EGKLMTCGGCRSVAYCSRECQKKAWKKHKTDCKAIRAQREQAEAL 293


>gi|397628595|gb|EJK68977.1| hypothetical protein THAOC_09810 [Thalassiosira oceanica]
          Length = 481

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query: 23  LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
           L  C+AC+ V YCG   Q+ H K HK  CK+   E+K   L  + L+
Sbjct: 207 LKDCTACRLVKYCGVDCQRAHRKKHKKACKQRAAELKDEQLFSQGLE 253


>gi|260800903|ref|XP_002595336.1| hypothetical protein BRAFLDRAFT_87573 [Branchiostoma floridae]
 gi|229280581|gb|EEN51348.1| hypothetical protein BRAFLDRAFT_87573 [Branchiostoma floridae]
          Length = 1307

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 2/44 (4%)

Query: 12 LMRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
          L +C+ C   +    +CS+C +  YC +  QK+ WK HKP C++
Sbjct: 15 LHKCSTCSRVSRDAKRCSSCDKAWYCNQTCQKDDWKRHKPTCQE 58


>gi|356554161|ref|XP_003545417.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
           max]
          Length = 729

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)

Query: 15  CAVCRETALHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKL 54
           CAVC   A  KCS CK V YC +  Q+ HW   HK  CK  
Sbjct: 70  CAVCATPAPKKCSRCKAVRYCSQACQQSHWMSGHKTVCKDF 110


>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
           niloticus]
          Length = 435

 Score = 39.3 bits (90), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 37/165 (22%), Positives = 73/165 (44%), Gaps = 15/165 (9%)

Query: 163 LPLRCLILQR----RSP-KKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKL 216
           L  R ++ QR    R+P ++   LK+ EAH+ K     E   + D + + +  S  +  L
Sbjct: 108 LVARIIMKQRATTERTPSERLLLLKEFEAHLDKMDSEKEEMNQTDIAALHHFYSRHISNL 167

Query: 217 PGD-VLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSK 274
           P +  L +  A+     C    +   ++      ++  + LM H+C PN   +     ++
Sbjct: 168 PDEQALTELFAQV---NCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAE 224

Query: 275 LFVFRDTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
           +   ++ +    I   Y + L+ T+ R+E L  + +F C C  C+
Sbjct: 225 VRAVKEINPGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECT 269


>gi|397570850|gb|EJK47491.1| hypothetical protein THAOC_33780 [Thalassiosira oceanica]
          Length = 338

 Score = 39.3 bits (90), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 5/69 (7%)

Query: 3   SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
           S I ++      CA C +       L  C+AC+ V YCG   Q+ H + HK  CK+   E
Sbjct: 49  SCITVDGSPADVCANCGKQGSDAVKLKNCTACRLVKYCGVDCQRAHRRQHKKACKQRAAE 108

Query: 58  IKSSPLLGR 66
           +K   L G+
Sbjct: 109 LKDEQLYGQ 117


>gi|340052290|emb|CCC46562.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 667

 Score = 39.3 bits (90), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 20/142 (14%)

Query: 181 LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI-CGVIEV- 238
           L+D  A MS+R      +  +  +V+ L       L G+    +  + L  I C  +EV 
Sbjct: 334 LQDHAAEMSQR------FAPLVRLVEGL-------LEGEETPVSVVRLLGVIRCNALEVC 380

Query: 239 --NGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHISTMYTNAL-WGT 294
              G+ IG+      ++     H+C PN A   N + +   V     ++  Y   L W  
Sbjct: 381 DSTGLGIGQALHA-STITSYFNHSCAPNCAMDGNAIVTICEVLPGEELTIAYIPQLYWPV 439

Query: 295 QPRREHLAITKYFNCSCERCSD 316
           + RRE LA   +F+C C RC D
Sbjct: 440 ELRREELAGRYFFHCCCTRCGD 461


>gi|449549234|gb|EMD40200.1| hypothetical protein CERSUDRAFT_112406 [Ceriporiopsis subvermispora
           B]
          Length = 704

 Score = 39.3 bits (90), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 22/44 (50%), Gaps = 2/44 (4%)

Query: 11  ELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
           EL RC+ CR     L KC  C +  YC    QK HW  HK  CK
Sbjct: 659 ELYRCSWCRNPSAMLRKCGGCGKTRYCDAICQKSHWGDHKIACK 702


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.135    0.441 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,791,966,964
Number of Sequences: 23463169
Number of extensions: 365933993
Number of successful extensions: 789832
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1569
Number of HSP's successfully gapped in prelim test: 929
Number of HSP's that attempted gapping in prelim test: 785455
Number of HSP's gapped (non-prelim): 3600
length of query: 541
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 393
effective length of database: 8,886,646,355
effective search space: 3492452017515
effective search space used: 3492452017515
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)