BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy6242
(541 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242007168|ref|XP_002424414.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507814|gb|EEB11676.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 563
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 193/472 (40%), Positives = 291/472 (61%), Gaps = 40/472 (8%)
Query: 1 MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
M+++ E D+ L +C +C +KC++CK+V YC K+HQKE WK HK +C+ P+++KS
Sbjct: 1 MSTVGEEIDKLLEKCKICDVQTTNKCTSCKKVNYCCKEHQKEDWKNHKIQCR--PFQMKS 58
Query: 61 SPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
S LGR+L+AT ++ G+ + E P++ GP+ E +CLGC+K +CP+C
Sbjct: 59 SLELGRFLEATRRINAGEILFTEYPIVFGPRPE--EIVCLGCYK-----TEPEEKCPKCL 111
Query: 121 WPACSARCSGLSDAHTHAPECAILKLGC-----ETLLAYNDYKYEAILPLRCLILQRRSP 175
WPACS+ C GL++ H EC +L+L ++ + + ++++ I PLRCL+LQ+++P
Sbjct: 112 WPACSSSCPGLTNPDHHGSECFLLRLYTRNNQQQSYVTDDYFRFDVIFPLRCLLLQKKNP 171
Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLPGD-----------VLDD 223
++ + ME+HM KRGP TE+Y++ + IVKYL NF + L + D+
Sbjct: 172 NGWKIITGMESHMKKRGPNTEIYKQTNEKIVKYLNDNFFKFLNDSGNREPDTEMILLPDE 231
Query: 224 TSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKH----SNMMQSKL 275
+ + LH ICG+I+VN ++I G LY LMEHNCLPN +H S + K+
Sbjct: 232 RTPEILHKICGIIDVNSLEIQSPKGGTLNALYETAFLMEHNCLPNTRHIFQPSRRFKIKV 291
Query: 276 FVFRD----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
F D HISTMYT+ALWGTQ R++HL TK+F C C RC DP+E+GT S +KCL
Sbjct: 292 FANNDIQPGNHISTMYTHALWGTQQRQDHLRSTKHFICRCRRCGDPSEMGTNLSGLKCLA 351
Query: 332 EHKDQGDCW--ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
DQ C LP+ PLD+ + W C C L + V + +G++VD+++ + P ++
Sbjct: 352 PTVDQKICGGTQLPMAPLDSITVWQCDKCPTSLTSGQVSDFVNMIGDEVDRILNKQPKIE 411
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
SLE +L+KL+ + H +HYHCY+V HSL+QLYG QPGY Q++ +LE+KI
Sbjct: 412 SLETLLSKLKKLLHKNHYHCYSVMHSLVQLYGRQPGYLPHQMNDLMLEKKIG 463
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 5/52 (9%)
Query: 452 DEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
+E +CLGC+K +CP+C WPACS+ C GL++ H EC +L+L
Sbjct: 91 EEIVCLGCYK-----TEPEEKCPKCLWPACSSSCPGLTNPDHHGSECFLLRL 137
>gi|193683766|ref|XP_001950081.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 533
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 181/446 (40%), Positives = 272/446 (60%), Gaps = 19/446 (4%)
Query: 4 IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
II + + CA C +A KC+AC V YC K+HQK HWK HK +C L YE++
Sbjct: 7 IISKHLKNISHCAFCGNSASSKCAACTLVVYCSKEHQKAHWKQHKNEC--LSYELQIDSN 64
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLA-DNARCPRCFWP 122
+GR+L A ++P + I +E PL++GPK +EPIC+ C + L + + + C +C WP
Sbjct: 65 VGRHLIAKRTINPFEIIIQEEPLVIGPKFPTSEPICIKCLQRLKRSESLVESLCEKCLWP 124
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
C C + + H EC +L G E + NDY Y+A+ PL+CL+LQ K+ L
Sbjct: 125 ICGTGCVTSINKNIHEGECTVLVKGSEKIAKNNDYMYDALTPLKCLLLQFTDKNKWNRLM 184
Query: 183 DMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD 242
++++HM RGP +EVYEEI+S+ YL++N+ L G+ +TS+ +H ICG+++VN +D
Sbjct: 185 ELKSHMEYRGPESEVYEEINSVYNYLKNNY---LSGEEF-ETSSDLIHTICGILDVNALD 240
Query: 243 I---GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWG 293
+ G +Y + +EHNCLPN S +++V+ + HI+TMYTNALWG
Sbjct: 241 VQVAGLELTAIYPTVSKLEHNCLPNTGISFDKCGRIYVYASRKITKGEHITTMYTNALWG 300
Query: 294 TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDW 353
T+ RR HL TKYF C C+RCSD TELGT FS + C + +G+ + P++PLD+ S+W
Sbjct: 301 TRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVCNVKGFCKGN--LTPIHPLDDSSEW 358
Query: 354 TCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVK 413
C C +++ + + ++L VDK +Q NP++ SLE+ +KL+ H +HY C+ VK
Sbjct: 359 ECDRCPNTVSSDKIDAILTELNHIVDKALQ-NPSINSLEDAFSKLKTRIHSNHYLCFNVK 417
Query: 414 HSLIQLYGTQPGYAYTQLSSSLLERK 439
H+LIQLYG GY ++ L+ LL+RK
Sbjct: 418 HTLIQLYGHSMGYKHSMLTDDLLKRK 443
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 2/73 (2%)
Query: 443 VMSPNLKATDEPICLGCHKPLNPNLA-DNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
V+ P T EPIC+ C + L + + + C +C WP C C + + H EC +L
Sbjct: 88 VIGPKF-PTSEPICIKCLQRLKRSESLVESLCEKCLWPICGTGCVTSINKNIHEGECTVL 146
Query: 502 KLGCETLLAYNDY 514
G E + NDY
Sbjct: 147 VKGSEKIAKNNDY 159
>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
Length = 563
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 196/453 (43%), Positives = 280/453 (61%), Gaps = 34/453 (7%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
+CA C + A KCSACK V+YC K HQK+HWK HK C+ P+EI ++ +G+ L AT D
Sbjct: 5 KCAECDKPAELKCSACKLVSYCCKDHQKKHWKSHKTLCR--PFEITTTKEVGKCLVATRD 62
Query: 74 LHPGDRIARESPLIVGPKLALAE--PI-CLGCHKPLNPNLADNA-RCPRCFWPACSARCS 129
L PGD I E PL+ GP+ + E P+ C GC + + + +N+ RCP C +P C RC
Sbjct: 63 LSPGDVIISELPLVYGPRPHMVEEGPVPCPGCCRLI---ICENSPRCPGCDFPVCHPRCP 119
Query: 130 GLSDAHTHAPECAILKLGCETLLAYND----YKYEAILPLRCLILQRRSPKKYQELKDME 185
GL D H EC IL L + A N Y+ + +L LRCL+LQ+++PKK+ +L +ME
Sbjct: 120 GLKDMEKHGHECLILSL--REIRAINGLHDFYRQDTLLALRCLLLQKKNPKKFAQLMEME 177
Query: 186 AHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLP-------GDVLDDTSAKCLHWICGVIE 237
AH+ KRGP TE+Y+++ + YL F L +VL D S + +H ICG+I+
Sbjct: 178 AHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQARTGKEVLQDISKETVHKICGIID 237
Query: 238 VNGVDIGRYTQ--GLYSVICLMEHNCLPNAKHSN---------MMQSKLFVFRDTHISTM 286
VN ++I + + LY LMEH+CL N HS +++ L + + HISTM
Sbjct: 238 VNALEINQDAEISVLYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTM 297
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YT+ALWGTQ RREHL TKYF+C+C+RC DPTE+G+Y SA++CL D D + LP +P
Sbjct: 298 YTHALWGTQARREHLKETKYFSCTCKRCKDPTEMGSYLSALRCLGTGVDSCDGYQLPADP 357
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
D+++ W C C +L +V + +Q+GE+VD + +P VK L+ +L+K+ HP+H
Sbjct: 358 TDDNTQWLCNKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDNLLSKMLTFLHPNH 417
Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
YH Y+VKHSL+QLYG Q GY +Q+S + +K
Sbjct: 418 YHVYSVKHSLVQLYGYQQGYTPSQISDDTVLKK 450
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%)
Query: 469 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
++ RCP C +P C RC GL D H EC IL L
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHGHECLILSL 136
>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
Length = 575
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 196/453 (43%), Positives = 280/453 (61%), Gaps = 34/453 (7%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
+CA C + A KCSACK V+YC K HQK+HWK HK C+ P+EI ++ +G+ L AT D
Sbjct: 5 KCAECDKPAELKCSACKLVSYCCKDHQKKHWKSHKTLCR--PFEITTTKEVGKCLVATRD 62
Query: 74 LHPGDRIARESPLIVGPKLALAE--PI-CLGCHKPLNPNLADNA-RCPRCFWPACSARCS 129
L PGD I E PL+ GP+ + E P+ C GC + + + +N+ RCP C +P C RC
Sbjct: 63 LSPGDVIISELPLVYGPRPHMVEEGPVPCPGCCRLI---ICENSPRCPGCDFPVCHPRCP 119
Query: 130 GLSDAHTHAPECAILKLGCETLLAYND----YKYEAILPLRCLILQRRSPKKYQELKDME 185
GL D H EC IL L + A N Y+ + +L LRCL+LQ+++PKK+ +L +ME
Sbjct: 120 GLKDMEKHGHECLILSL--REIRAINGLHDFYRQDTLLALRCLLLQKKNPKKFAQLMEME 177
Query: 186 AHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLP-------GDVLDDTSAKCLHWICGVIE 237
AH+ KRGP TE+Y+++ + YL F L +VL D S + +H ICG+I+
Sbjct: 178 AHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQARTGKEVLQDISKETVHKICGIID 237
Query: 238 VNGVDIGRYTQ--GLYSVICLMEHNCLPNAKHSN---------MMQSKLFVFRDTHISTM 286
VN ++I + + LY LMEH+CL N HS +++ L + + HISTM
Sbjct: 238 VNALEINQDAEISVLYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAALPIKKGDHISTM 297
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YT+ALWGTQ RREHL TKYF+C+C+RC DPTE+G+Y SA++CL D D + LP +P
Sbjct: 298 YTHALWGTQARREHLKETKYFSCTCKRCKDPTEMGSYLSALRCLGTGVDSCDGYQLPADP 357
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
D+++ W C C +L +V + +Q+GE+VD + +P VK L+ +L+K+ HP+H
Sbjct: 358 TDDNTQWLCNKCDIKLTNSEVSYLINQIGEEVDHVQLSSPTVKDLDNLLSKMLTFLHPNH 417
Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
YH Y+VKHSL+QLYG Q GY +Q+S + +K
Sbjct: 418 YHVYSVKHSLVQLYGYQQGYTPSQISDDTVLKK 450
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%)
Query: 469 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
++ RCP C +P C RC GL D H EC IL L
Sbjct: 102 NSPRCPGCDFPVCHPRCPGLKDMEKHGHECLILSL 136
>gi|357620844|gb|EHJ72885.1| hypothetical protein KGM_13877 [Danaus plexippus]
Length = 517
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 173/413 (41%), Positives = 250/413 (60%), Gaps = 35/413 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI----CLGCHKPLNPNLA 111
YEI+ S LGRYL A DL P D + E PL+ GPK ++ +P C+GC+KP+ ++
Sbjct: 6 YEIRKSEELGRYLVAARDLTPDDVVLTELPLVYGPK-SMPDPEALMPCVGCYKPIFTDVG 64
Query: 112 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLL--AYNDYKYEAILPLRCLI 169
+ RC RC WP CS C GL D H EC IL E +L + Y+++A+LPLRC +
Sbjct: 65 E--RCSRCGWPVCSGNCPGLKDPLHHGVECEILSARPECVLDNMADYYRHDALLPLRCAL 122
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSI-VKYLRSNFLEKLPGDV----LDDT 224
LQ K+++L ++++HM R PGT+ Y+E D VKYL + F+ K+ +V L+
Sbjct: 123 LQYTDDDKWKKLLELQSHMECRVPGTDAYDEADEFTVKYLMNVFINKIDKNVKNKYLNLI 182
Query: 225 SAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSK------ 274
S + LH ICG+I+ N ++I G LY+ C+MEH+C+PN KH K
Sbjct: 183 SGELLHKICGIIDTNALEIRLPNGSELNALYATTCMMEHSCVPNTKHLFNTSGKDVKDKY 242
Query: 275 -------LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
+ + + H++TMY++ALWGTQ RR+HL TKYF+C C RCSDPTELGTY SAM
Sbjct: 243 KITVKVVVPINKGDHVATMYSHALWGTQARRQHLKDTKYFSCKCIRCSDPTELGTYLSAM 302
Query: 328 KCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
KC + K D LP +PLD ++DW C C+ ++N ++++ S++GE+V+ + +
Sbjct: 303 KCFGDDKGPCDGIHLPEDPLDEETDWACNKCTVKVNNSQINILISEMGEEVENVQMMGGS 362
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
V LE +L +L HP+HYH Y++KHSLIQLYG Q Y +S +L++KI
Sbjct: 363 VNMLENILCRLSTFLHPNHYHLYSIKHSLIQLYGRQSSY----MSEEILDKKI 411
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 28/46 (60%), Gaps = 2/46 (4%)
Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
C+GC+KP+ ++ + RC RC WP CS C GL D H EC IL
Sbjct: 52 CVGCYKPIFTDVGE--RCSRCGWPVCSGNCPGLKDPLHHGVECEIL 95
>gi|170034227|ref|XP_001844976.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167875488|gb|EDS38871.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 548
Score = 311 bits (796), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 174/453 (38%), Positives = 256/453 (56%), Gaps = 35/453 (7%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATL 72
+CA+C A KC+ CK V YC +HQK+HWK HK +C K PYE+ + +GR+ AT
Sbjct: 7 KCALCGVEASLKCAGCKRVVYCSPEHQKKHWKRQHKNECAK-PYELARNEEVGRFFLATE 65
Query: 73 DLHPGDRIARESPLIVGPKLALAE-------PICLGCHKPLNPNLADNARCPRCFWPACS 125
+ + ESP+++GPK L E C+GC + RC RC WP C
Sbjct: 66 AIPKDTILFTESPMVIGPKWNLDEYEQRSAIVPCVGCFTDCQLGVY---RCDRCNWPVCK 122
Query: 126 ARCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
C GL +A+ HA EC IL+ G + ++ Y+Y+A+L L+CL LQ + +
Sbjct: 123 PDCPGLVNANLHAIECPILRFGGGPKPRDDPEAVFDYYRYDAMLVLKCLALQIHNRPLFD 182
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEV 238
++ +E+H R G++ Y + D V+YL NFL L ++ K ICG++EV
Sbjct: 183 QMMQLESHYEAR-KGSQYYRDADDRTVQYLLKNFLAPLKKQ--EEIQGKT---ICGILEV 236
Query: 239 NGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLF------VFRDTHISTMYT 288
N + I GR GLY + C++EH C+PN +++ + L H+ST YT
Sbjct: 237 NAMVIPLTNGREICGLYPIGCMLEHCCMPNCFYTSTFLNLLVYDLTNSFLSGEHLSTTYT 296
Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLD 348
+ALWGTQ RR+HL KYF+CSC RC+DPTELGTY SA++CL + LP++ L+
Sbjct: 297 HALWGTQQRRDHLKTNKYFSCSCARCADPTELGTYLSALRCLGVDGVGCSGYQLPIDSLN 356
Query: 349 NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYH 408
+ SDW C C ++ A V+ + +++GE++D +++ ++K E+M+ KL HP+HYH
Sbjct: 357 DASDWKCNQCPVQIEADQVNFLLAKIGEEMDDAIEQKTSIKQKEDMIAKLLTFLHPNHYH 416
Query: 409 CYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
+A+KHSLIQ+YG GY QLS L KI
Sbjct: 417 IFALKHSLIQMYGHLKGYQTPQLSDRDLADKIG 449
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLG 504
C+GC + RC RC WP C C GL +A+ HA EC IL+ G
Sbjct: 99 CVGCFTDCQLGVY---RCDRCNWPVCKPDCPGLVNANLHAIECPILRFG 144
>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
Length = 565
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 180/460 (39%), Positives = 263/460 (57%), Gaps = 39/460 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
CA+C + KC+ CK V YC +HQK+HW++ HK +C K PYE+ + +GR+ AT +
Sbjct: 8 CALCGVPSSLKCAGCKLVVYCSPEHQKKHWRMQHKNECAK-PYELSRNEEVGRFFLATEE 66
Query: 74 LHPGDRIARESPLIVGPKLALAE-------PICLGCHKPLNPNLADNARCPRCFWPACSA 126
+ + E+P+++GPK LAE C+GC + RC C WPAC
Sbjct: 67 IAKDTILFTEAPMVIGPKWNLAEYEQRSMTVPCVGCFADCQLGMY---RCEACQWPACKP 123
Query: 127 RCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQE 180
C GL +A+ HA EC IL+ G + ++ Y+Y+A+L L+CL LQ R+ + + +
Sbjct: 124 DCPGLENANLHALECGILRFGSGPKPRDDPEAVFDYYRYDALLALKCLALQIRNRQLFDQ 183
Query: 181 LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFL------EKLPGD-VLDDTSAKCLHWI 232
+ +E+H R GT Y+E D V+YL NFL EK G VL K LH I
Sbjct: 184 MLQLESHYEAR-KGTPFYKEADERTVQYLLKNFLNPLKKMEKQQGKTVLPVCDVKTLHKI 242
Query: 233 CGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFR-------D 280
G++EVN + I GR GLY + C++EH C+PN ++ N + F+
Sbjct: 243 SGILEVNAMVIPLSNGREICGLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRNIKKG 302
Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCW 340
H++T YT++LWGTQ RR+HL KYF+CSC RC+DPTELGTY SA+KC+ + +
Sbjct: 303 EHLTTTYTHSLWGTQQRRDHLKANKYFSCSCARCADPTELGTYLSALKCMGIDGNACAGY 362
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
LP++PL SDW C C + + V+ + S++GE+VD ++ +VK LE++++KL
Sbjct: 363 QLPIDPLKETSDWKCNQCPITIESDQVNFLLSKIGEEVDDAMERKSSVKQLEDLISKLLT 422
Query: 401 MFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
HP+H+ +KHSLIQ+YG GYA QLS +L KI
Sbjct: 423 FLHPNHHFLLQLKHSLIQMYGHFKGYATNQLSDRILAEKI 462
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 36/82 (43%), Gaps = 11/82 (13%)
Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGC--------ET 507
C+GC + RC C WPAC C GL +A+ HA EC IL+ G E
Sbjct: 99 CVGCFADCQLGMY---RCEACQWPACKPDCPGLENANLHALECGILRFGSGPKPRDDPEA 155
Query: 508 LLAYNDYKSLSRAYLDHQQVRS 529
+ Y Y +L Q+R+
Sbjct: 156 VFDYYRYDALLALKCLALQIRN 177
>gi|198461482|ref|XP_001362030.2| GA13100 [Drosophila pseudoobscura pseudoobscura]
gi|198137355|gb|EAL26610.2| GA13100 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 179/458 (39%), Positives = 253/458 (55%), Gaps = 44/458 (9%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
M CAVC A C+ CK V YC K+HQK+HW HK CK P K +LGR+L AT
Sbjct: 4 MDCAVCGVAAAQTCTRCKIVRYCDKEHQKQHWPEHKRCCK--PIREKKDDVLGRFLVATQ 61
Query: 73 DLHPGDRIARESPLIVGPKLALAEPI-------CLGCHKP--LNPNLADNARCPRCFWPA 123
D+ I E PL+VGPK ++E C+GC+ P L +L C C WP
Sbjct: 62 DIKANQIIFVEEPLVVGPKWYMSEAEKSSTTVPCVGCYTPCRLGKHL-----CRNCRWPV 116
Query: 124 CSARCSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
CSA C A EC++L LG +T + Y+ +A+L L+CL+LQR+ PKK
Sbjct: 117 CSASCE------HEALECSVLSLGPGPSARADTRGLNDFYRGDALLVLKCLLLQRKDPKK 170
Query: 178 YQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKL---PGDVLDDTSAKCLHWIC 233
+ +L +M++H +R GTE+++E D IV YL FL++L DVL+ + LH +C
Sbjct: 171 WTDLLEMQSHEEER-KGTELHQEAEDGIVSYLSQRFLQRLEQSKKDVLEHCEPELLHRLC 229
Query: 234 GVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN---AKHSNMMQSKLFVFRD----TH 282
G+IE N + + G GL+ C+MEH C PN + MQ ++ D H
Sbjct: 230 GIIETNYMVVELPTGIELSGLFRQACMMEHACQPNCYFQFDGSTMQIEVRAGGDLKKGDH 289
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
+ YTN LWGT R+ HL +TK+F C CERC DPTE G+Y SA+ CL + L
Sbjct: 290 LRITYTNILWGTHLRQHHLRLTKHFKCGCERCLDPTEFGSYVSALTCLGDVNKSCGGKQL 349
Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
P++P+D S W C +C ++ V + + + EQVDK++ P+ +E L +L M
Sbjct: 350 PLDPVDEHSQWKCDACPMAMDTAYVTELQTHMTEQVDKMLAGRPSASEIELTLARLTQML 409
Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
HP+H+H + +KH+LIQLYG + G + LS +LERK+
Sbjct: 410 HPNHFHMFNLKHTLIQLYGNENGLELSSLSDGILERKL 447
>gi|350413549|ref|XP_003490027.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 480
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 161/417 (38%), Positives = 241/417 (57%), Gaps = 36/417 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C +C + A KC+ CK+ YC K+HQ++ W H+ C+ +EI+ S LGRYL A+ DL
Sbjct: 9 CPICNQRATLKCNGCKQQFYCKKEHQRQDWPRHRSNCQA--WEIRESCELGRYLVASRDL 66
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
+PGD I ESPL+ GP L + +C+GC K N + + RC +C WPAC CSGL+D
Sbjct: 67 NPGDVILSESPLVWGPALHSDQRVCVGCGKQCN---SSDTRCTKCLWPACDTDCSGLTDK 123
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
+ H EC+ L + A + + +L +R LIL R+ K + ++ +++H RG G
Sbjct: 124 NRHGLECSFL------VKARIIPRCDVLLVIRMLILWRKKLKYWYSIEKLQSHQDSRGQG 177
Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--GRYTQGLYS 252
T YEE +I +++ E+L D + +S +H ICG+I+VN ++ +Y
Sbjct: 178 TNAYEETMNIYQHI-----ERLLPD--NPSSKDIVHKICGLIDVNALETIPPEGCVAIYE 230
Query: 253 VICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHLA 302
CL+EH+CL N +HS + K + + H+STMYT+ALW T+ RR HL
Sbjct: 231 TACLLEHSCLANTRHSFKIDDKGRPRITVIAVCSIQKWDHLSTMYTHALWSTRARRAHLL 290
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
TKYF+C C+RC+DPTELGT+ +KC +++ +ILP +PL+ DS W C SC L
Sbjct: 291 ETKYFSCHCKRCADPTELGTHLGTLKCPHDND-----FILPKDPLNFDSQWGCISCPGTL 345
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
A +V +L E VD+++ + N L ++L++L + HP H C +V HSLIQL
Sbjct: 346 TASEVMQFVGKLEEDVDEIMYQ-ANKNKLVDLLSRLTTLLHPGHQLCTSVSHSLIQL 401
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+D+ +C+GC K N + + RC +C WPAC CSGL+D + H EC+ L
Sbjct: 86 SDQRVCVGCGKQCN---SSDTRCTKCLWPACDTDCSGLTDKNRHGLECSFL 133
>gi|340710128|ref|XP_003393648.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
Length = 480
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 160/417 (38%), Positives = 241/417 (57%), Gaps = 36/417 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C +C + A KC++CK+ YC K+HQ++ W H+ C+ +EI+ S LGRYL A+ DL
Sbjct: 9 CPICNQRATLKCNSCKQQFYCKKEHQRQDWPRHRSNCQA--WEIRESCELGRYLVASRDL 66
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
+PGD I ESPL+ GP L + +C+GC K + + RC +C WPAC CSGL+D
Sbjct: 67 NPGDVILSESPLVWGPALHSDQRVCVGCGKQC---YSGDTRCTKCLWPACDTNCSGLTDK 123
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
+ H EC+ L + A + + +L +R LIL R+ K + ++ +++H RG G
Sbjct: 124 NRHGLECSFL------IKARIIPRCDVLLVIRMLILWRKKLKHWYSIEKLQSHQDSRGQG 177
Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--GRYTQGLYS 252
T YEE +I ++ E+L D + +S +H ICG+I+VN ++ +Y
Sbjct: 178 TNAYEETMNIYHHI-----ERLLPD--NPSSKDIVHKICGLIDVNALETIPPEGCVAIYE 230
Query: 253 VICLMEHNCLPNAKHSNMMQSKLF----------VFRDTHISTMYTNALWGTQPRREHLA 302
CL+EH+CL N +HS + K + + H+STMYT+ALW T+ RR HL+
Sbjct: 231 TACLLEHSCLANTRHSFKIDDKGRPRITVIAVCPIQKWDHLSTMYTHALWSTRARRAHLS 290
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
TKYF+C C+RC+DPTELGT+ +KC +++ +ILP +PL+ DS W C SC L
Sbjct: 291 ETKYFSCHCKRCADPTELGTHLGTLKCPHDND-----FILPKDPLNFDSQWGCNSCPGTL 345
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
A +V +L E VD+++ + N L ++L++L + HP H C +V HSLIQL
Sbjct: 346 TASEVMQFVGKLEEDVDEIMYQ-ANKNKLVDLLSRLTTLLHPGHQLCTSVSHSLIQL 401
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 3/51 (5%)
Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+D+ +C+GC K + + RC +C WPAC CSGL+D + H EC+ L
Sbjct: 86 SDQRVCVGCGKQC---YSGDTRCTKCLWPACDTNCSGLTDKNRHGLECSFL 133
>gi|28573973|ref|NP_610202.3| CG14590 [Drosophila melanogaster]
gi|19528095|gb|AAL90162.1| AT24727p [Drosophila melanogaster]
gi|28380690|gb|AAF57282.2| CG14590 [Drosophila melanogaster]
gi|220949792|gb|ACL87439.1| CG14590-PA [synthetic construct]
Length = 553
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 172/454 (37%), Positives = 252/454 (55%), Gaps = 40/454 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC A C+ CK V YC ++HQK+HW HK +C+ P+ + LGRYL+ T ++
Sbjct: 7 CPVCGVAASQACTRCKMVRYCDREHQKQHWPQHKRRCR--PFSEEQDAELGRYLKVTQNI 64
Query: 75 HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
G + E PL+VGPK L++ C+GC+ P +C RC WP CSA
Sbjct: 65 AAGQIVFIEEPLVVGPKWYLSDADKEASNVPCVGCYTPCR---LGKHQCRRCRWPVCSAG 121
Query: 128 CSGLSDAHTHAPECAILKLGCET-----LLAYNDY-KYEAILPLRCLILQRRSPKKYQEL 181
C S EC++L LG + + NDY + +A+L L+CL+LQR+SP K+ L
Sbjct: 122 CKHES------MECSVLSLGSGSPTRADARSLNDYFRGDALLVLKCLLLQRQSPTKWSAL 175
Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIE 237
+M++H +R GT++YEE + +V YL+ FL +L ++L D + LH +CG+IE
Sbjct: 176 LEMQSHEEER-KGTDLYEEAEKRVVTYLQKRFLCRLKQTNPNLLTDCGPEMLHRLCGIIE 234
Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTM 286
N + I G GL+ C+MEH C PN +++ R H+
Sbjct: 235 TNFMVIELPSGVELSGLFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLRKGDHLRIT 294
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YTN LWGTQ R+ HL +TK+F+C C RC DPTE GTY SA+ CL + LPV+P
Sbjct: 295 YTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALTCLGDVNQTCGGTHLPVDP 354
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
LD ++ W C +C ++ V + S + EQV+ L+ P+ +E +L +L M HP+H
Sbjct: 355 LDENTQWKCDTCPMIVDGAYVAELQSHMTEQVEGLLAGCPSANQVELLLARLTHMLHPNH 414
Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+H + +KH+LIQLYG + G LS++ LERK+
Sbjct: 415 FHTFNLKHTLIQLYGNEAGLELGVLSNTQLERKL 448
>gi|345483721|ref|XP_001601934.2| PREDICTED: protein msta, isoform B-like [Nasonia vitripennis]
Length = 480
Score = 281 bits (718), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 160/418 (38%), Positives = 238/418 (56%), Gaps = 40/418 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC + A +C +CK YCGK+HQ+EHW HK C+ YE + + LGR+L AT DL
Sbjct: 11 CAVCSKAATTRCGSCKLAFYCGKEHQREHWPRHKTSCRA--YEERENQELGRHLLATRDL 68
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSD 133
P D I E+P++ GP + E +C+GC + D RCP C WPAC C GL D
Sbjct: 69 KPDDLIISEAPIVWGPSSHVEERVCVGC------GVRDAVCRCPNCAWPACRLACDGLVD 122
Query: 134 AHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGP 193
+ H EC IL L+ DY + LIL+R++PKK++ L +++H RGP
Sbjct: 123 ENRHGYECGILAKA--RLMPRCDYLLILRV----LILRRKNPKKWRVLSKLQSHEESRGP 176
Query: 194 GTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLY 251
GTE +EE++++ +YL S LD SA+ L +CG+I+VN ++ + +Y
Sbjct: 177 GTEAHEEMEAVKQYLGSLL-------SLDQGSAEALPKVCGLIDVNALETNPPEGSAAIY 229
Query: 252 SVICLMEHNCLPNAKHSNMMQSK------LFVFRDT----HISTMYTNALWGTQPRREHL 301
CL+EH C+ N ++S + + ++ D H+ST YT+ LW T+ RREHL
Sbjct: 230 ETACLLEHRCVANTRYSFELDEQGRPRINVYAVTDIKKADHLSTTYTHVLWATRVRREHL 289
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TKYF+C CERC+DPTELG++ +KC E ++ P +PL+ ++DW+C C
Sbjct: 290 LATKYFSCRCERCADPTELGSHLGTLKCPCEKG-----FVTPDDPLNPETDWSCKDCPGV 344
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
L+A +V +T +LGE+VD + + + ++L +L + HP H HC + HSL+QL
Sbjct: 345 LSAGEVAQLTDRLGEEVDAAMS-AADKDIMSDLLFRLTVLLHPCHQHCLTIGHSLMQL 401
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 27/52 (51%), Gaps = 7/52 (13%)
Query: 451 TDEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+E +C+GC + D RCP C WPAC C GL D + H EC IL
Sbjct: 88 VEERVCVGC------GVRDAVCRCPNCAWPACRLACDGLVDENRHGYECGIL 133
>gi|380026797|ref|XP_003697129.1| PREDICTED: uncharacterized protein LOC100866234 [Apis florea]
Length = 480
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 162/418 (38%), Positives = 233/418 (55%), Gaps = 38/418 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C +C + A KCS CK+ YC K+HQ+ W HK C+ +EI+ + LGR+L A+ DL
Sbjct: 9 CPICNQRATLKCSGCKQQFYCKKEHQRVDWPRHKLTCQ--AWEIRENSELGRHLLASRDL 66
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
+PGD I ESPL GP + + +C+GC K + N RC +C WPAC+ CSGL+D
Sbjct: 67 NPGDVILSESPLAWGPSIHSDQRLCVGCGKQCK---SANTRCTKCLWPACAVDCSGLTDK 123
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
+ H EC+ L + A + + +L +R LIL R+ K + ++ +++H RGPG
Sbjct: 124 NRHDLECSFL------IKAKIIPRCDVLLVIRMLILWRKKSKHWNSIQKLQSHEDSRGPG 177
Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK-CLHWICGVIEVNGVDI--GRYTQGLY 251
T YEE +I +++ LP D++S K + ICG+I+VN ++ +Y
Sbjct: 178 TSAYEETMNIYYHIQ----RLLP----DNSSTKDIVSKICGLIDVNALETVPPEGCVAIY 229
Query: 252 SVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHL 301
CL+EH+CL N +HS + K + + H+STMYT+ALW T+ RR HL
Sbjct: 230 ETACLLEHSCLANTRHSFTIDDKGRPRITVKAVCSIQKGDHLSTMYTHALWATRARRSHL 289
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TKYF+C C+RC+DPTELGT+ +KC Q D +I NPLD DS+W C SC
Sbjct: 290 LETKYFSCHCKRCADPTELGTHLGTLKC-----PQDDGFISSKNPLDFDSEWGCDSCPGT 344
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
L A ++ +L E VD ++ K L +L++L + HP H C V HSLIQL
Sbjct: 345 LTASEMMQFIGKLEEDVDDVMCSAQKDK-LVNLLSRLTKLLHPGHQLCITVSHSLIQL 401
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+D+ +C+GC K + N RC +C WPAC+ CSGL+D + H EC+ L
Sbjct: 86 SDQRLCVGCGKQCK---SANTRCTKCLWPACAVDCSGLTDKNRHDLECSFL 133
>gi|335892814|ref|NP_001229486.1| uncharacterized protein LOC724300 [Apis mellifera]
Length = 479
Score = 277 bits (709), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 161/417 (38%), Positives = 236/417 (56%), Gaps = 37/417 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C +C + A KCS CK+ YC K+HQ+ W HK C+ +EI+ + LGR+L A+ DL
Sbjct: 9 CPICNQRATLKCSGCKQQFYCKKEHQRMDWPRHKLTCQ--AWEIRENSELGRHLLASRDL 66
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
+PGD I ESPL+ GP + + +C+GC K + N RC +C WPAC+ CSGL+D
Sbjct: 67 NPGDVILSESPLVWGPSIHSDQRLCVGCGKQCK---SANIRCTKCLWPACAVDCSGLTDK 123
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
+ H EC+ L + A + + +L +R LIL + K + ++ +++H RGPG
Sbjct: 124 NRHDLECSFL------IKAKIIPRCDVLLVIRMLILWCKKSKYWNSIQKLQSHEDSRGPG 177
Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK-CLHWICGVIEVNGVDI--GRYTQGLY 251
T VYEE +I +++ LP D++S K + ICG+I++N ++ +Y
Sbjct: 178 TSVYEETMNIYYHIQ----RLLP----DNSSTKDIVSKICGLIDINALETVPPEGCVAIY 229
Query: 252 SVICLMEHNCLPNAKHSNMM---------QSKLFVFRDTHISTMYTNALWGTQPRREHLA 302
CL+EH+CL N +HS + ++ + + H+STMYT+ALW T+ RR HL
Sbjct: 230 ETACLLEHSCLANTRHSFTIDKGRPRITVKALCSIQKGDHLSTMYTHALWATRVRRSHLL 289
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
TKYF+C C+RC+DPTELGT+ +KC Q + +ILP NPLD DS+W C SC L
Sbjct: 290 ETKYFSCHCKRCADPTELGTHLGTLKC-----PQDNGFILPKNPLDFDSEWGCDSCPGTL 344
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
A ++ +L E VD V + L +L++L + HP H C V HSLIQL
Sbjct: 345 TASEMIQFIGKLEEDVDD-VMCSARKDKLVNLLSRLTTLLHPGHQLCITVSHSLIQL 400
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+D+ +C+GC K + N RC +C WPAC+ CSGL+D + H EC+ L
Sbjct: 86 SDQRLCVGCGKQCK---SANIRCTKCLWPACAVDCSGLTDKNRHDLECSFL 133
>gi|307187820|gb|EFN72772.1| Protein msta, isoform B [Camponotus floridanus]
Length = 484
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 154/417 (36%), Positives = 232/417 (55%), Gaps = 36/417 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + A KC C+ YC K HQ W H+ C + +EI LGRYL AT DL
Sbjct: 10 CPVCGKNATLKCGNCRREFYCDKSHQSRDWPRHRSAC--IAWEIGRDADLGRYLLATRDL 67
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
PGD I E+PL+ GP + +C+GC K N ++ + RC C WPAC + C GLSD
Sbjct: 68 APGDAILSEAPLVWGPSTHATQRLCVGCGKRCNDDI--DMRCGVCRWPACRSNCEGLSDK 125
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
H ECA+L A + E +L +R LIL R K++ L ++++H RGPG
Sbjct: 126 KRHESECALLAR------ARIIPRCEVLLVIRMLILWRTKSKRWTSLANLQSHEGSRGPG 179
Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLYS 252
T+ ++E+ SI ++L + D+L ICG+I+VN ++ + +Y
Sbjct: 180 TDAHDEVTSISQHLGPLLMAFDCQDILPK--------ICGLIDVNALETMPPEGSMAIYE 231
Query: 253 VICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHLA 302
L+EH+C+ N +HS + K + + H+STMYT+ALW T+ RREHL
Sbjct: 232 NASLLEHSCVANTRHSFRIDDKGRPRITVYAVTSIKKGEHLSTMYTHALWSTRIRREHLL 291
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
+TKYF C C+RC+DPTELGT+ +KC ++ +LP +PL++++DW+C C +
Sbjct: 292 VTKYFACRCKRCADPTELGTHLGTLKCPCKNG-----LMLPNDPLNSNTDWSCNVCPGIV 346
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
A +V + +L E+V ++++ +L ++L++L + HP H HC +V HSLIQL
Sbjct: 347 TASEVAQLMDRLEEEVTGIMKQATE-HTLSDLLSRLTVLLHPGHQHCISVSHSLIQL 402
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 8/73 (10%)
Query: 443 VMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 502
V P+ AT + +C+GC K N ++ + RC C WPAC + C GLSD H ECA+L
Sbjct: 80 VWGPSTHAT-QRLCVGCGKRCNDDI--DMRCGVCRWPACRSNCEGLSDKKRHESECALLA 136
Query: 503 LG-----CETLLA 510
CE LL
Sbjct: 137 RARIIPRCEVLLV 149
>gi|195380866|ref|XP_002049182.1| GJ20889 [Drosophila virilis]
gi|194143979|gb|EDW60375.1| GJ20889 [Drosophila virilis]
Length = 553
Score = 275 bits (703), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 172/457 (37%), Positives = 249/457 (54%), Gaps = 43/457 (9%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
+CAVC A C+ CK V YC +HQK+HW HK C+ P+ ++ LGRYL AT D
Sbjct: 7 KCAVCDVAASLTCTRCKMVRYCNGEHQKQHWPQHKSCCR--PFRVQQDEQLGRYLVATQD 64
Query: 74 LHPGDRIARESPLIVGPKLALAE-------PICLGCHKPLNPNLADNARCPRCFWPACSA 126
+ I E PL+VGPK L E C+GC+ P +C C WP CSA
Sbjct: 65 IKAKQIIFVEEPLVVGPKWFLTEHEKSASIVPCVGCYTPCRLG---KHQCRNCRWPVCSA 121
Query: 127 RCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQE 180
C EC++L LG + ++ Y+++A+L L+CL+LQR+ P+++Q
Sbjct: 122 SCLH------EKLECSVLSLGTAPTAKSDARALHDYYRHDALLVLKCLLLQRQQPERWQA 175
Query: 181 LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGD------VLDDTSAKCLHWIC 233
L +M++H +R GTE+ +E + IV YL FL++L L D + LH +C
Sbjct: 176 LLEMQSHAEER-LGTELQQEAEQRIVSYLEQRFLQRLKQSKNRECCQLSDYEPELLHRLC 234
Query: 234 GVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTH 282
G+IE N + I G GL+ C+MEH C PN A +++ + R H
Sbjct: 235 GIIETNYMVIELATGVELSGLFRQACMMEHACQPNCYFQFDSATQRIAVRAGCDMKRGDH 294
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
+ YTN LWGTQ R+ HL +TK+F+C C RC DPTE G+Y SAM+CL + L
Sbjct: 295 LKITYTNILWGTQMRQHHLRMTKHFSCRCARCEDPTEYGSYVSAMRCLGDVSKSCVGVQL 354
Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
P++PL+ + W C +C L+A V V + + E+VD+L+ P +E +L +L +
Sbjct: 355 PLDPLNEQTQWKCDTCPVLLDAAYVTEVQTHMTEEVDQLLAGRPTASQVELLLARLSQLL 414
Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
HP+H+H + +KH+LIQLYG +PG T+LS L RK
Sbjct: 415 HPNHFHMFNLKHTLIQLYGHEPGLELTELSDLQLGRK 451
>gi|194864122|ref|XP_001970781.1| GG10833 [Drosophila erecta]
gi|190662648|gb|EDV59840.1| GG10833 [Drosophila erecta]
Length = 553
Score = 275 bits (703), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 171/454 (37%), Positives = 250/454 (55%), Gaps = 40/454 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC A C+ CK V YC ++HQK+HW HK +C+ P+ + LGRYL+ T D+
Sbjct: 7 CPVCGVAASQACTRCKMVRYCDREHQKQHWPQHKRRCR--PFSEEQDAELGRYLKVTQDI 64
Query: 75 HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
G + E PL+VGPK L++ C+GC NP +C RC WP CSA
Sbjct: 65 AAGQIVFIEEPLVVGPKWYLSDADKEASIVPCVGC---CNPCRLGKHQCRRCRWPVCSAG 121
Query: 128 CSGLSDAHTHAPECAILKLGCET-----LLAYNDY-KYEAILPLRCLILQRRSPKKYQEL 181
C S EC++L L + + + NDY + +A+L L+CL+LQR+SP K+ L
Sbjct: 122 CEHES------LECSVLSLRSGSPARADVRSLNDYFRGDALLVLKCLLLQRQSPTKWSAL 175
Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIE 237
+M++H +R GTE++EE + +V YL+ FL +L ++L D + LH +CG+IE
Sbjct: 176 LEMQSHEDER-KGTELHEEAEQRVVSYLQQRFLCRLKQTNPNLLHDCGPEMLHRLCGIIE 234
Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTM 286
N + I G GL+ C+MEH C PN +++ R H+
Sbjct: 235 TNFMVIELPTGVELSGLFRQACMMEHACQPNCDFQFDNKTQQIAVRAGCDLRKGDHLRIT 294
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YTN LWGTQ R+ HL +TK+F+C C RC DPTE GTY SA+ CL + L V+P
Sbjct: 295 YTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALACLGDVNQTCGGTHLAVDP 354
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
LD ++ W C +C L+ V + S + E V+ L+ P+ +E +L +L M HP+H
Sbjct: 355 LDENTQWKCDTCPMLLDGAYVSELQSHMTEHVEGLLGGCPSANQVELLLARLTHMLHPNH 414
Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+H + +KH+LIQLYG + G LS++ L+RK+
Sbjct: 415 FHTFNLKHTLIQLYGNEAGLELGVLSNTQLKRKL 448
>gi|195023574|ref|XP_001985712.1| GH20950 [Drosophila grimshawi]
gi|193901712|gb|EDW00579.1| GH20950 [Drosophila grimshawi]
Length = 557
Score = 271 bits (693), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 168/455 (36%), Positives = 249/455 (54%), Gaps = 42/455 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC A C+ CK V YC +HQK+HW HK CK P+++++ P LGRYL AT +
Sbjct: 8 CAVCNVAAPLTCTRCKLVRYCNGEHQKQHWPTHKTCCK--PFKVQNDPQLGRYLVATQTI 65
Query: 75 HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
I E PL+VGPK L + C+ C+ P N C C WP CS
Sbjct: 66 QAKQIIFVEEPLVVGPKWFLTDDEKSASIVPCVACYTPCRIN---KHLCRSCRWPVCSIS 122
Query: 128 CSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
C EC+IL LG + ++ Y+++A+L L+CL+LQR+ P+++ L
Sbjct: 123 CEH------EKLECSILSLGSPPSGKSDARGLHDYYRHDALLVLKCLLLQRQHPERWAAL 176
Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG-----DVLDDTSAKCLHWICGV 235
+M++H +R GTE+++E + IV YL+ FL+++ +L + SA+ LH +CG+
Sbjct: 177 LEMQSHEEER-LGTELHQEAEQRIVSYLQQRFLQRIKAAKNRESLLSEYSAELLHRLCGI 235
Query: 236 IEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTHIS 284
IE N + I G G++ C+MEH C PN A +++ + + H+
Sbjct: 236 IETNYMVIELATGVELSGVFRQACMMEHACQPNCYFQFDAATQRIAVRAGCDLKKGDHLK 295
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YTN LWGTQ R+ HL +TK+FNC C RC DP+E GTY SAM+CL + PV
Sbjct: 296 ITYTNILWGTQMRQHHLRMTKHFNCRCSRCEDPSEYGTYVSAMRCLGDVSKSCGGIQQPV 355
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
+PLD + W C +C ++A V V + + ++V+KL+ P +E +L +L + HP
Sbjct: 356 DPLDEQTQWKCDTCPMLVDAAYVSEVQTHMTDEVEKLLAGQPTASQVELLLARLSQLLHP 415
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+H+H + +KH+LIQLYG +PG LS L RK
Sbjct: 416 NHFHMFNLKHTLIQLYGHEPGLELATLSDLQLGRK 450
>gi|383853714|ref|XP_003702367.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 484
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 154/418 (36%), Positives = 237/418 (56%), Gaps = 37/418 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C +C A KCS CK+ YC K HQ++ WK H+ C+ +EI LGR+L A+ DL
Sbjct: 9 CPICNGHATLKCSGCKQQFYCNKDHQRQDWKRHRSVCQA--WEIHEDSELGRHLLASRDL 66
Query: 75 HPGDRIARESPLIVGPKL-ALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
PGD + E+P++ GP L + +C+GC K + + RC C W AC C GL+D
Sbjct: 67 DPGDVVLSEAPVVWGPALHGDDQRLCVGCGKQCK---SSSTRCNSCGWAACKMDCPGLTD 123
Query: 134 AHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGP 193
TH EC+ L + + + +L +R L+L R+ K++ L+ +++H RGP
Sbjct: 124 NTTHGLECSYL------IKTAIFPRSDVLLVIRMLLLWRKKSKQWMALEKLQSHDDSRGP 177
Query: 194 GTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLY 251
GT+ YEE ++ ++++ FL + + AK ICG+I+VN ++ R +Y
Sbjct: 178 GTQAYEEAMNMCEHIQ-RFLPAIGAG--KEIVAK----ICGLIDVNALETAPPRGCVAIY 230
Query: 252 SVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHL 301
+ CL+EH+C+ N +HS + K + + H+STMYT+ALW T+ RR HL
Sbjct: 231 NTACLLEHSCVANTRHSFSIDDKGRPKITVKALCSIKKGEHLSTMYTHALWATRARRAHL 290
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TKYF+C C+RC+DPTELG++ ++C + D G +ILP +PLD ++DW C SC
Sbjct: 291 LETKYFSCRCKRCADPTELGSHLGTLRCPH---DNG--FILPKDPLDFETDWACDSCPGV 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
L A +V T +L E VD+++ + P+ +L ++ ++L + HP H C V HSLIQL
Sbjct: 346 LTASEVMQFTGKLEEDVDEVMYQ-PSKNALIDLYSRLTTLLHPSHQQCITVSHSLIQL 402
Score = 46.2 bits (108), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
Query: 443 VMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
V P L D+ +C+GC K + + RC C W AC C GL+D TH EC+ L
Sbjct: 79 VWGPALHGDDQRLCVGCGKQCK---SSSTRCNSCGWAACKMDCPGLTDNTTHGLECSYL 134
>gi|194743442|ref|XP_001954209.1| GF16861 [Drosophila ananassae]
gi|190627246|gb|EDV42770.1| GF16861 [Drosophila ananassae]
Length = 550
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 168/451 (37%), Positives = 247/451 (54%), Gaps = 37/451 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC A C+ CK V YC +HQK+ W HK +CK P+ + LGR+L AT ++
Sbjct: 7 CPVCGVAATLVCTRCKLVRYCDPEHQKQDWAQHKRRCK--PFRVAEDEQLGRFLVATQNI 64
Query: 75 HPGDRIARESPLIVGPKL--ALAEPI--CLGCHKPLNPNLADNARCPRCFWPACSARCSG 130
E PL+VGPK + E I C+GCH P +C C WP CSA C+
Sbjct: 65 AAKQIAFVEEPLVVGPKWYHSQDETIVPCVGCHTPCR---LGKHQCRSCHWPVCSAACAD 121
Query: 131 LSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
A EC++L LG + + ++ +A+L LRCL+LQR++P K+ L +M
Sbjct: 122 ------EALECSVLSLGPTPGARADNRTLNDYFRGDALLVLRCLLLQRQNPAKWTALLEM 175
Query: 185 EAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNG 240
++H +R GTE++ E + +V +L+ FL++L D+L D ++ LH ICG+IE N
Sbjct: 176 QSHEEER-QGTELHVEAEMRVVDFLQQRFLKRLKQTKPDLLADCGSELLHRICGIIETNY 234
Query: 241 VDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTN 289
+ I G GL+ C+MEH C PN ++ R H+ YTN
Sbjct: 235 MVIELPSGVELSGLFRQACMMEHACQPNCYFQFDAFTQQITVRAGCDLQKGDHLRIAYTN 294
Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN 349
LWGTQ R+ HL +TK+F+C C RC DPTE G+Y SA+ CL + LPV+PL+
Sbjct: 295 ILWGTQLRQHHLRLTKHFSCHCARCLDPTENGSYVSALACLGDVNLCCTGTHLPVDPLNE 354
Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
++ W C +C ++A V + + + +QV+ L+ P K +E +L +L M HP+H+H
Sbjct: 355 ETQWKCDTCPMLVDAAYVSELQTHMTDQVESLLAGRPTAKEVELLLARLSQMLHPNHFHM 414
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ +KH+LIQLYG +PG LS+ LERK+
Sbjct: 415 FNLKHTLIQLYGNEPGLEMDTLSNGQLERKL 445
Score = 38.5 bits (88), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 17/81 (20%)
Query: 432 SSSLLERKISYVMSPNL------KATDEPI--CLGCHKPLNPNLADNARCPRCFWPACSA 483
+ ++ ++I++V P + + DE I C+GCH P +C C WP CSA
Sbjct: 61 TQNIAAKQIAFVEEPLVVGPKWYHSQDETIVPCVGCHTPCR---LGKHQCRSCHWPVCSA 117
Query: 484 RCSGLSDAHTHAPECAILKLG 504
C+ A EC++L LG
Sbjct: 118 ACAD------EALECSVLSLG 132
>gi|307197684|gb|EFN78851.1| Protein msta, isoform A [Harpegnathos saltator]
Length = 487
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 157/420 (37%), Positives = 228/420 (54%), Gaps = 41/420 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + A KC C+ YC K HQ W HK C +EI P LGR+L A DL
Sbjct: 13 CPVCGKDATLKCGNCRREFYCDKSHQSRDWPRHKSLC--CAWEIGQDPQLGRHLLAARDL 70
Query: 75 HPGDRIARESPLIVGPKL-ALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
PGD I E PL GP + A + +C+GC K +ARC +C WP C C GL+D
Sbjct: 71 APGDVILSEVPLAWGPSIHARDQRLCVGCGKQCA---EIDARCGKCRWPVCRLDCEGLTD 127
Query: 134 AHTHAPECAILKLGCETLLAYNDY--KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKR 191
H PECA LLA+ + + +L +R LIL R K++ L ++++H R
Sbjct: 128 KKRHQPECA--------LLAHIRIIPRCDVLLVIRMLILWRTKSKRWTSLTNLQSHEDSR 179
Query: 192 GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--GRYTQG 249
GPGTE YEE+ SI +YL P + L ICG+I+VN ++ +
Sbjct: 180 GPGTEAYEEVMSISQYLE-------PIVAMVPCCKSVLPKICGLIDVNALETMPPEGSVA 232
Query: 250 LYSVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRRE 299
+Y + L+EH+C+ N +HS + K F+ + H+STMYT+ALW T+ RRE
Sbjct: 233 IYENMSLLEHSCMANTRHSFRLDDKGRPRITVNAITFIKKGEHLSTMYTHALWSTRARRE 292
Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
HL TKYF+C CERC+DPTELG + +++C ++ +L +PL+ D++W+C +C
Sbjct: 293 HLLATKYFSCRCERCADPTELGAHLGSLQCPCKNG-----LMLADDPLNPDTEWSCTACP 347
Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ + +V ++ +L E+V V +L ++L++L + HP H HC +V HSLIQL
Sbjct: 348 GTVTSAEVAQLSDKLEEEVTG-VMRKATAHTLSDLLSRLTVLLHPGHQHCVSVSHSLIQL 406
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 3/63 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
++ P++ A D+ +C+GC K +ARC +C WP C C GL+D H PEC
Sbjct: 79 EVPLAWGPSIHARDQRLCVGCGKQCA---EIDARCGKCRWPVCRLDCEGLTDKKRHQPEC 135
Query: 499 AIL 501
A+L
Sbjct: 136 ALL 138
>gi|322799089|gb|EFZ20542.1| hypothetical protein SINV_07869 [Solenopsis invicta]
Length = 491
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 159/428 (37%), Positives = 236/428 (55%), Gaps = 37/428 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + A KC CK YC K HQ + W HK C +EI P LG +L AT DL
Sbjct: 10 CPVCGKNATLKCGNCKREFYCDKSHQSQDWPRHKSTCGG--WEIGHDPALGHHLLATRDL 67
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
PGD I E PL+ GP + +C+GC K + + RC RC WP C C GLSD
Sbjct: 68 SPGDVILSEVPLVWGPSTHANQRVCVGCSKRCDDV---DVRCGRCCWPVCRPDCEGLSDK 124
Query: 135 HTHAPECAILKLGCETLLAYNDYKY----EAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H ECA+L C ++ +Y E +L +R LIL R +++ L ++++H+
Sbjct: 125 KRHQQECALL--ACARIVPRYLLRYACRCEVLLIIRILILWRTKSRRWTSLANLQSHVES 182
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQ 248
RGPGTE Y+E+ S+ + L L + D D L ICG+++VN ++ +
Sbjct: 183 RGPGTEAYDEVSSVSQRLGPLLL--MASDCKD-----ILPMICGLVDVNALETAPPEGSM 235
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRR 298
+Y L+EH+C+ N +H+ + K + R H+STMYT+ALW T+ RR
Sbjct: 236 AIYENASLLEHSCIANTRHNFRVDDKGRPRITVYAVTSIKRGEHLSTMYTHALWSTRARR 295
Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
EHL TKYF C C+RC+DPTELGT+ + C ++ +LP +PL+ ++DW+C +C
Sbjct: 296 EHLLATKYFACRCKRCADPTELGTHLGTLNCPCKNG-----LMLPNDPLNPNADWSCNAC 350
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
S + + +V +T +L E+V ++++E +L ++L++L + HP H HC +V HSL+Q
Sbjct: 351 SGTVTSAEVAQLTDRLEEEVTEVMKEATE-HTLSDLLSRLTVLLHPGHQHCISVSHSLMQ 409
Query: 419 LY-GTQPG 425
L T PG
Sbjct: 410 LLPATDPG 417
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 12/94 (12%)
Query: 408 HCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNL 467
H A+ H L+ PG ++ ++ V P+ A ++ +C+GC K +
Sbjct: 53 HDPALGHHLLATRDLSPG--------DVILSEVPLVWGPSTHA-NQRVCVGCSKRCDDV- 102
Query: 468 ADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+ RC RC WP C C GLSD H ECA+L
Sbjct: 103 --DVRCGRCCWPVCRPDCEGLSDKKRHQQECALL 134
>gi|195431128|ref|XP_002063600.1| GK21998 [Drosophila willistoni]
gi|194159685|gb|EDW74586.1| GK21998 [Drosophila willistoni]
Length = 557
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 173/463 (37%), Positives = 250/463 (53%), Gaps = 49/463 (10%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
+ CA+C A C+ C+ V YC +HQKEHW HK KCK P+ LGRYL AT
Sbjct: 4 LDCAICGVAASRTCNRCRMVRYCDGEHQKEHWPSHKRKCK--PFREVQDEQLGRYLVATH 61
Query: 73 DLHPGDRIARESPLIVGPKLALAEPI-------CLGCHKP--LNPNLADNARCPRCFWPA 123
D+ I E PL+VGPK +E C+GC+ P L +L C C WP
Sbjct: 62 DIEAKQIIFVEEPLVVGPKWYKSEAEKSSVVVPCVGCYTPCRLGKHL-----CHSCRWPV 116
Query: 124 CSARCSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
CSA C S EC+IL LG +T ++ Y+ +A+L L+CL+LQR+ P++
Sbjct: 117 CSAMCQHES------LECSILSLGPGPAARSDTRGLHDYYREDALLVLKCLLLQRQHPER 170
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLP--------GDVLDDTSAKC 228
+ L +M++H +R GTE+++E + IV YL FL +L ++ + +
Sbjct: 171 WSALLEMQSHEEER-KGTELHQEAEKRIVNYLHQKFLHRLKQATKQSKESRLIAEYDPEL 229
Query: 229 LHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFV 277
LH +CG++E N + I G GL+ C+MEH C PN A +++ +
Sbjct: 230 LHRLCGIVETNYMVIELPTGVELSGLFRQACMMEHACQPNCYFTFDAATQQIAVRAGCNL 289
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ H+ YTN LWGTQ R+ HL +TK+F C C RC DPTE G+Y SAM CL +
Sbjct: 290 KKGDHLKITYTNILWGTQLRQHHLKMTKHFVCRCSRCLDPTEYGSYVSAMACLGDVNKTC 349
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
LPVNPLD ++ W C +C + A V + + + +VDKL+ P+ +E +L +
Sbjct: 350 VGRQLPVNPLDEETQWKCDTCPMVVEAGYVTELQTHMTAEVDKLMSGKPSANEVELLLAR 409
Query: 398 LEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
L M HP+H+H + +KH+LIQLYG + G +QL + LERK+
Sbjct: 410 LSLMLHPNHFHMFNLKHTLIQLYGHEEGLQMSQLGAGQLERKL 452
>gi|195580894|ref|XP_002080269.1| GD10336 [Drosophila simulans]
gi|194192278|gb|EDX05854.1| GD10336 [Drosophila simulans]
Length = 531
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 167/438 (38%), Positives = 244/438 (55%), Gaps = 42/438 (9%)
Query: 32 VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
V YC ++HQK+HW HK +C+ P+ + LGRYL+ T D+ G + E PL+VGPK
Sbjct: 2 VRYCDREHQKQHWPQHKRRCR--PFSEEQDAELGRYLKVTQDIAAGQIVFIEEPLVVGPK 59
Query: 92 LALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 143
L++ C+GC+ P +C RC WP CSA C H P EC++
Sbjct: 60 WYLSDADKEASIVPCVGCYTPCR---LGKHQCRRCRWPVCSAGCE-------HEPMECSV 109
Query: 144 LKLGCETLL-----AYNDY-KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV 197
L LG + + NDY + +A+L L+CL+LQR+SP K+ L +M++H +R GTE+
Sbjct: 110 LSLGSGSSTRADTRSLNDYFRGDALLVLKCLLLQRQSPSKWSALLEMQSHEEER-KGTEL 168
Query: 198 YEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNGVDI----GRYTQG 249
+EE + +V YL+ FL +L +L D + LH +CG+IE N + I G G
Sbjct: 169 HEEAEKRVVSYLQKRFLCRLKQTNPSLLTDCGPEMLHRLCGIIETNFMVIELPSGVELSG 228
Query: 250 LYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHLA 302
L+ C+MEH C PN +++ R H+ YTN LWGTQ R+ HL
Sbjct: 229 LFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLRKGDHLRITYTNILWGTQLRQHHLR 288
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
+TK+F+C C RC DPTE GTY SA+ CL + LPV+PLD ++ W C +C +
Sbjct: 289 LTKHFSCRCSRCLDPTEYGTYISALTCLGDVNQTCGGTHLPVDPLDENTQWKCDTCPMLV 348
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
+ V + S + EQV+ L+ P+ +E +L +L M HP+H+H + +KH+LIQLYG
Sbjct: 349 DGAYVAELQSHMTEQVEGLLAGCPSANQVELLLARLTHMLHPNHFHTFNLKHTLIQLYGN 408
Query: 423 QPGYAYTQLSSSLLERKI 440
+ G LS++ LERK+
Sbjct: 409 EAGLELGVLSNTQLERKL 426
>gi|195475636|ref|XP_002090090.1| GE19428 [Drosophila yakuba]
gi|194176191|gb|EDW89802.1| GE19428 [Drosophila yakuba]
Length = 526
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 161/437 (36%), Positives = 244/437 (55%), Gaps = 40/437 (9%)
Query: 32 VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
V YC ++HQK+HW HK +C+ P+ + +LGRYL+AT ++ G + E PL+VGPK
Sbjct: 2 VRYCDREHQKQHWPQHKRRCR--PFSEEQDAVLGRYLKATQEIAAGQIVFIEEPLVVGPK 59
Query: 92 LALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 144
++ C+GC P +C RC WP CSA C S EC++L
Sbjct: 60 WYHSDADNEASIVPCVGCCTPCR---LGKHQCRRCCWPVCSAGCEHES------MECSVL 110
Query: 145 KLGCETLL-----AYNDY-KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVY 198
LG + + NDY + +A+L L+CL+LQR+SP K+ L +M++H +R GTE++
Sbjct: 111 SLGTGSPTRVDPRSLNDYFRGDALLVLKCLLLQRQSPTKWLALLEMQSHEDER-RGTELH 169
Query: 199 EEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNGVDI----GRYTQGL 250
+E + +V YL++ FL +L +L D + LH +CG+IE N + I G GL
Sbjct: 170 DEAEKRVVSYLQNRFLCRLKQTNPSLLHDCEPEMLHRLCGIIETNFMVIELPTGLELSGL 229
Query: 251 YSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAI 303
+ C+MEH C PN H +++ + + H+ YTN LWGTQ R+ HL +
Sbjct: 230 FRQACMMEHACQPNCDFQFDNKTHQVAVRAGCDLRKGDHLQITYTNILWGTQLRQHHLRL 289
Query: 304 TKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLN 363
TK+F+C C RC DPTE GTY SAM CL + L V+PLD ++ W C +C ++
Sbjct: 290 TKHFSCRCSRCLDPTEYGTYISAMACLGDVNQTCGGTHLAVDPLDENTQWKCDTCPILVD 349
Query: 364 ARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQ 423
V + S + +QV+ L+ P+ +E +L +L + HP+H+H + +KH+LIQLYG +
Sbjct: 350 GAYVAELQSHMTKQVEGLLAGCPSANQVELLLARLTHVLHPNHFHTFNLKHTLIQLYGNE 409
Query: 424 PGYAYTQLSSSLLERKI 440
G LS++ L RK+
Sbjct: 410 AGLESCVLSNNQLIRKL 426
>gi|332017349|gb|EGI58093.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 481
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 150/417 (35%), Positives = 231/417 (55%), Gaps = 36/417 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + A KC +C+ YC K HQ + W H+ C +EI +LGR+L AT DL
Sbjct: 10 CPVCGKNATLKCGSCRREFYCDKSHQSQDWPRHRSTCSG--WEIGRDSVLGRHLLATRDL 67
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
PGD I E+PL+ GP + +C+GC + + +ARC C WP C + C GLSD
Sbjct: 68 APGDVILTETPLVWGPSTHTNQRVCVGCSERCDDI---DARCSECCWPVCRSDCEGLSDK 124
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPG 194
H ECA+L A + E +L +R LIL R +++ L +++ H+ RGPG
Sbjct: 125 KRHQLECALLAR------ARIIPRCEVLLIIRLLILWRMKSRRWTSLANLQNHVESRGPG 178
Query: 195 TEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQGLYS 252
TE ++E+ S+ + L L + D D L ICG+I+VN ++ + +Y
Sbjct: 179 TEAHDEVSSVSQRLGPLLL--MASDCKD-----VLPMICGLIDVNALETAPPEGSVAIYE 231
Query: 253 VICLMEHNCLPNAKHSNMMQSK----------LFVFRDTHISTMYTNALWGTQPRREHLA 302
L+EH+C+ N +HS + K + + H+STMYT+ALW T+ RREHL
Sbjct: 232 HASLLEHSCIANTRHSFRIDDKGRPRITIYAVTSIKKGQHLSTMYTHALWSTRARREHLL 291
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
TKYF C C+RC+DPTELGT+ + C ++ +LP +PL+ +++W+C +C +
Sbjct: 292 TTKYFACRCKRCADPTELGTHLGTLNCPCKNG-----LMLPNDPLNPNTNWSCDACPGTI 346
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ +V + +L E+V +++++ +L E+L++L + HP H HC +V HSLIQL
Sbjct: 347 TSAEVAELIDRLEEEVVEVMKQATEC-TLSELLSRLTVLLHPGHQHCISVSHSLIQL 402
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 451 TDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
T++ +C+GC + + +ARC C WP C + C GLSD H ECA+L
Sbjct: 87 TNQRVCVGCSERCDDI---DARCSECCWPVCRSDCEGLSDKKRHQLECALL 134
>gi|242007166|ref|XP_002424413.1| protein msta, isoform A, putative [Pediculus humanus corporis]
gi|212507813|gb|EEB11675.1| protein msta, isoform A, putative [Pediculus humanus corporis]
Length = 540
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 155/464 (33%), Positives = 245/464 (52%), Gaps = 53/464 (11%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC + + C CK + YC K+HQK HW C + Y+++ + LGRYL A+ ++
Sbjct: 5 CAVCGQPSSSICGGCKRLFYCSKEHQKLHWMKEHKGCCRAVYKLEQNDRLGRYLIASENI 64
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
GD I ESPLIVGPK + P+CLGCH N+ + C C +P CS +C +SD
Sbjct: 65 SKGDVIFYESPLIVGPK-TVTVPVCLGCHNVC--NMDNYHECDGCGFPLCSPQCQ-VSD- 119
Query: 135 HTHAPEC-AILKLGCETLL-------------AYNDYKYEAILPLRCLILQ-RRSPKKYQ 179
H EC ++K + + Y I+PLRC++L+ +++ +KY+
Sbjct: 120 -NHVDECYQMMKSNYRIKMDSKKKKNNDDNNGGGKEIFYFPIVPLRCILLKGKKNSEKYK 178
Query: 180 ELKDMEAHMSKR--GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
++ ++E+H R P ++Y++ ++ ++R E+L +DD K + I G+++
Sbjct: 179 KIMNLESHYDDRINKPLFDLYKK--NVADFIR----ERLK---MDDVDEKEILTITGILD 229
Query: 238 VNGVDIGRYT---QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------ISTMY 287
N +I + +GLY+V ++ H+C PN KHS + V T I+ Y
Sbjct: 230 TNAFEIVKNNVKLRGLYNVSSMLSHDCKPNTKHSFLQPDMTIVVTATRDIKSGELITATY 289
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC-----LNEHKDQGDCWIL 342
T W T RREHL K F+C CERCSDP+ELGTY A++C ++ +C+++
Sbjct: 290 TQTFWNTLSRREHLKTIKCFDCVCERCSDPSELGTYAGAIRCSKCTMTSDKNSVENCYVI 349
Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKL---- 398
PV+PL+++S W C +C ++ + + + L ++ K V LEE L K
Sbjct: 350 PVDPLNSNSVWKCMNCQREMSHKQIKMGNDNLKNELTK--SNVKTVTGLEEFLIKYGVYE 407
Query: 399 EAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
+ H ++YH VKH+LI+L G GY Y+ +S LL+RKI Y
Sbjct: 408 NGVLHSNNYHVVQVKHALIRLIGNTDGYTYSDISDDLLDRKIIY 451
>gi|195171611|ref|XP_002026597.1| GL11805 [Drosophila persimilis]
gi|194111523|gb|EDW33566.1| GL11805 [Drosophila persimilis]
Length = 528
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 159/424 (37%), Positives = 230/424 (54%), Gaps = 44/424 (10%)
Query: 47 HKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-------C 99
HK CK P + +LGR+ AT D+ I E PL+VGPK ++E C
Sbjct: 17 HKRCCK--PIREEKDDVLGRFWVATQDIKANQIIFVEEPLVVGPKWYMSEAEKSSTTVPC 74
Query: 100 LGCHKP--LNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLG------CETL 151
+GC+ P L +L C C WP CSA C A EC++L LG +T
Sbjct: 75 VGCYTPCRLGKHL-----CRNCRWPVCSASCE------HEALECSVLSLGPGPSARADTR 123
Query: 152 LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRS 210
+ Y+ +A+L L+CL+LQR+ PKK+ +L +M++H +R GTE+++E D IV YL
Sbjct: 124 GLNDFYRGDALLVLKCLLLQRKDPKKWTDLLEMQSHEEER-KGTELHQEAEDGIVSYLSQ 182
Query: 211 NFLEKL---PGDVLDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLP 263
FL++L DVL+ + LH +CG+IE N + + G GL+ C+MEH C P
Sbjct: 183 RFLQRLEQSKKDVLEHCEPELLHRLCGIIETNYMVVELPTGIELSGLFRQACMMEHACQP 242
Query: 264 N---AKHSNMMQSKLFVFRD----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
N MQ ++ D H+ YTN LWGT R+ HL +TK+F C CERC D
Sbjct: 243 NCYFQFDGTTMQIEVRAGGDLKKGDHLRITYTNILWGTHLRQHHLRLTKHFKCGCERCLD 302
Query: 317 PTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGE 376
PTE G+Y SA+ CL + LP++P+D S W C +C ++ V + + + E
Sbjct: 303 PTEFGSYVSALTCLGDVNKSCGGKQLPLDPVDEQSQWKCDACPMAMDTAYVTELQTHMTE 362
Query: 377 QVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLL 436
QVDK++ P+ +E L +L M HP+H+H + +KH+LIQLYG + G + LS +L
Sbjct: 363 QVDKMLAGRPSASEIELTLARLTQMLHPNHFHMFNLKHTLIQLYGNENGLELSSLSDGIL 422
Query: 437 ERKI 440
ERK+
Sbjct: 423 ERKL 426
>gi|195124734|ref|XP_002006842.1| GI21287 [Drosophila mojavensis]
gi|193911910|gb|EDW10777.1| GI21287 [Drosophila mojavensis]
Length = 551
Score = 247 bits (631), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 162/457 (35%), Positives = 241/457 (52%), Gaps = 46/457 (10%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC A C+ CK V YC +HQ++ W HK C+ P+ + LGRYL AT ++
Sbjct: 8 CAVCHVAARLTCTRCKVVRYCNGEHQRQDWPQHKSCCR--PFREQQDEQLGRYLVATHNV 65
Query: 75 HPGDRIARESPLIVGPKLALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSAR 127
I E PL+VGPK L + C+GC+ P D C C WP CS
Sbjct: 66 KAKQIIFVEDPLVVGPKWFLTDEEKSASIVPCVGCYTPCR---VDKHHCRNCRWPVCSVG 122
Query: 128 CSGLSDAHTHAPECAILKLG--------CETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
C EC++L LG L+ Y Y+++A+L L+CL+LQ++ P +++
Sbjct: 123 CEH------EKLECSVLSLGNAPKTKTDVRALVDY--YRHDALLVLKCLLLQQQHPDRWR 174
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLP-----GDVLDDTSAKCLHWIC 233
L +M++H +R GTE+++E + +V YL+ FL+++ + D + LH +C
Sbjct: 175 ALLEMQSHEEER-LGTELHQEAEQRVVGYLQQRFLQRIKQSKQRDSMPSDYEPQLLHRLC 233
Query: 234 GVIEVNGVDI----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKLFVFRDTH 282
G+IE N + I G GL+ C+MEH C PN A +++ + + H
Sbjct: 234 GIIETNYMVIELATGIELSGLFRQACMMEHACQPNCYFQFDAATQRIAVRAGCDLKKGDH 293
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
+ YTN LWGTQ R HL +TK+F+C C RC DPTE G+Y SAM+CL + L
Sbjct: 294 LKITYTNILWGTQMRHHHLLMTKHFSCRCARCEDPTEYGSYVSAMRCLGDVSKTCVGVQL 353
Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
PV+PLD + W C +C ++A V V + + EQV+ L+ P +E +L +L +
Sbjct: 354 PVDPLDEKTQWKCDTCPMVVDAAYVTEVQTHMTEQVESLLAGRPTASQVELLLARLSQLL 413
Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
H +H+H + +KH+L+QLYG +PG LS L RK
Sbjct: 414 HTNHFHMFNLKHTLVQLYGHEPGLELATLSDLQLGRK 450
>gi|195440652|ref|XP_002068154.1| GK12548 [Drosophila willistoni]
gi|194164239|gb|EDW79140.1| GK12548 [Drosophila willistoni]
Length = 534
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 153/457 (33%), Positives = 249/457 (54%), Gaps = 50/457 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA C+ A C+ACK V YC ++HQKEHWK HK +CK YE+ S+ +LGR+L+A
Sbjct: 5 LTNCAFCQARASQICAACKSVVYCSREHQKEHWKKGHKRECK--CYEVASNDMLGRHLRA 62
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLN-PNLADNA-RCPRCFWPACSARC 128
T D+ G++I RE+PL+VGPK+A A PICLGCH+ + P + N +C C WP C C
Sbjct: 63 TRDIKIGEQILREAPLVVGPKVASA-PICLGCHRNIGAPGKSMNYYKCSSCTWPLCGKEC 121
Query: 129 SGLSDAHTHAPECAIL-KLGCETLLAYN----------DYKYEAILPLRCLILQRRSPKK 177
+ H EC ++ + ++ + Y+ + Y I+ LRC++L+ P+
Sbjct: 122 E---QSPHHRGECQLMAQSNFQSKINYHPDQDQNQERKESAYCVIMLLRCMLLKTSQPED 178
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICG 234
+ L +E H+ +R T +Y+ LR+N F++ + G L + S + I
Sbjct: 179 FVRLFSLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--LREWSEMEILRIAA 228
Query: 235 VIEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH---SNM---MQSKLFVFRDTHI 283
+++ N ++ R + LY ++ H+C+PN +H NM +K + + +
Sbjct: 229 ILDTNTFEVRQPKERRKVRALYPGAAMISHDCVPNMRHRFDDNMNIIFLAKRAISKGEIL 288
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
+ YT L T RR HL K F+C+C+RCSDP+ELGT+ +A CL + I+P
Sbjct: 289 NISYTQPLRSTIQRRLHLRQAKCFDCACQRCSDPSELGTFAAAQMCLKCKAGK----IIP 344
Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
+NPL N S W C C+ + +A+++ + +QL +++ L + P +LE+ + + H
Sbjct: 345 LNPLQNSSPWKCQLCNVKKSAKEIITIDNQLQLEIESLDKSTPT--ALEDFIYRHRIDLH 402
Query: 404 PHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H VK++L QLYG PG++ +L+ L+RK+
Sbjct: 403 ETNTHVLQVKYALTQLYGNAPGFSMDELTEDNLKRKV 439
>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
Length = 524
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 144/441 (32%), Positives = 230/441 (52%), Gaps = 43/441 (9%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPG 77
C + A +CS C+ V YC + HQK WK H+ CK Y I+ + +GR+L A DL G
Sbjct: 8 CGKEAHQRCSGCQAVFYCSRDHQKSDWKKHRSHCK--AYRIEQTLTVGRHLIACRDLKAG 65
Query: 78 DRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH 137
D I +E P+++GPK A ICLGC+ ++ RC +C WP C G D H H
Sbjct: 66 DVILQEKPIVMGPKHT-AGQICLGCYSGVD----GRTRCSQCGWPMC-----GRDDCHAH 115
Query: 138 ----APECAILKLGCETLLAYNDYK-YEAILPLRCLILQRRSPKKYQELKDMEAHMS-KR 191
A EC ++ G ++ + Y++++ LRCL L+ ++P+++ EL +EAH+ +R
Sbjct: 116 ESDHAAECGVMASGGRPIVGSLPVQAYQSVMVLRCLALRDQNPERWDELMQLEAHVQERR 175
Query: 192 GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD----IGRYT 247
G E ++ + V+++R ++P +++ +CG++ VN + G
Sbjct: 176 QKGMEDVDQA-TAVRFIRETLGLQIPEELILQ--------LCGILMVNSFEQPPMKGNSQ 226
Query: 248 QGL---YSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMYTNALWGTQPRR 298
GL YS L+EH+C+ NA K ++++ + + + I+ YT LWGT R+
Sbjct: 227 HGLVAVYSTASLLEHDCVANAIKTFTNKGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQ 286
Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
HL+ TK+F C CERC DPTEL T+ S C + Q +L NPLD+ SDW C C
Sbjct: 287 RHLSQTKFFQCVCERCKDPTELDTFTSGFYC-QKCPQQSAGILLTENPLDDASDWICRQC 345
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
AR A + +G+++ L + +V+ E L K + HP+H++ VK +L Q
Sbjct: 346 GARQPANYAAEIIESVGKEIVAL--KKGSVRDCEGFLRKFSKVLHPNHFYLTDVKMALCQ 403
Query: 419 LYGTQPGYAYTQLSSSLLERK 439
+YG G + +++ L K
Sbjct: 404 MYGHLEGQSLIEMNDDTLNAK 424
>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
Length = 529
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 156/456 (34%), Positives = 242/456 (53%), Gaps = 56/456 (12%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
CA C++ A C+ C+ V YC ++HQKEHWK HK +CK YE+ S+ LLGR+L+AT D
Sbjct: 7 CAYCQQRATQLCAGCRSVVYCSREHQKEHWKRGHKRECK--CYEVTSNELLGRHLRATRD 64
Query: 74 LHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSARCS 129
+ G++I E+PL+VGPK+A A P+CLGCH+ P P + +C C WP CS C
Sbjct: 65 IRMGEQIMCEAPLVVGPKVA-ATPLCLGCHRNLLAPQKPAMT-FYKCSSCSWPMCSKECE 122
Query: 130 GLSDAHTHAPECAILKLG-CETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQEL 181
+ H EC ++ ++ + YN + Y I+ LRC+ L+R +P+ + L
Sbjct: 123 ---KSPFHLDECRLMAASNFQSKINYNPADPEGKESAYCVIMLLRCMQLKRSNPEAFARL 179
Query: 182 KDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVIEV 238
+E H+ +R T +Y+ LR+N F++ + G D + I +++
Sbjct: 180 AALEDHLKERIE-TPLYQ-------VLRANLITFIKTVLGQ--RDWPELEILRIAAILDT 229
Query: 239 NGVDIG-----RYTQGLYSVICLMEHNCLPNAKH---SNM---MQSKLFVFRDTHISTMY 287
N ++ R + ++ ++ H+C+PN +H NM +K + + +S Y
Sbjct: 230 NAFEVRQNGERRKVRAIFPGGAMIAHDCVPNLRHRFDDNMRIIFLAKRPIPKGAILSISY 289
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV--- 344
T L T RR HL K F+C+CERCSDPTELGT+ A C G C + V
Sbjct: 290 TQPLRSTVQRRVHLKQVKCFDCACERCSDPTELGTFAGAHLC-------GKCKVGKVISQ 342
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
NPL+N ++W C C+ + +AR+V ++L ++++ L + P LEE + H
Sbjct: 343 NPLENAANWHCQLCNVKKSAREVLTQDARLQQEIESLDKTTP--VHLEEFIQGHRIDLHD 400
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYGT PGYA +LS L+RK+
Sbjct: 401 TNTHILQAKYALTQLYGTAPGYAMDELSEESLKRKV 436
>gi|194873875|ref|XP_001973295.1| GG13432 [Drosophila erecta]
gi|190655078|gb|EDV52321.1| GG13432 [Drosophila erecta]
Length = 530
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 149/456 (32%), Positives = 242/456 (53%), Gaps = 50/456 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK H+ +CK +EI ++ +LGR+L A
Sbjct: 6 LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRAECK--CFEIATNEVLGRHLTA 63
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
T D+ G++I +ESPL++GPK+A A P+CLGCH+ P NP + +C C WP C
Sbjct: 64 TRDIKIGEQILKESPLVLGPKVASA-PLCLGCHRNLLAPENPR-GNYHKCSSCSWPLCGK 121
Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ + P+ +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMQLKAKDPEAF 178
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
L +E H+ +R T +Y+ LR+N F++ + G + D + I +
Sbjct: 179 LRLDTLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWLEMDILRIAAI 228
Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
++ N ++ R + LY ++ H+C+PN +H + + +K + + +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRRIAKGEILS 288
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L T RR HL K F+CSC RC DP ELG++ A CL + I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
NPL N + W C CS + +A++V ++L ++++ L + P +LEE + + A H
Sbjct: 345 NPLLNSAPWKCQLCSYKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG+ PG+A +LS L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSEESLNRKV 438
Score = 38.5 bits (88), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 30/63 (47%), Gaps = 10/63 (15%)
Query: 443 VMSPNLKATDEPICLGCHK----PLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
V+ P K P+CLGCH+ P NP + +C C WP C C D+ H EC
Sbjct: 80 VLGP--KVASAPLCLGCHRNLLAPENPR-GNYHKCSSCSWPLCGKEC---EDSVHHKAEC 133
Query: 499 AIL 501
++
Sbjct: 134 QLM 136
>gi|195353810|ref|XP_002043396.1| GM16483 [Drosophila sechellia]
gi|194127519|gb|EDW49562.1| GM16483 [Drosophila sechellia]
Length = 486
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 156/438 (35%), Positives = 225/438 (51%), Gaps = 87/438 (19%)
Query: 32 VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
V YC ++HQK+HW HK +C+ P+ + LGRYL+ T D+ G + E PL+VGPK
Sbjct: 2 VRYCDREHQKQHWPQHKRRCR--PFREEQDAELGRYLKVTQDIAAGQIVFIEEPLVVGPK 59
Query: 92 LALAEPI-------CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 143
L++ C+GC+ P + +C RC WP CSA C H P EC++
Sbjct: 60 WYLSDADKEASIVPCVGCYAP---SRLGKHQCRRCRWPVCSAGCE-------HEPMECSV 109
Query: 144 LKLGCET-----LLAYNDY-KYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV 197
L LG + + NDY + +A+L L+CL+LQR+SP K+ L +M++H +R GTE+
Sbjct: 110 LSLGSGSPTRADTRSLNDYFRGDALLVLKCLLLQRQSPTKWSALLEMQSHEEER-KGTEL 168
Query: 198 YEEIDS-IVKYLRSNFLEKLPG---DVLDDTSAKCLHWICGVIEVNGVDI----GRYTQG 249
+EE + +V YL+ FL +L +L D + LH +CG+IE N + I G G
Sbjct: 169 HEEAEKRVVSYLQKRFLCRLKQTNPSLLTDCGPEMLHRLCGIIETNFMVIELPSGVELSG 228
Query: 250 LYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHLA 302
L+ C+MEH C PN +++ R H+ YTN LWGTQ R+ HL
Sbjct: 229 LFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLRKGDHLRITYTNILWGTQLRQHHLR 288
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
+TK+F+C C RC DPTE GTY SA+ CL GD
Sbjct: 289 LTKHFSCHCNRCLDPTEYGTYISALTCL------GD------------------------ 318
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
V+ L+ P+ +E +L +L M HP+H+H + +KH+LIQLYG
Sbjct: 319 ---------------VEGLLAGCPSANQVELLLARLTHMLHPNHFHTFNLKHTLIQLYGN 363
Query: 423 QPGYAYTQLSSSLLERKI 440
+ G LS++ LERK+
Sbjct: 364 EAGLELGVLSNTQLERKL 381
>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
Length = 532
Score = 228 bits (580), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 144/455 (31%), Positives = 237/455 (52%), Gaps = 48/455 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK HK +CK YE+ ++ LGR+L+A
Sbjct: 5 LTSCALCQAKASQLCAACRSVVYCSREHQKEHWKQGHKRECK--CYEVATNETLGRHLRA 62
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
T D+ G+ I RE+PL++GPK+A A PICLGCH+ L + +C C WP C
Sbjct: 63 TRDVKIGEEILREAPLLLGPKVASA-PICLGCHRNLLAPQKQRGNYYKCSSCSWPLCGRE 121
Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQ 179
C ++ H EC ++ ++ + Y + Y I+ LRC+ L+ P+ +
Sbjct: 122 CE---ESSHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRCMQLKASDPEGFA 178
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVI 236
L +E H+ +R T +Y+ LR+N F++ + G + D S + I ++
Sbjct: 179 RLSALEDHLEER-LATPLYQ-------VLRANLITFIKTILG--MRDWSEVDILRIAAIL 228
Query: 237 EVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
+ N ++ R + L+ ++ H+C PN +H + + +K + + ++
Sbjct: 229 DTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILTI 288
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
YT L T RR HL K F+C+C+RC DPTELGT+ A C+ + I+ VN
Sbjct: 289 SYTQPLRSTIQRRLHLRQAKCFDCACDRCQDPTELGTFAGAQTCVKCKAGK----IISVN 344
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
PL N ++W C C+ + +A++V L ++L ++++ L + P E+ + + H
Sbjct: 345 PLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTTP--VDFEDFIYRHRVELHET 402
Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG PG+ +LS L RK+
Sbjct: 403 NTHVLQAKYALTQLYGNAPGFTMDELSEESLSRKV 437
>gi|357603389|gb|EHJ63740.1| hypothetical protein KGM_20849 [Danaus plexippus]
Length = 544
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 153/463 (33%), Positives = 230/463 (49%), Gaps = 50/463 (10%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC KCS C+ V YC + HQK HWKLHK C +P +IK P +GRYL+AT +
Sbjct: 7 CEVCLTPTEQKCSGCQIVHYCSRDHQKRHWKLHKLHC--VPAKIKELPGIGRYLEATRTI 64
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
GD I +E+PLI GP + P+CLGC+ L RC +C WP CS +C+ S
Sbjct: 65 KAGDVIMKEAPLITGPA-QITPPVCLGCYVLLEE--GKTTRCDKCGWPFCSEKCTHSS-- 119
Query: 135 HTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
H PEC + E + + + Y Y+ + LRCL + + K + +L+ +E H
Sbjct: 120 -VHEPECFYTQQRGEKV-SISTYGIPHPNYQCVTVLRCLYQRDHNMKLWNKLQALETHTE 177
Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ- 248
R GT+ +E +V +F KL ++ KC CG+I++NG ++
Sbjct: 178 DR-KGTDKWENDRKMVAQFIWSFF-KLERLFNEEEIMKC----CGIIQINGHEVPLLEPE 231
Query: 249 --GLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALWGTQPRREH 300
++ I ++EHNC N S ++ + + + R +HIS YT+ LWGT+ RR H
Sbjct: 232 YVAVFDKISMVEHNCRANCNKSFTSNGEIILSAGVAIPRGSHISVCYTDPLWGTEARRHH 291
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ---GDCWILPV-----------NP 346
L+ +K+F CSCERCSD TE GT FSA+KC ++ D +ILP+
Sbjct: 292 LSDSKFFECSCERCSDVTEFGTMFSAVKCKKKNCKGYLLPDTFILPILHKTISPDPATRG 351
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
LDN W C C ++ + + +G ++ + + NP+ + E + HP H
Sbjct: 352 LDNKF-WKCKECKDEVSDEIIQQLLQDIGRELSIMEKGNPD--ACERFIEHCATYLHPSH 408
Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLE----RKISYVMS 445
Y+ +L QL G G A LL+ RKI+ +++
Sbjct: 409 YYMIDAGLALAQLVGQDTGLAVVSDDRLLLKTQLCRKITELLN 451
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 38/94 (40%), Gaps = 7/94 (7%)
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLN 464
H HC K + G T + ++ ++ + P P+CLGC+ L
Sbjct: 39 HKLHCVPAKIKELPGIGRYLEATRTIKAGDVIMKEAPLITGP--AQITPPVCLGCYVLLE 96
Query: 465 PNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
RC +C WP CS +C+ S H PEC
Sbjct: 97 E--GKTTRCDKCGWPFCSEKCTHSS---VHEPEC 125
>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 145/455 (31%), Positives = 236/455 (51%), Gaps = 48/455 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK HK +CK YE+ ++ LGR+L+A
Sbjct: 5 LTSCALCQAKASQLCAACRSVVYCSREHQKEHWKQGHKRECK--CYEVATNETLGRHLRA 62
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
T D+ G+ I RE+PL++GPK+A A PICLGCH+ L + +C C WP C
Sbjct: 63 TRDVKIGEEILREAPLLLGPKVASA-PICLGCHRNLLAPQKQRGNYYKCSSCSWPLCGRE 121
Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQ 179
C ++ H EC ++ ++ + Y + Y I+ LRC+ L+ P+ +
Sbjct: 122 CE---ESPHHRAECQLMSGSNFQSKINYTPGEDERKESAYCVIMLLRCMQLKSSDPEGFA 178
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVI 236
L +E H+ +R T +Y+ LR+N F++ + G + D S + I ++
Sbjct: 179 RLSALEDHLEER-LATPLYQ-------VLRANLITFIKTILG--MRDWSEVDILRIAAIL 228
Query: 237 EVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
+ N ++ R + L+ ++ H+C PN +H + + +K + + +S
Sbjct: 229 DTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFLAKRPIAKGEILSI 288
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
YT L T RR HL K F+C+C RC DPTELGT+ A C+ + I+ VN
Sbjct: 289 SYTQPLRSTIQRRLHLRQAKCFDCACARCQDPTELGTFAGAQTCVKCKAGK----IISVN 344
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
PL N ++W C C+ + +A++V L ++L ++++ L + P E+ + + H
Sbjct: 345 PLQNTANWKCQLCNLKRSAKEVLLSDAKLQQEIEALDKTTP--VDFEDFIYRHRVELHET 402
Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG PG+ +LS L RK+
Sbjct: 403 NTHVLQAKYALTQLYGNAPGFTMDELSEESLSRKV 437
>gi|332373498|gb|AEE61890.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 109/232 (46%), Positives = 153/232 (65%), Gaps = 12/232 (5%)
Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGRYTQ--GLYSVICLMEHNCLPNAKH---------S 268
VL D S + +H ICG+++VN ++I + + LY LMEHNCL N KH +
Sbjct: 12 VLKDISKEIIHQICGIVDVNALEINQDAEVTALYPTAFLMEHNCLSNTKHMFDNADKGYT 71
Query: 269 NMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
+++ L + + HISTMYT+ALWGTQ RREHL TKYF+C C RC+DPTELGTY SA+K
Sbjct: 72 ITIRASLPIAKGDHISTMYTHALWGTQARREHLRETKYFSCKCLRCADPTELGTYLSALK 131
Query: 329 CLNEHKDQGDCWI-LPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
C+ D+ I LP++PLD++++W C C A+L +V + +Q+GE+VD + NP
Sbjct: 132 CIGTQDDEPCGGIQLPIDPLDDNTEWACNKCDAKLTNHEVCFLVNQIGEEVDNVQLSNPT 191
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
V+ L+ +LTK+ HP+HYH Y+VKHSL+QLYG Q GY QLS ++ +K
Sbjct: 192 VRELDSILTKMLNFVHPNHYHVYSVKHSLVQLYGYQQGYMPNQLSDDIIAKK 243
>gi|91087991|ref|XP_973574.1| PREDICTED: similar to CG11160 CG11160-PA [Tribolium castaneum]
Length = 528
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 140/407 (34%), Positives = 219/407 (53%), Gaps = 39/407 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+E+K+S ++GRY+ + +L PGD I E+PL++GP + CLGC+K L + +
Sbjct: 39 FEVKTSKVMGRYMVSRKNLKPGDVILSEAPLVIGPCTG-CKVQCLGCYKTLESSFC--TK 95
Query: 116 CPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYNDYK-----YEAILPLRCL 168
C C WP CS +C GL H+ ECA+LK L Y++++ + AI+PLRCL
Sbjct: 96 CT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCL 154
Query: 169 ILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSAK 227
+L+ + YQ L ME H E+ I + + + ++K+ D L + S +
Sbjct: 155 LLKETDARSYQHLMTMETH-------NEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQE 207
Query: 228 CLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT--- 281
+H +CG++EVN +IG+ +GLY LM H+C+PN H + + R +
Sbjct: 208 EIHSVCGILEVNSFEIGQQGVNIRGLYPSAFLMSHDCVPNTNHIDEESTFRLTVRASTRI 267
Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
I+ Y L T RREHL K+F C C RCSDPTELGT+ SA+ C
Sbjct: 268 EPGEMITLSYAYTLQSTLKRREHLLENKFFECQCRRCSDPTELGTFTSALIC----PKCK 323
Query: 338 DCWILPVNPLDNDSDWTCGSCSA----RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
D ++L NPLD ++DWTC + + +++A+ + L+ ++ ++VD + + +V ++E
Sbjct: 324 DGFVLSSNPLDPEADWTCSNYNKCPGYKISAKSMQLLLDKITQEVDNI--DCNDVITMET 381
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
L K + HP HY +VK SL QLYG G+ +LS +L RK+
Sbjct: 382 FLKKYRNVLHPTHYINLSVKLSLSQLYGRINGFLIDELSDEVLTRKL 428
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYND 513
CLGC+K L + +C C WP CS +C GL H+ ECA+LK L Y++
Sbjct: 81 CLGCYKTLESSFC--TKCT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDN 137
Query: 514 YKSL 517
++ +
Sbjct: 138 FEEM 141
>gi|270012059|gb|EFA08507.1| hypothetical protein TcasGA2_TC006159 [Tribolium castaneum]
Length = 529
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 140/407 (34%), Positives = 219/407 (53%), Gaps = 39/407 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+E+K+S ++GRY+ + +L PGD I E+PL++GP + CLGC+K L + +
Sbjct: 40 FEVKTSKVMGRYMVSRKNLKPGDVILSEAPLVIGPCTG-CKVQCLGCYKTLESSFC--TK 96
Query: 116 CPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYNDYK-----YEAILPLRCL 168
C C WP CS +C GL H+ ECA+LK L Y++++ + AI+PLRCL
Sbjct: 97 CT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDNFEEMRPNFNAIVPLRCL 155
Query: 169 ILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSAK 227
+L+ + YQ L ME H E+ I + + + ++K+ D L + S +
Sbjct: 156 LLKETDARSYQHLMTMETH-------NEIRRNIPELWQTNQRTVVDKIRIDWGLREYSQE 208
Query: 228 CLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT--- 281
+H +CG++EVN +IG+ +GLY LM H+C+PN H + + R +
Sbjct: 209 EIHSVCGILEVNSFEIGQQGVNIRGLYPSAFLMSHDCVPNTNHIDEESTFRLTVRASTRI 268
Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
I+ Y L T RREHL K+F C C RCSDPTELGT+ SA+ C
Sbjct: 269 EPGEMITLSYAYTLQSTLKRREHLLENKFFECQCRRCSDPTELGTFTSALIC----PKCK 324
Query: 338 DCWILPVNPLDNDSDWTCGSCSA----RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
D ++L NPLD ++DWTC + + +++A+ + L+ ++ ++VD + + +V ++E
Sbjct: 325 DGFVLSSNPLDPEADWTCSNYNKCPGYKISAKSMQLLLDKITQEVDNI--DCNDVITMET 382
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
L K + HP HY +VK SL QLYG G+ +LS +L RK+
Sbjct: 383 FLKKYRNVLHPTHYINLSVKLSLSQLYGRINGFLIDELSDEVLTRKL 429
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 34/64 (53%), Gaps = 5/64 (7%)
Query: 456 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAILK-LGCETLLAYND 513
CLGC+K L + +C C WP CS +C GL H+ ECA+LK L Y++
Sbjct: 82 CLGCYKTLESSFC--TKCT-CGWPLCSPKCKGLGKRFGHSQVECAVLKATKSSKFLNYDN 138
Query: 514 YKSL 517
++ +
Sbjct: 139 FEEM 142
>gi|242397545|gb|ACS92862.1| MIP11718p [Drosophila melanogaster]
Length = 514
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 146/456 (32%), Positives = 243/456 (53%), Gaps = 50/456 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK H+ +C+ +EI ++ +LGR+L+A
Sbjct: 6 LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
T D+ G++I +E+PL++GPK+A A P+CLGCH+ P P + +C C WP C
Sbjct: 64 TRDIKIGEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPGKPR-GNYHKCSSCSWPLCGK 121
Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ + P +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMHLKDKDPDAF 178
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
+L ++E H+ +R T +Y+ LR+N F++ + G + D + I +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWPEMDILRIAAI 228
Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
++ N ++ R + LY ++ H+C+PN +H + + +K + + +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNILFLAKRKIAKGEILS 288
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L T RR HL K F+CSC RC DP ELG++ A CL + I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
NPL N + W C C+ + +A+DV ++L ++++ L + P +LEE + + A H
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKDVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG+ PG+A +LS L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSGESLNRKL 438
>gi|24666583|ref|NP_649084.1| CG18136 [Drosophila melanogaster]
gi|7293833|gb|AAF49199.1| CG18136 [Drosophila melanogaster]
Length = 530
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 146/456 (32%), Positives = 243/456 (53%), Gaps = 50/456 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK H+ +C+ +EI ++ +LGR+L+A
Sbjct: 6 LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
T D+ G++I +E+PL++GPK+A A P+CLGCH+ P P + +C C WP C
Sbjct: 64 TRDIKIGEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPGKPR-GNYHKCSSCSWPLCGK 121
Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ + P +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMHLKDKDPDAF 178
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
+L ++E H+ +R T +Y+ LR+N F++ + G + D + I +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWPEMDILRIAAI 228
Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
++ N ++ R + LY ++ H+C+PN +H + + +K + + +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILS 288
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L T RR HL K F+CSC RC DP ELG++ A CL + I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
NPL N + W C C+ + +A+DV ++L ++++ L + P +LEE + + A H
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKDVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG+ PG+A +LS L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSGESLNRKL 438
>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
Length = 458
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 155/450 (34%), Positives = 226/450 (50%), Gaps = 51/450 (11%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CA C+ + + CS C+ V YC ++HQK W HK C P++IK S GRYL A+ +
Sbjct: 1 CAKCQLPSTNCCSKCRRVYYCSREHQKADWTNHKTLCGP-PFKIKESEAEGRYLVASRLI 59
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
G+ I +E PL+VGPKL P+CLGC+K ++ D RC RC WP CSA C S
Sbjct: 60 KAGEVILQELPLVVGPKLNTL-PLCLGCYK----SITDTYRCSRCNWPLCSAACEE-SAL 113
Query: 135 HTHAPECAILKLGCETLL-------AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAH 187
H + EC ++ TL+ A N ++ I PLR L L + L ++ +H
Sbjct: 114 HKNG-ECRMID---PTLMTNHLSQGAINSQVFQCITPLRYLTL---PDSDRERLDELVSH 166
Query: 188 MSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--- 243
+ +R G ++Y ++ +I +LR L L ++ + +CG++E N +I
Sbjct: 167 LEQRR-GMDIYRLVEQNISSFLRYRLL-------LTQYDSESIQRVCGILETNCFEIRIQ 218
Query: 244 GRYT-QGLYSVICLMEHNCLPNAKHS---NMMQSKLFVFRDT----HISTMYTNALWGTQ 295
GR + +GLY LM H+C+ N +H + ++ +D IS YT +LW T
Sbjct: 219 GRVSVRGLYPTASLMNHDCVANTRHVFDPADFRIRILATKDIPAGDKISATYTRSLWNTL 278
Query: 296 PRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTC 355
RR HL TK+F C C RC+DP ELGT SA+KC +L NPL S+W C
Sbjct: 279 DRRLHLKSTKHFWCQCSRCADPRELGTLLSAVKCTG-----CGGAVLSQNPLYQMSNWEC 333
Query: 356 GSCSARL---NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
C + + VH +Q+ +L + P + LE+ + K HP H
Sbjct: 334 SDCGSVQKVEQVKRVHDSARMELKQIAQLARNRPEL--LEDFIRKYSGAIHPDSCHVIEA 391
Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
K++L+QLYG P Y LS L+ERK+ Y
Sbjct: 392 KYALVQLYGNTPQLLYHDLSEELIERKLDY 421
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
+ ++ +++ V+ P L P+CLGC+K ++ D RC RC WP CSA C
Sbjct: 61 AGEVILQELPLVVGPKLNTL--PLCLGCYK----SITDTYRCSRCNWPLCSAAC 108
>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
Length = 530
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 145/456 (31%), Positives = 242/456 (53%), Gaps = 50/456 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK H+ +C+ +EI ++ +LGR+L+A
Sbjct: 6 LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
T D+ G++I +E+PL++GPK+A A P+CLGCH+ P P + +C C WP C
Sbjct: 64 TRDIKIGEQILKEAPLVLGPKVASA-PLCLGCHRNLLAPEKPR-GNYHKCSSCSWPLCGK 121
Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ P +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVAGEEERKESAYCVIMLLRCMQLKAMDPDAF 178
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
+L ++E H+ +R T +Y+ LR+N F++ + G + D + I +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--MKDWPEMDILRIAAI 228
Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
++ N ++ R + LY ++ H+C+PN +H + + +K + + +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILS 288
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L T RR HL K F+CSC RC DP ELG++ A CL + I+ +
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISL 344
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
NPL N + W C C+ + +A++V ++L ++++ L + P +LEE + + A H
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG+ PG+A +LS L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSEESLNRKV 438
>gi|195496367|ref|XP_002095664.1| GE22530 [Drosophila yakuba]
gi|194181765|gb|EDW95376.1| GE22530 [Drosophila yakuba]
Length = 530
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 142/455 (31%), Positives = 242/455 (53%), Gaps = 48/455 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK H+ +CK +EI ++ +LGR+L+A
Sbjct: 6 LTSCALCQAKASQLCAACRNVVYCSREHQKEHWKKGHRSECK--CFEIATNEVLGRHLKA 63
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
T D+ G++I +ESPL++GPK+A A P+CLGCH+ L + + +C C WP C
Sbjct: 64 TRDIKIGEQILKESPLVLGPKVASA-PLCLGCHRNLLAPEKSRGNYHKCSSCSWPLCGKE 122
Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKYQ 179
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ + P+ +
Sbjct: 123 CE---DSVHHKAECQLMSGSNFQSKINYVPGEDERKESAYCVIMLLRCMQLKAKDPEAFL 179
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEK--LPGDVLDDTSAKCLHWICGVI 236
+L +E H+ +R T +Y+ + + ++ ++++ K L D+L I ++
Sbjct: 180 KLYTLEDHLKERLE-TPLYQVLRANLITFIKTVLGMKNWLEMDILR---------IAAIL 229
Query: 237 EVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
+ N ++ R + LY ++ H+C+PN +H + + +K + + +S
Sbjct: 230 DTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILSI 289
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
YT L T RR HL K F+CSC RC DP ELG++ A CL + I+ +N
Sbjct: 290 SYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISLN 345
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
PL N + W C C+ + +A++V ++L ++++ L + P +LEE + + A H
Sbjct: 346 PLLNSAPWKCQLCNYKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHET 403
Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG+ P + +LS L RK+
Sbjct: 404 NTHVLQAKYALTQLYGSAPRFGMEELSEESLNRKV 438
>gi|189240267|ref|XP_971538.2| PREDICTED: similar to CG18136 CG18136-PA [Tribolium castaneum]
Length = 468
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 146/408 (35%), Positives = 220/408 (53%), Gaps = 42/408 (10%)
Query: 53 KLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLAD 112
K P+ +K +LGR + AT D+ G+ IA PLIVGPK+A + P+CLGCHK L LA+
Sbjct: 8 KAPFMVKRDEVLGRCVVATRDIRSGEVIAESLPLIVGPKMA-SPPLCLGCHKKLA--LAE 64
Query: 113 NA-RCPRCFWPACSARCSGLSDAHTHAPECAIL-KLGCETLLAYNDYK---YEAILPLRC 167
+ C +CFWP C C + H EC I K G + + ++ K Y I PLR
Sbjct: 65 SRYDCSKCFWPLCGQACES---SPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPIAPLRA 121
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSA 226
L+L++ SP+++ L D ++H+ T+VY+ + S+V + L KL DT+
Sbjct: 122 LLLKKNSPERFNLLLDFQSHLQDH-EKTQVYQVLKKSLVPFFTQ--LLKL------DTNE 172
Query: 227 KCLHWICGVIEVNGVDIGRYTQ------GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
+ IC + + N ++ R TQ GLY I + H+C N KH + V
Sbjct: 173 SEILTICSIFDTNCFEV-RDTQRLVNIRGLYPTISFLSHSCKHNTKHCFNGDNFRLVLTA 231
Query: 281 TH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
T ++T YT LWGT RR HL + K+F+C CERC+DPTE GTY SA+ C
Sbjct: 232 TTPIKKGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERCTDPTEFGTYLSAVNC---- 287
Query: 334 KDQGD-CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
GD + +PL+ ++DW C +C + D+ L ++++ L ++NPN SLE
Sbjct: 288 SVCGDGSKVTSSDPLNCEADWFCENCGFAITGSDMLWGNEVLQKEINHLNKDNPN--SLE 345
Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E L + + H + +C VK++L Q+YG+ G+ + +LSS L++RKI
Sbjct: 346 EFLERYAEVLHGRNSYCLQVKYALCQIYGSLDGFKFEELSSDLIDRKI 393
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 8/71 (11%)
Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSD 490
S ++ + ++ P K P+CLGCHK L LA++ C +CFWP C C
Sbjct: 31 SGEVIAESLPLIVGP--KMASPPLCLGCHKKLA--LAESRYDCSKCFWPLCGQACES--- 83
Query: 491 AHTHAPECAIL 501
+ H EC I
Sbjct: 84 SPLHEAECEIF 94
>gi|270012346|gb|EFA08794.1| hypothetical protein TcasGA2_TC006485 [Tribolium castaneum]
Length = 498
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 146/408 (35%), Positives = 220/408 (53%), Gaps = 42/408 (10%)
Query: 53 KLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLAD 112
K P+ +K +LGR + AT D+ G+ IA PLIVGPK+A + P+CLGCHK L LA+
Sbjct: 8 KAPFMVKRDEVLGRCVVATRDIRSGEVIAESLPLIVGPKMA-SPPLCLGCHKKLA--LAE 64
Query: 113 NA-RCPRCFWPACSARCSGLSDAHTHAPECAIL-KLGCETLLAYNDYK---YEAILPLRC 167
+ C +CFWP C C + H EC I K G + + ++ K Y I PLR
Sbjct: 65 SRYDCSKCFWPLCGQACES---SPLHEAECEIFAKAGYKPTVKNDNSKQTVYCPIAPLRA 121
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSA 226
L+L++ SP+++ L D ++H+ T+VY+ + S+V + L KL DT+
Sbjct: 122 LLLKKNSPERFNLLLDFQSHLQDH-EKTQVYQVLKKSLVPFFTQ--LLKL------DTNE 172
Query: 227 KCLHWICGVIEVNGVDIGRYTQ------GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
+ IC + + N ++ R TQ GLY I + H+C N KH + V
Sbjct: 173 SEILTICSIFDTNCFEV-RDTQRLVNIRGLYPTISFLSHSCKHNTKHCFNGDNFRLVLTA 231
Query: 281 TH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
T ++T YT LWGT RR HL + K+F+C CERC+DPTE GTY SA+ C
Sbjct: 232 TTPIKKGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERCTDPTEFGTYLSAVNC---- 287
Query: 334 KDQGD-CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
GD + +PL+ ++DW C +C + D+ L ++++ L ++NPN SLE
Sbjct: 288 SVCGDGSKVTSSDPLNCEADWFCENCGFAITGSDMLWGNEVLQKEINHLNKDNPN--SLE 345
Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E L + + H + +C VK++L Q+YG+ G+ + +LSS L++RKI
Sbjct: 346 EFLERYAEVLHGRNSYCLQVKYALCQIYGSLDGFKFEELSSDLIDRKI 393
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 34/71 (47%), Gaps = 8/71 (11%)
Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNA-RCPRCFWPACSARCSGLSD 490
S ++ + ++ P K P+CLGCHK L LA++ C +CFWP C C
Sbjct: 31 SGEVIAESLPLIVGP--KMASPPLCLGCHKKLA--LAESRYDCSKCFWPLCGQACES--- 83
Query: 491 AHTHAPECAIL 501
+ H EC I
Sbjct: 84 SPLHEAECEIF 94
>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
Length = 530
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 145/456 (31%), Positives = 240/456 (52%), Gaps = 50/456 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK H+ +C+ +EI ++ +LGR+L+A
Sbjct: 6 LTSCALCQAKASQLCAACRNVVYCCREHQKEHWKKGHRSECQ--CFEIATNEVLGRHLRA 63
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSA 126
T D+ G++I +E+PL++GPK+A A P+CL CH+ P P + +C C WP C
Sbjct: 64 TRDIKIGEQILKEAPLVLGPKVASA-PLCLSCHRNLLAPEKPR-GNYHKCSSCSWPLCGK 121
Query: 127 RCSGLSDAHTHAPECAILK-LGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKY 178
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ PK +
Sbjct: 122 ECE---DSVHHKAECQLMSGSNFQSKINYVPGEEERKESAYCVIMLLRCMQLKAIDPKAF 178
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGV 235
+L ++E H+ +R T +Y+ LR+N F+ + G + D + I +
Sbjct: 179 LKLYNLEDHLKERLE-TPLYQ-------VLRANLITFINTVLG--MKDWPEMDILRIAAI 228
Query: 236 IEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIS 284
++ N ++ R + LY ++ H+C+PN +H + + +K + + +S
Sbjct: 229 LDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFLAKRKIAKGEILS 288
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L T RR HL K F+CSC RC DP ELG++ A CL + I+
Sbjct: 289 ISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTCLKCKAGK----IISQ 344
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
NPL N + W C C+ + +A++V ++L ++++ L + P +LEE + + A H
Sbjct: 345 NPLLNSAPWKCQLCNFKRSAKEVVTSDAELQQELESLDKTTP--VALEEFIYRHRADLHE 402
Query: 405 HHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L QLYG+ PG+A +LS L RK+
Sbjct: 403 TNTHILQAKYALTQLYGSAPGFAMEELSEESLNRKV 438
>gi|195377589|ref|XP_002047571.1| GJ13517 [Drosophila virilis]
gi|194154729|gb|EDW69913.1| GJ13517 [Drosophila virilis]
Length = 528
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 151/454 (33%), Positives = 239/454 (52%), Gaps = 52/454 (11%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
CA C+ A C+ C+ V YC ++HQKEHWK HK +CK +E+ S+ LGR+L+AT D
Sbjct: 7 CAFCQSHATQLCAGCRSVVYCSREHQKEHWKRGHKRECK--CFELSSNETLGRHLRATRD 64
Query: 74 LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNA-RCPRCFWPACSARCSG 130
+ G++I RE+PL+VGPK+A + P+CLGCHK L P A N +C C WP CSA C
Sbjct: 65 IKMGEQIMREAPLVVGPKVA-SVPLCLGCHKNLLPPAKPAQNYYKCSACTWPLCSAECEQ 123
Query: 131 ----LSDAHTHAP---ECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
L++ A EC I E + + Y I+ LRC+ L+ SP + L +
Sbjct: 124 SPYHLAECRLMAGSNFECKINYKAAEQ--EHKESAYCVIMLLRCVQLKLESPAAFARLYE 181
Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVIEVNG 240
+E H+ +R T +Y+ LR+N F++ + G L++ S + I +++ N
Sbjct: 182 LEDHLKERLE-TPLYQ-------VLRANLITFIKTVLG--LNNCSELEILRIAAILDTNA 231
Query: 241 VDIGRY-----TQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTN 289
++ + +GLY + H+C+PN +H + + +K + + +S YT
Sbjct: 232 FELRQLGGCVKVRGLYPGAAMFAHDCVPNMRHRFDDDMNIVFLAKRPIAKGEVLSISYTQ 291
Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC---WILPVNP 346
L T RR HL K F+C+C RC DPTELGT+ A C G C I+ ++P
Sbjct: 292 PLRSTIQRRVHLKQVKCFDCACARCEDPTELGTFAGAHVC-------GKCKVGKIISMDP 344
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
L ++W C C+ + +A++ +++ ++++ L + P SLE+ L + H +
Sbjct: 345 LQLAANWKCEVCNLKRSAKEFLTQDAKIEQELESLDKTTP--LSLEDFLYRHRVELHETN 402
Query: 407 YHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
H K++L QLYG G+A +LS L+RK+
Sbjct: 403 THILQAKYALTQLYGHATGFAMDELSEKSLKRKV 436
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 8/67 (11%)
Query: 438 RKISYVMSPNLKATDEPICLGCHKPLNP--NLADN-ARCPRCFWPACSARCSGLSDAHTH 494
R+ V+ P K P+CLGCHK L P A N +C C WP CSA C + H
Sbjct: 73 REAPLVVGP--KVASVPLCLGCHKNLLPPAKPAQNYYKCSACTWPLCSAEC---EQSPYH 127
Query: 495 APECAIL 501
EC ++
Sbjct: 128 LAECRLM 134
>gi|194751716|ref|XP_001958171.1| GF10789 [Drosophila ananassae]
gi|190625453|gb|EDV40977.1| GF10789 [Drosophila ananassae]
Length = 438
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 140/442 (31%), Positives = 233/442 (52%), Gaps = 48/442 (10%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQA 70
L CA+C+ A C+AC+ V YC ++HQKEHWK HK +CK YE+ ++ LGR+L+A
Sbjct: 7 LTSCALCQARASQLCAACRNVVYCSREHQKEHWKKGHKAECK--CYEVATNETLGRHLRA 64
Query: 71 TLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---NPNLADNARCPRCFWPACSAR 127
T D+ G++I +E+PLI+GPK+A A P+CLGCH+ L + +C C WP C +
Sbjct: 65 TRDIKIGEQILKEAPLILGPKVASA-PLCLGCHRNLLAPEKRKGNYYKCSSCSWPLCGKQ 123
Query: 128 CSGLSDAHTHAPECAIL-KLGCETLLAY-------NDYKYEAILPLRCLILQRRSPKKYQ 179
C D+ H EC ++ ++ + Y + Y I+ LRC+ L+ + P+ +
Sbjct: 124 C---EDSPHHKAECQLMSSSNFQSKINYVPGEAERKESAYCVIMLLRCMQLKTKDPESFA 180
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVI 236
+L +E H+ +R T +Y+ LR+N F++ + G L D S + I ++
Sbjct: 181 KLYTLEDHLKER-LDTPLYQ-------VLRANLITFIKTVLG--LKDWSELDILRIAAIL 230
Query: 237 EVNGVDIGR-----YTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHIST 285
+ N ++ + + L+ ++ H+C+PN +H + + +K + + +S
Sbjct: 231 DTNTFEVRQPRERIKIRALFPGAAMISHDCVPNMRHRFDDDMNIIFLAKRKISKGEILSI 290
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
YT L T RR HL K F+CSC RC DP ELG++ A C+ + I+ +N
Sbjct: 291 SYTQPLRSTIQRRLHLRQAKCFDCSCARCQDPEELGSFAGAQNCVKCKAGK----IVSMN 346
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
PL N ++W C C+ + A+DV +QL ++++ L + P +LE+ + + H
Sbjct: 347 PLQNSANWKCQICNLKRTAKDVLTSDAQLQQELETLDKSTP--VALEDFIYRHRVELHET 404
Query: 406 HYHCYAVKHSLIQLYGTQPGYA 427
+ H K++L QLYG PG+
Sbjct: 405 NTHILQAKYALTQLYGNAPGFG 426
>gi|189239848|ref|XP_973542.2| PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
Length = 829
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 217/425 (51%), Gaps = 31/425 (7%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
RC VC + ALHKC C YC K+HQ+EHWK HK C+ P++I +LGR+L AT
Sbjct: 6 RCEVCEKPALHKCGKCHNAHYCDKKHQREHWKQHKSVCQ--PFKICEDAVLGRHLVATRA 63
Query: 74 LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
L PG+ + +E+PLI GP + P+CLGC K ++ + C +C WP CS C
Sbjct: 64 LKPGEVVIQEAPLIWGPSYN-SIPVCLGCGKAVDETCS--RPCQKCGWPVCSDLCE---K 117
Query: 134 AHTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
+ +H PEC E + Y+ I LRCL ++ PK +++L+ ++ H+
Sbjct: 118 SPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCLYQKQFLPKNWKKLETLQNHLE 177
Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---Y 246
+ T + +I +++R F KL ++ K +H + +VNG ++ +
Sbjct: 178 EHKSTTNYESDRVTIAEFIRRFF--KLSATFSEEDIMK-VHGSTLIFQVNGHEVPLTEPF 234
Query: 247 TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREH 300
+YS ++EH+C PN S Q + + H+S Y++ LWGT RR
Sbjct: 235 HVAIYSSASMLEHSCGPNCTKSFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYF 294
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP--VNPLDNDSDWTCGSC 358
L TK+F C CERC DP+E GT FSA+KC K G ++LP L DW C C
Sbjct: 295 LHETKFFWCHCERCEDPSEFGTNFSAIKC--STKSCGG-YLLPATFTELQKLPDWVCDKC 351
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
++ VH + ++G + L + +++ ++ L + + H +HY+C VK++L Q
Sbjct: 352 DKTMSFYSVHDILERIGRDLFDL--DKSSIEDCKDFLNTHQDLLHENHYYCTEVKYTLSQ 409
Query: 419 LYGTQ 423
L G +
Sbjct: 410 LIGQR 414
>gi|170056265|ref|XP_001863952.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876021|gb|EDS39404.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 526
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 140/403 (34%), Positives = 210/403 (52%), Gaps = 34/403 (8%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
PY ++ S + GRYL A+ +L G+ I + PL VGP A ++P+CLGCH+ A
Sbjct: 46 PYVVRHSDVWGRYLIASRNLKAGEVIIQVEPLAVGP-WAESDPVCLGCHRTFEVG-AKTV 103
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKL-GCETLL------AYNDYKYEAILPLRC 167
RCP C W CS C+GL+ H+ EC LK G LL YEAIL LRC
Sbjct: 104 RCPTCNWRICSPTCTGLTTHHSRL-ECIPLKEHGVAKLLETCSSATQIKLAYEAILALRC 162
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEE-IDSIVKYLRSNFLEKLPGDVLDDTSA 226
L+L+R +P +Y++L +M+A +R T +++ ++IV+ +R + ++ S
Sbjct: 163 LLLKRTAPDRYEKLLEMDAMNERRQQITSLWKRNQEAIVQRIRDEW-------KFEEFSE 215
Query: 227 KCLHWICGVIEVNGVDIG---RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
+H +CG+IEVN IG + + LY + H+C PN H++ S R T
Sbjct: 216 AEVHTVCGIIEVNAFQIGPTEVHARALYPEAFYIMHDCTPNTTHTDQPGSHELTVRPTRD 275
Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
++ Y+ L GT RR+HL +K+F C C RC DPTE GT+ S +KC K
Sbjct: 276 LKAGEPLTLSYSYTLQGTLKRRQHLCDSKFFWCQCPRCEDPTEFGTHCSTLKCAKCPKG- 334
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
+L NPLD D+ W C +C + A+ V + + L ++++ + NV+ E L
Sbjct: 335 ---LVLSSNPLDQDAPWKCKNCPHTIPAQSVTQLLNGLYKELESI--GGNNVELFENFLL 389
Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
K E + H +HY + KHSL +LYG P Y +L+ + RK
Sbjct: 390 KYEPILHKNHYLFLSAKHSLCELYGKAPNYLMHELTLDQINRK 432
Score = 46.6 bits (109), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 23/53 (43%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 502
A +P+CLGCH+ A RCP C W CS C+GL+ H+ EC LK
Sbjct: 83 AESDPVCLGCHRTFEVG-AKTVRCPTCNWRICSPTCTGLTTHHSRL-ECIPLK 133
>gi|321449930|gb|EFX62153.1| hypothetical protein DAPPUDRAFT_120473 [Daphnia pulex]
Length = 510
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 149/443 (33%), Positives = 228/443 (51%), Gaps = 54/443 (12%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVCR A C C E++YC K+HQKEHW +HK CK PY+I GR + A+ +L
Sbjct: 10 CAVCRAEASQICGGCGEISYCSKEHQKEHWLVHKSLCK--PYKIVHDEKFGRCIFASKNL 67
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLN-PNLADNARCPRCFWPACSARCSGLSD 133
PG+ I E+ +I GPK P CL C+ L+ A RCP C +P C +C+
Sbjct: 68 KPGEIIFGETAVITGPKQGCT-PCCLKCYASLDRVQEASLFRCPNCNFPFCQEQCA---K 123
Query: 134 AHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAH--MSKR 191
+ H EC IL ++ + ND + R P E K +E H + ++
Sbjct: 124 SPEHEAECLILSRA-KSCIVIND-------------VHRIHP----EYKMLENHNDLRRQ 165
Query: 192 GPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI----GRYT 247
+Y+ ++V++LR K+ G + D+ S + +H CGVI+VN +I +Y
Sbjct: 166 SDMWRIYQV--NVVQFLR-----KICG-LADEFSEEEIHASCGVIDVNAFEIRLAGNQYQ 217
Query: 248 Q--GLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTNALWGTQPRRE 299
Q G++ + +M HNC+ N +H + +++ + + + I T YT L GT+ RR+
Sbjct: 218 QVLGVFPLASMMSHNCVANTQHVIDANYTMTVRASVPIMKGEQIFTSYTLPLEGTKERRD 277
Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN-DSDWTCGSC 358
L +K F C C RCSDPTE TY SA++C N + +LP PLD ++DW C C
Sbjct: 278 VLRHSKLFECDCSRCSDPTECSTYLSALRCPN----CSNGVVLPERPLDEPETDWQCSQC 333
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
+L A V V +L E+++ + E V+ E+ L++ ++ HP+H+ + SL
Sbjct: 334 PYKLTAAVVTQVIDKLKEELETI--EPDQVEKFEDFLSRHASLLHPNHFLLTRARQSLSI 391
Query: 419 LYGTQPGYAYTQLSSSLLERKIS 441
LYG + L+S LERK+S
Sbjct: 392 LYGRNERFLLNTLTSQQLERKVS 414
>gi|270011901|gb|EFA08349.1| hypothetical protein TcasGA2_TC005992 [Tribolium castaneum]
Length = 803
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 139/425 (32%), Positives = 215/425 (50%), Gaps = 34/425 (8%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
RC VC + ALHKC C YC K+HQ+EHWK HK C+ P++I +LGR+L AT
Sbjct: 6 RCEVCEKPALHKCGKCHNAHYCDKKHQREHWKQHKSVCQ--PFKICEDAVLGRHLVATRA 63
Query: 74 LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
L PG+ + +E+PLI GP + P+CLGC K ++ + C +C WP CS C
Sbjct: 64 LKPGEVVIQEAPLIWGPSYN-SIPVCLGCGKAVDETCS--RPCQKCGWPVCSDLCE---K 117
Query: 134 AHTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
+ +H PEC E + Y+ I LRCL ++ PK +++L+ ++ H+
Sbjct: 118 SPSHIPECRYTSQRGEKITVKIFGVPHPIYKCITVLRCLYQKQFLPKNWKKLETLQNHLE 177
Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI---GRY 246
+ T + +I +++R F KL ++ K + G VNG ++ +
Sbjct: 178 EHKSTTNYESDRVTIAEFIRRFF--KLSATFSEEDIMK----VHGSTLVNGHEVPLTEPF 231
Query: 247 TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREH 300
+YS ++EH+C PN S Q + + H+S Y++ LWGT RR
Sbjct: 232 HVAIYSSASMLEHSCGPNCTKSFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYF 291
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP--VNPLDNDSDWTCGSC 358
L TK+F C CERC DP+E GT FSA+KC K G ++LP L DW C C
Sbjct: 292 LHETKFFWCHCERCEDPSEFGTNFSAIKC--STKSCGG-YLLPATFTELQKLPDWVCDKC 348
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
++ VH + ++G + L + +++ ++ L + + H +HY+C VK++L Q
Sbjct: 349 DKTMSFYSVHDILERIGRDLFDL--DKSSIEDCKDFLNTHQDLLHENHYYCTEVKYTLSQ 406
Query: 419 LYGTQ 423
L G +
Sbjct: 407 LIGQR 411
>gi|357603991|gb|EHJ63998.1| hypothetical protein KGM_04190 [Danaus plexippus]
Length = 564
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 141/408 (34%), Positives = 211/408 (51%), Gaps = 49/408 (12%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
YEIK S ++GRY+ AT D+ PG+ I L+VGP ICLGC++ L + +
Sbjct: 81 YEIKHSDVMGRYIVATRDIEPGEVIITAPALVVGPCSGCG-LICLGCYRELEE--SHLTK 137
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYN-DYK--------YEAILPLR 166
C C WP CS C GL H+ K CETL DYK Y AI PLR
Sbjct: 138 CSGCKWPLCSNSCFGLGKYTGHS------KYECETLKKIPPDYKNMEDLRDSYHAITPLR 191
Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP--GDVLDDT 224
CL+L++ P K+ L +ME+H + R ++Y D N +++L G DD
Sbjct: 192 CLLLKKEDPSKWCALSEMESHNAIRRSRGDIYPMNDR-------NVVQRLKKWGLGYDDD 244
Query: 225 SAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN---------MMQ 272
+H +CG++EVN ++G + LYS L+ H+C P+ H++ ++
Sbjct: 245 E---VHTVCGILEVNAFEVGASGANARALYSEAYLLAHDCTPSTSHTDHERRPGRPLTIR 301
Query: 273 SKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
+ + IS Y L GT RREHLA++K+F CSC RC+DPTE GT S+++C
Sbjct: 302 AAIRHVPGDVISLCYAYTLQGTLKRREHLAVSKFFMCSCVRCADPTERGTQASSLRC--- 358
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
+ +L +PL +D+ W C C+ ++A V L+ +L ++ D++ + +V+ E
Sbjct: 359 --PRCAGSVLSTDPLRSDALWRCAECTYEMSAATVKLLLKRLTDEFDQI--DANDVQGFE 414
Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
LTK + P+HY + KHSL QLYG Y +++ LL+RKI
Sbjct: 415 NYLTKYRNVLQPNHYLNLSAKHSLSQLYGKVADYMIHEMTEELLQRKI 462
>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
Length = 527
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 142/457 (31%), Positives = 237/457 (51%), Gaps = 56/457 (12%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATL 72
+CA C+ A C+AC+ V YC ++HQKEHWK HK +CK YE+ S+ LLGR+L+AT
Sbjct: 6 KCAYCQAHATQLCAACRNVVYCSREHQKEHWKRGHKLECK--CYELASNELLGRHLRATR 63
Query: 73 DLHPGDRIARESPLIVGPKLALAEPICLGCHK----PLNPNLADNARCPRCFWPACSARC 128
D+ G+ I RE+PL++GPK+A + PICLGCH+ P P L + +C C WP C C
Sbjct: 64 DIKMGELIMREAPLVLGPKVA-SMPICLGCHRNLLPPQKPAL-NYYKCSACSWPLCGPEC 121
Query: 129 SGLSDAHTHAPECAILKLG-CETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQE 180
+ H EC ++ ++ + YN + Y I+ LRC+ L+ P +
Sbjct: 122 E---RSAFHVDECRLMAASKFQSKINYNPGQPVGKESAYCVIMLLRCMQLKESKPAAFAR 178
Query: 181 LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNF-LEKLPG-DVLDDTSAKCLHWICGVIE 237
L ++E H+ +R T +Y+ + + ++ ++++ L + P D+L I +++
Sbjct: 179 LCELEDHLKERLE-TPLYQVLRANLISFIKTVLGLREWPDLDILR---------IAAILD 228
Query: 238 VNGVDIGRY-----TQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTM 286
N ++ + +GL+ + H+C+PN +H + M +K + + +S
Sbjct: 229 TNAFEVRQAGDRIKVRGLFPGGAMFSHDCVPNMRHRFDDDMNIMFLAKRPIAKGEILSIS 288
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI---LP 343
YT L T RR HL K F C+C RC+DPTELGTY A C C + +
Sbjct: 289 YTQQLRSTIQRRLHLKQVKCFECACARCADPTELGTYAGAHMC-------AKCKVGKFIS 341
Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
++PL N ++W C C+ A++ + +++ +++ L + P +S E+ L + H
Sbjct: 342 MDPLQNAANWRCEVCNLIRPAKEYLINDAKIEAELESLDKTTP--QSFEDFLYRHRVELH 399
Query: 404 PHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ H K++L Q+YG G+ +LS L+RK+
Sbjct: 400 ETNTHILQAKYALTQIYGNVKGFGMDELSDVSLKRKV 436
>gi|157125398|ref|XP_001654321.1| hypothetical protein AaeL_AAEL001920 [Aedes aegypti]
gi|108882685|gb|EAT46910.1| AAEL001920-PA [Aedes aegypti]
Length = 527
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 143/436 (32%), Positives = 223/436 (51%), Gaps = 42/436 (9%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
PY ++ S + GRYL A +L G++I + PL +GP A ++P+CLGC+K +
Sbjct: 46 PYIVRYSDVWGRYLVAARNLKAGEKIIQVEPLAMGP-WAASDPVCLGCYKRFAKG-SKII 103
Query: 115 RCPRCFWPACSARCSGLSDAHTHAP-ECAILKL-GCETLLAYNDYK-----YEAILPLRC 167
RC C W CS CSGL+ H EC LK LL + + YEAI LRC
Sbjct: 104 RCSGCGWRICSHTCSGLTANGAHKRLECNPLKEHNVVNLLETSSTEQIKLMYEAIFALRC 163
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSA 226
L+L++ P++Y L +ME+ E+ ++I S+ K + + ++ + D+ S
Sbjct: 164 LLLKKTDPERYATLLEMESL-------NELRQKISSLWKRNQEMIVGRIRDEWRFDEFSE 216
Query: 227 KCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
+H +CG+IEVN IG+ Y + LY + H+C PN H++ ++ + R T+
Sbjct: 217 GEIHTVCGIIEVNSFQIGQDDVYARALYPEAFYIMHDCTPNTTHTDDPKTHVLSIRLTND 276
Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I+ Y+ L GT RR+HL +K+F C C+RCSDPTE GT S +KC K
Sbjct: 277 LKAGDPITLSYSYTLQGTLKRRQHLFESKFFWCQCKRCSDPTEFGTNCSTIKCTKCPKG- 335
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
+IL NPLD D+DW C SCS + A+ + + +L +++D + + ++ E+
Sbjct: 336 ---YILTTNPLDQDADWKCKSCSHTMPAKIIVQLLDRLFKELDAI--DGNSIDLYEQFNV 390
Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK---------ISYVMSPN 447
K + + H +HY + KHSL QLYG GY +++S +++K + V P
Sbjct: 391 KYQNILHHNHYLFLSTKHSLCQLYGKISGYLISEMSLQQIKQKEKICRDLLSVVDVYEPG 450
Query: 448 LKATDEPICLGCHKPL 463
L I H P+
Sbjct: 451 LSRLRGVIMYELHAPI 466
Score = 38.9 bits (89), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 27/54 (50%), Gaps = 2/54 (3%)
Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAILK 502
A +P+CLGC+K + RC C W CS CSGL+ H EC LK
Sbjct: 83 AASDPVCLGCYKRFAKG-SKIIRCSGCGWRICSHTCSGLTANGAHKRLECNPLK 135
>gi|242013801|ref|XP_002427589.1| protein msta, isoform B, putative [Pediculus humanus corporis]
gi|212512004|gb|EEB14851.1| protein msta, isoform B, putative [Pediculus humanus corporis]
Length = 472
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 125/399 (31%), Positives = 212/399 (53%), Gaps = 45/399 (11%)
Query: 67 YLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN-ARCPRCFWPACS 125
YL A+ + G+ I + P++VGP ++P+C+GC+ ++ N ++ RC C WP CS
Sbjct: 1 YLVASKSIKQGEIILEDEPIVVGP-CQESKPLCIGCYTDIDLNDSNKYFRCVNCTWPLCS 59
Query: 126 ARCSGLSDAHTHAPE-CAILK---LGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
+C G + H+ E C + + + + ++ Y A+ P RCL+L++ +P+K++ +
Sbjct: 60 NQCKGFFNKFGHSKEECETFSKSFVKNKEIKTFVNFYY-ALTPFRCLLLKKINPEKWKII 118
Query: 182 KDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT-------SAKCLHWICG 234
K+ME+H ++R +++E +N EK+ D++ + +H++CG
Sbjct: 119 KNMESHTNERKKDGKLWE----------TN--EKMAVDIIKNVWGLSDDDDDNDIHFVCG 166
Query: 235 VIEVNGVDIG----RYTQGLYSVICLMEHNCLPNAKHSNMMQSKL------FVFRDTHIS 284
++EVN ++ + + LYS L HNC+PN HSN + KL F+ +++ ++
Sbjct: 167 ILEVNSFEVNFQNVQSLRALYSNAFLFSHNCIPNTSHSNNNEFKLIFRTTTFIQKNSAVT 226
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
Y L T RR+HL TK+F C+C+RCSDPTEL T+ +KC + + + LP
Sbjct: 227 ISYAYTLQPTLLRRKHLEKTKFFQCNCQRCSDPTELNTFIGGVKC----QICKNGFYLPE 282
Query: 345 NPLDNDSDWTCGS---CSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
N LDN S W C S C A++ A ++ + ++D + +V+S E + K + +
Sbjct: 283 NSLDNKSMWFCNSIKGCKAQIQASNIECTIERFFNELDGI--NGNDVESYEYFIIKCQEI 340
Query: 402 FHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
FH +HY C + H L QLYG GY L+ L RKI
Sbjct: 341 FHKNHYLCLSAMHCLSQLYGKINGYLINDLTEEQLLRKI 379
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 20/80 (25%), Positives = 42/80 (52%), Gaps = 16/80 (20%)
Query: 453 EPICLGCHKPLNPNLADN-ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAY 511
+P+C+GC+ ++ N ++ RC C WP CS +C G + H+ E CET
Sbjct: 29 KPLCIGCYTDIDLNDSNKYFRCVNCTWPLCSNQCKGFFNKFGHSKE------ECET---- 78
Query: 512 NDYKSLSRAYLDHQQVRSWT 531
S++++ +++++++
Sbjct: 79 -----FSKSFVKNKEIKTFV 93
>gi|357620843|gb|EHJ72884.1| hypothetical protein KGM_13875 [Danaus plexippus]
Length = 516
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 137/448 (30%), Positives = 214/448 (47%), Gaps = 65/448 (14%)
Query: 32 VAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGP 90
V YC K+HQK WK HK +CK PY+I+ S GR++ AT D+ G+ I +E P ++GP
Sbjct: 2 VYYCSKEHQKFDWKKSHKVECK--PYKIQYSDSCGRFMTATRDIKQGEVILKEKPAVIGP 59
Query: 91 KLALAEPICLGCHKPLNPNLADNA-----RCPRCFWPACSARCSGLSDAHTHAPECAIL- 144
++ + CL C + L P D+ RC C WP C C A H EC I+
Sbjct: 60 RM-YCKVQCLSCGRQLQPIPNDDGNMDFIRCSSCNWPVCGIDCE---KAEVHREECKIMV 115
Query: 145 --KLGCETLLAYNDYKYE----------AILPLRCLILQRRSPKKYQELKDMEAHMSKRG 192
K C D KYE I PLR L+++ +P+ ++ + ++E+H+ +R
Sbjct: 116 QSKYKC-------DIKYECPDKAEAAYCVIAPLRVLLMKDSNPRMFESIMNLESHVDQR- 167
Query: 193 PGTEVYEEIDSIVKYLRSN---FLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----G 244
T +Y+ LR N F+ ++ G DD K + I + + N D+
Sbjct: 168 INTPIYQ-------VLRPNLIMFIRQVLGMNFDD---KIILEISSIFDNNSFDVRSADKT 217
Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLF-------VFRDTHISTMYTNALWGTQPR 297
+ + +Y + +M H+C PN +H + + K + + I+ YT LWGT R
Sbjct: 218 KRLRAIYLLASMMNHSCRPNTRHIYLGEDKTLALIATVHIAKGEEITATYTQPLWGTLDR 277
Query: 298 REHLAITKYFNCSCERCSDPTELGTYFSAMKC-----LNEHKDQGDCWILPVNPLDNDSD 352
R+ L KYF+C CERC+DPTE GTY + C N + ++ NPLD +
Sbjct: 278 RKFLKTNKYFDCKCERCADPTEFGTYLGNIYCTMCNSTNADVSRKGAMLVSTNPLDETAL 337
Query: 353 WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
W C +C + R + L E++ L + P + E + K + HP ++
Sbjct: 338 WKCENCGYFIQGRQMVWGNKALKEELSSLNKTGP--QDFEHFIEKYKLTLHPRNHLLIQA 395
Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKI 440
K +L+++YG GY+ +L +LL RKI
Sbjct: 396 KLALMEIYGNYKGYSLKELPENLLSRKI 423
>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
Length = 497
Score = 211 bits (537), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 130/407 (31%), Positives = 209/407 (51%), Gaps = 33/407 (8%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
PY++ +P LGRY + DL GD I E P +GPK+ + P+CL C P++ + A+
Sbjct: 4 PYKVLDNPELGRYAVSAKDLKAGDVILEEIPFAIGPKVN-SSPLCLECCCPVDGS-ANGP 61
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLI 169
+C C WP C + H ECA+ G D + + I PLR L+
Sbjct: 62 KCSDCGWPMCEECNEDKDNVVYHKKECALFVKGKAKFQNVEDSTASCMQLDCITPLRMLL 121
Query: 170 LQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
+ P+++ EL ME H R GT +++ +IV YLR P + D S +
Sbjct: 122 AKEDHPERWDAELSVMEYHNDARKDGTTWHQDQQNIVGYLRG------PCGLKDRFSEEV 175
Query: 229 LHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN--------MMQSKLF 276
+ + G++++N ++ G +GLY + +M H+C+PN HS + + +
Sbjct: 176 IQQVIGILDINAFEVRTSKGYSARGLYPKLGVMAHSCVPNVVHSIHPSDGYRLIGRVAID 235
Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ + T+YT L GT R+ L +KYF C C RC DPTELGT+FS++KC + D
Sbjct: 236 IPEGEKLYTVYTYTLNGTSTRQAALKSSKYFTCRCPRCLDPTELGTHFSSLKC--QKCDN 293
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENP-NVKSLEE 393
G I+ NPLD++++W C +C + V + ++D+L ++ P + + E+
Sbjct: 294 G--LIVSSNPLDDEAEWRCSNCEFKTRGAAVQKAIQVMQAEIDQLSCLENGPEKLTAFEK 351
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ K ++ HP H+ +++HSLI+LYG PGY +L LLERK+
Sbjct: 352 LYKKYRSILHPLHFIKTSIRHSLIELYGRVPGYEMPELPDILLERKV 398
>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 467
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 137/448 (30%), Positives = 222/448 (49%), Gaps = 50/448 (11%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA 70
E C +C ++HKCSAC+ V YC K+HQKE WK H CK +++ +P LGR+ A
Sbjct: 2 ESQTCVICGGHSVHKCSACENVYYCSKKHQKEDWKKHSKVCKS--FKLAENPSLGRHYVA 59
Query: 71 TLDLHPGDRIARE-SPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCS 129
T ++ G+ I R+ PLI G + + P+CL C+ LN +A C +C WP C
Sbjct: 60 TRNIKVGEIILRDDQPLITGL-MYNSVPVCLQCYTMLNQEIA--IPCEKCGWPLCQ---- 112
Query: 130 GLSDAHTHAPECAILKLGCE--------TLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
+ + H EC K C T Y Y+ I +R L L+ +P+ Y++L
Sbjct: 113 ---NCNEHGLEC---KFSCSRRDSKISITEFGYPHPSYQCINIIRALSLKDTNPESYKKL 166
Query: 182 KDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGV 241
+E+H ++ E I ++ ++ DD S + + I G+++VNG
Sbjct: 167 ISLESHCNEINNSKEPLNIAHFIKRFFKA-----------DDISEEEIVTIIGILQVNGH 215
Query: 242 DIGRYTQ---GLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALW 292
++ +Y + L+EHNC N S ++++ L + + HIS YT+ LW
Sbjct: 216 EVPLTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGLIIRAALPITKGDHISICYTDPLW 275
Query: 293 GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSD 352
GT RR HL TK+F C C RC DPTE T F+A+KC +K ++LP L+ + D
Sbjct: 276 GTANRRHHLLKTKFFECICNRCQDPTEFQTMFNALKC---NKINCSGYVLPKTFLEQEQD 332
Query: 353 WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
+ C +C + ++ ++ + +G ++ + + ++ + +E L + + HP+HY+ V
Sbjct: 333 YVCKTCESIVSCTEIEKMLEDIGIRLSNM--KKNDIVACKEFLDRYKNNLHPNHYYNIDV 390
Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+L QL G Q G + LL KI
Sbjct: 391 TIALTQLIGQQTG-GLAVIEEDLLIEKI 417
>gi|350420138|ref|XP_003492412.1| PREDICTED: hypothetical protein LOC100743563 [Bombus impatiens]
Length = 442
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/448 (29%), Positives = 228/448 (50%), Gaps = 49/448 (10%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
E CA+C + A+HKCSAC V YC KQHQKE W+ H CK +++ LLGR+
Sbjct: 2 ESQTTCAICGKNAIHKCSACGNVYYCSKQHQKEDWRNHAKACKS--FKLAEDSLLGRHCI 59
Query: 70 ATLDLHPGDRIARES-PLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARC 128
AT ++ G+ + ++ PLI GP A P+CL C+ LN +A C +C WP C
Sbjct: 60 ATRNIKVGEIVLKDDEPLIAGPMYNCA-PVCLRCYMVLNETIA--VACEKCGWPLCQ--- 113
Query: 129 SGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELKD 183
D H EC + ++ ++ Y+ I +R L + + + Y++L
Sbjct: 114 ----DCKDHGLECNFSSSRRDHKVSITEFGHPHPSYQCITVIRALASKDVNLESYKKLLS 169
Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI 243
+E+H + E+ + V++++ F DD S + + I G+++VNG ++
Sbjct: 170 LESHCDRINS-----HELSNTVRFIKRFFKT-------DDISEEEMTKIVGILQVNGHEV 217
Query: 244 GRYT---QGLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALWGT 294
+Y + L+EHNC N S ++ + + + + HIS YT+ LWGT
Sbjct: 218 PLTDPPYVAVYELTSLLEHNCKANCSKSFTDTGGLIIHAAVPIAKGDHISICYTDPLWGT 277
Query: 295 QPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC--WILPVNPLDNDSD 352
RR HL TK+F C+C+RC DPTE GT F+A++C ++ +C +ILP + + D
Sbjct: 278 ANRRHHLLKTKFFECTCDRCKDPTEFGTMFNALRC-----NRMNCPGYILPKTFFEQEHD 332
Query: 353 WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAV 412
+ C C + + ++ + +G + ++++N ++ + +E +++ ++ HP+H++ V
Sbjct: 333 YICKICESIVPYVEIEKMLEDIGIYL-SIIKKN-DIIACKEFISRYKSTLHPNHFYNIDV 390
Query: 413 KHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+L QL G Q G + + LL KI
Sbjct: 391 TIALAQLIGQQTG-GLAAVETDLLIEKI 417
>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 440
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 136/445 (30%), Positives = 221/445 (49%), Gaps = 44/445 (9%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA 70
E C +C ++HKCSAC+ V YC K+HQKE WK H CK +++ +P LGR+ A
Sbjct: 2 ESQTCVICGGHSVHKCSACENVYYCSKKHQKEDWKKHSKICKS--FKLAENPSLGRHYIA 59
Query: 71 TLDLHPGDRIARE-SPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCS 129
T ++ G+ I R+ PLI G P+CL C+ LN +A C +C WP C
Sbjct: 60 TRNIKVGEIILRDDQPLITGLMYNTV-PVCLQCYTVLNQEIA--IPCEKCGWPLCQ---- 112
Query: 130 GLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELKDM 184
+ + H EC ++ ++ ++ Y+ I +R L L+ +P+ Y++L +
Sbjct: 113 ---NCNEHGLECKFSSSRRDSKISITEFGYPHPSYQCINVIRALSLKDTNPESYKKLISL 169
Query: 185 EAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG 244
E+H ++ E I ++ ++ DD S + + I G+++VNG ++
Sbjct: 170 ESHCNEMNNSKEPLNIAHFIKRFFKA-----------DDISEEEIATIIGILQVNGHEVP 218
Query: 245 RYTQ---GLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNALWGTQ 295
+Y + L+EHNC N S ++++ L + + HIS YT+ LWGT
Sbjct: 219 LTDSPYVAVYEMASLIEHNCRANCSKSFTDMGGLIIRAALPITKGDHISICYTDPLWGTA 278
Query: 296 PRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTC 355
RR HL TK+F C C RC DPTE T F+A+KC N + +ILP L+ + D+ C
Sbjct: 279 NRRHHLFKTKFFECICNRCQDPTEFQTMFNALKCNNINCSG---YILPKTFLEQEQDYIC 335
Query: 356 GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHS 415
C + ++ ++ V +G +D + ++ + +E L + + HP+HY+ V +
Sbjct: 336 KICESVVSCTEIEKVLEDIG--IDLSNMKKNDIVACKEFLDRYKNDLHPNHYYNIDVTIA 393
Query: 416 LIQLYGTQPGYAYTQLSSSLLERKI 440
L QL G Q G + +LL KI
Sbjct: 394 LAQLIGQQTG-GLAVIEENLLIEKI 417
>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
Length = 546
Score = 204 bits (519), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 155/507 (30%), Positives = 238/507 (46%), Gaps = 43/507 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + A +CS C YC +HQ++ WK HK C P++I S GRY+ AT D+
Sbjct: 26 CPVCNQPAKKRCSGCSAAYYCSVEHQRQDWKNHKNVCH--PFKICSDERYGRYMMATKDI 83
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
GD + +ESPL+ GP + P+C+GC + L N C RC WP C C D
Sbjct: 84 KAGDVVLKESPLVYGPA-QITNPVCVGCLQGLTEN--QFLECERCGWPVCKRECQ---DH 137
Query: 135 HTHAPECAI-LKLGCETLLA--YNDYK-YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
+H EC + G + L YN + Y+ ++PLRCL+L R P K+Q L +E+H +
Sbjct: 138 PSHKAECKFTIARGSKMQLRHFYNPHPVYQCLIPLRCLLLAERDPAKWQALIKLESHEEQ 197
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRY---T 247
R + + + + K + F K +D K I G+++VNG ++ +
Sbjct: 198 RRGSEQWRNDREGVAKLIPRFF--KCENKWSEDEILK----IVGILQVNGHEVPLTEPPS 251
Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF-------RDTHISTMYTNALWGTQPRREH 300
+Y+ ++EH+C PN S K VF + +S Y++ LWGT R++H
Sbjct: 252 VAIYNNASMLEHSCRPNLSKS-FTSKKEIVFWAPNPIKQGERLSISYSDVLWGTANRQDH 310
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSCS 359
L TK F C+C RC DPTE GTY SA+KC KD +LP N + + C C
Sbjct: 311 LQQTKLFRCTCVRCLDPTEFGTYLSALKCSGFKKDSNCSGLLLPENLKNWYGGYICNKCR 370
Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
++ +++ + + +VD + N + + + P+H+ VK SL Q+
Sbjct: 371 GLVDGKEITNILDRA--RVDHEAMQKDNEQHCHKYIAHYGRWLGPNHHLLVDVKISLSQI 428
Query: 420 YGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPIC------LGCHK-PLNPNLADNAR 472
G A ++S L K+ + L + +C +G + L+ LA+ AR
Sbjct: 429 MGGGDPNAIQKISDEDLMTKMK--ICQELIDVFQRVCPAEARVIGTTRFELHAALAEFAR 486
Query: 473 CPRCFWPACSARCSGLSDAHTHAPECA 499
R SA S L D+ +A EC
Sbjct: 487 --RGVESMNSAVRSALEDSLYNAEECV 511
>gi|195566137|ref|XP_002106647.1| GD16997 [Drosophila simulans]
gi|194204029|gb|EDX17605.1| GD16997 [Drosophila simulans]
Length = 532
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/457 (29%), Positives = 222/457 (48%), Gaps = 46/457 (10%)
Query: 37 KQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE 96
+Q +KEH + + K+LPY ++ S + GRYL A L G+ + RE PL +GP ++ +
Sbjct: 31 EQQEKEHSRAPTEE-KELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-GD 88
Query: 97 PICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAIL--------KLG 147
P+CLGC++P++ D RCP C WP C + C+GL H H EC + KL
Sbjct: 89 PVCLGCYQPVSLK-PDQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYGERRAVAGKLL 147
Query: 148 CETLL-AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IV 205
E A YE ++ +R L+L++ P+++ + ME+H +R ++ + +V
Sbjct: 148 TERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVV 207
Query: 206 KYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCL 262
+ LR + L+D A+ +H +CG+++VN +IG+ + LY L+ H+C
Sbjct: 208 QRLRVTW-------QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCT 260
Query: 263 PNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
PN H++ +S + R + ++ Y L GT RR + K F C C RCS
Sbjct: 261 PNTAHTDDPRSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCS 320
Query: 316 DPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLG 375
DP ELGT SA+ C + V+PL DW C C+ ++ A +V ++
Sbjct: 321 DPRELGTDCSALVCATCRTGS----VRAVDPLQQSGDWACDRCAHKMGATEVERQLDRIN 376
Query: 376 EQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSL 435
++ + + ++ LE L + + P+HY + K+SL Q+YG GY Q+S
Sbjct: 377 NDLEDI--DVHDIPGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMSPED 434
Query: 436 LERKISY---------VMSPNLKATDEPICLGCHKPL 463
+ RK SY V+ P L I H P+
Sbjct: 435 IARKESYCREFLEIVDVLDPGLTRLRGLIMYELHAPV 471
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 46/92 (50%), Gaps = 6/92 (6%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y V+HS I YG + L R+ + P + + +P+CLGC++P++ D
Sbjct: 49 YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYQPVSLK-PD 103
Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
RCP C WP C + C+GL H H EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135
>gi|24641162|ref|NP_572675.1| CG11160, isoform A [Drosophila melanogaster]
gi|22832070|gb|AAF47987.2| CG11160, isoform A [Drosophila melanogaster]
Length = 532
Score = 201 bits (512), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 140/460 (30%), Positives = 222/460 (48%), Gaps = 50/460 (10%)
Query: 37 KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
KQ Q++ + P + K+LPY ++ S + GRYL A L G+ + RE PL +GP ++
Sbjct: 29 KQEQQDKEQSPSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87
Query: 96 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAILK----LGCET 150
+P+CLGC+ P++ AD RCP C WP C + C+GL H H EC + + E
Sbjct: 88 DPVCLGCYHPVSLK-ADQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYAERRAVAGEL 146
Query: 151 LL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEID 202
L A YE ++ +R L+L++ P+++ + ME+H +R + YEE
Sbjct: 147 LTERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEE-- 204
Query: 203 SIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEH 259
+V+ LR + L+D A+ +H +CG+++VN +IG+ + LY L+ H
Sbjct: 205 KVVQRLRVTW-------QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAH 257
Query: 260 NCLPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCE 312
+C PN H++ S + R + ++ Y L GT RR + K F C C
Sbjct: 258 DCTPNTAHTDDPSSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCR 317
Query: 313 RCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTS 372
RCSDP ELGT SA+ C + V+PL DW C C+ ++ A +V
Sbjct: 318 RCSDPRELGTDCSALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLD 373
Query: 373 QLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLS 432
++ ++ + + ++ LE L + + P+HY + K+SL Q+YG GY Q+S
Sbjct: 374 RINNDLEDI--DVHDIPGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMS 431
Query: 433 SSLLERKISY---------VMSPNLKATDEPICLGCHKPL 463
+ RK SY V+ P L I H P+
Sbjct: 432 PEDIARKESYCREFLEIVDVLDPGLTRLRGLIMYELHAPV 471
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 46/92 (50%), Gaps = 6/92 (6%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y V+HS I YG + L R+ + P + + +P+CLGC+ P++ AD
Sbjct: 49 YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYHPVSLK-AD 103
Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
RCP C WP C + C+GL H H EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135
>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
[Megachile rotundata]
Length = 523
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 136/436 (31%), Positives = 215/436 (49%), Gaps = 57/436 (13%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC ++ KCSAC V YC K+HQKE WK H C+ +++ S LGRY A ++
Sbjct: 7 CAVCGGESVFKCSACGNVYYCSKKHQKEDWKRHVQFCR--AFKLAESSSLGRYYIAARNI 64
Query: 75 HPGDRIARES-PLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
G+ + ++ PL+ GP + + P+CLGC+ L+ N A C +C WP C +
Sbjct: 65 KAGEIVLKDDRPLVAGP-MHNSVPVCLGCYMVLHENTA--VPCTKCGWPLCQ-------N 114
Query: 134 AHTHAPEC-----------AILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
H EC +I K G Y Y+ I +R L L+ + + Y +L
Sbjct: 115 CKEHGTECDFTSSRRSDKVSITKFG------YPHPSYQCINVIRALSLKDNNIESYNKLL 168
Query: 183 DMEAHMSK-RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGV 241
+E+H K + T ++EE ++V +++ F +D + + I GV++VNG
Sbjct: 169 TLESHSDKIKNKETFIFEESSNLVNFIKRFFKS-------EDIPEEDIRKIIGVLQVNGH 221
Query: 242 DIGRYT---QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT------HISTMYTNALW 292
++ +Y + L+EHNC N S L + T HIS YT+ LW
Sbjct: 222 EVPLTDPPYVAVYELASLLEHNCKANCSKSFTDTGGLIIHAATPITKGDHISICYTDPLW 281
Query: 293 GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC--WILPVNPL-DN 349
GT RR HL TKYF C+C+RC DPTE G F+A++C + DC ++LP L D+
Sbjct: 282 GTMNRRHHLLETKYFECTCDRCRDPTEFGAMFNAIRC-----SEIDCSGYVLPKTFLGDH 336
Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
D+ C +C+ + + +G + ++ + + +V +E L + + + HP+H++
Sbjct: 337 REDYVCTNCTTVVPLEIIEETLEDIGIHLSEMRKNDIDV--CKEFLNRYKNVLHPNHFYN 394
Query: 410 YAVKHSLIQLYGTQPG 425
V +L QL G Q G
Sbjct: 395 IDVTIALAQLIGQQSG 410
>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 514
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 132/411 (32%), Positives = 198/411 (48%), Gaps = 40/411 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I + +GR++ A DL PG+ I E P +VGPK A P+CL C+ P P+L +
Sbjct: 20 YKILKNDKVGRFMVANKDLQPGEEIVTEMPFVVGPK-AFTYPLCLSCYAPWPPSLTEKPL 78
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
C RC WP CS C + EC I K + L + + E I PLR
Sbjct: 79 CSRCGWPVCSTECENQPQHKDY--ECQIFVQANEKFNVQAALEEANENGVPQLECITPLR 136
Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
L+ ++P+++ E+K+MEAH R + ++V+YLR+ LD S
Sbjct: 137 LLLESEKNPERWNNEVKNMEAHNKIRSQKAHWKSDQVNVVEYLRNQL-------KLDRFS 189
Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
+ + CG++E+N ++ G + LY + LM H+C+ N HS R T
Sbjct: 190 EELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTDYRIRLRTT 249
Query: 282 -------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
+ YT++L+ T RREHL K+F C+C RCSDPTELGT+ S++KC
Sbjct: 250 VHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARCSDPTELGTHMSSLKC--NKC 307
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN-----PNVK 389
D G +L ++ LD DS W C C + V V + +VD + Q + ++
Sbjct: 308 DNG--IVLSLDSLDPDSSWKCTHCEFSTSGPAVRKVLKIIQAEVDAVEQISGADGADAIQ 365
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E ++ K ++ HP H ++HSL Q+YG Y L +LE KI
Sbjct: 366 ERETVMKKYRSVLHPRHAFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKI 416
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
++ +V+ P KA P+CL C+ P P+L + C RC WP CS C
Sbjct: 47 EMPFVVGP--KAFTYPLCLSCYAPWPPSLTEKPLCSRCGWPVCSTEC 91
>gi|195479534|ref|XP_002100923.1| GE15901 [Drosophila yakuba]
gi|194188447|gb|EDX02031.1| GE15901 [Drosophila yakuba]
Length = 532
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/444 (30%), Positives = 215/444 (48%), Gaps = 49/444 (11%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLA 111
K+ PY I+ S + GRYL A+ L G+ + RE PL +GP ++ +P+CLGC++P++
Sbjct: 45 KEQPYRIEHSDIYGRYLVASRQLEAGETLIREEPLAIGPCVS-GDPVCLGCYQPVSLK-P 102
Query: 112 DNARCPRCFWPACSARCSGLSDAHTHA-PECAIL----KLGCETLL-----AYNDYKYEA 161
D RCP C WP C C+GL H H EC + + E L A YE
Sbjct: 103 DQYRCPGCAWPLCGITCTGLKHRHGHTETECQLFGERRAVAGELLTERAGPAEVRDLYEL 162
Query: 162 ILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEIDSIVKYLRSNFLEKLPG 218
++ +R L+L++ P+++ + ME+H +R + YEE +V+ LR +
Sbjct: 163 VMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEE--KVVQRLRVTW------ 214
Query: 219 DVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
L+D A+ +H +CG+++VN +IG+ + LY L+ H+C PN H++ +S
Sbjct: 215 -QLEDLEAEQVHDVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPNTAHTDDPRSFE 273
Query: 276 FVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
+ R + ++ Y L GT RR + K F C C RCSDP ELGT SA+
Sbjct: 274 ILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALV 333
Query: 329 CLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNV 388
C + V+PL+ DW C CS ++ A +V ++ ++ + + ++
Sbjct: 334 CATCRTGS----VRAVDPLEQTGDWACDRCSHKMGATEVERQLDRINNDLEDI--DVHDI 387
Query: 389 KSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY------ 442
LE L + + P+HY + K+SL Q+YG GY Q+S + RK SY
Sbjct: 388 PGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMSPEDIARKESYCREFLE 447
Query: 443 ---VMSPNLKATDEPICLGCHKPL 463
V+ P L I H P+
Sbjct: 448 IVDVLDPGLTRLRGLIMYELHAPV 471
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 46/93 (49%), Gaps = 6/93 (6%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y ++HS +YG + + L R+ + P + + +P+CLGC++P++ D
Sbjct: 49 YRIEHS--DIYGRYLVASRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYQPVSLK-PD 103
Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAIL 501
RCP C WP C C+GL H H EC +
Sbjct: 104 QYRCPGCAWPLCGITCTGLKHRHGHTETECQLF 136
>gi|156544373|ref|XP_001607398.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 507
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 146/461 (31%), Positives = 224/461 (48%), Gaps = 60/461 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I+ + +GR+L A+ DL PG++I E P +VGPK A P+CL C+ + P D+++
Sbjct: 12 YKIQQNDQVGRFLVASKDLEPGEQILTELPFVVGPKAA-TYPVCLSCYS-VWPATEDDSK 69
Query: 116 --CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETL---LAYNDY------KYEAILP 164
C RC WP C C ++ EC I + E +A ++ + E I P
Sbjct: 70 PLCSRCSWPVCGPECE--NNPQHKDYECPIFEAAKEKFSIDVALSEEHQNGVPQLECITP 127
Query: 165 LRCLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
LR L+ + P++++ E+KDMEAH KR + + + +IV+Y+R LD
Sbjct: 128 LRLLLAAEKDPERWKSEIKDMEAHNKKRAQKNQWHIDHVNIVEYIRKRL-------KLDR 180
Query: 224 TSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQ 272
S + + CG+++VN +I G +GLY + +M HNC+ N HS ++
Sbjct: 181 FSEEDIQTACGILDVNSHEIRTAKGFLARGLYPKVAIMNHNCVSNTAHSIDPNDYRIFLR 240
Query: 273 SKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
S + V + YT+AL T RREHL +K+F C+C RCSDPTELGT+ S++KC
Sbjct: 241 SAVKVPAAGELFGSYTHALLPTLLRREHLLESKHFACACNRCSDPTELGTHMSSLKC--S 298
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL----VQENPN- 387
D G +LP++ LD +S W C C N V V + ++D++ + P+
Sbjct: 299 KCDNG--VVLPLDSLDAESQWKCTHCEFSTNGLAVAKVQKIIQAEMDQVEMYSAADGPDA 356
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY----- 442
+++ E + K ++ HP H + SL QLYG Y L +LE KI
Sbjct: 357 IQNRETFMKKYHSVLHPRHALLTIPRFSLSQLYGRVEEYFLDDLPDIVLEHKIDMCRLVL 416
Query: 443 ----VMSPNLKATDEPICLGCHKPL--------NPNLADNA 471
V+ P + I H PL N + DNA
Sbjct: 417 QVLDVVEPGKTRSRGMILYELHAPLLFIAKGQWNAGVIDNA 457
>gi|157138251|ref|XP_001664198.1| hypothetical protein AaeL_AAEL003788 [Aedes aegypti]
gi|108880683|gb|EAT44908.1| AAEL003788-PA, partial [Aedes aegypti]
Length = 485
Score = 197 bits (502), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 144/453 (31%), Positives = 232/453 (51%), Gaps = 59/453 (13%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATL 72
RCAVC + +CS C++V+YCG+ HQ++HWK H+ +C+ ++I S+ +LGR+L AT
Sbjct: 7 RCAVCDASGGKQCSGCQQVSYCGRDHQRQHWKAGHREQCR--CFKISSNAILGRHLTATR 64
Query: 73 DLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNL-----ADNARCPRCFWPACSAR 127
+ G+ I +SPL++GPK+A + P+CLGCH+ L P + D C C WP C
Sbjct: 65 PIRRGELIFNDSPLLLGPKIA-SIPVCLGCHRNLPPLMNSHLQVDFYHCRHCSWPLCGPS 123
Query: 128 CSGLSDAHTHAPECAIL-----KLGCETLLAYND-----YKYEAILPLRCLILQRRSPKK 177
C A +H EC +L +G ++D Y A++PLR L+L+R +P++
Sbjct: 124 CE---TATSHKDECPLLARSSKSIGSRLQFCFSDPTRKESVYCAVVPLRALLLKRSNPER 180
Query: 178 YQE-LKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
+++ +E+H+ R T +Y + S +V ++R + + G+V + + L I +
Sbjct: 181 FRDGFLTLESHLETR-IKTPLYAALRSNLVPFVRD--ILGMGGEV----TERELLEIAAI 233
Query: 236 IEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHIS 284
++ N D+ + LY + +M H+C PN KH + +L V +D IS
Sbjct: 234 LDTNCYDVRLPELDVKVRALYELGAMMSHHCRPNTKHYFDERLRLVVVATVDIPKDAVIS 293
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L T RR + +K F C C+RC DPTELGTY ++ C + K + ++
Sbjct: 294 ISYTQPLLSTIQRRYAIQQSKCFECCCDRCRDPTELGTYLGSIVCPSCRKAK----VVAF 349
Query: 345 NPLDNDSDWTCGS--CSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT--KLEA 400
+PLD SDW C S CS R +A D ++ + + E L +LE+
Sbjct: 350 DPLDLRSDWQCESRKCSFRESAVDAVRRNELRQAEIMAVARTVSPRDQYERFLVTHQLES 409
Query: 401 ------MFHPHHYHCYAVKHSLIQLY---GTQP 424
+ HP + + VK++L QLY T+P
Sbjct: 410 SKEELGLLHPWNSNVLQVKYALTQLYRGAATEP 442
>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
Length = 510
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 131/409 (32%), Positives = 202/409 (49%), Gaps = 39/409 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ + +GRY+ A+ +L G+ I E+P IVGPK A P+CL C+ P P+ D
Sbjct: 19 YKVLQNDQVGRYMIASRELQAGEEITSETPFIVGPK-ACTYPLCLSCYTPWPPSPDDKPL 77
Query: 116 CPRCFWPACSARCSGLSDAHTHAP-ECAIL-----KLGCETLLAYNDY-KYEAILPLRCL 168
C +C W C+ C +A H EC I K + +L + + E I PLR L
Sbjct: 78 CSKCGWSVCNQECE---NAPQHKDYECPIFVEANEKFNVDVVLDGDGVPQLECITPLRLL 134
Query: 169 ILQRRSPKKY-QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
+ R+ +++ +E+KDMEAH R + + +IV YLR LD S +
Sbjct: 135 LESERNVERWNKEIKDMEAHNKIRSEKIQWKSDHVNIVDYLRKRL-------KLDRFSEE 187
Query: 228 CLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLF 276
+ ICG++++N ++ G +GLY + +M H+C+ N HS +++ L
Sbjct: 188 FIQTICGILDINTFEVRTMKGFCARGLYPTVAMMNHSCVSNTSHSISPIDYRIRLRTTLK 247
Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
V + YT++L T RREHL KYF C+C RCSDPTELGT+ S++KC D
Sbjct: 248 VPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRCSDPTELGTHMSSLKC--NKCDN 305
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVKSL 391
G IL ++ LD+ S W C C + + V + + +VD + +
Sbjct: 306 G--VILSLDSLDSTSTWNCTHCDFSTSGQAVRKIFKIIQAEVDTVEAISGADGADAILER 363
Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E ++ K ++ HPHH ++HSL Q+YG Y L +LE K+
Sbjct: 364 ENVIKKYRSVLHPHHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHKV 412
>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
Length = 514
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 125/419 (29%), Positives = 210/419 (50%), Gaps = 36/419 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC E +KCS C +V+YC QHQK+ WK HKP C P++I + LGR+L AT +
Sbjct: 4 CHVCEEPTKNKCSNCNQVSYCSVQHQKQDWKAHKPSCH--PFKIAHNEQLGRHLVATRTI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+CLGC + P+ D+ C +C WP C C L +
Sbjct: 62 KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEPD--DHIECEQCGWPLCGPECKSLDE- 117
Query: 135 HTHAPECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
H EC + K + + ++ Y + +RCL++ SP+K + +D+E+ S
Sbjct: 118 --HKAECRLTK-DRDQRVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQDLESLES 174
Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ- 248
R + ++ SI +++ F + ++ K + G +++NG ++
Sbjct: 175 TRRGSNQWKADLVSIGQFIPKFFKTQ---KFTEEQIMKAV----GALQINGHEVPTTDPS 227
Query: 249 --GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREH 300
++ E++CLPN AK N ++ + + ++ H+S Y++A+WGT R+ H
Sbjct: 228 HVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRH 287
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSA 360
L TK F C+CERC D TEL T +SA+KC + Q +LP D + W C C
Sbjct: 288 LMQTKLFKCACERCVDVTELDTNYSAIKCEDR---QCGGLMLPTKADDWNGSWRCRECHK 344
Query: 361 RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
++ V + + G+ + + N ++ + L E P H+H +K L+QL
Sbjct: 345 QVQKHYVERILERAGKDIQSM---EKNAENGLKYLKHYEKWLPPQHFHMSEIKILLVQL 400
Score = 38.9 bits (89), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 5/49 (10%)
Query: 454 PICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 502
P+CLGC + P+ D+ C +C WP C C L + H EC + K
Sbjct: 83 PVCLGCLNGIEPD--DHIECEQCGWPLCGPECKSLDE---HKAECRLTK 126
>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 134/456 (29%), Positives = 217/456 (47%), Gaps = 32/456 (7%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC++ A +CS C V YC +HQ++ WK+HK C+ P++I S+ GR+L AT D+
Sbjct: 2 CPVCKKEASKRCSRCAMVYYCCVEHQQQDWKVHKTTCQ--PFKIFSNEQYGRFLVATRDI 59
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
G+ + +ESPL+ GP + P+C+GC + L C RC WP C C D+
Sbjct: 60 KAGEIVLKESPLVHGPA-QITGPVCVGCLQGLEEK--KYLDCERCGWPVCKRSCQ---DS 113
Query: 135 HTHAPECAILKLGCETLLAYNDY----KYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
+H EC + + Y Y+ ++P+RCL+L P +++ L +E+H +
Sbjct: 114 PSHQAECKFTIARGSKISIQHFYVPHPTYQCLMPVRCLLLAESDPARWETLLKLESHEEE 173
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRY---T 247
R + + + + K + F K +D + + G+I+VNG ++ +
Sbjct: 174 RRGSEQWRNDREGVAKLIPRFF--KCENKWDEDEILR----VVGIIQVNGHEVPMTEPSS 227
Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREHL 301
+Y++ ++EH+C PN S + ++ ++ R +S YT+ LW T R EHL
Sbjct: 228 VAIYNMASMLEHSCRPNLAKSFTNRGEVVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHL 287
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSCSA 360
TK F C CERCSD TE TYFSA++C KD ++LPV+ +W C CS
Sbjct: 288 QQTKMFRCECERCSDRTEYETYFSAVRCSGFQKDSKCKGYLLPVDNAQWTGEWQCLRCSK 347
Query: 361 RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
++ V + + ++D E + +H + VK SL Q+
Sbjct: 348 EVSGATVCQILERA--RMDLEAMEKHREDHCNRYMKHYSKWLTANHQYMVDVKISLSQVI 405
Query: 421 GTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPIC 456
G + A ++ L KI + +L A E IC
Sbjct: 406 GGKNPEAIKKIPYEKLMNKIK--ICQDLIALFEKIC 439
>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
Length = 515
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 126/421 (29%), Positives = 208/421 (49%), Gaps = 40/421 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + KCS C +V+YC QHQK+ WK HK C P++I + LLGR+L AT +
Sbjct: 4 CNVCEQPTKSKCSNCNQVSYCSVQHQKQDWKAHKSSCH--PFKIAHNDLLGRHLVATRTI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + I +E+PL+ GP + P+C+GC + P D+ C C WP C C L +
Sbjct: 62 KPYEIILKEAPLVRGPA-QITPPVCMGCLNSIEPE--DHIDCDLCGWPLCGPECKSLGE- 117
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPL-------RCLILQRRSPKKYQELKDMEAH 187
H EC + + E N +++ PL RCL++ SP+K + +++E+
Sbjct: 118 --HRAECQLTQ---ERGQKVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKASKFQELESL 172
Query: 188 MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT 247
S R + ++ SI +++ F + S + + G +++NG ++
Sbjct: 173 ESTRRGSNQWKADLASIGQFIPKFF-------KTEKFSEEEVMRAVGALQINGHEVPTSD 225
Query: 248 Q---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRR 298
++ EH+C+PN AK N ++ + + +++H+S Y++A+WGT R+
Sbjct: 226 PPHVAVFYTASFTEHSCMPNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQ 285
Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
HL TK F C+C+RC+D TELGT +SA+KC + + D +LP D + W C C
Sbjct: 286 RHLVQTKLFKCACDRCTDVTELGTNYSAIKCEDR---RCDGLMLPSKTDDWNGSWKCREC 342
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
++ + + + G+ + + + N + L E P HYH VK L+Q
Sbjct: 343 QKQVQKHYIDQILERAGKDLQSMEKTAENGL---KYLKHYEKWLPPRHYHMSEVKILLVQ 399
Query: 419 L 419
L
Sbjct: 400 L 400
>gi|195060014|ref|XP_001995739.1| GH17917 [Drosophila grimshawi]
gi|193896525|gb|EDV95391.1| GH17917 [Drosophila grimshawi]
Length = 545
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 127/438 (28%), Positives = 215/438 (49%), Gaps = 44/438 (10%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y + S + GRYL A L G+ + E PL +GP ++ EP+CLGC++P+ + + R
Sbjct: 61 YRVAHSDVFGRYLVANRRLEAGELLISEEPLAIGPCVS-CEPVCLGCYQPVQLD-SQQYR 118
Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK---------LGCETLLAYNDYKYEAILPL 165
CP C WP C C G+ H A ECA+ L L YE +L +
Sbjct: 119 CPDCNWPLCGPTCRGIRQRTGHSAEECALYAERRALLADVLTANATLEQRRDLYELVLIV 178
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDT 224
R ++L++ S ++Y ++ ME+H ++R +++ + +V+ LR+++ D L+
Sbjct: 179 RIVLLKQHSAEQYANIRRMESHTAERRENATLWQHYEQKVVQRLRNDWQLANVCDELE-- 236
Query: 225 SAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN-------MMQSK 274
LH ICG+++VN +IG+ + LY L+ H+C PN H++ ++++
Sbjct: 237 ----LHEICGILDVNCFEIGQRGAKARTLYPSAFLLAHDCTPNTAHTDDPATFAILLRTS 292
Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
V ++ Y L GT RR + K F C C+RC+DP ELG+ SA+ C +
Sbjct: 293 RRVLEQEPLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRCADPRELGSDCSALVCRSCRL 352
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEM 394
I PL+ +DW C C RL+A D+ + ++ + ++++ + ++ +LE
Sbjct: 353 GS----IRATAPLEQTADWACDRCDCRLSAADLERLIDRINDDLEEI--DVHDIPALENF 406
Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY---------VMS 445
LT+ + P+HY + K+SL Q+YG GY ++S +ERK Y ++
Sbjct: 407 LTRYREVLRPNHYLLLSAKYSLCQIYGRIEGYLLPEMSPQDIERKERYCRDFLEIVDLLD 466
Query: 446 PNLKATDEPICLGCHKPL 463
P L I H P+
Sbjct: 467 PGLTRLRGLIMYELHAPI 484
Score = 46.6 bits (109), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 453 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH-APECAI 500
EP+CLGC++P+ + + RCP C WP C C G+ H A ECA+
Sbjct: 100 EPVCLGCYQPVQLD-SQQYRCPDCNWPLCGPTCRGIRQRTGHSAEECAL 147
>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
Length = 515
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 126/421 (29%), Positives = 208/421 (49%), Gaps = 40/421 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + KCS C +V+YC QHQK+ WK HK C P++I + LLGR+L AT +
Sbjct: 4 CNVCDQPTKSKCSNCNQVSYCSVQHQKQDWKAHKSSCH--PFKIAHNDLLGRHLVATRTI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + I +E+PL+ GP + P+C+GC + P D+ C C WP C C L +
Sbjct: 62 KPYEIILKEAPLVRGPA-QITPPVCMGCLNSIEPE--DHIDCDLCGWPLCGPECKSLGE- 117
Query: 135 HTHAPECAILKLGCETLLAYNDYKYEAILPL-------RCLILQRRSPKKYQELKDMEAH 187
H EC + + E N +++ PL RCL++ SP+K + +++E+
Sbjct: 118 --HRAECQLTQ---ERGQKVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKASKFQELESL 172
Query: 188 MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT 247
S R + ++ SI +++ F + S + + G +++NG ++
Sbjct: 173 ESTRRGSNQWKADLASIGQFIPKFF-------KTEKFSEEEVMRAVGALQINGHEVPTSD 225
Query: 248 Q---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRR 298
++ EH+C+PN AK N ++ + + +++H+S Y++A+WGT R+
Sbjct: 226 PPHVAVFYTASFTEHSCMPNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQ 285
Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
HL TK F C+C+RC+D TELGT +SA+KC + + D +LP D + W C C
Sbjct: 286 RHLVQTKLFKCACDRCTDVTELGTNYSAIKCEDR---RCDGLMLPSKTDDWNGSWKCREC 342
Query: 359 SARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQ 418
++ + + + G+ + + + N + L E P HYH VK L+Q
Sbjct: 343 QKQVQKHYIDQILERAGKDLQSMEKTAENGL---KYLKHYEKWLPPRHYHMSEVKILLVQ 399
Query: 419 L 419
L
Sbjct: 400 L 400
>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
Length = 515
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 124/420 (29%), Positives = 212/420 (50%), Gaps = 38/420 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC E + CS C +V+YC QHQK+ WK HKP C PY+I + +LGR+L AT ++
Sbjct: 4 CQVCAEPTKNNCSNCNQVSYCSVQHQKQDWKSHKPNCH--PYKIAHNEVLGRHLVATRNI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + I +E+PL+ GP ++ P+C+GC + P D+ C +C WP C C L +
Sbjct: 62 KPYEIILKEAPLVRGPA-QISAPVCMGCLNSIEP--TDHIECEQCGWPLCGPECKSLDE- 117
Query: 135 HTHAPECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
H EC + K + + +++ Y + +RCL+L + ++ D+E+ S
Sbjct: 118 --HRAECKLTKDRGQKV-NVQEFQGPHPLYTCVSTVRCLLLGETNAANAKKFSDLESLES 174
Query: 190 KRGPGTEVYEEIDSIVKYLRSNF-LEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ 248
R ++ ++ SI +++ F +K + + T G +++NG ++
Sbjct: 175 TRRGSSQWKADLTSIGQFIPKFFKTQKFSEEEIMRT--------VGALQINGHEVPTSDP 226
Query: 249 ---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRRE 299
++ V E++CLPN AK N ++ + + +++H+S Y++A+WGT R+
Sbjct: 227 PHVAVFYVASFTENSCLPNLAKSFNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQR 286
Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
HL TK F C+C RC DPTELGT +SA+KC + K G +LP D W C C
Sbjct: 287 HLMQTKLFKCACLRCQDPTELGTNYSAIKC-EDRKCSG--LMLPTKADDWHGGWQCKECG 343
Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
++ + V + + G+ + + +++ + L E H+H +K L+Q+
Sbjct: 344 KQVPKQYVEGILERAGQDIQSM---EKTIENGFKYLKHYERWLPSQHFHMSEIKILLVQI 400
>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 514
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 128/411 (31%), Positives = 198/411 (48%), Gaps = 40/411 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ S +GRY+ A+ +L G+ I E P +VGPK A P+CL C+ P P+ D
Sbjct: 20 YKVLRSDQVGRYMVASRELQAGEEIMTEMPFVVGPK-ACTYPLCLACYTPWPPSPDDKPL 78
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
C +C WP C C + EC + K L N+ + E I PLR
Sbjct: 79 CSKCGWPVCGEECENSLQHKDY--ECQVFVQANEKFNVTAALDENNENGVPQLECITPLR 136
Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
L+ +S +++ E++DMEAH R ++ + +IV+YLR LD S
Sbjct: 137 LLLESEKSVERWNSEVRDMEAHTKIRCQKSQWKSDHVNIVEYLRKRL-------KLDRFS 189
Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSK 274
+C+ CG++E+N +I G + LY + +M H+C+ N H+ +++
Sbjct: 190 EECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDYRIRLRTT 249
Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
L + + YT++L T RREHL K+F C+C RCSDPTEL T+ S++KC
Sbjct: 250 LKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRCSDPTELATHMSSLKC--NKC 307
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
D G +LP++ LD +S W C C +N + V V + +VD + +
Sbjct: 308 DNG--IVLPLDSLDAESAWKCTHCEYSINGQAVRKVFRIIQAEVDNVEAISGADGADAIH 365
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E +L K ++ HP H ++HSL Q+YG Y L +LE K+
Sbjct: 366 ERETVLRKYRSVLHPRHAFLTILRHSLTQMYGRVDEYLLDDLPDVVLEHKV 416
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
++ +V+ P KA P+CL C+ P P+ D C +C WP C C
Sbjct: 47 EMPFVVGP--KACTYPLCLACYTPWPPSPDDKPLCSKCGWPVCGEEC 91
>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
Length = 517
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 125/419 (29%), Positives = 208/419 (49%), Gaps = 36/419 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC+E KCS C +V+YC QHQK+ WK HK C P++I S LGR+L AT +
Sbjct: 6 CHVCQEPTKTKCSNCNQVSYCSVQHQKQDWKAHKTNCH--PFKIAHSEQLGRHLVATRTI 63
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + I RE+PL+ GP ++ P+C+GC + D+ C +C WP C C L +
Sbjct: 64 KPYEIILREAPLVRGPA-QISAPVCMGCLNSIEAE--DHIDCEQCGWPLCGPECKSLEE- 119
Query: 135 HTHAPECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
H EC + K + + N++ Y + +RCL++ SP+K + +++E+ S
Sbjct: 120 --HQAECKLTKDRGQKV-NVNEFNGPHPLYTCVSTVRCLLIGETSPEKAAKFQELESLES 176
Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ- 248
R + ++ SI +++ F T + + + G +++NG +I
Sbjct: 177 TRRGSNQWKADLASIGQFIPKFF------KTQKFTEEQIMRAV-GALQINGHEIPTSDPP 229
Query: 249 --GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREH 300
G++ E++CLPN AK N ++ + + ++ H+S Y++A+WGT R+ H
Sbjct: 230 HVGVFYTASFTENSCLPNLAKSFNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRH 289
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSA 360
L TK F C+C RC+D TEL T++SA+KC + Q +LP + + W C C
Sbjct: 290 LMQTKLFKCACARCADVTELDTFYSALKCEDR---QCGGLMLPSKSDEWNGSWRCRECHK 346
Query: 361 RLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
++ V + + G+ + + + N + L E P H+H +K L+QL
Sbjct: 347 QVQKHYVEGILERAGKDIQSMEKTAENGL---KYLKHYEKWLPPQHFHMSEIKILLVQL 402
>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
Length = 514
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 122/418 (29%), Positives = 208/418 (49%), Gaps = 34/418 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC + +KCS C +V+YC QHQK+ WK HKP C P++I + LGR+L AT +
Sbjct: 4 CHVCEKPTKNKCSNCNQVSYCSVQHQKQDWKTHKPSCH--PFKIAHNEQLGRHLVATRTI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+CLGC + D+ C +C WP C C L +
Sbjct: 62 KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEAE--DHIECEQCGWPLCGPECKSLDE- 117
Query: 135 HTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H EC++ K + + Y + +RCL++ SP+K + +++E+ S
Sbjct: 118 --HKAECSLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQELESLEST 175
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
R + ++ SI +++ F + S + + G +++NG ++
Sbjct: 176 RRGSNQWKADLVSIGQFIPKFFKTQ-------KFSEEEIMKAVGALQINGHEVPTTDPSH 228
Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
++ E++CLPN AK N ++ + + ++ H+S Y++A+WGT R+ HL
Sbjct: 229 VAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHL 288
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TK F C+CERC D TEL T++SA+KC + Q +LP D + +W C C +
Sbjct: 289 MQTKLFRCACERCVDVTELDTFYSAIKCEDR---QCGGLMLPTKTDDWNGNWRCRECHKQ 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ V + + G+ + + V++ + L E P H+H +K L+QL
Sbjct: 346 VQKHYVERILERAGKDIQSM---EKTVENGLKYLKHYEKWLPPQHFHISEIKILLVQL 400
>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
Length = 513
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 130/411 (31%), Positives = 195/411 (47%), Gaps = 40/411 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I + +GRY+ A+ +L G+ I E P +VGPK A P+CL C+ P+L+
Sbjct: 19 YKILQNDKVGRYMVASKELEAGEEIVTEMPFVVGPK-AFTYPLCLSCYATWPPSLSAKPL 77
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
C +C WP CS C + EC + K + L + + E I PLR
Sbjct: 78 CSKCGWPVCSEECENQPQHKDY--ECQVFVQANEKFNVQAALEEANENGVPQLECITPLR 135
Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
L+ R+P+++ E+K+MEAH R +IV Y+R LD S
Sbjct: 136 LLLESERNPERWNDEIKNMEAHNKIRSQKKHWQSNQINIVDYIRKQL-------KLDRFS 188
Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHS-------NMMQSK 274
+ +H CG++E+N +I G + LY + LM H+C+ N HS +++
Sbjct: 189 EEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVRLRTT 248
Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
L + + YT++L T RREHL KYF C+C RCSDPTELGT+ S++KC
Sbjct: 249 LKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRCSDPTELGTHVSSLKC--NKC 306
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
D G +L ++ LD DS W C C + V V + +VD + ++
Sbjct: 307 DNG--IVLSLDSLDPDSSWKCTHCEFTTSGSAVRKVLHIIQAEVDAVETISGADGADAIQ 364
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E ++ K ++ HP H ++HSL Q+YG Y L +LE KI
Sbjct: 365 ERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKI 415
Score = 38.9 bits (89), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
++ +V+ P KA P+CL C+ P+L+ C +C WP CS C
Sbjct: 46 EMPFVVGP--KAFTYPLCLSCYATWPPSLSAKPLCSKCGWPVCSEEC 90
>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 513
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 130/411 (31%), Positives = 195/411 (47%), Gaps = 40/411 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I + +GRY+ A+ +L G+ I E P +VGPK A P+CL C+ P+L+
Sbjct: 19 YKILQNDKVGRYMVASKELEAGEEIVTEMPFVVGPK-AFTYPLCLSCYATWPPSLSAKPL 77
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
C +C WP CS C + EC + K + L + + E I PLR
Sbjct: 78 CSKCGWPVCSEECENQPQHKDY--ECQVFVQAKEKFNVQAALEEANENGVPQLECITPLR 135
Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
L+ R+P+++ E+KDMEAH R +IV Y+R LD S
Sbjct: 136 LLLESERNPERWNNEIKDMEAHNKIRSQKKHWQSNQINIVDYIRKQL-------KLDRFS 188
Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHS-------NMMQSK 274
+ +H CG++E+N +I G + LY + LM H+C+ N HS +++
Sbjct: 189 EEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVRLRTT 248
Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
L + + YT++L T RREHL KYF C+C RCSDPTELGT+ S++KC
Sbjct: 249 LKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRCSDPTELGTHVSSLKC--NKC 306
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
D G +L ++ LD +S W C C + V V + +VD + ++
Sbjct: 307 DNG--IVLSLDSLDPESSWKCTHCEFTTSGSAVRKVLHIIQAEVDAVETISGADGADAIQ 364
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E ++ K ++ HP H ++HSL Q+YG Y L +LE KI
Sbjct: 365 ERETVVKKYRSVLHPRHGFLTMLRHSLTQMYGRVDEYLLDDLPDVVLEHKI 415
Score = 38.9 bits (89), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
++ +V+ P KA P+CL C+ P+L+ C +C WP CS C
Sbjct: 46 EMPFVVGP--KAFTYPLCLSCYATWPPSLSAKPLCSKCGWPVCSEEC 90
>gi|170029518|ref|XP_001842639.1| msta [Culex quinquefasciatus]
gi|167863223|gb|EDS26606.1| msta [Culex quinquefasciatus]
Length = 495
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 202/403 (50%), Gaps = 30/403 (7%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ P +GRY A+ DL G+ ++P VGP + A P+CL C P++ A AR
Sbjct: 6 YKVLECPEMGRYGVASRDLRAGEIAYEDTPFAVGPSVGSA-PLCLECACPVD-GCAGGAR 63
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
CPRC WP C + + + H EC + D + + I PLR L+
Sbjct: 64 CPRCGWPLCEKCGAEIEASVYHKAECELFAKHGSKFQNVEDSSEACVQLDCITPLRVLLA 123
Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ +P + E+ ME H ++R + ++V +LR + K D S + +
Sbjct: 124 KEANPDWWNAEIVMMEDHRAERDGNAFWKADQSNVVAFLRDSCGLK------DRCSEELI 177
Query: 230 HWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSN--------MMQSKLFVFRD 280
+ G+++VN + + +GLY + +M H+C+ N H+ + ++ + +
Sbjct: 178 QKVIGILDVNAFEAHTCSLRGLYPKMGIMAHSCVTNLAHTVHPSKGYRLIARAAVDIEEG 237
Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCW 340
+ T YT+ L GT+ R+ L TKYF C C+RC DPTELGT+FS+MKC + D G
Sbjct: 238 AMLCTTYTHLLAGTRTRQAELQRTKYFTCQCKRCLDPTELGTHFSSMKC--QKCDNG--L 293
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKSLEEMLTK 397
+ P D +++W C C +L + + ++D+L ++ P ++S E + K
Sbjct: 294 VESTKPTDEEAEWKCTHCEHKLKGVLIAKAIQVMQAEIDELAYMEYGPERLESFERVFKK 353
Query: 398 LEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ HP H+ +++HSLI+LYG PGY +L LLERKI
Sbjct: 354 FRSVLHPLHFINTSIRHSLIELYGRIPGYMMQELPDILLERKI 396
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 442 YVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+ + P++ + P+CL C P++ A ARCPRC WP C + + + H EC +
Sbjct: 36 FAVGPSVGSA--PLCLECACPVD-GCAGGARCPRCGWPLCEKCGAEIEASVYHKAECELF 92
>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
Length = 501
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 127/413 (30%), Positives = 208/413 (50%), Gaps = 49/413 (11%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
++++ S +GRY A+ DL PGD I E+P GPK + + +CLGCH P++
Sbjct: 8 FKVEKSETVGRYAVASEDLKPGDVIFSETPFAYGPK-SDSSCLCLGCHSPVDCAYL---- 62
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYNDY--KYEAILPLRCLIL 170
C C WP C C L H EC + K+ +T+ D +YE I PLR L+
Sbjct: 63 CSTCSWPVCGPECEQLK-VHKEN-ECQVFSNAKVKFQTVEDPTDVCLQYECITPLRMLLE 120
Query: 171 QRRSPKKYQE-LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV-LDDTSAKC 228
+ R PK+++E + MEAH +R I ++ + N +E + G +D S +
Sbjct: 121 KERDPKRWEEEISIMEAHNEERKS--------RPIWEFNQHNIVEYIRGPCKMDKYSEEL 172
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICL------MEHNCLPNAKHSN------------- 269
+H ICG++E+N + T Y++ CL + HNC+ N H+
Sbjct: 173 IHLICGILEINAFEA--RTPSCYAIRCLFPKLAILSHNCVSNIHHAVDCNGDGELKDCVV 230
Query: 270 MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+++ + + + + + YT +LW T RRE L +K+F+C+C RCS PTEL T+ S++KC
Sbjct: 231 TVRAAVDIPKGGELFSSYTFSLWPTLVRREFLRESKFFDCTCPRCSSPTELDTHMSSLKC 290
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN 387
+ D G I+ N D+ +W C C + N R V V + + ++D++ +
Sbjct: 291 --QRCDNG--VIISTNTADDACEWKCTHCDYKTNGRAVRKVFAAIQNEIDQVEYIGGPQG 346
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ E + K ++ HP++ + ++ +LIQLYG GY L +LERKI
Sbjct: 347 IQQRETIFRKYRSVLHPNNSYMTTLRSALIQLYGKTEGYTLEDLPDVILERKI 399
>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
Length = 513
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 129/413 (31%), Positives = 195/413 (47%), Gaps = 40/413 (9%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
+ Y+I + +GRY+ A +L G+ I E P IVGPK A P+CL C+ P+L +
Sbjct: 17 VTYKILQNDKVGRYMVANKELEAGEEIITEMPFIVGPKAA-TYPLCLSCYVSWPPSLNEK 75
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILP 164
C +C WP CS C + EC I K + L + + E I P
Sbjct: 76 PLCSKCGWPVCSTECENQPQHKDY--ECQIFAQANEKFNVQAALEETNENGVPQLECITP 133
Query: 165 LRCLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
LR L+ ++P++++ E+KDMEAH R + + +IV+Y+R LD
Sbjct: 134 LRLLLESEKNPERWKNEVKDMEAHNKIRSQKKQWKLDQVNIVEYIRKQL-------KLDR 186
Query: 224 TSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
S K + +CG++E+N ++ G + +Y + LM H+C+ N HS R
Sbjct: 187 FSEKEIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRLR 246
Query: 280 DT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
T + YT++L T RREHL K+F C+C RCSDPTELGT+ S++KC
Sbjct: 247 TTLKIPPGGELYGSYTHSLLPTMLRREHLLEGKHFACACARCSDPTELGTHMSSLKC--N 304
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPN 387
D G +L ++ LD +S W C C + V V + +VD +
Sbjct: 305 KCDNG--IVLSLDSLDPESSWKCTHCEFSTSGSAVRKVLQIIHMEVDAVEAISGADGADA 362
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ E + K ++ HP H ++HSL Q+YG Y L +LE KI
Sbjct: 363 IQERETVAKKYRSVLHPRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKI 415
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
++ +++ P KA P+CL C+ P+L + C +C WP CS C
Sbjct: 46 EMPFIVGP--KAATYPLCLSCYVSWPPSLNEKPLCSKCGWPVCSTEC 90
>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
Length = 515
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 123/418 (29%), Positives = 207/418 (49%), Gaps = 34/418 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC E +KCS C +V+YC QHQK+ WK HKP C P++I + LGR+L AT +
Sbjct: 4 CHVCEEPTKNKCSNCNQVSYCSVQHQKQDWKRHKPSCH--PFKIDHNEQLGRHLVATRTI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+CLGC + D+ C +C WP C C L +
Sbjct: 62 KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEAE--DHIECEQCGWPLCGPECKSLDE- 117
Query: 135 HTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H EC + K + + Y + +RCL++ SP+K + +++E+ S
Sbjct: 118 --HKAECRLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQELESLEST 175
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ-- 248
R + ++ SI +++ F + ++ K + G +++NG ++
Sbjct: 176 RRGSNQWKADLVSIGQFIPKFFKTQ---KFTEEEIMKAV----GALQINGHEVPTTDPSH 228
Query: 249 -GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
++ E++CLPN AK N ++ + + ++ H+S Y++A+WGT R+ HL
Sbjct: 229 VAVFYTASFTENSCLPNLAKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHL 288
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TK F C+C+RC D TEL T +SA+KC + Q +LP D + W C C +
Sbjct: 289 MQTKLFKCACDRCVDVTELDTNYSAIKCEDR---QCGGLMLPTKADDWNGSWRCRECHKQ 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ V + + G+ + + NV++ + L E P H+H +K L+QL
Sbjct: 346 VQKHYVERILERAGKDIQSM---EKNVENGLKYLKHYEKWLPPQHFHMSEIKILLVQL 400
>gi|195399143|ref|XP_002058180.1| GJ15632 [Drosophila virilis]
gi|194150604|gb|EDW66288.1| GJ15632 [Drosophila virilis]
Length = 527
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 128/440 (29%), Positives = 208/440 (47%), Gaps = 48/440 (10%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+ + S L GRYL A L G+ + E PL +GP ++ EP+CLGC++P+ A R
Sbjct: 43 FRVAHSELFGRYLVANRQLAAGELLITEQPLAIGPCVS-CEPVCLGCYRPVQLT-AQQYR 100
Query: 116 CPRCFWPACSARCSGLSD---AHTHAPECAILK----LGCETLLAYNDYK-----YEAIL 163
C C WP C A C G+ HT EC I L E L + YE +L
Sbjct: 101 CAGCGWPLCGASCRGMQQRRLGHTE-EECEIYGQRRGLAAELLTTHATPDQVRDLYELVL 159
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD 222
+R +L+ +P Y +++ ME+H + R +++ + +V+ LR ++ L
Sbjct: 160 IVRIALLRHLTPDLYAQIRRMESHTAARRENATLWQHYEQKVVRRLRDDW-------QLQ 212
Query: 223 DTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN-------MMQ 272
A LH ICG+++VN +IG++ + LY L+ H+C PN H++ +++
Sbjct: 213 QLEAAELHEICGILDVNCFEIGQHGAKARTLYPSAFLLAHDCSPNTAHTDDPLSYAILLR 272
Query: 273 SKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
+ V ++ Y L GT RR + K F C C RC+DP ELG+ SA+ C +
Sbjct: 273 TSRAVREQETLTLSYAYTLQGTLKRRSFIQGGKLFWCRCRRCADPRELGSDCSALVCRSC 332
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
I ++PLD +DW C C+ RL+A + + ++ + ++ + + ++ +LE
Sbjct: 333 QLGS----IRAIDPLDQSADWACDRCAHRLSAVQLERLLDRINDDLESI--DVHDIPALE 386
Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY---------V 443
L + + P+HY + K+SL Q YG GY +LS + RK Y +
Sbjct: 387 NFLARYREVLRPNHYLLLSAKYSLCQTYGRIEGYLLPELSPQDIARKERYCREFLAVIDL 446
Query: 444 MSPNLKATDEPICLGCHKPL 463
+ P L I H P+
Sbjct: 447 LEPGLTRLRGVIMYELHAPI 466
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 27/51 (52%), Gaps = 5/51 (9%)
Query: 453 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD---AHTHAPECAI 500
EP+CLGC++P+ A RC C WP C A C G+ HT EC I
Sbjct: 82 EPVCLGCYRPVQLT-AQQYRCAGCGWPLCGASCRGMQQRRLGHTE-EECEI 130
>gi|170029520|ref|XP_001842640.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863224|gb|EDS26607.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 498
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 125/406 (30%), Positives = 203/406 (50%), Gaps = 36/406 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+++ P LGRY T L G+ + E P +GPK + + P+CL C P++ + A +
Sbjct: 6 FKVLECPELGRYAVTTKKLKAGEELFEEKPFAIGPK-SDSPPVCLECCVPVDGS-AFGPK 63
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
CP+C WP C C + H EC + +D + + I PLR L+
Sbjct: 64 CPQCGWPLCE-EC--VRSVVYHKAECDLFVKNKVKFQNVDDSTAGCVQLDCITPLRVLLA 120
Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ P+++ E+ M+ H +R T + ++V++LRS K D S + +
Sbjct: 121 KEADPERWNAEISRMQDHREERRGSTFWDADQKNVVEFLRSECGLK------DRCSEELI 174
Query: 230 HWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN--------MMQSKLFV 277
+ G++EVN + G +GLY + +M H+C+PN HS ++ + V
Sbjct: 175 QQVIGILEVNAFEARTSSGYTIRGLYPKLAIMAHSCVPNVVHSIHPSKDYRLTARTAIDV 234
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ + T YT L GT R+ L TKYF C C+RC DPTELGT+FS++KC + D G
Sbjct: 235 EEGSKLYTTYTYTLSGTMVRQAALKSTKYFTCQCKRCLDPTELGTHFSSLKC--QKCDNG 292
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKSLEEM 394
I+ P D +++W C C +L + + ++D+L ++ P ++S E +
Sbjct: 293 --VIVSSKPTDEEAEWHCTHCEYKLKGAAMAKAIQVMQAEIDELAYMEYGPERLESFERV 350
Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
K ++ HP H+ ++++SLI+LYG PGY +L LLERKI
Sbjct: 351 FKKYRSVLHPLHFINTSIRNSLIELYGRIPGYMMPELPDILLERKI 396
>gi|170029522|ref|XP_001842641.1| msta [Culex quinquefasciatus]
gi|167863225|gb|EDS26608.1| msta [Culex quinquefasciatus]
Length = 489
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/407 (31%), Positives = 203/407 (49%), Gaps = 36/407 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+ + SP LGRY A +L G+ I E +GPK A P+CL C P++ AD +
Sbjct: 6 FTVLESPELGRYGVARRNLRAGEIIFEEQVFAIGPK-ASTSPLCLECASPVDGG-ADGPK 63
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYND--YKYEAILPLRCLIL 170
CP+C WP C C G H EC + K+ + + + + I PLR L+
Sbjct: 64 CPQCGWPLC-GECVG--SVVYHKGECELFVQHKVKFQNQQNSDGCCAQLDCITPLRVLLA 120
Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ P+++ E+ ME H ++R + +++V+YLR L + D S + +
Sbjct: 121 KEAHPERWNAEICMMEDHRAERCGSVYWNADQNNVVRYLR------LACGLKDRCSEELI 174
Query: 230 HWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN--------MMQSKLFV 277
+ G++EVN + G +GLY + +M H+C+ N HS + ++ + V
Sbjct: 175 QQVIGILEVNAFEARTHRGYAVRGLYPKLAIMAHSCVRNVVHSIHPSKDYRLIARTAIDV 234
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ T YT A T R+ L +KYF+C CERC DPTELGTYFS++KC +K
Sbjct: 235 EEGGKLYTSYTFAQDPTHYRQTTLKDSKYFSCQCERCLDPTELGTYFSSLKCRKCNKGS- 293
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLV---QENPNVKSLEEM 394
++ P D +++W C SC L A + +VT + + V K+V ++S E+
Sbjct: 294 ---MVSSKPTDEEAEWHCTSCDRVLEADAMAIVTKVMQDDVGKIVYMESGTERLESYEQA 350
Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
K A+ P HY +++ SLI+LYG P Y +LS + LERKI+
Sbjct: 351 FAKYSAVLEPLHYLLTSIRQSLIELYGRVPEYRLHELSDAKLERKIT 397
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 21/59 (35%), Positives = 29/59 (49%), Gaps = 4/59 (6%)
Query: 443 VMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
V + KA+ P+CL C P++ AD +CP+C WP C C G H EC +
Sbjct: 35 VFAIGPKASTSPLCLECASPVDGG-ADGPKCPQCGWPLC-GECVG--SVVYHKGECELF 89
>gi|195133792|ref|XP_002011323.1| GI16059 [Drosophila mojavensis]
gi|193907298|gb|EDW06165.1| GI16059 [Drosophila mojavensis]
Length = 526
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 126/439 (28%), Positives = 216/439 (49%), Gaps = 43/439 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y ++ S L GRYL A L G+++ RE PL +GP ++ EP+CLGC++P+ + R
Sbjct: 37 YRVEHSELYGRYLVANRALAAGEQLIREQPLAIGPCVS-CEPVCLGCYQPVQLG-PEQCR 94
Query: 116 CPRCFWPACSARCSGLSDAHTHA-----------PECAILKLGCETLLAYNDYKYEAILP 164
C C WP C+A C GL + H+ P+ A L + A D YE +L
Sbjct: 95 CKGCGWPLCAADCRGLGQSSGHSREECELYGRRRPQAAELLKESASPDAVRDL-YELVLI 153
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDD 223
+R L++++ P+KY +++ ME+H +R +++ + +V+ L ++ +L ++
Sbjct: 154 VRILLMRKHQPEKYAKIRLMESHTEQRRENLTLWQHYEKKVVQRLLHDW--QLAQELSGL 211
Query: 224 TSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSN-------MMQS 273
A+ +H ICG+++VN +IG+ + LY L+ H+C PN H++ ++++
Sbjct: 212 EPAE-MHEICGILDVNCFEIGQRGGKARTLYPSAFLLAHDCRPNTSHTDDPLDYSIILRT 270
Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
V ++ Y L GT RR + K F C C+RC+DP ELG+ S + C
Sbjct: 271 SRSVREQETLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRCADPRELGSDCSDLVC---- 326
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
K I PL ++DW C C+ R+ + V + + + ++ + + ++ +LE
Sbjct: 327 KICRAGSIRATEPLKQEADWACDRCAHRIESVQVERLLDHINDDLESI--DVHDIPALEN 384
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY---------VM 444
L + + P+HY + K+SL Q+YG GY +LS +ERK Y ++
Sbjct: 385 FLIRYREVLRPNHYLLLSAKYSLCQIYGRVEGYLLPELSPQDIERKERYCREFLAVVDIL 444
Query: 445 SPNLKATDEPICLGCHKPL 463
P L I H P+
Sbjct: 445 EPGLTRLRGLIMYELHAPI 463
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y V+HS +LYG + L R+ + P + EP+CLGC++P+ +
Sbjct: 37 YRVEHS--ELYGRYLVANRALAAGEQLIREQPLAIGPCVSC--EPVCLGCYQPVQLG-PE 91
Query: 470 NARCPRCFWPACSARCSGLSDAHTHAPE 497
RC C WP C+A C GL + H+ E
Sbjct: 92 QCRCKGCGWPLCAADCRGLGQSSGHSRE 119
>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
Length = 513
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 126/411 (30%), Positives = 195/411 (47%), Gaps = 40/411 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I + +GRY+ A +L G+ I E P IVGPK A P+CL C+ P L +
Sbjct: 19 YKILQNDKVGRYMVANKELEAGEEIITEMPFIVGPKAA-TYPLCLSCYVSWPPTLNEKPL 77
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLR 166
C +C WP CS C + EC I K + L + + E I PLR
Sbjct: 78 CSKCGWPVCSTECENQPQHKDY--ECQIFAQANEKFNVQAALEETNENGVPQLECITPLR 135
Query: 167 CLILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
L+ ++P++++ E+KDMEAH R + + +IV+Y+R L+ S
Sbjct: 136 LLLESEKNPERWKNEVKDMEAHNKIRSQKKQWKSDQINIVEYIRKQL-------KLERFS 188
Query: 226 AKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSK 274
+ + +CG++E+N ++ G + +Y + LM H+C+ N HS +++
Sbjct: 189 EEQIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIRLRTT 248
Query: 275 LFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
L + + YT++L T RREHL K F C+C RCSDPTELGT+ S++KC
Sbjct: 249 LKIPVGGELYGSYTHSLLPTMLRREHLLEGKNFACACARCSDPTELGTHMSSLKC--NKC 306
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVK 389
D G +L ++ LD +S W C C + V V + +VD + ++
Sbjct: 307 DNG--IVLSLDSLDPESSWKCTHCEFSTSGSAVRKVLQIIQMEVDAVEAISGADGADAIQ 364
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
E + K ++ HP H ++HSL Q+YG Y L +LE KI
Sbjct: 365 ERETVAKKYRSVLHPRHAFLTMLRHSLTQMYGRVEDYLLEDLPDVVLEHKI 415
Score = 39.3 bits (90), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 2/47 (4%)
Query: 439 KISYVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARC 485
++ +++ P KA P+CL C+ P L + C +C WP CS C
Sbjct: 46 EMPFIVGP--KAATYPLCLSCYVSWPPTLNEKPLCSKCGWPVCSTEC 90
>gi|195383960|ref|XP_002050693.1| GJ20068 [Drosophila virilis]
gi|194145490|gb|EDW61886.1| GJ20068 [Drosophila virilis]
Length = 515
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 128/442 (28%), Positives = 212/442 (47%), Gaps = 35/442 (7%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC +KCS C +V+YC QHQK+ WK HK +C P++I + LLGR+L AT ++
Sbjct: 4 CQVCETATKNKCSNCNQVSYCSVQHQKQDWKAHKSQC--YPFKIAQNELLGRHLVATRNI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+C+GC + PN D+ C +C WP C C L +
Sbjct: 62 KPYEIVIKEAPLMRGPS-QISAPVCMGCLNCIEPN--DHITCDKCGWPLCGPECHALDE- 117
Query: 135 HTHAPECAILKLGCE--TLLAYNDYK--YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H EC + + + + +N Y + +RCL++ +P + K +E
Sbjct: 118 --HRAECQLTQERGQKVNVQEFNGPHPLYTCVSTVRCLLIGETNPANAAKFKQLETLEQT 175
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
R + ++ SI +++ F + S + + G +++NG ++
Sbjct: 176 RRGSNQWKADLASIGQFIPKFFRTQ-------KFSEEEIMRAVGALQINGHEVPTSDPPH 228
Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
++ E++C+PN AK N M+ + + +++H+S Y++A+WGT R+ HL
Sbjct: 229 VAVFYTASFTENSCVPNLAKSFNKHGHCMLWAPKEIKKNSHLSICYSDAVWGTADRQRHL 288
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TK F CSCERC D TELGT +SA+KC + K G +LP+ D W C C +
Sbjct: 289 MQTKLFKCSCERCLDVTELGTNYSALKC-EDRKCPG--LLLPIKADDWHGSWRCRECQKQ 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
+ + V + + + + + + N + L E HYH VK +QL
Sbjct: 346 VQRKYVDGILERASQDIQSMEKTAENGF---KYLKHYEKWLPTQHYHLSEVKILQVQLIA 402
Query: 422 TQPGYAYTQLSSSLLERKISYV 443
LS L+ K+ Y
Sbjct: 403 KDQQELMI-LSDEHLQLKLKYA 423
>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
Length = 513
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 122/418 (29%), Positives = 205/418 (49%), Gaps = 34/418 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC E +KCS C +V+YC QHQK+ WK+HKP C P++I + LGR+L AT +
Sbjct: 4 CHVCEEPTKNKCSNCNQVSYCSVQHQKQDWKVHKPSCH--PFKIAHNEQLGRHLVATRTI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+CLGC + D+ C +C WP C C L +
Sbjct: 62 KPYEIVLKEAPLVRGPA-QISAPVCLGCLNGIEAE--DHIECEQCGWPLCGPECKSLDE- 117
Query: 135 HTHAPECAILKLGCETL----LAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H EC + K + + Y + +RCL++ S +K + +D+E+ S
Sbjct: 118 --HKAECGLTKDRGQKVNVQEFGGPHPLYTCLSTVRCLLIGETSTEKASKFQDLESLEST 175
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
R + ++ SI +++ F + ++ K + G +++NG ++
Sbjct: 176 RRGSNQWKADLVSIGQFIPKFFKTQ---KFTEEEIMKAV----GALQINGHEVPTTDPSH 228
Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
++ E++CLPN AK N ++ + + ++ H+S Y++A+WGT R+ HL
Sbjct: 229 VAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHL 288
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TK F C+CERC D TEL T +SA+KC + Q +LP + + W C C +
Sbjct: 289 MQTKLFKCACERCVDVTELDTNYSAIKCEDR---QCGGLMLPTKADEWNGSWRCRECHKQ 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ V + + G+ + + + N + L E P H+H +K L+QL
Sbjct: 346 VQKHYVERILERAGKDIQSMEKIAENGL---KYLKHYEKWLPPQHFHMSEIKILLVQL 400
>gi|91088043|ref|XP_966391.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270012080|gb|EFA08528.1| hypothetical protein TcasGA2_TC006181 [Tribolium castaneum]
Length = 492
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 122/405 (30%), Positives = 195/405 (48%), Gaps = 39/405 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+ IK + +GR+ A+ L PGD I E+P GPK + + P+CLGC P++ +
Sbjct: 5 FTIKRNEKVGRFAVASRALKPGDLIFTETPFAYGPK-SDSPPLCLGCFAPVDCTVL---- 59
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
C C WP C C EC + A D +YE I PLR L++
Sbjct: 60 CSHCSWPVCGPECEAKPSHRDF--ECGVFASAGVKFQAVEDPTNVCLQYECITPLRVLLM 117
Query: 171 QRRSPKKYQE-LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV-LDDTSAKC 228
+ + PK++QE ++ ME+H R + I ++ + N ++ L G LD + +
Sbjct: 118 KEKDPKRWQEEIEAMESHNDVR--------KTKPIWEFNQVNIVDYLKGPCKLDRFADEL 169
Query: 229 LHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFV 277
+H +CG++E+N + G + L+ + ++ HNC+ N HS + + + V
Sbjct: 170 IHTVCGILEINAFEARASSGYLIRCLFPKLAILSHNCVSNVHHSVDCDTFQVSVSASVEV 229
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ + YT +LW T RRE L +KYF C C+RC D TELGT+ +KC D G
Sbjct: 230 PESGELFSSYTYSLWPTLVRREFLKESKYFECMCDRCKDKTELGTHLGTLKC--NKCDNG 287
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPNVKSLEEML 395
IL +PL +W C C + +A V V + ++D +++ ++ E +
Sbjct: 288 --VILSTDPLSFTCEWKCTHCEFKTSAIAVRKVYEAVQTEIDAVEMLSGPEGIEQREAIF 345
Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
K ++FHP + + ++ SL QLYG GY L LLERK+
Sbjct: 346 RKYRSVFHPKNAYMTILRTSLSQLYGRAEGYTVEDLPDLLLERKV 390
>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 483
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 125/399 (31%), Positives = 188/399 (47%), Gaps = 40/399 (10%)
Query: 68 LQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSAR 127
+ A +L G+ I E P ++GPK A P+CL C P D + C +C WP C
Sbjct: 1 MVANRELRAGEEIITEMPFVIGPK-ACTYPLCLSCFTPWPLEPDDKSLCSKCGWPVCGEE 59
Query: 128 CSGLSDAHTHAPECAIL-----KLGCETLLAYNDY----KYEAILPLRCLILQRRSPKKY 178
C + EC + K + L N + E I PLR L+ R+ +++
Sbjct: 60 CENAPQHKDY--ECQVFAQANEKFNVDAALDGNSENGVPQLECITPLRLLLESERNVERW 117
Query: 179 -QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
+E+KDMEAH R ++ + +IV YLR LD S K + ICG++E
Sbjct: 118 NKEVKDMEAHNKTRCQKSQWKSDQINIVDYLRKRL-------KLDRFSEKYIQTICGILE 170
Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFVFRDTHISTM 286
+N ++ G +GLY + +M H+C+ N HS +++ L + +
Sbjct: 171 INTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIRLRTTLKIPAGGELYAS 230
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YT++L T RREHL K+F C+C RCSDPTELGT+ S++KC D G +LP++
Sbjct: 231 YTHSLLPTILRREHLLEGKHFACACPRCSDPTELGTHMSSLKC--NKCDNG--IVLPLDS 286
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVKSLEEMLTKLEAM 401
LD++S W C C N + V + + QVD + E ++ K +
Sbjct: 287 LDSESTWKCTHCDFSTNGQAVRKILRIIQAQVDAAEAISGADGADAIYKRETVMKKYRLV 346
Query: 402 FHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
HPHH ++HSL Q+YG Y L +LE K+
Sbjct: 347 LHPHHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHKV 385
>gi|194762638|ref|XP_001963441.1| GF20287 [Drosophila ananassae]
gi|190629100|gb|EDV44517.1| GF20287 [Drosophila ananassae]
Length = 528
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 203/409 (49%), Gaps = 36/409 (8%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P+ I+ S + GRYL A+ L G+ + E PL +GP ++ +P+CLGC++P++ + +
Sbjct: 44 PFRIEHSDIYGRYLVASRQLEAGELLITEEPLAIGPCVS-GDPVCLGCYQPVSLD-STQY 101
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPE----------CAILKLGCETLLAYNDYKYEAILP 164
RCP C WP C C G+ H H E A L T A YE ++
Sbjct: 102 RCPGCQWPLCRDSCPGIHQPHGHTSEECQLYGERRAVAAGLLTDRTGPAEVRDLYELVMI 161
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDD 223
+R L+L++R P+++ ++ ME+H +R +++ + +V+ LR + L+D
Sbjct: 162 VRILLLRQRDPEQFAAIERMESHTEERRKNQVLWQHYEQKVVQRLREVW-------QLED 214
Query: 224 TSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
+ +H +CG+++VN +IG+ + LY L+ H+C PN H++ +S + R
Sbjct: 215 LQPEQIHQVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCSPNTAHTDHPESYAIILRT 274
Query: 281 TH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
+ ++ Y L GT RR + K F C C+RCSDP ELG+ SA+ C
Sbjct: 275 SRRVREREALTLSYAYTLQGTLKRRSFMHEGKLFWCVCQRCSDPRELGSDCSALVC--PV 332
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
QG + PL+ + W C C ++ A+D+ ++ ++ + + ++ LE
Sbjct: 333 CRQGS--LRAQEPLNQEGSWGCDKCDHKMLAKDLESRLDRINNDLEAI--DVHDIPGLEN 388
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
L + + P+HY + K+SL Q+YG GY Q+S + RK Y
Sbjct: 389 FLVRYREVLRPNHYLLLSAKYSLCQIYGRIEGYLLPQMSPEDIARKERY 437
Score = 45.8 bits (107), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 1/48 (2%)
Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPE 497
+ +P+CLGC++P++ + + RCP C WP C C G+ H H E
Sbjct: 81 VSGDPVCLGCYQPVSLD-STQYRCPGCQWPLCRDSCPGIHQPHGHTSE 127
>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 519
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 128/437 (29%), Positives = 202/437 (46%), Gaps = 56/437 (12%)
Query: 8 EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
E EE+ CAVC+ A KC CK V YCG++HQK HW+ H KCK Y++ + LGR+
Sbjct: 6 EREEV--CAVCKVPAKQKCGGCKSVFYCGREHQKAHWREHSAKCKS--YKLVENEKLGRH 61
Query: 68 LQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSAR 127
A + G+ + RE +V PICLGC+KPL + A C C WP C A
Sbjct: 62 YVAIRRIEAGEVVIREDDALVQAPQQETVPICLGCYKPLESDTAKP--CELCGWPLC-AS 118
Query: 128 CSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQRRSPKKYQELK 182
C+ +H EC K +T + + Y+ I +R L L+ +P+ Y+
Sbjct: 119 CT------SHGAECDFTKRHRDTKVTITGFGIAHPTYKCIGVVRALALRETNPEAYERFT 172
Query: 183 DMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNG-- 240
+ + EE I ++++ F + L D S + G++++NG
Sbjct: 173 KL---------ASGELEEPREIARFIKRFF------NKLHDFSEDEIARAAGIMQINGHE 217
Query: 241 ------VDIGRYTQGLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYT 288
V I Y + Y +EH+C N S ++++ + + + HI YT
Sbjct: 218 VPISEPVLIAVYDESSY-----VEHSCRANCSKSFTSSGGIVIRAAMEIEKGEHIMICYT 272
Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLD 348
+ LWGT RR HL TK+F C+C RCSDP E G+ F+A++C D C + +
Sbjct: 273 DPLWGTANRRHHLLRTKFFECTCPRCSDPAEFGSMFNAIEC--SKSDCSGCMLPKSFTVQ 330
Query: 349 NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYH 408
D+ C C ++ V + Q+G+++ + + + + K + H +HY+
Sbjct: 331 EIPDYRCNKCDNSRSSNSVDEMLEQIGKEMASMRKNDAG--ACRSFARKHASKLHDNHYY 388
Query: 409 CYAVKHSLIQLYGTQPG 425
VK +L Q+ G Q G
Sbjct: 389 MTDVKLALAQMIGQQTG 405
>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
Length = 515
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/420 (28%), Positives = 203/420 (48%), Gaps = 34/420 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC+ +KCS C +V+YC QHQK+ WK HK C P+ + + LLGR+L AT ++
Sbjct: 4 CNVCQAETKNKCSNCNQVSYCSVQHQKQDWKSHKSNCH--PFRVAHNELLGRHLVATRNI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+C+GC + PN D+ C +C WP C C L +
Sbjct: 62 KPYEIVLKEAPLMRGPA-QISVPVCMGCLNAIEPN--DHITCDKCGWPLCGPECQALDE- 117
Query: 135 HTHAPECAILKLGCE--TLLAYNDYK--YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H EC + + + + +N Y + +RCL+L +P ++ + +E+
Sbjct: 118 --HKAECQLTEARGQKVNVQEFNGPHPLYTCVSTVRCLLLSETNPANAEKFQQLESLEQT 175
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
R + ++ SI +++ F + S + + G +++NG ++
Sbjct: 176 RRGSNQWKADLASIGQFIPKFFRTQ-------KFSEEEIMRAVGALQINGHEVPTSDPPH 228
Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREHL 301
++ E++C+PN S ++ + ++S Y++A+WGT R+ HL
Sbjct: 229 VAVFYTASFTENSCVPNLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHL 288
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TK F C+CERC D TELGTY+SA+KC + K G +LP+ D + W C C +
Sbjct: 289 MQTKLFKCTCERCVDVTELGTYYSALKC-EDRKCTG--LLLPIKADDWNGCWRCRDCQKQ 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
+ + V + + + + + + N + L E P HYH VK +QL
Sbjct: 346 VQRKYVDGILERAIKDIQSMEKTAENGF---KYLKHYEKWLPPQHYHLSEVKILQVQLIA 402
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 5/47 (10%)
Query: 454 PICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAI 500
P+C+GC + PN D+ C +C WP C C L + H EC +
Sbjct: 83 PVCMGCLNAIEPN--DHITCDKCGWPLCGPECQALDE---HKAECQL 124
>gi|195029731|ref|XP_001987725.1| GH19808 [Drosophila grimshawi]
gi|193903725|gb|EDW02592.1| GH19808 [Drosophila grimshawi]
Length = 515
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 214/441 (48%), Gaps = 35/441 (7%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C VC +KCS C +VAYC QHQKE WK HK C P++I + +LGR+L AT ++
Sbjct: 4 CHVCETPTKNKCSNCNQVAYCSVQHQKEDWKAHKIHCH--PFKIAKNEVLGRHLVATRNI 61
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDA 134
P + + +E+PL+ GP ++ P+C+GC + N D+ C +C WP C C L++
Sbjct: 62 KPYEIVLKEAPLMRGPA-QISAPVCMGCLNCIEAN--DHISCEKCGWPLCGPECPSLTE- 117
Query: 135 HTHAPECAILKLGCE--TLLAYNDYK--YEAILPLRCLILQRRSPKKYQELKDMEAHMSK 190
H EC + + + + +N Y + +RCL+L +P ++ +E+
Sbjct: 118 --HRAECQLTEERGQKVNVTEFNGPHPLYTCVSTVRCLLLNTTNPANAEKFNQLESLEQT 175
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--- 247
R + ++ SI +++ F + S + + G++++NG ++
Sbjct: 176 RRGSNQWKADLASIGQFIPKFFRTQ-------KFSEEEIMRAVGILQINGHEVPISDPPH 228
Query: 248 QGLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHL 301
++ E++C+PN AK N ++ + + +++H+S Y++A+WGT R+ HL
Sbjct: 229 VAVFYTASFTENSCVPNLAKSFNKHGHCVLWAPKEIKKNSHLSICYSDAVWGTADRQRHL 288
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSAR 361
TK F C+CERC D TELGT +SA+KC + K G +LP D + W C C +
Sbjct: 289 MQTKLFKCACERCLDVTELGTNYSAIKC-EDRKCTGQ--LLPSKADDWNGSWRCLDCQKQ 345
Query: 362 LNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
+ + + + + + + + + N + L E P HYH VK +QL
Sbjct: 346 VQRKYIDGILERASKDIQNMEKTAENGF---KYLKHYEKWLPPQHYHLSEVKILQVQLIA 402
Query: 422 TQPGYAYTQLSSSLLERKISY 442
LS L+ K+ Y
Sbjct: 403 KDQKELMI-LSDEQLQLKLKY 422
>gi|242019692|ref|XP_002430293.1| protein msta, isoform A, putative [Pediculus humanus corporis]
gi|212515408|gb|EEB17555.1| protein msta, isoform A, putative [Pediculus humanus corporis]
Length = 501
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 132/428 (30%), Positives = 209/428 (48%), Gaps = 55/428 (12%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
++I + GRY A D PGD I +ESP GPK + + P+CLGCH+ ++ +
Sbjct: 8 FDILKNDKFGRYAVALQDYEPGDLIIQESPFTYGPK-SDSPPVCLGCHRKVDGKIL---- 62
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK--------YEAILPLRC 167
C C WP C +C H+ EC I + + K YE I PLR
Sbjct: 63 CSTCGWPVCDEKCEK-EPIHSEN-ECIIFS---SVKVRFTKVKNPQGNCPQYECITPLRV 117
Query: 168 LILQRRSPKKYQ-ELKDMEAHMSKRGPGTEVY--EEIDSIVKYLRSNFLEKLPGDVLDDT 224
L+ + + P+ + ++ +E+H +R T+++ E I+++ FL K+ + +
Sbjct: 118 LLGKEKFPEIWNSQISKLESHNEERQK-TDIWKVESINTV------GFLHKV-CKLQERF 169
Query: 225 SAKCLHWICGVIEVNGVD----IGRYTQGLYSVICLMEHNCLPNAKH-----------SN 269
S+ +H+ CG++E N G + LY L H+C+PN H SN
Sbjct: 170 SSDLIHFACGILETNAFSSPSSYGYDMRYLYPKSSLFLHSCIPNVGHYIFKNSNDGERSN 229
Query: 270 MM--QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
++ ++ L + + + YT LW T RR HL K+F+C C+RCSDPTEL T FS +
Sbjct: 230 VIHVRASLKIKKGQEFTLSYTQTLWPTLLRRSHLKEGKFFDCCCQRCSDPTELKTNFSTI 289
Query: 328 KCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN-- 385
KC N + + WIL +PL D+ W C SC+ +L V + S + ++V+ L + +
Sbjct: 290 KCSNCY----NGWILNTDPLVVDAPWNCTSCNWKLTGDSVSKLFSNIQQEVENLNKLDNY 345
Query: 386 -PNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS--Y 442
V+ E + K +++ HP++ VK+SL QLYG Y L LLE+K+ Y
Sbjct: 346 LDPVEIRENFIKKYKSLLHPNNGFLIIVKYSLAQLYGQAKEYFLDDLPDILLEKKLKLCY 405
Query: 443 VMSPNLKA 450
+ LK
Sbjct: 406 TLDEVLKT 413
>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
Length = 504
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 126/410 (30%), Positives = 202/410 (49%), Gaps = 47/410 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ S + GR+L A + G+ + +E PL +GPK +CL C++P RC
Sbjct: 14 VECSRIYGRFLVANEPVQRGELLVQEQPLAIGPKCN-GYVLCLACYRP-----DVQERCS 67
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLL-----AYNDYKYEAILPLRCLILQR 172
C WP CS C G + ECA+L+L + + + + I+PLR L+ +
Sbjct: 68 LCGWPLCSL-CDGNASNQHLELECAVLQLAKARFYEMPAGSLHWPQLDCIMPLRVLLAKE 126
Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
P+++ E+ ME H R +V+ + +I +YLR P + + S + +
Sbjct: 127 AQPERWLNEVAPMEHHEQARRENADVWHADRVNIAQYLRG------PCKLAERFSEELIM 180
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSN--------MMQSKLFV 277
+ GV+EVN + R +QG LY ++ HNC+PN + +++ + +
Sbjct: 181 QVVGVLEVNAFE-ARTSQGYPLRCLYPYTGILAHNCVPNTARTIHPSEGYKIRLRAMVAL 239
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ YT L GT R+ HL KYF CSCERC D +EL T+FS+MKC G
Sbjct: 240 EAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFSSMKC-------G 292
Query: 338 DC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE- 393
C W+LP +P +S W C +C ++ +V + + L +V+ + + K LEE
Sbjct: 293 QCTEGWLLPKHPTAMNSSWNCRACDHSTSSEEVRQIVAALQAEVNAVQALDMGPKRLEES 352
Query: 394 --MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
+L K +++ HP HY +++ LI++YG GY QL+ LLERK S
Sbjct: 353 ERLLRKYKSLLHPSHYIATSLRQLLIEMYGRVQGYEMVQLTDQLLERKAS 402
>gi|158297510|ref|XP_317734.4| AGAP007779-PA [Anopheles gambiae str. PEST]
gi|157015230|gb|EAA12332.4| AGAP007779-PA [Anopheles gambiae str. PEST]
Length = 498
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 129/404 (31%), Positives = 200/404 (49%), Gaps = 31/404 (7%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ P LGRY A DL G+ + E+P VGPKL + P+CL C P++ R
Sbjct: 6 YKVLECPELGRYGVAARDLKAGELLFEETPFAVGPKLD-SPPLCLECCCPVDGG-EGGPR 63
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----YKYEAILPLRCLIL 170
C RC WP C +G ++ H EC + D + + I PLR L+
Sbjct: 64 CSRCGWPLCEDCSAGGAELVYHRGECEVFAANGVRFRPVEDSTAGCVQLDCITPLRVLLA 123
Query: 171 QRRSPKKYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ +++ E++ ME H ++R G + ++IV +LR P + SA+ +
Sbjct: 124 KESDEARWKREIEPMEYHDAERREGANWKVDENNIVAFLRG------PCGLGKRFSAELV 177
Query: 230 HWICGVIEVNGVD----IGRYTQGLYSVICLMEHNCLPNAKHS----NMMQSKLFV--FR 279
G+++VN + G +GLY + +M HNC+PN HS N M +++ V
Sbjct: 178 QRAIGLLDVNAFEGRTGSGYSCRGLYPQLAIMAHNCVPNVVHSIHPSNGMVARVAVDVKE 237
Query: 280 DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC 339
+ T YT L GT R+ L +K+F C C RC DPTELGT+FS + C K G
Sbjct: 238 GDKLYTTYTYTLTGTVARQSILKASKFFTCRCTRCLDPTELGTHFSTLLC---SKCSGG- 293
Query: 340 WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLV---QENPNVKSLEEMLT 396
I +PLD +++W CG C + + V + ++D+L E +++ E +
Sbjct: 294 LITSTDPLDENAEWKCGQCGFKTSGAAVQKAVMTIHNEIDELACFEYEAGRLEAYETVYK 353
Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
K ++ HP H+ +++HSLI+LYG PGY +L LLERK+
Sbjct: 354 KYRSVLHPLHFINTSIRHSLIELYGRIPGYEMAELPDVLLERKV 397
>gi|198467928|ref|XP_001354556.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
gi|198146172|gb|EAL31609.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
Length = 533
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 124/410 (30%), Positives = 203/410 (49%), Gaps = 36/410 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+ I S + GRYL A+ L G+ + E PL +GP ++ + +CLGC++P+ + R
Sbjct: 46 FHIAQSEIYGRYLVASRQLEAGELLITEEPLAIGPCVS-GDAVCLGCYQPVGLG-REQYR 103
Query: 116 CPRCFWPACSARCSGLSDAHTHAPE-CAIL---KLGCETLLAYNDYK--------YEAIL 163
CP C WP C A C GL H E CA+ + LL ND YE ++
Sbjct: 104 CPGCGWPLCGASCPGLGRRIGHTTEECALYGERRPAAAELL--NDRTSPPRVRDLYELVM 161
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD 222
+R L+L+ +P++Y + ME+H +R +++ + +V+ LR + KL D+
Sbjct: 162 IVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEW--KL-DDLAA 218
Query: 223 DTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
+ A+ +H ICG+++VN +IG+ + LY L+ H+C PN H++ +S + R
Sbjct: 219 EEEAQAVHEICGILDVNCFEIGQKGAKARTLYPSAFLLAHDCSPNTAHTDDPRSYAILLR 278
Query: 280 DTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
+ ++ Y L GT RR + K F C C+RC+DP ELG+ SA+ C +
Sbjct: 279 TSRRVREQEALTLSYAYTLQGTLKRRSFMYEGKLFWCRCQRCADPQELGSDCSALVCRSC 338
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
K + V PL+ +DW C C R A D+ ++ ++ + + +V LE
Sbjct: 339 LKGT----VRAVTPLEQTADWACDRCDHRTTALDLENQLDRINNDLEAI--DVHDVPGLE 392
Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
L + P+HY + K+SL Q+YG GY ++S + + RK Y
Sbjct: 393 NFLLRYRETLRPNHYLLLSAKYSLCQIYGRIKGYLLPEMSLNDIVRKERY 442
Score = 42.7 bits (99), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 500
+ + +CLGC++P+ + RCP C WP C A C GL H ECA+
Sbjct: 82 VSGDAVCLGCYQPVGLG-REQYRCPGCGWPLCGASCPGLGRRIGHTTEECAL 132
>gi|195117408|ref|XP_002003239.1| GI23580 [Drosophila mojavensis]
gi|193913814|gb|EDW12681.1| GI23580 [Drosophila mojavensis]
Length = 507
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 129/415 (31%), Positives = 197/415 (47%), Gaps = 58/415 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A+ + G+ + E P +GPK +CL C++P + RC
Sbjct: 14 VEWSPIYGRYLTASEPVKRGELLIEEQPFALGPKCN-GYVLCLNCYRP-----EVHERCA 67
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEA------------ILPL 165
C WP C G + L+L C L A YE I+PL
Sbjct: 68 LCGWPLCGPCSDGEEEEENQH-----LQLECSVLRAARARFYEMSAGSKYWPQLDCIMPL 122
Query: 166 RCLILQRRSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDD 223
R L+ + P+++ E+ ME H +R +V+ + +I +YLR P + D
Sbjct: 123 RVLLAKEAQPERWLNEVAPMEHHELERRENADVWHADRVNIAQYLRG------PCKLADR 176
Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSN--------M 270
S + + + GV+EVN + R +QG L+ ++ HNC+PN +
Sbjct: 177 FSEELIMQVVGVLEVNAFE-ARTSQGYPLRCLFPYTGILAHNCVPNTARTIHPSEGYKIR 235
Query: 271 MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
+++ + + + YT L GT R+ HL KYF CSCERC D +EL T+FS++KC
Sbjct: 236 LRAMVALEEGQQLQHSYTYTLDGTAQRQAHLREGKYFVCSCERCLDASELQTHFSSLKC- 294
Query: 331 NEHKDQGDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
G C W+LP P DS W+C SC +A DV + L +V+ + +
Sbjct: 295 ------GQCTEGWLLPKQPTVVDSSWSCRSCDYSTSAEDVRQIVDALQAEVNAVQALDMG 348
Query: 388 VKSLEE---MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
K LEE +L K +++ HP HY +++ LI++YG GY QL+ LLERK
Sbjct: 349 PKRLEEAERLLRKYKSLLHPSHYIATSLRQLLIEMYGRVQGYEMVQLTDQLLERK 403
>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
Length = 510
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 123/401 (30%), Positives = 189/401 (47%), Gaps = 41/401 (10%)
Query: 66 RYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACS 125
RY+ + +L G+ + E P +VGPK A P+CL C+ P P D C C WP C
Sbjct: 27 RYMVTSRELQAGEELVTEMPFVVGPK-ACTYPLCLSCYTPWPPEPDDKQLCSTCGWPVCG 85
Query: 126 ARCSGLSDAHTHAP-ECAIL-----KLGCETLLAYNDY---KYEAILPLRCLILQRRSPK 176
C +A H EC + K + L N+ + E I PLR L+ R+ +
Sbjct: 86 QECE---NALQHKDYECQVFAQANEKFNIDAALQNNENGVPQLECITPLRLLLESERNVE 142
Query: 177 KY-QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
++ +E+KDME H R + + +IV YLR LD S + + +CG+
Sbjct: 143 RWNKEIKDMEPHNKIRCQKAQWKSDHVNIVDYLRKRL-------KLDRFSEEHIQTVCGI 195
Query: 236 IEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFVFRDTHIS 284
+E+N ++ G +GLY + +M H+C+ N HS +++ L + +
Sbjct: 196 LEINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIRLRTTLKIPAGGELY 255
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT++L T RREHL K+F C+C RCSDPTELGT+ S++KC D G +L +
Sbjct: 256 ASYTHSLLPTMLRREHLLEGKHFACACPRCSDPTELGTHMSSLKC--NKCDNG--IVLSL 311
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL-----VQENPNVKSLEEMLTKLE 399
+ LD++S W C C + V V + +VD + E ++ K
Sbjct: 312 DSLDSESKWKCTHCDFSTTGQAVRKVFRIIQAEVDAAEAISGADGADAIHERETIMKKYR 371
Query: 400 AMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ HP H ++HSL Q+YG Y L +LE K+
Sbjct: 372 SVLHPQHAFLSMLRHSLTQMYGRVDEYLLDDLPDVVLEHKV 412
>gi|195169991|ref|XP_002025797.1| GL18252 [Drosophila persimilis]
gi|194110650|gb|EDW32693.1| GL18252 [Drosophila persimilis]
Length = 533
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 124/410 (30%), Positives = 202/410 (49%), Gaps = 36/410 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+ I S + GRYL A+ L G+ + E PL +GP ++ + +CLGC++P+ + R
Sbjct: 46 FHIAQSEIYGRYLVASRQLEAGELLITEEPLAIGPCVS-GDAVCLGCYQPVGLG-REQYR 103
Query: 116 CPRCFWPACSARCSGLSDAHTHAPE-CAIL---KLGCETLLAYNDYK--------YEAIL 163
CP C WP C A C GL H E CA+ + LL ND YE ++
Sbjct: 104 CPGCGWPLCGASCPGLGRRIGHTTEECALYGERRPAAAELL--NDRTSPPRVRDLYELVM 161
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD 222
+R L+L+ +P++Y + ME+H +R +++ + +V+ LR + KL D+
Sbjct: 162 IVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEW--KL-DDLAA 218
Query: 223 DTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
+ A+ +H ICG+++VN +IG+ + LY L+ H+C PN H++ +S + R
Sbjct: 219 EEEAQAVHEICGILDVNCFEIGQKGAKARTLYPSAFLLAHDCSPNTAHTDDPRSYAILLR 278
Query: 280 DTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
+ ++ Y L GT RR + K F C C+RC+DP ELG+ SA+ C +
Sbjct: 279 TSRRVREQEALTLSYAYTLQGTLKRRSFMYEGKLFWCRCQRCADPQELGSDCSALVCRSC 338
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLE 392
K + V PL+ +DW C C R A D+ ++ ++ + + +V LE
Sbjct: 339 LKGT----VRAVTPLEQTADWACDRCDHRTKALDLENQLDRINNDLEAI--DVHDVPGLE 392
Query: 393 EMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
L + P+HY + K+SL Q+YG GY ++S + RK Y
Sbjct: 393 NFLLRYRETLRPNHYLLLSAKYSLCQIYGRIKGYLLPEMSLHDIVRKERY 442
Score = 42.7 bits (99), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 28/52 (53%), Gaps = 2/52 (3%)
Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAP-ECAI 500
+ + +CLGC++P+ + RCP C WP C A C GL H ECA+
Sbjct: 82 VSGDAVCLGCYQPVGLG-REQYRCPGCGWPLCGASCPGLGRRIGHTTEECAL 132
>gi|195050973|ref|XP_001993006.1| GH13590 [Drosophila grimshawi]
gi|193900065|gb|EDV98931.1| GH13590 [Drosophila grimshawi]
Length = 504
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 135/417 (32%), Positives = 200/417 (47%), Gaps = 61/417 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ S L GRYL A+ G+ + E PL +GPK +CL C++P P RC
Sbjct: 14 VEWSRLYGRYLVASGSATHGELLVEELPLAIGPKCN-GPVVCLVCYRPDVPE-----RCI 67
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYE------------AILPL 165
C WP CSA C+ ++A+ HA +L C LLA YE +I+PL
Sbjct: 68 HCGWPLCSA-CNE-TEANLHA------QLECPLLLASRARFYEMPTGSRYWPQLDSIMPL 119
Query: 166 RCLILQRRSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDD 223
R L+ + P+++ E+ ME H R +++ + +I +YLR P + +
Sbjct: 120 RVLLAKEAQPERWLNEVAPMEHHEEARRENADIWHADRVNIAQYLRG------PCKLAER 173
Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHS----NMMQSK 274
S + + + GV+EVN + R G LY ++ HNC+PN S + +
Sbjct: 174 FSEELIMQVVGVLEVNAFE-ARTINGYPLRCLYPYTGILAHNCVPNTARSIHPSEGYKVR 232
Query: 275 LFVFRD----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
L D + YT L GT R+ HL KYF CSC+RC D +EL T+FS++KC
Sbjct: 233 LRAMVDLEAGQQLQHSYTYTLDGTVQRQAHLQQGKYFRCSCKRCLDASELQTHFSSLKC- 291
Query: 331 NEHKDQGDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
G C W+LP P +S W C +C +++DV + L +V+ +
Sbjct: 292 ------GQCTEGWLLPKQPTALESSWQCNACDNTTSSQDVREIVENLQAEVNGVQALEMG 345
Query: 388 VKSLEE---MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKIS 441
K LEE +L K +++ HP HY ++ LI++YG GY QLS LLERK S
Sbjct: 346 AKRLEESERLLRKYKSLLHPSHYIATGLRQLLIEMYGRVQGYEMVQLSDQLLERKSS 402
>gi|347964965|ref|XP_309220.5| AGAP001025-PA [Anopheles gambiae str. PEST]
gi|333466560|gb|EAA04943.5| AGAP001025-PA [Anopheles gambiae str. PEST]
Length = 519
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 201/435 (46%), Gaps = 43/435 (9%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQATLD 73
C VC AL +C+ C++V YCG+ HQ+ WK H+ +C++ +++ + LGR+L AT
Sbjct: 6 CRVCSAPALQRCAGCQQVGYCGRDHQRADWKAQHRDQCRR--FKVVRNDRLGRHLVATRH 63
Query: 74 LHPGDRIARESPLIVGPKLALAEPICLGCHKPL------NPNLADNARCPRCFWPACSAR 127
+ G+ I R+ P VGPK+A P+CLGC++ L L C RC WP C
Sbjct: 64 IKQGEIIYRDEPYAVGPKIANV-PLCLGCNRNLMAGWDATRGLDRFHECSRCGWPLCGPG 122
Query: 128 CSGLSDAHTHAPECAILK-LGCETLLAYN-------DYKYEAILPLRCLILQRRSPKKYQ 179
C ++ H PEC++L G + N + Y I+PLR L+L+R +P++Y
Sbjct: 123 CEEVAQ---HRPECSVLAGSGYRPNIRPNPSNPEQRESAYCVIVPLRVLLLERIAPERYA 179
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV-LDDTSAKCLHWICGVIEV 238
++ E+H+ +R + LRSN + L + L S + + + +++
Sbjct: 180 TVQGFESHLDER--------LASPLYGVLRSNLVPFLRQVLRLQQYSEQTVLKLSAILDT 231
Query: 239 NGVDIGRYTQ-----GLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDTH----ISTMY 287
N +I Q GLY + ++ H+C PN KH + + L D I Y
Sbjct: 232 NCYEIRLPEQHVKVRGLYPLGAMLSHDCRPNTKHYFDDRLHMVLVATVDIPAGGVIHASY 291
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
T L GT RR L K F+C CERC+DPTE GT S +C N + +L P
Sbjct: 292 TQPLLGTVQRRLALRQAKCFDCCCERCADPTEYGTSASGFRCPNCRRTPS--LVLAAEPT 349
Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHY 407
+ + W C + R V Q + L + EE L + HP +
Sbjct: 350 NYRTVWRCQNQRCAYQERPDQYVARCERMQQELLALDRTEPAGYEEFLARHATTLHPWNA 409
Query: 408 HCYAVKHSLIQLYGT 422
+ K++L QL G+
Sbjct: 410 YMLQAKYALTQLLGS 424
>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 499
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 126/404 (31%), Positives = 188/404 (46%), Gaps = 36/404 (8%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
++ P +GRY+ A +DL E PLI GP + P+CLGC K + + A RC
Sbjct: 7 KVAEDPTVGRYVAANVDLAAWQVALVERPLIRGPS-QVTRPVCLGCLKSITADSA--VRC 63
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQ 171
RC WP C S SD H EC + + D+ Y++I +RCL +
Sbjct: 64 ERCGWPMCPD--SNCSDDEWHKAECDWTVTRRKQKIEIKDFTNPHPSYQSITVIRCLYQK 121
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
+P+ + +L +E+H + R G++ + I +L+ F + P + D +
Sbjct: 122 HNNPEVWAKLTKLESHCASRRGGSKYETDRIWIADHLQ-RFFKLDPTEWPVDEILR---- 176
Query: 232 ICGVIEVNGVDIGRYTQ---GLYSVICLMEHNCLPNAKHSNMMQSKLF-------VFRDT 281
+CG+++VNG ++ +Y ++EH+C+PN S L V
Sbjct: 177 VCGIVQVNGHEVPLTDPPYVAIYDAGSMLEHSCVPNCSKSFTRDGHLLIRTAAAAVESGG 236
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC-- 339
H+S YT+ LWGT R HLA TK+F C C RCSDPTELGTYFS +KC E DC
Sbjct: 237 HLSISYTDVLWGTAQRLAHLADTKFFVCKCPRCSDPTELGTYFSGVKCATE-----DCMG 291
Query: 340 WILP-VNPLDNDS---DWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
+ LP V+P ND +W C C + + L + +G + + + +P +
Sbjct: 292 YSLPNVHPSTNDPFDVEWMCNFCFTVADPNRIELSLAHVGRDMAAMNRRDPKHCHMFLDH 351
Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
HP+HY+ V+ +L QLYG G LS L K
Sbjct: 352 YVSAGDLHPNHYYFTEVRLNLAQLYGQLEGQPLDTLSPEQLSHK 395
Score = 38.5 bits (88), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 46/112 (41%), Gaps = 7/112 (6%)
Query: 428 YTQLSSSLLERKISYVMSPNLKATDE---PICLGCHKPLNPNLADNARCPRCFWPACSAR 484
Y + L +++ V P ++ + P+CLGC K + + A RC RC WP C
Sbjct: 17 YVAANVDLAAWQVALVERPLIRGPSQVTRPVCLGCLKSITADSA--VRCERCGWPMCPD- 73
Query: 485 CSGLSDAHTHAPECAILKLGCETLLAYNDYKSLSRAYLDHQQVRSWTQSSDN 536
S SD H EC + + D+ + +Y +R Q +N
Sbjct: 74 -SNCSDDEWHKAECDWTVTRRKQKIEIKDFTNPHPSYQSITVIRCLYQKHNN 124
>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
Length = 507
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 132/441 (29%), Positives = 202/441 (45%), Gaps = 53/441 (12%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A + E P VGPK +CLGC++ + + + C
Sbjct: 14 VQWSPVCGRYLVARGAAVGHGLLIEELPFAVGPKCN-GPVVCLGCYQ-FDAD-TEEEYCS 70
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
C WP CS C+ H EC +L+ + + ILPLR L+ +
Sbjct: 71 ECGWPLCS-ECAQDEGNHHFRAECRVLQDARARFFPLPKGSSHCPQLDCILPLRVLLAKE 129
Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
P+++ +++ ME H +R +V+ + +I +YLR P + S + +
Sbjct: 130 TDPERWDSQVEPMEHHEEERKEDADVWHADRVNIAQYLRG------PCKLSSRFSEELIM 183
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHS-------NMMQSKLFVF 278
+ G++EVN + R T+G LY ++ HNC+PN S + +
Sbjct: 184 QVVGILEVNAFE-ARTTRGFPLRCLYPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVDL 242
Query: 279 RDTH-ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+D + YT L GT R+ HL KYF+C+CERC DPTELGT+FS++KC G
Sbjct: 243 KDGQPLHHSYTYTLDGTAQRQRHLKQGKYFSCTCERCQDPTELGTHFSSLKC-------G 295
Query: 338 DC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE- 393
C + +P P D+DS W C +C A + +V + L +V+K+ K LEE
Sbjct: 296 QCTEGYQVPRQPTDSDSSWNCNNCGASSSQAEVETLLQSLQSEVNKVQGLEMGAKRLEEA 355
Query: 394 --MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY--------- 442
+L K +++ HP H+ ++ LI++YG GY QL LERK+
Sbjct: 356 ERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPEHQLERKVELCRQVLGVLN 415
Query: 443 VMSPNLKATDEPICLGCHKPL 463
V P L T H PL
Sbjct: 416 VFEPGLSRTRAMNLYEMHVPL 436
>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
Length = 486
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 152/526 (28%), Positives = 236/526 (44%), Gaps = 100/526 (19%)
Query: 7 IEDEELMR----CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
+EDE+ R C VC+ A +CS C V YC +HQ++HWK HK C P+++ S+
Sbjct: 2 LEDEDSDRIEKICPVCQVEATKRCSRCALVYYCSVEHQQQHWKKHKATCN--PFKVFSNE 59
Query: 63 LLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWP 122
GR+L AT ++ G+ + +E PL+ GP + PI +
Sbjct: 60 QFGRFLVATREIRAGEVVIKEPPLVHGPA-QITGPIAI---------------------- 96
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
H + P Y+ ++PLRCL+L P K++ L+
Sbjct: 97 -----------QHFYVPHPM----------------YQCLMPLRCLLLAESDPAKWELLQ 129
Query: 183 DMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD 242
+E+H +R G+E++ V L F + GD D+ + I G+I+VNG +
Sbjct: 130 QLESHEEERR-GSELWRNDREGVAKLIPRFFKT--GDRWDEDE---ILRIVGIIQVNGHE 183
Query: 243 IGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWG 293
I + +Y+ ++EH+C PN S + ++ ++ R +S YT+A+W
Sbjct: 184 IPLSVPSSVAIYNQASMLEHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWA 243
Query: 294 TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG-DCWILPVNPLDN--D 350
T R EHL TK F C CERCSDPTE TYFSA++C KD +ILPV+ DN
Sbjct: 244 TGNRLEHLQQTKMFRCECERCSDPTEYETYFSALRCSGFQKDSKCKGYILPVDS-DNWTT 302
Query: 351 SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQE-----NPNVKSLEEMLTKLEAMFHPH 405
DW+C C + V + + ++ + + N +K + LT P+
Sbjct: 303 GDWSCQKCRGVVTGAAVGQILERARMDLEAMQKHREDHCNKYMKHYSKWLT-------PN 355
Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPIC------LGC 459
H + VK SL Q+ G + A ++ L KI + +L E IC G
Sbjct: 356 HQYLVDVKISLSQVIGGKAPEAIKKIPYEKLMSKIK--ICQDLLTLFEKICPAEARAFGA 413
Query: 460 HK-PLNPNLADNA-RCPRCFWPACSARCSGLSDAHTHAPECAILKL 503
+ L+ LA+ A R PAC+ S L ++ +A EC + L
Sbjct: 414 TRFELHAALAELARRSTESNNPACT---SQLEESLYNANECVRMLL 456
>gi|194889892|ref|XP_001977181.1| GG18383 [Drosophila erecta]
gi|190648830|gb|EDV46108.1| GG18383 [Drosophila erecta]
Length = 402
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 189/379 (49%), Gaps = 37/379 (9%)
Query: 37 KQHQKEHWKLHKPKC--KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLAL 94
KQ Q++ + P K+ PY ++ S + GRYL A+ L G+ + RE PL +GP ++
Sbjct: 27 KQEQQDKERSASPTAEGKEQPYRVEHSDIYGRYLVASRQLEAGETLIREEPLAIGPCVS- 85
Query: 95 AEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAIL----KLGCE 149
+P+CLGC++P++ +D RCP C WP C + C+GL H H EC + + E
Sbjct: 86 GDPVCLGCYQPVSLK-SDQYRCPGCAWPLCGSACAGLKQRHGHTETECQLYGERRAVAGE 144
Query: 150 TLL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS- 203
L A YE I+ +R L+L++ P+++ + ME+H +R ++ +
Sbjct: 145 LLTERAGPAEVRDLYELIMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEK 204
Query: 204 IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHN 260
+V LR + L+D A+ +H +CG+++VN +IG+ + LY L+ H+
Sbjct: 205 VVHRLRVTWQ-------LEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHD 257
Query: 261 CLPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCER 313
C PN H++ +S + R + ++ Y L GT RR + K F C C R
Sbjct: 258 CTPNTAHTDDPRSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRR 317
Query: 314 CSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ 373
CSDP ELGT SA+ C + V+PL+ DW C CS ++ A +V +
Sbjct: 318 CSDPRELGTDCSALVCATCRTGS----VRAVDPLEQTGDWACDRCSHKMGATEVERQLDR 373
Query: 374 LGEQVDKL-VQENPNVKSL 391
+ ++ + V + P +++
Sbjct: 374 INNDLEDIDVHDIPGLENF 392
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 6/92 (6%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y V+HS I YG + + L R+ + P + + +P+CLGC++P++ +D
Sbjct: 48 YRVEHSDI--YGRYLVASRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYQPVSLK-SD 102
Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
RCP C WP C + C+GL H H EC +
Sbjct: 103 QYRCPGCAWPLCGSACAGLKQRHGHTETECQL 134
>gi|260800041|ref|XP_002594945.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
gi|229280183|gb|EEN50956.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
Length = 458
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/414 (29%), Positives = 200/414 (48%), Gaps = 59/414 (14%)
Query: 63 LLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWP 122
L GRYL A DL G + E P++ GP + +CLGC ++ + RC C WP
Sbjct: 4 LTGRYLVANGDLPMGHLVISEEPILTGPSYTTS-ALCLGCFDSVDGSY----RCKGCNWP 58
Query: 123 ACSARCSGLSDAH----TH---APECAILKLGCETLLAYNDY-----KYEAILPLRCLIL 170
C+ +C TH + CA+LK T + D+ Y+ I PLRCL+L
Sbjct: 59 LCNRKCETAGSHQLEVCTHDYSSLSCAMLKT---TDIDITDFVGPNTSYQFITPLRCLLL 115
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ R P +++ + ME+H + E+ ++V+ +R +D+ S +
Sbjct: 116 KSRDPARWKLISGMESHTEDQTREYRWGMEMGRNVVRVIRQTL-------GMDEFSEHEI 168
Query: 230 HWICGVIEVNGVDIGRYTQG-------------LYSVICLMEHNCLPN------AKHSNM 270
+CG++ VN ++ +T G +Y LM H+CL N +K+
Sbjct: 169 MKVCGILTVNAFEL--FTTGPNKDRDSTPRAWAVYPTTYLMNHDCLANTLTSIDSKNKMH 226
Query: 271 MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
+++++ V + ++ Y+ LWGT+ RR L KYF CSC+RC DPTELG++ S+ +C
Sbjct: 227 VRTRMPVKKGEALTAEYSECLWGTEIRRHQLHRYKYFWCSCQRCRDPTELGSFISSHRCT 286
Query: 331 NEHKDQGDCWILPVNPLD-NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQ---ENP 386
N + ++ PLD N + + C SC + + Q+ E+V +L E P
Sbjct: 287 NCGGN-----VVATKPLDFNSTGFRCESCRVIIRGEVLKGTNHQVEEEVFELESKELETP 341
Query: 387 NVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ E+++ + E++ HP+HY K L YG QPGY+ +LS L +K+
Sbjct: 342 -IEEYEDLVERHESILHPNHYAMLRCKKFLAHAYGRQPGYSLAELSEDRLNKKM 394
>gi|24641164|ref|NP_727478.1| CG11160, isoform B [Drosophila melanogaster]
gi|22832071|gb|AAN09278.1| CG11160, isoform B [Drosophila melanogaster]
Length = 403
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 188/378 (49%), Gaps = 36/378 (9%)
Query: 37 KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
KQ Q++ + P + K+LPY ++ S + GRYL A L G+ + RE PL +GP ++
Sbjct: 29 KQEQQDKEQSPSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87
Query: 96 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAILK----LGCET 150
+P+CLGC+ P++ AD RCP C WP C + C+GL H H EC + + E
Sbjct: 88 DPVCLGCYHPVSLK-ADQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYAERRAVAGEL 146
Query: 151 LL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-I 204
L A YE ++ +R L+L++ P+++ + ME+H +R ++ + +
Sbjct: 147 LTERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKV 206
Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNC 261
V+ LR + L+D A+ +H +CG+++VN +IG+ + LY L+ H+C
Sbjct: 207 VQRLRVTWQ-------LEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDC 259
Query: 262 LPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
PN H++ S + R + ++ Y L GT RR + K F C C RC
Sbjct: 260 TPNTAHTDDPSSFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRC 319
Query: 315 SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQL 374
SDP ELGT SA+ C + V+PL DW C C+ ++ A +V ++
Sbjct: 320 SDPRELGTDCSALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRI 375
Query: 375 GEQVDKL-VQENPNVKSL 391
++ + V + P +++
Sbjct: 376 NNDLEDIDVHDIPGLENF 393
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 46/92 (50%), Gaps = 6/92 (6%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y V+HS I YG + L R+ + P + + +P+CLGC+ P++ AD
Sbjct: 49 YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYHPVSLK-AD 103
Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
RCP C WP C + C+GL H H EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135
>gi|20152089|gb|AAM11404.1| RE25548p [Drosophila melanogaster]
Length = 403
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 188/378 (49%), Gaps = 36/378 (9%)
Query: 37 KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
KQ Q++ + P + K+LPY ++ S + GRYL A L G+ + RE PL +GP ++
Sbjct: 29 KQEQQDKEQSPSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87
Query: 96 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA-PECAILK----LGCET 150
+P+CLGC+ P++ AD RCP C WP C + C+GL H H EC + + E
Sbjct: 88 DPVCLGCYHPVSLK-ADQYRCPGCAWPLCGSTCAGLKHRHGHTETECQLYAERRAVAGEL 146
Query: 151 LL-----AYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-I 204
L A YE ++ +R L+L++ P+++ + ME+H +R ++ + +
Sbjct: 147 LTERAGPAEVRDLYELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKV 206
Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNC 261
V+ LR + L+D A+ +H +CG+++VN +IG+ + LY L+ H+C
Sbjct: 207 VQRLRVTW-------QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDC 259
Query: 262 LPNAKHSNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
PN H++ S + R + ++ Y L GT RR + K F C C RC
Sbjct: 260 TPNTAHTDDPSSFEILLRTSRRVREREALTLGYAYTLQGTLKRRAFMHEGKLFWCCCRRC 319
Query: 315 SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQL 374
SDP ELGT SA+ C + V+PL DW C C+ ++ A +V ++
Sbjct: 320 SDPRELGTDCSALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRI 375
Query: 375 GEQVDKL-VQENPNVKSL 391
++ + V + P +++
Sbjct: 376 NNDLEDIDVHDIPGLENF 393
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 46/92 (50%), Gaps = 6/92 (6%)
Query: 410 YAVKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLAD 469
Y V+HS I YG + L R+ + P + + +P+CLGC+ P++ AD
Sbjct: 49 YRVEHSDI--YGRYLVANRQLEAGETLIREEPLAIGPCV--SGDPVCLGCYHPVSLK-AD 103
Query: 470 NARCPRCFWPACSARCSGLSDAHTHA-PECAI 500
RCP C WP C + C+GL H H EC +
Sbjct: 104 QYRCPGCAWPLCGSTCAGLKHRHGHTETECQL 135
>gi|118776986|ref|XP_307207.3| AGAP011523-PA [Anopheles gambiae str. PEST]
gi|116133074|gb|EAA03232.3| AGAP011523-PA [Anopheles gambiae str. PEST]
Length = 473
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 126/396 (31%), Positives = 180/396 (45%), Gaps = 40/396 (10%)
Query: 66 RYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACS 125
RY+ A DL GD I P +VGP A EP+CLGCH P + RC C W CS
Sbjct: 1 RYIVAARDLKAGDVIIETPPFVVGP-CAETEPVCLGCHSAFQPG-SVVYRCEICNWRICS 58
Query: 126 ARCSGLSDAHTHAP-ECAILKLGC------ETLLAYNDYKYEAILPLRCLILQRRSPKKY 178
C GL A +H EC L+ + A YEAIL LRC++L+ +Y
Sbjct: 59 PSCQGLVPAGSHRQLECIPLRDKAVHNHLQTSPAAQVKLMYEAILTLRCMLLKTVDRPQY 118
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD-VLDDTSAKCLHWICGVIE 237
L M+ P +V ++I + + +E++ + S LH ICG+IE
Sbjct: 119 DRLLGMD-------PLNDVRQQIPKLWNRNQKEIVERIRNQWGFAEYSELELHTICGIIE 171
Query: 238 VNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSNMMQS---KLFVFRDTH----ISTMY 287
VN ++G+ + L+ L+ H+C PN H++ Q+ + V RD ++ Y
Sbjct: 172 VNAFEVGQEPTKARALFPEAYLLMHDCTPNTGHTDAPQTHHLTVRVLRDVKAGEPLTLTY 231
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL----NEHKDQGDCWILP 343
+ L GT RR+HL K+F+C+C RC+DP+ELGT SA++C H G
Sbjct: 232 AHILQGTLKRRQHLREEKFFSCNCRRCTDPSELGTNCSALRCTKCPKGTHTADGSAQ--- 288
Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
C CS ++ V + +V LE L A+ H
Sbjct: 289 ----PRKRKCHCRQCSTVMSCEAV--ILLLEQLSQQLETIGGNDVDGLETFLRVQGAVLH 342
Query: 404 PHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+HY +VKHSL +LYG G+ QLS L+RK
Sbjct: 343 DNHYLLLSVKHSLCELYGKIEGFLIPQLSREQLKRK 378
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%), Gaps = 3/53 (5%)
Query: 442 YVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH 494
+V+ P A EP+CLGCH P + RC C W CS C GL A +H
Sbjct: 21 FVVGPC--AETEPVCLGCHSAFQPG-SVVYRCEICNWRICSPSCQGLVPAGSH 70
>gi|357606572|gb|EHJ65115.1| hypothetical protein KGM_22194 [Danaus plexippus]
Length = 482
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 184/385 (47%), Gaps = 43/385 (11%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLA-LAEPICLGCHKPLNPNLADNA 114
Y++K + LGRYL A +L GD I + P VGPK P+CL C+ P+ ++
Sbjct: 7 YDVKRNVKLGRYLVANAELGCGDLIFTDYPFAVGPKPGKYTPPLCLSCYCPI-----ESK 61
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEA-----ILPLRCLI 169
C RC WP CSA C + H PEC++ D+ A + PLR L+
Sbjct: 62 YCSRCSWPICSAECEL---SPNHQPECSVFSKAKIKFQPVEDWTVSAPQLDCVTPLRLLV 118
Query: 170 LQRRSPKKY-QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
+ + P ++ +E++ ME H +R + + +IV YL +D C
Sbjct: 119 AKEQDPDRWSKEVQAMETHTEQRRKRSTWKADQINIVNYL------------VDHCKLNC 166
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-------MQSKLFVFRDT 281
V +V G + +Y + + H+C+PN HS + +++ + + +
Sbjct: 167 RFSKELVEQVCASRGGFSIRAVYPRLAIAAHSCVPNIVHSILQPDYRVEVRAAVPLQKGQ 226
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI 341
+ YT+ L T RREHL +K+F+C C RC+DPTELGT+ S KC K +
Sbjct: 227 RLHLSYTHVLSPTILRREHLRESKFFDCDCPRCTDPTELGTHLSTFKCSKCKKG----IV 282
Query: 342 LPVNPLDNDSDWTCG--SCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE---MLT 396
L NPLD ++ W C C R ++ +H + S L +++D L + +++E+ +
Sbjct: 283 LSKNPLDKEASWNCSENDCDFRTSSAVMHKLLSDLQDELDSLDSLDTASEAVEQREAFIN 342
Query: 397 KLEAMFHPHHYHCYAVKHSLIQLYG 421
K + + H H VKH+L+QLYG
Sbjct: 343 KYQLILHHRHSFMLCVKHALVQLYG 367
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 39/93 (41%), Gaps = 24/93 (25%)
Query: 442 YVMSPNLKATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL 501
+ + P P+CL C+ P+ ++ C RC WP CSA C + H PEC++
Sbjct: 37 FAVGPKPGKYTPPLCLSCYCPI-----ESKYCSRCSWPICSAECEL---SPNHQPECSV- 87
Query: 502 KLGCETLLAYNDYKSLSRAYLDHQQVRSWTQSS 534
S+A + Q V WT S+
Sbjct: 88 ---------------FSKAKIKFQPVEDWTVSA 105
>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
Length = 507
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 123/410 (30%), Positives = 197/410 (48%), Gaps = 48/410 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A + + E P VGPK +CLGC++P NP+ + C
Sbjct: 14 VQWSPVCGRYLVAKGAIRGHGLLIEELPFAVGPKCN-GPVVCLGCYEP-NPD-PEEELCS 70
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
C WP C C+ +D EC+ LK + + I+PLR L+ +
Sbjct: 71 ECGWPLC-VECAQQADNAHFRLECSQLKDARARFFRLPSGSRHCPQLDCIMPLRVLLAKE 129
Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+P+++ E+ ME H +R +V+ + +I +YLR P + + S + +
Sbjct: 130 ANPERWDNEVAPMEHHKEERQRDADVWHADRVNIAQYLRG------PCQLANRFSEELIM 183
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
+ GV+EVN + R +G L+ ++ HNC+PN S + S+ + R +
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIRLRAMVD 241
Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT L GT R++HL K+F C CERC DPTELGT+FS++KC
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC------- 294
Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVK 389
G C + +P P + D+ W C +C + + D + L +V+ VQ P ++
Sbjct: 295 GQCAEGFQVPRQPTEPDTSWNCANCGSDTSNADALAMLQSLQSEVNA-VQALPMAAKRLE 353
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+E +L K +++ HP H+ ++ LI++YG GY QL LERK
Sbjct: 354 EIERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 403
>gi|195432146|ref|XP_002064087.1| GK19981 [Drosophila willistoni]
gi|194160172|gb|EDW75073.1| GK19981 [Drosophila willistoni]
Length = 390
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/398 (28%), Positives = 187/398 (46%), Gaps = 49/398 (12%)
Query: 36 GKQHQKEHWKLHKPKCKKL----------------PYEIKSSPLLGRYLQATLDLHPGDR 79
GK+HQ+ H +P K + I+ S + GRYL A L PG+
Sbjct: 6 GKKHQRHHQTQRQPHQKAEVDEVQVEEEEEPLVIPAFRIEKSEIYGRYLVANRQLEPGEL 65
Query: 80 IARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH-A 138
+ E PL +GP ++ +P+CLGC+ P+ + A RC C WP C ++C GL A H A
Sbjct: 66 LITEQPLAIGPCVS-GDPVCLGCYSPVTLD-ASQYRCSACGWPLCGSKCKGLDTAVGHTA 123
Query: 139 PECAILK---------LGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMS 189
EC +L L T YE I+ +R L+L++ +P++Y + ME+H
Sbjct: 124 AECKVLAGRRSLAGDLLTLGTAPGQVRDLYELIMIVRILLLRQNAPEQYALVMRMESHSE 183
Query: 190 KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR---Y 246
+R +++ + V + GD+ + A+ LH ICG+++VN +IG+
Sbjct: 184 ERRKNATLWQHYEEKVVQRLIELWQM--GDLAE---AQQLHEICGILDVNCFEIGQNGAK 238
Query: 247 TQGLYSVICLMEHNCLPNAKHSN-------MMQSKLFVFRDTHISTMYTNALWGTQPRRE 299
+ LY L+ H+C PN H++ ++++ V ++ Y L GT RR
Sbjct: 239 ARTLYPSAFLLAHDCSPNTAHTDDPGTYAILLRTSRRVHEQEALTLSYAYTLQGTLKRRS 298
Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
+ K F C C RC+DP ELGT SA+ C + I V PL +DW C C
Sbjct: 299 FMHEGKLFWCQCRRCADPKELGTDCSALACRSCQLGS----IRAVQPLKQTADWACDRCK 354
Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
+++A++V ++ ++ + + ++ LE L +
Sbjct: 355 RQVSAQEVEHQLDRINNDLEAI--DVHDIPGLENFLIR 390
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 450 ATDEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTH-APECAIL 501
+ +P+CLGC+ P+ + A RC C WP C ++C GL A H A EC +L
Sbjct: 78 VSGDPVCLGCYSPVTLD-ASQYRCSACGWPLCGSKCKGLDTAVGHTAAECKVL 129
>gi|170061020|ref|XP_001866058.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879295|gb|EDS42678.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 494
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 124/413 (30%), Positives = 196/413 (47%), Gaps = 46/413 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR-- 115
+ ++ LGR+L AT + G+ I RESP ++GPK A + P+CLGCH+ L+P D +
Sbjct: 8 LSTNATLGRHLVATRPIRSGEVIFRESPTVLGPKTA-SVPLCLGCHRNLDPITTDAGKKY 66
Query: 116 --CPRCFWPACSARCSGLSDAHTHAPECAIL-------KLGCETLLAYNDYK-YEAILPL 165
C C WP CS C + H EC + ++ + L+ + Y AI+PL
Sbjct: 67 YNCQHCGWPLCSPSCE---TSCYHRDECQLFASKSYRPQIRFDELVPSKKHSGYCAIVPL 123
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLDDT 224
R ++L+R+ P ++ +L +E+H+ R T +Y + S +V ++R ++ ++
Sbjct: 124 RAILLKRKEPARFAKLATLESHVETRQ-STPLYAAVQSNLVPFVRDVL------NLRNEV 176
Query: 225 SAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKH------SNMMQS 273
S L I G+ + N +I G + LY + +M H C PN KH + +M +
Sbjct: 177 SVGQLMEIAGIFDTNSYEIRIPERGIKIRALYELGAMMAHCCQPNTKHFFDEELNLVMIA 236
Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
+ + +D IS Y L T RR + K F C C RCSDPTE TY ++ C
Sbjct: 237 AVDIPKDEMISISYAQPLQATIQRRFTIKQAKCFECGCHRCSDPTEFRTYAGSIVCPQCS 296
Query: 334 KDQGDCWILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
K ++ VNPLD SDW C CS R +A+ ++ +L
Sbjct: 297 KS----MVVAVNPLDFRSDWRCEDKKCSYRESAQQYITRNEAFRAEIGQL-SSGAGPDGY 351
Query: 392 EEMLTKLEA--MFHPHHYHCYAVKHSLIQLYGTQPGYAYT--QLSSSLLERKI 440
E +L + ++ + HP + + VK++L QLYG LS L RK+
Sbjct: 352 ECLLGRFQSTDVLHPWNTNVLQVKYALTQLYGGGGPGGINLKDLSEKQLRRKV 404
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 29/58 (50%), Gaps = 6/58 (10%)
Query: 432 SSSLLERKISYVMSPNLKATDEPICLGCHKPLNPNLADNAR----CPRCFWPACSARC 485
S ++ R+ V+ P K P+CLGCH+ L+P D + C C WP CS C
Sbjct: 26 SGEVIFRESPTVLGP--KTASVPLCLGCHRNLDPITTDAGKKYYNCQHCGWPLCSPSC 81
>gi|195472006|ref|XP_002088293.1| GE18493 [Drosophila yakuba]
gi|194174394|gb|EDW88005.1| GE18493 [Drosophila yakuba]
Length = 507
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/410 (29%), Positives = 196/410 (47%), Gaps = 48/410 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GR+L A + E PL VGPK +CLGC++P +P+ + C
Sbjct: 14 VQWSPVCGRFLVAKGATLGHGLLIEELPLAVGPKCN-GPVVCLGCYQP-DPDPVEGL-CS 70
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
C WP C CS +D + EC L+ + + I+PLR L+ +
Sbjct: 71 ECGWPVCE-ECSQQADNTHYQLECRELRDARARFFPLPSGSSHCPQLDCIMPLRVLLAKE 129
Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+P+++ E+ ME H +R +V+ + +I +YLR P + S + +
Sbjct: 130 ANPERWDNEVAPMEHHEEERKKDADVWHADRVNIAQYLRG------PCKLASRFSEELIM 183
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
+ GV+EVN + R +G L+ ++ HNC+PN S + S+ + R +
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIRLRAMVD 241
Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT L GT R++HL K+F C C+RC DPTELGT+FS++KC
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRCLDPTELGTHFSSLKC------- 294
Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVK 389
G C + +P P + D+ W C SC + + + + L +V+ VQ P ++
Sbjct: 295 GQCAEGFQVPRQPTEPDTSWNCASCGSDTSNVEALTLLQSLQSEVNA-VQALPMAAKRLE 353
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+E +L K +++ HP H+ ++ LI++YG GY QL LERK
Sbjct: 354 EVERLLRKYKSLLHPQHFIATGLRQLLIEMYGRVKGYEMVQLPDHQLERK 403
>gi|225713426|gb|ACO12559.1| msta, isoform A [Lepeophtheirus salmonis]
Length = 505
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 136/447 (30%), Positives = 211/447 (47%), Gaps = 47/447 (10%)
Query: 15 CAVCRETALHKCSACKEVAYCGK-QHQKEHWKLHKPKCKK-LPYEIKSSPLLGRYLQATL 72
C C A C+ C V++C + H+ H+ K + P+ +K + LGR+L AT
Sbjct: 7 CFYCGNPATASCTTCGLVSFCSEPSHKALHYGHKKDETSFCFPFAVKRNSQLGRHLVATR 66
Query: 73 DLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS 132
D+ + I ESPL++GP ++ + CL C K L+ C C +P CS CS
Sbjct: 67 DIKALEVILEESPLVIGPYIS-SNNQCLECFKTLD---TSPHSCISCNYPFCSQECS--- 119
Query: 133 DAHTHAPEC-AILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQE-LKDMEAHMSK 190
H EC A K G + ND+ Y I LR L LQ P+ + +KDM H S
Sbjct: 120 QGPWHKIECSAFQKYGHKRDENDNDF-YSIITTLRLLSLQESKPEFWNTIMKDMLDHNSD 178
Query: 191 RGPGT----EVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI--- 243
T YEE ++VK ++S L K G+ DT L + GVI+ NG D+
Sbjct: 179 IQEYTPEKWTSYEE--TVVKKIQS--LTK--GNADTDT----LTRLIGVIDTNGADLNLP 228
Query: 244 --GRYTQG--LYSVICLMEHNCLPNAKHSNM------MQSKLFVFRDTHISTMYTNALWG 293
G++ +G LY + + H+C+ N K N M++ + + + I Y
Sbjct: 229 HDGKHGKGTGLYLIYSNLNHSCVCNTKTINFQDRRIEMRASVNISKGEQICNQYVRPNKC 288
Query: 294 TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDW 353
T RR + YF+C C+RC+DPTE G+Y A+ C +LP+NPL+ +S+W
Sbjct: 289 TLARRTTIKNKWYFDCCCKRCADPTECGSYLGALLC-----SSCKSVVLPINPLETESNW 343
Query: 354 TCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVK 413
C CS + V V +GE +++ +PN S E++L + + HP+HY +
Sbjct: 344 ACKGCSKEYDVEYVMGVLISVGESLEEATPYDPN--SYEKILYDTQDLLHPNHYLRIEAE 401
Query: 414 HSLIQLYGTQPGYAYTQ-LSSSLLERK 439
+ L G P ++++ + +ERK
Sbjct: 402 LKVAALIGNIPTSSFSKGMPRPAVERK 428
>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
Length = 507
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 123/409 (30%), Positives = 194/409 (47%), Gaps = 46/409 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A + E PL VGPK +CLGC++ +PN + C
Sbjct: 14 VQWSPVCGRYLVAKGATLGHGLLIEELPLAVGPKCN-GPVVCLGCYE-TDPN-PEEELCS 70
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
C WP C A CS +D EC L+ + + I+PLR L+ +
Sbjct: 71 ECGWPLC-AECSEQADNAHFQLECRELRDARARFFPLPSGSSHCPQLDCIMPLRVLLAKE 129
Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+P+++ E+ ME H +R +V+ + +I +YLR P + S + +
Sbjct: 130 ANPERWDSEVAPMEHHEEERKKDADVWHADRVNIAQYLRG------PCKLASRFSEELIM 183
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
+ GV+EVN + R +G L+ ++ HNC+PN S + S+ + R +
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIRLRAMVD 241
Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT L GT R++HL K+F C C+RC DPTELGT+FS++KC
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRCLDPTELGTHFSSLKC------- 294
Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
G C + +P P + D+ W+C SC + + + + L +V+ + K LEE
Sbjct: 295 GQCAEGYQVPRQPTEADTSWSCASCGSDTSNVEALAMLQSLQSEVNAVQALPMGAKRLEE 354
Query: 394 ---MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+L K +++ HP H+ ++ LI++YG GY QL LERK
Sbjct: 355 VERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 403
>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
Length = 507
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 120/410 (29%), Positives = 195/410 (47%), Gaps = 48/410 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A + E P +GPK +CLGC++ +P+ + C
Sbjct: 14 VQWSPVCGRYLVAKGATRGHGLLIEELPFAIGPKCN-GPVVCLGCYE-TDPD-PEEELCS 70
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQR 172
C WP C C+ +D EC L+ + + I+PLR L+ +
Sbjct: 71 ECGWPLC-VECAQQADNAHFQLECRELRDARARFFRLPSGSSHCPQLDCIMPLRVLLAKE 129
Query: 173 RSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+P+++ E+ ME H +R +V+ + +I +YLR P + + S + +
Sbjct: 130 ANPERWDNEVAPMEHHKEERQKDADVWHADRVNIAQYLRG------PCQLANRFSEELIM 183
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
+ GV+EVN + R +G L+ ++ HNC+PN S + S+ + R +
Sbjct: 184 QVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTARS-IYPSEGYKIRLRAMVD 241
Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT L GT R++HL K+F C CERC DPTELGT+FS++KC
Sbjct: 242 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC------- 294
Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVK 389
G C + +P P + D+ W+C SC + + D + L +V+ VQ P ++
Sbjct: 295 GQCAEGFQVPRQPTEPDTSWSCASCGSETSNVDALAMLQSLQSEVNA-VQALPMAAKRLE 353
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+E +L K +++ HP H+ ++ LI++YG GY QL LERK
Sbjct: 354 EIERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 403
>gi|386769408|ref|NP_001245962.1| CG43129, isoform C [Drosophila melanogaster]
gi|383291417|gb|AFH03636.1| CG43129, isoform C [Drosophila melanogaster]
Length = 503
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 120/416 (28%), Positives = 197/416 (47%), Gaps = 48/416 (11%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ +LGR L + + G+ + E P GPK + +CLGC++ L +
Sbjct: 2 LKARLQEDAVLGRCLATSSSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLA--YNDY---KYEAILPLR 166
+ RC C WP C + C+ D H EC + T +D + + I LR
Sbjct: 61 SLDRCELCDWPLCGS-CADDEDVTEHRGECQVFSAARVTFAGNVSDDGVCPQLDCITILR 119
Query: 167 CLILQRRSPKKY-QELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDT 224
L+ + +P+++ E+ ME H +R +V+ + +I +YLR P + +
Sbjct: 120 VLLAKEANPERWDNEVAPMEHHKEERQRDADVWHADRVNIAQYLRG------PCQLANRF 173
Query: 225 SAKCLHWICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
S + + + GV+EVN + R +G L+ ++ HNC+PN S + S+ + R
Sbjct: 174 SEELIMQVVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTSRS-IYPSEGYKIR 231
Query: 280 DTHISTM---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
+ + YT L GT R++HL K+F C CERC DPTELGT+FS++KC
Sbjct: 232 LRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC- 290
Query: 331 NEHKDQGDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP- 386
G C + +P P + D+ W C +C + + D + L +V+ VQ P
Sbjct: 291 ------GQCAEGFQVPRQPTEPDTSWNCANCGSDTSNADALAMLQSLQSEVNA-VQALPM 343
Query: 387 ---NVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
++ +E +L K +++ HP H+ ++ LI++YG GY QL LERK
Sbjct: 344 AAKRLEEIERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 399
>gi|125986939|ref|XP_001357232.1| GA14319 [Drosophila pseudoobscura pseudoobscura]
gi|54645563|gb|EAL34301.1| GA14319 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 124/409 (30%), Positives = 195/409 (47%), Gaps = 45/409 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ S + GRYL A + E P +GPK +CLGC +P P C
Sbjct: 14 VQWSAVCGRYLVAQESTPARSLLIEELPFAIGPKCN-GPVVCLGCCRP-APATDAEELCV 71
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCET---LLAYNDY--KYEAILPLRCLILQR 172
C WP C CS D+ EC ++ L + + Y + + ILPLR L+ +
Sbjct: 72 LCGWPLCE-ECSENEDSPHVQLECREFQISGARFYRLPSGSSYCPQLDCILPLRVLLAKE 130
Query: 173 RSPKKYQ-ELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+P+++ E+ ME H S+R +V+ ++ +I +YLR P + S + +
Sbjct: 131 ANPRRWDLEVAPMEHHESERRLNEDVWHADLVNIAQYLRG------PCKLASRFSEELIM 184
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
+ GV+EVN + R +G LY ++ H+C+PN S + S+ F R +
Sbjct: 185 QVVGVLEVNAFE-ARTARGYPLRCLYPYTGILAHSCVPNTARS-IYPSEGFKIRLRAMVD 242
Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT L GT R+ HL K+F C+CERC DPTEL T+FS++KC
Sbjct: 243 LDDGQPLHHSYTYTLDGTAQRQTHLREGKFFTCNCERCLDPTELQTHFSSLKC------- 295
Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKS 390
G C ++ P P + D+ W C SC A + DV + +V + ++ P+ ++
Sbjct: 296 GQCAEGFLAPKQPTEVDTSWNCCSCGASTSNVDVETTIKSIQSEVSGVQALEMGPHRLEE 355
Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
E +L K +++ HP H+ ++ LI++YG Y QLS LLERK
Sbjct: 356 AERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQRYEMVQLSDDLLERK 404
>gi|195146636|ref|XP_002014290.1| GL19122 [Drosophila persimilis]
gi|194106243|gb|EDW28286.1| GL19122 [Drosophila persimilis]
Length = 508
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/409 (30%), Positives = 195/409 (47%), Gaps = 45/409 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ S + GRYL A + E P +GPK +CLGC +P P C
Sbjct: 14 VQWSAVCGRYLVAQESTPARSLLIEELPFAIGPKCN-GPVVCLGCCRP-APATDVEELCV 71
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCET---LLAYNDY--KYEAILPLRCLILQR 172
C WP C CS D+ EC ++ L + + Y + + ILPLR L+ +
Sbjct: 72 LCGWPLCE-ECSENEDSPHVQLECREFQISGARFYRLPSGSSYCPQLDCILPLRVLLAKE 130
Query: 173 RSPKKYQ-ELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+PK+++ E+ ME H S+R +V+ ++ +I +YLR P + S + +
Sbjct: 131 ANPKRWEVEVAPMEHHESERRLNEDVWHADLVNIAQYLRG------PCKLASRFSEELIM 184
Query: 231 WICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
+ GV+EVN + R +G LY ++ H+C+PN S + S+ F R +
Sbjct: 185 QVVGVLEVNAFE-ARTARGYPLRCLYPYTGILAHSCVPNTARS-IYPSEGFKIRLRAMVD 242
Query: 286 M---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT L GT R+ HL K+F C+CERC DPTEL T+FS++KC
Sbjct: 243 LDDGQPLHHSYTYTLDGTAQRQTHLREGKFFTCNCERCLDPTELQTHFSSLKC------- 295
Query: 337 GDC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL--VQENPN-VKS 390
G C ++ P + D+ W C SC A + DV + +V + ++ P+ ++
Sbjct: 296 GQCAEGFLASKQPTEVDTSWNCCSCGASTSNVDVETTIKSIQSEVSGVQALEMGPHRLEE 355
Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
E +L K +++ HP H+ ++ LI++YG Y QLS LLERK
Sbjct: 356 AERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQRYEMVQLSDDLLERK 404
>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
Length = 423
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 108/329 (32%), Positives = 167/329 (50%), Gaps = 47/329 (14%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL---------- 106
++ + LGRY+ AT ++ G+ I R++P +VGPK+A + P+CLGC++ L
Sbjct: 13 QVVENACLGRYVVATRNIRQGEVIFRDAPAVVGPKMA-SVPVCLGCNRDLMATATVRQQG 71
Query: 107 NPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLG--------CETLLAYNDYK 158
+PN + +C RC WP C A C A H EC+I+ A +
Sbjct: 72 DPNFRFH-QCSRCGWPVCGAECEA---ADQHRSECSIMAGSSYRPKICPAREEQARRESA 127
Query: 159 YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLP 217
Y I+PLR L+LQ SP Y +L+ +E+H+++R + +YE + S +V ++RS
Sbjct: 128 YCVIVPLRVLLLQSHSPDTYAQLQRLESHVAERL-KSPLYEVVRSNLVPFIRSVL----- 181
Query: 218 GDVLDDTSAKCLHWICGVIEVNGVDIG---RYTQ--GLYSVICLMEHNCLPNAKH----- 267
L SA+ + +C +++ N +I R T+ LY + ++ H+C PN KH
Sbjct: 182 --GLQQYSAETILEVCAMLDTNCFEIRLPERRTKVRALYPLGAMLSHDCRPNTKHYFDDA 239
Query: 268 -SNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
++ + + + R IST YT L GT RR L +K+F+C CERC+DPTELGT S
Sbjct: 240 LRMVLVATVDIERGATISTSYTQPLLGTLHRRLALKQSKHFDCQCERCTDPTELGTNLSG 299
Query: 327 MKCLNEHKDQGDCWILPVNPLDNDSDWTC 355
+C K +L P S W C
Sbjct: 300 FRCPVCRKG----IVLSAQPCSPQSVWRC 324
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 16/77 (20%)
Query: 435 LLERKISYVMSPNLKATDEPICLGCHKPL----------NPNLADNARCPRCFWPACSAR 484
++ R V+ P + + P+CLGC++ L +PN + +C RC WP C A
Sbjct: 35 VIFRDAPAVVGPKMASV--PVCLGCNRDLMATATVRQQGDPNFRFH-QCSRCGWPVCGAE 91
Query: 485 CSGLSDAHTHAPECAIL 501
C A H EC+I+
Sbjct: 92 CEA---ADQHRSECSIM 105
>gi|195350806|ref|XP_002041929.1| GM11450 [Drosophila sechellia]
gi|194123734|gb|EDW45777.1| GM11450 [Drosophila sechellia]
Length = 462
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 121/447 (27%), Positives = 190/447 (42%), Gaps = 94/447 (21%)
Query: 37 KQHQKEHWKLHKP-KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA 95
KQ Q+E + P + K+LPY ++ S + GRYL A L G+ + RE PL +GP ++
Sbjct: 29 KQEQQEKEQSTSPTEEKELPYRVEHSDIYGRYLVANRQLEAGETLIREEPLAIGPCVS-G 87
Query: 96 EPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYN 155
+P+CLGC++P++ D RCP C WP
Sbjct: 88 DPVCLGCYQPVSLQ-PDQYRCPGCAWP--------------------------------- 113
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEIDSIVKYLRSNF 212
+P P+++ + ME+H +R + YEE +V+ LR +
Sbjct: 114 -------IP---------DPEQFALIARMESHTEERRQNAVLWRHYEE--KVVQRLRGTW 155
Query: 213 LEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQ 272
L+D A+ +H I IG + LY L+ H+C PN H++ +
Sbjct: 156 -------QLEDLEAEQVHEIG--------QIGVKVRTLYPSAFLLAHDCTPNTAHTDDPR 200
Query: 273 SKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
S + R + ++ Y L GT RR + K F C C RCSDP ELGT S
Sbjct: 201 SFEILLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCS 260
Query: 326 AMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
A+ C + V+PL DW C C+ ++ A +V ++ ++ + +
Sbjct: 261 ALVCATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRINNDLEDI--DV 314
Query: 386 PNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY--- 442
++ LE L + + P+HY + K+SL Q+YG GY Q++ + RK SY
Sbjct: 315 HDIPGLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMTPEDIARKESYCRE 374
Query: 443 ------VMSPNLKATDEPICLGCHKPL 463
V+ P L I H P+
Sbjct: 375 FLEIVDVLDPGLTRLRGLIMYELHAPV 401
>gi|239791604|dbj|BAH72247.1| ACYPI001902 [Acyrthosiphon pisum]
Length = 241
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 72/154 (46%), Positives = 103/154 (66%), Gaps = 3/154 (1%)
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
MYTNALWGT+ RR HL TKYF C C+RCSD TELGT FS + C + +G+ + P++
Sbjct: 1 MYTNALWGTRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVCNVKGFCKGN--LTPIH 58
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
PLD+ S+W C C +++ + + ++L VDK +Q NP++ SLE+ +KL+ H +
Sbjct: 59 PLDDSSEWECDRCPNTVSSDKIDAILTELNHIVDKALQ-NPSINSLEDAFSKLKTRIHSN 117
Query: 406 HYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
HY C+ VKH+LIQLYG GY ++ L+ LL+RK
Sbjct: 118 HYLCFNVKHTLIQLYGHSMGYKHSMLTDDLLKRK 151
>gi|322792382|gb|EFZ16366.1| hypothetical protein SINV_10209 [Solenopsis invicta]
Length = 513
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 188/382 (49%), Gaps = 32/382 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ S LGRYL ++ PG+ I RE P+ VGP + + C C + L P + +
Sbjct: 7 YKVAHSNKLGRYLMTAKNIAPGEVIIREEPIAVGPMMYKKDCFCFACMRTL-PKIGKGRQ 65
Query: 116 --CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
C RC F P CS C ++ HT+ EC I K + L +L LR +++
Sbjct: 66 YVCSRCNFAPLCSVACEVRTNHHTNE-ECQIFKDNRDLLTESGIDVARILLALRLWLIKH 124
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
++P ++++ MEAH+ KR GT V+ + + ++V +R ++P ++ + SA+ +
Sbjct: 125 KNPAVWEKIDRMEAHLDKR-IGTPVWNDREVNVVNVIRR---LRIPAEI-EPPSAELMQK 179
Query: 232 ICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVF 278
+CG+++VN ++ G +GLY + LM H+C N K + + + +
Sbjct: 180 LCGILDVNAFELRSPGALDGFLLRGLYIEVTLMAHDCRGNTHVTVDDKFQLTVYASVAIK 239
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM---KCLNEHKD 335
+D I YT++L GT R+++L KYF C C C+DP EL ++ S++ +C
Sbjct: 240 QDETIFFNYTSSLLGTADRKDYLREGKYFECECSLCADPYELQSHLSSVLCPRCKKGFVG 299
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
D I V+P + S W C C R + + +D+ E+ ++K ++ +L
Sbjct: 300 MQDTSI--VDPYER-SRWQCQMCKKTYGGRLIRATLNICRTLIDEC--EDVDIKRIDGLL 354
Query: 396 TKLEAMFHPHHYHCYAVKHSLI 417
+L H +H+ ++K L+
Sbjct: 355 KRLSRSLHTNHFLMLSLKQKLL 376
>gi|380022731|ref|XP_003695192.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 554
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/390 (29%), Positives = 186/390 (47%), Gaps = 37/390 (9%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L Y+I S LGRYLQA+ DL G+ I RE P+ VGP + PIC C L P++ +
Sbjct: 15 LKYKIAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCMKNPICFECLSLL-PDIEQD 73
Query: 114 AR--CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLIL 170
C C C C + H+ EC I+K E + D + LR +L
Sbjct: 74 FHYICSGCNVVTLCGFSCEERAYYHSMY-ECEIIKNNMELSIENTDTLAAVLFVLRLWLL 132
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+++ P+ ++ + +E+H++KR T V+E+ + +IV L+S L +P + D + ++ +
Sbjct: 133 KQKDPELWKRVLSLESHLNKR-RNTIVWEDREINIVNVLKS--LNFVPEN--DPSVSEII 187
Query: 230 HWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH 282
+CG+++VN ++ G +GLY LM H+C N + L V+
Sbjct: 188 QQLCGILDVNSFELRSPGGMDGLLLRGLYLEASLMAHDCRGNVHVTADDNFHLTVYASIP 247
Query: 283 IST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I YT++L GT RRE+L KYF C C+ C DP E+G+Y S + C Q
Sbjct: 248 IKEGDVIFFNYTSSLLGTTGRREYLRTGKYFECECDLCKDPYEMGSYMSCILC--PRCRQ 305
Query: 337 GDCWI---LPVNPLDNDSDWTCGSCSARLNA---RDVHLVTSQLGEQVDKLVQENPNVKS 390
G + L P +S W C C ++ + ++ L + VD+ N+K
Sbjct: 306 GYVGMQDPLTKFPYSKNSKWQCNKCKKIISGCLIKTSLSISKNLVDDVDEY-----NIKV 360
Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
+E + KL H +H+ ++K L+ +Y
Sbjct: 361 MENLKEKLSKSLHKNHFLMLSLKQKLLNVY 390
>gi|383858904|ref|XP_003704939.1| PREDICTED: protein msta, isoform A-like [Megachile rotundata]
Length = 530
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 183/385 (47%), Gaps = 27/385 (7%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L Y++ S GRYLQA +L G+ I RE P+ VGP + + +C C + L P L
Sbjct: 13 LKYKVACSEKFGRYLQAAKNLQAGEVILREKPIAVGPITSSNDYLCFACLRLL-PKLKKG 71
Query: 114 AR--CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL-PLRCLI 169
+ C +C P C C HT + EC I K E L A N IL PLR +
Sbjct: 72 TQYVCSKCNIAPLCGTACQEQPKHHT-SYECEIFKTN-ENLSADNIETLIGILLPLRLWL 129
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+++ P+ + ++ MEAHM +R + + +++ +++ L +P D D + + L
Sbjct: 130 VKQTDPELSKRIESMEAHMEERRNTSVWRDRETNVINVMKA--LHLIPED--DTSIPEYL 185
Query: 230 HWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH 282
+CG+++VN ++ G +GLY M H+C N + +L V+
Sbjct: 186 QQLCGILDVNTFELRSPGGLDGLLLRGLYVEASFMAHDCRGNTHLTVDDHFQLTVYASIP 245
Query: 283 IST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD- 335
I YT++L GT RREHL KYF C C CSDP E+G+Y S++ C K
Sbjct: 246 IKEGEAILFNYTSSLLGTLGRREHLRGGKYFECECSLCSDPYEMGSYMSSILCPRCRKGY 305
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
G L + + + W C C + ++ R + + +D + ++ ++K +E +
Sbjct: 306 MGIRTPLTIYAYEKSTKWQCDKCRSCISGRLIRATVNITKTLIDDM--DDYDIKGMETLT 363
Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLY 420
KL FHP+HY ++K L+ Y
Sbjct: 364 KKLLGSFHPNHYLMLSLKQKLLAAY 388
>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
Length = 517
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 187/381 (49%), Gaps = 31/381 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ S LGRYL A ++ G+ I RE P+ VGP + +C C + L+ +
Sbjct: 7 YKVTHSGRLGRYLVAAKNIAAGEVIIREEPIAVGPMTYRKDRLCFACLRSLS-KIGRQYV 65
Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL-PLRCLILQRR 173
C RC P CS C + HT + EC IL LLA N IL LR +++ R
Sbjct: 66 CSRCNLAPLCSVACETRTKHHT-SDECLILGEN-RDLLAKNGIDATGILLALRLWLIKCR 123
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
+P ++++ ME H+ +R GT V+++ + ++V +R + ++P D ++ SA+ + +
Sbjct: 124 NPAVWEQVDHMEDHLHERM-GTPVWKDREVNVVNVIR---MLRVP-DGIEPPSAELMQKL 178
Query: 233 CGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST 285
CG+++VN ++ G +GLY LM H+C N + +L V+ I
Sbjct: 179 CGILDVNTFELRSPGALDGLLLRGLYVEAALMAHDCRGNTHLTVDDDFRLTVYASVAIGE 238
Query: 286 M------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM---KCLNEHKDQ 336
YT++L GT RREHL KYF C C+ C D EL ++ S++ +C +
Sbjct: 239 NEPILFNYTSSLLGTADRREHLREGKYFECECDLCKDSCELQSHLSSLLCPRCKKGFIEM 298
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
D I V+P +S W C +C R + + +++ E+ ++KSL+ +L
Sbjct: 299 QDTSI--VDPYARESRWQCQTCKRTYGGRLIRATLNICRTLIEEC--EDIDIKSLDGLLK 354
Query: 397 KLEAMFHPHHYHCYAVKHSLI 417
KL H +H+ ++K L+
Sbjct: 355 KLSRSLHTNHFLMLSLKQKLL 375
>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
Length = 513
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 189/408 (46%), Gaps = 53/408 (12%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LALAEPICLGCHK---PLNPNLA 111
+ I+ SPL GR + AT D+ G+ I + PL++GP+ P+C+ C+K PL P
Sbjct: 39 WSIEYSPLGGRGMFATRDIQAGELIFTDVPLLIGPRCYRKCFPMCVVCYKNNCPLFPCDH 98
Query: 112 DNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETL-LAYNDYKYEAILPLRCLIL 170
D C P CS C ++ H+ EC +L+ T ++++ A++P+R L L
Sbjct: 99 D------CGLPICSTECE---NSTMHSQECRLLREWMPTCGSSWSNELLLAVIPIRGLTL 149
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
S ++ + L E H+ T +E + L+ N ++K P + + +
Sbjct: 150 ---SKEQRKLLYAFECHLEL----TRTHE-----IDLLKRN-VKKFP----NKEQLELMK 192
Query: 231 WICGVIEVNGVDI--------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-- 280
ICGV N ++ +GLY + L H C+PN +H Q +L V
Sbjct: 193 RICGVFNTNSFEVLVASNKDCTTSLRGLYPLGALQNHCCVPNTRHHFDDQQQLHVTATLP 252
Query: 281 ----THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I+ YT+ LW T RR+ L TK+F+C+C RCSDP+E G+ A+ C KD
Sbjct: 253 IAAGEEITMSYTDLLWDTSSRRQFLKATKHFSCNCSRCSDPSEFGSQLGALLC---AKDD 309
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN--VKSLEEM 394
+LP NPL+ S W C C +N + + + SQL V + P+ VK +E
Sbjct: 310 CSGHLLPSNPLNFISPWICDKCQISVNYKQIEFIHSQLNALVLDAMYMTPHEIVKFIETA 369
Query: 395 LTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
L+KL ++ VK +I +G P + L+ + L K Y
Sbjct: 370 LSKL---VPTSNFILIDVKFRIISYFGRTPNLKWEDLTDAELRIKKMY 414
>gi|332018227|gb|EGI58832.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 518
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 184/382 (48%), Gaps = 32/382 (8%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I S LGRYL ++ PG+ + RE P+ +GP + + C C + L P + + +
Sbjct: 7 YKIAHSKKLGRYLVTAKNIVPGEILIREEPIAIGPMVYKKDCFCFACMRTL-PKIGEGRQ 65
Query: 116 --CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL-PLRCLILQ 171
C RC F P CS C ++ HT+ EC I K LL N IL LR I++
Sbjct: 66 YACSRCKFVPLCSVACEIRTNHHTNE-ECQIFK-DNRDLLTENRIDVAGILLALRLWIIK 123
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
++P ++++ MEAH+ KR GT V+ + + ++V +R + P ++ + + + +
Sbjct: 124 HKNPAVWEKIDCMEAHLDKR-IGTSVWRDREVNVVDVIRR---LRWPAEI-ESPNTELMQ 178
Query: 231 WICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI 283
+CG+++VN ++ G +GLY LM H+C N + +L V+ I
Sbjct: 179 KLCGILDVNTFELRSPSALDGLLLRGLYIEATLMAHDCRGNTHLTVDDNFQLTVYASVAI 238
Query: 284 STM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
YT++L GT R++HL KYF C C C DP EL ++ S++ C K G
Sbjct: 239 KQNEPILFNYTSSLLGTVDRKDHLREGKYFECECSLCMDPYELQSHLSSVLCPRCKK--G 296
Query: 338 DCWILPVNPLD--NDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
I ++ +D S W C C R + + +D+ E+ ++K ++ +L
Sbjct: 297 FIGIQDISTIDPYEQSRWQCQMCKKTYGGRLIRATLNICRTLIDEC--EDVDIKKIDSLL 354
Query: 396 TKLEAMFHPHHYHCYAVKHSLI 417
KL H +H+ ++K L+
Sbjct: 355 KKLSRSLHTNHFLMLSLKQKLL 376
>gi|321474905|gb|EFX85869.1| hypothetical protein DAPPUDRAFT_237516 [Daphnia pulex]
Length = 595
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 129/464 (27%), Positives = 204/464 (43%), Gaps = 77/464 (16%)
Query: 35 CGKQHQ-KEHWKLHKPKCKKL-------PYEIKSSPLLGRYLQATLDLHPGDRIARESPL 86
CG + Q +E+ + +P +K + SSP LGRY A+ PGD + +E PL
Sbjct: 43 CGSRTQGRENRGMTQPNNRKSSSGGSKPAIHLDSSPELGRYFVASRAFKPGDIVMKEFPL 102
Query: 87 IVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL-K 145
P L+E +C CH+ A C C P CS C + H EC L K
Sbjct: 103 ASAPG-PLSEHVCFACHRLF----ARAHICSDCRSPLCSRNCQV---SPEHQRECRYLTK 154
Query: 146 LGCETLLAYNDYKYEAILPLRCLILQR---------RSPKKYQELKDMEAHMSKRGPGTE 196
+ E + K + + P++ +L P K+ +L D+E+H + R +
Sbjct: 155 IIAEIENRQRNGKIQRV-PVQLGMLMLPLRLLMLRREQPDKWNQLMDLESHTTARQATSI 213
Query: 197 VYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG------RYT--- 247
+I +L + ++++P + DD + +CG ++VN +I R+T
Sbjct: 214 WGFNQRNITPFLETMLIDEMPDIINDD----LVQQMCGAVDVNSFEIRIETPDRRHTNDL 269
Query: 248 ------------QGLYSVICLMEHNCLPNAKHSNMMQSKLFV-----FRDTH-ISTMYTN 289
+G + + LM H+C+ NA+ S S++ V R+ I YT+
Sbjct: 270 PQQPMPNSQEVLRGTFYMASLMAHSCVVNAQISIGNDSQMTVKATVPIREGEGIFVSYTD 329
Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV-NPLD 348
L T RR L K+F C C RC DPTE GT+ SA++C + W+LP +P
Sbjct: 330 PLQTTLQRRTFLEKGKHFICRCRRCQDPTEFGTFASAVRC----RSCTSGWVLPNDDPAS 385
Query: 349 NDS---------DWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL----EEML 395
S W C CS+ L + ++ T ++ + + K +Q P + L EE+
Sbjct: 386 IASAAAVEGVALRWKCSECSSELKSEEIGR-TEEVIQNIVKNIQRLPVDRHLIDRCEELF 444
Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
L H +H ++ SLI LYG PG+ +LS LL+RK
Sbjct: 445 VTLPKKIHVNHAFLLQLRISLIHLYGNVPGFLIPELSVGLLQRK 488
>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
Length = 518
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 176/384 (45%), Gaps = 51/384 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y+I S LGR++ A DL G++I E P + GP A+ CL C +
Sbjct: 8 YQILESEQLGRFVVAKRDLTRGEQILLEEPYVTGPYWD-ADVSCLNCFRDSCSTCRQCGV 66
Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK-LGCETLLAYNDYKYEAILPLRCLILQRR 173
P C+ D H EC+ + + YN + ++P+RCL+L R
Sbjct: 67 APLCY------------DCKIHDEAECSFYRESSLDRHFLYNHFN--VVMPVRCLMLYRS 112
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSN--FLEKLPGDVLDDTSAKCLH 230
+ ++Y E+ ME+ + +R GTE+++ + +VK L + F EK G + + + +
Sbjct: 113 NRERYDEVMTMESRLEER-RGTEIWDIHEKFVVKPLMESGIFAEKFEGL---EVTGELIQ 168
Query: 231 WICGVIEVNGVDI-----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
ICG+++ N +I G +GLY LM H+C+PN S S + VF
Sbjct: 169 RICGILDANTFEIRGNVDSRESQMGNLARGLYPKTALMMHSCMPNTLLSVDGYSNVRVFT 228
Query: 280 DTHI------STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
+ + YT +L+GT R+ HL KYF CSC RC DPTELGT+ S+++C
Sbjct: 229 SAPVKMGEILNISYTRSLFGTYDRQTHLRKGKYFICSCRRCKDPTELGTHLSSIRC--SE 286
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
D G C P P W C C L V+ + L D ++ V+ LE
Sbjct: 287 CDDGLCSFYPDEP-----RWECNKCHKLLKREYVNEI---LCAARDDIMACPLEVRDLER 338
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLI 417
+++K + +PHH K +L+
Sbjct: 339 VISKHSKILNPHHSLVLEAKQTLM 362
>gi|312373518|gb|EFR21239.1| hypothetical protein AND_17329 [Anopheles darlingi]
Length = 445
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 171/358 (47%), Gaps = 42/358 (11%)
Query: 137 HAPECAILKLGCETLLAYND-----YKYEAILPLRCLILQRRSPKKYQ-ELKDMEAHMSK 190
H ECA+ + D + + I PLR L+ + +P++++ E+ ME H K
Sbjct: 30 HGAECAVFAEKRVRFQSVEDSTAGCVQLDCITPLRMLLAKEANPERWEREIVPMEYHDEK 89
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI----GRY 246
R G + +IV +LR+ + D S + + GV++VN + G
Sbjct: 90 RREGANWKVDDHNIVSFLRNAC------GLADRFSVDLIQQVIGVLDVNAFEGRTCNGYS 143
Query: 247 TQGLYSVICLMEHNCLPNAKHSNMMQSKLF---------VFRDTHISTMYTNALWGTQPR 297
+GLY + +M H+C+PN HS + S+ F V + T YT L GT R
Sbjct: 144 VRGLYPQLAIMAHSCVPNVVHS-IHPSEAFRLEARVAVDVAEGEKLYTTYTYTLTGTVAR 202
Query: 298 REHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGS 357
+ L +K+F C C RC DP+ELGT+FS++ C +K G I+ +PLD +++W CG
Sbjct: 203 QAILKASKFFTCMCPRCVDPSELGTHFSSLLC---NKCVGGI-IISSDPLDENAEWKCGD 258
Query: 358 CSARLNARDVHLVTSQLGEQVDKLV---QENPNVKSLEEMLTKLEAMFHPHHYHCYAVKH 414
C + + V + ++D+L + +++ E + K ++ HP H+ +++H
Sbjct: 259 CGFKTSGAAVQKAILTMHNELDELACLEYDAGRLEAYERLYKKFRSVLHPRHFINTSIRH 318
Query: 415 SLIQLYGTQPGYAYTQLSSSLLERKISY---------VMSPNLKATDEPICLGCHKPL 463
SLI+LYG PGY +L LLERK+ V+ P L + + H PL
Sbjct: 319 SLIELYGRIPGYEMAELPDVLLERKVDLCRDVLRVLDVVEPGLSRSRAMVLYELHAPL 376
>gi|158289967|ref|XP_559081.3| AGAP010377-PA [Anopheles gambiae str. PEST]
gi|157018413|gb|EAL41037.3| AGAP010377-PA [Anopheles gambiae str. PEST]
Length = 503
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 165/372 (44%), Gaps = 49/372 (13%)
Query: 66 RYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF-WPAC 124
R+L A D+ PG+ I E P++VGP A+ CLGC P + C C P C
Sbjct: 1 RFLVANRDIKPGELILAEDPIVVGPYWD-ADISCLGCLGPASRT------CRVCLKGPLC 53
Query: 125 SARCSGLSDAHTHAP-ECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
D H P EC+ + + + + I PLRCL+L R KY E+
Sbjct: 54 R-------DCSRHDPVECSFYEQASKLNKNFLFDHFNIITPLRCLLLYRSDRDKYNEMMA 106
Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD--TSAKCLHWICGVIEVN-- 239
ME+H R TE++ + +Y+ L + +DD + + L ICG+++VN
Sbjct: 107 MESHFESR-RDTEIWHIHN---QYVVEPMLNEKDFQQIDDLTVTGELLQRICGILDVNTF 162
Query: 240 ---------GVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST----- 285
GV + +GLY LM HNC N + SKL ++ I
Sbjct: 163 EIRGNMDSQGVQMNNLARGLYPKTSLMTHNCQTNTLIAVDGMSKLRLYSSIGIKAGELLY 222
Query: 286 -MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
YT L+ T R+ HL KYF C+C RCSDPTELGT+ S++KC D G C P
Sbjct: 223 YNYTRVLFSTFERQTHLRKGKYFICNCARCSDPTELGTHLSSLKC--TACDDGLCVFHPE 280
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHP 404
P W C +C +L V+ + + + + ++ N LE+++ K +P
Sbjct: 281 TP-----KWECLTCGHQLPREYVNQIMCEARDDISSCALDSRN---LEKVIGKHSKTLNP 332
Query: 405 HHYHCYAVKHSL 416
HH K SL
Sbjct: 333 HHSLVLEAKQSL 344
>gi|328713664|ref|XP_003245146.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 464
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 112/404 (27%), Positives = 180/404 (44%), Gaps = 53/404 (13%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
PY + SP LGR+ A D+ G+ + E PL+VGPK A + P+CL C+ P
Sbjct: 17 PYAVVQSPELGRHWVAVRDIVAGEVLLEERPLVVGPK-AGSPPVCLTCYAP-----TAGY 70
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRC-LILQRR 173
+C +C WP C RC + H A EC ++ ++ Y ++PLRC L+L +R
Sbjct: 71 KCSKCGWPVCGPRCEA-APVHRDA-ECPLIDGHYDS---RRSAAYCFVMPLRCMLLLHQR 125
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
++ E + +++H+ R T +Y ++ ++ + G + SA
Sbjct: 126 DGRRAAEFRSLQSHLDDR-LNTPLYRAYAVNVAAFVLDRLGLRSAGYGHNHRSAL---EA 181
Query: 233 CGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM-----------MQSKLFV 277
V++ N ++ GR + +Y +M H C PN KH + + + + +
Sbjct: 182 AAVLDTNAFEVRRPGGRKFRAVYGRASMMAHCCTPNTKHVFIGDETDGQPTIRVVAAVPI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
R I+ YT LW T+ RR HL+ K F C+C RC+DP ELGT+ ++ C G
Sbjct: 242 ARGYPITATYTQTLWCTRDRRRHLSAAKCFECACARCADPVELGTHLGSVAC-----GGG 296
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
C P W C +C RL A DV + Q + + + E L +
Sbjct: 297 QC---PGGRATAAGQWLCTTC-GRL-ATDVEAAHAL---QTVGALSKTRDCAGFERFLER 348
Query: 398 LEAM----FHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLE 437
+ H +H+ VK++L QLY + + LS+ LE
Sbjct: 349 VRDGTMPPLHANHHVTVGVKYALAQLYADR----ISDLSTKQLE 388
>gi|321473703|gb|EFX84670.1| hypothetical protein DAPPUDRAFT_314752 [Daphnia pulex]
Length = 497
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 116/407 (28%), Positives = 196/407 (48%), Gaps = 40/407 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIA-RESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
+ I S +G++L AT ++ + I E L VGPK A P+CL C++ ++ +
Sbjct: 54 FTIGKSSDIGKFLVATRNIRADEIIIDAEEALAVGPKQA-THPVCLNCYRRVDGSYL--- 109
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILK----LGCETLLAYNDYK-YEAILPLRCLI 169
CP C WP CS C+ + + EC +L + L D + Y+ I PLR ++
Sbjct: 110 -CPLCQWPLCSEVCAQMENGAHQKYECQLLANREAYPTKEELEQEDCRLYDCIAPLRIVL 168
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPG--DVLDDTSAK 227
R+P++ + +E H +Y + I R +E + + ++T+ +
Sbjct: 169 DNHRNPERSRLWLGLEDH--------RLYRQKIGIWHVDRKTVVEPVLNKWQLGENTNEE 220
Query: 228 CLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPN-------AKHSNMMQSKL 275
L CG++EVN ++ + +YS CLM HNC+PN A +++ +
Sbjct: 221 ELQRACGILEVNAFEVCDDDKTISARAVYSHACLMAHNCVPNTACCVNPATQRMTVRAAV 280
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ I+T YT L GT RR HL TK+F+C C RC DPTELG++ S++ C +
Sbjct: 281 DIPAGQMITTSYTFTLDGTLRRRSHLKETKFFDCCCPRCCDPTELGSHLSSLAC---PRC 337
Query: 336 QGDCWILPVNPLDNDSDWTCGS--CSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
G + +P +PL+ S W C C R+ + V+ + + L + D+L + + + LE
Sbjct: 338 SGRNYTVPEDPLNPASFWKCQGEKCGNRIGSDQVNQLLACLQRKADELNYD--DFQGLEI 395
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ ++ HP+H +K+ L YG PG+ ++S L RK+
Sbjct: 396 FIETWSSLLHPNHSLLIGIKYYLCHFYGNAPGFLLKEMSWRFLTRKM 442
>gi|350403683|ref|XP_003486875.1| PREDICTED: hypothetical protein LOC100747756 [Bombus impatiens]
Length = 814
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 122/392 (31%), Positives = 192/392 (48%), Gaps = 41/392 (10%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L Y + S LGRYLQA +L G+ I RE P+ VGP + +P C C + L P +
Sbjct: 13 LKYRVAYSEKLGRYLQAAKNLTAGEVILREEPIAVGPITSSKDPACFACLRLL-PKIKKE 71
Query: 114 AR--CPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLIL 170
+ C +C P CS C S HT EC I K + L+ + +LPLR +L
Sbjct: 72 LQYVCSKCNIAPLCSTACEERSKHHT-PDECEIFKRNKDLLINNTENIIGVLLPLRLWLL 130
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCL 229
++R + +++++ MEAH KR T V+++ + ++V ++S L +P D + ++ L
Sbjct: 131 RQRDAELWKQVESMEAHTDKRR-NTSVWKDRELNVVNVIKS--LHLVPDG--DPSVSELL 185
Query: 230 HWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF---- 278
+CG+++VN ++ G +GLY LM H+C N + +L V+
Sbjct: 186 QLLCGILDVNCFELRSPGGLDGLLLRGLYVEASLMAHDCRGNTHLTADDNFQLTVYASLP 245
Query: 279 --RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I YT++L GT RREHL KYF C C CSDP E+G+Y S++ C
Sbjct: 246 IKEGDEIYFNYTSSLLGTLGRREHLRGGKYFECECPLCSDPYEMGSYMSSILCPRCR--- 302
Query: 337 GDCWILPVNPL-----DNDSDWTCGSCSARLNARDVHL---VTSQLGEQVDKLVQENPNV 388
+ +I NPL + + W C C + + R V +T L + VD + ++
Sbjct: 303 -EGYIGMQNPLTKFPYEKGTRWQCNKCKSSIGGRLVRATLNITKTLIDDVD-----DYDI 356
Query: 389 KSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
K LE ++ KL FH +H+ ++K L+ Y
Sbjct: 357 KGLETLMAKLSKSFHRNHFLMLSLKQKLLAAY 388
>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
Length = 490
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 118/407 (28%), Positives = 184/407 (45%), Gaps = 46/407 (11%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHK---PLNPNLADN 113
+ SPL GR L AT D+ G+ I + PL++GP+ P+C+ C+K PL P D
Sbjct: 29 VDRSPLGGRGLFATRDIQAGELIFTDVPLLMGPRCYNKHLPMCVVCYKNNCPLFP--CDY 86
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLA-YNDYKYEAILPLRCLILQR 172
C P CSA C S H EC L+ T + ++ ++ +R L L
Sbjct: 87 G----CGLPICSAECEN-SAMHVQG-ECQFLREWAPTCGSTWSKDLLLVVVLIRGLAL-- 138
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
S ++ L E H + EV++ I L+ + L++ ++ + K + +
Sbjct: 139 -SEEQRTLLYAFECH-ANLNRNYEVHQFI------LKIDLLKRNVANLPSEEQMKLMQRV 190
Query: 233 CGVIEVNGVDI---------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV------ 277
C V N ++ +GLY + L H C+PN++H Q +L V
Sbjct: 191 CDVFNTNSFEVVVPSRNNDHTMSLRGLYPMGALQNHCCVPNSRHHFDDQQRLHVSAALPI 250
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
I+ YT+ LW T RR+ L ITK F+C+C RCSDP E G+ SA+ C KD+
Sbjct: 251 AAGEEITMSYTDLLWDTSTRRQFLRITKRFSCNCNRCSDPLEFGSRLSALLC---AKDEC 307
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN--VKSLEEML 395
+LP NPL+ +S W C C +N R + + S L V ++ + P + E +L
Sbjct: 308 LGHLLPRNPLNYESSWICDKCQTSVNHRQIECIHSGLNTFVSNVMYKTPREIQRFTEAIL 367
Query: 396 TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
KL +Y +K+ +I +G P + L+ L K +Y
Sbjct: 368 PKLVPAI---NYIMLDIKYCIISYFGRVPDLKWEDLTDVELRIKRTY 411
>gi|350424241|ref|XP_003493732.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 498
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 116/429 (27%), Positives = 195/429 (45%), Gaps = 55/429 (12%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LALAEPICLGCHKPLNPNLADN 113
P+ I+SS L GR L AT D+ + I +SPL+VGPK + +C+ C+K N
Sbjct: 32 PWTIRSSLLGGRGLFATRDIKQNELIFIDSPLLVGPKYIGKYFEMCVSCYK----NECAL 87
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETL-LAYNDYKYEAILPLRCLILQ 171
C R C P CS +C S H + EC LK T ++ A++P+R L L
Sbjct: 88 FPCDRGCGLPVCSTQCEN-SPKHVNY-ECEYLKSLVPTCGTDWSPNLLLAVVPIRALFLT 145
Query: 172 RRSPKK------------YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD 219
+ K Y E++ ++ +++ P E E + + + L +N E
Sbjct: 146 EQQRKCLATLQSDKNLTCYPEIEQLKKNVTN-SPSEEDMELMKHMCRILNTNSFE----- 199
Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV-- 277
I V+ + +GLYS+ H C+PN +H + +++V
Sbjct: 200 ---------------TIMVHDKEHSVSLRGLYSIASFQNHCCVPNTRHHFDGEFRMYVSA 244
Query: 278 ----FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
I++ YT+ W T RR+ L+ITK+F+C C+RCSDPTE + A+ C +
Sbjct: 245 ALPIAAGEEITSTYTSLFWDTTLRRKFLSITKHFSCMCKRCSDPTEFKSRLGALLCAS-- 302
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
D+ +LP++PL+ + W C C+ LN R + + S + + +++ + P + + +
Sbjct: 303 -DKCSGELLPLDPLNMTTPWICDKCAITLNHRQICSIRSGIAGIIKEVLCKTP--REIYK 359
Query: 394 MLTKLEAMFHPHHYHCYA-VKHSLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATD 452
L K ++ P +C +K ++ YG G + LS + L K + + L D
Sbjct: 360 FLQKELSLLTPWSNYCIVDMKFRIVSYYGRSDGAKWEDLSDTELNLKAKFC-NDLLSVLD 418
Query: 453 EPICLGCHK 461
+ C C K
Sbjct: 419 DLNCGDCRK 427
>gi|345480417|ref|XP_001607394.2| PREDICTED: protein msta, isoform B-like [Nasonia vitripennis]
Length = 557
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 187/407 (45%), Gaps = 49/407 (12%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
YE+ S +LGRYL A +L G+ I RE L+VGP + + C G + A NA
Sbjct: 15 YELAYSDVLGRYLVAAKNLSAGEVIFREDALVVGPAMFANDVFCFGLNPAKLATRAPNAA 74
Query: 116 CPRCFWPACS-----------ARCSGLSDAHTHA-PECAILKLGCETLLAYNDYKYEAIL 163
PR + L D+ H+ EC +L+ + +L + ++
Sbjct: 75 SPRSAIELARYIRILHVSHSLFQRDALQDSGLHSTAECDLLRAKRDVMLENMEDLMHTLV 134
Query: 164 PLRC-LILQRRSPKKYQE-LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVL 221
L+ L++ S + +E L MEAHM++R G+EV+ E ++ V + F G V
Sbjct: 135 HLKLWLVMSAGSDQDVRERLSKMEAHMNER-RGSEVWFEREANVVDVFRRF-----GLVR 188
Query: 222 DDTSAKCLHWICGVIEVNGVDIGR------------YTQGLYSVICLMEHNCLPNAKHSN 269
+ A+ + W+CGV++VN ++ +G++ LM H C A +
Sbjct: 189 SENEAELVQWLCGVLDVNSFELRTPVPGSNGNNGSPLLRGIFLEAALMAHACRGTAHIAV 248
Query: 270 MMQSKLFVFRDT------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY 323
+ ++ V+ I+ YT++L GT R+EHL + KYF C C C DP E G++
Sbjct: 249 DDRFQMTVYAAVPIPAGETIAFNYTSSLLGTIERQEHLQVGKYFRCECSMCVDPLEHGSF 308
Query: 324 FSAMKCLNEHKDQGDCWILPVNPLDND-----SDWTCGSCSARLNARDVHLVTSQLGEQV 378
S + C KD ++ N D D + W C C R+ + T ++G+ +
Sbjct: 309 VSCILCPRCKKD----YVAIQNARDKDPYGRKTKWQCRKCK-RIFRGCLVKSTIEIGKDL 363
Query: 379 DKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPG 425
+ V + N K++E +L +L + FH +H+ +K ++ +Y + G
Sbjct: 364 IENVGDG-NSKAMESLLVRLSSTFHTNHFVMLTLKQKVLAVYRKEVG 409
>gi|328785466|ref|XP_001121590.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 526
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 182/391 (46%), Gaps = 68/391 (17%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L Y++ S LGRYLQA+ DL G+ I RE P+ VGP + +PIC C L PN+ ++
Sbjct: 15 LKYKVAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCVKDPICFECLSIL-PNIEED 73
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKL---GCETLLAYNDYKYEAILPLRCLIL 170
+ CSG C ++ L CE Y+ + +L
Sbjct: 74 V----------NYVCSG----------CNVVTLCGVTCEERGIYHSALW---------LL 104
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRS-NFLEKLPGDVLDDTSAKC 228
+++ P+ ++ + +E+H++KR T V+E+ + +IV L+S NFLE D + ++
Sbjct: 105 KQKDPELWKRVLSLESHLNKR-RNTIVWEDREINIVNVLKSLNFLEN------DPSVSEM 157
Query: 229 LHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
+ +CG+++VN ++ G +GLY +M H+C N + L V+
Sbjct: 158 IQQLCGILDVNSFELRSPGGMDGLLLRGLYLEASMMAHDCRGNVHVTADDNFHLTVYASI 217
Query: 282 HIST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
I YT++L GT RRE+L KYF C C+ C DP E+G+Y S + C
Sbjct: 218 PIKEGDTIFFNYTSSLLGTTGRREYLRTGKYFECECDLCKDPYEMGSYMSCIVC--PRCR 275
Query: 336 QGDCWI---LPVNPLDNDSDWTCGSCSARLNA---RDVHLVTSQLGEQVDKLVQENPNVK 389
QG + L P + W C C ++ + ++ L + VD+ N+K
Sbjct: 276 QGYVGMQDPLTKFPYTKSAKWQCNKCKKIISGCLIKTSLSISKNLVDDVDEY-----NIK 330
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLY 420
+E + KL H +H+ ++K L+ +Y
Sbjct: 331 VMENLKDKLSKSLHKNHFLMLSLKQKLLNVY 361
>gi|170044116|ref|XP_001849704.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867385|gb|EDS30768.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 502
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 168/379 (44%), Gaps = 56/379 (14%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+++ + L R++ A DL G++I E PL+ GP A+ CL C + + +
Sbjct: 8 FQLLHTEQLDRFVVAKRDLPRGEQILLEEPLVTGPYWD-ADVSCLSCLRDSCSTCSKCGK 66
Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK--LGCETLLAYNDYKYEAILPLRCLILQR 172
P C+ D H A EC + + YN + I P+RCL+L R
Sbjct: 67 APLCY------------DCTEHDATECDFFRESTALDKHFLYNHFN--VITPVRCLLLYR 112
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD----TSAKC 228
++ + E+ ME+H+ +R G+E+++ D KY+ + LE D L+D + +
Sbjct: 113 KNRDLFDEMMRMESHVEER-RGSEIWQIHD---KYVVTPLLEAGILDSLNDGELEVTGEL 168
Query: 229 LHWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
+ ICG+++VN +I +GLY LM HNC+PN S L VF
Sbjct: 169 IQRICGILDVNTFEIRGDVDSSQSGSNLARGLYPKTSLMVHNCVPNTLLSIDGVGNLRVF 228
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGD 338
+ GT R+ HL KYF C C RC DPTELGT+ S++KC D+G
Sbjct: 229 ----------TSAPGTFERQTHLRQGKYFTCYCRRCKDPTELGTHLSSIKCT--ECDEGL 276
Query: 339 CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKL 398
C P P W C C L V+ V ++ + E V+ LE ++ K
Sbjct: 277 CSFYPSEP-----RWECNKCRKLLKREYVNEVMCAARSEIMECALE---VRDLERVIGKH 328
Query: 399 EAMFHPHHYHCYAVKHSLI 417
+PHH K +L+
Sbjct: 329 SRTLNPHHSLVLEAKQTLL 347
>gi|91088045|ref|XP_966477.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270012081|gb|EFA08529.1| hypothetical protein TcasGA2_TC006182 [Tribolium castaneum]
Length = 481
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 107/407 (26%), Positives = 183/407 (44%), Gaps = 53/407 (13%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P+ IK + R++ A + G+ I E+P + GPK + +CLGC + ++ A
Sbjct: 5 PFTIKQTDQ-NRHVVANRLIKSGEIILTETPFVYGPKPG-SPVLCLGCFECVDSKTA--- 59
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY--------KYEAILPLR 166
C +C WP C C LS + EC I + + +ND +YE I PLR
Sbjct: 60 -CSKCSWPVCGPECEFLSSHKDN--ECEIF---AKNRVKFNDVDDPYDVCLQYEGITPLR 113
Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
L+ + + PK++ E G ++ + ++V ++R N + +D
Sbjct: 114 MLLAKEKDPKRWLE----------EIGGLKIRKNNVNVVDFVR-NVCK------MDQFCD 156
Query: 227 KCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR--- 279
+H +CG++++N +++ G + LY + + HNC+ N + + R
Sbjct: 157 DLIHTVCGILDLNAIEVRAPSGYLIRCLYPKLSQISHNCVTNLFQTVTPEDFKITLRASG 216
Query: 280 ----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ + Y LW T RR+ L K +C C+RC D TEL T+ S +KC D
Sbjct: 217 NISENQELFYNYVYPLWPTLIRRDFLKENKNLDCCCKRCGDKTELRTHLSTLKC--SKCD 274
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPNVKSLEE 393
G +L + L + DW C C + NA V V + +++ ++V ++ E
Sbjct: 275 NG--ILLSSDSLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEGIEQREA 332
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
+ K ++FHP + + ++ L QLYG PGY +L L E K+
Sbjct: 333 IFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 379
>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 490
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 127/441 (28%), Positives = 199/441 (45%), Gaps = 71/441 (16%)
Query: 30 KEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVG 89
KEV H KE+ + + + Y S L GR + AT D+ + I ++PLIVG
Sbjct: 12 KEVEEVFISHLKENKIFQEQQLWTVGY----SSLGGRGMFATRDIKQNELIFIDAPLIVG 67
Query: 90 PK-LALAEPICLGCHK---PLNPNLADNARCPR-CFWPACSARCSGLSDAHTHAPECAIL 144
PK L +C+ C+K PL P C + C P CS +C ++ H EC L
Sbjct: 68 PKCLNKQTKMCICCYKNECPLFP-------CDKGCGLPVCSIQCE---NSPNHVNECEYL 117
Query: 145 KL---GCETLLAYNDYKYEAILPLRCLIL---QRRSPKKYQELKDM----EAHMSKRG-- 192
+ C T N A++P+R L + QR+ Q K++ E + K+
Sbjct: 118 RSLIPTCGTDWCLN--LLLAMIPIRGLFMTKMQRKCLATLQYNKNLFSKYEIELLKKNVM 175
Query: 193 --PGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT--Q 248
P E E ++ + + +N E + C+H D +T +
Sbjct: 176 NSPSDEDIEMMERVCRAFNTNSFETI-----------CVH-----------DKDHFTNLR 213
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS-----TM-YTNALWGTQPRREHLA 302
GLY + L H C+PN +H + +L+V IS TM YT+ W T RR+ L
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273
Query: 303 ITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARL 362
+TK F+C C+RCSDPTE + SA+ C + D+ +LP NPL+ + W C CS +
Sbjct: 274 VTKNFSCMCKRCSDPTEFNSKLSALLCAS---DKCSGELLPKNPLNMRTSWICNKCSITV 330
Query: 363 NARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH-HYHCYAVKHSLIQLYG 421
N R + + S L +++++ + P + + + L K ++ P +Y VK +I YG
Sbjct: 331 NHRQICSIRSGLAAIIEEIIYKTP--REIFKFLEKDLSIIVPQSNYLIMDVKFRIISYYG 388
Query: 422 TQPGYAYTQLSSSLLERKISY 442
+ LS L+ K +Y
Sbjct: 389 RSNDLQWKDLSDIELDIKANY 409
>gi|170056789|ref|XP_001864189.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876476|gb|EDS39859.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 508
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 170/387 (43%), Gaps = 55/387 (14%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
YEI S LGRYL A DL G+ I E P +VGP ++ CL C + R
Sbjct: 8 YEILESDRLGRYLVAKRDLARGELILVERPTVVGPHWD-SDVCCLNCFRNSCTICRLCRR 66
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY--KYEAILPLRCLILQRR 173
P C+ C+G +A EC + E+ L N + + +RCL+L R
Sbjct: 67 APLCY------DCTGHDEA-----ECRFYR---ESNLDINFLFNHFNVVTSVRCLLLYRT 112
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL---- 229
++ E++ MEAH+ KR G+E++ D + KY+ L+ D + D + L
Sbjct: 113 DRDRFWEMRRMEAHLEKR-RGSEIW---DILSKYVVEPLLQNGAFDRVGDGEGEKLTVNE 168
Query: 230 ---HWICGVIEVNGVDIGR-----------YTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
CGV +VN +I + +GLY LM H+C+PNA S +S L
Sbjct: 169 DLIQHCCGVFDVNAFEIRGNGTSQSVQQNCFVRGLYPRAALMAHDCVPNAFISVDGESNL 228
Query: 276 FVF------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
V+ + + YT L+GT RR HL KYF C+C RC DPTELGT+ +++C
Sbjct: 229 RVYAAVPIEKGQMVLFNYTRCLFGTFERRAHLRKGKYFLCTCARCEDPTELGTHLCSVRC 288
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
N C + + W C C L V V + + + + +
Sbjct: 289 TN-------CSAGLCSYYSKVTKWKCDKCPHELTREYVKQVFTN---ARNDAMARSFEIP 338
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSL 416
LEE +T+ A +P + +K +L
Sbjct: 339 ELEEAITRHSATLNPRNSLVLELKQTL 365
>gi|170065144|ref|XP_001867817.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167882269|gb|EDS45652.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 506
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 166/379 (43%), Gaps = 59/379 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+++ + L R++ A DL G++I E PL+ GP A+ CL C + + +
Sbjct: 8 FQLLHTEQLDRFVVAKRDLPRGEQILLEEPLVTGPYWD-ADVSCLSCLRDSCSTCSKCGK 66
Query: 116 CPRCFWPACSARCSGLSDAHTH-APECAILK--LGCETLLAYNDYKYEAILPLRCLILQR 172
P C+ D H EC + + YN + I P+RCL+L R
Sbjct: 67 APLCY------------DCSEHDTTECDFFRETTALDKHFLYNHFN--VITPVRCLLLYR 112
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD----TSAKC 228
++ + E+ ME+H+ +R G+E+++ D KY+ + LE D L+D + +
Sbjct: 113 KNRDLFDEMMRMESHVEER-RGSEIWQIHD---KYVVTPLLEAGILDSLNDGELEVTGEL 168
Query: 229 LHWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
+ ICG+++VN +I +GLY LM HNC+PN S L VF
Sbjct: 169 IQRICGILDVNTFEIRGDVDSSQSGSNLARGLYPKTSLMVHNCVPNTLLSIDGVGNLRVF 228
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGD 338
+ GT R+ HL KYF C C RC DPTELGT+ S++KC D+G
Sbjct: 229 ----------TSAPGTFERQTHLRQGKYFTCYCRRCKDPTELGTHLSSIKCT--ECDEGL 276
Query: 339 CWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKL 398
C P P W C C L V+ V ++ + V+ LE ++ K
Sbjct: 277 CSFYPSEP-----RWECNKCRKLLKREYVNEVMCAARSEIMEC------VRDLERVIGKH 325
Query: 399 EAMFHPHHYHCYAVKHSLI 417
+PHH K +L+
Sbjct: 326 SRTLNPHHSLVLEAKQTLV 344
>gi|442615898|ref|NP_001259436.1| CG11160, isoform C [Drosophila melanogaster]
gi|440216646|gb|AGB95279.1| CG11160, isoform C [Drosophila melanogaster]
Length = 369
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 151/323 (46%), Gaps = 37/323 (11%)
Query: 163 LPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEV---YEEIDSIVKYLRSNFLEKLPGD 219
+ +R L+L++ P+++ + ME+H +R + YEE +V+ LR +
Sbjct: 1 MIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEE--KVVQRLRVTW------- 51
Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGR---YTQGLYSVICLMEHNCLPNAKHSN------- 269
L+D A+ +H +CG+++VN +IG+ + LY L+ H+C PN H++
Sbjct: 52 QLEDLEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPNTAHTDDPSSFEI 111
Query: 270 MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
++++ V ++ Y L GT RR + K F C C RCSDP ELGT SA+ C
Sbjct: 112 LLRTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 171
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
+ V+PL DW C C+ ++ A +V ++ ++ + + ++
Sbjct: 172 ATCRTGS----VRAVDPLQQTGDWACDRCAHKMGATEVERQLDRINNDLEDI--DVHDIP 225
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY------- 442
LE L + + P+HY + K+SL Q+YG GY Q+S + RK SY
Sbjct: 226 GLENFLLRYRDVLRPNHYLLLSAKYSLCQIYGRTEGYLLPQMSPEDIARKESYCREFLEI 285
Query: 443 --VMSPNLKATDEPICLGCHKPL 463
V+ P L I H P+
Sbjct: 286 VDVLDPGLTRLRGLIMYELHAPV 308
>gi|237651911|gb|ACR08644.1| SET and MYND domain containing 1, partial [Drosophila silvestris]
Length = 246
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 135/248 (54%), Gaps = 31/248 (12%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC A C+ CK V YC +HQK+HW HK CK P++++ P LGRYL AT ++
Sbjct: 8 CAVCNVAAPLTCTRCKLVRYCNGEHQKQHWPAHKTCCK--PFKVQDDPQLGRYLVATQNI 65
Query: 75 HPGDRIARESPLIVGPKLALAEP-------ICLGCHKPLNPNLADNARCPRCFWPACSAR 127
I E PL+VGPK L + C+ C+ P N C C WP CS
Sbjct: 66 QAKQIIFVEEPLVVGPKWFLTDDEKSANIVPCVACYTPCRIN---KHLCRSCRWPVCSIS 122
Query: 128 CSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPKKYQEL 181
C EC+IL LG + ++ Y+++A+L L+CL+LQR+ P+++ L
Sbjct: 123 CEH------EKFECSILSLGSPPSGKSDARGLHDYYRHDALLVLKCLLLQRQHPERWTAL 176
Query: 182 KDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPG-----DVLDDTSAKCLHWICGV 235
+M++H +R GTE+++E + IV YL+ FL+++ G +L + A+ LH +CG+
Sbjct: 177 LEMQSHEEERV-GTELHQEAEQRIVSYLQQRFLQRIKGAKNRESLLSEYRAELLHRLCGI 235
Query: 236 IEVNGVDI 243
IE N + I
Sbjct: 236 IETNYMVI 243
>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 501
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 115/428 (26%), Positives = 196/428 (45%), Gaps = 57/428 (13%)
Query: 37 KQHQKEHWKLH--KPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LA 93
++H H L +P C ++I++S L GR + A+ D+ + + ++ ++ GP+ +
Sbjct: 11 RRHLVRHGLLEAAQPTC----WKIENSKLSGRGVIASRDIAVDELVYADAAVVQGPRCYS 66
Query: 94 LAEPICLGCHKP---LNPNLADNARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCE 149
P+C+ C+K L P C R C P CS C +D H ECA LK G E
Sbjct: 67 KYLPLCVSCYKTGCVLFP-------CDRGCGLPVCSEVCQ--NDPHHVEAECAYLK-GLE 116
Query: 150 TLLA--YNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKY 207
++ +A++P+R L+L + + + L+ EA R E+D
Sbjct: 117 PTCGSGWSLELLQAVVPIRALMLPKELTRVFYSLQCHEAAQHGR--------EVD----M 164
Query: 208 LRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHN 260
L+SN + GD+ D + + +C ++ N + G+ T + L+ V L H
Sbjct: 165 LKSN----VKGDI-DADDEELMRRVCRAMDTNSFETAVIRGGKSTSLRALFPVGALTNHQ 219
Query: 261 CLPNAKHSNMMQSKLFVF------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
C+PN +H + +L V+ I+ Y + LW TQ RR L TK+F C C RC
Sbjct: 220 CVPNTRHIVNAEGELLVYAAVPIAEGQEITMSYADVLWDTQMRRHFLLATKHFACQCPRC 279
Query: 315 SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQL 374
+D TE G+ A+ C ++ QG+ +LP PL + WTC C + R + + + +
Sbjct: 280 TDVTECGSLLGALSCAIDNC-QGN--LLPNTPLSLNCPWTCNECGLIIKGRQISSIRAGI 336
Query: 375 GEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSS 434
++ ENP K L + +L + +Y + ++ L+G G +++L+
Sbjct: 337 IAATEEFKYENPR-KVLRFVNKELSLLVPETNYVMMEMMFRIVSLFGRAEGCTWSELTDE 395
Query: 435 LLERKISY 442
L KI Y
Sbjct: 396 ELNIKILY 403
>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
Length = 492
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/390 (26%), Positives = 173/390 (44%), Gaps = 56/390 (14%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
Y++ + + GR++ A+ ++ G I +E+P+++ P++ IC C L
Sbjct: 8 YQVAKNEIFGRFIVASKNIKQGQLILKENPVLLCPQVG-GPIICFNCCAQLKKLFF---- 62
Query: 116 CPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDY--KYEAILPLRCLILQR 172
CP C C+ C + + EC LK + ++D E + PLRCL+ ++
Sbjct: 63 CPDCRIAVLCNPNCK---NQFHNLQECTALK---SLPIKHDDVLSNPEIVTPLRCLLWRQ 116
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPG-DVLDDTSAK--CL 229
+ + + +EAH++ R I + R N + L +LD++ K +
Sbjct: 117 FNKNQLENFLQLEAHLNFRRDS--------QIWRRNRVNIEDVLFKLKILDESDLKEEVV 168
Query: 230 HWICGVIEVNGVDI------------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
ICG++++N D+ +GLY LM H+C+ N + L+V
Sbjct: 169 QKICGILDINTFDVRQPQRNRLGFNQAENLRGLYLRAALMAHDCVANTHLAVDDDFVLYV 228
Query: 278 F------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
+ I YT+ L G R+ L K+F C C RC D TE GT S++KC
Sbjct: 229 HAAVDIPEGSPIYFNYTDVLQGNDERKRRLLNAKHFECQCSRCRDSTEFGTEMSSLKCPR 288
Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
HK +L NP W C SC + + + ++ ++D+ E P +++L
Sbjct: 289 CHKG-----LLRPNP------WQCSSCQRIFHDSLIKMTVTECRRRIDEF--ETPTIENL 335
Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
E+ L KL FHP HY V+ +L++LY
Sbjct: 336 EKFLKKLSFTFHPQHYLVLEVEQNLVRLYA 365
>gi|195479030|ref|XP_002086547.1| GE22790 [Drosophila yakuba]
gi|194186337|gb|EDW99948.1| GE22790 [Drosophila yakuba]
Length = 395
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 150/296 (50%), Gaps = 30/296 (10%)
Query: 159 YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN---FLEK 215
Y I+ LRC+ L+ + P+ + +L +E H+ +R T +Y+ LR+N F++
Sbjct: 24 YCVIMLLRCMQLKAKDPEAFLKLYTLEDHLKERLE-TPLYQ-------VLRANLITFIKT 75
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIG-----RYTQGLYSVICLMEHNCLPNAKH--- 267
+ G + D + I +++ N ++ R + LY ++ H+C+PN +H
Sbjct: 76 VLG--MKDWLEMDILRIAAILDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFD 133
Query: 268 ---SNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYF 324
+ + +K + + +S YT L T RR HL K F+CSC RC DP ELG++
Sbjct: 134 DDMNIVFLAKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFA 193
Query: 325 SAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQE 384
A CL + I+ +NPL N + W C C+ + +A++V ++L ++++ L +
Sbjct: 194 GAQTCLKCKAGK----IISLNPLLNSAPWKCQLCNYKRSAKEVVTSDAELQQELESLDKT 249
Query: 385 NPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
P +LEE + + A H + H K++L QLYG+ PG+ +LS L RK+
Sbjct: 250 TP--VALEEFIYRHRADLHETNTHVLQAKYALTQLYGSAPGFGMEELSEESLNRKV 303
>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 485
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 196/444 (44%), Gaps = 77/444 (17%)
Query: 30 KEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVG 89
KEV H KE+ + + + Y S L GR + AT D+ + I ++PLIVG
Sbjct: 10 KEVEEVFVSHLKENKIFQEQQLWTVGY----SSLGGRGMFATRDIKQNELIFIDAPLIVG 65
Query: 90 PKLALA-EPICLGCHK---PLNPNLADNARCPR-CFWPACSARCSGLSDAHTHAPECAIL 144
PK + +C+ C+K PL P C + C P CS +C ++ H EC L
Sbjct: 66 PKCSNKHTKMCVCCYKNECPLFP-------CDKGCGLPVCSTQCE---NSPNHVNECEYL 115
Query: 145 KL--------GCETLLAYNDYKYEAILPLRCLIL---QRRSPKKYQELKDM----EAHMS 189
+ C LL A +P+R L + QR+ Q K++ E +
Sbjct: 116 RSLVPICGTDWCLNLLL-------ATIPIRGLFMTKMQRKCLATLQYNKNLFSKYEIELL 168
Query: 190 KRG----PGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGR 245
K+ P E E ++ + + +N E I V+ D
Sbjct: 169 KKNVMNSPSDEDIEMMERVCRAFNTNSFE--------------------TIRVHDKDHST 208
Query: 246 YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST------MYTNALWGTQPRRE 299
+GLY + L H C+PN +H + +L+V IST YT+ W T RR+
Sbjct: 209 NLRGLYPLGSLQNHCCIPNTRHYFDEEFQLYVRAALPISTGEEITMSYTSLFWDTTLRRQ 268
Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
L +TK F+C C+RCSDPTE + A+ C ++ K G+ +LP NPL+ + W C C+
Sbjct: 269 FLNVTKNFSCMCKRCSDPTEFNSKLGALLCASD-KCCGE--LLPKNPLNMRTSWICNKCT 325
Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH-HYHCYAVKHSLIQ 418
+N R + + S L +++++ + P + + + L K ++ P +Y VK +I
Sbjct: 326 ISVNHRQICSIRSGLAAIIEEVIYKTP--REIFKFLQKDLSIIVPQSNYLIMDVKFRIIS 383
Query: 419 LYGTQPGYAYTQLSSSLLERKISY 442
YG + L+ + L+ K +Y
Sbjct: 384 YYGRSDDLQWKDLTDTELDIKANY 407
>gi|157124069|ref|XP_001660316.1| hypothetical protein AaeL_AAEL009751 [Aedes aegypti]
gi|108874117|gb|EAT38342.1| AAEL009751-PA [Aedes aegypti]
Length = 500
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 166/380 (43%), Gaps = 48/380 (12%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EI S LGRYL A DL G +I E P++VGP + CL C + R
Sbjct: 9 EILESDELGRYLVAKRDLARGQQILVEQPIVVGPYWD-SGIRCLNCFRNSCTICRKCKRA 67
Query: 117 PRCFWPACSARCSGLSDAHTH-APECAI-LKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
P CF D TH EC L G ++ + + P+RCL+L +
Sbjct: 68 PLCF------------DCTTHNETECEFYLNSGLNINFLFDHFN--VVTPVRCLLLASSN 113
Query: 175 PKKYQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSNFLEKLPGDVLD-DTSAKCLHWI 232
K+ E+ MEAH +R T+++ + +V+ L N + G + + +++ + I
Sbjct: 114 RAKFDEMMTMEAHTEER-RDTDIWNAHEKKVVRPLLEN--KAFDGKMENLKVTSELIQRI 170
Query: 233 CGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH 282
CG +VN +I +GLY LM HNC+ N S + L ++ T
Sbjct: 171 CGFFDVNSFEIRGNMDDAQQEESLIRGLYPKAALMAHNCVSNTLISVDGDANLRLYVTTP 230
Query: 283 IST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+ YT +L+GT RR HL + KYF C+C RC DPTELGT+ S++KC D
Sbjct: 231 VKKGEMLFYNYTRSLFGTFERRTHLKLGKYFVCTCSRCEDPTELGTHLSSVKCTG--CDD 288
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
G C ++ W C C+ L V+ V S + + N+ LE ++
Sbjct: 289 GLCSYYA-----EENKWRCNKCAKELEETYVNNVLSDARKDAFECAN---NIDELERVIA 340
Query: 397 KLEAMFHPHHYHCYAVKHSL 416
K HP + +K SL
Sbjct: 341 KHSETLHPCNSIILELKQSL 360
>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 494
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 183/409 (44%), Gaps = 54/409 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK-LALAEPICLGCHK---PLNPNLA 111
+ I+ S L GR + AT D+ + I ++PLIVGP+ + +C+ C+K PL P
Sbjct: 33 WTIRYSNLGGRGMFATRDIAANELIFIDAPLIVGPRCVGKHLEMCVCCYKSECPLFP--- 89
Query: 112 DNARCPR-CFWPACSARCSGLSDAHTHAPECAILKL---GCETLLAYNDYKYEAILPLRC 167
C R C P CS C S H + EC L+ C T + N A++P+R
Sbjct: 90 ----CDRGCGLPICSLECEN-SPKHVNY-ECNYLRSLVPTCGTDWSLN--LLLAVIPIRA 141
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
L + + K L+ + S+ YE + L+ N +EK P + +
Sbjct: 142 LFMTEQQRKCLAALQCDQTLTSQ-------YE-----INLLQRN-VEKSPSE----ADVE 184
Query: 228 CLHWICGVIEVNGV--------DIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
+ +CG N D +GLY + L HNC PN +H +L+V
Sbjct: 185 LMKRVCGAFNSNSFETVSVLDKDHSTSLRGLYPLGALQNHNCAPNTRHHFDENYRLYVSA 244
Query: 280 DTHIS-----TM-YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
IS TM YT+ W T RR+ L+ITK+F+C+C+RCSDPTE G+ A+ C
Sbjct: 245 TVPISAGEELTMSYTSLFWDTTLRRQFLSITKHFSCTCKRCSDPTESGSMLCALLC---A 301
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
D +LP +PL + W C C +N R + + S L ++ + ++P + +
Sbjct: 302 FDNCSGNLLPRDPLRIGTPWVCDMCGKSINYRQICSIRSGLAAITEEALYKSPR-EIFKF 360
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
M +L + +Y +K +I YG G + L+ L+ K Y
Sbjct: 361 MQKELFRIVPLSNYLIIDMKFRIISYYGRTEGLEWADLTDVELDTKSEY 409
>gi|170061022|ref|XP_001866059.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879296|gb|EDS42679.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 495
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 112/400 (28%), Positives = 172/400 (43%), Gaps = 64/400 (16%)
Query: 87 IVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAIL-- 144
+VGPK A + P+CLGCH+ L+ N C C WP C + C + H EC +L
Sbjct: 1 MVGPKTA-SVPVCLGCHRNLDATKQFN-NCQHCGWPMCGSSCE---TSRYHREECQLLAA 55
Query: 145 -----KLGCETLLAYNDYK-YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVY 198
++G + Y Y I+PLR ++L+R P+ + ++ ME+H+ R T +Y
Sbjct: 56 KNYLPQIGFDECNPGKKYSAYCTIVPLRAILLKRTDPQLWAKVVSMESHLETRK-NTPLY 114
Query: 199 EEIDS-IVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYS 252
I S +V ++R + + + S + L + G+ + N +I + LY
Sbjct: 115 AAIRSNVVPFVR------MVLGLRTEASEEELMEVTGIFDTNSFEIRIPEKSIKIRALYE 168
Query: 253 VICLMEHNCLPNAKHSNMMQSKLFVF-------------------------RDTHISTMY 287
+ LM H C PN +H + L + +D IS Y
Sbjct: 169 LGALMNHCCQPNTRHYFDEEFNLVMIAAGRILEQLQDRTNSYFELLSVDIPKDEMISISY 228
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
L T RR + K F C C RCSDPTE TY ++ C K ++ VNPL
Sbjct: 229 AQPLQATIQRRFTIKQAKCFECGCHRCSDPTEFRTYAGSIVCPQCSKS----MVVAVNPL 284
Query: 348 DNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLV-QENPNVKSLEEMLTKLEA--MF 402
D SDW C CS R +A+ ++ +L P+ E +L + ++ +
Sbjct: 285 DFRSDWRCEDKKCSYRESAQQYITRNEAFRAEIGQLSCGAGPD--GYECLLGRFQSTDVL 342
Query: 403 HPHHYHCYAVKHSLIQLYGTQPGYAYT--QLSSSLLERKI 440
HP + + VK++L QLYG LS L RK+
Sbjct: 343 HPWNTNVLQVKYALTQLYGGGGPGGINLKDLSEKQLRRKV 382
>gi|242019690|ref|XP_002430292.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
humanus corporis]
gi|212515407|gb|EEB17554.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
humanus corporis]
Length = 487
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 172/384 (44%), Gaps = 52/384 (13%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR-CPRC-FWP 122
G ++ A+ DL PG+ I +E PLI+GP + + +CLGC + + L + + C C P
Sbjct: 3 GTFMVASKDLKPGEIILKEQPLILGPSIN-SNILCLGCFRRI---LFEKFKFCRECQIAP 58
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY---KYEAILPLRCLILQ-------- 171
C CS L H+ EC K E ++ ++ +LPL+C L+
Sbjct: 59 VCGEDCS-LKSQHSKY-ECKYYKKLFENDKIDKNFIVSNFQIVLPLKCFSLKLFLFPDDD 116
Query: 172 RRSPKKYQELKDMEAHMSKRG--PGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ +ME+H+ +R P Y+E +I+K L S ++L+D +
Sbjct: 117 DDDDHLWDNFINMESHVDERQNEPIWNYYQE--NIIKVLNS-------VNILNDEECLLM 167
Query: 230 HWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHS------NMMQS 273
ICG+++VN ++ G +G+Y M H+C+ N S + +
Sbjct: 168 QKICGILDVNSFEMRCPDDEQISAGNQLRGIYREAAFMSHDCIGNTHVSLNDFFEMTIHA 227
Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
+ + I Y N T R++ L KYFNC C+RC+DPTELGTY ++ C H
Sbjct: 228 SVPIKSGQAIFFNYANPFECTAARQQRLKKGKYFNCFCKRCTDPTELGTYSGSIICPRCH 287
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
+ I+ NPL + S W C C+ + + T+ G+ + V E N LE
Sbjct: 288 EG----CIISQNPLSDSSGWNCEKCNHFFYNQLIK-ATNVEGQNLLSDVDET-NSTQLEN 341
Query: 394 MLTKLEAMFHPHHYHCYAVKHSLI 417
+L K F ++Y +K LI
Sbjct: 342 LLKKCLKSFPQNNYVMTEIKQKLI 365
>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
Length = 497
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/416 (25%), Positives = 183/416 (43%), Gaps = 58/416 (13%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNPNLADN 113
P+EIK S L G + AT + G+ I ++ P+I+GP+ A P+ C+ C+ + L
Sbjct: 27 PWEIKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLSCVSCYSRRDLRLCGK 86
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL---PLRCLIL 170
+C CS +C + H EC +++ ++ D + + P+R L+L
Sbjct: 87 ----KCGLLVCSEKCE---KSLGHQKECKVVRQWQSKPIS-EDLSVQLVRVLSPIRSLLL 138
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+ LK AH S + E+D + L N + + K +
Sbjct: 139 GEEDKSVVKCLK---AHKSDQHGF-----EVDVMKDALALN---------VKEDEEKFMR 181
Query: 231 WICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFV 277
++C V++ N ++ +GLY + L H+C PN K ++++ F+
Sbjct: 182 FVCSVMDSNAFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ + I Y+ +W T RR HL TK+F C C+RC DPTE G+Y ++ C K
Sbjct: 242 PQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSILC----KVCK 297
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNV--KSLEEML 395
++P N L D W C C + + +V +T LG ++ ++ V + L + L
Sbjct: 298 TGKVIPTNSLQTDK-WQCEVCGSLIKKEEVANITFVLGSALNSFDDDDVAVMLRLLHQKL 356
Query: 396 ---------TKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKISY 442
TK + ++ +K+ LI + G Q G+A+ LS L K+++
Sbjct: 357 PNLSCCDYSTKTAKLVPENNQTVVELKYKLIWILGHQQGFAWNDLSDEYLSLKVNF 412
>gi|198458396|ref|XP_001361024.2| GA21934 [Drosophila pseudoobscura pseudoobscura]
gi|198136331|gb|EAL25600.2| GA21934 [Drosophila pseudoobscura pseudoobscura]
Length = 489
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 166/377 (44%), Gaps = 47/377 (12%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A +++ G+ + E PL+V P A+ C C + C +C +P
Sbjct: 16 LGRHLVANINIDAGETLLEERPLLVAPHWECAQLKCAQCMEE------SYVMCRKCQVFP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
C CS +D EC G C+ LL N I L L+L P
Sbjct: 70 LC-MDCSQHNDF-----ECEFFASGAGSRICKDLLVKN----YGICALLKLLLLLEDPAT 119
Query: 178 -------YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVL--DDTSAKC 228
+Q D+E + G E E++ + L S +E LP L D A C
Sbjct: 120 RADCQTLFQRSVDLEDYRDGEGMWQEHEEQV--VRPLLASGLVEALPTQQLTADVLHAHC 177
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNM-MQSKLFVFRDTH 282
+ C EV G D G +G++ + H+C+PN + NM + + + +
Sbjct: 178 VRIDCNAFEVTGRD-GETLKGVFICGAGLPHHCVPNTVVALDEQFNMKLYAAVPLNAGDV 236
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
I YTN L GT R+ L ++++ +CSC RC D TE+GT+ S+MKC C I
Sbjct: 237 IYNSYTNPLMGTSQRQHQLRLSRHLDCSCSRCQDATEMGTHMSSMKCRECSAGFSICEIG 296
Query: 343 PVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
P D +W C C+A L A DVH V ++ G D LV ++ E +L + + +
Sbjct: 297 P----DGSGNWRCVDCNAILPAVDVHEVLAEAG---DALVDAKGELQVYESLLVRYQQLL 349
Query: 403 HPHHYHCYAVKHSLIQL 419
HP+H+ +K ++ +
Sbjct: 350 HPNHFLLLDIKQNIASI 366
>gi|195154475|ref|XP_002018147.1| GL17551 [Drosophila persimilis]
gi|194113943|gb|EDW35986.1| GL17551 [Drosophila persimilis]
Length = 489
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 168/372 (45%), Gaps = 37/372 (9%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A +++ G+ + E PL+V P A+ C C + C +C +P
Sbjct: 16 LGRHLVANMNIDAGETLLEERPLLVAPHWECAQLKCAQCMEE------SYVMCRKCQVFP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPKK 177
C CS +D EC G C+ LL N Y A+L L L+ +
Sbjct: 70 LC-MDCSQHNDF-----ECEFFASGAGSRICKDLLVKN-YGICALLKLLLLLEDPATRAD 122
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVK--YLRSNFLEKLPGDVL--DDTSAKCLHWIC 233
Q L + ++ G +++E + V L S +E LP L D A C+ C
Sbjct: 123 CQTLFERSVNLEDYRDGEGMWQEHEEQVVGPLLASGLVEALPTQQLTADVLHAHCVRIDC 182
Query: 234 GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNM-MQSKLFVFRDTHISTMY 287
EV G D G +G++ + H+C+PN + NM + + + + I Y
Sbjct: 183 NAFEVTGRD-GETLKGVFVCGAGLPHHCVPNTVVALDEQFNMKLYAAVPLNAGDVIYNSY 241
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
TN L GT R+ L ++++ +CSC RC D TE+GT+ S+MKC C I P
Sbjct: 242 TNPLMGTSQRQHQLRLSRHLDCSCSRCQDATEMGTHMSSMKCRECSAGFSICEIGP---- 297
Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHY 407
D +W C C+A L A DVH V ++ G D LV ++ E +L + + + HP+H+
Sbjct: 298 DGSGNWRCVDCNAILPAVDVHEVLAEAG---DALVDAKGELQVYESLLVRYQQLLHPNHF 354
Query: 408 HCYAVKHSLIQL 419
+K ++ +
Sbjct: 355 LLLDIKQNIASI 366
>gi|270011595|gb|EFA08043.1| hypothetical protein TcasGA2_TC005637 [Tribolium castaneum]
Length = 319
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 75/247 (30%), Positives = 122/247 (49%), Gaps = 18/247 (7%)
Query: 205 VKYLRSNFL-------EKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLM 257
++YL FL E+ +L+D S + L I ++ N + Y L+
Sbjct: 8 IEYLVQKFLTPLKKLEEETHQIILEDKSEETLGKIFNFLQSN-THHNKQLISFYPNASLL 66
Query: 258 EHNCLPNAKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFN--CS 310
+H+C PN + +++ V T ++ YT+ GT R ++L K N C
Sbjct: 67 KHSCTPNTIFTTENFEITLRAISGVSEGTPLTCSYTDLFSGTHVRLKNL---KKLNKTCF 123
Query: 311 CERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLV 370
C RC+DPTE GT+FSA+KC+ +D LPVNP + ++ W C C L + +
Sbjct: 124 CARCTDPTEFGTFFSALKCMGTKQDPCGGNQLPVNPTNEETLWVCDKCKIELPHKQIVKF 183
Query: 371 TSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQ 430
LG QV K + +++ LEE L KL HP+H++ +AVK++L++LYG + Q
Sbjct: 184 VKHLGGQVGKTLARKVSIEELEEFLGKLLMFLHPNHFYIFAVKNALVELYGEESLATKIQ 243
Query: 431 LSSSLLE 437
+ L++
Sbjct: 244 ICEELID 250
>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
Length = 325
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 115/229 (50%), Gaps = 31/229 (13%)
Query: 232 ICGVIEVNGVDIGRYTQG-----LYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
+ GV+EVN + R +G L+ ++ HNC+PN S + S+ + R + +
Sbjct: 3 VVGVLEVNAFE-ARSPKGYPLRCLFPYTGILAHNCVPNTARS-IYPSEGYKIRLRAMVDL 60
Query: 287 ---------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
YT L GT R++HL K+F C CERC DPTELGT+FS++KC G
Sbjct: 61 EEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKC-------G 113
Query: 338 DC---WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP----NVKS 390
C + +P P + D+ W+C C + + D + L +V+ VQ P ++
Sbjct: 114 QCAEGFQVPRQPTEPDTSWSCAGCGSDTSNADALAMLQSLQSEVNA-VQALPMAAKRLEE 172
Query: 391 LEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERK 439
+E +L K +++ HP H+ ++ LI++YG GY QL LERK
Sbjct: 173 IERLLRKYKSLLHPLHFIATGLRQLLIEMYGRVQGYEMVQLPDHQLERK 221
>gi|307211936|gb|EFN87850.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 331
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 96/338 (28%), Positives = 156/338 (46%), Gaps = 47/338 (13%)
Query: 49 PKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKL-ALAEPICLGCHK--- 104
P+ + + I+ SPL GR + A D+ PG+ I + PL++GP+ + P+C+ C+K
Sbjct: 20 PQDESRSWRIERSPLGGRGMFAMRDIQPGELIFMDVPLLLGPRCHSKYLPMCVVCYKNDC 79
Query: 105 PLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK-LGCETLLAYNDYKYEAIL 163
PL P D+ C P CS +C S H A EC IL+ L ++ ++
Sbjct: 80 PLFP--CDHG----CGLPICSTKCEN-SLMHARA-ECRILRELEPTCGSTWSKDLLLTVV 131
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
+R L L S ++ + L E H++ P E+ + L++ + D+
Sbjct: 132 LIRALAL---SSEQRRLLFAFECHLNL-APNYEI-------------DLLKRNITHLPDE 174
Query: 224 TSAKCLHWICGVIEVNGVDIGRYT--------QGLYSVICLMEHNCLPNAKHSNMMQSKL 275
+ ICG+ N ++ +G+Y + L H C+PN +H Q +L
Sbjct: 175 EQLALMKRICGIFNTNSFEVVAVRDKDRVVSLRGIYPIGALQNHCCVPNTRHHFDDQQRL 234
Query: 276 F------VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+ + I+ YT+ LW T+ RR L +TK+F+C C RCSDP E G+ A+ C
Sbjct: 235 YMSAALPIAAGQEITMSYTDLLWDTRSRRCFLRVTKHFSCDCNRCSDPLEFGSQLGALLC 294
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV 367
E+ +LP + L +S WTC C ++ R V
Sbjct: 295 AKENCPGH---LLPCDSLSYESSWTCSECQTSVSHRQV 329
>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
Length = 515
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 175/393 (44%), Gaps = 61/393 (15%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL-NPNLAD 112
+ Y+++ S LGRYL A D+ G+RI + P ++GP + +C C+ PL N L
Sbjct: 3 VKYKVEKSEKLGRYLVAAKDIKAGERILSDQPFVLGPS-SDTSLVCFNCYLPLINKFLV- 60
Query: 113 NARCPRC-FWPACSARCSGLSDAHT---HAPECAI---LKL--GCETLLAYNDYKYEAIL 163
C C P C G D T + EC + LKL G + + ++L
Sbjct: 61 ---CKNCAVAPLCPG--DGCPDEFTKYHNRQECDVFRNLKLTKGISPMTMVQNVGSLSVL 115
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD 223
R L+ + + +++ ++E H+ +R + V++ D+ VK+++S G + +
Sbjct: 116 --RALLKKETNLLEWKLFMELETHLERRRE-SNVWQYYDNTVKFIQS------LGLLENG 166
Query: 224 TSAKCLHWICGVIEVNGVDI----------GRYTQGLYSVICLMEHNCLPNAKHS----N 269
+ + IC I+VN ++ +G+Y L+ H+C+ N S N
Sbjct: 167 QNQDLVQKICAAIDVNSFEVRGPPIPAIGCAEILRGVYLQAALLSHDCIANTHMSIDDNN 226
Query: 270 MM--QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
M+ + + + + I+ YT+ L GT PR++HL + KYF C+C RC+D TELGT+ S+
Sbjct: 227 MLVCHASVDIKKGESINYNYTDPLKGTIPRQQHLIVGKYFKCTCTRCTDNTELGTFMSSA 286
Query: 328 KCLNEHK----DQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQ 383
C K GD WI C C + + D+ ++++ V
Sbjct: 287 ICPGCKKGYITKNGDVWI-------------CFDCKKEVESMDIEPKIKCCSDKLE--VI 331
Query: 384 ENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
+ K LEE + + + P HY K L
Sbjct: 332 NKKDEKELEEYIKYVSLVLAPGHYLLLDAKQRL 364
>gi|347963044|ref|XP_566389.3| AGAP000041-PA [Anopheles gambiae str. PEST]
gi|333467386|gb|EAL41307.3| AGAP000041-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 157/353 (44%), Gaps = 46/353 (13%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNP-NLAD 112
P++I + GR L AT D+ + + + P++VGP++ E I C C L L
Sbjct: 61 PWDIATFEGYGRGLVATRDIAVNELVFVDRPVLVGPRVNNYEVIFCASCCSILRRLQLCT 120
Query: 113 NARCPRCFWPACSARCS-GLSDAHTHAPECAILK-LGCETLLAYNDYKYEAILPLRCLIL 170
C P CS RC + + HA ECA+++ + Y+ A+ +R +L
Sbjct: 121 GG----CRLPICS-RCDYSVGASSPHAAECALIQSWQPKDAGRYSKNILYALTSIRGFLL 175
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
R + + M P E+ EI+ + L+ + L G+ + + L
Sbjct: 176 SERD-------RSIVLQMEGHPPRKEMTTEIE---RLLKDGYFWNLDGE---GPAVRYLR 222
Query: 231 WICGVIEVNGVDIGRYT------------QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
+ ++ N + R +GLY + LM H C PN ++ + ++ V
Sbjct: 223 QVVNILNTNAFETSRIVADEENNQHEIILRGLYILGALMNHCCRPNVRYVFDGELRMRVH 282
Query: 279 ------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
+ I Y+ LWG+Q R HL +K F C C+RC DPTE GTY A+KC+ E
Sbjct: 283 ASRPIKKGEQIMNNYSKILWGSQHRIIHLCFSKNFLCCCDRCKDPTEFGTYLGALKCVRE 342
Query: 333 HKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
+G +LPVNPL S W C CS +++ ++ S++ E +++ N
Sbjct: 343 SCPEGR--LLPVNPLKISSVWRCDRCSLKMD----NIKASKIQEIASRMIMNN 389
>gi|290462845|gb|ADD24470.1| Protein msta, isoform A [Lepeophtheirus salmonis]
Length = 486
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 96/423 (22%), Positives = 189/423 (44%), Gaps = 58/423 (13%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK--LPYEIKSSPLLGRYLQ 69
+ C +C + CS CK+ C H++LH + + LP++++ + GR L
Sbjct: 1 MQTCFLCGSPVSNICSYCKDCYVCDT-----HFELHHQRELRICLPFDVEYDSVKGRILV 55
Query: 70 ATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR-----CPRCFWPAC 124
A+ D+ P + + + + GP L P+CLGC + + N CP+CF+P+C
Sbjct: 56 ASRDIKPEEVVLFDKAAVFGPNLL---PVCLGCLEKIKSTRGKNGELHYSVCPKCFFPSC 112
Query: 125 SARCSGLSDAHTHAP-ECAILKLGCETLLAYNDYK-----YEAILPLRCLILQRRSPKKY 178
+ +S+ H+ EC+I + D++ Y I LR ++ +R+
Sbjct: 113 GNK-ECISETSVHSKYECSIFA-ATGFKPKFKDFESPHFIYNVITILRLILEMQRNEDNR 170
Query: 179 QELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEV 238
+ + H +R + E++ +K + S ++ L + + + + GV++
Sbjct: 171 ILVNQLMTHHDQR---VNKFPEVEIGIKRITSFLMDDLN---IPNVNGDDVIQAFGVMKT 224
Query: 239 NGVDIGRYT-QGLYSVICLMEHNCLPNAKHSN------MMQSKLFVFRDTHISTMYTNAL 291
N + G +GLY + ++ H+C+ N + +++SK+ + + ++ Y +
Sbjct: 225 NAMTFGDGNGRGLYPIGSIISHSCVSNLEPLTEPMECLVLRSKISIKKGNELTIRYM-SF 283
Query: 292 WGTQPR-REHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDND 350
W + + ++ L +FN CERC DP+++ T S+ C K + ++ P+N L+ +
Sbjct: 284 WSHRTQLKDSLNNQWFFNYECERCQDPSDMKTMLSSELC---SKCKQKSYLKPINSLNLN 340
Query: 351 SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE------MLTKLEAMFHP 404
+DW C C + +QV L+ E P +S + K E +FH
Sbjct: 341 TDWQCDFCQE-----------VRAKKQVCDLLAEFPEPESFTSTDEITFFIKKSEKVFHA 389
Query: 405 HHY 407
HH+
Sbjct: 390 HHF 392
>gi|195431056|ref|XP_002063564.1| GK21977 [Drosophila willistoni]
gi|194159649|gb|EDW74550.1| GK21977 [Drosophila willistoni]
Length = 487
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/396 (28%), Positives = 169/396 (42%), Gaps = 43/396 (10%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
GR+L A LD+ GD + E PL+V P + C C + C C +P
Sbjct: 16 FGRHLVAQLDIEAGDTLVEERPLLVAPHWECGQMKCAKCLQE------SYVMCRSCKMFP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLG------CETLLAYNDYKYEAILPLRCLILQRRSPK 176
C C G D EC G C+ LL N Y A++ L+ + K
Sbjct: 70 LC-MDCQGHDDF-----ECQFFASGGPGGYICKDLLVKN-YGVCALMKFLLLLEDPSTRK 122
Query: 177 KYQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGD---VLDDTSAKCLHW 231
+ L + ++ G +++E + ++ + S L L DD A C+
Sbjct: 123 DCERLLETPINLDDYRDGDGMWQEHEELAVRPLMASGLLMSLTQAQSLTADDLHAYCIRI 182
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN--AKHSNMMQSKLFVFRDTH----IST 285
EV D G +G++ V + H+C+PN KL+ D I
Sbjct: 183 DSYSFEVTARD-GDTLKGVF-VCGSLPHHCVPNTVVALDEEFNLKLYAAVDLKSGEVIYN 240
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
YTN L GT R+ L +T+ CSC RC DPTE GT S++KC ++ D + L
Sbjct: 241 SYTNPLMGTSQRQHQLRLTRRLECSCSRCLDPTECGTNMSSVKC----RECPDGYAL--C 294
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPH 405
L + DW C CSA L A DVH + ++LG LV+ +++ E +L + + + HP+
Sbjct: 295 ELSSGGDWRCSKCSAILPAADVHELLAELG---GALVEAKGELQAFESLLAQYKTLLHPN 351
Query: 406 HYHCYAVKHSLIQ-LYGTQPGYAYTQLSSSLLERKI 440
H+ +K ++ L T Q S LL R++
Sbjct: 352 HFLMLDIKQNIASILRATALMNCMDQPSKKLLARRV 387
>gi|194756636|ref|XP_001960582.1| GF11432 [Drosophila ananassae]
gi|190621880|gb|EDV37404.1| GF11432 [Drosophila ananassae]
Length = 489
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 164/370 (44%), Gaps = 32/370 (8%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A +++ PGD + E PL+V P + C C + C RC +P
Sbjct: 16 LGRHLVAGMNIEPGDTLLEERPLLVTPHWECGQLKCAQCFQ------ESYVICRRCQVYP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQE-- 180
C CS D K C+ +L N I L L+L P +
Sbjct: 70 LC-MDCSQHDDFECDFFASGAGKGLCKDILVKN----YGICGLLKLLLLLEDPATRPDCA 124
Query: 181 -LKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKCLHWICGV 235
L + ++ G +++E + V + S +E L L D A C+
Sbjct: 125 TLVETPVNLDDYRDGDGMWQEHEEQVVRPLMASGLVEVLSSQQLTCDSLHAHCIRIDSNA 184
Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIST------MYTN 289
EV D G +G++ + HNC+PN + Q + ++ + T YTN
Sbjct: 185 FEVTARD-GDTLKGVFVCGASLPHNCVPNTVVALDEQFNMKLYAAVPLQTGDIIYNSYTN 243
Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN 349
L GT R+ L +++ +CSC RC DPTE+GT+ S+MKC + C ++P
Sbjct: 244 PLMGTSQRQHQLRLSRRLDCSCSRCLDPTEMGTHMSSMKCRECPQGYAVC---EMDPDGR 300
Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
DW C C A ++A +VH + +++G LV+ +++ E +LT+ ++M HP+H+
Sbjct: 301 PGDWRCLECKASMSAANVHELLAEVG---GALVEARGDLQVYESLLTQYKSMVHPNHFLL 357
Query: 410 YAVKHSLIQL 419
+K ++ +
Sbjct: 358 LDIKQNIASI 367
>gi|312384987|gb|EFR29584.1| hypothetical protein AND_01294 [Anopheles darlingi]
Length = 366
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 98/339 (28%), Positives = 153/339 (45%), Gaps = 36/339 (10%)
Query: 104 KPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK----Y 159
KPL N C RC WP C C D H EC + + + Y Y
Sbjct: 5 KPLEEN--KYLECERCGWPVCKRSCQ---DQRGHRAECELTVARGSKVSIQHFYVPHPLY 59
Query: 160 EAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD 219
+ +LPLRCL+L P +++ L +++H + G+E + V L F K
Sbjct: 60 QCLLPLRCLLLAESDPARWKALMRLQSH-DEEWRGSEQWRTDRRGVAQLIPRFF-KCDNR 117
Query: 220 VLDDTSAKCLHWICGVIEVNGVDIGRY---TQGLYSVICLMEHNCLPNAKHSNMMQSKLF 276
+D + + G+I+VNG ++ + +YSV ++EH+C PN S Q ++
Sbjct: 118 WQEDDILQAI----GIIQVNGHEVPLTEPASVAIYSVASMLEHSCRPNLAKSFSDQGEII 173
Query: 277 VF------RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
++ T +S YT+ LW T RR HL TK F+C+C+RC DPTE GT+FSA++C
Sbjct: 174 LWAPVAIAEGTRLSISYTDVLWTTAHRRAHLLQTKLFDCACDRCLDPTEFGTFFSALRCS 233
Query: 331 N-EHKDQGD----CWILPVN-PLDNDSDWTCGSCSARLNARDVHLVTSQLGE--QVDKLV 382
+ DQ ++LP + + D DW CG C + + V +Q+ E ++D
Sbjct: 234 GFRNSDQSSQMCKGYLLPCDISVQGDDDWVCGRC----HGKQPGTVVAQILERARIDIQA 289
Query: 383 QENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYG 421
+ + L P+H + K +L Q G
Sbjct: 290 MQKQREDHCVKYLRHYAKWLTPNHQYMVEAKMALSQAIG 328
>gi|195335119|ref|XP_002034223.1| GM21750 [Drosophila sechellia]
gi|194126193|gb|EDW48236.1| GM21750 [Drosophila sechellia]
Length = 488
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 168/378 (44%), Gaps = 49/378 (12%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A++ + PGD I E PL+V P + C C + C RC +P
Sbjct: 16 LGRHLVASIGIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69
Query: 123 ACSARCSGLSDAHTHAP-ECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPK 176
C D + H EC G C+ +L N I L L+L +P
Sbjct: 70 LCM-------DCNQHDEFECEFFTSGAGKALCKDILVKN----FGICGLLKLLLLLENPS 118
Query: 177 KY---QELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVLDDTSAKCLHW 231
Q L D+ ++S G +++E + +V + S + LP L S+ LH
Sbjct: 119 TKADCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQGL---SSDALHA 175
Query: 232 ICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT------ 281
C I+ N ++ G +G++ + H+C+PN + Q + ++
Sbjct: 176 HCIRIDSNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGD 235
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI 341
I YTN L GT R+ L +++ C+C RC DPTE+GT+ S++KC K+ G +
Sbjct: 236 IIYNSYTNPLMGTSQRQHQLRLSRRLECTCSRCLDPTEMGTHMSSLKC----KECGGFSV 291
Query: 342 LPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
++ DW C C A L A +VH + +++G LV +++ E +LT+ +
Sbjct: 292 CEIDSNGKLGDWRCPDCRALLTAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPL 348
Query: 402 FHPHHYHCYAVKHSLIQL 419
HP+H+ +K ++ +
Sbjct: 349 LHPNHFMLLDIKQNIASI 366
>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
Length = 397
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 160/361 (44%), Gaps = 53/361 (14%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
+ + + SPL GR L AT D+ G+ + + PL+ GP+ + + GC N +
Sbjct: 28 VKWAVYESPLGGRGLVATQDIKAGEVLFVDYPLVYGPRSGII--VQRGCTVCKNIDSDIF 85
Query: 114 ARCPRCFWPACSARCSGLSDAH-------THAPECAILKLGCETLLAYNDYKYEAILPLR 166
+C +C CS +C SD H +H P ++ +TLL+ A+ +R
Sbjct: 86 FKCSKCALILCSVQCQN-SDFHSGDCSIISHWPNKVPIEEVDDTLLS------RALTAIR 138
Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
L+L +KY + + G EI + +Y D+
Sbjct: 139 ALLLNE--DQKYLLTSLQANKLPQYG------SEIRDLKQYF----------DIPLHEEE 180
Query: 227 KCLHWICGVIEVNGVDIGR-------YTQGLYSVICLMEHNCLPNAKH------SNMMQS 273
+ IC ++ N + +GLY V ++ HNC+PN ++ +++
Sbjct: 181 FMILVIC-ILNTNAFQMATPYGKKEMSLRGLYPVASILNHNCVPNTRNCFNGDLQMTVKA 239
Query: 274 KLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEH 333
+ + I T Y+ LWGT RR +L +K+F C CERC+DPTE GT +A+KC +
Sbjct: 240 TKTINAGSEIFTCYSGMLWGTPARRLYLYKSKHFLCDCERCADPTERGTLLAALKCFST- 298
Query: 334 KDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE 393
+ QG +LP+ PL + W C C R+ ++ ++ S LG + L + NV LE
Sbjct: 299 ECQGS--LLPIQPLKTTTAWRCLECGMRVPNDNICVIQSALGSLMGSL--DLKNVDELEN 354
Query: 394 M 394
Sbjct: 355 F 355
>gi|225717680|gb|ACO14686.1| msta, isoform A [Caligus clemensi]
Length = 521
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 119/493 (24%), Positives = 204/493 (41%), Gaps = 65/493 (13%)
Query: 15 CAVCRETALHKCSAC---KEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQAT 71
C +C + + C + V +C K+H++ H C PY + S +GR+L A+
Sbjct: 14 CFICHQPSSDSCRKSNHDETVYFCSKEHERLHSGDKTSSC--FPYRVTKSHQVGRFLVAS 71
Query: 72 LDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCS-- 129
+ G+ I +E P IVGP ++ PIC+ C+K + P + CP+C +P C C
Sbjct: 72 RKILRGEIIFQELPGIVGPNPVVSSPICVACYKQVGP----RSSCPKCGFPLCEESCERD 127
Query: 130 -GLSDAHTHAPECAIL-KLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAH 187
G H H + + C++ + + AILPL+ L+ + +P Y L + H
Sbjct: 128 HGFICEHLHRFGIDLTPEEMCDSSILH------AILPLKLLLSKESNPHVYNCLSILMDH 181
Query: 188 MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT 247
R E E I + + L+S LE+ D K L G++EVN ++ T
Sbjct: 182 REDRLNDPEYLEGIQTPI--LKS--LERSLSTYSTDELLKAL----GIVEVNSYEVYNLT 233
Query: 248 -----QGLYSVICLMEHNCLPNAK--------HSNMMQSKLFVFRDTHISTMYTNALWGT 294
+GL+ + L+ H+C+P + + M S + +S Y +
Sbjct: 234 GHSGFRGLFCLTSLLSHDCVPKRRPIIGHETPYGLTMISTRNINEGEILSINYVHTQKPN 293
Query: 295 QPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT 354
+ RR L YF CSC+RC D E G + A+ C C + P + +
Sbjct: 294 RIRRRTLKDNWYFECSCKRCEDEAEFGLHPDAIWC-------DSCQEGLIKP-SHKGIYC 345
Query: 355 CGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKH 414
C CS + + + L Q ++ ++ +V E+ + + +HY + +
Sbjct: 346 CDKCSREKSQGCIEDLIESLESQKAEIDRQ--DVNQYLELYRRSSVLLGENHYFLNSPRR 403
Query: 415 SLIQLYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKPLN---PNLADNA 471
+I L+ +P + +L V SP++ +C G + LN P L+ N
Sbjct: 404 WIIPLF-CRPLTSSPRL-----------VPSPDMLRDKIQLCEGYLRLLNKIEPGLSKNR 451
Query: 472 RCPRCFWPACSAR 484
W CS +
Sbjct: 452 GKVLYEWAECSLK 464
>gi|194882337|ref|XP_001975268.1| GG20654 [Drosophila erecta]
gi|190658455|gb|EDV55668.1| GG20654 [Drosophila erecta]
Length = 488
Score = 108 bits (270), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 167/372 (44%), Gaps = 37/372 (9%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A++ + PG+ + E PL+V P + C C + C RC +P
Sbjct: 16 LGRHLVASISIEPGETVLEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKY---Q 179
C CS + K C+ +L N I L L+L +P Q
Sbjct: 70 LC-MDCSQHDEFECEFFASGAGKALCKDILVKN----FGICGLLKLLLLLENPNTQADCQ 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
L D+ ++S G +++E + +V + S ++ LP L ++ LH C I+
Sbjct: 125 MLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLVDVLPTQQL---TSDVLHAHCIRID 181
Query: 238 VNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTMY 287
N ++ G +G++ + H+C+PN + Q + ++ I Y
Sbjct: 182 SNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNSY 241
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPL 347
TN L GT R+ L +++ CSC RC DPTE+GT+ S++KC K+ G + ++
Sbjct: 242 TNPLMGTSQRQHQLRLSRRLECSCSRCLDPTEMGTHMSSIKC----KECGGFSVCEIDLN 297
Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHY 407
DW C C A L A +VH + +++G LV +++ E +LT+ + HP+H+
Sbjct: 298 GKLGDWRCPDCRALLTAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPLLHPNHF 354
Query: 408 HCYAVKHSLIQL 419
+K ++ +
Sbjct: 355 MLLDIKQNIASI 366
>gi|195584108|ref|XP_002081857.1| GD11241 [Drosophila simulans]
gi|194193866|gb|EDX07442.1| GD11241 [Drosophila simulans]
Length = 488
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 98/373 (26%), Positives = 165/373 (44%), Gaps = 39/373 (10%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A++ + PGD I E PL+V P + C C + C RC +P
Sbjct: 16 LGRHLVASIGIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69
Query: 123 ACSARCSGLSDAHTHAP-ECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPK 176
C D + H EC G C+ +L N + +L L L+ +
Sbjct: 70 LCM-------DCNQHDEFECEFFTSGAGKALCKDILVKN-FGICGLLKLLLLLENPSTKA 121
Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLP--GDVLDDTSAKCLHWI 232
Q L D+ ++S G +++E + +V + S + LP G D A C+
Sbjct: 122 DCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQGLTSDALHAHCIRID 181
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTM 286
EV D G +G++ + H+C+PN + Q + ++ I
Sbjct: 182 SNSFEVTAKD-GDTLKGVFIWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNS 240
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YTN L GT R+ L +++ C+C RC DPTE+GT+ S++KC K+ G + ++
Sbjct: 241 YTNPLMGTSQRQHQLRLSRRLECTCSRCLDPTEMGTHMSSLKC----KECGGFSVCEIDS 296
Query: 347 LDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHH 406
DW C C A + A +VH + +++G LV +++ E +LT+ + HP+H
Sbjct: 297 NGKLGDWRCPDCRALITAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPLLHPNH 353
Query: 407 YHCYAVKHSLIQL 419
+ +K ++ +
Sbjct: 354 FMLLDIKQNIASI 366
>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
Length = 787
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 143/328 (43%), Gaps = 47/328 (14%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHK--PLNP 108
KK + + +S + GR + AT D+ G+ I RE L+VGP + C+ CH+ P
Sbjct: 378 KKPNWTLAASTVAGRGVFATRDIVAGELIFRERALVVGPTARKGQLSTCVCCHRLLPTQG 437
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L RC P C+A CSG + TH EC + D + E + PL
Sbjct: 438 FLCKQ----RCTLPVCAA-CSG---SVTHRAECEHFR---RWQPKDADAEEEQVNPLSLR 486
Query: 169 ILQRRS----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
IL K+ + L D ++RG +EI + R NF D
Sbjct: 487 ILTAVRVFHLGKEQRHLVDAMQANAERG----FRQEIIKAAQCFR-NF------PTTDKP 535
Query: 225 SAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNMMQSKL 275
L I GV+ N + GR T +GL+ + +M H C PNA H N + +
Sbjct: 536 FMDQLFRIVGVLNTNAFEAPCRTDGRETLLRGLFPLTAIMNHECTPNASHYFDNGRLAIV 595
Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
RD I+T YT LWG R L +TK+F C C RC+D TE GTY SA+ C
Sbjct: 596 RAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCNDNTENGTYLSALFC-- 653
Query: 332 EHKDQG-DCWILPVNPLDNDSDWTCGSC 358
++QG ++PV DW C SC
Sbjct: 654 --REQGCRGLVIPVQTRTLQPDWRCLSC 679
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 108/477 (22%), Positives = 170/477 (35%), Gaps = 93/477 (19%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPN 109
K+ + + SP+ GR + AT D+ G+ + RE ++VGP + + C C++ +
Sbjct: 49 KEPAWRVSDSPISGRGIFATRDIAKGEELFREHTILVGPTAHRSSNLRTCTLCYRLVPGQ 108
Query: 110 LADNARC-PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP---- 164
A C C P C G S H K C+ L + E I P
Sbjct: 109 TDAEALCLAGCGLPVCGE--CGKSPRH---------KTECDLFLKWKPKVTERIDPRSLR 157
Query: 165 ----LRCLIL---QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP 217
+RC L QRR Q D +M + +E+ + L +
Sbjct: 158 ILSVVRCFFLDEQQRRLLYAMQANAD-RYYMKEVERAAACFEDFPREQEMLDFFYRTICA 216
Query: 218 GDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
+ S C V G ++ + L+ + L+ H C PNA H + V
Sbjct: 217 FNTNAFESRSC---------VEGHEV--LIRALFPLAGLLNHQCTPNAGHHFQDGETIIV 265
Query: 278 FR------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGT-----YFSA 326
I+ Y LW T R+ L +TK F C C RC DPT L + + A
Sbjct: 266 CATERIACGAEITVSYAKMLWSTLARKIFLGMTKGFMCQCPRCQDPTSLTSSGIIGHHRA 325
Query: 327 MKCLNEHKDQGDCWILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQE 384
+ E + + W N ++ + C + L+ LGE L ++
Sbjct: 326 SSGIAEQRQRTTTW---TNRMNTTATMLAHMAPCRDTTPEQLAQLIDMHLGE----LREK 378
Query: 385 NPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQL-SSSLLERKISYV 443
PN + + S + G +A + + L+ R+ + V
Sbjct: 379 KPN----------------------WTLAASTVAGRGV---FATRDIVAGELIFRERALV 413
Query: 444 MSPNLKATDEPICLGCHK--PLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPEC 498
+ P + C+ CH+ P L RC P C+A CSG + TH EC
Sbjct: 414 VGPTARKGQLSTCVCCHRLLPTQGFLCKQ----RCTLPVCAA-CSG---SVTHRAEC 462
>gi|24654325|ref|NP_611181.1| CG9640, isoform A [Drosophila melanogaster]
gi|442623987|ref|NP_001261039.1| CG9640, isoform B [Drosophila melanogaster]
gi|7302840|gb|AAF57914.1| CG9640, isoform A [Drosophila melanogaster]
gi|54650638|gb|AAV36898.1| RE22408p [Drosophila melanogaster]
gi|440214465|gb|AGB93571.1| CG9640, isoform B [Drosophila melanogaster]
Length = 488
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 165/376 (43%), Gaps = 45/376 (11%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A++ + PGD I E PL+V P + C C + C RC +P
Sbjct: 16 LGRHLVASIAIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69
Query: 123 ACSARCSGLSDAHTHAP-ECAILKLG-----CETLLAYNDYKYEAILPLRCLILQRRSPK 176
C D + H EC G C+ +L N + +L L L+ R+
Sbjct: 70 LCM-------DCNQHDEFECEFFTSGAGKALCKDILVKN-FGICGLLKLLLLLENPRTKG 121
Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKCLHWI 232
Q L D+ ++S G +++E + +V + S + LP L D A C+
Sbjct: 122 DCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQELTSDALHAHCIRID 181
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTM 286
EV D G +G++ + H+C+PN + Q + ++ I
Sbjct: 182 SNSFEVTAKD-GDTLKGIFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNS 240
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNP 346
YTN L GT R+ L +++ C C RC DPTE+GT+ S++KC +C V
Sbjct: 241 YTNPLMGTSQRQHQLRLSRRLECICSRCLDPTEMGTHMSSLKC-------KECPGFSVCE 293
Query: 347 LDND---SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
+D++ DW C C A L A +VH + + +G LV +++ E +LT+ + H
Sbjct: 294 IDSNGKLGDWRCPDCRALLTAAEVHELQAAVG---SALVDAMGDLQVYEALLTQYGPLLH 350
Query: 404 PHHYHCYAVKHSLIQL 419
P+H+ +K ++ +
Sbjct: 351 PNHFMLLDIKQNIASI 366
>gi|170067842|ref|XP_001868639.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863902|gb|EDS27285.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 504
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 165/385 (42%), Gaps = 74/385 (19%)
Query: 32 VAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK 91
VA+ G+ KL+ P+ P++I GR L AT D+ + I + P++VGP+
Sbjct: 29 VAFLGE-------KLYSPE---QPWDIAVFEGYGRGLVATRDIAVNELIFLDRPILVGPR 78
Query: 92 LALAEPI-CLGCHKPLNPNL---ADNARCPRCFWPACSARCSGLSDAHT-HAPECAIL-- 144
+ + I C C + + L + R P C A C + T H EC ++
Sbjct: 79 VNNYDVIFCASCCR-IQKRLTLCSGGCRLPIC------ADCDRSAAPETPHTAECTVITS 131
Query: 145 ------KLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVY 198
K C+T+L A+ +R ++L + + +ME H P ++
Sbjct: 132 WQPKDQKRYCKTILY-------ALTSIRAMLLNDLESRI---VFNMEGH----APRKDMI 177
Query: 199 EEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ---------- 248
EID + L+ LP + T L I V+ N + R Q
Sbjct: 178 TEID---RLLKDQIFANLPEGSANLTF---LRRIVNVLNTNAFETFRIVQDEENNDHEII 231
Query: 249 --GLYSVICLMEHNCLPNAKHSNMMQSKLF------VFRDTHISTMYTNALWGTQPRREH 300
GLY + LM H+CL N ++ ++ + + I Y+ LWGTQ R H
Sbjct: 232 LRGLYILGALMNHHCLANVRYVFDENQVMWCHASRPIRKGEQIFNNYSKVLWGTQHRIIH 291
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSA 360
L +K+F C CERC D TELGTY A+KC+ E G ++ +NPL S W C C
Sbjct: 292 LWFSKHFLCECERCRDVTELGTYLGALKCVRERCPNGR--LVSLNPLKIASAWQCDKCGL 349
Query: 361 RLNARDVHLVTSQLGEQVDKLVQEN 385
++ +L +++ E ++V N
Sbjct: 350 QMG----NLKVTKIQEIAGRMVLNN 370
>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 478
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 150/326 (46%), Gaps = 39/326 (11%)
Query: 60 SSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-----PICLGCHKPLNPNLADNA 114
S+ + GR + AT D GD I + PLIV P+ + P+C C+ +
Sbjct: 33 SATVAGRGVVATRDYSVGDVIFVDVPLIVSPRAMGMDGTGSWPVCAVCYAVMMTPTG--- 89
Query: 115 RCPR-CFWPACSARCSGLSDAHTHAPECA-ILKLGCETLLAYNDYK---YEAILPLRCLI 169
CP C P C +C+ D H EC + +L +T + Y AI +R L
Sbjct: 90 -CPGGCRLPVCGRQCA---DRPEHRDECRYVRQLRPKTKSDGQTWSVGIYNAITAVRGLS 145
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL--PGDVLDDTSAK 227
++ KY+ D+ + K+ +E V+ L+ N +L + + T+ K
Sbjct: 146 IR---DGKYRYFLDV---LQKKSTDKLTFE-----VEELKRNISMELDKEDEQIMVTTCK 194
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS------NMMQSKLFVFRDT 281
+ C VN + +GLY V M H C+PN H+ + ++ L +++
Sbjct: 195 IMDANCFETIVNRSNKLTSLKGLYPVAAFMNHCCVPNTMHNFNEKLQMIAKASLPIYKGQ 254
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWI 341
I+T YT ++W T R+ HL +K F C+C RC D E GT +A+ C+ ++ D I
Sbjct: 255 EITTSYTYSIWPTSLRQNHLLTSKQFICTCSRCCDTEEFGTKLAALNCIVKNCDGR---I 311
Query: 342 LPVNPLDNDSDWTCGSCSARLNARDV 367
LP+NPLD S W C C+ +++ ++
Sbjct: 312 LPINPLDKKSIWQCKMCTKLVSSTEM 337
>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
Length = 387
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 156/351 (44%), Gaps = 47/351 (13%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNPNLADN 113
P+EIK S L G + AT + G+ I ++ P+I+GP+ A P+ C+ C+ + L
Sbjct: 27 PWEIKKSELGGFGVFATRAIEVGELIFKDFPVILGPRAAPTCPLSCVSCYSRRDLRLCGK 86
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL---PLRCLIL 170
+C CS +C + H EC +++ ++ D + + P+R L+L
Sbjct: 87 ----KCGLLVCSEKCE---KSLGHQKECKVVRQWQSKPIS-EDLSVQLVRVLSPIRSLLL 138
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+ LK AH S + G EV D++ ++ + K +
Sbjct: 139 GEEDKSVVKCLK---AHKSDQH-GFEVDVMKDALALNVKED-------------EEKFMR 181
Query: 231 WICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFV 277
++C V++ N ++ +GLY + L H+C PN K ++++ F+
Sbjct: 182 FVCSVMDSNAFEVLVGFEDNQASVKGLYPLGSLANHSCCPNTCHVFDDKQHMVVRASKFI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+ + I Y+ +W T RR HL TK+F C C+RC DPTE G+Y ++ C K
Sbjct: 242 PQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCEDPTEFGSYIGSILC----KVCK 297
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNV 388
++P N L D W C C + + +V +T LG ++ ++ V
Sbjct: 298 TGKVIPTNSLQTDK-WQCEVCGSLIKKEEVANITFVLGSALNSFDDDDVAV 347
>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
Length = 844
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 143/328 (43%), Gaps = 47/328 (14%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHK--PLNP 108
KK + + +S + GR + AT D+ G+ I RE L+VGP + C+ CH+ P
Sbjct: 435 KKPNWTLAASTVAGRGVFATRDIVAGELIFRERALVVGPTARKGQLSTCVCCHRLLPTQG 494
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L RC P C+A CSG + TH EC + D + E + PL
Sbjct: 495 FLCKQ----RCTLPVCAA-CSG---SVTHRAECEHFR---RWQPKDADAEEEQVNPLSLR 543
Query: 169 ILQRRS----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
IL K+ + L D ++RG +EI + R NF D
Sbjct: 544 ILTAVRVFHLGKEQRHLVDAMQANAERG----FRQEIIKAAQCFR-NF------PTTDKP 592
Query: 225 SAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNMMQSKL 275
L I GV+ N + GR T +GL+ + +M H C PNA H N + +
Sbjct: 593 FMDQLFRIVGVLNTNAFEAPCRTDGRETLLRGLFPLTAIMNHECTPNASHYFDNGRLAIV 652
Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
RD I+T YT LWG R L +TK+F C C RC+D TE GTY SA+ C
Sbjct: 653 RAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCNDNTENGTYLSALFC-- 710
Query: 332 EHKDQG-DCWILPVNPLDNDSDWTCGSC 358
++QG ++PV DW C SC
Sbjct: 711 --REQGCRGLVIPVQTRTLQPDWRCLSC 736
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 106/287 (36%), Gaps = 43/287 (14%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPN 109
K+ + + SP+ GR + AT D+ G+ + RE ++VGP + + C C++ +
Sbjct: 49 KEPAWRVSDSPISGRGIFATRDIAKGEELFREHTILVGPTAHRSSNLRTCTLCYRLVPGQ 108
Query: 110 LADNARC-PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP---- 164
A C C P C G S H K C+ L + E I P
Sbjct: 109 TDAEALCLAGCGLPVCGE--CGKSPRH---------KTECDLFLKWKPKVTERIDPRSLR 157
Query: 165 ----LRCLIL---QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP 217
+RC L QRR Q D +M + +E+ + L +
Sbjct: 158 ILSVVRCFFLDEQQRRLLYAMQANAD-RYYMKEVERAAACFEDFPREQEMLDFFYRTICA 216
Query: 218 GDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
+ S C V G ++ + L+ + L+ H C PNA H + V
Sbjct: 217 FNTNAFESRSC---------VEGHEV--LIRALFPLAGLLNHQCTPNAGHHFQDGETIIV 265
Query: 278 FR------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
I+ Y LW T R+ L +TK F C C RC DPT
Sbjct: 266 CATERIACGAEITVSYAKMLWSTLARKIFLGMTKGFMCQCPRCQDPT 312
>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
Length = 445
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/325 (30%), Positives = 140/325 (43%), Gaps = 49/325 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
+ IKSS + GR + AT D+ G+ I RE L+ GP + C+ CH+ L P
Sbjct: 40 WTIKSSKVAGRGVFATRDIAAGELIFRERALVTGPTARKGQLSSCVCCHQML-PQSGFLC 98
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDYKYEAILPLRCLI 169
R RC P C A CS DA H EC K E L N + +R
Sbjct: 99 R-HRCTLPVC-ADCS---DAEIHRAECEHFRRWQPKDADEELEQVNPLSLRILTAVRVFH 153
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN--FLEKLPGDVLDDTSAK 227
L K+ + L D ++RG E+ + + ++ F+++L
Sbjct: 154 LS----KEQRHLVDAMQANAQRGYRQEIIKAAQCFRNFPTTDREFMDQL----------- 198
Query: 228 CLHWICGVIEVNGVDI----GRYT---QGLYSVICLMEHNCLPNAKH--SNMMQSKLFVF 278
I GV+ N + G + +GL+ + +M H C PNA H N + +
Sbjct: 199 --FRIVGVLNTNAFEAPCRSGGHEVLLRGLFPLTAIMNHECTPNASHYFDNGTLAVVRAA 256
Query: 279 RDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
RD I+T YT LWG R L +TK FNC C RC+D TE GTY SA+ C +
Sbjct: 257 RDIPKGGEITTTYTKILWGNLTRGIFLKMTKNFNCDCVRCNDNTENGTYLSALFC----R 312
Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
+QG ++PV DW C SC
Sbjct: 313 EQGCRGLVIPVQTRTLQPDWKCLSC 337
>gi|91088051|ref|XP_966825.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270012083|gb|EFA08531.1| hypothetical protein TcasGA2_TC006184 [Tribolium castaneum]
Length = 331
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 17/233 (7%)
Query: 221 LDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM------ 270
+D +H +CG++++N +++ G + LY + + HNC+ N +
Sbjct: 1 MDQFCDNLIHTVCGILDLNAIEVRASSGYLIRCLYPKLSQISHNCVTNLFQTVTPEDFKI 60
Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+++ + + + + Y LW T RR+ L K +C C+RC D TEL T+ S +KC
Sbjct: 61 TLKASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 120
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPN 387
D G +L +PL + DW C C + NA V V + +++ ++V
Sbjct: 121 --SKCDNG--ILLSSDPLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEG 176
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ E + K ++FHP + + ++ L QLYG PGY +L L E K+
Sbjct: 177 IEQREAIFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 229
>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
Length = 460
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 141/339 (41%), Gaps = 57/339 (16%)
Query: 52 KKLP-YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNP 108
++LP + + SP+ GR + AT D+ G+ I RE L+VGP + C+ CH+ L
Sbjct: 39 EELPNWTLAPSPVAGRGVFATRDIAAGELIFRERALVVGPTARKGSQLNTCVCCHRLLG- 97
Query: 109 NLADNARCP-RCFWPACSARCSGLSDAHTH-------------APECAILKLGCETLLAY 154
A CP RC P C C+ S+AH + + EC L E +
Sbjct: 98 --AKQFLCPHRCTLPVC-GECAD-SEAHHNECAHFQRWQPKDLSEECKELSPTGEPVDVV 153
Query: 155 NDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE 214
N + +R L K+ + L D ++RG E+ + K+
Sbjct: 154 NPLSLRILTAVRVFYLS----KEQRALVDAMQANAERGYRQEIIKAAQCFRKF------- 202
Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKH 267
D L + GV+ N + +GL+ + +M H C PNA H
Sbjct: 203 ----PTTDKPFMDQLFRVVGVLNTNAFEAPCRVDGHESLLRGLFPLTAIMNHECTPNASH 258
Query: 268 SNMMQSKLFVFRDTH-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
+L V R I+T YT LW R L +TKYF C+C+RC+D +E
Sbjct: 259 Y-FENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSEN 317
Query: 321 GTYFSAMKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSC 358
GTY SA+ C ++QG ++PV DW C SC
Sbjct: 318 GTYLSALFC----REQGCKGLVIPVQTKTLQPDWRCLSC 352
>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 421
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 119/257 (46%), Gaps = 30/257 (11%)
Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-------GRYT---QGLYSVI 254
+ L+ N + KLP + + + CGV N ++ YT +GLY +
Sbjct: 98 IDLLKRN-VAKLPSE----EQMELMRRTCGVFNTNSFEVVVVPSRDNDYTTSLRGLYPMG 152
Query: 255 CLMEHNCLPNAKHSNMMQSKLFV------FRDTHISTMYTNALWGTQPRREHLAITKYFN 308
L H C+PN +H Q +L+V I+ YT+ LW T RR+ L +TK F+
Sbjct: 153 ALQNHCCVPNTRHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLKVTKRFS 212
Query: 309 CSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV- 367
C+C RCSDP E G+ S + C KD +LP NPL+++S W C C +N R V
Sbjct: 213 CNCNRCSDPLEFGSQLSTLLC---AKDNCSGCLLPRNPLNHESSWICDKCQISVNYRQVI 269
Query: 368 HLVTSQLGEQVDKLVQENPN--VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPG 425
+ S L V + + P ++ E +L++L +Y +K+ +I +G P
Sbjct: 270 ECIHSGLNTFVSDAMYKTPREILRFTEAILSRL---VPATNYIMMDMKYRIISYFGRVPD 326
Query: 426 YAYTQLSSSLLERKISY 442
+ L+ + L K Y
Sbjct: 327 LKWENLTDTELRIKKMY 343
>gi|157117568|ref|XP_001658830.1| hypothetical protein AaeL_AAEL008032 [Aedes aegypti]
gi|108876001|gb|EAT40226.1| AAEL008032-PA [Aedes aegypti]
Length = 527
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 150/353 (42%), Gaps = 56/353 (15%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-CLGCHKPLNPNLADN 113
P+++ GR + AT D+ + I + P++VGP++ + I C C + L L
Sbjct: 66 PWDLGVFEGSGRGMVATRDIAENELIFMDRPILVGPRVNNYDVIFCASCCR-LQKKLTLC 124
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILK--------LGCETLLAYNDYKYEAILPL 165
+ C P CS + +H EC + C+T+L A+ +
Sbjct: 125 S--GGCRLPVCSECDISAAPETSHTAECRRINSWEPKDQNRYCKTILY-------ALTSI 175
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTS 225
R +L S + + +M P ++ E+D I L+ KLP D D
Sbjct: 176 RGFLLDDLS-------RSIVLNMQAHPPRKDMTTEVDRI---LKDQVFNKLPEDGSD--- 222
Query: 226 AKCLHWICGVIEVNGVDIGRYTQ------------GLYSVICLMEHNCLPNAKHSNMMQS 273
+ L + V+ N + R + GLY + LM H C PN ++ +
Sbjct: 223 VQYLRKVIDVLNTNAFETFRMVKDEDNNDHEIVLRGLYILGALMNHCCRPNIRY--VFDE 280
Query: 274 KLF--------VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
KL + + I Y+ +LWGTQ R HL +K+F C C+RC DPTE+GTY
Sbjct: 281 KLVMHCYASKPIRKGEQIFNNYSKSLWGTQHRIIHLCFSKHFLCGCDRCLDPTEMGTYLG 340
Query: 326 AMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQV 378
A+KC+ + QG ++ +NP S W C +C ++N + + G +
Sbjct: 341 ALKCVRDSCPQGR--MVSMNPQKISSAWQCDTCGLKMNNLKITRIQEIAGRMI 391
>gi|195488851|ref|XP_002092488.1| GE11639 [Drosophila yakuba]
gi|194178589|gb|EDW92200.1| GE11639 [Drosophila yakuba]
Length = 488
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 168/392 (42%), Gaps = 34/392 (8%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A++ + PGD I E PL+V P + C C + C RC +P
Sbjct: 16 LGRHLVASISIEPGDTILEERPLLVAPHWECHQLKCAQCLQ------ESYVICRRCQVFP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKY---Q 179
C CS + K C+ +L N I L L+L +P Q
Sbjct: 70 LC-MDCSQHDEFECEFFASGAGKALCKDILVKN----FGICGLLKLLLLLENPNTQADCQ 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKCLHWICGV 235
L + ++ G +++E + +V + ++ LP L D A C+
Sbjct: 125 MLIGVPINLGDYRDGEGMWQEHEELVVRPLMECGLVDCLPTQQLTSDILHAHCIRIDSNS 184
Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ISTMYTN 289
EV D G +G++ + H+C+PN + Q + ++ I YTN
Sbjct: 185 FEVTAKD-GDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDIIYNSYTN 243
Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDN 349
L GT R+ L +++ C+C RC DPTE+GT+ S++KC K+ G + ++
Sbjct: 244 PLMGTSQRQHQLRLSRRLECTCTRCLDPTEMGTHMSSLKC----KECGGFSVCEIDSNGK 299
Query: 350 DSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHC 409
DW C C A L A +VH + +++G LV +++ E +LT+ + HP+H+
Sbjct: 300 LGDWRCPDCKALLTAAEVHELQAEVG---SALVDAMGDLQVYEALLTQYGPLLHPNHFML 356
Query: 410 YAVKHSLIQ-LYGTQPGYAYTQLSSSLLERKI 440
+K ++ L A Q LL R++
Sbjct: 357 LDIKQNIASILRAAALMTAMDQPCKKLLARRV 388
>gi|195056335|ref|XP_001995066.1| GH22836 [Drosophila grimshawi]
gi|193899272|gb|EDV98138.1| GH22836 [Drosophila grimshawi]
Length = 489
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 167/384 (43%), Gaps = 61/384 (15%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A+L + GD + E PL+V P + C C + C +C +P
Sbjct: 16 LGRHLVASLSIEAGDTLLEERPLLVTPHWECGQLKCAQCLQE------SYVMCRKCQVYP 69
Query: 123 ACSARCSGLSDAHTH-APECAILKLGC-----ETLLAYN----------DYKYEAILPLR 166
C D H A EC G + LL N + L +
Sbjct: 70 LCM-------DCSEHDAFECEFFASGAGCRIGKDLLVKNYSICGLLKLLLLLEQPELRAQ 122
Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVK-YLRSNFLEKLPGDVL--DD 223
C +L R P K + ++ EA M + E + +V+ + S E LPG L D
Sbjct: 123 CQLLVSR-PIKLDDYRN-EAGMWQ--------EHEEHVVRPLMASGLAEALPGQQLSSDV 172
Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI 283
A C+ EV G D G +G+Y + H+C+PN + Q + ++ +
Sbjct: 173 LHAHCIRIDSNSFEVTGRD-GEQLKGVYVCGAGLPHHCVPNTVVALDDQFNMKLYAAVPL 231
Query: 284 ST------MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
+T YTN L GT R+ L ++++ C+C RC DPTELGT+ S+MKC ++
Sbjct: 232 ATGDIIYASYTNPLMGTSQRQHQLRLSRHMECACTRCLDPTELGTHMSSMKC----RECT 287
Query: 338 DCWILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEML 395
D + +P L DW C C + A DVH + +++G +V+ + + E +L
Sbjct: 288 DGYAVP--DLSGTGDWHCLGVDCEGVIPAADVHELLAEVG---GAMVEAKGELNAYEALL 342
Query: 396 TKLEAMFHPHHYHCYAVKHSLIQL 419
+ + + HP+H+ +K ++ +
Sbjct: 343 AQHKPLLHPNHFLLLDIKQNIASI 366
>gi|125809191|ref|XP_001361025.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
gi|54636198|gb|EAL25601.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
Length = 501
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 104/396 (26%), Positives = 172/396 (43%), Gaps = 74/396 (18%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+K + LGR++ A DL G+ + E P++V P + + RC
Sbjct: 13 ELKQNETLGRFVVARCDLSAGETLLLEQPIVVLPHIG-------------------DRRC 53
Query: 117 PRCFWPACS--ARCSGLS---DAHTH-APECAILK---LGCETLLAYNDYKYEAILP-LR 166
+CF + +C LS D H +C L+ L E + + + P L
Sbjct: 54 SKCFKLTATFCGKCRLLSLCEDCAGHDERDCQRLRELQLSDEQVQILQEKANTEVQPALA 113
Query: 167 CLIL--QRRSPKKYQELKDMEAHMSKRGPGTEVYE----------EIDSIVKYLRSNFLE 214
CL+L + S ++E+ ME H+ R GTE++ E +VK L S
Sbjct: 114 CLLLGEHQHSRPLFEEMSQMETHLEAR-RGTEIWNRYQAHAYAPLEATGVVKQLHSA--- 169
Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSN 269
G+ +D+ + + G+++VN +I G +GLY L H+C+PN + +
Sbjct: 170 --SGNAVDEI---LVQRLLGILDVNVYEIRAPEQGGAMRGLYRRAALFAHSCMPNLETAI 224
Query: 270 MMQSKLFVFRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY 323
+ ++ VF + I+ YT+ L G + RR L K F+CSC RC DPTELGT+
Sbjct: 225 DDERRIKVFANRQITAGEILYICYTSVLLGNEERRHILKTGKCFDCSCARCQDPTELGTH 284
Query: 324 FSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV---HLVTSQLGEQVDK 380
S+ C G +I V L+ + W C LN V++ L ++
Sbjct: 285 MSSFICSGCACSGG--YI--VRQLETGT-WQC-----LLNPEHTLKPEFVSNMLERAKEE 334
Query: 381 LVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
+ ++ LE +L K+ + H +HY +K ++
Sbjct: 335 IFHARGDIYRLELLLVKMSRLLHGNHYLMLDLKQNI 370
>gi|312380816|gb|EFR26711.1| hypothetical protein AND_07035 [Anopheles darlingi]
Length = 368
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 91/183 (49%), Gaps = 35/183 (19%)
Query: 83 ESPLIVGPKLALAE-------PICLGCHKPLNPNLAD----NARCPRCFWPACSARCSGL 131
E+PL++GPK L E C+GC AD RC C WPAC C GL
Sbjct: 15 ENPLVIGPKWNLDEYEQRSTIVPCVGC-------FADCHLGQFRCEHCHWPACKPDCPGL 67
Query: 132 SDAHTHAPECAILKLGC--------ETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
++A+ H EC IL+ G ET Y Y+Y+A+L L+CL +Q RSP ++ ++ +
Sbjct: 68 TNANLHGIECGILRFGRAPKPTDDPETFFDY--YRYDALLVLKCLAIQIRSPTQFDQMMN 125
Query: 184 MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE-------KLPGDVLDDTSAKCLHWICGVI 236
+E+H +R + D +V YL NFL+ K VL K LH I G++
Sbjct: 126 LESHYEERKKTHYYRDTDDRVVSYLLRNFLDPLQKLEKKQDKQVLKVCDRKALHKIAGIL 185
Query: 237 EVN 239
EVN
Sbjct: 186 EVN 188
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 53/76 (69%)
Query: 366 DVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPG 425
+V+ + S++GE+VD+L+ +VK E+++ KL+ HP+H+H ++HSLIQ+YG G
Sbjct: 186 EVNFLLSKIGEEVDELMGRKSSVKEFEDLIQKLQNFLHPNHFHLLTLQHSLIQMYGHFNG 245
Query: 426 YAYTQLSSSLLERKIS 441
Y LS +LL+RK+S
Sbjct: 246 YQLQVLSDALLQRKLS 261
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 33/69 (47%), Gaps = 8/69 (11%)
Query: 472 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGC--------ETLLAYNDYKSLSRAYLD 523
RC C WPAC C GL++A+ H EC IL+ G ET Y Y +L
Sbjct: 51 RCEHCHWPACKPDCPGLTNANLHGIECGILRFGRAPKPTDDPETFFDYYRYDALLVLKCL 110
Query: 524 HQQVRSWTQ 532
Q+RS TQ
Sbjct: 111 AIQIRSPTQ 119
>gi|195124463|ref|XP_002006712.1| GI21214 [Drosophila mojavensis]
gi|193911780|gb|EDW10647.1| GI21214 [Drosophila mojavensis]
Length = 489
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 166/382 (43%), Gaps = 57/382 (14%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A+L + PG+ + E PL+V P C C + C RC +P
Sbjct: 16 LGRHLVASLSIEPGETLLEEQPLLVAPHWECDRLKCAQCLQE------SYVICRRCQVYP 69
Query: 123 ACSARCSGLSDAHTHAPECAILKLGCETLLAYN----DYKYEAILPL-----------RC 167
C CS AH A EC G + + + +Y A+L L +C
Sbjct: 70 LC-MDCS----AHD-AFECEFFASGAGSGIGKDLLIKNYGICALLKLLLLLEQPDQRAQC 123
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVL--DDTS 225
IL R K +E + ++ G E E++ + + S + LP L D
Sbjct: 124 QILLERPVK-------LEDYRNEDGVWQEHEEQV--VRPIMESGLAQVLPDQQLTSDALH 174
Query: 226 AKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF------R 279
A C+ EV D G +G+Y + H+C+PN + Q L ++
Sbjct: 175 AHCIRIDTSAYEVTARD-GDTLKGVYVRGANLPHSCVPNTVVALDEQFNLKLYAAVPLES 233
Query: 280 DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC 339
I T Y N L GT R+ L +T++ C+C RC DPTELGT+ S+MKC +D D
Sbjct: 234 GQIIFTSYANPLMGTSQRQHQLRMTRHMECACVRCLDPTELGTHMSSMKC----RDCSDG 289
Query: 340 WILPVNPLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTK 397
+ V L +W C SC A + A DVH + +++G LV + + E +L +
Sbjct: 290 Y--AVCDLGGTGNWRCLDVSCGAVIPAADVHELLAEVG---GALVDAKGELSAYESLLAQ 344
Query: 398 LEAMFHPHHYHCYAVKHSLIQL 419
+ + HP+H+ +K ++ +
Sbjct: 345 NKQLLHPNHFLLLDIKQNIASI 366
>gi|195441983|ref|XP_002068740.1| GK17864 [Drosophila willistoni]
gi|194164825|gb|EDW79726.1| GK17864 [Drosophila willistoni]
Length = 504
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 168/387 (43%), Gaps = 53/387 (13%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
++K + LGR++ A ++ G+ + E P+++ P L E C C NL N C
Sbjct: 13 QVKHNDTLGRFVVAVDNIRAGETLLVEEPIVILPHLG--ERRCARC-----LNLTTNF-C 64
Query: 117 PRC-FWPACSA-------RCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL-RC 167
+C P C C L+D L+L E + D + + + +C
Sbjct: 65 SKCQLLPLCEECKDHEERDCQRLAD----------LQLSEEQVQQLQDSQRTEVQSMVKC 114
Query: 168 LILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEIDSIVK--YLRSNFLEKLPGDVLDD 223
L+L+ + + Y+E+ M+A + R GTEV+ + V S L KL +
Sbjct: 115 LLLREHTETRSLYEEMYQMDAQLDAR-RGTEVWNNYQANVANPLESSGILGKLWQGSDLE 173
Query: 224 TSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNA-----KHSNMMQS 273
+ + + + ++N +I G +G+Y L H+C+PN SN Q
Sbjct: 174 KDVELVQRLLAIFDINAFEIRAPESGGSMRGIYRRASLFPHSCMPNLVSAIDDASNGRQL 233
Query: 274 KLFVFRDTHISTM----YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
K++ R + YTN L GTQ RRE L K FNC+C RC DPTELGT+ S+ C
Sbjct: 234 KMYASRFIAAGEILYNCYTNVLLGTQERREILKQGKCFNCNCPRCVDPTELGTHMSSFMC 293
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVK 389
N G + +NP N W C R+ V + L +++ N+
Sbjct: 294 NNCSSTGG---YIVLNPETN--QWQCLLNPEHTLKRE--FVANMLERAKEEVFHARDNIY 346
Query: 390 SLEEMLTKLEAMFHPHHYHCYAVKHSL 416
+E +L KL + H +HY +K ++
Sbjct: 347 RMELLLAKLSRLLHGNHYVMLDLKQNI 373
>gi|194882335|ref|XP_001975267.1| GG20655 [Drosophila erecta]
gi|190658454|gb|EDV55667.1| GG20655 [Drosophila erecta]
Length = 498
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 140/297 (47%), Gaps = 47/297 (15%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EIK + LGR++ A +L G+ + E+P++V P++ + C C K + C
Sbjct: 13 EIKQNDTLGRFVVALCNLRAGETLLLENPIVVLPQMG--DRRCSKCFK------LTQSFC 64
Query: 117 PRCFWPACSARCSG--------LSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRC 167
+C A CS L++ H A + +L+ ++ I P L+C
Sbjct: 65 RKCHLLALCEDCSNHDERDCRRLAEMHFSADQVELLQ----------RKEHTEIQPVLKC 114
Query: 168 LILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDD 223
L+L+ Y+E+ ME+ ++ R GT++++ + L++L G +D
Sbjct: 115 LLLREHEETLPLYEEMSQMESQLATR-RGTDIWKNYQEHAFAPLDAGGVLKQLRGSADED 173
Query: 224 TSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
+ + G++++N +I G +GLY L H+C+PN S + ++ V+
Sbjct: 174 L----VQGLLGILDINAYEIRAPESGGAMRGLYRRAGLFAHSCMPNLVISIDDEKRIKVY 229
Query: 279 RDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+ I+ YTN L GT+ RR+ L K F+C+C RC DPTELGT+ S+ C
Sbjct: 230 ANRFIAAGEILYNCYTNVLLGTEERRQILKEGKCFDCTCARCQDPTELGTHMSSFMC 286
>gi|195154477|ref|XP_002018148.1| GL17552 [Drosophila persimilis]
gi|194113944|gb|EDW35987.1| GL17552 [Drosophila persimilis]
Length = 501
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/396 (26%), Positives = 172/396 (43%), Gaps = 74/396 (18%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+K + LGR++ A DL G+ + E ++V P + + RC
Sbjct: 13 ELKQNETLGRFVVARCDLSAGETLLLEQSIVVLPHIG-------------------DRRC 53
Query: 117 PRCFWPACS--ARCSGLS---DAHTH-APECAILK---LGCETLLAYNDYKYEAILP-LR 166
+CF + +C LS D H +C L+ L E + + + P L
Sbjct: 54 SKCFKLTATFCGKCRLLSLCEDCAGHDERDCQRLRELQLSDEQVQILQEKANTEVQPALA 113
Query: 167 CLIL--QRRSPKKYQELKDMEAHMSKRGPGTEVYE----------EIDSIVKYLRSNFLE 214
CL+L + S ++E+ ME H+ R GTE+++ E +VK L S
Sbjct: 114 CLLLGEHQHSRPLFEEMSQMETHLEAR-RGTEIWKRYQAHAYAPLEATGVVKQLHSA--- 169
Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSN 269
G+ +D+ + + G+++VN +I G +GLY L H+C+PN + +
Sbjct: 170 --SGNAVDEI---LVQRLLGILDVNVYEIRAPEQGGAMRGLYRRAALFAHSCMPNLETAI 224
Query: 270 MMQSKLFVFRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY 323
+ ++ VF + I+ YT+ L G + RR L K F+CSC RC DPTELGT+
Sbjct: 225 DDERRIKVFANRQIAAGEILYICYTSVLLGNEERRHILKTGKCFDCSCARCQDPTELGTH 284
Query: 324 FSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDV---HLVTSQLGEQVDK 380
S+ C G +I V L+ + W C LN V++ L ++
Sbjct: 285 MSSFICSGCACSGG--YI--VRQLETGT-WQC-----LLNPEHTLKPEFVSNMLERAKEE 334
Query: 381 LVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
+ ++ LE +L K+ + H +HY +K ++
Sbjct: 335 IFHARGDIYRLELLLVKMSRLLHGNHYLMLDLKQNI 370
>gi|195584110|ref|XP_002081858.1| GD11242 [Drosophila simulans]
gi|194193867|gb|EDX07443.1| GD11242 [Drosophila simulans]
Length = 498
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 138/289 (47%), Gaps = 31/289 (10%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EIK + +LGR++ A +L G+ + E+P++V P++ + C C NL ++ C
Sbjct: 13 EIKHNDMLGRFVVALCNLRAGETLLLENPIVVLPQMG--DRRCTKCF-----NLTESF-C 64
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRSP 175
+C A CS + A + L + + ++ I P L+CL+L+
Sbjct: 65 RKCRLLALCEDCSDHDE--RDCKRLAEMNLSDDQVELLQKKEHTEIQPVLKCLLLREHEE 122
Query: 176 KK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
Y+E+ ME+ + R GT++++ + L +L G +D +
Sbjct: 123 TLPLYEEMSQMESQLMTR-RGTDIWKNYQEHAFTPLNSGGVLAQLRGAADEDL----VQG 177
Query: 232 ICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
+ G++++N +I G +GLY L H+C+PN S + ++ V+ + I+
Sbjct: 178 LLGILDINAYEIRAPEVGGAMRGLYRRAGLFAHSCMPNLVISIDDERRIKVYANRFIAAG 237
Query: 287 ------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
YTN L GT+ RR+ L K F+CSC RC DPTELGT+ S+ C
Sbjct: 238 EILYNCYTNVLLGTEERRQILKEGKCFDCSCPRCQDPTELGTHMSSFIC 286
>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
Length = 462
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 131/323 (40%), Gaps = 45/323 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
+ I SS + GR + AT D+ G+ I +E L+ GP + C+ CH+ L P
Sbjct: 57 WTISSSTVAGRGVFATRDISAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 115
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
R RC P C SD+ H EC + + D + E + P+ IL
Sbjct: 116 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 167
Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
K+ + L D ++R E+ + + D L
Sbjct: 168 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 216
Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
I GV+ N + G T +GL+ + +M H C PNA H +L V R
Sbjct: 217 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 275
Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I+T YT LWG R L +TK+F C C RC D TE GTY SA+ C ++Q
Sbjct: 276 IPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQ 331
Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
G ++PV DW C +C
Sbjct: 332 GCRGLVIPVQTRTLQPDWRCITC 354
>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
Length = 462
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 131/323 (40%), Gaps = 45/323 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
+ I SS + GR + AT D+ G+ I +E L+ GP + C+ CH+ L P
Sbjct: 57 WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 115
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
R RC P C SD+ H EC + + D + E + P+ IL
Sbjct: 116 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 167
Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
K+ + L D ++R E+ + + D L
Sbjct: 168 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 216
Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
I GV+ N + G T +GL+ + +M H C PNA H +L V R
Sbjct: 217 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 275
Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I+T YT LWG R L +TK+F C C RC D TE GTY SA+ C ++Q
Sbjct: 276 IPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQ 331
Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
G ++PV DW C +C
Sbjct: 332 GCRGLVIPVQTRTLQPDWRCITC 354
>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
Length = 495
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 132/322 (40%), Gaps = 43/322 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
+ I SS + GR + AT D+ G+ I +E L+ GP + C+ CH+ L P
Sbjct: 90 WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 148
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
R RC P C SD+ H EC + + D + E + P+ IL
Sbjct: 149 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 200
Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
K+ + L D ++R E+ + + D L
Sbjct: 201 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 249
Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDT 281
I GV+ N + G T +GL+ + +M H C PNA H N + + RD
Sbjct: 250 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHYFENGRLAVVRAARDI 309
Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
I+T YT LWG R L +TK+F C C RC D TE GTY SA+ C ++QG
Sbjct: 310 PKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQG 365
Query: 338 -DCWILPVNPLDNDSDWTCGSC 358
++PV DW C +C
Sbjct: 366 CRGLVIPVQTRTLQPDWRCITC 387
>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
Length = 991
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 131/323 (40%), Gaps = 45/323 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
+ I SS + GR + AT D+ G+ I +E L+ GP + C+ CH+ L P
Sbjct: 586 WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 644
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
R RC P C SD+ H EC + + D + E + P+ IL
Sbjct: 645 R-HRCTLPVCET----CSDSEEHQAECEHFRRWQPKDV---DAEQEQVNPMSLRILTAVR 696
Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
K+ + L D ++R E+ + + D L
Sbjct: 697 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 745
Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
I GV+ N + G T +GL+ + +M H C PNA H +L V R
Sbjct: 746 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 804
Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I+T YT LWG R L +TK+F C C RC D TE GTY SA+ C ++Q
Sbjct: 805 IPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCHDNTENGTYLSALFC----REQ 860
Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
G ++PV DW C +C
Sbjct: 861 GCRGLVIPVQTRTLQPDWRCITC 883
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/321 (24%), Positives = 124/321 (38%), Gaps = 58/321 (18%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKL--ALAEPICLGCHKPLNPNLADN 113
+ + SP+ GR + AT ++ G+ + RE L+VGP ++ C C++ + +
Sbjct: 44 WRVADSPISGRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSA 103
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
A CP C P CS D+ H EC + + + + + I P
Sbjct: 104 ALCPAGCGLPVCSE----CRDSTRHDLECKLFR-------KWKPLESQRIEPRALRILSV 152
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+RC L S K L M+A+M R EV D + R D+LD
Sbjct: 153 VRCFFLDEASRKL---LYAMQANMD-RYYMQEVQRAADCFEHFPREQ-------DMLDYF 201
Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
+ E V+G ++ + L+ + L+ H C PNA H + V
Sbjct: 202 YRTICAFNTNAFESRSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 259
Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----------- 323
I+ Y LW T R+ L +TK+F C C RC DPT + +
Sbjct: 260 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPTFMASITNRKQTSVKGA 319
Query: 324 --FSAMKCLNEHKDQGDCWIL 342
FS + E D G C ++
Sbjct: 320 GGFSHSEVFGEQIDSGKCSLM 340
>gi|195335121|ref|XP_002034224.1| GM21751 [Drosophila sechellia]
gi|194126194|gb|EDW48237.1| GM21751 [Drosophila sechellia]
Length = 498
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 137/289 (47%), Gaps = 31/289 (10%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EIK + +LGR++ A +L G+ + E+P++V P++ + C C NL ++ C
Sbjct: 13 EIKHNDMLGRFVVALCNLRAGETLLLENPIVVLPQMG--DRRCSKCF-----NLTESF-C 64
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRSP 175
+C A CS A + + + ++ I P L+CL+L+
Sbjct: 65 RKCRLLALCEDCS--YHDERDCKRLAEMNFSDDQVELLQKKEHTEIQPVLKCLLLREHEE 122
Query: 176 KK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
Y+E+ ME+ + R GT++++ + L +L G +D +
Sbjct: 123 TLPLYEEMSQMESQLMTR-RGTDIWKNYQEHAFTPLNSGGVLAQLRGAADEDL----VQG 177
Query: 232 ICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
+ G++++N +I G +GLYS L H+C+PN S + ++ V+ + I+
Sbjct: 178 LLGILDINAYEIRAPEVGGAMRGLYSRAGLFAHSCMPNLVISIDDERRIKVYANRFIAAG 237
Query: 287 ------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
YTN L GT+ RR+ L K F+CSC RC DPTELGT+ S+ C
Sbjct: 238 EILYNCYTNVLLGTEERRQILKEGKCFDCSCPRCQDPTELGTHMSSFIC 286
>gi|321474757|gb|EFX85721.1| hypothetical protein DAPPUDRAFT_98275 [Daphnia pulex]
Length = 222
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 4/116 (3%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVCR A C C E++YC K+HQK+HW HK +CK PY+I GR + A+ ++
Sbjct: 16 CAVCRSKASQICGGCGEISYCSKEHQKQHWATHKSQCK--PYKIVFDQKYGRIMVASKNI 73
Query: 75 HPGDRIARESPLIVGPKLALAEPICLGCHKPL-NPNLADNARCPRCFWPACSARCS 129
PG+ I RE + GPK P CL C+ L N + A RCP C +P C +C+
Sbjct: 74 KPGEIIFREKAIFTGPKQGCL-PCCLACYTSLENVDEASLFRCPGCNFPFCQEKCA 128
>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
Length = 462
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 130/323 (40%), Gaps = 45/323 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAE-PICLGCHKPLNPNLADNA 114
+ I SS + GR + AT D+ G+ I +E L+ GP + C+ CH+ L P
Sbjct: 57 WTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETL-PQTGFLC 115
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
R RC P C +D+ H EC + + D + E + PL IL
Sbjct: 116 R-HRCTLPVCET----CADSEEHQAECEHFRRWQPKDV---DAEQEQVNPLSLRILTAVR 167
Query: 175 ----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
K+ + L D ++R E+ + + D L
Sbjct: 168 VFHLGKEQRHLVDAMQANAERAYRREIIQAAQCFRNF-----------PTTDRVFMDQLF 216
Query: 231 WICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH- 282
I GV+ N + G T +GL+ + +M H C PNA H +L V R
Sbjct: 217 RIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAARD 275
Query: 283 ------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
I+T YT LW R L +TK+F C C RC D TE GTY SA+ C ++Q
Sbjct: 276 IPKGGEITTTYTKILWSNLTRNIFLKMTKHFACDCARCHDNTENGTYLSALFC----REQ 331
Query: 337 G-DCWILPVNPLDNDSDWTCGSC 358
G ++PV DW C +C
Sbjct: 332 GCRGLVIPVQTRTLQPDWRCITC 354
>gi|24654327|ref|NP_611182.1| CG9642 [Drosophila melanogaster]
gi|7302839|gb|AAF57913.1| CG9642 [Drosophila melanogaster]
Length = 498
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 134/289 (46%), Gaps = 31/289 (10%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EIK + LGR+ A ++ G+ + E+P++V P + E C C NL ++ C
Sbjct: 13 EIKQNDTLGRFAVALCNVRAGETLLLENPIVVLP--LMGERRCSKCF-----NLTESF-C 64
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRSP 175
+C A CS + A + + + ++ I P L+CL+L+
Sbjct: 65 RKCRLLALCEDCSDHDE--RDCKRLAEMNFSDDQVELLQKKEHTEIQPVLKCLLLREHEE 122
Query: 176 KK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
Y+E+ M++ + R GTEV++ + L +L G +D +
Sbjct: 123 TLPLYEEMSQMDSQLMTR-RGTEVWKNYQEHAFTPLDYGGVLAQLRGAADEDL----VQG 177
Query: 232 ICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTM 286
+ G++++N +I G +GLY L H+C PN S + ++ V+ + I+
Sbjct: 178 LLGILDINAYEIRAPEVGGAMRGLYRRAGLFAHSCTPNLVISIDDEQRIKVYANRFIAAG 237
Query: 287 ------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
YTN L GT+ RR+ L + K F+CSC RC DPTELGT+ S+ C
Sbjct: 238 EILYNCYTNVLLGTEERRKILKVGKCFDCSCPRCQDPTELGTHMSSFIC 286
>gi|195488849|ref|XP_002092487.1| GE11640 [Drosophila yakuba]
gi|194178588|gb|EDW92199.1| GE11640 [Drosophila yakuba]
Length = 497
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 137/296 (46%), Gaps = 45/296 (15%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EIK + LGR++ A +L G+ + E+P++V P++ + C C K + C
Sbjct: 13 EIKQNDTLGRFVVALCNLRAGETLLLENPIVVLPQIG--DRRCSKCFK------LTQSFC 64
Query: 117 PRCFWPACSARCSG--------LSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
+C A CS L++ + + +L+ T E L+CL
Sbjct: 65 RKCRLLALCEDCSDHDGRDCRRLAEMNLSNDQVELLQRKEHT---------EIQSALKCL 115
Query: 169 ILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEID--SIVKYLRSNFLEKLPGDVLDDT 224
+L+ Y+E+ ME+ ++ R GT++++ + L++L G +D
Sbjct: 116 LLREHEETLPLYEEMSQMESQLASR-RGTDIWKSYQEHAFAPLDSGGVLKQLRGSADEDL 174
Query: 225 SAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR 279
+ + G+++VN +I G +GLY L H+C+PN S + ++ V+
Sbjct: 175 ----VQGLLGILDVNAYEIRAPESGGAMRGLYRRAGLFAHSCMPNLVISIDDERRIKVYA 230
Query: 280 DTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+ I+ YTN L GT+ R++ L K F+C+C RC DPTELGT+ S+ C
Sbjct: 231 NRFIAAGEILYNCYTNVLLGTEERQQILKEGKCFDCTCPRCKDPTELGTHMSSFMC 286
>gi|194756634|ref|XP_001960581.1| GF11433 [Drosophila ananassae]
gi|190621879|gb|EDV37403.1| GF11433 [Drosophila ananassae]
Length = 497
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 137/300 (45%), Gaps = 49/300 (16%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPK---------LALAEPICLGCHKPLN 107
++K + LGR++ A ++ G+ + E P++V P L E C CH
Sbjct: 13 QVKQNETLGRFVIALGNIRAGETLLLEEPILVLPHAGDRRCSKCFKLTEEFCRKCHLL-- 70
Query: 108 PNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKY-EAILPLR 166
A C C C LSD LKL E +L + K+ E L+
Sbjct: 71 ------ALCGDCSEHD-ERDCKRLSD----------LKLSEEQVLLLQEKKHSEVQSALK 113
Query: 167 CLILQRRSPKK--YQELKDMEAHMSKRGPGTEVYE--EIDSIVKYLRSNFLEKLPGDVLD 222
C++L+ + Y+E+ ME ++ R GT++++ + + +S L++L + +
Sbjct: 114 CILLREHLETQPLYEEISQMENQLTAR-RGTDIWKNNQEHAYNPLEKSGVLKQLNSEANE 172
Query: 223 DTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
D + + G+++VN +I G +GLY L H+C+PN + + ++ V
Sbjct: 173 DL----VQGLLGMLDVNAFEIRAPESGGSMRGLYRRAGLYPHSCMPNLVTAIDDERRIKV 228
Query: 278 FRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
+ + I+ YTN L GT RR L K F+C+C RC DPTELGT+ S+ C N
Sbjct: 229 YANRFIAAGEILYNCYTNILLGTDERRHILKAGKCFDCTCSRCQDPTELGTHMSSFVCSN 288
>gi|270012082|gb|EFA08530.1| hypothetical protein TcasGA2_TC006183 [Tribolium castaneum]
Length = 331
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 17/233 (7%)
Query: 221 LDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM------ 270
+D +H +CG++++N +++ G T+ LY + + HNC+ N +
Sbjct: 1 MDQFCDDLIHTVCGILDLNAIEVRAPSGYLTRCLYPKLSQISHNCVTNLFQTVTPEDFKI 60
Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+++ + + + + Y LW T RR+ L K +C C+RC D TEL T+ S +KC
Sbjct: 61 TLRASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 120
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPN 387
D G +L + L + DW C C + NA V V + +++ ++V
Sbjct: 121 --SKCDNG--ILLSSDSLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEG 176
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ E + K ++FHP + + ++ L QLYG PGY +L L E K+
Sbjct: 177 IEQREAIFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 229
>gi|189239865|ref|XP_966657.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 356
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 111/233 (47%), Gaps = 17/233 (7%)
Query: 221 LDDTSAKCLHWICGVIEVNGVDI----GRYTQGLYSVICLMEHNCLPNAKHSNM------ 270
+D +H +CG++++N +++ G T+ LY + + HNC+ N +
Sbjct: 26 MDQFCDDLIHTVCGILDLNAIEVRAPSGYLTRCLYPKLSQISHNCVTNLFQTVTPEDFKI 85
Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+++ + + + + Y LW T RR+ L K +C C+RC D TEL T+ S +KC
Sbjct: 86 TLRASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 145
Query: 330 LNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVD--KLVQENPN 387
D G +L + L + DW C C + NA V V + +++ ++V
Sbjct: 146 --SKCDNG--ILLSSDSLSDSCDWNCTHCEFKTNASSVKKVYRIVQSEIEAIQMVSGAEG 201
Query: 388 VKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGYAYTQLSSSLLERKI 440
++ E + K ++FHP + + ++ L QLYG PGY +L L E K+
Sbjct: 202 IEQREAIFRKYRSVFHPKNAYMTILRVDLTQLYGRAPGYTIHELPDLLQEHKV 254
>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
Length = 448
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 134/325 (41%), Gaps = 50/325 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKL--ALAEPICLGCHKPLNPNLADN 113
+ + SP+ GR + AT ++ G+ + RE L+VGP ++ C C++ + +
Sbjct: 44 WRVADSPISGRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIPGSTDSA 103
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
A CP C P CS D+ H EC + + + + + I P
Sbjct: 104 ALCPAGCGLPVCSE----CRDSTRHDLECKLFR-------KWKPLESQRIEPRALRILSV 152
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+RC L S K L M+A+M R EV D + R D+LD
Sbjct: 153 VRCFFLDEASRKL---LYAMQANMD-RYYMQEVQRAADCFEHFPREQ-------DMLDYF 201
Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
+ E V+G ++ + L+ + L+ H C PNA H + V
Sbjct: 202 YRTICAFNTNAFESRSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 259
Query: 281 ------THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
I+ Y LW T R+ L +TK+F C C RC DPTE GTY SA+ C +
Sbjct: 260 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPTENGTYLSALFC----R 315
Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
+QG ++PV DW C +C
Sbjct: 316 EQGCRGLVIPVQTRTLQPDWRCITC 340
>gi|21430220|gb|AAM50788.1| LD23906p [Drosophila melanogaster]
Length = 417
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 119/260 (45%), Gaps = 24/260 (9%)
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVL--DDTSAKC 228
R+ Q L D+ ++S G +++E + +V + S + LP L D A C
Sbjct: 47 RTKGDCQMLIDVPINLSDYRDGEGMWQEHEELVVRPLMESGLADVLPTQELTSDALHAHC 106
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH------ 282
+ EV D G +G++ + H+C+PN + Q + ++
Sbjct: 107 IRIDSNSFEVTAKD-GDTLKGIFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDI 165
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
I YTN L GT R+ L +++ C C RC DPTE+GT+ S++KC +C
Sbjct: 166 IYNSYTNPLMGTSQRQHQLRLSRRLECICSRCLDPTEMGTHMSSLKC-------KECPGF 218
Query: 343 PVNPLDND---SDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLE 399
V +D++ DW C C A L A +VH + + +G LV +++ E +LT+
Sbjct: 219 SVCEIDSNGKLGDWRCPDCRALLTAAEVHELQAAVGS---ALVDAMGDLQVYEALLTQYG 275
Query: 400 AMFHPHHYHCYAVKHSLIQL 419
+ HP+H+ +K ++ +
Sbjct: 276 PLLHPNHFMLLDIKQNIASI 295
>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
Length = 766
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 136/333 (40%), Gaps = 58/333 (17%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+ + +S + GR + AT D+ G+ I RE L+VGP + C+ CH L A
Sbjct: 354 WTLAASVVAGRGVFATRDIARGELIFRERALVVGPTARKGAQLNTCVCCHCLLP---AQG 410
Query: 114 ARCP-RCFWPACSARCSGLSDAHTHAPECAILKL---------GCETLLAYNDYKYEAIL 163
C RC P C SD+ TH EC + G E A E I
Sbjct: 411 FLCAHRCTLPVCGP----CSDSETHREECQHFQRWQPIDAVPEGAEPSGA------EQIN 460
Query: 164 PLRCLILQRRS----PKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGD 219
PL IL ++ + L D ++RG +EI + R NF
Sbjct: 461 PLSLRILTAVRVFHLGQQQRSLVDAMQANAERG----YRQEIIKAAQCFR-NF------P 509
Query: 220 VLDDTSAKCLHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKH--SNM 270
D L I GV+ N + G T +GL+ + +M H C PNA H N
Sbjct: 510 TTDKPFMDQLFRIVGVLNTNAFEAPCRVDGHETLLRGLFPLTAIMNHECTPNASHYFDNG 569
Query: 271 MQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
+ + RD I+T YT LW R L +TK F C C RC+D TE GTY SA
Sbjct: 570 RMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKNFMCDCPRCNDNTENGTYLSA 629
Query: 327 MKCLNEHKDQG-DCWILPVNPLDNDSDWTCGSC 358
+ C ++QG ++PV DW C SC
Sbjct: 630 LFC----REQGCKGLVIPVQTKTMQPDWRCLSC 658
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 117/287 (40%), Gaps = 50/287 (17%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHK-PLNPNLAD 112
+++ SP+ GR + A D+ G+ + RE L+VGP + + C C+ P +
Sbjct: 37 WQVADSPISGRGIFAIRDIEQGEELFREHTLLVGPTAHRSSNLRTCTICYNLPNGCDTDA 96
Query: 113 NARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP------- 164
A CP C P C + H EC + + + E I P
Sbjct: 97 QALCPSGCGLPVCPS----CRPTERHQLECKLFR-------KWKPKDTERIEPRALRILS 145
Query: 165 -LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLP--GDVL 221
+RC L + K+ Q L M+A+ + R EV D E P D+L
Sbjct: 146 VVRCFFL---TEKQRQLLYAMQAN-ADRYYMKEVQRAADC---------FEHFPHEQDML 192
Query: 222 DDTSAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMM 271
D + E V G +I + L+ + L+ H+C PNA H + ++
Sbjct: 193 DYFYRTICAFNTNAFESRSRVGGNEI--LVRALFPLAGLLNHHCTPNAAHHFEDGETIVV 250
Query: 272 QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
+ + + T I+ Y LW T R+ LA+TK+F C C RC DPT
Sbjct: 251 SATERIPKGTEITMTYAKLLWSTLARKLFLAMTKHFICQCPRCQDPT 297
>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
Length = 1033
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 97/196 (49%), Gaps = 15/196 (7%)
Query: 233 CGVIEVNGVDIGRYTQ---GLYSVICLMEHNCLPN-AKHSN-----MMQSKLFVFRDTHI 283
G +++NG ++ ++ E++CLPN AK N ++ + + ++ H+
Sbjct: 730 VGALQINGHEVPTTDPPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHL 789
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
S Y++A+WGT R+ HL TK F C+CERC D TEL T +SA+KC + Q +LP
Sbjct: 790 SICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTELDTNYSAIKCEDR---QCGGLMLP 846
Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
D + W C C ++ V + + G+ + + + N ++ + L E
Sbjct: 847 TKADDWNGSWRCRECHKQVQKHYVERILERAGKDIQSMEK---NAENGLKYLKHYEKWLP 903
Query: 404 PHHYHCYAVKHSLIQL 419
P H+H +K L+QL
Sbjct: 904 PQHFHMSEIKILLVQL 919
>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
Length = 495
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 139/292 (47%), Gaps = 37/292 (12%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+K + LGRY+ A ++ G+ + E P+++ + C CH+ ++ C
Sbjct: 13 EVKQNETLGRYVVAATNIKAGETVLLEQPVLLLANNG--DRRCCNCHQ------LTSSFC 64
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK--YEAILPLRCLILQRR 173
+C P C A C H P+C L + E L+ L+L+
Sbjct: 65 GKCRLMPLC-ADC-----VEHHGPDCRRLAELQLQEQQVEQLQAHTEVCSALKYLLLREH 118
Query: 174 SPKK--YQELKDMEAHMSKRGPGTEVYEEIDS-IVKYLRSN--FLEKLPGDVLDDTSAKC 228
+ K+ Y++L +EAH+++R T+++ + +++ L+++ L+ G +D+ +
Sbjct: 119 AEKRSHYEQLLQLEAHLARR-RDTDIWRSYRAEVIEPLQASGLLLQLRNGAEIDE---EL 174
Query: 229 LHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI 283
L + + ++NG +I G +G+Y L H+CLPN + Q ++ ++ + I
Sbjct: 175 LQRLLAIADINGFEIRAPESGGAMRGVYMTAALFAHSCLPNLVTAVDEQRRIKLYANRII 234
Query: 284 STM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+ YT+ L GT RR L K F+C C RC+DPTELGT+ S+ C
Sbjct: 235 AAGEILYNCYTDILLGTDERRRILKTGKCFDCQCARCTDPTELGTHMSSFVC 286
>gi|195381153|ref|XP_002049319.1| GJ20817 [Drosophila virilis]
gi|194144116|gb|EDW60512.1| GJ20817 [Drosophila virilis]
Length = 489
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 163/376 (43%), Gaps = 45/376 (11%)
Query: 64 LGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC-FWP 122
LGR+L A++ + G+ + E PL+V P + C C + C +C +P
Sbjct: 16 LGRHLVASMSIEAGETLLDERPLLVAPHWECGQLKCAQCLQE------SYVMCRKCQVYP 69
Query: 123 ACSARCSGLSDAHTH-APECAILKLGCETLLAYN----DYKYEAILPLRCLILQRRSPKK 177
C D H A EC G + ++ + +Y A+L L L+ Q
Sbjct: 70 LCM-------DCSEHDAFECEFFASGAGSRISKDLLVKNYGICALLKLLLLLEQPDQRAL 122
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIV--KYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
Q L D + +++E D V + S ++ P L +A+ LH C
Sbjct: 123 CQTLLDTPVKLEDYRNEDGLWQERDEQVVRPLMSSGLVQAFPDQQL---TAEVLHAHCIR 179
Query: 236 IEVNGVDI----GRYTQGLYSVICLMEHNCLPNA-----KHSNM-MQSKLFVFRDTHIST 285
I+ + ++ G +G+Y + H+C+PN + NM + + + + I T
Sbjct: 180 IDNSAYEVTARDGDTLKGVYVRGASLPHSCVPNTVVALDEQFNMKLYAAVPLESGDIIYT 239
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
YTN L GT R+ L ++++ C+C RC D TEL T+ S+MKC C
Sbjct: 240 SYTNPLMGTSQRQHQLRLSRHKECACTRCLDATELNTHMSSMKCRGCPDGYAVCQ----- 294
Query: 346 PLDNDSDWTC--GSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
L DW C SC A + A +VH + +++G LV+ + + E +L + + + H
Sbjct: 295 -LSGSGDWRCLNASCGAVIPATEVHELLAEVG---SALVEARGELSAYETLLDQHKQLLH 350
Query: 404 PHHYHCYAVKHSLIQL 419
P+H+ +K ++ +
Sbjct: 351 PNHFLLLDIKQNIASI 366
>gi|195056333|ref|XP_001995065.1| GH22837 [Drosophila grimshawi]
gi|193899271|gb|EDV98137.1| GH22837 [Drosophila grimshawi]
Length = 495
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 135/291 (46%), Gaps = 35/291 (12%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+K + +GRY+ A+ ++ G+ + E P+++ + C C+K N+ C
Sbjct: 13 EVKENETMGRYVIASTNIREGETVLCEQPILLLSNNT--DRRCSSCYK------LTNSFC 64
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL--PLRCLILQ-- 171
+C P C C H A +C L + +A + L L+L+
Sbjct: 65 GKCRLLPLC-GEC-----VHHDARDCKRLAELQLEEEQVEKLQAQAEVNNALTYLVLREH 118
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSN-FLEKLPGDVLDDTSAKCL 229
+ + +Y++L ME+H++ R TE++ + +V L+S L++L + + L
Sbjct: 119 KETQPQYEQLLQMESHLALR-RNTEIWRTYREQVVLPLQSTGLLQQLRNGA--EIDEELL 175
Query: 230 HWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS 284
+ G +VNG +I G +GLY L H+C PN + + ++ V+ + +I
Sbjct: 176 QHLLGTTDVNGFEIRAPESGGTLRGLYIRAALFPHSCTPNLVTAIDEERRIKVYANRYIP 235
Query: 285 TM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
YTN L GT+ RR L K F+C C RC DPTELGT+ S+ C
Sbjct: 236 AGEILYNCYTNILLGTEERRLILKQGKCFDCQCARCQDPTELGTHMSSFVC 286
>gi|195124465|ref|XP_002006713.1| GI21216 [Drosophila mojavensis]
gi|193911781|gb|EDW10648.1| GI21216 [Drosophila mojavensis]
Length = 491
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 138/297 (46%), Gaps = 47/297 (15%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+K + +GRYL A +++ G+ + E PL++ A + C CH+ + C
Sbjct: 9 ELKENETMGRYLVAAVNIKAGETLLLEQPLLLLANNA--DRRCCNCHQ------LTSTFC 60
Query: 117 PRC-FWPACSA-------RCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
+C P C+ C L+ H + L+ E N K+ L
Sbjct: 61 GKCRLMPLCAECVDHHGLDCLRLAKLQLHVEQVQQLQANAEI---ANVLKF--------L 109
Query: 169 ILQRRSPKK--YQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSN--FLEKLPGDVLDD 223
+L+ + Y+EL MEAH+++R TE++ +++ L+S L+ G +D+
Sbjct: 110 LLREHAEMHSWYEELLQMEAHLARR-RDTEIWRTYQKQVIEPLQSAGLLLQLRNGSEIDE 168
Query: 224 TSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF 278
L + G+ ++NG +I G +G+Y L H+C+PN + Q ++ ++
Sbjct: 169 ---DLLQRMLGIADINGFEIRAPESGSSMRGVYLRAGLFAHSCVPNVVAAIDEQHRIKLY 225
Query: 279 RDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
+ I+ YTN + GT RR+ L I K F+C C RC DPTELGT+ S+ C
Sbjct: 226 ANRSIAAGEILYNCYTNIMLGTDERRQILKIGKCFDCQCVRCLDPTELGTHMSSFVC 282
>gi|321472908|gb|EFX83877.1| hypothetical protein DAPPUDRAFT_239494 [Daphnia pulex]
Length = 291
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 59/202 (29%), Positives = 96/202 (47%), Gaps = 29/202 (14%)
Query: 260 NCLPNAKHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
N + +AK+ +++ + + R Y AL G + R+ L +K F C C RCSDPTE
Sbjct: 2 NHVIDAKYQMTVRASVPIMRGD-----YALALEGLRERQSLLRQSKLFECDCSRCSDPTE 56
Query: 320 LGTYFSAMKCLNEHKDQGDCWILPVNPLD-NDSDWTCGSCSARLNARDVHLVTSQLGEQV 378
TY SA +C + ++LP+ PLD +++W CS +L A V+ V +L E+
Sbjct: 57 CSTYLSAQRC----QKCPTGFVLPIRPLDEKETEWKGNLCSHKLTAVVVNRVVDKLKEEF 112
Query: 379 DKL-----------------VQENPNVKS--LEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ + V P + + E L + ++ HP+H+ + + SL QL
Sbjct: 113 ETIGPNEVEKYFEILYLIIKVAIKPFLGTGRFEGFLKRHASLVHPNHFLFTSARQSLSQL 172
Query: 420 YGTQPGYAYTQLSSSLLERKIS 441
YG Y L+ LERK++
Sbjct: 173 YGRDEKYLVNTLTMEQLERKVA 194
>gi|195454533|ref|XP_002074283.1| GK18438 [Drosophila willistoni]
gi|194170368|gb|EDW85269.1| GK18438 [Drosophila willistoni]
Length = 232
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 110/230 (47%), Gaps = 23/230 (10%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A + + E P VGPK + +CLGC+ P N CP
Sbjct: 15 VEWSPIYGRYLVAQQNTSGHSLLIEELPFAVGPK-SQGGVVCLGCYSPELENC-----CP 68
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY----KYEAILPLRCLILQRR 173
+C WP C CS + D EC I K + + + I+PLR L+ + +
Sbjct: 69 QCGWPLCE-ECSKIEDNVHKQMECRIFKEAKARFYRIANGGQCPQLDCIMPLRVLLAKEQ 127
Query: 174 SPKKYQ-ELKDMEAHMSKRGPGTEVYE-EIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
P++++ ++ ME H ++R ++++ + +I +YLR P + S + +
Sbjct: 128 QPERWEKDVAPMEHHEAERRENSDIWHADRVNIAQYLRG------PCKLASRFSEELIMQ 181
Query: 232 ICGVIEVNGVD----IGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFV 277
+ G++EVN + +G + LY ++ HNC+PN ++ Q +L V
Sbjct: 182 VVGILEVNAFEARTTVGYPLRCLYPYTGILAHNCVPNTARIDISQRELQV 231
>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
Length = 600
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 70/145 (48%), Gaps = 20/145 (13%)
Query: 229 LHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
L + GV+ N + G T +GL+ + +M H C PNA H +L V R
Sbjct: 353 LFRVVGVLNTNAFEAPCRVDGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAA 411
Query: 282 H-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
I+T YT LW R L +TKYF C+C+RC+D +E GTY SA+ C +
Sbjct: 412 RDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLSALFC----R 467
Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
+QG ++PV DW C SC
Sbjct: 468 EQGCKGLVIPVQTKTLQPDWRCLSC 492
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/279 (23%), Positives = 114/279 (40%), Gaps = 32/279 (11%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+ + SP+ GR + A D+ G+ + RE L+VGP + C C++ P D+
Sbjct: 28 WRVADSPISGRGIFAVRDIAQGELLFRERTLLVGPTAHRGRNLRTCTQCYR--QPAGEDS 85
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL-RCLIL-- 170
C C G + HA EC + + + D + IL + RC L
Sbjct: 86 DAAALCTAGCGLPVCGGCVASPRHAVECQLFRKWQPKDPSRIDPRALRILSVVRCFFLNE 145
Query: 171 -QRRSPKKYQELKDMEAHMSKRGPGTEVYEEI---DSIVKYLRSNFLEKLPGDVLDDTSA 226
QR+ Q D +M + E +E +++Y F + + +
Sbjct: 146 TQRKLLYAMQANAD-RYYMREVERAAECFEHFPREQDMLEY----FYRTVCAFNTNAFES 200
Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRD 280
+C +V+G ++ + L+ + ++ H C PNA H + ++ + +
Sbjct: 201 RC--------QVDGREV--VARALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLG 250
Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
I+ Y LW T R+ L +TK+F C C RC DPT+
Sbjct: 251 AEITMSYAKLLWSTLARKMFLGMTKHFMCQCPRCQDPTQ 289
>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
Length = 302
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 69/145 (47%), Gaps = 20/145 (13%)
Query: 229 LHWICGVIEVNGVDI-----GRYT--QGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
L I GV+ N + G T +GL+ + +M H C PNA H +L V R
Sbjct: 58 LFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAA 116
Query: 282 H-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
I+T YT LWG R L +TK+F C C RC+D TE GTY SA+ C +
Sbjct: 117 RDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRCNDNTENGTYLSALFC----R 172
Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
+QG ++PV DW C +C
Sbjct: 173 EQGCRGLVIPVQTRTLQPDWRCITC 197
>gi|307172339|gb|EFN63827.1| Protein msta, isoform A [Camponotus floridanus]
Length = 352
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/215 (24%), Positives = 95/215 (44%), Gaps = 22/215 (10%)
Query: 219 DVLDDTSAKCLHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMM 271
D + SA+ + +CG++++N ++ G +GLY LM H+C N +
Sbjct: 3 DKTEPPSAELIQKLCGILDINTFELRSPSILDGLLLRGLYIEASLMSHDCRGNTYLTMDD 62
Query: 272 QSKLFVFRDTHISTM------YTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
+L V+ I YT++L GT RR+HL KY C C C DP EL ++ S
Sbjct: 63 NFQLTVYASVPIKQNEPILFNYTSSLLGTAERRQHLREGKYLECECSLCKDPYELQSHLS 122
Query: 326 AM---KCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLV 382
++ +C D ++ + P + S W C C + R + + +DK
Sbjct: 123 SVLCPRCKEGFVGMQDTFV--IAPYERASRWQCQMCKKTYSGRLIRATLNICKTLIDKC- 179
Query: 383 QENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLI 417
+ ++++L +L H +H+ ++K L+
Sbjct: 180 ---EDFDGIDDLLKRLSRSLHTNHFLMLSLKQKLL 211
>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
Length = 611
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 68/145 (46%), Gaps = 20/145 (13%)
Query: 229 LHWICGVIEVNGVDI-------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT 281
L + GV+ N + +GL+ + +M H C PNA H +L V R
Sbjct: 364 LFRVVGVLNTNAFEAPCRVDTHETLLRGLFPLTAIMNHECTPNASHY-FENGRLAVVRAA 422
Query: 282 H-------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHK 334
I+T YT LW R L +TKYF C+C+RC+D +E GTY +A+ C +
Sbjct: 423 RAIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLAALFC----R 478
Query: 335 DQG-DCWILPVNPLDNDSDWTCGSC 358
+QG ++PV DW C SC
Sbjct: 479 EQGCKGLVIPVQTKTLQPDWRCLSC 503
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 114/276 (41%), Gaps = 27/276 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
Y + SP+ GR + A D+ G+ I RE L+VGP + C C+ + +
Sbjct: 39 YRVADSPISGRGIFALRDIAKGELIFREHTLLVGPTAHRGRNLRTCTQCYGQPPGDSDAS 98
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL---RCLI 169
A CP C P C + + H PEC + + + +A+ L RC
Sbjct: 99 ALCPAGCGLPVCG----NCASSTRHTPECQLFRKWQPKEATKAGIEPQALRILSVVRCFF 154
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS-NFLEKLPGDVLDDTSAKC 228
L P++ + L M+A+ + R EV + ++ R LE V A
Sbjct: 155 LN--EPQR-KLLYAMQAN-ADRYYIREVERAAECFEQFPRDPELLEYFYRTV----CAFN 206
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTH 282
+ ++G ++ + L+ + L+ H C PNA H + ++ + +
Sbjct: 207 TNAFESRFHIDGQEV--VARALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAE 264
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
I+ Y LW T R+ L +TK+F C C RC DPT
Sbjct: 265 ITMSYAKLLWSTLARKMFLGMTKHFMCRCPRCQDPT 300
>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
Length = 429
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 121/296 (40%), Gaps = 47/296 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+EI S + GR + T L GD I R+ P+++G + + C C K L P+
Sbjct: 25 WEIAVSKIAGRGVFTTRKLKRGDIILRDIPILIGLAARQEDTLNSCSICFKVL-PDTKFM 83
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
R C P CS + H +CA+ K E + D I+ L C+ R
Sbjct: 84 CR-QGCALPVCSV----CAKKKQHKDDCALFK-SWEPIEP--DVANSVIIRLICIA---R 132
Query: 174 SPKKYQELKDM----EAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
+ +E +D+ +A++ TEV + K P D + +
Sbjct: 133 AINLSKEQRDLIYCLQANLDN-NHRTEVRNA---------AKCFNKFPTD---KKLIEIM 179
Query: 230 HWICGVIEVNGVDIGR---------YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
+ V+ NG D + + L+ + L+ H+C+PN+ ++ ++ V R
Sbjct: 180 NRTVAVLRTNGFDETADRTNDNQEFFYRALFPLFALVNHDCVPNSYYTFEEKTNYMVLRA 239
Query: 280 ------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
T I+T YT G R L + K F C C RCSDPTE G + SA+ C
Sbjct: 240 SVDLPEGTEITTTYTKLFTGNIARHLFLKMKKNFTCKCSRCSDPTEKGAFISAVYC 295
>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 606
Score = 72.4 bits (176), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 99/417 (23%), Positives = 163/417 (39%), Gaps = 67/417 (16%)
Query: 53 KLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLAD 112
+LP+ + SS LGR+ A D+ PGD + E+P+ + + +C C++ L P LA
Sbjct: 94 RLPWSLASSARLGRHALAKRDISPGDLVLIEAPVAAVVRSQFVKTVCHTCYREL-PALAP 152
Query: 113 NARCPRCFWPA-----CSARCSGLSD---AHTHAPECAILKLGCET---------LLAYN 155
P A CS C+ L+D A T+ I K+ ET +L +
Sbjct: 153 PGSVPDRAAAAPNKQYCSRACA-LADTLAAVTNPIHAKIDKMAVETQVDPQLLRLILELD 211
Query: 156 DYKYEAILPLR-------CLILQRRSPKKYQ------ELKDMEAHMSKRGPGTEVYEE-I 201
+ + P + +L+ K Q L+DME S E + + +
Sbjct: 212 SKRATSAPPGKPTPTPEGTWVLKFSRTKAGQLEAVHCTLEDMETLTSPWDKSPEAWRKSV 271
Query: 202 DSIVKYLRSNFLEK---LPGDV--LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICL 256
+ + L+ + + PG + L D +A + G+ N + GLY + +
Sbjct: 272 RAGCEALQQAIIAEGAYQPGSLQQLLDMAAVINNNAHGMGAANSTNTD-SALGLYPALSM 330
Query: 257 MEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSC 311
+ H+CLPN A + M + V ++ Y N + + R L TK+F C+C
Sbjct: 331 LNHSCLPNCVFASCGSDMHVRAIRPVAAGEQLTVTYINIMEPRRIRARELMDTKHFACAC 390
Query: 312 ERCSDPTEL--GTYFSAMKCLNEHKDQGDC--WILP--VNPLDNDSDWTCGSCSARLNAR 365
ERC P E + A+KC G C W+L + L W C C+ + +R
Sbjct: 391 ERCVSPLETHPDRFLEAVKC-----GAGGCDGWLLEQRADNLSEGDTWKCTVCATAVPSR 445
Query: 366 DVHLVTS--QLGEQVDKLVQENPNV----------KSLEEMLTKLEAMFHPHHYHCY 410
V T + E + N+ E +L + E + HP H +
Sbjct: 446 SVDGRTGPQDITESFRAAWHQAANILRYKGHAAGRPVFEAVLARAEGVLHPMHVSAF 502
>gi|194913085|ref|XP_001982623.1| GG12637 [Drosophila erecta]
gi|190648299|gb|EDV45592.1| GG12637 [Drosophila erecta]
Length = 309
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 114/284 (40%), Gaps = 45/284 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+ + SP+ GR + AT ++ PG+ + RE L+VGP + + C C++ + +
Sbjct: 44 WRVADSPISGRGIFATREIAPGEELFREHTLLVGPTAHRSTNLRTCTLCYRLIPGSTDSA 103
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
A CP C P CS D+ H EC + + + + + I P
Sbjct: 104 ALCPAGCGLPVCSE----CRDSSRHDLECKLFR-------KWKPLESQRIEPRALRILSV 152
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+RC L + K L M+A+M R EV D + R D+LD
Sbjct: 153 VRCFFLDEAARKL---LYAMQANMD-RYYMQEVQRAADCFEHFPREQ-------DMLDYF 201
Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
+ E V+G ++ + L+ + L+ H C PNA H + V
Sbjct: 202 YRTICAFNTNAFESRSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 259
Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
I+ Y LW T R+ L +TK+F C C RC DPT
Sbjct: 260 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPT 303
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 97/419 (23%), Positives = 167/419 (39%), Gaps = 68/419 (16%)
Query: 62 PLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADNARCPRC 119
P GR L AT D +PG+ I + P + P + P C GC +N N+ +RC RC
Sbjct: 19 PEKGRSLLATRDFYPGEVIISQEPYVCVPNNSSVSPQKRCDGCFTTINNNVL--SRCSRC 76
Query: 120 ---FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL-------- 168
F+ C C E + +L CE L + + YK +++ P L
Sbjct: 77 QLAFY--CGTAC--------QRSEWKLHRLECEVLSSLHKYKRKSLTPSIRLMLRLYLRR 126
Query: 169 ------ILQRRSPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVL 221
I+ + Y ++ + AHMS V Y ++ ++V N + + PG +
Sbjct: 127 KLQNDKIIPSTAMDNYNLVEALVAHMSDITEEQLVLYAQMANLV-----NSILEWPGINI 181
Query: 222 DDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLF 276
+ + + C + ++ GLY VI ++ H+CLPN+ S ++++
Sbjct: 182 KEIAENFSKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNSVLVFEGSSALVRAVQH 241
Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS-----DPTELGTYFSAMKCLN 331
+ T + Y T R++ L F C+C RCS D + KC +
Sbjct: 242 IPSGTEVLISYIETAESTMTRQKALKEQYLFTCTCPRCSKVGQYDDIQESAILEGYKCKS 301
Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDK----LVQE 384
E C + D + C C + ++ +T++ L E K QE
Sbjct: 302 E-----KCGGFLLRTTDGKG-FQCQGCGLIRDKEEIKRITTEIKLLSEDASKPSATYYQE 355
Query: 385 NPNV-KSLEEMLTKLEAMFHPHHYHCYAVKH----SLIQLYGTQPGYAYTQLSSSLLER 438
++ K +E++ T+L FHP + + SL++L AY +L+ +R
Sbjct: 356 AISIYKRIEKLQTEL---FHPLSINLMHTREKILKSLMELEHWTEALAYCKLTIPFYQR 411
>gi|195477726|ref|XP_002100289.1| GE16967 [Drosophila yakuba]
gi|194187813|gb|EDX01397.1| GE16967 [Drosophila yakuba]
Length = 306
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 114/284 (40%), Gaps = 45/284 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+ + SP+ GR + AT ++ PG+ + RE L+VGP + + C C++ + +
Sbjct: 45 WRVADSPISGRGIFATREIAPGEELFREHTLLVGPTAHRSTNLRTCTLCYRLIPGSTDSA 104
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
A CP C P CS D+ H EC + + + + + I P
Sbjct: 105 ALCPAGCGLPVCSE----CRDSSRHDLECKLFR-------KWKPLESQRIEPRALRILSV 153
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+RC L + K L M+A+M R EV D + R D+LD
Sbjct: 154 VRCFFLDEAARKL---LYAMQANMD-RYYMKEVQRAADCFKHFPREQ-------DMLDYF 202
Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
+ E V+G ++ + L+ + L+ H C PNA H + V
Sbjct: 203 YRTICAFNTNAFEARSNVDGHEV--LVRALFPLAGLLNHQCTPNAAHHFENGETIVVCAT 260
Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
I+ Y LW T R+ L +TK+F C C RC DPT
Sbjct: 261 ERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRCQDPT 304
>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 155/390 (39%), Gaps = 63/390 (16%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLN 107
K K +E+ S + GR L AT + G+ I ++ PL++ + + C C + L
Sbjct: 19 KDKDPAWEVGVSKIAGRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLREL- 77
Query: 108 PNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
P+ R C P CS C L + PE A I++L C A N
Sbjct: 78 PDTRFMCRIG-CGLPVCSLCSKKKQHKTDCDLLKSWGPNEPEVANSVIIRLLC-VARALN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ +++E ++ V LR+N +K
Sbjct: 136 LNKDQRDL-IYCLQANLDNNHRV-EVRNAAKCFKNFPTDKKIFEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
D+ E N + LY + +M H+C+PN+ ++ ++ +
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFAVMNHDCIPNSYYTFEEKTNI 234
Query: 276 FVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
+ R I+T YT G R L I + F C C RCSDPTE G Y S +
Sbjct: 235 MIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCSDPTEKGAYISGLY 294
Query: 329 CLNEH-------KDQG------DCWILPVNP----LDNDSDWTCGSCSARLNARDVHLVT 371
C + + + G +C + + D+ G+ +A+ N+ + +
Sbjct: 295 CRDTNCSGLVVPETTGLPHPSYNCLVCKQKSTHAQMMKSQDFASGAINAKANSNSLRTLV 354
Query: 372 SQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
L E+ D + N N ++ L+ L+ +
Sbjct: 355 QYLNEKSDNFI-PNSNYVVIDAKLSVLQRL 383
>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
Length = 498
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 155/390 (39%), Gaps = 63/390 (16%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLN 107
K K +E+ S + GR L AT + G+ I ++ PL++ + + C C + L
Sbjct: 19 KDKDPAWEVGVSKIAGRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLREL- 77
Query: 108 PNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
P+ R C P CS C L + PE A I++L C A N
Sbjct: 78 PDTRFMCRMG-CGLPVCSLCSKKKQHKTDCDLLKSWGPNEPEVANSVIIRLLC-VARALN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ +++E ++ V LR+N +K
Sbjct: 136 LNKDQRDL-IYCLQANLDNNHRV-EVRNAAKCFKNFPTDKKIFEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKL 275
D+ E N + LY + +M H+C+PN+ ++ ++ +
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFAVMNHDCIPNSYYTFEEKTNI 234
Query: 276 FVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
+ R I+T YT G R L I + F C C RCSDPTE G Y S +
Sbjct: 235 MIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCSDPTEKGAYISGLY 294
Query: 329 CLNEH-------KDQG------DCWILPVNP----LDNDSDWTCGSCSARLNARDVHLVT 371
C + + + G +C + + D+ G+ +A+ N+ + +
Sbjct: 295 CRDTNCSGLVVPETTGLPHPSYNCLVCKQKSTHAQMMKSQDFASGAINAKANSNSLRTLV 354
Query: 372 SQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
L E+ D + N N ++ L+ L+ +
Sbjct: 355 QYLNEKSDNFI-PNSNYVVIDAKLSVLQRL 383
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/382 (21%), Positives = 157/382 (41%), Gaps = 49/382 (12%)
Query: 62 PLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFW 121
P GR L T + PG+ I + P + P + E C C +N L + C ++
Sbjct: 19 PEKGRSLFTTRNFRPGEEIISQEPYVCVPNNSPVESRCDRCFASIN--LKKCSACKVAWY 76
Query: 122 PACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR---- 173
CS+ C E + +L C+ L + +++++ P +R +I ++R+
Sbjct: 77 --CSSVC--------QKSEWKLHRLECDALARLDKDRHKSVTPSIRLMIKLFIRRKLQSE 126
Query: 174 ------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
+ Y+ ++++ HMS V Y ++ ++V NF+ + PG + + +
Sbjct: 127 KVIPMAATDNYKLVEELIDHMSGIDEKQLVLYAQMGTLV-----NFILQWPGMNVKEIAV 181
Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
C + ++ GLY VI ++ H+CLPNA + ++++ +
Sbjct: 182 NFSKLACNAHTICDSELRPLGTGLYPVISIINHSCLPNAVLVFEGRTAVVRAVQHIPAGA 241
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERC-----SDPTELGTYFSAMKCLNEHKDQ 336
+S Y T R++ L F C+C RC D + +C N DQ
Sbjct: 242 EVSISYIETAGSTMTRQKTLKENYLFTCTCSRCVKVAQEDEIKESAILEGYRCRN---DQ 298
Query: 337 GDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLT 396
D ++L + +D+ +TC C + D+ + S++ D+ + + +S E L
Sbjct: 299 CDGFLLRNS---DDTGFTCQQCGLVRSKEDIKNIASKIKSISDE-ASTSLSSQSYAEALF 354
Query: 397 KLEAMFHPHHYHCYAVKHSLIQ 418
E + C+ SL+Q
Sbjct: 355 MYEKVEKLQRILCHPYSISLMQ 376
>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
Length = 503
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 123/301 (40%), Gaps = 45/301 (14%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGISKIAGRGVVATRHLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCMIL 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS C + PE A I++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKTDCDLFRSWGPNEPEVANSVIIRLLC-VARAIN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ K +V E ++ V +R+N +K
Sbjct: 136 LNKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKKFPTDKKVIEIMNRTVAVIRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + +M H+C+PN+ K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVMNHDCIPNSYYTFEEKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + I+T YT G R L + K F C C RCSDPTE G Y S +
Sbjct: 235 MIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKMKKNFTCKCPRCSDPTEKGAYISGLY 294
Query: 329 C 329
C
Sbjct: 295 C 295
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 91/390 (23%), Positives = 159/390 (40%), Gaps = 59/390 (15%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +E+ S + GR + AT L G+ I ++SPL++G LA E L
Sbjct: 19 KDKDPAWEVGVSKIAGRGVMATRQLKRGEIIFQDSPLLIG--LAAQEEDTLNACSICLKM 76
Query: 110 LADNARCPR--CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRC 167
L D R C P CS + H +C + K D I+ L C
Sbjct: 77 LPDTRFMCRQGCGLPICSL----CAKKKQHKSDCDMFKSWGPN---EPDVANSVIIRLLC 129
Query: 168 LILQRRSPKKYQELKD-MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
+ K ++L ++A++ E+ + K ++ +K ++++ T A
Sbjct: 130 VARAINLTKDQRDLIYCLQANLDNNH-----RTEVRNAAKCFKNFPTDKKLVEIMNRTVA 184
Query: 227 KCLHWICGVIEVNGVD--IGRYT-------QGLYSVICLMEHNCLPNA------KHSNMM 271
V+ NG D R T + LY + +M H+C+PN+ K +NM+
Sbjct: 185 --------VLRTNGFDKTTDRTTDNQEFSYRALYPLFAVMNHDCIPNSYYTFEEKTNNMI 236
Query: 272 -QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
++ + + I+T YT G R +L + K F C C RCSDPTE G Y S + C
Sbjct: 237 VRAAVDIAEGEEITTTYTKLFTGNIARHLYLKMKKGFTCKCPRCSDPTEKGAYISGLYCR 296
Query: 331 NEH-------KDQG------DCWILPVNP----LDNDSDWTCGSCSARLNARDVHLVTSQ 373
+ + G +C + + D+ G+ +A++N+ + +
Sbjct: 297 DTKCSGLVVPETTGLPHPNWNCLVCKQKSTHAQMMKSQDFASGAINAKVNSNSLRTLVHY 356
Query: 374 LGEQVDKLVQENPNVKSLEEMLTKLEAMFH 403
L E+ D + + N ++ ++ L+ + H
Sbjct: 357 LNEKSDSFI-PSSNYVVIDAKMSVLQRLAH 385
>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
Length = 503
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 124/301 (41%), Gaps = 45/301 (14%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS + C L + P+ A I++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLLKSWGPNEPDVANSVIIRLLC-VARALN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ ++ E ++ V LR+N +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKIIEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + ++ H+C+PNA K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEDKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + ++T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294
Query: 329 C 329
C
Sbjct: 295 C 295
>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
Length = 496
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 93/395 (23%), Positives = 155/395 (39%), Gaps = 73/395 (18%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS + C + P+ A I++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ ++ E ++ V LR+N +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + ++ H+C+PNA K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + ++T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294
Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
C + +C L V + D+ G+ +A++N+
Sbjct: 295 CRDT-----NCTGLVVPEISGLPHPNWNCLECKQKSTHAQMMKSQDFASGAINAKVNSNS 349
Query: 367 VHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
+ + L E+ D + N N ++ ++ L+ +
Sbjct: 350 LRTLVQYLNEKSDSFI-PNSNYVVIDAKMSVLQRL 383
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 68.9 bits (167), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 127/303 (41%), Gaps = 49/303 (16%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEP----ICLGCHKP 105
K K +E+ S + GR + AT +L G+ I ++SPL++G LA E C C K
Sbjct: 19 KDKDPAWEVGVSKIAGRGVMATRNLKRGEIIFQDSPLLIG--LAAHEEDTLNACSVCFKM 76
Query: 106 LNPNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLA 153
L P+ R C P CS C ++ P+ A I++L C A
Sbjct: 77 L-PDTRFMCR-QGCGLPVCSLCAKKKQHKTDCDMFKAWGSNEPDVANSVIIRLLC-VARA 133
Query: 154 YNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL 213
N K + L + CL + + E+++ ++ E ++ V LR+N
Sbjct: 134 INLSKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKIVEIMNRTVAVLRTNGF 191
Query: 214 EKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KH 267
+K D+ E N + LY + +M H+C+PN+ K
Sbjct: 192 DKTTDRTTDNQ------------EFN-------YRALYPLFAVMNHDCIPNSYYTFEEKT 232
Query: 268 SNMM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
+NM+ ++ + + I+T YT G R L + K F C C RC DPTE G++ S
Sbjct: 233 NNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCLDPTEKGSFISG 292
Query: 327 MKC 329
+ C
Sbjct: 293 LYC 295
>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
Length = 499
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 128/304 (42%), Gaps = 51/304 (16%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEP----ICLGCHKP 105
K K +EI S + GR + AT +L G+ I ++ PL++G LA E C C K
Sbjct: 19 KVKDPAWEIGISKIAGRGVMATRNLKRGEIIFQDMPLLIG--LAAQEEDSLNACSICLKL 76
Query: 106 LNPNLADNARCPRCFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLA 153
L P+ R C P CS C + P+ A I++L C A
Sbjct: 77 L-PDTRFMCR-QGCGLPICSLCGKKRQHKTDCDMFKSWGPNEPDVANSVIIRLLC-VARA 133
Query: 154 YNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL 213
N K + L + CL + + E+++ ++ E ++ V LR+N
Sbjct: 134 INLTKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLVEIMNRTVAVLRTNGF 191
Query: 214 EKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNA------K 266
+K T+ +C D ++ + LY + +M H+C+PN+ K
Sbjct: 192 DK--------TTDRC------------TDNQEFSYRALYPLFGVMNHDCIPNSYYTFEEK 231
Query: 267 HSNMM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
+NM+ ++ + + I+T YT G R L + K F C C RCSDPTE G Y S
Sbjct: 232 TNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCSDPTEKGAYIS 291
Query: 326 AMKC 329
+ C
Sbjct: 292 GLYC 295
>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
Length = 497
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 146/376 (38%), Gaps = 72/376 (19%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS + C + P+ A I++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ ++ E ++ V LR+N +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + ++ H+C+PNA K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + ++T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294
Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
C + +C L V + D+ G+ +A++N+
Sbjct: 295 CRDT-----NCTGLVVPEISGLPHPNWNCLECKQKSTHAQMMKSQDFASGAINAKVNSNS 349
Query: 367 VHLVTSQLGEQVDKLV 382
+ + L E+ D +
Sbjct: 350 LRTLVQYLNEKSDSFI 365
>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
Length = 500
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 146/376 (38%), Gaps = 72/376 (19%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS + C + P+ A I++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ ++ E ++ V LR+N +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + ++ H+C+PNA K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + ++T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294
Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
C + +C L V + D+ G+ +A++N+
Sbjct: 295 CRDT-----NCTGLVVPEITGLPHPNWNCLVCKQKSTHAQMMKSQDFASGAINAKVNSNS 349
Query: 367 VHLVTSQLGEQVDKLV 382
+ + L E+ D +
Sbjct: 350 LRTLVQYLNEKSDSFI 365
>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
Length = 302
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 116/278 (41%), Gaps = 33/278 (11%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+ + SP+ GR + A ++ G+ + RE L+VGP + C+ C+ ++
Sbjct: 42 WRVGDSPISGRGVFAARNIEKGEELFREHTLLVGPTAHRGRNLRTCIHCYHQPPGESEES 101
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL-RCLILQ 171
A C C P C + + HA EC + + L+ + + IL + RC L
Sbjct: 102 ALCSAGCGLPVCP----DCAPSERHAVECQLFRKWQPKHLSIIEPRALRILSVVRCFFL- 156
Query: 172 RRSPKKYQELKDMEA-----HMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
S + Q L M+A +M + E +E L F + + +
Sbjct: 157 --SEPQRQLLYAMQANTDRYYMREVQRAAECFEHFPRDPDML-EYFYRTVCAFNTNAFES 213
Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRD 280
+C ++G ++ T+ L+ + L+ H C PNA H + ++ + +
Sbjct: 214 RC--------HIDGQEV--VTRALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAG 263
Query: 281 THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
I+ YT LW T R+ L +TK+F C C RC DPT
Sbjct: 264 AEITMSYTKLLWSTLARKIFLGMTKHFMCQCPRCQDPT 301
>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
Length = 500
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 89/376 (23%), Positives = 146/376 (38%), Gaps = 72/376 (19%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIG--LAAHEEDSLNACSVCLKM 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS + C + P+ A +++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVVIRLLC-VARAIN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ ++ E ++ V LR+N +K
Sbjct: 136 LSKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + ++ H+C+PNA K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + ++T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294
Query: 329 CLNEHKDQGDCWILPVNPLDN----------------------DSDWTCGSCSARLNARD 366
C + +C L V + D+ G+ +A++N+
Sbjct: 295 CRDT-----NCAGLVVPEITGLPHPNWNCLVCKQKSTHAQMMKSQDFASGAINAKVNSNS 349
Query: 367 VHLVTSQLGEQVDKLV 382
+ + L E+ D +
Sbjct: 350 LRTLVQYLNEKSDSFI 365
>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
Length = 499
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 123/302 (40%), Gaps = 47/302 (15%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEP----ICLGCHKP 105
K K +EI S + GR + AT L G+ I ++SPL++G LA E C C K
Sbjct: 19 KDKDPAWEIGVSKIAGRGVMATRSLKRGEIIFKDSPLLIG--LAAHEEDALNACTVCLKE 76
Query: 106 LNPNLADNAR--CPRCFWPACSAR------CSGLSDAHTHAPECA---ILKLGCETLLAY 154
L P+ R C P C+ + C + PE A I++L C A
Sbjct: 77 L-PDTKFMCRQGCGLPVCPLCAKKKQHKTDCDMFKSWGPNEPEVANSVIIRLLC-IARAI 134
Query: 155 NDYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE 214
N K + L + CL + + E+++ ++ E ++ V +R+N +
Sbjct: 135 NLTKDQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVMRTNGFD 192
Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSK 274
K D+ E N + LY + +M H+C+PN+ ++ ++
Sbjct: 193 KTTDRTNDNQ------------EFN-------YRALYPLFGVMNHDCIPNSYYTFDEKTN 233
Query: 275 LFVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAM 327
V R I+T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 234 HMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCSDPTEKGAFISGL 293
Query: 328 KC 329
C
Sbjct: 294 YC 295
>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
Length = 497
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 123/301 (40%), Gaps = 45/301 (14%)
Query: 50 KCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPN 109
K K +EI S + GR + AT L G+ I R+SPL++G LA E L
Sbjct: 19 KDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLMG--LAAHEEDSLNACSVCLKM 76
Query: 110 LADNARCPR--CFWPACS---------ARCSGLSDAHTHAPECA---ILKLGCETLLAYN 155
L D R C P CS + C + P+ A I++L C A N
Sbjct: 77 LPDTRFMCRQGCGLPVCSLCAKKKQHKSDCDLFKSWGPNEPDVANSVIIRLLC-VARAIN 135
Query: 156 DYKYEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
K + L + CL + + E+++ ++ E ++ V LR+N +K
Sbjct: 136 LTKEQRDL-IYCLQANLDNNHR-TEVRNAAKCFKNFPTDKKLIEIMNRTVAVLRTNGFDK 193
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSN 269
D+ E N + LY + ++ H+C+PNA K +N
Sbjct: 194 TTDRTNDNQ------------EFN-------YRALYPLFGVVNHDCIPNAYYTFEEKTNN 234
Query: 270 MM-QSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK 328
M+ ++ + + ++T YT G R L + K F C C RCSDPTE G + S +
Sbjct: 235 MIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLY 294
Query: 329 C 329
C
Sbjct: 295 C 295
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 142/359 (39%), Gaps = 38/359 (10%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P EI+ P GR L AT PG I +PL++ P +A +C C +P P L
Sbjct: 6 PVEIRPHPSKGRALHATKSFPPGAVIFSFTPLLLLPTVACLTSVCSHCLRPGEPRL---- 61
Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAIL--PLRCLILQ 171
C RC A C A C + H+ EC L +T+ D + +L P R LI
Sbjct: 62 -CSRCHAAAYCDATCQAAAWKAVHSRECKAL---TKTI---RDEERRRMLPTPTRALIQA 114
Query: 172 RRSPKKYQELKDMEAH-MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
K L D+E H + KR G E + +I+ ++ F GD +A+ L
Sbjct: 115 LLWGKIRDGLADLEGHVLEKRAEGDE-WRDIE-MMAVAGCAFSGLGRGDQDVRMAAEMLC 172
Query: 231 WICGVIEVNGVDIGRYTQGLY--SVICLMEHNCLPNAKHSNMMQSKLFVFRD-----THI 283
I D GLY + + H+C+PNA + ++ L + + I
Sbjct: 173 KIQNN-SFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRNALLIAENPIRAGDEI 231
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
YT RRE LA K F C C RC D + + C+N + +
Sbjct: 232 EIAYTFYTDPLPKRREALAHYK-FTCQCLRCRDNLNVYQVAAISPCVNLN---SQSLVPD 287
Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMF 402
V+ L N + + A +N Q E V+ +V ++SL E L++ F
Sbjct: 288 VSKLRNHPATSSAAKQAIVN---------QHSESVEHIVDALAVIESLSERRKLLKSEF 337
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 65.5 bits (158), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 75/381 (19%), Positives = 151/381 (39%), Gaps = 53/381 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ + P GR L T + +PG+ I R+ P + P N + +RC
Sbjct: 15 VSNLPEKGRCLLTTKNFNPGEVILRQEPYVCVPN-----------------NSSTVSRCD 57
Query: 118 RCFWPACSARCSGLS-----DAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
CF +CS + E + +L C L K +A+ P L+++
Sbjct: 58 GCFASESLKKCSACQVVWYCGSTCQKSEWKLHRLECNALSRLEKEKRKAVTPSIRLMVRL 117
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
+K Q D++ +Y ++ ++V +F+ + P L + +
Sbjct: 118 YLRRKLQNEMDLDEKQ------LVLYAQMANLV-----HFILQWPEINLKEIAENFSKLA 166
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMY 287
C + ++ GLY V+ ++ H+C+PNA S+++++ + +S Y
Sbjct: 167 CNAHTICDCELRPLGTGLYPVVSIINHSCMPNAVLTFEGKSSVVRAVEHIPEGAEVSIAY 226
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC---SDPTELGTYFSAMKCLNEHKDQGDCWILPV 344
+ T R++ L +F C+C RC D + +C + D+ + ++L
Sbjct: 227 IDTAGSTMTRQKALKEQYFFTCTCPRCIKVYDDIQESAILEGYRCKD---DRCNGFLLRD 283
Query: 345 NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENP-----NVKSLEEMLTKLE 399
+ D + C +C R + +V + ++ DK ++ V SL +M+ KL+
Sbjct: 284 S---EDKGFICQTCGLRRSKEEVKRIVCEITAISDKKLKSTSPGNHEEVISLYKMIEKLQ 340
Query: 400 A-MFHPHHYHCYAVKHSLIQL 419
+ HP + L+++
Sbjct: 341 MELCHPFSISLMRTQEELLKI 361
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 14 RCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C + +L KCSAC+ V YCG QK WKLH+ +C L
Sbjct: 55 RCDGCFASESLKKCSACQVVWYCGSTCQKSEWKLHRLECNAL 96
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 89/412 (21%), Positives = 170/412 (41%), Gaps = 55/412 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L A PG+ + + P P C C + NL + C
Sbjct: 19 VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
R W C + C E + +L C + A + + + + P + L+L+R+
Sbjct: 76 RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
Y + +E+H+S+ V Y ++ ++V+ + +F +L +
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
T +K + C + ++ GLY V+ ++ H+C+PNA + +++ +
Sbjct: 185 HTFSK---FACNAHTICDPELRSLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
++ +S Y T R++ L YF C+C RC +E +C N+ K G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQ---ENPNVKSL 391
++LP + +TC CS + ++ + S L ++V LV +N V S+
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEVGSM 354
Query: 392 EEMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
+ + +LE ++HP + +L+++Y Q Y +L+ + ER
Sbjct: 355 YKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 406
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 65.1 bits (157), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 89/411 (21%), Positives = 169/411 (41%), Gaps = 54/411 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L A PG+ + + P P C C + NL + C
Sbjct: 19 VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
R W C + C E + +L C + A + + + + P + L+L+R+
Sbjct: 76 RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
Y + +E+H+S+ V Y ++ ++V+ + +F +L +
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
T +K + C + ++ GLY V+ ++ H+C+PNA + +++ +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
++ +S Y T R++ L YF C+C RC +E +C N+ K G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQE--NPNVKSLE 392
++LP + +TC CS + ++ + S L ++V LV N V S+
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVGSMY 354
Query: 393 EMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
+ + +LE ++HP + +L+++Y Q Y +L+ + ER
Sbjct: 355 KTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 405
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/412 (21%), Positives = 170/412 (41%), Gaps = 55/412 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L A PG+ + + P P C C + NL + C
Sbjct: 19 VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
R W C + C E + +L C + A + + + + P + L+L+R+
Sbjct: 76 RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
Y + +E+H+S+ V Y ++ ++V+ + +F +L +
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
T +K + C + ++ GLY V+ ++ H+C+PNA + +++ +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
++ +S Y T R++ L YF C+C RC +E +C N+ K G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQ---ENPNVKSL 391
++LP + +TC CS + ++ + S L ++V LV +N V S+
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGIDNSEVGSM 354
Query: 392 EEMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
+ + +LE ++HP + +L+++Y Q Y +L+ + ER
Sbjct: 355 YKTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 406
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/411 (21%), Positives = 169/411 (41%), Gaps = 54/411 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L A PG+ + + P P C C + NL + C
Sbjct: 19 VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
R W C + C E + +L C + A + + + + P + L+L+R+
Sbjct: 76 RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
Y + +E+H+S+ V Y ++ ++V+ + +F +L +
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
T +K + C + ++ GLY V+ ++ H+C+PNA + +++ +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
++ +S Y T R++ L YF C+C RC +E +C N+ K G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQE--NPNVKSLE 392
++LP + +TC CS + ++ + S L ++V LV N V S+
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLVSSGNNSEVGSMY 354
Query: 393 EMLTKLE-AMFHPHHYHCYAVKHSLIQLYGT----QPGYAYTQLSSSLLER 438
+ + +LE ++HP + +L+++Y Q Y +L+ + ER
Sbjct: 355 KTIEELERKLYHPLSITLLHTRETLLKIYMELQDWQTALMYCRLTIPVYER 405
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/345 (22%), Positives = 139/345 (40%), Gaps = 48/345 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L AT PGD I + P P L C C + NL + C
Sbjct: 19 VASIPGKGRGLIATFTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR 173
R W CS+ C E + +L C + A + + + + P +R ++ L+R+
Sbjct: 76 RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTENRKKMLTPTIRLMVRLALKRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
S Y + +E+H+SK V Y ++ ++V + LP LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDALESHISKVDENQLVLYAQMANLVSLI-------LPFIELD 179
Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
+ + + C + ++ GLY VI ++ H+C+PNA + +++
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ +D +S Y T+ R L +F C+C RC + +C N+
Sbjct: 240 PIDKDEEVSISYIETAAVTKKRNNDLK-QYFFTCTCPRCVKGFDEDPLLEGFRCKNQ--- 295
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
D ++LP + +TC C A + ++ + S++ + DK
Sbjct: 296 TCDGFLLPSS---GKKAYTCQKCGASRDVEEIKNMRSEILQLSDK 337
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 113/273 (41%), Gaps = 34/273 (12%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P+++K L GR L A+ PGD I +P I+ P L+ A +C C KP P
Sbjct: 6 PFQVKRH-LKGRCLIASRSFTPGDTILTFTPTILIPSLSHASTVCSHCFKPGEPR----- 59
Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAIL-KLGCETLLAYNDYKYEAILPLRCLILQR 172
C RC A C A C + H+ EC +L K+ + + P+R ++
Sbjct: 60 SCSRCHAVAYCDASCQSAAWKAIHSKECKVLQKVSAQG-------RPGLPTPVRAVVQAL 112
Query: 173 RSPKKYQELKDMEAH-MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
+ L+D+E + +S RG E+ ++ + + K L
Sbjct: 113 VKAEVGAALEDLEGNDLSWRGSDKWADMEMMAMGASAFAGL------GTTQEAVKKALSL 166
Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAK-----HSNMMQSKLFVFRDT 281
+C I+ N DIG+ L + + H+C+PNA +++++ + D
Sbjct: 167 LCK-IQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGRRAILRAERPIKADE 225
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
I YT+ + R+E LA +F C C RC
Sbjct: 226 EIEISYTDYNYPLSKRKEALA-PYFFTCECTRC 257
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/391 (20%), Positives = 157/391 (40%), Gaps = 54/391 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ ++P GR L + D PG+ I + P + P + C GC + + NL + C
Sbjct: 15 VSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFR--SSNLKKCSACH 72
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKK 177
++ C + C SD H +L C L + +++ P L+++ +K
Sbjct: 73 VVWY--CGSTCQK-SDWKLH-------RLECNALSRLEKERQKSLTPSIRLMVKLYMRRK 122
Query: 178 YQELKDM-----------EAHMSKRGPGTE----VYEEIDSIVKYLRSNFLEKLPGDVLD 222
Q K M EA +S E +Y ++ ++V N + + P +
Sbjct: 123 LQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLV-----NLILQWPDINVK 177
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR--- 279
+ + C + ++ GLY VI ++ H+CLPN+ + + +L V R
Sbjct: 178 EIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNSVL--VFEERLAVVRAVQ 235
Query: 280 ----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS-----DPTELGTYFSAMKCL 330
T + Y T R++ L F C+C RC D + +C
Sbjct: 236 HIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYRCK 295
Query: 331 NEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK-LVQENPNVK 389
+ D+ D ++L + +D + C C N ++ + S+L DK + +
Sbjct: 296 D---DRCDGFLLRDS---DDIGFICQQCGLVRNKEEIKRLASELKPLSDKATMSSSSQAT 349
Query: 390 SLEEMLTKLEA-MFHPHHYHCYAVKHSLIQL 419
S+ +M+ KL+ +FHP + + +++++
Sbjct: 350 SIYKMIEKLQTKLFHPFSINLMRTREAILKI 380
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C R + L KCSAC V YCG QK WKLH+ +C L
Sbjct: 55 RCEGCFRSSNLKKCSACHVVWYCGSTCQKSDWKLHRLECNAL 96
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 79/345 (22%), Positives = 139/345 (40%), Gaps = 48/345 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L AT PGD I + P P L C C + NL + C
Sbjct: 19 VASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR 173
R W CS+ C E + +L C + A + + + + P +R ++ L+R+
Sbjct: 76 RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
S Y + +E+H+SK V Y ++ ++V + LP LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLI-------LPFIELD 179
Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
+ + + C + ++ GLY VI ++ H+C+PNA + +++
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ +D +S Y T+ R L +F C+C RC + +C N+
Sbjct: 240 PINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVKGFDEDALLEGFRCKNQ--- 295
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
D ++LP + +TC C A + ++ + S++ + DK
Sbjct: 296 ACDGFLLPNS---GKKAYTCQKCGASRDVEEIKNMRSEILQLSDK 337
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 79/345 (22%), Positives = 139/345 (40%), Gaps = 48/345 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L AT PGD I + P P L C C + NL + C
Sbjct: 19 VASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLI---LQRR 173
R W CS+ C E + +L C + A + + + + P +R ++ L+R+
Sbjct: 76 RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
S Y + +E+H+SK V Y ++ ++V + LP LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDGLESHISKVDENQLVLYAQMANLVSLI-------LPFIELD 179
Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
+ + + C + ++ GLY VI ++ H+C+PNA + +++
Sbjct: 180 LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ +D +S Y T+ R L +F C+C RC + +C N+
Sbjct: 240 PINKDEEVSISYIETATVTKKRNNDLK-QYFFTCTCPRCVKGFDEDALLEGFRCKNQ--- 295
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
D ++LP + +TC C A + ++ + S++ + DK
Sbjct: 296 ACDGFLLPNS---GKKAYTCQKCGASRDVEEIKNMRSEILQLSDK 337
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/413 (21%), Positives = 161/413 (38%), Gaps = 58/413 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L AT PGD I + P P L C C + NL + C
Sbjct: 19 VASIPGKGRGLIATRTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMCQ 76
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
++ CS C E + +L C + A + + + + P + L+L+R+
Sbjct: 77 VTWY--CSTNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLVLKRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
S Y + +E+H+ K V Y ++ ++V + LP LD
Sbjct: 127 LQNEKVIPSSSIDNYNLVDALESHIWKVDENHLVLYAQMANLVSLI-------LPLIELD 179
Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
+ + + C + ++ GLY VI ++ H+C+PNA + +++
Sbjct: 180 LKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQ 239
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ ++ +S Y T+ R L +F CSC RC +E +C N+
Sbjct: 240 PIGKNEEVSISYIETAAVTKKRHNDLK-QYFFTCSCPRCVKGSEEDALLEGFRCKNQ--- 295
Query: 336 QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKL---------VQENP 386
D ++LP + +TC CS + ++ + S++ + DK +
Sbjct: 296 TCDGFLLPDS---GKKAYTCQKCSVSRDEEEIQKMRSEILQLSDKASSFLSSGNKAEAGS 352
Query: 387 NVKSLEEMLTKLEAMFHPHHYH-CYAVKHSLIQLYGTQPGYAYTQLSSSLLER 438
K++E++ L F H C + ++L + AY +L+ + ER
Sbjct: 353 VYKTIEQLEQNLYHAFSTTLLHTCETLLKIYMELQDWRTALAYCRLTVPVYER 405
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 62.8 bits (151), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 75/281 (26%), Positives = 112/281 (39%), Gaps = 40/281 (14%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ SP GR L+AT +L GD + E P +LA+ IC C + + RC
Sbjct: 5 EVFDSPGKGRGLRATKELLSGDVLFSEPPFAAVVFDSLADKICHSCFRRQDK----LQRC 60
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR--- 172
+C F C C + H EC +K + E I L I+ R
Sbjct: 61 GQCKFAQYCDKTCQS-AGWKEHKLECGAIKAFGAIGDNFGKAPNENI-RLAARIMWRLDK 118
Query: 173 ----RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDV----LDDT 224
S + L D+E H+S P ++ E +K NFL+ P + +DD
Sbjct: 119 EGGVMSDMQMTSLDDLENHISDM-PEDDLKE-----LKVDIHNFLDYWPRNSKQHRVDDI 172
Query: 225 SAKCLHWICGVIEVNGVDIGR------YTQGLYSVICLMEHNCLPN----AKHSNMMQSK 274
S I GVI NG + GL+ +CL+ H+C PN H +
Sbjct: 173 SH-----IFGVINCNGFTVSDQRGLNAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRA 227
Query: 275 LFVFR-DTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
L ++ Y + L T+ R+ L + +F+C+CE C
Sbjct: 228 LGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHC 268
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/394 (20%), Positives = 157/394 (39%), Gaps = 57/394 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ ++P GR L + D PG+ I + P + P + C GC + + NL + C
Sbjct: 15 VSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFR--SSNLKKCSACH 72
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKK 177
++ C + C SD H +L C L + +++ P L+++ +K
Sbjct: 73 VVWY--CGSTCQK-SDWKLH-------RLECNALSRLEKERQKSLTPSIRLMVKLYMRRK 122
Query: 178 YQELKDM-----------EAHMSKRGPGTE----VYEEIDSIVKYLRSNFLEKLPGDVLD 222
Q K M EA +S E +Y ++ ++V N + + P +
Sbjct: 123 LQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLYAQMANLV-----NLILQWPDINVK 177
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR--- 279
+ + C + ++ GLY VI ++ H+CLPN+ + + +L V R
Sbjct: 178 EIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLPNS--VLVFEERLAVVRAVQ 235
Query: 280 ----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS-----DPTELGTYFSAMKCL 330
T + Y T R++ L F C+C RC D + +C
Sbjct: 236 HIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRMGQYDDIQESAILEGYRCK 295
Query: 331 NEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN--- 387
+ D+ D ++L + +D + C C N ++ + S+L DK + +
Sbjct: 296 D---DRCDGFLLRDS---DDIGFICQQCGLVRNKEEIKRLASELKPLSDKATMSSSSHYV 349
Query: 388 -VKSLEEMLTKLEA-MFHPHHYHCYAVKHSLIQL 419
S+ +M+ KL+ +FHP + + +++++
Sbjct: 350 EATSIYKMIEKLQTKLFHPFSINLMRTREAILKI 383
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C R + L KCSAC V YCG QK WKLH+ +C L
Sbjct: 55 RCEGCFRSSNLKKCSACHVVWYCGSTCQKSDWKLHRLECNAL 96
>gi|281208900|gb|EFA83075.1| hypothetical protein PPL_03863 [Polysphondylium pallidum PN500]
Length = 470
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 129/295 (43%), Gaps = 42/295 (14%)
Query: 136 THAPECA-ILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ----ELKDMEAHMSK 190
H EC+ ++K+ T+ ND + ++L LR L+ + + Y L + K
Sbjct: 127 VHQLECSFVVKVAQATVDVQNDLSH-SLLVLRILLKKNILSELYNGGVGTLSVQQQEFEK 185
Query: 191 RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDD-------TSAKCLHWICGVIEVNGVDI 243
R P + ++ ++ F+E L D D T + ++ C + VN
Sbjct: 186 RAP-----QWLEQTSRFA-DRFIELLQEDKHIDPTLLSFFTKQEIINTACATM-VNSFAS 238
Query: 244 GR----YTQGLYSVICLMEHNCLPNAKHS---NMMQSKLF--VFRDTHISTMYTNALWGT 294
+ + G + L+ H+C PN +S N +Q + + + I Y + L T
Sbjct: 239 TKNSIIVSNGFFYEAALLNHSCQPNTFYSIQNNQLQMRCIQDIEQGGDIFDSYVDLLEPT 298
Query: 295 QPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV---NPLDN-- 349
R++ L +K+F C CERC DPTE G Y S+ KC + + W+ P+ N DN
Sbjct: 299 FERQQVLLQSKHFYCRCERCQDPTECGRYLSSFKCPSCQEG----WVSPIISYNEKDNVL 354
Query: 350 DSDWTCGSCSARLNARDVHLV--TSQLGEQV-DKLVQENPNVKSLEEMLTKLEAM 401
W C C N R + + QL + + +K + +PN LE++ +K+E +
Sbjct: 355 ARLWRCLGCRQEQNDRIIQQLKGIDQLKQTINEKKWRVDPNA-YLEQIGSKMEEV 408
>gi|194768767|ref|XP_001966483.1| GF21974 [Drosophila ananassae]
gi|190617247|gb|EDV32771.1| GF21974 [Drosophila ananassae]
Length = 364
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/284 (25%), Positives = 113/284 (39%), Gaps = 45/284 (15%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI--CLGCHKPLNPNLADN 113
+ + SP+ GR + AT D+ G+ + RE L+VGP + + C C+ + + +
Sbjct: 49 WRVADSPISGRGIFATRDIAAGEELFREHTLLVGPTAHRSTNLRTCTLCYCLIPGDTDLD 108
Query: 114 ARCPR-CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-------- 164
A CP C P C SD H +L C+ + + + I P
Sbjct: 109 ALCPAGCGLPVCPD--CRTSDRH---------QLECKLFRKWKPLETQRIEPRALRILSV 157
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+RC L P++ + L M+A+ + R EV + + + D+LD
Sbjct: 158 VRCFFLD--EPQR-KLLYAMQAN-ADRYYMKEVQRAAECFQHFPKEQ-------DMLDYF 206
Query: 225 SAKCLHWICGVIE----VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR- 279
+ E V G ++ + L+ + L+ H C PNA H + V
Sbjct: 207 YRTICAFNTNAFESRSHVGGHEV--LVRALFPLAGLLNHQCTPNAAHHFEDGETIVVCST 264
Query: 280 -----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
I+ Y LW T R+ L +TK+F C C RC DPT
Sbjct: 265 ERIPCGAEITMSYAKLLWSTLARKIFLGMTKHFMCQCVRCQDPT 308
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 116/278 (41%), Gaps = 36/278 (12%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ SP GR L A H G+ I ++ P + + C C++ + NL + C
Sbjct: 21 VSVSPAKGRCLIANRAFHRGEVILQQEPYVCALDDSSQALRCDRCYRQ-SSNLKRCSACK 79
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL---ILQRRS 174
F+ C A C S H ECAIL + + K LR + +++RR
Sbjct: 80 TVFY--CCANCQR-SGWGLHKFECAIL------VRLLKEKKRSPTPSLRLVMRFLIKRRL 130
Query: 175 PKKYQELKD----------MEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDV-LD 222
+ +LK + HMS+ V Y ++ SIV+ + + P +V +
Sbjct: 131 QAERVQLKAAVDNYDVCEVLPTHMSETSEDRLVMYAQMASIVQQMMA------PDEVNVK 184
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAK-HSNMMQSKLFVFRDT 281
+ + + C + ++ GLY VI ++ H+C+PNA H + ++ L DT
Sbjct: 185 EVTQMICRFACNAHTICDEEVRPLGTGLYPVISIVNHSCVPNAVLHFDGNRAALRALEDT 244
Query: 282 H----ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
I+ Y T RR+ L YF+C+C RCS
Sbjct: 245 QEGTEITISYVELAASTNTRRKALRDQYYFDCNCIRCS 282
Score = 38.9 bits (89), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 15/78 (19%)
Query: 8 EDEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK---KLPYEIKSSP 62
+ + +RC C + + L +CSACK V YC Q+ W LHK +C +L E K SP
Sbjct: 55 DSSQALRCDRCYRQSSNLKRCSACKTVFYCCANCQRSGWGLHKFECAILVRLLKEKKRSP 114
Query: 63 ----------LLGRYLQA 70
L+ R LQA
Sbjct: 115 TPSLRLVMRFLIKRRLQA 132
>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
Length = 484
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 138/340 (40%), Gaps = 50/340 (14%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GR L A D + G + + P + C C K +P L + C F+ C
Sbjct: 25 GRCLVADRDFYAGQTVLEQDPYVAVLDGDSRGNRCDACFKQ-SPALRRCSACKFVFY--C 81
Query: 125 SARCSGLSDAHTHAPECAIL-KLGCE----------TLLAYNDYKYEAILPLRCLILQRR 173
S C S H EC +L +L E LL + + +LP+ L
Sbjct: 82 SPTCQ-RSQWKIHQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATGVLPVTAL----- 135
Query: 174 SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVK-YLRSNFLEKLPGDVLDDTSAKC-LH 230
Y+ ++ + H S+ G V Y ++ ++K L + + E + D C H
Sbjct: 136 --DNYEIVRALPTHFSETGDERLVMYAQMAVLIKTILNARYAEDVKEITKDICRISCNGH 193
Query: 231 WICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHIS 284
IC E+ V IG L+ V+ ++ H+C N+ KH+ ++++ + R ++
Sbjct: 194 TICD-DELRPVGIG-----LFPVVSIINHSCSSNSLLLFDGKHA-VVRALGTISRGCEVT 246
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----FSAMKCLNEHKDQGDCW 340
Y T RRE L+ YF+C C RC+D +E G Y A+ CL D
Sbjct: 247 VSYIELGASTNSRREALSDQYYFHCKCPRCTDDSEAGLYKDDVLEAVACL-------DPA 299
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
L N S W C SC + +V+ ++++ ++K
Sbjct: 300 CESFMRLSNGS-WRCMSCGSSREPNEVNKLSTEAEGMIEK 338
Score = 42.0 bits (97), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 9 DEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
D RC C + AL +CSACK V YC Q+ WK+H+ +CK L
Sbjct: 53 DSRGNRCDACFKQSPALRRCSACKFVFYCSPTCQRSQWKIHQEECKVL 100
>gi|358388868|gb|EHK26461.1| hypothetical protein TRIVIDRAFT_35412 [Trichoderma virens Gv29-8]
Length = 535
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 110/269 (40%), Gaps = 18/269 (6%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P EI+ P GR L AT PG I +PL++ P ++ +C C +P NP
Sbjct: 6 PLEIRPHPSKGRGLYATKSFSPGAVIFPFTPLLLLPTVSCLSSVCSYCLRPGNPRA---- 61
Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
C RC + C A C + H+ EC L+ G + + + P R L+
Sbjct: 62 -CSRCHAASYCDAACQAAAWKAVHSRECKALRQGIKD----EGRRRQLPTPTRALMQALL 116
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI- 232
+ LKD+E H+ ++ + + +I+ ++ F K + L +A+ L I
Sbjct: 117 CGEIGDGLKDLEGHVLEKKAEGDEWRDIE-MMAMAACAFSGKGTAEELVRRAAEMLCKIQ 175
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-----THISTMY 287
+ D+G L + + H+C+PNA + ++ L V I Y
Sbjct: 176 NNSFQRFDPDLGVVGLFLEPTLAMANHSCIPNAAVQFIGRNTLLVAESPIRAGDEIELAY 235
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSD 316
T RRE LA K F C C RC D
Sbjct: 236 TFYTDPLLKRREALAHYK-FVCQCLRCRD 263
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 74/274 (27%), Positives = 109/274 (39%), Gaps = 39/274 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I SP GR L+ T + GD I E L +LAE IC C + +C
Sbjct: 6 IFDSPGKGRGLKTTKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEK----LQKCS 61
Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK 176
+C F C C A H EC +K AY E I ++
Sbjct: 62 QCKFAHYCDRTCQRAGWAE-HKQECGAIK-------AYGKAPNENI-----RVVSHMQLI 108
Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVI 236
+EL+D A M + +EI + K NFL+ P + T H I GVI
Sbjct: 109 TVEELEDHVADMQE--------DEIKEL-KVDIHNFLDYWPRNSKQHTIDDISH-IFGVI 158
Query: 237 EVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM-MQSKLFVFRDTHIST 285
NG + R Q GL+ +C++ HNC PN H + ++S + ++
Sbjct: 159 NCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGKIELRSLGKIAEGEELTV 218
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
Y + L ++ RR L +F+C C+ C + T+
Sbjct: 219 AYVDFLNLSEERRRLLKTQYFFDCQCDYCKNGTK 252
>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
Length = 598
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 129/331 (38%), Gaps = 53/331 (16%)
Query: 47 HKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPL 106
H+P L + + P+ GR + A D+ G+ I E PL+ A
Sbjct: 112 HQPHNSSLAWHVGHCPVKGRIMVAARDIEAGEIILEEEPLVAASWHA------------- 158
Query: 107 NPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLR 166
RC C P S+ CS + H + P A E L E + +
Sbjct: 159 -------HRCIECHEPHRSSECSRVQ--HKYPPAVAANMEEIERFL-------EGLESIE 202
Query: 167 CLILQRRSPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEK--LPGDVLDD 223
L RR K + A M P T E V +R + + K +P V D+
Sbjct: 203 ELDSARRFIKLLSKSSSDPAVMVHLSPCTAANMESCRKAVSAVRKHKVAKAIIPPTVSDE 262
Query: 224 TSAKCLHWICGVIEVNGVDIGRY-TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVF---- 278
+A+ L ++ N ++G + GL+ + CLMEH+C PN S ++L+V
Sbjct: 263 DAARML----SILNTNSHELGEWGGSGLFLLGCLMEHSCAPNCNFST-YGTQLWVTALNS 317
Query: 279 --RDTHISTMYTNALWG-TQPRREHLAITKYFNCSCERCSD-PTELGTYFSAMKCLNEHK 334
+ +S Y ++ + T RR+ L + F C C RC+ P + + +K
Sbjct: 318 IKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRCTTLPDVMRAFICPVK------ 371
Query: 335 DQGDCWILPVNPLDNDSDWTCGSCSARLNAR 365
Q + P+ SDW C C ++L+ +
Sbjct: 372 -QCKGKVCPLGEGSKPSDWVCEKCKSQLDKK 401
>gi|440790376|gb|ELR11659.1| peptidylprolyl cis-trans isomerase, FKBP-type domain containing
protein [Acanthamoeba castellanii str. Neff]
Length = 165
Score = 58.2 bits (139), Expect = 1e-05, Method: Composition-based stats.
Identities = 24/45 (53%), Positives = 30/45 (66%), Gaps = 4/45 (8%)
Query: 15 CAVC----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
CAVC RE L +C+AC+ V YC +HQK+HWK HK CK +P
Sbjct: 14 CAVCQVSSREKELKRCAACQAVWYCDAEHQKQHWKTHKAICKLIP 58
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 116/287 (40%), Gaps = 43/287 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I SP GR L+A + GD I E+ L +LAE +C C + RC
Sbjct: 6 IFDSPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEK----LQRCG 61
Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
+C F C C A H EC+ +K AY E I L IL R
Sbjct: 62 QCKFAHYCDRTCQRAGWAE-HKQECSAIK-------AYGKVPNENI-RLVARILWRLDKD 112
Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
S + L ++E H+ T++ E+ +K NFL+ P T
Sbjct: 113 GSTVSDMQLTTLDELEDHI------TDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDIS 166
Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM-MQSKLFVF 278
H I GVI NG + R Q GL+ +CL+ H+C PN H + ++S +
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRSLGKIA 225
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC----SDPTELG 321
++ Y + + ++ R+ L +F+C+CE C D +LG
Sbjct: 226 EGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLG 272
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/300 (22%), Positives = 121/300 (40%), Gaps = 42/300 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ + P GR L T D+ PG+ I + P I P E C GC K NL + C
Sbjct: 15 VSNLPHKGRSLFTTRDIRPGEVILSQKPYICVPNNTSLESRCDGCFK--TNNLKKCSGCQ 72
Query: 118 RCFWPACSARCSGLSDAHTHAPEC-AILKLGCE-------------TLLAYNDYKYEAIL 163
++ C + C S+ H EC A+ +L E L + + E +L
Sbjct: 73 VVWY--CGSFCQK-SEWKLHRHECKALTRLEKEKRKFVTPTIRLMVKLYIKRNLQNEKVL 129
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTE-VYEEIDSIVKYLRSNFLEKLPGDVLD 222
P+ S Y ++ + +HMS+ +Y ++ ++V N + + P L
Sbjct: 130 PI-------TSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLV-----NLILQFPSVDLR 177
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
+ + + C + ++ GL+ ++ ++ H+C PNA + ++++ +
Sbjct: 178 EIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRAMDNI 237
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD-----PTELGTYFSAMKCLNE 332
+D+ I+ Y T R++ L F+C C RCS+ E +C NE
Sbjct: 238 SKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNVGKPHDIEESAILEGYRCSNE 297
Score = 42.0 bits (97), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C +T L KCS C+ V YCG QK WKLH+ +CK L
Sbjct: 55 RCDGCFKTNNLKKCSGCQVVWYCGSFCQKSEWKLHRHECKAL 96
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/300 (21%), Positives = 120/300 (40%), Gaps = 42/300 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ + P GR L D PG+ I + P I P +E C GC K NL + C
Sbjct: 15 VSNLPQKGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCFK--TNNLKKCSACQ 72
Query: 118 RCFWPACSARCSGLSDAHTHAPEC-AILKLGCET-------------LLAYNDYKYEAIL 163
++ C + C S+ H EC A+ +L E L + + E +L
Sbjct: 73 VVWY--CGSSCQK-SEWKLHRDECKALTRLEKEKRKFVTPTIRLMVRLYIKRNLQNEKVL 129
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTE-VYEEIDSIVKYLRSNFLEKLPGDVLD 222
P+ + Y ++ + +HMS+ +Y ++ ++V N + + P L
Sbjct: 130 PI-------TTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLV-----NLILQFPSVDLR 177
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
+ + + C + ++ GL+ ++ ++ H+C PNA + ++++ +
Sbjct: 178 EIAENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVVRAMDNI 237
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD-----PTELGTYFSAMKCLNE 332
+D+ I+ Y T R++ L F+C C RCS+ E +C NE
Sbjct: 238 SKDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSNFGKPHDIEESAILEGYRCANE 297
Score = 45.1 bits (105), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C +T L KCSAC+ V YCG QK WKLH+ +CK L
Sbjct: 55 RCDGCFKTNNLKKCSACQVVWYCGSSCQKSEWKLHRDECKAL 96
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 113/278 (40%), Gaps = 39/278 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I SP GR L+AT + GD I E+ + +LAE IC C + RC
Sbjct: 6 IFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEK----LQRCG 61
Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
+C F C C A H EC+ +K AY E I L I+ R
Sbjct: 62 QCKFAHYCDRTCQRAGWAE-HKLECSAIK-------AYGKVPNENI-RLVARIMWRLDKE 112
Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
S + L ++E H++ P ++ E +K NFL+ P + T
Sbjct: 113 GSTVSDMQLTTLDELEDHIADM-PEDDLKE-----LKVDIHNFLDYWPHNSKQHTIDDIS 166
Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNMMQSKLF-VF 278
H I GVI NG + R Q GL+ +CL+ H+C PN H + L +
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGKIELRALGKIA 225
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
++ Y + L ++ R+ L +F+C+CE C +
Sbjct: 226 EGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 263
>gi|321448192|gb|EFX61351.1| hypothetical protein DAPPUDRAFT_273988 [Daphnia pulex]
Length = 326
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 30/44 (68%), Gaps = 2/44 (4%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
CAVCR A C C E++YC K+HQK+HW HK +CK PY+I
Sbjct: 19 CAVCRSKASQICGGCGEISYCSKEHQKQHWATHKSQCK--PYKI 60
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/276 (19%), Positives = 106/276 (38%), Gaps = 49/276 (17%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GR L T D +PG+ I + P + P N A +RC RCF +
Sbjct: 22 GRSLVTTKDFNPGEVIISQEPYVCVPN-----------------NSATESRCDRCFSSSN 64
Query: 125 SARCSGLSDAHTHAPECAIL-----KLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
+CS A C L ++ CE L + + + + P L+++ +K Q
Sbjct: 65 VKKCSACQVAWYCGSSCQKLEWKLHRIECEALSKLDKDRRKYVTPSIRLMVRLLIRRKLQ 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS----------------NFLEKLPGDVLDD 223
K + + T+ Y+ ++++V +++ N + + P + +
Sbjct: 125 SEKIISS------TATDNYDLVEALVAHMKDIDEKQLVLYAQMANLVNLILRWPDVNIKE 178
Query: 224 TSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVF 278
+ C + ++ GLY V+ ++ H+CLPNA ++ + +
Sbjct: 179 IAENFSKLACNAHTICDSELRPLGTGLYPVVSIINHSCLPNAVLVFDGRLAVVHTVQHIP 238
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
+ + + Y T R++ L +F C+C RC
Sbjct: 239 KGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRC 274
Score = 38.9 bits (89), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C ++ + KCSAC+ YCG QK WKLH+ +C+ L
Sbjct: 55 RCDRCFSSSNVKKCSACQVAWYCGSSCQKLEWKLHRIECEAL 96
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 118/298 (39%), Gaps = 44/298 (14%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
+++ SP GR L + + G + P + C C P +P RC
Sbjct: 35 DLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYLSSHCSSCAAPASPERPQLKRC 94
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYN-DYKYEAILPL-RCLILQRR 173
+C CS +C +D H PEC L+ + + EA+ L R L ++R
Sbjct: 95 AKCHVVHYCSQKCQN-ADWKAHKPECDALQRWATAAPSPDLAVPEEAVRCLGRMLWQKQR 153
Query: 174 SPKKY--QELKDMEAHMSKRGPGTEVYEE----IDSIVKYLRSNFLEKLP-------GDV 220
+P +E+ M++H S P E +E ++V+YL + EKL GDV
Sbjct: 154 NPSSIWSREIDSMQSHRSSLRP--ESFESHAYLAHALVRYLGLDAPEKLASFGLRSAGDV 211
Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----------KHSN 269
+D S + I + + + + L+ H+C PNA
Sbjct: 212 VDLIS----RFTTNAITLAAPSLTPLGVSVSPAVALVNHSCAPNAVVVFPRVSKTVDQEP 267
Query: 270 MMQSKLFVFRDTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTE 319
+MQ + RD H ++T Y + + R++ L T +F CSC C+ DP E
Sbjct: 268 VMQ--VIALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCSCSLCAAEDPDPRE 323
>gi|281204009|gb|EFA78205.1| MYND-type zinc finger-containing protein [Polysphondylium
pallidum PN500]
Length = 854
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 37/66 (56%), Gaps = 6/66 (9%)
Query: 1 MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
MAS+ EEL CA+C T + C+ C V YCG +HQ +HW HK +C L K
Sbjct: 1 MASLT---TEELHHCAICSSTQVQICTGCLSVYYCGSEHQLQHWPQHKSQCSGL---NKR 54
Query: 61 SPLLGR 66
S LLGR
Sbjct: 55 SDLLGR 60
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 76/290 (26%), Positives = 114/290 (39%), Gaps = 44/290 (15%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I SP GR L+AT + GD I E L +LAE IC C + +C
Sbjct: 6 IFDSPGKGRGLKATKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEK----LQKCS 61
Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK 176
+C F C C A H EC +K AY E I + L+ R K
Sbjct: 62 QCKFAHYCDRTCQRAGWAE-HKQECGAIK-------AYGKAPNENIRLVARLMW--RLDK 111
Query: 177 KYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS---NFLEKLPGDVLDDTSAKCLHWIC 233
+ + DM+ ++ V + + +K L+ NFL+ P + T H I
Sbjct: 112 EGSVVSDMQL-ITVEELEDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDDISH-IF 169
Query: 234 GVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM------------- 270
GVI NG + R Q GL+ +C++ HNC PN H N
Sbjct: 170 GVINCNGFTVSDQRGLQAVGVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRRI 229
Query: 271 -MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
++S + ++ Y + L ++ RR L +F+C C+ C + T+
Sbjct: 230 ELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCKNGTK 279
>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
Length = 696
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/336 (21%), Positives = 131/336 (38%), Gaps = 51/336 (15%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLA 111
++ P E+ GR + A+ DL G + R SP + E C C K +N ++
Sbjct: 209 QQFPIELVWRDGFGRCVLASEDLPKGTMVLRVSPFASVLEDHKIEKNCGFCFKKINKSIR 268
Query: 112 DNARCPRC----FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRC 167
N C C P CS L + H EC IL + + + + +
Sbjct: 269 INQTCKNCKNHLLCPQCSVDEYSL---NYHKDECDILNFLKQYYPSSQTRDFRFMFRVLL 325
Query: 168 LILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--DTS 225
+++ + K + + + ++ + P +++ ++ R+ L+K+ + ++ S
Sbjct: 326 NVIKDKKNKSFSKENQSKQWLNHQNPF--IFDSYKYLINLSRT--LDKVQPEQMEAFKRS 381
Query: 226 AKCLHWI------------CGV-----IEVNGVDIGR------------YTQGLYSVICL 256
A+ + I CGV IE+ + + Y G++
Sbjct: 382 AQSVIAIFNKLRGPKFFDECGVTIDEIIEIYSIVLSNGHEMLHPLNCHTYGLGIFPTGSY 441
Query: 257 MEHNCLPNAKHSNMMQSKLFVFRD-------THISTMYTNALWGTQPRREHLAITKYFNC 309
+ H+CLPNA N Q + VFR I T YT+ RR+HL +F C
Sbjct: 442 LNHSCLPNAFWYNDDQG-MMVFRTLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFC 500
Query: 310 SCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
C++C ++L T S + C N+ + +P N
Sbjct: 501 QCQQCKFQSKL-TDQSCLSCKNQLSSSNTSFKIPSN 535
>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
Length = 128
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 46/87 (52%), Gaps = 8/87 (9%)
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTNAL 291
V G +I + L+ + L+ H+C PNA H + ++ + + + T I+ Y L
Sbjct: 42 VGGNEI--LVRALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLL 99
Query: 292 WGTQPRREHLAITKYFNCSCERCSDPT 318
W T R+ LA+TK+F C C RC DPT
Sbjct: 100 WSTLARKLFLAMTKHFICQCPRCQDPT 126
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 116/300 (38%), Gaps = 56/300 (18%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I SP GR L+A + GD I E+ L +LAE +C C + RC
Sbjct: 6 IFDSPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEK----LQRCG 61
Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
+C F C C A H EC+ +K AY E I L IL R
Sbjct: 62 QCKFAHYCDRTCQRAGWAE-HKQECSAIK-------AYGKVPNENI-RLVARILWRLDKD 112
Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
S + L ++E H+ T++ E+ +K NFL+ P T
Sbjct: 113 GSTVSDMQLTTLDELEDHI------TDMQEDELKELKVDIHNFLDFWPRTSKQHTVDDIS 166
Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSNM--------- 270
H I GVI NG + R Q GL+ +CL+ H+C PN H N
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHS 225
Query: 271 -----MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC----SDPTELG 321
++S + ++ Y + + ++ R+ L +F+C+CE C D +LG
Sbjct: 226 QRRIELRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLG 285
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/348 (21%), Positives = 141/348 (40%), Gaps = 47/348 (13%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L A PG+ + + P P C C + NL + C
Sbjct: 19 VASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFA--SRNLRKCSVC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
R W C + C E + +L C + A + + + + P + L+L+R+
Sbjct: 76 RVAW-YCGSAC--------QREEWKLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
Y + +E+H+S+ V Y ++ ++V+ + +F +L +
Sbjct: 127 LQDDKAIPSSGTDNYNLVDALESHISEVDKNQLVLYAQMANLVQLILPSF--ELDLKEIT 184
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFV 277
T +K + C + ++ GLY V+ ++ H+C+PNA + +++ +
Sbjct: 185 HTFSK---FACNAHTICDPELRPLGTGLYPVLSIINHSCVPNAVLIFEGRTAYVRALQPI 241
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQG 337
++ +S Y T R++ L YF C+C RC +E +C N+ K G
Sbjct: 242 SKNEEVSISYIETAATTMKRQDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC-NDQKCDG 299
Query: 338 DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLV 382
++LP + +TC CS + ++ + S L ++V LV
Sbjct: 300 --FLLPNA---GNKGYTCQKCSTSRDGEELQKMASDVLLLSDKVSSLV 342
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 115/291 (39%), Gaps = 52/291 (17%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I SP GR L+AT + GD I E+ + +LAE IC C + RC
Sbjct: 6 IFDSPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEK----LQRCG 61
Query: 118 RC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR---- 172
+C F C C A H EC+ +K AY E I L I+ R
Sbjct: 62 QCKFAHYCDRTCQRAGWAE-HKLECSAIK-------AYGKVPNENI-RLVARIMWRLDKE 112
Query: 173 ---RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
S + L ++E H++ P ++ E +K NFL+ P + T
Sbjct: 113 GSTVSDMQLTTLDELEDHIADM-PEDDLKE-----LKVDIHNFLDYWPHNSKQHTIDDIS 166
Query: 230 HWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN----AKHSN------MMQS 273
H I GVI NG + R Q GL+ +CL+ H+C PN H N M S
Sbjct: 167 H-IFGVINCNGFSVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHS 225
Query: 274 KLF--------VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
+ + ++ Y + L ++ R+ L +F+C+CE C +
Sbjct: 226 QRRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 276
>gi|353238271|emb|CCA70222.1| hypothetical protein PIIN_04161 [Piriformospora indica DSM 11827]
Length = 311
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 29/40 (72%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+C VC+E A +CS CK V YCG +HQ++HW HK +C+K
Sbjct: 121 KCVVCKEYAPKQCSNCKTVQYCGAEHQRQHWSEHKAQCRK 160
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 117/295 (39%), Gaps = 44/295 (14%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCH-KPLNPN--LADN 113
E+ + GR L+AT G + ++ P + + +C C P P + D
Sbjct: 7 ELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDVVCHYCLCAPGQPGAPVEDL 66
Query: 114 ARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
RC C F C+ C + H ECA +K + + L R L ++
Sbjct: 67 HRCTGCKFAQYCTKECQKKAWPE-HKQECAAIKR-----ITPGKPVDQTRLVGRILWRRK 120
Query: 173 RSPKKYQELKD----------------MEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL 216
R E KD +E H+SKR E E ID V +F +
Sbjct: 121 REENLNGEKKDGKENDEKKVELVKIEELEDHLSKRN--AEEKEAIDEKV----YSFGDYF 174
Query: 217 PGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN-AKHSN 269
D + D+ + H + +I+ N + + R Q G+Y I ++ H+C PN +N
Sbjct: 175 TYDEMPDSDEEMAH-LFAIIDCNAIGLNDHRGVQTIGVGIYPGISMLNHDCSPNCVAMNN 233
Query: 270 MMQSKLFVFR----DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
+ ++ R + Y ++L T+ RRE L + YF+C C+ C+ EL
Sbjct: 234 GPRLEVRALRVIQPGEELCISYIDSLETTEKRREKLKLQYYFDCECDTCTKGEEL 288
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 88/392 (22%), Positives = 155/392 (39%), Gaps = 56/392 (14%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
I S P GR L T PG+ I P P L C C + NL + C
Sbjct: 19 IASIPGKGRGLVTTRRFFPGEVILCGEPYASTPNKILVGSNCDHCFT--SSNLRKCSVC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP----LRCLILQRR 173
R W C + C E + +L C + A D + + + P + LIL+R+
Sbjct: 76 RVAW-YCGSVC--------QKEEWKLHQLECRAISALTDDRKKMLTPTIRLMVRLILRRK 126
Query: 174 ----------SPKKYQELKDMEAHMSKRGPGTEV-YEEIDSIVKYLRSNFLEKLPGDVLD 222
Y + +E+H+S+ V Y ++ ++V+ + LP LD
Sbjct: 127 LQNEKVIPSTGTDNYGLVDALESHISEVDDKQLVLYAQMANLVQLI-------LPAIELD 179
Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKL 275
+T+ + C + ++ GL+ I + H+C+PNA + +++
Sbjct: 180 LKETAHIFSKFSCNAHTICDPELRPVGTGLFPAISTINHSCVPNAVLLFEGRTAYVRALQ 239
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKD 335
+ +T +S Y T R L +F C+C RC +E +C KD
Sbjct: 240 PLSNNTEVSISYIETAATTLKRHNDLK-HYFFTCTCPRCIKGSEEDPLLEGYRC----KD 294
Query: 336 QG-DCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQ---LGEQVDKLVQENPN--VK 389
Q D ++LP + +TC CS + +V ++S+ L ++ V N
Sbjct: 295 QNCDGFLLPDS---GKKAYTCQKCSMCRDGEEVKKLSSEILLLSDKASSFVSSGNNNEAG 351
Query: 390 SLEEMLTKLE-AMFHPHHYHCYAVKHSLIQLY 420
S+ +++ +LE ++H + +L++LY
Sbjct: 352 SVYKIIEQLERKLYHSFSITLLHTRETLLKLY 383
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG QKE WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSVCQKEEWKLHQLECRAI 100
>gi|440800401|gb|ELR21440.1| hypothetical protein ACA1_183760 [Acanthamoeba castellanii str.
Neff]
Length = 454
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 145/381 (38%), Gaps = 86/381 (22%)
Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI---L 170
RC +C A CS C + H EC +K L + E +L LR +I L
Sbjct: 11 RCNKCHTIAFCSPACRAEGEV-LHRVECFYVKK-----LGHLIVNLEILLLLRIVIARWL 64
Query: 171 QRRSPKK--------YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS--NFLEKLPGDV 220
+ S +K + + M+AH +ID + Y + + L LP ++
Sbjct: 65 ETLSSQKRGVEGTPVFDRMMHMQAHAES---------QIDLLQTYAPAVEDMLRVLPAEM 115
Query: 221 LDDTSAKCLHWICGVIEVNGVDIGR---------------YTQGLYSVICLMEHNCLPNA 265
+ + + I + VN +GR GL+ ++ H+C NA
Sbjct: 116 --KVAKEAMLIIVCICSVNQFAVGRPPAENESGEGQSKDGEGSGLFLTSAIVSHSCRENA 173
Query: 266 KHSN-----MMQSKLFVFRDTHISTMYT-NALWGTQPRREHLAITKYFNCSCERCSDPTE 319
H M + + + I+ Y N L TQ R+ L TK F C CERC D TE
Sbjct: 174 MHRTKGRRLAMLALEPIAAGSEITWSYLGNPLLPTQIRQRELYQTKVFLCQCERCLDLTE 233
Query: 320 LGTYFSAMKCLNEHKDQGDCWILPVNP--LDNDSD-WTCGSCS---ARLNARDVHLVTSQ 373
A K NE G LP+ ++ + D W C + + A ARDV + +
Sbjct: 234 CHLRAFACKACNE----GLLVPLPLTQAGIETEEDRWRCDNAACGRAAGFARDVERLEAG 289
Query: 374 LGEQVDKLVQENPNVKSLEEMLTKLEAMF--------HPHHYHCYAVKHSLIQLYGTQPG 425
L + ++ + L L +LEA+ HPHHY L+Q+Y
Sbjct: 290 LVRRFEEAHAALGATQDLAAALQQLEAVIADHAGPVMHPHHY-------LLLQIY----- 337
Query: 426 YAYTQLSSSLLERKISYVMSP 446
L LL RK+ + P
Sbjct: 338 ----LLHGELLRRKLDGRLPP 354
>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
Length = 530
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 148/364 (40%), Gaps = 50/364 (13%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P+ +KS GR + +T PGD I +P I+ P L+ IC C K A+
Sbjct: 6 PFRVKSHRK-GRGIFSTKSFAPGDVILPFTPTILIPSLSHINTICSHCFKQ-----AEVR 59
Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
C RC + C A C + H+ EC +L+ E + P+R ++
Sbjct: 60 ACSRCHAVSYCDAACQAANWTAVHSKECKVLRKVTE------QGRPGLPTPIRAVVQALV 113
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWIC 233
P+ ++D+E + + +E + +++ ++ S F G + K L +C
Sbjct: 114 KPEIGNAIEDLEGNAAS-WRKSEKWADME-MMAMGASAFAGLGTG---QEDVQKALTILC 168
Query: 234 GVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHI 283
I+ N D+G+ L + + + H+C+PNA +++++ + D I
Sbjct: 169 K-IQTNAFHRYDTDLGQVGIFLGTKLAMANHSCIPNAMVQFIGRRAILRAEKPIKIDDEI 227
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----FSAMKCLNEHKDQGDC 339
YT+ + R+ LA +F+C C RC +L Y S + +N H D
Sbjct: 228 EISYTDYTFPLSNRKLALA-PYFFDCMCLRCE--KDLNVYQVCAGSPIIDMNRHSLVADV 284
Query: 340 WILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLE 399
L +P D + +AR + E++ +L+ + + SLE L+
Sbjct: 285 GKLGRHPGATDPNKV---STARRQS-----------EELSELLDKKDSAGSLETRRASLQ 330
Query: 400 AMFH 403
+H
Sbjct: 331 KRYH 334
>gi|341883759|gb|EGT39694.1| hypothetical protein CAEBREN_28295 [Caenorhabditis brenneri]
Length = 582
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 91/423 (21%), Positives = 153/423 (36%), Gaps = 69/423 (16%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIAR 82
L +C C YC K+ Q W +HKP+CK+L I +P + L ++ + +
Sbjct: 62 LLRCGGCNFSMYCSKECQATAWLVHKPECKRLK-AIFETPFATQVLNPKVNEFCANCLRG 120
Query: 83 ESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECA 142
+P G KL C GC+ F CS C + H PEC
Sbjct: 121 PAP---GEKLLR----CGGCN----------------FSMYCSKECQATA-WLVHKPECK 156
Query: 143 ILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK-------KYQELKDMEAHMSKRGPGT 195
LK L + + + + R L++ K K+ L D + + +
Sbjct: 157 RLKASFPNL-PLTEVLFLSKVIDRIQFLEKNGDKLGIEAERKFSSLVDHKTDIREDEEKM 215
Query: 196 EVYEEIDSIVKYLR-SNFLEKLPGDVLD-----DTSAKCLHWICGVIEVNGVDIGRYTQG 249
+E++ + + R + +EK G+ D ++ +H G +D+G
Sbjct: 216 THFEKVFTKMGLFRKDDMIEK--GEFFDIFCKASINSHSIHTNAGNEVGMALDLG----- 268
Query: 250 LYSVICLMEHNCLPNAKHSNMMQSKLFVFR---------DTHISTM-YTNALWGTQPRRE 299
+ H+C P S + R DT + + Y + RR
Sbjct: 269 ----VSKYNHSCRPTC--SMVFDGYRVCLRPLVPGVDAEDTEQAFISYIDVGRSKFVRRR 322
Query: 300 HLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCS 359
L YFNC C RC DP + +A++C N D IL + + + C C
Sbjct: 323 DLKSRWYFNCECTRCMDPED--DALTAIRCANP---SCDAPIL-TSETEEPMNIACDKCK 376
Query: 360 ARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQL 419
+ DV + + P L ++L K E + HP + + ++ +L +
Sbjct: 377 TIVEENDVKAAQEYMKSLPASFDPKCP-ADILRDLLAKAEQVLHPSNVYVARLRTALFHV 435
Query: 420 YGT 422
GT
Sbjct: 436 TGT 438
>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
Length = 513
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 116/293 (39%), Gaps = 36/293 (12%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P E + P R L AT P I PLI+ P L+ +C C +P +P +
Sbjct: 3 PIETRPHPSKHRALHATQPFRPSQTIHTFHPLILHPSLSHLASVCTHCLRPGSPRAC--S 60
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYNDYKYEAILPLRCLILQ 171
RC ++ C+ C + H+ EC L KLG T + P+R L+
Sbjct: 61 RCHAAYY--CNTSCQQAAWTAVHSKECKALQQRKLGSRT-------GADLPTPVRALLQT 111
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
+ + ++ H +R T+ + +++ + + G + K +
Sbjct: 112 LLIKEIEDGVAALDGHTEERR-KTKSWPDLEMMALAACA-----FAGRQGESNVRKAVEL 165
Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
+C I+ N VD+G+ L + + H+C+PNA +++++ +
Sbjct: 166 LCK-IQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILRAERPIQAGD 224
Query: 282 HISTMYTNALWGTQPRREHLAITKY-FNCSCERCSDPTELGTYFSAMKCLNEH 333
I YT+ RRE A+ +Y F C+C RC +L Y C N +
Sbjct: 225 EIEISYTDYTMPLSARRE--ALEQYSFECTCARCKG--DLNVYQVRTACPNTY 273
>gi|339235237|ref|XP_003379173.1| MYND finger protein [Trichinella spiralis]
gi|316978237|gb|EFV61247.1| MYND finger protein [Trichinella spiralis]
Length = 451
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 125/320 (39%), Gaps = 43/320 (13%)
Query: 99 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK 158
C+ C + L N +C R W A C L A+ + + LG
Sbjct: 41 CMRCRRALFCN----EQCQRLGWWDHQAECGALKRAYPVVFDVEVRLLGRIAARHMQISS 96
Query: 159 YEAILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK-LP 217
+ + L R+S + +L D+ H+ E + I L + K LP
Sbjct: 97 GKDAVDRENFYLNRQSER---QLGDLWHHVGSMRENVEEMRKFRRIQASLERFYKAKYLP 153
Query: 218 GDV----LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQ 272
+ L + H I + +N +IG+ GLY +C +H+C PNA ++
Sbjct: 154 NEQTLFKLHSRNYINRHAIGDQLYLN--EIGK---GLYLDLCAYDHSCRPNAIYTCEGFV 208
Query: 273 SKLFVFRDT-------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS 325
++L D+ T Y L RR L + YF+C CERC+DP + +
Sbjct: 209 ARLKALDDSVNLSDRKRTHTCYITPLDSRAERRRLLKESWYFDCFCERCTDPND--HLLT 266
Query: 326 AMKCLNEHKDQGDCWILPVNPLDNDSD------WTCGSCSARLNAR-DVHLVTSQL---G 375
A+KC N +P++ DND D +TC C R +R DV ++
Sbjct: 267 ALKCNNCST------AVPIHNADNDKDLIDDSFFTCPKCEQRCFSRPDVEQALFKMIDIE 320
Query: 376 EQVDKLVQENPNVKSLEEML 395
+Q+ K E + L+ ML
Sbjct: 321 KQLAKTGSETLSKSELDSML 340
>gi|390597639|gb|EIN07038.1| hypothetical protein PUNSTDRAFT_104486 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 440
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%)
Query: 1 MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
A + + +EL C VC + L +CS C+ YCG + QK HW+ HKP C+ +
Sbjct: 253 FADVARLNADELKGCIVCGKQTLKECSRCRSAKYCGPECQKSHWREHKPVCRSI 306
>gi|195123617|ref|XP_002006300.1| GI20970 [Drosophila mojavensis]
gi|193911368|gb|EDW10235.1| GI20970 [Drosophila mojavensis]
Length = 574
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 74/307 (24%), Positives = 119/307 (38%), Gaps = 52/307 (16%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E++ +P GR++ T DL GD +A E P + C C + L C
Sbjct: 188 EMRETPDEGRFIVTTQDLVVGDIVATEEPFCSSLLAPMRYIRCSNCKEERYLTLIPCDNC 247
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKL-----------GCETLLAYNDYKYEAILPL 165
C CS C +++ H EC I+ L T L+ D Y +I L
Sbjct: 248 --CSVMFCSEECKKRANSTFHKYECPIIDLLHRMFNKIHCIALRTTLSALDL-YGSIEEL 304
Query: 166 RCLILQRRSPKK------YQELKDMEAHMSKRGPGT--------EVYE------------ 199
Q ++ K Y +L E + + G T ++++
Sbjct: 305 MAFCEQPQNQHKSVFDLDYTQLTPQEHYRAIHGLVTNQHLRSVSDLFQRSVVCAVLKHFI 364
Query: 200 -EIDSIVKYLRSNFLEKLPGDVLD---DTSAKCLHWICGVIEVNGV-DIGRYTQGLYSVI 254
E + +YL D+L TS +H I V + N D ++ G Y+ +
Sbjct: 365 MEYTPLKEYLGGEEGRNFFTDLLFRHLQTSPSNMHGIDLVEQANETKDDQSHSSGAYAFL 424
Query: 255 CLMEHNCLPNA-KHSNMMQSKLFVFRDTHIS-TMYTN-----ALWGTQPRREHLAITKYF 307
L+ H+C PN + +++ LFV R +Y N A++G + R+E L++ F
Sbjct: 425 SLLNHSCAPNTLRIYEGVKAYLFVLRPIKAGGVLYDNYGAHFAVFGKKQRQETLSMQYRF 484
Query: 308 NCSCERC 314
NC CE C
Sbjct: 485 NCKCEAC 491
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 52.4 bits (124), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 141/362 (38%), Gaps = 56/362 (15%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P+ +KS GR + +T PGD I +P I+ P L+ IC C K A+
Sbjct: 6 PFRVKSHHR-GRGIFSTKSFVPGDVILPFTPRILIPSLSHINNICSHCFKQ-----AEVR 59
Query: 115 RCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
C RC + C A C + H+ EC +L+ E + P+R ++
Sbjct: 60 ACSRCHAVSYCDAACQAANWTAVHSKECKVLRKVTE------QGRPGLPTPIRAVVQALV 113
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEE---IDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
P+ ++D+E G GT + D + + ++ L G +D H
Sbjct: 114 KPEIGNAIEDLE------GNGTSWRKSEKWADMEMMAMGASAFAGL-GTGQEDIQTNAFH 166
Query: 231 WICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHIST 285
D+G+ L + + H+C+PNA +++++ + D I
Sbjct: 167 RY-------DTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGRKAILRAEKPIKIDDEIEI 219
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTY----FSAMKCLNEHKDQGDCWI 341
YT+ + R+ LA +F+C C RC +L Y S + +N H D
Sbjct: 220 SYTDYTFPLSKRKLALA-PYFFDCMCLRCE--KDLNVYQVCAGSPIIDMNRHSLVADVGK 276
Query: 342 LPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEAM 401
L +P DS+ +++A H E + +L+++ + SLE L
Sbjct: 277 LGRHPGATDSN--------KVSAARRH------SEGLSELLEDKDSAGSLETRRASLRKR 322
Query: 402 FH 403
+H
Sbjct: 323 YH 324
>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
Length = 526
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 111/274 (40%), Gaps = 34/274 (12%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNA 114
P E + P R L AT P I PLI+ P L+ +C C +P +P +
Sbjct: 3 PIETRPHPSKHRALHATQPFKPSQIIHTFHPLILHPSLSQLASVCTHCLRPGSPRAC--S 60
Query: 115 RCPRCFWPACSARCSGLSDAHTHAPECAIL---KLGCETLLAYNDYKYEAILPLRCLILQ 171
RC ++ C+ C + H+ EC L KLG T + P+R L+
Sbjct: 61 RCHAAYY--CNTSCQQAAWTAVHSKECKALQQRKLGSRT-------GADLPTPVRALLQT 111
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
+ + + ++ H +R T+ + +++ + + G + + +
Sbjct: 112 LLNKETEDGIAALDGHTEERR-KTKSWPDLEMMAMAACAFA-----GRQGESNIRRAVEL 165
Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
+C I+ N VD+G+ L + + H+C+PNA +++++ +
Sbjct: 166 LCK-IQTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGRKAILRAERPIHAGD 224
Query: 282 HISTMYTNALWGTQPRREHLAITKY-FNCSCERC 314
I YT+ RRE A+ +Y F C+C RC
Sbjct: 225 EIEISYTDYTMPLSTRRE--ALEQYSFECTCARC 256
>gi|348689633|gb|EGZ29447.1| hypothetical protein PHYSODRAFT_473651 [Phytophthora sojae]
Length = 500
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 73/306 (23%), Positives = 119/306 (38%), Gaps = 51/306 (16%)
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
L+CL L R + E+ + +R S LR + P D+
Sbjct: 124 LKCLALYERDARSLDEVLGLTCAGQQRAM---------SAALQLRRQLPDIFPAGFSDEQ 174
Query: 225 SAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVF 278
A + GV+ N ++ GL+ C MEH+CLPN + M + +
Sbjct: 175 MAT----LIGVLNTNSHELENLGGSGLFLSACRMEHSCLPNCSFTTFDSTLWMTAIRPIA 230
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKY-FNCSCERC---SDPTELGTYFSAMKCLNEHK 334
+S Y N + P R+ + Y F C+CE C DPT A++CL+ +
Sbjct: 231 PGDALSIDYGNFFYRPTPERQESLLESYGFVCTCEACVSMPDPTR------AVRCLSGNC 284
Query: 335 DQGDCWILP---------VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
QG P + P +W C +C +A + H T L + D + EN
Sbjct: 285 PQGVMLPCPIRTNAPSSSIGPNQLQFEWRCQTCGKVADAAE-H--TRILAAEQD--LMEN 339
Query: 386 PNVKSLEEMLTKL-EAMFHPHHYHCY-AVKHSLIQLYGTQPGYAYTQLS------SSLLE 437
+SLEE+ + + H HY + A+ + + T A Q++ + + E
Sbjct: 340 GFPESLEEVDAVVRRGVLHERHYLLFWALDNIGCEAAATTAFQAEAQMAQHRRALAQVWE 399
Query: 438 RKISYV 443
R I+Y+
Sbjct: 400 RIIAYM 405
>gi|403332639|gb|EJY65354.1| Zinc finger MYND domain-containing protein 17 [Oxytricha
trifallax]
Length = 370
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 34/52 (65%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
C +C++ A + CS CK V YC + HQ E WK+HK CK+L +++++ L +
Sbjct: 8 CNICKKPATNLCSKCKSVYYCNRDHQLEDWKIHKQVCKQLFEQLQNNAALSQ 59
>gi|393247959|gb|EJD55466.1| hypothetical protein AURDEDRAFT_155706 [Auricularia delicata
TFB-10046 SS5]
Length = 191
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 13 MRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
M CA C H C+ CK VAYC K HQ+ HWK+HKP C++
Sbjct: 140 MACAFCNSNEKTHLCADCKAVAYCSKAHQRAHWKVHKPTCRE 181
>gi|50287869|ref|XP_446364.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637715|sp|Q6FTT0.1|SET5_CANGA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49525671|emb|CAG59288.1| unnamed protein product [Candida glabrata]
Length = 515
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 6/84 (7%)
Query: 243 IGRYTQGLYSVICLMEHNCLPNAKH--SNMMQSKL----FVFRDTHISTMYTNALWGTQP 296
I + + LY + + HNC PN ++ +N ++ K+ F+ +D + T Y N L G
Sbjct: 335 INQLSGQLYFLYSFLNHNCEPNVRYDINNKLELKVYARKFIKKDEELVTTYVNPLHGVSL 394
Query: 297 RREHLAITKYFNCSCERCSDPTEL 320
RR L + F C+C+RC+ EL
Sbjct: 395 RRRELRVNWGFICNCDRCAKEIEL 418
>gi|384254219|gb|EIE27693.1| cysteine proteinase [Coccomyxa subellipsoidea C-169]
Length = 728
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
CAVC E A H+C CK YC ++HQ++HW HK C+KL I +
Sbjct: 49 CAVCSEPAPHRCGGCKASRYCSREHQQQHWAAGHKQVCRKLALGIAAG 96
>gi|66815955|ref|XP_641994.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
gi|60470035|gb|EAL68016.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
Length = 563
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/262 (26%), Positives = 108/262 (41%), Gaps = 37/262 (14%)
Query: 124 CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKD 183
CS CS + TH EC+ + E+ + +L LR +I + KYQE
Sbjct: 89 CSEICS---NDMTHQLECSFINKLIESSIKNECDVSTCLLALRIMIRNKIESDKYQET-- 143
Query: 184 MEAHMSKRGPGTEVYEEID-SIVKYLRSNF------LEKLPGD--VLDDTSAKCLHWICG 234
+ K E + EI+ S ++ NF L++ D +L + I
Sbjct: 144 ----VGKLSNQLEKFIEINKSFIEKYDINFQQFINQLQQENSDPSLLAIFNKDEFLQIIS 199
Query: 235 VIEVNGV-----DIGR--YTQGLYSVICLMEHNCLPNA----KHSNM-MQSKLFVFRDTH 282
I +N D R + G + L+ H+C PN K N+ M++ + +D
Sbjct: 200 SIYINSFSGLSNDFNRKPISNGYFYKPALLNHSCEPNIFFTIKDKNLEMRACKKIEKDEE 259
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
I Y + L T R++ L +K F C C RCSD TE Y S++ C H + G +I
Sbjct: 260 IVDSYVDLLLPTIERQKILYNSKNFLCKCSRCSDSTEDERYLSSIYCF--HCNDGVEFIS 317
Query: 343 PVNPLDNDS-----DWTCGSCS 359
P + D+ +W C + S
Sbjct: 318 PEIYFNKDTKKLQENWKCSNTS 339
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 109/272 (40%), Gaps = 28/272 (10%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
IK GR L AT + G +A SPLI+ P L+ +C C + + P +RC
Sbjct: 11 IKPHSTRGRALHATKTVAAGGVLAVFSPLILVPSLSHLTIVCSFCLRGVTPR--PCSRCR 68
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI----LQRR 173
++ C A+C + + HA ECA L + + E P+R L+ L R
Sbjct: 69 AAYY--CDAQCQAAAWSSGHAAECAALT----GAIPPARRRREIPAPVRALVQLLLLLRS 122
Query: 174 SPKKYQELKD-MEAHMS--KRGPGTEVYEEIDSIVKYLRSNF--LEKLPGDVLDDTSAKC 228
++L D +E H + +R PG E ++ F +E G V + C
Sbjct: 123 GGGHTRDLLDGLEGHAAQRRRAPGWADME----LMAMAGCAFAGMETSEGAVREAAEILC 178
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHI 283
D+G L + + H+CLPNA + +++++ + I
Sbjct: 179 -KLQTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVLRAEQRIQSGDEI 237
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCS 315
YT+ R+ LA +F C C RC+
Sbjct: 238 EISYTDYTSSLSKRQAALA-PYHFECRCRRCT 268
>gi|440801743|gb|ELR22748.1| MYND finger domain containing protein, partial [Acanthamoeba
castellanii str. Neff]
Length = 320
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 30/47 (63%), Gaps = 5/47 (10%)
Query: 11 ELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+L +C +C R C ACK+VAYC ++HQK+HW+ HKP C+
Sbjct: 269 QLAKCGLCEAKEARRGDFPVCGACKQVAYCTREHQKQHWRTHKPHCR 315
>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
Length = 555
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 120/295 (40%), Gaps = 34/295 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGP--KLALAEPICLGCHKPLNPNLADNARCPR---- 118
GRY++AT L PGD + E+ P + C C K + P+
Sbjct: 16 GRYVEATDILKPGDSLFSENSFASIPLENFLYKKDYCFNCCKYQISTEQQTSEQPKSTPG 75
Query: 119 --------CFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLIL 170
C CS C+ D H EC+ + E L ++ ++L LR ++
Sbjct: 76 FPFQCKYGCNMWFCSNECA---DGMVHQLECSFISKFVELSLKHDCDASISLLALRVVMK 132
Query: 171 QRRSPKKYQE----LKDMEAHMSKRGPG-----TEVYEEIDSIVKYLRSNFLEKL-PGDV 220
+ + + +++ L + ++ + +++++++ IVK S +
Sbjct: 133 NKVASEVFEQTVGKLTNQKSIFIQNNKSFLDKYSKLFDQVYLIVKSENSELSSIFNKEEY 192
Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQS-KLFVFR 279
L+ + ++ G+ N + + G + L+ H+C PN S + ++ V +
Sbjct: 193 LEVVCSIFVNSFAGL--SNNFNRTPISNGYFYKAALLNHSCEPNTFFSIQNSTLEMRVVK 250
Query: 280 DTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCL 330
D + I Y + L T R++ L +K F C C RC+D TE + S++ C+
Sbjct: 251 DINKGEQIFDSYVDLLLPTIERQKILLKSKNFVCDCSRCTDITENSRFVSSIHCI 305
>gi|403341486|gb|EJY70048.1| MYND finger family protein [Oxytricha trifallax]
Length = 425
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 32/47 (68%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
C +C++ A + CS CK V YC + HQ E WK+HK CK+L +++++
Sbjct: 8 CNICKKPATNLCSKCKSVYYCNRDHQLEDWKIHKQVCKQLFEQLQNN 54
>gi|389739118|gb|EIM80312.1| hypothetical protein STEHIDRAFT_125885 [Stereum hirsutum FP-91666
SS1]
Length = 718
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/44 (54%), Positives = 27/44 (61%), Gaps = 2/44 (4%)
Query: 11 ELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
EL RC+ CR LHKC C +V YC QK+HWK HKP CK
Sbjct: 674 ELYRCSWCRNPSAVLHKCRNCGKVRYCDSVCQKQHWKAHKPACK 717
>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
bisporus H97]
Length = 674
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 106/268 (39%), Gaps = 56/268 (20%)
Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSV---------------ICLMEHNCLPNA 265
LD+ S+ + I G++++ + R+T ++V I L H+C PNA
Sbjct: 263 LDEFSSYGIDSIAGLVDL----VSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNA 318
Query: 266 ----------KHSNMMQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSC 311
+N + + RD + Y + + R++ L T YF CSC
Sbjct: 319 VPVFPRAPHNAKANEPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCSC 378
Query: 312 ERCSDPTELGTYFSAMKCLNEHKD-QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLV 370
C P + G Y + + L K+ G CW+ P ++D C C + D L
Sbjct: 379 SLCEKPRD-GAYVDSRESLYCPKNCGGLCWL----PTEDDPLARCAKCGTVVKDSDAILD 433
Query: 371 TSQLG----EQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGY 426
++G ++V+K+ +P K++ ++ T L + + TQP
Sbjct: 434 VIRIGFVALKKVNKIQISDPQ-KAI-QLTTNLIRILASARVCPF-----------TQPLL 480
Query: 427 AYTQLSSSLLERKISYVMSPNLKATDEP 454
A ++L+++LL PN+ P
Sbjct: 481 ALSRLNTTLLISNYRSPFDPNVVEISSP 508
>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
Length = 453
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 134/333 (40%), Gaps = 52/333 (15%)
Query: 74 LHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSD 133
+PG + + P + C C K +P L + C F+ CS C S
Sbjct: 1 FYPGQTVLEQDPYVAVLDGDSRGNRCDACFKQ-SPALKRCSACKFVFY--CSPTCQR-SQ 56
Query: 134 AHTHAPECAIL-KLGCE----------TLLAYNDYKYEAILPLRCLILQRRSPKKYQELK 182
H EC +L +L E LL + + +LP+ L Y+ ++
Sbjct: 57 WKIHQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATGVLPVTAL-------DNYEIVR 109
Query: 183 DMEAHMSKRGPGTEV-YEEIDSIVK-YLRSNFLEKLPGDVLDDTSAKC-LHWICGVIEVN 239
+ H S+ G V Y ++ ++K L + + E + D C H IC E+
Sbjct: 110 ALPTHFSETGDERLVMYAQMAVLIKTILNARYAEDVKEITKDICRISCNGHTICD-DELR 168
Query: 240 GVDIGRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMYTNALWG 293
V IG L+ V+ ++ H+C N+ KH+ ++++ + R ++ Y
Sbjct: 169 PVGIG-----LFPVVSVINHSCSSNSLLLFDGKHA-VVRALGTISRGCEVTVSYIELGAS 222
Query: 294 TQPRREHLAITKYFNCSCERCSD--PTELGTY----FSAMKCLNEHKDQGDCWILPVNPL 347
T RRE L+ YF+C C RC+D P G Y A+ CL+ + L
Sbjct: 223 TNSRREALSDQYYFHCKCPRCTDVSPQAHGLYKDDVLEAVACLDPACES-------FMRL 275
Query: 348 DNDSDWTCGSCSARLNARDVHLVTSQLGEQVDK 380
N S W C SC + +V+ ++++ ++K
Sbjct: 276 SNGS-WRCMSCGSSREPNEVNKLSTEAEGMIEK 307
Score = 41.6 bits (96), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 9 DEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
D RC C + AL +CSACK V YC Q+ WK+H+ +CK L
Sbjct: 20 DSRGNRCDACFKQSPALKRCSACKFVFYCSPTCQRSQWKIHQEECKVL 67
>gi|170041205|ref|XP_001848363.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864728|gb|EDS28111.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 563
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 69/317 (21%), Positives = 119/317 (37%), Gaps = 88/317 (27%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ GRYL DL GD +A E P ++P C GC +
Sbjct: 191 ELVQDEKFGRYLITNRDLKAGDVVALEKPF--------SKP-CKGCTIAM---------- 231
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAIL---------------KLGCETLLAYNDYKYEA 161
CS C + H EC IL +L +TL +N + EA
Sbjct: 232 ------FCSDECQQKAMEEYHRFECPILQDIHRLVEHVLVPALRLTIKTLTTFN-FDLEA 284
Query: 162 ILPLRCLILQ--RRSP-------------KKYQELKDMEAHMSKRGPGTEVYEEIDSIVK 206
LR L+ + +P + Y+ + ++E+H KR P I++I+
Sbjct: 285 ---LRLLMESDGKNTPNALDMDWTSIDPKETYEIVYNLESHRDKRSPDHLGQYAINAII- 340
Query: 207 YLRSNFLEKLP-------GDVLDDTSAKCLHWICGVIEVNGVDIGR------------YT 247
+ L++ + L + + + ++ VN ++
Sbjct: 341 -MSEILLQRTKIRDNCKNSNALRNITELLIMHHAMILPVNSFELSFEDYEDKACKREVLM 399
Query: 248 QGLYSVICLMEHNCLPNAKHSNMMQSKL--FVFRDTHISTMYT------NALWGTQPRRE 299
+G+Y+++ ++ H+C PN++ N L +V R + T A+ Q RRE
Sbjct: 400 EGIYTILSMINHSCAPNSQPMNGSDDNLALYVLRPIKTGSPITIKYCVKFAITPLQERRE 459
Query: 300 HLAITKYFNCSCERCSD 316
+L+ YF C CE C++
Sbjct: 460 YLSENYYFECQCEACAN 476
>gi|307187760|gb|EFN72732.1| Zinc finger MYND domain-containing protein 10 [Camponotus
floridanus]
Length = 434
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
EL RC++C+E + +CS CKEV YC +Q Q + W+ HK CKK
Sbjct: 391 ELKRCSLCQEVSKKRCSKCKEVWYCSRQCQIKDWEEHKKICKK 433
>gi|332018517|gb|EGI59107.1| Zinc finger MYND domain-containing protein 17 [Acromyrmex
echinatior]
Length = 400
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 63/150 (42%), Gaps = 16/150 (10%)
Query: 15 CAVCRETA----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP-----YEIKSSPLLG 65
C VC+ L KC C+ +AYC +QHQK+HW HK CK + ++++ L
Sbjct: 2 CHVCKRFGDGVCLKKCGGCRMIAYCNRQHQKQHWSQHKSLCKVIQDVLRDFKMEDHGTLE 61
Query: 66 RYLQATLDLHPGDRIARESPL-IVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
+ L+L E PL I ++ C CH+ L D C C A
Sbjct: 62 EWDDQKLNLILLISFKLERPLEIYEKEMFQFLRECTVCHEQDGQLLED---CQSC---AS 115
Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAY 154
+ C ++ H C L+L + L++
Sbjct: 116 VSFCKDHKNSIKHKDTCVSLELCLRSNLSF 145
>gi|440799553|gb|ELR20597.1| MYND finger domain containing protein, partial [Acanthamoeba
castellanii str. Neff]
Length = 140
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 30/47 (63%), Gaps = 5/47 (10%)
Query: 11 ELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+L +C +C R C ACK+VAYC ++HQK+HW+ HKP C+
Sbjct: 89 QLAKCGLCEAKEARRGDFPVCGACKQVAYCTREHQKQHWRTHKPHCR 135
>gi|380025446|ref|XP_003696485.1| PREDICTED: uncharacterized protein LOC100869161, partial [Apis
florea]
Length = 414
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 61/148 (41%), Gaps = 22/148 (14%)
Query: 15 CAVCR----ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL-LGRYLQ 69
C +C+ E +L +C C ++YC K+HQK HW HK C + +K + + + ++
Sbjct: 5 CHICKRFGKEISLKRCGNCSMISYCSKEHQKIHWSQHKDLCNAICSVLKDNTISMTLNIK 64
Query: 70 ATLDLHPGDRIARESPLIVGPKL----------ALAEP-ICLGCHKPLNPNLADNARCPR 118
+DL ++ L+V KL P C+ CH L D + CP
Sbjct: 65 QNVDLKTWVQMKMNFMLLVAIKLDRKLEYYEEQMFKFPRSCVTCHDQNIKILEDCSNCPN 124
Query: 119 CFWPACSARCSGLSDAHTHAPECAILKL 146
+ C D H C +LKL
Sbjct: 125 I------SFCIKHKDNIIHKKFCNLLKL 146
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 70/314 (22%), Positives = 115/314 (36%), Gaps = 43/314 (13%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+++ + GR + A+ DL G+ + RE P + + A C C + + A
Sbjct: 74 FKVTEDSVSGRCVIASRDLKAGELVLREPPFVKVVRRDCASRQCAYCCQQVTERGKIEAD 133
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP------LRCLI 169
P F CS C DA A A+ KL + A D + + R
Sbjct: 134 VP--FAVYCSRACQAREDALRAAEASALGKLA--GISAARDVDIDLLRMLLRLLITRAKA 189
Query: 170 LQRRSPK----KYQELKDMEAHMSKRGPGTEVYEEID-----------------SIVKYL 208
L R P D E G G + ++ + S+V+
Sbjct: 190 LGLREPSGDSDSVSRGVDEEGEDGTMGEGLFLRQQWENLYALMHHREAMAPDWISVVREA 249
Query: 209 RSNFLEKLPGDVLDDTSAKCLHWICGV-IEVNGV--DIGR---YTQGLYSVICLMEHNCL 262
+ L+ LP V D + + C V + +G+ D G G++ + ++ H C
Sbjct: 250 GEDLLQLLPEWVRFDVE-EVVQLACRVNVNAHGLRDDSGANLVIGVGMFPLTAMINHACR 308
Query: 263 PNAKH----SNMMQSKLF-VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
PN N+ L V +S Y + L T RR+ L +K+F C C RC +P
Sbjct: 309 PNCTFVYFGGNLEVRTLEPVSAGAELSVYYIDLLQSTAARRQELLTSKHFLCKCSRCENP 368
Query: 318 TELGTYFSAMKCLN 331
+ + Y + C +
Sbjct: 369 SSMDDYLDGVCCTD 382
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 89/443 (20%), Positives = 166/443 (37%), Gaps = 94/443 (21%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GR L T D HPGD I + P + P + C GC NL+ +RC ++ C
Sbjct: 22 GRSLFTTRDFHPGDVIISQEPYVCVP----TQKRCDGCFS--TTNLSKCSRCQVVWY--C 73
Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
C E + +L C+ L + K +++ P L+L+ +K Q+ K +
Sbjct: 74 GTPC--------QKSEWKLHRLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKII 125
Query: 185 EAH---------------------MSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGD--- 219
+ +S R E + S+V L + + + D
Sbjct: 126 PSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKE 185
Query: 220 ---VLDDTSAKCLHWICGVIEVNGVDIGRYTQ-------------------GLYSVICLM 257
VL A +H I E+N +I + GLY V+ ++
Sbjct: 186 EQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSII 245
Query: 258 EHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCE 312
H+CLPN+ +++ + + T + Y T R++ L F C C
Sbjct: 246 NHSCLPNSVLVFDGREASVRALQHIPKGTEVLISYIETAGSTVTRQKALREQYLFQCVCP 305
Query: 313 RCSDPTELGTY--------FSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNA 364
CS ++G Y +C NE D ++L + + C C +
Sbjct: 306 LCS---KVGQYEDVRENAILEGYRCKNE---TCDGFLLRTT---DGKAFQCQECGLVRDK 356
Query: 365 RDVHLVTSQLGEQVDKLVQENPN----VKSLEEMLTKLEA-MFHPHHYHCYAVKH----S 415
++ + +++ +++ + + N S+ +M+ KL+ ++HP + + S
Sbjct: 357 EEIKQIATEIKFLLEEASKPSSNDSHEAISIHKMIEKLQTKLYHPFSINLMQTRETILKS 416
Query: 416 LIQLYGTQPGYAYTQLSSSLLER 438
L++L + AY +L+ + +R
Sbjct: 417 LMKLEYWREALAYCKLTIPIYQR 439
>gi|291223160|ref|XP_002731579.1| PREDICTED: egl nine homolog 1-like [Saccoglossus kowalevskii]
Length = 615
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 27/39 (69%), Gaps = 1/39 (2%)
Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
RCAVC + L +CS C V YC ++HQK+HWK+HK C
Sbjct: 11 RCAVCEKIEDLKRCSRCHVVHYCSREHQKQHWKVHKTSC 49
>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 674
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 117/333 (35%), Gaps = 93/333 (27%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFW--- 121
GR+L AT ++ G + ESP + CL CH L N D+ + P C++
Sbjct: 234 GRHLVATRNIKAGYILIVESPFAFSTNNEAFDRNCLNCHVTLKLN--DSVKIP-CYFCRT 290
Query: 122 -PACSARCSGLSDAHTHAPECAILKLGCET------------LLAY-------------- 154
CS +C + H EC+I + C LLAY
Sbjct: 291 VSFCSEKCRNEAWKLYHRYECSIFDIFCGNDSEQPQRQSSYLLLAYRMTIAGCLLSNTDK 350
Query: 155 -NDYKYEAILPLRCLILQRR-----------------SPKKYQELKDMEAHMSKRGPGTE 196
N+ I L L+ SP+ Y+ + ++ H K P
Sbjct: 351 VNNMDKTEIPVLNDNFLRYHGTNTNQECSDLGINEVYSPRDYRTILKLKTHCEKIEPNIN 410
Query: 197 VYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI-----------CGVIEVNGVDIG- 244
+ I++I FL K VL CL I N +I
Sbjct: 411 LIRAIEAI-------FLTKCFTFVLSKMDVVCLKETFISLAVAMLHHLQAINCNAYEIVE 463
Query: 245 ------------RYTQG-LYSVICLMEHNCLPNA-KHSN-----MMQSKLFVFRDTHIST 285
RY G +Y + L+ H+C PN +HS +++S F+ + T I
Sbjct: 464 NLYDKKAHVWEPRYVGGAIYPSVSLVNHSCYPNVVRHSYPSGVVVVRSLRFIGKGTEIVD 523
Query: 286 MYTNALW---GTQPRREHLAITKYFNCSCERCS 315
Y W G PRRE+L F C+CE C+
Sbjct: 524 CY-GPHWLSEGRLPRREYLWKKYRFLCACEACT 555
>gi|302692314|ref|XP_003035836.1| hypothetical protein SCHCODRAFT_84443 [Schizophyllum commune H4-8]
gi|300109532|gb|EFJ00934.1| hypothetical protein SCHCODRAFT_84443 [Schizophyllum commune H4-8]
Length = 283
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 43/109 (39%), Gaps = 9/109 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC +C+ C +AYCG+ QK HWK HK CK + + G++ Q +
Sbjct: 109 CAVCGNKTTSRCTGCLSIAYCGQACQKAHWKEHKGFCKTIRGGTWRTMTFGQHFQVGGQV 168
Query: 75 HPGDRIARESPLIVGPKLALAEP---------ICLGCHKPLNPNLADNA 114
I S P EP + +PL P+L A
Sbjct: 169 MSAVSINHSSGKANTPINKKNEPPANVHGDKLFLVKIQRPLVPDLTQQA 217
>gi|238601033|ref|XP_002395301.1| hypothetical protein MPER_04666 [Moniliophthora perniciosa FA553]
gi|215465804|gb|EEB96231.1| hypothetical protein MPER_04666 [Moniliophthora perniciosa FA553]
Length = 129
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 2/54 (3%)
Query: 6 EIEDEELMRCAVCRETALH--KCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
++ +EE+ C VC E +H +CS CK VAYC +HQK WK HK C K +E
Sbjct: 42 DLWEEEIKSCVVCGEKDIHLSRCSRCKIVAYCNAEHQKADWKQHKKHCIKPVWE 95
>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 643
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/268 (21%), Positives = 105/268 (39%), Gaps = 56/268 (20%)
Query: 221 LDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSV---------------ICLMEHNCLPNA 265
LD+ S+ + I G++++ + R+T ++V I L H+C PNA
Sbjct: 230 LDELSSYGIDSIAGLVDL----VSRFTTNAFTVASPTLTPIGVSISPIIALFNHSCAPNA 285
Query: 266 ----------KHSNMMQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSC 311
+N + + RD + Y + + R++ L T YF C+C
Sbjct: 286 VPVFPRAPHNAKANEPMASVITLRDIPANEEVVISYIDTTLTKRERQKALKETYYFTCNC 345
Query: 312 ERCSDPTELGTYFSAMKCLNEHKD-QGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLV 370
C P + G Y + + L K+ G CW+ P ++D C C + D L
Sbjct: 346 SLCEKPRD-GAYVDSRESLYCPKNCGGLCWL----PTEDDPLARCAKCGTVVKESDAILD 400
Query: 371 TSQLG----EQVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGTQPGY 426
++G ++V+K+ +P K++ ++ T L + + TQP
Sbjct: 401 VIRIGFVALKKVNKVQISDPQ-KAI-QLTTNLIRILASARVCPF-----------TQPLL 447
Query: 427 AYTQLSSSLLERKISYVMSPNLKATDEP 454
A ++L ++LL PN+ P
Sbjct: 448 ALSRLHTTLLISNYRSPFDPNVAEISSP 475
>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
Length = 680
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 78/343 (22%), Positives = 131/343 (38%), Gaps = 49/343 (14%)
Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGD 78
RE L KE C K+ K + + P E+ S GR + A+ DL G
Sbjct: 157 REIQLKIIKRNKEEKDC-----KDRIKRDFDQFNQFPVELVWSDQFGRRVLASEDLPKGS 211
Query: 79 RIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS--DAHT 136
+ R SP + C C+K + + D +C C RCS +
Sbjct: 212 MLLRVSPFGSCLEDDKIFKNCGFCYKKIEKSKRDQCKCKICNNFLLCERCSTTDPFASEY 271
Query: 137 HAPECAILKLGCETL--LAYNDYKYEAILPLRCLILQR-RSPKK---------------- 177
H EC IL+ E D+++ + L L ++ +S KK
Sbjct: 272 HKEECDILQFLKEYYPSTETKDFRFMFRVVLNALKEKKFQSFKKENCSSNWKKHEHPFIF 331
Query: 178 --YQELKDMEAHMSKRGPG-TEVYEEIDSIVKYLRSNFLEKLPGD--VLDDTSAKCLHWI 232
YQ+L+++ + K P E ++ S + + + KL G+ +L + + +
Sbjct: 332 DEYQDLENLSRTLDKVDPQQMEAFKRSASSIIAV----ISKLRGEDAILKYITLNEIIEL 387
Query: 233 CGVIEVNGVDI------GRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD------ 280
++ NG ++ Y G++ + H+C PNA N Q + VFR
Sbjct: 388 YSIVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQG-MMVFRSLRPLKK 446
Query: 281 -THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGT 322
+ T YT+ + RR++L +F C C +C + L T
Sbjct: 447 GEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQCQYQSNLVT 489
>gi|195024336|ref|XP_001985854.1| GH20862 [Drosophila grimshawi]
gi|193901854|gb|EDW00721.1| GH20862 [Drosophila grimshawi]
Length = 574
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 115/308 (37%), Gaps = 54/308 (17%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E++ +P GR++ DL GD IA E P + C C + L C
Sbjct: 188 EMRETPDEGRFIVTNRDLVVGDVIAIEQPFCSSLLAPMRYIRCATCKEERYLTLIPCDSC 247
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKL------------------------GCETLL 152
C CS C + + H EC I+ L E L+
Sbjct: 248 --CSAMFCSEECKERATSTFHKYECPIIDLLHRMFNKIHGIALRTTLSALDLHGSIEELM 305
Query: 153 AYNDYKYEAILPLRCLILQRRSPKKY----------QELKDMEAHMSKRGPGTEVYE--- 199
AY D + L + +P+++ Q L+ + + + +R V +
Sbjct: 306 AYCDQPQNQHKSVFDLDYTQLTPQEHYRAIHGLVTNQHLRSV-SDLFQRSVVCAVLKHFI 364
Query: 200 -EIDSIVKYLRS----NFLEKLPGDVLDDTSAKCLHWICGVIEVNGV-DIGRYTQGLYSV 253
E + +YL NF L L TS +H I V + N D Y G Y+
Sbjct: 365 IEYTPLKEYLGGEDGRNFFTDLLFRHLQ-TSPSNMHGIDLVEQPNETKDDQSYASGAYAF 423
Query: 254 ICLMEHNCLPNAKHSNM-MQSKLFVFRDTHIS-TMYTN-----ALWGTQPRREHLAITKY 306
+ L+ H+C PN N + + LFV R +Y N A++ + R+E L++
Sbjct: 424 LSLINHSCAPNTMRINQGVHAYLFVLRPIKAGDVLYDNYGAHFAVFSKKQRQETLSMQYR 483
Query: 307 FNCSCERC 314
FNC CE C
Sbjct: 484 FNCKCEAC 491
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 120/301 (39%), Gaps = 54/301 (17%)
Query: 52 KKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGC-HKPLNPNL 110
K P E+ + L GR L+ T DL G+ + E+ +L+ +C C + +NP+
Sbjct: 5 KTDPVEVFGAGLKGRGLRGTRDLSAGEVVFAEASFAAVVLDSLSLQVCHSCFRRQVNPH- 63
Query: 111 ADNARCPRCFWPACSARCSGLSDAHTHAPEC-AILKLGCETLLAYNDYKYEAILPLRCLI 169
RC +C + R + H EC AI +G A N+ + L I
Sbjct: 64 ----RCAQCKFAHYCDRTCQRAAWDEHRKECSAIRNIG----KAPNEN-----VRLVARI 110
Query: 170 LQR-------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD 222
L R S + L +E H+S+ P E +E+ + VK + + +K D
Sbjct: 111 LWRIQKHTGLVSDSQLTTLDMLEDHLSRMTP--EDLKELKADVKTFYTYWPKKSKAVGED 168
Query: 223 DTSAKCLHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN-----------A 265
S + GVI NG + R Q GL+ +CL+ H+C PN A
Sbjct: 169 YVSH-----LFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHDCWPNCTVILNHGDQSA 223
Query: 266 KHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
++ S+ R ++ Y + L + R+ L YF+C CE C + T
Sbjct: 224 LDASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHCVNGT 283
Query: 319 E 319
+
Sbjct: 284 K 284
>gi|343980823|gb|AEM76787.1| hypothetical protein [Armillaria mellea]
Length = 267
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/39 (56%), Positives = 27/39 (69%), Gaps = 2/39 (5%)
Query: 15 CAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C R+ L KCS CK+V+YCG + Q+ HWK HKP C
Sbjct: 226 CAACEKRDMRLQKCSRCKKVSYCGPECQRAHWKKHKPVC 264
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 111/279 (39%), Gaps = 39/279 (13%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ SP GR L+AT + GD + E P + A IC C + RC
Sbjct: 5 EVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFR----RQEKLQRC 60
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR-- 173
+C F C C + H ECA +K Y E + L IL R
Sbjct: 61 GQCRFAQYCDKTCQ-RAGWEEHKLECAAIK-------TYGKPPSENV-RLAARILWRMDK 111
Query: 174 -----SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
S + L+D+E H+ ++ E+ K NFL+ P + T
Sbjct: 112 QGSVVSDNQLTTLEDLEDHI------CDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSV 165
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
H I GVI NG + R Q GL+ +CL+ H+C PN ++ ++ + +
Sbjct: 166 SH-ILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGKIELRALGKI 224
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
++ Y + L + R+ L +F+C+C+ C++
Sbjct: 225 SAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTE 263
>gi|328865944|gb|EGG14330.1| hypothetical protein DFA_12100 [Dictyostelium fasciculatum]
Length = 521
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 99/241 (41%), Gaps = 38/241 (15%)
Query: 247 TQGLYSVICLMEHNCLPNAKHS---NMMQSKLF--VFRDTHISTMYTNALWGTQPRREHL 301
T G + L+ H+C PNA S + +Q ++ + +D I Y + L T R+ L
Sbjct: 212 TNGFFYQAALLNHSCQPNAFFSFNGDKLQMRVVKDMDKDDSIYDSYVDLLLPTYERQLGL 271
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDC--WILP-VNPLDNDSD------ 352
+K F C C+RCS T + S++ C+ + DC WI P + +N D
Sbjct: 272 LKSKNFFCQCKRCS--TREDCHLSSINCVGK-----DCQSWIAPLIGKKENKEDGSTSLT 324
Query: 353 --WTCGSCSARLNARDVHLVTSQLGEQVDKL----VQENPN--VKSLEEMLTKLEAMFHP 404
W C C+ + V + QL L +EN + V+ + +LE H
Sbjct: 325 VTWGCDDCTGTASESHVKNIQDQLVSFTKYLEKRSYRENADRFVEQASVTMAQLENKIHK 384
Query: 405 HHYHCYAVKHSLIQ--LYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLGCHKP 462
+H H+ +Q L+ + Q + S+ K+ + N+ D + C+K
Sbjct: 385 NH-------HARLQYHLHMSNAYIKLQQFNESIYHAKVVKRLVKNVLQCDRGELVDCYKT 437
Query: 463 L 463
+
Sbjct: 438 I 438
>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 409
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 111/280 (39%), Gaps = 29/280 (10%)
Query: 58 IKSSPLLGRYLQATLDLH--PGDRIARESPLI--VGPKLALAEPICLGCHKPLNPNLADN 113
I+SS + GR L A H PGD IA P++ V P +A A C CH LN +
Sbjct: 11 IESSSVGGRGLFAGEGSHYIPGDSIATYRPIVYCVTPTMAKA---C--CHWCLNSDAPKY 65
Query: 114 ARCPRCFWPA-CSARCSGLS-DAHTHAPECAILK-LGCETLLAYNDYKYEAILPLRCLIL 170
+C C + CS C+ + TH EC ++K L E + + A L L+
Sbjct: 66 YQCSGCRYAVYCSKECAKAAYKLGTHRRECQLIKKLPAEHASSAPLTTFLAAAKLHWLV- 124
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
++ + ++ M + ++ E L S +L+ D++ +
Sbjct: 125 -----QEDEAVRHSVDEMCRHADSSDTLEADSGSTAILLSRYLDGTQADLIYSPPVPVML 179
Query: 231 WICGVIEVNGVDIGR-----YTQGLYSVICLMEHNCLPNA----KHSNMMQSKLFVFRD- 280
+ V+ N V I GLY+ + M H+C PN S + + D
Sbjct: 180 DLLRVLRYNAVTITNDSLQDVALGLYTEVSAMNHSCAPNVVLIFSGSEVTLRTIRAVEDG 239
Query: 281 THISTMYTNALWGTQPRR-EHLAITKYFNCSCERCSDPTE 319
+ Y + + +R + L F+CSCERC+ E
Sbjct: 240 AELFISYVDVCISPKAKRCQRLRDQYKFDCSCERCTREDE 279
>gi|407405659|gb|EKF30538.1| hypothetical protein MOQ_005654 [Trypanosoma cruzi marinkellei]
Length = 585
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 77/338 (22%), Positives = 127/338 (37%), Gaps = 63/338 (18%)
Query: 107 NPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPL 165
P A +C RC F CS C+ D EC + E L + +A +P
Sbjct: 203 EPTAAALRKCRRCQFVSYCSDTCASCHDRQHEEYECRLFFRLREMLRSMQ--SCDAAVP- 259
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLE---------KL 216
+ M H G +V +E V L S+ +E +L
Sbjct: 260 -------------DDFVTMATHCITTVSGVKVRKEGHEAVLRLESHEVEVSQGLTPLLRL 306
Query: 217 PGDVLDDTSAKCLHWICGVIEVN--------GVDIGRYTQGLYSVICLMEHNCLPN-AKH 267
D+ ++ + I GV+ N G+ +G+ + SV H+CLPN A
Sbjct: 307 VQDLFNEEDPTFITRILGVVRCNALAVCDASGLPVGQALH-VASVTSYFNHSCLPNCAIE 365
Query: 268 SNMMQSKLFVFRDTHISTMYTNAL-WGTQPRREHLAITKYFNCSCERCSDPTE--LGTYF 324
++ + + + ++ Y L W RRE LA +F+C C RC D +
Sbjct: 366 ADAIVTTRAIRPGEELTISYLPQLYWPAWLRREELAERYFFDCRCMRCGDGDRHPFESAL 425
Query: 325 SAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQE 384
SA L +++ +I V L CG R+ A+DV V +G++
Sbjct: 426 SATLRLGGSREKEREYISSVQIL-------CG----RVRAKDVGDV--DVGDR------- 465
Query: 385 NPNVKSLEEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
+L +L + P HY C+ ++++L +Y
Sbjct: 466 ----DALLHLLQECRQHLFPFHYLCHELQNTLSFVYAV 499
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 110/291 (37%), Gaps = 44/291 (15%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
+ S P GR L AT PGD I + P P L C C + NL + C
Sbjct: 19 VASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFT--SGNLRKCSMC- 75
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKK 177
R W CS+ C E + +L C + A + + + + P L+++ +K
Sbjct: 76 RVTW-YCSSNC--------QKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRK 126
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
Q K M +S P + E+D L + + + C
Sbjct: 127 LQNEK-MANLVSLILP----FIELD------------------LKEIAQTFSKFACNAHT 163
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNALW 292
+ ++ GLY VI ++ H+C+PNA + +++ + +D +S Y
Sbjct: 164 ICDPELRPLGTGLYPVISIINHSCVPNAVLIFDGRTAYVRALQPINKDEEVSISYIETAT 223
Query: 293 GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
T+ R L +F C+C RC + +C N+ D ++LP
Sbjct: 224 VTKKRNNDLK-QYFFTCTCPRCVKGFDEDALLEGFRCKNQ---ACDGFLLP 270
>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
206040]
Length = 543
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 67/274 (24%), Positives = 112/274 (40%), Gaps = 31/274 (11%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
+I+ P GR L AT PG I +PL++ P ++ +C C +P NP C
Sbjct: 12 QIRPHPTKGRGLYATQSFSPGQVILPFTPLLLLPTVSYLNIVCSYCLRPGNPRA-----C 66
Query: 117 PRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
RC + C A C + H+ EC +L+ G + D + P R LI
Sbjct: 67 SRCHAASYCDATCQAAAWKAVHSRECKVLRQGIKD----EDRRRRLPTPTRALIQAVLRK 122
Query: 176 KKYQELKDMEAH-MSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICG 234
+ L+ +E H + K+ + +++I+ I+ F K G +D K +C
Sbjct: 123 EIGDGLEGLEGHVLEKKAAEGDEWKDIE-IMAMAACAFSGK--GTAEEDIR-KAAEMLCK 178
Query: 235 V----IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------ 284
+ + D+G L + + H+C+PNA + ++ L + + +
Sbjct: 179 IQNNSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLIAENPIRAGDEMEL 238
Query: 285 --TMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
T YT+ L P+R+ F C C RC D
Sbjct: 239 AYTFYTDPL----PKRKEALAHYKFVCQCLRCRD 268
>gi|299116928|emb|CBN75038.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 457
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 114/290 (39%), Gaps = 65/290 (22%)
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
++C+++ R+ P L + A R E + SI + + F P + +D
Sbjct: 102 IQCMLMYRKDPLSLDPLAPLTAANLPR-----CLESVVSIRELIPGLF----PDGMTNDR 152
Query: 225 SAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSNMMQSKLFVF----- 278
+A+ I G + N ++ + GL+ + C+MEH+C PN + KL+V
Sbjct: 153 AAR----ILGTLNTNSHELEQVEGSGLFLMACIMEHDCTPNCSFTT-FGDKLWVTAIRQV 207
Query: 279 -RDTHISTMYTNALW-GTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQ 336
+S Y N + T R+ L F C+C C T L A +C ++
Sbjct: 208 GEGERLSIDYGNNFYLPTAERKAELEDIYGFVCTCRAC---TVLPDRCRAFRCPSK---- 260
Query: 337 GDCWILPVNPLDNDSD------WTCGSCSARLNARDVHLV-------------TSQLG-E 376
DC V P + D W C +C + + ++ ++ +LG E
Sbjct: 261 -DCRTGKVCPHGDGGDESAERSWGCLACGHQCDPEEIEVLLEAEESLSEALEMADELGVE 319
Query: 377 QVDKLVQENPNVKSLEEMLTKLEAMFHPHHYHCY-AVKHSLIQLYGTQPG 425
Q+D LVQ+ V HP HY + A++ + +L G G
Sbjct: 320 QIDALVQDARVV--------------HPTHYVVFWALEATAKRLSGESAG 355
>gi|393243929|gb|EJD51442.1| hypothetical protein AURDEDRAFT_181799 [Auricularia delicata
TFB-10046 SS5]
Length = 394
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 36/65 (55%), Gaps = 3/65 (4%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP-LLGRYLQAT 71
MRCAVC E CS+CK V YC + QK HW H+ C+ P +I SS L+ L+
Sbjct: 1 MRCAVCPEPGTKLCSSCKSVRYCSAKCQKAHWAEHRFTCR--PDKISSSDRLVHAVLKNA 58
Query: 72 LDLHP 76
+ HP
Sbjct: 59 MPTHP 63
>gi|301105252|ref|XP_002901710.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100714|gb|EEY58766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 500
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/263 (22%), Positives = 97/263 (36%), Gaps = 44/263 (16%)
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
L+CL + P + D+ +R + + LR + P D
Sbjct: 124 LKCLAMYECDPHALDGMMDLTCVGQQR---------VTNATLQLRRQLPDIFPAGFTDQQ 174
Query: 225 SAKCLHWICGVIEVNGVDIGRYT-QGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVF 278
A + GV+ N ++ GL+ C MEHNC PN + M + +
Sbjct: 175 MAT----LIGVLNTNSHELENLGGSGLFLSACRMEHNCKPNCSFTTFDSTLWMTAIRPIA 230
Query: 279 RDTHISTMYTNALWGTQPRREHLAITKY-FNCSCERC---SDPTELGTYFSAMKCLNEHK 334
+S Y N + P R+ + Y F C+CE C DPT A++CL+ +
Sbjct: 231 PGDALSIDYGNFFYRPTPERQECLLESYGFICTCEACVSLPDPTR------AVRCLSRNC 284
Query: 335 DQGDCWILP---------VNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQEN 385
QG P + P +W C +C + + + + ++ + EN
Sbjct: 285 PQGVMLPCPARTNTASSSIQPKKMQFEWRCQTCGTTADTAEHSRIFA-----AEQELLEN 339
Query: 386 PNVKSLEEMLTKL-EAMFHPHHY 407
+SLEE+ + + H HY
Sbjct: 340 GFPESLEEVDAVVSRGVLHESHY 362
>gi|341874291|gb|EGT30226.1| CBN-SET-30 protein [Caenorhabditis brenneri]
Length = 555
Score = 48.1 bits (113), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 99/231 (42%), Gaps = 38/231 (16%)
Query: 124 CSARCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAIL-----PLRCLILQR 172
CS C S A H EC LK C E +L +Y+ I + R
Sbjct: 62 CSKECQA-SGAFDHKYECGALK-KCSDLNTDERMLIRIIGRYKEIQDGNDKKIEGFYKNR 119
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
S + ++ + A M K V+++I D + ++ N+L +D+ +A LH
Sbjct: 120 ESIRSVMQIWEHCADMKKDESAMNVFKKIYDRVKEFGDKNYL-------VDEETAFQLHS 172
Query: 232 --ICGVIEVNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HI 283
++ VD R +GLY +C +H+C PNA +S N +KL D ++
Sbjct: 173 RNFINRHSISNVDYLREIGKGLYLDLCRYDHSCRPNAIYSCNGTIAKLRALHDNVDLENV 232
Query: 284 STMYTNALWGTQP-----RREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
T T+ + P RR L T YF C C+RC DPT+ + +A+ C
Sbjct: 233 ET--THYTYIELPPCKIQRRHMLKETWYFECHCDRCDDPTD--NWLTAVIC 279
>gi|392568936|gb|EIW62110.1| hypothetical protein TRAVEDRAFT_27483 [Trametes versicolor
FP-101664 SS1]
Length = 399
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 32/58 (55%), Gaps = 7/58 (12%)
Query: 3 SIIEIEDEELMRC--AVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
S E++ L RC A C R +C CKEV YCG+ QKE WK HKP+CK+
Sbjct: 299 SDAEVKSYVLKRCVRAGCDTREVRVAEFKRCGGCKEVVYCGQACQKEDWKAHKPRCKE 356
>gi|342179951|emb|CCC89425.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 433
Score = 47.8 bits (112), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 60/263 (22%), Positives = 103/263 (39%), Gaps = 22/263 (8%)
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL-EKLPGDVLDDTSAKC 228
++ R P L +++ H EV + + +V+ + E+LP +V A+
Sbjct: 110 IKSRRPGYRVVLNELQGH------AAEVSQNLSHMVRMVGELLRDEELPLEV-----ARL 158
Query: 229 LHWI-CGVIEVNGVDIGRYTQGLY--SVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHIS 284
+ I C IEVN Q L+ ++ H+C PN A + + + ++
Sbjct: 159 IGIIRCNTIEVNNELSLGVGQALHATTITSYFNHSCSPNCAIQGEFIVTTRVIAAGEELT 218
Query: 285 TMYTNAL-WGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILP 343
Y L W RRE LA T YF+CSCERC D + + L ++
Sbjct: 219 ISYMPQLYWPVALRREELANTYYFHCSCERCRDNENDPFEAALVAVLRSGCEETGGGGGA 278
Query: 344 VNPLDNDSDWTCGSCSARLNARDVHLVTSQL----GEQVDKLVQENPNVKSLEEMLTKLE 399
+ + + CG L A+ H+ Q +D + L+ +L K
Sbjct: 279 QDGNNAKAARVCG-LENHLEAQRKHITDVQTLCNRIRSMDPSDITTTDRNELQRLLQKCC 337
Query: 400 AMFHPHHYHCYAVKHSLIQLYGT 422
+P HY C+ ++++L Y
Sbjct: 338 EELYPFHYLCHELRNALSFTYAA 360
>gi|323449643|gb|EGB05529.1| hypothetical protein AURANDRAFT_72192 [Aureococcus anophagefferens]
Length = 1848
Score = 47.8 bits (112), Expect = 0.014, Method: Composition-based stats.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 9 DEELMR-CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
D+ + R CA C A C +CK YC ++ QK+HW LHK C
Sbjct: 185 DDRMRRMCAACGARAKFSCGSCKAARYCSRECQKDHWGLHKDDC 228
>gi|393213478|gb|EJC98974.1| hypothetical protein FOMMEDRAFT_31532 [Fomitiporia mediterranea
MF3/22]
Length = 374
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 24/43 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
C +C A C AC+ YCG HQ+ WK HK C++ PYE
Sbjct: 326 CELCPLEATKTCGACRGAWYCGDAHQRRDWKFHKSWCRQHPYE 368
>gi|290990111|ref|XP_002677680.1| predicted protein [Naegleria gruberi]
gi|284091289|gb|EFC44936.1| predicted protein [Naegleria gruberi]
Length = 1179
Score = 47.8 bits (112), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 34/57 (59%), Gaps = 5/57 (8%)
Query: 10 EELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
+E+ +C C E++L +CSAC V YC + QK+ WK HK KCKK+ E K +
Sbjct: 997 KEIGKCNTCGKGSSSESSLKRCSACNSVRYCSVECQKKDWKNHKEKCKKIQEESKQT 1053
>gi|393243935|gb|EJD51448.1| hypothetical protein AURDEDRAFT_159180 [Auricularia delicata
TFB-10046 SS5]
Length = 395
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK-SSPLLGRYLQAT 71
MRC VC E + +C++C+ V YC ++ Q+ HW HK C+ P +I + L+ L+ +
Sbjct: 1 MRCEVCSEPGVKRCASCQSVRYCSQKCQQAHWAEHKFTCR--PDKITDADQLVHAVLKGS 58
Query: 72 LDLHPGDRIA 81
+ HP ++A
Sbjct: 59 MPTHPDVQLA 68
>gi|168063352|ref|XP_001783636.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664826|gb|EDQ51531.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 582
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 7/78 (8%)
Query: 2 ASIIEIEDEELMR---CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
A + I ELM CA C++ A+ +CS CK YCG++ Q WK+HK C L ++
Sbjct: 508 AQTVPIWGIELMEQPTCAACKKAAVQRCSRCKSEWYCGRECQVSVWKMHKGACDILAGKV 567
Query: 59 KSSPLLGRYLQATLDLHP 76
+ S + QA + L P
Sbjct: 568 EKSGV----AQADISLDP 581
>gi|118387165|ref|XP_001026696.1| MYND finger family protein [Tetrahymena thermophila]
gi|89308463|gb|EAS06451.1| MYND finger family protein [Tetrahymena thermophila SB210]
Length = 1283
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 30/49 (61%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
E +C +C++ A +CS CK V YC ++ Q HWK HK CKK+ E K
Sbjct: 1044 EGFKCELCKKEATKRCSQCKTVWYCTRECQVAHWKDHKIACKKIVEENK 1092
>gi|66815713|ref|XP_641873.1| hypothetical protein DDB_G0279115 [Dictyostelium discoideum AX4]
gi|60469913|gb|EAL67896.1| hypothetical protein DDB_G0279115 [Dictyostelium discoideum AX4]
Length = 1045
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 12 LMRCAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKC 51
L RC C++T + C CK VAYC K QK+HW +HKP C
Sbjct: 808 LYRCWSCKQTLMKPLICGYCKTVAYCSKDCQKDHWPIHKPSC 849
>gi|406867546|gb|EKD20584.1| hypothetical protein MBM_01266 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 205
Score = 47.4 bits (111), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 2 ASIIEIEDEELMRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
A IEI L CAVC ++ + CS C++ AYC K+ QK HW HK C+ + +K+
Sbjct: 132 APGIEILKPGLKSCAVCGQSDTVKPCSGCRDFAYCSKECQKTHWPFHKKHCRHV-QAVKA 190
Query: 61 SPLLGRYLQATL 72
+ G A L
Sbjct: 191 ATEAGSMTAAAL 202
>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
mellifera]
Length = 440
Score = 47.4 bits (111), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 33/219 (15%)
Query: 113 NARCPRCFWPACSARCSGLSDAHTHA-PECAILKLGCETLLAYNDYKYEAILPLRCLILQ 171
N C + W S C+ L + P+ A +L ++ N E I
Sbjct: 55 NQSCQQMSWTIHSKECASLKRFSSKVIPDVA--RLMARIIIKLNQGGGEEI--------G 104
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
S KY++ KD+ +H S + E + L + GD+ SA+ L
Sbjct: 105 YYSKTKYRKFKDLMSHYSDIKKDEKKMEHFVCVC-----GVLYEFLGDMSIPNSAE-LMG 158
Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT----- 281
I G I +N +D+ G+Y +++H+C PNA + + + R T
Sbjct: 159 IYGRIYINSFNISDLDMNNIGAGIYLGPSILDHSCKPNAVAT--FEGTTIIIRTTEDLPC 216
Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
I Y + + T+ RRE L + YF C+C++C +
Sbjct: 217 LDLSQIRISYIDVIKTTKDRREELQSSYYFWCNCKKCEE 255
>gi|395325441|gb|EJF57863.1| hypothetical protein DICSQDRAFT_149470 [Dichomitus squalens
LYAD-421 SS1]
Length = 424
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 47/104 (45%), Gaps = 21/104 (20%)
Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQAT 71
RCA C +A L C+AC + YC KQ Q+ HW LHK +C +P+ S+ L + +
Sbjct: 238 RCAACGSSAGTLKFCNACHAIRYCSKQCQRSHWSLHKTRC--IPFSADSTVTLTPFYED- 294
Query: 72 LDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+GP ++LA+ P+ A NAR
Sbjct: 295 ----------------IGPVMSLADVARSAFGFPVEKQPARNAR 322
>gi|366998786|ref|XP_003684129.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
gi|357522425|emb|CCE61695.1| hypothetical protein TPHA_0B00230 [Tetrapisispora phaffii CBS 4417]
Length = 523
Score = 47.4 bits (111), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 12/89 (13%)
Query: 243 IGRY-----TQGLYSVICLMEHNCLPNAK-----HSNMMQ--SKLFVFRDTHISTMYTNA 290
IGRY + LYS+ L+ HNC PN + H+ ++ ++ + +++ + T Y N
Sbjct: 347 IGRYNLNHKSAQLYSIYSLVNHNCEPNVRVEVDYHTKELKLYARKSISKNSELLTTYINP 406
Query: 291 LWGTQPRREHLAITKYFNCSCERCSDPTE 319
L + RR+ L I F C+CERC + +
Sbjct: 407 LHDVELRRKILLINYGFACNCERCENEID 435
>gi|261329994|emb|CBH12977.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 443
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 53/116 (45%), Gaps = 12/116 (10%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKSSPLLGRYLQATLDLHP 76
C L +CS+CK V YC ++ Q+EHW H+ CK Y+ ++ +L Y+ A ++
Sbjct: 326 CGGEGLLRCSSCKAVYYCSEECQREHWSAAHRVPCKN--YKQRAETILAEYIAANKNVVG 383
Query: 77 GDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS 132
G R R S L E + PL P+L R R + A SG+S
Sbjct: 384 GKRGKRMS-------LQQQEELVAVLEVPLEPSLFYETR--RYLYDHRDASFSGVS 430
>gi|195029123|ref|XP_001987424.1| GH19974 [Drosophila grimshawi]
gi|193903424|gb|EDW02291.1| GH19974 [Drosophila grimshawi]
Length = 346
Score = 47.4 bits (111), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 21/37 (56%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C H CS CK + YC HQ+ HW LHKP C
Sbjct: 128 CAACGCLGPHTCSRCKSINYCSSTHQRAHWPLHKPNC 164
>gi|242216123|ref|XP_002473871.1| predicted protein [Postia placenta Mad-698-R]
gi|220726971|gb|EED80904.1| predicted protein [Postia placenta Mad-698-R]
Length = 1108
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D + RC VCR + C CK+V YC QK+HWK+HK +CK
Sbjct: 1060 RDPDAHRCFVCRGKGKPRIRACGGCKKVRYCSAACQKKHWKIHKSQCK 1107
>gi|307171609|gb|EFN63394.1| Egl nine-like protein 1 [Camponotus floridanus]
Length = 491
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 3/85 (3%)
Query: 3 SIIEIEDEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
+I+ + CAVC R L +CS CK V YC K+HQ+ WK H+ C P +
Sbjct: 26 AIVGDASSRVTVCAVCDRTDKLLRCSRCKAVFYCTKEHQRRDWKRHREFCATHP--ARED 83
Query: 62 PLLGRYLQATLDLHPGDRIARESPL 86
P + + +P D +R+ P+
Sbjct: 84 PTESAHPSSPGSRNPRDTPSRQRPV 108
>gi|328872812|gb|EGG21179.1| MYND-type zinc finger-containing protein [Dictyostelium
fasciculatum]
Length = 961
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 29/52 (55%), Gaps = 3/52 (5%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
C VC T + C+ C V YCG+ HQ HW +HK KC L K + L+GR
Sbjct: 28 CEVCGSTNVQICTGCLLVYYCGENHQLLHWPVHKNKCSALD---KRTDLIGR 76
>gi|242215015|ref|XP_002473326.1| predicted protein [Postia placenta Mad-698-R]
gi|220727553|gb|EED81468.1| predicted protein [Postia placenta Mad-698-R]
Length = 1161
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D + RC VCR + C CK+V YC QK+HWK+HK +CK
Sbjct: 1113 RDPDAHRCFVCRGKGKPRIRACGGCKKVRYCSAACQKKHWKIHKSQCK 1160
>gi|72392066|ref|XP_846327.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175451|gb|AAX69592.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70802863|gb|AAZ12768.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 443
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/116 (31%), Positives = 53/116 (45%), Gaps = 12/116 (10%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKSSPLLGRYLQATLDLHP 76
C L +CS+CK V YC ++ Q+EHW H+ CK Y+ ++ +L Y+ A ++
Sbjct: 326 CGGEGLLRCSSCKAVYYCSEECQREHWSAAHRVPCKN--YKQRAETILAEYIAANKNVVG 383
Query: 77 GDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLS 132
G R R S L E + PL P+L R R + A SG+S
Sbjct: 384 GKRGKRMS-------LQQQEELVAVLEVPLEPSLFYETR--RYLYDHRDASFSGVS 430
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 6/75 (8%)
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFV--FRDTH----ISTMYTNALWGTQPRREHLA 302
GL+S+ L+ H+C PN SN S + V RD I+ Y + R++ L
Sbjct: 164 GLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQKELR 223
Query: 303 ITKYFNCSCERCSDP 317
TK+F+C CERCS P
Sbjct: 224 DTKHFDCQCERCSTP 238
>gi|194771646|ref|XP_001967706.1| GF20518 [Drosophila ananassae]
gi|190617390|gb|EDV32914.1| GF20518 [Drosophila ananassae]
Length = 288
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 69 CAACGCLAPHTCSKCKSIHYCSSEHQRAHWPQHKPNC 105
>gi|344301790|gb|EGW32095.1| hypothetical protein SPAPADRAFT_61174 [Spathaspora passalidarum
NRRL Y-27907]
Length = 415
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 68/300 (22%), Positives = 111/300 (37%), Gaps = 56/300 (18%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-----CLGCHKPLNPNLADNA----R 115
G+ L A ++ G+ I E PL P LA I C C + L + +
Sbjct: 122 GKGLYAKHNIAKGELIWEEEPLFFIPPLANINLIKSGTACSHCGRLLQQSEGNTVLRGLD 181
Query: 116 CPRCFWPACSARCS-------GLSDAHTHAPECAILKL-GCETLLAYNDYKYE------- 160
C C C +C GL + + P KL E L DY E
Sbjct: 182 CQVCAEVWCGTKCKKLDGVLHGLLKHNVYNPNAKRSKLFDPEAFLHMQDYCLEDQWNALY 241
Query: 161 AILPLRCLILQRRSPKKYQELKDME----------------AHMSKRGPGTEVYEEIDSI 204
AI + IL +S K ++ K M A + G V E+ +++
Sbjct: 242 AITLIYADILNDKSGTKAKQFKAMARVSQDIRYKALNSSAGAFDTLSGGALFVQEQQENL 301
Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN 264
K FL P + +D + ++ G +N +D ++ + + HNC PN
Sbjct: 302 WKLGYEKFLRIFPSNPID---YQEFMYMMGTYNINNLD-----SNVFLIQSHLNHNCNPN 353
Query: 265 AK-HSNMMQS---KLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
+ ++++ K++ RD ++T Y N Q R+ L + F C CE+C +
Sbjct: 354 TSVETELLRTDGLKVYAARDIRAGEELTTTYVNPKNTVQQRQRELRVNWGFLCGCEKCKE 413
>gi|169865882|ref|XP_001839538.1| hypothetical protein CC1G_08917 [Coprinopsis cinerea okayama7#130]
gi|116499410|gb|EAU82305.1| hypothetical protein CC1G_08917 [Coprinopsis cinerea okayama7#130]
Length = 1180
Score = 47.0 bits (110), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 28/42 (66%), Gaps = 3/42 (7%)
Query: 14 RCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+C+VCR + L CSAC+ V YC + QK+ WK HKPKC+
Sbjct: 1136 KCSVCRGQGKPKLKACSACQRVRYCSTECQKKDWKAHKPKCQ 1177
>gi|290988356|ref|XP_002676887.1| hypothetical protein NAEGRDRAFT_58086 [Naegleria gruberi]
gi|284090492|gb|EFC44143.1| hypothetical protein NAEGRDRAFT_58086 [Naegleria gruberi]
Length = 499
Score = 47.0 bits (110), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 25/38 (65%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
CAVC++ A KC+ CK V YC QK+HW +HK CK
Sbjct: 459 CAVCKKAASKKCANCKAVYYCSADCQKKHWSVHKKLCK 496
>gi|194756844|ref|XP_001960680.1| GF11375 [Drosophila ananassae]
gi|190621978|gb|EDV37502.1| GF11375 [Drosophila ananassae]
Length = 347
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 128 CAACGCLAPHTCSKCKSIHYCSSEHQRAHWPQHKPNC 164
>gi|336364484|gb|EGN92841.1| hypothetical protein SERLA73DRAFT_116652 [Serpula lacrymans var.
lacrymans S7.3]
gi|336388527|gb|EGO29671.1| hypothetical protein SERLADRAFT_445456 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1163
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D E RC VCR + L C+ CK+V YC + QK+ WK+HK +CK
Sbjct: 1115 RDPEAYRCFVCRGKGKPKLKMCTVCKKVRYCSSECQKKDWKVHKLRCK 1162
>gi|299741263|ref|XP_001834348.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
gi|298404633|gb|EAU87325.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
Length = 434
Score = 47.0 bits (110), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 43/80 (53%), Gaps = 19/80 (23%)
Query: 18 CRET---ALHKCSACKEVAYCGKQHQKEHWKLHKPKCK---KLPYEIKSSPLLGRYLQAT 71
RET +CSAC+ V YCG++ QKE WK HKP+CK KL IK
Sbjct: 352 ARETEVAQFKRCSACRLVVYCGQECQKEDWKRHKPECKQHLKLKSTIK------------ 399
Query: 72 LDLHPGDRIARESPLIVGPK 91
+ + G +A ++P+++ P+
Sbjct: 400 -NWNNGRPVAADAPMLLSPE 418
>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3-like [Apis florea]
Length = 414
Score = 47.0 bits (110), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 87/219 (39%), Gaps = 33/219 (15%)
Query: 113 NARCPRCFWPACSARCSGLSDAHTHA-PECAILKLGCETLLAYNDYKYEAILPLRCLILQ 171
N C + W S C+ L + P+ A +L ++ N E I
Sbjct: 55 NQSCQQMSWTIHSKECASLKRFSSKVIPDVA--RLMARIIIKLNQGGGEEI--------G 104
Query: 172 RRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
S KY++ KD+ +H S + E + L + GD+ SA+ L
Sbjct: 105 YYSKTKYRKFKDLMSHYSDIKKDEKKMEHFVCVC-----GVLYEFLGDMSIPNSAE-LMG 158
Query: 232 ICGVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDT----- 281
I G I +N +D+ G+Y +++H+C PNA + + + R T
Sbjct: 159 IYGRIYINSFNISDLDMNNIGAGIYLGPSILDHSCKPNAVAT--FEGTTIIIRTTEDLPC 216
Query: 282 ----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
I Y + + T+ RRE L + YF C+C++C +
Sbjct: 217 LDLSQIRISYIDVIKTTKDRREELQNSYYFWCNCKKCEE 255
>gi|145523317|ref|XP_001447497.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415008|emb|CAK80100.1| unnamed protein product [Paramecium tetraurelia]
Length = 143
Score = 47.0 bits (110), Expect = 0.028, Method: Composition-based stats.
Identities = 21/49 (42%), Positives = 26/49 (53%)
Query: 4 IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
II E + C +C++ +C CK V YCG QK HWK HK CK
Sbjct: 95 IILYEPAKQTICGICKKNGELRCGQCKLVYYCGVDCQKTHWKQHKGFCK 143
>gi|401417428|ref|XP_003873207.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489435|emb|CBZ24694.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 622
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 83/202 (41%), Gaps = 40/202 (19%)
Query: 233 CGVIEV---NGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI----ST 285
C +EV +G+ + + S+ H+C PN + ++ + R H+ S
Sbjct: 378 CNALEVTDPSGLGVAQALHAGNSIASFFNHSCAPNCA-IDTVRHAIVTTRTIHVGEELSI 436
Query: 286 MYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLNEHKDQGDCW 340
Y L W T+ RRE L+ YF C C+RC DP E +M+ KD +
Sbjct: 437 AYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSGTKDPFERAL---SMELPTARKDATKHF 493
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
+P+ + C + +R+ VD + Q++ + L ++L+ L
Sbjct: 494 ----HPIVQAA---CATVRSRM---------------VDDVSQKDAD--ELSKLLSDLST 529
Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
P HY C+ V++ L +Y
Sbjct: 530 HLFPFHYLCHEVRNCLSFVYAV 551
>gi|169846118|ref|XP_001829775.1| hypothetical protein CC1G_05984 [Coprinopsis cinerea okayama7#130]
gi|116509102|gb|EAU91997.1| hypothetical protein CC1G_05984 [Coprinopsis cinerea okayama7#130]
Length = 254
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 14 RCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C R+ L KCSACK V YC QK +W+ HK +CK+L
Sbjct: 207 RCKACGKRDAKLMKCSACKSVTYCSADCQKANWRDHKARCKEL 249
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 112/292 (38%), Gaps = 52/292 (17%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ SP GR L+AT + GD + E P + A IC C + RC
Sbjct: 5 EVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKL----QRC 60
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR-- 173
+C F C C + H ECA +K Y E + L IL R
Sbjct: 61 GQCRFAQYCDKTCQ-RAGWEEHKLECAAIK-------TYGKPPSENV-RLAARILWRMDK 111
Query: 174 -----SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
S + L+D+E H+ ++ E+ K NFL+ P + T
Sbjct: 112 QGSVVSDNQLTTLEDLEDHIC------DISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSV 165
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN--AKHSNMMQSKL-FVFR 279
H I GVI NG + R Q GL+ +CL+ H+C PN +N QS + VF
Sbjct: 166 SH-ILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNNGNQSAIDTVFH 224
Query: 280 DT---------------HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
++ Y + L + R+ L +F+C+C+ C++
Sbjct: 225 SQKRIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTE 276
>gi|397640191|gb|EJK73983.1| hypothetical protein THAOC_04367, partial [Thalassiosira oceanica]
Length = 922
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 5/71 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ + CA C A L KC+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 84 SCVPVADDGIETCANCGTIASDTVKLKKCTACQLVKYCGVDCQRAHRKQHKKACKQRAAE 143
Query: 58 IKSSPLLGRYL 68
+K L G+ L
Sbjct: 144 LKDEQLYGQGL 154
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 41/89 (46%), Gaps = 7/89 (7%)
Query: 5 IEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
++ + E CA C E L C+AC+ V YCG Q+ H K HK CK+ E+K
Sbjct: 675 VDRKSETGEACAKCGEHGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELK 734
Query: 60 SSPLLGRYLQATLDLHPGDRIARESPLIV 88
L A + R+A++ P+ +
Sbjct: 735 DEQLYAN--DAGILAMVRARVAKKDPVAI 761
>gi|170118408|ref|XP_001890383.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634653|gb|EDQ98981.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1168
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D + +C VCR + +CS C++V YC + QK+ WK HKPKCK
Sbjct: 1120 RDPDARKCRVCRGKGKPKIKECSVCQKVRYCSPECQKKDWKAHKPKCK 1167
>gi|397602865|gb|EJK58288.1| hypothetical protein THAOC_21605, partial [Thalassiosira oceanica]
Length = 245
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ + CA C A L KC+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 93 SCVPVADDGIETCANCGTIASDTVKLKKCTACQLVKYCGVDCQRAHRKQHKKACKQRAAE 152
Query: 58 IKSSPLLGRYLQ 69
+K L G+ L+
Sbjct: 153 LKDEQLYGQGLE 164
>gi|339897286|ref|XP_003392344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399152|emb|CBZ08492.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 622
Score = 46.6 bits (109), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 48/202 (23%), Positives = 82/202 (40%), Gaps = 40/202 (19%)
Query: 233 CGVIEV---NGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI----ST 285
C +EV +G+ + + S+ H+C PN + ++ + R H+ S
Sbjct: 378 CNALEVTDPSGLGVAQALHAGNSIASFFNHSCTPNCA-IDTVRHAIITTRTIHVGEELSI 436
Query: 286 MYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLNEHKDQGDCW 340
Y L W T+ RRE L+ YF C C+RC DP E M+ KD +
Sbjct: 437 AYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSSNKDPFERAL---VMELPTARKDATKHF 493
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
+P+ + C + +R+ VD + Q++ + L ++L+ L
Sbjct: 494 ----HPIVQTA---CATVRSRM---------------VDDVSQKDAD--ELSKLLSDLST 529
Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
P HY C+ V++ L +Y
Sbjct: 530 HLFPFHYLCHEVRNCLSFVYAV 551
>gi|428177962|gb|EKX46839.1| hypothetical protein GUITHDRAFT_107195 [Guillardia theta CCMP2712]
Length = 504
Score = 46.6 bits (109), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 52/124 (41%), Gaps = 21/124 (16%)
Query: 15 CAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKC--------KKLPYEIKSS 61
CA C++T L KC C++ YCG + QKEHW H+ C K E+K+
Sbjct: 14 CANCKKTRDMVETLLKCPWCRKFFYCGSECQKEHWPEHRNFCPPRSSTSSKGEKLELKAK 73
Query: 62 PLLGRYLQATLDLHPGDRIARESPL---IVGPKLALAEPICLGCHKPLNPNLADNARCPR 118
PL + L+ + RI R L + L +C C NP L CP
Sbjct: 74 PLEQKILETETEEQR--RIRRRKELAENVDDDDLPEGWTMCSSCQ---NPVLIAWLECPS 128
Query: 119 CFWP 122
CF P
Sbjct: 129 CFRP 132
>gi|307108397|gb|EFN56637.1| hypothetical protein CHLNCDRAFT_144433 [Chlorella variabilis]
Length = 142
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 32/50 (64%), Gaps = 4/50 (8%)
Query: 15 CAVCRETALH--KCSACKEVAYCGKQHQKEHWKL--HKPKCKKLPYEIKS 60
CA C +TALH KCS C+ VAYC + Q +HWK HK +CK+L E S
Sbjct: 92 CAQCDQTALHLSKCSGCRAVAYCSRACQVKHWKEGGHKKECKRLAAEQSS 141
>gi|440800582|gb|ELR21618.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 254
Score = 46.2 bits (108), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 30/47 (63%), Gaps = 3/47 (6%)
Query: 11 ELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
E+ +CAVC+ E L +CS CKEV YC ++HQ E WK H CK L
Sbjct: 2 EVKKCAVCQKSEGELQRCSRCKEVHYCSREHQVEDWKKGHNKACKAL 48
>gi|397567676|gb|EJK45717.1| hypothetical protein THAOC_35652 [Thalassiosira oceanica]
Length = 892
Score = 46.2 bits (108), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 7/61 (11%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPL------LGR 66
+CA C E A C CK V YCG+ Q++HW+ HK C K+P ++ + + LG+
Sbjct: 6 KCAACGEEASKICQGCKLVGYCGRDCQQQHWREGHKVICSKVPKAVQKAEVKIDNFSLGQ 65
Query: 67 Y 67
Y
Sbjct: 66 Y 66
>gi|397575310|gb|EJK49633.1| hypothetical protein THAOC_31473 [Thalassiosira oceanica]
Length = 1054
Score = 46.2 bits (108), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 29/47 (61%), Gaps = 1/47 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
CAVC T +L CS C VAYC + HQ +HW HK CK+L E K+
Sbjct: 1006 CAVCGITVSLKLCSLCSSVAYCCRDHQVKHWPNHKADCKRLKKESKT 1052
>gi|195586319|ref|XP_002082925.1| GD11837 [Drosophila simulans]
gi|194194934|gb|EDX08510.1| GD11837 [Drosophila simulans]
Length = 347
Score = 46.2 bits (108), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 164
>gi|21064263|gb|AAM29361.1| GM13546p [Drosophila melanogaster]
Length = 347
Score = 46.2 bits (108), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 164
>gi|194886157|ref|XP_001976561.1| GG22945 [Drosophila erecta]
gi|190659748|gb|EDV56961.1| GG22945 [Drosophila erecta]
Length = 345
Score = 46.2 bits (108), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 127 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 163
>gi|290985106|ref|XP_002675267.1| predicted protein [Naegleria gruberi]
gi|284088862|gb|EFC42523.1| predicted protein [Naegleria gruberi]
Length = 1914
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CA C + + KC AC+ V YC K+ Q HWK+HK +CKKL
Sbjct: 1874 CACCGKPNSKKKCGACQAVVYCSKECQASHWKVHKTQCKKL 1914
>gi|255548628|ref|XP_002515370.1| cysteine-type endopeptidase, putative [Ricinus communis]
gi|223545314|gb|EEF46819.1| cysteine-type endopeptidase, putative [Ricinus communis]
Length = 632
Score = 46.2 bits (108), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 13/80 (16%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL-----------PYEIKSSP 62
CA C KCS CK V YC + Q+ HWK HK KCK L + +K+S
Sbjct: 68 CANCGNLGTKKCSRCKSVRYCSTECQEVHWKSGHKSKCKNLVRVNSVENAKTNFGVKASG 127
Query: 63 LLGRYLQATLDLHPGDRIAR 82
L GR + + + L P + I +
Sbjct: 128 LGGR-ISSGIALVPSNGITK 146
>gi|398012025|ref|XP_003859207.1| hypothetical protein, conserved [Leishmania donovani]
gi|322497420|emb|CBZ32495.1| hypothetical protein, conserved [Leishmania donovani]
Length = 622
Score = 46.2 bits (108), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 82/202 (40%), Gaps = 40/202 (19%)
Query: 233 CGVIEV---NGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHI----ST 285
C +EV +G+ + + S+ H+C PN + ++ + R H+ S
Sbjct: 378 CNALEVTDPSGLGVAQALHAGNSIASFFNHSCTPNCA-IDTVRHAIITTRTIHVGEELSI 436
Query: 286 MYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLNEHKDQGDCW 340
Y L W T+ RRE L+ YF C C+RC DP E M+ +D +
Sbjct: 437 AYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSSNKDPFERAL---GMELPTARRDATKHF 493
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
+P+ + C + +R+ VD + Q++ + L ++L+ L
Sbjct: 494 ----HPIVQTA---CATVRSRM---------------VDDVSQKDAD--ELSKLLSDLST 529
Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
P HY C+ V++ L +Y
Sbjct: 530 HLFPFHYLCHEVRNCLSFVYAV 551
>gi|195341850|ref|XP_002037518.1| GM18310 [Drosophila sechellia]
gi|194132368|gb|EDW53936.1| GM18310 [Drosophila sechellia]
Length = 347
Score = 45.8 bits (107), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSEHQRAHWPQHKPNC 164
>gi|343475522|emb|CCD13111.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 433
Score = 45.8 bits (107), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 19/154 (12%)
Query: 170 LQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL-EKLPGDVLDDTSAKC 228
++ R P L +++ H EV + + +V+ + E+LP +V A+
Sbjct: 110 IKSRRPGYRVVLNELQGH------AAEVSQNLSHMVRMVGELLRDEELPLEV-----ARL 158
Query: 229 LHWI-CGVIEV-NGVDIGRYTQGLY--SVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI 283
+ I C IEV N + +G Q L+ ++ H+C PN A + + + +
Sbjct: 159 IGIIRCNTIEVSNELSLG-VGQALHATTITSYFNHSCSPNCAIQGEFIVTTRVIAAGEEL 217
Query: 284 STMYTNAL-WGTQPRREHLAITKYFNCSCERCSD 316
+ Y L W RRE LA T YF+CSCERC D
Sbjct: 218 TISYMPQLYWPVALRREELANTYYFHCSCERCRD 251
>gi|145499622|ref|XP_001435796.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402931|emb|CAK68399.1| unnamed protein product [Paramecium tetraurelia]
Length = 539
Score = 45.8 bits (107), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 24/41 (58%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
CAVC CS CK V YC +HQ++HW +H+ C+ +
Sbjct: 8 CAVCSLKTTFGCSQCKSVFYCSVEHQRQHWSVHQQSCQSMK 48
>gi|195387832|ref|XP_002052596.1| GJ20766 [Drosophila virilis]
gi|194149053|gb|EDW64751.1| GJ20766 [Drosophila virilis]
Length = 126
Score = 45.8 bits (107), Expect = 0.052, Method: Composition-based stats.
Identities = 30/93 (32%), Positives = 43/93 (46%), Gaps = 6/93 (6%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ P+ GR L + + GD + E P GPK + +CLGC++ L +
Sbjct: 2 LKARLEHDPVYGRCLATSEAVESGDVVVEELPFAYGPKRE-SGIVCLGCYRFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C S + DA H EC I
Sbjct: 61 SLDRCELCDWPICG---SCMDDADDHRGECEIF 90
>gi|452978501|gb|EME78264.1| hypothetical protein MYCFIDRAFT_80693 [Pseudocercospora fijiensis
CIRAD86]
Length = 466
Score = 45.8 bits (107), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 37/161 (22%), Positives = 61/161 (37%), Gaps = 25/161 (15%)
Query: 197 VYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICL 256
VY + Y+ + LE L G+V+ G+ N + QG++++
Sbjct: 188 VYVAMSPRYHYVSNEALETLKGEVVK---------AMGIFSANNFILAGGNQGVFALASR 238
Query: 257 MEHNCLPNAKHS---NMMQSKLFVFRDTHIS----TMYTNALWGTQP---RREHLAITKY 306
+ H+C+PN H+ N+ + + RD Y A P R E L
Sbjct: 239 INHSCVPNVHHTNNPNIRRETVHAMRDIEAGEELLANYLGAGATYDPRLTRMEALRNNHG 298
Query: 307 FNCSCERCSDPT------ELGTYFSAMKCLNEHKDQGDCWI 341
F C C+ C DP + + F + EH +G +I
Sbjct: 299 FICQCQACMDPNSDERRHSISSIFWGLNAYMEHSTEGHPYI 339
>gi|444320821|ref|XP_004181067.1| hypothetical protein TBLA_0E04990 [Tetrapisispora blattae CBS 6284]
gi|387514110|emb|CCH61548.1| hypothetical protein TBLA_0E04990 [Tetrapisispora blattae CBS 6284]
Length = 644
Score = 45.8 bits (107), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 8/85 (9%)
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPN----AKHSNMMQSKLFVFRDTHIS----TMYTN 289
+ +I + LY + L+ HNC PN SN + L R+ +I T Y N
Sbjct: 415 IGRFNINQINGQLYPIASLINHNCQPNIHIEVSDSNSLSLTLITRRNINIDEELLTTYIN 474
Query: 290 ALWGTQPRREHLAITKYFNCSCERC 314
L G + RR L + F C CERC
Sbjct: 475 PLHGVKLRRRELLVNWGFLCHCERC 499
>gi|24762574|ref|NP_611890.1| zinc finger protein RP-8 [Drosophila melanogaster]
gi|7291749|gb|AAF47170.1| zinc finger protein RP-8 [Drosophila melanogaster]
Length = 347
Score = 45.8 bits (107), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSPEHQRAHWPQHKPNC 164
>gi|254584925|ref|XP_002498030.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
gi|238940924|emb|CAR29097.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
Length = 485
Score = 45.8 bits (107), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 74/310 (23%), Positives = 110/310 (35%), Gaps = 56/310 (18%)
Query: 53 KLPYEIKSSPLLG---RYLQATLDLHPGDRIARESPLIVGP---KLALAE-----PICLG 101
KLP ++ G R L AT L GD I + PL+V P KL L + P+C
Sbjct: 104 KLPRSVRIGACSGDKQRGLFATNALKAGDMIFEDLPLVVVPPMDKLVLMQSGKACPLCGS 163
Query: 102 CHKPLNPNLADNA-RCPRCFWPACSARCSGLSDAHTH----APECAILKLG--------C 148
+ N C C CS C HT C + L C
Sbjct: 164 SVSHSQHTIIMNGLDCGECGAVWCSKNCKKNDLTHTSLRHTRSRCRQINLNGWKKYESFC 223
Query: 149 ETLLAYNDYKYEAILPLRCLILQRRSPK----KYQELKDMEAHMSK-RGPGTEVYEEIDS 203
+ + Y I +IL + K+Q L D+ + K T + +D+
Sbjct: 224 KQYVFVAAYSIGIIYA--SMILDKNGKNGVHPKFQLLADVSQRIRKDSSDSTNLGGTLDA 281
Query: 204 IVKYLRSN------------FLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLY 251
S+ F E P LDD + G +N V + ++
Sbjct: 282 SSGAFTSDDPEPMWTQALDLFKEAFPE--LDDMDMEAYLNCIGKFNINQV-----SGQIF 334
Query: 252 SVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNALWGTQPRREHLAITK 305
V + H+C PN ++ + +L ++ HI T Y N L G + RR L +
Sbjct: 335 PVYSFINHDCEPNVRYEIDDKLRLKLYARKHIKKGEELLTTYVNPLHGVKLRRRELRVNW 394
Query: 306 YFNCSCERCS 315
F C C+RC+
Sbjct: 395 GFLCHCDRCN 404
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 115/282 (40%), Gaps = 39/282 (13%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
EI S GR L+AT + GD I E L+ +C C K L RC
Sbjct: 5 EIFDSEGKGRGLRATRESWAGDVIFAEPAYSAVVFDNLSHCVCHSCFKRQEKLL----RC 60
Query: 117 PRC-FWPACSARCSGLSDAHTHAPEC-AILKLGCETLLAYNDYKYEAILPLRCLILQRRS 174
+C F C C S + H EC AI K G A N+ L R L R
Sbjct: 61 GQCKFAHYCDRTCQKESWVN-HKHECLAIKKAG----KAPNE---NIRLAARILWRIERE 112
Query: 175 PKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL 229
E + D++ H+ K G G E S+++ ++ F++ P + +
Sbjct: 113 GGGLTEGCLVSIDDLQNHLDKFGDG-----EKSSLMEDVQK-FMDFWPSQS-QQFGMQYI 165
Query: 230 HWICGVIEVNGVDIGRYTQGLYSV-------ICLMEHNCLPNAK---HSNMMQSKLF--V 277
I VI NG + +GL +V +CL H+C PN ++ ++ + +
Sbjct: 166 SHIFSVISCNGFTLSD-QRGLQAVGVAIFPNLCLTNHDCWPNCTVIFNNGKIELRALGKI 224
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
+ ++ Y + L T+ R E L YF+C+CE C++ T+
Sbjct: 225 NKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCTNKTK 266
>gi|255732233|ref|XP_002551040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240131326|gb|EER30886.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 474
Score = 45.8 bits (107), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 66/300 (22%), Positives = 107/300 (35%), Gaps = 56/300 (18%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-----CLGCHKPLNPNLADNA----R 115
G+ L A D+ GD + E PL P LA + C C K L + A
Sbjct: 118 GKGLYAKKDIKKGDLMWSEEPLFFVPPLANVHLMKNASACTYCGKLLQQSEASAVLKGLD 177
Query: 116 CPRCFWPACSARCS-------GLSDAHTHAPECAILK-LGCETLLAYNDY----KYEAIL 163
C C CS +C GL + + P K + E L +Y ++ A+
Sbjct: 178 CNVCAEVWCSKKCKQLEGTLHGLLKHNVYNPSSKRSKTIDAEAFLEIQEYCLEEQWNALY 237
Query: 164 PLRCL----ILQRRSPK--------------KYQELKDMEAHM-SKRGPGTEVYEEIDSI 204
+ + ++ R K +Y+ L S G V E+ +S+
Sbjct: 238 AITLIYANCVIDRTGVKQKQFNAMARVSQDIRYKALNSSAGTFDSLNGGALFVQEQQESL 297
Query: 205 VKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN 264
+ FL P + +D + ++ G +N +D ++ + + HNC PN
Sbjct: 298 WRIGYEKFLRVFPKNPVD---YREFLYMMGTYNINNLD-----SNVFLIQSHLNHNCSPN 349
Query: 265 AKHSNMMQS----KLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCSD 316
MQ K+F RD ++T Y N R+ L + F C C +C +
Sbjct: 350 TSVETEMQRTDGLKVFAARDIKSGEELTTTYVNPSHTVHQRQRELRVNWGFVCGCAKCKE 409
>gi|201066195|gb|ACH92507.1| FI09725p [Drosophila melanogaster]
Length = 350
Score = 45.8 bits (107), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 22/37 (59%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC +HQ+ HW HKP C
Sbjct: 131 CAACGCHAPHACSKCKAIHYCSPEHQRAHWPQHKPNC 167
>gi|345560264|gb|EGX43389.1| hypothetical protein AOL_s00215g125 [Arthrobotrys oligospora ATCC
24927]
Length = 484
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 48/98 (48%), Gaps = 9/98 (9%)
Query: 232 ICGVIEVNGVDIGRYTQ--GLYSVICLMEHNCLPNAKHS---NMMQSKLFVFRDTHISTM 286
+ G+ + N + G + G+Y ICL+ H+C+PN H+ N+ Q + R +
Sbjct: 239 LSGIFKTNAIPCGYNSSIAGIYLTICLINHSCIPNTYHNWNENLEQETIHAIRPINAGEE 298
Query: 287 YTNALWG---TQPRREHLAITKYFNCSCERC-SDPTEL 320
T + + PRR+ L F+C C+ C SDP +L
Sbjct: 299 ITISYISESMSNPRRKRLQECFGFDCQCQLCTSDPKQL 336
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 48/216 (22%), Positives = 85/216 (39%), Gaps = 37/216 (17%)
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTNALWGTQPRREHL 301
GLY VI ++ H+C PNA + + + V R + ++ Y T R++ L
Sbjct: 160 GLYPVISIINHSCFPNA--VLLFEGRQAVVRAVEPIREGSELTVSYIEIAASTASRKKSL 217
Query: 302 AITKYFNCSCERC-----SDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCG 356
+F+C C RC D +C ++H C ++ D+ C
Sbjct: 218 KEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRCSSDH-----CEGFLLHDPDDAQSLVCQ 272
Query: 357 SCSARLNARDVHLVTSQLGEQVDKLVQENPNV---------KSLEEMLTKLEA-MFHPHH 406
C N + T + +VDKL +E + +SL E + +L+ ++HP+
Sbjct: 273 LCGCGRNEEE----TKKQARKVDKLGKEASKLLSSGNYSEARSLYEQIQQLQTQLWHPYS 328
Query: 407 YHCYAVKHSLI----QLYGTQPGYAYTQLSSSLLER 438
+L+ +LY + Y +L+ ER
Sbjct: 329 VILLRTGDTLLKICMELYDWKQALKYCRLTIPAYER 364
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 14 RCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC VC T L +CS CK YCG Q+ W+LH+ +CK +
Sbjct: 68 RCDVCFRLCTNLKRCSVCKTTWYCGGTCQRNGWRLHQHECKAI 110
>gi|195146634|ref|XP_002014289.1| GL19121 [Drosophila persimilis]
gi|194106242|gb|EDW28285.1| GL19121 [Drosophila persimilis]
Length = 142
Score = 45.8 bits (107), Expect = 0.062, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 3/93 (3%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ + GR L + GD + E P GPK + +CLGCH+ L +
Sbjct: 2 LKARLEHDAVFGRCLAVAEPVQRGDLLVEELPFAFGPKRD-SGIVCLGCHQYLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C G D H EC +
Sbjct: 61 SLDRCELCDWPLCGVCAGGDDDGIHHRSECVVF 93
>gi|170113033|ref|XP_001887717.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164637355|gb|EDR01641.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1168
Score = 45.8 bits (107), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 9 DEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D + +C VCR + C+ C++V YC + QK+ WK HKPKCK
Sbjct: 1121 DPDAHKCRVCRRKGKPKIKVCAGCQKVRYCSSECQKKDWKTHKPKCK 1167
>gi|302692316|ref|XP_003035837.1| hypothetical protein SCHCODRAFT_105332 [Schizophyllum commune H4-8]
gi|300109533|gb|EFJ00935.1| hypothetical protein SCHCODRAFT_105332, partial [Schizophyllum
commune H4-8]
Length = 543
Score = 45.8 bits (107), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CA+C + +C+ C V+YCG+ QK HWK HK C+ +
Sbjct: 366 CAICGKKTTSRCTGCLSVSYCGQACQKAHWKEHKEFCRTI 405
>gi|302681111|ref|XP_003030237.1| hypothetical protein SCHCODRAFT_236142 [Schizophyllum commune H4-8]
gi|300103928|gb|EFI95334.1| hypothetical protein SCHCODRAFT_236142 [Schizophyllum commune H4-8]
Length = 565
Score = 45.4 bits (106), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
L +C+ C+ YCG+QHQ+ HW HK +C K+ +
Sbjct: 532 LLRCAKCRSTVYCGQQHQRAHWSQHKQRCFKVEW 565
>gi|195489535|ref|XP_002092780.1| GE14382 [Drosophila yakuba]
gi|194178881|gb|EDW92492.1| GE14382 [Drosophila yakuba]
Length = 347
Score = 45.4 bits (106), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 21/37 (56%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC HQ+ HW HKP C
Sbjct: 128 CAACGCHAPHACSKCKAIHYCSSDHQRAHWPQHKPNC 164
>gi|195121108|ref|XP_002005063.1| GI20261 [Drosophila mojavensis]
gi|193910131|gb|EDW08998.1| GI20261 [Drosophila mojavensis]
Length = 349
Score = 45.4 bits (106), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 25/49 (51%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
CA C A CS CK + YC HQ+ HWK HKP C S+PL
Sbjct: 128 CAACGCLAPLACSRCKNINYCSSSHQRAHWKQHKPSCATGQSAQPSAPL 176
>gi|383849521|ref|XP_003700393.1| PREDICTED: uncharacterized protein LOC100881996, partial
[Megachile rotundata]
Length = 402
Score = 45.4 bits (106), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 15 CAVCRE----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C VC++ +L +C C ++YC K+HQKEHW HK CK +
Sbjct: 6 CHVCKKFGEWISLKRCGNCTMISYCSKEHQKEHWSQHKDLCKAI 49
>gi|198476005|ref|XP_001357231.2| GA14318 [Drosophila pseudoobscura pseudoobscura]
gi|198137505|gb|EAL34300.2| GA14318 [Drosophila pseudoobscura pseudoobscura]
Length = 142
Score = 45.4 bits (106), Expect = 0.071, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 3/93 (3%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ + GR L + GD + E P GPK + +CLGCH+ L +
Sbjct: 2 LKARLEHDAVFGRCLAVAEPVQRGDLLVEELPFAFGPKRD-SGIVCLGCHQYLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C G D H EC +
Sbjct: 61 SLDRCELCDWPLCGVCAGGDDDGIHHRSECVVF 93
>gi|397567519|gb|EJK45634.1| hypothetical protein THAOC_35746 [Thalassiosira oceanica]
Length = 351
Score = 45.4 bits (106), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 46/98 (46%), Gaps = 10/98 (10%)
Query: 234 GVIEVNGVDIG--RYTQGLYSVICLMEHNCLPNAKH---SNMMQSKLFVFRDTHIS---- 284
GV + N +G GL+ + + H+C PN H + Q+ +F RD I
Sbjct: 145 GVYDTNSYQLGGDESHGGLFLTLARINHSCRPNVVHCWRPGLQQTLVFALRDIEIGEELF 204
Query: 285 TMYT-NALWGTQPRREHLAITKYFNCSCERCSDPTELG 321
T+Y + T+ RRE+L FNC+CE C++ G
Sbjct: 205 TIYGPSECMDTKGRREYLHDRFAFNCTCEMCTEGNNNG 242
>gi|170036771|ref|XP_001846235.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879678|gb|EDS43061.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 446
Score = 45.4 bits (106), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 38/87 (43%), Gaps = 22/87 (25%)
Query: 14 RCAVCRETA-----------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
RC VC A L C C+ + YCG+QHQKE WK HK C+ + +
Sbjct: 13 RCNVCFTRATEHRLTLVGIKLRYCKGCRLIGYCGEQHQKEDWKHHKDFCR-----VVTKV 67
Query: 63 LLGRYLQATLDLHPGDRIARESPLIVG 89
L R + LD + E P IVG
Sbjct: 68 LTIRKIDHILD------VRTECPRIVG 88
>gi|449669894|ref|XP_002157768.2| PREDICTED: uncharacterized protein LOC100209808 [Hydra
magnipapillata]
Length = 519
Score = 45.4 bits (106), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 1 MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
M S I + + C V L +C CK + YC ++HQK+HW +HK CK+ E+ S
Sbjct: 1 MTSSISLNELYCFYCGV-DNIPLLRCVQCKSIWYCSREHQKKHWSIHKSFCKQKKKELVS 59
>gi|195383402|ref|XP_002050415.1| GJ20214 [Drosophila virilis]
gi|194145212|gb|EDW61608.1| GJ20214 [Drosophila virilis]
Length = 354
Score = 45.4 bits (106), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 24/51 (47%), Positives = 30/51 (58%), Gaps = 3/51 (5%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC--KKLPYEIKSSPL 63
CA C A H CS CKE+ YC HQ+ HW+ HK C +LP E+ + PL
Sbjct: 128 CAACGCLAPHACSRCKEINYCCAAHQRAHWQQHKQHCGGGRLP-EVANKPL 177
>gi|397600303|gb|EJK57632.1| hypothetical protein THAOC_22302 [Thalassiosira oceanica]
Length = 300
Score = 45.4 bits (106), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 40/82 (48%), Gaps = 6/82 (7%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ I ++D CA C +T L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 1 MSRCIAVDDGSAGVCANCGKTGSDTVKLRNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 60
Query: 56 YEIKSSPLLGRYLQ-ATLDLHP 76
E+K L + L+ A D P
Sbjct: 61 AELKDEQLYSQGLERAAGDFCP 82
>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
Length = 410
Score = 45.4 bits (106), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 59/296 (19%), Positives = 110/296 (37%), Gaps = 37/296 (12%)
Query: 62 PLLGRYLQATLDLHPGDRIARESPLIVGP-----KLALAEPICLGCHKPLNPNLADNARC 116
P+ G+ L A+ D+ + + E P + P LA + +C C KP+ C
Sbjct: 102 PVTGKGLFASRDIKQDEILFTEEPYVYFPPWEGFSLARSGHVCGLCCKPILYANRLTQHC 161
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAY-NDYKYEAILPLRCLILQRRSP 175
C CS C + H EC L + + +++A + + + +
Sbjct: 162 KHCDMYYCSKSCRATAWDQFHQLECTRLNPAITPFMNFCEGERWQAPMAVARIYARLILA 221
Query: 176 KKYQELKDM------------EAHMSKRG-------PGTEVYEEIDSIVK--YLRSNFLE 214
+ +EL D+ E +K P E++ + +++ Y +
Sbjct: 222 HQREELNDVIGRLDAFATVSQEERQAKETEWIFMEQPTRELWNKARDLLRKAYKQPPAKC 281
Query: 215 KLPGDVLDDTSAKCLHWICGVIEVNG-VDIGRYTQGLYSVICLMEHNCLPN--------- 264
K+ D+ + K + G +I G+Y V + HNC PN
Sbjct: 282 KITQDLPEALKEKLFEDEETFLNFMGKFNINNQNGGMYLVHSHINHNCYPNVSIDYPNPK 341
Query: 265 AKHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
+++ +++ + + + Y N W + R+ +L + FNC CERC EL
Sbjct: 342 SQYKIAVRAVRDIKQGEQLFETYVNPRWNKETRQTYLDKSYLFNCKCERCEKDLEL 397
>gi|396484759|ref|XP_003842008.1| hypothetical protein LEMA_P077680.1 [Leptosphaeria maculans JN3]
gi|312218584|emb|CBX98529.1| hypothetical protein LEMA_P077680.1 [Leptosphaeria maculans JN3]
Length = 382
Score = 45.4 bits (106), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 28/49 (57%)
Query: 5 IEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+++ + L CA C A KCS C+ + YCGK Q HWK HK C++
Sbjct: 1 MQMTPQPLTTCAACGNPATDKCSGCRSINYCGKFCQTTHWKSHKLTCEQ 49
>gi|356518197|ref|XP_003527768.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
max]
Length = 734
Score = 45.4 bits (106), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
CAVCR A KCS CK V YC + Q+ HWK HK +CK+
Sbjct: 57 CAVCRNPANKKCSRCKSVRYCSQACQQAHWKSEHKMRCKEF 97
>gi|260835866|ref|XP_002612928.1| hypothetical protein BRAFLDRAFT_278662 [Branchiostoma floridae]
gi|229298310|gb|EEN68937.1| hypothetical protein BRAFLDRAFT_278662 [Branchiostoma floridae]
Length = 287
Score = 45.4 bits (106), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 36/69 (52%), Gaps = 2/69 (2%)
Query: 14 RCAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE-IKSSPLLGRYLQAT 71
+C C E A KCSACK+V YCG+ QK HW HK C L E ++ L + Q +
Sbjct: 219 QCFTCGELQAEKKCSACKKVKYCGQACQKLHWFTHKKVCATLKAEYLQEQELAQKMKQQS 278
Query: 72 LDLHPGDRI 80
L+ G I
Sbjct: 279 LEGQEGTDI 287
>gi|452980714|gb|EME80475.1| hypothetical protein MYCFIDRAFT_156218 [Pseudocercospora
fijiensis CIRAD86]
Length = 235
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 3/51 (5%)
Query: 7 IEDEELM-RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
I EEL +CA C +T L KC+ C+ + YC ++ Q HWK+HK +C +L
Sbjct: 9 INAEELQQKCARCSKTGGPLKKCAKCRSILYCDRECQTLHWKMHKKECSRL 59
>gi|356509862|ref|XP_003523663.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
max]
Length = 742
Score = 45.1 bits (105), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 22/41 (53%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
CAVCR A KCS CK V YC + Q+ HWK HK +CK+
Sbjct: 60 CAVCRNPANKKCSRCKSVRYCSQACQQMHWKSEHKVRCKEF 100
>gi|332018918|gb|EGI59464.1| Egl nine-like protein 1 [Acromyrmex echinatior]
Length = 500
Score = 45.1 bits (105), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 26/42 (61%), Gaps = 1/42 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
CAVC +T L +CS CK V YC K+HQ+ WK H+ C P
Sbjct: 28 CAVCDKTDKLSRCSRCKVVFYCTKEHQRRDWKRHREFCATHP 69
>gi|321478511|gb|EFX89468.1| hypothetical protein DAPPUDRAFT_40294 [Daphnia pulex]
Length = 464
Score = 45.1 bits (105), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 19/33 (57%), Positives = 21/33 (63%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
E L CS C+ +AYCG HQKEHW HK CK
Sbjct: 46 EIKLLSCSGCRMIAYCGLAHQKEHWAQHKDLCK 78
>gi|397634138|gb|EJK71301.1| hypothetical protein THAOC_07282, partial [Thalassiosira oceanica]
Length = 1664
Score = 45.1 bits (105), Expect = 0.093, Method: Composition-based stats.
Identities = 25/60 (41%), Positives = 29/60 (48%), Gaps = 5/60 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
CA C E L C+AC V YCG Q+ H HK CKK EIK L G+ L+
Sbjct: 1256 CANCGEEGSDAVKLRNCTACFLVKYCGVDCQRAHRTRHKKACKKRAAEIKDEKLYGQGLE 1315
>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 621
Score = 45.1 bits (105), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 70/335 (20%), Positives = 122/335 (36%), Gaps = 73/335 (21%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
+++ + GR+L A D++PG+ I E P + L C C P + C
Sbjct: 203 DVQYNEKYGRHLVAKRDINPGEIIFIEEPYMHCLDLVRGYTYCFHCLTPCLITIP----C 258
Query: 117 PRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQ---- 171
C W CS C + H ECA+ E + + K+ A+ L C + +
Sbjct: 259 EHCGWAMFCSEGCKQQAWVKYHDLECAVYDFAKENVDG-DGVKHMAVKSLICAVREAGGV 317
Query: 172 ---RRSPKKYQELKDMEAHMSKRGP-GTEVYEEI----------------DSIVKYLR-- 209
R K + D K G + ++ I ++ + LR
Sbjct: 318 DQLRDELKAFDSCTDKLKGFVKDGKIQSSGFKSIYALSSNTSDKAEPIHKNNTIMILRAL 377
Query: 210 ---SNFLEKLPG-----DVLDDTSAKCL----HWICGVIEVNG--VDIGR--YTQGLYSV 253
+ + K PG ++ D L + + + ++N + IGR YT GL +
Sbjct: 378 VKNTKYFGKKPGFEKTEELKKDDKVLFLGSLVYKLSKIFQLNSRIIPIGRDFYTTGLDAR 437
Query: 254 IC-----------------LMEHNCLPNAKHSNMMQSKLFVF------RDTHISTMYTNA 290
+C L+ H+C+PN K + V+ + + + Y
Sbjct: 438 MCENKQCCTTGLYIAPITSLLNHSCIPNVKRCFSNNYSVIVYAVQPIKKGSQLFDCYQQE 497
Query: 291 LW--GTQPRREHLAITKYFNCSCERCSDPTELGTY 323
+ PR++HL T FNC C+ C + ++ Y
Sbjct: 498 FYEYNISPRQKHLKKTYNFNCDCKACKEKWDIVEY 532
>gi|397588547|gb|EJK54309.1| hypothetical protein THAOC_26078, partial [Thalassiosira
oceanica]
Length = 314
Score = 45.1 bits (105), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 38/78 (48%), Gaps = 5/78 (6%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + I D+ + CA C + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 1 MSCVPVIMDDGIKTCANCGKQGSDSVVLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 60
Query: 56 YEIKSSPLLGRYLQATLD 73
E+K L G+ + D
Sbjct: 61 AELKDEQLFGQGKRGMKD 78
>gi|156408512|ref|XP_001641900.1| predicted protein [Nematostella vectensis]
gi|156229041|gb|EDO49837.1| predicted protein [Nematostella vectensis]
Length = 430
Score = 45.1 bits (105), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
C VC E ++ KC+ACK+V YC QK HW HK C++L E + L+
Sbjct: 315 CYVCGERRSVKKCAACKKVGYCSVSCQKLHWSTHKKHCQRLAKEYEQELLM 365
>gi|198438401|ref|XP_002126749.1| PREDICTED: similar to programmed cell death 2 [Ciona intestinalis]
Length = 340
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
C +C + KCS C+EV YC K HQ HWK LHK +C K
Sbjct: 129 CGLCGNHGVKKCSRCREVQYCSKDHQVFHWKLLHKYQCSK 168
>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
rerio]
Length = 231
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 86/222 (38%), Gaps = 34/222 (15%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ SP GR L+AT + GD + E P + A IC C + RC
Sbjct: 5 EVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFR----RQEKLQRC 60
Query: 117 PRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR-- 173
+C F C C + H ECA +K Y E + L IL R
Sbjct: 61 GQCRFAQYCDKTCQ-RAGWEEHKLECAAIK-------TYGKPPSENV-RLAARILWRMDK 111
Query: 174 -----SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
S + L+D+E H+ ++ E+ K NFL+ P + T
Sbjct: 112 QGSVVSDNQLTTLEDLEDHI------CDISEDDLKDFKVDIHNFLDYWPRNSKPHTVDSV 165
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN 264
H I GVI NG + R Q GL+ +CL+ H+C PN
Sbjct: 166 SH-ILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWPN 206
>gi|389747799|gb|EIM88977.1| hypothetical protein STEHIDRAFT_52815 [Stereum hirsutum FP-91666 SS1]
Length = 1144
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
Query: 7 IEDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+ D E +C VCR + L C CK+V YC QK WK+HK +CK
Sbjct: 1095 LRDPEARKCFVCRGKGKPKLMTCKGCKKVRYCSDACQKRDWKVHKKRCK 1143
>gi|195429543|ref|XP_002062817.1| GK19653 [Drosophila willistoni]
gi|194158902|gb|EDW73803.1| GK19653 [Drosophila willistoni]
Length = 574
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 49/99 (49%), Gaps = 8/99 (8%)
Query: 224 TSAKCLHWICGVIEVNGV-DIGRYTQGLYSVICLMEHNCLPNA-KHSNMMQSKLFVFRDT 281
TS +H I V +VN D ++ G Y+ + L+ H+C PN + ++ LFV R
Sbjct: 393 TSPSNMHGIDLVEQVNETKDDQTHSSGAYAFLSLLNHSCAPNTLRIYEGTKAYLFVLRPI 452
Query: 282 HIS-TMYTN-----ALWGTQPRREHLAITKYFNCSCERC 314
+Y N A++ + R+E L++ FNC CE C
Sbjct: 453 KAGDVLYDNYGAHFAIFSKKERQETLSMQYRFNCKCEAC 491
>gi|449541346|gb|EMD32330.1| hypothetical protein CERSUDRAFT_118710 [Ceriporiopsis subvermispora
B]
Length = 446
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 5/49 (10%)
Query: 9 DEELMR-CAVCR----ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
DE L+R C CR + L +C+AC V YC KQ QKE W+ HK CK
Sbjct: 211 DERLLRRCETCRVREPQKTLFRCAACGMVWYCSKQCQKEGWQTHKEICK 259
>gi|308512211|ref|XP_003118288.1| CRE-SET-30 protein [Caenorhabditis remanei]
gi|308238934|gb|EFO82886.1| CRE-SET-30 protein [Caenorhabditis remanei]
Length = 548
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 61/230 (26%), Positives = 97/230 (42%), Gaps = 36/230 (15%)
Query: 124 CSARCSGLSDAHTHAPECAILKLGC-----ETLLAYNDYKYEAIL-----PLRCLILQRR 173
CS +C A H EC ++ E +L +Y+ IL + R
Sbjct: 62 CSKQCQVFG-AFDHKYECGAIQKCADLNTDERMLIRIIGRYKDILDGNDKKIDGFYTNRE 120
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEI-DSIVKYLRSNFLEKLPGDVLDDTSAKCLHW- 231
S + ++ + A M K V+++I D + ++ N+L +D+ A LH
Sbjct: 121 SGRTVMQIWEHCADMKKDEHAMNVFKKIYDRVKQFGDKNYL-------VDEEVAFQLHSR 173
Query: 232 -ICGVIEVNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HIS 284
++ VD R +GLY +C +H+C PNA +S N +KL D ++
Sbjct: 174 NFINRHSISNVDYLREIGKGLYLDLCRYDHSCRPNAIYSCNGTVAKLRALHDNVDLENVE 233
Query: 285 TMYTNALWGTQP-----RREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
T T+ + P RR L T YF C CERC DP + + +A+ C
Sbjct: 234 T--THYTYIELPPCKIQRRHMLKETWYFECHCERCEDPED--NWLTAVIC 279
>gi|397600839|gb|EJK57744.1| hypothetical protein THAOC_22182 [Thalassiosira oceanica]
Length = 415
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 5/74 (6%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + + DE CA C A L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 111 MSCFVPVADEGNETCANCGTAASDSVKLRDCAACRLVKYCGVDCQRAHRKQHKKACKQRA 170
Query: 56 YEIKSSPLLGRYLQ 69
E+K L + L+
Sbjct: 171 AELKDEELYSQGLE 184
>gi|302828706|ref|XP_002945920.1| hypothetical protein VOLCADRAFT_86388 [Volvox carteri f.
nagariensis]
gi|300268735|gb|EFJ52915.1| hypothetical protein VOLCADRAFT_86388 [Volvox carteri f.
nagariensis]
Length = 1263
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 28/43 (65%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+ CA+C + AL +C+ CK YC ++ Q++HW HK +C +L
Sbjct: 1 MASCAICDKPALFRCARCKVERYCSRECQRDHWPHHKTRCVQL 43
>gi|402587997|gb|EJW81931.1| MYND finger family protein, partial [Wuchereria bancrofti]
Length = 348
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 47/98 (47%), Gaps = 19/98 (19%)
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLF---------VFRDTH 282
CG IE +IG+ GLY +C +H+C PN ++ N + L TH
Sbjct: 59 CGYIE----EIGK---GLYLDLCAYDHSCRPNTIYTCNSFVATLRGLTAGVDLRNLNSTH 111
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
S Y + + TQ RR+ L T YF C C RC DP ++
Sbjct: 112 YS--YIDLINTTQQRRKLLKDTWYFECHCTRCDDPDDV 147
>gi|156551085|ref|XP_001602963.1| PREDICTED: egl nine homolog 1-like [Nasonia vitripennis]
Length = 440
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C VC R L +CS CK + YC K+HQK WK HK C +L
Sbjct: 26 CVVCGRNDKLLRCSRCKSIFYCTKEHQKLDWKHHKSVCSRL 66
>gi|125810725|ref|XP_001361596.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
gi|195154214|ref|XP_002018017.1| GL16991 [Drosophila persimilis]
gi|54636772|gb|EAL26175.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
gi|194113813|gb|EDW35856.1| GL16991 [Drosophila persimilis]
Length = 574
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 124/325 (38%), Gaps = 57/325 (17%)
Query: 44 WKLHKPKCKKLPY-----EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPI 98
+KL ++PY E++ SP GR++ DL GD +A E P +
Sbjct: 170 FKLSFDSHAQVPYIADCLELRESPDEGRFVVTNRDLVVGDLVAVEQPFCSTLLPPMRYIR 229
Query: 99 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK-----------LG 147
C C + L C C CS C + + H EC I+ +
Sbjct: 230 CATCKRENYLTLIPCDSC--CSVMFCSEECKLRATSSFHRFECPIIDFLHRMFNKIHGIA 287
Query: 148 CETLLAYNDYKYEAILPLRCLILQRRSPKK------YQELKDMEAHMSKRGPGTEVYEEI 201
T +A D Y +I L Q ++ K Y +L E + + G T +
Sbjct: 288 LRTTMAALDL-YPSIEELMAFCEQPQNQHKCAFDLDYSQLSAQEHYRAIHGLVTNQHLRS 346
Query: 202 DS---------------IVKYL-RSNFLEKLPG-----DVLD---DTSAKCLHWICGVIE 237
S I++Y +FL G D+L TS +H I V +
Sbjct: 347 VSDLFQRSVVCAVLKHFIIEYTPLKDFLGGEDGRNFFTDLLFRHLQTSPSNMHGIDLVEQ 406
Query: 238 VNGV-DIGRYTQGLYSVICLMEHNCLPNA-KHSNMMQSKLFVFRDTHI-STMYTN----- 289
VN D ++ G Y+ + L+ H+C PN + ++ LFV R + +Y N
Sbjct: 407 VNETKDDQTHSSGAYAFLSLLNHSCAPNTLRIYEGTKAYLFVLRPIKAGNVLYDNYGAHF 466
Query: 290 ALWGTQPRREHLAITKYFNCSCERC 314
A++ Q R + L++ F+C CE C
Sbjct: 467 AVFSKQQRLDTLSMQYRFDCKCEGC 491
>gi|389743859|gb|EIM85043.1| hypothetical protein STEHIDRAFT_81876 [Stereum hirsutum FP-91666
SS1]
Length = 171
Score = 44.7 bits (104), Expect = 0.12, Method: Composition-based stats.
Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 2/41 (4%)
Query: 15 CAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
CA C RE + +C+ C +V YCG ++HW HKP CK+
Sbjct: 128 CAGCGKREVKMKQCNKCHKVLYCGAVCLEQHWPTHKPICKQ 168
>gi|158289961|ref|XP_311564.4| AGAP010381-PA [Anopheles gambiae str. PEST]
gi|157018410|gb|EAA07249.4| AGAP010381-PA [Anopheles gambiae str. PEST]
Length = 91
Score = 44.7 bits (104), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
C C A KC+ C++V YC HQK+HWK PYE+ S +GR+ AT +
Sbjct: 5 CGYCGVPAKLKCAGCQQVYYCNPDHQKKHWKAKHKHECVKPYELTKSDEIGRHFVATKTI 64
Query: 75 HPGDRIARESPLIVGPKLALAE 96
+ E+PL++GPK LA+
Sbjct: 65 EKDTILFSENPLVIGPKWNLAD 86
>gi|340714736|ref|XP_003395881.1| PREDICTED: hypothetical protein LOC100649411 [Bombus terrestris]
Length = 376
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
C +C T L +CS CK V+YC K+HQ+ WK HK C
Sbjct: 24 CVICNRTDKLLRCSRCKAVSYCTKEHQRRDWKRHKEFC 61
>gi|321462665|gb|EFX73687.1| hypothetical protein DAPPUDRAFT_227055 [Daphnia pulex]
Length = 1025
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 30/52 (57%), Gaps = 4/52 (7%)
Query: 7 IEDEELMR--CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
IEDE CAVC++T L +CS C+ YC + Q HW +HK CKKL
Sbjct: 974 IEDEPADEKCCAVCKKTPEKLKRCSRCRSTVYCSVECQHSHWPMHKLICKKL 1025
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 44.7 bits (104), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 63/266 (23%), Positives = 102/266 (38%), Gaps = 40/266 (15%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GR + AT DL G E P + A C GC + L ++ + C + C
Sbjct: 17 GRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQTCSGCFR-LAAHMQQCSSCKVVQY--C 73
Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQE---- 180
S C SD H PEC K I P +L R K+ ++
Sbjct: 74 SQTCQR-SDWSIHKPECEGFK-----------AVQPRIPPSPVRLLGRMMFKRAKDCNEF 121
Query: 181 ---LKDMEAHMSKRGPGTEVYEEIDSIVKYLR--SNFLEKLPGDVLDDTSAKCLHWICGV 235
+ +E+H KR ++I+ I L+ S F+ P +L T+A + +C
Sbjct: 122 ERVVGQLESHRDKRAS-----KDIEHIAAMLQMASGFI---PPALLLSTTADMIA-LCCK 172
Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMYTN 289
I+VN + R +Y + + H+C+PNA + + I+ Y +
Sbjct: 173 IQVNTMTTERGV-AIYDRLSTVNHSCVPNACLTFGIGGIARLSPMTAIASGDQINISYVD 231
Query: 290 ALWGTQPRREHLAITKYFNCSCERCS 315
+ R+ L YF+C+C C+
Sbjct: 232 VFQSCETRQRQLKEQYYFDCTCRLCT 257
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 7/61 (11%)
Query: 15 CAVCRETALH--KCSACKEVAYCGKQHQKEHWKLHKPKC---KKLPYEIKSSP--LLGRY 67
C+ C A H +CS+CK V YC + Q+ W +HKP+C K + I SP LLGR
Sbjct: 51 CSGCFRLAAHMQQCSSCKVVQYCSQTCQRSDWSIHKPECEGFKAVQPRIPPSPVRLLGRM 110
Query: 68 L 68
+
Sbjct: 111 M 111
>gi|170595415|ref|XP_001902374.1| MYND finger family protein [Brugia malayi]
gi|158589993|gb|EDP28779.1| MYND finger family protein [Brugia malayi]
Length = 464
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 15/95 (15%)
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS--------NMMQSKLFVFRDTHIS 284
CG IE +IG+ GLY +C +H+C PNA ++ + + + + +
Sbjct: 231 CGYIE----EIGK---GLYLDLCAYDHSCRPNAIYTCNSFVATLRGLTANVDLRNLSSTY 283
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
Y + + TQ RR+ L T YF C C RC DP +
Sbjct: 284 YSYIDLINTTQQRRKLLKDTWYFECHCTRCDDPDD 318
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 64/279 (22%), Positives = 107/279 (38%), Gaps = 34/279 (12%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPN 109
L E S+ G L+A L PG+ + R PL G + + + LG K +
Sbjct: 4 LKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--- 60
Query: 110 LADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
RC +C CSA+C + H EC LK C+ + + + ++ +
Sbjct: 61 -----RCSQCRVAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVVKLM 113
Query: 169 ILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD------ 222
+K D+E+++SK + EE ++ L F + ++ D
Sbjct: 114 EETPSESEKLYSFYDLESNISK------LTEEKKEGIRQLAMTFQHFMREEIQDVSQLLP 167
Query: 223 --DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFR 279
D C + ++ GLY + L+ H+C PN + N L R
Sbjct: 168 AFDIFEAFAKVFCNAFTICNAEMQGVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVR 227
Query: 280 DTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
D ++ Y + L ++ RR+HL F+C C RC
Sbjct: 228 DIETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRC 266
>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
Length = 549
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 110/279 (39%), Gaps = 31/279 (11%)
Query: 65 GRYLQATLDLHPGDRIAR-ESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPA 123
GR + AT PGD IAR ++P +V P A C C K P C C A
Sbjct: 19 GRSIYATRRFKPGDVIARFDNPAVVLPPGHRALEYCNHCVKKQRPAGVKLRACTGCKTVA 78
Query: 124 -CSARCSGLSDAHTHAPECAILKLGCETLLAYN-DYKYEAILPLRCLILQRRSPKKYQEL 181
C C + + H EC ++ E A+ D+ P+R P+
Sbjct: 79 YCGPACQRANWSLVHKLECKAIQRLHEAKPAHQPDW---VPTPIRAAAQVMLRPQVLARF 135
Query: 182 KDMEAHMS--KRGPGTEVYEEIDSIVK--YLRSNFLEKLPG--DVLDDTSAKCLHWICGV 235
+++E H+ ++ GT++ + +VK L + E+L VL
Sbjct: 136 EELEGHVEQWRKKDGTDLQLQSHGVVKCLGLDTVTFERLETSFQVLCKLQTNAFSRTEEY 195
Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNA 290
E GV L + + ++ H+C+PNA + +++ F+ I Y +
Sbjct: 196 YETGGV-------FLDTTLAMINHSCVPNALVQFGGRTATLRATSFLDPGDEIEISYIDQ 248
Query: 291 LWGTQPR-REHLAITKY-FNCSCERCSDPTELGTYFSAM 327
TQPR + H + Y F CSC +C +L Y AM
Sbjct: 249 ---TQPRGKRHGELDLYHFECSCYKCQ--KDLDEYQVAM 282
>gi|357613780|gb|EHJ68714.1| putative Egl nine-like protein 1 [Danaus plexippus]
Length = 473
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
L CAVC + +C C V YC +HQ++ WK HK C
Sbjct: 7 LASCAVCNQQTQRRCGRCFSVYYCNTEHQRQDWKRHKINC 46
>gi|453089339|gb|EMF17379.1| hypothetical protein SEPMUDRAFT_35708 [Mycosphaerella populorum
SO2202]
Length = 240
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
CA C T L KC+ CK V YC ++ Q+ HWK HK +C +L E P
Sbjct: 10 CAKCNNTEGELKKCTKCKSVLYCNRECQRAHWKKHKRQCSRLAGEGAPPP 59
>gi|395327455|gb|EJF59854.1| hypothetical protein DICSQDRAFT_137978 [Dichomitus squalens
LYAD-421 SS1]
Length = 443
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC + A +C+ C+ +AYCG + Q+ W HK C+ L
Sbjct: 258 CAVCGKRATSRCAGCQSIAYCGSECQRSDWPDHKITCRSL 297
>gi|406988119|gb|EKE08230.1| hypothetical protein ACD_17C00276G0002 [uncultured bacterium]
Length = 361
Score = 44.3 bits (103), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 15 CAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
C+ C + +L +C C YCGKQHQ++HW HKP C+
Sbjct: 23 CSQCEKYNQSLLRCKQCLSTLYCGKQHQRQHWPTHKPVCQ 62
>gi|328778894|ref|XP_397368.4| PREDICTED: egl nine homolog 1-like [Apis mellifera]
Length = 486
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/38 (55%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
C VC T L +CS CK V YC K+HQK WK HK C
Sbjct: 24 CVVCNRTDKLLRCSRCKAVFYCTKEHQKRDWKRHKEFC 61
>gi|323451400|gb|EGB07277.1| hypothetical protein AURANDRAFT_64957 [Aureococcus anophagefferens]
Length = 1781
Score = 44.3 bits (103), Expect = 0.15, Method: Composition-based stats.
Identities = 18/33 (54%), Positives = 18/33 (54%)
Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
CS CK V YC QK HW HK C K P EI
Sbjct: 1747 CSRCKAVVYCDAACQKAHWATHKKHCAKKPAEI 1779
>gi|242208886|ref|XP_002470292.1| predicted protein [Postia placenta Mad-698-R]
gi|220730599|gb|EED84453.1| predicted protein [Postia placenta Mad-698-R]
Length = 380
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 27/38 (71%), Gaps = 3/38 (7%)
Query: 19 RETALHK---CSACKEVAYCGKQHQKEHWKLHKPKCKK 53
RET + + C ACK+ YCG++ QK+ WK HKPKC++
Sbjct: 325 RETTVAQFKCCGACKDAVYCGQECQKKDWKAHKPKCQE 362
>gi|51535049|dbj|BAD37420.1| putative ubiquitin-specific protease 15 [Oryza sativa Japonica
Group]
gi|51535599|dbj|BAD37542.1| putative ubiquitin-specific protease 15 [Oryza sativa Japonica
Group]
Length = 1069
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSP 62
CA C A KCS CK V YC + Q +HW+ HK KCK++ KSSP
Sbjct: 105 CAACGYIATKKCSGCKRVRYCSQGCQSKHWQSGHKFKCKQMK---KSSP 150
>gi|350414632|ref|XP_003490373.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
[Bombus impatiens]
Length = 439
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 30/49 (61%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
++ C++C E A +CS CKEV YCG++ Q + W HK C K+ I+
Sbjct: 387 DVKECSLCHEEAKKRCSKCKEVWYCGRECQVKDWVKHKNICDKITKSIE 435
>gi|336375477|gb|EGO03813.1| hypothetical protein SERLA73DRAFT_119451 [Serpula lacrymans var.
lacrymans S7.3]
Length = 98
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 22/48 (45%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D E RC VCR + L C+ CK+V YC + QK+ WK+HK +CK
Sbjct: 50 RDPEAYRCFVCRGKGKPKLKMCTVCKKVRYCSSECQKKDWKVHKLRCK 97
>gi|393911885|gb|EJD76488.1| CBR-EGL-9 protein [Loa loa]
Length = 483
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 43/92 (46%), Gaps = 9/92 (9%)
Query: 15 CAVCRE------TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
CAVC AL +C C VAYCG++H+ WK HKP CK + ++ + L
Sbjct: 10 CAVCGAPKSIDGNALSQCKQCSMVAYCGEEHRNFDWKRHKPLCKTVQSQLTRTG--ATTL 67
Query: 69 QATLDLHPGDRIA-RESPLIVGPKLALAEPIC 99
AT+ D++ + +V L + +C
Sbjct: 68 SATIGQRSVDKLTDSDGYTVVTAAKWLGDKLC 99
>gi|342320836|gb|EGU12774.1| Microtubule-associated protein, putative [Rhodotorula glutinis
ATCC 204091]
Length = 560
Score = 44.3 bits (103), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 29/62 (46%), Gaps = 3/62 (4%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDL 74
CAVC + A +C CK V +C + QK W HK CK P PL R L DL
Sbjct: 12 CAVCDQPAKSRCGGCKRVPFCSPRCQKLIWSTHKALCKADPDVFHPPPLSARELG---DL 68
Query: 75 HP 76
P
Sbjct: 69 RP 70
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 29/54 (53%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
E+ ++CAVC + +CS C + +CG QK W HK CK P S+PL
Sbjct: 281 EKTVKCAVCDKDTKLRCSGCGILPFCGAACQKLLWPTHKTLCKVDPEVFHSAPL 334
>gi|195472004|ref|XP_002088292.1| GE18492 [Drosophila yakuba]
gi|194174393|gb|EDW88004.1| GE18492 [Drosophila yakuba]
Length = 151
Score = 44.3 bits (103), Expect = 0.16, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ LLGR L + + G+ + E P GPK + +CLGC++ L +
Sbjct: 2 LKARLQEDALLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C + C+ D H EC +
Sbjct: 61 SLDRCELCDWPLCGS-CADDEDVTEHRGECQVF 92
>gi|321469693|gb|EFX80672.1| hypothetical protein DAPPUDRAFT_103160 [Daphnia pulex]
Length = 2032
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
CA C++T L +CS C+ ++YC + Q++HWK HK C K
Sbjct: 1991 CAACKQTTQNLKRCSRCRSISYCSVECQRQHWKQHKVFCTK 2031
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 29/54 (53%), Gaps = 2/54 (3%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
C C++ L +CS C+ YC + Q++HW HK K L + SP+LG+
Sbjct: 953 CNACQKVPQNLKRCSRCRTAVYCSVECQRQHWAQHKKASKMLRTPSRGSPVLGK 1006
>gi|312085936|ref|XP_003144878.1| MYND finger family protein [Loa loa]
Length = 535
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 15/95 (15%)
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS--------NMMQSKLFVFRDTHIS 284
CG IE +IG+ GLY +C +H+C PN ++ + + + + +
Sbjct: 232 CGYIE----EIGK---GLYLDLCAYDHSCRPNTIYTCDGFVATLRGLTASVDLRNLSSAH 284
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
Y + + TQ RR+ L T YF C C RCSDP +
Sbjct: 285 YSYIDLINTTQQRRKLLKDTWYFECHCTRCSDPDD 319
>gi|156839743|ref|XP_001643559.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
gi|171770010|sp|A7TPV3.1|SET5_VANPO RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|156114175|gb|EDO15701.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 499
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 11/83 (13%)
Query: 243 IGRY-----TQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNAL 291
IGRY + ++ + L+ HNC PN + + ++ V+ +IS T Y N L
Sbjct: 330 IGRYNINQISDQMFFLPSLINHNCEPNVRFEVVSNKEIRVYARKNISAGQELLTNYINPL 389
Query: 292 WGTQPRREHLAITKYFNCSCERC 314
G + RR L + F C C+RC
Sbjct: 390 HGVKLRRRELRVNYGFLCHCDRC 412
>gi|71654175|ref|XP_815712.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70880787|gb|EAN93861.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 584
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 127/322 (39%), Gaps = 49/322 (15%)
Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAIL-----KLG----CETLLAYNDYKYEAILPL 165
C RC F CS RC+ EC +L LG CE + +D+ A +
Sbjct: 211 CRRCQFVSYCSDRCASCHGRQHEEYECRLLFRLREMLGSMRSCEAAVP-DDFFTMATHCI 269
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPG-TEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+ + + ++ + +E+H + G T + + + F+ ++ G V +
Sbjct: 270 TTMSGVKMRKEGHEAVLRLESHEVEVSQGLTPLVRLVQDLFSGEDPTFVTRILGVVRCNA 329
Query: 225 SAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI 283
A C + +G+ +G+ SV H+CLPN A + + + + +
Sbjct: 330 LAVC--------DASGLPVGQALHA-ASVTSYFNHSCLPNCAIEAGAIVTTRAIRPGEEL 380
Query: 284 STMYTNAL-WGTQPRREHLAITKYFNCSCERC--SDPTELGTYFSAMKCLNEHKDQGDCW 340
+ Y L W RRE LA +F+C C RC D + + SA L +++ +
Sbjct: 381 TISYLPQLYWPAWLRREELAERYFFDCRCVRCDDGDRSPFESALSATLRLGGSREKEREY 440
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
I V L CG R+ A+DV V +G++ +L +L +
Sbjct: 441 ISSVQIL-------CG----RVRAKDVGDV--DVGDR-----------DALLHLLQECRQ 476
Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
P HY C+ + ++L +Y
Sbjct: 477 HLFPFHYLCHELHNTLSFVYAV 498
>gi|340714716|ref|XP_003395871.1| PREDICTED: egl nine homolog 1-like [Bombus terrestris]
Length = 486
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 24/38 (63%), Gaps = 1/38 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
C +C T L +CS CK V+YC K+HQ+ WK HK C
Sbjct: 24 CVICNRTDKLLRCSRCKAVSYCTKEHQRRDWKRHKEFC 61
>gi|194862042|ref|XP_001969907.1| GG23679 [Drosophila erecta]
gi|190661774|gb|EDV58966.1| GG23679 [Drosophila erecta]
Length = 207
Score = 44.3 bits (103), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 43/94 (45%), Gaps = 4/94 (4%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ LLGR L + + G+ + E P GPK + +CLGC++ L +
Sbjct: 2 LKARLQEDALLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAILK 145
+ RC C WP C + C+ D H EC +
Sbjct: 61 SLDRCELCDWPLCGS-CADDEDVTEHRGECQVFS 93
>gi|397616736|gb|EJK64110.1| hypothetical protein THAOC_15187 [Thalassiosira oceanica]
Length = 299
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 10/74 (13%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ I E+E CA C + A L +C+AC+ V YCG Q+ H KLHK CK+
Sbjct: 1 MSGIQEVET-----CANCGKEASEAVKLKQCTACRLVKYCGVDCQRAHRKLHKKACKQRA 55
Query: 56 YEIKSSPLLGRYLQ 69
E+K L + L+
Sbjct: 56 AELKDEQLYSQGLE 69
>gi|393905719|gb|EFO19192.2| MYND finger family protein [Loa loa]
Length = 545
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 15/95 (15%)
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS--------NMMQSKLFVFRDTHIS 284
CG IE +IG+ GLY +C +H+C PN ++ + + + + +
Sbjct: 232 CGYIE----EIGK---GLYLDLCAYDHSCRPNTIYTCDGFVATLRGLTASVDLRNLSSAH 284
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
Y + + TQ RR+ L T YF C C RCSDP +
Sbjct: 285 YSYIDLINTTQQRRKLLKDTWYFECHCTRCSDPDD 319
>gi|125598076|gb|EAZ37856.1| hypothetical protein OsJ_22202 [Oryza sativa Japonica Group]
Length = 995
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 24/49 (48%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSP 62
CA C A KCS CK V YC + Q +HW+ HK KCK++ KSSP
Sbjct: 31 CAACGYIATKKCSGCKRVRYCSQGCQSKHWQSGHKFKCKQMK---KSSP 76
>gi|157866065|ref|XP_001681739.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68125037|emb|CAJ03122.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 622
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 48/211 (22%), Positives = 83/211 (39%), Gaps = 45/211 (21%)
Query: 229 LHWICGVI--------EVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD 280
L+ I GV+ + +G+ + + S+ H+C PN + ++ + R
Sbjct: 369 LYTIIGVLCCNALELADPSGLGVAQALHAGNSIASFFNHSCTPNCA-IDTVRHAIVTTRT 427
Query: 281 THI----STMYTNAL-WGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLN 331
H+ S Y L W T+ RRE L+ + YF C C+RC DP E L
Sbjct: 428 IHVGEELSIAYIPQLYWPTRLRRERLSESYYFVCRCQRCESSNKDPFERALSME----LP 483
Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
+ + P+ TC + +R+ V V+ + +++ KL++E
Sbjct: 484 TARREATKHFHPI------VQTTCATVRSRM----VGDVSQKDADELSKLLRE------- 526
Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSLIQLYGT 422
L P HY C+ V++ L +Y
Sbjct: 527 ------LSTHLFPFHYLCHEVRNCLSFVYAV 551
>gi|336375469|gb|EGO03805.1| hypothetical protein SERLA73DRAFT_158294 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1147
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D RC VCR + L C+ CK+V YC QK+ WK+HK +CK
Sbjct: 1099 RDPAAYRCFVCRGKGKPKLKMCTVCKKVRYCSSGCQKKDWKIHKLRCK 1146
>gi|336388523|gb|EGO29667.1| hypothetical protein SERLADRAFT_445454 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1163
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D RC VCR + L C+ CK+V YC QK+ WK+HK +CK
Sbjct: 1115 RDPAAYRCFVCRGKGKPKLKMCTVCKKVRYCSSGCQKKDWKIHKLRCK 1162
>gi|440799318|gb|ELR20373.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 362
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 1/46 (2%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKL 54
E++ CA C A +CSACK V YC HQKEHW + HK +C+++
Sbjct: 107 EKVFFCAECGMRAEKRCSACKMVHYCTVTHQKEHWGRGHKEECEQI 152
>gi|326525078|dbj|BAK07809.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 372
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 27/50 (54%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C +C +C VAYC HQ HWK+HK +C +L ++ +L ++
Sbjct: 13 CTSGTHRRCGSCGAVAYCSNDHQFIHWKVHKEECARLATQMSRIDMLSQF 62
>gi|224100385|ref|XP_002311856.1| predicted protein [Populus trichocarpa]
gi|222851676|gb|EEE89223.1| predicted protein [Populus trichocarpa]
Length = 391
Score = 43.9 bits (102), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 28/54 (51%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
R C +A +C C VAYC HQ HW HK +C++L ++K +L +
Sbjct: 8 RGTRCMGSARRRCGRCGAVAYCSVSHQMSHWNEHKEECERLEQQMKRVDVLNDF 61
>gi|358382303|gb|EHK19975.1| hypothetical protein TRIVIDRAFT_47985 [Trichoderma virens Gv29-8]
Length = 448
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RCA C T L +C+ C V YC ++HQ HW LHK C K+
Sbjct: 21 RCAHCLSTGPKLLRCTGCLAVHYCSREHQAAHWPLHKSTCIKI 63
>gi|383853834|ref|XP_003702427.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
[Megachile rotundata]
Length = 441
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 31/50 (62%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
++ C +C E A ++CS CKE YCG++ Q + W+ HK C K+ +K+
Sbjct: 388 DVKECFLCHEEAKNRCSKCKEAWYCGRECQVKDWENHKHICNKITESVKN 437
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 43.9 bits (102), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 79/208 (37%), Gaps = 39/208 (18%)
Query: 124 CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ---- 179
CSA C +D H PECA LK + ++P ++ R +K +
Sbjct: 93 CSAACQK-ADWKDHKPECAALK------------RISPVVPATFVMFLARILRKMERNTG 139
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLR------SNFLEKLPGDVLDDTSAKCLHWIC 233
E+ ++ HM + + +I+++LR L K VL TSA
Sbjct: 140 EMDVLQLHMPGEPSDPQQQRGLFAILEHLRHFLPDAEKHLLKSAYPVLRITSANSF---- 195
Query: 234 GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------ISTM 286
G+ V G ++G GLY + + H+C PN S R H ++
Sbjct: 196 GISGVEGNNLG---VGLYDTVSYINHSCAPNC--SITFSGVYARVRSVHDLPPNQELTIA 250
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + RR HL F+C C RC
Sbjct: 251 YIDPCDPRAKRRAHLKSQFMFDCECSRC 278
>gi|397595780|gb|EJK56578.1| hypothetical protein THAOC_23507, partial [Thalassiosira oceanica]
Length = 564
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L GR L+
Sbjct: 384 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGRGLE 430
>gi|392559211|gb|EIW52396.1| hypothetical protein TRAVEDRAFT_75375 [Trametes versicolor
FP-101664 SS1]
Length = 178
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 45/114 (39%), Gaps = 22/114 (19%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
M C VC E A CSACK YC + QK +WK HK C+ + + L
Sbjct: 3 MPCEVCYEDATRVCSACKYTRYCSEACQKANWKTHKKGCE-----------IQQMLNRMN 51
Query: 73 DLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSA 126
+ H AR +P C GC+ + + CP C + AC +
Sbjct: 52 EEHAAAPRARPNPK-----------RCTGCNVRFTEDYPCDGECPDCGYVACES 94
>gi|324503373|gb|ADY41469.1| Egl nine 1 [Ascaris suum]
Length = 727
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
L +C C VAYCG++HQK WK HKP CK + ++
Sbjct: 146 LLRCQHCDMVAYCGEEHQKFDWKRHKPLCKTVQSQL 181
>gi|194762120|ref|XP_001963206.1| GF19729 [Drosophila ananassae]
gi|190616903|gb|EDV32427.1| GF19729 [Drosophila ananassae]
Length = 186
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ + GR L + GD + E P GPK + +CL C++ L +
Sbjct: 2 LKARLEQDEVFGRCLAVAEPVQRGDLVLEELPFAQGPKRD-SGIVCLACYQFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C+A C+ SD H+ EC +
Sbjct: 61 SLDRCEICDWPLCAA-CAIGSDVTDHSGECEVF 92
>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
Length = 614
Score = 43.9 bits (102), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 50/237 (21%), Positives = 91/237 (38%), Gaps = 33/237 (13%)
Query: 249 GLYSVICLMEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAI 303
GL+ + ++ H+C PN A ++M + + + + Y N + R+ L
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355
Query: 304 TKYFNCSCERCSDP--TELGTYFSAMKCLNEHKDQGDCWILPVNPLDND----SDWTCGS 357
TK+F+C+C+RC +P T + + C H L L D + W C
Sbjct: 356 TKHFDCTCDRCVEPLNTSIDRFLEGCVC---HVRGCGGVFLRTAALHEDQASSTTWECDL 412
Query: 358 CSARLNARDVHL------VTSQLGEQVDKLV------QENPNVKSLEEMLTKLEAMFHPH 405
CS L+ + V ++ E++ V + K LE L++ HP
Sbjct: 413 CSRVLDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472
Query: 406 HYHCYAVKHSLIQ----LYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLG 458
H + L+ + + G +L +E+ + P+L+ + CLG
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLP---GPSLELANFYFCLG 526
>gi|407844697|gb|EKG02089.1| hypothetical protein TCSYLVIO_006900 [Trypanosoma cruzi]
Length = 584
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 72/322 (22%), Positives = 127/322 (39%), Gaps = 49/322 (15%)
Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAIL-----KLG----CETLLAYNDYKYEAILPL 165
C RC F CS RC+ EC +L LG CE + +D+ A +
Sbjct: 211 CRRCQFVSYCSDRCASCHGRQHEEYECRLLFRLREMLGSMRSCEAAVP-DDFFTMATHCI 269
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPG-TEVYEEIDSIVKYLRSNFLEKLPGDVLDDT 224
+ + + ++ + +E+H + G T + + + F+ ++ G V +
Sbjct: 270 TTVSGVKMRKEGHEAVLRLESHEVEVSQGLTPLVRLVQDLFSGEDPTFVTRILGVVRCNA 329
Query: 225 SAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI 283
A C + +G+ +G+ SV H+CLPN A + + + + +
Sbjct: 330 LAVC--------DASGLPVGQALHA-ASVTSYFNHSCLPNCAIEAGAIVTTRAIRPGEEL 380
Query: 284 STMYTNAL-WGTQPRREHLAITKYFNCSCERCSDP--TELGTYFSAMKCLNEHKDQGDCW 340
+ Y L W RRE LA +F+C C RC D + + SA L +++ +
Sbjct: 381 TISYLPQLYWPAWLRREELAERYFFDCRCVRCGDGDRSPFESALSATLRLGGSREKEREY 440
Query: 341 ILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEMLTKLEA 400
I V L CG R+ A+DV V +G++ +L +L +
Sbjct: 441 ISSVQIL-------CG----RVRAKDVGDV--DVGDR-----------DALLHLLQECRQ 476
Query: 401 MFHPHHYHCYAVKHSLIQLYGT 422
P HY C+ + ++L +Y
Sbjct: 477 HLFPFHYLCHELHNTLSFVYAV 498
>gi|308509934|ref|XP_003117150.1| CRE-SET-14 protein [Caenorhabditis remanei]
gi|308242064|gb|EFO86016.1| CRE-SET-14 protein [Caenorhabditis remanei]
Length = 429
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 24/32 (75%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCSAC++VAYC + Q+ WKLHK +CK +
Sbjct: 36 LKKCSACRKVAYCSAECQRADWKLHKRECKAI 67
>gi|209879507|ref|XP_002141194.1| MYND finger domain-containing protein [Cryptosporidium muris RN66]
gi|209556800|gb|EEA06845.1| MYND finger domain-containing protein [Cryptosporidium muris RN66]
Length = 232
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 27/47 (57%), Gaps = 2/47 (4%)
Query: 8 EDEELMRCAVCRETAL--HKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
E +L RC CR T+L +C CK+V YC QKE WK HK CK
Sbjct: 186 EKADLTRCENCRITSLPLMRCGRCKKVLYCSVTCQKEDWKYHKRICK 232
>gi|357122838|ref|XP_003563121.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
[Brachypodium distachyon]
Length = 372
Score = 43.9 bits (102), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 28/50 (56%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C A +C +C VAYC + HQ HW LHK +C +L ++ +L ++
Sbjct: 13 CAGGAHRRCGSCGAVAYCSRAHQFIHWTLHKEECARLATQMSHIDVLSQF 62
>gi|397571596|gb|EJK47869.1| hypothetical protein THAOC_33384 [Thalassiosira oceanica]
Length = 1255
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 32/60 (53%), Gaps = 5/60 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
CA C + A L C+AC+ V YCG QK H KLHK CKK E+K L + L+
Sbjct: 18 CANCGKFASNTVKLKDCTACRLVKYCGVDCQKAHRKLHKKACKKRAAELKDEQLYSQGLE 77
>gi|389745230|gb|EIM86411.1| hypothetical protein STEHIDRAFT_121370 [Stereum hirsutum FP-91666
SS1]
Length = 307
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 21/39 (53%), Positives = 23/39 (58%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
C +E L CS CK V YC K+ QKEHW HK CKK
Sbjct: 263 CEAIKEKDLATCSRCKLVYYCSKECQKEHWPRHKIWCKK 301
>gi|170036329|ref|XP_001846017.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878894|gb|EDS42277.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 574
Score = 43.9 bits (102), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 7/98 (7%)
Query: 53 KLPY-----EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLN 107
KLP+ +K P GR L A D + GD I E P+++ + L +C C
Sbjct: 189 KLPFVAKGIAMKYYPEFGRGLVAERDFNTGDVILDEKPMLMAVEHGLRYSVCSYCTASFR 248
Query: 108 PNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILK 145
+L C C + CS +C + TH EC I++
Sbjct: 249 KSLIPCPNCVSCMY--CSEKCLEMDHRFTHRFECGIME 284
>gi|391224601|gb|AFM37574.1| HIF prolyl hydroxylase [Trichoplax adhaerens]
Length = 300
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 15 CAVCRETALH-KCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
C++C + + KCS C+ YC K+HQ+ HWK+H+ C G L+ T
Sbjct: 26 CSLCFKKLISVKCSLCQSAYYCSKEHQQSHWKIHRENCMDERRPKIGCDKTGLALEKTWQ 85
Query: 74 LHPGDRIA 81
L +R++
Sbjct: 86 LQKNERLS 93
>gi|170086215|ref|XP_001874331.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651883|gb|EDR16123.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 98
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D + +C VCR + +C+ C++V YC + QK+ WK HKPKCK
Sbjct: 50 RDPDARKCRVCRGKGKPRIKECAGCQKVRYCSPECQKKDWKAHKPKCK 97
>gi|392562241|gb|EIW55422.1| hypothetical protein TRAVEDRAFT_131337 [Trametes versicolor FP-101664
SS1]
Length = 1177
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 10 EELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
E RCA C + + KCS CK V+YC + QK WK HKP C
Sbjct: 1129 EHATRCAKCSWPDNKMRKCSRCKTVSYCSEGCQKADWKKHKPAC 1172
>gi|307203234|gb|EFN82389.1| Egl nine-like protein 1 [Harpegnathos saltator]
Length = 473
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
C VC T L +CS CK V YC K+HQ+ WK H+ C P
Sbjct: 24 CVVCERTDKLLRCSRCKAVFYCTKEHQRSDWKRHRDFCATHP 65
>gi|367012652|ref|XP_003680826.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
gi|359748486|emb|CCE91615.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
Length = 486
Score = 43.9 bits (102), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 6/84 (7%)
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNAL 291
V ++ + + +Y + + H+C PN ++ + +L VF I T Y N L
Sbjct: 324 VGKFNLNQISGQIYPIYSFINHDCEPNVRYEIDDKLRLKVFARKPIKAGEELLTTYVNPL 383
Query: 292 WGTQPRREHLAITKYFNCSCERCS 315
G + RR L + F C CERC+
Sbjct: 384 HGVKLRRRELRVNWGFLCQCERCT 407
>gi|330944524|ref|XP_003306390.1| hypothetical protein PTT_19525 [Pyrenophora teres f. teres 0-1]
gi|311316122|gb|EFQ85513.1| hypothetical protein PTT_19525 [Pyrenophora teres f. teres 0-1]
Length = 414
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C +C + KC CK VAYCG++HQ++ W HK +C++L
Sbjct: 369 CKICGKYEGARKCVRCKTVAYCGEEHQRQDWPSHKRECRQL 409
>gi|383849148|ref|XP_003700208.1| PREDICTED: egl nine homolog 1-like [Megachile rotundata]
Length = 486
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
C VC T L +CS CK V YC K+HQ+ WK HK C
Sbjct: 24 CVVCNRTDKLLRCSRCKAVFYCTKEHQRRDWKRHKEFC 61
>gi|330792277|ref|XP_003284216.1| hypothetical protein DICPUDRAFT_147973 [Dictyostelium purpureum]
gi|325085913|gb|EGC39312.1| hypothetical protein DICPUDRAFT_147973 [Dictyostelium purpureum]
Length = 507
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 81/170 (47%), Gaps = 20/170 (11%)
Query: 247 TQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTH----ISTMYTNALWGTQPRREHL 301
T GLY ++ H+C PN + +N + K+ RD + + Y+ ++ + RRE L
Sbjct: 228 TIGLYPLMVFFNHSCKPNLSILNNKKELKIICNRDINENEELFINYSPSICYSNERREML 287
Query: 302 AITKYFNCSCERC-SDPTE--LGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSC 358
+F+C CE C D E + Y KC +K + + W + + C +C
Sbjct: 288 KQCFFFDCKCELCLKDELEKSMDLYILCNKC---NKGRVNIWY----DENKNRIVKCLNC 340
Query: 359 SARLNARDV---HLVTSQLGEQVDKLVQENP--NVKSLEEMLTKLEAMFH 403
S+ NA ++ +L+T++L + +D+ + +N N+ + +L + F+
Sbjct: 341 SSDHNADEILKKNLITTKLQKCLDQFINQNSELNIDEFKLLLNQYCKQFY 390
>gi|326426901|gb|EGD72471.1| hypothetical protein PTSG_11593 [Salpingoeca sp. ATCC 50818]
Length = 457
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+CAVC E A +CS CK YC +Q Q EHW HK C +
Sbjct: 408 KCAVCGEPATKRCSKCKNEWYCRRQCQVEHWPKHKKLCATI 448
>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
Length = 391
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 107/282 (37%), Gaps = 36/282 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAE---PICLGCHKPLNPNLADNARCPRC-F 120
GR L +T L G + +SP+++ L L + L C K + + CP C
Sbjct: 20 GRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCFKTIQSASVSCPTCSH 79
Query: 121 WPACSARCSGLSDAHTHAP-ECAILKL--GCETLLAYN--DYKYEAILPLRCLILQRRSP 175
CS C + A +H P C L C+ L ++ + + +A + L SP
Sbjct: 80 QRFCSPTCLSAALASSHTPWVCQSLSRLRDCQDFLQHHSVERQIQAQFLVAAYNLAFVSP 139
Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYL--------RSNFLEKLPGDVLDDT-SA 226
+Q L ++ P + + + S++ L S LE + V D +A
Sbjct: 140 SDFQILLSLQGRAEDEDPA--IVQSLHSVISSLCPPPPIEGFSFSLELIAALVAKDRFNA 197
Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDT 281
L + E NG G+Y L H+CLPNA +N + V R
Sbjct: 198 FGLMEPLNLNEENGGQRSVRAYGIYPKASLFNHDCLPNACRFDYVDTNNSGNTDIVVRMI 257
Query: 282 H---------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
H +S N+ + T RR+ L F C C+RC
Sbjct: 258 HDVPQGREICLSYFPVNSNYST--RRKRLLEDYGFTCDCDRC 297
>gi|307109858|gb|EFN58095.1| hypothetical protein CHLNCDRAFT_142413 [Chlorella variabilis]
Length = 318
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 47/99 (47%), Gaps = 9/99 (9%)
Query: 1 MASIIEIEDEELMRCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI- 58
MA++ + E + + CA C T L KCS C +CG + QK +W H+ +CK+ +
Sbjct: 1 MAAMHQDELPDEVECANCGTTQNLLKCSRCHTAWFCGVKCQKAYWPFHRTQCKRNEFADA 60
Query: 59 --KSSPLLGRYL-----QATLDLHPGDRIARESPLIVGP 90
+S P R++ QA L DR+ R GP
Sbjct: 61 IEESEPKFARWMRKHGKQAVLKDDEVDRLERAGAAASGP 99
>gi|170113176|ref|XP_001887788.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164637149|gb|EDR01436.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 420
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 9 DEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D + +C CR A + C+ C++V YC + QK+ WK HKPKCK
Sbjct: 373 DPDAHKCRSCRRKAKPKIKVCAGCQKVRYCSSECQKKDWKAHKPKCK 419
>gi|168060010|ref|XP_001781992.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666565|gb|EDQ53216.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 388
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 16/47 (34%), Positives = 28/47 (59%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
C A +C AC + YC ++HQK HW H C ++ ++++ +P+L
Sbjct: 12 CSGPATLRCGACGAIRYCSRKHQKAHWDEHALVCSRMAHQMQLAPVL 58
>gi|405978745|gb|EKC43109.1| Egl nine-like protein 1 [Crassostrea gigas]
Length = 319
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 3/60 (5%)
Query: 1 MASIIEIEDEE--LMRCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
MA + + D M C +C + L CS C+ YC K+HQK WK HK CK++ E
Sbjct: 1 MAEVKSVRDRPNISMACQLCSSSERLMVCSGCRRTWYCSKEHQKLDWKYHKKNCKRMREE 60
>gi|189197141|ref|XP_001934908.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980856|gb|EDU47482.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 240
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 20/39 (51%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
CA C KCS CK + YC Q W LHKP CK+
Sbjct: 31 CATCHTPTPRKCSTCKNIRYCSATCQTTDWPLHKPVCKR 69
>gi|413943501|gb|AFW76150.1| putative ubiquitin carboxyl-terminal hydrolase superfamily protein
[Zea mays]
Length = 944
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
CA C KCS CK V YC ++ Q +HW++ HK KCK++
Sbjct: 91 CAACGVVTSKKCSGCKRVRYCSQECQTKHWQIDHKFKCKQM 131
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 63/283 (22%), Positives = 110/283 (38%), Gaps = 31/283 (10%)
Query: 46 LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLG 101
+ PK +K SS G L+A+ L PG+ + R PL G + + + LG
Sbjct: 1 MEPPKVEKF-----SSASRGNGLRASAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLG 55
Query: 102 CHKPLNPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYE 160
K + RC +C CSA+C + H EC LK C+ + +
Sbjct: 56 KEKLM--------RCSQCRVAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLL 105
Query: 161 AILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTE--VYEEIDSIVKYLRSNFLE--KL 216
+ + + +K D+E++++K + + + + + ++R + +L
Sbjct: 106 GRVVFKLMEETPSESEKLYSFYDLESNINKLTEDKKDGLRQLVMTFQHFMREEIQDASQL 165
Query: 217 PGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKL 275
P D IC + ++ GLY + L+ H+C PN + N L
Sbjct: 166 PSSF--DIFEAFAKVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLL 223
Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
RD ++ Y + L ++ RR+ L F C C RC
Sbjct: 224 RAVRDIEAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRC 266
>gi|268532182|ref|XP_002631219.1| C. briggsae CBR-SET-14 protein [Caenorhabditis briggsae]
Length = 428
Score = 43.5 bits (101), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
T L KC+ C E YC ++ QK WK+HKP+CK +
Sbjct: 34 TRLKKCAGCNEFFYCSQECQKNDWKVHKPECKSI 67
>gi|398389124|ref|XP_003848023.1| hypothetical protein MYCGRDRAFT_50206 [Zymoseptoria tritici
IPO323]
gi|339467897|gb|EGP82999.1| hypothetical protein MYCGRDRAFT_50206 [Zymoseptoria tritici
IPO323]
Length = 543
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 27/53 (50%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
+E CA C + KC AC V YCG++ Q HWK HK CK +I P
Sbjct: 2 KEGYECASCHQYGNLKCKACHLVVYCGRRCQTAHWKEHKKYCKSPFMKIDWQP 54
>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
Length = 453
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%), Gaps = 1/42 (2%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C +ET + KCS C V YCG+ QKE W HK +C+KL
Sbjct: 30 RCDRCFKETKVMKCSNCLYVRYCGRSCQKEAWSDHKEECEKL 71
>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
Length = 382
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 12/120 (10%)
Query: 211 NFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN 264
NFL+ P + T H + GVI NG + R Q GL+ +CL+ H+C PN
Sbjct: 54 NFLDYWPRNSKQHTIDDISH-LFGVINCNGFTVSDQRGLQAVGVGLFPNLCLVNHDCWPN 112
Query: 265 ----AKHSNM-MQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
H + ++S + ++ Y + L ++ RR L +F+C CE C + T+
Sbjct: 113 CTVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCEHCRNRTK 172
>gi|345483923|ref|XP_001603005.2| PREDICTED: zinc finger MYND domain-containing protein 10-like
[Nasonia vitripennis]
Length = 333
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 31/50 (62%), Gaps = 2/50 (4%)
Query: 5 IEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+EI D + +C C+E A ++CS CKE YCG++ Q + W HK CK +
Sbjct: 275 LEIVDSK--KCFRCKEVAKNRCSRCKEAWYCGRECQVKDWTNHKTICKNI 322
>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
Length = 614
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 50/237 (21%), Positives = 91/237 (38%), Gaps = 33/237 (13%)
Query: 249 GLYSVICLMEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAI 303
GL+ + ++ H+C PN A ++M + + + + Y N + R+ L
Sbjct: 296 GLFPYVSILNHSCRPNCCFASEGSVMYVRAVQDIPKGAELCLSYINLYESRRVRKTLLVA 355
Query: 304 TKYFNCSCERCSDP--TELGTYFSAMKCLNEHKDQGDCWILPVNPLDND----SDWTCGS 357
TK+F+C+C+RC +P T + + C H L L D + W C
Sbjct: 356 TKHFDCTCDRCVEPLNTSIDRFLEGCVC---HVRGCGGVFLRTAALHEDQASSTTWECDL 412
Query: 358 CSARLNARDVHL------VTSQLGEQVDKLV------QENPNVKSLEEMLTKLEAMFHPH 405
CS L+ + V ++ E++ V + K LE L++ HP
Sbjct: 413 CSRILDPESMRFKDPPWEVIAKAEERLVAAVRIYSERRFKEARKLLESYLSEFTGKLHPL 472
Query: 406 HYHCYAVKHSLIQ----LYGTQPGYAYTQLSSSLLERKISYVMSPNLKATDEPICLG 458
H + L+ + + G +L +E+ + P+L+ + CLG
Sbjct: 473 HVFLFDALTPLMNCCRAMGDAEEGLRVCRLILQCMEKVLP---GPSLELANFYFCLG 526
>gi|323452529|gb|EGB08403.1| hypothetical protein AURANDRAFT_64147 [Aureococcus anophagefferens]
Length = 1455
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 15 CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
CAVC A L C+ C YC K HQ+ HWK H+ CKK
Sbjct: 1410 CAVCGVRANLQACARCGAAFYCSKAHQESHWKEHRRDCKK 1449
>gi|169600053|ref|XP_001793449.1| hypothetical protein SNOG_02856 [Phaeosphaeria nodorum SN15]
gi|111068467|gb|EAT89587.1| hypothetical protein SNOG_02856 [Phaeosphaeria nodorum SN15]
Length = 414
Score = 43.5 bits (101), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/45 (46%), Positives = 24/45 (53%), Gaps = 2/45 (4%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
EEL CA+C +H CS C + YC K QK W LHK CK
Sbjct: 8 EEL--CAMCNNMGVHACSGCHSIRYCSKLCQKTDWSLHKLLCKSF 50
>gi|195454531|ref|XP_002074282.1| GK18437 [Drosophila willistoni]
gi|194170367|gb|EDW85268.1| GK18437 [Drosophila willistoni]
Length = 129
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 42/94 (44%), Gaps = 4/94 (4%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L +++ + GR L T +H G+ + E P GPK + +CLGC+ L
Sbjct: 2 LKARLENDSVFGRCLATTEQVHRGEIVVEELPFAYGPKRE-SGIVCLGCYCYLQFGEDGE 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAILK 145
+ RC C WP C A C+ + H EC I
Sbjct: 61 SLDRCELCDWPLC-AMCASNQYENDHQGECVIFS 93
>gi|330928344|ref|XP_003302230.1| hypothetical protein PTT_13958 [Pyrenophora teres f. teres 0-1]
gi|311322550|gb|EFQ89675.1| hypothetical protein PTT_13958 [Pyrenophora teres f. teres 0-1]
Length = 249
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 29/48 (60%), Gaps = 7/48 (14%)
Query: 14 RCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+CA C++TA L +C+ CK + YCG+ QK H+K HK C +L
Sbjct: 3 QCAKCKKTAADCGVASLKRCAKCKSIVYCGRDCQKAHYKAHKAACNQL 50
>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 521
Score = 43.5 bits (101), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 113/318 (35%), Gaps = 66/318 (20%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGC--HKPLNPNL-- 110
P + S P+ GRYL AT DL I R+ P A + +C C PLN +
Sbjct: 123 PIHVYSHPINGRYLVATKDLDEQTVILRDLPYTWAVDHATCDSVCQHCFLEVPLNQQILP 182
Query: 111 ADNARCPRCFWPA-CSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI 169
D C C CSA C + D H EC I K + +Y L +
Sbjct: 183 TDFYMCEGCQRVGYCSANCRCI-DYSQHRFECQIFK-------ELDTEEYSPFLMSEIKL 234
Query: 170 LQRRSPKKYQELK-------DMEAHMSKRGPGTEVYEEIDSIV---KYLRSNFLEKLPGD 219
L R +K+ E D+ K+ Y+ S++ LR N +L +
Sbjct: 235 LVRTLSRKWLEDSITQTAGIDINDETIKKQNTYNQYKNPQSLIPQDNGLRYNDYAELVSN 294
Query: 220 V--LDDTSAKCL-HWICGVIEVNGVDIGR-------------------YTQ--------- 248
V +++ + L +WIC + +G+ Y Q
Sbjct: 295 VENYNESLKESLSYWICKYVVKLSAKLGKIEDEFDLLNILLRNRCNAFYIQGRPRDGSSG 354
Query: 249 -----GLYSVICLMEHNCLPNAKH---SNMMQSKLFVFRDT----HISTMYTNALWGTQP 296
G+Y H+C PN + +N ++ + + ++ ++ Y +
Sbjct: 355 ESRGCGVYVRNSFFNHSCDPNVNYWVVNNTLEVECTLLKNVKEGDELTISYIDTTSPLNK 414
Query: 297 RREHLAITKYFNCSCERC 314
RRE L FNC C +C
Sbjct: 415 RREKLLEGYLFNCLCTKC 432
>gi|389749892|gb|EIM91063.1| hypothetical protein STEHIDRAFT_91033 [Stereum hirsutum FP-91666
SS1]
Length = 739
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 29/55 (52%), Gaps = 3/55 (5%)
Query: 2 ASIIEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKK 53
A ++++ EL RC+ CR L +C C YCG Q EHWK HK CKK
Sbjct: 673 ARSVDMKAVELYRCSWCRNPSAVLKRCKGCGGTTYCGASCQMEHWKTGHKKVCKK 727
>gi|397616553|gb|EJK64030.1| hypothetical protein THAOC_15280 [Thalassiosira oceanica]
Length = 323
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C + L C+AC+ V YCG QK H K HK CK+ E
Sbjct: 2 SCVPVVDDGAEVCANCGKQGSDTVKLRNCTACRLVKYCGVDCQKAHRKQHKNACKRRAAE 61
Query: 58 IKSSPLLGRYLQ 69
+K L R L+
Sbjct: 62 LKDEQLYSRGLE 73
>gi|443702950|gb|ELU00773.1| hypothetical protein CAPTEDRAFT_224225 [Capitella teleta]
Length = 1175
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 6/56 (10%)
Query: 14 RCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
RC+ CR L +CS C + AYC +Q Q + W HKP CK +P E+ P +
Sbjct: 674 RCSCCRRDCTANMQLKRCSKCFKTAYCDRQCQTQDWVRHKPNCKFVP-ELIGQPFM 728
>gi|321474624|gb|EFX85589.1| hypothetical protein DAPPUDRAFT_237743 [Daphnia pulex]
Length = 908
Score = 43.1 bits (100), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 15 CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C VC++ L +CS C+ +AYC + Q HW +HK CKKL
Sbjct: 867 CTVCKKIPEKLKRCSRCRYIAYCSVECQHSHWPIHKAICKKL 908
>gi|403413554|emb|CCM00254.1| predicted protein [Fibroporia radiculosa]
Length = 336
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%), Gaps = 4/43 (9%)
Query: 15 CAVCRE---TALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
CA C AL CS CK++ YC QHQK HWK +HK +C +
Sbjct: 290 CAACGNPDAKALKVCSGCKQIMYCSPQHQKAHWKTVHKNQCSR 332
>gi|410904557|ref|XP_003965758.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Takifugu rubripes]
Length = 419
Score = 43.1 bits (100), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E K CSACK V YC K QK HW HK CKKL
Sbjct: 318 CTSCGEKGAQKRCSACKMVVYCDKACQKLHWFTHKKVCKKL 358
>gi|325182386|emb|CCA16839.1| ubiquitinspecific protease putative [Albugo laibachii Nc14]
Length = 1031
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 26/49 (53%), Gaps = 2/49 (4%)
Query: 8 EDEELMRCAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
E +L C VCR T A+ +C C+ V YC Q HWK H+ C KL
Sbjct: 293 EGVQLKSCIVCRSTSEAVKRCGRCRRVYYCTVDCQGSHWKYHRTVCSKL 341
>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
Length = 358
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 73/290 (25%), Positives = 111/290 (38%), Gaps = 39/290 (13%)
Query: 61 SPLLGRYLQATLDLHPGDRIARESPLIVGPKLALA---EPIC--LGCHKPLNPNLADNAR 115
+PL GR L + G+ + + P V P +A+ + IC LGC + ++ + A++
Sbjct: 59 NPLKGRTLSTNGPIRAGELLMADLPYAVLPTVAVGGGDDSICSNLGCSRRVSRHAANSVT 118
Query: 116 CPR-----CFWPACSARCSGLSDAHTHAPECAILKLGCETLLA-YNDYKYEAILPLRCLI 169
C +W C+ C D H ECA LK L D+ Y + + L+
Sbjct: 119 CETGCSLDIWW--CNESCKD-EDQARHDVECAWLKKYSPVLRQELGDHDYYMLWIIIRLL 175
Query: 170 LQRRSPKKYQEL---KDMEAHMSKRG--PGTEVYEEIDSIVKYLRSNFL--EKLPGDVLD 222
R S K L K ++ S R P +++ KYL + L E L D L
Sbjct: 176 AARNSHSKIASLAAGKGVQLLRSNRDSWPTSQLQHWSTLAEKYLLQSSLLPETLDLDTLV 235
Query: 223 D----TSAKCLHWICGVIEV-----NGVDIG-RYTQGLYSVICLMEHNCLPNAKHSNMMQ 272
D G E+ GV+ G +Y ++ I L H+C PN H +
Sbjct: 236 DLICAEETNVFELCPGPTEIIPHQSPGVERGTQYGLAVFLRITLANHSCAPNVTHQADDR 295
Query: 273 SKLFV--FRDTH------ISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
++ V RD S + Q RR+ F C+C RC
Sbjct: 296 GRMMVTALRDIAPGEECCTSYFDLSEYVDLQARRKKTQELFTFTCTCPRC 345
>gi|393232939|gb|EJD40515.1| hypothetical protein AURDEDRAFT_69933 [Auricularia delicata
TFB-10046 SS5]
Length = 351
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/34 (52%), Positives = 23/34 (67%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+ L +CS CKEV YC +QHQK W +HK C+K
Sbjct: 317 SSGLIRCSRCKEVKYCSEQHQKYDWGIHKLVCEK 350
>gi|390598125|gb|EIN07524.1| hypothetical protein PUNSTDRAFT_136202 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 214
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%)
Query: 1 MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+A+ E+E+ +T+L +CS CK YCGK Q+E W++HK CK L
Sbjct: 141 LAACASCEEEKTADINTPGKTSLMRCSGCKVTRYCGKACQREDWRVHKEFCKIL 194
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 62/268 (23%), Positives = 103/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A+ L PGD + R PL G + A+ + LG K L RC +C
Sbjct: 15 GNGLRASAALRPGDLLFRSDPLAYTVCKGSRGAVCDRCLLGKEKLL--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-NCRPRYPPDSVRLLGRVVCKLMEGTPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E++++K + E+ ++ L F + + ++ D D
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLAMTFQDFMREEIQDASQLPPSFDIFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STM 286
IC + ++ GLY + L+ H+C PN + N L RD + +
Sbjct: 179 ICNSFTICDAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|260818611|ref|XP_002604476.1| hypothetical protein BRAFLDRAFT_280521 [Branchiostoma floridae]
gi|229289803|gb|EEN60487.1| hypothetical protein BRAFLDRAFT_280521 [Branchiostoma floridae]
Length = 379
Score = 43.1 bits (100), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 20/33 (60%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
L +C C+ YC K+HQ+ HWK HK CK P
Sbjct: 4 LLRCGGCRTAYYCSKEHQRTHWKKHKRVCKSKP 36
>gi|302692318|ref|XP_003035838.1| hypothetical protein SCHCODRAFT_84444 [Schizophyllum commune H4-8]
gi|300109534|gb|EFJ00936.1| hypothetical protein SCHCODRAFT_84444 [Schizophyllum commune H4-8]
Length = 530
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C VC + +C+ C VAYCG + Q+ HWK HK C+ +
Sbjct: 355 CVVCGKRTTSRCTGCLAVAYCGPECQRAHWKDHKSFCRAI 394
>gi|310800311|gb|EFQ35204.1| MYND finger [Glomerella graminicola M1.001]
Length = 344
Score = 43.1 bits (100), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 34/65 (52%), Gaps = 7/65 (10%)
Query: 14 RCAVCR-ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
+CAVC T L +C+ C+ V YCG+ HQ H K HK C ++ E + RYLQ
Sbjct: 13 QCAVCHSRTKLSRCAGCQVVYYCGRDHQVRHRKAHKSDCNQIK-EYRE-----RYLQEEN 66
Query: 73 DLHPG 77
L G
Sbjct: 67 ALRNG 71
>gi|328771148|gb|EGF81188.1| hypothetical protein BATDEDRAFT_87837 [Batrachochytrium
dendrobatidis JAM81]
Length = 391
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 2/85 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKCKKLPYEIKSSPLLGRYLQATL 72
C +C KCSACK V YC K+H W++ HK C + + + + + +
Sbjct: 157 CHLCGLKGSLKCSACKNVYYCSKEHSVLDWQIGGHKEHCGNSISDDQIAIVRAKAIFPEF 216
Query: 73 DLHPGDRIARESPLIVGPKLALAEP 97
+L D + E P V L L EP
Sbjct: 217 ELQEEDEMEFELPATVTDSLVLEEP 241
>gi|350412393|ref|XP_003489630.1| PREDICTED: egl nine homolog 1-like [Bombus impatiens]
Length = 486
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
C +C T L +CS CK V YC K+HQ+ WK HK C
Sbjct: 24 CVICNRTDKLLRCSRCKAVFYCTKEHQRRDWKRHKEFC 61
>gi|221505568|gb|EEE31213.1| hypothetical protein TGVEG_082100 [Toxoplasma gondii VEG]
Length = 858
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 9/89 (10%)
Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
+Y + H+CLPNA H+ +++S+ + ++ Y + + T RR+
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRSRADIAAGDEVTLTYLSEDDLMRPTLQRRKV 227
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
L+ TK F C+CERCS P + F C
Sbjct: 228 LSETKDFICTCERCSAPVDFSRGFRCPAC 256
>gi|397635097|gb|EJK71718.1| hypothetical protein THAOC_06815 [Thalassiosira oceanica]
Length = 300
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 6/73 (8%)
Query: 3 SIIEIEDEELMRCAVCRE------TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
S I ++D CA C + L C+AC+ V YCG Q+ H KLHK CK+
Sbjct: 2 SCIAVDDGSTDVCANCGKGCEGGAIKLKSCTACRLVKYCGVDCQRAHRKLHKKACKQRVA 61
Query: 57 EIKSSPLLGRYLQ 69
E+K L + L+
Sbjct: 62 ELKDEELYSQGLE 74
>gi|397579542|gb|EJK51248.1| hypothetical protein THAOC_29597 [Thalassiosira oceanica]
Length = 365
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 5/73 (6%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + + D+ CA C + L C+AC+ V YC Q+ H K HK CK+
Sbjct: 65 MSCFVSVADDGYEACANCGKQGSEAVKLRNCTACRLVKYCSVDCQRAHRKQHKKACKQRA 124
Query: 56 YEIKSSPLLGRYL 68
E+K L G+ L
Sbjct: 125 AELKDEQLYGQGL 137
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 106/280 (37%), Gaps = 43/280 (15%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ + GR L+A + GD I E +L +C C K A RC
Sbjct: 5 EVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFK----RHAKLHRC 60
Query: 117 PRC-FWPACSARCSGLSDAHT-HAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
+C F C C DA H ECA +K + E I L R + R
Sbjct: 61 GQCKFAHYCDRTCQ--KDAWVNHKNECAAIK-------KHGKAPNENIRLAARIMWRIER 111
Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
E + ++ H+ G EE ++ +FLE P D + +
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFG------EEEKKELRLDLESFLEFWPHDG-KQFAMQY 164
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPN-------AKHSNMMQSKL 275
+ I GVI NG + R Q G++ +CL+ H+C PN K K+
Sbjct: 165 ISHILGVINCNGFTLSDQRGLQAVGVGIFPNLCLVNHDCWPNCTVIFNNGKIELRALGKI 224
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ + +S Y + L + RR+ L YF+C+CE C
Sbjct: 225 SIGEELTVS--YVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 262
>gi|115472083|ref|NP_001059640.1| Os07g0481000 [Oryza sativa Japonica Group]
gi|113611176|dbj|BAF21554.1| Os07g0481000 [Oryza sativa Japonica Group]
gi|215701040|dbj|BAG92464.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222637030|gb|EEE67162.1| hypothetical protein OsJ_24245 [Oryza sativa Japonica Group]
Length = 380
Score = 43.1 bits (100), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C A +C C VAYC + HQ HW HK +C +L ++ +L ++
Sbjct: 13 CAGEARRRCGGCGAVAYCSRAHQTVHWGFHKEECARLAEQMSRIDMLKQF 62
>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
Length = 429
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 24/32 (75%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCSAC+ +AYC ++ Q+ WKLHK +CK +
Sbjct: 36 LKKCSACRRLAYCSQECQRADWKLHKVECKAI 67
>gi|340715205|ref|XP_003396109.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
isoform 1 [Bombus terrestris]
gi|340715207|ref|XP_003396110.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
isoform 2 [Bombus terrestris]
Length = 440
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 27/44 (61%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
++ C +C E A +CS CKEV YCG++ Q + W HK C K+
Sbjct: 388 DVKECPLCHEEAKKRCSKCKEVWYCGRECQVKDWVKHKNICDKI 431
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 109/289 (37%), Gaps = 39/289 (13%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ +S GR L+AT + D I E +L +C C K RC
Sbjct: 5 EVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFK----RQEKLHRC 60
Query: 117 PRC-FWPACSARCSGLSDAH-THAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
+C F C C DA H ECA +K Y E I L R + R
Sbjct: 61 GQCKFAHYCDRTCQ--KDAWLNHKNECAAIK-------KYGKVPNENIRLAARIMWRVER 111
Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
E + D++ H+ G EE ++ FL+ P S +
Sbjct: 112 EGTGLTEGCLVSVDDLQNHVEHFG------EEEQKELRVDVDTFLQYWPPQS-QQFSMQY 164
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
+ I GVI NG + R Q G++ + L+ H+C PN ++ ++ + +
Sbjct: 165 ISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKI 224
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
++ Y + L ++ RR L YF+CSCE C + + +A
Sbjct: 225 SEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAA 273
>gi|403411611|emb|CCL98311.1| predicted protein [Fibroporia radiculosa]
Length = 675
Score = 42.7 bits (99), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 24/38 (63%)
Query: 16 AVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
++C +C+ACK +YC K+ QK+HWK H+ CK
Sbjct: 112 SICPREGSMRCAACKLTSYCSKECQKDHWKTHRRDCKN 149
>gi|390599653|gb|EIN09049.1| hypothetical protein PUNSTDRAFT_102596, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 722
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
Query: 5 IEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+ I+ L C+ C AL KC C + YC QK HWK HKP C++
Sbjct: 670 VTIDSVSLYLCSFCENPSAALKKCGGCGKTRYCDAACQKGHWKAHKPLCQR 720
>gi|348689068|gb|EGZ28882.1| hypothetical protein PHYSODRAFT_353603 [Phytophthora sojae]
Length = 175
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCK 52
E +RC VC+ + CS CK V YCG + QK+ WK HK KCK
Sbjct: 128 EAALRCRVCKAPSTLVCSRCKVVRYCGAECQKQDWKASHKQKCK 171
>gi|297825441|ref|XP_002880603.1| ubiquitin carboxyl-terminal hydrolase family protein [Arabidopsis
lyrata subsp. lyrata]
gi|297326442|gb|EFH56862.1| ubiquitin carboxyl-terminal hydrolase family protein [Arabidopsis
lyrata subsp. lyrata]
Length = 650
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 41/84 (48%), Gaps = 2/84 (2%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSSPLLGRYLQ 69
+ + C+VC +T+ KCS CK V YC Q WK HK KCK SSP+ +
Sbjct: 60 DTVECSVCGKTSTKKCSRCKSVRYCSADCQTSDWKSGHKLKCKVF-RSTDSSPVGRDDID 118
Query: 70 ATLDLHPGDRIARESPLIVGPKLA 93
L +++S + + P+L+
Sbjct: 119 YKASLFGNMSASKKSKVALVPQLS 142
>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 425
Score = 42.7 bits (99), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 5/104 (4%)
Query: 234 GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQS-KLFVFRDT----HISTMYT 288
G +NGV GL+ + H+C PN S Q + V +D ++ Y
Sbjct: 184 GFSIMNGVTNEPVGIGLFLQGSMFNHDCDPNCVVSFQGQEMNVHVIKDVKEGQELTISYV 243
Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNE 332
L T+ R++ L + +F C C RC+ T Y ++C N+
Sbjct: 244 EVLQSTKKRQKILKDSYFFECQCSRCTTETTDDWYLDGLQCGNK 287
>gi|302676301|ref|XP_003027834.1| hypothetical protein SCHCODRAFT_237843 [Schizophyllum commune
H4-8]
gi|300101521|gb|EFI92931.1| hypothetical protein SCHCODRAFT_237843 [Schizophyllum commune
H4-8]
Length = 224
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 24/51 (47%)
Query: 1 MASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
M++ + I D RC VC +CS C YC K H WK HK +C
Sbjct: 1 MSAAVPIPDGFETRCGVCAAPTSMRCSRCGSAYYCSKDHSAMDWKYHKKQC 51
>gi|156552215|ref|XP_001606321.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Nasonia vitripennis]
Length = 411
Score = 42.7 bits (99), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E A KCS CK+V YC ++ Q+ HW +HK C +L
Sbjct: 339 CVTCGEEKATKKCSKCKQVQYCDRECQRLHWFIHKKTCSRL 379
>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 541
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 61/267 (22%), Positives = 107/267 (40%), Gaps = 35/267 (13%)
Query: 65 GRYLQATLDLHPGDRIAR-ESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFW-- 121
GR L+A D PG +A + PL+ P A ++ C C L NA+ C
Sbjct: 25 GRCLRAARDYKPGQLMAVFDDPLVAFPDAAGSKTTCHHC-------LVHNAKVFGCTGCD 77
Query: 122 ---PACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI-LQRRSPKK 177
CS C ++ H+ EC + + + ++ N P+R L+ + R +
Sbjct: 78 KAVSYCSDECQKVNWKLIHSKECKVFR-KVQAAVSKNWLP----TPVRALVQILLRLSEV 132
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
++ + +E H K E++E + L++ G DD + I I+
Sbjct: 133 HEVVTGLEGHGDKFRARKELWENMK-----LQAYAGIHYAGRKEDDANLNLAAEILCKIQ 187
Query: 238 VNG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMY 287
N D G+ L ++ ++ H+C+PNA K ++++ V I+ Y
Sbjct: 188 TNSFDRFDADTGQSGTFLDPLLAMVNHSCIPNAVVLFWKRKAYLRAETPVKAGEDITISY 247
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ RR+ L + +F C C RC
Sbjct: 248 IDYTKPLSFRRQDLELY-HFQCGCLRC 273
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 14/102 (13%)
Query: 240 GVDIGRYTQGLYSVICLMEHNCLPN---AKHSNMMQSKLF--VFRDTHISTMYTNALWGT 294
V IG Y +G L H+C PN + MQ ++ + D ++ Y L T
Sbjct: 196 AVGIGLYPEG-----ALFNHDCDPNCVVSFKGREMQVRVVRDIEVDEELTVSYVELLQST 250
Query: 295 QPRREHLAITKYFNCSCERCSDPTELGT----YFSAMKCLNE 332
+ RR L + +F+C C+RC T + Y ++C N+
Sbjct: 251 KARRRELKESYFFDCECKRCKAATNGQSNEDWYLDGLRCSNK 292
>gi|256075329|ref|XP_002573972.1| hypothetical protein [Schistosoma mansoni]
gi|360044826|emb|CCD82374.1| hypothetical protein Smp_135930 [Schistosoma mansoni]
Length = 478
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
RC C E A +CS C++ YCG++ Q +HW HK C L I S
Sbjct: 427 RCPTCGEIASKRCSRCRQEWYCGRECQVKHWLKHKDACDLLTEAITS 473
>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
Length = 433
Score = 42.7 bits (99), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 102/265 (38%), Gaps = 20/265 (7%)
Query: 61 SPLLGRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
SP GR L+A GD + A S L VG + + E C L+ RC
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFACPAYASVLTVGERGSHCEH-CFA----RKEGLSKCGRC 68
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPK 176
+ F+ C+ C D H EC+ + + E + A + + I R+P
Sbjct: 69 KQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPS 125
Query: 177 KYQ-ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
+ +K+ E+H+ K E + I S + L + + L D C
Sbjct: 126 EMLLAVKEFESHLDKLD--NEKKDLIQSDIAMLHQFYSKHLEFPDHDSLVVLFAQVNCNG 183
Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTHISTMYTNA 290
+ ++ ++ + LM H+C PN + +++ V + T Y +
Sbjct: 184 FTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEVLPGEEVFTSYIDL 243
Query: 291 LWGTQPRREHLAITKYFNCSCERCS 315
L+ T+ R + L + +F C C+ C+
Sbjct: 244 LYPTEDRNDRLRDSYFFTCECQECT 268
>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
vitripennis]
Length = 391
Score = 42.7 bits (99), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 44/166 (26%), Positives = 71/166 (42%), Gaps = 25/166 (15%)
Query: 178 YQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIE 237
Y++ KD+ +H + + E S+ + L +F+E G L +++ + + G I
Sbjct: 106 YRKFKDLMSHYTDIKNDPKRIEHFVSLCQVLE-DFME---GTTLPNSAE--ILGLYGRIC 159
Query: 238 VNGV-----DIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD---------THI 283
VN D+ G+Y +++H+C PNA + + + R + I
Sbjct: 160 VNSYNILDPDMNSIGVGIYLGPSVIDHSCKPNA--VAVFEGTTILIRALEDIPRLDWSQI 217
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
Y + L T R L T YF C CERC DP TY +A C
Sbjct: 218 HISYIDVLNTTSTRCTELQNTYYFLCECERCKDPE---TYATAAIC 260
>gi|117650662|gb|ABK54276.1| Egln3 [Branchiostoma belcheri]
Length = 804
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 7/67 (10%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK------KLPYEIKSSPLLGRY 67
CAVC ++ L +CS C+ V YC +HQ ++WK HK C+ + + +S+ G
Sbjct: 83 CAVCGAKSNLKRCSRCQGVWYCSSEHQSQNWKQHKKICRAKSAAQSVQSDTQSAAGKGHV 142
Query: 68 LQATLDL 74
Q+ +DL
Sbjct: 143 SQSAVDL 149
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 109/289 (37%), Gaps = 39/289 (13%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ +S GR L+AT + D I E +L +C C K RC
Sbjct: 10 EVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFK----RQEKLHRC 65
Query: 117 PRC-FWPACSARCSGLSDAH-THAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
+C F C C DA H ECA +K Y E I L R + R
Sbjct: 66 GQCKFAHYCDRTCQ--KDAWLNHKNECAAIK-------KYGKVPNENIRLAARIMWRVER 116
Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
E + D++ H+ G EE ++ FL+ P S +
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVEHFG------EEEQKELRVDVDTFLQYWPPQS-QQFSMQY 169
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
+ I GVI NG + R Q G++ + L+ H+C PN ++ ++ + +
Sbjct: 170 ISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKI 229
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
++ Y + L ++ RR L YF+CSCE C + + +A
Sbjct: 230 SEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAA 278
>gi|389604014|ref|XP_003723152.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504894|emb|CBZ14681.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 623
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 46/205 (22%), Positives = 80/205 (39%), Gaps = 45/205 (21%)
Query: 229 LHWICGVIEVNGVDIGR-----YTQGLY---SVICLMEHNCLPN-----AKHSNMMQSKL 275
L+ I GV+ N +++ Q L+ ++ H+C+PN +H+ + +
Sbjct: 370 LYIIIGVLRCNALEVTDPSGLGVAQALHVGNTIASFFNHSCMPNCAIDTVRHAIVTTRII 429
Query: 276 FVFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTELGTYFSAMKCLN 331
+V + I+ + W T+ RRE L+ YF C C RC DP EL L
Sbjct: 430 YVGEELSIAYI-PQLYWPTRLRRERLSEGYYFVCHCPRCQRCDKDPFELALSME----LP 484
Query: 332 EHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSL 391
+ P+ C + +R+ V + QE+ + L
Sbjct: 485 AARRGATTHFHPI------VQTACATIRSRM---------------VGDVSQEDAD--KL 521
Query: 392 EEMLTKLEAMFHPHHYHCYAVKHSL 416
++L+ L P HY C+ V++ L
Sbjct: 522 RKLLSDLLKHLFPFHYLCHEVRNCL 546
>gi|307213312|gb|EFN88764.1| Ankyrin repeat and MYND domain-containing protein 2 [Harpegnathos
saltator]
Length = 406
Score = 42.7 bits (99), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E A+ KCS CK V YC ++ Q+ HW +HK C +L
Sbjct: 335 CVTCGEDKAIKKCSKCKAVQYCDRECQRLHWFMHKKACARL 375
>gi|427778123|gb|JAA54513.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 468
Score = 42.7 bits (99), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
C C E A +CSACK V YCG QK HW HK C +L E K +
Sbjct: 374 CTACGEPQADKRCSACKSVQYCGAPCQKLHWFTHKRHCARLAEEYKKA 421
>gi|242045648|ref|XP_002460695.1| hypothetical protein SORBIDRAFT_02g033310 [Sorghum bicolor]
gi|241924072|gb|EER97216.1| hypothetical protein SORBIDRAFT_02g033310 [Sorghum bicolor]
Length = 165
Score = 42.7 bits (99), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 28/47 (59%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
A +C C VAYC + HQ HW++HK +C++ +++ +L ++
Sbjct: 17 VADRRCGGCGAVAYCSRAHQIIHWRVHKEECERFAEQMRRVDVLNQF 63
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 65/287 (22%), Positives = 110/287 (38%), Gaps = 39/287 (13%)
Query: 46 LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLG 101
+ PK +K S+ G L+A L PG+ + R PL G + + + LG
Sbjct: 1 MEPPKVEKF-----STANRGNGLRALAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLG 55
Query: 102 CHKPLNPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYE 160
K + RC +C CSA+C + H EC LK C+ + +
Sbjct: 56 KEKLM--------RCSQCRVAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLL 105
Query: 161 AILPLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK----- 215
+ + + +K D+E++++K E E + +V + E+
Sbjct: 106 GRVVFKLMEETPSESEKLYSFYDLESNINKLT--EEKKEGLRQLVMTFQHFMREEIQDAS 163
Query: 216 ---LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMM 271
LP D+ + + IC + ++ GLY + L+ H+C PN + N
Sbjct: 164 QLPLPFDIFEAFA----KVICNAFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGP 219
Query: 272 QSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
L RD ++ Y + L ++ RR+ L F C C RC
Sbjct: 220 HLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 109/281 (38%), Gaps = 27/281 (9%)
Query: 56 YEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNAR 115
+E +S G L+AT ++H G+ + PL A+ +C C L R
Sbjct: 5 FERFASAEKGNGLRATHEIHAGELLYTAEPLAYCVSNTCAKDLCHSCFSRRKTLL----R 60
Query: 116 CPRC-FWPACSARCS--GLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
C +C C C SD H EC L+ + + + A + R L +
Sbjct: 61 CSQCKVARYCDVTCQKQAWSD---HKRECKCLR-SLHPRIPTDSVRLAARIIFRLLSPSQ 116
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
SP L++ E+H+ G E E + + L+ +++P + + L +
Sbjct: 117 TSPG-LCSLEEHESHLCDMG--EERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLL 173
Query: 233 ----CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQS-KLFVFRDTH----I 283
C ++ ++ GLY + L+ H+C P+ ++ L RD +
Sbjct: 174 AKLTCNCFTISDGELREIGVGLYPSMSLLNHDCRPSCVMLFEGETLHLRAVRDMQPAEEL 233
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTEL 320
+ Y L T+ RR L +F C C+RC+ DP L
Sbjct: 234 TISYIGTLAPTRDRRTQLEEQYHFTCQCQRCTTADMDPVML 274
>gi|270011013|gb|EFA07461.1| HIF prolyl hydroxylase [Tribolium castaneum]
Length = 478
Score = 42.7 bits (99), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 13 MRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI-KSSPLLG 65
+ CAVC +T L +C+ CK YC K+HQ++ W H+ +C +L + K SP G
Sbjct: 5 LSCAVCGTKTNLLRCARCKTRIYCSKEHQRKDWPNHRRECAQLALALDKQSPKSG 59
>gi|427784613|gb|JAA57758.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 430
Score = 42.7 bits (99), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
C C E A +CSACK V YCG QK HW HK C +L E K +
Sbjct: 336 CTACGEPQADKRCSACKSVQYCGAPCQKLHWFTHKRHCARLAEEYKKA 383
>gi|282158095|ref|NP_001164091.1| HIF prolyl hydroxylase [Tribolium castaneum]
Length = 356
Score = 42.7 bits (99), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 13 MRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI-KSSPLLG 65
+ CAVC +T L +C+ CK YC K+HQ++ W H+ +C +L + K SP G
Sbjct: 5 LSCAVCGTKTNLLRCARCKTRIYCSKEHQRKDWPNHRRECAQLALALDKQSPKSG 59
>gi|72003678|ref|NP_508850.2| Protein SET-30 [Caenorhabditis elegans]
gi|351064639|emb|CCD73128.1| Protein SET-30 [Caenorhabditis elegans]
Length = 560
Score = 42.7 bits (99), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 238 VNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HISTMYTNAL 291
++ VD R +GLY +C +H+C PNA +S N + +KL D ++ T T+
Sbjct: 181 ISNVDYLREIGKGLYLDLCKYDHSCRPNAIYSCNGIVAKLRALHDNVDLENVET--THYT 238
Query: 292 WGTQP-----RREHLAITKYFNCSCERCSDPTE 319
+ P RR L T YF C CERC DP +
Sbjct: 239 YIELPPCKIQRRHMLKETWYFECHCERCDDPDD 271
>gi|195339973|ref|XP_002036591.1| GM18750 [Drosophila sechellia]
gi|194130471|gb|EDW52514.1| GM18750 [Drosophila sechellia]
Length = 141
Score = 42.7 bits (99), Expect = 0.52, Method: Composition-based stats.
Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ +LGR L + + G+ + E P GPK + +CLGC++ L +
Sbjct: 2 LKARLQEDAVLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C + C+ D H EC +
Sbjct: 61 SLDRCELCDWPLCGS-CADDEDVTEHRGECQVF 92
>gi|348680121|gb|EGZ19937.1| hypothetical protein PHYSODRAFT_298276 [Phytophthora sojae]
Length = 423
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 2 ASIIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKCKK 53
A+ ++ D+++ CA+C + A CSAC YC K HQK+HW HK C +
Sbjct: 136 AAQLKQVDDKVDLCALCGQRATFTCSACHVAQYCSKAHQKDHWTAGGHKQTCGQ 189
>gi|242096572|ref|XP_002438776.1| hypothetical protein SORBIDRAFT_10g026010 [Sorghum bicolor]
gi|241916999|gb|EER90143.1| hypothetical protein SORBIDRAFT_10g026010 [Sorghum bicolor]
Length = 979
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
CA C A KCS CK V YC ++ Q HW+ HK KCK++
Sbjct: 85 CAACGGVASKKCSRCKRVRYCSQECQTNHWQSDHKFKCKQM 125
>gi|255714340|ref|XP_002553452.1| KLTH0D17160p [Lachancea thermotolerans]
gi|238934832|emb|CAR23014.1| KLTH0D17160p [Lachancea thermotolerans CBS 6340]
Length = 485
Score = 42.7 bits (99), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 6/87 (6%)
Query: 235 VIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDT----HISTMYT 288
+I + ++ + ++++ + HNC PN ++ KLF +D + T Y
Sbjct: 321 MILIGKFNLNQREGQIFTLYSYLNHNCEPNVRYELDGKTGLKLFARKDIKKGEELLTTYV 380
Query: 289 NALWGTQPRREHLAITKYFNCSCERCS 315
N L G RR L + F C CERC+
Sbjct: 381 NPLHGVTLRRRELLVNWGFLCDCERCN 407
>gi|393236698|gb|EJD44245.1| hypothetical protein AURDEDRAFT_137472 [Auricularia delicata
TFB-10046 SS5]
Length = 222
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 29/85 (34%), Positives = 38/85 (44%), Gaps = 16/85 (18%)
Query: 7 IEDEELMRCAVCRET----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS- 61
+E + CAV T L CS CK VAYC K+ Q+ HW LHK C+ I +S
Sbjct: 1 MEQRQCDNCAVTVSTEAGVVLGICSNCKFVAYCSKECQRAHWALHKDVCRLTAAIISASR 60
Query: 62 -----------PLLGRYLQATLDLH 75
P L +YL+ +H
Sbjct: 61 AHGVASGQYPTPNLAKYLRRFCSIH 85
>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
Length = 475
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 34/63 (53%), Gaps = 8/63 (12%)
Query: 13 MRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP-----LLG 65
+RCA C + L +C+ CK YC K QK WK+HK +CK L E KS P LG
Sbjct: 46 VRCAQCCVFKPKLLRCAGCKLQYYCSKDCQKSAWKIHKEECKMLKSE-KSMPDDITLFLG 104
Query: 66 RYL 68
R L
Sbjct: 105 RIL 107
>gi|126646557|ref|XP_001388057.1| MYND finger domain protein [Cryptosporidium parvum Iowa II]
gi|126117145|gb|EAZ51245.1| MYND finger domain protein [Cryptosporidium parvum Iowa II]
gi|323509519|dbj|BAJ77652.1| cgd4_1070 [Cryptosporidium parvum]
Length = 234
Score = 42.4 bits (98), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
ED+ L RC CR E L CS CK+VAYC Q++ W HK CK
Sbjct: 187 EDKILNRCESCRKESEKTLLICSRCKKVAYCNVDCQRKDWSYHKQFCK 234
>gi|301118408|ref|XP_002906932.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262108281|gb|EEY66333.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 175
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 26/44 (59%), Gaps = 1/44 (2%)
Query: 10 EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCK 52
E +RC VC+ + CS CK V YCG QK+ WK HK KCK
Sbjct: 128 EAALRCRVCKAPSTLVCSRCKAVRYCGADCQKQDWKASHKQKCK 171
>gi|397570679|gb|EJK47406.1| hypothetical protein THAOC_33872 [Thalassiosira oceanica]
Length = 306
Score = 42.4 bits (98), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 28/49 (57%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
T L C+AC+ V YCG Q+ H KLHK CK+ E+K L + L+
Sbjct: 28 TKLKNCTACRLVKYCGVDCQRSHRKLHKKACKQRVAELKDEQLYSQGLE 76
>gi|397593190|gb|EJK55891.1| hypothetical protein THAOC_24316 [Thalassiosira oceanica]
Length = 304
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H KLHK CK+ E+K L G+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQLYGQ 72
>gi|390331847|ref|XP_003723365.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like
[Strongylocentrotus purpuratus]
Length = 1640
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 30/49 (61%), Gaps = 5/49 (10%)
Query: 12 LMRCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
L+ C+ C+++ +L +C+ C VAYC + QK +W H+P C++ P
Sbjct: 1063 LLHCSHCKKSCEKGRSLKRCTKCYRVAYCDQACQKSNWNTHRPTCRRFP 1111
>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 552
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 85/358 (23%), Positives = 136/358 (37%), Gaps = 59/358 (16%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
+I++S GR + A L G + P + E C C P + RC
Sbjct: 26 DIRTSSTDGRGIWAREALSAGATLMALKPHVHVLSTRNLELYCSACTSPAP--VTGLKRC 83
Query: 117 PRC--FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP---LRCL--I 169
RC W CS C +D H PECA ++ A N + +P +RCL +
Sbjct: 84 TRCRVIW-YCSTACQN-NDWTMHKPECAAIQR-----WAANAPSPDVSIPAEPIRCLGRL 136
Query: 170 LQRR-----SPKKYQELKDMEAHMSKRGPGTEVYEE----IDSIVKYLRSNFLEKL---- 216
L R+ S +E+ +++H P T YE ++VKYL + +L
Sbjct: 137 LWRKQKLRSSSTWAKEVDGLQSHRKDLQPST--YESHTHLAHAVVKYLGLDAPGQLGAYG 194
Query: 217 ---PGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNA-------- 265
D++D S +I I + + + ++ L+ H+C PNA
Sbjct: 195 ITTAADLVDLIS----RFITNSITLTAPSLTPLGVSVSPLVALINHSCDPNAVVVYPRST 250
Query: 266 ----KHSNMMQSKLF--VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
+ MQ + D I T Y + R+ L T F C C CS P
Sbjct: 251 SDPSQQEPQMQVVAIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFTCKCSLCSQPDG 310
Query: 320 LGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQ 377
+ A+ C K+ G LP + ++ CG C A +++ D L ++G++
Sbjct: 311 MD-LREAIWC---PKNCGGTCPLPT---EENTLSRCGKCKAIVSSTDSVLDALRVGQE 361
>gi|397619755|gb|EJK65389.1| hypothetical protein THAOC_13754, partial [Thalassiosira oceanica]
Length = 234
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L GR L+
Sbjct: 93 LKNCTACRLVKYCGVDCQRTHRKQHKKACKQRAAELKDEQLYGRGLE 139
>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
Length = 446
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 91/242 (37%), Gaps = 50/242 (20%)
Query: 115 RCPRCFWPACSARCSGLSDAH------------THAPECAILK-LGCETLLAYNDYKYEA 161
RC +CF P +CS H EC LK +G T+ +A
Sbjct: 30 RCDKCFKPGKVLKCSNCLYVRYCNRFCQKEAWPDHQDECGKLKAIGTRTVP-------DA 82
Query: 162 ILPLRCLI--LQR--------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSN 211
L + +I LQ+ + K Y+ D+ H E S++ LRS
Sbjct: 83 ALMISRIIRKLQKGGDYQKGYYTAKFYRRFHDLMTHEDDIKNDAPRMEHFQSLLVVLRS- 141
Query: 212 FLEKLPGDVLDDTSAKCLHWICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAK 266
L + + A+ L I G + +N +I G+Y I +++H+C PNA
Sbjct: 142 ----LVEEAAMPSKAELLQ-IFGKMCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNAL 196
Query: 267 HS-NMMQSKLFVFRDTH--------ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
+ + L + D H I Y + + + R+ L YF C+CERC D
Sbjct: 197 ATFDGTTIHLRLLEDYHGSEVDFSKIFISYIDLMNPAEERKARLRAQYYFECNCERCRDE 256
Query: 318 TE 319
E
Sbjct: 257 QE 258
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 42.4 bits (98), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 30/52 (57%), Gaps = 5/52 (9%)
Query: 13 MRCAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
+RC C L KCS CK +AYCGK Q WK+HK +CK L KS+P
Sbjct: 49 IRCDYCYSEPEKLLKCSKCKFIAYCGKVCQASDWKMHKYECKCLT---KSAP 97
>gi|348687379|gb|EGZ27193.1| hypothetical protein PHYSODRAFT_468296 [Phytophthora sojae]
Length = 430
Score = 42.4 bits (98), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 26/45 (57%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
E L C C+ VAYC ++ QK+ WK HKP CK + + + + L
Sbjct: 250 EGKLMTCGGCRSVAYCSRECQKKAWKKHKPDCKAIRAQREQAEAL 294
>gi|380011327|ref|XP_003689760.1| PREDICTED: zinc finger MYND domain-containing protein 10-like,
partial [Apis florea]
Length = 436
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
++ C +C E A +CS CKE YCG++ Q + W HK C K+ I+
Sbjct: 388 DIKECFLCHEEAKKRCSKCKEAWYCGRECQIKDWIKHKDICDKITKNIE 436
>gi|299470253|emb|CBN79557.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 734
Score = 42.4 bits (98), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 28/41 (68%), Gaps = 1/41 (2%)
Query: 14 RCAVCRETALH-KCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+CA C ++ +CS C++VAYC K+ Q+ WK HKP C++
Sbjct: 691 KCAQCGKSGKPLQCSRCRKVAYCSKECQRRDWKSHKPSCRE 731
>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
impatiens]
Length = 439
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 88/233 (37%), Gaps = 35/233 (15%)
Query: 113 NARCPRCFWPACSARCSGLSDAH------------THAPECAILKLGCETLLAYNDYKYE 160
N RC CF RCS + H+ ECA LK C N +
Sbjct: 30 NVRCDYCFKSGKLFRCSACQYVYYCNQSCQQMSWPMHSKECARLK-KCSPWGISNVARLM 88
Query: 161 A--ILPLRCLILQRR---SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK 215
A I+ L + R + Y++ KD+ +H S+ + E + L FLE
Sbjct: 89 ARIIIKLNQGGDEERGYYNETSYRKFKDLMSHCSEIEKDEKKMEHFVCLCNVLH-KFLED 147
Query: 216 LPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ----GLYSVICLMEHNCLPNAKHSNMM 271
+P S L I G I +N I G+Y +++H+C PNA +
Sbjct: 148 MP-----IPSTAELLGIYGRITINSFSIFNSDMNIGVGIYLGPSILDHSCKPNAVATFEG 202
Query: 272 QS-KLFVFRD------THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
+ + D + I Y + + T+ RR L + YF C CERC P
Sbjct: 203 TTITVKAIEDLPSLDLSQIRIPYIDVIKTTRDRRAELQSSYYFWCDCERCEKP 255
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 61/268 (22%), Positives = 101/268 (37%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRALARLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVIFKLMEETPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E++M+K + EE ++ L F + ++ D D
Sbjct: 125 SFYDLESNMNK------LTEEKKEGLRQLVLTFQHFMREEIQDASQLPPSFDIFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
IC + ++ GLY + L+ H+C PN + N L RD ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|330790441|ref|XP_003283305.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
gi|325086730|gb|EGC40115.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
Length = 754
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 12 LMRCAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKC 51
L +C C+ T + C CK VAYC K+ QK+HW +HK +C
Sbjct: 668 LYKCWSCKLTLIKPLICGYCKTVAYCSKECQKDHWLVHKEQC 709
>gi|397597652|gb|EJK57057.1| hypothetical protein THAOC_22945, partial [Thalassiosira oceanica]
Length = 439
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
CA C ET L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 150 CANCGETGSDAVKLKNCTACRLVKYCGVDCQRTHRKQHKKACKERVAELKDEKLYGQ 206
>gi|328793905|ref|XP_001122085.2| PREDICTED: zinc finger MYND domain-containing protein 10-like [Apis
mellifera]
Length = 373
Score = 42.4 bits (98), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 19/50 (38%), Positives = 29/50 (58%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
++ C +C E A +CS CKE YCG++ Q + W HK C K+ I++
Sbjct: 323 DIKECFLCHEEAKKRCSKCKEAWYCGRECQIKDWIKHKNICDKITKNIEN 372
>gi|443687503|gb|ELT90474.1| hypothetical protein CAPTEDRAFT_222728 [Capitella teleta]
Length = 128
Score = 42.4 bits (98), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 24/39 (61%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
++C VC A C++CK+ AYCG+Q Q + W HK C
Sbjct: 87 VKCDVCTNLAAFLCASCKKAAYCGRQCQMQGWDQHKSTC 125
>gi|50550485|ref|XP_502715.1| YALI0D11792p [Yarrowia lipolytica]
gi|74634629|sp|Q6C9E7.1|SET5_YARLI RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49648583|emb|CAG80903.1| YALI0D11792p [Yarrowia lipolytica CLIB122]
Length = 438
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 105/277 (37%), Gaps = 35/277 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPI-----CLGCHKPLNPNLADNARCPRC 119
G+ L A D+ + + E ++ P L + C C +P + + CP C
Sbjct: 118 GKGLYAKKDIKKDEELWNEQAFLLVPPLEHVGIMRKGMGCAFCSRPFQSH--NGVECPSC 175
Query: 120 FWPACSARCSGLSDAHTHA-PECAILKLGCETLLAYNDY----KYEAILPLRCLILQR-R 173
C +C H E K+ A+ D+ + A+ L L L+ R
Sbjct: 176 AAKWCDNQCKKKDVTHVAMWHESRHGKVTRVEWRAFEDFCVENGWMALYALGLLWLRVIR 235
Query: 174 SPKKYQELKDMEAHMSKRGPGT--EVYEEIDSIVKYLRSNFLEKLPGDVLD----DTS-- 225
PKK + K M A ++ G + E+ +S+ +S L K ++LD DT
Sbjct: 236 DPKKDEVQKQMMA-FARVGQDERHKAVEQSNSLFASEQSEVLWKKGYEMLDKLLLDTEFS 294
Query: 226 -AKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR----- 279
K G+ +N VD +Y + H+C PN N+ +++ R
Sbjct: 295 YEKDFLPGLGMFNINNVD-----GNMYLTQSHLNHSCEPNVDVKNVGRTQGISVRAKRDI 349
Query: 280 --DTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
+ T Y N RR +L + FNC+C RC
Sbjct: 350 KTGEELFTTYVNPEHQLDDRRYNLRVNWGFNCNCTRC 386
>gi|397586045|gb|EJK53487.1| hypothetical protein THAOC_27074, partial [Thalassiosira
oceanica]
Length = 150
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H KLHK CK+ E+K L G+
Sbjct: 35 LKDCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQLYGQ 78
>gi|330795484|ref|XP_003285803.1| hypothetical protein DICPUDRAFT_97170 [Dictyostelium purpureum]
gi|325084267|gb|EGC37699.1| hypothetical protein DICPUDRAFT_97170 [Dictyostelium purpureum]
Length = 1002
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC+ + C+ C V YCG +HQ + W HK C L
Sbjct: 71 CAVCKSKNVQICTGCLLVYYCGTEHQNQDWPNHKDFCSGL 110
>gi|189205749|ref|XP_001939209.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975302|gb|EDU41928.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 327
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+ KCS C +V YCGKQHQK W H+ C K
Sbjct: 296 VSKCSGCGQVYYCGKQHQKLAWGFHRGYCAK 326
>gi|281210969|gb|EFA85135.1| hypothetical protein PPL_02134 [Polysphondylium pallidum PN500]
Length = 819
Score = 42.4 bits (98), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 11 ELMRCAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKCK 52
+L RC C+ + + C CK V+YC K Q+EHW HK CK
Sbjct: 760 QLYRCGKCKLSLMKPLICGNCKSVSYCSKICQQEHWDFHKDICK 803
>gi|195578357|ref|XP_002079032.1| GD23737 [Drosophila simulans]
gi|194191041|gb|EDX04617.1| GD23737 [Drosophila simulans]
Length = 141
Score = 42.4 bits (98), Expect = 0.65, Method: Composition-based stats.
Identities = 27/93 (29%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 54 LPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN 113
L ++ +LGR L + + G+ + E P GPK + +CLGC++ L +
Sbjct: 2 LKARLQEDAVLGRCLATSCSVERGELVLEELPFARGPKRD-SGIVCLGCYQFLQFGEDGD 60
Query: 114 A--RCPRCFWPACSARCSGLSDAHTHAPECAIL 144
+ RC C WP C + C+ D H EC +
Sbjct: 61 SLDRCELCDWPLCGS-CADDEDVTEHRGECHVF 92
>gi|169601910|ref|XP_001794377.1| hypothetical protein SNOG_03831 [Phaeosphaeria nodorum SN15]
gi|111067916|gb|EAT89036.1| hypothetical protein SNOG_03831 [Phaeosphaeria nodorum SN15]
Length = 354
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 23/40 (57%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC + A KC+ C+ AYC K+ Q WK HK C L
Sbjct: 9 CAVCEKPASDKCARCRASAYCSKECQAADWKTHKTACADL 48
>gi|397632769|gb|EJK70688.1| hypothetical protein THAOC_07931, partial [Thalassiosira
oceanica]
Length = 283
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H KLHK CK+ E+K L R
Sbjct: 7 LKNCAACRLVKYCGADCQRGHRKLHKTACKQRAAELKDEQLYSR 50
>gi|321464470|gb|EFX75478.1| hypothetical protein DAPPUDRAFT_214409 [Daphnia pulex]
Length = 355
Score = 42.4 bits (98), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
CA C +E KCS CK V YC + QK HW +HK +C K
Sbjct: 316 CATCGQEKVGSKCSICKSVQYCNRDCQKLHWPIHKKECDK 355
>gi|410077102|ref|XP_003956133.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
gi|372462716|emb|CCF56998.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
Length = 484
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 47/108 (43%), Gaps = 11/108 (10%)
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKH--SNMMQSKLFVFRDT----HISTMYTNAL 291
+ +I + + +Y + L H+C PNA++ + ++ K++ R I Y N L
Sbjct: 324 IGKFNINQISNQVYQLTSLANHSCEPNARYEIDSKLELKVYARRKIKPGEEIFLTYVNPL 383
Query: 292 WGTQPRREHLAITKYFNCSCERC-----SDPTELGTYFSAMKCLNEHK 334
G RR L + F C C RC + + S +K L+E K
Sbjct: 384 HGVNLRRRELRVNWGFLCKCSRCLKELKYSKLRINSETSQLKVLDEKK 431
>gi|193676233|ref|XP_001943351.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 636
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 8/91 (8%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICL--GCHKPLNPNLADNA 114
E+K S +GR + AT D++PGD +A + P I GP ++ +C GC LN +LA
Sbjct: 230 ELKMSKNMGRGVYATRDINPGDVVAIDEPYICGP-ISDHTGVCQYSGC---LNLDLAL-I 284
Query: 115 RCPRCFWP-ACSARCSGLSDAHTHAPECAIL 144
CP+C C+ C ++ H EC I+
Sbjct: 285 PCPKCLLVYYCNKDCMNKANEDGHYLECPIM 315
>gi|397593560|gb|EJK55991.1| hypothetical protein THAOC_24202 [Thalassiosira oceanica]
Length = 302
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+S + + D+ CA C + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 1 MSSCVPVADDGDETCANCGKQGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKTACKQRA 60
Query: 56 YEIKSSPL 63
E+K L
Sbjct: 61 AELKDEQL 68
>gi|327292924|ref|XP_003231159.1| hypothetical protein TERG_08982 [Trichophyton rubrum CBS 118892]
gi|326466578|gb|EGD92031.1| hypothetical protein TERG_08982 [Trichophyton rubrum CBS 118892]
Length = 1458
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+C VC C CK+VAYCG +HQ+ W HK CK
Sbjct: 1416 KCVVCSAPG-KLCLGCKKVAYCGTEHQRHDWANHKTTCK 1453
>gi|301610418|ref|XP_002934751.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 2-like [Xenopus (Silurana) tropicalis]
Length = 430
Score = 42.4 bits (98), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/153 (21%), Positives = 68/153 (44%), Gaps = 13/153 (8%)
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+R + +++ +KD E+H+SK E E I + + L + + L D +A+
Sbjct: 119 ERTASERFLSVKDFESHLSKLD--NEKLELIQNDIAALHRFYSKNLH---YSDNAAQVFL 173
Query: 231 WI---CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
+ C + ++ ++ + LM H+C PN + +++ ++ H
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGDE 233
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + + NC C CS
Sbjct: 234 VITSYIDLLYPTEDRNDRLIDSXFCNCDCRECS 266
>gi|328854461|gb|EGG03593.1| hypothetical protein MELLADRAFT_117274 [Melampsora
larici-populina 98AG31]
Length = 340
Score = 42.4 bits (98), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 34/74 (45%), Gaps = 8/74 (10%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATL 72
RCA C L C+ CK YC + HQ+E W HK CK IK S + L+ +
Sbjct: 10 RCARCPSNQDLKSCTGCKVQQYCSETHQREDWTNHKKDCK----SIKKSRM---RLEEAI 62
Query: 73 DLHPGDRIARESPL 86
HP D +PL
Sbjct: 63 QEHPNDPSFNTAPL 76
>gi|406698010|gb|EKD01257.1| hypothetical protein A1Q2_04414 [Trichosporon asahii var. asahii
CBS 8904]
Length = 359
Score = 42.4 bits (98), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 45/98 (45%), Gaps = 10/98 (10%)
Query: 230 HWICGVIEVNGVDIGRYT-QG-LYSVICLMEHNCLPNAKH---SNMMQSKLFVFRDTHIS 284
+W + N + +G + QG +Y +C + H+C+PNA H N+ + RD
Sbjct: 149 YW--STFDTNALPLGPNSPQGAIYPTVCYINHSCIPNANHCWNKNLKMETIHALRDIKKG 206
Query: 285 TMYTNA--LWGTQP-RREHLAITKYFNCSCERCSDPTE 319
T A L GT+ RR H+ F C CE C P +
Sbjct: 207 EEITIAYILEGTRAERRVHVHERLGFECQCEACGAPDD 244
>gi|403417395|emb|CCM04095.1| predicted protein [Fibroporia radiculosa]
Length = 1169
Score = 42.4 bits (98), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 8 EDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
D +C VCR L C C +V YC + QK+HWK HK CK+
Sbjct: 1121 RDPSAHKCFVCRGKGKPRLLACKGCMKVRYCSEACQKKHWKAHKANCKR 1169
>gi|195347882|ref|XP_002040480.1| GM18905 [Drosophila sechellia]
gi|194121908|gb|EDW43951.1| GM18905 [Drosophila sechellia]
Length = 71
Score = 42.4 bits (98), Expect = 0.69, Method: Composition-based stats.
Identities = 17/35 (48%), Positives = 20/35 (57%)
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCSDPTELG 321
Y LW T R+ L +TK+F C C RC DPT G
Sbjct: 3 YAKLLWSTLARKIFLGMTKHFICKCVRCQDPTGAG 37
>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
Length = 384
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 45/211 (21%), Positives = 85/211 (40%), Gaps = 17/211 (8%)
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
RC + F+ C+ C D H EC+ + + E + A + + I R
Sbjct: 17 GRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPER 73
Query: 174 SP-KKYQELKDMEAHMSK-RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHW 231
+P +K +K+ E+H+ K ++ + + + + S LE L D L A+
Sbjct: 74 TPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFLDNDSLVVVFAQV--- 130
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IS 284
C + ++ ++ + LM H+C PN + + L R +
Sbjct: 131 NCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPGEEVF 188
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCS 315
T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 189 TSYIDLLYPTEDRNDRLRDSYFFTCECQECT 219
>gi|449541748|gb|EMD32730.1| hypothetical protein CERSUDRAFT_118469 [Ceriporiopsis
subvermispora B]
Length = 523
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 35/73 (47%), Gaps = 9/73 (12%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHK----PKCKKLPYEIKSSPLL---GRYLQATL 72
E CS CK +YC K+ QK+HWK+HK PK K SP L Y+ A
Sbjct: 23 EQPFSICSVCKRASYCSKECQKKHWKVHKLLCQPKGSKPGERPNPSPHLPADSPYVVAET 82
Query: 73 D--LHPGDRIARE 83
D +H GD I E
Sbjct: 83 DPNVHTGDEIRTE 95
>gi|358056892|dbj|GAA97242.1| hypothetical protein E5Q_03919 [Mixia osmundae IAM 14324]
Length = 1165
Score = 42.4 bits (98), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 22/38 (57%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
CA C A H+CS C YC K+ Q W LHKP+C+
Sbjct: 927 CAHCHLRASHRCSLCNTERYCSKECQAARWPLHKPRCQ 964
>gi|169846112|ref|XP_001829772.1| hypothetical protein CC1G_05981 [Coprinopsis cinerea okayama7#130]
gi|116509099|gb|EAU91994.1| hypothetical protein CC1G_05981 [Coprinopsis cinerea okayama7#130]
Length = 277
Score = 42.4 bits (98), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 23/43 (53%), Gaps = 2/43 (4%)
Query: 14 RCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+C C R+ L KC ACK V YCG QK W HK CK L
Sbjct: 233 KCKACGKRDAKLMKCGACKSVNYCGSTCQKADWPAHKAICKSL 275
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 42.0 bits (97), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 44/222 (19%), Positives = 89/222 (40%), Gaps = 33/222 (14%)
Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
N C + WP CS + + + ++L IL + +R
Sbjct: 73 NVDCQKGDWPMHKLECSSMCSSGQNWCPSETVRLTAR------------ILAKQKTQTER 120
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
+ +++ +K+ E+H+SK E E I++ + + L + + + L ++
Sbjct: 121 TASERFMSVKEFESHLSKLD--NEKKELIENDI-----SALHRFYSKNVHNCDNAALEFL 173
Query: 233 CGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
+ NG I ++ + LM H+C PN + +++ ++ H
Sbjct: 174 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTVAEVRAVQEIHAGEE 233
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTEL 320
+ T Y + L+ T+ R + L + +F+C C CS DP +L
Sbjct: 234 VFTSYIDLLYPTEDRNDRLKDSYFFSCDCRECSTKQKDPAKL 275
>gi|324509870|gb|ADY44135.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 442
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 62/289 (21%), Positives = 112/289 (38%), Gaps = 43/289 (14%)
Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
+ +C + WPA C L P ++ +L ++ N + ++
Sbjct: 48 DTQCQKGGWPAHKEECFFLVKVQPRVP-TSMARLLARVIIRTNQKE----------TIRA 96
Query: 173 RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIV-KYLRSNFLEKLPGDVLDDTSAKCLHW 231
+ + + L + + G +E + + ++ +Y+ +++L P L K +
Sbjct: 97 FNGRTFDSLLSHSDEIKEDGEKSEFFVTLSHVLFEYMGADYLP--PASELLTIFGKVM-- 152
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR---------DTH 282
V ++ D+ GLY + +++H+C P+A + V R D+
Sbjct: 153 -VNVFTISNDDLNTIGLGLYLGLSVLDHSCDPDA--FVLFNGTKAVLRPLKQYITAYDSS 209
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWIL 342
+ Y + L T RR L +F C C C D T S ++C H G C
Sbjct: 210 LRIAYCDLLDLTSMRRNQLKQQFFFTCECSACLDLEREKTARS-VRC--RHCVDGYC--- 263
Query: 343 PVNPLDNDSD----WTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPN 387
PLD + + W CG+ S VHL+ Q+ +V +L QE N
Sbjct: 264 ---PLDVNENSLVCWQCGATSEVHVDEAVHLM-QQVEWEVKRL-QERRN 307
>gi|449549245|gb|EMD40211.1| hypothetical protein CERSUDRAFT_122273 [Ceriporiopsis subvermispora
B]
Length = 704
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 4 IIEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+I EL RC+ CR AL +C C + YC QK HW HK CK
Sbjct: 652 LIRTNSVELYRCSWCRNPSAALRRCGRCGKTRYCDGACQKAHWGEHKAACK 702
>gi|170101224|ref|XP_001881829.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164643184|gb|EDR07437.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 621
Score = 42.0 bits (97), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 24/42 (57%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
E+ + C CS CK V+YC K+ QK HW++HK C+
Sbjct: 90 EVEKYKACDNPGTMACSGCKLVSYCSKECQKYHWRVHKRDCR 131
>gi|299743144|ref|XP_001835571.2| hypothetical protein CC1G_03353 [Coprinopsis cinerea
okayama7#130]
gi|298405520|gb|EAU86142.2| hypothetical protein CC1G_03353 [Coprinopsis cinerea
okayama7#130]
Length = 465
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 2 ASIIEIEDEELMRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
A + + ++L +C+VC++T L CSAC E YCG + Q W HK +C K
Sbjct: 15 ARTVMLPLKKLEKCSVCQKTDGLRLCSACGEKVYCGSECQTNDWAEHKKQCGK 67
>gi|323452238|gb|EGB08113.1| hypothetical protein AURANDRAFT_71676 [Aureococcus anophagefferens]
Length = 1696
Score = 42.0 bits (97), Expect = 0.74, Method: Composition-based stats.
Identities = 19/58 (32%), Positives = 27/58 (46%), Gaps = 5/58 (8%)
Query: 2 ASIIEIEDEELMRCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
A + +E E C T L +C+ CK Y K+HQ+ HW LHK C+ +
Sbjct: 1177 AQLARLEPVEFATTRCCARTYATGVPLLRCARCKRTFYASKEHQEAHWPLHKRSCRHV 1234
>gi|221484463|gb|EEE22759.1| hypothetical protein TGGT1_035070 [Toxoplasma gondii GT1]
Length = 859
Score = 42.0 bits (97), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 9/89 (10%)
Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
+Y + H+CLPNA H+ +++++ + ++ Y + + T RR+
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRTRADIAAGDEVTLTYLSEDDLMRPTLQRRKV 227
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
L+ TK F C+CERCS P + F C
Sbjct: 228 LSETKDFICTCERCSAPVDFSRGFRCPAC 256
>gi|237837951|ref|XP_002368273.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
gi|211965937|gb|EEB01133.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
Length = 859
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 43/89 (48%), Gaps = 9/89 (10%)
Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
+Y + H+CLPNA H+ +++++ + ++ Y + + T RR+
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRTRADIAAGDEVTLTYLSEDDLMRPTLQRRKV 227
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
L+ TK F C+CERCS P + F C
Sbjct: 228 LSETKDFICTCERCSAPVDFSRGFRCPAC 256
>gi|393229006|gb|EJD36638.1| hypothetical protein AURDEDRAFT_74033 [Auricularia delicata
TFB-10046 SS5]
Length = 239
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 23/42 (54%)
Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C CK AYC K QKEHW HKP C+ + +S GR+
Sbjct: 44 CGGCKFAAYCSKACQKEHWPSHKPICRFTVATVGASRAEGRF 85
>gi|302688639|ref|XP_003033999.1| hypothetical protein SCHCODRAFT_107214 [Schizophyllum commune
H4-8]
gi|300107694|gb|EFI99096.1| hypothetical protein SCHCODRAFT_107214, partial [Schizophyllum
commune H4-8]
Length = 216
Score = 42.0 bits (97), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 21/38 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
C VC +CS C+ V YC K+H + WK HKP C
Sbjct: 16 CRVCGSPTSMRCSRCRVVYYCSKEHIAQDWKNHKPGCN 53
>gi|321471928|gb|EFX82900.1| hypothetical protein DAPPUDRAFT_316237 [Daphnia pulex]
Length = 1216
Score = 42.0 bits (97), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 15 CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC++ L +CS C+ YC + Q HW +HK CKKL
Sbjct: 1175 CAVCKKIPEKLKRCSRCRSTVYCSVECQHSHWPMHKLICKKL 1216
>gi|323449447|gb|EGB05335.1| hypothetical protein AURANDRAFT_66543 [Aureococcus
anophagefferens]
Length = 380
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 1/54 (1%)
Query: 8 EDEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
+D+++ RCA C RE A C+ C+ V YC Q+ WK HK C ++S
Sbjct: 11 DDDDVKRCAACGREAARLVCTRCRAVRYCDGDCQRRAWKAHKASCPTRDQRVES 64
>gi|169868772|ref|XP_001840957.1| hypothetical protein CC1G_03186 [Coprinopsis cinerea okayama7#130]
gi|116498115|gb|EAU81010.1| hypothetical protein CC1G_03186 [Coprinopsis cinerea okayama7#130]
Length = 445
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGD 78
E AL C+ CK AYCG++ Q+E WK +HK +C L E + R++ T L GD
Sbjct: 272 ELALKICTNCKVTAYCGERCQREDWKQIHKRECGDLAKEYRKLVDAKRWVSPTTKL--GD 329
Query: 79 RIARESPLIVGPKLA--LAEPICLGCHKPLNPN 109
+A S L + + IC + N N
Sbjct: 330 LLAINSTFRAHDTLVPQILQQICRAGPQATNKN 362
>gi|15930025|gb|AAH15453.1| ANKMY2 protein, partial [Homo sapiens]
Length = 161
Score = 42.0 bits (97), Expect = 0.80, Method: Composition-based stats.
Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E A +CS CK V YC + QK HW HK CK L
Sbjct: 40 CTTCGEKGASKRCSVCKMVIYCDQTCQKTHWFTHKKICKNL 80
>gi|449540948|gb|EMD31935.1| hypothetical protein CERSUDRAFT_119267 [Ceriporiopsis subvermispora
B]
Length = 214
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+L +CS C++ YCG +HQ WK HK +CK L
Sbjct: 162 SLSRCSGCRKAMYCGPEHQSGTWKDHKSECKAL 194
>gi|326524007|dbj|BAJ97014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1026
Score = 42.0 bits (97), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
EL CA C KCS CK + YC + Q +HW+ HK KCK++
Sbjct: 26 ELGPCAQCGAAGTKKCSGCKRMRYCSGECQSKHWQSDHKFKCKQM 70
>gi|390599611|gb|EIN09007.1| hypothetical protein PUNSTDRAFT_143642 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 729
Score = 42.0 bits (97), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 12 LMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
L RC+ C AL KC C +YC QK HWK HK +C+K
Sbjct: 685 LYRCSYCGNPSAALKKCGGCGNTSYCDASCQKNHWKAHKNQCQK 728
>gi|268579277|ref|XP_002644621.1| C. briggsae CBR-SET-30 protein [Caenorhabditis briggsae]
Length = 634
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 100/238 (42%), Gaps = 35/238 (14%)
Query: 116 CPRC-FWPACSARCSGLSDAHTHAPECAILKLGC------ETLLAYNDYKYEAIL----- 163
C +C CS +C S A H EC L C E +L +Y+ IL
Sbjct: 136 CDKCKVAKFCSKQCQT-SGAFDHKYECGPLH-NCKDLNTDERMLLRIIGRYKDILDGNDK 193
Query: 164 PLRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYL--RSNFLEKLPGDVL 221
+ R S + ++ + A M K V+++I VK +++ ++K L
Sbjct: 194 KIDGFYTNRESGRSVMQIWEHCADMKKDENAMNVFKKIYDRVKQYGDKTHLVDKDTAFQL 253
Query: 222 DDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRD 280
+ H I V + +IG+ GLY +C +H+C PNA +S N +KL D
Sbjct: 254 HSRNFINRHSISNVDYLR--EIGK---GLYLDLCRYDHSCRPNAIYSCNGTVAKLRALHD 308
Query: 281 T----HISTMYTNALWGTQP-----RREHLAITKYFNCSCERCSDPTELGTYFSAMKC 329
++ T T+ + P RR L T YF C CERC DP + + +A+ C
Sbjct: 309 NVDLENVET--THYTYIELPPCKIQRRHMLKETWYFECHCERCDDPED--NWLTAVIC 362
>gi|302656038|ref|XP_003019776.1| hypothetical protein TRV_06153 [Trichophyton verrucosum HKI 0517]
gi|291183546|gb|EFE39152.1| hypothetical protein TRV_06153 [Trichophyton verrucosum HKI 0517]
Length = 1485
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+C VC C CK+VAYCG +HQ+ W HK CK
Sbjct: 1443 KCVVCSAPG-KLCLGCKKVAYCGTEHQRRDWVNHKTTCK 1480
>gi|397585818|gb|EJK53401.1| hypothetical protein THAOC_27174, partial [Thalassiosira
oceanica]
Length = 144
Score = 42.0 bits (97), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
CA C ET L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 16 CANCGETGSDAVKLKNCTACRLVKYCGVDCQRAHRKEHKKACKQRVAELKDEKLYGQ 72
>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 445
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 64/162 (39%), Gaps = 22/162 (13%)
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEK-LPGDVLDDTSAKCLHWI 232
+ K Y+ D+ H + E S+ LRS E +PG+ + L I
Sbjct: 105 TDKCYRRFWDLMPHEEDIKKDEKRMEHFQSLTVVLRSLIDEAAMPGN-------QELLRI 157
Query: 233 CGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKH-SNMMQSKLFVFRDTH---- 282
G + +N +I G+Y +M+H+C PNA + + + D H
Sbjct: 158 FGKMCINSFNILDDEMNSIGTGMYLGASIMDHSCRPNAVAIFDGCNLNVRLLEDYHGAEI 217
Query: 283 ----ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTEL 320
I Y + L T RR+ L YF C CERC D EL
Sbjct: 218 DFSKIFISYIDLLNPTDVRRDMLRKRYYFECGCERCRDEQEL 259
>gi|147838237|emb|CAN78307.1| hypothetical protein VITISV_005598 [Vitis vinifera]
Length = 494
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
C VC KCS CK V YC + Q HWK HK KCK K++
Sbjct: 90 CLVCGNLGTKKCSGCKAVRYCSQTCQATHWKSGHKTKCKDFQLSGKAN 137
>gi|7510854|pir||T30001 hypothetical protein ZC8.3 - Caenorhabditis elegans
Length = 306
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 47/93 (50%), Gaps = 13/93 (13%)
Query: 238 VNGVDIGR-YTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDT----HISTMYTNAL 291
++ VD R +GLY +C +H+C PNA +S N + +KL D ++ T T+
Sbjct: 159 ISNVDYLREIGKGLYLDLCKYDHSCRPNAIYSCNGIVAKLRALHDNVDLENVET--THYT 216
Query: 292 WGTQP-----RREHLAITKYFNCSCERCSDPTE 319
+ P RR L T YF C CERC DP +
Sbjct: 217 YIELPPCKIQRRHMLKETWYFECHCERCDDPDD 249
>gi|170101132|ref|XP_001881783.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164643138|gb|EDR07391.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 428
Score = 42.0 bits (97), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 22/40 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C +C + KC+ C V YCGK+ Q WK HK CK L
Sbjct: 249 CELCGKREASKCTGCFSVVYCGKECQTADWKDHKKTCKTL 288
>gi|198458079|ref|XP_001360905.2| GA17017 [Drosophila pseudoobscura pseudoobscura]
gi|198136215|gb|EAL25480.2| GA17017 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 42.0 bits (97), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC HQ+ HW HK C
Sbjct: 128 CAACGCYAPHACSRCKTIYYCSSAHQRAHWSDHKANC 164
>gi|340718013|ref|XP_003397467.1| PREDICTED: LOW QUALITY PROTEIN: ankyrin repeat and MYND
domain-containing protein 2-like [Bombus terrestris]
Length = 407
Score = 42.0 bits (97), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
C C E A KCS CK V YC ++ Q+ HW +HK C +L I S
Sbjct: 335 CITCGEDKAXQKCSKCKAVQYCDRECQRLHWFMHKKACTRLGQNITPS 382
>gi|397624310|gb|EJK67342.1| hypothetical protein THAOC_11640 [Thalassiosira oceanica]
Length = 304
Score = 42.0 bits (97), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+ L+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRVAELKDEQLYGQGLE 75
>gi|431908963|gb|ELK12554.1| Ankyrin repeat and MYND domain-containing protein 2 [Pteropus
alecto]
Length = 441
Score = 42.0 bits (97), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C CRE A +CS CK V YC + QK HW HK CK L
Sbjct: 320 CTTCREKGASKRCSVCKMVIYCDQTCQKTHWFAHKKMCKNL 360
>gi|195151327|ref|XP_002016599.1| GL11671 [Drosophila persimilis]
gi|194110446|gb|EDW32489.1| GL11671 [Drosophila persimilis]
Length = 348
Score = 42.0 bits (97), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 20/37 (54%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK + YC HQ+ HW HK C
Sbjct: 128 CAACGCYAPHACSRCKTIYYCSSAHQRAHWSDHKANC 164
>gi|397644046|gb|EJK76226.1| hypothetical protein THAOC_02027 [Thalassiosira oceanica]
Length = 271
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG QK H KLHK CK+ E+K L R
Sbjct: 31 LKSCTACRLVKYCGVDCQKAHRKLHKKACKQRAAELKDEQLYSR 74
>gi|297838801|ref|XP_002887282.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297333123|gb|EFH63541.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 392
Score = 42.0 bits (97), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 25/50 (50%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C +C C VAYC HQ HW HK +C++L +++ LL +
Sbjct: 12 CVGPPTRRCGQCGSVAYCSVSHQSSHWSYHKEECERLEEQMRRVDLLNDF 61
>gi|405974013|gb|EKC38689.1| Ankyrin repeat and MYND domain-containing protein 2 [Crassostrea
gigas]
Length = 420
Score = 42.0 bits (97), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 24/50 (48%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 8 EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
ED+ C R A KCSACK V YC ++ QK HW HK CK+L E
Sbjct: 314 EDDVCSTCGSLR--AKKKCSACKMVCYCDQRCQKLHWSTHKKLCKQLAEE 361
>gi|428180679|gb|EKX49545.1| hypothetical protein GUITHDRAFT_93215 [Guillardia theta CCMP2712]
Length = 483
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 32/55 (58%), Gaps = 1/55 (1%)
Query: 1 MASIIEIED-EELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
M ++ +ED E +CAVC A +CS CK YCG++ Q WK HKP C+ +
Sbjct: 395 MYDMLNVEDFMEDPKCAVCGAPAEKRCSKCKMEWYCGRKCQVAAWKKHKPFCEMI 449
>gi|198425974|ref|XP_002127497.1| PREDICTED: similar to EGL nine homolog 1 [Ciona intestinalis]
Length = 329
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
C VC L CS C+ V YC ++HQ+ WK+HK CK
Sbjct: 11 CQVCNTPEQLRTCSECRTVWYCSREHQRSDWKVHKTNCK 49
>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
Length = 432
Score = 41.6 bits (96), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 101/265 (38%), Gaps = 20/265 (7%)
Query: 61 SPLLGRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
SP GR L+A H GD + A L V + E C L+ RC
Sbjct: 14 SPGKGRGLRALQPFHVGDLLFSCPAYAYVLTVNERGNHCE-YCFA----RKEGLSKCGRC 68
Query: 117 PRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP- 175
+ F+ C+ C D H EC+ + + E + A + + I R+P
Sbjct: 69 KQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPERTPS 125
Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGV 235
+K +K+ E+H+ K E + I S V L + + L D C
Sbjct: 126 EKLLAVKEFESHLEKLD--NEKRDLIQSDVAALHHFYSKHLEFPDTDSLVVLFAQVNCNG 183
Query: 236 IEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTHISTMYTNA 290
+ ++ ++ + LM H+C PN + +++ + + T Y +
Sbjct: 184 FTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEEVFTSYIDL 243
Query: 291 LWGTQPRREHLAITKYFNCSCERCS 315
L+ T+ R + L + +F C C+ C+
Sbjct: 244 LYPTEDRNDRLRDSYFFTCQCQECT 268
>gi|67585983|ref|XP_665160.1| MYND finger domain protein [Cryptosporidium hominis TU502]
gi|54655649|gb|EAL34930.1| MYND finger domain protein [Cryptosporidium hominis]
Length = 123
Score = 41.6 bits (96), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 23/48 (47%), Positives = 27/48 (56%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
ED+ L RC CR E L CS CK+VAYC Q++ W HK CK
Sbjct: 76 EDKILNRCESCRKESEKTLLICSRCKKVAYCNVDCQRKDWSYHKQFCK 123
>gi|410912118|ref|XP_003969537.1| PREDICTED: programmed cell death protein 2-like [Takifugu rubripes]
Length = 365
Score = 41.6 bits (96), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 25/57 (43%), Positives = 28/57 (49%), Gaps = 2/57 (3%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK-LPYEIKSSPLLGR 66
L C VC CS C V YCGK HQ HWK HK +C +K+SP L R
Sbjct: 141 LKLCWVCGCLGNKACSRCHMVTYCGKHHQTVHWKHTHKKECSSPETTTVKTSPFLFR 197
>gi|390340692|ref|XP_791789.3| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Strongylocentrotus purpuratus]
Length = 452
Score = 41.6 bits (96), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
C+ C E A KCSACK+V YC QK W HK +CK++ E K
Sbjct: 321 CSTCGEAKASKKCSACKQVNYCDAVCQKLEWFTHKKQCKRIAEEHK 366
>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
mutus]
Length = 383
Score = 41.6 bits (96), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 44/212 (20%), Positives = 86/212 (40%), Gaps = 11/212 (5%)
Query: 110 LADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLI 169
L+ RC + F+ C+ C D H EC+ + + E + A + + I
Sbjct: 11 LSKCGRCKQAFY--CNVECQR-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKI 67
Query: 170 LQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 68 HPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVLF 125
Query: 229 LHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----I 283
C + ++ ++ + LM H+C PN + +++ ++ H +
Sbjct: 126 AQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEEV 185
Query: 284 STMYTNALWGTQPRREHLAITKYFNCSCERCS 315
T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 186 FTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 217
>gi|397647092|gb|EJK77557.1| hypothetical protein THAOC_00606 [Thalassiosira oceanica]
Length = 396
Score = 41.6 bits (96), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 24/41 (58%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC+ V YCG Q+ H KLHK CK+ EIK L
Sbjct: 122 LKDCTACRLVKYCGVDCQRAHRKLHKKACKRRAAEIKDEQL 162
>gi|397646698|gb|EJK77388.1| hypothetical protein THAOC_00784 [Thalassiosira oceanica]
Length = 290
Score = 41.6 bits (96), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG QK H KLH+ CK+ E+K L + L+
Sbjct: 27 LKDCTACRLVKYCGVDCQKAHRKLHRKACKQRAAELKDEKLYSQGLE 73
>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
Length = 441
Score = 41.6 bits (96), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 43/193 (22%), Positives = 82/193 (42%), Gaps = 39/193 (20%)
Query: 237 EVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFR-------DTHISTMYTN 289
E+N + +G Y + + +++H+C PNA + + R + + Y +
Sbjct: 171 EMNPIGVGVYIRQIKPC-SVLDHSCRPNA--VAVFEGTTLRIRCVEPVDSEQDLRISYID 227
Query: 290 ALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFS----AMKCLNEHKDQGDCWILPVN 345
L T RR +L YFNC C C D + FS +++C +G +P
Sbjct: 228 TLDDTTTRRRNLQQQYYFNCLCGECKDSEKDLIKFSFNCTSVEC------KGHVTQVPA- 280
Query: 346 PLDNDSDWTCGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEEM----------L 395
D ++C C ++ DV+L +Q E+ K+++ +K E +
Sbjct: 281 ----DDRFSCDICGTTVD--DVNL--NQAAEKAQKIIKHLNELKKQREHESIRMKGLSCV 332
Query: 396 TKLEAMFHPHHYH 408
+++ A+ HP++ H
Sbjct: 333 SEMRAILHPYNVH 345
>gi|397634288|gb|EJK71362.1| hypothetical protein THAOC_07207, partial [Thalassiosira
oceanica]
Length = 117
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C + L C+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 2 SCVPVMDDGDEACANCGKQGSDTVVLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 61
Query: 58 IKSSPLLGRYLQ 69
++ L G+ L+
Sbjct: 62 LEDEQLYGQGLE 73
>gi|359476454|ref|XP_002271840.2| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Vitis
vinifera]
Length = 940
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
C VC KCS CK V YC + Q HWK HK KCK K++
Sbjct: 87 CLVCGNLGTKKCSGCKAVRYCSQTCQATHWKSGHKTKCKDFQLSGKAN 134
>gi|342881956|gb|EGU82734.1| hypothetical protein FOXB_06742 [Fusarium oxysporum Fo5176]
Length = 498
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 1/43 (2%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
C+ + C+ C+ V YCG + QK HW +HK CK PY K+
Sbjct: 18 CQNIGKYSCANCRLVVYCGSECQKAHWLIHKVDCKS-PYTKKT 59
>gi|393238670|gb|EJD46206.1| hypothetical protein AURDEDRAFT_151705 [Auricularia delicata
TFB-10046 SS5]
Length = 741
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 3/40 (7%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKL-HKPKC 51
CA C E + L KC+AC++V YC KQ QK+ WK H+ +C
Sbjct: 693 CAYCGEESSLLRKCAACRDVRYCSKQCQKKDWKFTHREQC 732
>gi|323447944|gb|EGB03849.1| hypothetical protein AURANDRAFT_67642 [Aureococcus anophagefferens]
Length = 1642
Score = 41.6 bits (96), Expect = 1.0, Method: Composition-based stats.
Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
Query: 14 RCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL-PYEIKSSPLLGRY 67
RCA C +HKC+ C+ V YC + QK HW HK C P E K + + +Y
Sbjct: 1555 RCAHCARLPGPTVRVHKCAQCRRVYYCSRDCQKSHWPKHKAFCSTSGPTEPKPNYIPPKY 1614
Query: 68 LQATLD 73
+ D
Sbjct: 1615 HAHSYD 1620
>gi|320162592|gb|EFW39491.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1600
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 24/38 (63%)
Query: 17 VCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
VC +TA +C+ CK YC ++ Q WK HKP CK++
Sbjct: 1466 VCGKTAKFRCTMCKSQYYCSRECQTSDWKTHKPACKRM 1503
>gi|397563830|gb|EJK43958.1| hypothetical protein THAOC_37548 [Thalassiosira oceanica]
Length = 300
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C + L C+AC+ V YCG Q+ H KLHK CK+ E
Sbjct: 2 SCVPVVDDSGEACANCGKQGSDTVKLRNCAACRLVKYCGVDCQRAHRKLHKKACKQRAAE 61
Query: 58 IKSSPLLGRYLQ 69
++ L + L+
Sbjct: 62 LEDEKLYSQGLE 73
>gi|170115743|ref|XP_001889065.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164636013|gb|EDR00313.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 633
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 19/35 (54%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
CR CSACK V+YC K Q HW HK CK
Sbjct: 52 CRNPGTMACSACKLVSYCSKDCQTTHWHRHKKDCK 86
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDHCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + ++ + +K
Sbjct: 67 VAKYCSAKCQKKA-WQDHKQECKCLK-SCKPRYPPDSVRLLGRVVIKLMEKTPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E++++K + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDIFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STM 286
IC + ++ GLY + L+ H+C PN + N L RD + +
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|367055708|ref|XP_003658232.1| hypothetical protein THITE_2124766 [Thielavia terrestris NRRL
8126]
gi|347005498|gb|AEO71896.1| hypothetical protein THITE_2124766 [Thielavia terrestris NRRL
8126]
Length = 447
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C T L +CS C+ V YCG+ HQ H +HK C K+
Sbjct: 21 RCPQCTATGPKLLRCSGCRAVRYCGRDHQVAHRTMHKSACTKI 63
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 41.6 bits (96), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 7/101 (6%)
Query: 249 GLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTMYTNALWGTQPRREHLAI 303
GLY + L+ H+C PN N +++ V + Y + R+ L
Sbjct: 255 GLYPLASLINHSCQPNCGFYNRGPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLE 314
Query: 304 TKYFNCSCERCSDPT--ELGTYFSAMKCLNEHKDQGDCWIL 342
TK+F+C C RCS P + Y S +C N+ K D ++
Sbjct: 315 TKHFDCLCNRCSPPITDSVDRYLSGFQCPNKAKTSCDGLLV 355
>gi|407408418|gb|EKF31865.1| hypothetical protein MOQ_004294 [Trypanosoma cruzi marinkellei]
Length = 744
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 56/126 (44%), Gaps = 24/126 (19%)
Query: 239 NGVDIGRYTQGL--YSVICLMEHNCLPNA----KHSNMMQSKLFV-FRDT-------HIS 284
NGV +G G+ + I L+EH+C PNA +H +S +F R T I+
Sbjct: 362 NGVSLGTLETGIGVFGGISLIEHSCHPNALVVFRHGCTSESIVFAELRATRPIGIGERIT 421
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCS---DPTELGTYFSAMKCLNEH-----KDQ 336
Y + R++ L +F+C+C+ C+ D T L F++ L E K +
Sbjct: 422 IAYVPTFLPKEERKKRLRAKFFFSCACDHCTAGHDTTRL--MFASCGSLEERVVMCPKGR 479
Query: 337 GDCWIL 342
G W L
Sbjct: 480 GSEWTL 485
>gi|397602518|gb|EJK58199.1| hypothetical protein THAOC_21697 [Thalassiosira oceanica]
Length = 309
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
CA C +T L C+AC+ V YCG Q+ H KLHK CK+ E+K L
Sbjct: 15 CANCGKTGSDTVKLKSCTACRLVKYCGVDCQRAHRKLHKRACKQRAAELKDEEL 68
>gi|328768420|gb|EGF78466.1| hypothetical protein BATDEDRAFT_90401 [Batrachochytrium
dendrobatidis JAM81]
Length = 578
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 68/325 (20%), Positives = 110/325 (33%), Gaps = 60/325 (18%)
Query: 55 PYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADN- 113
P +K++ GR+ A L PG + ESP A C C K N A
Sbjct: 16 PITLKATKSKGRHFVAKSQLQPGTTVLVESPCAFVLFKDNARDYCSYCMKRFNRMTAATR 75
Query: 114 ---------------------ARCPRC----FWPACSARCSGLSDAH------------- 135
C C FW CS C AH
Sbjct: 76 PASTETSLFSLSSQSDSSNTCIVCTDCRNQTFW--CSESCKKADQAHQLICAAIVELPGI 133
Query: 136 --THAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDMEAHM-SKRG 192
H+ + A+++L + L + EAI + +P + D+ +H S +G
Sbjct: 134 AGMHSVDYALVRLIVQVL---ANTAVEAIWQMDSSCRVESTPMAC--ISDLLSHQKSYQG 188
Query: 193 PGTEVY----EEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ 248
P E+I + + +L+ N K +++ + + GV + G G
Sbjct: 189 PWLNAIDCAAEDIAACLSHLKINDSLKSKQEIIK-LACRINSNSHGVYDATGNKSGEIGV 247
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYTNALWGTQPRREHLA 302
G++ + ++ H+C PN ++ V T +S Y + RR L
Sbjct: 248 GMFPFVAMLNHSCAPNCAFVTSQSGQMLVRTLTQVSPGTELCVGYIDLFTPRWERRGKLL 307
Query: 303 ITKYFNCSCERCSDPTELGTYFSAM 327
TK F C+C RC E + +
Sbjct: 308 TTKLFWCTCIRCGPRAEETVVLTTL 332
>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
Length = 333
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 9/107 (8%)
Query: 246 YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-------THISTMYTNALWGTQPRR 298
Y G++ + H+CLPNA N Q + VFR I T YT+ RR
Sbjct: 68 YGLGIFPTGSYLNHSCLPNAFWYNDDQG-MMVFRTLRPIKKGEEILTSYTDITTECSERR 126
Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
+HL +F C C++C +L S + C N+ + +P N
Sbjct: 127 KHLLKQYFFFCQCQQCKFQKKLADQ-SCLSCKNQLSSSNTSFKIPSN 172
>gi|397643232|gb|EJK75731.1| hypothetical protein THAOC_02539 [Thalassiosira oceanica]
Length = 295
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 31/63 (49%), Gaps = 12/63 (19%)
Query: 13 MRCAVCRET------------ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
M+C+ C ET L C+AC+ V YCG + QK H K HK CK+ EIK
Sbjct: 1 MKCSPCEETCANCGGRGSDSLKLKDCTACRLVKYCGVECQKVHRKQHKTICKQRAAEIKD 60
Query: 61 SPL 63
L
Sbjct: 61 EQL 63
>gi|397575638|gb|EJK49803.1| hypothetical protein THAOC_31288 [Thalassiosira oceanica]
Length = 342
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 34/66 (51%), Gaps = 5/66 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C TA L C+AC+ V YCG Q+ H KLHK CK+ E
Sbjct: 42 SCVPVTDDCDEACANCGLTASDTAKLKNCNACRLVKYCGVDCQRAHRKLHKKACKQRAAE 101
Query: 58 IKSSPL 63
+K L
Sbjct: 102 LKDEQL 107
>gi|392588951|gb|EIW78282.1| hypothetical protein CONPUDRAFT_167323 [Coniophora puteana
RWD-64-598 SS2]
Length = 381
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 32/61 (52%), Gaps = 7/61 (11%)
Query: 9 DEELMR---CAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
+E LM CA C R + +CSAC YC + Q+ HWK HKP C P+ K++
Sbjct: 189 EEGLMEDKNCANCHKRRERMKQCSACHTTMYCSTECQQSHWKAHKPSCN--PFSGKNTVT 246
Query: 64 L 64
L
Sbjct: 247 L 247
>gi|290975218|ref|XP_002670340.1| predicted protein [Naegleria gruberi]
gi|284083898|gb|EFC37596.1| predicted protein [Naegleria gruberi]
Length = 700
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 14 RCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+CA C +T L +CS C++V YC QK HW HK CKK
Sbjct: 652 KCANCNKTGTKSAPLKRCSGCQKVFYCSGNCQKTHWSSHKTACKK 696
>gi|301107195|ref|XP_002902680.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262098554|gb|EEY56606.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 337
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 24/39 (61%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
L KC+ C++ YC + QKE WKLHK C K EI +S
Sbjct: 293 LSKCARCRDAVYCSRDCQKEAWKLHKRLCGKTTEEITAS 331
>gi|395330558|gb|EJF62941.1| hypothetical protein DICSQDRAFT_83701 [Dichomitus squalens LYAD-421
SS1]
Length = 398
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 20/36 (55%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
R +CS CKEV YC QKE W HK KCK+
Sbjct: 320 VRAAEFKRCSGCKEVIYCSPSCQKEDWPAHKKKCKE 355
>gi|440794150|gb|ELR15321.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 974
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 3/50 (6%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
RCA+C R L CS C+ V YCG +HQ HW++ H+ C+ P+ + S
Sbjct: 887 RCAICSRVDQLRVCSGCRLVHYCGPEHQNLHWRVSHRGSCEG-PFRLLSG 935
>gi|189199084|ref|XP_001935879.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187982978|gb|EDU48466.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 276
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 10/93 (10%)
Query: 232 ICGVIEVNGVDIGRYTQ--GLYSVICLMEHNCLPNA-----KHSNMMQSKLF--VFRDTH 282
G+ + N + G T+ +Y+ + L+ H+C+PN+ +H NM +
Sbjct: 38 FSGIFQTNALPCGSETKLYAVYATVSLINHDCMPNSSPNWNRHLNMQTIHALCDIKAGEE 97
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
I+ YTNA+ TQ R+ +L F+C+C CS
Sbjct: 98 ITVCYTNAV-TTQERQVYLKQHFGFDCACSICS 129
>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
rotundata]
Length = 438
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 68/156 (43%), Gaps = 18/156 (11%)
Query: 174 SPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWIC 233
+ K++ KD+ +H S + E + L FL GD SA+ L I
Sbjct: 107 TKTKFRRFKDLMSHYSDIKKDEKKMEHFMFLCGVLFG-FL----GDTPMPNSAE-LMGIY 160
Query: 234 GVIEVNG-----VDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRD------T 281
G I +N +D+ G+Y +++H+C+PNA + + + D +
Sbjct: 161 GRICINSFNIFDLDMNSIGVGIYLAPSVVDHSCVPNAVATFEGITLNIRTIEDLPSLDWS 220
Query: 282 HISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
I Y + L T+ RR L + YF C+C++C +P
Sbjct: 221 QIRISYIDVLKTTKERRSELQSSYYFWCNCKKCEEP 256
>gi|339236015|ref|XP_003379562.1| Egl nine protein [Trichinella spiralis]
gi|316977745|gb|EFV60808.1| Egl nine protein [Trichinella spiralis]
Length = 395
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 30/54 (55%), Gaps = 5/54 (9%)
Query: 6 EIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
++ + ++ C+ C + + L KCS C +V YC + HQ++HW HK C +
Sbjct: 4 QMVNRSVLLCSFCGKPSSDNVPLFKCSGCLKVCYCNRLHQQQHWNDHKKHCDNM 57
>gi|449686514|ref|XP_002164124.2| PREDICTED: ankyrin repeat and MYND domain-containing protein
2-like, partial [Hydra magnipapillata]
Length = 317
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/58 (36%), Positives = 30/58 (51%), Gaps = 2/58 (3%)
Query: 3 SIIEIEDEELMRCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
SI ++ + C C + +KCS+CK V YC + QK HW HK C++L E
Sbjct: 227 SINGLQSADFTECCSCCGNRRSFNKCSSCKMVKYCNQSCQKLHWGTHKKFCERLKEEF 284
>gi|397630366|gb|EJK69745.1| hypothetical protein THAOC_08966 [Thalassiosira oceanica]
Length = 716
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+ L+
Sbjct: 30 LKSCTACRLVKYCGVDCQRAHRKQHKKTCKQRAAELKDEELYGQGLE 76
>gi|307213998|gb|EFN89205.1| hypothetical protein EAI_07708 [Harpegnathos saltator]
Length = 128
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 4/73 (5%)
Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESP 85
C CK ++YCG++H++ H+ H C + +KS P Y ATL+L+ D I
Sbjct: 3 CDQCKMISYCGEKHKQMHYTQHMEFCAVIQKLLKSYP----YFWATLELNLEDWIQSRKE 58
Query: 86 LIVGPKLALAEPI 98
L+ K L+ +
Sbjct: 59 LVHLTKQELSRAL 71
>gi|440792097|gb|ELR13325.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 480
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 31/67 (46%), Gaps = 14/67 (20%)
Query: 7 IEDEELMRCAVCRETALHK------------CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+ D + RCA+C + + CS C+EV YC Q+EHW HKP C
Sbjct: 407 LPDAGMQRCAMCAKQVKGRNTDGKCGVQVKWCSRCREVGYCSPACQREHWPAHKPACA-- 464
Query: 55 PYEIKSS 61
P I +S
Sbjct: 465 PASITAS 471
>gi|302500465|ref|XP_003012226.1| hypothetical protein ARB_01486 [Arthroderma benhamiae CBS 112371]
gi|291175783|gb|EFE31586.1| hypothetical protein ARB_01486 [Arthroderma benhamiae CBS 112371]
Length = 1480
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 22/39 (56%), Gaps = 1/39 (2%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+C VC C CK+VAYCG +HQ+ W HK CK
Sbjct: 1438 KCVVCSAPG-KLCLGCKKVAYCGTEHQRLDWANHKATCK 1475
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 IAKYCSAKCQKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMQETPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E++++K + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLALTFQHFMREEIQDASQLPPSFDIFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
IC + ++ GLY + L+ H+C PN + N L RD ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRTVRDVEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F+C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFDCDCFRC 266
>gi|321474405|gb|EFX85370.1| hypothetical protein DAPPUDRAFT_237975 [Daphnia pulex]
Length = 842
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/42 (45%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 15 CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC++ L +CS C+ YC + Q HW +HK CKKL
Sbjct: 801 CAVCKKIPEKLKRCSRCRSTFYCSVECQHSHWPMHKILCKKL 842
>gi|432113963|gb|ELK36028.1| SET and MYND domain-containing protein 3 [Myotis davidii]
Length = 239
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 16 AVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK-----KLPYEIKSSPLLGRYLQA 70
AV R+ L +CS C+ YCG + QK+ W HK +CK K Y S LLGR +
Sbjct: 51 AVRRKEKLMRCSQCRVAKYCGAKCQKKAWPDHKRECKCLKSCKPRYPPDSVRLLGRVVVK 110
Query: 71 TLDLHPGD 78
++ P +
Sbjct: 111 LMEETPSE 118
>gi|307214000|gb|EFN89207.1| hypothetical protein EAI_07710 [Harpegnathos saltator]
Length = 149
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 37/73 (50%), Gaps = 4/73 (5%)
Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESP 85
C CK ++YCG++H++ H+ H C + +KS P Y ATL+L+ D I
Sbjct: 1 CDQCKMISYCGEKHKQMHYTQHMEFCAVIQKLLKSYP----YFWATLELNLEDWIQSRKE 56
Query: 86 LIVGPKLALAEPI 98
L+ K L+ +
Sbjct: 57 LVHLTKQELSRAL 69
>gi|346324100|gb|EGX93697.1| Zinc finger domain-containing protein, MYND-type [Cordyceps
militaris CM01]
Length = 429
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 14 RCAVC--RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC+ C R L +C C V YC ++HQ HW HKP C ++
Sbjct: 18 RCSHCQARAAKLLRCVGCSAVRYCNREHQTAHWPHHKPLCVQI 60
>gi|356498052|ref|XP_003517868.1| PREDICTED: uncharacterized protein LOC100808017 [Glycine max]
Length = 380
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 26/46 (56%)
Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
A C+ C+ VAYC HQ HW HK +C +L ++KS +L +
Sbjct: 20 ATRLCARCEAVAYCSLSHQIAHWSRHKHECDRLQQQLKSVEVLNNF 65
>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
Length = 433
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 44/213 (20%), Positives = 86/213 (40%), Gaps = 11/213 (5%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQR-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
C + ++ ++ + LM H+C PN + +++ ++ H
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEE 235
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 236 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
Length = 395
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 44/213 (20%), Positives = 86/213 (40%), Gaps = 11/213 (5%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 23 GLSKCGRCKQAFY--CNVECQR-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 79
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 80 IHPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDNDSLVVL 137
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
C + ++ ++ + LM H+C PN + +++ ++ H
Sbjct: 138 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEE 197
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 198 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 230
>gi|345330186|ref|XP_001505544.2| PREDICTED: annexin A7-like [Ornithorhynchus anatinus]
Length = 584
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+LH+C C+ V YCG + Q+ +W +HK CKKL
Sbjct: 171 SLHRCKRCQNVYYCGPECQRANWPVHKRFCKKL 203
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 106/278 (38%), Gaps = 39/278 (14%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARC 116
E+ +S GR L+AT + D I E +L +C C K RC
Sbjct: 10 EVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFK----RQERLHRC 65
Query: 117 PRC-FWPACSARCSGLSDAH-THAPECAILKLGCETLLAYNDYKYEAI-LPLRCLILQRR 173
+C F C C DA H EC+ +K Y E I L R + R
Sbjct: 66 GQCKFAHYCDRTCQ--KDAWLNHKNECSAIK-------RYGKVPNENIRLAARIMWRVER 116
Query: 174 SPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKC 228
E + D++ H+ G EE ++ FL+ P S +
Sbjct: 117 EGTGLTEGCLVSVDDLQNHVEHFG------EEEQKELRVDVDTFLQYWPPQS-QQFSMQY 169
Query: 229 LHWICGVIEVNGVDIG--RYTQ----GLYSVICLMEHNCLPNAK---HSNMMQSKLF--V 277
+ I GVI NG + R Q G++ + L+ H+C PN ++ ++ + +
Sbjct: 170 ISHIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGKIELRALGKI 229
Query: 278 FRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
++ Y + L ++ RR+ L YF+CSCE C
Sbjct: 230 SEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQ 267
>gi|242002976|ref|XP_002422563.1| ankyrin repeat and mynd domain-containing protein, putative
[Pediculus humanus corporis]
gi|212505353|gb|EEB09825.1| ankyrin repeat and mynd domain-containing protein, putative
[Pediculus humanus corporis]
Length = 331
Score = 41.2 bits (95), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
Query: 9 DEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
D ++ CA C E KCS CK V YC K QK HW +HK C
Sbjct: 250 DSDVDSCATCGEEKRSKKCSRCKSVQYCDKDCQKYHWFVHKKVC 293
>gi|440490913|gb|ELQ70409.1| hypothetical protein OOW_P131scaffold00034g31 [Magnaporthe oryzae
P131]
Length = 462
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 15 CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC E A L C+ C+ V YCG++HQ++H HK C +
Sbjct: 12 CAVCGEKAGLLFCTGCRVVHYCGRKHQEDHRPAHKSACNAI 52
>gi|357613394|gb|EHJ68480.1| pcdc2/rp-8 [Danaus plexippus]
Length = 351
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 23/40 (57%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C +C A CS CK+V YC +QHQ WK HK +C L
Sbjct: 131 CYICGARAPSHCSKCKKVYYCSRQHQILDWKNHKIECPNL 170
>gi|440794148|gb|ELR15319.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 715
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 30/50 (60%), Gaps = 3/50 (6%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
RCA+C R L CS C+ V YCG +HQ HW++ H+ C+ P+ + S
Sbjct: 628 RCAICSRVDQLRVCSGCRLVHYCGPEHQNLHWRVSHRGSCEG-PFRLLSG 676
>gi|397615238|gb|EJK63308.1| hypothetical protein THAOC_16041 [Thalassiosira oceanica]
Length = 397
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 26/44 (59%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L KC+AC+ V YC + QK H K HK CKK E++ L G+
Sbjct: 27 LKKCNACQLVKYCSVECQKAHRKQHKKACKKRSAELRDETLFGQ 70
>gi|321460587|gb|EFX71628.1| hypothetical protein DAPPUDRAFT_255583 [Daphnia pulex]
Length = 981
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CA C++ + L +CS C+ YC ++ Q++HW HK CKK+
Sbjct: 940 CAACKKVSQSLKRCSRCRSTVYCCEECQRQHWAQHKMDCKKV 981
>gi|390596497|gb|EIN05899.1| hypothetical protein PUNSTDRAFT_145782 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 665
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/35 (45%), Positives = 21/35 (60%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
C + L CS C+ V YC K+ Q +HWK+H CK
Sbjct: 100 CNKEGLFSCSTCRLVKYCSKECQAKHWKIHSRDCK 134
>gi|389742410|gb|EIM83597.1| hypothetical protein STEHIDRAFT_141295 [Stereum hirsutum FP-91666
SS1]
Length = 678
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 25/48 (52%), Gaps = 5/48 (10%)
Query: 15 CAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
C VC +T +L +CS CK YC + QK WK HK C LP E
Sbjct: 453 CQVCSKTPEGDESLKRCSICKNRFYCSSRCQKRDWKKHKVDCSLLPSE 500
>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 635
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 110/320 (34%), Gaps = 70/320 (21%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
I S GR+L AT + PGD + E+P + + CH L+ NL +
Sbjct: 246 ISFSEKYGRHLVATKEFKPGDVVTIENPYAY---VIYTQRYYTHCHHCLSRSYNLIPCSY 302
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
CP + CS +C L+ H EC I+ L L D +R LI+
Sbjct: 303 CPVAQY--CSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTKIIRFLIVVTSKG 360
Query: 176 KKYQELK-DMEAHMS---KRGPGTEVYEEIDS---------------------------- 203
KK+ EL+ DME S R G +DS
Sbjct: 361 KKFDELRVDMELAESNPDNRSAGFTDEGILDSTSARSALSLATNMTTRPLIGISAFACIS 420
Query: 204 ----IVKYLRSNFL-EKLPGDVLDDT--------SAKCLHWICGVIEVN--------GVD 242
I+ ++NF K D L D S + C ++ N G+
Sbjct: 421 ALAAILLATQTNFFCNKYEVDQLKDISNYPKIIFSGSLMFRACVIMSSNCFSVQQEPGIK 480
Query: 243 IGRYTQGLYSVICLMEHNCLPNA----KHSNMMQSKLF-VFRDTHISTMY--TNALWGTQ 295
G GLY L H+C PN + M+ L ++ I T Y A
Sbjct: 481 TG---SGLYVTHSLYNHSCAPNTFRHFEELTMITRALRPIYPGDQIFTNYGAAYAYMTKS 537
Query: 296 PRREHLAITKYFNCSCERCS 315
RRE + +F C C C+
Sbjct: 538 ERREKIIQDYFFECDCIACA 557
>gi|296083782|emb|CBI23999.3| unnamed protein product [Vitis vinifera]
Length = 892
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 24/48 (50%), Gaps = 1/48 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPYEIKSS 61
C VC KCS CK V YC + Q HWK HK KCK K++
Sbjct: 11 CLVCGNLGTKKCSGCKAVRYCSQTCQATHWKSGHKTKCKDFQLSGKAN 58
>gi|321449733|gb|EFX62041.1| hypothetical protein DAPPUDRAFT_302174 [Daphnia pulex]
Length = 211
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CA C++ + L +CS C+ YC ++ Q++HW HK CKK+
Sbjct: 170 CAACKKVSQSLKRCSRCRSTVYCCEECQRQHWAQHKMDCKKV 211
>gi|397629349|gb|EJK69320.1| hypothetical protein THAOC_09432 [Thalassiosira oceanica]
Length = 872
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 22/37 (59%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
L C AC+ V YC Q+ HWK HK CKKL E++
Sbjct: 24 LKICKACRSVKYCSVGCQRAHWKQHKAACKKLAKELE 60
>gi|326430459|gb|EGD76029.1| hypothetical protein PTSG_00738 [Salpingoeca sp. ATCC 50818]
Length = 1663
Score = 41.2 bits (95), Expect = 1.4, Method: Composition-based stats.
Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 12 LMRCAVCRETALH------KCSACKEVAYCGKQHQKEHWKLHKPKCK 52
L +C C A H +C+ CK+V YC Q+ HW HK C+
Sbjct: 1206 LAKCHACATLAEHATGALKRCARCKQVCYCSTACQQSHWPQHKSSCR 1252
>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
Length = 433
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 44/213 (20%), Positives = 86/213 (40%), Gaps = 11/213 (5%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKRDLIQSDIAALHHFYSKHLEFPDSDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH---- 282
C + ++ ++ + LM H+C PN + +++ ++ H
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSXIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGEE 235
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 236 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|397624311|gb|EJK67343.1| hypothetical protein THAOC_11642 [Thalassiosira oceanica]
Length = 1256
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG QK H KLH+ CK+ E+K L + L+
Sbjct: 982 LKDCTACRLVKYCGVDCQKAHRKLHREACKQRAAELKDEQLYSQGLE 1028
>gi|397636925|gb|EJK72461.1| hypothetical protein THAOC_06008, partial [Thalassiosira
oceanica]
Length = 328
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 5/68 (7%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + + D+ CA C + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 24 MSCFVPVADDGYEACANCGKQGSDTVKLRNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 83
Query: 56 YEIKSSPL 63
E+K L
Sbjct: 84 AELKDEQL 91
>gi|397625914|gb|EJK67961.1| hypothetical protein THAOC_10918, partial [Thalassiosira oceanica]
Length = 383
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S I ++D CA C + L C+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 80 SCIAVDDGSAGVCANCGKHGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 139
Query: 58 IKSSPLLGRYLQ 69
+K L + L+
Sbjct: 140 LKDERLYSQGLE 151
>gi|397620610|gb|EJK65805.1| hypothetical protein THAOC_13300 [Thalassiosira oceanica]
Length = 876
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 5/71 (7%)
Query: 1 MASIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M S + D+ + CA C + L C+AC+ V YCG Q+ H K HK C +
Sbjct: 505 MMSCAPVADDGVEACANCGKQGSGTVTLKNCTACRLVKYCGVDCQRAHRKQHKRACNQRV 564
Query: 56 YEIKSSPLLGR 66
E+K+ L G+
Sbjct: 565 EELKNEQLYGQ 575
>gi|392562230|gb|EIW55411.1| hypothetical protein TRAVEDRAFT_23090 [Trametes versicolor FP-101664
SS1]
Length = 1167
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D +C+VCR + L CS C +V YC + QK+ W HK KCK
Sbjct: 1119 RDPAAHKCSVCRGRGQPKLKACSGCSKVRYCSQACQKKDWPRHKSKCK 1166
>gi|390599613|gb|EIN09009.1| hypothetical protein PUNSTDRAFT_143643 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 729
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 12 LMRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
L C+ C+ + L KC C YC QK+HWKLHK C++
Sbjct: 684 LYECSYCQNASAVLKKCGGCGNTRYCNSSCQKKHWKLHKASCRR 727
>gi|195455340|ref|XP_002074677.1| GK23035 [Drosophila willistoni]
gi|194170762|gb|EDW85663.1| GK23035 [Drosophila willistoni]
Length = 348
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 20/37 (54%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C A H CS CK V YC HQ+ HW HK C
Sbjct: 128 CAACGCHAPHACSRCKAVHYCSSAHQRVHWTQHKATC 164
>gi|170093361|ref|XP_001877902.1| ectomycorrhiza-upregulated zf-MYND domain-containing protein
[Laccaria bicolor S238N-H82]
gi|164647761|gb|EDR12005.1| ectomycorrhiza-upregulated zf-MYND domain-containing protein
[Laccaria bicolor S238N-H82]
Length = 227
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 14 RCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL-PYEIKSSPLL 64
RC+ C+ ++ CS CK YC K+ QK HW++HKP C+ P E+ LL
Sbjct: 5 RCSKCQFGHNEVNACSKCKTAWYCSKECQKAHWEIHKPLCRPYSPNEVWGIKLL 58
>gi|397582294|gb|EJK52236.1| hypothetical protein THAOC_28513, partial [Thalassiosira
oceanica]
Length = 69
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + + D++ CA C A L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 1 MSCFVPVADDDNEVCANCGTAASDSVKLRDCAACRLVKYCGVDCQRAHRKQHKKVCKQRV 60
Query: 56 YEIKSSPL 63
E+K L
Sbjct: 61 AELKDEEL 68
>gi|302692188|ref|XP_003035773.1| hypothetical protein SCHCODRAFT_105219 [Schizophyllum commune
H4-8]
gi|300109469|gb|EFJ00871.1| hypothetical protein SCHCODRAFT_105219, partial [Schizophyllum
commune H4-8]
Length = 213
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 28/56 (50%)
Query: 8 EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
+ +E CA+C A +CS C + YC K+HQ W HK CK + + P+
Sbjct: 5 DTQERRCCAICSYPAPSQCSGCGQAFYCSKEHQTLAWSKHKRLCKIYQRQARGEPV 60
>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
Length = 403
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 46/107 (42%), Gaps = 9/107 (8%)
Query: 246 YTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRD-------THISTMYTNALWGTQPRR 298
Y G++ + H+CLPNA N Q + VFR I T YT+ RR
Sbjct: 138 YGLGIFPTGSYLNHSCLPNAFWYNDDQG-MMVFRTLRPIKKGEEILTSYTDITTECSERR 196
Query: 299 EHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVN 345
+HL +F C C++C +L S + C N+ + +P N
Sbjct: 197 KHLLKQYFFFCQCQQCKFQKKLADQ-SCLSCKNQLSSSNTSFKIPSN 242
>gi|397582835|gb|EJK52436.1| hypothetical protein THAOC_28284 [Thalassiosira oceanica]
Length = 377
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 7/70 (10%)
Query: 1 MASIIEIEDEELMRCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
M+ I ++D CA C + + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 1 MSRCIAVDDGSAGVCANCGKGSDAAGTVKLRNCNACRLVKYCGVDCQRAHRKQHKKACKQ 60
Query: 54 LPYEIKSSPL 63
E+K L
Sbjct: 61 RVAELKDEQL 70
>gi|449435374|ref|XP_004135470.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
[Cucumis sativus]
gi|449478708|ref|XP_004155398.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
[Cucumis sativus]
Length = 376
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 27/54 (50%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
R C +A +C C VAYC HQ +HW HK +CK+ +++ L +
Sbjct: 8 RGTRCTGSATRRCGRCGVVAYCSVDHQVKHWNDHKDECKRFELQMERIDALNEF 61
>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
latipes]
Length = 495
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 25/99 (25%), Positives = 43/99 (43%), Gaps = 5/99 (5%)
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTH----ISTMY 287
C ++ ++ GLY + L+ H+C PN N + +L +D + + Y
Sbjct: 243 CNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFNGTELQLRAVQDINPGEELRISY 302
Query: 288 TNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
L T+ R++ L +F C C+RCS + G S
Sbjct: 303 IETLSLTEDRQKQLEEQYHFTCHCQRCSSKEQDGVMLSG 341
>gi|397563809|gb|EJK43951.1| hypothetical protein THAOC_37556 [Thalassiosira oceanica]
Length = 299
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 41/86 (47%), Gaps = 7/86 (8%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR-YLQATLDLHPGDRIA 81
L C+AC+ V YCG Q+ H K HK CK+ E+K L + Y + +D P
Sbjct: 27 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYSQGYERPEVDFCP--ICT 84
Query: 82 RESPLIVGPKLALA----EPICLGCH 103
PLI+G A + IC GC+
Sbjct: 85 LPIPLIMGEHSVFAVCCMKQICNGCN 110
>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
Length = 536
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 43/213 (20%), Positives = 84/213 (39%), Gaps = 11/213 (5%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 164 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 220
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 221 IYPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 278
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTH 282
C + ++ ++ + LM H+C PN + +++ +
Sbjct: 279 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIKTGEE 338
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 339 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 371
>gi|440475247|gb|ELQ43941.1| hypothetical protein OOU_Y34scaffold00119g4 [Magnaporthe oryzae
Y34]
Length = 459
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 15 CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CAVC E A L C+ C+ V YCG++HQ++H HK C +
Sbjct: 12 CAVCGEKAGLLFCTGCRVVHYCGRKHQEDHRPAHKSACNAI 52
>gi|397597649|gb|EJK57054.1| hypothetical protein THAOC_22942 [Thalassiosira oceanica]
Length = 270
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 5/54 (9%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
CA C ET L C+AC+ V YCG Q+ H K HK CK+ E+K L
Sbjct: 13 CANCGETGSDAVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKEEQL 66
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 26/39 (66%), Gaps = 1/39 (2%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
RC +CR+ L +CS CK YCG + QK+ W+ HK +CK
Sbjct: 51 RC-LCRKEKLLRCSQCKVARYCGSECQKQAWQDHKRECK 88
>gi|321462662|gb|EFX73684.1| hypothetical protein DAPPUDRAFT_307653 [Daphnia pulex]
Length = 1021
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 23/52 (44%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 7 IEDEELMR--CAVCRET--ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
IEDE CAVC++ L +CS C+ YC + Q HW HK CKKL
Sbjct: 970 IEDEPADEKCCAVCKKIPEKLKRCSRCRSAVYCSVECQHSHWPTHKHICKKL 1021
>gi|307189954|gb|EFN74190.1| Ankyrin repeat and MYND domain-containing protein 2 [Camponotus
floridanus]
Length = 405
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
C C E A KCS CK V YC ++ Q+ HW +HK C +L + +S
Sbjct: 333 CVTCGEDKATKKCSKCKAVQYCDRECQRLHWFMHKKACARLDPSLANS 380
>gi|326673176|ref|XP_002664281.2| PREDICTED: egl nine homolog 1-like [Danio rerio]
Length = 328
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 9 DEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
D + C +C + L KC C+ YC K+HQKEHWK HK C++
Sbjct: 11 DGDKQYCELCGKMENLLKCGRCRISFYCSKEHQKEHWKKHKLTCRE 56
>gi|322800676|gb|EFZ21610.1| hypothetical protein SINV_05901 [Solenopsis invicta]
Length = 407
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E A KCS CK V YC ++ Q+ HW +HK C +L
Sbjct: 335 CVTCSEDKATKKCSKCKAVQYCDRECQRLHWFMHKKACARL 375
>gi|170102677|ref|XP_001882554.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164642451|gb|EDR06707.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 645
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 111/285 (38%), Gaps = 55/285 (19%)
Query: 203 SIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVD-IGRYTQGLYSV-------- 253
S+V+YL + LE+L + +TS + VD I R+T +++
Sbjct: 242 SLVRYLGLSSLEELAAFGMQNTS-------------DLVDLISRFTTNTFTITSPTLAPL 288
Query: 254 -------ICLMEHNCLPNA---------KHSNMMQ--SKLFVFRDTHISTMYTNALWGTQ 295
+ L+ H+C PNA + +MQ + ++ D I T Y + T
Sbjct: 289 GACVSPSVALINHSCDPNAAVVFPRSAKEGEPLMQVVALKYIGPDEEILTAYIDTTLPTG 348
Query: 296 PRREHLAITKYFNCSCERCS-DPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT 354
R++ L T +F C C CS PT + AM C K G C + P + +S
Sbjct: 349 LRQQALKETYHFVCECPLCSPPPTSSVDWREAMWC--PKKCGGVCRV----PTEGNSLTR 402
Query: 355 CGSCSARLNARDVHLVTSQLG-EQVDKL--VQENPNVKSLEEMLTKLEAMFHPHHYHCYA 411
C C A + D L ++G E +DK VQ + K+L+ + +
Sbjct: 403 CEKCKAPVKDTDAVLDALRVGQEGLDKAEKVQFSDPAKALQLTTNLIPILI---SAGLVP 459
Query: 412 VKHSLIQLYGTQPGYAYTQLSSSLLERKIS--YVMSPNLKATDEP 454
H L+ L + L SS L S + SP ++A P
Sbjct: 460 ASHPLLGLSRLHTSFLIANLGSSPLSAAGSDEEIQSPQVQAEATP 504
>gi|432910654|ref|XP_004078459.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Oryzias latipes]
Length = 415
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E K CS CK V YC + QK HW HK CKKL
Sbjct: 320 CTTCGEKGAQKRCSICKTVIYCDQTCQKMHWFTHKKVCKKL 360
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTALRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGTPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|397574117|gb|EJK49043.1| hypothetical protein THAOC_32119, partial [Thalassiosira oceanica]
Length = 1069
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K+HK CK+ EIK L + L+
Sbjct: 415 LKNCNACRLVKYCGVDCQRAHRKVHKKACKQRAAEIKDEQLYSQGLK 461
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|397567134|gb|EJK45413.1| hypothetical protein THAOC_35971 [Thalassiosira oceanica]
Length = 262
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 23/41 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC+ V YCG QK H K HK CKK EIK L
Sbjct: 37 LKSCTACRLVKYCGVDCQKAHRKQHKKACKKRAAEIKDEQL 77
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|397588711|gb|EJK54371.1| hypothetical protein THAOC_26007, partial [Thalassiosira
oceanica]
Length = 284
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H KLHK CK+ E+K L + L+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQLYSQGLE 75
>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
Length = 477
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 14 RCAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C T L KCS C+ V YCG QK WKLH+ +C+ L
Sbjct: 51 RCDGCFSTTNLSKCSRCQVVWYCGTPCQKSEWKLHRLECQAL 92
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 56/248 (22%), Positives = 91/248 (36%), Gaps = 63/248 (25%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GR L T D HPGD I + P + P + C GC NL+ +RC ++ C
Sbjct: 22 GRSLFTTRDFHPGDVIISQEPYVCVP----TQKRCDGCFS--TTNLSKCSRCQVVWY--C 73
Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
C E + +L C+ L + K +++ P L+L+ +K Q+ K +
Sbjct: 74 GTPC--------QKSEWKLHRLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKII 125
Query: 185 EAH---------------------MSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGD--- 219
+ +S R E + S+V L + + + D
Sbjct: 126 PSTAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKE 185
Query: 220 ---VLDDTSAKCLHWICGVIEVNGVDIGRYTQ-------------------GLYSVICLM 257
VL A +H I E+N +I + GLY V+ ++
Sbjct: 186 EQLVLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSII 245
Query: 258 EHNCLPNA 265
H+CLPN+
Sbjct: 246 NHSCLPNS 253
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 40.8 bits (94), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 17 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 68
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 69 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 126
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 127 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 181
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 182 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 241
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 242 LDMLMTSEERRKQLRDQYCFECDCFRC 268
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 125 SFYDLESNINKLTEDKK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQ--SKLFVFRDT----HISTMYTNALWGTQPRREHLA 302
GL+ + L+ H+C PN SN +++ RD I+ Y + RR+ L
Sbjct: 210 GLFPICGLINHSCQPNCTWSNAGDGIAEVRALRDIKEGDEITLSYIDIDKERAERRKELR 269
Query: 303 ITKYFNCSCERCSDP 317
TK+F+C C RC+ P
Sbjct: 270 ETKHFDCQCGRCAAP 284
>gi|334183803|ref|NP_177172.6| zinc ion binding protein [Arabidopsis thaliana]
gi|332196904|gb|AEE35025.1| zinc ion binding protein [Arabidopsis thaliana]
Length = 374
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 25/50 (50%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C +C C VAYC HQ HW HK +C++L +++ LL +
Sbjct: 12 CVGPPTRRCGQCGAVAYCSVSHQISHWSYHKEECERLEEQMRRVDLLNDF 61
>gi|195996589|ref|XP_002108163.1| hypothetical protein TRIADDRAFT_52356 [Trichoplax adhaerens]
gi|190588939|gb|EDV28961.1| hypothetical protein TRIADDRAFT_52356 [Trichoplax adhaerens]
Length = 389
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/45 (48%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
C C E A KCSACK+V YC ++ QK HW HK CK E
Sbjct: 308 CYTCGEKRANMKCSACKKVKYCDQRCQKFHWFTHKKFCKYFAEEF 352
>gi|397623675|gb|EJK67098.1| hypothetical protein THAOC_11912, partial [Thalassiosira
oceanica]
Length = 330
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 33/70 (47%), Gaps = 6/70 (8%)
Query: 1 MASIIEIEDEELMRCAVCRETA------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
M I ++D CA C + + L C+AC+ V YCG QK H K HK CK+
Sbjct: 29 MNRCIAVDDGSTDVCANCGKGSEGGAIKLKNCTACRLVKYCGVDCQKAHRKQHKKACKQR 88
Query: 55 PYEIKSSPLL 64
E+K L
Sbjct: 89 AAELKDERLF 98
>gi|397571653|gb|EJK47895.1| hypothetical protein THAOC_33352, partial [Thalassiosira oceanica]
Length = 597
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 5/69 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C + L C+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 224 SCVPVVDDGAEVCANCGKQGSDAVKLKNCTACRLVKYCGVDCQRAHRKQHKQACKQRAAE 283
Query: 58 IKSSPLLGR 66
+K L G+
Sbjct: 284 LKDEQLYGQ 292
>gi|390599621|gb|EIN09017.1| hypothetical protein PUNSTDRAFT_102560 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 675
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 2/46 (4%)
Query: 12 LMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
L RC+ C AL KC C++ YC QK+HWK HK C P
Sbjct: 630 LYRCSFCGNPSAALKKCGRCEKTRYCDASCQKKHWKAHKTDCHTSP 675
>gi|389745836|gb|EIM87016.1| hypothetical protein STEHIDRAFT_95997 [Stereum hirsutum FP-91666
SS1]
Length = 286
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 24/43 (55%), Gaps = 1/43 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKLPY 56
C VC E A CS CK V YCGK Q + WK HK KC K Y
Sbjct: 244 CPVCMEEAELLCSKCKTVRYCGKVCQTKDWKAGHKTKCFKALY 286
>gi|357117231|ref|XP_003560376.1| PREDICTED: uncharacterized protein LOC100846465 [Brachypodium
distachyon]
Length = 1046
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
E CA C ++ +CS CK + YC + Q +HW+ HK KCK++
Sbjct: 101 ETAPCAECGRASIKRCSGCKRMRYCSGECQSKHWQSDHKFKCKQM 145
>gi|397581200|gb|EJK51849.1| hypothetical protein THAOC_28941, partial [Thalassiosira oceanica]
Length = 174
Score = 40.8 bits (94), Expect = 1.7, Method: Composition-based stats.
Identities = 23/61 (37%), Positives = 30/61 (49%), Gaps = 5/61 (8%)
Query: 11 ELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLG 65
E+ CA C + A L C+AC+ V YC QK H K HK CK+ E+K L
Sbjct: 103 EIEACANCGKEASDAVKLKNCTACRLVKYCSVDCQKTHRKKHKKACKQRAAELKDERLFS 162
Query: 66 R 66
+
Sbjct: 163 Q 163
>gi|299472616|emb|CBN78268.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1372
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 13/27 (48%), Positives = 22/27 (81%)
Query: 25 KCSACKEVAYCGKQHQKEHWKLHKPKC 51
+C+ C++V+YC K+ Q++HWK+HK C
Sbjct: 1345 QCTRCRKVSYCSKECQRQHWKIHKAAC 1371
>gi|157125230|ref|XP_001654253.1| hypothetical protein AaeL_AAEL010138 [Aedes aegypti]
gi|108873686|gb|EAT37911.1| AAEL010138-PA [Aedes aegypti]
Length = 445
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 11/52 (21%)
Query: 14 RCAVCRETALHK-----------CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC +C A + C C+ + YCG+QHQK+ WK HK C+ +
Sbjct: 13 RCNICFTRATDRRLALADQKVRFCKGCRLIGYCGEQHQKDDWKNHKDFCRAV 64
>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
Length = 455
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 100/291 (34%), Gaps = 62/291 (21%)
Query: 115 RCPRCFWPACSARCSGLSDAH------------THAPECAILKLGCETLLAYNDYKYEAI 162
RC RCF +CSG H EC LKL TL +
Sbjct: 30 RCDRCFKLGKVLKCSGCLYVRYCNRACQQEAWPDHQAECEKLKLLPATL----------V 79
Query: 163 LPLRCLILQR---------------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKY 207
+P L+L R + K Y+ D+ H + E ++
Sbjct: 80 VPSAALMLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEEDIRKDVKRIEHFHTLNVV 139
Query: 208 LRSNFLEKLPGDVLDDTSAKCLHWIC-GVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAK 266
L+ E P D + +C V ++ G+Y +++H+C PNA
Sbjct: 140 LQRLLDE--PAIPPRDELLRIFGKMCINSFNVCDDEMNSIGTGMYLGASILDHSCRPNAV 197
Query: 267 HSNM-MQSKLFVFRD--------THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDP 317
+ + Q +L + D + I Y + + + RRE L+ YF C C RC D
Sbjct: 198 ATFVGEQLQLRLLEDFAGPELDFSRIFISYIDLIDPSDTRREQLSERYYFRCECVRCRDE 257
Query: 318 TELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLN--ARD 366
E A C N D+ P+ D C +C A + ARD
Sbjct: 258 AE-RELMGAAACQNRKCDE------PI----RDGQTLCSACEAPFDQSARD 297
>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
Length = 716
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 9/95 (9%)
Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTMYTNAL--------WGTQP 296
R Q +YSV L H+C PN H+ + LF+ H++ L W +
Sbjct: 457 RVGQAIYSVASLFNHSCQPNI-HAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKD 515
Query: 297 RREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
R++ L F C C CS+ +A +C+N
Sbjct: 516 RQKFLKDEYSFRCECSGCSELNVSDLVLNAFRCVN 550
>gi|225424795|ref|XP_002267535.1| PREDICTED: zinc finger MYND domain-containing protein 15-like
[Vitis vinifera]
Length = 369
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 25/50 (50%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C A +C C VAYC HQ HW HK +C +L ++K +L +
Sbjct: 12 CIGPATRRCGRCGAVAYCSVSHQVSHWSDHKEECGRLEQQMKLVDVLNDF 61
>gi|397646650|gb|EJK77370.1| hypothetical protein THAOC_00804 [Thalassiosira oceanica]
Length = 303
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC+ V YCG + Q+ H KLHK CK+ E+K L
Sbjct: 27 LKNCTACRLVKYCGVECQRAHRKLHKKACKQRVAELKDEQL 67
>gi|397573252|gb|EJK48616.1| hypothetical protein THAOC_32572 [Thalassiosira oceanica]
Length = 475
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 28/54 (51%), Gaps = 5/54 (9%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
CA C +T L C+AC+ V YCG Q+ H K HK CKK E+K L
Sbjct: 187 CANCGKTGSDTVKLKDCTACRLVKYCGVDCQRAHRKQHKKACKKRAAELKDERL 240
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 40.8 bits (94), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 59/269 (21%), Positives = 101/269 (37%), Gaps = 34/269 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A + L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVVALRPGELLFRSDPLAYTVCKGSRGIVCDRCLLGKEKLM--------RCSQCQ 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVIFKLMEETPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E+++ K + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNIKK------LTEDKKEGLRQLAMTFQHFMREEIQDVSQLPPSFDVFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
IC + ++ GLY + L+ H+C PN + N L RD ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERCS 315
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCLRCQ 267
>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
Length = 799
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 9/95 (9%)
Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTMYTNAL--------WGTQP 296
R Q +YSV L H+C PN H+ + LF+ H++ L W +
Sbjct: 483 RVGQAIYSVASLFNHSCQPNI-HAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKD 541
Query: 297 RREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
R++ L F C C CS+ +A +C+N
Sbjct: 542 RQKFLKDEYSFRCECSGCSELNVSDLVLNAFRCVN 576
>gi|332028331|gb|EGI68378.1| Ankyrin repeat and MYND domain-containing protein 2 [Acromyrmex
echinatior]
Length = 407
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E A KCS CK V YC ++ Q+ HW +HK C +L
Sbjct: 335 CVTCGEDKATKKCSKCKAVQYCDRECQRLHWFMHKKACARL 375
>gi|397647767|gb|EJK77844.1| hypothetical protein THAOC_00296 [Thalassiosira oceanica]
Length = 304
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 29 LKDCTACRLVKYCGVDCQRAHRKQHKKACKRRAAELKDEQLYGQ 72
>gi|393233651|gb|EJD41221.1| hypothetical protein AURDEDRAFT_138760 [Auricularia delicata
TFB-10046 SS5]
Length = 222
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 19/30 (63%)
Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHK 48
RE L KCS C+ YC +HQ+ HW LHK
Sbjct: 163 REENLKKCSKCETAKYCSTEHQRAHWSLHK 192
>gi|397601218|gb|EJK57846.1| hypothetical protein THAOC_22072 [Thalassiosira oceanica]
Length = 636
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 5/66 (7%)
Query: 3 SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S I ++D CA C + L C+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 335 SCIAVDDGSAGVCANCGKHGSDTVKLKDCAACRLVKYCGVDCQRAHRKQHKKACKQRASE 394
Query: 58 IKSSPL 63
+K L
Sbjct: 395 LKDEKL 400
>gi|301106410|ref|XP_002902288.1| programmed cell death protein 2, putative [Phytophthora infestans
T30-4]
gi|262098908|gb|EEY56960.1| programmed cell death protein 2, putative [Phytophthora infestans
T30-4]
Length = 390
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKCKK 53
CA+C + A CSAC YC K HQK+HW HK C +
Sbjct: 148 CALCGQRATFTCSACHVAQYCSKAHQKDHWTAGGHKQTCAQ 188
>gi|389738208|gb|EIM79409.1| hypothetical protein STEHIDRAFT_126513 [Stereum hirsutum FP-91666
SS1]
Length = 262
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/33 (54%), Positives = 20/33 (60%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
E C ACKEV YC K+ QK HW LHK C+
Sbjct: 42 EEPYQACGACKEVIYCSKECQKAHWPLHKGPCR 74
>gi|296086488|emb|CBI32077.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 25/50 (50%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C A +C C VAYC HQ HW HK +C +L ++K +L +
Sbjct: 12 CIGPATRRCGRCGAVAYCSVSHQVSHWSDHKEECGRLEQQMKLVDVLNDF 61
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 125 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|323450761|gb|EGB06641.1| hypothetical protein AURANDRAFT_65492 [Aureococcus anophagefferens]
Length = 478
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 13 MRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
++CA C E + + +C+ CK +YC + QK+HW HKP C +P
Sbjct: 427 LKCAQCGEVSSVRVTVSRCARCKTASYCSRHCQKKHWLKHKPTCVPVP 474
>gi|299740056|ref|XP_001840441.2| hypothetical protein CC1G_12970 [Coprinopsis cinerea okayama7#130]
gi|298404065|gb|EAU81374.2| hypothetical protein CC1G_12970 [Coprinopsis cinerea okayama7#130]
Length = 788
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 5 IEIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+ ++ L RC+ C AL KCS C++ YC QK HW HK CK
Sbjct: 724 VRMDRVSLYRCSWCGNPSAALQKCSGCQKARYCDSTCQKSHWPDHKKPCK 773
>gi|307102719|gb|EFN50988.1| hypothetical protein CHLNCDRAFT_55450 [Chlorella variabilis]
gi|307105437|gb|EFN53686.1| hypothetical protein CHLNCDRAFT_53614 [Chlorella variabilis]
Length = 272
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 14 RCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
+C+ C + L C+AC V+YC Q+ HWK HKP CK E++
Sbjct: 220 QCSGCGKHSIGLRLCAACHSVSYCSPACQRSHWKQHKPNCKVAQKEMRG 268
>gi|397578508|gb|EJK50901.1| hypothetical protein THAOC_29984 [Thalassiosira oceanica]
Length = 118
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 5/66 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C A L C+AC+ V YCG Q+ H K HK CKK E
Sbjct: 45 SCVPVVDDGAEVCANCGAMASDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKACKKRAAE 104
Query: 58 IKSSPL 63
+K L
Sbjct: 105 LKDEEL 110
>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
Length = 433
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 82/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CDVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E E I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKELIQSDIAALHHFYSKHLEFPDNDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEINPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 16 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 67
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 68 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 125
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 126 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 180
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 181 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 240
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 241 LDMLMTSEERRKQLRDQYCFECDCFRC 267
>gi|299749521|ref|XP_001836166.2| hypothetical protein CC1G_10947 [Coprinopsis cinerea okayama7#130]
gi|298408476|gb|EAU85675.2| hypothetical protein CC1G_10947 [Coprinopsis cinerea okayama7#130]
Length = 450
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 1/40 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
C + AL +C C++V YC +HQK HW+ +HK +C +
Sbjct: 404 CGKPGDEALKQCGGCRQVLYCSPEHQKFHWRAMHKNQCSR 443
>gi|66357248|ref|XP_625802.1| SET domain protein [Cryptosporidium parvum Iowa II]
gi|46226909|gb|EAK87875.1| SET domain protein [Cryptosporidium parvum Iowa II]
Length = 583
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 42/192 (21%), Positives = 85/192 (44%), Gaps = 24/192 (12%)
Query: 250 LYSVICLMEHNCLPN------AKHSNMMQSK--LFVFRDTHISTMYTNALWGTQPRREHL 301
LY+VI +M HNC + ++ ++++K L V + IS + + L+ R L
Sbjct: 314 LYNVISMMAHNCGASCCWHYGVDNTFVLRAKTRLEVGDEITISYISDDDLFKCSKTRREL 373
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT----CGS 357
F C CERC++PT+L +KC C + + ++++ T C
Sbjct: 374 LSNWLFYCQCERCNNPTDLS---RGLKC-------ASCGVGSMFFKEDENGVTRSSRCSV 423
Query: 358 CSARLNARDVHLVTSQLGEQVDKLVQENPN-VKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
C ++ + + T + V +L + + N V+ +E + + + +F H+ Y + L
Sbjct: 424 CRSQPDREIIDSYTDLESQYVARLEETDKNDVEDVEAVYQEAQKVF-TQHWIMYELDTML 482
Query: 417 IQLYGTQPGYAY 428
+ Y +++
Sbjct: 483 FEAYKESQQFSF 494
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 12 LMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
L RCA C + T L C CK VAYCG+ Q H+K HK C+
Sbjct: 419 LARCANCGKASTQLSLCGRCKSVAYCGRPCQVAHYKGHKATCR 461
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Meleagris gallopavo]
Length = 427
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 21/34 (61%)
Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
R LH+CS CK YCGK QKE W HK +CK
Sbjct: 55 RNEHLHRCSQCKVAKYCGKSCQKEGWLDHKRECK 88
>gi|398394643|ref|XP_003850780.1| hypothetical protein MYCGRDRAFT_110262 [Zymoseptoria tritici
IPO323]
gi|339470659|gb|EGP85756.1| hypothetical protein MYCGRDRAFT_110262 [Zymoseptoria tritici
IPO323]
Length = 422
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+C+ C + A L +C+ CK +YC + Q HWK+HK C KL
Sbjct: 195 KCSQCDKPASSLKRCAKCKSTSYCDRACQTAHWKVHKKICAKL 237
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus
gallus]
Length = 428
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 21/34 (61%)
Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
R LH+CS CK YCGK QKE W HK +CK
Sbjct: 55 RNEHLHRCSQCKVAKYCGKSCQKEAWLDHKRECK 88
>gi|397570025|gb|EJK47111.1| hypothetical protein THAOC_34195, partial [Thalassiosira oceanica]
Length = 1544
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
L C+AC+ V YCG Q+ H KLHK CK+ E+K+
Sbjct: 807 LKNCTACRLVKYCGADCQRAHRKLHKKACKQRAAELKA 844
Score = 39.3 bits (90), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 23/42 (54%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
L C+AC+ V YC Q+ H KLHK CK+ E+K L
Sbjct: 1255 LKNCTACRLVKYCSADCQRAHRKLHKKACKQRAAELKDEQLF 1296
>gi|366987641|ref|XP_003673587.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
gi|342299450|emb|CCC67204.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
Length = 508
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 6/88 (6%)
Query: 235 VIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHIS------TMYT 288
++++ +I + +Y + + HNC PN ++ + L VF I T Y
Sbjct: 331 LLDIGKFNINQLEGQIYHLYSFINHNCEPNIRYEIDSKLCLRVFARKPIQPGEELLTTYV 390
Query: 289 NALWGTQPRREHLAITKYFNCSCERCSD 316
N L G + RR L + F C C RC +
Sbjct: 391 NPLHGVKLRRRALKVNWGFLCQCARCEN 418
>gi|195493207|ref|XP_002094318.1| GE21757 [Drosophila yakuba]
gi|194180419|gb|EDW94030.1| GE21757 [Drosophila yakuba]
Length = 403
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
C+ C E KCS CK V YC ++ Q+ HW +HK C +L + ++ P
Sbjct: 333 CSTCGAEKPDKKCSKCKAVQYCDRECQRLHWFMHKKNCARLLAQSQTQP 381
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 14 RCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC C E + L +CS CK YCG Q+ W++HK +C++L
Sbjct: 50 RCDFCLEKLSDLQRCSRCKFARYCGASCQRAAWRIHKSECERL 92
>gi|397611228|gb|EJK61234.1| hypothetical protein THAOC_18316 [Thalassiosira oceanica]
Length = 282
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 47/105 (44%), Gaps = 9/105 (8%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR-YLQATLDLHPGDRIA 81
L C+AC+ V YCG Q+ H KLHK CK+ E+K L + +++ +D P +
Sbjct: 22 LKNCTACRFVKYCGVDCQRAHRKLHKKACKQRAAELKEEQLYSQGHVRPEVDFCPICTLP 81
Query: 82 RESPLIVGPKLAL----AEPICLGCHKPLNPNLADNARCPRCFWP 122
PL +G + IC GC D+ CP C P
Sbjct: 82 --IPLPMGHHSVFRACCMQKICNGCDIAAKKRGMDD--CPFCRTP 122
>gi|56755307|gb|AAW25833.1| SJCHGC05428 protein [Schistosoma japonicum]
Length = 131
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 27/48 (56%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
RC C E A +CS C++ YCG++ Q +HW HK C L + ++
Sbjct: 80 RCPTCGEVASKRCSRCRQEWYCGRECQVKHWPRHKKACDLLTEAVSTN 127
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 125 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 179
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 180 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 239
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 240 LDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|226505212|ref|NP_001151267.1| MYND finger family protein [Zea mays]
gi|195645400|gb|ACG42168.1| MYND finger family protein [Zea mays]
Length = 372
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
A +C C VAYC + HQ HW++HK +C++ +++ L ++
Sbjct: 17 VADRRCGGCGAVAYCSRVHQIIHWRVHKEECERFAEQMRHIDALSQF 63
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 56/268 (20%), Positives = 104/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC+ LK C+ + + + ++ + + +K
Sbjct: 67 IAKYCSAKCQKKA-WPDHRRECSCLK-SCKPRYPPDSVRLLGRVIVKLMDEKPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E+++SK + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTM 286
IC + ++ GLY + L+ H+C PN ++++ + ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCIRC 266
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 56/268 (20%), Positives = 104/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC+ LK C+ + + + ++ + + +K
Sbjct: 67 IAKYCSAKCQKKA-WPDHRRECSCLK-SCKPRYPPDSVRLLGRVIVKLMDEKPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E+++SK + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTM 286
IC + ++ GLY + L+ H+C PN ++++ + ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCIRC 266
>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 647
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 26/51 (50%)
Query: 4 IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+E +D EL + + +C CK +AYCG Q+ W HKP+C L
Sbjct: 100 FLEKQDFELDNPSTPSNRTIRRCGKCKVIAYCGSNCQQLDWASHKPECTAL 150
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 25/43 (58%), Gaps = 2/43 (4%)
Query: 12 LMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
L RCA C + T L C CK VAYCG+ Q H+K HK C+
Sbjct: 410 LARCANCGKASTQLSLCGRCKSVAYCGRPCQVAHYKGHKATCR 452
>gi|348673334|gb|EGZ13153.1| hypothetical protein PHYSODRAFT_510777 [Phytophthora sojae]
Length = 560
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 7 IEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
+E+ EL RCA C + A L +C+ CK YC Q Q+ HWK HK C
Sbjct: 319 MENPELRRCAKCDKRAAPGKELKRCTRCKCTFYCDAQCQRGHWKDHKVSC 368
>gi|336388071|gb|EGO29215.1| hypothetical protein SERLADRAFT_359379 [Serpula lacrymans var.
lacrymans S7.9]
Length = 709
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 22/44 (50%), Gaps = 2/44 (4%)
Query: 11 ELMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
EL RC+ C L KCS C + YC QK HW HK CK
Sbjct: 665 ELYRCSYCGNPSAVLRKCSGCAKARYCDNSCQKSHWSEHKIACK 708
>gi|405970379|gb|EKC35288.1| Ubiquitin carboxyl-terminal hydrolase 19 [Crassostrea gigas]
Length = 1310
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 6/66 (9%)
Query: 14 RCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
RCA CR L +C+ C +V YC QK HW +HK C P + SP +
Sbjct: 685 RCASCRRLCPDNCKLKRCTRCFKVGYCDHSCQKNHWIMHKNHCNMTPEPV-GSPFIVSIP 743
Query: 69 QATLDL 74
+ ++D+
Sbjct: 744 EYSVDV 749
>gi|389747838|gb|EIM89016.1| hypothetical protein STEHIDRAFT_53749 [Stereum hirsutum FP-91666 SS1]
Length = 1186
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 3/49 (6%)
Query: 7 IEDEELMRCAVCRETA---LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
I D E +C CR L +C C++V YC + QK +W+ HK +CK
Sbjct: 1137 IRDPEARKCFACRGKGNPKLMECKGCRKVRYCSVECQKFNWEEHKKRCK 1185
>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
[Oryzias latipes]
Length = 435
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 40/210 (19%), Positives = 79/210 (37%), Gaps = 22/210 (10%)
Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQR 172
N C R WP H EC + E + + + ++ +
Sbjct: 75 NVDCQRGDWP-------------MHKLECVAMCTHGENWCPSETVRLVSRIIMKQRVTTE 121
Query: 173 RSP-KKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
R+P ++ LK+ E+H+ K + + D + + Y S + LP D + +
Sbjct: 122 RTPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD--QELTELFAQ 179
Query: 231 WICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTHIST 285
C + ++ ++ + LM H+C PN + +++ + I
Sbjct: 180 VNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQEINPGEEIFN 239
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCS 315
Y + L+ T+ R+E L + +F C C C+
Sbjct: 240 SYIDLLYPTEDRKERLLDSYFFGCQCTECT 269
>gi|350417592|ref|XP_003491497.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Bombus impatiens]
Length = 407
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
C C E A KCS CK V YC ++ Q+ HW +HK C +L I S
Sbjct: 335 CITCGEDKANKKCSKCKAVQYCDRECQRLHWFMHKKACTRLGQNITPS 382
>gi|332020173|gb|EGI60617.1| Zinc finger MYND domain-containing protein 10 [Acromyrmex
echinatior]
Length = 437
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 28/44 (63%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
E +C++C+E + +CS CKEV YC ++ Q + W+ HK C +
Sbjct: 389 ETKKCSLCQEPSKKRCSRCKEVWYCSRECQVKDWENHKNICNTI 432
>gi|198465840|ref|XP_001353783.2| GA20758 [Drosophila pseudoobscura pseudoobscura]
gi|198150338|gb|EAL29517.2| GA20758 [Drosophila pseudoobscura pseudoobscura]
Length = 404
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)
Query: 10 EELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+E C+ C +E KCS CK V YC ++ Q+ HW +HK C +L
Sbjct: 327 DETSYCSTCGQENPDKKCSKCKAVQYCDRECQRLHWFMHKKSCARL 372
>gi|397631988|gb|EJK70365.1| hypothetical protein THAOC_08280, partial [Thalassiosira
oceanica]
Length = 217
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L R L+
Sbjct: 22 LKNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYSRGLE 68
>gi|354801524|gb|AER39528.1| egl nine 1-like protein [Carassius carassius]
Length = 359
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 1/49 (2%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
C +C + L KC C+ YC K+HQ++ WK HK CK+ + ++ P
Sbjct: 18 CELCGKMENLMKCGRCRSSFYCSKEHQRQDWKKHKRMCKEADKQQQTPP 66
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 56/268 (20%), Positives = 104/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC+ LK C+ + + + ++ + + +K
Sbjct: 67 IAKYCSAKCQKKA-WPDHRRECSCLK-SCKPRYPPDSVRLLGRVIVKLMDEKPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E+++SK + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSN-----MMQSKLFVFRDTHISTM 286
IC + ++ GLY + L+ H+C PN ++++ + ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCIRC 266
>gi|290985293|ref|XP_002675360.1| predicted protein [Naegleria gruberi]
gi|284088956|gb|EFC42616.1| predicted protein [Naegleria gruberi]
Length = 618
Score = 40.4 bits (93), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 31/54 (57%), Gaps = 6/54 (11%)
Query: 6 EIEDEELMRCAVCRET------ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
E E+E+ CAVC T L C CK+V YC ++ Q++ WK HK +C+K
Sbjct: 565 EAENEKPKVCAVCGATQAEGSKKLLVCGKCKKVHYCSRECQEKDWKTHKKECQK 618
>gi|397624983|gb|EJK67613.1| hypothetical protein THAOC_11329 [Thalassiosira oceanica]
Length = 307
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 33/67 (49%), Gaps = 5/67 (7%)
Query: 2 ASIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
+S + I DE CA C + L C+AC+ V YCG QK H K HK C+K
Sbjct: 52 SSCVPIVDEGDEVCANCGKHGSDIVKLKHCNACRIVKYCGVDCQKAHRKQHKKACRKRAA 111
Query: 57 EIKSSPL 63
E+K L
Sbjct: 112 ELKDEQL 118
>gi|397617240|gb|EJK64346.1| hypothetical protein THAOC_14932 [Thalassiosira oceanica]
Length = 289
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 26/49 (53%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L KC+AC V YCG QK H K HK CKK E+K L + L+
Sbjct: 15 VTLKKCNACHLVKYCGVDCQKAHRKEHKKVCKKRAAELKEERLYSQGLE 63
>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
Length = 430
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 70/164 (42%), Gaps = 21/164 (12%)
Query: 171 QRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLH 230
+R + + +K+ E+H+SK E E I+S + L + + L T L
Sbjct: 119 ERTPSETFLSVKEFESHLSKLD--NEKKELIESDIAALHRFYSKNL-----HYTDNAALV 171
Query: 231 WICGVIEVNGVDI-----GRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH-- 282
++ + NG I ++ + LM H+C PN + +++ ++ H
Sbjct: 172 FLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTFKGTVAEIRAVQEIHAG 231
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS----DPTEL 320
+ T Y + L+ T+ R + L + +F C C CS DP +L
Sbjct: 232 DEVFTSYIDLLYPTEDRNDRLMDSYFFTCDCRECSTKQKDPAKL 275
>gi|336375120|gb|EGO03456.1| hypothetical protein SERLA73DRAFT_83458 [Serpula lacrymans var.
lacrymans S7.3]
Length = 712
Score = 40.4 bits (93), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 22/44 (50%), Gaps = 2/44 (4%)
Query: 11 ELMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
EL RC+ C L KCS C + YC QK HW HK CK
Sbjct: 668 ELYRCSYCGNPSAVLRKCSGCAKARYCDSSCQKSHWSDHKAVCK 711
>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
[Oryzias latipes]
Length = 424
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/159 (20%), Positives = 65/159 (40%), Gaps = 9/159 (5%)
Query: 164 PLRCLILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKLPGDVL 221
P + R+P ++ LK+ E+H+ K + + D + + Y S + LP D
Sbjct: 102 PSETRVTTERTPSERLLLLKEFESHLDKMDSEKDEMNQADIAALHYFYSKHISDLPDD-- 159
Query: 222 DDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLF 276
+ + C + ++ ++ + LM H+C PN + +++
Sbjct: 160 QELTELFAQVNCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAEVRAVQE 219
Query: 277 VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ I Y + L+ T+ R+E L + +F C C C+
Sbjct: 220 INPGEEIFNSYIDLLYPTEDRKERLLDSYFFGCQCTECT 258
>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
Length = 272
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 87/216 (40%), Gaps = 17/216 (7%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L +F K G +D+
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALH-HFYSKHLGFPDNDSLVV 174
Query: 228 CLHWI-CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
+ C + ++ ++ + LM H+C PN + + L R
Sbjct: 175 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232
Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|407915489|gb|EKG09079.1| Zinc finger MYND-type protein [Macrophomina phaseolina MS6]
Length = 309
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 35/64 (54%), Gaps = 10/64 (15%)
Query: 7 IEDEELMR--CAVCR--------ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY 56
I+DE + CA CR + +L++CS C V+YC K+ Q HW HK CK+ Y
Sbjct: 3 IDDEWAVESGCAGCRRQQSDLAGDQSLNRCSRCHCVSYCSKECQTAHWTTHKSNCKRPNY 62
Query: 57 EIKS 60
I++
Sbjct: 63 VIEA 66
>gi|397627358|gb|EJK68445.1| hypothetical protein THAOC_10375, partial [Thalassiosira oceanica]
Length = 352
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+ L
Sbjct: 75 LKSCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQGL 120
>gi|336370221|gb|EGN98562.1| hypothetical protein SERLA73DRAFT_183631 [Serpula lacrymans var.
lacrymans S7.3]
Length = 432
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 10 EELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
++L++CAVC ++ L CS+C E YC + QK W HK C K
Sbjct: 15 KDLVKCAVCSKKKDLRLCSSCGEQTYCSAECQKNDWPTHKTSCGK 59
>gi|193785783|dbj|BAG51218.1| unnamed protein product [Homo sapiens]
Length = 441
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E A +CS CK V YC + QK HW HK CK L
Sbjct: 320 CTTCGEKGASKRCSVCKMVIYCDQTRQKTHWFTHKKICKNL 360
>gi|449545771|gb|EMD36741.1| hypothetical protein CERSUDRAFT_50971 [Ceriporiopsis subvermispora
B]
Length = 425
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C +C +CS C+ V+YCG++ Q+ W HK C+ L
Sbjct: 247 CVLCGSKTASRCSQCQSVSYCGQECQRADWPSHKQTCRSL 286
>gi|397624427|gb|EJK67380.1| hypothetical protein THAOC_11597 [Thalassiosira oceanica]
Length = 305
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 30 LRDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQ 73
>gi|348507084|ref|XP_003441087.1| PREDICTED: egl nine homolog 1-like [Oreochromis niloticus]
Length = 359
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 26/46 (56%), Gaps = 1/46 (2%)
Query: 9 DEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
D + C +C R L KC C+ YC K+HQK+ WK HK CK+
Sbjct: 9 DRDRQYCELCGRFDNLLKCGRCRSSFYCSKEHQKQDWKKHKLICKE 54
>gi|324508446|gb|ADY43564.1| SET domain-containing protein 14 [Ascaris suum]
Length = 421
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
T L KC+ACK + YC + QK W +HK +CK
Sbjct: 35 TKLEKCAACKSIFYCNRSCQKADWPMHKVECK 66
>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe]
Length = 483
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 2 ASIIEIEDEELMR-CAVCRETAL--HKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
I+ ++ L R C+ C E + +C+ACK + YC K QK W HK +CK L
Sbjct: 35 VDILSLDSANLTRTCSTCTEEKVKTQRCAACKIIHYCSKGCQKADWPFHKLECKAL 90
>gi|340369460|ref|XP_003383266.1| PREDICTED: zinc finger MYND domain-containing protein 10-like
[Amphimedon queenslandica]
Length = 442
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
+CA+C A +CS C YC ++ Q ++WK HKP C L Y+ KSS
Sbjct: 396 KCALCGSEATKRCSRCCSEWYCSRECQVKNWKKHKPSCDIL-YQSKSS 442
>gi|225709842|gb|ACO10767.1| Programmed cell death protein 2 [Caligus rogercresseyi]
Length = 374
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 33/63 (52%), Gaps = 6/63 (9%)
Query: 8 EDEELMRCAVCRETALHK-CSACKEVAYCGKQHQKEHWK--LHKPKCKKLPYEIKSSPLL 64
++ L C +C A +K CS CKE YC K+HQ WK HK +C+ KS PL+
Sbjct: 133 QERHLQLCRLCGAGASNKTCSGCKEAKYCSKEHQTADWKSGSHKAECQS---PSKSVPLI 189
Query: 65 GRY 67
Y
Sbjct: 190 SLY 192
>gi|348680577|gb|EGZ20393.1| hypothetical protein PHYSODRAFT_313078 [Phytophthora sojae]
Length = 250
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 27/43 (62%), Gaps = 3/43 (6%)
Query: 15 CAVCRET--ALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKL 54
C+ C+ T AL +CS CK AYC K Q++HWK H+ C KL
Sbjct: 5 CSACQSTDKALSRCSQCKWTAYCSKTCQRDHWKNGHRLTCGKL 47
>gi|195622642|gb|ACG33151.1| MYND finger family protein [Zea mays]
gi|414886684|tpg|DAA62698.1| TPA: MYND finger family protein [Zea mays]
Length = 372
Score = 40.4 bits (93), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
A +C C VAYC + HQ HW++HK +C++ +++ L ++
Sbjct: 17 VADRRCGGCGAVAYCSRVHQIIHWRVHKEECERFAEQMRHINALSQF 63
>gi|397589365|gb|EJK54635.1| hypothetical protein THAOC_25719 [Thalassiosira oceanica]
Length = 147
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D CA C TA L C+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 2 SCVSVVDGSNEVCANCGTTASDIVKLKNCTACRLVKYCGVDCQRAHRKRHKKACKQRAAE 61
Query: 58 IKSSPLLGRYLQ 69
++ L + L+
Sbjct: 62 LEDERLYSQGLK 73
>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 635
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 56/131 (42%), Gaps = 8/131 (6%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
I S GR+L AT + PGD + E+P + + CH L+ NL +
Sbjct: 246 ISFSEKYGRHLVATKEFKPGDVVTIENPYAY---VIYTQRYYTHCHHCLSRSYNLIPCSH 302
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
CP + CS +C L+ H EC I+ L L D +R LI+
Sbjct: 303 CPVAQY--CSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTKIIRFLIVITSKG 360
Query: 176 KKYQELK-DME 185
KK+ EL+ DME
Sbjct: 361 KKFDELRVDME 371
>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
Length = 349
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)
Query: 65 GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
GR L+A H GD + A L VG + E C L+ RC + F
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72
Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEA-ILPLRCLILQRRSPKKYQ 179
+ C C D H EC+ + + E + A IL + + +R +K
Sbjct: 73 Y--CDVECQK-EDWPLHKLECSSMVVFGENWNPSETVRLTARILAKQKMHPERTPSEKLL 129
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
+++ E+H+ K E + I S + L + + L + D +S L C
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKHL--EFPDHSSLVVLFAQVNCNGFT 185
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
+ ++ ++ + LM H+C PN + +++ ++ H + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245
Query: 293 GTQPRREHLAITKYFNCSCERCS 315
T+ R + L + +F C C C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268
Score = 38.9 bits (89), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C R+ L KC CK+ YC + QKE W LHK +C +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 63/283 (22%), Positives = 104/283 (36%), Gaps = 31/283 (10%)
Query: 46 LHKPKCKKLPYEIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKP 105
+ PK +K SS G L+A + L PG+ + R PL +C+ C
Sbjct: 1 MEPPKVEKF-----SSANRGNGLRALVQLRPGELLFRSDPLAYTVCKGSRGVVCVRCIIG 55
Query: 106 LNPNLADNARCPRC-FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP 164
L C RC C A+C + H EC LK C+ + + +
Sbjct: 56 RKYLLY----CSRCRLLAQCGAKCRKKA-WQDHKRECKCLK-SCKPRYPPDSVRLLGRVV 109
Query: 165 LRCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD-- 222
+ + +K D+E++++K + E+ ++ L F + ++ D
Sbjct: 110 FKLMEETPSESEKLYSFYDLESNINK------LTEDKKEGLRQLVMTFQHFMREEIQDAS 163
Query: 223 ------DTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKL 275
D IC V ++ GLY + L+ H+C PN + N L
Sbjct: 164 QLPPSFDIFEAFAKVICNSFTVCNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLL 223
Query: 276 FVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
RD ++ Y + L ++ RR+ L F C C RC
Sbjct: 224 RAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC 266
>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
Length = 554
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/136 (22%), Positives = 56/136 (41%), Gaps = 22/136 (16%)
Query: 249 GLYSVICLM-EHNCLPNAKHSNMMQSKLF------VFRDTHISTMYTNALWGTQPRREHL 301
GL+ + L H C PN + +L V +D + Y + RR+ L
Sbjct: 256 GLFPLGALFFNHGCNPNTAFVGLPNGQLAFRTIRSVQKDEELVVSYIDIYSDRDERRQEL 315
Query: 302 AITKYFNCSCERCSDPTE--LGTYFSAMKCLNEHKDQGDCWILPVNPLD----------N 349
TK+F C C+RC+ P E + + + C ++ + D +++P +D
Sbjct: 316 LTTKHFWCKCKRCASPLEKSIDRFLQGVVC---NQCEKDVYMIPATNIDLLMKGERSLMI 372
Query: 350 DSDWTCGSCSARLNAR 365
++ + C SC L +
Sbjct: 373 ENTFKCASCGHELEGQ 388
>gi|167522379|ref|XP_001745527.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775876|gb|EDQ89498.1| predicted protein [Monosiga brevicollis MX1]
Length = 386
Score = 40.4 bits (93), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
+C VC A+++CS C+ YC +Q Q EHW HK C
Sbjct: 338 KCVVCGADAVNRCSRCQSEWYCRRQCQVEHWPKHKAAC 375
>gi|348681660|gb|EGZ21476.1| hypothetical protein PHYSODRAFT_557520 [Phytophthora sojae]
Length = 341
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/38 (47%), Positives = 23/38 (60%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKS 60
L KC+ C++ YC + QK WKLHK C K P EI +
Sbjct: 297 LSKCARCRDAVYCSRDCQKAAWKLHKRLCGKTPEEISA 334
>gi|328872636|gb|EGG21003.1| hypothetical protein DFA_00872 [Dictyostelium fasciculatum]
Length = 596
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
RC C + T L KC CK+V YC K+ Q ++W H+ C
Sbjct: 554 RCQTCFKSTNLKKCGRCKQVFYCSKECQIKNWAFHQSIC 592
>gi|397642774|gb|EJK75446.1| hypothetical protein THAOC_02829, partial [Thalassiosira oceanica]
Length = 365
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
CA C +T L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 90 CANCGKTGSDAVKLKNCTACRLVKYCGVDCQRAHRKKHKKACKQRVAELKDEQLYGQ 146
>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
terrestris]
Length = 439
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 71/329 (21%), Positives = 119/329 (36%), Gaps = 64/329 (19%)
Query: 113 NARCPRCFWPACSARCSGLSDAH------------THAPECAILKLGCETLLAYNDYKYE 160
N RC CF RCS + H+ ECA LK ++ +
Sbjct: 30 NVRCDYCFKSGKLFRCSACQYVYYCNQSCQQMSWPMHSKECARLK-------KFSPWGIS 82
Query: 161 AILPLRCLILQR-----------RSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLR 209
+ L I+ + + Y++ KD+ +H S+ + E + L
Sbjct: 83 NVARLMARIIIKLNQGGDEEKGYYNETNYRKFKDLMSHCSEIKKDEKKMEHFVCLCNVLH 142
Query: 210 SNFLEKLPGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQ----GLYSVICLMEHNCLPNA 265
FLE +P S L I G I +N I G+Y +++H+C PNA
Sbjct: 143 -KFLEDMP-----IPSTAELLGIYGRITINSFSIFNLDMNIGVGIYLGPSILDHSCKPNA 196
Query: 266 KHS------NMMQSKLFVFRD-THISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPT 318
+ N+ + D + I Y + + RR L + YF C CE+C P
Sbjct: 197 VATFEGTTINVKAIEDLPSLDLSQIRIPYIDVIKTAGDRRAELQSSYYFWCDCEKCEKPE 256
Query: 319 ELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQV 378
+ A C N+ L P D ++D C C+ + + + ++ E
Sbjct: 257 PMA---EAAACPNK---------LCTYPCDPNAD-LCEKCNTKF-PENFKEIFDEISEFT 302
Query: 379 DKLVQENPNVKSLEEM---LTKLEAMFHP 404
++ N+ L+ L++ E + HP
Sbjct: 303 AYHLENMKNIAYLDVSKMCLSRQEGILHP 331
>gi|397607285|gb|EJK59631.1| hypothetical protein THAOC_20114, partial [Thalassiosira oceanica]
Length = 340
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 5/74 (6%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M + +E + CA C + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 62 MIKCVPVEGDGDDACANCGKLGSDAVKLKNCTACRLVKYCGVDCQRAHRKKHKKACKQRV 121
Query: 56 YEIKSSPLLGRYLQ 69
E+K L + L+
Sbjct: 122 AEVKDEQLYSQGLE 135
>gi|397594234|gb|EJK56180.1| hypothetical protein THAOC_23985 [Thalassiosira oceanica]
Length = 303
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 29 LKDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQ 72
>gi|397565442|gb|EJK44625.1| hypothetical protein THAOC_36821, partial [Thalassiosira oceanica]
Length = 447
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 70/174 (40%), Gaps = 39/174 (22%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C + L KC+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 148 SCVPVTDDGDEACANCGKHGNDTIKLKKCTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 207
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPIC-LGCHKPLNPNLADNARC 116
+K L Y Q L GD PIC L H P++ + N C
Sbjct: 208 LKDEEL---YSQGHERLE-GDFC----------------PICTLPVHIPIHEHSGFNTCC 247
Query: 117 PRCFWPAC--SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
+ C +A+ G++D CA C T L ND A++ R L
Sbjct: 248 MKQICHGCNLAAQKRGMND-------CAF----CRTPLPKNDADGMAMVQSRVL 290
>gi|357484667|ref|XP_003612621.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
gi|355513956|gb|AES95579.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
Length = 384
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 110/291 (37%), Gaps = 51/291 (17%)
Query: 62 PLLGRYLQATLDLHPGDRIARESPLIVGPKLAL----AEPICLGCHKPLNPNLADNARCP 117
P GR L AT DL G I ESPL++ L P C C + L P + +CP
Sbjct: 43 PNKGRGLIATQDLKAGQIILTESPLLLYSATPLFSHTPSPYCHNCFRTLPP--SQTFQCP 100
Query: 118 RCF-WPACSARCSGLSDAHTHAPECAILKLGCETLLAYND-----------YKYEAILPL 165
C + CS +C ++ +H+ C+TL + + +A L +
Sbjct: 101 SCSNYLFCSQKCLSIALNSSHS------SWTCQTLSHLQNPTSPLSEKPCELQVQARLIV 154
Query: 166 RCLILQRRSPKKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRS---NFLEKLPGDVLD 222
L +P K Q L + + + + + + + S++ S NF +L ++
Sbjct: 155 AAYNLAIHTPSKLQTLLSLHGNPNDQDSIVDNAKFLHSLISPFCSPHMNFSAELAAKII- 213
Query: 223 DTSAKCLHWICGV--IEVNGVDIGRYTQGLYSVICLMEHNCLPNA-------------KH 267
+ + L+ C + G G+Y H C+PNA +H
Sbjct: 214 --AKERLNSFCLMEPYSQKGPQRSIKAYGIYQKATFFNHGCIPNACRFDYVESGEPGDEH 271
Query: 268 SNMMQSKLF----VFRDTHISTMYTNALWGTQPRREHLAITKYFNCSCERC 314
+ + +L V + IS N + T R++ L F C C+RC
Sbjct: 272 NTDIVIRLIKDVGVGSEICISYFRINKDYST--RKKILMEDYGFTCECDRC 320
>gi|195578359|ref|XP_002079033.1| GD23738 [Drosophila simulans]
gi|194191042|gb|EDX04618.1| GD23738 [Drosophila simulans]
Length = 161
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 53/119 (44%), Gaps = 9/119 (7%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCP 117
++ SP+ GRYL A + E P VGPK +CLGC++P +P+ + C
Sbjct: 14 VQWSPVCGRYLVAKGATRGHGLLIEELPFAVGPKCN-GPVVCLGCYEP-DPD-PEEELCS 70
Query: 118 RCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDY-----KYEAILPLRCLILQ 171
C WP C A C+ +D EC LK + + I+PLR L+ +
Sbjct: 71 ECGWPLC-AECAQQADNAHFQLECRELKDARARFFRLPSGSSHCPQLDCIMPLRVLLAK 128
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 60/275 (21%), Positives = 105/275 (38%), Gaps = 40/275 (14%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + ++ + + +K
Sbjct: 67 IAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVVKLMDGKPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E+++SK + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IS 284
IC + ++ GLY + L+ H+C PN S + + R ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNC--SIVFNGPHLLLRAVREIEAGEELT 236
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERCSDPTE 319
Y + L ++ RR+ L F C C RC PT+
Sbjct: 237 ICYLDMLMTSEERRKQLRDQYCFECDCIRC--PTQ 269
>gi|397610576|gb|EJK60907.1| hypothetical protein THAOC_18674 [Thalassiosira oceanica]
Length = 325
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG QK H KLH+ CK+ E+K L + L+
Sbjct: 103 LKDCTACRLVKYCGVDCQKAHRKLHRKACKQRAAELKDEKLYSQGLE 149
>gi|167533770|ref|XP_001748564.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773083|gb|EDQ86728.1| predicted protein [Monosiga brevicollis MX1]
Length = 500
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 24/39 (61%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+C+VC + A + CS C + YCG+ QK HW +H+ C
Sbjct: 58 QCSVCMKPASYLCSKCCQAWYCGRDCQKNHWSVHRQDCS 96
>gi|66810001|ref|XP_638724.1| hypothetical protein DDB_G0284069 [Dictyostelium discoideum AX4]
gi|60467344|gb|EAL65375.1| hypothetical protein DDB_G0284069 [Dictyostelium discoideum AX4]
Length = 426
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
C C A CS CK+V YCGK+HQ+ W+L H +CK L
Sbjct: 133 CEYCGCFAKSNCSGCKKVHYCGKEHQQLDWQLGHSEQCKLL 173
>gi|449473708|ref|XP_002189720.2| PREDICTED: zinc finger MYND domain-containing protein 10
[Taeniopygia guttata]
Length = 452
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 22/38 (57%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
RC VC A +CS C+ YC + Q +HW+ HKP C
Sbjct: 402 RCRVCGVEATKRCSRCRNEWYCTRACQVQHWQKHKPAC 439
>gi|397614171|gb|EJK62639.1| hypothetical protein THAOC_16738 [Thalassiosira oceanica]
Length = 302
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 39/84 (46%), Gaps = 8/84 (9%)
Query: 1 MASIIEIEDEELMRCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
M+ I ++D CA C + + L C+AC+ V YC QK H K HK CK+
Sbjct: 1 MSRCIAVDDGSAGVCANCGKGSDAEGTVKLKNCTACRLVKYCSVDCQKAHRKQHKKACKQ 60
Query: 54 LPYEIKSSPLLGRYLQ-ATLDLHP 76
E+K L + L+ A D P
Sbjct: 61 RAAELKDEQLYSQGLERAAGDFCP 84
>gi|397591586|gb|EJK55412.1| hypothetical protein THAOC_24860 [Thalassiosira oceanica]
Length = 213
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 26/44 (59%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H +LHK CK+ E+K L G+
Sbjct: 139 LKNCTACRLVKYCGVDCQRAHRQLHKKACKQRVAELKDEELYGQ 182
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 101/268 (37%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVTPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CS++C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSSKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMEGSPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E++++K + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNINK------LTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFDIFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STM 286
IC + ++ GLY I L+ H+C PN + N L RD + +
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSVVFNGPHLLLRAVRDVEVGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|403340343|gb|EJY69455.1| MYND finger family protein [Oxytricha trifallax]
Length = 442
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 22/38 (57%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
RC C + A +CS CK YC ++ Q + WK HKP C
Sbjct: 362 RCNECGKNATQRCSRCKNQWYCSRECQLKQWKAHKPLC 399
>gi|405971699|gb|EKC36522.1| Egl nine-like protein 1 [Crassostrea gigas]
Length = 420
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 13 MRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
+RC VC + L +C+ CK V YC +QHQ + W HK C
Sbjct: 11 LRCVVCDSKENLKRCTRCKGVVYCCRQHQIQDWPSHKVAC 50
>gi|219123565|ref|XP_002182093.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406694|gb|EEC46633.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1183
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 28/55 (50%), Gaps = 5/55 (9%)
Query: 15 CAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPY----EIKSSPLL 64
C+ C R A +C C V YC + Q+ HW+ HK CK+L E SP+L
Sbjct: 497 CSACYRAGATMRCKRCMSVYYCQRSCQESHWQFHKAPCKRLAATNTEETVGSPVL 551
>gi|397644789|gb|EJK76551.1| hypothetical protein THAOC_01680 [Thalassiosira oceanica]
Length = 309
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 28/54 (51%), Gaps = 5/54 (9%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
CA C +T L C+AC+ V YCG Q+ H K HK CK+ E+K L
Sbjct: 15 CANCGKTGSDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQL 68
>gi|397634136|gb|EJK71299.1| hypothetical protein THAOC_07280 [Thalassiosira oceanica]
Length = 311
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 23/41 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC V YCG QK H KLHK CKK E+K L
Sbjct: 29 LKNCTACFLVKYCGVDCQKSHRKLHKRACKKRAAELKDEEL 69
>gi|406861729|gb|EKD14782.1| ectomycorrhiza-upregulated zf-MYND domain-containing protein
[Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 472
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C +C+ + +CSACK YC + QK+ W LHKP CK L
Sbjct: 8 CTICQNPDSSKRCSACKVTRYCSHECQKKGWPLHKPVCKFL 48
>gi|348525586|ref|XP_003450303.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Oreochromis niloticus]
Length = 426
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E K CS CK V YC + QK HW HK CKKL
Sbjct: 320 CTTCGEKGAEKRCSICKMVIYCDQACQKMHWFTHKKICKKL 360
>gi|291222899|ref|XP_002731455.1| PREDICTED: EGL nine homolog 1-like [Saccoglossus kowalevskii]
Length = 440
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCK-KLPYEIKSS 61
CS C++ YC ++HQK HWK HK +C K P+ + +S
Sbjct: 27 CSRCRDTWYCCREHQKTHWKTHKKECGPKNPFILGTS 63
>gi|440793753|gb|ELR14928.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 252
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 9/54 (16%)
Query: 7 IEDEELMRCAVCR--------ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+ +E +RCA + ET ++CS CK+V YCG++ Q+ W HKP CK
Sbjct: 200 VVEERAIRCANGKCGRSEEDGET-FNRCSRCKKVYYCGRECQRADWPKHKPTCK 252
>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
Length = 386
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 45/210 (21%), Positives = 81/210 (38%), Gaps = 15/210 (7%)
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
RC + F+ C+ C D H EC+ + + E + A + + I R
Sbjct: 19 GRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHPER 75
Query: 174 SP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
+P +K +K+ E+H+ K E + I S V L + + L D
Sbjct: 76 TPSEKLLAVKEFESHLDKLD--NEKKDLIQSDVAALHHFYSKHLEFPDSDSLVVLFAQVN 133
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IST 285
C + ++ ++ + LM H+C PN + + L R + T
Sbjct: 134 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEINPGEEVFT 191
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCS 315
Y + L+ T+ R + L + +F C C+ C+
Sbjct: 192 SYIDLLYPTEDRNDRLRDSYFFTCQCQECT 221
>gi|193656959|ref|XP_001949267.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
isoform 1 [Acyrthosiphon pisum]
gi|328709997|ref|XP_003244132.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
isoform 2 [Acyrthosiphon pisum]
Length = 384
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 15 CAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
C C E A KC+ CK V YC ++ Q+ HW +HK +C +
Sbjct: 318 CGTCAENNATKKCAKCKAVQYCDRECQRIHWFVHKKECNR 357
>gi|321474337|gb|EFX85302.1| hypothetical protein DAPPUDRAFT_314029 [Daphnia pulex]
Length = 420
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 2/42 (4%)
Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+CA C++T+ + +CS C+ + YC K+ Q HW HK C+K
Sbjct: 378 QCAKCKKTSQNVKRCSRCRSIFYCSKECQFSHWSNHKTVCQK 419
>gi|398409022|ref|XP_003855976.1| hypothetical protein MYCGRDRAFT_29456, partial [Zymoseptoria
tritici IPO323]
gi|339475861|gb|EGP90952.1| hypothetical protein MYCGRDRAFT_29456 [Zymoseptoria tritici
IPO323]
Length = 61
Score = 40.0 bits (92), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 26/50 (52%), Gaps = 2/50 (4%)
Query: 14 RCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
+C C + A L C+ CK YC ++ Q HW HK CK+L Y K S
Sbjct: 9 QCLSCSQIAAGLQTCARCKAAKYCSRECQAAHWTAHKSACKRLNYVFKVS 58
>gi|397643623|gb|EJK75984.1| hypothetical protein THAOC_02275 [Thalassiosira oceanica]
Length = 388
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 26/47 (55%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YC Q+ H K HK CKK E+K L G+ L+
Sbjct: 123 LKDCTACRIVKYCSVDCQRAHRKQHKKACKKRAAELKDEKLYGQGLE 169
>gi|397641165|gb|EJK74510.1| hypothetical protein THAOC_03807, partial [Thalassiosira oceanica]
Length = 560
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 8 EDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSP 62
ED + CA C + L KC+AC+ V YC QK H K HK CK+ E+K
Sbjct: 148 EDACIETCANCGKPESDTVTLKKCNACRLVKYCSVDCQKAHRKQHKKACKQRAEELKDEE 207
Query: 63 LLGR 66
L R
Sbjct: 208 LFSR 211
>gi|397640514|gb|EJK74163.1| hypothetical protein THAOC_04176 [Thalassiosira oceanica]
Length = 385
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 9/81 (11%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D++ CA C + L C+AC+ V YCG Q+ H K HK CK+ E
Sbjct: 2 SCVPVMDDDDEVCANCGKQGSDTIKLKICTACRLVKYCGVDCQRAHRKQHKKACKQRAAE 61
Query: 58 IKSSPLLGRYLQATLDLHPGD 78
+K L Y Q LD GD
Sbjct: 62 LKDEQL---YSQG-LDRPEGD 78
>gi|397579340|gb|EJK51173.1| hypothetical protein THAOC_29678, partial [Thalassiosira
oceanica]
Length = 167
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 5/74 (6%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M + +E + CA C + L C+AC+ V YCG Q+ H KLHK CK+
Sbjct: 2 MMRCVPVEGDGDEACANCGKLGSDAVKLKNCTACRLVKYCGVDCQRAHRKLHKKVCKQRV 61
Query: 56 YEIKSSPLLGRYLQ 69
E++ L + L+
Sbjct: 62 AELRDEQLYSQGLK 75
>gi|380016914|ref|XP_003692413.1| PREDICTED: LOW QUALITY PROTEIN: ankyrin repeat and MYND
domain-containing protein 2-like [Apis florea]
Length = 407
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
C C E A KCS CK V YC ++ Q+ HW +HK C +L I
Sbjct: 335 CITCGEDKANKKCSKCKAVQYCDRECQRLHWFMHKKACTRLGQNI 379
>gi|401401559|ref|XP_003881041.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115453|emb|CBZ51008.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 946
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 250 LYSVICLMEHNCLPNA------KHSNMMQSKLFVFRDTHISTMY---TNALWGTQPRREH 300
+Y + H+CLPNA H+ +++++ + ++ Y + + T RR
Sbjct: 168 IYFMPSFSSHSCLPNALWFTDEDHTFVLRTRADIAAGDEVTLTYLSEEDLMRPTLHRRRV 227
Query: 301 LAITKYFNCSCERCSDPTELGTYFSAMKC 329
L+ TK F C+CERC+ P + F C
Sbjct: 228 LSETKDFICACERCAAPVDFSRGFRCPSC 256
>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 25 KCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+C+ACK+V YC K QK+ W +HK +CK L
Sbjct: 60 RCAACKQVHYCSKICQKQDWSMHKLECKAL 89
>gi|406864160|gb|EKD17206.1| hypothetical protein MBM_04783 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 450
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 59/129 (45%), Gaps = 21/129 (16%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWK-LHKPKCKKL--------PYEIKSSPLLGRYLQATLD 73
L+KCS CK + YC ++ QK+ W HK CK L P E ++ +Y Q L
Sbjct: 30 LNKCSRCKRIRYCSQKCQKKDWNDGHKKLCKVLLGLSDFYPPVEAQT---WQQYRQQVLS 86
Query: 74 LHPGDRI--ARESPLIVG-PKLALAEPICLGCHK-PLNPNLADNAR---CPRCFWPACSA 126
R+ A+ P+ + ++ L + C C++ PL LA + + CP CF A
Sbjct: 87 SVAALRVCGAQVKPITLNDERILLYQAYCSNCYRTPL--QLAPDTKLKLCPNCFIAATCT 144
Query: 127 RCSGLSDAH 135
CS + H
Sbjct: 145 SCSSTPETH 153
>gi|397639864|gb|EJK73806.1| hypothetical protein THAOC_04549 [Thalassiosira oceanica]
Length = 175
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + ED++ CA C + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 1 MSCVPVGEDDDAEACANCGKQGSDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKTCKQRA 60
Query: 56 YEIKSSPL 63
E+K L
Sbjct: 61 AELKDEQL 68
>gi|397563864|gb|EJK43971.1| hypothetical protein THAOC_37538 [Thalassiosira oceanica]
Length = 316
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 5/72 (6%)
Query: 3 SIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S I ++D CA C + L C+AC+ V YCG Q+ H K HK C++ E
Sbjct: 2 SCIAVDDGSAGVCANCGKHETDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACRQRAAE 61
Query: 58 IKSSPLLGRYLQ 69
+K L + L+
Sbjct: 62 LKDERLYSQGLE 73
>gi|342319761|gb|EGU11708.1| Serine/arginine repetitive matrix protein 2 [Rhodotorula glutinis
ATCC 204091]
Length = 437
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 5/48 (10%)
Query: 9 DEELMR-CAVCRE----TALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
D+ ++R C CR+ T +CS CK + YCG Q Q+EHW HK C
Sbjct: 53 DQPILRMCMTCRKMDLITKRSRCSGCKMIFYCGPQCQREHWPEHKIDC 100
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 103/267 (38%), Gaps = 32/267 (11%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 51 GNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 102
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 103 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMDGAPSESEKLY 160
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKL-------PGDVLDDTSAKCLHWI 232
D+E++++K + E + +V + E++ P L + AK I
Sbjct: 161 SFYDLESNINKLTEDRK--EGLRQLVMTFQHFMREEIQDASQLPPAFDLFEAFAKV---I 215
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHI----STMY 287
C + ++ GLY I L+ H+C PN + N L RD + + Y
Sbjct: 216 CNSFTICNAEMQEVGVGLYPSISLLNHSCDPNCSIVFNGPHLLLRAVRDIEVGEELTICY 275
Query: 288 TNALWGTQPRREHLAITKYFNCSCERC 314
+ L ++ RR+ L F C C RC
Sbjct: 276 LDMLMTSEERRKQLRDQYCFECDCFRC 302
>gi|397647582|gb|EJK77767.1| hypothetical protein THAOC_00381, partial [Thalassiosira oceanica]
Length = 307
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 65 LKDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYGQ 108
>gi|159482787|ref|XP_001699449.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272900|gb|EDO98695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 318
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 20/29 (68%)
Query: 26 CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CS CK V YCG++ Q HWK HK +C KL
Sbjct: 271 CSRCKSVRYCGRECQSAHWKEHKQECPKL 299
>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 282
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 100
>gi|322795587|gb|EFZ18266.1| hypothetical protein SINV_00088 [Solenopsis invicta]
Length = 445
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 25/41 (60%)
Query: 11 ELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
E +C VC+E + +CS CKE YC ++ Q + W+ HK C
Sbjct: 391 EAKKCLVCQEASKKRCSKCKEAWYCSRECQVKDWENHKNIC 431
>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
Length = 371
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDSDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|260799513|ref|XP_002594740.1| hypothetical protein BRAFLDRAFT_224157 [Branchiostoma floridae]
gi|229279976|gb|EEN50751.1| hypothetical protein BRAFLDRAFT_224157 [Branchiostoma floridae]
Length = 915
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 5/52 (9%)
Query: 12 LMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
+ RC C++ + + +C+ C + YC +Q QK+HW HK CK P I
Sbjct: 481 ITRCTSCQKISPDGCNMKRCTKCFKAGYCNQQCQKKHWPTHKAICKYSPEPI 532
>gi|118383381|ref|XP_001024845.1| TPR Domain containing protein [Tetrahymena thermophila]
gi|89306612|gb|EAS04600.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
Length = 999
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 24/40 (60%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+C +C + +C ACK YC ++H ++ W HK KCK+
Sbjct: 7 KCVICGKQTNQRCQACKIPYYCSQEHLEQDWGNHKEKCKQ 46
>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
Length = 420
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)
Query: 65 GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
GR L+A H GD + A L VG + E C L+ RC + F
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72
Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP-KKYQ 179
+ C C D H EC+ + + E + A + + I R+P +K
Sbjct: 73 Y--CDVECQK-EDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQKIHPERTPSEKLL 129
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
+++ E+H+ K E + I S + L + + L + D +S L C
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKYL--EFPDHSSLVVLFAQVNCNGFT 185
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
+ ++ ++ + LM H+C PN + +++ ++ H + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245
Query: 293 GTQPRREHLAITKYFNCSCERCS 315
T+ R + L + +F C C C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268
Score = 38.9 bits (89), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C R+ L KC CK+ YC + QKE W LHK +C +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93
>gi|393239660|gb|EJD47190.1| hypothetical protein AURDEDRAFT_163797 [Auricularia delicata
TFB-10046 SS5]
Length = 722
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 3/49 (6%)
Query: 8 EDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWK-LHKPKCKK 53
ED +L CA C L +CS C+EV YCG++ Q E W+ H+ +C +
Sbjct: 671 EDLDLHACAHCELESAVLRRCSLCREVRYCGRECQAEDWQNQHREECSR 719
>gi|307171148|gb|EFN63141.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 641
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 56/131 (42%), Gaps = 8/131 (6%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
I S GR+ AT + PGD I+ E P + E CH L+ NL A+
Sbjct: 251 ISFSERYGRHYIATREFQPGDIISIEDPY---AHVIYEERYYTHCHYCLSKSYNLIPCAK 307
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
CP + CS +C L H EC ILKL L D +R LI+ +
Sbjct: 308 CPIAQY--CSEKCRKLGWEACHKTECPILKLLTNLLNVDKDKIRMISKIIRLLIVVTENG 365
Query: 176 KKYQEL-KDME 185
K +EL +DM+
Sbjct: 366 IKIKELEQDMK 376
>gi|198429203|ref|XP_002125038.1| PREDICTED: similar to ankyrin repeat and MYND domain containing 2
[Ciona intestinalis]
Length = 362
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 27/46 (58%), Gaps = 2/46 (4%)
Query: 15 CAVCRETALH-KCSACKEVAYCGKQHQKEHWKLHKPKC-KKLPYEI 58
C C +T+ +CS CKE YC K+ QK HW +HK C K YEI
Sbjct: 316 CDACGDTSSKLRCSVCKEALYCSKRCQKFHWFVHKKFCGKSKQYEI 361
>gi|397632503|gb|EJK70587.1| hypothetical protein THAOC_08041, partial [Thalassiosira oceanica]
Length = 471
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC V YCG QK H K+HK CK+ E+K L G+
Sbjct: 192 LKNCTACLLVKYCGVDCQKAHRKIHKKACKERAAELKDERLYGQ 235
>gi|397571081|gb|EJK47614.1| hypothetical protein THAOC_33653 [Thalassiosira oceanica]
Length = 350
Score = 40.0 bits (92), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L + L+
Sbjct: 75 LKDCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLFSQGLE 121
>gi|397605791|gb|EJK59116.1| hypothetical protein THAOC_20701 [Thalassiosira oceanica]
Length = 369
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 5 IEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIK 59
+ + D+ CA C + L C+AC+ V YCG Q+ H K HK CKK E+K
Sbjct: 74 VPVADDGAEACANCGKHGSDAVKLKDCTACRLVKYCGVDCQRAHRKQHKKTCKKRAAELK 133
Query: 60 SSPL 63
L
Sbjct: 134 DEEL 137
>gi|414886685|tpg|DAA62699.1| TPA: hypothetical protein ZEAMMB73_171774 [Zea mays]
Length = 83
Score = 40.0 bits (92), Expect = 3.1, Method: Composition-based stats.
Identities = 15/46 (32%), Positives = 27/46 (58%)
Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
A +C C VAYC + HQ HW++HK +C++ +++ L ++
Sbjct: 18 ADRRCGGCGAVAYCSRVHQIIHWRVHKEECERFAEQMRHINALSQF 63
>gi|336365142|gb|EGN93494.1| hypothetical protein SERLA73DRAFT_115908 [Serpula lacrymans var.
lacrymans S7.3]
gi|336377712|gb|EGO18873.1| hypothetical protein SERLADRAFT_454099 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1207
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 15 CAVCRETALHK---CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CA C + K CS C++V+YC + Q+ WK HKP+C L
Sbjct: 1162 CAACEKEGKPKLLLCSKCRKVSYCSVECQRREWKKHKPQCAVL 1204
>gi|300121800|emb|CBK22374.2| unnamed protein product [Blastocystis hominis]
Length = 57
Score = 40.0 bits (92), Expect = 3.1, Method: Composition-based stats.
Identities = 17/40 (42%), Positives = 21/40 (52%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+C C + +CS CK V YC KQ Q W +HK C K
Sbjct: 4 KCQFCGKEGALRCSKCKRVYYCSKQCQVRDWPMHKSVCGK 43
>gi|397626218|gb|EJK68075.1| hypothetical protein THAOC_10788 [Thalassiosira oceanica]
Length = 307
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC+ V YCG Q+ H KLHK CK+ E+K L
Sbjct: 31 LKNCNACRLVKYCGVDCQRAHRKLHKKACKQRAAELKDEQL 71
>gi|326436236|gb|EGD81806.1| hypothetical protein PTSG_02518 [Salpingoeca sp. ATCC 50818]
Length = 531
Score = 40.0 bits (92), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 2/41 (4%)
Query: 15 CAVCRETALHK--CSACKEVAYCGKQHQKEHWKLHKPKCKK 53
CA C+ T CS C+EVAYC Q+ HW +HK C +
Sbjct: 465 CAHCKRTLSRPLVCSGCREVAYCSGGCQRRHWAVHKQSCTR 505
>gi|328721096|ref|XP_003247209.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 611
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 75/175 (42%), Gaps = 21/175 (12%)
Query: 57 EIKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHK-PLNPNLADNAR 115
E+KSS +GR + T D++PGD +A + P I G L L C C K P+ P +
Sbjct: 204 ELKSSKNMGRGVYTTRDINPGDVVAIDEPYIYG-GLQLHNSCCRNCLKSPVVP--IPCTK 260
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP 175
C ++ CS C S H EC I+ +L D A+ L L +
Sbjct: 261 CSLVYY--CSENCLHKSMKDGHNLECPII-FFIRSLPGITDLNQLALKWL-SKELSKMGL 316
Query: 176 KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFL---------EKLPGDVL 221
KKY L D + SK P T + E +Y NFL E++P +VL
Sbjct: 317 KKYCSLVDTFSK-SKIDPKTRGFNENG---QYKSDNFLTAYSLDSCEEEMPIEVL 367
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 19/43 (44%), Positives = 26/43 (60%), Gaps = 2/43 (4%)
Query: 13 MRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+RCA C + + L C CK V YCG+ Q H+K HK KC++
Sbjct: 419 VRCANCGKNSVRLMMCGRCKAVMYCGRTCQVAHYKEHKAKCQR 461
>gi|41054031|ref|NP_956188.1| ankyrin repeat and MYND domain-containing protein 2 [Danio rerio]
gi|27882498|gb|AAH44410.1| Zgc:55491 [Danio rerio]
Length = 420
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHK-CSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C C E K CS CK V YCG+ QK HW HK CK L
Sbjct: 320 CTTCGEKGAEKRCSICKMVIYCGQACQKLHWFSHKKVCKML 360
>gi|290982213|ref|XP_002673825.1| predicted protein [Naegleria gruberi]
gi|284087411|gb|EFC41081.1| predicted protein [Naegleria gruberi]
Length = 492
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 25/42 (59%)
Query: 13 MRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+CA C + A H C CK YC + K +WK+HKP C+++
Sbjct: 427 FKCANCGKKANHYCGGCKIERYCSVECMKSNWKVHKPFCQEI 468
>gi|242822562|ref|XP_002487912.1| checkpoint protein kinase, putative [Talaromyces stipitatus ATCC
10500]
gi|218712833|gb|EED12258.1| checkpoint protein kinase, putative [Talaromyces stipitatus ATCC
10500]
Length = 1088
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 4 IIEIEDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+IE+ D + C +C E +C C+ V YC K +Q++ WK +K +CK
Sbjct: 160 LIELSD---LSCVICGEPG-RRCQGCQLVTYCSKDYQRKDWKNYKRECK 204
>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
Length = 448
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
++ L+KCS CK + YC K QKE W+ HK +CK +
Sbjct: 61 QSELYKCSRCKIIMYCCKSCQKEDWQWHKYECKSI 95
>gi|452843846|gb|EME45781.1| hypothetical protein DOTSEDRAFT_34222 [Dothistroma septosporum
NZE10]
Length = 226
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 19/32 (59%)
Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
AL C C E YC K+ QK WK+HK CKK
Sbjct: 194 ALFLCGVCGEGRYCSKECQKREWKVHKSSCKK 225
>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
Length = 450
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)
Query: 65 GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
GR L+A H GD + A L VG + E C L+ RC + F
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72
Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP-KKYQ 179
+ C C D H EC+ + + E + A + + I R+P +K
Sbjct: 73 Y--CDVECQK-EDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQKIHPERTPSEKLL 129
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
+++ E+H+ K E + I S + L + + L + D +S L C
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKYL--EFPDHSSLVVLFAQVNCNGFT 185
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
+ ++ ++ + LM H+C PN + +++ ++ H + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245
Query: 293 GTQPRREHLAITKYFNCSCERCS 315
T+ R + L + +F C C C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268
Score = 38.9 bits (89), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C R+ L KC CK+ YC + QKE W LHK +C +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93
>gi|397596181|gb|EJK56701.1| hypothetical protein THAOC_23362 [Thalassiosira oceanica]
Length = 518
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H + HK CK+ E+K L G+
Sbjct: 245 LKDCTACRLVKYCGVDCQRAHRRQHKKACKRRAAELKDEQLYGQ 288
>gi|392558232|gb|EIW51426.1| hypothetical protein TRAVEDRAFT_25089 [Trametes versicolor
FP-101664 SS1]
Length = 288
Score = 40.0 bits (92), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 17/50 (34%), Positives = 26/50 (52%), Gaps = 1/50 (2%)
Query: 3 SIIEIEDEELMRCAVCRETALHKCSACKE-VAYCGKQHQKEHWKLHKPKC 51
S++ + D + CAVC ++ +C+ C AYC H +EHW H C
Sbjct: 2 SLVAVLDPPMPSCAVCELPSMLRCAYCNSGPAYCSSSHFREHWVTHAVVC 51
>gi|397599887|gb|EJK57538.1| hypothetical protein THAOC_22404, partial [Thalassiosira oceanica]
Length = 440
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 5/71 (7%)
Query: 1 MASIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M+ + E + + CA C + L C+AC+ V YCG Q+ H K HK CK+
Sbjct: 165 MSCVPVAEVDVIEACANCGKQGSDTVKLKNCTACRLVKYCGVDCQRAHRKQHKKACKQRA 224
Query: 56 YEIKSSPLLGR 66
E+K L G+
Sbjct: 225 AELKDEQLYGQ 235
>gi|397590002|gb|EJK54879.1| hypothetical protein THAOC_25455 [Thalassiosira oceanica]
Length = 302
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC V YCG QK H K+HK CK+ E+K L G+
Sbjct: 28 LKNCTACLLVKYCGVDCQKAHRKIHKKACKERAAELKDERLYGQ 71
>gi|354801520|gb|AER39526.1| egl nine 1-like protein [Oncorhynchus mykiss]
Length = 390
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 1/53 (1%)
Query: 2 ASIIEIEDEELMRCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
++++ D + C +C + L KC C+ YC K+HQK WK HK CK+
Sbjct: 9 SAVVTDLDRDQQYCELCGKMENLLKCGRCRNSFYCSKEHQKSDWKKHKQFCKE 61
>gi|397618435|gb|EJK64892.1| hypothetical protein THAOC_14321, partial [Thalassiosira oceanica]
Length = 297
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
CA C +T L C+AC+ V YCG Q+ H K H+ CK+ E+K L G+
Sbjct: 90 CANCGKTGSDAVKLKNCTACRLVKYCGVDCQRAHRKKHQKACKERAAELKDEQLYGQ 146
>gi|307207016|gb|EFN84839.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 638
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 68/152 (44%), Gaps = 13/152 (8%)
Query: 58 IKSSPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNP--NLADNAR 115
+ S GR+ AT D PGD ++ E P + E CH L+ NL ++
Sbjct: 248 VSFSKRYGRHYIATRDFKPGDIVSIEDPY---AHVIYEERYYTHCHYCLSRSYNLIPCSK 304
Query: 116 CPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILP-LRCLILQRRS 174
CP + CS +C L+ H EC ILKL +LL + K I +R LI+ +
Sbjct: 305 CPIAQY--CSEKCRKLAWEVCHMTECPILKL-LTSLLNVDKDKIRMISKIIRLLIIVTAN 361
Query: 175 PKKYQEL-KDM---EAHMSKRGPGTEVYEEID 202
K +EL +DM E++ R G E +D
Sbjct: 362 GSKIEELQQDMKIAESNPDNRTAGFTDTEILD 393
>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 509
Score = 40.0 bits (92), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 249 GLYSVICLMEHNCLPNAKHSNMMQSK---LFVFRDTH----ISTMYTNALWGTQPRREHL 301
G+Y V L+ H C+ N + ++SK + RD I+ +T+A T R+ HL
Sbjct: 272 GIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEITHAFTDAASPTVVRKAHL 331
Query: 302 AITKYFNCSCERCSD 316
F+C+C RC+D
Sbjct: 332 KSLYAFDCNCSRCND 346
>gi|397618815|gb|EJK65055.1| hypothetical protein THAOC_14145 [Thalassiosira oceanica]
Length = 434
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 21/41 (51%), Positives = 23/41 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC V YCG QK H KLHK CKK E+K L
Sbjct: 152 LKNCTACFLVKYCGVDCQKSHRKLHKRACKKRAAELKDEEL 192
>gi|397563281|gb|EJK43740.1| hypothetical protein THAOC_37790, partial [Thalassiosira oceanica]
Length = 976
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 30/63 (47%), Gaps = 5/63 (7%)
Query: 21 TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA-----TLDLH 75
L C+AC+ V YCG Q+ H K HK CK+ E+K L G + LDLH
Sbjct: 315 VGLKNCTACRLVEYCGVDCQRAHRKKHKNVCKERAAELKDEELYGLGHERPEGGFLLDLH 374
Query: 76 PGD 78
D
Sbjct: 375 AAD 377
>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
Length = 441
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 87/216 (40%), Gaps = 17/216 (7%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L +F K G +D+
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALH-HFYSKHLGFPDNDSLVV 174
Query: 228 CLHWI-CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
+ C + ++ ++ + LM H+C PN + + L R
Sbjct: 175 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232
Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|397612058|gb|EJK61586.1| hypothetical protein THAOC_17901 [Thalassiosira oceanica]
Length = 309
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 22/54 (40%), Positives = 29/54 (53%), Gaps = 5/54 (9%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
CA C +T L C+AC+ V YCG Q+ H K HK +CK+ E+K L
Sbjct: 15 CANCGKTGSDTVKLKSCTACRLVKYCGVDCQRVHRKQHKKECKQRAAELKDEQL 68
>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
Length = 433
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVV 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|298714742|emb|CBJ25641.1| pcdc2/rp-8 (programmed cell death protein 2) [Ectocarpus
siliculosus]
Length = 462
Score = 40.0 bits (92), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 18/39 (46%), Positives = 23/39 (58%), Gaps = 2/39 (5%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL--HKPKC 51
C VC ++A CS C+ YCG++HQ HWK HK C
Sbjct: 173 CCVCGQSAKSACSKCRIARYCGREHQALHWKTGGHKKAC 211
>gi|170118378|ref|XP_001890368.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164634638|gb|EDQ98966.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 532
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 14 RCAVCRET---ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+C V + + A CS CK V YC ++HQ + WK HK CK+
Sbjct: 26 KCHVVKPSGAPAFSVCSGCKIVRYCSREHQVQDWKTHKAFCKR 68
>gi|116179140|ref|XP_001219419.1| hypothetical protein CHGG_00198 [Chaetomium globosum CBS 148.51]
gi|88184495|gb|EAQ91963.1| hypothetical protein CHGG_00198 [Chaetomium globosum CBS 148.51]
Length = 786
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RCAVC + L +C C + YCG +HQ H + HK C +
Sbjct: 10 RCAVCPVRSGLLRCGGCNVIRYCGPEHQSAHRREHKKACTTI 51
>gi|449301926|gb|EMC97935.1| hypothetical protein BAUCODRAFT_407676 [Baudoinia compniacensis
UAMH 10762]
Length = 137
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 20/37 (54%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
C E L CS CK YC K+ QK HWK HKP C
Sbjct: 85 CGAASEIGLLTCSKCKTRKYCCKECQKAHWKNHKPVC 121
>gi|405968412|gb|EKC33486.1| Zinc finger MYND domain-containing protein 10 [Crassostrea gigas]
Length = 451
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 22/38 (57%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
+CAVC A +CS C+ YC ++ Q HWK HK C
Sbjct: 400 KCAVCGAEATKRCSRCQNEWYCRRECQVSHWKKHKTAC 437
>gi|321460122|gb|EFX71168.1| hypothetical protein DAPPUDRAFT_112030 [Daphnia pulex]
Length = 1050
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 26/42 (61%), Gaps = 2/42 (4%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
CA C++ + L +CS C+ YC + Q++HW HK CKK+
Sbjct: 1009 CAACKKVSQSLKRCSRCRSTVYCCVECQRQHWAQHKLVCKKV 1050
>gi|147817677|emb|CAN75589.1| hypothetical protein VITISV_042880 [Vitis vinifera]
Length = 816
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 40/95 (42%), Gaps = 9/95 (9%)
Query: 245 RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTHISTMYTNAL--------WGTQP 296
R Q +YSV L H+C PN H+ + LF+ H++ L W +
Sbjct: 78 RVGQAIYSVASLFNHSCQPNI-HAYFLSRTLFLRATEHVAVGCPLELSYGPQVGQWDCKD 136
Query: 297 RREHLAITKYFNCSCERCSDPTELGTYFSAMKCLN 331
R++ L F C C CS+ +A +C+N
Sbjct: 137 RQKFLKDEYSFRCECSGCSELNVSDLVLNAFRCVN 171
>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
Length = 542
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 127/302 (42%), Gaps = 39/302 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GR L T + GD IA +PL++ P L+ +C C + P +RC ++ C
Sbjct: 18 GRALHTTKTVAAGDVIAVFTPLLLLPSLSHLTTVCSFCLRAGTPR--PCSRCRAAYY--C 73
Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELK-- 182
ARC + + H+ ECA L ++ + + E P+R L+ S + ++L
Sbjct: 74 DARCQAAAWSGGHSLECAALVRAVKS----SKKRREIPTPVRALVKVLLSCGQPEDLSKN 129
Query: 183 --DMEAHMS--KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWICGVIEV 238
+E H++ +R PG E ++ F + + +++ + + +C ++
Sbjct: 130 MDGLEGHVAERRREPGWADME----MMAMGGCAFAGR---ETSEESVRQAVEILCK-LQT 181
Query: 239 NG-----VDIGRYTQGLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYT 288
N D+G L + + H+CLPNA + +++++ + + I YT
Sbjct: 182 NAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVLRAESRIQKGDEIEISYT 241
Query: 289 NALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK----CLNEHKDQGDCWILPV 344
+ R+ LA F C C RC+ +L Y ++ LNE GD L
Sbjct: 242 DYTSSLGKRKAALAPYN-FECRCRRCTQ--DLCVYQASAHYRDYMLNEDSILGDGTKLQT 298
Query: 345 NP 346
+P
Sbjct: 299 HP 300
>gi|397646269|gb|EJK77200.1| hypothetical protein THAOC_00984 [Thalassiosira oceanica]
Length = 316
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L KC+AC V YCG QK H K HK CK+ E+K L G+
Sbjct: 31 LKKCTACLLVKYCGVDCQKAHRKQHKKLCKQRVAELKDERLYGQ 74
>gi|397637923|gb|EJK72864.1| hypothetical protein THAOC_05566 [Thalassiosira oceanica]
Length = 305
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 32/66 (48%), Gaps = 5/66 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C TA L C+AC+ V YCG Q H K HK CK+ E
Sbjct: 2 SCVPVVDDGNEVCANCGTTASNTVKLKNCTACRLVKYCGVDCQGAHRKHHKKACKQRAAE 61
Query: 58 IKSSPL 63
+K L
Sbjct: 62 LKDEQL 67
>gi|397601150|gb|EJK57830.1| hypothetical protein THAOC_22092 [Thalassiosira oceanica]
Length = 284
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 26/49 (53%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
C L KC+AC+ V YC Q+ H K HK CKK E++ L G+
Sbjct: 22 CDAVKLKKCTACQLVKYCSVDCQRAHRKHHKGACKKRAAELEDEQLYGQ 70
>gi|397590805|gb|EJK55166.1| hypothetical protein THAOC_25129, partial [Thalassiosira oceanica]
Length = 273
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L C+AC+ V YCG Q+ H KLHK CK+ E+K L
Sbjct: 232 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQL 272
>gi|397564868|gb|EJK44379.1| hypothetical protein THAOC_37083 [Thalassiosira oceanica]
Length = 684
Score = 39.7 bits (91), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H KLHK CK+ E+K + + L+
Sbjct: 29 LKNCTACRLVKYCGVDCQRAHRKLHKKACKQRVAELKDEQMYSQGLE 75
>gi|356546765|ref|XP_003541793.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
max]
Length = 990
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL-PYEIKSSPL 63
CA C A +CS CK V YC Q HW+L HK +C++L P++ S PL
Sbjct: 76 CARCSAPARTRCSRCKIVRYCSGNCQIIHWRLIHKQECQQLEPHKSSSFPL 126
>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
Length = 417
Score = 39.7 bits (91), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 11/91 (12%)
Query: 257 MEHNCLPNAKH-------SNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAITKYFNC 309
+ H C PNA++ S+ ++ + I+ YT+ L T+ R+ HLA +F C
Sbjct: 242 LNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHLAQGFHFTC 301
Query: 310 SCERCSDPTELGTYFSAMKCLNEHKDQGDCW 340
+C RC+ P T + +N+ + Q + W
Sbjct: 302 TCPRCTSPITDIT----LSIINDLQAQLNDW 328
>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=HSKM-B; AltName: Full=Histone methyltransferase
SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
AltName: Full=SET and MYND domain-containing protein 2
gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
Length = 433
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 47/216 (21%), Positives = 87/216 (40%), Gaps = 17/216 (7%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L +F K G +D+
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALH-HFYSKHLGFPDNDSLVV 174
Query: 228 CLHWI-CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
+ C + ++ ++ + LM H+C PN + + L R
Sbjct: 175 LFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232
Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|449486818|ref|XP_004157412.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Cucumis
sativus]
Length = 711
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
C VC KCS CK V YC Q+ HWK HK KCK
Sbjct: 78 CVVCGNLGSKKCSRCKAVRYCSPTCQEIHWKAGHKTKCKDF 118
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 39.7 bits (91), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 58/270 (21%), Positives = 102/270 (37%), Gaps = 38/270 (14%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + ++ + + +K
Sbjct: 67 IAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVVKLMDGKPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E+++SK + E+ ++ L F + ++ D D
Sbjct: 125 SFYDLESNISK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IS 284
IC + ++ GLY + L+ H+C PN S + + R ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNC--SIVFNGPHLLLRAVREIEAGEELT 236
Query: 285 TMYTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 237 ICYLDMLMTSEERRKQLRDQYCFECDCIRC 266
>gi|451996998|gb|EMD89464.1| hypothetical protein COCHEDRAFT_1205568 [Cochliobolus
heterostrophus C5]
Length = 437
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 53/127 (41%), Gaps = 15/127 (11%)
Query: 23 LHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKLPY---EIKSSPLLGRYLQATLDLHPGD 78
L +C ACK V+YC + QK+ W + HK CKKL + + P GR+
Sbjct: 37 LKRCIACKRVSYCSVECQKQDWNRNHKQICKKLVTLNKQKRYVPSAGRHW---------- 86
Query: 79 RIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHA 138
R RE + K +P LG L D A C + ++ C+ H A
Sbjct: 87 REYREEQELEFSKSGTMDPCELGTLIGRESQLEDGANLNACQYCHLTSFCTSCQQTHPSA 146
Query: 139 PECAILK 145
ECA L+
Sbjct: 147 -ECATLQ 152
>gi|397607252|gb|EJK59617.1| hypothetical protein THAOC_20130 [Thalassiosira oceanica]
Length = 312
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 46/103 (44%), Gaps = 5/103 (4%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQA-TLDLHPGDRIA 81
L C+AC+ V YCG Q+ H K HK CK+ E++ L G+ L+ D P +
Sbjct: 37 LKDCAACRLVKYCGVDCQRAHRKQHKKACKQRAAELEDEKLYGQGLERPKGDFCPICTLP 96
Query: 82 RESPLIVGP--KLALAEPICLGCHKPLNPNLADNARCPRCFWP 122
P+ K ++ IC+GC D+ CP C P
Sbjct: 97 IPLPMDKHSYFKTCCSKMICIGCSMATQKRGMDD--CPFCRTP 137
>gi|397575063|gb|EJK49512.1| hypothetical protein THAOC_31605 [Thalassiosira oceanica]
Length = 309
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/46 (47%), Positives = 25/46 (54%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYL 68
L KC+AC V YC QK H K HK CKK E+K L G+ L
Sbjct: 23 LKKCNACHLVKYCSVGCQKAHRKQHKGACKKRAAELKDEQLYGQGL 68
>gi|397568200|gb|EJK46011.1| hypothetical protein THAOC_35344 [Thalassiosira oceanica]
Length = 425
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 5/62 (8%)
Query: 7 IEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSS 61
+ D + CA C + L C+AC+ V YCG Q+ H K HK CK+ E+K
Sbjct: 149 VADNGIEACANCGKEGSITVKLRNCTACRLVKYCGVDCQRAHRKQHKKACKQRAAELKDE 208
Query: 62 PL 63
L
Sbjct: 209 KL 210
>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
Length = 597
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 107/297 (36%), Gaps = 60/297 (20%)
Query: 113 NARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL--IL 170
N+ C W C+ L + HAP + +A+ RCL +L
Sbjct: 139 NSECQNKDWATHKRECTALQEWAKHAPAAEV------------SVPSDAV---RCLGRLL 183
Query: 171 QRRSPKKY-----QELKDMEAHMSKRGP-GTEVYEEID-SIVKYLR-------SNFLEKL 216
+R K Q++ M++ P G E + + S+V+YL +F
Sbjct: 184 WKRQKKGLDSTWAQQIAAMQSSRKSLQPSGYEFHTHLSHSLVRYLGLSSPDGLQDFGISS 243
Query: 217 PGDVLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSK-- 274
PGD+LD TS + V D+ + ++ L+ H+C PNA SK
Sbjct: 244 PGDLLDLTS----RFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPRASKTP 299
Query: 275 --------LFVFRDTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGT 322
+ RD I T Y + RR L F C C C P +
Sbjct: 300 STDEPLISVVAIRDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLCKTPPPV-- 357
Query: 323 YFSAMKCLNEHKDQGDCWILPV--NPLDNDSDWTCGSCSARLNARDVHLVTSQLGEQ 377
A + L K G LP+ NPL C +C A + D L +++G++
Sbjct: 358 --DAREALWCPKSCGGMCPLPIEENPLAQ-----CANCKAVVKDTDAVLDATRVGQE 407
>gi|297722479|ref|NP_001173603.1| Os03g0704200 [Oryza sativa Japonica Group]
gi|255674817|dbj|BAH92331.1| Os03g0704200 [Oryza sativa Japonica Group]
Length = 174
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG Q+E WKLH+ +C+ +
Sbjct: 90 LRKCSVCRVAWYCGSACQREEWKLHQLECRAI 121
>gi|449447371|ref|XP_004141442.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Cucumis
sativus]
Length = 711
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
C VC KCS CK V YC Q+ HWK HK KCK
Sbjct: 78 CVVCGNLGSKKCSRCKAVRYCSPTCQEIHWKAGHKTKCKDF 118
>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
Length = 436
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 52/230 (22%), Positives = 93/230 (40%), Gaps = 26/230 (11%)
Query: 99 CLGCHKPLNPNLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYK 158
C GC L+ RC + F+ C+ C D H ECA + N +
Sbjct: 55 CDGCF-ARKEGLSKCGRCKQAFY--CNVECQK-EDWPMHKLECASM-----CTFGQNWHP 105
Query: 159 YEAI-LPLRCLILQRRSPKKYQE-----LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNF 212
E + L R L Q+ P++ Q +K+ E+H+ K E E I + + L +
Sbjct: 106 SETVRLTARILAKQKTHPERTQSEKLLAVKEFESHLDKLD--NEKRELIQNDIAALHHFY 163
Query: 213 LEKLPGDVLDDTSAKCL--HWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-N 269
+ L D D+ + L C + ++ ++ + LM H+C PN +
Sbjct: 164 SKHL--DYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYK 221
Query: 270 MMQSKLFVFRDT----HISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+++ R+ I + Y + L+ T+ R + L + +F+C C C+
Sbjct: 222 GTLAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFSCDCRECT 271
>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
Length = 433
Score = 39.7 bits (91), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 43/213 (20%), Positives = 84/213 (39%), Gaps = 11/213 (5%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNM-----MQSKLFVFRDTH 282
C + ++ ++ + LM H+C PN + +++ +
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEISPGEE 235
Query: 283 ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 236 VFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|393238191|gb|EJD45729.1| hypothetical protein AURDEDRAFT_114035 [Auricularia delicata
TFB-10046 SS5]
Length = 249
Score = 39.7 bits (91), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 22 ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLG 65
AL C+ CK VAYC K+ Q+ HWK H+ C+ I +S G
Sbjct: 40 ALSICAGCKFVAYCSKECQRAHWKSHQGVCRLTAATIGASRAQG 83
>gi|397646369|gb|EJK77239.1| hypothetical protein THAOC_00940, partial [Thalassiosira oceanica]
Length = 616
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Query: 15 CAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
CA C +T L C+AC+ V YCG Q+ H K HK CK+ E+K L + L+
Sbjct: 222 CANCGKTGSDTVKLKHCTACRIVKYCGVDCQRAHRKQHKKACKQRAAELKDEQLYSQGLE 281
>gi|358388034|gb|EHK25628.1| hypothetical protein TRIVIDRAFT_32886 [Trichoderma virens Gv29-8]
Length = 599
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%)
Query: 17 VCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
+C+E C +C+ VAYCG Q+ HW+ HK CK
Sbjct: 18 LCKEIGKLPCKSCRMVAYCGISCQQSHWEEHKKHCK 53
>gi|397597611|gb|EJK57048.1| hypothetical protein THAOC_22954 [Thalassiosira oceanica]
Length = 781
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S + + D+ CA C + L C+AC+ V YCG Q++H K HK CK+ E
Sbjct: 483 SCVPVADDGDEACANCGKQGGDTVKLKSCTACRLVKYCGVDCQRDHRKQHKKTCKQRAAE 542
Query: 58 IKSSPL 63
+K L
Sbjct: 543 LKDEEL 548
>gi|195064659|ref|XP_001996607.1| GH23322 [Drosophila grimshawi]
gi|193899819|gb|EDV98685.1| GH23322 [Drosophila grimshawi]
Length = 466
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%)
Query: 8 EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
++E++ +C C + A KC CKE+ YC ++ Q + W HK C
Sbjct: 417 KEEKIYKCGACYKVAKKKCGNCKEMYYCSRECQLKDWPRHKESC 460
>gi|170108840|ref|XP_001885628.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164639504|gb|EDR03775.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 247
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
ET L C+ CK V YC +Q Q + WK HKP C
Sbjct: 36 ETTLKVCARCKAVKYCSRQCQTKDWKNHKPAC 67
>gi|406862909|gb|EKD15958.1| hypothetical protein MBM_05969 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 276
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 22/42 (52%)
Query: 12 LMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
++ C + L C CK YCGK+HQK WK HK C K
Sbjct: 228 VVSCTLRGADKLTVCGVCKIQRYCGKEHQKADWKYHKHICAK 269
>gi|389750668|gb|EIM91741.1| hypothetical protein STEHIDRAFT_164225 [Stereum hirsutum FP-91666
SS1]
Length = 442
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/42 (47%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 13 MRCAVCRE---TALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
M C R+ L CS CK V YCGK Q+ HWK HK C
Sbjct: 399 MSCGKGRDGEPVDLQCCSRCKTVKYCGKDCQRLHWKFHKITC 440
>gi|330935391|ref|XP_003304944.1| hypothetical protein PTT_17678 [Pyrenophora teres f. teres 0-1]
gi|311318134|gb|EFQ86881.1| hypothetical protein PTT_17678 [Pyrenophora teres f. teres 0-1]
Length = 327
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 19/31 (61%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
+ +CS C + YCGKQHQK W H+ C K
Sbjct: 296 VSRCSGCGQAYYCGKQHQKLAWGFHRGYCAK 326
>gi|71991003|ref|NP_001023832.1| Protein EGL-9, isoform b [Caenorhabditis elegans]
gi|22265838|emb|CAD44114.1| Protein EGL-9, isoform b [Caenorhabditis elegans]
Length = 363
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 20/37 (54%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C + L C C VAYC K+HQ+ W HK CK L
Sbjct: 46 CTSSQLQTCLFCGTVAYCSKEHQQLDWLTHKMICKSL 82
>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
Length = 433
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 45/216 (20%), Positives = 86/216 (39%), Gaps = 17/216 (7%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSK-RGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSA 226
I R+P +K +K+ E+H+ K ++ + + + + S LE D L A
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVVFA 177
Query: 227 KCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH---- 282
+ C + ++ ++ + LM H+C PN + + L R
Sbjct: 178 QV---NCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKP 232
Query: 283 ---ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 233 GEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|392567157|gb|EIW60332.1| hypothetical protein TRAVEDRAFT_45581 [Trametes versicolor
FP-101664 SS1]
Length = 704
Score = 39.7 bits (91), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 24/49 (48%), Gaps = 2/49 (4%)
Query: 6 EIEDEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
E+ EL RC+ C L KC+ C + YC QK HW HK CK
Sbjct: 653 EMTSYELYRCSWCGNPSAVLRKCAGCGKTKYCDSGCQKSHWTEHKADCK 701
>gi|397642097|gb|EJK75025.1| hypothetical protein THAOC_03266 [Thalassiosira oceanica]
Length = 286
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H + HK CK+ E+K L G+
Sbjct: 31 LKDCTACRLVKYCGVDCQRAHRRQHKKSCKRRAAELKDEQLYGQ 74
>gi|397575636|gb|EJK49801.1| hypothetical protein THAOC_31286 [Thalassiosira oceanica]
Length = 490
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 34/66 (51%), Gaps = 7/66 (10%)
Query: 3 SIIEIEDEELMRCAVC-----RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
+++ +DE + CA C L C+AC+ V YCG Q+ H K HK +CK+ E
Sbjct: 4 TLVATDDEAV--CANCGLVSSDTVKLKSCTACRLVKYCGVDCQRAHRKQHKKECKRRATE 61
Query: 58 IKSSPL 63
+K L
Sbjct: 62 LKDEQL 67
>gi|330914309|ref|XP_003296581.1| hypothetical protein PTT_06719 [Pyrenophora teres f. teres 0-1]
gi|311331195|gb|EFQ95321.1| hypothetical protein PTT_06719 [Pyrenophora teres f. teres 0-1]
Length = 461
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 25/47 (53%), Gaps = 7/47 (14%)
Query: 14 RCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
RCA C ++A L +CS CK YCG+ QK WK HK C +
Sbjct: 7 RCANCNKSAAECSLANLQQCSRCKTTEYCGRDCQKADWKTHKKICAQ 53
>gi|392567689|gb|EIW60864.1| hypothetical protein TRAVEDRAFT_71120 [Trametes versicolor
FP-101664 SS1]
Length = 322
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Query: 11 ELMRCAVCRET-----ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+L RC+ C E L KC+AC+ V YC KQ QK W H+ C+ +
Sbjct: 13 DLRRCSWCGEAEKPSQKLKKCAACEYVMYCSKQCQKSAWSDHRGSCQYM 61
>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
mulatta]
Length = 433
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
Length = 384
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 44/210 (20%), Positives = 81/210 (38%), Gaps = 15/210 (7%)
Query: 114 ARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRR 173
RC + F+ C+ C D H EC+ + + E + A + + I R
Sbjct: 17 GRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQKIHAER 73
Query: 174 SP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI 232
+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 74 TPSEKLLAVKEFESHLEKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVN 131
Query: 233 CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH-------IST 285
C + ++ ++ + LM H+C PN + + L R + T
Sbjct: 132 CNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEINPGEEVFT 189
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCS 315
Y + L+ T+ R + L + +F C C+ C+
Sbjct: 190 SYIDLLYPTEDRNDRLRDSYFFTCECQECT 219
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 39.7 bits (91), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 111/281 (39%), Gaps = 22/281 (7%)
Query: 61 SPLLGRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
SP G L+A + G+ + PL LA+ +C C L RC +C
Sbjct: 11 SPGKGNGLRAAGRIRKGELVISAEPLACCVSNRLAKEVCHHCFTRHESLL----RCSQCK 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
C+ C + + H EC L+ L + + A L L + S ++
Sbjct: 67 MARYCNTTCQKQAWS-GHKRECKCLQ-SLLPRLPTDSVRLAAKLIFALLSPSKSSSEELY 124
Query: 180 ELKDMEAHMS-----KRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI-- 232
L++ E+H++ K+ +++ ++ ++ + L+++ G L + + L I
Sbjct: 125 TLEEHESHLTSMPEQKKQGLSQLASMLELYLQQEAPDLLQEM-GSALPPSCREPLSLIAK 183
Query: 233 --CGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HIST 285
C ++ ++ GLY + L+ H+C PN + +L RD ++
Sbjct: 184 VTCNCFTISDGELQEIGVGLYPSLSLLNHDCRPNCVMVFEGTKLELRAVRDIDPEDELTI 243
Query: 286 MYTNALWGTQPRREHLAITKYFNCSCERCSDPTELGTYFSA 326
Y L T+ R+ L +F C C+RC + G S
Sbjct: 244 SYIETLSLTEDRQRQLEEQYHFTCHCQRCDSRDKDGLMLSG 284
>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
Length = 433
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|397569322|gb|EJK46672.1| hypothetical protein THAOC_34653 [Thalassiosira oceanica]
Length = 392
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H K HK CK+ E+K L G+
Sbjct: 22 LKDCTACRLVKYCGVDCQRAHRKRHKKACKQRAAELKDERLYGQ 65
>gi|321463445|gb|EFX74461.1| hypothetical protein DAPPUDRAFT_251928 [Daphnia pulex]
Length = 790
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 2/39 (5%)
Query: 15 CAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C++ + L +CS C+ V YC + Q+ HW LHK C
Sbjct: 746 CASCKKISENLKRCSRCRTVVYCSVECQRGHWTLHKTVC 784
>gi|383860124|ref|XP_003705541.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 2-like
[Megachile rotundata]
Length = 407
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 1/45 (2%)
Query: 15 CAVCRE-TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEI 58
C C E A KCS CK V YC ++ Q+ HW +HK C +L I
Sbjct: 335 CITCGEDKANKKCSKCKIVQYCDRECQRLHWFMHKKACTRLGQSI 379
>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
Length = 391
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 18 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 74
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 75 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 132
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 133 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 190
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 191 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 225
>gi|443698784|gb|ELT98594.1| hypothetical protein CAPTEDRAFT_229252 [Capitella teleta]
Length = 351
Score = 39.7 bits (91), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 15/30 (50%), Positives = 21/30 (70%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
L C AC++ YC ++HQK HW+ HK +CK
Sbjct: 28 LSLCGACRQTWYCCREHQKSHWRQHKKECK 57
>gi|401883282|gb|EJT47497.1| hypothetical protein A1Q1_03609 [Trichosporon asahii var. asahii
CBS 2479]
Length = 358
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 44/101 (43%), Gaps = 10/101 (9%)
Query: 223 DTSAKCLHWICGVIEVNGVDIGRYT--QGLYSVICLMEHNCLPNAKHS---NMMQSKLFV 277
D S C ++ + + N + +G + +Y +C + H+C+PNA HS N+ +
Sbjct: 141 DPSHPCANF--AIFDGNALPLGANSPKGAVYPTLCRINHSCIPNAHHSWNENLEMETIHA 198
Query: 278 FRDTHISTMYTNALWGT---QPRREHLAITKYFNCSCERCS 315
RD T + RR HL F C+CE CS
Sbjct: 199 LRDISKGEEITIGYFLDGCRAERRAHLQNKFGFECNCEACS 239
>gi|397633519|gb|EJK71016.1| hypothetical protein THAOC_07582, partial [Thalassiosira oceanica]
Length = 430
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L KC+AC+ V YCG Q+ H K HK CK+ E+K L
Sbjct: 158 LRKCTACRLVKYCGVDCQRAHRKQHKKACKQRVAELKDEQL 198
>gi|395323289|gb|EJF55768.1| hypothetical protein DICSQDRAFT_113934 [Dichomitus squalens LYAD-421
SS1]
Length = 1176
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 25/48 (52%), Gaps = 3/48 (6%)
Query: 8 EDEELMRCAVCR---ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D L +C+VCR + L C C+ V YC QK+ W HK +CK
Sbjct: 1127 RDPTLHKCSVCRGKGKPKLMTCKGCQRVRYCSTACQKKDWPRHKAQCK 1174
>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
griseus]
Length = 404
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 22/36 (61%)
Query: 19 RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
R+ L KC CK+ YCG + QKE W LHK +C +
Sbjct: 29 RKEGLSKCGRCKQAFYCGVECQKEDWPLHKLECSSM 64
Score = 38.5 bits (88), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 87/215 (40%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C C D H EC+ + + E + A + +
Sbjct: 32 GLSKCGRCKQAFY--CGVECQK-EDWPLHKLECSSMVVFGENWNPSETVRLTARILAKQK 88
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + L + D +S
Sbjct: 89 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHQFYSRHL--EFPDHSSLV 144
Query: 228 CL--HWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH-- 282
L C + ++ ++ + LM H+C PN + +++ ++ H
Sbjct: 145 VLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPG 204
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C C+
Sbjct: 205 DEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECT 239
>gi|299740414|ref|XP_001838814.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
gi|298404233|gb|EAU83041.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
Length = 1320
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 26 CSACKEVAYCGKQHQKEHWK-LHKPKCKKLPYEIKS 60
CS CK VAYC ++ QK+ WK +H+ +CK L + KS
Sbjct: 458 CSGCKSVAYCSRECQKQDWKRVHREECKLLATQRKS 493
>gi|389739146|gb|EIM80340.1| hypothetical protein STEHIDRAFT_68351 [Stereum hirsutum FP-91666
SS1]
Length = 652
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 21/37 (56%)
Query: 16 AVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
A C + C +CK V YC K Q +HWK+HK CK
Sbjct: 115 AACPKRGSSLCGSCKLVQYCSKICQAQHWKIHKQDCK 151
>gi|384250513|gb|EIE23992.1| hypothetical protein COCSUDRAFT_62519 [Coccomyxa subellipsoidea
C-169]
Length = 620
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 43/187 (22%), Positives = 75/187 (40%), Gaps = 25/187 (13%)
Query: 244 GRYTQGLYSVICLMEHNCLPNAKH------SNMMQSKLFVFRDTHISTMYTNALWGTQPR 297
G+ +YS+I + H+C N + + ++++ + + D + Y + L R
Sbjct: 123 GQPPAAMYSLISRLNHSCRANTAYHFKAGGAVVVRAVVDIDPDEELCISYLDPLQPLSRR 182
Query: 298 REHLAITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPV---NPLDNDSDWT 354
+E L F CSCERC+ ++ ++ + W L N + + D
Sbjct: 183 QEDLQQRFCFTCSCERCTLEQQM---LASTGLQDGTPAHAPTWFLSAIAPNAVGTEVD-- 237
Query: 355 CGSCSARLNARDVHLVTSQLGEQVDKLVQENPNVKSLEE-MLTKLEAMFHPHHYHCYAVK 413
SC RL T Q E+ + +LE +L +E HP+HY C A+
Sbjct: 238 --SCLTRL--------TRQAEEEFVRQGNATAAWTALEAGILQAVEEGAHPYHYRCLALY 287
Query: 414 HSLIQLY 420
L+ Y
Sbjct: 288 ACLVSAY 294
>gi|147768947|emb|CAN69258.1| hypothetical protein VITISV_026383 [Vitis vinifera]
Length = 502
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 18/50 (36%), Positives = 24/50 (48%)
Query: 18 CRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRY 67
C A +C C VAYC HQ HW HK C +L ++K +L +
Sbjct: 12 CIGPATRRCGRCGAVAYCSVSHQVSHWSDHKEXCGRLEQQMKLVDVLNDF 61
>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 742
Score = 39.7 bits (91), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 117/341 (34%), Gaps = 57/341 (16%)
Query: 65 GRYLQATLDLHPGDRIARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCFWPAC 124
GRY A+ + G+ + E+P C C PL C C C
Sbjct: 257 GRYFVASTKIEYGELLIFENPFAFVLLPEYYNSFCYNCCVPLKYYSIPCDNC--CTILFC 314
Query: 125 SARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQELKDM 184
+C + H EC K G A L LR I ++ ++ ++
Sbjct: 315 GDKCLQEARNSYHRWEC---KQGTSIFKCIGI----AHLALRLTIETSQANSNNDQIYNL 367
Query: 185 EAHMSKRGPGTEVYE---EIDSIVKYL--RSNFLEKLPGDVLDDTSAKCLHWICGVIEVN 239
H+ E+Y+ ++ YL +++F EK P VLD + LH + ++ N
Sbjct: 368 LTHIDDL-KSLELYQYSLTATLLLIYLQKKTDFFEKHPNLVLDSVGNELLHHMTRLV-CN 425
Query: 240 GVDIG---------------------RYTQGLYSVICLMEHNCLPNAKHSNMMQSKLF-V 277
G I R ++ L+ H+C PN SN+++ +
Sbjct: 426 GNAISTHMLSDYDSGSRTPIIDESQPRIGTAIFPTSSLLNHSCDPNIFSSNILKYVVIKA 485
Query: 278 FRDTHISTMYTNALWG-------TQPRREHLAITKYFNCSCERCSDPTELGTYFSAMK-- 328
RD TN +G R+ L +F+C C C DP +F +
Sbjct: 486 SRDISEGEEITNC-YGPNFLRMRVVDRQASLKNQYHFDCECNTCLDPQADDLFFKTFEGL 544
Query: 329 -CLNEHKDQGDCWILPVNPLDNDSDWT--CGSCSARLNARD 366
CL H +P D D + T CG C R D
Sbjct: 545 VCLRCHNG------IPATLSDLDVNETVYCGLCYVRFRTLD 579
>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
Length = 520
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
L KCS C+ YCG QKE WKLH+ +C+ +
Sbjct: 69 LRKCSVCRVAWYCGSVCQKEEWKLHQLECQAI 100
>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
Length = 433
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|330934901|ref|XP_003304751.1| hypothetical protein PTT_17418 [Pyrenophora teres f. teres 0-1]
gi|311318512|gb|EFQ87155.1| hypothetical protein PTT_17418 [Pyrenophora teres f. teres 0-1]
Length = 557
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 20/47 (42%), Positives = 25/47 (53%), Gaps = 7/47 (14%)
Query: 14 RCAVCRETA-------LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
RCA C ++A L +CS CK YCG+ QK WK HK C +
Sbjct: 7 RCANCNKSAAECSLANLQQCSRCKTTEYCGRDCQKADWKTHKKICAQ 53
>gi|169848916|ref|XP_001831162.1| hypothetical protein CC1G_09865 [Coprinopsis cinerea okayama7#130]
gi|116507730|gb|EAU90625.1| hypothetical protein CC1G_09865 [Coprinopsis cinerea okayama7#130]
Length = 443
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 11 ELMRCAVCRET-ALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
+L C VC T +CS C V YCG + Q E W HK CK L
Sbjct: 254 DLSGCEVCGSTKGASRCSGCLSVLYCGTKCQTEDWPRHKATCKSL 298
>gi|405959392|gb|EKC25437.1| hypothetical protein CGI_10023056 [Crassostrea gigas]
Length = 475
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 18/37 (48%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
CA C+ T +CS CK + YC + QKEHW H+P C
Sbjct: 436 CARCKTTGSLQCSRCK-MPYCSHECQKEHWPEHRPHC 471
>gi|440790393|gb|ELR11676.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 669
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 9 DEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKKL 54
D++ CA C +TA C CK V YC ++ Q +HW+ HK C K+
Sbjct: 617 DKKKQSCARCGKTANKACGGCKLVWYCSRECQSDHWQQSHKAACSKV 663
>gi|397631833|gb|EJK70303.1| hypothetical protein THAOC_08347, partial [Thalassiosira oceanica]
Length = 351
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 24/44 (54%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H K HK CK+ E+K L R
Sbjct: 73 LKNCTACRLVKYCGVDCQRAHRKKHKKACKQRAAELKDEQLYSR 116
>gi|24644165|ref|NP_730906.1| HIF prolyl hydroxylase, isoform C [Drosophila melanogaster]
gi|20151779|gb|AAM11249.1| RE70727p [Drosophila melanogaster]
gi|23170431|gb|AAF52052.2| HIF prolyl hydroxylase, isoform C [Drosophila melanogaster]
gi|220948766|gb|ACL86926.1| Hph-PB [synthetic construct]
gi|220958252|gb|ACL91669.1| Hph-PB [synthetic construct]
Length = 478
Score = 39.7 bits (91), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 17/42 (40%), Positives = 25/42 (59%), Gaps = 1/42 (2%)
Query: 14 RCAVC-RETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
RC++C + L +C+ CK V YC HQ HW H+ +C+ L
Sbjct: 29 RCSICGTQQQLLRCAKCKAVYYCSPAHQHLHWPDHRTECRLL 70
>gi|397582820|gb|EJK52429.1| hypothetical protein THAOC_28291, partial [Thalassiosira
oceanica]
Length = 337
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 28/57 (49%)
Query: 17 VCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQATLD 73
C L C+AC+ V YC Q+ H K+HK C+ E++ L G+ L+ D
Sbjct: 17 ACGAAKLKTCTACRLVKYCSVDCQRAHRKIHKKACRARAAELRDERLYGQGLERPED 73
>gi|397569320|gb|EJK46670.1| hypothetical protein THAOC_34651 [Thalassiosira oceanica]
Length = 303
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ + YCG Q+ H + HK CKK E+K L G+
Sbjct: 23 LKDCTACRLIKYCGVDCQRAHRRQHKKACKKRAAELKDERLYGQ 66
>gi|67613152|ref|XP_667280.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54658400|gb|EAL37050.1| hypothetical protein Chro.40234 [Cryptosporidium hominis]
Length = 582
Score = 39.3 bits (90), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 41/192 (21%), Positives = 84/192 (43%), Gaps = 24/192 (12%)
Query: 250 LYSVICLMEHNCLPN------AKHSNMMQSK--LFVFRDTHISTMYTNALWGTQPRREHL 301
LY+VI +M HNC + ++ ++++K L + + IS + + L+ R L
Sbjct: 313 LYNVISMMAHNCGASCCWHYGVDNTFVLRAKTRLEIGDEITISYISDDDLFKCSKTRREL 372
Query: 302 AITKYFNCSCERCSDPTELGTYFSAMKCLNEHKDQGDCWILPVNPLDNDSDWT----CGS 357
F C CERC++PT+L +KC C + + ++++ T C
Sbjct: 373 LSNWLFYCQCERCNNPTDLS---RGLKC-------ASCGVGSMFFKEDENGVTRSSRCSV 422
Query: 358 CSARLNARDVHLVTSQLGEQVDKLVQENPN-VKSLEEMLTKLEAMFHPHHYHCYAVKHSL 416
C + + + T + V +L + + N V+ +E + + + +F H+ Y + L
Sbjct: 423 CRSHPDREIIDSYTDLESQYVARLEETDKNDVEDVEAVYQEAQKVF-TQHWIMYELDTML 481
Query: 417 IQLYGTQPGYAY 428
+ Y +++
Sbjct: 482 FEAYKESQQFSF 493
>gi|397568499|gb|EJK46168.1| hypothetical protein THAOC_35179 [Thalassiosira oceanica]
Length = 374
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 27/56 (48%)
Query: 14 RCAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
R L +C+AC+ V YC QK H K HK CKK E+K L + L+
Sbjct: 99 RTKAGEAVVLKRCTACRLVKYCSVDCQKAHRKYHKQACKKRAAELKDEQLYEQGLE 154
>gi|397568098|gb|EJK45952.1| hypothetical protein THAOC_35408, partial [Thalassiosira oceanica]
Length = 868
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L KC+AC+ V YCG Q+ H K HK CK+ E+K L
Sbjct: 156 LRKCTACRIVKYCGVDCQRAHRKQHKKACKQRAAELKDERL 196
>gi|323454349|gb|EGB10219.1| hypothetical protein AURANDRAFT_62835 [Aureococcus anophagefferens]
Length = 3543
Score = 39.3 bits (90), Expect = 4.7, Method: Composition-based stats.
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
L KC+ CK YC ++ QK W +HK C+K
Sbjct: 3509 LLKCARCKRTHYCSRECQKADWSVHKLTCRK 3539
>gi|397568360|gb|EJK46098.1| hypothetical protein THAOC_35255 [Thalassiosira oceanica]
Length = 307
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGR 66
L C+AC+ V YCG Q+ H + HK CK+ E+K L G+
Sbjct: 30 LKDCTACRLVKYCGVDCQRAHRRQHKKACKQRAAELKDEQLYGQ 73
>gi|302677903|ref|XP_003028634.1| expressed protein [Schizophyllum commune H4-8]
gi|300102323|gb|EFI93731.1| expressed protein [Schizophyllum commune H4-8]
Length = 219
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 23/46 (50%), Gaps = 2/46 (4%)
Query: 9 DEELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
D EL RC+ CR L KC C + YC + QK W HK CK
Sbjct: 171 DFELYRCSYCRNPSAVLRKCGGCSKARYCDAECQKLGWAAHKKTCK 216
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 39.3 bits (90), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 59/268 (22%), Positives = 102/268 (38%), Gaps = 34/268 (12%)
Query: 65 GRYLQATLDLHPGDRIARESPLIV----GPKLALAEPICLGCHKPLNPNLADNARCPRC- 119
G L+A + L PG+ + R PL G + + + LG K + RC +C
Sbjct: 15 GNGLRAVVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLM--------RCSQCR 66
Query: 120 FWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSPKKYQ 179
CSA+C + H EC LK C+ + + + + + +K
Sbjct: 67 VAKYCSAKCQKKA-WPDHKRECKCLK-SCKPRYPPDSVRLLGRVVFKLMEEIPSESEKLY 124
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLD--------DTSAKCLHW 231
D+E++++K + E+ ++ L F + ++ D D
Sbjct: 125 TFYDLESNINK------LTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDIFEAFAKV 178
Query: 232 ICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDT----HISTM 286
IC + ++ GLY + L+ H+C PN + N L RD ++
Sbjct: 179 ICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVRDIEAGEELTIC 238
Query: 287 YTNALWGTQPRREHLAITKYFNCSCERC 314
Y + L ++ RR+ L F C C RC
Sbjct: 239 YLDMLMTSEERRKQLRDQYCFECDCFRC 266
>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR1-like [Glycine max]
Length = 401
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 22/76 (28%), Positives = 35/76 (46%), Gaps = 5/76 (6%)
Query: 249 GLYSVICLMEHNCLPNA-----KHSNMMQSKLFVFRDTHISTMYTNALWGTQPRREHLAI 303
GLY VI ++ H+CLPN+ S ++ + + T + Y T ++ L
Sbjct: 128 GLYPVISIINHSCLPNSVLVFEGRSALVPTVQHIPTGTEVPISYIETAGSTMTQQNALKS 187
Query: 304 TKYFNCSCERCSDPTE 319
+ F C+C RCS E
Sbjct: 188 STXFTCTCPRCSKVGE 203
>gi|281204212|gb|EFA78408.1| hypothetical protein PPL_09059 [Polysphondylium pallidum PN500]
Length = 414
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 1/47 (2%)
Query: 8 EDEELMRCAVCRETALHKCSACKEVAYCGKQHQKEHWKL-HKPKCKK 53
+D +C C A +C C+ V YCGK HQ W+L H+ +CK+
Sbjct: 135 QDARQPQCTFCGCAAPTRCGNCRRVHYCGKDHQSRDWELGHREQCKQ 181
>gi|397611439|gb|EJK61326.1| hypothetical protein THAOC_18212 [Thalassiosira oceanica]
Length = 306
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 58/138 (42%), Gaps = 22/138 (15%)
Query: 1 MASIIEIEDEELMRCAVCRE-----TALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLP 55
M + + D++ CA C + L +C+AC+ V YCG QK H K HK C +
Sbjct: 1 MMKCLPVMDDDDKVCANCGKHGSDTVKLKRCTACRLVKYCGVDCQKAHRKQHKKACNQRV 60
Query: 56 YEIKSSPLLGRYLQA-TLDLHPGDRIARESPLIVGPKLALAEPICL---GCHKPLNPNLA 111
E+K L + L+ D P + P+ K +L P C+ GC N+A
Sbjct: 61 AELKDEQLYSQGLERPEGDFCPICTLPIPFPI---DKHSLFHPCCMTRIGC------NVA 111
Query: 112 DNAR----CPRCFWPACS 125
D CP C P C
Sbjct: 112 DTKTGIKPCPFCRAPLCD 129
>gi|356563968|ref|XP_003550229.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
max]
Length = 735
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKL 54
CAVC A KCS CK V YC + Q+ HW HK CK
Sbjct: 71 CAVCATPAPKKCSRCKAVRYCSQACQQSHWMSGHKTVCKNF 111
>gi|449549246|gb|EMD40212.1| hypothetical protein CERSUDRAFT_112416 [Ceriporiopsis subvermispora
B]
Length = 659
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 21/44 (47%), Gaps = 2/44 (4%)
Query: 11 ELMRCAVCRE--TALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
EL RC+ C+ L KC C YC QK HW HK CK
Sbjct: 614 ELYRCSWCKNPSAVLRKCGGCGRTRYCDGVCQKSHWSEHKAACK 657
>gi|397623696|gb|EJK67106.1| hypothetical protein THAOC_11904 [Thalassiosira oceanica]
Length = 309
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK +CK+ E+K L + L+
Sbjct: 35 LKNCTACRLVKYCGVDCQRAHRKQHKKECKQRAAELKDEQLYSQGLE 81
>gi|397573939|gb|EJK48939.1| hypothetical protein THAOC_32222 [Thalassiosira oceanica]
Length = 392
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 24/41 (58%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPL 63
L KC+AC+ V YCG Q+ H K HK CK+ E+K L
Sbjct: 119 LRKCTACRLVKYCGVDCQRAHRKQHKKACKQRVAELKDEQL 159
>gi|392560004|gb|EIW53187.1| hypothetical protein TRAVEDRAFT_133378 [Trametes versicolor
FP-101664 SS1]
Length = 431
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 21/40 (52%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C VC ++ +CS C+ YC QK W HKP C+ L
Sbjct: 255 CEVCGKSGTSRCSQCQSATYCSPACQKIDWPHHKPTCRSL 294
>gi|167519084|ref|XP_001743882.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777844|gb|EDQ91460.1| predicted protein [Monosiga brevicollis MX1]
Length = 549
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 20/49 (40%), Positives = 28/49 (57%), Gaps = 2/49 (4%)
Query: 5 IEIEDEELMRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKC 51
+ IE + CA C T L +C+AC AYC + QK+HW+ HK +C
Sbjct: 1 MSIEALVHLVCANCGGTGERLKRCTACSMTAYCNAECQKQHWRHHKKEC 49
>gi|427794897|gb|JAA62900.1| Putative egl nine log 1, partial [Rhipicephalus pulchellus]
Length = 427
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 23/39 (58%), Gaps = 1/39 (2%)
Query: 15 CAVCRETA-LHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
C C T+ L +C+ C + YC K+HQ+ HW HK CK
Sbjct: 63 CQFCGSTSGLLRCAQCHGIYYCSKEHQRLHWPAHKHICK 101
>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Cofactor Product Adohcy
gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Methyltransferase Inhibitor
Sinefungin
gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
Length = 433
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 58/263 (22%), Positives = 104/263 (39%), Gaps = 24/263 (9%)
Query: 65 GRYLQATLDLHPGDRI----ARESPLIVGPKLALAEPICLGCHKPLNPNLADNARCPRCF 120
GR L+A H GD + A L VG + E C L+ RC + F
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCE-----CCFARKEGLSKCGRCKQAF 72
Query: 121 WPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCLILQRRSP-KKYQ 179
+ C C D H EC+ + + E + A + + I R+P +K
Sbjct: 73 Y--CDVECQK-EDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQKIHPERTPSEKLL 129
Query: 180 ELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCL--HWICGVIE 237
+++ E+H+ K E + I S + L + + L + D +S L C
Sbjct: 130 AVREFESHLDKLD--NEKKDLIQSDIAALHQFYSKYL--EFPDHSSLVVLFAQVNCNGFT 185
Query: 238 VNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSKLFVFRDTH----ISTMYTNALW 292
+ ++ ++ + LM H+C PN + +++ ++ H + T Y + L+
Sbjct: 186 IEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTLAEVRAVQEIHPGDEVFTSYIDLLY 245
Query: 293 GTQPRREHLAITKYFNCSCERCS 315
T+ R + L + +F C C C+
Sbjct: 246 PTEDRNDRLRDSYFFTCECRECT 268
Score = 38.9 bits (89), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKL 54
C R+ L KC CK+ YC + QKE W LHK +C +
Sbjct: 54 CCFARKEGLSKCGRCKQAFYCDVECQKEDWPLHKLECSSM 93
>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
Length = 433
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 45/215 (20%), Positives = 83/215 (38%), Gaps = 15/215 (6%)
Query: 109 NLADNARCPRCFWPACSARCSGLSDAHTHAPECAILKLGCETLLAYNDYKYEAILPLRCL 168
L+ RC + F+ C+ C D H EC+ + + E + A + +
Sbjct: 61 GLSKCGRCKQAFY--CNVECQK-EDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK 117
Query: 169 ILQRRSP-KKYQELKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAK 227
I R+P +K +K+ E+H+ K E + I S + L + + L D
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLD--NEKKDLIQSDIAALHHFYSKHLEFPDNDSLIVL 175
Query: 228 CLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHSNMMQSKLFVFRDTH----- 282
C + ++ ++ + LM H+C PN + + L R
Sbjct: 176 FAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT--YKGTLAEVRAVQEIKPG 233
Query: 283 --ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ T Y + L+ T+ R + L + +F C C+ C+
Sbjct: 234 EEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECT 268
>gi|301123801|ref|XP_002909627.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100389|gb|EEY58441.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 429
Score = 39.3 bits (90), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 20 ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLL 64
E L C C+ VAYC ++ QK+ WK HK CK + + + + L
Sbjct: 249 EGKLMTCGGCRSVAYCSRECQKKAWKKHKTDCKAIRAQREQAEAL 293
>gi|397628595|gb|EJK68977.1| hypothetical protein THAOC_09810 [Thalassiosira oceanica]
Length = 481
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 26/47 (55%)
Query: 23 LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYEIKSSPLLGRYLQ 69
L C+AC+ V YCG Q+ H K HK CK+ E+K L + L+
Sbjct: 207 LKDCTACRLVKYCGVDCQRAHRKKHKKACKQRAAELKDEQLFSQGLE 253
>gi|260800903|ref|XP_002595336.1| hypothetical protein BRAFLDRAFT_87573 [Branchiostoma floridae]
gi|229280581|gb|EEN51348.1| hypothetical protein BRAFLDRAFT_87573 [Branchiostoma floridae]
Length = 1307
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 26/44 (59%), Gaps = 2/44 (4%)
Query: 12 LMRCAVCRETA--LHKCSACKEVAYCGKQHQKEHWKLHKPKCKK 53
L +C+ C + +CS+C + YC + QK+ WK HKP C++
Sbjct: 15 LHKCSTCSRVSRDAKRCSSCDKAWYCNQTCQKDDWKRHKPTCQE 58
>gi|356554161|ref|XP_003545417.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 19-like [Glycine
max]
Length = 729
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 15 CAVCRETALHKCSACKEVAYCGKQHQKEHW-KLHKPKCKKL 54
CAVC A KCS CK V YC + Q+ HW HK CK
Sbjct: 70 CAVCATPAPKKCSRCKAVRYCSQACQQSHWMSGHKTVCKDF 110
>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
niloticus]
Length = 435
Score = 39.3 bits (90), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 37/165 (22%), Positives = 73/165 (44%), Gaps = 15/165 (9%)
Query: 163 LPLRCLILQR----RSP-KKYQELKDMEAHMSKRGPGTEVYEEID-SIVKYLRSNFLEKL 216
L R ++ QR R+P ++ LK+ EAH+ K E + D + + + S + L
Sbjct: 108 LVARIIMKQRATTERTPSERLLLLKEFEAHLDKMDSEKEEMNQTDIAALHHFYSRHISNL 167
Query: 217 PGD-VLDDTSAKCLHWICGVIEVNGVDIGRYTQGLYSVICLMEHNCLPNAKHS-NMMQSK 274
P + L + A+ C + ++ ++ + LM H+C PN + ++
Sbjct: 168 PDEQALTELFAQV---NCNGFTIEDEELSHLGSAVFPDVALMNHSCSPNVIVTYKGTVAE 224
Query: 275 LFVFRDTH----ISTMYTNALWGTQPRREHLAITKYFNCSCERCS 315
+ ++ + I Y + L+ T+ R+E L + +F C C C+
Sbjct: 225 VRAVKEINPGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECT 269
>gi|397570850|gb|EJK47491.1| hypothetical protein THAOC_33780 [Thalassiosira oceanica]
Length = 338
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 5/69 (7%)
Query: 3 SIIEIEDEELMRCAVCRETA-----LHKCSACKEVAYCGKQHQKEHWKLHKPKCKKLPYE 57
S I ++ CA C + L C+AC+ V YCG Q+ H + HK CK+ E
Sbjct: 49 SCITVDGSPADVCANCGKQGSDAVKLKNCTACRLVKYCGVDCQRAHRRQHKKACKQRAAE 108
Query: 58 IKSSPLLGR 66
+K L G+
Sbjct: 109 LKDEQLYGQ 117
>gi|340052290|emb|CCC46562.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 667
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 20/142 (14%)
Query: 181 LKDMEAHMSKRGPGTEVYEEIDSIVKYLRSNFLEKLPGDVLDDTSAKCLHWI-CGVIEV- 238
L+D A MS+R + + +V+ L L G+ + + L I C +EV
Sbjct: 334 LQDHAAEMSQR------FAPLVRLVEGL-------LEGEETPVSVVRLLGVIRCNALEVC 380
Query: 239 --NGVDIGRYTQGLYSVICLMEHNCLPN-AKHSNMMQSKLFVFRDTHISTMYTNAL-WGT 294
G+ IG+ ++ H+C PN A N + + V ++ Y L W
Sbjct: 381 DSTGLGIGQALHA-STITSYFNHSCAPNCAMDGNAIVTICEVLPGEELTIAYIPQLYWPV 439
Query: 295 QPRREHLAITKYFNCSCERCSD 316
+ RRE LA +F+C C RC D
Sbjct: 440 ELRREELAGRYFFHCCCTRCGD 461
>gi|449549234|gb|EMD40200.1| hypothetical protein CERSUDRAFT_112406 [Ceriporiopsis subvermispora
B]
Length = 704
Score = 39.3 bits (90), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 22/44 (50%), Gaps = 2/44 (4%)
Query: 11 ELMRCAVCR--ETALHKCSACKEVAYCGKQHQKEHWKLHKPKCK 52
EL RC+ CR L KC C + YC QK HW HK CK
Sbjct: 659 ELYRCSWCRNPSAMLRKCGGCGKTRYCDAICQKSHWGDHKIACK 702
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.441
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,791,966,964
Number of Sequences: 23463169
Number of extensions: 365933993
Number of successful extensions: 789832
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1569
Number of HSP's successfully gapped in prelim test: 929
Number of HSP's that attempted gapping in prelim test: 785455
Number of HSP's gapped (non-prelim): 3600
length of query: 541
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 393
effective length of database: 8,886,646,355
effective search space: 3492452017515
effective search space used: 3492452017515
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 79 (35.0 bits)