BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy14480
(190 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|357603389|gb|EHJ63740.1| hypothetical protein KGM_20849 [Danaus plexippus]
Length = 544
Score = 237 bits (604), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 101/187 (54%), Positives = 133/187 (71%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ IS + PHP+Y+C+ LRCLY++ H ++W+KL LE H E+ K + K+ENDR+ V
Sbjct: 134 KVSISTYGIPHPNYQCVTVLRCLYQRDHNMKLWNKLQALETHTEDRKGTDKWENDRKMVA 193
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ +FFKL F +EEI+K CGIIQ+N HE+PL EP Y+A+FD+ S +EHNC N KS
Sbjct: 194 QFIWSFFKLERLFNEEEIMKCCGIIQINGHEVPLLEPEYVAVFDKISMVEHNCRANCNKS 253
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FT +G+++L A I G H+S+CYTDPLWGT RRHHL SK+F C CERC D TE T
Sbjct: 254 FTSNGEIILSAGVAIPRGSHISVCYTDPLWGTEARRHHLSDSKFFECSCERCSDVTEFGT 313
Query: 182 FYDGVKC 188
+ VKC
Sbjct: 314 MFSAVKC 320
>gi|193683573|ref|XP_001946512.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 499
Score = 218 bits (554), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 98/189 (51%), Positives = 131/189 (69%), Gaps = 2/189 (1%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
KI+I +FT PHPSY+ I +RCLY+K + P++W KL +LE+H + KYE DR +
Sbjct: 96 KIEIKDFTNPHPSYQSITVIRCLYQKHNNPEVWAKLTKLESHCASRRGGSKYETDRIWIA 155
Query: 62 QFLLNFFKLN-EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
L FFKL+ E+ +EIL++CGI+QVN HE+PLT+P Y+AI+D S +EH+C PN K
Sbjct: 156 DHLQRFFKLDPTEWPVDEILRVCGIVQVNGHEVPLTDPPYVAIYDAGSMLEHSCVPNCSK 215
Query: 121 SFTDSGQVLLR-AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
SFT G +L+R A + G HLSI YTD LWGT R HL +K+F+C+C RC DPTEL
Sbjct: 216 SFTRDGHLLIRTAAAAVESGGHLSISYTDVLWGTAQRLAHLADTKFFVCKCPRCSDPTEL 275
Query: 180 NTFYDGVKC 188
T++ GVKC
Sbjct: 276 GTYFSGVKC 284
>gi|189239848|ref|XP_973542.2| PREDICTED: similar to CG8503 CG8503-PA [Tribolium castaneum]
Length = 829
Score = 213 bits (541), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 96/190 (50%), Positives = 132/190 (69%), Gaps = 3/190 (1%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
KI + F PHP Y+CI LRCLY+K P+ W KL L+ H+EE+K++ YE+DR +
Sbjct: 134 KITVKIFGVPHPIYKCITVLRCLYQKQFLPKNWKKLETLQNHLEEHKSTTNYESDRVTIA 193
Query: 62 QFLLNFFKLNEEFTQEEILKICG---IIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL 118
+F+ FFKL+ F++E+I+K+ G I QVN HE+PLTEP ++AI+ AS +EH+C PN
Sbjct: 194 EFIRRFFKLSATFSEEDIMKVHGSTLIFQVNGHEVPLTEPFHVAIYSSASMLEHSCGPNC 253
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
KSFT G +++ A K I GDHLSICY+DPLWGT +RR+ L +K+F C CERC DP+E
Sbjct: 254 TKSFTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYFLHETKFFWCHCERCEDPSE 313
Query: 179 LNTFYDGVKC 188
T + +KC
Sbjct: 314 FGTNFSAIKC 323
>gi|270011901|gb|EFA08349.1| hypothetical protein TcasGA2_TC005992 [Tribolium castaneum]
Length = 803
Score = 213 bits (541), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 94/187 (50%), Positives = 130/187 (69%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
KI + F PHP Y+CI LRCLY+K P+ W KL L+ H+EE+K++ YE+DR +
Sbjct: 134 KITVKIFGVPHPIYKCITVLRCLYQKQFLPKNWKKLETLQNHLEEHKSTTNYESDRVTIA 193
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
+F+ FFKL+ F++E+I+K+ G VN HE+PLTEP ++AI+ AS +EH+C PN KS
Sbjct: 194 EFIRRFFKLSATFSEEDIMKVHGSTLVNGHEVPLTEPFHVAIYSSASMLEHSCGPNCTKS 253
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FT G +++ A K I GDHLSICY+DPLWGT +RR+ L +K+F C CERC DP+E T
Sbjct: 254 FTKQGHIVISAAKSIQEGDHLSICYSDPLWGTPSRRYFLHETKFFWCHCERCEDPSEFGT 313
Query: 182 FYDGVKC 188
+ +KC
Sbjct: 314 NFSAIKC 320
>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 211 bits (536), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 130/187 (69%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
KI I +F PHP+Y+C++ +RCL P W+ L++LE+H EE + S ++ NDR V
Sbjct: 129 KISIQHFYVPHPTYQCLMPVRCLLLAESDPARWETLLKLESHEEERRGSEQWRNDREGVA 188
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
+ + FFK ++ ++EIL++ GIIQVN HE+P+TEPS +AI++ AS +EH+C PNL KS
Sbjct: 189 KLIPRFFKCENKWDEDEILRVVGIIQVNGHEVPMTEPSSVAIYNMASMLEHSCRPNLAKS 248
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FT+ G+V++ A PI GD LSICYTD LW T NR HLQ +K F C+CERC D TE T
Sbjct: 249 FTNRGEVVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHLQQTKMFRCECERCSDRTEYET 308
Query: 182 FYDGVKC 188
++ V+C
Sbjct: 309 YFSAVRC 315
>gi|383858836|ref|XP_003704905.1| PREDICTED: SET and MYND domain-containing protein DDB_G0273589-like
[Megachile rotundata]
Length = 523
Score = 209 bits (532), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 93/190 (48%), Positives = 134/190 (70%), Gaps = 2/190 (1%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYE-NDRRNV 60
K+ I+ F PHPSY+CI +R L K + + ++KL+ LE+H ++ KN + + N+
Sbjct: 132 KVSITKFGYPHPSYQCINVIRALSLKDNNIESYNKLLTLESHSDKIKNKETFIFEESSNL 191
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
V F+ FFK +E+ +E+I KI G++QVN HE+PLT+P Y+A+++ AS +EHNC N K
Sbjct: 192 VNFIKRFFK-SEDIPEEDIRKIIGVLQVNGHEVPLTDPPYVAVYELASLLEHNCKANCSK 250
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELN 180
SFTD+G +++ A PI GDH+SICYTDPLWGT+NRRHHL +KYF C C+RCRDPTE
Sbjct: 251 SFTDTGGLIIHAATPITKGDHISICYTDPLWGTMNRRHHLLETKYFECTCDRCRDPTEFG 310
Query: 181 TFYDGVKCPE 190
++ ++C E
Sbjct: 311 AMFNAIRCSE 320
>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
Length = 486
Score = 205 bits (522), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 91/186 (48%), Positives = 126/186 (67%)
Query: 3 IKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQ 62
I I +F PHP Y+C++ LRCL P W+ L QLE+H EE + S + NDR V +
Sbjct: 94 IAIQHFYVPHPMYQCLMPLRCLLLAESDPAKWELLQQLESHEEERRGSELWRNDREGVAK 153
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF 122
+ FFK + + ++EIL+I GIIQVN HE+PL+ PS +AI+++AS +EH+C PNL KSF
Sbjct: 154 LIPRFFKTGDRWDEDEILRIVGIIQVNGHEIPLSVPSSVAIYNQASMLEHSCRPNLSKSF 213
Query: 123 TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
T+ G+++L A I G+ LSICYTD +W T NR HLQ +K F C+CERC DPTE T+
Sbjct: 214 TNRGEIVLWAPNTIKRGERLSICYTDAMWATGNRLEHLQQTKMFRCECERCSDPTEYETY 273
Query: 183 YDGVKC 188
+ ++C
Sbjct: 274 FSALRC 279
>gi|380018360|ref|XP_003693098.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 467
Score = 203 bits (516), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 127/187 (67%), Gaps = 5/187 (2%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
KI I+ F PHPSY+CI +R L K P+ + KL+ LE+H E NS + N+
Sbjct: 131 KISITEFGYPHPSYQCINIIRALSLKDTNPESYKKLISLESHCNEINNS----KEPLNIA 186
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
F+ FFK ++ ++EEI+ I GI+QVN HE+PLT+ Y+A+++ AS IEHNC N KS
Sbjct: 187 HFIKRFFKA-DDISEEEIVTIIGILQVNGHEVPLTDSPYVAVYEMASLIEHNCRANCSKS 245
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FTD G +++RA PI GDH+SICYTDPLWGT NRRHHL +K+F C C RC+DPTE T
Sbjct: 246 FTDMGGLIIRAALPITKGDHISICYTDPLWGTANRRHHLLKTKFFECICNRCQDPTEFQT 305
Query: 182 FYDGVKC 188
++ +KC
Sbjct: 306 MFNALKC 312
>gi|328784417|ref|XP_624539.3| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 440
Score = 202 bits (513), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
KI I+ F PHPSY+CI +R L K P+ + KL+ LE+H E NS + N+
Sbjct: 131 KISITEFGYPHPSYQCINVIRALSLKDTNPESYKKLISLESHCNEMNNS----KEPLNIA 186
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
F+ FFK ++ ++EEI I GI+QVN HE+PLT+ Y+A+++ AS IEHNC N KS
Sbjct: 187 HFIKRFFKA-DDISEEEIATIIGILQVNGHEVPLTDSPYVAVYEMASLIEHNCRANCSKS 245
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FTD G +++RA PI GDH+SICYTDPLWGT NRRHHL +K+F C C RC+DPTE T
Sbjct: 246 FTDMGGLIIRAALPITKGDHISICYTDPLWGTANRRHHLFKTKFFECICNRCQDPTEFQT 305
Query: 182 FYDGVKC 188
++ +KC
Sbjct: 306 MFNALKC 312
>gi|350420138|ref|XP_003492412.1| PREDICTED: hypothetical protein LOC100743563 [Bombus impatiens]
Length = 442
Score = 200 bits (508), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 129/187 (68%), Gaps = 6/187 (3%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ I+ F PHPSY+CI +R L K + + KL+ LE+H + + ++ N V
Sbjct: 132 KVSITEFGHPHPSYQCITVIRALASKDVNLESYKKLLSLESHCDRINS-----HELSNTV 186
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
+F+ FFK ++ ++EE+ KI GI+QVN HE+PLT+P Y+A+++ S +EHNC N KS
Sbjct: 187 RFIKRFFK-TDDISEEEMTKIVGILQVNGHEVPLTDPPYVAVYELTSLLEHNCKANCSKS 245
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FTD+G +++ A PIA GDH+SICYTDPLWGT NRRHHL +K+F C C+RC+DPTE T
Sbjct: 246 FTDTGGLIIHAAVPIAKGDHISICYTDPLWGTANRRHHLLKTKFFECTCDRCKDPTEFGT 305
Query: 182 FYDGVKC 188
++ ++C
Sbjct: 306 MFNALRC 312
>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
Length = 546
Score = 199 bits (507), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 125/187 (66%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+++ +F PHP Y+C+I LRCL P W L++LE+H E+ + S ++ NDR V
Sbjct: 153 KMQLRHFYNPHPVYQCLIPLRCLLLAERDPAKWQALIKLESHEEQRRGSEQWRNDREGVA 212
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
+ + FFK +++++EILKI GI+QVN HE+PLTEP +AI++ AS +EH+C PNL KS
Sbjct: 213 KLIPRFFKCENKWSEDEILKIVGILQVNGHEVPLTEPPSVAIYNNASMLEHSCRPNLSKS 272
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FT +++ A PI G+ LSI Y+D LWGT NR+ HLQ +K F C C RC DPTE T
Sbjct: 273 FTSKKEIVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRCLDPTEFGT 332
Query: 182 FYDGVKC 188
+ +KC
Sbjct: 333 YLSALKC 339
>gi|312384987|gb|EFR29584.1| hypothetical protein AND_01294 [Anopheles darlingi]
Length = 366
Score = 187 bits (475), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 90/187 (48%), Positives = 125/187 (66%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ I +F PHP Y+C++ LRCL P W LM+L++H EE++ S ++ DRR V
Sbjct: 46 KVSIQHFYVPHPLYQCLLPLRCLLLAESDPARWKALMRLQSHDEEWRGSEQWRTDRRGVA 105
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
Q + FFK + + +++IL+ GIIQVN HE+PLTEP+ +AI+ AS +EH+C PNL KS
Sbjct: 106 QLIPRFFKCDNRWQEDDILQAIGIIQVNGHEVPLTEPASVAIYSVASMLEHSCRPNLAKS 165
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F+D G+++L A IA G LSI YTD LW T +RR HL +K F C C+RC DPTE T
Sbjct: 166 FSDQGEIILWAPVAIAEGTRLSISYTDVLWTTAHRRAHLLQTKLFDCACDRCLDPTEFGT 225
Query: 182 FYDGVKC 188
F+ ++C
Sbjct: 226 FFSALRC 232
>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
Length = 514
Score = 176 bits (445), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 120/189 (63%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F PHP Y C+ +RCL P+ K +LE+ + S +++ D ++
Sbjct: 131 KVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQELESLESTRRGSNQWKADLVSIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK ++F++EEI+K G +Q+N HE+P T+PS++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFK-TQKFSEEEIMKAVGALQINGHEVPTTDPSHVAVFYTASFTENSCLPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A + I HLSICY+D +WGT +R+ HL +K F C CERC D TEL+T
Sbjct: 250 FNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFRCACERCVDVTELDT 309
Query: 182 FYDGVKCPE 190
FY +KC +
Sbjct: 310 FYSAIKCED 318
>gi|156550498|ref|XP_001601757.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 519
Score = 175 bits (444), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 118/187 (63%), Gaps = 9/187 (4%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ I+ F HP+Y+CI +R L + P+ +++ +L + E + R +
Sbjct: 136 KVTITGFGIAHPTYKCIGVVRALALRETNPEAYERFTKLASGELE---------EPREIA 186
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
+F+ FF +F+++EI + GI+Q+N HE+P++EP IA++D +S++EH+C N KS
Sbjct: 187 RFIKRFFNKLHDFSEDEIARAAGIMQINGHEVPISEPVLIAVYDESSYVEHSCRANCSKS 246
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FT SG +++RA I G+H+ ICYTDPLWGT NRRHHL +K+F C C RC DP E +
Sbjct: 247 FTSSGGIVIRAAMEIEKGEHIMICYTDPLWGTANRRHHLLRTKFFECTCPRCSDPAEFGS 306
Query: 182 FYDGVKC 188
++ ++C
Sbjct: 307 MFNAIEC 313
>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
Length = 515
Score = 175 bits (443), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 79/189 (41%), Positives = 121/189 (64%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F PHP Y C+ +RCL P +K QLE+ + + S +++ D ++
Sbjct: 131 KVNVQEFNGPHPLYTCVSTVRCLLLSETNPANAEKFQQLESLEQTRRGSNQWKADLASIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FF+ ++F++EEI++ G +Q+N HE+P ++P ++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFR-TQKFSEEEIMRAVGALQINGHEVPTSDPPHVAVFYTASFTENSCVPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
FT +G +L A K I G +LSICY+D +WGT +R+ HL +K F C CERC D TEL T
Sbjct: 250 FTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFKCTCERCVDVTELGT 309
Query: 182 FYDGVKCPE 190
+Y +KC +
Sbjct: 310 YYSALKCED 318
>gi|195029731|ref|XP_001987725.1| GH19808 [Drosophila grimshawi]
gi|193903725|gb|EDW02592.1| GH19808 [Drosophila grimshawi]
Length = 515
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 79/189 (41%), Positives = 121/189 (64%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ ++ F PHP Y C+ +RCL + P +K QLE+ + + S +++ D ++
Sbjct: 131 KVNVTEFNGPHPLYTCVSTVRCLLLNTTNPANAEKFNQLESLEQTRRGSNQWKADLASIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FF+ ++F++EEI++ GI+Q+N HE+P+++P ++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFR-TQKFSEEEIMRAVGILQINGHEVPISDPPHVAVFYTASFTENSCVPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F G +L A K I HLSICY+D +WGT +R+ HL +K F C CERC D TEL T
Sbjct: 250 FNKHGHCVLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKCACERCLDVTELGT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
Length = 524
Score = 172 bits (435), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 78/189 (41%), Positives = 122/189 (64%), Gaps = 9/189 (4%)
Query: 9 TTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF 68
+ P +Y+ ++ LRCL + P+ WD+LMQLEAHV+E + + D+ V+F+
Sbjct: 136 SLPVQAYQSVMVLRCLALRDQNPERWDELMQLEAHVQERRQKGMEDVDQATAVRFIRETL 195
Query: 69 KLNEEFTQEEILKICGIIQVNAHEMP----LTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
L + +E IL++CGI+ VN+ E P ++ +A++ AS +EH+C N K+FT+
Sbjct: 196 GL--QIPEELILQLCGILMVNSFEQPPMKGNSQHGLVAVYSTASLLEHDCVANAIKTFTN 253
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
G +++RA PI G+ +++CYT+PLWGT+NR+ HL +K+F C CERC+DPTEL+TF
Sbjct: 254 KGDIVIRAAVPIPKGEKIALCYTEPLWGTMNRQRHLSQTKFFQCVCERCKDPTELDTFTS 313
Query: 185 GV---KCPE 190
G KCP+
Sbjct: 314 GFYCQKCPQ 322
>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
Length = 514
Score = 171 bits (434), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 79/189 (41%), Positives = 118/189 (62%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
++ + F PHP Y C+ +RCL P+ K LE+ + S +++ D ++
Sbjct: 131 RVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQDLESLESTRRGSNQWKADLVSIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK ++FT+E+I+K G +Q+N HE+P T+PS++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFK-TQKFTEEQIMKAVGALQINGHEVPTTDPSHVAVFYTASFTENSCLPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A + I HLSICY+D +WGT +R+ HL +K F C CERC D TEL+T
Sbjct: 250 FNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTELDT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
Length = 515
Score = 171 bits (433), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 118/189 (62%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F PHP Y C+ +RCL P+ K +LE+ + S +++ D ++
Sbjct: 131 KVNVQEFGGPHPLYTCLSTVRCLLIGETSPEKASKFQELESLESTRRGSNQWKADLVSIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK ++FT+EEI+K G +Q+N HE+P T+PS++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFK-TQKFTEEEIMKAVGALQINGHEVPTTDPSHVAVFYTASFTENSCLPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A I HLSICY+D +WGT +R+ HL +K F C C+RC D TEL+T
Sbjct: 250 FNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRCVDVTELDT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
Length = 513
Score = 171 bits (432), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 117/189 (61%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F PHP Y C+ +RCL + K LE+ + S +++ D ++
Sbjct: 131 KVNVQEFGGPHPLYTCLSTVRCLLIGETSTEKASKFQDLESLESTRRGSNQWKADLVSIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK ++FT+EEI+K G +Q+N HE+P T+PS++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFK-TQKFTEEEIMKAVGALQINGHEVPTTDPSHVAVFYTASFTENSCLPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A + I HLSICY+D +WGT +R+ HL +K F C CERC D TEL+T
Sbjct: 250 FNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTELDT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
Length = 517
Score = 169 bits (429), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 118/189 (62%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ ++ F PHP Y C+ +RCL P+ K +LE+ + S +++ D ++
Sbjct: 133 KVNVNEFNGPHPLYTCVSTVRCLLIGETSPEKAAKFQELESLESTRRGSNQWKADLASIG 192
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK ++FT+E+I++ G +Q+N HE+P ++P ++ +F ASF E++C PNL KS
Sbjct: 193 QFIPKFFK-TQKFTEEQIMRAVGALQINGHEIPTSDPPHVGVFYTASFTENSCLPNLAKS 251
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A + I HLSICY+D +WGT +R+ HL +K F C C RC D TEL+T
Sbjct: 252 FNKNGHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACARCADVTELDT 311
Query: 182 FYDGVKCPE 190
FY +KC +
Sbjct: 312 FYSALKCED 320
>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
Length = 515
Score = 169 bits (427), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 116/189 (61%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F HP Y C+ +RCL P+ K +LE+ + S +++ D ++
Sbjct: 131 KVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKASKFQELESLESTRRGSNQWKADLASIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK E+F++EE+++ G +Q+N HE+P ++P ++A+F ASF EH+C PNL KS
Sbjct: 191 QFIPKFFK-TEKFSEEEVMRAVGALQINGHEVPTSDPPHVAVFYTASFTEHSCMPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A + I HLSICY+D +WGT +R+ HL +K F C C+RC D TEL T
Sbjct: 250 FNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDVTELGT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
Length = 515
Score = 168 bits (426), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 116/189 (61%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F HP Y C+ +RCL P+ K +LE+ + S +++ D ++
Sbjct: 131 KVNVHEFNGSHPLYTCVSTVRCLLIGETSPEKASKFQELESLESTRRGSNQWKADLASIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK E+F++EE+++ G +Q+N HE+P ++P ++A+F ASF EH+C PNL KS
Sbjct: 191 QFIPKFFK-TEKFSEEEVMRAVGALQINGHEVPTSDPPHVAVFYTASFTEHSCMPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F +G +L A + I HLSICY+D +WGT +R+ HL +K F C C+RC D TEL T
Sbjct: 250 FNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCTDVTELGT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|195383960|ref|XP_002050693.1| GJ20068 [Drosophila virilis]
gi|194145490|gb|EDW61886.1| GJ20068 [Drosophila virilis]
Length = 515
Score = 168 bits (426), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 78/189 (41%), Positives = 115/189 (60%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F PHP Y C+ +RCL P K QLE + + S +++ D ++
Sbjct: 131 KVNVQEFNGPHPLYTCVSTVRCLLIGETNPANAAKFKQLETLEQTRRGSNQWKADLASIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FF+ ++F++EEI++ G +Q+N HE+P ++P ++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFR-TQKFSEEEIMRAVGALQINGHEVPTSDPPHVAVFYTASFTENSCVPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F G +L A K I HLSICY+D +WGT +R+ HL +K F C CERC D TEL T
Sbjct: 250 FNKHGHCMLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKCSCERCLDVTELGT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSALKCED 318
>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
Length = 515
Score = 165 bits (418), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 76/189 (40%), Positives = 114/189 (60%), Gaps = 1/189 (0%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K+ + F PHP Y C+ +RCL K LE+ + S +++ D ++
Sbjct: 131 KVNVQEFQGPHPLYTCVSTVRCLLLGETNAANAKKFSDLESLESTRRGSSQWKADLTSIG 190
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKS 121
QF+ FFK ++F++EEI++ G +Q+N HE+P ++P ++A+F ASF E++C PNL KS
Sbjct: 191 QFIPKFFK-TQKFSEEEIMRTVGALQINGHEVPTSDPPHVAVFYVASFTENSCLPNLAKS 249
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
F G +L A + I HLSICY+D +WGT +R+ HL +K F C C RC+DPTEL T
Sbjct: 250 FNKHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTKLFKCACLRCQDPTELGT 309
Query: 182 FYDGVKCPE 190
Y +KC +
Sbjct: 310 NYSAIKCED 318
>gi|195583324|ref|XP_002081472.1| GD11032 [Drosophila simulans]
gi|194193481|gb|EDX07057.1| GD11032 [Drosophila simulans]
Length = 1033
Score = 135 bits (339), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 59/118 (50%), Positives = 82/118 (69%)
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
+FT+EEI+K G +Q+N HE+P T+P ++A+F ASF E++C PNL KSF +G +L A
Sbjct: 720 KFTEEEIMKAVGALQINGHEVPTTDPPHVAVFYTASFTENSCLPNLAKSFNKNGHCILWA 779
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
+ I HLSICY+D +WGT +R+ HL +K F C CERC D TEL+T Y +KC +
Sbjct: 780 PREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERCVDVTELDTNYSAIKCED 837
>gi|260800041|ref|XP_002594945.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
gi|229280183|gb|EEN50956.1| hypothetical protein BRAFLDRAFT_174600 [Branchiostoma floridae]
Length = 458
Score = 132 bits (332), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 106/195 (54%), Gaps = 10/195 (5%)
Query: 3 IKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYEND-RRNVV 61
I I++F P+ SY+ I LRCL KS P W + +E+H E+ ++ + RNVV
Sbjct: 92 IDITDFVGPNTSYQFITPLRCLLLKSRDPARWKLISGMESHTEDQTREYRWGMEMGRNVV 151
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS--------YIAIFDRASFIEHN 113
+ + ++E F++ EI+K+CGI+ VNA E+ T P+ A++ + H+
Sbjct: 152 RVIRQTLGMDE-FSEHEIMKVCGILTVNAFELFTTGPNKDRDSTPRAWAVYPTTYLMNHD 210
Query: 114 CYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
C N S ++ +R P+ G+ L+ Y++ LWGT RRH L KYF C C+RC
Sbjct: 211 CLANTLTSIDSKNKMHVRTRMPVKKGEALTAEYSECLWGTEIRRHQLHRYKYFWCSCQRC 270
Query: 174 RDPTELNTFYDGVKC 188
RDPTEL +F +C
Sbjct: 271 RDPTELGSFISSHRC 285
>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
Length = 423
Score = 132 bits (331), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 104/178 (58%), Gaps = 3/178 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LR L +SH P + +L +LE+HV E SP YE R N+V F+ + L ++
Sbjct: 127 AYCVIVPLRVLLLQSHSPDTYAQLQRLESHVAERLKSPLYEVVRSNLVPFIRSVLGL-QQ 185
Query: 74 FTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
++ E IL++C ++ N E+ L E A++ + + H+C PN F D+ +++L
Sbjct: 186 YSAETILEVCAMLDTNCFEIRLPERRTKVRALYPLGAMLSHDCRPNTKHYFDDALRMVLV 245
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
A I G +S YT PL GT++RR L+ SK+F CQCERC DPTEL T G +CP
Sbjct: 246 ATVDIERGATISTSYTQPLLGTLHRRLALKQSKHFDCQCERCTDPTELGTNLSGFRCP 303
>gi|198463716|ref|XP_001352921.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
gi|198151380|gb|EAL30422.2| GA14810 [Drosophila pseudoobscura pseudoobscura]
Length = 532
Score = 129 bits (325), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 68/177 (38%), Positives = 100/177 (56%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ KS P+ + +L LE H+EE +P Y+ R N++ F+ + +
Sbjct: 157 AYCVIMLLRCMQLKSSDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRD- 215
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+++ +IL+I I+ N E+ P A+F A+ I H+C PN+ F D ++
Sbjct: 216 WSEVDILRIAAILDTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFL 275
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A +PIA G+ LSI YT PL TI RR HL+ +K F C C RC+DPTEL TF C
Sbjct: 276 AKRPIAKGEILSISYTQPLRSTIQRRLHLRQAKCFDCACARCQDPTELGTFAGAQTC 332
>gi|170034227|ref|XP_001844976.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167875488|gb|EDS38871.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 548
Score = 128 bits (322), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 71/177 (40%), Positives = 99/177 (55%), Gaps = 4/177 (2%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
Y+ ++ L+CL + H ++D++MQLE+H E K S Y + VQ+LL F L
Sbjct: 162 YDAMLVLKCLALQIHNRPLFDQMMQLESHYEARKGSQYYRDADDRTVQYLLKNF-LAPLK 220
Query: 75 TQEEILK--ICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
QEEI ICGI++VNA +PLT I ++ +EH C PN + + T ++
Sbjct: 221 KQEEIQGKTICGILEVNAMVIPLTNGREICGLYPIGCMLEHCCMPNCFYTSTFLNLLVYD 280
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G+HLS YT LWGT RR HL+T+KYF C C RC DPTEL T+ ++C
Sbjct: 281 LTNSFLSGEHLSTTYTHALWGTQQRRDHLKTNKYFSCSCARCADPTELGTYLSALRC 337
>gi|195171538|ref|XP_002026562.1| GL21959 [Drosophila persimilis]
gi|194111478|gb|EDW33521.1| GL21959 [Drosophila persimilis]
Length = 532
Score = 128 bits (322), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 101/177 (57%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P+ + +L LE H+EE +P Y+ R N++ F+ + +
Sbjct: 157 AYCVIMLLRCMQLKASDPEGFARLSALEDHLEERLATPLYQVLRANLITFIKTILGMRD- 215
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+++ +IL+I I+ N E+ P A+F A+ I H+C PN+ F D ++
Sbjct: 216 WSEVDILRIAAILDTNTFELRQPRERRKVRALFPGAAMISHDCAPNMRHRFDDDMNIIFL 275
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A +PIA G+ L+I YT PL TI RR HL+ +K F C C+RC+DPTEL TF C
Sbjct: 276 AKRPIAKGEILTISYTQPLRSTIQRRLHLRQAKCFDCACDRCQDPTELGTFAGAQTC 332
>gi|193683766|ref|XP_001950081.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 533
Score = 125 bits (313), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 100/176 (56%), Gaps = 2/176 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
Y+ + L+CL + W++LM+L++H+E + + +V +L N + EEF
Sbjct: 161 YDALTPLKCLLLQFTDKNKWNRLMELKSHMEYRGPESEVYEEINSVYNYLKNNYLSGEEF 220
Query: 75 --TQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
+ + I ICGI+ VNA ++ + AI+ S +EHNC PN SF G++ + A
Sbjct: 221 ETSSDLIHTICGILDVNALDVQVAGLELTAIYPTVSKLEHNCLPNTGISFDKCGRIYVYA 280
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ I G+H++ YT+ LWGT RR HL ++KYF C+C+RC D TEL T + + C
Sbjct: 281 SRKITKGEHITTMYTNALWGTRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVC 336
>gi|195440652|ref|XP_002068154.1| GK12548 [Drosophila willistoni]
gi|194164239|gb|EDW79140.1| GK12548 [Drosophila willistoni]
Length = 534
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 66/184 (35%), Positives = 102/184 (55%), Gaps = 3/184 (1%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN 66
N +Y I+ LRC+ K+ QP+ + +L LE H++E +P Y+ R N++ F+
Sbjct: 152 NQERKESAYCVIMLLRCMLLKTSQPEDFVRLFSLEDHLKERLETPLYQVLRANLITFIKT 211
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
L E +++ EIL+I I+ N E+ P A++ A+ I H+C PN+ F D
Sbjct: 212 VLGLRE-WSEMEILRIAAILDTNTFEVRQPKERRKVRALYPGAAMISHDCVPNMRHRFDD 270
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ ++ A + I+ G+ L+I YT PL TI RR HL+ +K F C C+RC DP+EL TF
Sbjct: 271 NMNIIFLAKRAISKGEILNISYTQPLRSTIQRRLHLRQAKCFDCACQRCSDPSELGTFAA 330
Query: 185 GVKC 188
C
Sbjct: 331 AQMC 334
>gi|194762123|ref|XP_001963207.1| GF15829 [Drosophila ananassae]
gi|190616904|gb|EDV32428.1| GF15829 [Drosophila ananassae]
Length = 507
Score = 121 bits (304), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/183 (37%), Positives = 103/183 (56%), Gaps = 5/183 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQ-LEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ WD ++ +E H EE K ++ + DR N+ Q+L KL
Sbjct: 114 PQLDCILPLRVLLAKETDPERWDSQVEPMEHHEEERKEDADVWHADRVNIAQYLRGPCKL 173
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQ 127
+ F++E I+++ GI++VNA E T + ++ + HNC PN +S ++ +
Sbjct: 174 SSRFSEELIMQVVGILEVNAFEARTTRGFPLRCLYPYTGILAHNCVPNTARSIYPSEGYK 233
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ LRAM + G L YT L GT R+ HL+ KYF C CERC+DPTEL T + +K
Sbjct: 234 IRLRAMVDLKDGQPLHHSYTYTLDGTAQRQRHLKQGKYFSCTCERCQDPTELGTHFSSLK 293
Query: 188 CPE 190
C +
Sbjct: 294 CGQ 296
>gi|194751716|ref|XP_001958171.1| GF10789 [Drosophila ananassae]
gi|190625453|gb|EDV40977.1| GF10789 [Drosophila ananassae]
Length = 438
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 100/177 (56%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P+ + KL LE H++E ++P Y+ R N++ F+ L ++
Sbjct: 159 AYCVIMLLRCMQLKTKDPESFAKLYTLEDHLKERLDTPLYQVLRANLITFIKTVLGL-KD 217
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+++ +IL+I I+ N E+ P A+F A+ I H+C PN+ F D ++
Sbjct: 218 WSELDILRIAAILDTNTFEVRQPRERIKIRALFPGAAMISHDCVPNMRHRFDDDMNIIFL 277
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + I+ G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 278 AKRKISKGEILSISYTQPLRSTIQRRLHLRQAKCFDCSCARCQDPEELGSFAGAQNC 334
>gi|195020683|ref|XP_001985247.1| GH16955 [Drosophila grimshawi]
gi|193898729|gb|EDV97595.1| GH16955 [Drosophila grimshawi]
Length = 529
Score = 120 bits (301), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 95/177 (53%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K P+ + +L LE H++E +P Y+ R N++ F+ +
Sbjct: 156 AYCVIMLLRCMQLKRSNPEAFARLAALEDHLKERIETPLYQVLRANLITFIKTVLG-QRD 214
Query: 74 FTQEEILKICGIIQVNAHEMPLT--EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + EIL+I I+ NA E+ AIF + I H+C PNL F D+ +++
Sbjct: 215 WPELEILRIAAILDTNAFEVRQNGERRKVRAIFPGGAMIAHDCVPNLRHRFDDNMRIIFL 274
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A +PI G LSI YT PL T+ RR HL+ K F C CERC DPTEL TF C
Sbjct: 275 AKRPIPKGAILSISYTQPLRSTVQRRVHLKQVKCFDCACERCSDPTELGTFAGAHLC 331
>gi|195479030|ref|XP_002086547.1| GE22790 [Drosophila yakuba]
gi|194186337|gb|EDW99948.1| GE22790 [Drosophila yakuba]
Length = 395
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 98/177 (55%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P+ + KL LE H++E +P Y+ R N++ F+ + ++
Sbjct: 23 AYCVIMLLRCMQLKAKDPEAFLKLYTLEDHLKERLETPLYQVLRANLITFIKTVLGM-KD 81
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D ++
Sbjct: 82 WLEMDILRIAAILDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFL 141
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 142 AKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 198
>gi|157135679|ref|XP_001663543.1| hypothetical protein AaeL_AAEL003295 [Aedes aegypti]
gi|108881205|gb|EAT45430.1| AAEL003295-PA [Aedes aegypti]
Length = 565
Score = 119 bits (298), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 97/173 (56%), Gaps = 16/173 (9%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF----KLNEEFTQEEILKIC---- 83
Q++D+++QLE+H E K +P Y+ VQ+LL F K E+ + +L +C
Sbjct: 179 QLFDQMLQLESHYEARKGTPFYKEADERTVQYLLKNFLNPLKKMEKQQGKTVLPVCDVKT 238
Query: 84 -----GIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQVLLRAMKP 135
GI++VNA +PL+ I ++ A +EH C PN + +F T ++ +A +
Sbjct: 239 LHKISGILEVNAMVIPLSNGREICGLYPIACMLEHCCMPNCFYTFNCTKGMKLTFKAGRN 298
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
I G+HL+ YT LWGT RR HL+ +KYF C C RC DPTEL T+ +KC
Sbjct: 299 IKKGEHLTTTYTHSLWGTQQRRDHLKANKYFSCSCARCADPTELGTYLSALKC 351
>gi|242397545|gb|ACS92862.1| MIP11718p [Drosophila melanogaster]
Length = 514
Score = 119 bits (298), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 97/177 (54%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC++ K P + KL LE H++E +P Y+ R N++ F+ + ++
Sbjct: 158 AYCVIMLLRCMHLKDKDPDAFLKLYNLEDHLKERLETPLYQVLRANLITFIKTVLGM-KD 216
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D +L
Sbjct: 217 WPEMDILRIAAILDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNILFL 276
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 277 AKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 333
>gi|195117408|ref|XP_002003239.1| GI23580 [Drosophila mojavensis]
gi|193913814|gb|EDW12681.1| GI23580 [Drosophila mojavensis]
Length = 507
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 104/183 (56%), Gaps = 5/183 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAH-VEEYKNSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K QP+ W +++ +E H +E +N+ + DR N+ Q+L KL
Sbjct: 114 PQLDCIMPLRVLLAKEAQPERWLNEVAPMEHHELERRENADVWHADRVNIAQYLRGPCKL 173
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQ 127
+ F++E I+++ G+++VNA E ++ + +F + HNC PN ++ ++ +
Sbjct: 174 ADRFSEELIMQVVGVLEVNAFEARTSQGYPLRCLFPYTGILAHNCVPNTARTIHPSEGYK 233
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ LRAM + G L YT L GT R+ HL+ KYF+C CERC D +EL T + +K
Sbjct: 234 IRLRAMVALEEGQQLQHSYTYTLDGTAQRQAHLREGKYFVCSCERCLDASELQTHFSSLK 293
Query: 188 CPE 190
C +
Sbjct: 294 CGQ 296
>gi|195472006|ref|XP_002088293.1| GE18493 [Drosophila yakuba]
gi|194174394|gb|EDW88005.1| GE18493 [Drosophila yakuba]
Length = 507
Score = 119 bits (297), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/182 (37%), Positives = 101/182 (55%), Gaps = 7/182 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ WD ++ +E H EE K ++ + DR N+ Q+L KL
Sbjct: 114 PQLDCIMPLRVLLAKEANPERWDNEVAPMEHHEEERKKDADVWHADRVNIAQYLRGPCKL 173
Query: 71 NEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSG 126
F++E I+++ G+++VNA E P P +F + HNC PN +S ++
Sbjct: 174 ASRFSEELIMQVVGVLEVNAFEARSPKGYP-LRCLFPYTGILAHNCVPNTSRSIYPSEGY 232
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ LRAM + G L YT L GT R+ HL+ K+F CQC+RC DPTEL T + +
Sbjct: 233 KIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRCLDPTELGTHFSSL 292
Query: 187 KC 188
KC
Sbjct: 293 KC 294
>gi|194862038|ref|XP_001969906.1| GG23681 [Drosophila erecta]
gi|190661773|gb|EDV58965.1| GG23681 [Drosophila erecta]
Length = 507
Score = 119 bits (297), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/182 (37%), Positives = 101/182 (55%), Gaps = 7/182 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ WD ++ +E H EE K ++ + DR N+ Q+L KL
Sbjct: 114 PQLDCIMPLRVLLAKEANPERWDSEVAPMEHHEEERKKDADVWHADRVNIAQYLRGPCKL 173
Query: 71 NEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSG 126
F++E I+++ G+++VNA E P P +F + HNC PN +S ++
Sbjct: 174 ASRFSEELIMQVVGVLEVNAFEARSPKGYP-LRCLFPYTGILAHNCVPNTSRSIYPSEGY 232
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ LRAM + G L YT L GT R+ HL+ K+F CQC+RC DPTEL T + +
Sbjct: 233 KIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCDRCLDPTELGTHFSSL 292
Query: 187 KC 188
KC
Sbjct: 293 KC 294
>gi|195377589|ref|XP_002047571.1| GJ13517 [Drosophila virilis]
gi|194154729|gb|EDW69913.1| GJ13517 [Drosophila virilis]
Length = 528
Score = 119 bits (297), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 94/177 (53%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K P + +L +LE H++E +P Y+ R N++ F+ LN
Sbjct: 156 AYCVIMLLRCVQLKLESPAAFARLYELEDHLKERLETPLYQVLRANLITFIKTVLGLNN- 214
Query: 74 FTQEEILKICGIIQVNAHEMPLTEP--SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
++ EIL+I I+ NA E+ ++ A+ H+C PN+ F D ++
Sbjct: 215 CSELEILRIAAILDTNAFELRQLGGCVKVRGLYPGAAMFAHDCVPNMRHRFDDDMNIVFL 274
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A +PIA G+ LSI YT PL TI RR HL+ K F C C RC DPTEL TF C
Sbjct: 275 AKRPIAKGEVLSISYTQPLRSTIQRRVHLKQVKCFDCACARCEDPTELGTFAGAHVC 331
>gi|195339975|ref|XP_002036592.1| GM18755 [Drosophila sechellia]
gi|194130472|gb|EDW52515.1| GM18755 [Drosophila sechellia]
Length = 507
Score = 119 bits (297), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 68/182 (37%), Positives = 101/182 (55%), Gaps = 7/182 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEE-YKNSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ WD ++ +E H EE K++ + DR N+ Q+L +L
Sbjct: 114 PQLDCIMPLRVLLAKEANPERWDNEVAPMEHHKEERQKDADVWHADRVNIAQYLRGPCQL 173
Query: 71 NEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSG 126
F++E I+++ G+++VNA E P P +F + HNC PN +S ++
Sbjct: 174 ANRFSEELIMQVVGVLEVNAFEARSPKGYP-LRCLFPYTGILAHNCVPNTARSIYPSEGY 232
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ LRAM + G L YT L GT R+ HL+ K+F CQCERC DPTEL T + +
Sbjct: 233 KIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSL 292
Query: 187 KC 188
KC
Sbjct: 293 KC 294
>gi|357620843|gb|EHJ72884.1| hypothetical protein KGM_13875 [Danaus plexippus]
Length = 516
Score = 118 bits (296), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 102/176 (57%), Gaps = 6/176 (3%)
Query: 17 CIIA-LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFT 75
C+IA LR L K P++++ +M LE+HV++ N+P Y+ R N++ F+ +N F
Sbjct: 136 CVIAPLRVLLMKDSNPRMFESIMNLESHVDQRINTPIYQVLRPNLIMFIRQVLGMN--FD 193
Query: 76 QEEILKICGIIQVNAHEMPLTEPS--YIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRA 132
+ IL+I I N+ ++ + + AI+ AS + H+C PN + + L L A
Sbjct: 194 DKIILEISSIFDNNSFDVRSADKTKRLRAIYLLASMMNHSCRPNTRHIYLGEDKTLALIA 253
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
IA G+ ++ YT PLWGT++RR L+T+KYF C+CERC DPTE T+ + C
Sbjct: 254 TVHIAKGEEITATYTQPLWGTLDRRKFLKTNKYFDCKCERCADPTEFGTYLGNIYC 309
>gi|91087991|ref|XP_973574.1| PREDICTED: similar to CG11160 CG11160-PA [Tribolium castaneum]
Length = 528
Score = 118 bits (296), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 104/186 (55%), Gaps = 3/186 (1%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPK-YENDRRNVVQFLL 65
NF P++ I+ LRCL K + + LM +E H E +N P+ ++ ++R VV +
Sbjct: 137 NFEEMRPNFNAIVPLRCLLLKETDARSYQHLMTMETHNEIRRNIPELWQTNQRTVVDKIR 196
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
+ L E++QEEI +CGI++VN+ E+ + ++ A + H+C PN +S
Sbjct: 197 IDWGL-REYSQEEIHSVCGILEVNSFEIGQQGVNIRGLYPSAFLMSHDCVPNTNHIDEES 255
Query: 126 G-QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
++ +RA I PG+ +++ Y L T+ RR HL +K+F CQC RC DPTEL TF
Sbjct: 256 TFRLTVRASTRIEPGEMITLSYAYTLQSTLKRREHLLENKFFECQCRRCSDPTELGTFTS 315
Query: 185 GVKCPE 190
+ CP+
Sbjct: 316 ALICPK 321
>gi|270012059|gb|EFA08507.1| hypothetical protein TcasGA2_TC006159 [Tribolium castaneum]
Length = 529
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 104/186 (55%), Gaps = 3/186 (1%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPK-YENDRRNVVQFLL 65
NF P++ I+ LRCL K + + LM +E H E +N P+ ++ ++R VV +
Sbjct: 138 NFEEMRPNFNAIVPLRCLLLKETDARSYQHLMTMETHNEIRRNIPELWQTNQRTVVDKIR 197
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
+ L E++QEEI +CGI++VN+ E+ + ++ A + H+C PN +S
Sbjct: 198 IDWGL-REYSQEEIHSVCGILEVNSFEIGQQGVNIRGLYPSAFLMSHDCVPNTNHIDEES 256
Query: 126 G-QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
++ +RA I PG+ +++ Y L T+ RR HL +K+F CQC RC DPTEL TF
Sbjct: 257 TFRLTVRASTRIEPGEMITLSYAYTLQSTLKRREHLLENKFFECQCRRCSDPTELGTFTS 316
Query: 185 GVKCPE 190
+ CP+
Sbjct: 317 ALICPK 322
>gi|170056265|ref|XP_001863952.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876021|gb|EDS39404.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 526
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 111/187 (59%), Gaps = 7/187 (3%)
Query: 6 SNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKN-SPKYENDRRNVVQFL 64
S+ T +YE I+ALRCL K P ++KL++++A E + + ++ ++ +VQ +
Sbjct: 145 SSATQIKLAYEAILALRCLLLKRTAPDRYEKLLEMDAMNERRQQITSLWKRNQEAIVQRI 204
Query: 65 LNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
+ +K EEF++ E+ +CGII+VNA ++ TE A++ A +I H+C PN + TD
Sbjct: 205 RDEWKF-EEFSEAEVHTVCGIIEVNAFQIGPTEVHARALYPEAFYIMHDCTPN--TTHTD 261
Query: 125 ---SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
S ++ +R + + G+ L++ Y+ L GT+ RR HL SK+F CQC RC DPTE T
Sbjct: 262 QPGSHELTVRPTRDLKAGEPLTLSYSYTLQGTLKRRQHLCDSKFFWCQCPRCEDPTEFGT 321
Query: 182 FYDGVKC 188
+KC
Sbjct: 322 HCSTLKC 328
>gi|24666583|ref|NP_649084.1| CG18136 [Drosophila melanogaster]
gi|7293833|gb|AAF49199.1| CG18136 [Drosophila melanogaster]
Length = 530
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 97/177 (54%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC++ K P + KL LE H++E +P Y+ R N++ F+ + ++
Sbjct: 158 AYCVIMLLRCMHLKDKDPDAFLKLYNLEDHLKERLETPLYQVLRANLITFIKTVLGM-KD 216
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D ++
Sbjct: 217 WPEMDILRIAAILDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFL 276
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 277 AKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 333
>gi|195496367|ref|XP_002095664.1| GE22530 [Drosophila yakuba]
gi|194181765|gb|EDW95376.1| GE22530 [Drosophila yakuba]
Length = 530
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 97/177 (54%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P+ + KL LE H++E +P Y+ R N++ F+ + +
Sbjct: 158 AYCVIMLLRCMQLKAKDPEAFLKLYTLEDHLKERLETPLYQVLRANLITFIKTVLGM-KN 216
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D ++
Sbjct: 217 WLEMDILRIAAILDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFL 276
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 277 AKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 333
>gi|242007168|ref|XP_002424414.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507814|gb|EEB11676.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 563
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 107/195 (54%), Gaps = 21/195 (10%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFL-LNFFK--- 69
++ I LRCL + P W + +E+H+++ N+ Y+ +V++L NFFK
Sbjct: 155 FDVIFPLRCLLLQKKNPNGWKIITGMESHMKKRGPNTEIYKQTNEKIVKYLNDNFFKFLN 214
Query: 70 -------------LNEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCY 115
L +E T E + KICGII VN+ E+ + + A+++ A +EHNC
Sbjct: 215 DSGNREPDTEMILLPDERTPEILHKICGIIDVNSLEIQSPKGGTLNALYETAFLMEHNCL 274
Query: 116 PNLYKSFTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PN F S + ++ A I PG+H+S YT LWGT R+ HL+++K+FIC+C RC
Sbjct: 275 PNTRHIFQPSRRFKIKVFANNDIQPGNHISTMYTHALWGTQQRQDHLRSTKHFICRCRRC 334
Query: 174 RDPTELNTFYDGVKC 188
DP+E+ T G+KC
Sbjct: 335 GDPSEMGTNLSGLKC 349
>gi|19921122|ref|NP_609464.1| CG43129, isoform D [Drosophila melanogaster]
gi|442627404|ref|NP_001260366.1| CG43129, isoform E [Drosophila melanogaster]
gi|7297774|gb|AAF53024.1| CG43129, isoform D [Drosophila melanogaster]
gi|17945207|gb|AAL48662.1| RE12806p [Drosophila melanogaster]
gi|440213690|gb|AGB92901.1| CG43129, isoform E [Drosophila melanogaster]
Length = 507
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 101/182 (55%), Gaps = 7/182 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEE-YKNSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ WD ++ +E H EE +++ + DR N+ Q+L +L
Sbjct: 114 PQLDCIMPLRVLLAKEANPERWDNEVAPMEHHKEERQRDADVWHADRVNIAQYLRGPCQL 173
Query: 71 NEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSG 126
F++E I+++ G+++VNA E P P +F + HNC PN +S ++
Sbjct: 174 ANRFSEELIMQVVGVLEVNAFEARSPKGYP-LRCLFPYTGILAHNCVPNTSRSIYPSEGY 232
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ LRAM + G L YT L GT R+ HL+ K+F CQCERC DPTEL T + +
Sbjct: 233 KIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSL 292
Query: 187 KC 188
KC
Sbjct: 293 KC 294
>gi|195591423|ref|XP_002085440.1| GD12310 [Drosophila simulans]
gi|194197449|gb|EDX11025.1| GD12310 [Drosophila simulans]
Length = 530
Score = 117 bits (294), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 97/177 (54%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P + KL LE H++E +P Y+ R N++ F+ + ++
Sbjct: 158 AYCVIMLLRCMQLKAMDPDAFLKLYNLEDHLKERLETPLYQVLRANLITFIKTVLGM-KD 216
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D ++
Sbjct: 217 WPEMDILRIAAILDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFL 276
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 277 AKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 333
>gi|357620844|gb|EHJ72885.1| hypothetical protein KGM_13877 [Danaus plexippus]
Length = 517
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 104/193 (53%), Gaps = 19/193 (9%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVE-EYKNSPKYENDRRNVVQFLLNFF--KLN 71
++ ++ LRC + W KL++L++H+E + Y+ V++L+N F K++
Sbjct: 112 HDALLPLRCALLQYTDDDKWKKLLELQSHMECRVPGTDAYDEADEFTVKYLMNVFINKID 171
Query: 72 EE--------FTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF 122
+ + E + KICGII NA E+ L S + A++ +EH+C PN F
Sbjct: 172 KNVKNKYLNLISGELLHKICGIIDTNALEIRLPNGSELNALYATTCMMEHSCVPNTKHLF 231
Query: 123 TDSGQ-------VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
SG+ + ++ + PI GDH++ Y+ LWGT RR HL+ +KYF C+C RC D
Sbjct: 232 NTSGKDVKDKYKITVKVVVPINKGDHVATMYSHALWGTQARRQHLKDTKYFSCKCIRCSD 291
Query: 176 PTELNTFYDGVKC 188
PTEL T+ +KC
Sbjct: 292 PTELGTYLSAMKC 304
>gi|242019692|ref|XP_002430293.1| protein msta, isoform A, putative [Pediculus humanus corporis]
gi|212515408|gb|EEB17555.1| protein msta, isoform A, putative [Pediculus humanus corporis]
Length = 501
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/200 (34%), Positives = 108/200 (54%), Gaps = 12/200 (6%)
Query: 1 MKIKISNFTTPH---PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYEND 56
+K++ + P P YECI LR L K P+IW+ ++ +LE+H EE + + ++ +
Sbjct: 92 VKVRFTKVKNPQGNCPQYECITPLRVLLGKEKFPEIWNSQISKLESHNEERQKTDIWKVE 151
Query: 57 RRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIA-IFDRASFIEHNCY 115
N V FL KL E F+ + I CGI++ NA P + + ++ ++S H+C
Sbjct: 152 SINTVGFLHKVCKLQERFSSDLIHFACGILETNAFSSPSSYGYDMRYLYPKSSLFLHSCI 211
Query: 116 PN----LYKSFTD---SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFIC 168
PN ++K+ D S + +RA I G ++ YT LW T+ RR HL+ K+F C
Sbjct: 212 PNVGHYIFKNSNDGERSNVIHVRASLKIKKGQEFTLSYTQTLWPTLLRRSHLKEGKFFDC 271
Query: 169 QCERCRDPTELNTFYDGVKC 188
C+RC DPTEL T + +KC
Sbjct: 272 CCQRCSDPTELKTNFSTIKC 291
>gi|195352295|ref|XP_002042648.1| GM14901 [Drosophila sechellia]
gi|194124532|gb|EDW46575.1| GM14901 [Drosophila sechellia]
Length = 530
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/177 (35%), Positives = 98/177 (55%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P+ + KL LE H++E +P Y+ R N++ F+ + ++
Sbjct: 158 AYCVIMLLRCMQLKAIDPKAFLKLYNLEDHLKERLETPLYQVLRANLITFINTVLGM-KD 216
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D ++
Sbjct: 217 WPEMDILRIAAILDTNTFEVRQPRERRKIRALYPEAAMISHDCVPNMRHRFDDDMNIVFL 276
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 277 AKRKIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 333
>gi|386769408|ref|NP_001245962.1| CG43129, isoform C [Drosophila melanogaster]
gi|383291417|gb|AFH03636.1| CG43129, isoform C [Drosophila melanogaster]
Length = 503
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 101/184 (54%), Gaps = 7/184 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEE-YKNSPKYENDRRNVVQFLLNFFKL 70
P +CI LR L K P+ WD ++ +E H EE +++ + DR N+ Q+L +L
Sbjct: 110 PQLDCITILRVLLAKEANPERWDNEVAPMEHHKEERQRDADVWHADRVNIAQYLRGPCQL 169
Query: 71 NEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSG 126
F++E I+++ G+++VNA E P P +F + HNC PN +S ++
Sbjct: 170 ANRFSEELIMQVVGVLEVNAFEARSPKGYP-LRCLFPYTGILAHNCVPNTSRSIYPSEGY 228
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ LRAM + G L YT L GT R+ HL+ K+F CQCERC DPTEL T + +
Sbjct: 229 KIRLRAMVDLEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSL 288
Query: 187 KCPE 190
KC +
Sbjct: 289 KCGQ 292
>gi|194873875|ref|XP_001973295.1| GG13432 [Drosophila erecta]
gi|190655078|gb|EDV52321.1| GG13432 [Drosophila erecta]
Length = 530
Score = 117 bits (292), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 98/177 (55%), Gaps = 3/177 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K+ P+ + +L LE H++E +P Y+ R N++ F+ + ++
Sbjct: 158 AYCVIMLLRCMQLKAKDPEAFLRLDTLEDHLKERLETPLYQVLRANLITFIKTVLGM-KD 216
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + +IL+I I+ N E+ P A++ A+ I H+C PN+ F D ++
Sbjct: 217 WLEMDILRIAAILDTNTFEVRQPRERRKIRALYPGAAMISHDCVPNMRHRFDDDMNIVFL 276
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ LSI YT PL TI RR HL+ +K F C C RC+DP EL +F C
Sbjct: 277 AKRRIAKGEILSISYTQPLRSTIQRRVHLRQAKCFDCSCARCQDPEELGSFAGAQTC 333
>gi|195387830|ref|XP_002052595.1| GJ20777 [Drosophila virilis]
gi|194149052|gb|EDW64750.1| GJ20777 [Drosophila virilis]
Length = 504
Score = 115 bits (289), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 102/183 (55%), Gaps = 5/183 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K QP+ W +++ +E H + + N+ + DR N+ Q+L KL
Sbjct: 111 PQLDCIMPLRVLLAKEAQPERWLNEVAPMEHHEQARRENADVWHADRVNIAQYLRGPCKL 170
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQ 127
E F++E I+++ G+++VNA E ++ + ++ + HNC PN ++ ++ +
Sbjct: 171 AERFSEELIMQVVGVLEVNAFEARTSQGYPLRCLYPYTGILAHNCVPNTARTIHPSEGYK 230
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ LRAM + G L YT L GT R+ HL+ KYF C CERC D +EL T + +K
Sbjct: 231 IRLRAMVALEAGQQLQHSYTYTLDGTAQRQAHLREGKYFTCSCERCLDASELQTHFSSMK 290
Query: 188 CPE 190
C +
Sbjct: 291 CGQ 293
>gi|242007166|ref|XP_002424413.1| protein msta, isoform A, putative [Pediculus humanus corporis]
gi|212507813|gb|EEB11675.1| protein msta, isoform A, putative [Pediculus humanus corporis]
Length = 540
Score = 115 bits (287), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 98/175 (56%), Gaps = 3/175 (1%)
Query: 18 IIALRC-LYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ 76
I+ LRC L K + + K+M LE+H ++ N P ++ ++NV F+ K+ ++ +
Sbjct: 160 IVPLRCILLKGKKNSEKYKKIMNLESHYDDRINKPLFDLYKKNVADFIRERLKM-DDVDE 218
Query: 77 EEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLRAMKP 135
+EIL I GI+ NA E+ +++ +S + H+C PN SF +++ A +
Sbjct: 219 KEILTITGILDTNAFEIVKNNVKLRGLYNVSSMLSHDCKPNTKHSFLQPDMTIVVTATRD 278
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
I G+ ++ YT W T++RR HL+T K F C CERC DP+EL T+ ++C +
Sbjct: 279 IKSGELITATYTQTFWNTLSRREHLKTIKCFDCVCERCSDPSELGTYAGAIRCSK 333
>gi|195127975|ref|XP_002008442.1| GI11814 [Drosophila mojavensis]
gi|193920051|gb|EDW18918.1| GI11814 [Drosophila mojavensis]
Length = 527
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 93/179 (51%), Gaps = 3/179 (1%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y I+ LRC+ K +P + +L +LE H++E +P Y+ R N++ F+ L E
Sbjct: 156 AYCVIMLLRCMQLKESKPAAFARLCELEDHLKERLETPLYQVLRANLISFIKTVLGLRE- 214
Query: 74 FTQEEILKICGIIQVNAHEMPLTEP--SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ +IL+I I+ NA E+ +F + H+C PN+ F D ++
Sbjct: 215 WPDLDILRIAAILDTNAFEVRQAGDRIKVRGLFPGGAMFSHDCVPNMRHRFDDDMNIMFL 274
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
A +PIA G+ LSI YT L TI RR HL+ K F C C RC DPTEL T+ C +
Sbjct: 275 AKRPIAKGEILSISYTQQLRSTIQRRLHLKQVKCFECACARCADPTELGTYAGAHMCAK 333
>gi|357603991|gb|EHJ63998.1| hypothetical protein KGM_04190 [Danaus plexippus]
Length = 564
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 97/189 (51%), Gaps = 11/189 (5%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPK-YENDRRNVVQFLL 65
N SY I LRCL K P W L ++E+H ++ Y + RNVVQ L
Sbjct: 176 NMEDLRDSYHAITPLRCLLLKKEDPSKWCALSEMESHNAIRRSRGDIYPMNDRNVVQRL- 234
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
K + +E+ +CGI++VNA E+ + + A++ A + H+C P+ S TD
Sbjct: 235 --KKWGLGYDDDEVHTVCGILEVNAFEVGASGANARALYSEAYLLAHDCTPS--TSHTDH 290
Query: 126 GQ-----VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELN 180
+ + +RA PGD +S+CY L GT+ RR HL SK+F+C C RC DPTE
Sbjct: 291 ERRPGRPLTIRAAIRHVPGDVISLCYAYTLQGTLKRREHLAVSKFFMCSCVRCADPTERG 350
Query: 181 TFYDGVKCP 189
T ++CP
Sbjct: 351 TQASSLRCP 359
>gi|157138251|ref|XP_001664198.1| hypothetical protein AaeL_AAEL003788 [Aedes aegypti]
gi|108880683|gb|EAT44908.1| AAEL003788-PA, partial [Aedes aegypti]
Length = 485
Score = 113 bits (283), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 100/187 (53%), Gaps = 3/187 (1%)
Query: 6 SNFTTPHPSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYKNSPKYENDRRNVVQFL 64
S+ T Y ++ LR L K P+ + D + LE+H+E +P Y R N+V F+
Sbjct: 153 SDPTRKESVYCAVVPLRALLLKRSNPERFRDGFLTLESHLETRIKTPLYAALRSNLVPFV 212
Query: 65 LNFFKLNEEFTQEEILKICGIIQVNAHE--MPLTEPSYIAIFDRASFIEHNCYPNLYKSF 122
+ + E T+ E+L+I I+ N ++ +P + A+++ + + H+C PN F
Sbjct: 213 RDILGMGGEVTERELLEIAAILDTNCYDVRLPELDVKVRALYELGAMMSHHCRPNTKHYF 272
Query: 123 TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
+ ++++ A I +SI YT PL TI RR+ +Q SK F C C+RCRDPTEL T+
Sbjct: 273 DERLRLVVVATVDIPKDAVISISYTQPLLSTIQRRYAIQQSKCFECCCDRCRDPTELGTY 332
Query: 183 YDGVKCP 189
+ CP
Sbjct: 333 LGSIVCP 339
>gi|195050973|ref|XP_001993006.1| GH13590 [Drosophila grimshawi]
gi|193900065|gb|EDV98931.1| GH13590 [Drosophila grimshawi]
Length = 504
Score = 113 bits (283), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 100/183 (54%), Gaps = 5/183 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P + I+ LR L K QP+ W +++ +E H E + N+ + DR N+ Q+L KL
Sbjct: 111 PQLDSIMPLRVLLAKEAQPERWLNEVAPMEHHEEARRENADIWHADRVNIAQYLRGPCKL 170
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQ 127
E F++E I+++ G+++VNA E + ++ + HNC PN +S ++ +
Sbjct: 171 AERFSEELIMQVVGVLEVNAFEARTINGYPLRCLYPYTGILAHNCVPNTARSIHPSEGYK 230
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
V LRAM + G L YT L GT+ R+ HLQ KYF C C+RC D +EL T + +K
Sbjct: 231 VRLRAMVDLEAGQQLQHSYTYTLDGTVQRQAHLQQGKYFRCSCKRCLDASELQTHFSSLK 290
Query: 188 CPE 190
C +
Sbjct: 291 CGQ 293
>gi|189240267|ref|XP_971538.2| PREDICTED: similar to CG18136 CG18136-PA [Tribolium castaneum]
Length = 468
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 103/190 (54%), Gaps = 5/190 (2%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K + N + Y I LR L K + P+ ++ L+ ++H+++++ + Y+ ++++V
Sbjct: 100 KPTVKNDNSKQTVYCPIAPLRALLLKKNSPERFNLLLDFQSHLQDHEKTQVYQVLKKSLV 159
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLY 119
F KL+ + EIL IC I N E+ T+ + ++ SF+ H+C N
Sbjct: 160 PFFTQLLKLDT--NESEILTICSIFDTNCFEVRDTQRLVNIRGLYPTISFLSHSCKHNTK 217
Query: 120 KSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
F D+ +++L A PI GD ++ YT LWGT++RR HL+ +K+F C CERC DPTE
Sbjct: 218 HCFNGDNFRLVLTATTPIKKGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERCTDPTE 277
Query: 179 LNTFYDGVKC 188
T+ V C
Sbjct: 278 FGTYLSAVNC 287
>gi|270012346|gb|EFA08794.1| hypothetical protein TcasGA2_TC006485 [Tribolium castaneum]
Length = 498
Score = 112 bits (281), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 103/190 (54%), Gaps = 5/190 (2%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV 61
K + N + Y I LR L K + P+ ++ L+ ++H+++++ + Y+ ++++V
Sbjct: 100 KPTVKNDNSKQTVYCPIAPLRALLLKKNSPERFNLLLDFQSHLQDHEKTQVYQVLKKSLV 159
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLY 119
F KL+ + EIL IC I N E+ T+ + ++ SF+ H+C N
Sbjct: 160 PFFTQLLKLDT--NESEILTICSIFDTNCFEVRDTQRLVNIRGLYPTISFLSHSCKHNTK 217
Query: 120 KSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
F D+ +++L A PI GD ++ YT LWGT++RR HL+ +K+F C CERC DPTE
Sbjct: 218 HCFNGDNFRLVLTATTPIKKGDLVTTTYTQTLWGTLSRRSHLKMAKHFDCLCERCTDPTE 277
Query: 179 LNTFYDGVKC 188
T+ V C
Sbjct: 278 FGTYLSAVNC 287
>gi|198461482|ref|XP_001362030.2| GA13100 [Drosophila pseudoobscura pseudoobscura]
gi|198137355|gb|EAL26610.2| GA13100 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 112 bits (279), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 94/170 (55%), Gaps = 13/170 (7%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL---------K 81
P+ W L+++++H EE K + ++ +V +L F E +++++L +
Sbjct: 168 PKKWTDLLEMQSHEEERKGTELHQEAEDGIVSYLSQRFLQRLEQSKKDVLEHCEPELLHR 227
Query: 82 ICGIIQVN--AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAMKPIAP 138
+CGII+ N E+P T +F +A +EH C PN Y F S Q+ +RA +
Sbjct: 228 LCGIIETNYMVVELP-TGIELSGLFRQACMMEHACQPNCYFQFDGSTMQIEVRAGGDLKK 286
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C CERC DPTE ++ + C
Sbjct: 287 GDHLRITYTNILWGTHLRQHHLRLTKHFKCGCERCLDPTEFGSYVSALTC 336
>gi|195431128|ref|XP_002063600.1| GK21998 [Drosophila willistoni]
gi|194159685|gb|EDW74586.1| GK21998 [Drosophila willistoni]
Length = 557
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 57/174 (32%), Positives = 95/174 (54%), Gaps = 16/174 (9%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF--------------KLNEEFTQ 76
P+ W L+++++H EE K + ++ + +V +L F +L E+
Sbjct: 168 PERWSALLEMQSHEEERKGTELHQEAEKRIVNYLHQKFLHRLKQATKQSKESRLIAEYDP 227
Query: 77 EEILKICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSF-TDSGQVLLRAMK 134
E + ++CGI++ N + L T +F +A +EH C PN Y +F + Q+ +RA
Sbjct: 228 ELLHRLCGIVETNYMVIELPTGVELSGLFRQACMMEHACQPNCYFTFDAATQQIAVRAGC 287
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ GDHL I YT+ LWGT R+HHL+ +K+F+C+C RC DPTE ++ + C
Sbjct: 288 NLKKGDHLKITYTNILWGTQLRQHHLKMTKHFVCRCSRCLDPTEYGSYVSAMAC 341
>gi|170061022|ref|XP_001866059.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879296|gb|EDS42679.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 495
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
+ +Y I+ LR + K PQ+W K++ +E+H+E KN+P Y R NVV F+ L
Sbjct: 73 YSAYCTIVPLRAILLKRTDPQLWAKVVSMESHLETRKNTPLYAAIRSNVVPFVRMVLGLR 132
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
E ++EE++++ GI N+ E+ + E S A+++ + + H C PN F + ++
Sbjct: 133 TEASEEELMEVTGIFDTNSFEIRIPEKSIKIRALYELGALMNHCCQPNTRHYFDEEFNLV 192
Query: 130 LRAMKPI------------------APGDHL-SICYTDPLWGTINRRHHLQTSKYFICQC 170
+ A I P D + SI Y PL TI RR ++ +K F C C
Sbjct: 193 MIAAGRILEQLQDRTNSYFELLSVDIPKDEMISISYAQPLQATIQRRFTIKQAKCFECGC 252
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC DPTE T+ + CP+
Sbjct: 253 HRCSDPTEFRTYAGSIVCPQ 272
>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
Length = 501
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 68/185 (36%), Positives = 104/185 (56%), Gaps = 14/185 (7%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQ-LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
YECI LR L +K P+ W++ + +EAH EE K+ P +E ++ N+V+++ K+++
Sbjct: 109 YECITPLRMLLEKERDPKRWEEEISIMEAHNEERKSRPIWEFNQHNIVEYIRGPCKMDK- 167
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAI---FDRASFIEHNCYPNLYKSFTDSGQ--- 127
+++E I ICGI+++NA E PS AI F + + + HNC N++ + +G
Sbjct: 168 YSEELIHLICGILEINAFEA--RTPSCYAIRCLFPKLAILSHNCVSNIHHAVDCNGDGEL 225
Query: 128 ----VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFY 183
V +RA I G L YT LW T+ RR L+ SK+F C C RC PTEL+T
Sbjct: 226 KDCVVTVRAAVDIPKGGELFSSYTFSLWPTLVRREFLRESKFFDCTCPRCSSPTELDTHM 285
Query: 184 DGVKC 188
+KC
Sbjct: 286 SSLKC 290
>gi|170029520|ref|XP_001842640.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863224|gb|EDS26607.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 498
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 105/194 (54%), Gaps = 7/194 (3%)
Query: 2 KIKISNF---TTPHPSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDR 57
K+K N T +CI LR L K P+ W+ ++ +++ H EE + S ++ D+
Sbjct: 93 KVKFQNVDDSTAGCVQLDCITPLRVLLAKEADPERWNAEISRMQDHREERRGSTFWDADQ 152
Query: 58 RNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYP 116
+NVV+FL + L + ++E I ++ GI++VNA E + I ++ + + + H+C P
Sbjct: 153 KNVVEFLRSECGLKDRCSEELIQQVIGILEVNAFEARTSSGYTIRGLYPKLAIMAHSCVP 212
Query: 117 NLYKSFTDSGQVLLRAMKPI--APGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
N+ S S L A I G L YT L GT+ R+ L+++KYF CQC+RC
Sbjct: 213 NVVHSIHPSKDYRLTARTAIDVEEGSKLYTTYTYTLSGTMVRQAALKSTKYFTCQCKRCL 272
Query: 175 DPTELNTFYDGVKC 188
DPTEL T + +KC
Sbjct: 273 DPTELGTHFSSLKC 286
>gi|195171611|ref|XP_002026597.1| GL11805 [Drosophila persimilis]
gi|194111523|gb|EDW33566.1| GL11805 [Drosophila persimilis]
Length = 528
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 94/170 (55%), Gaps = 13/170 (7%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL---------K 81
P+ W L+++++H EE K + ++ +V +L F E +++++L +
Sbjct: 147 PKKWTDLLEMQSHEEERKGTELHQEAEDGIVSYLSQRFLQRLEQSKKDVLEHCEPELLHR 206
Query: 82 ICGIIQVN--AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAMKPIAP 138
+CGII+ N E+P T +F +A +EH C PN Y F + Q+ +RA +
Sbjct: 207 LCGIIETNYMVVELP-TGIELSGLFRQACMMEHACQPNCYFQFDGTTMQIEVRAGGDLKK 265
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C CERC DPTE ++ + C
Sbjct: 266 GDHLRITYTNILWGTHLRQHHLRLTKHFKCGCERCLDPTEFGSYVSALTC 315
>gi|380022731|ref|XP_003695192.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 554
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 104/197 (52%), Gaps = 13/197 (6%)
Query: 1 MKIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNV 60
M++ I N T + LR K P++W +++ LE+H+ + +N+ +E+ N+
Sbjct: 110 MELSIENTDTLAA---VLFVLRLWLLKQKDPELWKRVLSLESHLNKRRNTIVWEDREINI 166
Query: 61 VQFL--LNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYI------AIFDRASFIEH 112
V L LNF N+ E I ++CGI+ VN+ E L P + ++ AS + H
Sbjct: 167 VNVLKSLNFVPENDPSVSEIIQQLCGILDVNSFE--LRSPGGMDGLLLRGLYLEASLMAH 224
Query: 113 NCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCER 172
+C N++ + D+ + + A PI GD + YT L GT RR +L+T KYF C+C+
Sbjct: 225 DCRGNVHVTADDNFHLTVYASIPIKEGDVIFFNYTSSLLGTTGRREYLRTGKYFECECDL 284
Query: 173 CRDPTELNTFYDGVKCP 189
C+DP E+ ++ + CP
Sbjct: 285 CKDPYEMGSYMSCILCP 301
>gi|195023574|ref|XP_001985712.1| GH20950 [Drosophila grimshawi]
gi|193901712|gb|EDW00579.1| GH20950 [Drosophila grimshawi]
Length = 557
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 93/171 (54%), Gaps = 13/171 (7%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF-----------KLNEEFTQEEI 79
P+ W L+++++H EE + ++ + +V +L F L E++ E +
Sbjct: 170 PERWAALLEMQSHEEERLGTELHQEAEQRIVSYLQQRFLQRIKAAKNRESLLSEYSAELL 229
Query: 80 LKICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIA 137
++CGII+ N + L T +F +A +EH C PN Y F + Q + +RA +
Sbjct: 230 HRLCGIIETNYMVIELATGVELSGVFRQACMMEHACQPNCYFQFDAATQRIAVRAGCDLK 289
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C+C RC DP+E T+ ++C
Sbjct: 290 KGDHLKITYTNILWGTQMRQHHLRMTKHFNCRCSRCEDPSEYGTYVSAMRC 340
>gi|335892814|ref|NP_001229486.1| uncharacterized protein LOC724300 [Apis mellifera]
Length = 479
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 96/181 (53%), Gaps = 5/181 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P + ++ +R L + + W+ + +L++H + + N+ + N
Sbjct: 140 PRCDVLLVIRMLILWCKKSKYWNSIQKLQSHEDSRGPGTSVYEETMNIYYHIQRLLPDNS 199
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQ--VL 129
T++ + KICG+I +NA E E +AI++ A +EH+C N SFT D G+ +
Sbjct: 200 S-TKDIVSKICGLIDINALETVPPE-GCVAIYETACLLEHSCLANTRHSFTIDKGRPRIT 257
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
++A+ I GDHLS YT LW T RR HL +KYF C C+RC DPTEL T +KCP
Sbjct: 258 VKALCSIQKGDHLSTMYTHALWATRVRRSHLLETKYFSCHCKRCADPTELGTHLGTLKCP 317
Query: 190 E 190
+
Sbjct: 318 Q 318
>gi|170061020|ref|XP_001866058.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879295|gb|EDS42678.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 494
Score = 110 bits (275), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 95/181 (52%), Gaps = 2/181 (1%)
Query: 12 HPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
H Y I+ LR + K +P + KL LE+HVE +++P Y + N+V F+ + L
Sbjct: 114 HSGYCAIVPLRAILLKRKEPARFAKLATLESHVETRQSTPLYAAVQSNLVPFVRDVLNLR 173
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
E + ++++I GI N++E+ + E A+++ + + H C PN F + ++
Sbjct: 174 NEVSVGQLMEIAGIFDTNSYEIRIPERGIKIRALYELGAMMAHCCQPNTKHFFDEELNLV 233
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ A I + +SI Y PL TI RR ++ +K F C C RC DPTE T+ + CP
Sbjct: 234 MIAAVDIPKDEMISISYAQPLQATIQRRFTIKQAKCFECGCHRCSDPTEFRTYAGSIVCP 293
Query: 190 E 190
+
Sbjct: 294 Q 294
>gi|357606572|gb|EHJ65115.1| hypothetical protein KGM_22194 [Danaus plexippus]
Length = 482
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 94/186 (50%), Gaps = 11/186 (5%)
Query: 5 ISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQ-LEAHVEEYKNSPKYENDRRNVVQF 63
+ ++T P +C+ LR L K P W K +Q +E H E+ + ++ D+ N+V +
Sbjct: 98 VEDWTVSAPQLDCVTPLRLLVAKEQDPDRWSKEVQAMETHTEQRRKRSTWKADQINIVNY 157
Query: 64 LLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT 123
L++ KLN F++E + ++C S A++ R + H+C PN+ S
Sbjct: 158 LVDHCKLNCRFSKELVEQVCA---------SRGGFSIRAVYPRLAIAAHSCVPNIVHSIL 208
Query: 124 DSG-QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
+V +RA P+ G L + YT L TI RR HL+ SK+F C C RC DPTEL T
Sbjct: 209 QPDYRVEVRAAVPLQKGQRLHLSYTHVLSPTILRREHLRESKFFDCDCPRCTDPTELGTH 268
Query: 183 YDGVKC 188
KC
Sbjct: 269 LSTFKC 274
>gi|157124668|ref|XP_001654144.1| hypothetical protein AaeL_AAEL001857 [Aedes aegypti]
gi|108882773|gb|EAT46998.1| AAEL001857-PA [Aedes aegypti]
Length = 497
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 101/188 (53%), Gaps = 4/188 (2%)
Query: 5 ISNFTTPHPSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDRRNVVQF 63
+ + T +CI LR L K P+ WD +L +E H + K+ + D++N+V +
Sbjct: 101 VEDSTASCMQLDCITPLRMLLAKEDHPERWDAELSVMEYHNDARKDGTTWHQDQQNIVGY 160
Query: 64 LLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSF 122
L L + F++E I ++ GI+ +NA E+ ++ S ++ + + H+C PN+ S
Sbjct: 161 LRGPCGLKDRFSEEVIQQVIGILDINAFEVRTSKGYSARGLYPKLGVMAHSCVPNVVHSI 220
Query: 123 --TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELN 180
+D +++ R I G+ L YT L GT R+ L++SKYF C+C RC DPTEL
Sbjct: 221 HPSDGYRLIGRVAIDIPEGEKLYTVYTYTLNGTSTRQAALKSSKYFTCRCPRCLDPTELG 280
Query: 181 TFYDGVKC 188
T + +KC
Sbjct: 281 THFSSLKC 288
>gi|380026797|ref|XP_003697129.1| PREDICTED: uncharacterized protein LOC100866234 [Apis florea]
Length = 480
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 96/182 (52%), Gaps = 6/182 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P + ++ +R L + + W+ + +L++H + + N+ + N
Sbjct: 140 PRCDVLLVIRMLILWRKKSKHWNSIQKLQSHEDSRGPGTSAYEETMNIYYHIQRLLPDNS 199
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT--DSGQ--V 128
T++ + KICG+I VNA E E +AI++ A +EH+C N SFT D G+ +
Sbjct: 200 S-TKDIVSKICGLIDVNALETVPPE-GCVAIYETACLLEHSCLANTRHSFTIDDKGRPRI 257
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
++A+ I GDHLS YT LW T RR HL +KYF C C+RC DPTEL T +KC
Sbjct: 258 TVKAVCSIQKGDHLSTMYTHALWATRARRSHLLETKYFSCHCKRCADPTELGTHLGTLKC 317
Query: 189 PE 190
P+
Sbjct: 318 PQ 319
>gi|242013801|ref|XP_002427589.1| protein msta, isoform B, putative [Pediculus humanus corporis]
gi|212512004|gb|EEB14851.1| protein msta, isoform B, putative [Pediculus humanus corporis]
Length = 472
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 92/176 (52%), Gaps = 2/176 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPK-YENDRRNVVQFLLNFFKLNEE 73
Y + RCL K P+ W + +E+H E K K +E + + V + N + L+++
Sbjct: 96 YYALTPFRCLLLKKINPEKWKIIKNMESHTNERKKDGKLWETNEKMAVDIIKNVWGLSDD 155
Query: 74 FTQEEILKICGIIQVNAHEMPLTE-PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
+I +CGI++VN+ E+ S A++ A HNC PN S + +++ R
Sbjct: 156 DDDNDIHFVCGILEVNSFEVNFQNVQSLRALYSNAFLFSHNCIPNTSHSNNNEFKLIFRT 215
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
I ++I Y L T+ RR HL+ +K+F C C+RC DPTELNTF GVKC
Sbjct: 216 TTFIQKNSAVTISYAYTLQPTLLRRKHLEKTKFFQCNCQRCSDPTELNTFIGGVKC 271
>gi|28573973|ref|NP_610202.3| CG14590 [Drosophila melanogaster]
gi|19528095|gb|AAL90162.1| AT24727p [Drosophila melanogaster]
gi|28380690|gb|AAF57282.2| CG14590 [Drosophila melanogaster]
gi|220949792|gb|ACL87439.1| CG14590-PA [synthetic construct]
Length = 553
Score = 109 bits (272), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 93/175 (53%), Gaps = 15/175 (8%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL------ 80
+ P W L+++++H EE K + YE + VV +L F + T +L
Sbjct: 165 QRQSPTKWSALLEMQSHEEERKGTDLYEEAEKRVVTYLQKRFLCRLKQTNPNLLTDCGPE 224
Query: 81 ---KICGIIQVNAHEMPLTEPSYI---AIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAM 133
++CGII+ N M + PS + +F +A +EH C PN F + + QV +RA
Sbjct: 225 MLHRLCGIIETNF--MVIELPSGVELSGLFRQACMMEHACQPNCDFQFDNKTQQVAVRAG 282
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ GDHL I YT+ LWGT R+HHL+ +K+F C+C RC DPTE T+ + C
Sbjct: 283 CDLRKGDHLRITYTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALTC 337
>gi|157124069|ref|XP_001660316.1| hypothetical protein AaeL_AAEL009751 [Aedes aegypti]
gi|108874117|gb|EAT38342.1| AAEL009751-PA [Aedes aegypti]
Length = 500
Score = 108 bits (271), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 94/194 (48%), Gaps = 14/194 (7%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN 66
NF H + + +RCL S +D++M +EAH EE +++ + + VV+ LL
Sbjct: 92 NFLFDH--FNVVTPVRCLLLASSNRAKFDEMMTMEAHTEERRDTDIWNAHEKKVVRPLLE 149
Query: 67 FFKL-----NEEFTQEEILKICGIIQVNAHEM------PLTEPSYI-AIFDRASFIEHNC 114
N + T E I +ICG VN+ E+ E S I ++ +A+ + HNC
Sbjct: 150 NKAFDGKMENLKVTSELIQRICGFFDVNSFEIRGNMDDAQQEESLIRGLYPKAALMAHNC 209
Query: 115 YPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
N S + L P+ G+ L YT L+GT RR HL+ KYF+C C RC
Sbjct: 210 VSNTLISVDGDANLRLYVTTPVKKGEMLFYNYTRSLFGTFERRTHLKLGKYFVCTCSRCE 269
Query: 175 DPTELNTFYDGVKC 188
DPTEL T VKC
Sbjct: 270 DPTELGTHLSSVKC 283
>gi|125986939|ref|XP_001357232.1| GA14319 [Drosophila pseudoobscura pseudoobscura]
gi|54645563|gb|EAL34301.1| GA14319 [Drosophila pseudoobscura pseudoobscura]
Length = 508
Score = 108 bits (271), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 97/183 (53%), Gaps = 5/183 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ WD ++ +E H E + N + D N+ Q+L KL
Sbjct: 115 PQLDCILPLRVLLAKEANPRRWDLEVAPMEHHESERRLNEDVWHADLVNIAQYLRGPCKL 174
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQ 127
F++E I+++ G+++VNA E + ++ + H+C PN +S ++ +
Sbjct: 175 ASRFSEELIMQVVGVLEVNAFEARTARGYPLRCLYPYTGILAHSCVPNTARSIYPSEGFK 234
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ LRAM + G L YT L GT R+ HL+ K+F C CERC DPTEL T + +K
Sbjct: 235 IRLRAMVDLDDGQPLHHSYTYTLDGTAQRQTHLREGKFFTCNCERCLDPTELQTHFSSLK 294
Query: 188 CPE 190
C +
Sbjct: 295 CGQ 297
>gi|195380866|ref|XP_002049182.1| GJ20889 [Drosophila virilis]
gi|194143979|gb|EDW60375.1| GJ20889 [Drosophila virilis]
Length = 553
Score = 108 bits (270), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 96/176 (54%), Gaps = 14/176 (7%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF--KLNEEFTQE------- 77
+ QP+ W L+++++H EE + + + +V +L F +L + +E
Sbjct: 166 QRQQPERWQALLEMQSHAEERLGTELQQEAEQRIVSYLEQRFLQRLKQSKNRECCQLSDY 225
Query: 78 --EIL-KICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLRA 132
E+L ++CGII+ N + L T +F +A +EH C PN Y F + Q + +RA
Sbjct: 226 EPELLHRLCGIIETNYMVIELATGVELSGLFRQACMMEHACQPNCYFQFDSATQRIAVRA 285
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ GDHL I YT+ LWGT R+HHL+ +K+F C+C RC DPTE ++ ++C
Sbjct: 286 GCDMKRGDHLKITYTNILWGTQMRQHHLRMTKHFSCRCARCEDPTEYGSYVSAMRC 341
>gi|321449930|gb|EFX62153.1| hypothetical protein DAPPUDRAFT_120473 [Daphnia pulex]
Length = 510
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/153 (39%), Positives = 82/153 (53%), Gaps = 3/153 (1%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS 99
LE H + + S + + NVVQFL L +EF++EEI CG+I VNA E+ L
Sbjct: 156 LENHNDLRRQSDMWRIYQVNVVQFLRKICGLADEFSEEEIHASCGVIDVNAFEIRLAGNQ 215
Query: 100 Y---IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
Y + +F AS + HNC N + + +RA PI G+ + YT PL GT R
Sbjct: 216 YQQVLGVFPLASMMSHNCVANTQHVIDANYTMTVRASVPIMKGEQIFTSYTLPLEGTKER 275
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
R L+ SK F C C RC DPTE +T+ ++CP
Sbjct: 276 RDVLRHSKLFECDCSRCSDPTECSTYLSALRCP 308
>gi|91088043|ref|XP_966391.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270012080|gb|EFA08528.1| hypothetical protein TcasGA2_TC006181 [Tribolium castaneum]
Length = 492
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 96/177 (54%), Gaps = 4/177 (2%)
Query: 15 YECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
YECI LR L K P+ W +++ +E+H + K P +E ++ N+V +L KL+
Sbjct: 106 YECITPLRVLLMKEKDPKRWQEEIEAMESHNDVRKTKPIWEFNQVNIVDYLKGPCKLDR- 164
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFT-DSGQVLLR 131
F E I +CGI+++NA E + I +F + + + HNC N++ S D+ QV +
Sbjct: 165 FADELIHTVCGILEINAFEARASSGYLIRCLFPKLAILSHNCVSNVHHSVDCDTFQVSVS 224
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + L YT LW T+ RR L+ SKYF C C+RC+D TEL T +KC
Sbjct: 225 ASVEVPESGELFSSYTYSLWPTLVRREFLKESKYFECMCDRCKDKTELGTHLGTLKC 281
>gi|195580894|ref|XP_002080269.1| GD10336 [Drosophila simulans]
gi|194192278|gb|EDX05854.1| GD10336 [Drosophila simulans]
Length = 531
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 92/171 (53%), Gaps = 15/171 (8%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL---------K 81
P W L+++++H EE K + +E + VV +L F + T +L +
Sbjct: 147 PSKWSALLEMQSHEEERKGTELHEEAEKRVVSYLQKRFLCRLKQTNPSLLTDCGPEMLHR 206
Query: 82 ICGIIQVNAHEMPLTEPSYI---AIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+CGII+ N M + PS + +F +A +EH C PN F + + QV +RA +
Sbjct: 207 LCGIIETNF--MVIELPSGVELSGLFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLR 264
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C+C RC DPTE T+ + C
Sbjct: 265 KGDHLRITYTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALTC 315
>gi|328785466|ref|XP_001121590.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 526
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 96/171 (56%), Gaps = 11/171 (6%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFL--LNFFKLNEEFTQEEILKICG 84
K P++W +++ LE+H+ + +N+ +E+ N+V L LNF + N+ E I ++CG
Sbjct: 105 KQKDPELWKRVLSLESHLNKRRNTIVWEDREINIVNVLKSLNFLE-NDPSVSEMIQQLCG 163
Query: 85 IIQVNAHEMPLTEPSYI------AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
I+ VN+ E L P + ++ AS + H+C N++ + D+ + + A PI
Sbjct: 164 ILDVNSFE--LRSPGGMDGLLLRGLYLEASMMAHDCRGNVHVTADDNFHLTVYASIPIKE 221
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
GD + YT L GT RR +L+T KYF C+C+ C+DP E+ ++ + CP
Sbjct: 222 GDTIFFNYTSSLLGTTGRREYLRTGKYFECECDLCKDPYEMGSYMSCIVCP 272
>gi|307187820|gb|EFN72772.1| Protein msta, isoform B [Camponotus floridanus]
Length = 484
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/182 (36%), Positives = 95/182 (52%), Gaps = 9/182 (4%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P E ++ +R L + + W L L++H ++ ++ Q L
Sbjct: 142 PRCEVLLVIRMLILWRTKSKRWTSLANLQSHEGSRGPGTDAHDEVTSISQHLGPLLM--- 198
Query: 73 EFTQEEIL-KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSGQ-- 127
F ++IL KICG+I VNA E E S +AI++ AS +EH+C N SF D G+
Sbjct: 199 AFDCQDILPKICGLIDVNALETMPPEGS-MAIYENASLLEHSCVANTRHSFRIDDKGRPR 257
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ + A+ I G+HLS YT LW T RR HL +KYF C+C+RC DPTEL T +K
Sbjct: 258 ITVYAVTSIKKGEHLSTMYTHALWSTRIRREHLLVTKYFACRCKRCADPTELGTHLGTLK 317
Query: 188 CP 189
CP
Sbjct: 318 CP 319
>gi|332017349|gb|EGI58093.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 481
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 86/160 (53%), Gaps = 6/160 (3%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM 93
W L L+ HVE + ++ +V Q L + + ++ + ICG+I VNA E
Sbjct: 162 WTSLANLQNHVESRGPGTEAHDEVSSVSQRLGPLLLMASDC-KDVLPMICGLIDVNALET 220
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSGQ--VLLRAMKPIAPGDHLSICYTDP 149
E S +AI++ AS +EH+C N SF D G+ + + A+ I G HLS YT
Sbjct: 221 APPEGS-VAIYEHASLLEHSCIANTRHSFRIDDKGRPRITIYAVTSIKKGQHLSTMYTHA 279
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
LW T RR HL T+KYF C+C+RC DPTEL T + CP
Sbjct: 280 LWSTRARREHLLTTKYFACRCKRCADPTELGTHLGTLNCP 319
>gi|345483721|ref|XP_001601934.2| PREDICTED: protein msta, isoform B-like [Nasonia vitripennis]
Length = 480
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/163 (38%), Positives = 85/163 (52%), Gaps = 6/163 (3%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA 90
P+ W L +L++H E + + V Q+L + L++ + E + K+CG+I VNA
Sbjct: 158 PKKWRVLSKLQSHEESRGPGTEAHEEMEAVKQYLGSLLSLDQ-GSAEALPKVCGLIDVNA 216
Query: 91 HEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR----AMKPIAPGDHLSICY 146
E E S AI++ A +EH C N SF Q R A+ I DHLS Y
Sbjct: 217 LETNPPEGS-AAIYETACLLEHRCVANTRYSFELDEQGRPRINVYAVTDIKKADHLSTTY 275
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
T LW T RR HL +KYF C+CERC DPTEL + +KCP
Sbjct: 276 THVLWATRVRREHLLATKYFSCRCERCADPTELGSHLGTLKCP 318
>gi|195353810|ref|XP_002043396.1| GM16483 [Drosophila sechellia]
gi|194127519|gb|EDW49562.1| GM16483 [Drosophila sechellia]
Length = 486
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 91/171 (53%), Gaps = 15/171 (8%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL---------K 81
P W L+++++H EE K + +E + VV +L F + T +L +
Sbjct: 147 PTKWSALLEMQSHEEERKGTELHEEAEKRVVSYLQKRFLCRLKQTNPSLLTDCGPEMLHR 206
Query: 82 ICGIIQVNAHEMPLTEPSYI---AIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+CGII+ N M + PS + +F +A +EH C PN F + + QV +RA +
Sbjct: 207 LCGIIETNF--MVIELPSGVELSGLFRQACMMEHACQPNCDFQFDNKTQQVAVRAGCDLR 264
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C C RC DPTE T+ + C
Sbjct: 265 KGDHLRITYTNILWGTQLRQHHLRLTKHFSCHCNRCLDPTEYGTYISALTC 315
>gi|157125398|ref|XP_001654321.1| hypothetical protein AaeL_AAEL001920 [Aedes aegypti]
gi|108882685|gb|EAT46910.1| AAEL001920-PA [Aedes aegypti]
Length = 527
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 100/176 (56%), Gaps = 3/176 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVE-EYKNSPKYENDRRNVVQFLLNFFKLNEE 73
YE I ALRCL K P+ + L+++E+ E K S ++ ++ +V + + ++ +E
Sbjct: 155 YEAIFALRCLLLKKTDPERYATLLEMESLNELRQKISSLWKRNQEMIVGRIRDEWRF-DE 213
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRA 132
F++ EI +CGII+VN+ ++ + A++ A +I H+C PN + VL +R
Sbjct: 214 FSEGEIHTVCGIIEVNSFQIGQDDVYARALYPEAFYIMHDCTPNTTHTDDPKTHVLSIRL 273
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ GD +++ Y+ L GT+ RR HL SK+F CQC+RC DPTE T +KC
Sbjct: 274 TNDLKAGDPITLSYSYTLQGTLKRRQHLFESKFFWCQCKRCSDPTEFGTNCSTIKC 329
>gi|195146636|ref|XP_002014290.1| GL19122 [Drosophila persimilis]
gi|194106243|gb|EDW28286.1| GL19122 [Drosophila persimilis]
Length = 508
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 97/183 (53%), Gaps = 5/183 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K P+ W+ ++ +E H E + N + D N+ Q+L KL
Sbjct: 115 PQLDCILPLRVLLAKEANPKRWEVEVAPMEHHESERRLNEDVWHADLVNIAQYLRGPCKL 174
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSF--TDSGQ 127
F++E I+++ G+++VNA E + ++ + H+C PN +S ++ +
Sbjct: 175 ASRFSEELIMQVVGVLEVNAFEARTARGYPLRCLYPYTGILAHSCVPNTARSIYPSEGFK 234
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ LRAM + G L YT L GT R+ HL+ K+F C CERC DPTEL T + +K
Sbjct: 235 IRLRAMVDLDDGQPLHHSYTYTLDGTAQRQTHLREGKFFTCNCERCLDPTELQTHFSSLK 294
Query: 188 CPE 190
C +
Sbjct: 295 CGQ 297
>gi|332373498|gb|AEE61890.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 74/120 (61%), Gaps = 4/120 (3%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ---V 128
++ ++E I +ICGI+ VNA E+ + A++ A +EHNC N F ++ + +
Sbjct: 14 KDISKEIIHQICGIVDVNALEIN-QDAEVTALYPTAFLMEHNCLSNTKHMFDNADKGYTI 72
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+RA PIA GDH+S YT LWGT RR HL+ +KYF C+C RC DPTEL T+ +KC
Sbjct: 73 TIRASLPIAKGDHISTMYTHALWGTQARREHLRETKYFSCKCLRCADPTELGTYLSALKC 132
>gi|380022725|ref|XP_003695189.1| PREDICTED: protein msta, isoform B-like [Apis florea]
Length = 513
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 96/179 (53%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + P+ W +++ +EAH + +++ D+ N+V+++ KL+
Sbjct: 126 PQLECITPLRLLLESEKNPERWKNEVKDMEAHNKIRSQKKQWKLDQVNIVEYIRKQLKLD 185
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F+++EI +CGI+++N E+ + S AI+ + + H+C N S + + ++
Sbjct: 186 R-FSEKEIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIR 244
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I PG L YT L T+ RR HL K+F C C RC DPTEL T +KC
Sbjct: 245 LRTTLKIPPGGELYGSYTHSLLPTMLRREHLLEGKHFACACARCSDPTELGTHMSSLKC 303
>gi|347964965|ref|XP_309220.5| AGAP001025-PA [Anopheles gambiae str. PEST]
gi|333466560|gb|EAA04943.5| AGAP001025-PA [Anopheles gambiae str. PEST]
Length = 519
Score = 106 bits (264), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 95/186 (51%), Gaps = 3/186 (1%)
Query: 6 SNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLL 65
SN +Y I+ LR L + P+ + + E+H++E SP Y R N+V FL
Sbjct: 150 SNPEQRESAYCVIVPLRVLLLERIAPERYATVQGFESHLDERLASPLYGVLRSNLVPFLR 209
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP--SYIAIFDRASFIEHNCYPNLYKSFT 123
+L ++++++ +LK+ I+ N +E+ L E ++ + + H+C PN F
Sbjct: 210 QVLRL-QQYSEQTVLKLSAILDTNCYEIRLPEQHVKVRGLYPLGAMLSHDCRPNTKHYFD 268
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFY 183
D ++L A I G + YT PL GT+ RR L+ +K F C CERC DPTE T
Sbjct: 269 DRLHMVLVATVDIPAGGVIHASYTQPLLGTVQRRLALRQAKCFDCCCERCADPTEYGTSA 328
Query: 184 DGVKCP 189
G +CP
Sbjct: 329 SGFRCP 334
>gi|270011596|gb|EFA08044.1| hypothetical protein TcasGA2_TC005638 [Tribolium castaneum]
Length = 575
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 98/175 (56%), Gaps = 18/175 (10%)
Query: 31 PQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFF-------------KLNEEFTQ 76
P+ + +LM++EAH+++ ++ Y+ + +L + F ++ ++ ++
Sbjct: 167 PKKFAQLMEMEAHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQARTGKEVLQDISK 226
Query: 77 EEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT---DSGQVLLRAM 133
E + KICGII VNA E+ + ++ A +EH+C N SF + ++ +RA
Sbjct: 227 ETVHKICGIIDVNALEIN-QDAEISVLYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAA 285
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
PI GDH+S YT LWGT RR HL+ +KYF C C+RC+DPTE+ ++ ++C
Sbjct: 286 LPIKKGDHISTMYTHALWGTQARREHLKETKYFSCTCKRCKDPTEMGSYLSALRC 340
>gi|91088857|ref|XP_971416.1| PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
Length = 563
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 98/175 (56%), Gaps = 18/175 (10%)
Query: 31 PQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFF-------------KLNEEFTQ 76
P+ + +LM++EAH+++ ++ Y+ + +L + F ++ ++ ++
Sbjct: 167 PKKFAQLMEMEAHLDKRGPDTEIYKQVQERTFDYLDDAFFTPLKVLQARTGKEVLQDISK 226
Query: 77 EEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT---DSGQVLLRAM 133
E + KICGII VNA E+ + ++ A +EH+C N SF + ++ +RA
Sbjct: 227 ETVHKICGIIDVNALEIN-QDAEISVLYPTAYLMEHDCLCNTVHSFDTEENGYKITVRAA 285
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
PI GDH+S YT LWGT RR HL+ +KYF C C+RC+DPTE+ ++ ++C
Sbjct: 286 LPIKKGDHISTMYTHALWGTQARREHLKETKYFSCTCKRCKDPTEMGSYLSALRC 340
>gi|321474905|gb|EFX85869.1| hypothetical protein DAPPUDRAFT_237516 [Daphnia pulex]
Length = 595
Score = 105 bits (263), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 90/179 (50%), Gaps = 20/179 (11%)
Query: 30 QPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF--KLNEEFTQEEILKICGIIQ 87
QP W++LM LE+H + + + ++RN+ FL ++ + + + ++CG +
Sbjct: 191 QPDKWNQLMDLESHTTARQATSIWGFNQRNITPFLETMLIDEMPDIINDDLVQQMCGAVD 250
Query: 88 VNAHE------------------MPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
VN+ E MP ++ F AS + H+C N S + Q+
Sbjct: 251 VNSFEIRIETPDRRHTNDLPQQPMPNSQEVLRGTFYMASLMAHSCVVNAQISIGNDSQMT 310
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
++A PI G+ + + YTDPL T+ RR L+ K+FIC+C RC+DPTE TF V+C
Sbjct: 311 VKATVPIREGEGIFVSYTDPLQTTLQRRTFLEKGKHFICRCRRCQDPTEFGTFASAVRC 369
>gi|194864122|ref|XP_001970781.1| GG10833 [Drosophila erecta]
gi|190662648|gb|EDV59840.1| GG10833 [Drosophila erecta]
Length = 553
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 91/170 (53%), Gaps = 13/170 (7%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL---------K 81
P W L+++++H +E K + +E + VV +L F + T +L +
Sbjct: 169 PTKWSALLEMQSHEDERKGTELHEEAEQRVVSYLQQRFLCRLKQTNPNLLHDCGPEMLHR 228
Query: 82 ICGIIQVN--AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAP 138
+CGII+ N E+P T +F +A +EH C PN F + + Q+ +RA +
Sbjct: 229 LCGIIETNFMVIELP-TGVELSGLFRQACMMEHACQPNCDFQFDNKTQQIAVRAGCDLRK 287
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C+C RC DPTE T+ + C
Sbjct: 288 GDHLRITYTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISALAC 337
>gi|156544373|ref|XP_001607398.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 507
Score = 105 bits (262), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 94/180 (52%), Gaps = 6/180 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L P+ W ++ +EAH ++ ++ D N+V+++ KL+
Sbjct: 120 PQLECITPLRLLLAAEKDPERWKSEIKDMEAHNKKRAQKNQWHIDHVNIVEYIRKRLKLD 179
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSG-QV 128
F++E+I CGI+ VN+HE+ T ++A ++ + + + HNC N S + ++
Sbjct: 180 R-FSEEDIQTACGILDVNSHEI-RTAKGFLARGLYPKVAIMNHNCVSNTAHSIDPNDYRI 237
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR+ + L YT L T+ RR HL SK+F C C RC DPTEL T +KC
Sbjct: 238 FLRSAVKVPAAGELFGSYTHALLPTLLRREHLLESKHFACACNRCSDPTELGTHMSSLKC 297
>gi|194743442|ref|XP_001954209.1| GF16861 [Drosophila ananassae]
gi|190627246|gb|EDV42770.1| GF16861 [Drosophila ananassae]
Length = 550
Score = 105 bits (262), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 91/171 (53%), Gaps = 15/171 (8%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL---------K 81
P W L+++++H EE + + + VV FL F + T+ ++L +
Sbjct: 166 PAKWTALLEMQSHEEERQGTELHVEAEMRVVDFLQQRFLKRLKQTKPDLLADCGSELLHR 225
Query: 82 ICGIIQVNAHEMPLTEPSYI---AIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
ICGII+ N M + PS + +F +A +EH C PN Y F + Q+ +RA +
Sbjct: 226 ICGIIETNY--MVIELPSGVELSGLFRQACMMEHACQPNCYFQFDAFTQQITVRAGCDLQ 283
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R+HHL+ +K+F C C RC DPTE ++ + C
Sbjct: 284 KGDHLRIAYTNILWGTQLRQHHLRLTKHFSCHCARCLDPTENGSYVSALAC 334
>gi|195475636|ref|XP_002090090.1| GE19428 [Drosophila yakuba]
gi|194176191|gb|EDW89802.1| GE19428 [Drosophila yakuba]
Length = 526
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 93/174 (53%), Gaps = 13/174 (7%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL------ 80
+ P W L+++++H +E + + ++ + VV +L N F + T +L
Sbjct: 143 QRQSPTKWLALLEMQSHEDERRGTELHDEAEKRVVSYLQNRFLCRLKQTNPSLLHDCEPE 202
Query: 81 ---KICGIIQVN--AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMK 134
++CGII+ N E+P T +F +A +EH C PN F + + QV +RA
Sbjct: 203 MLHRLCGIIETNFMVIELP-TGLELSGLFRQACMMEHACQPNCDFQFDNKTHQVAVRAGC 261
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ GDHL I YT+ LWGT R+HHL+ +K+F C+C RC DPTE T+ + C
Sbjct: 262 DLRKGDHLQITYTNILWGTQLRQHHLRLTKHFSCRCSRCLDPTEYGTYISAMAC 315
>gi|170029518|ref|XP_001842639.1| msta [Culex quinquefasciatus]
gi|167863223|gb|EDS26606.1| msta [Culex quinquefasciatus]
Length = 495
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 92/176 (52%), Gaps = 5/176 (2%)
Query: 16 ECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
+CI LR L K P W+ +++ +E H E + ++ D+ NVV FL + L +
Sbjct: 113 DCITPLRVLLAKEANPDWWNAEIVMMEDHRAERDGNAFWKADQSNVVAFLRDSCGLKDRC 172
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG--QVLLRA 132
++E I K+ GI+ VNA E S ++ + + H+C NL + S +++ RA
Sbjct: 173 SEELIQKVIGILDVNAFEAHTC--SLRGLYPKMGIMAHSCVTNLAHTVHPSKGYRLIARA 230
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
I G L YT L GT R+ LQ +KYF CQC+RC DPTEL T + +KC
Sbjct: 231 AVDIEEGAMLCTTYTHLLAGTRTRQAELQRTKYFTCQCKRCLDPTELGTHFSSMKC 286
>gi|383853714|ref|XP_003702367.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 484
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 97/183 (53%), Gaps = 10/183 (5%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P + ++ +R L + + W L +L++H + + + N+ + + F
Sbjct: 141 PRSDVLLVIRMLLLWRKKSKQWMALEKLQSHDDSRGPGTQAYEEAMNMCEHIQRFLPAIG 200
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEP--SYIAIFDRASFIEHNCYPNLYKSFT--DSG-- 126
+E + KICG+I VNA E T P +AI++ A +EH+C N SF+ D G
Sbjct: 201 A-GKEIVAKICGLIDVNALE---TAPPRGCVAIYNTACLLEHSCVANTRHSFSIDDKGRP 256
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ ++A+ I G+HLS YT LW T RR HL +KYF C+C+RC DPTEL + +
Sbjct: 257 KITVKALCSIKKGEHLSTMYTHALWATRARRAHLLETKYFSCRCKRCADPTELGSHLGTL 316
Query: 187 KCP 189
+CP
Sbjct: 317 RCP 319
>gi|66521464|ref|XP_396314.2| PREDICTED: protein msta, isoform B-like [Apis mellifera]
Length = 513
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 95/179 (53%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + P+ W +++ +EAH + ++++D+ N+V+++ KL
Sbjct: 126 PQLECITPLRLLLESEKNPERWKNEVKDMEAHNKIRSQKKQWKSDQINIVEYIRKQLKL- 184
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
E F++E+I +CGI+++N E+ + S AI+ + + H+C N S + + ++
Sbjct: 185 ERFSEEQIQTVCGILEINTFEVRTAKGFSARAIYPTVALMNHSCISNTCHSISPTDYKIR 244
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I G L YT L T+ RR HL K F C C RC DPTEL T +KC
Sbjct: 245 LRTTLKIPVGGELYGSYTHSLLPTMLRREHLLEGKNFACACARCSDPTELGTHMSSLKC 303
>gi|340722924|ref|XP_003399849.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
Length = 513
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 93/179 (51%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIW-DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + P+ W D++ +EAH + +++++ N+V ++ KL+
Sbjct: 126 PQLECITPLRLLLESERNPERWNDEIKNMEAHNKIRSQKKHWQSNQINIVDYIRKQLKLD 185
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F++E+I CGI+++N E+ + S A++ + + H+C N S + + +V
Sbjct: 186 R-FSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVR 244
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I G L YT L T+ RR HL KYF C C RC DPTEL T +KC
Sbjct: 245 LRTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACSRCSDPTELGTHVSSLKC 303
>gi|383858987|ref|XP_003704980.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 514
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 94/179 (52%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + P+ W+ ++ +EAH + +++D+ NVV++L N KL+
Sbjct: 127 PQLECITPLRLLLESEKNPERWNNEVKNMEAHNKIRSQKAHWKSDQVNVVEYLRNQLKLD 186
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F++E I CGI+++N E+ + A++ + + H+C N S + + ++
Sbjct: 187 R-FSEELIQTACGILEINTFEVRTSRGYGARALYPTVALMNHSCVSNTCHSISPTDYRIR 245
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR + P L YT L+ T+ RR HL K+F C C RC DPTEL T +KC
Sbjct: 246 LRTTVHVPPSGELYGSYTHSLYPTMLRREHLLEGKHFACACARCSDPTELGTHMSSLKC 304
>gi|195124734|ref|XP_002006842.1| GI21287 [Drosophila mojavensis]
gi|193911910|gb|EDW10777.1| GI21287 [Drosophila mojavensis]
Length = 551
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 89/171 (52%), Gaps = 13/171 (7%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF-----------KLNEEFTQEEI 79
P W L+++++H EE + ++ + VV +L F + ++ + +
Sbjct: 170 PDRWRALLEMQSHEEERLGTELHQEAEQRVVGYLQQRFLQRIKQSKQRDSMPSDYEPQLL 229
Query: 80 LKICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIA 137
++CGII+ N + L T +F +A +EH C PN Y F + Q + +RA +
Sbjct: 230 HRLCGIIETNYMVIELATGIELSGLFRQACMMEHACQPNCYFQFDAATQRIAVRAGCDLK 289
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GDHL I YT+ LWGT R HHL +K+F C+C RC DPTE ++ ++C
Sbjct: 290 KGDHLKITYTNILWGTQMRHHHLLMTKHFSCRCARCEDPTEYGSYVSAMRC 340
>gi|322799089|gb|EFZ20542.1| hypothetical protein SINV_07869 [Solenopsis invicta]
Length = 491
Score = 103 bits (256), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 87/160 (54%), Gaps = 6/160 (3%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM 93
W L L++HVE + ++ +V Q L + + ++ + ICG++ VNA E
Sbjct: 170 WTSLANLQSHVESRGPGTEAYDEVSSVSQRLGPLLLMASD-CKDILPMICGLVDVNALET 228
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSGQ--VLLRAMKPIAPGDHLSICYTDP 149
E S +AI++ AS +EH+C N +F D G+ + + A+ I G+HLS YT
Sbjct: 229 APPEGS-MAIYENASLLEHSCIANTRHNFRVDDKGRPRITVYAVTSIKRGEHLSTMYTHA 287
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
LW T RR HL +KYF C+C+RC DPTEL T + CP
Sbjct: 288 LWSTRARREHLLATKYFACRCKRCADPTELGTHLGTLNCP 327
>gi|242019690|ref|XP_002430292.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
humanus corporis]
gi|212515407|gb|EEB17554.1| histone-lysine N-methyltransferase ASHR1, putative [Pediculus
humanus corporis]
Length = 487
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/166 (34%), Positives = 90/166 (54%), Gaps = 12/166 (7%)
Query: 33 IWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL--KICGIIQVNA 90
+WD + +E+HV+E +N P + + N+++ L + LN+E E +L KICGI+ VN+
Sbjct: 122 LWDNFINMESHVDERQNEPIWNYYQENIIKVLNSVNILNDE---ECLLMQKICGILDVNS 178
Query: 91 HEMPLTEPSYIA-------IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLS 143
EM + I+ I+ A+F+ H+C N + S D ++ + A PI G +
Sbjct: 179 FEMRCPDDEQISAGNQLRGIYREAAFMSHDCIGNTHVSLNDFFEMTIHASVPIKSGQAIF 238
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
Y +P T R+ L+ KYF C C+RC DPTEL T+ + CP
Sbjct: 239 FNYANPFECTAARQQRLKKGKYFNCFCKRCTDPTELGTYSGSIICP 284
>gi|118776986|ref|XP_307207.3| AGAP011523-PA [Anopheles gambiae str. PEST]
gi|116133074|gb|EAA03232.3| AGAP011523-PA [Anopheles gambiae str. PEST]
Length = 473
Score = 102 bits (255), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 99/178 (55%), Gaps = 7/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYEN-DRRNVVQFLLNFFKLNEE 73
YE I+ LRC+ K+ +D+L+ ++ + + PK N +++ +V+ + N + E
Sbjct: 99 YEAILTLRCMLLKTVDRPQYDRLLGMDPLNDVRQQIPKLWNRNQKEIVERIRNQWGF-AE 157
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ---VLL 130
+++ E+ ICGII+VNA E+ A+F A + H+C PN TD+ Q + +
Sbjct: 158 YSELELHTICGIIEVNAFEVGQEPTKARALFPEAYLLMHDCTPN--TGHTDAPQTHHLTV 215
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R ++ + G+ L++ Y L GT+ RR HL+ K+F C C RC DP+EL T ++C
Sbjct: 216 RVLRDVKAGEPLTLTYAHILQGTLKRRQHLREEKFFSCNCRRCTDPSELGTNCSALRC 273
>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
Length = 518
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 96/189 (50%), Gaps = 13/189 (6%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
+ ++ +RCL + +D++M +E+ +EE + + ++ + VV+ L+ E+F
Sbjct: 98 FNVVMPVRCLMLYRSNRERYDEVMTMESRLEERRGTEIWDIHEKFVVKPLMESGIFAEKF 157
Query: 75 -----TQEEILKICGIIQVNAHEMPLTEPS--------YIAIFDRASFIEHNCYPNLYKS 121
T E I +ICGI+ N E+ S ++ + + + H+C PN S
Sbjct: 158 EGLEVTGELIQRICGILDANTFEIRGNVDSRESQMGNLARGLYPKTALMMHSCMPNTLLS 217
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
V + P+ G+ L+I YT L+GT +R+ HL+ KYFIC C RC+DPTEL T
Sbjct: 218 VDGYSNVRVFTSAPVKMGEILNISYTRSLFGTYDRQTHLRKGKYFICSCRRCKDPTELGT 277
Query: 182 FYDGVKCPE 190
++C E
Sbjct: 278 HLSSIRCSE 286
>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
Length = 515
Score = 102 bits (254), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 88/177 (49%), Gaps = 8/177 (4%)
Query: 20 ALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEI 79
LR L KK W M+LE H+E + S ++ N V+F+ + L Q+ +
Sbjct: 114 VLRALLKKETNLLEWKLFMELETHLERRRESNVWQY-YDNTVKFIQSLGLLENGQNQDLV 172
Query: 80 LKICGIIQVNAHE-----MPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
KIC I VN+ E +P + I ++ +A+ + H+C N + S D+ ++ A
Sbjct: 173 QKICAAIDVNSFEVRGPPIPAIGCAEILRGVYLQAALLSHDCIANTHMSIDDNNMLVCHA 232
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
I G+ ++ YTDPL GTI R+ HL KYF C C RC D TEL TF CP
Sbjct: 233 SVDIKKGESINYNYTDPLKGTIPRQQHLIVGKYFKCTCTRCTDNTELGTFMSSAICP 289
>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
Length = 458
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 93/176 (52%), Gaps = 6/176 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
++CI LR L + D+L+ +H+E+ + Y +N+ FL + L ++
Sbjct: 141 FQCITPLRYLTLPDSDRERLDELV---SHLEQRRGMDIYRLVEQNISSFL-RYRLLLTQY 196
Query: 75 TQEEILKICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRA 132
E I ++CGI++ N E+ + S ++ AS + H+C N F + ++ + A
Sbjct: 197 DSESIQRVCGILETNCFEIRIQGRVSVRGLYPTASLMNHDCVANTRHVFDPADFRIRILA 256
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
K I GD +S YT LW T++RR HL+++K+F CQC RC DP EL T VKC
Sbjct: 257 TKDIPAGDKISATYTRSLWNTLDRRLHLKSTKHFWCQCSRCADPRELGTLLSAVKC 312
>gi|307197684|gb|EFN78851.1| Protein msta, isoform A [Harpegnathos saltator]
Length = 487
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 93/181 (51%), Gaps = 6/181 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P + ++ +R L + + W L L++H + + + ++ Q+L +
Sbjct: 145 PRCDVLLVIRMLILWRTKSKRWTSLTNLQSHEDSRGPGTEAYEEVMSISQYLEPIVAMVP 204
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSGQ--V 128
+ + KICG+I VNA E E S +AI++ S +EH+C N SF D G+ +
Sbjct: 205 -CCKSVLPKICGLIDVNALETMPPEGS-VAIYENMSLLEHSCMANTRHSFRLDDKGRPRI 262
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ A+ I G+HLS YT LW T RR HL +KYF C+CERC DPTEL ++C
Sbjct: 263 TVNAITFIKKGEHLSTMYTHALWSTRARREHLLATKYFSCRCERCADPTELGAHLGSLQC 322
Query: 189 P 189
P
Sbjct: 323 P 323
>gi|350403520|ref|XP_003486827.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 513
Score = 101 bits (251), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 93/179 (51%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + P+ W+ ++ +EAH + +++++ N+V ++ KL+
Sbjct: 126 PQLECITPLRLLLESERNPERWNNEIKDMEAHNKIRSQKKHWQSNQINIVDYIRKQLKLD 185
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F++E+I CGI+++N E+ + S A++ + + H+C N S + + +V
Sbjct: 186 R-FSEEQIHTACGILEINTFEIRTAKGCSARALYPTVALMNHSCISNTCHSISPTDYRVR 244
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I G L YT L T+ RR HL KYF C C RC DPTEL T +KC
Sbjct: 245 LRTTLKIPAGGELYGSYTHSLLPTLLRREHLLEGKYFACACPRCSDPTELGTHVSSLKC 303
>gi|340710128|ref|XP_003393648.1| PREDICTED: protein msta, isoform B-like [Bombus terrestris]
Length = 480
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 92/181 (50%), Gaps = 6/181 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P + ++ +R L + + W + +L++H + + N+ + N
Sbjct: 140 PRCDVLLVIRMLILWRKKLKHWYSIEKLQSHQDSRGQGTNAYEETMNIYHHIERLLPDNP 199
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT--DSGQ--V 128
+++ + KICG+I VNA E + +AI++ A +EH+C N SF D G+ +
Sbjct: 200 S-SKDIVHKICGLIDVNALET-IPPEGCVAIYETACLLEHSCLANTRHSFKIDDKGRPRI 257
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ A+ PI DHLS YT LW T RR HL +KYF C C+RC DPTEL T +KC
Sbjct: 258 TVIAVCPIQKWDHLSTMYTHALWSTRARRAHLSETKYFSCHCKRCADPTELGTHLGTLKC 317
Query: 189 P 189
P
Sbjct: 318 P 318
>gi|383858904|ref|XP_003704939.1| PREDICTED: protein msta, isoform A-like [Megachile rotundata]
Length = 530
Score = 100 bits (248), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 92/178 (51%), Gaps = 6/178 (3%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ- 76
++ LR K P++ ++ +EAH+EE +N+ + + NV+ + + E+ T
Sbjct: 122 LLPLRLWLVKQTDPELSKRIESMEAHMEERRNTSVWRDRETNVINVMKALHLIPEDDTSI 181
Query: 77 -EEILKICGIIQVNAHEM----PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
E + ++CGI+ VN E+ L ++ ASF+ H+C N + + D Q+ +
Sbjct: 182 PEYLQQLCGILDVNTFELRSPGGLDGLLLRGLYVEASFMAHDCRGNTHLTVDDHFQLTVY 241
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
A PI G+ + YT L GT+ RR HL+ KYF C+C C DP E+ ++ + CP
Sbjct: 242 ASIPIKEGEAILFNYTSSLLGTLGRREHLRGGKYFECECSLCSDPYEMGSYMSSILCP 299
>gi|350413549|ref|XP_003490027.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 480
Score = 99.0 bits (245), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 92/181 (50%), Gaps = 6/181 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
P + ++ +R L + + W + +L++H + + N+ Q + N
Sbjct: 140 PRCDVLLVIRMLILWRKKLKYWYSIEKLQSHQDSRGQGTNAYEETMNIYQHIERLLPDNP 199
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT--DSGQ--V 128
+++ + KICG+I VNA E E +AI++ A +EH+C N SF D G+ +
Sbjct: 200 S-SKDIVHKICGLIDVNALETIPPE-GCVAIYETACLLEHSCLANTRHSFKIDDKGRPRI 257
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ A+ I DHLS YT LW T RR HL +KYF C C+RC DPTEL T +KC
Sbjct: 258 TVIAVCSIQKWDHLSTMYTHALWSTRARRAHLLETKYFSCHCKRCADPTELGTHLGTLKC 317
Query: 189 P 189
P
Sbjct: 318 P 318
>gi|332018228|gb|EGI58833.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 483
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 91/179 (50%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + + W+K + +EAH + ++++D+ N+V +L KL+
Sbjct: 96 PQLECITPLRLLLESERNVERWNKEVKDMEAHNKTRCQKSQWKSDQINIVDYLRKRLKLD 155
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F+++ I ICGI+++N E+ + S ++ + + H+C N S + ++
Sbjct: 156 R-FSEKYIQTICGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIR 214
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I G L YT L TI RR HL K+F C C RC DPTEL T +KC
Sbjct: 215 LRTTLKIPAGGELYASYTHSLLPTILRREHLLEGKHFACACPRCSDPTELGTHMSSLKC 273
>gi|195133792|ref|XP_002011323.1| GI16059 [Drosophila mojavensis]
gi|193907298|gb|EDW06165.1| GI16059 [Drosophila mojavensis]
Length = 526
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 97/181 (53%), Gaps = 9/181 (4%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + HQP+ + K+ +E+H E+ + N +++ + VVQ LL+ ++L +E
Sbjct: 148 YELVLIVRILLMRKHQPEKYAKIRLMESHTEQRRENLTLWQHYEKKVVQRLLHDWQLAQE 207
Query: 74 FTQ---EEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG---Q 127
+ E+ +ICGI+ VN E+ ++ A + H+C PN S TD
Sbjct: 208 LSGLEPAEMHEICGILDVNCFEIGQRGGKARTLYPSAFLLAHDCRPN--TSHTDDPLDYS 265
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
++LR + + + L++ Y L GT+ RR +Q K F CQC+RC DP EL + +
Sbjct: 266 IILRTSRSVREQETLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRCADPRELGSDCSDLV 325
Query: 188 C 188
C
Sbjct: 326 C 326
>gi|195432146|ref|XP_002064087.1| GK19981 [Drosophila willistoni]
gi|194160172|gb|EDW75073.1| GK19981 [Drosophila willistoni]
Length = 390
Score = 96.7 bits (239), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/177 (30%), Positives = 93/177 (52%), Gaps = 4/177 (2%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEE 73
YE I+ +R L + + P+ + +M++E+H EE KN+ +++ VVQ L+ +++ +
Sbjct: 153 YELIMIVRILLLRQNAPEQYALVMRMESHSEERRKNATLWQHYEEKVVQRLIELWQMGDL 212
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG--QVLLR 131
+++ +ICGI+ VN E+ ++ A + H+C PN + D G +LLR
Sbjct: 213 AEAQQLHEICGILDVNCFEIGQNGAKARTLYPSAFLLAHDCSPNTAHT-DDPGTYAILLR 271
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ + + L++ Y L GT+ RR + K F CQC RC DP EL T + C
Sbjct: 272 TSRRVHEQEALTLSYAYTLQGTLKRRSFMHEGKLFWCQCRRCADPKELGTDCSALAC 328
>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
Length = 495
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 89/170 (52%), Gaps = 11/170 (6%)
Query: 26 KKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQ-----FLLNFFKLNEEFTQEEIL 80
K+SH +++L+QLEAH+ +++ + + R V++ LL + E +E +
Sbjct: 121 KRSH----YEQLLQLEAHLARRRDTDIWRSYRAEVIEPLQASGLLLQLRNGAEIDEELLQ 176
Query: 81 KICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++ I +N E+ P + + ++ A+ H+C PNL + + ++ L A + IA
Sbjct: 177 RLLAIADINGFEIRAPESGGAMRGVYMTAALFAHSCLPNLVTAVDEQRRIKLYANRIIAA 236
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G+ L CYTD L GT RR L+T K F CQC RC DPTEL T C
Sbjct: 237 GEILYNCYTDILLGTDERRRILKTGKCFDCQCARCTDPTELGTHMSSFVC 286
>gi|307172338|gb|EFN63826.1| Protein msta, isoform B [Camponotus floridanus]
Length = 510
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 60/180 (33%), Positives = 90/180 (50%), Gaps = 6/180 (3%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + + W+K + +EAH + ++++D N+V +L KL+
Sbjct: 123 PQLECITPLRLLLESERNVERWNKEIKDMEAHNKIRSEKIQWKSDHVNIVDYLRKRLKLD 182
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSG-QV 128
F++E I ICGI+ +N E+ T + A ++ + + H+C N S + ++
Sbjct: 183 R-FSEEFIQTICGILDINTFEVR-TMKGFCARGLYPTVAMMNHSCVSNTSHSISPIDYRI 240
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR + G L YT L T+ RR HL KYF C C RC DPTEL T +KC
Sbjct: 241 RLRTTLKVPAGGELYGSYTHSLLPTMLRREHLLEGKYFACACPRCSDPTELGTHMSSLKC 300
>gi|322792366|gb|EFZ16350.1| hypothetical protein SINV_08812 [Solenopsis invicta]
Length = 510
Score = 95.9 bits (237), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 89/179 (49%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + + W+K + +E H + ++++D N+V +L KL+
Sbjct: 123 PQLECITPLRLLLESERNVERWNKEIKDMEPHNKIRCQKAQWKSDHVNIVDYLRKRLKLD 182
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F++E I +CGI+++N E+ + S ++ + + H+C N S + ++
Sbjct: 183 R-FSEEHIQTVCGILEINTFEVRTAKGFSARGLYPTVAMMNHSCVSNTSHSISPIDYRIR 241
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I G L YT L T+ RR HL K+F C C RC DPTEL T +KC
Sbjct: 242 LRTTLKIPAGGELYASYTHSLLPTMLRREHLLEGKHFACACPRCSDPTELGTHMSSLKC 300
>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
Length = 492
Score = 95.5 bits (236), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 90/184 (48%), Gaps = 10/184 (5%)
Query: 16 ECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFT 75
E + LRCL + + +QLEAH+ ++S + +R N+ L L+E
Sbjct: 104 EIVTPLRCLLWRQFNKNQLENFLQLEAHLNFRRDSQIWRRNRVNIEDVLFKLKILDESDL 163
Query: 76 QEEIL-KICGIIQVNAHEMPLTEPSYIA---------IFDRASFIEHNCYPNLYKSFTDS 125
+EE++ KICGI+ +N ++ + + + ++ RA+ + H+C N + + D
Sbjct: 164 KEEVVQKICGILDINTFDVRQPQRNRLGFNQAENLRGLYLRAALMAHDCVANTHLAVDDD 223
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
+ + A I G + YTD L G R+ L +K+F CQC RCRD TE T
Sbjct: 224 FVLYVHAAVDIPEGSPIYFNYTDVLQGNDERKRRLLNAKHFECQCSRCRDSTEFGTEMSS 283
Query: 186 VKCP 189
+KCP
Sbjct: 284 LKCP 287
>gi|345480417|ref|XP_001607394.2| PREDICTED: protein msta, isoform B-like [Nasonia vitripennis]
Length = 557
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 81/166 (48%), Gaps = 9/166 (5%)
Query: 33 IWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHE 92
+ ++L ++EAH+ E + S + NVV F + E E + +CG++ VN+ E
Sbjct: 150 VRERLSKMEAHMNERRGSEVWFEREANVVDVFRRFGLVRSENEAELVQWLCGVLDVNSFE 209
Query: 93 M--PL-------TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLS 143
+ P+ P IF A+ + H C + + D Q+ + A PI G+ ++
Sbjct: 210 LRTPVPGSNGNNGSPLLRGIFLEAALMAHACRGTAHIAVDDRFQMTVYAAVPIPAGETIA 269
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
YT L GTI R+ HLQ KYF C+C C DP E +F + CP
Sbjct: 270 FNYTSSLLGTIERQEHLQVGKYFRCECSMCVDPLEHGSFVSCILCP 315
>gi|194756634|ref|XP_001960581.1| GF11433 [Drosophila ananassae]
gi|190621879|gb|EDV37403.1| GF11433 [Drosophila ananassae]
Length = 497
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 94/177 (53%), Gaps = 9/177 (5%)
Query: 20 ALRCLYKKSH---QPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN---FFKLNEE 73
AL+C+ + H QP +++++ Q+E + + + ++N++ + L +LN E
Sbjct: 111 ALKCILLREHLETQP-LYEEISQMENQLTARRGTDIWKNNQEHAYNPLEKSGVLKQLNSE 169
Query: 74 FTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
++ + + G++ VNA E+ P + S ++ RA H+C PNL + D ++ +
Sbjct: 170 ANEDLVQGLLGMLDVNAFEIRAPESGGSMRGLYRRAGLYPHSCMPNLVTAIDDERRIKVY 229
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A + IA G+ L CYT+ L GT RRH L+ K F C C RC+DPTEL T C
Sbjct: 230 ANRFIAAGEILYNCYTNILLGTDERRHILKAGKCFDCTCSRCQDPTELGTHMSSFVC 286
>gi|307198438|gb|EFN79380.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 514
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 91/179 (50%), Gaps = 4/179 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
P ECI LR L + + W+ ++ +EAH + ++++D N+V++L KL+
Sbjct: 127 PQLECITPLRLLLESEKSVERWNSEVRDMEAHTKIRCQKSQWKSDHVNIVEYLRKRLKLD 186
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSG-QVL 129
F++E I CGI+++N E+ + S A++ + + H+C N+ + + ++
Sbjct: 187 R-FSEECIQMACGILEINTFEIRTAKGYSARALYPTVAMMNHSCVSNMSHTISPIDYRIR 245
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LR I L YT L T+ RR HL K+F C C RC DPTEL T +KC
Sbjct: 246 LRTTLKIPAAGELYASYTHSLLPTLLRREHLLEGKHFACACPRCSDPTELATHMSSLKC 304
>gi|170056789|ref|XP_001864189.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876476|gb|EDS39859.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 508
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 98/200 (49%), Gaps = 21/200 (10%)
Query: 7 NFTTPHPSYECIIALRCLYK-KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLL 65
NF H + + ++RCL ++ + + W+ + ++EAH+E+ + S ++ + VV+ LL
Sbjct: 92 NFLFNH--FNVVTSVRCLLLYRTDRDRFWE-MRRMEAHLEKRRGSEIWDILSKYVVEPLL 148
Query: 66 NFFKLN-------EEFTQEE--ILKICGIIQVNAHEMPLTEPS--------YIAIFDRAS 108
+ E+ T E I CG+ VNA E+ S ++ RA+
Sbjct: 149 QNGAFDRVGDGEGEKLTVNEDLIQHCCGVFDVNAFEIRGNGTSQSVQQNCFVRGLYPRAA 208
Query: 109 FIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFIC 168
+ H+C PN + S + + A PI G + YT L+GT RR HL+ KYF+C
Sbjct: 209 LMAHDCVPNAFISVDGESNLRVYAAVPIEKGQMVLFNYTRCLFGTFERRAHLRKGKYFLC 268
Query: 169 QCERCRDPTELNTFYDGVKC 188
C RC DPTEL T V+C
Sbjct: 269 TCARCEDPTELGTHLCSVRC 288
>gi|350403683|ref|XP_003486875.1| PREDICTED: hypothetical protein LOC100747756 [Bombus impatiens]
Length = 814
Score = 92.8 bits (229), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 85/164 (51%), Gaps = 6/164 (3%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFL--LNFFKLNEEFTQEEILKICGIIQVN 89
++W ++ +EAH ++ +N+ +++ NVV + L+ + E + +CGI+ VN
Sbjct: 136 ELWKQVESMEAHTDKRRNTSVWKDRELNVVNVIKSLHLVPDGDPSVSELLQLLCGILDVN 195
Query: 90 AHEM----PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSIC 145
E+ L ++ AS + H+C N + + D+ Q+ + A PI GD +
Sbjct: 196 CFELRSPGGLDGLLLRGLYVEASLMAHDCRGNTHLTADDNFQLTVYASLPIKEGDEIYFN 255
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
YT L GT+ RR HL+ KYF C+C C DP E+ ++ + CP
Sbjct: 256 YTSSLLGTLGRREHLRGGKYFECECPLCSDPYEMGSYMSSILCP 299
>gi|332018227|gb|EGI58832.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 518
Score = 92.4 bits (228), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 91/180 (50%), Gaps = 10/180 (5%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFL--LNFFKLNEEFT 75
++ALR K P +W+K+ +EAH+++ + + + NVV + L + E
Sbjct: 114 LLALRLWIIKHKNPAVWEKIDCMEAHLDKRIGTSVWRDREVNVVDVIRRLRWPAEIESPN 173
Query: 76 QEEILKICGIIQVNAHEMPLTEPSYI------AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
E + K+CGI+ VN E L PS + ++ A+ + H+C N + + D+ Q+
Sbjct: 174 TELMQKLCGILDVNTFE--LRSPSALDGLLLRGLYIEATLMAHDCRGNTHLTVDDNFQLT 231
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ A I + + YT L GT++R+ HL+ KYF C+C C DP EL + V CP
Sbjct: 232 VYASVAIKQNEPILFNYTSSLLGTVDRKDHLREGKYFECECSLCMDPYELQSHLSSVLCP 291
>gi|322792382|gb|EFZ16366.1| hypothetical protein SINV_10209 [Solenopsis invicta]
Length = 513
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 53/179 (29%), Positives = 93/179 (51%), Gaps = 8/179 (4%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF--- 74
++ALR K P +W+K+ ++EAH+++ +P + NDR V ++ ++ E
Sbjct: 114 LLALRLWLIKHKNPAVWEKIDRMEAHLDKRIGTPVW-NDREVNVVNVIRRLRIPAEIEPP 172
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
+ E + K+CGI+ VNA E+ P ++ ++ + + H+C N + + D Q+ +
Sbjct: 173 SAELMQKLCGILDVNAFELRSPGALDGFLLRGLYIEVTLMAHDCRGNTHVTVDDKFQLTV 232
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
A I + + YT L GT +R+ +L+ KYF C+C C DP EL + V CP
Sbjct: 233 YASVAIKQDETIFFNYTSSLLGTADRKDYLREGKYFECECSLCADPYELQSHLSSVLCP 291
>gi|194762638|ref|XP_001963441.1| GF20287 [Drosophila ananassae]
gi|190629100|gb|EDV44517.1| GF20287 [Drosophila ananassae]
Length = 528
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 92/177 (51%), Gaps = 3/177 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + P+ + + ++E+H EE KN +++ + VVQ L ++L E+
Sbjct: 156 YELVMIVRILLLRQRDPEQFAAIERMESHTEERRKNQVLWQHYEQKVVQRLREVWQL-ED 214
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPN-LYKSFTDSGQVLLRA 132
E+I ++CGI+ VN E+ ++ A + H+C PN + +S ++LR
Sbjct: 215 LQPEQIHQVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCSPNTAHTDHPESYAIILRT 274
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ + + L++ Y L GT+ RR + K F C C+RC DP EL + + CP
Sbjct: 275 SRRVREREALTLSYAYTLQGTLKRRSFMHEGKLFWCVCQRCSDPRELGSDCSALVCP 331
>gi|195154477|ref|XP_002018148.1| GL17552 [Drosophila persimilis]
gi|194113944|gb|EDW35987.1| GL17552 [Drosophila persimilis]
Length = 501
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 92/185 (49%), Gaps = 19/185 (10%)
Query: 20 ALRCLYKKSHQPQ--IWDKLMQLEAHVEEYKNSPKYEN---------DRRNVVQFL--LN 66
AL CL HQ +++++ Q+E H+E + + ++ + VV+ L +
Sbjct: 111 ALACLLLGEHQHSRPLFEEMSQMETHLEARRGTEIWKRYQAHAYAPLEATGVVKQLHSAS 170
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
++E Q ++ GI+ VN +E+ P + ++ RA+ H+C PNL + D
Sbjct: 171 GNAVDEILVQ----RLLGILDVNVYEIRAPEQGGAMRGLYRRAALFAHSCMPNLETAIDD 226
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
++ + A + IA G+ L ICYT L G RRH L+T K F C C RC+DPTEL T
Sbjct: 227 ERRIKVFANRQIAAGEILYICYTSVLLGNEERRHILKTGKCFDCSCARCQDPTELGTHMS 286
Query: 185 GVKCP 189
C
Sbjct: 287 SFICS 291
>gi|195169991|ref|XP_002025797.1| GL18252 [Drosophila persimilis]
gi|194110650|gb|EDW32693.1| GL18252 [Drosophila persimilis]
Length = 533
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 96/181 (53%), Gaps = 9/181 (4%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L ++H P+ + + ++E+H EE KN + + + VV+ L + +KL++
Sbjct: 157 YELVMIVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEWKLDDL 216
Query: 74 FTQEE---ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQ 127
+EE + +ICGI+ VN E+ ++ A + H+C PN + TD S
Sbjct: 217 AAEEEAQAVHEICGILDVNCFEIGQKGAKARTLYPSAFLLAHDCSPN--TAHTDDPRSYA 274
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+LLR + + + L++ Y L GT+ RR + K F C+C+RC DP EL + +
Sbjct: 275 ILLRTSRRVREQEALTLSYAYTLQGTLKRRSFMYEGKLFWCRCQRCADPQELGSDCSALV 334
Query: 188 C 188
C
Sbjct: 335 C 335
>gi|198467928|ref|XP_001354556.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
gi|198146172|gb|EAL31609.2| GA10806 [Drosophila pseudoobscura pseudoobscura]
Length = 533
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/181 (30%), Positives = 96/181 (53%), Gaps = 9/181 (4%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L ++H P+ + + ++E+H EE KN + + + VV+ L + +KL++
Sbjct: 157 YELVMIVRILLLRTHAPEQYAMITRMESHTEERRKNEKLWSHYEQKVVRRLRDEWKLDDL 216
Query: 74 FTQEE---ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQ 127
+EE + +ICGI+ VN E+ ++ A + H+C PN + TD S
Sbjct: 217 AAEEEAQAVHEICGILDVNCFEIGQKGAKARTLYPSAFLLAHDCSPN--TAHTDDPRSYA 274
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+LLR + + + L++ Y L GT+ RR + K F C+C+RC DP EL + +
Sbjct: 275 ILLRTSRRVREQEALTLSYAYTLQGTLKRRSFMYEGKLFWCRCQRCADPQELGSDCSALV 334
Query: 188 C 188
C
Sbjct: 335 C 335
>gi|170029522|ref|XP_001842641.1| msta [Culex quinquefasciatus]
gi|167863225|gb|EDS26608.1| msta [Culex quinquefasciatus]
Length = 489
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 92/178 (51%), Gaps = 6/178 (3%)
Query: 16 ECIIALRCLYKKSHQPQIWD-KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
+CI LR L K P+ W+ ++ +E H E S + D+ NVV++L L +
Sbjct: 110 DCITPLRVLLAKEAHPERWNAEICMMEDHRAERCGSVYWNADQNNVVRYLRLACGLKDRC 169
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSG--QVLL 130
++E I ++ GI++VNA E T Y ++ + + + H+C N+ S S +++
Sbjct: 170 SEELIQQVIGILEVNAFE-ARTHRGYAVRGLYPKLAIMAHSCVRNVVHSIHPSKDYRLIA 228
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + G L YT T R+ L+ SKYF CQCERC DPTEL T++ +KC
Sbjct: 229 RTAIDVEEGGKLYTSYTFAQDPTHYRQTTLKDSKYFSCQCERCLDPTELGTYFSSLKC 286
>gi|357611158|gb|EHJ67334.1| hypothetical protein KGM_20152 [Danaus plexippus]
Length = 397
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 75/142 (52%), Gaps = 9/142 (6%)
Query: 51 PKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPL----TEPSYIAIFDR 106
P+Y ++ R++ Q+ + EEF IL IC I+ NA +M E S ++
Sbjct: 159 PQYGSEIRDLKQYF-DIPLHEEEFM---ILVIC-ILNTNAFQMATPYGKKEMSLRGLYPV 213
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
AS + HNC PN F Q+ ++A K I G + CY+ LWGT RR +L SK+F
Sbjct: 214 ASILNHNCVPNTRNCFNGDLQMTVKATKTINAGSEIFTCYSGMLWGTPARRLYLYKSKHF 273
Query: 167 ICQCERCRDPTELNTFYDGVKC 188
+C CERC DPTE T +KC
Sbjct: 274 LCDCERCADPTERGTLLAALKC 295
>gi|170065144|ref|XP_001867817.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167882269|gb|EDS45652.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 506
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/189 (31%), Positives = 86/189 (45%), Gaps = 29/189 (15%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE-- 72
+ I +RCL ++D++M++E+HVEE + S ++ + VV LL L+
Sbjct: 99 FNVITPVRCLLLYRKNRDLFDEMMRMESHVEERRGSEIWQIHDKYVVTPLLEAGILDSLN 158
Query: 73 ----EFTQEEILKICGIIQVNAHEMPLTEPS-------YIAIFDRASFIEHNCYPNLYKS 121
E T E I +ICGI+ VN E+ S ++ + S + HNC PN S
Sbjct: 159 DGELEVTGELIQRICGILDVNTFEIRGDVDSSQSGSNLARGLYPKTSLMVHNCVPNTLLS 218
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
G + + P GT R+ HL+ KYF C C RC+DPTEL T
Sbjct: 219 IDGVGNLRVFTSAP----------------GTFERQTHLRQGKYFTCYCRRCKDPTELGT 262
Query: 182 FYDGVKCPE 190
+KC E
Sbjct: 263 HLSSIKCTE 271
>gi|170044116|ref|XP_001849704.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867385|gb|EDS30768.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 502
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 88/189 (46%), Gaps = 29/189 (15%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE-- 72
+ I +RCL ++D++M++E+HVEE + S ++ + VV LL L+
Sbjct: 99 FNVITPVRCLLLYRKNRDLFDEMMRMESHVEERRGSEIWQIHDKYVVTPLLEAGILDSLN 158
Query: 73 ----EFTQEEILKICGIIQVNAHEM-----PLTEPSYIA--IFDRASFIEHNCYPNLYKS 121
E T E I +ICGI+ VN E+ S +A ++ + S + HNC PN S
Sbjct: 159 DGELEVTGELIQRICGILDVNTFEIRGDVDSSQSGSNLARGLYPKTSLMVHNCVPNTLLS 218
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
G + + P GT R+ HL+ KYF C C RC+DPTEL T
Sbjct: 219 IDGVGNLRVFTSAP----------------GTFERQTHLRQGKYFTCYCRRCKDPTELGT 262
Query: 182 FYDGVKCPE 190
+KC E
Sbjct: 263 HLSSIKCTE 271
>gi|195124465|ref|XP_002006713.1| GI21216 [Drosophila mojavensis]
gi|193911781|gb|EDW10648.1| GI21216 [Drosophila mojavensis]
Length = 491
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 85/163 (52%), Gaps = 7/163 (4%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQ-----FLLNFFKLNEEFTQEEILKICGIIQV 88
+++L+Q+EAH+ +++ + ++ V++ LL + E ++ + ++ GI +
Sbjct: 121 YEELLQMEAHLARRRDTEIWRTYQKQVIEPLQSAGLLLQLRNGSEIDEDLLQRMLGIADI 180
Query: 89 NAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
N E+ P + S ++ RA H+C PN+ + + ++ L A + IA G+ L CY
Sbjct: 181 NGFEIRAPESGSSMRGVYLRAGLFAHSCVPNVVAAIDEQHRIKLYANRSIAAGEILYNCY 240
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
T+ + GT RR L+ K F CQC RC DPTEL T C
Sbjct: 241 TNIMLGTDERRQILKIGKCFDCQCVRCLDPTELGTHMSSFVCS 283
>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
Length = 517
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 89/180 (49%), Gaps = 10/180 (5%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQF--LLNFFKLNEEFT 75
++ALR K P +W+++ +E H+ E +P +++ NVV +L E +
Sbjct: 112 LLALRLWLIKCRNPAVWEQVDHMEDHLHERMGTPVWKDREVNVVNVIRMLRVPDGIEPPS 171
Query: 76 QEEILKICGIIQVNAHEMPLTEPSYI------AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
E + K+CGI+ VN E L P + ++ A+ + H+C N + + D ++
Sbjct: 172 AELMQKLCGILDVNTFE--LRSPGALDGLLLRGLYVEAALMAHDCRGNTHLTVDDDFRLT 229
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ A I + + YT L GT +RR HL+ KYF C+C+ C+D EL + + CP
Sbjct: 230 VYASVAIGENEPILFNYTSSLLGTADRREHLREGKYFECECDLCKDSCELQSHLSSLLCP 289
>gi|195060014|ref|XP_001995739.1| GH17917 [Drosophila grimshawi]
gi|193896525|gb|EDV95391.1| GH17917 [Drosophila grimshawi]
Length = 545
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 93/178 (52%), Gaps = 6/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R + K H + + + ++E+H E + N+ +++ + VVQ L N ++L
Sbjct: 172 YELVLIVRIVLLKQHSAEQYANIRRMESHTAERRENATLWQHYEQKVVQRLRNDWQLANV 231
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLL 130
+ E+ +ICGI+ VN E+ ++ A + H+C PN + TD + +LL
Sbjct: 232 CDELELHEICGILDVNCFEIGQRGAKARTLYPSAFLLAHDCTPN--TAHTDDPATFAILL 289
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + + + L++ Y L GT+ RR +Q K F CQC+RC DP EL + + C
Sbjct: 290 RTSRRVLEQEPLTLSYAYTLQGTLKRRTFIQGGKLFWCQCQRCADPRELGSDCSALVC 347
>gi|195056333|ref|XP_001995065.1| GH22837 [Drosophila grimshawi]
gi|193899271|gb|EDV98137.1| GH22837 [Drosophila grimshawi]
Length = 495
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 86/169 (50%), Gaps = 8/169 (4%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVV-----QFLLNFFKLNEEFTQEEILK 81
K QPQ +++L+Q+E+H+ +N+ + R VV LL + E +E +
Sbjct: 119 KETQPQ-YEQLLQMESHLALRRNTEIWRTYREQVVLPLQSTGLLQQLRNGAEIDEELLQH 177
Query: 82 ICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPG 139
+ G VN E+ P + + ++ RA+ H+C PNL + + ++ + A + I G
Sbjct: 178 LLGTTDVNGFEIRAPESGGTLRGLYIRAALFPHSCTPNLVTAIDEERRIKVYANRYIPAG 237
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L CYT+ L GT RR L+ K F CQC RC+DPTEL T C
Sbjct: 238 EILYNCYTNILLGTEERRLILKQGKCFDCQCARCQDPTELGTHMSSFVC 286
>gi|158289967|ref|XP_559081.3| AGAP010377-PA [Anopheles gambiae str. PEST]
gi|157018413|gb|EAL41037.3| AGAP010377-PA [Anopheles gambiae str. PEST]
Length = 503
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 59/194 (30%), Positives = 94/194 (48%), Gaps = 14/194 (7%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN 66
NF H + I LRCL ++++M +E+H E +++ + + VV+ +LN
Sbjct: 76 NFLFDH--FNIITPLRCLLLYRSDRDKYNEMMAMESHFESRRDTEIWHIHNQYVVEPMLN 133
Query: 67 ---FFKLNEEFTQEEIL-KICGIIQVNAHEMPLTEPS--------YIAIFDRASFIEHNC 114
F ++++ E+L +ICGI+ VN E+ S ++ + S + HNC
Sbjct: 134 EKDFQQIDDLTVTGELLQRICGILDVNTFEIRGNMDSQGVQMNNLARGLYPKTSLMTHNC 193
Query: 115 YPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
N + ++ L + I G+ L YT L+ T R+ HL+ KYFIC C RC
Sbjct: 194 QTNTLIAVDGMSKLRLYSSIGIKAGELLYYNYTRVLFSTFERQTHLRKGKYFICNCARCS 253
Query: 175 DPTELNTFYDGVKC 188
DPTEL T +KC
Sbjct: 254 DPTELGTHLSSLKC 267
>gi|195566137|ref|XP_002106647.1| GD16997 [Drosophila simulans]
gi|194204029|gb|EDX17605.1| GD16997 [Drosophila simulans]
Length = 532
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 92/178 (51%), Gaps = 7/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + H P+ + + ++E+H EE + N+ + + VVQ L ++L E+
Sbjct: 160 YELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRVTWQL-ED 218
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLL 130
E++ ++CGI+ VN E+ ++ A + H+C PN + TD S ++LL
Sbjct: 219 LEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPN--TAHTDDPRSFEILL 276
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + + + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 277 RTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 334
>gi|24641162|ref|NP_572675.1| CG11160, isoform A [Drosophila melanogaster]
gi|22832070|gb|AAF47987.2| CG11160, isoform A [Drosophila melanogaster]
Length = 532
Score = 89.4 bits (220), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 92/178 (51%), Gaps = 7/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + H P+ + + ++E+H EE + N+ + + VVQ L ++L E+
Sbjct: 160 YELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRVTWQL-ED 218
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLL 130
E++ ++CGI+ VN E+ ++ A + H+C PN + TD S ++LL
Sbjct: 219 LEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPN--TAHTDDPSSFEILL 276
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + + + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 277 RTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 334
>gi|24641164|ref|NP_727478.1| CG11160, isoform B [Drosophila melanogaster]
gi|22832071|gb|AAN09278.1| CG11160, isoform B [Drosophila melanogaster]
Length = 403
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 90/176 (51%), Gaps = 3/176 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + H P+ + + ++E+H EE + N+ + + VVQ L ++L E+
Sbjct: 160 YELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRVTWQL-ED 218
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRA 132
E++ ++CGI+ VN E+ ++ A + H+C PN + S ++LLR
Sbjct: 219 LEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPNTAHTDDPSSFEILLRT 278
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ + + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 279 SRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 334
>gi|125809191|ref|XP_001361025.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
gi|54636198|gb|EAL25601.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
Length = 501
Score = 89.0 bits (219), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 92/184 (50%), Gaps = 19/184 (10%)
Query: 20 ALRCLYKKSHQPQ--IWDKLMQLEAHVEEYKNSP---KYEN------DRRNVVQFL--LN 66
AL CL HQ +++++ Q+E H+E + + +Y+ + VV+ L +
Sbjct: 111 ALACLLLGEHQHSRPLFEEMSQMETHLEARRGTEIWNRYQAHAYAPLEATGVVKQLHSAS 170
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
++E Q ++ GI+ VN +E+ P + ++ RA+ H+C PNL + D
Sbjct: 171 GNAVDEILVQ----RLLGILDVNVYEIRAPEQGGAMRGLYRRAALFAHSCMPNLETAIDD 226
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
++ + A + I G+ L ICYT L G RRH L+T K F C C RC+DPTEL T
Sbjct: 227 ERRIKVFANRQITAGEILYICYTSVLLGNEERRHILKTGKCFDCSCARCQDPTELGTHMS 286
Query: 185 GVKC 188
C
Sbjct: 287 SFIC 290
>gi|20152089|gb|AAM11404.1| RE25548p [Drosophila melanogaster]
Length = 403
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 90/176 (51%), Gaps = 3/176 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + H P+ + + ++E+H EE + N+ + + VVQ L ++L E+
Sbjct: 160 YELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRVTWQL-ED 218
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRA 132
E++ ++CGI+ VN E+ ++ A + H+C PN + S ++LLR
Sbjct: 219 LEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPNTAHTDDPSSFEILLRT 278
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ + + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 279 SRRVREREALTLGYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 334
>gi|195479534|ref|XP_002100923.1| GE15901 [Drosophila yakuba]
gi|194188447|gb|EDX02031.1| GE15901 [Drosophila yakuba]
Length = 532
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 91/178 (51%), Gaps = 7/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R L + H P+ + + ++E+H EE + N+ + + VVQ L ++L E+
Sbjct: 160 YELVMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRVTWQL-ED 218
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLL 130
E++ +CGI+ VN E+ ++ A + H+C PN + TD S ++LL
Sbjct: 219 LEAEQVHDVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPN--TAHTDDPRSFEILL 276
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + + + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 277 RTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 334
>gi|158297510|ref|XP_317734.4| AGAP007779-PA [Anopheles gambiae str. PEST]
gi|157015230|gb|EAA12332.4| AGAP007779-PA [Anopheles gambiae str. PEST]
Length = 498
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 90/186 (48%), Gaps = 2/186 (1%)
Query: 5 ISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQ-LEAHVEEYKNSPKYENDRRNVVQF 63
+ + T +CI LR L K W + ++ +E H E + ++ D N+V F
Sbjct: 102 VEDSTAGCVQLDCITPLRVLLAKESDEARWKREIEPMEYHDAERREGANWKVDENNIVAF 161
Query: 64 LLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSF 122
L L + F+ E + + G++ VNA E S ++ + + + HNC PN+ S
Sbjct: 162 LRGPCGLGKRFSAELVQRAIGLLDVNAFEGRTGSGYSCRGLYPQLAIMAHNCVPNVVHSI 221
Query: 123 TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
S ++ R + GD L YT L GT+ R+ L+ SK+F C+C RC DPTEL T
Sbjct: 222 HPSNGMVARVAVDVKEGDKLYTTYTYTLTGTVARQSILKASKFFTCRCTRCLDPTELGTH 281
Query: 183 YDGVKC 188
+ + C
Sbjct: 282 FSTLLC 287
>gi|328713664|ref|XP_003245146.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 464
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 85/187 (45%), Gaps = 14/187 (7%)
Query: 14 SYECIIALRC-LYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE 72
+Y ++ LRC L + + L++H+++ N+P Y NV F+L+ L
Sbjct: 109 AYCFVMPLRCMLLLHQRDGRRAAEFRSLQSHLDDRLNTPLYRAYAVNVAAFVLDRLGLRS 168
Query: 73 E---FTQEEILKICGIIQVNAHEMPLTEPS---YIAIFDRASFIEHNCYPNLYKSFTDS- 125
L+ ++ NA E + P + A++ RAS + H C PN F
Sbjct: 169 AGYGHNHRSALEAAAVLDTNAFE--VRRPGGRKFRAVYGRASMMAHCCTPNTKHVFIGDE 226
Query: 126 --GQVLLR--AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
GQ +R A PIA G ++ YT LW T +RR HL +K F C C RC DP EL T
Sbjct: 227 TDGQPTIRVVAAVPIARGYPITATYTQTLWCTRDRRRHLSAAKCFECACARCADPVELGT 286
Query: 182 FYDGVKC 188
V C
Sbjct: 287 HLGSVAC 293
>gi|312373518|gb|EFR21239.1| hypothetical protein AND_17329 [Anopheles darlingi]
Length = 445
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/188 (29%), Positives = 98/188 (52%), Gaps = 4/188 (2%)
Query: 5 ISNFTTPHPSYECIIALRCLYKKSHQPQIWDK-LMQLEAHVEEYKNSPKYENDRRNVVQF 63
+ + T +CI LR L K P+ W++ ++ +E H E+ + ++ D N+V F
Sbjct: 47 VEDSTAGCVQLDCITPLRMLLAKEANPERWEREIVPMEYHDEKRREGANWKVDDHNIVSF 106
Query: 64 LLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSF 122
L N L + F+ + I ++ G++ VNA E S ++ + + + H+C PN+ S
Sbjct: 107 LRNACGLADRFSVDLIQQVIGVLDVNAFEGRTCNGYSVRGLYPQLAIMAHSCVPNVVHSI 166
Query: 123 --TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELN 180
+++ ++ R +A G+ L YT L GT+ R+ L+ SK+F C C RC DP+EL
Sbjct: 167 HPSEAFRLEARVAVDVAEGEKLYTTYTYTLTGTVARQAILKASKFFTCMCPRCVDPSELG 226
Query: 181 TFYDGVKC 188
T + + C
Sbjct: 227 THFSSLLC 234
>gi|194882335|ref|XP_001975267.1| GG20655 [Drosophila erecta]
gi|190658454|gb|EDV55667.1| GG20655 [Drosophila erecta]
Length = 498
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/182 (29%), Positives = 91/182 (50%), Gaps = 19/182 (10%)
Query: 20 ALRCLYKKSHQPQI--WDKLMQLEAHVEEYKNSPKYEN---------DRRNVVQFLLNFF 68
L+CL + H+ + ++++ Q+E+ + + + ++N D V++ L
Sbjct: 111 VLKCLLLREHEETLPLYEEMSQMESQLATRRGTDIWKNYQEHAFAPLDAGGVLKQLRG-- 168
Query: 69 KLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG 126
+E+ Q + GI+ +NA+E+ P + + ++ RA H+C PNL S D
Sbjct: 169 SADEDLVQ----GLLGILDINAYEIRAPESGGAMRGLYRRAGLFAHSCMPNLVISIDDEK 224
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ + A + IA G+ L CYT+ L GT RR L+ K F C C RC+DPTEL T
Sbjct: 225 RIKVYANRFIAAGEILYNCYTNVLLGTEERRQILKEGKCFDCTCARCQDPTELGTHMSSF 284
Query: 187 KC 188
C
Sbjct: 285 MC 286
>gi|194889892|ref|XP_001977181.1| GG18383 [Drosophila erecta]
gi|190648830|gb|EDV46108.1| GG18383 [Drosophila erecta]
Length = 402
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 91/178 (51%), Gaps = 7/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE I+ +R L + H P+ + + ++E+H EE + N+ + + VV L ++L E+
Sbjct: 159 YELIMIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVHRLRVTWQL-ED 217
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLL 130
E++ ++CGI+ VN E+ ++ A + H+C PN + TD S ++LL
Sbjct: 218 LEAEQVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPN--TAHTDDPRSFEILL 275
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + + + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 276 RTSRRVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 333
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 86.3 bits (212), Expect = 5e-15, Method: Composition-based stats.
Identities = 50/163 (30%), Positives = 81/163 (49%), Gaps = 7/163 (4%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAH 91
Q W+ L L H E +P + + R + LL F EE++++ + VNAH
Sbjct: 223 QQWENLYALMHHREAM--APDWISVVREAGEDLLQLLPEWVRFDVEEVVQLACRVNVNAH 280
Query: 92 EMPLTEPSYIAI----FDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
+ + + I F + I H C PN + G + +R ++P++ G LS+ Y
Sbjct: 281 GLRDDSGANLVIGVGMFPLTAMINHACRPNCTFVYF-GGNLEVRTLEPVSAGAELSVYYI 339
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
D L T RR L TSK+F+C+C RC +P+ ++ + DGV C +
Sbjct: 340 DLLQSTAARRQELLTSKHFLCKCSRCENPSSMDDYLDGVCCTD 382
>gi|91088045|ref|XP_966477.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270012081|gb|EFA08529.1| hypothetical protein TcasGA2_TC006182 [Tribolium castaneum]
Length = 481
Score = 86.3 bits (212), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 62/192 (32%), Positives = 94/192 (48%), Gaps = 16/192 (8%)
Query: 2 KIKISNFTTPHP---SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRR 58
++K ++ P+ YE I LR L K P+ W +EE K +
Sbjct: 90 RVKFNDVDDPYDVCLQYEGITPLRMLLAKEKDPKRW---------LEEI-GGLKIRKNNV 139
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPN 117
NVV F+ N K+ ++F + I +CGI+ +NA E+ I ++ + S I HNC N
Sbjct: 140 NVVDFVRNVCKM-DQFCDDLIHTVCGILDLNAIEVRAPSGYLIRCLYPKLSQISHNCVTN 198
Query: 118 LYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
L+++ T + ++ LRA I+ L Y PLW T+ RR L+ +K C C+RC D
Sbjct: 199 LFQTVTPEDFKITLRASGNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCCCKRCGDK 258
Query: 177 TELNTFYDGVKC 188
TEL T +KC
Sbjct: 259 TELRTHLSTLKC 270
>gi|195335121|ref|XP_002034224.1| GM21751 [Drosophila sechellia]
gi|194126194|gb|EDW48237.1| GM21751 [Drosophila sechellia]
Length = 498
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 90/176 (51%), Gaps = 7/176 (3%)
Query: 20 ALRCLYKKSHQPQI--WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN---FFKLNEEF 74
L+CL + H+ + ++++ Q+E+ + + + ++N + + L + +L
Sbjct: 111 VLKCLLLREHEETLPLYEEMSQMESQLMTRRGTDIWKNYQEHAFTPLNSGGVLAQLRGAA 170
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
++ + + GI+ +NA+E+ P + ++ RA H+C PNL S D ++ + A
Sbjct: 171 DEDLVQGLLGILDINAYEIRAPEVGGAMRGLYSRAGLFAHSCMPNLVISIDDERRIKVYA 230
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ IA G+ L CYT+ L GT RR L+ K F C C RC+DPTEL T C
Sbjct: 231 NRFIAAGEILYNCYTNVLLGTEERRQILKEGKCFDCSCPRCQDPTELGTHMSSFIC 286
>gi|195488849|ref|XP_002092487.1| GE11640 [Drosophila yakuba]
gi|194178588|gb|EDW92199.1| GE11640 [Drosophila yakuba]
Length = 497
Score = 85.9 bits (211), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 92/176 (52%), Gaps = 7/176 (3%)
Query: 20 ALRCLYKKSHQPQI--WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN---FFKLNEEF 74
AL+CL + H+ + ++++ Q+E+ + + + +++ + + L + +L
Sbjct: 111 ALKCLLLREHEETLPLYEEMSQMESQLASRRGTDIWKSYQEHAFAPLDSGGVLKQLRGSA 170
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
++ + + GI+ VNA+E+ P + + ++ RA H+C PNL S D ++ + A
Sbjct: 171 DEDLVQGLLGILDVNAYEIRAPESGGAMRGLYRRAGLFAHSCMPNLVISIDDERRIKVYA 230
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ IA G+ L CYT+ L GT R+ L+ K F C C RC+DPTEL T C
Sbjct: 231 NRFIAAGEILYNCYTNVLLGTEERQQILKEGKCFDCTCPRCKDPTELGTHMSSFMC 286
>gi|195584110|ref|XP_002081858.1| GD11242 [Drosophila simulans]
gi|194193867|gb|EDX07443.1| GD11242 [Drosophila simulans]
Length = 498
Score = 85.5 bits (210), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 90/176 (51%), Gaps = 7/176 (3%)
Query: 20 ALRCLYKKSHQPQI--WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLN---FFKLNEEF 74
L+CL + H+ + ++++ Q+E+ + + + ++N + + L + +L
Sbjct: 111 VLKCLLLREHEETLPLYEEMSQMESQLMTRRGTDIWKNYQEHAFTPLNSGGVLAQLRGAA 170
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
++ + + GI+ +NA+E+ P + ++ RA H+C PNL S D ++ + A
Sbjct: 171 DEDLVQGLLGILDINAYEIRAPEVGGAMRGLYRRAGLFAHSCMPNLVISIDDERRIKVYA 230
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ IA G+ L CYT+ L GT RR L+ K F C C RC+DPTEL T C
Sbjct: 231 NRFIAAGEILYNCYTNVLLGTEERRQILKEGKCFDCSCPRCQDPTELGTHMSSFIC 286
>gi|24654327|ref|NP_611182.1| CG9642 [Drosophila melanogaster]
gi|7302839|gb|AAF57913.1| CG9642 [Drosophila melanogaster]
Length = 498
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 91/178 (51%), Gaps = 7/178 (3%)
Query: 20 ALRCLYKKSHQPQI--WDKLMQLEAHVEEYKNSPKYENDRRNV---VQFLLNFFKLNEEF 74
L+CL + H+ + ++++ Q+++ + + + ++N + + + + +L
Sbjct: 111 VLKCLLLREHEETLPLYEEMSQMDSQLMTRRGTEVWKNYQEHAFTPLDYGGVLAQLRGAA 170
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
++ + + GI+ +NA+E+ P + ++ RA H+C PNL S D ++ + A
Sbjct: 171 DEDLVQGLLGILDINAYEIRAPEVGGAMRGLYRRAGLFAHSCTPNLVISIDDEQRIKVYA 230
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
+ IA G+ L CYT+ L GT RR L+ K F C C RC+DPTEL T C +
Sbjct: 231 NRFIAAGEILYNCYTNVLLGTEERRKILKVGKCFDCSCPRCQDPTELGTHMSSFICSQ 288
>gi|442615898|ref|NP_001259436.1| CG11160, isoform C [Drosophila melanogaster]
gi|440216646|gb|AGB95279.1| CG11160, isoform C [Drosophila melanogaster]
Length = 369
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 89/175 (50%), Gaps = 7/175 (4%)
Query: 19 IALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEEFTQE 77
+ +R L + H P+ + + ++E+H EE + N+ + + VVQ L ++L E+ E
Sbjct: 1 MIVRILLLRQHDPEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRVTWQL-EDLEAE 59
Query: 78 EILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLLRAMK 134
++ ++CGI+ VN E+ ++ A + H+C PN + TD S ++LLR +
Sbjct: 60 QVHEVCGILDVNCFEIGQNGAKARTLYPSAFLLAHDCTPN--TAHTDDPSSFEILLRTSR 117
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ + L++ Y L GT+ RR + K F C C RC DP EL T + C
Sbjct: 118 RVREREALTLSYAYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVCA 172
>gi|195441983|ref|XP_002068740.1| GK17864 [Drosophila willistoni]
gi|194164825|gb|EDW79726.1| GK17864 [Drosophila willistoni]
Length = 504
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 14/182 (7%)
Query: 21 LRCLYKKSHQP--QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE------ 72
++CL + H +++++ Q++A ++ + + + N + NV L + L +
Sbjct: 112 VKCLLLREHTETRSLYEEMYQMDAQLDARRGTEVWNNYQANVANPLESSGILGKLWQGSD 171
Query: 73 -EFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG--- 126
E E + ++ I +NA E+ P + S I+ RAS H+C PNL + D+
Sbjct: 172 LEKDVELVQRLLAIFDINAFEIRAPESGGSMRGIYRRASLFPHSCMPNLVSAIDDASNGR 231
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
Q+ + A + IA G+ L CYT+ L GT RR L+ K F C C RC DPTEL T
Sbjct: 232 QLKMYASRFIAAGEILYNCYTNVLLGTQERREILKQGKCFNCNCPRCVDPTELGTHMSSF 291
Query: 187 KC 188
C
Sbjct: 292 MC 293
>gi|321473703|gb|EFX84670.1| hypothetical protein DAPPUDRAFT_314752 [Daphnia pulex]
Length = 497
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 93/178 (52%), Gaps = 3/178 (1%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
Y+CI LR + P+ + LE H + + DR+ VV+ +LN ++L E
Sbjct: 158 YDCIAPLRIVLDNHRNPERSRLWLGLEDHRLYRQKIGIWHVDRKTVVEPVLNKWQLGENT 217
Query: 75 TQEEILKICGIIQVNAHEMPLTEP--SYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLR 131
+EE+ + CGI++VNA E+ + S A++ A + HNC PN + Q + +R
Sbjct: 218 NEEELQRACGILEVNAFEVCDDDKTISARAVYSHACLMAHNCVPNTACCVNPATQRMTVR 277
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
A I G ++ YT L GT+ RR HL+ +K+F C C RC DPTEL + + CP
Sbjct: 278 AAVDIPAGQMITTSYTFTLDGTLRRRSHLKETKFFDCCCPRCCDPTELGSHLSSLACP 335
>gi|195399143|ref|XP_002058180.1| GJ15632 [Drosophila virilis]
gi|194150604|gb|EDW66288.1| GJ15632 [Drosophila virilis]
Length = 527
Score = 82.4 bits (202), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 92/178 (51%), Gaps = 7/178 (3%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEE 73
YE ++ +R + P ++ ++ ++E+H + N+ +++ + VV+ L + ++L ++
Sbjct: 155 YELVLIVRIALLRHLTPDLYAQIRRMESHTAARRENATLWQHYEQKVVRRLRDDWQL-QQ 213
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLL 130
E+ +ICGI+ VN E+ ++ A + H+C PN + TD S +LL
Sbjct: 214 LEAAELHEICGILDVNCFEIGQHGAKARTLYPSAFLLAHDCSPN--TAHTDDPLSYAILL 271
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R + + + L++ Y L GT+ RR +Q K F C+C RC DP EL + + C
Sbjct: 272 RTSRAVREQETLTLSYAYTLQGTLKRRSFIQGGKLFWCRCRRCADPRELGSDCSALVC 329
>gi|195578361|ref|XP_002079034.1| GD23739 [Drosophila simulans]
gi|194191043|gb|EDX04619.1| GD23739 [Drosophila simulans]
Length = 325
Score = 82.0 bits (201), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 64/115 (55%), Gaps = 5/115 (4%)
Query: 80 LKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSF--TDSGQVLLRAMKP 135
+++ G+++VNA E P P +F + HNC PN +S ++ ++ LRAM
Sbjct: 1 MQVVGVLEVNAFEARSPKGYP-LRCLFPYTGILAHNCVPNTARSIYPSEGYKIRLRAMVD 59
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
+ G L YT L GT R+ HL+ K+F CQCERC DPTEL T + +KC +
Sbjct: 60 LEEGQPLHHSYTYTLDGTAQRQKHLKQGKFFTCQCERCLDPTELGTHFSSLKCGQ 114
>gi|307173919|gb|EFN64667.1| Protein msta, isoform B [Camponotus floridanus]
Length = 513
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 89/181 (49%), Gaps = 22/181 (12%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYEND--RRNVVQFLLNFFKLNEEFT 75
+I +R L Q ++ L E H+E + +E D +RNV +F N+E
Sbjct: 141 VIPIRGLTLSKEQRKL---LYAFECHLELTRT---HEIDLLKRNVKKFP------NKE-- 186
Query: 76 QEEILK-ICGIIQVNAHEMPLTE-----PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
Q E++K ICG+ N+ E+ + S ++ + H C PN F D Q+
Sbjct: 187 QLELMKRICGVFNTNSFEVLVASNKDCTTSLRGLYPLGALQNHCCVPNTRHHFDDQQQLH 246
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ A PIA G+ +++ YTD LW T +RR L+ +K+F C C RC DP+E + + C
Sbjct: 247 VTATLPIAAGEEITMSYTDLLWDTSSRRQFLKATKHFSCNCSRCSDPSEFGSQLGALLCA 306
Query: 190 E 190
+
Sbjct: 307 K 307
>gi|307172339|gb|EFN63827.1| Protein msta, isoform A [Camponotus floridanus]
Length = 352
Score = 80.5 bits (197), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 62/121 (51%), Gaps = 8/121 (6%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYI------AIFDRASFIEHNCYPNLYKSFTDSGQV 128
+ E I K+CGI+ +N E L PS + ++ AS + H+C N Y + D+ Q+
Sbjct: 9 SAELIQKLCGILDINTFE--LRSPSILDGLLLRGLYIEASLMSHDCRGNTYLTMDDNFQL 66
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ A PI + + YT L GT RR HL+ KY C+C C+DP EL + V C
Sbjct: 67 TVYASVPIKQNEPILFNYTSSLLGTAERRQHLREGKYLECECSLCKDPYELQSHLSSVLC 126
Query: 189 P 189
P
Sbjct: 127 P 127
>gi|345480361|ref|XP_003424136.1| PREDICTED: protein msta, isoform A-like [Nasonia vitripennis]
Length = 501
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 64/133 (48%), Gaps = 5/133 (3%)
Query: 61 VQFLLNFFKLNEEFTQEEILK-ICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCY 115
V L + K + + EE+++ +C + N+ E + S A+F + H C
Sbjct: 162 VDMLKSNVKGDIDADDEELMRRVCRAMDTNSFETAVIRGGKSTSLRALFPVGALTNHQCV 221
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
PN G++L+ A PIA G +++ Y D LW T RRH L +K+F CQC RC D
Sbjct: 222 PNTRHIVNAEGELLVYAAVPIAEGQEITMSYADVLWDTQMRRHFLLATKHFACQCPRCTD 281
Query: 176 PTELNTFYDGVKC 188
TE + + C
Sbjct: 282 VTECGSLLGALSC 294
>gi|307211936|gb|EFN87850.1| Protein msta, isoform B [Harpegnathos saltator]
Length = 331
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 82/180 (45%), Gaps = 20/180 (11%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYEND--RRNVVQFLLNFFKLNEEFT 75
++ +R L S Q ++ L E H+ +P YE D +RN+ L +E
Sbjct: 130 VVLIRALALSSEQRRL---LFAFECHLNL---APNYEIDLLKRNITH-------LPDEEQ 176
Query: 76 QEEILKICGIIQVNAHEMPLTEP-----SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
+ +ICGI N+ E+ S I+ + H C PN F D ++ +
Sbjct: 177 LALMKRICGIFNTNSFEVVAVRDKDRVVSLRGIYPIGALQNHCCVPNTRHHFDDQQRLYM 236
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
A PIA G +++ YTD LW T +RR L+ +K+F C C RC DP E + + C +
Sbjct: 237 SAALPIAAGQEITMSYTDLLWDTRSRRCFLRVTKHFSCDCNRCSDPLEFGSQLGALLCAK 296
>gi|270007661|gb|EFA04109.1| hypothetical protein TcasGA2_TC014347 [Tribolium castaneum]
Length = 497
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 69/133 (51%), Gaps = 5/133 (3%)
Query: 61 VQFLLNFFKLNEEFTQEEILK-ICGIIQVNAHEMPL----TEPSYIAIFDRASFIEHNCY 115
V + + LN + +E+ ++ +C ++ NA E+ + + S ++ S H+C
Sbjct: 161 VDVMKDALALNVKEDEEKFMRFVCSVMDSNAFEVLVGFEDNQASVKGLYPLGSLANHSCC 220
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
PN F D +++RA K I G + Y+ +W T RR HL +K+F+C+C+RC D
Sbjct: 221 PNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCED 280
Query: 176 PTELNTFYDGVKC 188
PTE ++ + C
Sbjct: 281 PTEFGSYIGSILC 293
>gi|189237463|ref|XP_967982.2| PREDICTED: similar to msta CG33548-PB [Tribolium castaneum]
Length = 387
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 69/133 (51%), Gaps = 5/133 (3%)
Query: 61 VQFLLNFFKLNEEFTQEEILK-ICGIIQVNAHEMPL----TEPSYIAIFDRASFIEHNCY 115
V + + LN + +E+ ++ +C ++ NA E+ + + S ++ S H+C
Sbjct: 161 VDVMKDALALNVKEDEEKFMRFVCSVMDSNAFEVLVGFEDNQASVKGLYPLGSLANHSCC 220
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
PN F D +++RA K I G + Y+ +W T RR HL +K+F+C+C+RC D
Sbjct: 221 PNTCHVFDDKQHMVVRASKFIPQGSEIFHSYSRLIWSTSARRFHLYRTKHFLCKCQRCED 280
Query: 176 PTELNTFYDGVKC 188
PTE ++ + C
Sbjct: 281 PTEFGSYIGSILC 293
>gi|332016864|gb|EGI57673.1| Protein msta, isoform A [Acromyrmex echinatior]
Length = 421
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 63/132 (47%), Gaps = 7/132 (5%)
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLT-------EPSYIAIFDRASFIEHNCYPNL 118
N KL E E + + CG+ N+ E+ + S ++ + H C PN
Sbjct: 104 NVAKLPSEEQMELMRRTCGVFNTNSFEVVVVPSRDNDYTTSLRGLYPMGALQNHCCVPNT 163
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
F D ++ + A+ PIA G+ +++ YTD LW T +RR L+ +K F C C RC DP E
Sbjct: 164 RHHFDDQQRLYVSAVLPIAAGEEITMSYTDLLWDTSSRRQFLKVTKRFSCNCNRCSDPLE 223
Query: 179 LNTFYDGVKCPE 190
+ + C +
Sbjct: 224 FGSQLSTLLCAK 235
>gi|195347880|ref|XP_002040479.1| GM18906 [Drosophila sechellia]
gi|194121907|gb|EDW43950.1| GM18906 [Drosophila sechellia]
Length = 302
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 65/140 (46%), Gaps = 6/140 (4%)
Query: 57 RRNVVQFLLNF--FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFI 110
RR ++Q F F + +++ +I G++ NA E P + +F + +
Sbjct: 34 RREIIQAAQCFRNFPTTDRMFMDQLFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIM 93
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQC 170
H C PN F + ++RA + I G ++ YT LWG + R L+ +K+F C C
Sbjct: 94 NHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDC 153
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC D TE T+ + C E
Sbjct: 154 VRCNDNTENGTYLSALFCRE 173
>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
Length = 991
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 65/140 (46%), Gaps = 6/140 (4%)
Query: 57 RRNVVQFLLNF--FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFI 110
RR ++Q F F + +++ +I G++ NA E P + +F + +
Sbjct: 720 RREIIQAAQCFRNFPTTDRVFMDQLFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIM 779
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQC 170
H C PN F + ++RA + I G ++ YT LWG + R L+ +K+F C C
Sbjct: 780 NHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDC 839
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC D TE T+ + C E
Sbjct: 840 VRCHDNTENGTYLSALFCRE 859
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
V+ HE+ + A+F A + H C PN F + +++ A + I G +++ Y
Sbjct: 219 VDGHEVLVR-----ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYA 273
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCRDPT 177
LW T+ R+ L +K+FIC+C RC+DPT
Sbjct: 274 KLLWSTLARKIFLGMTKHFICKCVRCQDPT 303
>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
Length = 462
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 65/140 (46%), Gaps = 6/140 (4%)
Query: 57 RRNVVQFLLNF--FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFI 110
RR ++Q F F + +++ +I G++ NA E P + +F + +
Sbjct: 191 RREIIQAAQCFRNFPTTDRVFMDQLFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIM 250
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQC 170
H C PN F + ++RA + I G ++ YT LWG + R L+ +K+F C C
Sbjct: 251 NHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDC 310
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC D TE T+ + C E
Sbjct: 311 VRCHDNTENGTYLSALFCRE 330
>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
Length = 462
Score = 76.6 bits (187), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 65/140 (46%), Gaps = 6/140 (4%)
Query: 57 RRNVVQFLLNF--FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFI 110
RR ++Q F F + +++ +I G++ NA E P + +F + +
Sbjct: 191 RREIIQAAQCFRNFPTTDRVFMDQLFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIM 250
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQC 170
H C PN F + ++RA + I G ++ YT LWG + R L+ +K+F C C
Sbjct: 251 NHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDC 310
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC D TE T+ + C E
Sbjct: 311 VRCHDNTENGTYLSALFCRE 330
>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
Length = 495
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 65/140 (46%), Gaps = 6/140 (4%)
Query: 57 RRNVVQFLLNF--FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFI 110
RR ++Q F F + +++ +I G++ NA E P + +F + +
Sbjct: 224 RREIIQAAQCFRNFPTTDRVFMDQLFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIM 283
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQC 170
H C PN F + ++RA + I G ++ YT LWG + R L+ +K+F C C
Sbjct: 284 NHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDC 343
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC D TE T+ + C E
Sbjct: 344 VRCHDNTENGTYLSALFCRE 363
>gi|195040117|ref|XP_001991004.1| GH12314 [Drosophila grimshawi]
gi|193900762|gb|EDV99628.1| GH12314 [Drosophila grimshawi]
Length = 611
Score = 76.3 bits (186), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R Y Q + D + +A+ E Y + Q F ++
Sbjct: 308 SLRILTAVRVFYLSKEQRALVDAM---QANAER-----GYRQEIIKAAQCFRKFPTTDKP 359
Query: 74 FTQEEILKICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ ++ G++ NA E P E +F + + H C PN F + +
Sbjct: 360 F-MDQLFRVVGVLNTNAFEAPCRVDTHETLLRGLFPLTAIMNHECTPNASHYFENGRLAV 418
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LW + R L+ +KYF+C C+RC D +E T+ + C
Sbjct: 419 VRAARAIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLAALFCR 478
Query: 190 E 190
E
Sbjct: 479 E 479
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 43/76 (56%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H C PN F D +++ A + I G +++ Y LW T+ R+ L
Sbjct: 225 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPMGAEITMSYAKLLWSTLARKMFLG 284
Query: 162 TSKYFICQCERCRDPT 177
+K+F+C+C RC+DPT
Sbjct: 285 MTKHFMCRCPRCQDPT 300
>gi|195397183|ref|XP_002057208.1| GJ16480 [Drosophila virilis]
gi|194146975|gb|EDW62694.1| GJ16480 [Drosophila virilis]
Length = 600
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R Y Q + D + +A+ E Y + Q F ++
Sbjct: 297 SLRILTAVRVFYLSKEQRALVDAM---QANAER-----GYRQEIIKAAQCFRKFPTTDKP 348
Query: 74 FTQEEILKICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ ++ G++ NA E P E +F + + H C PN F + +
Sbjct: 349 F-MDQLFRVVGVLNTNAFEAPCRVDGHETLLRGLFPLTAIMNHECTPNASHYFENGRLAV 407
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LW + R L+ +KYF+C C+RC D +E T+ + C
Sbjct: 408 VRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLSALFCR 467
Query: 190 E 190
E
Sbjct: 468 E 468
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H C PN F D +++ A + I G +++ Y LW T+ R+ L
Sbjct: 213 ALFPLAGMLNHQCTPNAAHHFEDGETIVVTATERIPLGAEITMSYAKLLWSTLARKMFLG 272
Query: 162 TSKYFICQCERCRDPTELNT 181
+K+F+CQC RC+DPT+ +
Sbjct: 273 MTKHFMCQCPRCQDPTQTDA 292
>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
Length = 460
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 79/181 (43%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R Y Q + D + +A+ E Y + Q F ++
Sbjct: 157 SLRILTAVRVFYLSKEQRALVDAM---QANAER-----GYRQEIIKAAQCFRKFPTTDKP 208
Query: 74 FTQEEILKICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ ++ G++ NA E P E +F + + H C PN F + +
Sbjct: 209 F-MDQLFRVVGVLNTNAFEAPCRVDGHESLLRGLFPLTAIMNHECTPNASHYFENGRLAV 267
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LW + R L+ +KYF+C C+RC D +E T+ + C
Sbjct: 268 VRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCNDNSENGTYLSALFCR 327
Query: 190 E 190
E
Sbjct: 328 E 328
>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
Length = 787
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 80/181 (44%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R + Q + D + +A+ E + + Q NF ++
Sbjct: 484 SLRILTAVRVFHLGKEQRHLVDAM---QANAER-----GFRQEIIKAAQCFRNFPTTDKP 535
Query: 74 FTQEEILKICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ +I G++ NA E P E +F + + H C PN F + +
Sbjct: 536 F-MDQLFRIVGVLNTNAFEAPCRTDGRETLLRGLFPLTAIMNHECTPNASHYFDNGRLAI 594
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LWG + R L+ +K+F+C C RC D TE T+ + C
Sbjct: 595 VRAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCNDNTENGTYLSALFCR 654
Query: 190 E 190
E
Sbjct: 655 E 655
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 45/80 (56%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H C PN F D +++ A + IA G +++ Y LW T+ R+ L
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296
Query: 162 TSKYFICQCERCRDPTELNT 181
+K F+CQC RC+DPT L +
Sbjct: 297 MTKGFMCQCPRCQDPTSLTS 316
>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
Length = 844
Score = 75.9 bits (185), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 80/181 (44%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R + Q + D + +A+ E + + Q NF ++
Sbjct: 541 SLRILTAVRVFHLGKEQRHLVDAM---QANAER-----GFRQEIIKAAQCFRNFPTTDKP 592
Query: 74 FTQEEILKICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ +I G++ NA E P E +F + + H C PN F + +
Sbjct: 593 F-MDQLFRIVGVLNTNAFEAPCRTDGRETLLRGLFPLTAIMNHECTPNASHYFDNGRLAI 651
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LWG + R L+ +K+F+C C RC D TE T+ + C
Sbjct: 652 VRAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCNDNTENGTYLSALFCR 711
Query: 190 E 190
E
Sbjct: 712 E 712
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 46/85 (54%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H C PN F D +++ A + IA G +++ Y LW T+ R+ L
Sbjct: 237 ALFPLAGLLNHQCTPNAGHHFQDGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLG 296
Query: 162 TSKYFICQCERCRDPTELNTFYDGV 186
+K F+CQC RC+DPT +GV
Sbjct: 297 MTKGFMCQCPRCQDPTLRLMLSEGV 321
>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
Length = 445
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 76/181 (41%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R + Q + D MQ A Y + Q NF + E
Sbjct: 142 SLRILTAVRVFHLSKEQRHLVDA-MQANAQ-------RGYRQEIIKAAQCFRNFPTTDRE 193
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ +I G++ NA E P + +F + + H C PN F + +
Sbjct: 194 F-MDQLFRIVGVLNTNAFEAPCRSGGHEVLLRGLFPLTAIMNHECTPNASHYFDNGTLAV 252
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LWG + R L+ +K F C C RC D TE T+ + C
Sbjct: 253 VRAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKNFNCDCVRCNDNTENGTYLSALFCR 312
Query: 190 E 190
E
Sbjct: 313 E 313
>gi|195381153|ref|XP_002049319.1| GJ20817 [Drosophila virilis]
gi|194144116|gb|EDW60512.1| GJ20817 [Drosophila virilis]
Length = 489
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 82/165 (49%), Gaps = 6/165 (3%)
Query: 30 QPQIWDKLMQLEAHVEEYKNSPKYENDR-RNVVQFLLNFFKL----NEEFTQEEILKICG 84
Q + L+ +E+Y+N +R VV+ L++ + +++ T E + C
Sbjct: 119 QRALCQTLLDTPVKLEDYRNEDGLWQERDEQVVRPLMSSGLVQAFPDQQLTAEVLHAHCI 178
Query: 85 IIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLS 143
I +A+E+ + + ++ R + + H+C PN + + + L A P+ GD +
Sbjct: 179 RIDNSAYEVTARDGDTLKGVYVRGASLPHSCVPNTVVALDEQFNMKLYAAVPLESGDIIY 238
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
YT+PL GT R+H L+ S++ C C RC D TELNT +KC
Sbjct: 239 TSYTNPLMGTSQRQHQLRLSRHKECACTRCLDATELNTHMSSMKC 283
>gi|189239865|ref|XP_966657.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 356
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 74/135 (54%), Gaps = 5/135 (3%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYP 116
NVV F+ K+ ++F + I +CGI+ +NA E+ Y+ ++ + S I HNC
Sbjct: 15 NVVDFVRIVCKM-DQFCDDLIHTVCGILDLNAIEVR-APSGYLTRCLYPKLSQISHNCVT 72
Query: 117 NLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
NL+++ T + ++ LRA I+ L Y PLW T+ RR L+ +K C+C+RC D
Sbjct: 73 NLFQTVTPEDFKITLRASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGD 132
Query: 176 PTELNTFYDGVKCPE 190
TEL T +KC +
Sbjct: 133 KTELRTHLSTLKCSK 147
>gi|21430220|gb|AAM50788.1| LD23906p [Drosophila melanogaster]
Length = 417
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
+E T + + C I N+ E+ + + IF + + H+C PN + + +
Sbjct: 94 TQELTSDALHAHCIRIDSNSFEVTAKDGDTLKGIFVWGATLPHHCVPNTVVALDEQFNMK 153
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L A P+ PGD + YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC
Sbjct: 154 LYAAVPLQPGDIIYNSYTNPLMGTSQRQHQLRLSRRLECICSRCLDPTEMGTHMSSLKCK 213
Query: 190 E 190
E
Sbjct: 214 E 214
>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
Length = 696
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/108 (35%), Positives = 60/108 (55%), Gaps = 3/108 (2%)
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSY-IAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
T +EI++I I+ N HEM PL +Y + IF S++ H+C PN + D G ++ R
Sbjct: 404 TIDEIIEIYSIVLSNGHEMLHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQGMMVFR 463
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
++PI G+ + YTD RR HL +F CQC++C+ ++L
Sbjct: 464 TLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQCKFQSKL 511
>gi|24654325|ref|NP_611181.1| CG9640, isoform A [Drosophila melanogaster]
gi|442623987|ref|NP_001261039.1| CG9640, isoform B [Drosophila melanogaster]
gi|7302840|gb|AAF57914.1| CG9640, isoform A [Drosophila melanogaster]
gi|54650638|gb|AAV36898.1| RE22408p [Drosophila melanogaster]
gi|440214465|gb|AGB93571.1| CG9640, isoform B [Drosophila melanogaster]
Length = 488
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
+E T + + C I N+ E+ + + IF + + H+C PN + + +
Sbjct: 165 TQELTSDALHAHCIRIDSNSFEVTAKDGDTLKGIFVWGATLPHHCVPNTVVALDEQFNMK 224
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L A P+ PGD + YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC
Sbjct: 225 LYAAVPLQPGDIIYNSYTNPLMGTSQRQHQLRLSRRLECICSRCLDPTEMGTHMSSLKCK 284
Query: 190 E 190
E
Sbjct: 285 E 285
>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
Length = 462
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 64/140 (45%), Gaps = 6/140 (4%)
Query: 57 RRNVVQFLLNF--FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFI 110
RR ++Q F F + +++ +I G++ NA E P + +F + +
Sbjct: 191 RREIIQAAQCFRNFPTTDRVFMDQLFRIVGVLNTNAFEAPCRSGGHETLLRGLFPLTAIM 250
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQC 170
H C PN F + ++RA + I G ++ YT LW + R L+ +K+F C C
Sbjct: 251 NHECTPNASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRNIFLKMTKHFACDC 310
Query: 171 ERCRDPTELNTFYDGVKCPE 190
RC D TE T+ + C E
Sbjct: 311 ARCHDNTENGTYLSALFCRE 330
>gi|194882337|ref|XP_001975268.1| GG20654 [Drosophila erecta]
gi|190658455|gb|EDV55668.1| GG20654 [Drosophila erecta]
Length = 488
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
++ T + + C I N+ E+ + + +F + + H+C PN + + +
Sbjct: 165 TQQLTSDVLHAHCIRIDSNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMK 224
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L A P+ PGD + YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC
Sbjct: 225 LYAAVPLQPGDIIYNSYTNPLMGTSQRQHQLRLSRRLECSCSRCLDPTEMGTHMSSIKCK 284
Query: 190 E 190
E
Sbjct: 285 E 285
>gi|330841743|ref|XP_003292851.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
gi|325076863|gb|EGC30616.1| hypothetical protein DICPUDRAFT_89992 [Dictyostelium purpureum]
Length = 680
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 83/168 (49%), Gaps = 11/168 (6%)
Query: 25 YKKSHQPQIWDKLMQLE--AHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE------FTQ 76
+KK P I+D+ LE + + + + E +R+ + KL E T
Sbjct: 322 WKKHEHPFIFDEYQDLENLSRTLDKVDPQQMEAFKRSASSIIAVISKLRGEDAILKYITL 381
Query: 77 EEILKICGIIQVNAHEM--PLTEPSY-IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAM 133
EI+++ I+ N HEM PLT SY + IF S++ H+C PN + + G ++ R++
Sbjct: 382 NEIIELYSIVLSNGHEMLHPLTCRSYGMGIFPTGSYLNHSCSPNAFWYNDEQGMMVFRSL 441
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+P+ G+ L YTD +RR +L +F CQC +C+ + L T
Sbjct: 442 RPLKKGEELLTSYTDVTNPLEDRRKYLMKQYFFFCQCNQCQYQSNLVT 489
>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
Length = 766
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/181 (25%), Positives = 78/181 (43%), Gaps = 13/181 (7%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
S + A+R + Q + D + +A+ E Y + Q NF ++
Sbjct: 463 SLRILTAVRVFHLGQQQRSLVDAM---QANAER-----GYRQEIIKAAQCFRNFPTTDKP 514
Query: 74 FTQEEILKICGIIQVNAHEMPLT----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
F +++ +I G++ NA E P E +F + + H C PN F + +
Sbjct: 515 F-MDQLFRIVGVLNTNAFEAPCRVDGHETLLRGLFPLTAIMNHECTPNASHYFDNGRMAV 573
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RA + I G ++ YT LW + R L+ +K F+C C RC D TE T+ + C
Sbjct: 574 VRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKNFMCDCPRCNDNTENGTYLSALFCR 633
Query: 190 E 190
E
Sbjct: 634 E 634
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 44/76 (57%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H+C PN F D +++ A + I G +++ Y LW T+ R+ L
Sbjct: 222 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 281
Query: 162 TSKYFICQCERCRDPT 177
+K+FICQC RC+DPT
Sbjct: 282 MTKHFICQCPRCQDPT 297
>gi|12007319|gb|AAG45134.1|AF310894_3 BOP [Dictyostelium discoideum]
Length = 403
Score = 73.9 bits (180), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 57/103 (55%), Gaps = 3/103 (2%)
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSY-IAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
T +EI++I I+ N HEM PL +Y + IF S++ H+C PN + D G ++ R
Sbjct: 111 TIDEIIEIYSIVLSNGHEMLHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQGMMVFR 170
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
++PI G+ + YTD RR HL +F CQC++C+
Sbjct: 171 TLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQCK 213
>gi|4324419|gb|AAD16880.1| unknown [Dictyostelium discoideum]
Length = 333
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 57/103 (55%), Gaps = 3/103 (2%)
Query: 75 TQEEILKICGIIQVNAHEM--PLTEPSY-IAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
T +EI++I I+ N HEM PL +Y + IF S++ H+C PN + D G ++ R
Sbjct: 41 TIDEIIEIYSIVLSNGHEMLHPLNCHTYGLGIFPTGSYLNHSCLPNAFWYNDDQGMMVFR 100
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
++PI G+ + YTD RR HL +F CQC++C+
Sbjct: 101 TLRPIKKGEEILTSYTDITTECSERRKHLLKQYFFFCQCQQCK 143
>gi|195056335|ref|XP_001995066.1| GH22836 [Drosophila grimshawi]
gi|193899272|gb|EDV98138.1| GH22836 [Drosophila grimshawi]
Length = 489
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 49/89 (55%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
++ + + H+C PN + D + L A P+A GD + YT+PL GT R+H L+
Sbjct: 197 GVYVCGAGLPHHCVPNTVVALDDQFNMKLYAAVPLATGDIIYASYTNPLMGTSQRQHQLR 256
Query: 162 TSKYFICQCERCRDPTELNTFYDGVKCPE 190
S++ C C RC DPTEL T +KC E
Sbjct: 257 LSRHMECACTRCLDPTELGTHMSSMKCRE 285
>gi|194756636|ref|XP_001960582.1| GF11432 [Drosophila ananassae]
gi|190621880|gb|EDV37404.1| GF11432 [Drosophila ananassae]
Length = 489
Score = 72.8 bits (177), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
+++ T + + C I NA E+ + + +F + + HNC PN + + +
Sbjct: 165 SQQLTCDSLHAHCIRIDSNAFEVTARDGDTLKGVFVCGASLPHNCVPNTVVALDEQFNMK 224
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L A P+ GD + YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC
Sbjct: 225 LYAAVPLQTGDIIYNSYTNPLMGTSQRQHQLRLSRRLDCSCSRCLDPTEMGTHMSSMKCR 284
Query: 190 E 190
E
Sbjct: 285 E 285
>gi|195584108|ref|XP_002081857.1| GD11241 [Drosophila simulans]
gi|194193866|gb|EDX07442.1| GD11241 [Drosophila simulans]
Length = 488
Score = 72.4 bits (176), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 60/121 (49%), Gaps = 1/121 (0%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
+ T + + C I N+ E+ + + +F + + H+C PN + + +
Sbjct: 165 TQGLTSDALHAHCIRIDSNSFEVTAKDGDTLKGVFIWGATLPHHCVPNTVVALDEQFNMK 224
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L A P+ PGD + YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC
Sbjct: 225 LYAAVPLQPGDIIYNSYTNPLMGTSQRQHQLRLSRRLECTCSRCLDPTEMGTHMSSLKCK 284
Query: 190 E 190
E
Sbjct: 285 E 285
>gi|195335119|ref|XP_002034223.1| GM21750 [Drosophila sechellia]
gi|194126193|gb|EDW48236.1| GM21750 [Drosophila sechellia]
Length = 488
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 56/109 (51%), Gaps = 1/109 (0%)
Query: 83 CGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
C I N+ E+ + + +F + + H+C PN + + + L A P+ PGD
Sbjct: 177 CIRIDSNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMKLYAAVPLQPGDI 236
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
+ YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC E
Sbjct: 237 IYNSYTNPLMGTSQRQHQLRLSRRLECTCSRCLDPTEMGTHMSSLKCKE 285
>gi|195488851|ref|XP_002092488.1| GE11639 [Drosophila yakuba]
gi|194178589|gb|EDW92200.1| GE11639 [Drosophila yakuba]
Length = 488
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 61/121 (50%), Gaps = 1/121 (0%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL 129
++ T + + C I N+ E+ + + +F + + H+C PN + + +
Sbjct: 165 TQQLTSDILHAHCIRIDSNSFEVTAKDGDTLKGVFVWGATLPHHCVPNTVVALDEQFNMK 224
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L A P+ PGD + YT+PL GT R+H L+ S+ C C RC DPTE+ T +KC
Sbjct: 225 LYAAVPLQPGDIIYNSYTNPLMGTSQRQHQLRLSRRLECTCTRCLDPTEMGTHMSSLKCK 284
Query: 190 E 190
E
Sbjct: 285 E 285
>gi|270012082|gb|EFA08530.1| hypothetical protein TcasGA2_TC006183 [Tribolium castaneum]
Length = 331
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 67/122 (54%), Gaps = 4/122 (3%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFT-DSGQV 128
++F + I +CGI+ +NA E+ Y+ ++ + S I HNC NL+++ T + ++
Sbjct: 2 DQFCDDLIHTVCGILDLNAIEVR-APSGYLTRCLYPKLSQISHNCVTNLFQTVTPEDFKI 60
Query: 129 LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LRA I+ L Y PLW T+ RR L+ +K C+C+RC D TEL T +KC
Sbjct: 61 TLRASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKC 120
Query: 189 PE 190
+
Sbjct: 121 SK 122
>gi|91088051|ref|XP_966825.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270012083|gb|EFA08531.1| hypothetical protein TcasGA2_TC006184 [Tribolium castaneum]
Length = 331
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 66/121 (54%), Gaps = 2/121 (1%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFT-DSGQVL 129
++F I +CGI+ +NA E+ + I ++ + S I HNC NL+++ T + ++
Sbjct: 2 DQFCDNLIHTVCGILDLNAIEVRASSGYLIRCLYPKLSQISHNCVTNLFQTVTPEDFKIT 61
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L+A I+ L Y PLW T+ RR L+ +K C+C+RC D TEL T +KC
Sbjct: 62 LKASVNISENQELFYNYVYPLWPTLIRRDFLKENKNLDCRCKRCGDKTELRTHLSTLKCS 121
Query: 190 E 190
+
Sbjct: 122 K 122
>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
vitripennis]
Length = 391
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 76/160 (47%), Gaps = 5/160 (3%)
Query: 26 KKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGI 85
K + + K L +H + KN PK ++ Q L +F + EIL + G
Sbjct: 98 KDYYAKNAYRKFKDLMSHYTDIKNDPKRIEHFVSLCQVLEDFMEGTTLPNSAEILGLYGR 157
Query: 86 IQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--H 141
I VN++ + P + I+ S I+H+C PN F + +L+RA++ I D
Sbjct: 158 ICVNSYNILDPDMNSIGVGIYLGPSVIDHSCKPNAVAVF-EGTTILIRALEDIPRLDWSQ 216
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ I Y D L T R LQ + YF+C+CERC+DP T
Sbjct: 217 IHISYIDVLNTTSTRCTELQNTYYFLCECERCKDPETYAT 256
>gi|170067842|ref|XP_001868639.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863902|gb|EDS27285.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 504
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 56/100 (56%), Gaps = 5/100 (5%)
Query: 89 NAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
N HE+ L ++ + + H+C N+ F ++ + A +PI G+ + Y+
Sbjct: 226 NDHEIILR-----GLYILGALMNHHCLANVRYVFDENQVMWCHASRPIRKGEQIFNNYSK 280
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LWGT +R HL SK+F+C+CERCRD TEL T+ +KC
Sbjct: 281 VLWGTQHRIIHLWFSKHFLCECERCRDVTELGTYLGALKC 320
>gi|198458396|ref|XP_001361024.2| GA21934 [Drosophila pseudoobscura pseudoobscura]
gi|198136331|gb|EAL25600.2| GA21934 [Drosophila pseudoobscura pseudoobscura]
Length = 489
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 77/160 (48%), Gaps = 6/160 (3%)
Query: 37 LMQLEAHVEEYKNSPK-YENDRRNVVQFLLNFFKL----NEEFTQEEILKICGIIQVNAH 91
L Q +E+Y++ ++ VV+ LL + ++ T + + C I NA
Sbjct: 126 LFQRSVDLEDYRDGEGMWQEHEEQVVRPLLASGLVEALPTQQLTADVLHAHCVRIDCNAF 185
Query: 92 EMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
E+ + + +F + + H+C PN + + + L A P+ GD + YT+PL
Sbjct: 186 EVTGRDGETLKGVFICGAGLPHHCVPNTVVALDEQFNMKLYAAVPLNAGDVIYNSYTNPL 245
Query: 151 WGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
GT R+H L+ S++ C C RC+D TE+ T +KC E
Sbjct: 246 MGTSQRQHQLRLSRHLDCSCSRCQDATEMGTHMSSMKCRE 285
>gi|328715119|ref|XP_003245540.1| PREDICTED: protein msta, isoform A-like [Acyrthosiphon pisum]
Length = 478
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/184 (27%), Positives = 81/184 (44%), Gaps = 26/184 (14%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQ------LEAHVEEYKNSPKYENDRRNVVQFLLNFF 68
Y I A+R L + + + + ++Q L VEE K + E D+ +
Sbjct: 135 YNAITAVRGLSIRDGKYRYFLDVLQKKSTDKLTFEVEELKRNISMELDKED--------- 185
Query: 69 KLNEEFTQEEILKICGIIQVNAHEMPLTEP----SYIAIFDRASFIEHNCYPNLYKSFTD 124
++ ++ C I+ N E + S ++ A+F+ H C PN +F +
Sbjct: 186 -------EQIMVTTCKIMDANCFETIVNRSNKLTSLKGLYPVAAFMNHCCVPNTMHNFNE 238
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
Q++ +A PI G ++ YT +W T R++HL TSK FIC C RC D E T
Sbjct: 239 KLQMIAKASLPIYKGQEITTSYTYSIWPTSLRQNHLLTSKQFICTCSRCCDTEEFGTKLA 298
Query: 185 GVKC 188
+ C
Sbjct: 299 ALNC 302
>gi|383850313|ref|XP_003700740.1| PREDICTED: protein msta, isoform B-like [Megachile rotundata]
Length = 494
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 66/143 (46%), Gaps = 10/143 (6%)
Query: 56 DRRNVVQFLLNFFKLNEEFTQEE-----ILKICGIIQVNAHEMPLT-----EPSYIAIFD 105
D+ Q+ +N + N E + E + ++CG N+ E S ++
Sbjct: 158 DQTLTSQYEINLLQRNVEKSPSEADVELMKRVCGAFNSNSFETVSVLDKDHSTSLRGLYP 217
Query: 106 RASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
+ HNC PN F ++ ++ + A PI+ G+ L++ YT W T RR L +K+
Sbjct: 218 LGALQNHNCAPNTRHHFDENYRLYVSATVPISAGEELTMSYTSLFWDTTLRRQFLSITKH 277
Query: 166 FICQCERCRDPTELNTFYDGVKC 188
F C C+RC DPTE + + C
Sbjct: 278 FSCTCKRCSDPTESGSMLCALLC 300
>gi|281208900|gb|EFA83075.1| hypothetical protein PPL_03863 [Polysphondylium pallidum PN500]
Length = 470
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 91/188 (48%), Gaps = 22/188 (11%)
Query: 18 IIALRCLYKKSHQPQIWDK-LMQLEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEE-- 73
++ LR L KK+ ++++ + L +E+ K +P++ +F F +L +E
Sbjct: 153 LLVLRILLKKNILSELYNGGVGTLSVQQQEFEKRAPQW---LEQTSRFADRFIELLQEDK 209
Query: 74 ---------FTQEEILKICGIIQVNAHEMPLTEPSYI---AIFDRASFIEHNCYPNLYKS 121
FT++EI+ VN+ T+ S I F A+ + H+C PN + S
Sbjct: 210 HIDPTLLSFFTKQEIINTACATMVNS--FASTKNSIIVSNGFFYEAALLNHSCQPNTFYS 267
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
++ Q+ +R ++ I G + Y D L T R+ L SK+F C+CERC+DPTE
Sbjct: 268 IQNN-QLQMRCIQDIEQGGDIFDSYVDLLEPTFERQQVLLQSKHFYCRCERCQDPTECGR 326
Query: 182 FYDGVKCP 189
+ KCP
Sbjct: 327 YLSSFKCP 334
>gi|195124463|ref|XP_002006712.1| GI21214 [Drosophila mojavensis]
gi|193911780|gb|EDW10647.1| GI21214 [Drosophila mojavensis]
Length = 489
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 76/158 (48%), Gaps = 6/158 (3%)
Query: 37 LMQLEAHVEEYKNSPKY-----ENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAH 91
L++ +E+Y+N E R +++ L +++ T + + C I +A+
Sbjct: 126 LLERPVKLEDYRNEDGVWQEHEEQVVRPIMESGLAQVLPDQQLTSDALHAHCIRIDTSAY 185
Query: 92 EMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
E+ + + ++ R + + H+C PN + + + L A P+ G + Y +PL
Sbjct: 186 EVTARDGDTLKGVYVRGANLPHSCVPNTVVALDEQFNLKLYAAVPLESGQIIFTSYANPL 245
Query: 151 WGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
GT R+H L+ +++ C C RC DPTEL T +KC
Sbjct: 246 MGTSQRQHQLRMTRHMECACVRCLDPTELGTHMSSMKC 283
>gi|322786609|gb|EFZ13004.1| hypothetical protein SINV_09986 [Solenopsis invicta]
Length = 490
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 60/131 (45%), Gaps = 6/131 (4%)
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP------SYIAIFDRASFIEHNCYPNLY 119
N L E + + ++C + N+ E+ + S ++ + H C PN
Sbjct: 174 NVANLPSEEQMKLMQRVCDVFNTNSFEVVVPSRNNDHTMSLRGLYPMGALQNHCCVPNSR 233
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
F D ++ + A PIA G+ +++ YTD LW T RR L+ +K F C C RC DP E
Sbjct: 234 HHFDDQQRLHVSAALPIAAGEEITMSYTDLLWDTSTRRQFLRITKRFSCNCNRCSDPLEF 293
Query: 180 NTFYDGVKCPE 190
+ + C +
Sbjct: 294 GSRLSALLCAK 304
>gi|195154475|ref|XP_002018147.1| GL17551 [Drosophila persimilis]
gi|194113943|gb|EDW35986.1| GL17551 [Drosophila persimilis]
Length = 489
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 77/160 (48%), Gaps = 6/160 (3%)
Query: 37 LMQLEAHVEEYKNSPK-YENDRRNVVQFLLNFFKL----NEEFTQEEILKICGIIQVNAH 91
L + ++E+Y++ ++ VV LL + ++ T + + C I NA
Sbjct: 126 LFERSVNLEDYRDGEGMWQEHEEQVVGPLLASGLVEALPTQQLTADVLHAHCVRIDCNAF 185
Query: 92 EMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
E+ + + +F + + H+C PN + + + L A P+ GD + YT+PL
Sbjct: 186 EVTGRDGETLKGVFVCGAGLPHHCVPNTVVALDEQFNMKLYAAVPLNAGDVIYNSYTNPL 245
Query: 151 WGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
GT R+H L+ S++ C C RC+D TE+ T +KC E
Sbjct: 246 MGTSQRQHQLRLSRHLDCSCSRCQDATEMGTHMSSMKCRE 285
>gi|62473432|ref|NP_001014718.1| msta, isoform B [Drosophila melanogaster]
gi|76363525|sp|P83501.2|MSTAB_DROME RecName: Full=Protein msta, isoform B
gi|61677866|gb|AAF45751.3| msta, isoform B [Drosophila melanogaster]
Length = 448
Score = 70.1 bits (170), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 55/103 (53%), Gaps = 5/103 (4%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
V+ HE+ + A+F A + H C PN F + +++ A + I G +++ Y
Sbjct: 219 VDGHEVLVR-----ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYA 273
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
LW T+ R+ L +K+FIC+C RC+DPTE T+ + C E
Sbjct: 274 KLLWSTLARKIFLGMTKHFICKCVRCQDPTENGTYLSALFCRE 316
>gi|157117568|ref|XP_001658830.1| hypothetical protein AaeL_AAEL008032 [Aedes aegypti]
gi|108876001|gb|EAT40226.1| AAEL008032-PA [Aedes aegypti]
Length = 527
Score = 69.7 bits (169), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 89 NAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
N HE+ L ++ + + H C PN+ F + + A KPI G+ + Y+
Sbjct: 250 NDHEIVLR-----GLYILGALMNHCCRPNIRYVFDEKLVMHCYASKPIRKGEQIFNNYSK 304
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LWGT +R HL SK+F+C C+RC DPTE+ T+ +KC
Sbjct: 305 SLWGTQHRIIHLCFSKHFLCGCDRCLDPTEMGTYLGALKC 344
>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 425
Score = 69.3 bits (168), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 93/199 (46%), Gaps = 20/199 (10%)
Query: 7 NFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYEN--DRRNVVQFL 64
+ T P C +A + + ++++ L H++++ +PK + + +V L
Sbjct: 89 DATLPTVLLVCRLAAKLFLSSKVDQEDKNRILNLRHHLDDH-TAPKQQQFCETTPLVHLL 147
Query: 65 LNFFKLN-----------EEFTQEEILKICGIIQVNAHEM---PLTEPSYIAIFDRASFI 110
L+ +K++ +E + EILK+ G + N + EP I +F + S
Sbjct: 148 LSEYKVDKQEQTPSFAELQESLKPEILKLFGQVSCNGFSIMNGVTNEPVGIGLFLQGSMF 207
Query: 111 EHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQ 169
H+C PN SF GQ + + +K + G L+I Y + L T R+ L+ S +F CQ
Sbjct: 208 NHDCDPNCVVSF--QGQEMNVHVIKDVKEGQELTISYVEVLQSTKKRQKILKDSYFFECQ 265
Query: 170 CERCRDPTELNTFYDGVKC 188
C RC T + + DG++C
Sbjct: 266 CSRCTTETTDDWYLDGLQC 284
>gi|350424241|ref|XP_003493732.1| PREDICTED: protein msta, isoform B-like [Bombus impatiens]
Length = 498
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 57/117 (48%), Gaps = 6/117 (5%)
Query: 78 EILK-ICGIIQVNAHEMPLT-----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
E++K +C I+ N+ E + S ++ ASF H C PN F ++ +
Sbjct: 184 ELMKHMCRILNTNSFETIMVHDKEHSVSLRGLYSIASFQNHCCVPNTRHHFDGEFRMYVS 243
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A PIA G+ ++ YT W T RR L +K+F C C+RC DPTE + + C
Sbjct: 244 AALPIAAGEEITSTYTSLFWDTTLRRKFLSITKHFSCMCKRCSDPTEFKSRLGALLC 300
>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
Length = 473
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 70/139 (50%), Gaps = 7/139 (5%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ HVE + K D R V+ L F+ +++F + I I G+I NA +
Sbjct: 127 LQNHVESFDEEEK--KDLRVDVESFLEFWPAQSQQFGMQYISHIFGVINCNAFTLSDQRG 184
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ + IF H+C+PN F ++G++ LRA+ I+PGD L++ Y D L +
Sbjct: 185 LQAVGVGIFPNLCQANHDCWPNCTVIF-NNGKIELRALSKISPGDELTVSYVDFLNVSEE 243
Query: 156 RRHHLQTSKYFICQCERCR 174
RR L+ YF C CE C+
Sbjct: 244 RRKQLKKQYYFDCTCEHCK 262
>gi|380022845|ref|XP_003695246.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 485
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 45/88 (51%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
++ S H C PN F + Q+ +RA PI+ G+ +++ YT W T RR L
Sbjct: 212 GLYPLGSLQNHCCIPNTRHYFDEEFQLYVRAALPISTGEEITMSYTSLFWDTTLRRQFLN 271
Query: 162 TSKYFICQCERCRDPTELNTFYDGVKCP 189
+K F C C+RC DPTE N+ + C
Sbjct: 272 VTKNFSCMCKRCSDPTEFNSKLGALLCA 299
>gi|347963044|ref|XP_566389.3| AGAP000041-PA [Anopheles gambiae str. PEST]
gi|333467386|gb|EAL41307.3| AGAP000041-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/100 (34%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 89 NAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
N HE+ L ++ + + H C PN+ F ++ + A +PI G+ + Y+
Sbjct: 245 NQHEIILR-----GLYILGALMNHCCRPNVRYVFDGELRMRVHASRPIKKGEQIMNNYSK 299
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
LWG+ +R HL SK F+C C+RC+DPTE T+ +KC
Sbjct: 300 ILWGSQHRIIHLCFSKNFLCCCDRCKDPTEFGTYLGALKC 339
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 57/111 (51%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E SY A++ + + H+C PN Y +F + + +++RA IA
Sbjct: 185 VLRTNGFDKTTDRTTDNQEFSYRALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIA 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G+ ++ YT G I R +L+ K F C+C RC DPTE + G+ C
Sbjct: 245 EGEEITTTYTKLFTGNIARHLYLKMKKGFTCKCPRCSDPTEKGAYISGLYC 295
>gi|195421076|ref|XP_002060836.1| GK23578 [Drosophila willistoni]
gi|194156921|gb|EDW71822.1| GK23578 [Drosophila willistoni]
Length = 128
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 44/76 (57%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H+C PN F D +++ A + I G +++ Y LW T+ R+ L
Sbjct: 51 ALFPLAGLLNHHCTPNAAHHFEDGETIVVSATERIPKGTEITMTYAKLLWSTLARKLFLA 110
Query: 162 TSKYFICQCERCRDPT 177
+K+FICQC RC+DPT
Sbjct: 111 MTKHFICQCPRCQDPT 126
>gi|328785542|ref|XP_001122203.2| PREDICTED: protein msta, isoform A-like [Apis mellifera]
Length = 490
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 45/88 (51%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
++ S H C PN F + ++ +RA PI+ G+ +++ YT W T RR L
Sbjct: 214 GLYPLGSLQNHCCIPNTRHYFDEKFRLYVRAALPISAGEEITMSYTSLFWDTTLRRQFLN 273
Query: 162 TSKYFICQCERCRDPTELNTFYDGVKCP 189
+K F C C+RC DPTE N+ + C
Sbjct: 274 VTKNFSCMCKRCSDPTEFNSKLSALLCA 301
>gi|195132931|ref|XP_002010893.1| GI21458 [Drosophila mojavensis]
gi|193907681|gb|EDW06548.1| GI21458 [Drosophila mojavensis]
Length = 302
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 44/76 (57%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F A + H C PN F D +++ A + I G +++ YT LW T+ R+ L
Sbjct: 226 ALFPLAGLLNHQCTPNAAHHFEDGETIVVTATERIPAGAEITMSYTKLLWSTLARKIFLG 285
Query: 162 TSKYFICQCERCRDPT 177
+K+F+CQC RC+DPT
Sbjct: 286 MTKHFMCQCPRCQDPT 301
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 58/113 (51%), Gaps = 1/113 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E +Y A++ + + H+C PN Y +F + + +++RA IA
Sbjct: 185 VLRTNGFDKTTDRTTDNQEFNYRALYPLFAVMNHDCIPNSYYTFEEKTNNMIVRAAVDIA 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
G+ ++ YT G I R L+ K F C+C RC DPTE +F G+ C +
Sbjct: 245 EGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCLDPTEKGSFISGLYCRD 297
>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
Length = 477
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ L+ YF C CE C+ + + F GVK
Sbjct: 250 KRQLKKQYYFDCACEHCQKKLKDDLFL-GVK 279
>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 764
Score = 67.0 bits (162), Expect = 3e-09, Method: Composition-based stats.
Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 7/105 (6%)
Query: 77 EEILKICGIIQVNAHEMPLTEPSYI-----AIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
E + ++ + Q NA + LT+PS + ++ A+ + H+C PN S G++ +R
Sbjct: 334 ETVARLFAVAQANA--LTLTDPSSLRPIGQGLYASAALLNHSCLPNANWSVDGEGRLCVR 391
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
A++PI G+ +++ Y DP R+ LQ +F C+C +CR P
Sbjct: 392 AVRPIEAGEEVTVAYVDPTLPYHARQQALQDHFFFACRCLQCRPP 436
>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Taeniopygia guttata]
Length = 478
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 80/171 (46%), Gaps = 12/171 (7%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
P+ +A R L+K + + + +L+ HV+ + K D R V+ L F
Sbjct: 100 PTENIRLAARILWKMEREGSGLSEGCLVAIEELQNHVDSFGEEEK--KDLRADVESFLEF 157
Query: 68 FKLN-EEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFT 123
+ ++F + I I G+I N + + + IF H+C+PN F
Sbjct: 158 WPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVVF- 216
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
++G++ LRA+ I+PGD L++ Y D L + RR L+ YF C CE C+
Sbjct: 217 NNGKIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCK 267
>gi|270011595|gb|EFA08043.1| hypothetical protein TcasGA2_TC005637 [Tribolium castaneum]
Length = 319
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 62/117 (52%), Gaps = 6/117 (5%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
E+ ++E + KI +Q N H I+ + AS ++H+C PN + T++ ++ LR
Sbjct: 32 EDKSEETLGKIFNFLQSNTHH----NKQLISFYPNASLLKHSCTPNTIFT-TENFEITLR 86
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
A+ ++ G L+ YTD GT R +L+ C C RC DPTE TF+ +KC
Sbjct: 87 AISGVSEGTPLTCSYTDLFSGTHVRLKNLKKLNK-TCFCARCTDPTEFGTFFSALKC 142
>gi|195454533|ref|XP_002074283.1| GK18438 [Drosophila willistoni]
gi|194170368|gb|EDW85269.1| GK18438 [Drosophila willistoni]
Length = 232
Score = 67.0 bits (162), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 3/111 (2%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-LMQLEAH-VEEYKNSPKYENDRRNVVQFLLNFFKL 70
P +CI+ LR L K QP+ W+K + +E H E +NS + DR N+ Q+L KL
Sbjct: 111 PQLDCIMPLRVLLAKEQQPERWEKDVAPMEHHEAERRENSDIWHADRVNIAQYLRGPCKL 170
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYK 120
F++E I+++ GI++VNA E T + ++ + HNC PN +
Sbjct: 171 ASRFSEELIMQVVGILEVNAFEARTTVGYPLRCLYPYTGILAHNCVPNTAR 221
>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Meleagris gallopavo]
Length = 478
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 70/139 (50%), Gaps = 7/139 (5%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ HVE + K D R V+ L F+ +++F + I I G+I NA +
Sbjct: 132 LQNHVENFDEEEK--KDLRIDVESFLEFWPAQSQQFGMQYISHIFGVINCNAFTLSDQRG 189
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ + IF H+C+PN F ++G++ LRA+ I+PGD L++ Y D L +
Sbjct: 190 LQAVGVGIFPNLCQANHDCWPNCTVIF-NNGKIELRALSKISPGDELTVSYVDFLNVSEE 248
Query: 156 RRHHLQTSKYFICQCERCR 174
R+ L+ YF C CE C+
Sbjct: 249 RQKQLKKQYYFDCTCEHCK 267
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 76/154 (49%), Gaps = 5/154 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLHLSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
R L+ YF C CE C+ + + F + P+
Sbjct: 250 RRQLKKQYYFDCSCEHCQKGLKDDLFLAAKEDPK 283
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 66.2 bits (160), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 76/154 (49%), Gaps = 5/154 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 127 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 185
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 186 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLHLSEER 244
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
R L+ YF C CE C+ + + F + P+
Sbjct: 245 RRQLKKQYYFDCSCEHCQKGLKDDLFLAAKEDPK 278
>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
Length = 449
Score = 66.2 bits (160), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 76/151 (50%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + + R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVENFGEEEQ-KQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ L+ YF C CE C+ + + F GVK
Sbjct: 250 KKQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 279
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 73/146 (50%), Gaps = 5/146 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLHLSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTF 182
R L+ YF C CE C+ + + F
Sbjct: 250 RQQLKKQYYFDCSCEHCQKGLKDDLF 275
>gi|261330294|emb|CBH13278.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 713
Score = 65.9 bits (159), Expect = 8e-09, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++CG +Q N + P + +++ AS+ H+C PNL + D G A++ I
Sbjct: 552 RLCGAVQCNNFGLFDPKEDCIGVSVIPEASYFNHSCLPNLCRVMCDGGIAAFYALREIRK 611
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L TS F CQC+RC
Sbjct: 612 GEPLTICYVDVQEVSTAERRRTLLTSYRFFCQCKRC 647
>gi|72392607|ref|XP_847104.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175618|gb|AAX69751.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803134|gb|AAZ13038.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 713
Score = 65.9 bits (159), Expect = 8e-09, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++CG +Q N + P + +++ AS+ H+C PNL + D G A++ I
Sbjct: 552 RLCGAVQCNNFGLFDPKEDCIGVSVIPEASYFNHSCLPNLCRVMCDGGIAAFYALREIRK 611
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L TS F CQC+RC
Sbjct: 612 GEPLTICYVDVQEVSTAERRRTLLTSYRFFCQCKRC 647
>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
Length = 503
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E +Y A++ + H+C PN Y +F D + +++RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYTFEDKTNNMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ K F C+C RC DPTE F G+ C
Sbjct: 245 EGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLYC 295
>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
cuniculus]
Length = 477
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HV+ + + E R +V FL + ++ F+ + I I G+I N +
Sbjct: 132 LQNHVDHFGEEEQKEL-RADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLALVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
R L+ YF C CE C+ + + F GVK
Sbjct: 250 RRQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 279
>gi|384250973|gb|EIE24451.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 606
Score = 65.1 bits (157), Expect = 1e-08, Method: Composition-based stats.
Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 9/119 (7%)
Query: 77 EEILKICGIIQVNAHEM----PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LR 131
+++L + +I NAH M S + ++ S + H+C PN F G + +R
Sbjct: 294 QQLLDMAAVINNNAHGMGAANSTNTDSALGLYPALSMLNHSCLPNCV--FASCGSDMHVR 351
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL--NTFYDGVKC 188
A++P+A G+ L++ Y + + R L +K+F C CERC P E + F + VKC
Sbjct: 352 AIRPVAAGEQLTVTYINIMEPRRIRARELMDTKHFACACERCVSPLETHPDRFLEAVKC 410
>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
Length = 417
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 53/98 (54%), Gaps = 1/98 (1%)
Query: 80 LKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAP 138
+++ I++ N ++ L +++A++ S + H C PN Y TD +R +PIA
Sbjct: 212 VQVQDIVKANTFQVLLNGVNHLAVWPETSRLNHACAPNAQYVIDTDHLSHTVRVTRPIAE 271
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
G+ ++I YT PL T R+HHL +F C C RC P
Sbjct: 272 GEEITISYTSPLEPTETRQHHLAQGFHFTCTCPRCTSP 309
>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
Length = 497
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 77/151 (50%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ L+ YF C CE C+ + + F GVK
Sbjct: 250 KRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 279
>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Cavia porcellus]
Length = 477
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISVGEELTVSYIDFLNVSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ L+ YF C CE C+ + + F GVK
Sbjct: 250 KRQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 279
>gi|66801407|ref|XP_629629.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74851021|sp|Q54D67.1|Y2454_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292454
gi|60463010|gb|EAL61206.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 343
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 51/96 (53%), Gaps = 2/96 (2%)
Query: 80 LKICGI-IQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIA 137
L GI I N ++ P S I ++ SFI H+C PN + F D + L +KPI
Sbjct: 238 LNTIGIDIDPNQQSTKMSSPESGIGLYLLTSFINHDCDPNAFIHFPDDHTMHLSPLKPIN 297
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PGD ++I YTD ++RR L + F C+C++C
Sbjct: 298 PGDEITISYTDTTKDLVDRRSQLFENYGFNCECKKC 333
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 57/119 (47%), Gaps = 6/119 (5%)
Query: 77 EEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
EEIL I I N++ M + + + +F I H+C PN S G +RA
Sbjct: 181 EEILVIAAQINENSYSMDALDEKHLVAAVGLFPICGLINHSCQPNCTWSNAGDGIAEVRA 240
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP--TELNTFYDGVKCP 189
++ I GD +++ Y D RR L+ +K+F CQC RC P ++ + +G CP
Sbjct: 241 LRDIKEGDEITLSYIDIDKERAERRKELRETKHFDCQCGRCAAPLSESVDRYLEGFCCP 299
>gi|281206757|gb|EFA80942.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 367
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 56/111 (50%), Gaps = 8/111 (7%)
Query: 72 EEFTQEEILKICGIIQVN--AHEMPLTEP------SYIAIFDRASFIEHNCYPNLYKSFT 123
+ F + +++ I+ +N E+ TEP S I +F SFI H+C PN Y +F
Sbjct: 239 KRFNYDWFVRVMQILYINTLGIEVGATEPKAASTSSGIGLFYLTSFINHSCDPNCYLAFP 298
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
L A+KP+ GD L I Y DP I+R+ HL + F C C +C+
Sbjct: 299 TDHTAHLTALKPLKAGDELLIAYGDPNKDYIDRQSHLFDNYGFSCNCSKCQ 349
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 48/99 (48%), Gaps = 4/99 (4%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
T +++ +C IQVN +T +AI+DR S + H+C PN +F G L M
Sbjct: 162 TTADMIALCCKIQVNT----MTTERGVAIYDRLSTVNHSCVPNACLTFGIGGIARLSPMT 217
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
IA GD ++I Y D R+ L+ YF C C C
Sbjct: 218 AIASGDQINISYVDVFQSCETRQRQLKEQYYFDCTCRLC 256
>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
mulatta]
Length = 399
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 75/151 (49%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKE-LRVDVDTFLQYWPPQSQQFGMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSEER 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ L+ YF C CE C+ + + F GVK
Sbjct: 250 KRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 279
>gi|389750946|gb|EIM92019.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 462
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 83/182 (45%), Gaps = 30/182 (16%)
Query: 9 TTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF 68
TT +CII+ R L ++ V E K P N + +VQ +N
Sbjct: 147 TTARQPGDCIISERPLL-----------ILPRSMMVPELK--PGVPNVQAQMVQGAINRM 193
Query: 69 KLNEEFTQEEILKIC--------GIIQVNAHEM-PLTEPSY----IAIFDRASFIEHNCY 115
+L++ + LK C GI+ NA ++ PL PSY +AI S I H+C
Sbjct: 194 RLDDR-ARFLALKNCKSGSDVLTGILDTNAMQIGPL--PSYPVDHVAICATVSRINHSCS 250
Query: 116 PN-LYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
PN ++ +S +RA+KPIA GD ++I YT PL R+ L+ F C CE C
Sbjct: 251 PNAVFAWDVNSFTCFIRALKPIAAGDEITITYTYPLAPRTERQEDLRQKYAFTCTCETCS 310
Query: 175 DP 176
P
Sbjct: 311 LP 312
>gi|301105252|ref|XP_002901710.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262100714|gb|EEY58766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 500
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 77/161 (47%), Gaps = 14/161 (8%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
L+CL P D +M L ++ + + R+ L + F FT +++
Sbjct: 124 LKCLAMYECDPHALDGMMDLTCVGQQRVTNATLQLRRQ-----LPDIFPAG--FTDQQMA 176
Query: 81 KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD 140
+ G++ N+HE+ S +F A +EHNC PN + DS + + A++PIAPGD
Sbjct: 177 TLIGVLNTNSHELENLGGS--GLFLSACRMEHNCKPNCSFTTFDS-TLWMTAIRPIAPGD 233
Query: 141 HLSICYTDPLWG-TINRRHHLQTSKYFICQCERCR---DPT 177
LSI Y + + T R+ L S FIC CE C DPT
Sbjct: 234 ALSIDYGNFFYRPTPERQECLLESYGFICTCEACVSLPDPT 274
>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
Length = 497
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E +Y A++ + H+C PN Y +F + + +++RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ K F C+C RC DPTE F G+ C
Sbjct: 245 EGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLYC 295
>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
Length = 499
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
E SY A++ + H+C PN Y +F + + +++RA I G+ ++ YT G I
Sbjct: 203 EFSYRALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDILEGEEITTTYTKLFTGNIA 262
Query: 156 RRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R L+ K F C+C RC DPTE + G+ C
Sbjct: 263 RHLFLKMKKGFTCKCPRCSDPTEKGAYISGLYC 295
>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 740
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/107 (37%), Positives = 56/107 (52%), Gaps = 4/107 (3%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEM--PLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSG 126
L++ FT++EI + I NAHE PLT I+ AS+I H C PN D G
Sbjct: 443 LSDPFTKQEIFDLYPKILFNAHEYIDPLTHYEIGRGIYPTASYINHTCLPNTTWYNDDHG 502
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+L R+ + I G+ ++ Y D L + RR L+ F+CQCERC
Sbjct: 503 LILYRSSRDILKGEEITTSYLDILKPKLQRRKDLKQYS-FVCQCERC 548
>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
Length = 500
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E +Y A++ + H+C PN Y +F + + +++RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ K F C+C RC DPTE F G+ C
Sbjct: 245 EGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLYC 295
>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
Length = 500
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E +Y A++ + H+C PN Y +F + + +++RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ K F C+C RC DPTE F G+ C
Sbjct: 245 EGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLYC 295
>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
Length = 497
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIA 137
+L+ G + E +Y A++ + H+C PN Y +F + + +++RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYTFEEKTNNMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ K F C+C RC DPTE F G+ C
Sbjct: 245 EGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFISGLYC 295
>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
Length = 429
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 47/92 (51%), Gaps = 1/92 (1%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRH 158
Y A+F + + H+C PN Y +F + + ++LRA + G ++ YT G I R
Sbjct: 206 YRALFPLFALVNHDCVPNSYYTFEEKTNYMVLRASVDLPEGTEITTTYTKLFTGNIARHL 265
Query: 159 HLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
L+ K F C+C RC DPTE F V C +
Sbjct: 266 FLKMKKNFTCKCSRCSDPTEKGAFISAVYCRD 297
>gi|225717680|gb|ACO14686.1| msta, isoform A [Caligus clemensi]
Length = 521
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 84/176 (47%), Gaps = 8/176 (4%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE 77
I+ L+ L K P +++ L L H E+ N P+Y + + L + + ++ +
Sbjct: 156 ILPLKLLLSKESNPHVYNCLSILMDHREDRLNDPEYLEGIQTPI--LKSLERSLSTYSTD 213
Query: 78 EILKICGIIQVNAHEM-PLTEPS-YIAIFDRASFIEHNCYPN---LYKSFTDSGQVLLRA 132
E+LK GI++VN++E+ LT S + +F S + H+C P + T G ++ +
Sbjct: 214 ELLKALGIVEVNSYEVYNLTGHSGFRGLFCLTSLLSHDCVPKRRPIIGHETPYGLTMI-S 272
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ I G+ LSI Y RR L+ + YF C C+RC D E D + C
Sbjct: 273 TRNINEGEILSINYVHTQKPNRIRRRTLKDNWYFECSCKRCEDEAEFGLHPDAIWC 328
>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
Length = 496
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/184 (26%), Positives = 82/184 (44%), Gaps = 11/184 (5%)
Query: 11 PHPSYECIIALRCLYKKSH-QPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF-- 67
P + II L C+ + + + D + L+A+++ + + RN + NF
Sbjct: 117 PDVANSVIIRLLCVARAINLSKEQRDLIYCLQANLDN-----NHRTEVRNAAKCFKNFPT 171
Query: 68 -FKLNEEFTQE-EILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD- 124
KL E + +L+ G + E +Y A++ + H+C PN Y +F +
Sbjct: 172 DKKLIEIMNRTVAVLRTNGFDKTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYTFEEK 231
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ +++RA I G ++ YT G I R L+ K F C+C RC DPTE F
Sbjct: 232 TNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCSDPTEKGAFIS 291
Query: 185 GVKC 188
G+ C
Sbjct: 292 GLYC 295
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 63.5 bits (153), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 68/138 (49%), Gaps = 5/138 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + K E R ++ FL + ++F + I I G+I N +
Sbjct: 127 LQNHVEHFGEEEKKEL-RLDLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGL 185
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L +R
Sbjct: 186 QAVGVGIFPNLCLVNHDCWPNCTVIF-NNGKIELRALGKISIGEELTVSYVDFLNVCEDR 244
Query: 157 RHHLQTSKYFICQCERCR 174
R L+ YF C CE C+
Sbjct: 245 RQQLKKQYYFDCTCEHCQ 262
>gi|395329416|gb|EJF61803.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 566
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 54/115 (46%), Gaps = 5/115 (4%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNA---HEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT 123
FF L QEE + GII NA +P Y + + S I H+C PN F
Sbjct: 304 FFGLMNA-QQEEPSLVKGIIDTNALYVGSLPGAPHQYAGVCEHISRINHSCSPNAAYRFD 362
Query: 124 DSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPT 177
+ +RA+ PI PG+ +SI YTDP R+H L +S F C C C P+
Sbjct: 363 LAAFAFEVRALFPIRPGEQVSIAYTDPALPRAARQHALLSSYGFTCACPACSLPS 417
>gi|401837625|gb|EJT41530.1| SET5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 524
Score = 63.2 bits (152), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF 122
F F K +EE E+ L + G +N + + Y+ I SFI H+C PN+Y
Sbjct: 324 FCGAFPKASEEIDLEKFLTMIGTFNINQYNGQM----YLWI----SFINHDCEPNVYIEQ 375
Query: 123 TDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
T+ + L L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 376 TEEHEELKLHARKPIKKGEQICITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 433
Query: 182 F 182
F
Sbjct: 434 F 434
>gi|84998620|ref|XP_954031.1| hypothetical protein [Theileria annulata]
gi|65305029|emb|CAI73354.1| hypothetical protein, conserved [Theileria annulata]
Length = 490
Score = 63.2 bits (152), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY---TDPLWG 152
+EP + I+D+ S++ H+C P+ T++ + +LRA K + PGD ++I Y TD L
Sbjct: 190 SEPDGLVIYDKISYVAHSCNPSCCWHHTENDEFVLRARKKLVPGDEITISYLGETDLLSP 249
Query: 153 TINRRHHLQTSKYFICQCERCRDPTE 178
T RR LQ +F C CERC P +
Sbjct: 250 TFRRRTLLQNWHFF-CTCERCSIPID 274
>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Ornithorhynchus anatinus]
Length = 477
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 76/151 (50%), Gaps = 6/151 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HV+ + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVDSFGEEEQKEL-RLDVESFLHFWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + +F + H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + +R
Sbjct: 191 QAVGVGVFPNLCLVNHDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSADR 249
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ L+ YF C CE C + + F GVK
Sbjct: 250 KKLLKKQYYFDCTCEHCEKGIKDDLFL-GVK 279
>gi|348689633|gb|EGZ29447.1| hypothetical protein PHYSODRAFT_473651 [Phytophthora sojae]
Length = 500
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 82/174 (47%), Gaps = 15/174 (8%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
L+CL + D+++ L ++ S + R+ L + F F+ E++
Sbjct: 124 LKCLALYERDARSLDEVLGLTCAGQQRAMSAALQLRRQ-----LPDIFPAG--FSDEQMA 176
Query: 81 KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD 140
+ G++ N+HE+ S +F A +EH+C PN + DS + + A++PIAPGD
Sbjct: 177 TLIGVLNTNSHELENLGGS--GLFLSACRMEHSCLPNCSFTTFDS-TLWMTAIRPIAPGD 233
Query: 141 HLSICYTDPLWG-TINRRHHLQTSKYFICQCERCR---DPTELNTFYDGVKCPE 190
LSI Y + + T R+ L S F+C CE C DPT G CP+
Sbjct: 234 ALSIDYGNFFYRPTPERQESLLESYGFVCTCEACVSMPDPTRAVRCLSG-NCPQ 286
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 85/170 (50%), Gaps = 13/170 (7%)
Query: 13 PSYECIIALRCLYKKSHQPQ--IWDKLM---QLEAHVEEYKNSPK--YENDRRNVVQFLL 65
PS + R L KK Q +KL+ ++++H E+ N + E D + QF
Sbjct: 101 PSEITRLVARILTKKKMQKDRCASEKLLLLGEMQSHTEDMDNERRETMEADVAGLHQF-- 158
Query: 66 NFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFT 123
F K E +++L + + N + E S++ A++ + I H+C P++ +++
Sbjct: 159 -FSKHLEIPGHKDLLTLFSQVACNGFTIEDEELSHLGTAVYPDVALINHSCRPSVIVTYS 217
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +RA++ + PGD + I Y D L+ T +R + L+ S YF CQCE C
Sbjct: 218 GT-SAHVRALRDMKPGDEVLISYIDVLYPTEDRNNRLRESYYFTCQCEEC 266
>gi|194913085|ref|XP_001982623.1| GG12637 [Drosophila erecta]
gi|190648299|gb|EDV45592.1| GG12637 [Drosophila erecta]
Length = 309
Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
V+ HE+ + A+F A + H C PN F + +++ A + I G +++ Y
Sbjct: 219 VDGHEVLVR-----ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYA 273
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCRDPT 177
LW T+ R+ L +K+FIC+C RC+DPT
Sbjct: 274 KLLWSTLARKIFLGMTKHFICKCVRCQDPT 303
>gi|195350806|ref|XP_002041929.1| GM11450 [Drosophila sechellia]
gi|194123734|gb|EDW45777.1| GM11450 [Drosophila sechellia]
Length = 462
Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 78/162 (48%), Gaps = 18/162 (11%)
Query: 31 PQIWDKLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVN 89
P+ + + ++E+H EE + N+ + + VVQ L ++L E+ E++ +I G I V
Sbjct: 117 PEQFALIARMESHTEERRQNAVLWRHYEEKVVQRLRGTWQL-EDLEAEQVHEI-GQIGVK 174
Query: 90 AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD---SGQVLLRAMKPIAPGDHLSICY 146
++ A + H+C PN + TD S ++LLR + + + L++ Y
Sbjct: 175 VR----------TLYPSAFLLAHDCTPN--TAHTDDPRSFEILLRTSRRVREREALTLSY 222
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
L GT+ RR + K F C C RC DP EL T + C
Sbjct: 223 AYTLQGTLKRRAFMHEGKLFWCCCRRCSDPRELGTDCSALVC 264
>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
Length = 555
Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 13/128 (10%)
Query: 69 KLNEEFTQEEILKICGIIQVNA--------HEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
+L+ F +EE L++ I VN+ + P++ F +A+ + H+C PN +
Sbjct: 182 ELSSIFNKEEYLEVVCSIFVNSFAGLSNNFNRTPISN----GYFYKAALLNHSCEPNTFF 237
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELN 180
S +S + +R +K I G+ + Y D L TI R+ L SK F+C C RC D TE +
Sbjct: 238 SIQNST-LEMRVVKDINKGEQIFDSYVDLLLPTIERQKILLKSKNFVCDCSRCTDITENS 296
Query: 181 TFYDGVKC 188
F + C
Sbjct: 297 RFVSSIHC 304
>gi|194768767|ref|XP_001966483.1| GF21974 [Drosophila ananassae]
gi|190617247|gb|EDV32771.1| GF21974 [Drosophila ananassae]
Length = 364
Score = 62.8 bits (151), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 5/90 (5%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
V HE+ + A+F A + H C PN F D +++ + + I G +++ Y
Sbjct: 224 VGGHEVLVR-----ALFPLAGLLNHQCTPNAAHHFEDGETIVVCSTERIPCGAEITMSYA 278
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCRDPT 177
LW T+ R+ L +K+F+CQC RC+DPT
Sbjct: 279 KLLWSTLARKIFLGMTKHFMCQCVRCQDPT 308
>gi|221505568|gb|EEE31213.1| hypothetical protein TGVEG_082100 [Toxoplasma gondii VEG]
Length = 858
Score = 62.4 bits (150), Expect = 8e-08, Method: Composition-based stats.
Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 9/133 (6%)
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
V+ ++ F+L T E++ + I N E +P I+ SF H+C PN
Sbjct: 127 VKKIIRAFRLENRTTPEDLELLLQIWIHNCFEQ-FEDPVGYVIYFMPSFSSHSCLPNAL- 184
Query: 121 SFTDSGQV-LLRAMKPIAPGDHLSICY---TDPLWGTINRRHHLQTSKYFICQCERCRDP 176
FTD +LR+ IA GD +++ Y D + T+ RR L +K FIC CERC P
Sbjct: 185 WFTDEDHTFVLRSRADIAAGDEVTLTYLSEDDLMRPTLQRRKVLSETKDFICTCERCSAP 244
Query: 177 TELNTFYDGVKCP 189
+ F G +CP
Sbjct: 245 VD---FSRGFRCP 254
>gi|195477726|ref|XP_002100289.1| GE16967 [Drosophila yakuba]
gi|194187813|gb|EDX01397.1| GE16967 [Drosophila yakuba]
Length = 306
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
V+ HE+ + A+F A + H C PN F + +++ A + I G +++ Y
Sbjct: 220 VDGHEVLVR-----ALFPLAGLLNHQCTPNAAHHFENGETIVVCATERIPAGAEITMSYA 274
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCRDPT 177
LW T+ R+ L +K+FIC+C RC+DPT
Sbjct: 275 KLLWSTLARKIFLGMTKHFICKCVRCQDPT 304
>gi|365760270|gb|EHN02002.1| Set5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 524
Score = 62.4 bits (150), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 65/121 (53%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF 122
F F K +EE E+ L + G +N + + Y+ I SFI H+C PN Y
Sbjct: 324 FCGAFPKASEEIDLEKFLTMIGTFNINQYNGQM----YLWI----SFINHDCEPNAYIEQ 375
Query: 123 TDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
T+ + L L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 376 TEEHEELKLHARKPIKKGEQICITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 433
Query: 182 F 182
F
Sbjct: 434 F 434
>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
Length = 448
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 49/92 (53%), Gaps = 3/92 (3%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
I ++ S + H+C PN +F Q+ LR ++ GD L I Y DP+ +R++ L
Sbjct: 191 IGVYPGLSLVNHSCSPNCSATFRGK-QMQLRIIENTKIGDELLISYIDPMQVLSSRQNQL 249
Query: 161 QTSKYFICQCERCRDPTE--LNTFYDGVKCPE 190
Q+ F C CERC D T+ N D V+CP+
Sbjct: 250 QSQYCFKCICERCIDTTKDSCNNLMDSVRCPK 281
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ E KN N+VQ +L F+L+ +EI NAH + P
Sbjct: 147 LESHISEVDKNQLVLYAQMANLVQLILPSFELD----LKEITHTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F + +RA++PI+ + +SI Y + T+ R
Sbjct: 203 RPLGTGLYPVLSIINHSCVPNAVLIF-EGRTAYVRALQPISKNEEVSISYIETAATTMKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ YF C C RC +E + +G +C
Sbjct: 262 QDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC 292
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ E KN N+VQ +L F+L+ +EI NAH + P
Sbjct: 147 LESHISEVDKNQLVLYAQMANLVQLILPSFELD----LKEITHTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F + +RA++PI+ + +SI Y + T+ R
Sbjct: 203 RPLGTGLYPVLSIINHSCVPNAVLIF-EGRTAYVRALQPISKNEEVSISYIETAATTMKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ YF C C RC +E + +G +C
Sbjct: 262 QDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC 292
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ E KN N+VQ +L F+L+ +EI NAH + P
Sbjct: 147 LESHISEVDKNQLVLYAQMANLVQLILPSFELD----LKEITHTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F + +RA++PI+ + +SI Y + T+ R
Sbjct: 203 RPLGTGLYPVLSIINHSCVPNAVLIF-EGRTAYVRALQPISKNEEVSISYIETAATTMKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ YF C C RC +E + +G +C
Sbjct: 262 QDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC 292
>gi|237837951|ref|XP_002368273.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
gi|211965937|gb|EEB01133.1| hypothetical protein TGME49_088330 [Toxoplasma gondii ME49]
Length = 859
Score = 62.0 bits (149), Expect = 1e-07, Method: Composition-based stats.
Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 9/133 (6%)
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
V+ ++ F+L T E++ + I N E +P I+ SF H+C PN
Sbjct: 127 VKKIIRAFRLENRTTPEDLELLLQIWIHNCFEQ-FEDPVGYVIYFMPSFSSHSCLPNAL- 184
Query: 121 SFTDSGQV-LLRAMKPIAPGDHLSICY---TDPLWGTINRRHHLQTSKYFICQCERCRDP 176
FTD +LR IA GD +++ Y D + T+ RR L +K FIC CERC P
Sbjct: 185 WFTDEDHTFVLRTRADIAAGDEVTLTYLSEDDLMRPTLQRRKVLSETKDFICTCERCSAP 244
Query: 177 TELNTFYDGVKCP 189
+ F G +CP
Sbjct: 245 VD---FSRGFRCP 254
>gi|221484463|gb|EEE22759.1| hypothetical protein TGGT1_035070 [Toxoplasma gondii GT1]
Length = 859
Score = 62.0 bits (149), Expect = 1e-07, Method: Composition-based stats.
Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 9/133 (6%)
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
V+ ++ F+L T E++ + I N E +P I+ SF H+C PN
Sbjct: 127 VKKIIRAFRLENRTTPEDLELLLQIWIHNCFEQ-FEDPVGYVIYFMPSFSSHSCLPNAL- 184
Query: 121 SFTDSGQV-LLRAMKPIAPGDHLSICY---TDPLWGTINRRHHLQTSKYFICQCERCRDP 176
FTD +LR IA GD +++ Y D + T+ RR L +K FIC CERC P
Sbjct: 185 WFTDEDHTFVLRTRADIAAGDEVTLTYLSEDDLMRPTLQRRKVLSETKDFICTCERCSAP 244
Query: 177 TELNTFYDGVKCP 189
+ F G +CP
Sbjct: 245 VD---FSRGFRCP 254
>gi|401401559|ref|XP_003881041.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115453|emb|CBZ51008.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 946
Score = 62.0 bits (149), Expect = 1e-07, Method: Composition-based stats.
Identities = 44/133 (33%), Positives = 64/133 (48%), Gaps = 9/133 (6%)
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
V+ ++ F L + + EE+ + I N E +P I+ SF H+C PN
Sbjct: 127 VKKIIRSFDLEKRTSPEELELLLQIWIHNCFEQ-FEDPVGYVIYFMPSFSSHSCLPNAL- 184
Query: 121 SFTDSGQV-LLRAMKPIAPGDHLSICY---TDPLWGTINRRHHLQTSKYFICQCERCRDP 176
FTD +LR IA GD +++ Y D + T++RR L +K FIC CERC P
Sbjct: 185 WFTDEDHTFVLRTRADIAAGDEVTLTYLSEEDLMRPTLHRRRVLSETKDFICACERCAAP 244
Query: 177 TELNTFYDGVKCP 189
+ F G +CP
Sbjct: 245 VD---FSRGFRCP 254
>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
niloticus]
Length = 440
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 87/173 (50%), Gaps = 9/173 (5%)
Query: 13 PSYECIIALRCLYKKSHQPQ--IWDKLM---QLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
PS + R L KK Q + I +K++ ++++H E+ N K E ++ +
Sbjct: 109 PSELSRLVARILAKKKMQKERCICEKMLLIGEMQSHTEDEDNE-KREMTEADIAGLHRFY 167
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDS 125
K + +E+L + + N + E S++ A++ + I H+C P++ +F +
Sbjct: 168 SKHLDVPDHKELLTLFSQVACNGFTIEDDELSHLGTAVYPDMALINHSCLPSVIVTFKGT 227
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+RA++ + PGD + I Y D L+ T +R + L+ S YFIC C+ C+ ++
Sbjct: 228 -SAEVRAVQDMKPGDEVLISYIDLLYPTDDRNNRLRESYYFICDCQECKSQSK 279
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 76/158 (48%), Gaps = 10/158 (6%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ H++++ + K + VQ ++F+ +++F + I I +I N +
Sbjct: 127 LQNHLDKFGDGEK--SSLMEDVQKFMDFWPSQSQQFGMQYISHIFSVISCNGFTLSDQRG 184
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ +AIF H+C+PN F ++G++ LRA+ I GD L++ Y D L T +
Sbjct: 185 LQAVGVAIFPNLCLTNHDCWPNCTVIF-NNGKIELRALGKINKGDELTVSYVDFLNLTED 243
Query: 156 RRHHLQTSKYFICQCERCRDPTE---LNTFYDGVKCPE 190
R L+ YF C CE C + T+ L DG PE
Sbjct: 244 RMEQLKKQYYFDCTCEHCTNKTKDALLLAVNDGESKPE 281
>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
Length = 486
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 68/151 (45%), Gaps = 18/151 (11%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ HVE + K D R V+ L F+ +++F + I I G+I NA +
Sbjct: 127 LQNHVESFDEEEK--KDLRVDVESFLEFWPAQSQQFGMQYISHIFGVINCNAFTLSDQRG 184
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLS 143
+ + IF H+C+PN F + ++ LRA+ I+PGD L+
Sbjct: 185 LQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPGDELT 244
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+ Y D L + RR L+ YF C CE C+
Sbjct: 245 VSYVDFLNVSEERRKQLKKQYYFDCTCEHCK 275
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 78/157 (49%), Gaps = 10/157 (6%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA---HEMPL 95
+LE HV + + E + ++ FL + + +++ T ++I I G+I N +
Sbjct: 112 ELEDHVADMQEDEIKEL-KVDIHNFLDYWPRNSKQHTIDDISHIFGVINCNGFTVSDQRG 170
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ + +F + HNC+PN + G++ LR++ IA G+ L++ Y D L +
Sbjct: 171 LQAVGVGLFPNLCMVNHNCWPNC-TVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEE 229
Query: 156 RRHHLQTSKYFICQCERCRDPTE-----LNTFYDGVK 187
RR L+T +F CQC+ C++ T+ DGVK
Sbjct: 230 RRRLLKTQYFFDCQCDYCKNGTKDDLKLAGREVDGVK 266
>gi|389604014|ref|XP_003723152.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322504894|emb|CBZ14681.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 623
Score = 61.6 bits (148), Expect = 1e-07, Method: Composition-based stats.
Identities = 56/178 (31%), Positives = 81/178 (45%), Gaps = 28/178 (15%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y CI L + K Q + +++L +HV+E S V + + FK EE
Sbjct: 316 AYHCITTLAGMKAKR---QGHENVLKLTSHVDEVIQS------LHPVGPLMCDLFKGEEE 366
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDR-------ASFIEHNCYPNLYKSFTDSG 126
T I I G+++ NA E +T+PS + + ASF H+C PN D+
Sbjct: 367 TTTLYI--IIGVLRCNALE--VTDPSGLGVAQALHVGNTIASFFNHSCMPNCA---IDTV 419
Query: 127 QVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQTSKYFICQCERC----RDPTEL 179
+ + + I G+ LSI Y L W T RR L YF+C C RC +DP EL
Sbjct: 420 RHAIVTTRIIYVGEELSIAYIPQLYWPTRLRRERLSEGYYFVCHCPRCQRCDKDPFEL 477
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ E KN N+VQ +L F+L+ +EI NAH + P
Sbjct: 147 LESHISEVDKNQLVLYAQMANLVQLILPSFELD----LKEITHTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F + +RA++PI+ + +SI Y + T+ R
Sbjct: 203 RPLGTGLYPVLSIINHSCVPNAVLIF-EGRTAYVRALQPISKNEEVSISYIETAATTMKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ YF C C RC +E + +G +C
Sbjct: 262 QDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC 292
>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
Length = 499
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 48/93 (51%), Gaps = 1/93 (1%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
E +Y A++ + H+C PN Y +F + + +++RA I G ++ YT G I
Sbjct: 203 EFNYRALYPLFGVMNHDCIPNSYYTFDEKTNHMVVRAAVDIPEGFEITTTYTKLFTGNIA 262
Query: 156 RRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R L+ K F C+C RC DPTE F G+ C
Sbjct: 263 RHLFLKMKKGFTCKCPRCSDPTEKGAFISGLYC 295
>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 497
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 79/169 (46%), Gaps = 12/169 (7%)
Query: 11 PHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDR-RNVVQFLLNFFK 69
P PS LR L K HQ D+L + ++ + + + D +N+V +
Sbjct: 117 PTPSRA---VLRILLLKKHQEDQGDRLQRFDSLTSHLTETIRTKPDHFQNLVLCARAIHE 173
Query: 70 LNE-EFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFIEHNCYPNLYKSFTD 124
++ E + +E++ G + +NA LT P Y A+ AS H+C PN F D
Sbjct: 174 YSKTELSLQEVVDCFGKLDINA--FTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDF-D 230
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
G+ LRA++ I G+ + + Y +P ++R+ L YF C+C RC
Sbjct: 231 KGKTWLRALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRC 279
>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 712
Score = 61.2 bits (147), Expect = 2e-07, Method: Composition-based stats.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++CG +Q N + ++ S I +IF AS+ H+C PNL + A++ I
Sbjct: 552 RMCGAVQCNNFGLFSSKESCIGVSIFPEASYFNHSCLPNLCRVMYRGNIAAFYALQSIRK 611
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L TS F C+C RC
Sbjct: 612 GEPLTICYVDVQEASTAERRRTLLTSYRFFCECRRC 647
>gi|66815955|ref|XP_641994.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
gi|60470035|gb|EAL68016.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
Length = 563
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 82/187 (43%), Gaps = 18/187 (9%)
Query: 17 CIIALRCLYKKS-----HQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN 71
C++ALR + + +Q + QLE +E N E N QF+ + N
Sbjct: 123 CLLALRIMIRNKIESDKYQETVGKLSNQLEKFIE--INKSFIEKYDINFQQFINQLQQEN 180
Query: 72 EE------FTQEEILKICGIIQVNAHEMPLTE----PSYIAIFDRASFIEHNCYPNLYKS 121
+ F ++E L+I I +N+ + P F + + + H+C PN++ +
Sbjct: 181 SDPSLLAIFNKDEFLQIISSIYINSFSGLSNDFNRKPISNGYFYKPALLNHSCEPNIFFT 240
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
D + +RA K I + + Y D L TI R+ L SK F+C+C RC D TE
Sbjct: 241 IKDKN-LEMRACKKIEKDEEIVDSYVDLLLPTIERQKILYNSKNFLCKCSRCSDSTEDER 299
Query: 182 FYDGVKC 188
+ + C
Sbjct: 300 YLSSIYC 306
>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
Length = 343
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 52/102 (50%), Gaps = 2/102 (1%)
Query: 80 LKICGI-IQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIA 137
L GI I N ++ P S I ++ SFI H+C PN Y F + +R +KPI
Sbjct: 236 LNTIGIDIDPNQTTTKMSTPESSIGLYFLTSFINHSCDPNAYVQFPNDHTAEIRLLKPIN 295
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
PG+ ++I Y D I+RR L + F C+C +C + +L
Sbjct: 296 PGEEITISYADTSKDIIDRRSQLFENYGFNCECPKCLNELKL 337
>gi|47227584|emb|CAG09581.1| unnamed protein product [Tetraodon nigroviridis]
Length = 382
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/143 (26%), Positives = 74/143 (51%), Gaps = 5/143 (3%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA---HEMPL 95
+LE H+ + + E + ++ FL + + +++ T ++I + G+I N +
Sbjct: 32 ELEDHIADLQEDELKEL-KVDIHNFLDYWPRNSKQHTIDDISHLFGVINCNGFTVSDQRG 90
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ + +F + H+C+PN + G++ LR++ IA G+ L++ Y D L +
Sbjct: 91 LQAVGVGLFPNLCLVNHDCWPNC-TVILNHGKIELRSLGKIAEGEELTVAYVDFLNLSEE 149
Query: 156 RRHHLQTSKYFICQCERCRDPTE 178
RR L+T +F CQCE CR+ T+
Sbjct: 150 RRRLLKTQYFFDCQCEHCRNRTK 172
>gi|432089341|gb|ELK23292.1| N-lysine methyltransferase SMYD2 [Myotis davidii]
Length = 391
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 86/161 (53%), Gaps = 18/161 (11%)
Query: 24 LYKKSHQPQI-WDKLM---QLEAHVEEYKNSPKYENDRRNVVQ----FLLNFFKLNEEF- 74
+ KKSH + +KL+ + E+H++ K +N++++++Q L +F+ + EF
Sbjct: 23 MRKKSHPERTPSEKLLTVKEFESHLD------KLDNEKKDLIQSDISALHHFYSKHLEFP 76
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
QE ++ + + N + E S++ AIF + + H+C PN+ ++ + +RA
Sbjct: 77 DQESLVALFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGT-LAEVRA 135
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ I PGD + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 136 VQEIGPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 176
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 59/118 (50%), Gaps = 8/118 (6%)
Query: 78 EILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
EILK+ G + NA + E I ++ + H+C PN SF ++ +R ++
Sbjct: 173 EILKLFGRVNCNAFSVANEFTNEAVGIGLYPEGALFNHDCDPNCVVSFKGR-EMQVRVVR 231
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT----FYDGVKC 188
I + L++ Y + L T RR L+ S +F C+C+RC+ T + + DG++C
Sbjct: 232 DIEVDEELTVSYVELLQSTKARRRELKESYFFDCECKRCKAATNGQSNEDWYLDGLRC 289
>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
Length = 497
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 79/169 (46%), Gaps = 12/169 (7%)
Query: 11 PHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDR-RNVVQFLLNFFK 69
P PS LR L K HQ D++ + ++ + + + D +N+V +
Sbjct: 117 PTPSRA---VLRILLLKKHQEDQGDRMQRFDSLTSHLTETIRTKPDHFQNLVLCARAIHE 173
Query: 70 LNE-EFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFIEHNCYPNLYKSFTD 124
++ E + +E++ G + +NA LT P Y A+ AS H+C PN F D
Sbjct: 174 YSKTELSLQEVVDCFGKLDINA--FTLTTPFYDQIGAAVEPLASLCNHSCDPNAAVDF-D 230
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
G+ LRA++ I G+ + + Y +P ++R+ L YF C+C RC
Sbjct: 231 KGKTWLRALQHIEEGEQIFVSYVEPTDACLHRQAELSKRYYFECECPRC 279
>gi|401422457|ref|XP_003875716.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322491955|emb|CBZ27228.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 872
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 34/102 (33%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQV-LLRAMKPIA 137
++C Q N+ + T+ I I+ AS+ H+C PNL + G++ A++ IA
Sbjct: 671 RLCAAAQCNSFGVYDTQEHCIGFGIYPEASYFNHSCVPNLCRVMHHGGRIAAFYALRAIA 730
Query: 138 PGDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERCRDPTE 178
P + L+ICYTD + RR +L + F C CERC E
Sbjct: 731 PQEPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKAE 772
>gi|403221111|dbj|BAM39244.1| uncharacterized protein TOT_010000703 [Theileria orientalis strain
Shintoku]
Length = 501
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 4/88 (4%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY---TDPLWG 152
++P + I+D+ S++ H+C P+ TD +LRA K + PGD ++I Y +D L
Sbjct: 201 SDPDGLVIYDKISYLAHSCNPSCCWHHTDDENFVLRARKKLIPGDEITISYLGESDLLSP 260
Query: 153 TINRRHHLQTSKYFICQCERCRDPTELN 180
T RR LQ + +F C C+RC P +L+
Sbjct: 261 TFKRRTLLQ-NWHFFCTCDRCSLPVDLS 287
>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL-RAMKPIA 137
+L+ G + E +Y A++ + + H+C PN Y +F + +++ RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ + F C+C RC DPTE + G+ C
Sbjct: 245 EGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCSDPTEKGAYISGLYC 295
>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 490
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA---HEMPLT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N E
Sbjct: 132 LQNHVEHFGEEEQKE-LRMDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSEQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + HNC+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQTRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNFSEERKKQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
Length = 498
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 1/111 (0%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL-RAMKPIA 137
+L+ G + E +Y A++ + + H+C PN Y +F + +++ RA I
Sbjct: 185 VLRTNGFDKTTDRTNDNQEFNYRALYPLFAVMNHDCIPNSYYTFEEKTNIMIVRAAVDIP 244
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
G ++ YT G I R L+ + F C+C RC DPTE + G+ C
Sbjct: 245 EGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCSDPTEKGAYISGLYC 295
>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
Length = 432
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 82/171 (47%), Gaps = 11/171 (6%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
PS + R L KK Q D + ++++H+E+ N K E ++ F
Sbjct: 101 PSETSRLVARILAKKKMQKGRCDSEKILLIGEMQSHLEDEDNEKK-ERTELDIAGLGRFF 159
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDS 125
K E + +++L + + N + E S++ A++ + I H+C P++ +F +
Sbjct: 160 SKHLEVPSHKDLLTLFSQVACNGFTIEDDELSHMGTAVYPDVALINHSCLPSVIVTFNGT 219
Query: 126 -GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
QV RA++ + PGD + I Y D L+ T +R L+ S YF C C C++
Sbjct: 220 LAQV--RAVQDMKPGDEVLISYIDLLYPTDDRNSRLRESYYFTCNCLECQN 268
>gi|323354728|gb|EGA86563.1| Set5p [Saccharomyces cerevisiae VL3]
Length = 526
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLY-KS 121
F F K +EE E+ L + G +N + ++ SFI H+C PN Y +
Sbjct: 326 FCGAFPKASEEIDFEKFLTMIGTFNINQYNGQ--------VYHWISFINHDCEPNAYIEQ 377
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ ++ L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 378 VEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 435
Query: 182 F 182
F
Sbjct: 436 F 436
>gi|225713426|gb|ACO12559.1| msta, isoform A [Lepeophtheirus salmonis]
Length = 505
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 82/182 (45%), Gaps = 11/182 (6%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQ----LEAHVEEYKNSPKYENDRRNVVQFLLNFFKL 70
Y I LR L + +P+ W+ +M+ + ++EY K+ + VV+ + + K
Sbjct: 146 YSIITTLRLLSLQESKPEFWNTIMKDMLDHNSDIQEY-TPEKWTSYEETVVKKIQSLTKG 204
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSY----IAIFDRASFIEHNCYPNLYKSFTDSG 126
N + + + ++ G+I N ++ L ++ S + H+C N
Sbjct: 205 NAD--TDTLTRLIGVIDTNGADLNLPHDGKHGKGTGLYLIYSNLNHSCVCNTKTINFQDR 262
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
++ +RA I+ G+ + Y P T+ RR ++ YF C C+RC DPTE ++ +
Sbjct: 263 RIEMRASVNISKGEQICNQYVRPNKCTLARRTTIKNKWYFDCCCKRCADPTECGSYLGAL 322
Query: 187 KC 188
C
Sbjct: 323 LC 324
>gi|6322001|ref|NP_012077.1| Set5p [Saccharomyces cerevisiae S288c]
gi|731577|sp|P38890.1|SET5_YEAST RecName: Full=Putative protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|458924|gb|AAB69736.1| Yhr207cp [Saccharomyces cerevisiae]
gi|207344478|gb|EDZ71606.1| YHR207Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256272756|gb|EEU07728.1| Set5p [Saccharomyces cerevisiae JAY291]
gi|285810113|tpg|DAA06900.1| TPA: Set5p [Saccharomyces cerevisiae S288c]
gi|392299021|gb|EIW10116.1| Set5p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 526
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLY-KS 121
F F K +EE E+ L + G +N + ++ SFI H+C PN Y +
Sbjct: 326 FCGAFPKASEEIDFEKFLTMIGTFNINQYNGQ--------VYHWISFINHDCEPNAYIEQ 377
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ ++ L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 378 VEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 435
Query: 182 F 182
F
Sbjct: 436 F 436
>gi|349578759|dbj|GAA23924.1| K7_Set5p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 526
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLY-KS 121
F F K +EE E+ L + G +N + ++ SFI H+C PN Y +
Sbjct: 326 FCGAFPKASEEIDFEKFLTMIGTFNINQYNGQ--------VYHWISFINHDCEPNAYIEQ 377
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ ++ L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 378 VEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 435
Query: 182 F 182
F
Sbjct: 436 F 436
>gi|323304666|gb|EGA58429.1| Set5p [Saccharomyces cerevisiae FostersB]
Length = 524
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLY-KS 121
F F K +EE E+ L + G +N + ++ SFI H+C PN Y +
Sbjct: 326 FCGAFPKASEEIDFEKFLTMIGTFNINQYNGQ--------VYHWISFINHDCEPNAYIEQ 377
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ ++ L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 378 VEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 435
Query: 182 F 182
F
Sbjct: 436 F 436
>gi|171704581|sp|A6ZTB4.1|SET5_YEAS7 RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|151944152|gb|EDN62445.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 526
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLY-KS 121
F F K +EE E+ L + G +N + ++ SFI H+C PN Y +
Sbjct: 326 FCGAFPKASEEIDFEKFLTMIGTFNINQYNGQ--------VYHWISFINHDCEPNAYIEQ 377
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ ++ L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 378 VEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 435
Query: 182 F 182
F
Sbjct: 436 F 436
>gi|190405982|gb|EDV09249.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|259146968|emb|CAY80224.1| Set5p [Saccharomyces cerevisiae EC1118]
gi|323348221|gb|EGA82471.1| Set5p [Saccharomyces cerevisiae Lalvin QA23]
gi|365765161|gb|EHN06674.1| Set5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 526
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 63/121 (52%), Gaps = 11/121 (9%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLY-KS 121
F F K +EE E+ L + G +N + ++ SFI H+C PN Y +
Sbjct: 326 FCGAFPKASEEIDFEKFLTMIGTFNINQYNGQ--------VYHWISFINHDCEPNAYIEQ 377
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
+ ++ L A KPI G+ + I Y +PL G RR L+ + F+CQC+RC++ EL+T
Sbjct: 378 VEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRRRELRVNWGFLCQCDRCQN--ELST 435
Query: 182 F 182
F
Sbjct: 436 F 436
>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
Length = 503
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 1/93 (1%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
E +Y A++ + H+C PN Y +F + + +++RA I G ++ YT G I
Sbjct: 203 EFNYRALYPLFGVMNHDCIPNSYYTFEEKTNNMIVRAAVDIPEGFEITTTYTKLFTGNIA 262
Query: 156 RRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
R L+ K F C+C RC DPTE + G+ C
Sbjct: 263 RHLFLKMKKNFTCKCPRCSDPTEKGAYISGLYC 295
>gi|290975196|ref|XP_002670329.1| predicted protein [Naegleria gruberi]
gi|284083887|gb|EFC37585.1| predicted protein [Naegleria gruberi]
Length = 626
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 49/98 (50%), Gaps = 9/98 (9%)
Query: 102 AIFDRASFIEHNCYPNL-YKSFTD-------SGQVLL-RAMKPIAPGDHLSICYTDPLWG 152
+F S +H+C PN +++F D SG V+L + +K + G+ L I Y D +
Sbjct: 295 GLFPMVSMFDHSCSPNCSFQTFDDMKSRNSYSGNVILVQTVKKVKKGEELCISYIDIMNP 354
Query: 153 TINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
T RR L SKYF+C+C RC TE N C E
Sbjct: 355 TCIRRRELWYSKYFVCRCSRCMSETEENRMVRAYSCGE 392
>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 1-like [Pongo abelii]
Length = 490
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|332239335|ref|XP_003268859.1| PREDICTED: SET and MYND domain-containing protein 1 [Nomascus
leucogenys]
Length = 512
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 154 LQNHVEHFGEEEQKE-LRVDVDTFLQYWLPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 212
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 213 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 272
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 273 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 314
>gi|328768420|gb|EGF78466.1| hypothetical protein BATDEDRAFT_90401 [Batrachochytrium
dendrobatidis JAM81]
Length = 578
Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 6/115 (5%)
Query: 65 LNFFKLNEEF-TQEEILKICGIIQVNAHEM-----PLTEPSYIAIFDRASFIEHNCYPNL 118
L+ K+N+ +++EI+K+ I N+H + + + +F + + H+C PN
Sbjct: 205 LSHLKINDSLKSKQEIIKLACRINSNSHGVYDATGNKSGEIGVGMFPFVAMLNHSCAPNC 264
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ SGQ+L+R + ++PG L + Y D RR L T+K F C C RC
Sbjct: 265 AFVTSQSGQMLVRTLTQVSPGTELCVGYIDLFTPRWERRGKLLTTKLFWCTCIRC 319
>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
[Taeniopygia guttata]
Length = 490
Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 78/183 (42%), Gaps = 23/183 (12%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
P+ +A R L+K + + + +L+ HV+ + K D R V+ L F
Sbjct: 100 PTENIRLAARILWKMEREGSGLSEGCLVAIEELQNHVDSFGEEEK--KDLRADVESFLEF 157
Query: 68 FKLN-EEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFT 123
+ ++F + I I G+I N + + + IF H+C+PN F
Sbjct: 158 WPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVVFN 217
Query: 124 DSG------------QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
+ ++ LRA+ I+PGD L++ Y D L + RR L+ YF C CE
Sbjct: 218 NGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCE 277
Query: 172 RCR 174
C+
Sbjct: 278 HCK 280
>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Taeniopygia guttata]
Length = 491
Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 78/183 (42%), Gaps = 23/183 (12%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
P+ +A R L+K + + + +L+ HV+ + K D R V+ L F
Sbjct: 100 PTENIRLAARILWKMEREGSGLSEGCLVAIEELQNHVDSFGEEEK--KDLRADVESFLEF 157
Query: 68 FKLN-EEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFT 123
+ ++F + I I G+I N + + + IF H+C+PN F
Sbjct: 158 WPPQCQQFGMQLISHIFGVISCNGFTLSDQRGLQAVGVGIFPNLCQANHDCWPNCTVVFN 217
Query: 124 DSG------------QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
+ ++ LRA+ I+PGD L++ Y D L + RR L+ YF C CE
Sbjct: 218 NGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCE 277
Query: 172 RCR 174
C+
Sbjct: 278 HCK 280
>gi|195431056|ref|XP_002063564.1| GK21977 [Drosophila willistoni]
gi|194159649|gb|EDW74550.1| GK21977 [Drosophila willistoni]
Length = 487
Score = 60.1 bits (144), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 41/79 (51%)
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C PN + + + L A + G+ + YT+PL GT R+H L+ ++ C C
Sbjct: 208 HHCVPNTVVALDEEFNLKLYAAVDLKSGEVIYNSYTNPLMGTSQRQHQLRLTRRLECSCS 267
Query: 172 RCRDPTELNTFYDGVKCPE 190
RC DPTE T VKC E
Sbjct: 268 RCLDPTECGTNMSSVKCRE 286
>gi|367012652|ref|XP_003680826.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
gi|359748486|emb|CCE91615.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
Length = 486
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 38/66 (57%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
SFI H+C PN+ D ++ + A KPI G+ L Y +PL G RR L+ + F+
Sbjct: 341 SFINHDCEPNVRYEIDDKLRLKVFARKPIKAGEELLTTYVNPLHGVKLRRRELRVNWGFL 400
Query: 168 CQCERC 173
CQCERC
Sbjct: 401 CQCERC 406
>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Meleagris gallopavo]
Length = 486
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 68/151 (45%), Gaps = 18/151 (11%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ HVE + K D R V+ L F+ +++F + I I G+I NA +
Sbjct: 127 LQNHVENFDEEEK--KDLRIDVESFLEFWPAQSQQFGMQYISHIFGVINCNAFTLSDQRG 184
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLS 143
+ + IF H+C+PN F + ++ LRA+ I+PGD L+
Sbjct: 185 LQAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALSKISPGDELT 244
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+ Y D L + R+ L+ YF C CE C+
Sbjct: 245 VSYVDFLNVSEERQKQLKKQYYFDCTCEHCK 275
>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Ailuropoda melanoleuca]
Length = 490
Score = 59.7 bits (143), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 75/164 (45%), Gaps = 17/164 (10%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---L 95
+L+ HVE + + E R +V FL + +++F+ + I + G+I N +
Sbjct: 131 ELQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHVFGVINCNGFTLSDQRG 189
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLS 143
+ + IF + HNC+PN F + ++ LRA+ I+ G+ L+
Sbjct: 190 LQAVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 250 VSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 82/190 (43%), Gaps = 13/190 (6%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF---- 68
P+ +A R +++ + + + E S E+D + + + NF
Sbjct: 102 PNENIRLAARIMWRLDKEGGVMSDMQMTSLDDLENHISDMPEDDLKELKVDIHNFLDYWP 161
Query: 69 KLNEEFTQEEILKICGIIQVNA---HEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
+ +++ ++I I G+I N + + +F + H+C+PN +
Sbjct: 162 RNSKQHRVDDISHIFGVINCNGFTVSDQRGLNAVGVGLFPNLCLVNHDCWPNC-TVILNH 220
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-----DPTELN 180
G++ LRA+ I PG+ +++ Y D L T +R+ L+ +F C CE C+ D
Sbjct: 221 GKIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHCKGHIKDDLKMGG 280
Query: 181 TFYDGVKCPE 190
DGVK PE
Sbjct: 281 KVVDGVKVPE 290
>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
Length = 432
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H+E K +N++R+++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLE------KLDNEKRDLIQSDVA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDTDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F CQC+ C
Sbjct: 215 VTYKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQEC 267
>gi|146087252|ref|XP_001465771.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134069871|emb|CAM68198.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 857
Score = 59.3 bits (142), Expect = 7e-07, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIA 137
++C Q N+ + T+ I ++ AS+ H+C PNL + G++ A++ IA
Sbjct: 656 RLCAAAQCNSFGVYDTQDHCIGFGMYPEASYFNHSCVPNLCRVMHHGGRIAAFYALRAIA 715
Query: 138 PGDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERCRDPTE 178
P + L+ICYTD + RR +L + F C CERC E
Sbjct: 716 PQEPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKAE 757
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 74/151 (49%), Gaps = 7/151 (4%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNE-EFTQEEILKICGIIQVNA---H 91
K+ +LE H+ + +N+ + E V F ++F +E + EE+ + II NA +
Sbjct: 144 KIEELEDHLSK-RNAEEKEAIDEKVYSFG-DYFTYDEMPDSDEEMAHLFAIIDCNAIGLN 201
Query: 92 EMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW 151
+ + + I+ S + H+C PN + + ++ +RA++ I PG+ L I Y D L
Sbjct: 202 DHRGVQTIGVGIYPGISMLNHDCSPNCV-AMNNGPRLEVRALRVIQPGEELCISYIDSLE 260
Query: 152 GTINRRHHLQTSKYFICQCERCRDPTELNTF 182
T RR L+ YF C+C+ C EL
Sbjct: 261 TTEKRREKLKLQYYFDCECDTCTKGEELENL 291
>gi|398015614|ref|XP_003860996.1| hypothetical protein, conserved [Leishmania donovani]
gi|322499220|emb|CBZ34291.1| hypothetical protein, conserved [Leishmania donovani]
Length = 857
Score = 59.3 bits (142), Expect = 8e-07, Method: Composition-based stats.
Identities = 33/102 (32%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIA 137
++C Q N+ + T+ I ++ AS+ H+C PNL + G++ A++ IA
Sbjct: 656 RLCAAAQCNSFGVYDTQDHCIGFGMYPEASYFNHSCVPNLCRVMHHGGRIAAFYALRAIA 715
Query: 138 PGDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERCRDPTE 178
P + L+ICYTD + RR +L + F C CERC E
Sbjct: 716 PQEPLTICYTDVEQLNSAERRRNLLGTYRFFCMCERCSGKAE 757
>gi|398012025|ref|XP_003859207.1| hypothetical protein, conserved [Leishmania donovani]
gi|322497420|emb|CBZ32495.1| hypothetical protein, conserved [Leishmania donovani]
Length = 622
Score = 59.3 bits (142), Expect = 8e-07, Method: Composition-based stats.
Identities = 53/177 (29%), Positives = 82/177 (46%), Gaps = 28/177 (15%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y CI L + K+ +P ++++ L AHV+E S + + + FK EE
Sbjct: 315 AYHCITTLAGI--KAKRPG-HERVLTLTAHVDEVIQS------LHPIGPLMSDLFKGEEE 365
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDR-------ASFIEHNCYPNLYKSFTDSG 126
+ I G++ NA E +T+PS + + ASF H+C PN D+
Sbjct: 366 --TATLYTIIGVLCCNALE--VTDPSGLGVAQALHAGNSIASFFNHSCTPNCA---IDTV 418
Query: 127 QVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQTSKYFICQCERC----RDPTE 178
+ + + I G+ LSI Y L W T RR L YF+C+C+RC +DP E
Sbjct: 419 RHAIITTRTIHVGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSSNKDPFE 475
>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
Full=CD8b-opposite; AltName: Full=Zinc finger protein
BOP; Short=m-BOP
gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
Methyltransferase Smyd1
gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
Length = 490
Score = 59.3 bits (142), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 74/166 (44%), Gaps = 16/166 (9%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
Y D L + RR L+ YF C CE C+ + + F + P+
Sbjct: 251 SYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAAKEDPK 296
>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
Y34]
Length = 581
Score = 59.3 bits (142), Expect = 8e-07, Method: Composition-based stats.
Identities = 39/144 (27%), Positives = 73/144 (50%), Gaps = 10/144 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF-FKLNEEFTQEEILKICGIIQVNAHE 92
W +MQLE+H EE + E+ R +++ + + ++ + + ++ II N+
Sbjct: 176 WKAIMQLESHFEE-----QNEDLRNDILAKVHGWGRRMGTDVDFMTVWELLCIIFTNSMN 230
Query: 93 MPLTEP--SYIAIFDRA-SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+ E S+ D S I H+C PN + F + QV +R++K IA G +++ Y DP
Sbjct: 231 VRPAEADVSFGTSLDLVVSMINHDCSPNAH-VFFEGSQVRVRSLKAIAAGGEITVSYCDP 289
Query: 150 LWGTINRRHHLQTSKYFICQCERC 173
+ R+ L+ +++F C+C C
Sbjct: 290 RLDVLLRQEILRQTQFFHCECTTC 313
>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
P131]
Length = 582
Score = 59.3 bits (142), Expect = 8e-07, Method: Composition-based stats.
Identities = 39/144 (27%), Positives = 73/144 (50%), Gaps = 10/144 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF-FKLNEEFTQEEILKICGIIQVNAHE 92
W +MQLE+H EE + E+ R +++ + + ++ + + ++ II N+
Sbjct: 176 WKAIMQLESHFEE-----QNEDLRNDILAKVHGWGRRMGTDVDFMTVWELLCIIFTNSMN 230
Query: 93 MPLTEP--SYIAIFDRA-SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+ E S+ D S I H+C PN + F + QV +R++K IA G +++ Y DP
Sbjct: 231 VRPAEADVSFGTSLDLVVSMINHDCSPNAH-VFFEGSQVRVRSLKAIAAGGEITVSYCDP 289
Query: 150 LWGTINRRHHLQTSKYFICQCERC 173
+ R+ L+ +++F C+C C
Sbjct: 290 RLDVLLRQEILRQTQFFHCECTTC 313
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 59.3 bits (142), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/179 (25%), Positives = 78/179 (43%), Gaps = 12/179 (6%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE 77
IIALR L + H+ Q +M + H +E ++ + + ++ +
Sbjct: 170 IIALRSL-SEGHRVQFERGVMAMMGHEDECPQ--QWVDSITETAKLVMPLLPKPARLSAR 226
Query: 78 EILKICGIIQVNAH-EMPLTEPSYI---AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRA 132
E +K+C I N+H + + P I ++ AS I H+C PN F + G L +R
Sbjct: 227 EFVKVCARINTNSHRQHHMFVPQRILGVGLYPLASLINHSCQPNC--GFYNRGPTLYIRT 284
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPT--ELNTFYDGVKCP 189
+ + G+ L Y D R+ L +K+F C C RC P ++ + G +CP
Sbjct: 285 LCDVKEGEELCYSYIDLYQSRSKRKAELLETKHFDCLCNRCSPPITDSVDRYLSGFQCP 343
>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
Length = 485
Score = 59.3 bits (142), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 74/166 (44%), Gaps = 16/166 (9%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 127 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 185
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 186 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 245
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
Y D L + RR L+ YF C CE C+ + + F + P+
Sbjct: 246 SYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAAKEDPK 291
>gi|366987641|ref|XP_003673587.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
gi|342299450|emb|CCC67204.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
Length = 508
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 8/105 (7%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
+E F+ ++ L G +N E I+ SFI HNC PN+ + +
Sbjct: 322 DEGFSLDDFLLDIGKFNINQLEG--------QIYHLYSFINHNCEPNIRYEIDSKLCLRV 373
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
A KPI PG+ L Y +PL G RR L+ + F+CQC RC +
Sbjct: 374 FARKPIQPGEELLTTYVNPLHGVKLRRRALKVNWGFLCQCARCEN 418
>gi|1809322|gb|AAC53020.1| t-BOP [Mus musculus]
Length = 456
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 74/166 (44%), Gaps = 16/166 (9%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 98 LQNHVEHFGEEEQKE-LRVDVDTFLQYWPLQSQQFSMQYISHIFGVINCNGFTLSDQRGL 156
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 157 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 216
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
Y D L + RR L+ YF C CE C+ + + F + P+
Sbjct: 217 SYIDFLHLSEERRRQLKKQYYFDCSCEHCQKGLKDDLFLAAKEDPK 262
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 70/152 (46%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ E KN N+VQ +L F+L+ +EI NAH + P
Sbjct: 147 LESHISEVDKNQLVLYAQMANLVQLILPSFELD----LKEITHTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
++ S I H+C PN F + +RA++PI+ + +SI Y + T+ R
Sbjct: 203 RSLGTGLYPVLSIINHSCVPNAVLIF-EGRTAYVRALQPISKNEEVSISYIETAATTMKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ YF C C RC +E + +G +C
Sbjct: 262 QDDLK-HYYFTCTCPRCVKDSEEDALLEGYRC 292
>gi|66362640|ref|XP_628286.1| protein with MYND plus SET domains plus 5 Ank repeats
[Cryptosporidium parvum Iowa II]
gi|46229755|gb|EAK90573.1| protein with MYND plus SET domains plus 5 Ank repeats
[Cryptosporidium parvum Iowa II]
Length = 1560
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 51/97 (52%), Gaps = 1/97 (1%)
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWG- 152
P+ + + F++AS +H+C P+ D ++ +R+ I +L+I Y L+
Sbjct: 401 PVPVSTGLGFFEKASKFQHSCIPSCVYYMNDEHKLCIRSAYNIPENANLTISYILDLYQP 460
Query: 153 TINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
T R+ + + K F C C RC D +E N F +G++CP
Sbjct: 461 TFKRKSIINSPKVFACMCSRCLDDSENNLFLEGIRCP 497
>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
Length = 467
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 70/158 (44%), Gaps = 16/158 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + ++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKE-LRVDVDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
Y D L + RR L+ YF C CE C+ + + F
Sbjct: 251 SYIDFLHLSEERRRQLRKQYYFDCSCEHCQKGLKDDLF 288
>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
Length = 490
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 71/158 (44%), Gaps = 16/158 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
Y D L + RR L+ YF C CE C+ + + F
Sbjct: 251 SYIDFLHLSEERRQQLKKQYYFDCSCEHCQKGLKDDLF 288
>gi|290462845|gb|ADD24470.1| Protein msta, isoform A [Lepeophtheirus salmonis]
Length = 486
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/190 (23%), Positives = 83/190 (43%), Gaps = 4/190 (2%)
Query: 2 KIKISNFTTPHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNS-PKYENDRRNV 60
K K +F +PH Y I LR + + + QL H ++ N P+ E + +
Sbjct: 138 KPKFKDFESPHFIYNVITILRLILEMQRNEDNRILVNQLMTHHDQRVNKFPEVEIGIKRI 197
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
FL++ + +++++ G+++ NA M + + ++ S I H+C NL
Sbjct: 198 TSFLMDDLNI-PNVNGDDVIQAFGVMKTNA--MTFGDGNGRGLYPIGSIISHSCVSNLEP 254
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELN 180
++LR+ I G+ L+I Y + L +F +CERC+DP+++
Sbjct: 255 LTEPMECLVLRSKISIKKGNELTIRYMSFWSHRTQLKDSLNNQWFFNYECERCQDPSDMK 314
Query: 181 TFYDGVKCPE 190
T C +
Sbjct: 315 TMLSSELCSK 324
>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
Length = 462
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + RR L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNLSEERRKQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
griseus]
Length = 498
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 70/158 (44%), Gaps = 16/158 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + ++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDAFLQYWPPQGQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
Y D L + RR L+ YF C CE C+ + + F
Sbjct: 251 SYIDFLHLSEERRRQLRKQYYFDCSCEHCQKGLKDDLF 288
>gi|325188357|emb|CCA22894.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 528
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/116 (29%), Positives = 58/116 (50%), Gaps = 4/116 (3%)
Query: 78 EILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMK 134
+I K+ ++Q N + L EP I ++ +A+ + H+C PN F Q ++ ++
Sbjct: 202 QITKLFCMVQCNVFTICNTLNEPLGIGLYPKAAMLNHSCTPNCIVRFDLTRKQAVIHTIE 261
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
I G L+I Y D T R+ L++S +F C C+RC E + F +G C +
Sbjct: 262 TIHSGQELTISYVDIFDPTTIRKKRLRSSYFFDCTCQRCTSSQE-DAFLEGFLCTQ 316
>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
Length = 491
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 67/150 (44%), Gaps = 16/150 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HV+ + K E R +V FL + +++F + I I G+I N +
Sbjct: 132 LQNHVDSFDEEEKKEL-RADVESFLEFWPPHSQQFGMQYISHIFGVISCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF H+C+PN F + ++ LRA+ I+PGD L++
Sbjct: 191 QAVGVGIFPNLCQANHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALTKISPGDELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCR 174
Y D L + RR L+ YF C CE C+
Sbjct: 251 SYVDFLNVSEERRKQLKKQYYFDCTCEHCK 280
>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
domestica]
Length = 490
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 72/159 (45%), Gaps = 18/159 (11%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ HV+ + + + D R V LNF+ +++F+ + I I G+I NA +
Sbjct: 132 LQNHVDNF--GEEEQKDLRVDVDSFLNFWPAQSQQFSMQYISHIFGVINCNAFTLSDQRG 189
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLS 143
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L+
Sbjct: 190 LQAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
+ Y D L + R+ L+ YF C CE C + + F
Sbjct: 250 VSYVDFLNVSQERKKQLKKQYYFDCTCEHCEKGIKDDLF 288
>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 647
Score = 58.5 bits (140), Expect = 1e-06, Method: Composition-based stats.
Identities = 49/198 (24%), Positives = 84/198 (42%), Gaps = 38/198 (19%)
Query: 22 RCLYKKSHQ----PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF--- 74
R ++KK + P W L +L+ H+ EY ++ K + +V L+ + N+E
Sbjct: 209 RLIWKKRKEEKLNPDWWTGLSELQHHLNEYNSTQKESLMQLSVT---LSRYVGNQELLAV 265
Query: 75 --TQEEILKICGIIQVNAHEMP--LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ--- 127
+ +L +C N+ + + + + A+FI H+C PN F + G+
Sbjct: 266 FESASALLPLCSRFIDNSFTLTSIILDQIGVVFVPSAAFINHSCNPNAVVVFPEGGEGAG 325
Query: 128 -------VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR------ 174
V + A+KPI PG+ + Y D RR+ L F+C C+ C+
Sbjct: 326 STAGKEWVKVIAIKPIEPGEEIVTSYIDSAGTRQERRNELVKRYKFVCDCQACQSDWNDQ 385
Query: 175 --DPTELNTFYDGVKCPE 190
DP E +KCP+
Sbjct: 386 QVDPRE------ALKCPK 397
>gi|345802872|ref|XP_537149.3| PREDICTED: N-lysine methyltransferase SMYD2 [Canis lupus
familiaris]
Length = 523
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 191 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 244
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF E L + + N + E S++ AIF + + H+C PN+
Sbjct: 245 ALHHFYSKHLEFPDSESLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 304
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F CQC+ C
Sbjct: 305 VTYKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQEC 357
>gi|428672870|gb|EKX73783.1| conserved hypothetical protein [Babesia equi]
Length = 492
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 48/88 (54%), Gaps = 4/88 (4%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY---TDPLWG 152
T+P + I+DR SF H+C + TD +LRA K + PGD ++I Y +D L
Sbjct: 191 TDPDGLVIYDRISFTAHSCDASCCWYHTDQDYFVLRARKRLLPGDEITISYLGESDLLAA 250
Query: 153 TINRRHHLQTSKYFICQCERCRDPTELN 180
T RR L+ +F CQC RC + +++
Sbjct: 251 TYKRRELLENWHFF-CQCNRCSESLDVS 277
>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
Length = 433
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K++ P +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQTSHPDRTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIS 154
Query: 63 FLLNFFKLNEEF-TQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L F+ + EF E ++ + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHRFYSKHLEFPDNESLVTLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQAISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 99 SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRH 158
+ + +F I H+C PN S + +RA++ I G+ +++ Y D R+
Sbjct: 161 AAVGLFSICGLINHSCQPNCTWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSERQK 220
Query: 159 HLQTSKYFICQCERCRDP--TELNTFYDGVKCP 189
L+ +K+F CQCERC P ++ DG +CP
Sbjct: 221 ELRDTKHFDCQCERCSTPLSESVDRVLDGFRCP 253
>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Canis lupus familiaris]
Length = 490
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + ++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RLDVDTFLQYWPPQGQQFSMQYISHIFGVISCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + +R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNLSEDRKKQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|401417428|ref|XP_003873207.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489435|emb|CBZ24694.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 622
Score = 58.5 bits (140), Expect = 1e-06, Method: Composition-based stats.
Identities = 50/173 (28%), Positives = 79/173 (45%), Gaps = 24/173 (13%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y CI L + K Q ++++ L AHV+E S + + + FK E+
Sbjct: 315 AYHCITTLAGIKAKR---QGHERVLTLTAHVDEVIQS------LHPIGPLMCDLFKGEED 365
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDR-------ASFIEHNCYPNLYKSFTDSG 126
+ I G++ NA E +T+PS + + ASF H+C PN D+
Sbjct: 366 --TPTLYTIIGVLCCNALE--VTDPSGLGVAQALHAGNSIASFFNHSCAPNCA---IDTV 418
Query: 127 QVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQTSKYFICQCERCRDPTE 178
+ + + I G+ LSI Y L W T RR L YF+C+C+RC+ T+
Sbjct: 419 RHAIVTTRTIHVGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSGTK 471
>gi|401625526|gb|EJS43529.1| set5p [Saccharomyces arboricola H-6]
Length = 526
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 73/152 (48%), Gaps = 21/152 (13%)
Query: 42 AHVEEYKNSPKYENDRRNVVQ----------FLLNFFKLNEEFTQEEILKICGIIQVNAH 91
H E+ ++ + D +N+ F F K +EE ++ L + G +N +
Sbjct: 295 VHTEDGSDNSSNKGDEKNIDDEIVWEKCYELFCNAFPKASEEIDMDKFLTMIGTFNINQY 354
Query: 92 EMPLTEPSYIAIFDRASFIEHNCYPNLY-KSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
I+ SF+ H+C PN Y + + ++ L A +PI G+ + + Y +PL
Sbjct: 355 NEQ--------IYHWHSFMNHDCEPNAYIEQAEEHDELKLHARRPIKKGEQICVTYVNPL 406
Query: 151 WGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
G RR L+ + F+CQC+RC++ EL+TF
Sbjct: 407 HGVRLRRRELRVNWGFLCQCDRCQN--ELSTF 436
>gi|255074237|ref|XP_002500793.1| set domain protein [Micromonas sp. RCC299]
gi|226516056|gb|ACO62051.1| set domain protein [Micromonas sp. RCC299]
Length = 701
Score = 58.5 bits (140), Expect = 1e-06, Method: Composition-based stats.
Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 9/88 (10%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ---------VLLRAMKPIAPGDHLSICYTDPLWG 152
A++ S H+C PN SF + V +RA+KP+ GD L+I Y DP G
Sbjct: 273 AVYRTLSRFNHSCAPNCTWSFVHESESKPGGGAVSVNVRAIKPVKSGDELTISYVDPTVG 332
Query: 153 TINRRHHLQTSKYFICQCERCRDPTELN 180
RR L F C C C P ++N
Sbjct: 333 RAERREQLWAKYRFECACHLCVGPRDVN 360
>gi|330812969|ref|XP_003291388.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
gi|325078448|gb|EGC32099.1| hypothetical protein DICPUDRAFT_98991 [Dictyostelium purpureum]
Length = 824
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 63/139 (45%), Gaps = 13/139 (9%)
Query: 61 VQFLLNFFK-------LNEEFTQEEILKICGIIQVNAHEMP--LTEPSYIAIFDRASFIE 111
V L+N F+ L + FT+EEI +I I N H + + + +
Sbjct: 485 VTTLMNLFEDIYGIDFLGDSFTREEIYEIYTIFLSNGHTVSNIFGKGYGHGFYPLGCYFN 544
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C PN+ D+G + RA + I+ G+ L + Y D +RR HL F CQC
Sbjct: 545 HSCVPNVTWYVDDNGMLAFRATRTISQGEELFLTYYDMFSTLDDRRPHLLDQYRFFCQCP 604
Query: 172 RCRDPTELNTFYDGVKCPE 190
+C ++L+ G+ C +
Sbjct: 605 QCEFQSKLS----GLTCNQ 619
>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
Length = 527
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
A+ H+C PN SF D G+ +RA KPIA G+ + I Y D + RRH L+ F
Sbjct: 221 AARFNHSCNPNASYSF-DKGKCYIRATKPIAKGEQIFIPYIDTTYSVGTRRHELKERYKF 279
Query: 167 ICQCERC 173
CQC RC
Sbjct: 280 DCQCPRC 286
>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
cuniculus]
Length = 490
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HV+ + + E R +V FL + ++ F+ + I I G+I N +
Sbjct: 132 LQNHVDHFGEEEQKEL-RADVDTFLQYWPPQSQHFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + RR L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERRRQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|339897286|ref|XP_003392344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321399152|emb|CBZ08492.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 622
Score = 58.2 bits (139), Expect = 2e-06, Method: Composition-based stats.
Identities = 52/177 (29%), Positives = 79/177 (44%), Gaps = 28/177 (15%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y CI L + K + +++ L AHV+E S + + + FK EE
Sbjct: 315 AYHCITTLAGIKAKRAGHE---RVLTLTAHVDEVIQS------LHPIGPLMSDLFKGEEE 365
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDR-------ASFIEHNCYPNLYKSFTDSG 126
+ I G++ NA E +T+PS + + ASF H+C PN D+
Sbjct: 366 --TATLYTIIGVLCCNALE--VTDPSGLGVAQALHAGNSIASFFNHSCTPNCA---IDTV 418
Query: 127 QVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQTSKYFICQCERC----RDPTE 178
+ + + I G+ LSI Y L W T RR L YF+C+C+RC +DP E
Sbjct: 419 RHAIITTRTIHVGEELSIAYIPQLYWPTRLRRERLSEGYYFVCRCQRCQSSNKDPFE 475
>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVISCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
Length = 484
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 126 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 184
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 185 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 244
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 245 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 286
>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
Length = 478
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 74/157 (47%), Gaps = 8/157 (5%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ H++++ + K +V +FL + +++F + I I +I N +
Sbjct: 127 LQNHIDKFDEAEKGLL-MEDVQKFLEYWPSQSQQFGMQYISHIFSVISCNGFTLSDQRGL 185
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + IF H+C+PN F ++G++ LRA+ I G+ L++ Y D L T +R
Sbjct: 186 QAVGVGIFPNLCLANHDCWPNCTVIF-NNGKIELRALGKINKGEELTVSYVDFLNLTEDR 244
Query: 157 RHHLQTSKYFICQCERCRDPTE---LNTFYDGVKCPE 190
+ L+ YF C CE C T+ L DG PE
Sbjct: 245 KAQLKKQYYFDCTCEHCTKKTKDALLLAVKDGEDKPE 281
>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
jacchus]
Length = 442
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKE-LRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKKQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
Length = 483
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 127 LQNHVEHFGEEEQKEL-RVDVDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 185
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 186 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 245
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 246 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKRLKDDLFL-GVK 287
>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
Length = 510
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 65/137 (47%), Gaps = 15/137 (10%)
Query: 57 RRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHN 113
+ +V F + K ++ ++ + + G+I N + + I +F + H+
Sbjct: 148 KADVKTFYTYWPKKSKAVGEDYVSHLFGVISCNGFTLSDQRGLQSVGIGLFPNLCLVNHD 207
Query: 114 CYPN------------LYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
C+PN L SF S ++ LRA++PI+ G L++ Y D L + +R+ LQ
Sbjct: 208 CWPNCTVILNHGDQSALDASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQ 267
Query: 162 TSKYFICQCERCRDPTE 178
YF C+CE C + T+
Sbjct: 268 QQYYFDCKCEHCVNGTK 284
>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
harrisii]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 72/159 (45%), Gaps = 18/159 (11%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFK-LNEEFTQEEILKICGIIQVNAHEMP---L 95
L+ HV+ + + + D R V LNF+ +++F+ + I I G+I NA +
Sbjct: 132 LQNHVDNF--GEEEQKDLRMDVDSFLNFWPPQSQQFSMQYISHIFGVINCNAFTLSDQRG 189
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLS 143
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L+
Sbjct: 190 LQAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELT 249
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
+ Y D L + R+ L+ YF C CE C + + F
Sbjct: 250 VSYIDFLNISQERKKQLKKQYYFDCTCEHCEKGIKDDLF 288
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/199 (23%), Positives = 89/199 (44%), Gaps = 16/199 (8%)
Query: 5 ISNFTTPHPSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRN 59
I + P PS +A R L++ Q + L LE H+ + + ++ + +
Sbjct: 88 IKTYGKP-PSENVRLAARILWRMDKQGSVVSDNQLTTLEDLEDHICDI-SEDDLKDFKVD 145
Query: 60 VVQFLLNFFKLNEEFTQEEILKICGIIQVNA---HEMPLTEPSYIAIFDRASFIEHNCYP 116
+ FL + + ++ T + + I G+I N + + + +F + H+C+P
Sbjct: 146 IHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWP 205
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-- 174
N ++G++ LRA+ I+ G+ +++ Y D L + +R+ L+ +F C C+ C
Sbjct: 206 NC-TVILNNGKIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTEK 264
Query: 175 ---DPTELNTFYDGVKCPE 190
D DGVK PE
Sbjct: 265 IKDDLKMAGAEVDGVKVPE 283
>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Cavia porcellus]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISVGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
Length = 1187
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 74/161 (45%), Gaps = 10/161 (6%)
Query: 25 YKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDR-RNVVQFLLNFFKLN------EEFTQE 77
+ K+ P I+D L + N + + + Q ++N FKL E T +
Sbjct: 848 WSKNGVPFIFDSYDDLTHLTTDTSNIDRKQMESFETATQSIINVFKLAKGPKSLEPLTNQ 907
Query: 78 EILKICGIIQVNAHEM--PLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+IL + + NAHE PL I+ A+++ H+C PN ++G + R+++
Sbjct: 908 QILDLYPKMLFNAHEYIDPLYHSEVARGIYPTAAYLNHSCEPNTVWHNDNNGMIAYRSIR 967
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
I G+ ++ Y D R+ +L + F+CQC RC+D
Sbjct: 968 DIKAGEEITTTYIDITKYKSTRQLNLLSQYAFLCQCARCQD 1008
>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
boliviensis boliviensis]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKKQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RLDVDMFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + +F + + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKKQLKRQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
gorilla gorilla]
Length = 459
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|239791604|dbj|BAH72247.1| ACYPI001902 [Acyrthosiphon pisum]
Length = 241
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 30/43 (69%)
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
YT+ LWGT RR HL ++KYF C+C+RC D TEL T + + C
Sbjct: 2 YTNALWGTRERRAHLLSTKYFKCKCKRCSDATELGTNFSTIVC 44
>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Equus caballus]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQREL-RVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + +F + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGVFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
Length = 493
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|157869726|ref|XP_001683414.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68126479|emb|CAJ04438.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 856
Score = 57.8 bits (138), Expect = 2e-06, Method: Composition-based stats.
Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 4/102 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIA 137
++C +Q N+ + T+ I I+ AS+ H+C PNL + ++ A++ IA
Sbjct: 655 RLCAAVQCNSFGVYDTQDHCIGFGIYPEASYFNHSCVPNLCRVMHHGSRIAAFYALRAIA 714
Query: 138 PGDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+ L+ICYTD + RR +L ++ F C CERC E
Sbjct: 715 AQEPLTICYTDVEQLNSAERRRNLLSTYRFFCMCERCSGKAE 756
>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
Length = 490
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 674
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 45/82 (54%), Gaps = 6/82 (7%)
Query: 97 EPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW--- 151
EP Y+ AI+ S + H+CYPN+ + SG V++R+++ I G + CY P W
Sbjct: 474 EPRYVGGAIYPSVSLVNHSCYPNVVRHSYPSGVVVVRSLRFIGKGTEIVDCY-GPHWLSE 532
Query: 152 GTINRRHHLQTSKYFICQCERC 173
G + RR +L F+C CE C
Sbjct: 533 GRLPRREYLWKKYRFLCACEAC 554
>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
Length = 505
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
SF+ H+C PN + F S Q+ +R++KPI GD +++ YT+ G + R+ L F
Sbjct: 222 GSFVNHSCDPNAFVFFEGS-QLRMRSLKPINAGDEITLTYTELREGVLMRKRKLHRGYSF 280
Query: 167 ICQCERCR 174
C+C RC+
Sbjct: 281 FCRCNRCK 288
>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
Length = 583
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 77/168 (45%), Gaps = 18/168 (10%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
Y +I +R KK PQ L +LE V ++ K R +LN + +E
Sbjct: 204 YRLLILMR---KKVLLPQQVKALARLENEVANFEKRTKKSWPR------VLNLAREAKER 254
Query: 75 TQEE-----ILKICGIIQVNAHEMPLT---EPSYIAIFDRASFIEHNCYPNLYKSFTDSG 126
T+ E +L + GI++ N+ + T P IA+ + + H C PN+ F +S
Sbjct: 255 TKSELSIGEVLMLYGIVRCNSLPVDQTFRNAPLGIALDMGGALLNHCCDPNVVIVF-NST 313
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
QV +RA++ I G+ L CY D + R + F CQC+RC+
Sbjct: 314 QVQVRALRKIKDGEELLHCYRDIAYDFTFRNPRITARYQFNCQCDRCK 361
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 64/137 (46%), Gaps = 10/137 (7%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V F+L + ++N + E K+ NAH + E P ++ S I H+C P
Sbjct: 142 NLVHFILQWPEINLKEIAENFSKL----ACNAHTICDCELRPLGTGLYPVVSIINHSCMP 197
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC--- 173
N +F V+ RA++ I G +SI Y D T+ R+ L+ +F C C RC
Sbjct: 198 NAVLTFEGKSSVV-RAVEHIPEGAEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPRCIKV 256
Query: 174 RDPTELNTFYDGVKCPE 190
D + + +G +C +
Sbjct: 257 YDDIQESAILEGYRCKD 273
>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
Length = 490
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RLDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + +F + + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKKQLKRQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
troglodytes]
Length = 452
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + ++ R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQ-KDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNVSEERKRQLKKQYYFDCTCEHCQKKLKDDLFL-GVK 292
>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
Length = 490
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 75/163 (46%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RLDVDTFLQYWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + +F + + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGVFPNLALVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + R+ L+ YF C CE C+ + + F GVK
Sbjct: 251 SYIDFLNISEERKKQLKRQYYFDCTCEHCQKGLKDDLFL-GVK 292
>gi|308804341|ref|XP_003079483.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
gi|116057938|emb|CAL54141.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
Length = 630
Score = 57.4 bits (137), Expect = 3e-06, Method: Composition-based stats.
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 101 IAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHLSICYTDP--LWGTIN-R 156
+ +FD H+C PN + +D G+V L +++PIA G+ ++ICY P W R
Sbjct: 228 MVMFDAIGACNHSCDPNAEVSAISDQGEVTLYSLRPIAAGEEITICYGKPSLRWLPARCR 287
Query: 157 RHHLQTSKYFICQCERCR 174
R L +F CQCE+C+
Sbjct: 288 RRSLLKEWHFECQCEKCK 305
>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
Length = 492
Score = 57.4 bits (137), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 65/148 (43%), Gaps = 7/148 (4%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQF-LLNFFKLNEEFTQEEILKICGIIQVNAHE 92
W L +L AH++E + P + D+ V+ L + ++F+ I G + N+
Sbjct: 133 WKNLQELPAHLDELRQEPDW--DKHAVLALGALKYSMAEDKFSSNIATGIYGRVFTNSLT 190
Query: 93 M--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
+ EP I A H+C PN + D Q+ RA+KPIA + + I Y D
Sbjct: 191 LIGSAMEPLGICFDPLACSANHSCDPNAF-VIMDGAQLSFRALKPIAKDEEILISYIDGT 249
Query: 151 WGTINRRHHLQTSKYFICQCERC-RDPT 177
R+ L YF C C +C + PT
Sbjct: 250 LPFERRQTDLTRRYYFTCSCTKCAQGPT 277
>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
Length = 464
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 69/138 (50%), Gaps = 18/138 (13%)
Query: 60 VVQFLLNFFKLN--------EEFTQEEILKICGIIQVNAHEMPLTEPSYIAI--FDRASF 109
+V+ LL F +L+ E F + + ++++NAH + +E + + + F +ASF
Sbjct: 143 LVEDLLRFIRLSHYPQHEVSELFEDNFVASLIRMLEMNAHTIYDSELNTLGVGFFPKASF 202
Query: 110 IEHNCYPNLYKSFTD-----SGQ---VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
+ H+C PN FT SG+ + +R ++PI G+ + I Y D ++R L+
Sbjct: 203 MNHDCRPNCVALFTGGFHSVSGKPISIHIRCIRPIEAGEEIVISYLDVCLSWMDRLEWLK 262
Query: 162 TSKYFICQCERCRDPTEL 179
F C C RC++ + L
Sbjct: 263 EHYQFECCCSRCKEESSL 280
>gi|301771175|ref|XP_002921007.1| PREDICTED: SET and MYND domain-containing protein 2-like
[Ailuropoda melanoleuca]
Length = 385
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 53 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 106
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 107 ALHHFYSKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 166
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F CQC+ C
Sbjct: 167 VTYKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQEC 219
>gi|281342967|gb|EFB18551.1| hypothetical protein PANDA_009836 [Ailuropoda melanoleuca]
Length = 386
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 48 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 101
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 102 ALHHFYSKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 161
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F CQC+ C
Sbjct: 162 VTYKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQEC 214
>gi|157866065|ref|XP_001681739.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68125037|emb|CAJ03122.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 622
Score = 57.0 bits (136), Expect = 3e-06, Method: Composition-based stats.
Identities = 52/177 (29%), Positives = 78/177 (44%), Gaps = 28/177 (15%)
Query: 14 SYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEE 73
+Y CI L + K + +++ L AHV+E S + + + FK E
Sbjct: 315 AYHCITTLAGIKAKRAGHE---RVLTLTAHVDEVIQS------LHPIGPLMCDLFKGEER 365
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDR-------ASFIEHNCYPNLYKSFTDSG 126
+ I G++ NA E L +PS + + ASF H+C PN D+
Sbjct: 366 --PATLYTIIGVLCCNALE--LADPSGLGVAQALHAGNSIASFFNHSCTPNCA---IDTV 418
Query: 127 QVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQTSKYFICQCERC----RDPTE 178
+ + + I G+ LSI Y L W T RR L S YF+C+C+RC +DP E
Sbjct: 419 RHAIVTTRTIHVGEELSIAYIPQLYWPTRLRRERLSESYYFVCRCQRCESSNKDPFE 475
>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
Length = 463
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 65/143 (45%), Gaps = 9/143 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK-LNEEFTQEEILKICGIIQVNAHE 92
W+ +M LE+H E +S K +V+ L + E + +L + I +N+
Sbjct: 136 WNTIMDLESHRSEIMSSEK-----GDVIWMLSKGIQNFTNEMNETFVLDLFCKIMINSFS 190
Query: 93 MPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
+ + I AI S I H+CYPN F D V LR+++ I P +++ Y D
Sbjct: 191 LMTSSYDTIGTAIDPIISRINHSCYPNTALVF-DRNTVALRSLQKILPNQEITVSYIDIY 249
Query: 151 WGTINRRHHLQTSKYFICQCERC 173
NR L + YF C+C RC
Sbjct: 250 NTQKNRHDELLSRYYFSCKCTRC 272
>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
garnettii]
Length = 490
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 70/158 (44%), Gaps = 16/158 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + + E R +V FL + ++ F+ + I I G+I N +
Sbjct: 132 LQNHVEHFGEEEQKEL-RVDVDTFLQYWPPQSQPFSIQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEAVKAMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
Y D L T RR L+ YF C CE C+ + + F
Sbjct: 251 SYIDFLNVTEERRKQLKKQYYFDCACEHCQKKLKDDLF 288
>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
Length = 475
Score = 57.0 bits (136), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ S I H+C PN + F + ++++A++ IAPG+ + I YTD +RR +L
Sbjct: 200 ALYLNHSLINHSCKPNAFPVFNGT-NLVIKALEKIAPGEEIKIAYTDTKAVIQDRRDYLN 258
Query: 162 TSKYFICQCERCRDPTELN 180
F+CQC+ C + E++
Sbjct: 259 DIYRFVCQCQGCTNDDEVD 277
>gi|169867645|ref|XP_001840401.1| hypothetical protein CC1G_05287 [Coprinopsis cinerea okayama7#130]
gi|116498562|gb|EAU81457.1| hypothetical protein CC1G_05287 [Coprinopsis cinerea okayama7#130]
Length = 442
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 66/144 (45%), Gaps = 10/144 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM 93
W K QL V+ +K PK D+R + + L L EE E G ++ N +M
Sbjct: 268 WKKAHQLV--VQAFKE-PKNAMDQRKLARILKK--PLPEEIDNEFFTYDPGFLK-NLGKM 321
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR----AMKPIAPGDHLSICYTDP 149
L ++ ++ S + H+C PN+ D L R A + I+PGD L++ Y +P
Sbjct: 322 SLNLEAHGGLYTLHSHLNHSCRPNISVRHNDKRTALSRISVIARRAISPGDELTVTYVNP 381
Query: 150 LWGTINRRHHLQTSKYFICQCERC 173
R+ LQ + C+CERC
Sbjct: 382 ELPYKTRQEQLQAWGFGSCRCERC 405
>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
dendrobatidis JAM81]
Length = 503
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 5/86 (5%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT 147
++++ P+ E +Y AS H+C+PN F D +V+++ ++ IA GD L I Y
Sbjct: 249 IDSNLFPVGEGTY----PLASLFNHDCWPNCIAIF-DGSRVVIQTIRDIAKGDELCISYI 303
Query: 148 DPLWGTINRRHHLQTSKYFICQCERC 173
DP+ +RR L+T F CQC C
Sbjct: 304 DPILDHDSRRMSLETKYCFNCQCSVC 329
>gi|402083183|gb|EJT78201.1| hypothetical protein GGTG_03303 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 419
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 104 FDRASFIEHNCYPNLYKSFTDSGQVLLRA--MKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
F AS I H+C PNL S TD G ++ RA + IAPG+ L+I Y D L R+ H +
Sbjct: 235 FPSASRINHDCRPNLV-SHTD-GNLVFRAYAARAIAPGEELTISYIDSLAPAAERQAHTR 292
Query: 162 TSKYFICQCERCRDPT 177
F+C CE CR T
Sbjct: 293 AVWGFVCGCEHCRLST 308
>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
Length = 463
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 57/113 (50%), Gaps = 4/113 (3%)
Query: 78 EILKICGIIQVNAHEMPLTEPSYIAI--FDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
EI ++ G + N + E YI I + + S H+C PN F + ++ +RA++
Sbjct: 162 EIFELFGRMTCNTFSICDPEMQYIGIGIYPKMSLFNHSCEPNCVAVF-NGLRMEVRAIQN 220
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
I PG+ L I Y + L + R+ L YF C+C RC+D T+ + VKC
Sbjct: 221 IQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKCPRCQDQTK-DGMMMAVKC 272
>gi|340513811|gb|EGR44092.1| predicted protein [Trichoderma reesei QM6a]
Length = 549
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 79/168 (47%), Gaps = 10/168 (5%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEE 73
Y +I +R KK P+ L +LE+ + ++ K + V++ L + E
Sbjct: 170 YRLLILMR---KKVLLPEQVQGLSRLESEMANFERRAKKTWPK--VLELALEAKERTKSE 224
Query: 74 FTQEEILKICGIIQVNAHEMPLT---EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
EIL + GII+ NA + T P IA+ + + H+C PN+ F ++ QV +
Sbjct: 225 LGIYEILILYGIIRCNALPVDQTYRNAPLGIALDLGGALLNHDCDPNVAIVFNNT-QVQV 283
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
RA++ + G+ L CY D + R + + F C CERCR ++
Sbjct: 284 RALRKLKAGEELLHCYRDIAYDFTFRNPRITSRYQFRCHCERCRQESD 331
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 10/162 (6%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP-- 94
L +LE H+ + + E + ++ FL + + +++ T ++I I G+I N +
Sbjct: 124 LDELEDHITDMQEDELKEL-KVDIHNFLDFWPRTSKQHTVDDISHIFGVINCNGFSVSDQ 182
Query: 95 -LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT 153
+ + +F + H+C+PN + G++ LR++ IA G+ L++ Y D + +
Sbjct: 183 RGLQAVGVGLFPNLCLVNHDCWPNC-TVILNHGKIELRSLGKIAEGEELTVAYVDYMNLS 241
Query: 154 INRRHHLQTSKYFICQCERC----RDPTELNTF-YDGVKCPE 190
R+ L+T +F C CE C +D +L DGVK E
Sbjct: 242 EERQRLLKTQYFFDCTCEHCKNKIKDDIKLGGREVDGVKPSE 283
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 56.6 bits (135), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 72/163 (44%), Gaps = 7/163 (4%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRN---VVQFLLNFFKLNEEFTQEEILKICGIIQV 88
++W Q + N K ++D + V +LN + L E + E L++ I
Sbjct: 109 RVWQLKSQNGWYDSLVSNVEKIDSDAKEDFVSVLMVLNEY-LGSEISPPEGLELFSKISC 167
Query: 89 NAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
N+ + E I IF A + H+C PN F + + ++A++ I G+ L+I Y
Sbjct: 168 NSFAICDGEMQAIGTGIFPNAVCLNHSCAPNSVAVFNGTN-IYIKALEEIPVGEELTISY 226
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L R+ LQT F CQC RC D ++ N + CP
Sbjct: 227 IQQLHPRETRQEELQTQFCFYCQCHRCLDASDNNKMLTSLICP 269
>gi|303278994|ref|XP_003058790.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459950|gb|EEH57245.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 771
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 27/72 (37%), Positives = 39/72 (54%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+F AS + H+C PN ++ D +VL+ A + + PG+ L+I Y D RR L+
Sbjct: 405 AVFPLASSLNHSCDPNCEVAYVDDARVLVVARRTLKPGEELTIAYVDVDADVGERRDELR 464
Query: 162 TSKYFICQCERC 173
F C CERC
Sbjct: 465 EVYGFECVCERC 476
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/172 (25%), Positives = 79/172 (45%), Gaps = 21/172 (12%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA---HEMPL 95
+LE HV + + E + ++ FL + + +++ T ++I I G+I N +
Sbjct: 126 ELEDHVADMQEDEIKEL-KVDIHNFLDYWPRNSKQHTIDDISHIFGVINCNGFTVSDQRG 184
Query: 96 TEPSYIAIFDRASFIEHNCYPN------------LYKSFTDSGQVLLRAMKPIAPGDHLS 143
+ + +F + HNC+PN + F ++ LR++ IA G+ L+
Sbjct: 185 LQAVGVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEELT 244
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE-----LNTFYDGVKCPE 190
+ Y D L + RR L+T +F CQC+ C++ T+ DGVK E
Sbjct: 245 VAYVDFLNLSEERRRLLKTQYFFDCQCDYCKNGTKDDLKLAGREVDGVKPSE 296
>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
Length = 427
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 72/141 (51%), Gaps = 10/141 (7%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK-LNEEFTQEEILKICGIIQVNAHE 92
W +MQLE+H EE + E+ R +++ + + + + + + ++ II N+
Sbjct: 64 WKAIMQLESHFEE-----QNEDLRNDILAKVHGWGRRMGTDVDFMTVWELLCIIFTNSMN 118
Query: 93 MPLTEP--SYIAIFDRA-SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+ E S+ D S I H+C PN + F S QV +R++K IA G +++ Y DP
Sbjct: 119 VRPAEADVSFGTSLDLVVSMINHDCSPNAHVFFEGS-QVRVRSLKAIAAGGEITVSYCDP 177
Query: 150 LWGTINRRHHLQTSKYFICQC 170
+ R+ L+ +++F C+C
Sbjct: 178 RLDVLLRQEILRQTQFFHCEC 198
>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Anolis carolinensis]
Length = 485
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 16/150 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HVE + K E R ++ FL + ++F + I I G+I N +
Sbjct: 127 LQNHVEHFGEEEKKEL-RLDLESFLEFWPHDGKQFAMQYISHILGVINCNGFTLSDQRGL 185
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + IF + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 186 QAVGVGIFPNLCLVNHDCWPNCTVIFNNGNHEAVRSMFHTQMRIELRALGKISIGEELTV 245
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCR 174
Y D L +RR L+ YF C CE C+
Sbjct: 246 SYVDFLNVCEDRRQQLKKQYYFDCTCEHCQ 275
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 56.2 bits (134), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
IF AS H+C PN +F + + +RA+KPIA G+ L + Y L + R+ L+
Sbjct: 189 GIFSSASLFNHSCDPNCVATF-NGRDISIRAIKPIAEGEELMLSYISILATSDVRQLELR 247
Query: 162 TSKYFICQCERCRDPTELNTFYDGVKCPE 190
S F C+C C ++ VKC +
Sbjct: 248 ESYMFTCKCTVCSRKEVNDSLMKSVKCSQ 276
>gi|237841865|ref|XP_002370230.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211967894|gb|EEB03090.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|221503107|gb|EEE28813.1| hypothetical protein TGVEG_013340 [Toxoplasma gondii VEG]
Length = 1648
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 36/109 (33%), Positives = 60/109 (55%), Gaps = 9/109 (8%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHL 142
G +++NAH LT A++ RA+ I+H+C PN+ Y++ G +++ A++ IA G +
Sbjct: 476 GKMKLNAH--ALTRGRGWALYPRAARIKHSCRPNVTYRNL--DGLLVVFALEDIAEGAPI 531
Query: 143 SICYTDPLW-GTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
++ Y D L+ T RR + +K CQC RC D + + CPE
Sbjct: 532 TMSYIDQLYMPTEERRKRVMATKRIFCQCMRCTDVCQKER---KILCPE 577
>gi|95007131|emb|CAJ20352.1| hypothetical protein TgIa.0990 [Toxoplasma gondii RH]
Length = 1737
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 36/109 (33%), Positives = 60/109 (55%), Gaps = 9/109 (8%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHL 142
G +++NAH LT A++ RA+ I+H+C PN+ Y++ G +++ A++ IA G +
Sbjct: 565 GKMKLNAH--ALTRGRGWALYPRAARIKHSCRPNVTYRNL--DGLLVVFALEDIAEGAPI 620
Query: 143 SICYTDPLW-GTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
++ Y D L+ T RR + +K CQC RC D + + CPE
Sbjct: 621 TMSYIDQLYMPTEERRKRVMATKRIFCQCMRCTDVCQKER---KILCPE 666
>gi|221482699|gb|EEE21037.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 1648
Score = 56.2 bits (134), Expect = 6e-06, Method: Composition-based stats.
Identities = 36/109 (33%), Positives = 60/109 (55%), Gaps = 9/109 (8%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHL 142
G +++NAH LT A++ RA+ I+H+C PN+ Y++ G +++ A++ IA G +
Sbjct: 476 GKMKLNAH--ALTRGRGWALYPRAARIKHSCRPNVTYRNL--DGLLVVFALEDIAEGAPI 531
Query: 143 SICYTDPLW-GTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
++ Y D L+ T RR + +K CQC RC D + + CPE
Sbjct: 532 TMSYIDQLYMPTEERRKRVMATKRIFCQCMRCTDVCQKER---KILCPE 577
>gi|303280161|ref|XP_003059373.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459209|gb|EEH56505.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 740
Score = 56.2 bits (134), Expect = 7e-06, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 6/79 (7%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ------VLLRAMKPIAPGDHLSICYTDPLWGTIN 155
A++ + S H+C PN F G V ++A++PI GD ++CY D G
Sbjct: 335 AVYAKLSRFNHSCAPNCTWHFAPPGSADSPVSVTVKAIRPIKKGDEATVCYLDASGGRDA 394
Query: 156 RRHHLQTSKYFICQCERCR 174
RR L F C+C+RC+
Sbjct: 395 RRRRLWAEYRFACECDRCQ 413
>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
heterostrophus C5]
Length = 508
Score = 56.2 bits (134), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 10/147 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRR---NVVQFLLNFFKLNEEFTQEEILKICGIIQVNA 90
W+ L QL +H++++K + Y+N QF L +++F + + G + NA
Sbjct: 134 WELLCQLPSHIDDFKRNATYQNIEMMAMGAAQFSLPQNVFDKDF----VAAMYGRVMSNA 189
Query: 91 HEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
+ P +P I + + H+C PN Y D V +R ++PI + I Y D
Sbjct: 190 LTLITPTLDPLGIIVDPTLCQMNHSCDPNAY-IMMDGPLVSIRTLRPIRKDKEIFISYID 248
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRD 175
R+ LQ +F C+C +C+D
Sbjct: 249 TTHPYQKRQDELQARWFFTCRCAKCQD 275
>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 481
Score = 56.2 bits (134), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+++ A+ H+C+P++ + F D + LRA++P+ PGD L++ Y T R+ LQ
Sbjct: 338 ALYNCAAMFNHSCFPSVVRQF-DGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQ 396
Query: 162 TSKYFICQCERC 173
+F C+C+RC
Sbjct: 397 QHYHFACRCQRC 408
>gi|410986126|ref|XP_003999363.1| PREDICTED: N-lysine methyltransferase SMYD2 [Felis catus]
Length = 386
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 54 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDVA 107
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 108 ALHHFYSKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 167
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F CQC+ C
Sbjct: 168 VTYKGT-LAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCQCQEC 220
>gi|358388868|gb|EHK26461.1| hypothetical protein TRIVIDRAFT_35412 [Trichoderma virens Gv29-8]
Length = 535
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 77/176 (43%), Gaps = 28/176 (15%)
Query: 11 PHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDR-RNVVQFLLNFFK 69
P P+ + AL C +I D L LE HV E K E D R++ +
Sbjct: 105 PTPTRALMQALLC-------GEIGDGLKDLEGHVLEKKA----EGDEWRDIEMMAMAACA 153
Query: 70 LNEEFTQEEILK-----ICGIIQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSF 122
+ + T EE+++ +C I Q N+ + + + +F + H+C PN F
Sbjct: 154 FSGKGTAEELVRRAAEMLCKI-QNNSFQRFDPDLGVVGLFLEPTLAMANHSCIPNAAVQF 212
Query: 123 TDSGQVLLRAMKPIAPGDHLSICYT---DPLWGTINRRHHLQTSKYFICQCERCRD 175
LL A PI GD + + YT DPL + RR L K F+CQC RCRD
Sbjct: 213 IGR-NTLLVAESPIRAGDEIELAYTFYTDPL---LKRREALAHYK-FVCQCLRCRD 263
>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
Length = 530
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 75/166 (45%), Gaps = 10/166 (6%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLN--EEFTQEE 78
+R + + +P+I + + LE + ++ S K+ D + F L +E Q+
Sbjct: 105 IRAVVQALVKPEIGNAIEDLEGNAASWRKSEKWA-DMEMMAMGASAFAGLGTGQEDVQKA 163
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPI 136
+ +C I Q NA T+ + IF + + H+C PN F + +LRA KPI
Sbjct: 164 LTILCKI-QTNAFHRYDTDLGQVGIFLGTKLAMANHSCIPNAMVQFIGR-RAILRAEKPI 221
Query: 137 APGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
D + I YTD + NR+ L +F C C RC +LN +
Sbjct: 222 KIDDEIEISYTDYTFPLSNRKLAL-APYFFDCMCLRCE--KDLNVY 264
>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
Length = 484
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 5/121 (4%)
Query: 70 LNEEFTQE--EILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDS 125
LN + ++ EI K I N H + E P I +F S I H+C N F D
Sbjct: 170 LNARYAEDVKEITKDICRISCNGHTICDDELRPVGIGLFPVVSIINHSCSSNSLLLF-DG 228
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
++RA+ I+ G +++ Y + T +RR L YF C+C RC D +E + D
Sbjct: 229 KHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRCTDDSEAGLYKDD 288
Query: 186 V 186
V
Sbjct: 289 V 289
>gi|119613764|gb|EAW93358.1| SET and MYND domain containing 2, isoform CRA_d [Homo sapiens]
Length = 230
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 17 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 70
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 71 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 130
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 131 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 183
>gi|431915909|gb|ELK16163.1| SET and MYND domain-containing protein 2, partial [Pteropus alecto]
Length = 390
Score = 55.8 bits (133), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLMQL---EAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ + P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 48 NPSETVRLTARILAKQKNHPETTPSEKLLAVKDFESHLD------KLDNEKKDLIQSDIS 101
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 102 ALHHFYSKHLEFPDNKSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 161
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 162 VTYKGT-LAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 214
>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Ornithorhynchus anatinus]
Length = 490
Score = 55.8 bits (133), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 74/163 (45%), Gaps = 17/163 (10%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP---LT 96
L+ HV+ + + E R +V FL + +++F+ + I I G+I N +
Sbjct: 132 LQNHVDSFGEEEQKEL-RLDVESFLHFWPPQSQQFSMQYISHIFGVINCNGFTLSDQRGL 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSG------------QVLLRAMKPIAPGDHLSI 144
+ + +F + H+C+PN F + ++ LRA+ I+ G+ L++
Sbjct: 191 QAVGVGVFPNLCLVNHDCWPNCTVIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTV 250
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
Y D L + +R+ L+ YF C CE C + + F GVK
Sbjct: 251 SYIDFLNVSADRKKLLKKQYYFDCTCEHCEKGIKDDLFL-GVK 292
>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
Length = 433
Score = 55.8 bits (133), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R+++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKRDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|299472634|emb|CBN78286.1| set domain protein [Ectocarpus siliculosus]
Length = 445
Score = 55.8 bits (133), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQV--LLRAMKPIAPGDHLSICYTDP---LWGTINR 156
++++R S H+C N + D G V ++P+A G+ L++ Y + GT+ R
Sbjct: 199 SLYERGSRFNHSCTANCTRVTLDGGGVERAFMTLRPVAAGEELTLSYLPSGIEVMGTVVR 258
Query: 157 RHHLQTSKYFICQCERCRDP 176
R HL S+ F+C C+RC P
Sbjct: 259 RRHLWLSRGFLCSCDRCSRP 278
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 71/146 (48%), Gaps = 13/146 (8%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKL----NEEFTQEEILKICGIIQVNAHE 92
L +LE H+ + E+D + + + NF +++ T ++I I G+I N
Sbjct: 124 LDELEDHIADMP-----EDDLKELKVDIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFS 178
Query: 93 MP---LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+ + + +F + H+C+PN + G++ LRA+ IA G+ L++ Y D
Sbjct: 179 VSDQRGLQAVGVGLFPNLCLVNHDCWPNC-TVILNHGKIELRALGKIAEGEELTVAYVDF 237
Query: 150 LWGTINRRHHLQTSKYFICQCERCRD 175
L + R+ L+T +F C CE C++
Sbjct: 238 LNLSEERQRLLKTQYFFDCTCEHCKN 263
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 67/148 (45%), Gaps = 6/148 (4%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS 99
L+ H+ + P+ + +++ L +F E+ + + I N+ + E +
Sbjct: 144 LQLHMPGEPSDPQQQRGLFAILEHLRHFLPDAEKHLLKSAYPVLRITSANSFGISGVEGN 203
Query: 100 --YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRR 157
+ ++D S+I H+C PN +F+ +R++ + P L+I Y DP RR
Sbjct: 204 NLGVGLYDTVSYINHSCAPNCSITFSGV-YARVRSVHDLPPNQELTIAYIDPCDPRAKRR 262
Query: 158 HHLQTSKYFICQCERC---RDPTELNTF 182
HL++ F C+C RC RD L T
Sbjct: 263 AHLKSQFMFDCECSRCERERDDDPLLTL 290
>gi|331241122|ref|XP_003333210.1| hypothetical protein PGTG_14130 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312200|gb|EFP88791.1| hypothetical protein PGTG_14130 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 284
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/135 (26%), Positives = 56/135 (41%), Gaps = 18/135 (13%)
Query: 51 PKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFI 110
+ + RN +F+ N +N ++ ++ G + VNA S
Sbjct: 62 ARLHGEGRNEDEFISNLIDINTFTSKLSTFELLGAVVVNA-----------------SRF 104
Query: 111 EHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQ 169
H+C PN+ + Q+L +RA KPIA G+ L+I Y RR ++ F C
Sbjct: 105 NHDCRPNMVYNLDSRTQILRMRAFKPIAKGEELTISYRSLEMNGKERRESIKREYGFDCA 164
Query: 170 CERCRDPTELNTFYD 184
C CR +EL D
Sbjct: 165 CSHCRMSSELQEQSD 179
>gi|440800401|gb|ELR21440.1| hypothetical protein ACA1_183760 [Acanthamoeba castellanii str.
Neff]
Length = 454
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 73/164 (44%), Gaps = 27/164 (16%)
Query: 33 IWDKLMQLEAHVEEYKN-----SPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ 87
++D++M ++AH E + +P E+ +L + +E +L I I
Sbjct: 79 VFDRMMHMQAHAESQIDLLQTYAPAVED--------MLRVLPAEMKVAKEAMLIIVCICS 130
Query: 88 VN------------AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
VN + E + +F ++ + H+C N T ++ + A++P
Sbjct: 131 VNQFAVGRPPAENESGEGQSKDGEGSGLFLTSAIVSHSCRENAMHR-TKGRRLAMLALEP 189
Query: 136 IAPGDHLSICY-TDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
IA G ++ Y +PL T R+ L +K F+CQCERC D TE
Sbjct: 190 IAAGSEITWSYLGNPLLPTQIRQRELYQTKVFLCQCERCLDLTE 233
>gi|291402401|ref|XP_002717452.1| PREDICTED: SET and MYND domain containing 2, partial [Oryctolagus
cuniculus]
Length = 394
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 88/172 (51%), Gaps = 15/172 (8%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLMQLEAHVEEYKNS-PKYENDRRNVVQ----FL 64
+PS + R L K+ P+ +KL+ VEE+++ K +N++++++Q L
Sbjct: 62 NPSETVRLTARILAKQKSHPERTPSEKLLA----VEEFESHLDKLDNEKKDLIQSDISAL 117
Query: 65 LNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKS 121
+F+ + EF + L + + N + E S++ AIF + + H+C PN+ +
Sbjct: 118 HHFYSKHLEFPDHDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVT 177
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ + +RA++ I+PG+ + Y D L+ T +R L+ S +F C+C C
Sbjct: 178 YKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNERLRDSYFFTCECLEC 228
>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
Length = 419
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 22/68 (32%), Positives = 42/68 (61%), Gaps = 1/68 (1%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
+ I H+C PN + + + Q+ +R++KPIA G+ +++ Y DP NR+ L+ + +F
Sbjct: 123 AMINHSCAPNAFVTL-EGCQLRVRSLKPIAAGEEITVSYADPTLPVFNRQKFLKETYFFD 181
Query: 168 CQCERCRD 175
C+C+ C +
Sbjct: 182 CRCKGCEN 189
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 70/168 (41%), Gaps = 25/168 (14%)
Query: 28 SHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ 87
SH +I +K M L A + N+V +L F ++ E K
Sbjct: 145 SHMSEIDEKQMLLYAQMA-------------NLVNLILQFPSVDLREIAENFSKF----S 187
Query: 88 VNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSIC 145
NAH + +E P I +F S I H+C PN F + V+ RAM I+ ++I
Sbjct: 188 CNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVV-RAMDNISKDSEITIS 246
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRD-----PTELNTFYDGVKC 188
Y + T+ R+ L+ F CQC RC + E + +G +C
Sbjct: 247 YIETAGSTLTRQKSLKEQYLFHCQCARCSNVGKPHDIEESAILEGYRC 294
>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
ND90Pr]
Length = 529
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 67/147 (45%), Gaps = 10/147 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRR---NVVQFLLNFFKLNEEFTQEEILKICGIIQVNA 90
W+ L QL +H++++K + Y+N QF L +++F + + G + NA
Sbjct: 155 WELLCQLPSHIDDFKRNGTYQNIEMMAMGAAQFALPQNIFDKDF----VAAMYGRVMSNA 210
Query: 91 HEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
+ P +P I + + H+C PN Y D + +R ++PI + I Y D
Sbjct: 211 LTLITPTLDPLGIIVDPTLCQMNHSCDPNAY-IMMDGPLISIRTLRPIRKDKEIFISYID 269
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRD 175
R+ LQ +F C+C +C+D
Sbjct: 270 TTHPYQKRQDELQARWFFTCRCAKCQD 296
>gi|440901481|gb|ELR52415.1| SET and MYND domain-containing protein 2, partial [Bos grunniens
mutus]
Length = 383
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R+++Q
Sbjct: 50 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKRDLIQSDIA 103
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 104 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 163
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 164 VTYKGT-LAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 216
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 70/168 (41%), Gaps = 25/168 (14%)
Query: 28 SHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ 87
SH +I +K M L A + N+V +L F ++ E K
Sbjct: 145 SHMSEIDEKQMLLYAQMA-------------NLVNLILQFPSVDLREIAENFSKF----S 187
Query: 88 VNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSIC 145
NAH + +E P I +F S I H+C PN F + V+ RAM I+ ++I
Sbjct: 188 CNAHSICDSELRPQGIGLFPLVSIINHSCSPNAVLVFEEQMAVV-RAMDNISKDSEITIS 246
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRD-----PTELNTFYDGVKC 188
Y + T+ R+ L+ F CQC RC + E + +G +C
Sbjct: 247 YIETAGSTLTRQKSLKEQYLFHCQCARCSNFGKPHDIEESAILEGYRC 294
>gi|426240539|ref|XP_004014156.1| PREDICTED: N-lysine methyltransferase SMYD2 [Ovis aries]
Length = 395
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R+++Q
Sbjct: 63 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKRDLIQSDIA 116
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 117 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 176
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 177 VTYKGT-LAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 229
>gi|342874438|gb|EGU76450.1| hypothetical protein FOXB_13043 [Fusarium oxysporum Fo5176]
Length = 896
Score = 55.5 bits (132), Expect = 1e-05, Method: Composition-based stats.
Identities = 48/167 (28%), Positives = 74/167 (44%), Gaps = 12/167 (7%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
+R + + +P I L LE +VE ++ S K+ D + F + QEE+
Sbjct: 471 VRAVIQALVKPGIGAALENLEGNVESWRKSDKWA-DMEMMAMGATAF--TGQGTGQEELQ 527
Query: 81 KICGI---IQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
K + IQ NA + + IF + + H+C PN F + +LRA KP
Sbjct: 528 KTLALLCKIQTNAFHRYDADLGQVGIFLEPKLAMANHSCIPNAMVQFVGR-KAILRAEKP 586
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
I + + I YTD + R+H L +F CQC RC +LN +
Sbjct: 587 IKVDEEIEISYTDYTFPRSKRKHAL-APYFFDCQCPRCE--KDLNVY 630
>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
Length = 513
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/171 (25%), Positives = 71/171 (41%), Gaps = 7/171 (4%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
+R L + +I D + L+ H EE + + + + + + E ++ +
Sbjct: 105 VRALLQTLLIKEIEDGVAALDGHTEERRKTKSWPDLEMMALAACAFAGRQGESNVRKAVE 164
Query: 81 KICGIIQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
+C I Q NA + + +F + H+C PN F + +LRA +PI
Sbjct: 165 LLCKI-QTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGR-KAILRAERPIQA 222
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
GD + I YTD RR L+ F C C RC+ +LN + CP
Sbjct: 223 GDEIEISYTDYTMPLSARREALEQYS-FECTCARCKG--DLNVYQVRTACP 270
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 58/132 (43%), Gaps = 8/132 (6%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYP 116
N+V +L +L+ +EI NAH + P P ++ S I H+C P
Sbjct: 167 NLVSLILPLIELD----LKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVP 222
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
N F D +RA++PI + +SI Y + +RH+ +F C C RC
Sbjct: 223 NAVLIF-DGRTAYVRALQPIGKNEEVSISYIETA-AVTKKRHNDLKQYFFTCSCPRCVKG 280
Query: 177 TELNTFYDGVKC 188
+E + +G +C
Sbjct: 281 SEEDALLEGFRC 292
>gi|255947380|ref|XP_002564457.1| Pc22g04180 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591474|emb|CAP97706.1| Pc22g04180 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 351
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 41/74 (55%), Gaps = 2/74 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT--INRRHH 159
A++ A+ H+C PN+ D G+++ A + I PG+ I Y D T +RR H
Sbjct: 259 AVYPTAAIANHSCLPNVIHKADDKGRMVFTASRDIFPGEECCISYFDLTQYTDLASRREH 318
Query: 160 LQTSKYFICQCERC 173
L+ S F+CQCERC
Sbjct: 319 LRKSFRFVCQCERC 332
>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
Length = 433
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
Length = 370
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 78/171 (45%), Gaps = 12/171 (7%)
Query: 25 YKKSHQP---QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILK 81
YK + P ++W LM +H + K+ K + + L + K +++
Sbjct: 105 YKAFYTPTSFRVWKDLM---SHYSDLKSDKKRMDHFSTLSMVLFEYLKDISLPNTADLMG 161
Query: 82 ICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPG 139
+ G + +N+ + E + I I+ +S I+H+C PN F D + +RA+K +
Sbjct: 162 LYGRMVINSFTILDIEMNSIGTGIYLASSVIDHSCNPNAVAVF-DGKTINIRALKDMNCL 220
Query: 140 D--HLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
D + I Y D + R+ L+ S YF+CQC+RC D + + KC
Sbjct: 221 DWKKIRISYIDLMKTPYERQMELRQSYYFLCQCDRCLDENRIK-YVHAAKC 270
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 55/114 (48%), Gaps = 8/114 (7%)
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
V LL F + E F + C + EM + + ++ S + H+C PN
Sbjct: 162 VSQLLPAFDIFEAFAK----VFCNAFTICNAEM---QGVGVGLYPSMSLLNHSCDPNCSI 214
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
F + +LLRA++ I G+ L++CY D L + RR HL++ F C C RC+
Sbjct: 215 VF-NGPHLLLRAVRDIETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRCQ 267
>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 496
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 5/85 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
++ +FI H+C PNL T G++L L+A++PI G+ +I Y D G I R L
Sbjct: 243 GLYPGLTFINHSCDPNL--QVTQIGKILTLKAVRPIKKGEEFTISYIDRTEGAIQRNEEL 300
Query: 161 QTSKYFICQCERCRD--PTELNTFY 183
+ +F CQC +C + N FY
Sbjct: 301 METFFFECQCHKCASVRSEKRNDFY 325
>gi|393227148|gb|EJD34840.1| hypothetical protein AURDEDRAFT_75805 [Auricularia delicata
TFB-10046 SS5]
Length = 123
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 5/94 (5%)
Query: 84 GIIQVNAHEMP----LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPG 139
GI++ N E L E ++IA+ S H+C PN KS Q A + IAPG
Sbjct: 28 GILRTNGFEATFDGILGERAFIAVMQIMSRANHSCKPNT-KSCRQKYQATYTASRDIAPG 86
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +++ Y D RR L+T +F C CE C
Sbjct: 87 EEITVTYIDETRPKAERRKELKTKYFFTCTCELC 120
>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
Length = 384
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 52 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 105
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 106 ALHHFYSKHLEFLDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 165
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 166 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 218
>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
Length = 433
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEF-TQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + ++ + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLIVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
Length = 554
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 41/84 (48%), Gaps = 2/84 (2%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
A F H C PN +GQ+ R ++ + + L + Y D RR L T+K+F
Sbjct: 262 ALFFNHGCNPNTAFVGLPNGQLAFRTIRSVQKDEELVVSYIDIYSDRDERRQELLTTKHF 321
Query: 167 ICQCERCRDPTE--LNTFYDGVKC 188
C+C+RC P E ++ F GV C
Sbjct: 322 WCKCKRCASPLEKSIDRFLQGVVC 345
>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
Length = 433
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/170 (25%), Positives = 85/170 (50%), Gaps = 11/170 (6%)
Query: 12 HPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNS-PKYENDRRNVVQ----FLLN 66
+PS + R L K+ P+ M L V+E+++ K +N++++++Q L
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEMLLA--VKEFESHLDKLDNEKKDLIQSDIAMLHQ 158
Query: 67 FFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFT 123
F+ + EF + L + + N + E S++ AIF + + H+C PN+ ++
Sbjct: 159 FYSKHLEFPDHDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYK 218
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +RA++ + PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 219 GT-LAEVRAVQEVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
Length = 371
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|358391549|gb|EHK40953.1| hypothetical protein TRIATDRAFT_29913 [Trichoderma atroviride IMI
206040]
Length = 554
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 58/121 (47%), Gaps = 6/121 (4%)
Query: 69 KLNEEFTQEEILKICGIIQVNAHEMPLT---EPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
++ E ++LK+ +I+ NA + T P AI A+ + HNC PN+ F +S
Sbjct: 222 RMKSELDVADVLKLYCLIRCNAVPVDQTFRNSPLGSAIDLGAAMLNHNCEPNIVIVF-NS 280
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
+V RA++ I G+ L CY D + R + F CQC+RC E N Y+G
Sbjct: 281 TRVEARAVRSIKAGEELQHCYRDIAYDCTFRSPRIAARYQFKCQCDRC--IRETNRHYEG 338
Query: 186 V 186
Sbjct: 339 A 339
>gi|332811913|ref|XP_003308794.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan troglodytes]
Length = 433
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEF-TQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + ++ + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVVFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|345329543|ref|XP_001510161.2| PREDICTED: N-lysine methyltransferase SMYD2-like [Ornithorhynchus
anatinus]
Length = 412
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 85/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 80 NPSETVRLTARILAKQKAHPERTPSEKLLAVKEFESHLD------KLDNEKRELIQSDIA 133
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ L + + N + E S++ AIF + + H+C PN+
Sbjct: 134 ALHHFYSKHIEYPDNASLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 193
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 194 VTYKGT-LAEVRAVQEINPGDEIFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 246
>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
Length = 433
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSRHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
Length = 433
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 85/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKMHPERTPSEKLLAVREFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L F+ + EF L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHQFYSKHLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 215 VTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 267
>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
Length = 433
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 90/175 (51%), Gaps = 21/175 (12%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLT--EPSYI--AIFDRASFIEHNCYPNL 118
L +F+ + EF + L I QVN + + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSL-IVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNV 213
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 214 IVTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|119613761|gb|EAW93355.1| SET and MYND domain containing 2, isoform CRA_a [Homo sapiens]
Length = 349
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 17 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 70
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 71 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 130
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 131 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 183
>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
Length = 433
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|119613763|gb|EAW93357.1| SET and MYND domain containing 2, isoform CRA_c [Homo sapiens]
Length = 391
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 58 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 111
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 112 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 171
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 172 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 224
>gi|426333764|ref|XP_004028440.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gorilla gorilla
gorilla]
Length = 384
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 52 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 105
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 106 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 165
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 166 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 218
>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
mulatta]
Length = 433
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|254584925|ref|XP_002498030.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
gi|238940924|emb|CAR29097.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
Length = 485
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 38/71 (53%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
IF SFI H+C PN+ D ++ L A K I G+ L Y +PL G RR L+
Sbjct: 333 IFPVYSFINHDCEPNVRYEIDDKLRLKLYARKHIKKGEELLTTYVNPLHGVKLRRRELRV 392
Query: 163 SKYFICQCERC 173
+ F+C C+RC
Sbjct: 393 NWGFLCHCDRC 403
>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
Length = 349
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 85/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKMHPERTPSEKLLAVREFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L F+ + EF L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHQFYSKHLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 215 VTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 267
>gi|355745920|gb|EHH50545.1| hypothetical protein EGM_01398, partial [Macaca fascicularis]
Length = 380
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/175 (26%), Positives = 90/175 (51%), Gaps = 21/175 (12%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 48 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 101
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLT--EPSYI--AIFDRASFIEHNCYPNL 118
L +F+ + EF + L I QVN + + E S++ AIF + + H+C PN+
Sbjct: 102 ALHHFYSKHLEFPDNDSL-IVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNV 160
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 161 IVTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 214
>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
Length = 433
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
Length = 410
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 108 SFIEHNCYPNL---YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSK 164
S I HNCYPN+ Y + ++ +RA++ I G+ L Y +P W R+ +L S
Sbjct: 323 SHINHNCYPNVSIDYPNPKSQYKIAVRAVRDIKQGEQLFETYVNPRWNKETRQTYLDKSY 382
Query: 165 YFICQCERCRDPTELN 180
F C+CERC EL
Sbjct: 383 LFNCKCERCEKDLELT 398
>gi|355558772|gb|EHH15552.1| hypothetical protein EGK_01662, partial [Macaca mulatta]
Length = 380
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 88/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 48 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 101
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 102 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 161
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 162 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 214
>gi|354465172|ref|XP_003495054.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cricetulus
griseus]
Length = 404
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 85/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 72 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 125
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L F+ + EF L + + N + E S++ AIF + + H+C PN+
Sbjct: 126 ALHQFYSRHLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 185
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 186 VTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 238
>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 454
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/169 (24%), Positives = 77/169 (45%), Gaps = 10/169 (5%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-----KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
PS +A R L++ I + QL+ HV + + ++ R +V +FL +
Sbjct: 100 PSENIRLAARVLWRMHKDTGIASDSQLISVDQLQEHVADLA-AEDFKQLRADVHKFLQYW 158
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
+ + + I I GII+ N + + + +F + H+C+PN +
Sbjct: 159 SYGTRQHSVDYISHIFGIIKCNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNC-TVILN 217
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
G++ LRA+ I G+ L++ Y D L + +R+ L+ +F C CE C
Sbjct: 218 HGKIELRALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHC 266
>gi|71834584|ref|NP_001025394.1| N-lysine methyltransferase SMYD2-B isoform 1 [Danio rerio]
Length = 437
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 74/159 (46%), Gaps = 6/159 (3%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF-TQ 76
IIA K+ +I L ++EAH+E+ N + + + L F+ + +F
Sbjct: 113 IIARLKAQKERSPSEILLLLGEMEAHLEDMDNEKREMTEAH--IAGLHQFYSKHLDFPDH 170
Query: 77 EEILKICGIIQVNAHEMPLTEPS--YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+ +L + + N + E S +AIF + + H+C PN+ ++ +RA+K
Sbjct: 171 QALLTLFSQVHCNGFTVEDEELSNLGLAIFPDIALLNHSCSPNVIVTYRGI-NAEVRAVK 229
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
I+PG + Y D L+ T +R L+ YF C C+ C
Sbjct: 230 DISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKEC 268
>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
impatiens]
Length = 439
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 7/118 (5%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSY-IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAM 133
+ E+L I G I +N+ + ++ + + I+ S ++H+C PN +F + + ++A+
Sbjct: 152 STAELLGIYGRITINSFSIFNSDMNIGVGIYLGPSILDHSCKPNAVATF-EGTTITVKAI 210
Query: 134 KPIAPGD--HLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ + D + I Y D + T +RR LQ+S YF C CERC P + + CP
Sbjct: 211 EDLPSLDLSQIRIPYIDVIKTTRDRRAELQSSYYFWCDCERCEKPEPMA---EAAACP 265
>gi|325530307|sp|Q5RGL7.2|SMY2B_DANRE RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
Length = 434
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 74/159 (46%), Gaps = 6/159 (3%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF-TQ 76
IIA K+ +I L ++EAH+E+ N + + + L F+ + +F
Sbjct: 113 IIARLKAQKERSPSEILLLLGEMEAHLEDMDNEKREMTEAH--IAGLHQFYSKHLDFPDH 170
Query: 77 EEILKICGIIQVNAHEMPLTEPS--YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+ +L + + N + E S +AIF + + H+C PN+ ++ +RA+K
Sbjct: 171 QALLTLFSQVHCNGFTVEDEELSNLGLAIFPDIALLNHSCSPNVIVTYRGI-NAEVRAVK 229
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
I+PG + Y D L+ T +R L+ YF C C+ C
Sbjct: 230 DISPGQEIYTSYIDLLYPTADRLERLRDMYYFSCDCKEC 268
>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
Length = 433
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 87/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R+++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKRDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ IF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDSDSLVVLFAQVNCNGFTIEDEELSHLGSXIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|299116928|emb|CBN75038.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 457
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 71/159 (44%), Gaps = 15/159 (9%)
Query: 17 CIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ 76
C I +Y+K P D L L A N P+ ++ + + F + T
Sbjct: 100 CFIQCMLMYRK--DPLSLDPLAPLTA-----ANLPRCLESVVSIRELIPGLFP--DGMTN 150
Query: 77 EEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKP 135
+ +I G + N+HE+ E S +F A +EH+C PN SFT G L + A++
Sbjct: 151 DRAARILGTLNTNSHELEQVEGS--GLFLMACIMEHDCTPNC--SFTTFGDKLWVTAIRQ 206
Query: 136 IAPGDHLSICYTDPLW-GTINRRHHLQTSKYFICQCERC 173
+ G+ LSI Y + + T R+ L+ F+C C C
Sbjct: 207 VGEGERLSIDYGNNFYLPTAERKAELEDIYGFVCTCRAC 245
>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 628
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 51/94 (54%), Gaps = 7/94 (7%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHH-- 159
AI+ AS + H+C PN+ SF D +++RAMK I G+ +S CY P + ++R H
Sbjct: 347 AIYPSASMMNHSCDPNIITSFVDQ-YLIVRAMKDIQAGEEVSNCY-GPNFRRMSREHRQE 404
Query: 160 -LQTSKYFICQCERCRDPTELNTF--YDGVKCPE 190
L+ F C+C+ C P N + +KCPE
Sbjct: 405 ILKNQYGFECRCDPCVMPEYENFMERFHAIKCPE 438
>gi|68531945|ref|XP_723657.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23478021|gb|EAA15222.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 1088
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN-RRHH 159
I +F A F+ H+C+PN F D+ ++ AM+ I D +SI + + L+ +I R+
Sbjct: 536 ILLFKTAYFLNHSCFPNSSYCFMDNNKICFIAMRKINMYDEISISFINELYASIEYRKKK 595
Query: 160 LQTSKYFICQCERC 173
L K IC C RC
Sbjct: 596 LNDIKNIICSCNRC 609
>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
Length = 476
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 15/127 (11%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG- 126
+++F+ + I I G+I N + + + IF + H+C+PN F +
Sbjct: 162 SQQFSMQYISNIFGVINCNGFTLSDQRGLQAVGVGIFPNLGLVNHDCWPNCTVIFNNGNH 221
Query: 127 -----------QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
++ LRA+ IA G+ L++ Y D L + R+ L+ YF C CE C+
Sbjct: 222 EAVKSMFHTQMRIELRALGKIAEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK 281
Query: 176 PTELNTF 182
+ + F
Sbjct: 282 GLKDDLF 288
>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
boliviensis]
Length = 433
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 89/174 (51%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ ++L+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSERLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I+PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|126307073|ref|XP_001374886.1| PREDICTED: n-lysine methyltransferase SMYD2 [Monodelphis domestica]
Length = 386
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 86/174 (49%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKS--HQPQIWDKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ H+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 54 NPSETVRLTARILAKQKTHHERTSSEKLLAVKEFESHLD------KLDNEKRELIQSDIA 107
Query: 63 FLLNFFKLNEEFTQE-EILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ ++ + + N + E S++ AIF + + H+C PN+
Sbjct: 108 ALHHFYSKHLEYPDNCSLVTLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 167
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 168 VTYKGT-LAEVRAVQEINPGDEVFTSYIDLLYPTEDRNDRLKDSYFFTCECREC 220
>gi|149708613|ref|XP_001488507.1| PREDICTED: n-lysine methyltransferase SMYD2 [Equus caballus]
Length = 384
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 74/142 (52%), Gaps = 14/142 (9%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQ----FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEM 93
+ E+H+E K +N++++++Q L +F+ + EF + L + + N +
Sbjct: 84 EFESHLE------KLDNEKKDLIQSDIAALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTI 137
Query: 94 PLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW 151
E S++ AIF + + H+C PN+ ++ + +RA++ I PG+ + Y D L+
Sbjct: 138 EDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGT-LAEVRAVQEINPGEEVFTSYIDLLY 196
Query: 152 GTINRRHHLQTSKYFICQCERC 173
T +R L+ S +F C+C+ C
Sbjct: 197 PTEDRNDRLRDSYFFTCECQEC 218
>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
Length = 241
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 21 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIQAGEELTICYLDMLMTSAERREQL 79
Query: 161 QTSKYFICQCERC 173
+ F C C RC
Sbjct: 80 RNQYCFDCDCARC 92
>gi|395531367|ref|XP_003767751.1| PREDICTED: N-lysine methyltransferase SMYD2 [Sarcophilus harrisii]
Length = 402
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 85/173 (49%), Gaps = 19/173 (10%)
Query: 13 PSYECIIALRCLYKKS--HQPQIWDKLM---QLEAHVEEYKNSPKYENDRRNVVQ----F 63
PS + R L K+ H+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 71 PSETVRLTARILAKQKTHHERTSSEKLLAVKEFESHLD------KLDNEKRELIQSDISA 124
Query: 64 LLNFFKLNEEF-TQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYK 120
L +F+ + E+ ++ + + N + E S++ AIF + + H+C PN+
Sbjct: 125 LHHFYSKHIEYPDNASLVTLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIV 184
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 185 TYKGT-LAEVRAVQEINPGDEVFTSYIDLLYPTEDRNDRLKDSYFFTCECREC 236
>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
AFUA_2G10080) [Aspergillus nidulans FGSC A4]
Length = 497
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 67/140 (47%), Gaps = 5/140 (3%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS 99
LE H+++ N + +R + + + + +E+I+ + +N+ + +
Sbjct: 150 LETHIDDILNRNAPQAERIALTSRAVKEYS-KTDMEEEKIVAYHARLDLNSFNLTNDDDI 208
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHH 159
I + A+ I H+C N F D ++ ++A++PIA G+ + I Y D + T R+
Sbjct: 209 GIYLHPYAALINHSCDYNAVVGF-DGSEIFVKAIRPIATGEQIFISYIDTTYPTRIRQKE 267
Query: 160 LQTSKYFICQCERC---RDP 176
LQ +F C C +C +DP
Sbjct: 268 LQERYFFTCNCAKCLCAKDP 287
>gi|443720661|gb|ELU10312.1| hypothetical protein CAPTEDRAFT_158133 [Capitella teleta]
Length = 441
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 7/105 (6%)
Query: 77 EEILKICGIIQVNAHEM--PLTEPSYIAIFDR----ASFIEHNCYPNLYKSFTDSGQVLL 130
EE+ I G + VN+ + P P + ++ R S ++H+C PN F + + +
Sbjct: 151 EELFVIFGRVCVNSFSICDPEMNPIGVGVYIRQIKPCSVLDHSCRPNAVAVF-EGTTLRI 209
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
R ++P+ L I Y D L T RR +LQ YF C C C+D
Sbjct: 210 RCVEPVDSEQDLRISYIDTLDDTTTRRRNLQQQYYFNCLCGECKD 254
>gi|298705753|emb|CBJ49061.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 401
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
T P+ + +F + I H+C PN + S Q+ +R KP+A G+ L + Y +
Sbjct: 46 TGPTGLGLFPSGAMINHSCSPNCQAWWRGS-QLEIRCTKPVATGEELCLSYIPIDQPSTV 104
Query: 156 RRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
RR L+ S +F C+C RC + + G++CP
Sbjct: 105 RRAQLRHSWFFACRCRRCVS-RQWDAELVGLRCP 137
>gi|344236460|gb|EGV92563.1| SET and MYND domain-containing protein 2 [Cricetulus griseus]
Length = 326
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 66/132 (50%), Gaps = 8/132 (6%)
Query: 49 NSPKYENDRRNVVQ----FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI-- 101
N K +N++++++Q L F+ + EF L + + N + E S++
Sbjct: 24 NLDKLDNEKKDLIQSDIAALHQFYSRHLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGS 83
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
AIF + + H+C PN+ ++ + +RA++ I PGD + Y D L+ T +R L+
Sbjct: 84 AIFPDVALMNHSCCPNVIVTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLR 142
Query: 162 TSKYFICQCERC 173
S +F C+C C
Sbjct: 143 DSYFFTCECREC 154
>gi|124506749|ref|XP_001351972.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|23505000|emb|CAD51783.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 1114
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ S I +F ASF+ H+C+PN F D + L AM+ I D ++I + L+ +I
Sbjct: 491 SNKSKIKLFKHASFLNHSCFPNASYCFIDDKNICLLAMRTINMYDEITISLINELYTSIQ 550
Query: 156 -RRHHLQTSKYFICQCERC 173
R L K C C RC
Sbjct: 551 YRNEKLNKIKNITCSCNRC 569
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 69/154 (44%), Gaps = 9/154 (5%)
Query: 40 LEAHVEEYKNSP-KYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ E + N+VQ +L +L+ +E I NAH + P
Sbjct: 147 LESHISEVDDKQLVLYAQMANLVQLILPAIELD----LKETAHIFSKFSCNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P +F S I H+C PN F + +RA++P++ +SI Y + T+ R
Sbjct: 203 RPVGTGLFPAISTINHSCVPNAVLLF-EGRTAYVRALQPLSNNTEVSISYIETAATTLKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
+ L+ +F C C RC +E + +G +C +
Sbjct: 262 HNDLK-HYFFTCTCPRCIKGSEEDPLLEGYRCKD 294
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ + +N N+V +L F +L+ +EI + NAH + P
Sbjct: 147 LESHISKVDENQLVLYAQMANLVSLILPFIELD----LKEIAQTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F D +RA++PI + +SI Y + T R
Sbjct: 203 RPLGTGLYPVISIINHSCVPNAVLIF-DGRTAYVRALQPINKDEEVSISYIETATVTKKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ +F C C RC + + +G +C
Sbjct: 262 NNDLK-QYFFTCTCPRCVKGFDEDALLEGFRC 292
>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
Length = 436
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 104 NPSETVRLTARILAKQKIHPERTQSEKLLAVKEFESHLD------KLDNEKRELIQNDIA 157
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ L + + N + E S++ AIF + + H+C PN+
Sbjct: 158 ALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 217
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA+K I PG+ + Y D L+ T +R L+ S +F C C C
Sbjct: 218 VTYKGT-LAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCREC 270
>gi|391340382|ref|XP_003744521.1| PREDICTED: son of sevenless homolog 2-like [Metaseiulus occidentalis]
Length = 1624
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 79/167 (47%), Gaps = 18/167 (10%)
Query: 12 HPSYECIIALRCLYKK--SHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK 69
+PS E ++ R K SH +I ++ A VE N+++ L +
Sbjct: 1291 NPSSETLLGTRRQLKDCMSHLQEIKRDRVRGGAFVEML-----------NLLKQFLTAAQ 1339
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSY-IAIFDRASFIEHNCYPNLYKSFTDSGQV 128
L E + +IL+I GI+ VN + + S+ A++ S I+H+CYPNL +F ++
Sbjct: 1340 L-EGVSDLDILEIFGIMCVNTIHISNDDDSFGCALYLAPSLIDHSCYPNLTATFK-GQKI 1397
Query: 129 LLRAMKPIAPGD--HLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+L+ ++P P LS+ Y RR L+ YF C+CE C
Sbjct: 1398 VLKVLRPCEPKTVADLSLAYMPVCTTKERRRKTLREEYYFTCECEMC 1444
>gi|47224323|emb|CAG09169.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 42/72 (58%), Gaps = 1/72 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ + I H+C P++ ++ + +RA++ + PGD + I Y D L+ T +R L+
Sbjct: 125 AVYPDVALINHSCLPSVIVTYNGT-SADVRAVRDMNPGDEVLISYIDVLYPTEDRNTRLR 183
Query: 162 TSKYFICQCERC 173
S YF CQC+ C
Sbjct: 184 ESYYFTCQCQEC 195
>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
Length = 436
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 104 NPSETVRLTARILAKQKIHPERTQSEKLLAVKEFESHLD------KLDNEKRELIQNDIA 157
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ L + + N + E S++ AIF + + H+C PN+
Sbjct: 158 ALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 217
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA+K I PG+ + Y D L+ T +R L+ S +F C C C
Sbjct: 218 VTYKGT-LAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCREC 270
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 53.9 bits (128), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ + +N N+V +L F +L+ +EI + NAH + P
Sbjct: 147 LESHISKVDENQLVLYAQMANLVSLILPFIELD----LKEIAQTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F D +RA++PI + +SI Y + T R
Sbjct: 203 RPLGTGLYPVISIINHSCVPNAVLIF-DGRTAYVRALQPINKDEEVSISYIETATVTKKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ +F C C RC + + +G +C
Sbjct: 262 NNDLK-QYFFTCTCPRCVKGFDEDALLEGFRC 292
>gi|255714340|ref|XP_002553452.1| KLTH0D17160p [Lachancea thermotolerans]
gi|238934832|emb|CAR23014.1| KLTH0D17160p [Lachancea thermotolerans CBS 6340]
Length = 485
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 35/71 (49%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
IF S++ HNC PN+ + L A K I G+ L Y +PL G RR L
Sbjct: 336 IFTLYSYLNHNCEPNVRYELDGKTGLKLFARKDIKKGEELLTTYVNPLHGVTLRRRELLV 395
Query: 163 SKYFICQCERC 173
+ F+C CERC
Sbjct: 396 NWGFLCDCERC 406
>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
latipes]
Length = 495
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 1/85 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + ++ LRA++ I PG+ L I Y + L T +R+ L
Sbjct: 258 VGLYPSLSLLNHDCRPNCVMVFNGT-ELQLRAVQDINPGEELRISYIETLSLTEDRQKQL 316
Query: 161 QTSKYFICQCERCRDPTELNTFYDG 185
+ +F C C+RC + G
Sbjct: 317 EEQYHFTCHCQRCSSKEQDGVMLSG 341
>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
Length = 433
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 87/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N+++ ++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKELIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|113674346|ref|NP_001038756.1| N-lysine methyltransferase SMYD2-B isoform 2 [Danio rerio]
gi|95132413|gb|AAI16607.1| SET and MYND domain containing 2b [Danio rerio]
Length = 423
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 6/151 (3%)
Query: 26 KKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF-TQEEILKICG 84
K+ +I L ++EAH+E+ N + + + L F+ + +F + +L +
Sbjct: 110 KERSPSEILLLLGEMEAHLEDMDNEKREMTEAH--IAGLHQFYSKHLDFPDHQALLTLFS 167
Query: 85 IIQVNAHEMPLTEPS--YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHL 142
+ N + E S +AIF + + H+C PN+ ++ +RA+K I+PG +
Sbjct: 168 QVHCNGFTVEDEELSNLGLAIFPDIALLNHSCSPNVIVTYRGI-NAEVRAVKDISPGQEI 226
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERC 173
Y D L+ T +R L+ YF C C+ C
Sbjct: 227 YTSYIDLLYPTADRLERLRDMYYFSCDCKEC 257
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 9/140 (6%)
Query: 52 KYENDR-RNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRAS 108
K +N++ N+V +L F +L+ +EI + NAH + P P ++ S
Sbjct: 126 KLQNEKMANLVSLILPFIELD----LKEIAQTFSKFACNAHTICDPELRPLGTGLYPVIS 181
Query: 109 FIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFIC 168
I H+C PN F D +RA++PI + +SI Y + T R + L+ +F C
Sbjct: 182 IINHSCVPNAVLIF-DGRTAYVRALQPINKDEEVSISYIETATVTKKRNNDLK-QYFFTC 239
Query: 169 QCERCRDPTELNTFYDGVKC 188
C RC + + +G +C
Sbjct: 240 TCPRCVKGFDEDALLEGFRC 259
>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
Length = 436
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
HPS + R L K+ P+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 104 HPSETVRLTARILAKQKTHPERTQSEKLLAVKEFESHLD------KLDNEKRELIQNDIA 157
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + ++ L + + N + E S++ AIF + + H+C PN+
Sbjct: 158 ALHHFYSKHLDYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 217
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C C C
Sbjct: 218 VTYKGT-LAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFSCDCREC 270
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 53/107 (49%), Gaps = 8/107 (7%)
Query: 89 NAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
NAH + +E P ++ S I H+C+PN F + Q ++RA++PI G L++ Y
Sbjct: 145 NAHTICDSELRPMGTGLYPVISIINHSCFPNAVLLF-EGRQAVVRAVEPIREGSELTVSY 203
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCR-----DPTELNTFYDGVKC 188
+ T +R+ L+ +F C+C RC D + +G +C
Sbjct: 204 IEIAASTASRKKSLKEQYFFDCKCLRCLKVDTPDGLHEDAILEGFRC 250
>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
NIH/UT8656]
Length = 550
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/167 (24%), Positives = 77/167 (46%), Gaps = 12/167 (7%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYE-NDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVN 89
P W++L+ L++H ++ N+ D + Q + + +E+ + IL++ + VN
Sbjct: 163 PGEWEQLLALQSHQQDLANAGGQRWQDLLIMSQGIKGYSGTDED--DDLILRLSCAVIVN 220
Query: 90 AHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSF----TDS-GQVLLRAMKPIAPGDHL 142
+ + I + + + + H+C PN Y F TD+ G + + A++ IA + +
Sbjct: 221 SFTLSNATFDSIGVILHPKPALLNHSCDPNAYVRFDVSETDTLGSISVHALRDIAKDEEI 280
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+I Y D R+ L +F CQC C +P +T DG P
Sbjct: 281 TISYIDTTVPCKRRQQQLSERYFFTCQCHLCANPD--DTPQDGYYWP 325
>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
Length = 532
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 69/153 (45%), Gaps = 10/153 (6%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVV---QFLLNFFKLNEEFTQEEILKICGIIQV 88
Q W+ + +L +HV+++K + Y N + QF L + +F I + +
Sbjct: 153 QDWEMVCRLPSHVDDFKRNGTYGNIEMMAMGAPQFALPPNMFDRDF----IAAMYARVMS 208
Query: 89 NAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
NA + P +P I + + H+C PN + D V +R ++PI + I Y
Sbjct: 209 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAF-IMMDGPSVSIRTLRPIRKDKEIFISY 267
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
D + R+ LQT +F C+C +C++ L
Sbjct: 268 IDTTYPYHKRQEELQTRWFFTCRCAKCQEKATL 300
>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 454
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 69/153 (45%), Gaps = 10/153 (6%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVV---QFLLNFFKLNEEFTQEEILKICGIIQV 88
Q W+ + +L +HV+++K Y N + QF L +++F I + +
Sbjct: 75 QDWEMVCRLPSHVDDFKRKGTYGNIEMMAMGAPQFALPPTMFDKDF----IAAMYARVMS 130
Query: 89 NAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
NA + P +P I + + H+C PN + D V +R ++PI + I Y
Sbjct: 131 NALTIITPTLDPLGIILDPTLCSLNHSCDPNAF-IMMDGPSVSIRTLRPIRKDKEIFISY 189
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
D + R+ LQT +F C+C +C++ L
Sbjct: 190 IDTTYPYHKRQEELQTRWFFTCRCAKCQEKATL 222
>gi|392561695|gb|EIW54876.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 342
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 52/100 (52%), Gaps = 6/100 (6%)
Query: 78 EILKICGIIQVNAHEMPLTEPSY---IAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAM 133
++L GI + NA +P + +F A+ + H+C PNL +S+ + S Q+ RA+
Sbjct: 121 KVLPALGIFETNA--LPCGKARSGRKAGVFLTAARLNHSCRPNLARSWDEASQQMTFRAL 178
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +A G+ L + Y D + R LQ++ F C CE C
Sbjct: 179 RDVAEGEELCLNYVDVIGTRAQRTEELQSAYGFECVCEAC 218
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 53.5 bits (127), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 50/209 (23%), Positives = 86/209 (41%), Gaps = 40/209 (19%)
Query: 15 YECIIALRCLYKKSHQP--------------QIWDKLMQLEAHVEEYK-------NSPKY 53
+EC I +R L +K P ++ + +QL+A V+ Y + +
Sbjct: 98 FECAILVRLLKEKKRSPTPSLRLVMRFLIKRRLQAERVQLKAAVDNYDVCEVLPTHMSET 157
Query: 54 ENDR-------RNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIF 104
DR ++VQ ++ +E +E+ ++ NAH + E P ++
Sbjct: 158 SEDRLVMYAQMASIVQQMMA----PDEVNVKEVTQMICRFACNAHTICDEEVRPLGTGLY 213
Query: 105 DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSK 164
S + H+C PN F D + LRA++ G ++I Y + T RR L+
Sbjct: 214 PVISIVNHSCVPNAVLHF-DGNRAALRALEDTQEGTEITISYVELAASTNTRRKALRDQY 272
Query: 165 YFICQCERC-----RDPTELNTFYDGVKC 188
YF C C RC R+ + + F +G C
Sbjct: 273 YFDCNCIRCSRLVTREGSREDAFLEGYGC 301
>gi|302774176|ref|XP_002970505.1| hypothetical protein SELMODRAFT_411158 [Selaginella moellendorffii]
gi|300162021|gb|EFJ28635.1| hypothetical protein SELMODRAFT_411158 [Selaginella moellendorffii]
Length = 837
Score = 53.1 bits (126), Expect = 5e-05, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
S I H+C PN Y ++ + VL +R KP++ GD +SI Y + L R L+
Sbjct: 232 SLINHSCSPNAYVAYLANSTVLCVRTTKPVSAGDEISIAYINVLAPAFQRHQELRPFGIK 291
Query: 167 ICQCERCRDPTELNT 181
C C RC++ L T
Sbjct: 292 SCYCSRCKEEDTLPT 306
>gi|321474757|gb|EFX85721.1| hypothetical protein DAPPUDRAFT_98275 [Daphnia pulex]
Length = 222
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 49/92 (53%), Gaps = 3/92 (3%)
Query: 30 QPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVN 89
Q + + LE H E + S + + NVV+FL +L+++FTQEEI CG + VN
Sbjct: 124 QEKCAKQFNGLENHNELRRTSDYWSVYQVNVVEFLHKVCELSDQFTQEEIHSACGALDVN 183
Query: 90 AHEM---PLTEPSYIAIFDRASFIEHNCYPNL 118
A+E+ + + +F AS + HNC N+
Sbjct: 184 AYEIRPPNSGQQRILGLFPLASMMSHNCVSNM 215
>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
Length = 430
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 81/182 (44%), Gaps = 21/182 (11%)
Query: 13 PSYECIIALRCLYKKSHQ-----PQIWDKLMQLEAHVEEYKNSPK--YENDRRNVVQFLL 65
PS + R L K+ Q + + + + E+H+ + N K E+D + L
Sbjct: 100 PSETVRLTARILAKQKTQTERTPSETFLSVKEFESHLSKLDNEKKELIESD----IAALH 155
Query: 66 NFFKLNEEFTQEEILK-ICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSF 122
F+ N +T L + + N + E S++ AIF + + H+C PN+ +F
Sbjct: 156 RFYSKNLHYTDNAALVFLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNIIVTF 215
Query: 123 TDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC----RDPT 177
G V +RA++ I GD + Y D L+ T +R L S +F C C C +DP
Sbjct: 216 --KGTVAEIRAVQEIHAGDEVFTSYIDLLYPTEDRNDRLMDSYFFTCDCRECSTKQKDPA 273
Query: 178 EL 179
+L
Sbjct: 274 KL 275
>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
Length = 265
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 73/150 (48%), Gaps = 6/150 (4%)
Query: 19 IALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEE 78
+A R L++++ I + L +LE H +E + K + VV + + + F
Sbjct: 120 LAARLLWRRARCGGI-NGLWRLEHHWDELDDRRKQLYAQMAVVTWWVARWGTWPGF--RT 176
Query: 79 ILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPI 136
+ ++ ++ N H + E P +A++ + + H+C P+ ++F S + LRA++ +
Sbjct: 177 VAQLLSLLSCNCHTVCDEELRPLGVALYPTGALVNHSCSPSTVQTFHGS-TLELRALRQL 235
Query: 137 APGDHLSICYTDPLWGTINRRHHLQTSKYF 166
APGD ++I Y + RR L S +F
Sbjct: 236 APGDEITIAYIELAATRQERRETLADSYFF 265
>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
Length = 420
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 83/173 (47%), Gaps = 19/173 (10%)
Query: 13 PSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ----F 63
PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 102 PSETVRLTARILAKQKIHPERTPSEKLLAVREFESHLD------KLDNEKKDLIQSDIAA 155
Query: 64 LLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYK 120
L F+ EF L + + N + E S++ AIF + + H+C PN+
Sbjct: 156 LHQFYSKYLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIV 215
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 216 TYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 267
>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Cofactor Product Adohcy
gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Methyltransferase Inhibitor
Sinefungin
gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
Length = 433
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVREFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L F+ EF L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHQFYSKYLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 215 VTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 267
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 9/152 (5%)
Query: 40 LEAHVEEY-KNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--PLT 96
LE+H+ + +N N+V +L F +L+ +EI + NAH + P
Sbjct: 147 LESHISKVDENQLVLYAQMANLVSLILPFIELD----LKEIAQTFSKFACNAHTICDPEL 202
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P ++ S I H+C PN F D +RA++PI + +SI Y + T R
Sbjct: 203 RPLGTGLYPVISIINHSCVPNAVLIF-DGRTAYVRALQPIDKDEEVSISYIETAAVTKKR 261
Query: 157 RHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ L+ +F C C RC + + +G +C
Sbjct: 262 NNDLK-QYFFTCTCPRCVKGFDEDPLLEGFRC 292
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 53.1 bits (126), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERCR 174
+ F C C RCR
Sbjct: 254 RDQYCFECDCFRCR 267
>gi|406868025|gb|EKD21062.1| SET and MYND domain-containing protein 2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 570
Score = 53.1 bits (126), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 65/142 (45%), Gaps = 16/142 (11%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLN------FFKLNEEFTQEEILKICGIIQVNAHE- 92
LE+HV K + DR+ + LL F K E +E I+ +C ++ NA
Sbjct: 149 LESHV------GKLQGDRKRWEEVLLQARAGIEFTKSRVERMEEAIMYLC-VLSTNAFRI 201
Query: 93 -MPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW 151
+P P + + H+C PN ++S + LRA++PI + + I Y DP
Sbjct: 202 TLPDNTPFGMCFSPTLALANHSCKPNAI-IVSNSRSISLRALRPIKKNEQIFISYIDPTE 260
Query: 152 GTINRRHHLQTSKYFICQCERC 173
+R+ L+ +F C+C+ C
Sbjct: 261 DLPSRQSKLKERYFFTCKCDSC 282
>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
grubii H99]
Length = 447
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 74/169 (43%), Gaps = 35/169 (20%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDR-RNVVQFLLNFFKLNEEF------------------ 74
WDKL +E+HV K +P+ + + + VQ L ++ ++
Sbjct: 51 WDKLSLMESHV---KQAPEEDIMKFASQVQHLEHYLSASKPLLPGEDPAVLHPSYMSDYG 107
Query: 75 --TQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VL 129
+E+L +C IQVNA + P P ++I + H+C PN F G+ +
Sbjct: 108 FNGVDEVLNLCSAIQVNAFTLTSPSLAPIGMSISPLLALANHSCEPNAIAVFPKGGRDIF 167
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT---SKY-FICQCERCR 174
L A+ I PG+ + Y D T HH Q+ S+Y F+C C C+
Sbjct: 168 LVALNDIPPGEEILTSYID----TSTPYHHRQSELLSRYRFVCHCSLCQ 212
>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
Length = 450
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVREFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L F+ EF L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHQFYSKYLEFPDHSSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+C C
Sbjct: 215 VTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECREC 267
>gi|268637641|ref|XP_002649106.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|256012848|gb|EEU04054.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 449
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 46/117 (39%), Gaps = 7/117 (5%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA----SFIEHNCYPNLYKSFTDS 125
L T EIL+ + NAH+ I R + H+C PN S D
Sbjct: 174 LKNTITIHEILECFSSVLTNAHQFSYATSKEIG---RGVCPTGYFNHSCMPNTTWSLDDQ 230
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
G +L + GD LS+ Y + NRR L YF CQC C + L+ +
Sbjct: 231 GMLLFSTSSNVKKGDELSLGYLANEYPLKNRRRELLDGYYFFCQCPLCEFQSNLSGY 287
>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
griseus]
Length = 428
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 25/138 (18%)
Query: 54 ENDRRNVVQFLLNFFKLNEEFTQEEILK-----------------ICGIIQVNAHEMPLT 96
E+ + + Q L F + FT+EEI IC + EM
Sbjct: 138 EDKKEGLRQLALTF----QHFTREEIQDASQLPPSFDLFEAFAKVICNSFTICNAEM--- 190
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
+ + ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + R
Sbjct: 191 QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEER 249
Query: 157 RHHLQTSKYFICQCERCR 174
R L+ F C C RC+
Sbjct: 250 RKQLRDQYCFECDCIRCQ 267
>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
Length = 272
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 87/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + F + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLGFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|449283344|gb|EMC90014.1| SET and MYND domain-containing protein 2, partial [Columba livia]
Length = 385
Score = 52.8 bits (125), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 84/174 (48%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 46 NPSETVRLTARILAKQKTHPERTQSEKLLAVKEFESHLD------KLDNEKRELIQNDIA 99
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ L + + N + E S++ AIF + + H+C PN+
Sbjct: 100 ALHHFYSKHLEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 159
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA+K I PG+ + Y D L+ T +R L+ S +F C C C
Sbjct: 160 VTYKGT-LAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFNCDCREC 212
>gi|281202062|gb|EFA76267.1| autophagy protein 9 [Polysphondylium pallidum PN500]
Length = 1064
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 48/88 (54%), Gaps = 7/88 (7%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
AS+ H+C PNL + V+ +A+ I+ G +SI Y D T R+ +L+T YF
Sbjct: 934 ASYFNHSCCPNL-ADVRGTTVVVFKALHFISKGTPISISYLDLDQPTHERQSYLKTFYYF 992
Query: 167 ICQCERCRDPTE-----LNTFY-DGVKC 188
CQC RC+D T+ ++ FY D KC
Sbjct: 993 TCQCLRCKDQTDESDNWISRFYCDRFKC 1020
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 254 RDQYCFECDCLRCQ 267
>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
Length = 439
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 68/160 (42%), Gaps = 8/160 (5%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFT--QEEILKICGIIQVNAHEM 93
K L +H E KN PK ++ L + + + E++ I G + N +
Sbjct: 111 KFRDLMSHYAEIKNDPKRLEHLDSLHAVLTDMMADSPSTVPNKSELMSIYGRLITNGFNV 170
Query: 94 PLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTDP 149
E + IA I+ S +H+C PN +F + ++ + A++ P + I Y D
Sbjct: 171 LDAEMNSIATAIYLGVSITDHSCQPNAVATF-EGNELHIHALEDMPCLDWSKIFISYIDL 229
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L RR L+ YF+C C +CRDP E CP
Sbjct: 230 LNTPEQRRQDLKDHYYFLCVCSKCRDPKEARQM-TAAACP 268
>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
Length = 352
Score = 52.4 bits (124), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 65 RDQYCFECDCSRCQ 78
>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 673
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 6/82 (7%)
Query: 97 EPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT- 153
EP +I AI+ S I H+CYPN+ + SG V+LR ++ I G + CY P W +
Sbjct: 474 EPRHIGGAIYPSISLINHSCYPNVVRHSYPSGTVVLRTLRFIGKGTEILDCY-GPHWLSE 532
Query: 154 --INRRHHLQTSKYFICQCERC 173
++R +L F+C CE C
Sbjct: 533 KRLSRLEYLWKKYRFLCACEAC 554
>gi|326915197|ref|XP_003203906.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Meleagris
gallopavo]
Length = 575
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 83/174 (47%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++R ++Q
Sbjct: 243 NPSETVRLTARILAKQKIHPERTQSEKLLAVKEFESHLD------KLDNEKRELIQNDIA 296
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ L + + N + E S++ AIF + + H+C PN+
Sbjct: 297 ALHHFYSKHMEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 356
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ +RA+K I PG+ + Y D L+ T +R L+ S +F C C C
Sbjct: 357 VTYKGI-LAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCDCREC 409
>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 547
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 1/62 (1%)
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C PN SF G+ +RA++PIA + + I Y D + RRH LQ F CQC
Sbjct: 244 HSCDPNASYSFA-KGKCYIRAIRPIAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCP 302
Query: 172 RC 173
+C
Sbjct: 303 KC 304
>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=HSKM-B; AltName: Full=Histone methyltransferase
SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
AltName: Full=SET and MYND domain-containing protein 2
gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
Length = 433
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 87/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + F + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLGFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
Length = 441
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 87/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 101 NPSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 154
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + F + L + + N + E S++ AIF + + H+C PN+
Sbjct: 155 ALHHFYSKHLGFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 214
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 215 VTYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 267
>gi|403413538|emb|CCM00238.1| predicted protein [Fibroporia radiculosa]
Length = 275
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 56/113 (49%), Gaps = 6/113 (5%)
Query: 67 FFKLNEEF-TQEEILKICGIIQVNAHEMPLTEPS----YIAIFDRASFIEHNCYPNLYKS 121
FF L+ ++ + ++L GI + NA P+ + + + +F A+ + H+C PN+ +
Sbjct: 100 FFSLSNKYKSGRKVLPGLGIFRTNAVPCPIMKANCAEERLGLFFAAARLNHSCTPNVSRV 159
Query: 122 FTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ QVL A++ I PG+ L I Y D L RR L + F C C C
Sbjct: 160 WDSEEQVLKFYALRHIGPGEELCISYLDVLETRDERRAELWSHFGFECACSVC 212
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 30/92 (32%), Positives = 45/92 (48%), Gaps = 4/92 (4%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC + EM + + ++ S + H+C PN F + +LLRA++ I G+
Sbjct: 179 ICNAFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEE 234
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
L+ICY D L + RR L+ F C C RC
Sbjct: 235 LTICYLDMLMTSEERRKQLKDQYCFECDCVRC 266
>gi|116283746|gb|AAH28104.1| SMYD2 protein [Homo sapiens]
Length = 417
Score = 52.4 bits (124), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/173 (24%), Positives = 86/173 (49%), Gaps = 19/173 (10%)
Query: 13 PSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ----F 63
PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 86 PSETVRLTARILAKQKIHPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIAA 139
Query: 64 LLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYK 120
L +F+ + F + L + + N + E S++ AIF + + H+C PN+
Sbjct: 140 LHHFYSKHLGFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIV 199
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I PG+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 200 TYKGT-LAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 251
>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
[Equus caballus]
Length = 358
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 125 VGLYPSMSLLNHSCEPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 183
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 184 RDQYCFECDCFRCQ 197
>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
Length = 428
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQ 267
>gi|50287869|ref|XP_446364.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637715|sp|Q6FTT0.1|SET5_CANGA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49525671|emb|CAG59288.1| unnamed protein product [Candida glabrata]
Length = 515
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 37/72 (51%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
SF+ HNC PN+ + ++ + A K I + L Y +PL G RR L+ + FI
Sbjct: 347 SFLNHNCEPNVRYDINNKLELKVYARKFIKKDEELVTTYVNPLHGVSLRRRELRVNWGFI 406
Query: 168 CQCERCRDPTEL 179
C C+RC EL
Sbjct: 407 CNCDRCAKEIEL 418
>gi|443728057|gb|ELU14532.1| hypothetical protein CAPTEDRAFT_199484 [Capitella teleta]
Length = 374
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 47/93 (50%), Gaps = 5/93 (5%)
Query: 87 QVNAHEMPLTEPSYIAI----FDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHL 142
Q++ + + + P + + + R S ++H+C PN + +F + +V +R ++PI L
Sbjct: 101 QISVNSITINHPQAVGLGVGLYLRFSAVDHSCRPNAFANF-EGTKVCVRCIEPIEDEKDL 159
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
I Y PL T RR +L F C CE C D
Sbjct: 160 RISYVSPLDDTATRRKNLLQKYLFECTCEACLD 192
>gi|343475522|emb|CCD13111.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 433
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 70/154 (45%), Gaps = 18/154 (11%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGII 86
KS +P L +L+ H E + + +V+ + + +EE E + ++ GII
Sbjct: 111 KSRRPGYRVVLNELQGHAAEVSQNLSH------MVRMVGELLR-DEELPLE-VARLIGII 162
Query: 87 QVN----AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHL 142
+ N ++E+ L + S+ H+C PN G+ ++ + IA G+ L
Sbjct: 163 RCNTIEVSNELSLGVGQALHATTITSYFNHSCSPNC----AIQGEFIV-TTRVIAAGEEL 217
Query: 143 SICYTDPL-WGTINRRHHLQTSKYFICQCERCRD 175
+I Y L W RR L + YF C CERCRD
Sbjct: 218 TISYMPQLYWPVALRREELANTYYFHCSCERCRD 251
>gi|378727601|gb|EHY54060.1| hypothetical protein HMPREF1120_02237 [Exophiala dermatitidis
NIH/UT8656]
Length = 578
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 27/90 (30%), Positives = 47/90 (52%), Gaps = 1/90 (1%)
Query: 85 IIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQV-LLRAMKPIAPGDHLS 143
+++ NA EM + ++A+F AS I H C PN + + A + I+P + ++
Sbjct: 312 VVKANAFEMQVGGRMHLAVFPEASRINHACGPNAQYHLQPTLLTHYVYAARSISPDEEIT 371
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERC 173
I Y PL +R +L+++ +F C C RC
Sbjct: 372 IAYAPPLKFRSDRMAYLESTFHFRCTCSRC 401
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQ 267
>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
Length = 239
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 65 RDQYCFECDCIRCQ 78
>gi|340966897|gb|EGS22404.1| mannose-6-phosphate isomerase-like protein [Chaetomium thermophilum
var. thermophilum DSM 1495]
Length = 859
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHH 159
+A+F + I HNC PN + F+D+ + L + + I PG+ ++I Y+ + R+
Sbjct: 618 MALFTNLARINHNCRPNAFIHFSDTTLAMTLWSAREIQPGEEITITYSPAGRTSKERQRM 677
Query: 160 LQTSKYFICQCERCRDPTEL 179
L+++ F C CE C P E+
Sbjct: 678 LKSTWGFECHCELCTSPPEV 697
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 10/111 (9%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC PT+
Sbjct: 221 LLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRC--PTQ 269
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 46/93 (49%), Gaps = 4/93 (4%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC V EM + + ++ S + H+C PN F + +LLRA++ I G+
Sbjct: 179 ICNSFTVCNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEE 234
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
L+ICY D L + RR L+ F C C RC+
Sbjct: 235 LTICYLDMLMTSEERRKQLRDQYCFECDCIRCQ 267
>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
norvegicus]
Length = 239
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCRDPTE 178
+ F C C RC PT+
Sbjct: 65 RDQYCFECDCIRC--PTQ 80
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQ 267
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQ 267
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQ 267
>gi|452847803|gb|EME49735.1| hypothetical protein DOTSEDRAFT_68494 [Dothistroma septosporum
NZE10]
Length = 364
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/157 (26%), Positives = 75/157 (47%), Gaps = 12/157 (7%)
Query: 31 PQIWDKLMQL-EAHVEEYKNSPKYENDRRNVV--QFLLNFFKLNEEFTQEEILKICGIIQ 87
P+++ K L ++ E+Y + +EN R+ + LL EE E+ ++ GI+Q
Sbjct: 69 PELYQKFSSLSQSEQEKYLDLSYHENPARDALLKDKLLQRGVGKEEL--HEMAQVAGIMQ 126
Query: 88 VNAHEMPLTEPS---YIAIFDRASFIEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDH 141
NA + L + Y A+F + I H+C PN + F SG++++ A+K + G+
Sbjct: 127 NNAFNVDLGDGQGSCYRALFPNVARINHSCSPNAHVCFYPKGSSGRMMVHALKTLDAGEE 186
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+ I Y L R+ Q F C+C C + ++
Sbjct: 187 IRISYFSILQAFAERQAKAQKWG-FSCRCSGCDERSD 222
>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
Length = 239
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 65 RDQYCFECDCIRCQ 78
>gi|328868687|gb|EGG17065.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 333
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+A++ ASF+ H+C+PN + ++G + + A++ I D ++ICY + RR L
Sbjct: 211 LAMYATASFLNHSCFPNCARVQRNAG-IDIVAIRDIEENDEITICYINARDNDTARRMIL 269
Query: 161 QTSKYFICQCERC 173
+ YF CQC RC
Sbjct: 270 KGCYYFDCQCIRC 282
>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
Length = 369
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 136 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 194
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 195 RDQYCFECDCFRCQ 208
>gi|302753848|ref|XP_002960348.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
gi|300171287|gb|EFJ37887.1| hypothetical protein SELMODRAFT_437450 [Selaginella moellendorffii]
Length = 614
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 35/119 (29%), Positives = 60/119 (50%), Gaps = 9/119 (7%)
Query: 77 EEILKICGIIQVNAHEMPLT--EPSYIAI--FDRASFIEHNCYPNLYKSFTDSGQVL-LR 131
EE+ + +I N+H M + +++A+ F S + H+C PN F G V+ +R
Sbjct: 267 EELEMLATLINTNSHGMGAQNLQNTHVALGLFPYVSILNHSCRPNC--CFASEGSVMYVR 324
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP--TELNTFYDGVKC 188
A++ I G L + Y + R+ L +K+F C C+RC +P T ++ F +G C
Sbjct: 325 AVQDIPKGAELCLSYINLYESRRVRKTLLVATKHFDCTCDRCVEPLNTSIDRFLEGCVC 383
>gi|68076465|ref|XP_680152.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56501046|emb|CAH95429.1| conserved hypothetical protein [Plasmodium berghei]
Length = 1047
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN-RRHH 159
I +F +A F+ H+C+PN F D+ ++ AM+ I D +SI + + L+ +I R+
Sbjct: 515 ILLFKKAYFLNHSCFPNSSYCFMDNNKICFIAMRKINMYDEISISFINELYASIEYRKKK 574
Query: 160 LQTSKYFICQCERC 173
L K C C RC
Sbjct: 575 LSDIKNTSCSCNRC 588
>gi|389750923|gb|EIM91996.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 477
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 61/140 (43%), Gaps = 18/140 (12%)
Query: 55 NDRRNVVQFLLNFFK---LNEEFTQEEILKIC---------GIIQVNA---HEMP-LTEP 98
ND RN+ Q ++N L +Q L C GI+ NA +P L
Sbjct: 183 NDGRNLGQTIVNHVVDRILPPGDSQYRALSNCKPNEPNPLKGIMDTNALPIGPLPGLMTQ 242
Query: 99 SYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTD-PLWGTINR 156
++ + AS I H+C PN S+ DS +RA+KPIA G+ ++I Y PL R
Sbjct: 243 DWLGVAAIASRINHSCCPNACVSWDLDSFTFTVRALKPIAAGEQVTISYLGWPLESRTKR 302
Query: 157 RHHLQTSKYFICQCERCRDP 176
R L F C C C P
Sbjct: 303 RAELLDKYSFTCTCPTCSLP 322
>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
rubripes]
Length = 434
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 1/85 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + ++LLRA++ ++PG+ L+I Y + L +R+ L
Sbjct: 198 VGLYPSLSLLNHDCRPNCVMVF-EGTKLLLRAVRGLSPGEELTISYIETLSLNEDRQQRL 256
Query: 161 QTSKYFICQCERCRDPTELNTFYDG 185
+ F C C+ C P G
Sbjct: 257 EDQYCFTCHCQCCNSPDNDKLMLSG 281
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 254 RDQYCFECDCFRCQ 267
>gi|295664524|ref|XP_002792814.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278928|gb|EEH34494.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 365
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ G I+ NA MP + AIF +A+ I H+C PN ++ + G++ ++A K I G
Sbjct: 124 VIGTIKTNA--MPFGARGAEGAIFPQAARINHSCQPNSQNTWNRNLGKLTIQAFKDIDKG 181
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
+ ++I Y D R+ + + F C+CE C P E DG
Sbjct: 182 EEITIAYIDGTELYETRQECFENAFGFRCRCEVCAIPAEATKKRDG 227
>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
Length = 239
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 65 RDQYCFECDCIRCQ 78
>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
dendrobatidis JAM81]
Length = 227
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLS 143
GI + N E + A+F AS+ H+C+PN +S ++ R +K ++ G+ L+
Sbjct: 85 GIWKPNGKEACMGR----AVFPAASYFNHSCFPNC-QSIKHDHKMAFRTLKDVSKGEMLT 139
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERC 173
I Y D RR L +F C CERC
Sbjct: 140 ISYIDTNMPVSARRARLMDDYFFECMCERC 169
>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
Length = 251
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 4/93 (4%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC + EM + + ++ S + H+C PN F + +LLRA++ I G+
Sbjct: 2 ICNAFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEE 57
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
L+ICY D L + RR L+ F C C RC+
Sbjct: 58 LTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 90
>gi|145346226|ref|XP_001417594.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577821|gb|ABO95887.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 253
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 42/90 (46%), Gaps = 1/90 (1%)
Query: 85 IIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLS 143
++ + S A++ S + H+C PN + + + L +RA++ IAPG+ L+
Sbjct: 150 VLSAGLDAITQGTASGSAVYKYVSLLNHSCAPNCHTHWENGDSSLTIRALREIAPGEELT 209
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERC 173
I Y D RR L S F C C RC
Sbjct: 210 ITYVDADSPRDARRARLANSYAFDCACSRC 239
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 46/93 (49%), Gaps = 4/93 (4%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC + EM + + ++ S + H+C PN F + +LLRA++ I G+
Sbjct: 179 ICNAFTICNAEM---QEVGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEE 234
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
L+ICY D L + RR L+ F C C RC+
Sbjct: 235 LTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|302767894|ref|XP_002967367.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
gi|300165358|gb|EFJ31966.1| hypothetical protein SELMODRAFT_439878 [Selaginella moellendorffii]
Length = 614
Score = 52.0 bits (123), Expect = 1e-04, Method: Composition-based stats.
Identities = 35/119 (29%), Positives = 60/119 (50%), Gaps = 9/119 (7%)
Query: 77 EEILKICGIIQVNAHEMPLT--EPSYIAI--FDRASFIEHNCYPNLYKSFTDSGQVL-LR 131
EE+ + +I N+H M + +++A+ F S + H+C PN F G V+ +R
Sbjct: 267 EELEMLATLINTNSHGMGAQNLQNTHVALGLFPYVSILNHSCRPNC--CFASEGSVMYVR 324
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP--TELNTFYDGVKC 188
A++ I G L + Y + R+ L +K+F C C+RC +P T ++ F +G C
Sbjct: 325 AVQDIPKGAELCLSYINLYESRRVRKTLLVATKHFDCTCDRCVEPLNTSIDRFLEGCVC 383
>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
Length = 409
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 72/175 (41%), Gaps = 28/175 (16%)
Query: 12 HPSYECIIAL----RCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
HP+ II L L K S M L +H +E + +D ++N
Sbjct: 13 HPTVRGIIQLVSNPAALAKDS-------PFMDLASHEDELRRVQSKWDDISLQAHAVVNL 65
Query: 68 FKLNEEFTQEEILKICGI------IQVNAHEMPLT---EPSYIAIFDRASFIEHNCYPNL 118
+ EF + + +C + ++ N P+ +P + RA+ H+C PN
Sbjct: 66 LQRKPEFVKYALESLCRLSTNGFRVESNVGNGPIGLCLDP----LLARAN---HSCRPNA 118
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+F D + LRA+ PIA G+ + I Y D RR L + +F C C RC
Sbjct: 119 AITF-DGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRC 172
>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 369
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 136 VGLYPSISLLNHSCDPNCSVVF-NGPHLLLRAVRDIEVGEELTICYLDTLMTSEERRKQL 194
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 195 RDQYCFECDCFRCQ 208
>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
mutus]
Length = 221
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ + G+ L+ICY D L + RR L
Sbjct: 21 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDVEAGEELTICYLDMLMTSEERRKQL 79
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 80 RGQYCFDCDCFRCQ 93
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 1/73 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERC 173
+ F C C RC
Sbjct: 254 RDQYCFECDCFRC 266
>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
abelii]
gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
gorilla gorilla]
gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
Length = 258
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 76 QEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
+EE IC + EM + + ++ S + H+C PN F + +LLRA++
Sbjct: 3 EEEEKVICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRD 58
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 59 IEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 97
>gi|168035501|ref|XP_001770248.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678465|gb|EDQ64923.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 691
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 30/109 (27%), Positives = 51/109 (46%), Gaps = 10/109 (9%)
Query: 74 FTQEEILKICGIIQVNA---------HEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
F Q EI ++CGI+Q NA + +T ++ SF+ H+C P++
Sbjct: 386 FPQNEIARLCGIVQCNAFGHTQVTRKNARDITRSWSCGLWMLPSFMNHSCTPSVATVVIG 445
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +++ A + + GD L++ Y D RR + S F+C C RC
Sbjct: 446 NAMIIV-AARDLKCGDELTVAYFDIFRPLQERRASMLHSWNFMCSCPRC 493
>gi|115395962|ref|XP_001213620.1| predicted protein [Aspergillus terreus NIH2624]
gi|114193189|gb|EAU34889.1| predicted protein [Aspergillus terreus NIH2624]
Length = 425
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 78/169 (46%), Gaps = 13/169 (7%)
Query: 26 KKSHQPQIWDKLMQLEAHVEE--YKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKIC 83
++ + PQ D QLE H +E ++N+P++E R + + + + T E+ +
Sbjct: 92 RRKYTPQELDLFQQLETHEKEIRHENAPQWE--RIALSSKAVKAYSQTD--TPEDTISAF 147
Query: 84 GI-IQVNAHEMPLTEPSYIAIFDR--ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD 140
G + VN+ M I ++ A+ I H+C N F D ++ A++PI +
Sbjct: 148 GAKLDVNSFNMTTALADRIGLYLHPYAALINHSCAYNAVIGF-DGAELFATALRPITRDE 206
Query: 141 HLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ I Y D RR+ L+ +F C+C +C E++ DG + P
Sbjct: 207 QIFISYVDATNPVAVRRNELRERYFFDCRCAKC--AAEMDA-PDGGRAP 252
>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
CCMP2712]
Length = 98
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 45/73 (61%), Gaps = 3/73 (4%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN-RRHHL 160
++ ASFI H+C PN SF D ++++RA++ I G ++I Y + L+ ++ RR L
Sbjct: 28 GLYPLASFINHSCEPNAIISF-DGNKLVVRALENIPRGTEITIAYVE-LYAPLDVRRDAL 85
Query: 161 QTSKYFICQCERC 173
+ K F+C+C RC
Sbjct: 86 LSRKGFLCRCSRC 98
>gi|113206679|gb|ABI34490.1| SET and MYND domain containing 1b [Danio rerio]
Length = 180
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 45/87 (51%), Gaps = 6/87 (6%)
Query: 109 FIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFIC 168
+ H+C+PN ++G++ LRA+ I+ G+ +++ Y D L + +R+ L+ +F C
Sbjct: 2 LVNHDCWPNC-TVILNNGKIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDC 60
Query: 169 QCERCR-----DPTELNTFYDGVKCPE 190
C+ C D DGVK PE
Sbjct: 61 TCKHCTEKIKDDLKMAGAEVDGVKVPE 87
>gi|342179951|emb|CCC89425.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 433
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 68/159 (42%), Gaps = 28/159 (17%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEE-----ILK 81
KS +P L +L+ H E V Q L + ++ E ++E + +
Sbjct: 111 KSRRPGYRVVLNELQGHAAE-------------VSQNLSHMVRMVGELLRDEELPLEVAR 157
Query: 82 ICGIIQVNA----HEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIA 137
+ GII+ N +E+ L + S+ H+C PN G+ ++ + IA
Sbjct: 158 LIGIIRCNTIEVNNELSLGVGQALHATTITSYFNHSCSPNC----AIQGEFIV-TTRVIA 212
Query: 138 PGDHLSICYTDPL-WGTINRRHHLQTSKYFICQCERCRD 175
G+ L+I Y L W RR L + YF C CERCRD
Sbjct: 213 AGEELTISYMPQLYWPVALRREELANTYYFHCSCERCRD 251
>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
Length = 697
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++C +Q N+ + + + I A++ AS+ H+C PN+ + A++ I
Sbjct: 547 RLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAAFHALREIRK 606
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L +S F C+C RC
Sbjct: 607 GEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 51.6 bits (122), Expect = 1e-04, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++C +Q N+ + + + I A++ AS+ H+C PN+ + A++ I
Sbjct: 547 RLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAAFHALREIRK 606
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L +S F C+C RC
Sbjct: 607 GEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/211 (21%), Positives = 88/211 (41%), Gaps = 27/211 (12%)
Query: 5 ISNFTTPHPSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRN 59
I + P PS +A R L++ Q + L LE H+ + + ++ + +
Sbjct: 88 IKTYGKP-PSENVRLAARILWRMDKQGSVVSDNQLTTLEDLEDHICDI-SEDDLKDFKVD 145
Query: 60 VVQFLLNFFKLNEEFTQEEILKICGIIQVNA---HEMPLTEPSYIAIFDRASFIEHNCYP 116
+ FL + + ++ T + + I G+I N + + + +F + H+C+P
Sbjct: 146 IHNFLDYWPRNSKPHTVDSVSHILGVINCNGFMVSDQRGLQAVGVGLFPNLCLVNHDCWP 205
Query: 117 N------------LYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSK 164
N + F ++ LRA+ I+ G+ +++ Y D L + +R+ L+
Sbjct: 206 NCTVILNNGNQSAIDTVFHSQKRIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQY 265
Query: 165 YFICQCERCR-----DPTELNTFYDGVKCPE 190
+F C C+ C D DGVK PE
Sbjct: 266 FFDCTCKHCTEKIKDDLKMAGAEVDGVKVPE 296
>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
anubis]
Length = 258
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 76 QEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
+EE IC + EM + + ++ S + H+C PN F + +LLRA++
Sbjct: 3 EEEEKVICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRD 58
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 59 IEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 97
>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
glaber]
Length = 252
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F+ +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 19 VGLYPSMSLLNHSCDPNCSIVFSGP-HLLLRAVRDIEVGEELTICYLDMLMTSEERRKQL 77
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 78 RDQYCFDCDCFRCQ 91
>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Ornithorhynchus anatinus]
Length = 415
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 4/93 (4%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC ++ EM + + ++ S + H+C PN F + + LRA++ I G+
Sbjct: 153 ICNGFTISNGEM---QEVGVGLYPSMSLLNHSCDPNCVIVF-EGTSLFLRAVREIQKGEE 208
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
L+ICY D L + R+ L+ F C C RC+
Sbjct: 209 LTICYLDVLLPSQERQKQLKEQYCFACDCIRCK 241
>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3 [Equus caballus]
Length = 440
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 207 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQL 265
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 266 RDQYCFECDCFRCQ 279
>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
Length = 526
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 67/164 (40%), Gaps = 7/164 (4%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
+R L + + D + L+ H EE + + + + + + E + +
Sbjct: 105 VRALLQTLLNKETEDGIAALDGHTEERRKTKSWPDLEMMAMAACAFAGRQGESNIRRAVE 164
Query: 81 KICGIIQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
+C I Q NA + + +F + H+C PN F + +LRA +PI
Sbjct: 165 LLCKI-QTNAFHRWDVDLGQVGVFLEPTLAMANHSCVPNAVVQFVGR-KAILRAERPIHA 222
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
GD + I YTD RR L+ F C C RC+ +LN +
Sbjct: 223 GDEIEISYTDYTMPLSTRREALEQYS-FECTCARCKG--DLNVY 263
>gi|328865944|gb|EGG14330.1| hypothetical protein DFA_12100 [Dictyostelium fasciculatum]
Length = 521
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 58/120 (48%), Gaps = 10/120 (8%)
Query: 55 NDRRNVVQFLLNFFKLNEEFTQEEILKI-CGIIQVNAHEMPLTEPSYIAIFDRASFIEHN 113
ND+ V + LL+FF +EI+ C II + E F +A+ + H+
Sbjct: 174 NDK--VEESLLSFFD------HDEIVNTACTIITNSFGETSNNTTITNGFFYQAALLNHS 225
Query: 114 CYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
C PN + SF + ++ +R +K + D + Y D L T R+ L SK F CQC+RC
Sbjct: 226 CQPNAFFSF-NGDKLQMRVVKDMDKDDSIYDSYVDLLLPTYERQLGLLKSKNFFCQCKRC 284
>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++C +Q N+ + + + I A++ AS+ H+C PN+ + A++ I
Sbjct: 547 RLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGILAAFHALREIRK 606
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L +S F C+C RC
Sbjct: 607 GEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 1/73 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + ++ LRA++ I P D L+I Y + L T +R+ L
Sbjct: 201 VGLYPSLSLLNHDCRPNCVMVF-EGTKLELRAVRDIDPEDELTISYIETLSLTEDRQRQL 259
Query: 161 QTSKYFICQCERC 173
+ +F C C+RC
Sbjct: 260 EEQYHFTCHCQRC 272
>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
leucogenys]
Length = 369
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 110 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 161
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 162 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRCQ 208
>gi|156839743|ref|XP_001643559.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
gi|171770010|sp|A7TPV3.1|SET5_VANPO RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|156114175|gb|EDO15701.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 499
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 35/66 (53%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
S I HNC PN+ + ++ + A K I+ G L Y +PL G RR L+ + F+
Sbjct: 347 SLINHNCEPNVRFEVVSNKEIRVYARKNISAGQELLTNYINPLHGVKLRRRELRVNYGFL 406
Query: 168 CQCERC 173
C C+RC
Sbjct: 407 CHCDRC 412
>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
Length = 697
Score = 51.6 bits (122), Expect = 2e-04, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 3/96 (3%)
Query: 81 KICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
++C +Q N+ + + + I A++ AS+ H+C PN+ + A++ I
Sbjct: 547 RLCAALQCNSFGLFNADGNCIGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPK 606
Query: 139 GDHLSICYTD-PLWGTINRRHHLQTSKYFICQCERC 173
G+ L+ICY D T RR L +S F C+C RC
Sbjct: 607 GEPLTICYVDVQETSTAERRRTLFSSYRFFCECARC 642
>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
Length = 435
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/170 (25%), Positives = 80/170 (47%), Gaps = 13/170 (7%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDK-----LMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
PS + R + K+ HQ + L +LEAH+++ N ND + L +F
Sbjct: 102 PSETVRLVARIILKQKHQTERTPSERVLTLRELEAHLDKLDNEKNEMND--TDIAALHHF 159
Query: 68 FKLNEEFTQEEIL-KICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTD 124
+ + +F L ++ + N + E S++ A+F + + H+C PN+ ++
Sbjct: 160 YSRHLDFPDNAALTELIAQVNCNGFTIEDEELSHLGSALFPDVALMNHSCSPNVIVTY-- 217
Query: 125 SGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
G V +RA++ I P + + Y D L+ T +R L+ S +F C C+ C
Sbjct: 218 KGTVAEVRAVQEINPEEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKEC 267
>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
terrestris]
Length = 439
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 70/159 (44%), Gaps = 7/159 (4%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM 93
+ K L +H E K K + L F + + E+L I G I +N+ +
Sbjct: 111 YRKFKDLMSHCSEIKKDEKKMEHFVCLCNVLHKFLEDMPIPSTAELLGIYGRITINSFSI 170
Query: 94 -PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--HLSICYTDPL 150
L + I+ S ++H+C PN +F + + ++A++ + D + I Y D +
Sbjct: 171 FNLDMNIGVGIYLGPSILDHSCKPNAVATF-EGTTINVKAIEDLPSLDLSQIRIPYIDVI 229
Query: 151 WGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+RR LQ+S YF C CE+C P + + CP
Sbjct: 230 KTAGDRRAELQSSYYFWCDCEKCEKPEPMA---EAAACP 265
>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 459
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/138 (26%), Positives = 58/138 (42%), Gaps = 3/138 (2%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS 99
L+ +VE +K + D + + L +F+ +I IQ NA +
Sbjct: 56 LDGNVEGFKKHDQVWKDFEIQATGAVVYAGLENDFSIPRAREILCKIQTNAFNRLDPDAG 115
Query: 100 YIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRR 157
I+ + H+C P+ + SF D +LRA + I GD ++ICY D R
Sbjct: 116 MTGIYLDPVLAMANHSCMPSAFVSF-DQRNAVLRAWRDIKEGDEITICYVDVTLPNKAAR 174
Query: 158 HHLQTSKYFICQCERCRD 175
+F C+C RC+D
Sbjct: 175 QEALKLYHFECRCPRCKD 192
>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
Length = 170
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 65 RDQYCFECDCFRCQ 78
>gi|403218380|emb|CCK72870.1| hypothetical protein KNAG_0M00170 [Kazachstania naganishii CBS
8797]
Length = 491
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 50/112 (44%), Gaps = 8/112 (7%)
Query: 63 FLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF 122
F+ F L EE E L+ G +NA + E Y SFI HNC PN+
Sbjct: 308 FIDTFPSLKEEVDYERFLEYIGRFNLNA----IAEQLYYI----PSFINHNCEPNVRFEK 359
Query: 123 TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+ A K I G+ + + Y +PL RR L+ + F+C C+RCR
Sbjct: 360 DSRLHINFYARKDIKKGEQIFMTYCNPLHEVNLRRRELRVNYGFLCFCDRCR 411
>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
Length = 453
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 10/122 (8%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRA 132
T+ E+L+I G + +N + E S I ++ AS I+H+C PN+ SF G+ L
Sbjct: 151 TKAELLRIYGKMCINTFNILDAEMSTIGTGMYIGASIIDHSCRPNVVVSF--DGETLRMR 208
Query: 133 MKPIAPGDHLS-----ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
+ P L I Y D + R+ L YF C CERCRD E +
Sbjct: 209 LLEDYPEQELDFGKLFISYIDLIDTAEVRQEQLAERYYFHCACERCRDEQEQKRM-NAAA 267
Query: 188 CP 189
CP
Sbjct: 268 CP 269
>gi|48476977|gb|AAT44536.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9C [Homo sapiens]
Length = 273
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 2/76 (2%)
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C+PN F ++G++ LRA+ I+ G+ L++ Y D L + R+ L+ YF C CE
Sbjct: 2 HDCWPNCTVIF-NNGKIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCE 60
Query: 172 RCRDPTELNTFYDGVK 187
C+ + + F GVK
Sbjct: 61 HCQKKLKDDLFL-GVK 75
>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
Length = 448
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 69/161 (42%), Gaps = 9/161 (5%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ---EEILKICGIIQVNAHE 92
K L +H E KN PK ++ L + T E++ I G + N
Sbjct: 119 KFRDLMSHYAEIKNDPKRREHLESLHAVLTEMMTDSSGSTVPNPTELMSIYGRLITNGFN 178
Query: 93 MPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTD 148
+ E + IA I+ S +H+C PN +F + ++ + A++ P + I Y D
Sbjct: 179 ILDAEMNSIATAIYLGVSITDHSCQPNAVATF-EGNELHVHAIEDLPCLDWSKIYISYID 237
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L RR L+ YF+C C +C DP E++ V CP
Sbjct: 238 LLNTPEQRRADLKEHYYFLCVCSKCIDPQEMHEMTAAV-CP 277
>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 319
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 5/119 (4%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYI---AIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
F +E+L + +N+ M I A++ RAS H+C PN F S ++ +
Sbjct: 174 FRSQEVLDTYQKLTINSFSMYDEMTRTIVGEALYIRASMFNHSCEPNCTFVFEGS-RLSV 232
Query: 131 RAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
RA+K I G+ I Y L + R+ L++ F CQC RC D N VKCP
Sbjct: 233 RAIKRIEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRCLDSARDNLML-CVKCP 290
>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 413
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+A+ +S+ H+C PN +S D + +++ PI GD ++I Y T RR +L
Sbjct: 266 MAVSPSSSYFNHSCIPNC-ESVRDGSDMTFKSLFPIKKGDQINISYLALDKSTKRRRDYL 324
Query: 161 QTSKYFICQCERCR----DPT 177
+ YF CQC RC DPT
Sbjct: 325 KFGYYFHCQCPRCNSTDIDPT 345
>gi|444320821|ref|XP_004181067.1| hypothetical protein TBLA_0E04990 [Tetrapisispora blattae CBS 6284]
gi|387514110|emb|CCH61548.1| hypothetical protein TBLA_0E04990 [Tetrapisispora blattae CBS 6284]
Length = 644
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 16/105 (15%)
Query: 74 FTQEEILKICG---IIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
T E+ L + G I Q+N P+ AS I HNC PN++ +DS + L
Sbjct: 406 LTYEKYLNLIGRFNINQINGQLYPI-----------ASLINHNCQPNIHIEVSDSNSLSL 454
Query: 131 RAM--KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ + I + L Y +PL G RR L + F+C CERC
Sbjct: 455 TLITRRNINIDEELLTTYINPLHGVKLRRRELLVNWGFLCHCERC 499
>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
Length = 373
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 114 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 165
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 166 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 212
>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
niloticus]
Length = 435
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 71/141 (50%), Gaps = 7/141 (4%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL-KICGIIQVNAHEMPL 95
L + EAH+++ +S K E ++ ++ + + E+ L ++ + N +
Sbjct: 131 LKEFEAHLDKM-DSEKEEMNQTDIAALHHFYSRHISNLPDEQALTELFAQVNCNGFTIED 189
Query: 96 TEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWG 152
E S++ A+F + + H+C PN+ ++ G V +RA+K I PG+ + Y D L+
Sbjct: 190 EELSHLGSAVFPDVALMNHSCSPNVIVTY--KGTVAEVRAVKEINPGEEIFNSYIDLLYP 247
Query: 153 TINRRHHLQTSKYFICQCERC 173
T +R+ L S +F CQC C
Sbjct: 248 TEDRKERLLDSYFFTCQCTEC 268
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 495
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 74/164 (45%), Gaps = 10/164 (6%)
Query: 19 IALRCLYKK--SHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ 76
+ LR K S + +++D L E H+ E S + + DR N+ + + E +
Sbjct: 124 VVLRTARNKYDSEESKVFDGL---ETHINEISES-QGQLDRINLTARAVKNYS-GTEMDE 178
Query: 77 EEILKICGIIQVNAHEMPLTEPSYIAIFDR--ASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+ + +N+ + + I ++ A I H+C N F D ++ ++AM+
Sbjct: 179 GTVASYAAKLDLNSFNLTTSMYDRIGLYMHPYAGLINHSCDYNSTVGF-DGEELYVKAMR 237
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
PI G+ + I Y D RR+ L+ +F CQC +C+ TE
Sbjct: 238 PIKKGEQIFISYIDTTTPYDIRRNELKERYFFDCQCTKCKMGTE 281
>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
Length = 369
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 110 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 161
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 162 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 208
>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
Length = 352
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 93 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 144
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 145 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 191
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 171 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 222
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 223 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 269
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
Length = 612
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 70/166 (42%), Gaps = 20/166 (12%)
Query: 20 ALRCLYKKSHQPQIWDK---LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ 76
A+R + + P K M L +H +E + +D ++N + EF +
Sbjct: 178 AVRGIIQLVSNPAALAKDSPFMDLASHEDELRRVQSKWDDISLQAHAVVNLLQRKPEFVK 237
Query: 77 EEILKICGI------IQVNAHEMPLT---EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
+ +C + ++ N P+ +P + RA+ H+C PN +F D +
Sbjct: 238 YALESLCRLSTNGFRVESNVGNGPIGLCLDP----LLARAN---HSCRPNAAITF-DGKR 289
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
LRA+ PIA G+ + I Y D RR L + +F C C RC
Sbjct: 290 ATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCSRC 335
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 254 RDQYCFECDCFRCQ 267
>gi|302824578|ref|XP_002993931.1| hypothetical protein SELMODRAFT_137937 [Selaginella moellendorffii]
gi|300138203|gb|EFJ04978.1| hypothetical protein SELMODRAFT_137937 [Selaginella moellendorffii]
Length = 290
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ S H+C N+ + ++ V L+A++PI PG L I Y D G RR LQ
Sbjct: 215 AVYILPSMFNHSCDANVNIYWRENAFVQLKALQPIEPGKELCITYIDASMGCEARRALLQ 274
Query: 162 TSKYFICQCERCRD 175
+ F C+C RC D
Sbjct: 275 DAYGFHCKCPRCLD 288
>gi|66357248|ref|XP_625802.1| SET domain protein [Cryptosporidium parvum Iowa II]
gi|46226909|gb|EAK87875.1| SET domain protein [Cryptosporidium parvum Iowa II]
Length = 583
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGT 153
T + I +++ S + HNC + + +LRA + GD ++I Y D L+
Sbjct: 307 TSNNGIVLYNVISMMAHNCGASCCWHYGVDNTFVLRAKTRLEVGDEITISYISDDDLFKC 366
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
R L ++ F CQCERC +PT+L+ G+KC
Sbjct: 367 SKTRRELLSNWLFYCQCERCNNPTDLSR---GLKCA 399
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
Length = 287
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 28 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 79
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 80 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 126
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 169 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 221 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 267
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 170 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 221
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 222 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 268
>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
Length = 536
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/174 (24%), Positives = 87/174 (50%), Gaps = 19/174 (10%)
Query: 12 HPSYECIIALRCLYKKSHQPQIW--DKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ P+ +KL+ + E+H++ K +N++++++Q
Sbjct: 204 NPSETVRLTARILAKQKIYPERTPSEKLLAVKEFESHLD------KLDNEKKDLIQSDIA 257
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + EF + L + + N + E S++ AIF + + H+C PN+
Sbjct: 258 ALHHFYSKHLEFPDNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 317
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ + +RA++ I G+ + Y D L+ T +R L+ S +F C+C+ C
Sbjct: 318 VTYKGT-LAEVRAVQEIKTGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQEC 370
>gi|255082145|ref|XP_002508291.1| predicted protein [Micromonas sp. RCC299]
gi|226523567|gb|ACO69549.1| predicted protein [Micromonas sp. RCC299]
Length = 311
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 14/113 (12%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPS--------------YIAIFDRASFIEHNCYPNLYK 120
+ +E ++ G +Q+N EM +EP + +F AS+ H+C PN +
Sbjct: 194 SMDECVRTLGRLQLNGFEMTASEPEEGADEAEGGGHRPVGVGVFPSASYTNHSCAPNCAQ 253
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
F G +++ + + G+ L+I Y D G RR L+ + F C CERC
Sbjct: 254 RFDGHGCIVVETARDVRGGEELTIPYVDVRLGRRERRERLRKNFAFDCACERC 306
>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
Length = 311
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 99 SYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPGDHLSICYTDPLWGTINRR 157
S A++ AS H+C PN + S+ + + +R ++P+ G+ +I Y D + +RR
Sbjct: 231 SGTAVYFTASLFNHSCAPNAHVSWENGDAAITIRTLRPVRAGEEFNITYVDANERSASRR 290
Query: 158 HHLQTSKYFICQCERC 173
L+ F C CERC
Sbjct: 291 ARLKEWYGFDCACERC 306
>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
Length = 239
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 6 VGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQL 64
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 65 RDQYCFECDCFRCQ 78
>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 484
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 70/147 (47%), Gaps = 16/147 (10%)
Query: 35 DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP 94
D+ +QLE H+++ ++ +R ++ + + E EE++ G +++ +
Sbjct: 131 DQFLQLETHIKDIRDESASHWERISLSSKAIKAYSGTE--MSEEVISAMGA-KLDLNSFN 187
Query: 95 LTEPSYIAIFDR--------ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
LT A++DR A+ H+C N SF D + ++A++PI + + I Y
Sbjct: 188 LTN----AVYDRLGVYLHPYAAIFNHSCDHNAAVSF-DGPNLHIKAIRPIQKDEQIFITY 242
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERC 173
D R+H+LQ+ YF C C +C
Sbjct: 243 IDVTDPYPIRQHNLQSRYYFTCHCSKC 269
>gi|317038856|ref|XP_001402319.2| set and mynd domain containing protein [Aspergillus niger CBS
513.88]
gi|350631791|gb|EHA20161.1| hypothetical protein ASPNIDRAFT_195107 [Aspergillus niger ATCC
1015]
Length = 357
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 6/90 (6%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN---RRH 158
A++ RA+ H+C PN+ G+++ A K IA G+ I Y D L ++ RR
Sbjct: 265 AVYPRAAIANHSCSPNIMHKPDHHGRMVFTASKDIAAGEECCISYFD-LSKRVDLKSRRD 323
Query: 159 HLQTSKYFICQCERC--RDPTELNTFYDGV 186
HLQ F+C C+RC +P+E + + G+
Sbjct: 324 HLQGLFRFVCGCDRCTAEEPSEKESDWVGL 353
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 68/140 (48%), Gaps = 7/140 (5%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ-EEILKICGIIQVNAHEMPLTE 97
QLE+H E+ ++ K D + F+L + Q ++K+ G N+ + +
Sbjct: 138 QLESHHEKLSDARK---DHFESLWFVLQQCIEEDVLPQPSSLVKMFGATICNSFSICDND 194
Query: 98 PSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ IA I+ RAS + H+CYPN F D ++ LR ++ + GD +I Y D +
Sbjct: 195 LNGIAVGIYLRASMLNHSCYPNCVVVF-DERKLQLRTVRDVKEGDVCTISYVDVINPAKE 253
Query: 156 RRHHLQTSKYFICQCERCRD 175
R+ L+ +F C C +C D
Sbjct: 254 RQTELEERYHFSCNCVKCID 273
>gi|47217177|emb|CAG11013.1| unnamed protein product [Tetraodon nigroviridis]
Length = 473
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 41/73 (56%), Gaps = 3/73 (4%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
A+F + + H+C PN+ ++ G V +RA++ I PGD + Y D L+ T +R+ L
Sbjct: 234 AVFPDVALMNHSCSPNVIVTY--KGTVAEVRAVQEINPGDEIFNSYIDLLYPTEDRKERL 291
Query: 161 QTSKYFICQCERC 173
S +F CQC C
Sbjct: 292 LDSYFFTCQCAEC 304
>gi|67613152|ref|XP_667280.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54658400|gb|EAL37050.1| hypothetical protein Chro.40234 [Cryptosporidium hominis]
Length = 582
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGT 153
T + I +++ S + HNC + + +LRA + GD ++I Y D L+
Sbjct: 306 TSNNGIVLYNVISMMAHNCGASCCWHYGVDNTFVLRAKTRLEIGDEITISYISDDDLFKC 365
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
R L ++ F CQCERC +PT+L+ G+KC
Sbjct: 366 SKTRRELLSNWLFYCQCERCNNPTDLSR---GLKCA 398
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
F L E F + IC + EM + + ++ S + H+C PN F +
Sbjct: 205 FDLFEAFAK----VICNSFTICNAEM---QEVGVGLYPSISLLNHSCDPNCSIVF-NGPH 256
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+LLRA++ I G+ L+ICY D L + RR L+ F C C RC+
Sbjct: 257 LLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 303
>gi|358059760|dbj|GAA94529.1| hypothetical protein E5Q_01181 [Mixia osmundae IAM 14324]
Length = 554
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/178 (25%), Positives = 72/178 (40%), Gaps = 19/178 (10%)
Query: 13 PSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRN------VVQFLLN 66
P E + R ++ + H P L V+ + N K RR V Q L N
Sbjct: 123 PLTEARLLARMIWAQKHDPS--QHRYDLHGDVDCFDNKEKALWARRIQEASVLVGQSLFN 180
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLY---KS 121
N + ++ K+ Q+NAH + P E +++ + I H+C PN +
Sbjct: 181 EVLGNAKAALLQMAKV----QMNAHTLCTPFGEACGVSLSSSFALINHSCAPNTFAMSSH 236
Query: 122 FTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPT 177
+ D LR A +PI GD ++I Y D + RR ++ + F C C C T
Sbjct: 237 WPDEKPKYLRVAACRPIKAGDEITIAYVDVEEENLQRRQTIKATYGFDCDCRLCEAAT 294
>gi|327262509|ref|XP_003216066.1| PREDICTED: n-lysine methyltransferase SMYD2-like [Anolis
carolinensis]
Length = 437
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/175 (26%), Positives = 84/175 (48%), Gaps = 21/175 (12%)
Query: 12 HPSYECIIALRCLYKKSHQPQI--WDKLM---QLEAHVEEYKNSPKYENDRRNVVQ---- 62
+PS + R L K+ Q + +KL+ + E+H++ K +N++ ++Q
Sbjct: 105 NPSETVRLTARILAKQKTQTERCESEKLLSVKEFESHLD------KLDNEKMELIQSDIS 158
Query: 63 FLLNFFKLNEEFTQEEILKIC-GIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLY 119
L +F+ + E+ L + + N + E S++ AIF + + H+C PN+
Sbjct: 159 ALHHFYSKHIEYPDNAALVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVI 218
Query: 120 KSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++ G V +RA++ I PGD + Y D L+ T +R L+ S +F C C C
Sbjct: 219 VTY--KGTVAEVRAVQEIEPGDEIFTSYIDLLYPTEDRNDRLKDSYFFTCDCREC 271
>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
Length = 518
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
A++I H+C PN Y F GQV+ L+A++ IAP + + I YTD R+ LQ +
Sbjct: 216 AAYINHSCEPNAYIGF--DGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYF 273
Query: 166 FICQCERCRDPT 177
F C+C +C T
Sbjct: 274 FECKCPKCLKGT 285
>gi|401395860|ref|XP_003879698.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325114105|emb|CBZ49663.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 1764
Score = 50.8 bits (120), Expect = 3e-04, Method: Composition-based stats.
Identities = 33/109 (30%), Positives = 58/109 (53%), Gaps = 9/109 (8%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHL 142
G ++VNAH L+ A++ RA+ ++H+C PN+ Y++ G +++ A++ I +
Sbjct: 576 GKMKVNAH--ALSRGRGWALYPRAARVKHSCRPNVTYRNL--DGLLVVFALEDIEEDAPI 631
Query: 143 SICYTDPLW-GTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
++ Y D L+ T RR + +K CQC RC D + + CPE
Sbjct: 632 TMSYIDQLYVPTEERRKRVMATKRIFCQCTRCMDVCQKER---KILCPE 677
>gi|170064095|ref|XP_001867383.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167881524|gb|EDS44907.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 445
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 72/164 (43%), Gaps = 12/164 (7%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAH 91
+ WD L H E+ K K +++ L + +E+L+I G + +N+
Sbjct: 111 RFWD----LMPHEEDIKKDEKRMEHFQSLTVVLRSLIDEAAMPGNQELLRIFGKMCINSF 166
Query: 92 EMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD----HLSIC 145
+ E + I ++ AS ++H+C PN F D + +R ++ + + I
Sbjct: 167 NILDDEMNSIGTGMYLGASIMDHSCRPNAVAIF-DGCNLNVRLLEDYHGAEIDFSKIFIS 225
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
Y D L T RR L+ YF C CERCRD EL +G C
Sbjct: 226 YIDLLNPTDVRRDMLRKRYYFECGCERCRDEQELK-LMNGAACA 268
>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
Length = 518
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
A++I H+C PN Y F GQV+ L+A++ IAP + + I YTD R+ LQ +
Sbjct: 216 AAYINHSCEPNAYIGF--DGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYF 273
Query: 166 FICQCERCRDPT 177
F C+C +C T
Sbjct: 274 FECKCPKCLKGT 285
>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 404
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 58/122 (47%), Gaps = 5/122 (4%)
Query: 60 VVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP--SYIAIFDRASFIEHNCYPN 117
VV ++ + + T+++++++ I Q NA + E + +A++ S H+C PN
Sbjct: 218 VVAASISLPRGARKVTKDDVVRLLCIQQCNAFGLSNGEGEMTGVALYPALSLFNHSCMPN 277
Query: 118 LYKSFTDSGQVLLRAMK---PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+G + A+K + PG+ L+I Y D R+ L+ S F C C RCR
Sbjct: 278 CAAVDDGTGSKRVCAIKTLVAVPPGEELTISYIDLDLTRELRQDKLEESYAFRCTCARCR 337
Query: 175 DP 176
P
Sbjct: 338 AP 339
>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
Length = 479
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/72 (38%), Positives = 41/72 (56%), Gaps = 3/72 (4%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
A++I H+C PN Y F GQV+ L+A++ IAP + + I YTD R+ LQ +
Sbjct: 177 AAYINHSCEPNAYIGF--DGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQTELQLRYF 234
Query: 166 FICQCERCRDPT 177
F C+C +C T
Sbjct: 235 FECKCPKCLKGT 246
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 69/162 (42%), Gaps = 16/162 (9%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
+R + + + ++ L LE + ++ S K+ D + F L TQE +
Sbjct: 105 VRAVVQALVKAEVGAALEDLEGNDLSWRGSDKWA-DMEMMAMGASAFAGLGT--TQEAVK 161
Query: 81 KICGI---IQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
K + IQ NA + + IF R + H+C PN F + +LRA +P
Sbjct: 162 KALSLLCKIQTNAFHRYDADIGQVGIFLEPRLAMANHSCIPNATVQFVGR-RAILRAERP 220
Query: 136 IAPGDHLSICYTD---PLWGTINRRHHLQTSKYFICQCERCR 174
I + + I YTD PL ++R +F C+C RCR
Sbjct: 221 IKADEEIEISYTDYNYPL----SKRKEALAPYFFTCECTRCR 258
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ + G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSISLLNHSCDPNCSVVF-NGPHLLLRAVRDVEVGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 254 RDQYCFECDCFRCQ 267
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/174 (24%), Positives = 79/174 (45%), Gaps = 21/174 (12%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP-- 94
L +LE H+ + + E + ++ FL + + +++ T ++I I G+I N +
Sbjct: 124 LDELEDHITDMQEDELKEL-KVDIHNFLDFWPRTSKQHTVDDISHIFGVINCNGFSVSDQ 182
Query: 95 -LTEPSYIAIFDRASFIEHNCYPN------------LYKSFTDSGQVLLRAMKPIAPGDH 141
+ + +F + H+C+PN + F ++ LR++ IA G+
Sbjct: 183 RGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIELRSLGKIAEGEE 242
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERC----RDPTELNTF-YDGVKCPE 190
L++ Y D + + R+ L+T +F C CE C +D +L DGVK E
Sbjct: 243 LTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKNKIKDDIKLGGREVDGVKPSE 296
>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 673
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 6/82 (7%)
Query: 97 EPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT- 153
EP I AI+ S I H+CYPN+ + SG V+LR ++ I G + CY P W +
Sbjct: 474 EPRNIGGAIYPSISLINHSCYPNVVRHSYPSGTVVLRTLRFIGKGTEILDCY-GPHWLSE 532
Query: 154 --INRRHHLQTSKYFICQCERC 173
++R +L F+C CE C
Sbjct: 533 KRLSRLEYLWKKYCFLCTCEAC 554
>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
Length = 495
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 73/164 (44%), Gaps = 18/164 (10%)
Query: 21 LRCL-YKKSHQPQIWDKL---MQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ 76
LR L + S QP + L L +H +E + K + +R ++ K +E++
Sbjct: 121 LRILKLRSSKQPDVQQDLGMFRSLRSHFKEIAETNKEQYER------IMLCAKAEKEYSH 174
Query: 77 -----EEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
E I + I+VN P +P + I A F+ H+C PN F D G++
Sbjct: 175 SDLDIETIAEYLAKIEVNGFTFTTPFGDPLGLCIQPFACFVNHSCDPNAVVGF-DEGRIT 233
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
++A++ I P + + I Y D R+ L +F C+C +C
Sbjct: 234 VKALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSKC 277
>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
Methionine
Length = 429
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L
Sbjct: 196 VGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDXLXTSEERRKQL 254
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 255 RDQYCFECDCFRCQ 268
>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
Length = 542
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 5/147 (3%)
Query: 30 QPQIWDKLMQ-LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQV 88
QP+ K M LE HV E + P + + + + E + + ++I +Q
Sbjct: 122 QPEDLSKNMDGLEGHVAERRREPGWADMEMMAMGGCAFAGRETSEESVRQAVEILCKLQT 181
Query: 89 NAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
NA + ++ IF + H+C PN + F VL RA I GD + I Y
Sbjct: 182 NAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRAAVL-RAESRIQKGDEIEISY 240
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERC 173
TD ++ +R F C+C RC
Sbjct: 241 TD-YTSSLGKRKAALAPYNFECRCRRC 266
>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
Length = 400
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 56/122 (45%), Gaps = 10/122 (8%)
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSY---------IAIFDRASFIEHNCYPNLYKSFT 123
E +E + K+ +I+ H++ + +AI +SF H+C PN +
Sbjct: 220 ELKKEAVEKVISVIRELIHKVRCNQFGIWTKNDKCIAVAISPSSSFFNHSCIPNCI-NIR 278
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFY 183
D ++ +A+ P+ G+ L+I Y D +R+ +L+ YF C C RC + T +
Sbjct: 279 DGNKMTFKALYPVKKGEPLAISYLDLDLPVESRKEYLKYGYYFDCGCPRCDEKTNQDECM 338
Query: 184 DG 185
D
Sbjct: 339 DN 340
>gi|115481972|ref|NP_001064579.1| Os10g0410700 [Oryza sativa Japonica Group]
gi|78708616|gb|ABB47591.1| SET domain protein 123, putative, expressed [Oryza sativa Japonica
Group]
gi|113639188|dbj|BAF26493.1| Os10g0410700 [Oryza sativa Japonica Group]
gi|215767244|dbj|BAG99472.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767272|dbj|BAG99500.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612812|gb|EEE50944.1| hypothetical protein OsJ_31491 [Oryza sativa Japonica Group]
Length = 298
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 18/118 (15%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA------------------SFIEHNCY 115
T+E + + I++NA + L SY + A SF H+C
Sbjct: 177 LTKEWYINVLARIRINAFRIELVASSYENLLSSAVASVSCDAAVGNAVYMLPSFYNHDCD 236
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PN + + S L+A++ I G+ L ICY D R+ L F C+C+RC
Sbjct: 237 PNTHIVWLASADARLKALRNIEEGEELRICYIDASMDVDARQRILAEGFGFECRCQRC 294
>gi|171688102|ref|XP_001908991.1| hypothetical protein [Podospora anserina S mat+]
gi|170944013|emb|CAP70123.1| unnamed protein product [Podospora anserina S mat+]
Length = 289
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 37/63 (58%), Gaps = 1/63 (1%)
Query: 109 FIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFIC 168
I H+C PN + +F + G++ +R+++ IA G+ ++ICY DP +RR L +F C
Sbjct: 1 MINHSCEPNAF-AFLEKGEIRVRSLRKIAAGEEITICYIDPTIDVKSRREILMDEHFFEC 59
Query: 169 QCE 171
E
Sbjct: 60 DYE 62
>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
Length = 446
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 75/170 (44%), Gaps = 8/170 (4%)
Query: 26 KKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGI 85
K + + + + L H ++ KN ++++ L + + ++ E+L+I G
Sbjct: 101 KGYYTAKFYRRFHDLMTHEDDIKNDAPRMEHFQSLLVVLRSLVEEAAMPSKAELLQIFGK 160
Query: 86 IQVNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--- 140
+ +N+ + E + I ++ S I+H+C PN +F D + LR ++ +
Sbjct: 161 MCINSFNILDDEMNSIGTGMYLGISIIDHSCRPNALATF-DGTTIHLRLLEDYHGSEVDF 219
Query: 141 -HLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ I Y D + R+ L+ YF C CERCRD E G CP
Sbjct: 220 SKIFISYIDLMNPAEERKARLRAQYYFECNCERCRDEQEQQLMIAGA-CP 268
>gi|392569190|gb|EIW62364.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 338
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 16/124 (12%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPLT-----------EPSYIAIFDRASFIEHNCY 115
FF L + E + I+ N E+ ++ E SY +F S + H+C
Sbjct: 100 FFALASAYPPSEFDVVPSILNTNCLEIRMSDAEPDIHHDPVEDSYAGLFPTLSRVNHSCA 159
Query: 116 P--NLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCER 172
P N Y +F SGQ L A + I G+ ++I YT I RR L +++F+C C
Sbjct: 160 PSANYYFAFPAFSGQ--LWAARDIGAGEEITITYTPLAAPRIERRAFLARTRFFVCACPA 217
Query: 173 CRDP 176
C P
Sbjct: 218 CIVP 221
>gi|358395915|gb|EHK45302.1| hypothetical protein TRIATDRAFT_79702 [Trichoderma atroviride IMI
206040]
Length = 543
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 69/166 (41%), Gaps = 16/166 (9%)
Query: 32 QIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGI---IQV 88
+I D L LE HV E K + E ++ F + +E+I K + IQ
Sbjct: 123 EIGDGLEGLEGHVLEKKAAEGDEWKDIEIMAMAACAFS-GKGTAEEDIRKAAEMLCKIQN 181
Query: 89 NAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
N+ + ++ I +F + H+C PN F +L+ A PI GD + + Y
Sbjct: 182 NSFQRFDSDLGVIGLFLEPTLAMANHSCIPNAAVQFIGRNALLI-AENPIRAGDEMELAY 240
Query: 147 T---DPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
T DPL +R F+CQC RCRD LN + P
Sbjct: 241 TFYTDPL----PKRKEALAHYKFVCQCLRCRD--NLNVYQVAAVSP 280
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHH 159
+ ++ S + H+C P+ F G+ L LRA++ + P + L+I Y L T +RR
Sbjct: 193 VGLYPSMSLLNHDCRPSCVMLF--EGETLHLRAVRDMQPAEELTISYIGTLAPTRDRRTQ 250
Query: 160 LQTSKYFICQCERC 173
L+ +F CQC+RC
Sbjct: 251 LEEQYHFTCQCQRC 264
>gi|146413731|ref|XP_001482836.1| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
6260]
Length = 637
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 63/161 (39%), Gaps = 9/161 (5%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA 90
P L +L H EE + + + L K +Q+ +LKI + +N+
Sbjct: 143 PSYRKNLERLTTHREEVEKNATLVRLSEKIAPLLPEKIKA----SQQTVLKILCLAAINS 198
Query: 91 HEMPLTEPSYIAI-FDRA-SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
M + + FD S I H+C PNLY + A PI G + Y
Sbjct: 199 STMMNENFEQVGMAFDPTFSLINHSCVPNLYSIPISLTVISFVATSPIKAGTEVFTSYCF 258
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ T RR L+T YF C+C CR+ F+ CP
Sbjct: 259 NGYPTEVRRRTLETRFYFTCKCSICRNK---KNFFFSYNCP 296
>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
[Oryzias latipes]
Length = 424
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 74/141 (52%), Gaps = 7/141 (4%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK-LNEEFTQEEILKICGIIQVNAHEMPL 95
L + E+H+++ +S K E ++ ++ + K +++ +E+ ++ + N +
Sbjct: 120 LKEFESHLDKM-DSEKDEMNQADIAALHYFYSKHISDLPDDQELTELFAQVNCNGFTIED 178
Query: 96 TEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWG 152
E S++ A+F + + H+C PN+ ++ G V +RA++ I PG+ + Y D L+
Sbjct: 179 EELSHLGSAVFPDVALMNHSCSPNVIVTY--KGTVAEVRAVQEINPGEEIFNSYIDLLYP 236
Query: 153 TINRRHHLQTSKYFICQCERC 173
T +R+ L S +F CQC C
Sbjct: 237 TEDRKERLLDSYFFGCQCTEC 257
>gi|224001052|ref|XP_002290198.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973620|gb|EED91950.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 601
Score = 50.4 bits (119), Expect = 4e-04, Method: Composition-based stats.
Identities = 35/121 (28%), Positives = 56/121 (46%), Gaps = 13/121 (10%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQV-LL 130
+E ++E+ K+ I NA E I+ R S H+C PN D+ V ++
Sbjct: 219 DESGEDELTKLALIYSCNAFEGG-------RIYHRLSRANHSCNPNAVVVEGDTADVSVV 271
Query: 131 RAMKPIAPGDHLSICYTDPLW--GTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
+A I GD ++I Y G R+ L+ K+F+C+CERC +L + + C
Sbjct: 272 KAACSIKEGDEITISYLGKFLFGGYAVRQRKLRVEKHFVCRCERCSSSGDLAS---RIPC 328
Query: 189 P 189
P
Sbjct: 329 P 329
>gi|190348269|gb|EDK40693.2| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
6260]
Length = 637
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 63/161 (39%), Gaps = 9/161 (5%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNA 90
P L +L H EE + + + L K +Q+ +LKI + +N+
Sbjct: 143 PSYRKNLERLTTHREEVEKNATLVRLSEKIAPLLPEKIKA----SQQTVLKILCLAAINS 198
Query: 91 HEMPLTEPSYIAI-FDRA-SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
M + + FD S I H+C PNLY + A PI G + Y
Sbjct: 199 STMMNENFEQVGMAFDPTFSLINHSCVPNLYSIPISLTVISFVATSPIKAGTEVFTSYCF 258
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ T RR L+T YF C+C CR+ F+ CP
Sbjct: 259 NGYPTEVRRRTLETRFYFTCKCSICRNK---KNFFFSYNCP 296
>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 471
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 15/113 (13%)
Query: 77 EEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLY-------KSFTDSG 126
+EI I GII+ N + + + +F + H+C+PN S T+S
Sbjct: 168 DEIAHIFGIIKCNGFTLSDQRGLKAVGVGLFPNLCLVNHDCWPNCSVVLNHGNHSATNSA 227
Query: 127 -----QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
++ LRA++ I G+ L++ Y D L + R+ L+ YF C CE CR
Sbjct: 228 LHSKRRIELRALRKICEGEELTVSYVDFLDTSAERQRKLKEHFYFECTCEHCR 280
>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
Length = 455
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 67/142 (47%), Gaps = 17/142 (11%)
Query: 53 YENDRRNVVQFLLNFFKLNEEFTQ----------EEILKICGIIQVNAHEMPLTEPSYIA 102
+E D R V+ + +F LN + +E+L+I G + +N+ + E + I
Sbjct: 119 HEEDIRKDVKRIEHFHTLNVVLQRLLDEPAIPPRDELLRIFGKMCINSFNVCDDEMNSIG 178
Query: 103 --IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD----HLSICYTDPLWGTINR 156
++ AS ++H+C PN +F Q+ LR ++ A + + I Y D + + R
Sbjct: 179 TGMYLGASILDHSCRPNAVATFVGE-QLQLRLLEDFAGPELDFSRIFISYIDLIDPSDTR 237
Query: 157 RHHLQTSKYFICQCERCRDPTE 178
R L YF C+C RCRD E
Sbjct: 238 REQLSERYYFRCECVRCRDEAE 259
>gi|238493359|ref|XP_002377916.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
gi|220696410|gb|EED52752.1| SET and MYND domain protein, putative [Aspergillus flavus NRRL3357]
Length = 415
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 73/164 (44%), Gaps = 8/164 (4%)
Query: 21 LRCLYKKSHQPQIWDKL---MQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE 77
LR + + +H+ +L QLE H+ E ++ + +R + + + + +E
Sbjct: 37 LRMIVRTAHKKYTNGELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYS-GTDMKEE 95
Query: 78 EILKICGIIQVNAHEMPLTEPSYIAIFDR--ASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
I +++N+ I ++ A+ I H+C N +F DS + ++A +P
Sbjct: 96 TISAFGAKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTF-DSDNLYIKATRP 154
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC-RDPTE 178
I GD + I Y D RR L+ YF C C +C +D TE
Sbjct: 155 IQKGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAKDLTE 198
>gi|389583694|dbj|GAB66428.1| hypothetical protein PCYB_092140 [Plasmodium cynomolgi strain B]
Length = 763
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGT 153
T+ + +++R S + H+C + + +LRA +A GD L+I Y D L+ +
Sbjct: 471 TDNEGLVLYNRISMLAHSCNSTACWHYGSNDSFVLRARVKLAVGDELTISYLGDDDLYKS 530
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
N R T+ F+C C RC +P + + G KC
Sbjct: 531 SNIRREKLTNWLFVCMCSRCTNPIDKSR---GFKCS 563
>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
[Oryzias latipes]
Length = 435
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 74/141 (52%), Gaps = 7/141 (4%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK-LNEEFTQEEILKICGIIQVNAHEMPL 95
L + E+H+++ +S K E ++ ++ + K +++ +E+ ++ + N +
Sbjct: 131 LKEFESHLDKM-DSEKDEMNQADIAALHYFYSKHISDLPDDQELTELFAQVNCNGFTIED 189
Query: 96 TEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWG 152
E S++ A+F + + H+C PN+ ++ G V +RA++ I PG+ + Y D L+
Sbjct: 190 EELSHLGSAVFPDVALMNHSCSPNVIVTY--KGTVAEVRAVQEINPGEEIFNSYIDLLYP 247
Query: 153 TINRRHHLQTSKYFICQCERC 173
T +R+ L S +F CQC C
Sbjct: 248 TEDRKERLLDSYFFGCQCTEC 268
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 61/139 (43%), Gaps = 12/139 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V +L + +N + E K+ NAH + E P ++ S I H+C P
Sbjct: 163 NLVNLILQWPDINVKEIAENFSKLA----CNAHTICDGELRPLGTGLYPVISIINHSCLP 218
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-- 174
N F + V+ RA++ I G + I Y + TI R+ L+ F C C RCR
Sbjct: 219 NSVLVFEERLAVV-RAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRM 277
Query: 175 ---DPTELNTFYDGVKCPE 190
D + + +G +C +
Sbjct: 278 GQYDDIQESAILEGYRCKD 296
>gi|320168148|gb|EFW45047.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 522
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 52/107 (48%), Gaps = 3/107 (2%)
Query: 69 KLNEEFTQEEILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG 126
+L F +++LK+ I+ N + E A+F ASF H+C P+ + + D
Sbjct: 314 ELMAGFPLKDVLKLVSQIESNGFGLWDGKGECYARALFPSASFFNHSCDPSCDR-YQDKF 372
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ + +PIA G+ LSI Y D R+H L S +F C C RC
Sbjct: 373 LLSIATRRPIAAGEELSISYIDVNAPCRTRQHELLDSYHFQCSCTRC 419
>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
Length = 498
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/162 (23%), Positives = 72/162 (44%), Gaps = 5/162 (3%)
Query: 18 IIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE 77
I+ LR + H + + L +H++E +S + + +R + + E E
Sbjct: 126 IVKLRS-SNEDHVKSDLNMFLTLRSHLDEITSSNQEQYERIMLCAKAEKEYS-GSELDVE 183
Query: 78 EILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
I + I+VN+ +P + I A ++ H+C PN F D G ++++A++
Sbjct: 184 TIAEYLARIEVNSFTFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGF-DGGLIIVKALRE 242
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPT 177
I P + + I Y D + R+ L +F C+C +C T
Sbjct: 243 IKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKCAQGT 284
>gi|317157122|ref|XP_001826232.2| SET and MYND domain protein [Aspergillus oryzae RIB40]
Length = 435
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/164 (25%), Positives = 73/164 (44%), Gaps = 8/164 (4%)
Query: 21 LRCLYKKSHQPQIWDKL---MQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE 77
LR + + +H+ +L QLE H+ E ++ + +R + + + + +E
Sbjct: 57 LRMIVRTAHKKYTNGELELFSQLETHISEIRDQSPEQWERIALSSKAVKAYS-GTDMKEE 115
Query: 78 EILKICGIIQVNAHEMPLTEPSYIAIFDR--ASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
I +++N+ I ++ A+ I H+C N +F DS + ++A +P
Sbjct: 116 TISAFGAKLELNSFNFTNILSDRIGLYLHPYAALINHSCNYNAAVTF-DSDNLYIKATRP 174
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC-RDPTE 178
I GD + I Y D RR L+ YF C C +C +D TE
Sbjct: 175 IQKGDQIFISYIDATNPVKLRRSELRERYYFDCHCAKCAKDLTE 218
>gi|403158231|ref|XP_003307547.2| hypothetical protein PGTG_00497 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163733|gb|EFP74541.2| hypothetical protein PGTG_00497 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 552
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 39/79 (49%), Gaps = 4/79 (5%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTIN--- 155
+ AI+ AS H C PNL+ Q+L LRA +PI+ G L+I Y +
Sbjct: 379 FGAIYLNASLFNHACRPNLFGFLNYETQLLYLRAFEPISMGQELTISYLSQEMDATDLES 438
Query: 156 RRHHLQTSKYFICQCERCR 174
RR LQ S F+C C C+
Sbjct: 439 RRQKLQGSYGFLCTCSHCQ 457
>gi|343475229|emb|CCD13316.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 382
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 5/93 (5%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSF-TDSGQVLLRAMKPIAPGDHLSICYTDP---LWGTIN 155
Y F + H+C N F S V L A++P+ G+ +++ Y L G N
Sbjct: 284 YHGFFRACALANHSCVANAAMKFDAASNTVTLIAVRPVEAGEFVNVKYLSDAQLLMGVGN 343
Query: 156 RRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
RR +L+ S F C CERC + N F++ V+C
Sbjct: 344 RREYLR-SWLFWCSCERCTSDNDPNAFHEHVQC 375
>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
S + H+C PN F + +LLR +K I G+ L+I Y D T RR LQ F+
Sbjct: 2 SLLNHSCDPNCVIVFEGTC-LLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60
Query: 168 CQCERC--RDPTE 178
C C+RC RD E
Sbjct: 61 CDCQRCLLRDKDE 73
>gi|294921829|ref|XP_002778734.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239887454|gb|EER10529.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 586
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT--DPLWGTINRRH 158
+ ++D+ S + H+C + +LRA PI PGD L+I Y + L+ + N R
Sbjct: 239 LIMYDKISMLSHSCEATCCWHYGPEDSFVLRARVPIQPGDELTISYIGDEELFKSTNVRR 298
Query: 159 HLQTSKYFICQCERCRDPTELNTFYDGVKC 188
F C C RC +P + + G++C
Sbjct: 299 QRLQGWLFTCHCHRCDEPVD---YARGLRC 325
>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
mellifera]
Length = 440
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 68/160 (42%), Gaps = 8/160 (5%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM 93
+ K L +H + K K V L F E++ I G I +N+ +
Sbjct: 111 YRKFKDLMSHYSDIKKDEKKMEHFVCVCGVLYEFLGDMSIPNSAELMGIYGRIYINSFNI 170
Query: 94 PLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTDP 149
+ + I I+ S ++H+C PN +F + +++R + P + I Y D
Sbjct: 171 SDLDMNNIGAGIYLGPSILDHSCKPNAVATF-EGTTIIIRTTEDLPCLDLSQIRISYIDV 229
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ T +RR LQ+S YF C C++C + + + CP
Sbjct: 230 IKTTKDRREELQSSYYFWCNCKKCEESEPM---VEAAACP 266
>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
Length = 453
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 5/110 (4%)
Query: 70 LNEEFTQE--EILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDS 125
LN + ++ EI K I N H + E P I +F S I H+C N F D
Sbjct: 137 LNARYAEDVKEITKDICRISCNGHTICDDELRPVGIGLFPVVSVINHSCSSNSLLLF-DG 195
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
++RA+ I+ G +++ Y + T +RR L YF C+C RC D
Sbjct: 196 KHAVVRALGTISRGCEVTVSYIELGASTNSRREALSDQYYFHCKCPRCTD 245
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 61/139 (43%), Gaps = 12/139 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V +L + +N + E K+ NAH + E P ++ S I H+C P
Sbjct: 163 NLVNLILQWPDINVKEIAENFSKLA----CNAHTICDGELRPLGTGLYPVISIINHSCLP 218
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-- 174
N F + V+ RA++ I G + I Y + TI R+ L+ F C C RCR
Sbjct: 219 NSVLVFEERLAVV-RAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRCRRM 277
Query: 175 ---DPTELNTFYDGVKCPE 190
D + + +G +C +
Sbjct: 278 GQYDDIQESAILEGYRCKD 296
>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
[Oryzias latipes]
Length = 476
Score = 50.1 bits (118), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 55/106 (51%), Gaps = 4/106 (3%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
+++ + E I I GII+ N + + + +F + + H+C+PN + + G+
Sbjct: 162 SKQHSVEYISHIFGIIKCNGFTLTDQRGLQAVGVGLFPNLALVNHDCWPNC-TAILNHGK 220
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +R + I+ G+ L+I Y D L + +R+ L+ +F C C+ C
Sbjct: 221 IEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHC 266
>gi|406867180|gb|EKD20219.1| xanthine dehydrogenase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 654
Score = 50.1 bits (118), Expect = 4e-04, Method: Composition-based stats.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 1/85 (1%)
Query: 90 AHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTD 148
+H P E I++ A H+C PN + F+D G+ ++LRA I G+ ++I Y
Sbjct: 430 SHAAPGDEYLGPFIYEIAGRFNHSCTPNAARGFSDVGELIVLRAFVDIQEGEEITIEYFS 489
Query: 149 PLWGTINRRHHLQTSKYFICQCERC 173
T RR HL F C C C
Sbjct: 490 DGGTTTERRQHLYKQYRFNCDCRAC 514
>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 322
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 37/76 (48%)
Query: 98 PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRR 157
P + ++ A+ H+C PN + F G V + + + G+ L+I Y D + G RR
Sbjct: 242 PIGVGVYPSAAMFNHDCAPNAAQRFDAFGCVRVETTRRVRKGEELTIPYVDVMLGREERR 301
Query: 158 HHLQTSKYFICQCERC 173
L+ + F C C RC
Sbjct: 302 GKLRKNFAFECACARC 317
>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
Length = 431
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 54/113 (47%), Gaps = 11/113 (9%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFD---RASFIEHNCYPNL--------YK 120
E + +++ ++ +I +N+ + + + R +F H+C PN +
Sbjct: 141 EPLSLDDLRQLLAVIHLNSFALSSERFPGMTTYGFYLRMAFCNHSCRPNACQYIDPNSTR 200
Query: 121 SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ +S ++LRA+ IA G+ + I Y + + T RR LQ YF CQC RC
Sbjct: 201 ARLNSPSIVLRAVSDIAEGEEVCISYIELMDTTPERREALQELYYFTCQCPRC 253
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 71/158 (44%), Gaps = 24/158 (15%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKL----NEEFTQEEILKICGIIQVNAHE 92
L +LE H+ + E+D + + + NF +++ T ++I I G+I N
Sbjct: 124 LDELEDHIADMP-----EDDLKELKVDIHNFLDYWPHNSKQHTIDDISHIFGVINCNGFS 178
Query: 93 MP---LTEPSYIAIFDRASFIEHNCYPN------------LYKSFTDSGQVLLRAMKPIA 137
+ + + +F + H+C+PN + F ++ LRA+ IA
Sbjct: 179 VSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMFHSQRRIELRALGKIA 238
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
G+ L++ Y D L + R+ L+T +F C CE C++
Sbjct: 239 EGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 276
>gi|238588335|ref|XP_002391696.1| hypothetical protein MPER_08835 [Moniliophthora perniciosa FA553]
gi|215456713|gb|EEB92626.1| hypothetical protein MPER_08835 [Moniliophthora perniciosa FA553]
Length = 422
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 11/121 (9%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEP--------SYIAIFDRASFIEHNCYPNLYKSFT-DS 125
+ +++ I G + N E+ P Y +F S I H+C PN++ F +S
Sbjct: 220 SHKDVGPIIGKKRTNGLELKDRRPRYGIEDLGKYSGVFKDVSRINHSCRPNVHTKFVMES 279
Query: 126 GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP-TELNTFYD 184
V +RA++ I G+ L I Y D T NR+ L+ F C C CR+P +++N
Sbjct: 280 FSVQVRALRDIKKGEELFISYIDIQTITQNRQKELEPYG-FRCACLACRNPRSDMNLLGV 338
Query: 185 G 185
G
Sbjct: 339 G 339
>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
aries]
Length = 369
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLR ++ + G+ L+ICY D L + RR L
Sbjct: 136 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRTVRDVEAGEELTICYLDMLMTSEERRKQL 194
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 195 RDQYCFDCDCFRCQ 208
>gi|342185633|emb|CCC95118.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 697
Score = 50.1 bits (118), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 5/93 (5%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSF-TDSGQVLLRAMKPIAPGDHLSICYTDP---LWGTIN 155
Y F + H+C N F S V L A++P+ G+ +++ Y L G N
Sbjct: 328 YHGFFRACALANHSCVANAAMKFDAASNTVTLIAVRPVEAGEFVNVKYLSDAQLLMGVGN 387
Query: 156 RRHHLQTSKYFICQCERCRDPTELNTFYDGVKC 188
RR +L+ S F C CERC + N F++ V+C
Sbjct: 388 RREYLR-SWLFWCGCERCTSDNDPNAFHEHVQC 419
>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 770
Score = 50.1 bits (118), Expect = 5e-04, Method: Composition-based stats.
Identities = 25/89 (28%), Positives = 42/89 (47%), Gaps = 2/89 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT--DPLWGTINRRHH 159
AI+ AS + H+C PN++ SF +++RA++ I G+ + CY R+
Sbjct: 491 AIYPSASLMNHSCDPNIFSSFRCGSTLVVRAIRRIQEGEEVLNCYGPHHRRMSFAERQQL 550
Query: 160 LQTSKYFICQCERCRDPTELNTFYDGVKC 188
LQ +F+C C C + +KC
Sbjct: 551 LQEQYFFVCSCTACSSGEDAEQRLQALKC 579
>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 10/116 (8%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSY-----IAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
+ E I + GI+ N M +T+ +AI + I H+C PN+ + + +
Sbjct: 271 SDEYIDHLFGIVSCNG--MSITDMRGLQYLGVAIHPTLNLINHDCNPNVV-AVSCGPNIF 327
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR--DPTELNTFY 183
+RA+KPI GD L I Y D + R++ L+ YF C C+ C + EL + Y
Sbjct: 328 VRAIKPIKEGDELFISYIDTSATSETRKNILKDQYYFDCTCKMCESGEKDELKSAY 383
>gi|317144906|ref|XP_003189640.1| set and mynd domain containing protein [Aspergillus oryzae RIB40]
Length = 372
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 58/132 (43%), Gaps = 19/132 (14%)
Query: 60 VVQFLLNFFKLNEEFTQEEILK-IC-------GIIQVNAHEMPLTEPSY-------IAIF 104
V ++L N L T E+L IC G+ PL E + ++
Sbjct: 216 VKKYLRNSQSLPHGLTSSEVLVLICQEEANSFGLYPRETGAFPLPELAVDRGEQFAAGVY 275
Query: 105 DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTI---NRRHHLQ 161
A+ H+C PN+ D +++ A K IA G+ I Y D L + +RR HLQ
Sbjct: 276 PTAALANHSCSPNIIHKPDDQSRMVFVASKDIATGEECCISYFD-LSKKVELKDRRDHLQ 334
Query: 162 TSKYFICQCERC 173
S F+C+C+RC
Sbjct: 335 GSFRFVCKCDRC 346
>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
Length = 391
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 40/76 (52%), Gaps = 4/76 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
A++ R S I H+C PN F S +L LRA + I PG+ + I Y D +R
Sbjct: 290 ALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDECNLQRSRHSRQ 349
Query: 161 QTSK---YFICQCERC 173
+T K FICQCE+C
Sbjct: 350 KTLKDYYLFICQCEKC 365
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ S + H+C PN F + +LLR ++ + G+ L+ICY D L + RR L
Sbjct: 195 VGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRTVRDVEAGEELTICYLDMLMTSEERRKQL 253
Query: 161 QTSKYFICQCERCR 174
+ F C C RC+
Sbjct: 254 RDQYCFDCDCFRCQ 267
>gi|258597255|ref|XP_001347831.2| conserved Plasmodium protein [Plasmodium falciparum 3D7]
gi|254832627|gb|AAN35744.2| conserved Plasmodium protein [Plasmodium falciparum 3D7]
Length = 793
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 2/85 (2%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGT 153
T+ + +++R S + H+C + ++ +LRA + PGD ++I Y D L+ +
Sbjct: 501 TDNEGLVLYNRISMLAHSCISTACWHYGENDSFVLRARINLNPGDEITISYLGDDDLYKS 560
Query: 154 INRRHHLQTSKYFICQCERCRDPTE 178
N R T+ F+C C RC P +
Sbjct: 561 SNIRREKLTNWLFVCMCSRCTHPVD 585
>gi|221056024|ref|XP_002259150.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193809221|emb|CAQ39923.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 778
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGT 153
T+ + +++R S + H+C + + +LRA +A GD L+I Y D L+ +
Sbjct: 486 TDNEGLVLYNRISMLAHSCNSTACWHYGTNDSFVLRARVKLAVGDELTISYLGDDDLYKS 545
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
N R T+ F+C C RC +P + + G KC
Sbjct: 546 SNIRREKLTNWLFVCMCSRCTNPIDKSR---GFKCS 578
>gi|332023172|gb|EGI63428.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 624
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 4/82 (4%)
Query: 97 EPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW--G 152
EP I AI+ S + H+CYPN+ + +G V++RA++ I G + CY
Sbjct: 431 EPRNIGAAIYSTVSLVNHSCYPNMVRHSYPNGIVVVRALRFIGKGCEIFDCYGPHFLSES 490
Query: 153 TINRRHHLQTSKYFICQCERCR 174
+NRR L F+C C+ C+
Sbjct: 491 KLNRREFLWKKYRFLCGCDACK 512
>gi|156098478|ref|XP_001615271.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148804145|gb|EDL45544.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 772
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGT 153
T+ + +++R S + H+C + + +LRA +A GD L+I Y D L+ +
Sbjct: 480 TDNEGLVLYNRISMLAHSCNSTACWHYGSNDSFVLRARVKLAVGDELTISYLGDDDLYKS 539
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
N R T+ F+C C RC +P + + G KC
Sbjct: 540 SNIRREKLTNWLFVCMCSRCTNPIDNSR---GFKCS 572
>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
2509]
Length = 547
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 70/148 (47%), Gaps = 11/148 (7%)
Query: 40 LEAHVEEYKNSPKYEND-RRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP 98
LE++VE +K D ++ L + T ++ +++ IQ NA +
Sbjct: 136 LESNVEGFKTDEGLWGDFELQATGAVVYAGLLQSDETLKQAMEVLCKIQTNAFNRFDADT 195
Query: 99 SYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
I+ S + H+C PN Y +F + + +L+A + + PGD + I Y D T+ R
Sbjct: 196 GQAGIYLHPSLSMVNHSCVPNAYITF-EKRKAVLKAERDLEPGDEILISYIDH---TMPR 251
Query: 157 RHHLQTSK--YFICQCERCRDPTELNTF 182
R ++ + +F C C RC+D +LN +
Sbjct: 252 RARQESLRLYHFQCNCIRCKD--DLNAY 277
>gi|302758304|ref|XP_002962575.1| hypothetical protein SELMODRAFT_78536 [Selaginella moellendorffii]
gi|300169436|gb|EFJ36038.1| hypothetical protein SELMODRAFT_78536 [Selaginella moellendorffii]
Length = 290
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ S H+C N+ + ++ L+A++PI PG L I Y D G RR LQ
Sbjct: 215 AVYILPSMFNHSCDANVNIYWRENAFAQLKALQPIEPGKELCITYIDASMGCEARRALLQ 274
Query: 162 TSKYFICQCERCRD 175
+ F C+C RC D
Sbjct: 275 DAYGFHCKCPRCLD 288
>gi|294892061|ref|XP_002773875.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239879079|gb|EER05691.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 140
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 38/72 (52%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
++ A+ H+C+PN+ +F + RA++ I+PG+ + Y + + RR L
Sbjct: 60 GVYPNAALFNHSCHPNVIPAFGHGSTLSFRAIRDISPGEEICHSYVELTLPSWKRRDVLL 119
Query: 162 TSKYFICQCERC 173
F+C+CERC
Sbjct: 120 RDYEFLCECERC 131
>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
Length = 562
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 50/108 (46%), Gaps = 4/108 (3%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA--SFIEHNCYPNLYKSFTDSGQ 127
L E T E+ ++ IQ NA + IF + + H+C PN + F D
Sbjct: 186 LESEETLEKAREVLCKIQTNAFNRLDADTGLSGIFLDVGLAMVNHSCVPNAFIGF-DRRT 244
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
+LRA +PI G+ ++I Y D R+ L+ +F C C RCRD
Sbjct: 245 AVLRAERPIQEGEEITISYIDNALPKSARQEALRL-YHFRCDCPRCRD 291
>gi|225684542|gb|EEH22826.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 365
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 4/105 (3%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ G I+ NA MP + AIF +A+ I H+C PN ++ + G++ ++A K I G
Sbjct: 124 VTGTIKTNA--MPFGARGAEGAIFPQAARINHSCQPNTQNTWNRNLGKLTIQAFKDIDKG 181
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ ++I Y D R+ + + F C+CE C P E D
Sbjct: 182 EEITIAYVDCTELYDTRQECFENAFGFRCRCEVCAIPAEATKKRD 226
>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 393
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 1/69 (1%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
S+ H+C+PN + ++ + + ++ PI GD LSI Y D +R HL+ YF
Sbjct: 274 SYFNHSCFPNCVR-VQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFE 332
Query: 168 CQCERCRDP 176
C+C+RC P
Sbjct: 333 CKCKRCTLP 341
>gi|238485368|ref|XP_002373922.1| set and mynd domain containing protein, putative [Aspergillus
flavus NRRL3357]
gi|220698801|gb|EED55140.1| set and mynd domain containing protein, putative [Aspergillus
flavus NRRL3357]
Length = 372
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 58/132 (43%), Gaps = 19/132 (14%)
Query: 60 VVQFLLNFFKLNEEFTQEEILK-IC-------GIIQVNAHEMPLTEPSY-------IAIF 104
V ++L N L T E+L IC G+ PL E + ++
Sbjct: 216 VKKYLRNSQSLPHGLTSSEVLVLICQEEANSFGLYPRETGAFPLPELAVDRGEQFAAGVY 275
Query: 105 DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTI---NRRHHLQ 161
A+ H+C PN+ D +++ A K IA G+ I Y D L + +RR HLQ
Sbjct: 276 PTAALANHSCSPNIIHKPDDQSRMVFVASKDIATGEECCISYFD-LSKKVELKDRRDHLQ 334
Query: 162 TSKYFICQCERC 173
S F+C+C+RC
Sbjct: 335 GSFRFVCKCDRC 346
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 60/149 (40%), Gaps = 4/149 (2%)
Query: 27 KSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGII 86
+S D L LE H + + +P + + + E E +I +
Sbjct: 121 RSGGGHTRDLLDGLEGHAAQRRRAPGWADMELMAMAGCAFAGMETSEGAVREAAEILCKL 180
Query: 87 QVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSI 144
Q NA + ++ IF + H+C PN + F VL RA + I GD + I
Sbjct: 181 QTNAFHRFDADLGHVGIFLEPTLAMANHSCLPNAFVQFVGRTAVL-RAEQRIQSGDEIEI 239
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERC 173
YTD ++++R +F C+C RC
Sbjct: 240 SYTD-YTSSLSKRQAALAPYHFECRCRRC 267
>gi|326429665|gb|EGD75235.1| hypothetical protein PTSG_06889 [Salpingoeca sp. ATCC 50818]
Length = 836
Score = 49.7 bits (117), Expect = 6e-04, Method: Composition-based stats.
Identities = 29/96 (30%), Positives = 46/96 (47%), Gaps = 6/96 (6%)
Query: 87 QVNAHEMPL---TEPSYI---AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD 140
Q+N + PL T P+ + IF + H+C PN G++ + ++ I
Sbjct: 286 QMNCNGYPLRLSTAPNTLVGMGIFPAVAMANHSCSPNCAVVTRPGGRLAVVTLQRIRKHQ 345
Query: 141 HLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
L++ Y D L +RR +L SK F C+C RC+ P
Sbjct: 346 ELTVSYVDLLRPRAHRRQYLLASKNFHCRCLRCQHP 381
>gi|226294200|gb|EEH49620.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 365
Score = 49.7 bits (117), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 52/105 (49%), Gaps = 4/105 (3%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ G I+ NA MP + AIF +A+ I H+C PN ++ + G++ ++A K I G
Sbjct: 124 VTGTIKTNA--MPFGARGAEGAIFPQAARINHSCQPNTQNTWNRNLGKLTIQAFKDIDKG 181
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ ++I Y D R+ + + F C+CE C P E D
Sbjct: 182 EEITIAYVDCTELYDTRQECFENAFGFRCRCEVCAIPAEATKKRD 226
>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 537
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 11/148 (7%)
Query: 40 LEAHVEEYKNSPKYEND-RRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEP 98
LE++VE +K + D ++ L + T ++ +++ IQ NA +
Sbjct: 156 LESNVEGFKTDEELWKDFELQATGAVVYAGLLQSDETLKQAMEVLCKIQTNAFNRFDADT 215
Query: 99 SYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
I+ S + H+C PN Y +F + + L+A + I PGD + I Y D T R
Sbjct: 216 GQAGIYLHPSLSMVNHSCVPNAYITF-EKRKAFLKAERDIEPGDEILISYIDH---TAPR 271
Query: 157 RHHLQTSK--YFICQCERCRDPTELNTF 182
R ++ + +F C C RC+D +LN +
Sbjct: 272 RARQESLRLYHFQCNCVRCKD--DLNVY 297
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/137 (26%), Positives = 62/137 (45%), Gaps = 12/137 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
+V F+L + +N +EI + NAH + +E P ++ S I H+C P
Sbjct: 163 TLVNFILQWPGMN----VKEIAVNFSKLACNAHTICDSELRPLGTGLYPVISIINHSCLP 218
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC--- 173
N F + ++RA++ I G +SI Y + T+ R+ L+ + F C C RC
Sbjct: 219 NAVLVF-EGRTAVVRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSRCVKV 277
Query: 174 --RDPTELNTFYDGVKC 188
D + + +G +C
Sbjct: 278 AQEDEIKESAILEGYRC 294
>gi|393214406|gb|EJC99899.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 335
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 7/96 (7%)
Query: 84 GIIQVNAHEMPLTEPS----YIAIFDRASFIEHNCYPNLYKSFTDSGQVL--LRAMKPIA 137
GI++ N E+ +E Y A+ S + H+C PN + +DS ++ LRA + I
Sbjct: 111 GILRTNTIELSFSESDICTQYSAVCREISLVNHSCSPNTSQ-HSDSRTLICDLRAKRDIV 169
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PG+ ++I Y D + T R+ L+ F C C C
Sbjct: 170 PGEEITISYIDIVRPTTERKAELKIKYDFDCTCSVC 205
>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 465
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 107 ASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
+ + H+C PN F S Q+ L AMK I + ++I Y DP RR +LQ +
Sbjct: 205 GAMVNHSCDPNCVTIFARGSAQLELWAMKSIGKDEEVTISYVDPANCMNKRRKYLQKRYH 264
Query: 166 FICQCERC 173
F C+C+RC
Sbjct: 265 FDCRCQRC 272
>gi|444708470|gb|ELW49533.1| SET and MYND domain-containing protein 3 [Tupaia chinensis]
Length = 125
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
S + H+C PN F + +LLRA++ I G+ L+ICY D L + RR L+ F
Sbjct: 17 SLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFE 75
Query: 168 CQCERCR 174
C C RC+
Sbjct: 76 CDCFRCQ 82
>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ER-3]
Length = 519
Score = 49.3 bits (116), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
AS+I H+C PN Y F D + L+A++ IAP + + I Y D R+ LQ +F
Sbjct: 217 ASYINHSCQPNAYIGF-DGPVMYLKALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFF 275
Query: 167 ICQCERCRDPT 177
C+C +C + T
Sbjct: 276 ECKCPKCLEGT 286
>gi|159480142|ref|XP_001698143.1| hypothetical protein CHLREDRAFT_151604 [Chlamydomonas reinhardtii]
gi|158273641|gb|EDO99428.1| predicted protein [Chlamydomonas reinhardtii]
Length = 770
Score = 49.3 bits (116), Expect = 8e-04, Method: Composition-based stats.
Identities = 25/62 (40%), Positives = 33/62 (53%), Gaps = 7/62 (11%)
Query: 132 AMKPIAPGDHLSICY----TDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVK 187
A++PI PGD ++ CY L T R+ L +K F C CERCR P + Y V+
Sbjct: 170 AVRPIRPGDVITTCYLGWDEQTLMSTRMRQKLLYRTKLFHCTCERCRSPVDP---YRAVR 226
Query: 188 CP 189
CP
Sbjct: 227 CP 228
>gi|345486427|ref|XP_001607097.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 596
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 52/122 (42%), Gaps = 20/122 (16%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPS------------------YIAIFDRASFIEHNCY 115
FT +L++ I VN+H + T + + I AS H+C
Sbjct: 378 FTGSLLLRLSKISNVNSHAIANTNDACKYSNDSFTCRKNWCCVKGVCIVPLASLTNHSCN 437
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN--RRHHLQTSKYFICQCERC 173
PN + FTD + ++ A++PI GD + Y + N RR L+ + F C C+ C
Sbjct: 438 PNASRCFTDDLEFIMYALQPIKKGDQICDSYNSNFYEAPNPYRRDILRETYSFDCDCQAC 497
Query: 174 RD 175
+
Sbjct: 498 EN 499
>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ATCC 18188]
Length = 519
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
AS+I H+C PN Y F D + L+A++ IAP + + I Y D R+ LQ +F
Sbjct: 217 ASYINHSCQPNAYIGF-DGPVMYLKALQDIAPDEQIYISYIDNTEPFQTRQSELQLQYFF 275
Query: 167 ICQCERCRDPT 177
C+C +C + T
Sbjct: 276 ECKCPKCLEGT 286
>gi|209877585|ref|XP_002140234.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555840|gb|EEA05885.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 505
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT--DPLWGT 153
T + I +++R S + H+C + + +LRA + GD ++I Y D L+
Sbjct: 230 TSNNGIVLYNRISMMAHSCAASCCWHYGTENTFVLRAKTRLEIGDEITISYIADDDLFKC 289
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
R +L + F CQC RC DP +L+ G++C
Sbjct: 290 SKVRRNLLLNWLFYCQCSRCSDPVDLSR---GLRCA 322
>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 22/79 (27%), Positives = 41/79 (51%), Gaps = 1/79 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ + + H+C PN Y F + ++ +A++ I PG+ ++ YTD + RR +L
Sbjct: 53 AVYIDHALVNHSCRPNAYPVFNKTN-MIFKALRKIEPGEEITHAYTDTISPIQERREYLN 111
Query: 162 TSKYFICQCERCRDPTELN 180
F+C C C E++
Sbjct: 112 DVWRFMCNCPGCTKSNEID 130
>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
rotundata]
Length = 438
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 64/145 (44%), Gaps = 5/145 (3%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPL 95
+ L +H + K K + L F E++ I G I +N+ +
Sbjct: 113 RFKDLMSHYSDIKKDEKKMEHFMFLCGVLFGFLGDTPMPNSAELMGIYGRICINSFNIFD 172
Query: 96 TEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--HLSICYTDPLW 151
+ + I I+ S ++H+C PN +F + + +R ++ + D + I Y D L
Sbjct: 173 LDMNSIGVGIYLAPSVVDHSCVPNAVATF-EGITLNIRTIEDLPSLDWSQIRISYIDVLK 231
Query: 152 GTINRRHHLQTSKYFICQCERCRDP 176
T RR LQ+S YF C C++C +P
Sbjct: 232 TTKERRSELQSSYYFWCNCKKCEEP 256
>gi|407037947|gb|EKE38859.1| tetratricopeptide repeat-containing protein [Entamoeba nuttalli
P19]
Length = 425
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+F + + H+C PN + + TD L L A +PI+PGD L+I Y D R+ L
Sbjct: 349 GLFKYLNTLNHSCSPNCFLANTDDSCALSLIASRPISPGDELTISYIDNTLPYSQRQSLL 408
Query: 161 QTSKYFICQCERC 173
S +F C C +C
Sbjct: 409 YDSYHFYCHCPKC 421
>gi|299470965|emb|CBN79949.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 556
Score = 49.3 bits (116), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 43/86 (50%), Gaps = 3/86 (3%)
Query: 92 EMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPL 150
E L ++ A+ H+C PN +SF SG+ L LR +PI G+ ++I T
Sbjct: 202 ESALNGEVGCGLYLEAAAANHSCNPNAAQSF--SGKTLSLRCTRPIRKGEEITIGITQIQ 259
Query: 151 WGTINRRHHLQTSKYFICQCERCRDP 176
RR L+ + +F C+CERC P
Sbjct: 260 KPGPARRESLRKTYFFECRCERCESP 285
>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 644
Score = 49.3 bits (116), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+A++ AS + H+C PN+ + D Q+ +RA + I G + CY P + R L
Sbjct: 374 VAVYGTASMLNHSCTPNVIAGY-DGNQLTIRATEMIKKGGEVLHCY-GPRVSDMFRDERL 431
Query: 161 QTSK---YFICQCERCRDPTELNTFYDGVKCP 189
+ + YF C+C C P E T ++CP
Sbjct: 432 KVLRDQYYFTCKCMFCGIPQEAITPSGALRCP 463
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 4/104 (3%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC ++ EM + + ++ S + H+C PN F + + LRA++ I G+
Sbjct: 179 ICNGFTISNGEM---QEVGVGLYPSMSLLNHSCDPNCVIVF-EGPSLFLRAIRNIPLGEE 234
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
L+ICY D L T R+ L+ F C C C+ ++ G
Sbjct: 235 LTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKTQSKDADMLAG 278
>gi|389738165|gb|EIM79366.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 395
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 50/116 (43%), Gaps = 7/116 (6%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS-----YIAIFDRASFIEHNCYPNLYKS 121
F L F E K+ G + N + E S Y+A+ + + + H+C P
Sbjct: 147 FLSLGNAFESSE-GKVFGRVLTNGIGIGTLEGSKMEGGYVAVAEHIARVNHSCRPATSWR 205
Query: 122 FTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
F + L LRA++PI G ++ Y D L + +R+ L+ F C C C P
Sbjct: 206 FDSTNFTLQLRALQPITSGSEITTSYIDILLPSSSRQSELRRKYGFTCSCTACTLP 261
>gi|346976798|gb|EGY20250.1| Mcg1p [Verticillium dahliae VdLs.17]
Length = 413
Score = 48.9 bits (115), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 34/118 (28%), Positives = 52/118 (44%), Gaps = 9/118 (7%)
Query: 81 KICGIIQVNAHEMPLTEPS------YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAM- 133
+ GI+ NA ++ + + +F S + H+C PN ++TD ++ RA
Sbjct: 203 SVRGIVATNAFQVDVGGDVAGGHHHHFGVFPAISRLNHDCGPNA-AAWTDKTGLVHRAFA 261
Query: 134 -KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCPE 190
K IA G+ +SI Y D L RR + S F C C RC E + +K E
Sbjct: 262 SKDIAAGEEISISYVDALAPRAERRARMAGSWGFECGCRRCAAGDESDARVAEIKALE 319
>gi|158299652|ref|XP_319721.4| AGAP008973-PA [Anopheles gambiae str. PEST]
gi|157013620|gb|EAA14846.4| AGAP008973-PA [Anopheles gambiae str. PEST]
Length = 668
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD--PLWGTINRR 157
+ AIF R S H+C PN+ F + + + A +PI G + CY L R+
Sbjct: 412 FTAIFPRISMFNHSCDPNIRNHF-ERATLTVHATRPIGAGGEVFNCYGPHYRLMAAAERK 470
Query: 158 HHLQTSKYFICQCERCRD-PTELNTFYDGVKCP 189
L+ F C CERCR+ ++ ++CP
Sbjct: 471 MLLRAQYCFECGCERCREGDATFEQRFNMIRCP 503
>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3-like [Apis florea]
Length = 414
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 67/160 (41%), Gaps = 8/160 (5%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM 93
+ K L +H + K K V L F E++ I G I +N+ +
Sbjct: 111 YRKFKDLMSHYSDIKKDEKKMEHFVCVCGVLYEFLGDMSIPNSAELMGIYGRIYINSFNI 170
Query: 94 PLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTDP 149
+ + I I+ S ++H+C PN +F + +++R + P + I Y D
Sbjct: 171 SDLDMNNIGAGIYLGPSILDHSCKPNAVATF-EGTTIIIRTTEDLPCLDLSQIRISYIDV 229
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ T +RR LQ S YF C C++C + + + CP
Sbjct: 230 IKTTKDRREELQNSYYFWCNCKKCEESEPM---VEAAACP 266
>gi|331241142|ref|XP_003333220.1| hypothetical protein PGTG_14140 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312210|gb|EFP88801.1| hypothetical protein PGTG_14140 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 433
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 2/105 (1%)
Query: 82 ICGIIQVNAHEMPLTEPSYI-AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPG 139
I +I VN +++ + + A+ AS + H C PN+ + Q L +RA KPIA G
Sbjct: 224 ISNLIDVNTFNSEISKFNPLGAVMVDASRLNHACRPNVVYTLDSRTQTLRMRAFKPIAKG 283
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ L+I Y RR L+ F C C C+ ++L D
Sbjct: 284 EELTISYRSLEMTGKTRRESLKDDYGFDCTCSHCQMSSDLQEKSD 328
>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
Length = 400
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 40/75 (53%), Gaps = 1/75 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
I ++ +AS I H+C N F Q+ +RA + I PG+ + Y PL T R+ L
Sbjct: 194 IGVYPQASMINHSCKSNCIGMFYGP-QIQIRANEFIRPGEQIFHGYIPPLLPTAKRQEKL 252
Query: 161 QTSKYFICQCERCRD 175
+ +F+CQC CR+
Sbjct: 253 LKTYHFLCQCADCRN 267
>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
FGSC 2508]
Length = 668
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 8/77 (10%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSK--Y 165
S + H+C PN Y +F + + L+A + I PGD + I Y D T+ RR ++ + +
Sbjct: 227 SMVNHSCVPNAYIAF-EKRKAFLKAERDIEPGDEILISYID---HTMPRRARQESLRLYH 282
Query: 166 FICQCERCRDPTELNTF 182
F C C RC+D +LN +
Sbjct: 283 FQCNCIRCKD--DLNAY 297
>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
Length = 420
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 42/160 (26%), Positives = 67/160 (41%), Gaps = 8/160 (5%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFT--QEEILKICGIIQVNAHEM 93
K L +H E KN K ++ L + + + E++ I G + N +
Sbjct: 111 KFRDLMSHYAEIKNDRKRLEHLDSLHAVLTDMMADSPSTVPNKSELMSIYGRLITNGFNV 170
Query: 94 PLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTDP 149
E + IA I+ S +H+C PN +F + ++ + A++ P + I Y D
Sbjct: 171 LDAEMNSIATAIYLGVSITDHSCQPNAVATF-EGNELHIHALEDMPCLDWSKIFISYIDL 229
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L RR L+ YF+C C +CRDP E CP
Sbjct: 230 LNTPEQRRQDLKDHYYFLCVCSKCRDPKEARQM-TAAACP 268
>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Monodelphis domestica]
Length = 441
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 4/104 (3%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
IC ++ EM + + ++ S + H+C PN F + + LRA++ I G+
Sbjct: 192 ICNGFAISNGEM---QEVGVGLYPSMSLLNHSCDPNCVIVF-EGPSLFLRAVRDIQQGEE 247
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
L+ICY D L + R+ L+ F C C C ++ + G
Sbjct: 248 LTICYLDVLMPSAERQKQLKEQYCFDCDCPGCETRSKDDDMLSG 291
>gi|393243208|gb|EJD50723.1| hypothetical protein AURDEDRAFT_182119 [Auricularia delicata
TFB-10046 SS5]
Length = 434
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 45/108 (41%), Gaps = 15/108 (13%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
L+EE + + +I ++VN ++ S + H+C P + D L
Sbjct: 283 LSEEGFRRGLARINLNLEVNG-----------GLYALHSHLNHSCAPTVAARHMDPRTAL 331
Query: 130 LR----AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
R + P+ PG L+I Y DP G + RR LQ C C RC
Sbjct: 332 ARLAVVPLWPLKPGQELTITYVDPKMGVVARRAELQAWGIARCDCTRC 379
>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
Length = 448
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 71/166 (42%), Gaps = 14/166 (8%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFT--------QEEILKICGIIQ 87
K L +H E KN PK ++ L + + + + E++ I G +
Sbjct: 114 KFRDLMSHYAEIKNDPKRIEHLESLHAVLTDMMAESGKGSLTGSLVPNMNELMSIYGRLI 173
Query: 88 VNAHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLS 143
N + E + IA I+ S +H+C PN +F + ++ + A++ P +
Sbjct: 174 TNGFNILDAEMNSIATGIYLGVSITDHSCQPNAVATF-EGNELHVHAIEDLPCLDWSKIY 232
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
I Y D L RR L+ YF+C C +C DP E++ V CP
Sbjct: 233 ISYIDLLNTPEQRRADLKEHYYFLCVCSKCIDPQEMHEMTAAV-CP 277
>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
Length = 498
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/143 (24%), Positives = 63/143 (44%), Gaps = 4/143 (2%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEM--P 94
+ L +H+ E NS + + +R + + E E I + I+VN+
Sbjct: 144 FLTLRSHLVEITNSNQEQYERIMLCAKAEKEYS-GSELDVETIAEYLARIEVNSFTFTTA 202
Query: 95 LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTI 154
+P + I A ++ H+C PN F D G + ++A++ I P + + I Y D +
Sbjct: 203 FGDPLGLCIQPFACYMNHSCEPNAVVGF-DGGLITVKALREIKPDEQVFISYIDNTYPLE 261
Query: 155 NRRHHLQTSKYFICQCERCRDPT 177
R+ L +F C+C +C T
Sbjct: 262 VRQKQLTERYFFTCKCSKCAQGT 284
>gi|410077102|ref|XP_003956133.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
gi|372462716|emb|CCF56998.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
Length = 484
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 36/71 (50%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
++ S H+C PN ++ + A + I PG+ + + Y +PL G RR L+
Sbjct: 336 VYQLTSLANHSCEPNARYEIDSKLELKVYARRKIKPGEEIFLTYVNPLHGVNLRRRELRV 395
Query: 163 SKYFICQCERC 173
+ F+C+C RC
Sbjct: 396 NWGFLCKCSRC 406
>gi|428176655|gb|EKX45538.1| hypothetical protein GUITHDRAFT_138770 [Guillardia theta CCMP2712]
Length = 598
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 38/117 (32%), Positives = 56/117 (47%), Gaps = 9/117 (7%)
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAM 133
+ E+ ++ I+ N+HE L E +F +EH+C PN +F+ G Q+ + A+
Sbjct: 260 SDEDAARMLSILNTNSHE--LGEWGGSGLFLLGCLMEHSCAPNC--NFSTYGTQLWVTAL 315
Query: 134 KPIAPGDHLSICYTDPLWG-TINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
I G+ LSI Y D + T RR L+ S F+C C RC T L CP
Sbjct: 316 NSIKKGESLSIDYNDSTFEPTAVRRKLLKDSYDFLCLCPRC---TTLPDVMRAFICP 369
>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 356
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 11/140 (7%)
Query: 39 QLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEI---LKICGIIQVNAHEMPL 95
QLE+H E+ RR+ + L+ K E+ E+ LK+ G N+ +
Sbjct: 129 QLESHHEKLSGQI-----RRDAFEIWLSLLKDCEDGVLPELSSWLKMFGATICNSISICD 183
Query: 96 TEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT 153
+ IA I+ RAS + H+C PN D ++ + +K + GD +I Y D +
Sbjct: 184 NDLIDIAVGIYLRASMLNHSCDPNC-AWVCDGRKLQIMTVKDVKEGDECTISYVDAMKPA 242
Query: 154 INRRHHLQTSKYFICQCERC 173
R+ L+ S +F C+C +C
Sbjct: 243 KVRQADLKESYHFTCKCVKC 262
>gi|401402655|ref|XP_003881302.1| putative ankyrin repeat-containing protein, conserved [Neospora
caninum Liverpool]
gi|325115714|emb|CBZ51269.1| putative ankyrin repeat-containing protein, conserved [Neospora
caninum Liverpool]
Length = 1567
Score = 48.9 bits (115), Expect = 0.001, Method: Composition-based stats.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 4/89 (4%)
Query: 103 IFDR--ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWG-TINRRHH 159
+F R A+F +H+C P + + G V +RA+ I G L I + L+ T+ R+
Sbjct: 562 VFSRGLANF-QHSCVPTCTYNLDEDGYVSVRALCHIPVGGRLCISMVEDLFAPTVARKGL 620
Query: 160 LQTSKYFICQCERCRDPTELNTFYDGVKC 188
+ F C C RC DPTE GV+C
Sbjct: 621 DALPRVFGCGCVRCSDPTEGGRMLRGVRC 649
>gi|311978082|ref|YP_003987202.1| putative SET domain-containing protein [Acanthamoeba polyphaga
mimivirus]
gi|81999695|sp|Q5UNT8.1|YL678_MIMIV RecName: Full=Putative SET domain-containing protein L678
gi|55417289|gb|AAV50939.1| unknown [Acanthamoeba polyphaga mimivirus]
gi|308204535|gb|ADO18336.1| putative SET domain-containing protein [Acanthamoeba polyphaga
mimivirus]
gi|339061617|gb|AEJ34921.1| hypothetical protein MIMI_L678 [Acanthamoeba polyphaga mimivirus]
Length = 255
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 54/128 (42%), Gaps = 13/128 (10%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL 118
NV QF+ N FT +EIL NA E AI + H+C PN+
Sbjct: 99 NVYQFI------NNNFTSDEILLYGAKYMCNAFEFNNGS----AILINGAKFNHSCVPNV 148
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+D + ++ I G+ L+ Y D + T R++ L F CQCERC +
Sbjct: 149 I-FVSDENYMYFYTVRNIKTGEELTDNYVDIMSNTKTRKNRLFNQYGFDCQCERCIGSDK 207
Query: 179 LNTFYDGV 186
L FY V
Sbjct: 208 L--FYQEV 213
>gi|256071345|ref|XP_002572001.1| set and mynd domain containing [Schistosoma mansoni]
gi|350645143|emb|CCD60151.1| set and mynd domain containing, putative [Schistosoma mansoni]
Length = 527
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 42/81 (51%), Gaps = 6/81 (7%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW----GTINRR 157
+F S + H+C PN +F +S ++LR +KPI PG + CY P + ++ R
Sbjct: 395 GLFPCVSLLNHSCDPNTAHNFEESF-LILRCLKPILPGTEVFHCY-GPHYLHYPSSVQRV 452
Query: 158 HHLQTSKYFICQCERCRDPTE 178
LQ +FIC CE C P +
Sbjct: 453 TLLQQQYFFICDCEHCSKPVQ 473
>gi|398257480|gb|EJN41088.1| hypothetical protein lvs_L585 [Acanthamoeba polyphaga
lentillevirus]
Length = 237
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 13/129 (10%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL 118
NV QF+ N FT +EIL NA E AI + H+C PN+
Sbjct: 81 NVYQFI------NNNFTSDEILLYGAKYMCNAFEFNNGS----AILINGAKFNHSCVPNV 130
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+D + ++ I G+ L+ Y D + T R++ L F CQCERC +
Sbjct: 131 I-FVSDENYMYFYTVRNIKTGEELTDNYVDIMSNTKTRKNRLFNQYGFDCQCERCIGSDK 189
Query: 179 LNTFYDGVK 187
L FY V+
Sbjct: 190 L--FYQEVE 196
>gi|240274726|gb|EER38242.1| SET domain-containing protein 5 [Ajellomyces capsulatus H143]
gi|325091062|gb|EGC44372.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 363
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 50/99 (50%), Gaps = 4/99 (4%)
Query: 82 ICGIIQVNAHEMPLT-EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPG 139
+ G I+ NA MP + S AIF RA+ I H+C PN ++ + + L + + K I G
Sbjct: 122 VIGTIKTNA--MPFSARGSEGAIFPRAARINHSCKPNSQNTWNQNLERLTIHSFKDIVEG 179
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+ ++I Y D R+ + + F CQCE C P E
Sbjct: 180 EEVTIAYVDGTELYDERQACFEEAFGFRCQCEVCAVPRE 218
>gi|32398686|emb|CAD98646.1| hypothetical predicted protein, unknown function [Cryptosporidium
parvum]
Length = 467
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 44/79 (55%), Gaps = 14/79 (17%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT------DPLWGTIN 155
+++ ++SF++H+C PN+ + + + V RA++PI + LSICY DPLW
Sbjct: 202 SLYFKSSFLDHSCSPNVARFYIGTVAV-FRALRPIKRKETLSICYIESEYIQDPLWV--- 257
Query: 156 RRHHLQTSKYFICQCERCR 174
R L F C+CE C+
Sbjct: 258 RSSELN----FFCRCELCQ 272
>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 621
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/126 (23%), Positives = 55/126 (43%), Gaps = 20/126 (15%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDR------------------ASFIEHNCY 115
F + K+ I Q+N+ +P+ Y D S + H+C
Sbjct: 404 FLGSLVYKLSKIFQLNSRIIPIGRDFYTTGLDARMCENKQCCTTGLYIAPITSLLNHSCI 463
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL--WGTINRRHHLQTSKYFICQCERC 173
PN+ + F+++ V++ A++PI G L CY + R+ HL+ + F C C+ C
Sbjct: 464 PNVKRCFSNNYSVIVYAVQPIKKGSQLFDCYQQEFYEYNISPRQKHLKKTYNFNCDCKAC 523
Query: 174 RDPTEL 179
++ ++
Sbjct: 524 KEKWDI 529
>gi|66475308|ref|XP_627470.1| SET domain containing protein [Cryptosporidium parvum Iowa II]
gi|46228934|gb|EAK89783.1| SET domain containing protein [Cryptosporidium parvum Iowa II]
Length = 467
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 44/79 (55%), Gaps = 14/79 (17%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT------DPLWGTIN 155
+++ ++SF++H+C PN+ + + + V RA++PI + LSICY DPLW
Sbjct: 202 SLYFKSSFLDHSCSPNVARFYIGTVAV-FRALRPIKRKETLSICYIESEYIQDPLWV--- 257
Query: 156 RRHHLQTSKYFICQCERCR 174
R L F C+CE C+
Sbjct: 258 RSSELN----FFCRCELCQ 272
>gi|159463378|ref|XP_001689919.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283907|gb|EDP09657.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 765
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 28/93 (30%), Positives = 48/93 (51%), Gaps = 5/93 (5%)
Query: 77 EEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAM 133
+E+ ++ + VN H + E P A++ + H+C P+ + F G+ L LRA+
Sbjct: 98 KEVAQLLARLAVNCHTVCDEELRPLGTALYPSGALANHSCRPSTVQVF--RGRTLQLRAL 155
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
+P+APG +++CY +P RR L S F
Sbjct: 156 RPLAPGQEVTLCYLEPAATAQERREALWDSYCF 188
>gi|225561626|gb|EEH09906.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 363
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPG 139
+ G I+ NA MP S AIF RA+ I H+C PN ++ + + L + + K I G
Sbjct: 122 VIGTIKTNA--MPFGARGSEGAIFPRAARINHSCKPNSQNTWNQNLERLTIHSFKDIEEG 179
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+ ++I Y D R+ + + F CQCE C P E
Sbjct: 180 EEVTIAYVDGTELYDERQACFEEAFGFRCQCEVCAAPRE 218
>gi|351737851|gb|AEQ60886.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
Length = 263
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 55/129 (42%), Gaps = 13/129 (10%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL 118
NV QF+ N FT +EIL NA E AI + H+C PN+
Sbjct: 99 NVYQFI------NNNFTSDEILLYGAKYMCNAFEFNNGS----AILINGAKFNHSCVPNV 148
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+D + ++ I G+ L+ Y D + T R++ L F CQCERC +
Sbjct: 149 I-FVSDENYMYFYTVRNIKTGEELTDNYVDIMSNTKTRKNRLFNQYGFDCQCERCIGSDK 207
Query: 179 LNTFYDGVK 187
L FY V+
Sbjct: 208 L--FYQEVE 214
>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 668
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
Query: 97 EPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW--- 151
EP I AI+ S I H+CYPN+ + SG V++R ++ + G + CY P W
Sbjct: 469 EPRQIGGAIYPSVSLINHSCYPNVVRHTYPSGIVVVRTLRFVGKGTEILDCY-GPHWFSE 527
Query: 152 GTINRRHHLQTSKYFICQCERC 173
++R +L F+C C+ C
Sbjct: 528 NKLSRIEYLWKKYRFLCTCDAC 549
>gi|326510051|dbj|BAJ87242.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 293
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/158 (23%), Positives = 66/158 (41%), Gaps = 30/158 (18%)
Query: 35 DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP 94
D ++++E E K++ + + + FL T++ + + I+VNA +
Sbjct: 143 DTIIEMEEEFELLKSTFRKAGFQEELTTFL----------TKQWYINVLARIRVNAFRIE 192
Query: 95 LTEPSYIAIFDRA------------------SFIEHNCYPNLYKSFTDSGQVLLRAMKPI 136
L SY + A SF H+C PN + + ++ L ++ I
Sbjct: 193 LVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCDPNTHIVWLENADAKLNTLRDI 252
Query: 137 APGDHLSICYTDPLWGTINRRHHLQTSKY-FICQCERC 173
G+ L ICY D +N R + T + F C+C+RC
Sbjct: 253 DEGEELRICYIDTSM-NVNARQKILTEGFGFQCRCQRC 289
>gi|302686742|ref|XP_003033051.1| hypothetical protein SCHCODRAFT_54270 [Schizophyllum commune H4-8]
gi|300106745|gb|EFI98148.1| hypothetical protein SCHCODRAFT_54270 [Schizophyllum commune H4-8]
Length = 482
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 4/96 (4%)
Query: 93 MPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLW 151
MP +Y +FD+ S + H+C PN ++ + S LRA++ I P + ++I Y +P
Sbjct: 201 MPGGGATYTGVFDKLSRMNHSCTPNAVFTWANMSFSGALRAIRDIQPDEEITITYCNPSL 260
Query: 152 GTINRRHHLQTSKYFI-CQCERCRDPTELNTFYDGV 186
T R L + Y I C C C+D + + D +
Sbjct: 261 STSERAAAL--APYDIECDCPACKDGSASDARRDDI 294
>gi|67623599|ref|XP_668082.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54659279|gb|EAL37864.1| hypothetical protein Chro.60128 [Cryptosporidium hominis]
Length = 467
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 44/79 (55%), Gaps = 14/79 (17%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT------DPLWGTIN 155
+++ ++SF++H+C PN+ + + + V RA++PI + LSICY DPLW
Sbjct: 202 SLYFKSSFLDHSCSPNVARFYIGTVAV-FRALRPIKRKETLSICYIESEYIQDPLWV--- 257
Query: 156 RRHHLQTSKYFICQCERCR 174
R L F C+CE C+
Sbjct: 258 RSSELN----FFCRCELCQ 272
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
+F +SF H+C PN+Y +F ++ A++ I G+ L I Y D +R+ HL
Sbjct: 279 GLFFLSSFFNHSCDPNVYMAFPHDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLF 338
Query: 162 TSKYFICQCERC 173
+ F C C +C
Sbjct: 339 DNYGFNCDCPKC 350
>gi|402077091|gb|EJT72440.1| hypothetical protein GGTG_09306 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 455
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 5/121 (4%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
FF L F ++E + GI + NA +P+ + + IF AS I H C N ++ +
Sbjct: 72 FFSLRNSF-EDEATRELGITRTNA--LPIGSNAATGGIFLEASRINHACLQNAQNTWNEG 128
Query: 126 GQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
Q L + A++ I G+ ++I Y + R+ LQ F C C+ C P L D
Sbjct: 129 LQQLTIHAIRDINQGEEITIMYIEDRANRAARQRTLQRDFRFTCSCQLCLLPPSLRASSD 188
Query: 185 G 185
Sbjct: 189 A 189
>gi|154282997|ref|XP_001542294.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150410474|gb|EDN05862.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 282
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 4/99 (4%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPG 139
+ G I+ NA MP S AIF A+ I H+C PN ++ + + L + + K I G
Sbjct: 41 VIGTIKTNA--MPFGARGSEGAIFPSAARINHSCKPNSQNTWNRNLERLTIHSFKDIEEG 98
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+ L+I Y D R+ + + + F CQCE C P E
Sbjct: 99 EELTIAYVDGTELYDERQAYFEEAFGFRCQCEVCAVPRE 137
>gi|315040696|ref|XP_003169725.1| hypothetical protein MGYG_07892 [Arthroderma gypseum CBS 118893]
gi|311345687|gb|EFR04890.1| hypothetical protein MGYG_07892 [Arthroderma gypseum CBS 118893]
Length = 538
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 65/126 (51%), Gaps = 6/126 (4%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS 125
F L+ F E+ +++ GI++ NA +PL + + +F ++ I H+C PN ++ ++
Sbjct: 71 FLSLHNAFEDEDGVEL-GIVRTNA--LPLGSNATEGGMFLDSARINHSCNPNAQNTWNEN 127
Query: 126 GQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
Q L + A++ I + ++ICY + R+ LQ+ F+C C C P + + D
Sbjct: 128 LQKLTIHAIRDIPRDEEITICYLSTIQNRSARQCALQSCFRFVCSCSLCALPVDQSRASD 187
Query: 185 GVKCPE 190
+C E
Sbjct: 188 K-RCDE 192
>gi|302697297|ref|XP_003038327.1| hypothetical protein SCHCODRAFT_103387 [Schizophyllum commune H4-8]
gi|300112024|gb|EFJ03425.1| hypothetical protein SCHCODRAFT_103387, partial [Schizophyllum
commune H4-8]
Length = 360
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
+ +Y+ +F+RAS I H+C PN F S +++RA++ I+ G+ + I Y++ L
Sbjct: 170 DSAYVGVFERASRINHSCSPNAVYHFDLQSFCLVVRAIRHISKGEEIFISYSETLLQAAT 229
Query: 156 RRHHLQTSKYFICQCERC 173
RR + F C C C
Sbjct: 230 RRQNSLQDYGFRCACPAC 247
>gi|407408418|gb|EKF31865.1| hypothetical protein MOQ_004294 [Trypanosoma cruzi marinkellei]
Length = 744
Score = 48.5 bits (114), Expect = 0.001, Method: Composition-based stats.
Identities = 24/79 (30%), Positives = 40/79 (50%), Gaps = 6/79 (7%)
Query: 101 IAIFDRASFIEHNCYPN---LYKSFTDSGQVL---LRAMKPIAPGDHLSICYTDPLWGTI 154
I +F S IEH+C+PN +++ S ++ LRA +PI G+ ++I Y
Sbjct: 373 IGVFGGISLIEHSCHPNALVVFRHGCTSESIVFAELRATRPIGIGERITIAYVPTFLPKE 432
Query: 155 NRRHHLQTSKYFICQCERC 173
R+ L+ +F C C+ C
Sbjct: 433 ERKKRLRAKFFFSCACDHC 451
>gi|209878147|ref|XP_002140515.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209556121|gb|EEA06166.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 573
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 43/93 (46%), Gaps = 6/93 (6%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ----VLLRAMKPIAPGDHLSICYTDPLWGTINRR 157
+F AS I H+C PN D + +R+ I GD ++I Y + R
Sbjct: 271 GLFPYASLINHSCEPNCSYLGNDESTRYPYIEIRSTTNIPKGDEITISYIELYESRKQRI 330
Query: 158 HHLQTSKYFICQCERCRDP--TELNTFYDGVKC 188
L +KYFIC+C RC+ ++ F +G+ C
Sbjct: 331 SQLYKTKYFICRCTRCKQDLYKSMDYFIEGILC 363
>gi|356533089|ref|XP_003535101.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Glycine
max]
Length = 313
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 35/72 (48%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN + + D+ L+A++ I G+ L ICY D R+ L
Sbjct: 238 AVYLLPSFYNHDCDPNAHIIWIDNADAKLKALRDIVEGEELRICYIDASLDRNARQELLS 297
Query: 162 TSKYFICQCERC 173
F C C RC
Sbjct: 298 RGFGFQCNCSRC 309
>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 631
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 45/91 (49%), Gaps = 6/91 (6%)
Query: 89 NAHE--MPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSI 144
N HE + EP I AI+ S + H+CYPN+ + SG V++RA++ I G +
Sbjct: 421 NVHEEVTRVWEPRNIGGAIYTTVSLVNHSCYPNVVRHSYPSGIVVVRALRFIGKGCEILD 480
Query: 145 CYTDPLW--GTINRRHHLQTSKYFICQCERC 173
CY + RR L F+C+C+ C
Sbjct: 481 CYGPQFLSESRMARRELLWKKYRFLCECDAC 511
>gi|50303021|ref|XP_451448.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74637076|sp|Q6CX91.1|SET5_KLULA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49640579|emb|CAH03036.1| KLLA0A10241p [Kluyveromyces lactis]
Length = 492
Score = 48.5 bits (114), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 35/71 (49%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
++ + I H+C PN+ + L A K I G+ L + Y +PL G RR L+
Sbjct: 340 VYPLVAHINHSCEPNVRYELEPKHGIKLYARKDIKKGEQLRLTYVNPLHGVTLRRRELRV 399
Query: 163 SKYFICQCERC 173
+ F+C C RC
Sbjct: 400 NYGFLCHCPRC 410
>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
Length = 391
Score = 48.1 bits (113), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 1/88 (1%)
Query: 102 AIFDRASFIEHNCYPNLYK-SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
++D+ S+ H+C PN++K + T G++++ A++ I G+ L Y R L
Sbjct: 273 GLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEARIKKL 332
Query: 161 QTSKYFICQCERCRDPTELNTFYDGVKC 188
+ S +F CQC C++ F D C
Sbjct: 333 KESYFFNCQCPGCKNAPSHKQFLDKYLC 360
>gi|353234341|emb|CCA66367.1| hypothetical protein PIIN_00053 [Piriformospora indica DSM 11827]
Length = 527
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 42/74 (56%), Gaps = 2/74 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPGDHLSICYTDPLWGTIN-RRHH 159
+F + S ++C PN ++ + G++ + A++PI GD L+ICY PL+ N RR +
Sbjct: 135 GVFLQLSRFNNSCTPNASITWDERRGRMTVHALEPIRAGDELTICYGQPLFAVRNERREY 194
Query: 160 LQTSKYFICQCERC 173
L+ + F C C C
Sbjct: 195 LRHLRNFTCTCACC 208
>gi|363755786|ref|XP_003648109.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
DBVPG#7215]
gi|356892145|gb|AET41292.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
DBVPG#7215]
Length = 487
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
I+ S I H+C PN Y F G + A K IA G+ L Y +PL +RR L
Sbjct: 339 IYPLFSHINHSCEPNTYFEFDKHG-IKAFARKDIAAGEELLTTYVNPLHDVNSRRRELCV 397
Query: 163 SKYFICQCERCR 174
+ F+C C RC+
Sbjct: 398 NWGFLCNCRRCK 409
>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/181 (22%), Positives = 77/181 (42%), Gaps = 21/181 (11%)
Query: 13 PSYECIIALRCLYKKSHQPQIWD-----KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF 67
PS +A R L++ I + QL+ HV + + ++ R +V +FL +
Sbjct: 100 PSENIRLAARVLWRMHKDTGIASDSQLISVDQLQEHVADLA-AEDFKQLRADVHKFLQYW 158
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPN------- 117
+ + + I I GII+ N + + + +F + H+C+PN
Sbjct: 159 SYGTRQHSVDYISHIFGIIKCNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCTVILNH 218
Query: 118 -----LYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCER 172
+ + ++ LRA+ I G+ L++ Y D L + +R+ L+ +F C CE
Sbjct: 219 GNQSAVSSALHSQRRIELRALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEH 278
Query: 173 C 173
C
Sbjct: 279 C 279
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 59/137 (43%), Gaps = 12/137 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V +L + +N + E K NAH + +E P ++ S I H+C P
Sbjct: 168 NLVNSILEWPGINIKEIAENFSKFA----CNAHTICDSELRPVGTGLYPVISIINHSCLP 223
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-- 174
N F S L+RA++ I G + I Y + T+ R+ L+ F C C RC
Sbjct: 224 NSVLVFEGSS-ALVRAVQHIPSGTEVLISYIETAESTMTRQKALKEQYLFTCTCPRCSKV 282
Query: 175 ---DPTELNTFYDGVKC 188
D + + +G KC
Sbjct: 283 GQYDDIQESAILEGYKC 299
>gi|383849683|ref|XP_003700474.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 686
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 40/166 (24%), Positives = 73/166 (43%), Gaps = 20/166 (12%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPL 95
+ +Q E N + N+R V LL+ +F E+ + H
Sbjct: 423 RCLQRVGFFERTSNDAETPNEREIAVAALLSRHLQLLQFNAHEVFET---RHGTEHRFRG 479
Query: 96 TEPSY--IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT 153
++P Y +A++ + H+CYP + + F +++RA++ + PGD ++ Y P++
Sbjct: 480 SKPVYLGVAVYPTVARFNHDCYPAVTRYFVGR-SIVVRAIRRLRPGDVVAENY-GPIFTK 537
Query: 154 I---NRRHHLQTSKYFICQCERCRDPTELNTFYDGV-------KCP 189
I RR L +F C+C CR+ L +DG+ +CP
Sbjct: 538 IPLKKRRDTLAGRYWFRCECTACREDWPL---FDGLTNDLVRFRCP 580
>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
Length = 439
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 59/122 (48%), Gaps = 8/122 (6%)
Query: 68 FKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY------IAIFDRASFIEHNCYPNLYK- 120
+L +E Q +L + ++ N HE+ +T Y A++++AS H+C PN+ +
Sbjct: 243 LQLIQEKQQRRMLNLMCALECNTHEIGITVDEYNYCSIGSAVYEKASLFNHSCQPNVCRI 302
Query: 121 -SFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
+ G + + ++ IA G L Y T +R+ L + +F C+C C + T +
Sbjct: 303 NRAGEWGALEMISLTDIAAGTELVYNYIQISLPTEDRQSKLSENYFFECKCNGCVNNTSI 362
Query: 180 NT 181
++
Sbjct: 363 SS 364
>gi|209879634|ref|XP_002141257.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209556863|gb|EEA06908.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 1424
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTS-KYFICQ 169
+H+C P +SG + +R+ + L+I Y L+ +R + S K F C
Sbjct: 406 QHSCIPTCVYYLNESGNLCIRSAYNLPENMPLTISYLMDLFIPTQKRKSISNSMKVFSCM 465
Query: 170 CERCRDPTELNTFYDGVKC 188
C RC D TE N F +G++C
Sbjct: 466 CARCMDDTENNCFLEGIRC 484
>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
Length = 457
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 69/164 (42%), Gaps = 12/164 (7%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE------EILKICGIIQVN 89
K L +H E KN PK ++ L + + + E++ I G + N
Sbjct: 125 KFRDLMSHYAEIKNDPKRREHLESLHAVLTDMISDSNSGSSSIVPNITELMSIYGRLITN 184
Query: 90 AHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSIC 145
+ E + IA I+ S +H+C PN +F + ++ + ++ P + I
Sbjct: 185 GFNILDAEMNSIATAIYLGVSITDHSCQPNAVATF-EGNELHVHVIEDLPCLDWSKIYIS 243
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
Y D L RR L+ YF+C C +C DP E++ V CP
Sbjct: 244 YIDLLNTPEQRRADLREHYYFLCVCSKCIDPQEMHEMTAAV-CP 286
>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
Length = 392
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI GD + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNACSTFPYSNDIVVLKALAPIQEGDEICISYLDECMLERSRHSRHKVLREN 353
Query: 165 Y-FICQCERCR 174
Y FICQC +CR
Sbjct: 354 YVFICQCSKCR 364
>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 333
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 60/131 (45%), Gaps = 14/131 (10%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKIC---GIIQVNA----HEMPLTEPSYI----AIFDRA 107
+V Q + NF KL E + +C I ++N HE L I AI+ A
Sbjct: 173 DVQQLVTNFNKLTPEDAFSYGILVCNGHAITKINVTADDHENKLLIEEQIRIATAIYPSA 232
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL--WGTINRRHHLQTSKY 165
S + H+C PN+ SF D ++++A++ I G+ + CY R+ L++ Y
Sbjct: 233 SMMNHSCDPNIINSFLDQ-TLIVKAIQDIKEGEEILHCYGVDFRRMPKAERQESLESQYY 291
Query: 166 FICQCERCRDP 176
F C C+ C P
Sbjct: 292 FTCNCKACTMP 302
>gi|449437548|ref|XP_004136554.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
sativus]
Length = 341
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 37/72 (51%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN + + ++ L+A++ + P + L ICY D R+ L
Sbjct: 266 AVYMLPSFYNHDCDPNTHIIWINNANAKLKALRDVDPDEELRICYIDASMDYDARQTLLH 325
Query: 162 TSKYFICQCERC 173
FIC+C RC
Sbjct: 326 RGFGFICKCARC 337
>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
Length = 457
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 69/164 (42%), Gaps = 12/164 (7%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE------EILKICGIIQVN 89
K L +H E KN PK ++ L + + + E++ I G + N
Sbjct: 125 KFRDLMSHYAEIKNDPKRREHLESLHAVLTDMISDSNSGSSSIVPNITELMSIYGRLITN 184
Query: 90 AHEMPLTEPSYIA--IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSIC 145
+ E + IA I+ S +H+C PN +F + ++ + ++ P + I
Sbjct: 185 GFNILDAEMNSIATAIYLGVSITDHSCQPNAVATF-EGNELHVHVIEDLPCLDWSKIYIS 243
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
Y D L RR L+ YF+C C +C DP E++ V CP
Sbjct: 244 YIDLLNTPEQRRADLREHYYFLCVCSKCIDPQEMHEMTAAV-CP 286
>gi|195454342|ref|XP_002074198.1| GK12748 [Drosophila willistoni]
gi|194170283|gb|EDW85184.1| GK12748 [Drosophila willistoni]
Length = 394
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI GD + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNACSTFPYSNDIVVLKALSPIQEGDEICISYLDECQLERSRHSRHKILCGN 353
Query: 165 Y-FICQCERCR 174
Y FICQC +CR
Sbjct: 354 YIFICQCPKCR 364
>gi|414871458|tpg|DAA50015.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 299
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/122 (26%), Positives = 48/122 (39%), Gaps = 18/122 (14%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA------------------SFIE 111
EE T + + I++NA + L SY + A SF
Sbjct: 174 FQEEITTYWYINVLARIRINAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYN 233
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C PN + + + L+A++ I G+ L ICY D R+ L F C+C
Sbjct: 234 HDCDPNAHIVWLQNADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCL 293
Query: 172 RC 173
RC
Sbjct: 294 RC 295
>gi|294463315|gb|ADE77193.1| unknown [Picea sitchensis]
Length = 288
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 38/73 (52%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ S H+C PN + + ++ + L+A++ I PG+ L ICY D R+ L
Sbjct: 213 AVYMLPSMYNHDCDPNAHILWIENAKARLKALRNIEPGEELRICYIDASMDYEARQSLLY 272
Query: 162 TSKYFICQCERCR 174
F CQC RC+
Sbjct: 273 QGFGFRCQCLRCK 285
>gi|440463292|gb|ELQ32885.1| hypothetical protein OOU_Y34scaffold01018g15 [Magnaporthe oryzae
Y34]
gi|440490415|gb|ELQ69973.1| hypothetical protein OOW_P131scaffold00096g15 [Magnaporthe oryzae
P131]
Length = 428
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 47/97 (48%), Gaps = 2/97 (2%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAMKPIAPGDHL 142
GI+ N M L+ + +F R S I H+C PN+Y +T S ++ + A + I+ G L
Sbjct: 225 GIMDSNTFGMTLSGVPHSGLFPRISRINHSCRPNVYVRYTRSTLELEVVAYRDISAGTEL 284
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
+ YT PL R + F C C+ C P+ +
Sbjct: 285 GVSYT-PLNMLSRDRRQVLLGWGFNCTCQLCSAPSHI 320
>gi|322693095|gb|EFY84969.1| lysine methyltransferase [Metarhizium acridum CQMa 102]
Length = 563
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 35/80 (43%), Gaps = 3/80 (3%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSK 164
A + H+C PN F D + R A+ IAPG +SI Y PL R L+ S
Sbjct: 377 AQRLNHDCRPNAVY-FFDEATLTQRVHALTDIAPGAEISITYISPLQSRAERLSRLRASW 435
Query: 165 YFICQCERCRDPTELNTFYD 184
F C C C P EL D
Sbjct: 436 GFNCSCSACSQPPELAALSD 455
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 71/164 (43%), Gaps = 16/164 (9%)
Query: 21 LRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEIL 80
+R + + +P+I + + LE + ++ S K+ D + F L QE+I
Sbjct: 105 IRAVVQALVKPEIGNAIEDLEGNGTSWRKSEKWA-DMEMMAMGASAFAGLGT--GQEDI- 160
Query: 81 KICGIIQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAP 138
Q NA T+ + IF + + H+C PN F + +LRA KPI
Sbjct: 161 ------QTNAFHRYDTDLGQVGIFLEPKLAMANHSCIPNAMVQFIGR-KAILRAEKPIKI 213
Query: 139 GDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
D + I YTD + +++R +F C C RC +LN +
Sbjct: 214 DDEIEISYTDYTF-PLSKRKLALAPYFFDCMCLRCE--KDLNVY 254
>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
rubripes]
Length = 434
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 73/142 (51%), Gaps = 9/142 (6%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK--LNEEFTQEEILKICGIIQVNAHEMP 94
L E+H+++ +S K E ++ ++ L +F+ +N+ + + ++ + N +
Sbjct: 131 LKDFESHLDKM-DSIKDEMNQADIAA-LHHFYSRYINDIPDDQSLTELFAQVNCNGFTIE 188
Query: 95 LTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLW 151
E S++ AIF + + H+C PN+ ++ G V +RA++ + PGD + Y D L+
Sbjct: 189 DEELSHLGSAIFPDVALMNHSCNPNVIVTY--KGTVAEVRAVQELNPGDEILNSYIDLLY 246
Query: 152 GTINRRHHLQTSKYFICQCERC 173
T +R+ L S +F C C C
Sbjct: 247 PTEDRKERLLDSYFFTCLCSEC 268
>gi|350632678|gb|EHA21045.1| hypothetical protein ASPNIDRAFT_45712 [Aspergillus niger ATCC 1015]
Length = 367
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 50/102 (49%), Gaps = 8/102 (7%)
Query: 79 ILKIC-GIIQVNAHEMPLTEPSY---IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+ ++C G ++ H+ P E +A+F R + H+C PN+ D G++++ A++
Sbjct: 259 VFELCPGPTEIIPHQSPGVERGTQYGLAVFLRITLANHSCAPNVTHQADDRGRMMVTALR 318
Query: 135 PIAPGDHLSICYTDPLWGTIN---RRHHLQTSKYFICQCERC 173
IAPG+ Y D L ++ RR Q F C C RC
Sbjct: 319 DIAPGEECCTSYFD-LSEYVDLQARRKKTQELFTFTCTCPRC 359
>gi|401412321|ref|XP_003885608.1| hypothetical protein NCLIV_060040 [Neospora caninum Liverpool]
gi|325120027|emb|CBZ55580.1| hypothetical protein NCLIV_060040 [Neospora caninum Liverpool]
Length = 703
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT--DPLWGT 153
TE + +++R S + H+C + + +LRA + GD L+I Y D L+ +
Sbjct: 425 TEQQGLVLYNRISMMAHSCRATACWHYGEDDAFVLRARVKLQAGDELTISYIGDDDLFKS 484
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
N R F CQC RC P + G +CP
Sbjct: 485 TNVRREKVQGWLFTCQCVRCAAPVDNAR---GFRCP 517
>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
Length = 433
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 73/142 (51%), Gaps = 9/142 (6%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFK--LNEEFTQEEILKICGIIQVNAHEMP 94
L E+H+++ +S K E ++ ++ L +F+ +N+ + + ++ + N +
Sbjct: 131 LKDFESHLDKM-DSIKDEMNQADIAA-LHHFYSRYINDIPDDQSLTELFAQVNCNGFTIE 188
Query: 95 LTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLW 151
E S++ AIF + + H+C PN+ ++ G V +RA++ + PGD + Y D L+
Sbjct: 189 DEELSHLGSAIFPDVALMNHSCNPNVIVTY--KGTVAEVRAVQELNPGDEILNSYIDLLY 246
Query: 152 GTINRRHHLQTSKYFICQCERC 173
T +R+ L S +F C C C
Sbjct: 247 PTEDRKERLLDSYFFTCLCSEC 268
>gi|366991437|ref|XP_003675484.1| hypothetical protein NCAS_0C01270 [Naumovozyma castellii CBS 4309]
gi|342301349|emb|CCC69117.1| hypothetical protein NCAS_0C01270 [Naumovozyma castellii CBS 4309]
Length = 374
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAM-KPIAPGDHLSICYTDPLWGTIN-RRHHL 160
IF RAS+ H+C PNL K T G+ + + + I G L I Y+ L + RR L
Sbjct: 296 IFPRASYFNHSCDPNLTK--TRKGRTMFFTLNRDIDVGSELDIDYSGVLSLPVKERRKFL 353
Query: 161 QTSKYFICQCERCRDPTEL 179
S +F CQC+RC+ +L
Sbjct: 354 HDSWFFDCQCDRCKLEMQL 372
>gi|332024084|gb|EGI64301.1| SET and MYND domain-containing protein 3 [Acromyrmex echinatior]
Length = 441
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 41/82 (50%), Gaps = 9/82 (10%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--------HLSICYTDPLWG 152
+ I+ AS I+H+C PN+ F + +++R + + D + I Y D L
Sbjct: 176 VGIYLGASVIDHSCKPNVIVVF-EGTTIIVRTLTDLPSLDWSQASIDKDIRISYVDLLNS 234
Query: 153 TINRRHHLQTSKYFICQCERCR 174
+RR L +S YF C CERC+
Sbjct: 235 NKDRREELHSSYYFWCDCERCK 256
>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
Length = 909
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 10/95 (10%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTI---NRRH 158
A+F + + H+C PN++ SF +++RA+ I PG+ L CY P G + R+
Sbjct: 642 AVFPTEALLNHSCRPNVFVSF-QGKTLIVRAVSHIKPGEELLHCY-GPHAGRMVYGERQA 699
Query: 159 HLQTSKYFICQCERCRD----PTELNTFYDGVKCP 189
L+ +F C C+ C++ P ++ F KCP
Sbjct: 700 ALKEQYFFSCSCDACQEQVGNPNTVDMF-SAYKCP 733
>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
Length = 358
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 8/105 (7%)
Query: 76 QEEILKIC-GIIQVNAHEMPLTEPSY---IAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ + ++C G ++ H+ P E +A+F R + H+C PN+ D G++++
Sbjct: 242 ETNVFELCPGPTEIIPHQSPGVERGTQYGLAVFLRITLANHSCAPNVTHQADDRGRMMVT 301
Query: 132 AMKPIAPGDHLSICYTDPLWGTIN---RRHHLQTSKYFICQCERC 173
A++ IAPG+ Y D L ++ RR Q F C C RC
Sbjct: 302 ALRDIAPGEECCTSYFD-LSEYVDLQARRKKTQELFTFTCTCPRC 345
>gi|440483321|gb|ELQ63734.1| hypothetical protein OOW_P131scaffold00952g22 [Magnaporthe oryzae
P131]
Length = 492
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQV-LLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
A+F AS + H+C PN + S+ + L+ A+ ++PGD L++ Y GT +
Sbjct: 161 AVFRNASRLNHSCVPNCHFSWDRVTRTCLVHAITDVSPGDELTVAYDSR--GTAQTTAEM 218
Query: 161 QTSKYFICQCERCRDPTE 178
+T+ F C C RC P++
Sbjct: 219 ETTMGFRCACPRCSLPSD 236
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 82/182 (45%), Gaps = 21/182 (11%)
Query: 13 PSYECIIALRCLYKKSHQPQ--IWDKLM---QLEAHVEEYKNSPK--YENDRRNVVQFLL 65
PS + R L K+ Q + ++ M + E+H+ + N K END + L
Sbjct: 100 PSETVRLTARILAKQKTQTERTASERFMSVKEFESHLSKLDNEKKELIEND----ISALH 155
Query: 66 NFFKLNEEFTQEEILK-ICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSF 122
F+ N L+ + + N + E S++ AIF + + H+C PN+ ++
Sbjct: 156 RFYSKNVHNCDNAALEFLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTY 215
Query: 123 TDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC----RDPT 177
G V +RA++ I G+ + Y D L+ T +R L+ S +F C C C +DP
Sbjct: 216 --KGTVAEVRAVQEIHAGEEVFTSYIDLLYPTEDRNDRLKDSYFFSCDCRECSTKQKDPA 273
Query: 178 EL 179
+L
Sbjct: 274 KL 275
>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
Length = 704
Score = 47.8 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 4/90 (4%)
Query: 88 VNAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSIC 145
V E + EP + AI+ S H+CYPN+ + +G V++ +++ I G + C
Sbjct: 503 VRDDETKILEPRNVGGAIYTTVSLTNHSCYPNIVRHSFPNGTVVVTSLRYIPEGSEILDC 562
Query: 146 YTDPLWGTI--NRRHHLQTSKYFICQCERC 173
Y +RR L YF CQCE C
Sbjct: 563 YGQHFLENKRDSRRRLLAEKYYFDCQCEPC 592
>gi|170116896|ref|XP_001889637.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164635352|gb|EDQ99660.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 453
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 54/121 (44%), Gaps = 10/121 (8%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPL--------TEPSYIAIFDRASFIEHNCYPNL 118
F L TQ+ +I GII+ N +++P +Y A+ + S I H+C PN
Sbjct: 214 FMALANSHTQDGSGRILGIIRTNGYKVPGLYDGDEDDNAGAYTAVLNVMSRINHSCSPNT 273
Query: 119 YKSF-TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPT 177
+F S LRA++ I G+ L Y D ++ T + R + IC C C T
Sbjct: 274 AYNFHMASLSFELRAVRDIKQGEELFTSYCD-IFRTKSERAEDLATYSIICACPGCVGAT 332
Query: 178 E 178
+
Sbjct: 333 K 333
>gi|389646263|ref|XP_003720763.1| hypothetical protein MGG_02943 [Magnaporthe oryzae 70-15]
gi|351638155|gb|EHA46020.1| hypothetical protein MGG_02943 [Magnaporthe oryzae 70-15]
Length = 336
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 42/78 (53%), Gaps = 3/78 (3%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQV-LLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
A+F AS + H+C PN + S+ + L+ A+ ++PGD L++ Y GT +
Sbjct: 92 AVFRNASRLNHSCVPNCHFSWDRVTRTCLVHAITDVSPGDELTVAYDS--RGTAQTTAEM 149
Query: 161 QTSKYFICQCERCRDPTE 178
+T+ F C C RC P++
Sbjct: 150 ETTMGFRCACPRCSLPSD 167
>gi|307195197|gb|EFN77181.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 693
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 41/81 (50%), Gaps = 4/81 (4%)
Query: 97 EPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW--G 152
EP I AI+ S + H+CYPN+ + +G +++RA++ I+ G + CY
Sbjct: 493 EPRNIGGAIYTTVSLVNHSCYPNVVRHSYPNGMIVVRALRSISKGCEIFDCYGPQFLSES 552
Query: 153 TINRRHHLQTSKYFICQCERC 173
+ RR L F+C+C C
Sbjct: 553 RLTRREFLWKKYRFLCECNAC 573
>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 418
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 46/88 (52%), Gaps = 1/88 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
+++ +++ H+C PN+ F QV + A++ I G+ + Y D RR L+
Sbjct: 155 GLYEEVNYMNHSCTPNVICVFNKLPQVRVIAIRDIEQGEEIMNSYIDTKKDLDFRRRFLK 214
Query: 162 TSKYFICQCERC-RDPTELNTFYDGVKC 188
+ +F+C+C+RC ++ E +F KC
Sbjct: 215 QNYFFLCECKRCIKEQNEGVSFVRCQKC 242
>gi|395328144|gb|EJF60538.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 384
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 99 SYIAIFDRASFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTD-PLWGTINR 156
S IF +A+ + H+C PN+ +++ Q + RA++ +APG+ L + Y D + GT
Sbjct: 166 SREGIFLQAARLNHSCRPNVARTWDVGAQEMAFRALRDVAPGEELCMSYVDVDILGTREE 225
Query: 157 RH-HLQTSKYFICQCERC 173
R ++ + F+C CE C
Sbjct: 226 RGAEIEGAFGFVCACEAC 243
>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 742
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 6/92 (6%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL--WGTINRRHH 159
AIF +S + H+C PN++ S V+++A + I+ G+ ++ CY ++R+
Sbjct: 456 AIFPTSSLLNHSCDPNIFSS-NILKYVVIKASRDISEGEEITNCYGPNFLRMRVVDRQAS 514
Query: 160 LQTSKYFICQCERCRDPTELNTF---YDGVKC 188
L+ +F C+C C DP + F ++G+ C
Sbjct: 515 LKNQYHFDCECNTCLDPQADDLFFKTFEGLVC 546
>gi|170058970|ref|XP_001865157.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877852|gb|EDS41235.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 556
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 37/67 (55%), Gaps = 6/67 (8%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
A+ + H+C PN +F DSG + + ++PI GD +SI Y P+W H + + F
Sbjct: 398 ATLVNHSCDPNA-AAFVDSGTIKIAVLRPIQRGDQISIAYP-PVWWNA----HDGSEQAF 451
Query: 167 ICQCERC 173
+C+C C
Sbjct: 452 LCKCAVC 458
>gi|449540254|gb|EMD31248.1| hypothetical protein CERSUDRAFT_60339 [Ceriporiopsis subvermispora
B]
Length = 467
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 3/97 (3%)
Query: 82 ICGIIQVNAHEMPLTE-PSYIAIFDRASFIEHNCYPNLY-KSFTDSGQVLLRAMKPIAPG 139
+ G+++ ++P E AI + S + H+C PN + + T S + L A++PIA G
Sbjct: 168 VTGVLRKVVPDLPKGEIHELTAIGEVLSRVNHSCRPNAHVRMDTHSLSLQLVALRPIASG 227
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
+ +++ YTD L R+ L F C C C DP
Sbjct: 228 EQVTVAYTDILAPYTTRKRKL-APYGFTCSCLSCADP 263
>gi|255074845|ref|XP_002501097.1| predicted protein [Micromonas sp. RCC299]
gi|226516360|gb|ACO62355.1| predicted protein [Micromonas sp. RCC299]
Length = 375
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 3/68 (4%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
S H+C PN+ + SG +RA +A G L+ICYTD RR L+ +
Sbjct: 304 SLFNHSCDPNVDVDWV-SGDAGMRARTRVDVAKGTELTICYTDGGAPVDARRGALEHAYG 362
Query: 166 FICQCERC 173
F+C+CERC
Sbjct: 363 FVCRCERC 370
>gi|321472908|gb|EFX83877.1| hypothetical protein DAPPUDRAFT_239494 [Daphnia pulex]
Length = 291
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 32/62 (51%), Gaps = 5/62 (8%)
Query: 127 QVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
Q+ +RA PI GD Y L G R+ L+ SK F C C RC DPTE +T+
Sbjct: 10 QMTVRASVPIMRGD-----YALALEGLRERQSLLRQSKLFECDCSRCSDPTECSTYLSAQ 64
Query: 187 KC 188
+C
Sbjct: 65 RC 66
>gi|67465651|ref|XP_648998.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56465330|gb|EAL43612.1| hypothetical protein, conserved domain containing [Entamoeba
histolytica HM-1:IMSS]
gi|449704953|gb|EMD45105.1| set and mynd domain containing protein [Entamoeba histolytica KU27]
Length = 426
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+F + + H+C PN + + TD L L A PI+PGD L+I Y D R+ L
Sbjct: 350 GLFKYLNTLNHSCSPNCFLASTDDSCALSLIASCPISPGDELTISYIDNTLPYSQRQSLL 409
Query: 161 QTSKYFICQCERCR 174
S +F C C +C
Sbjct: 410 YDSYHFYCHCPKCN 423
>gi|339236741|ref|XP_003379925.1| MYND finger protein [Trichinella spiralis]
gi|316977375|gb|EFV60485.1| MYND finger protein [Trichinella spiralis]
Length = 476
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 33/70 (47%), Gaps = 2/70 (2%)
Query: 111 EHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--HLSICYTDPLWGTINRRHHLQTSKYFIC 168
+H+C PN F VL + I D I Y D RR LQ+ YF C
Sbjct: 194 DHSCRPNTAMVFNGFRAVLRPLVNGIDTADPSQCFIAYVDVGRSRYQRRKELQSKWYFWC 253
Query: 169 QCERCRDPTE 178
+CERCRDP++
Sbjct: 254 ECERCRDPSD 263
>gi|328850125|gb|EGF99294.1| hypothetical protein MELLADRAFT_68704 [Melampsora larici-populina
98AG31]
Length = 467
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 50/104 (48%), Gaps = 1/104 (0%)
Query: 82 ICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPN-LYKSFTDSGQVLLRAMKPIAPGD 140
+ IQ N+ ++ L Y+A+F AS H+C PN +Y + + ++ + A++ I G+
Sbjct: 254 MVSAIQRNSFQLMLEGEPYVALFLDASRANHDCRPNVVYHTDGATLKLNMYAVREIEVGE 313
Query: 141 HLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
L Y + T RR +L+ F CQC C P L D
Sbjct: 314 ELVAAYLNIQQPTETRRAYLRQHYGFECQCSVCSLPEHLIELSD 357
>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/164 (24%), Positives = 79/164 (48%), Gaps = 39/164 (23%)
Query: 36 KLMQLEAHVEEYKNSPKYENDRRNVVQFLL-NFFKLNEE----------------FTQEE 78
+L++L++H P+ ++D R + + N +L E+ + EE
Sbjct: 151 RLLKLQSH------EPEVDSDEREWLSGIARNTLRLMEQDVDEGVKVFGEKDAPGYGVEE 204
Query: 79 ILKICGIIQVNAH------EMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LR 131
++++ ++ N+H E P +EP A++ + S H+C P+ F + G L +R
Sbjct: 205 LVRLMCCVRCNSHTLYASEEWP-SEPVGTAVYLKGSAFNHSCLPS--AEFYNEGTSLRVR 261
Query: 132 AMKPIAPGDHLSICY---TDPLWGTINRRHHLQTSKYFICQCER 172
+++ I+ G+ ++I Y T+ LW +RR L F C+C+R
Sbjct: 262 SVRDISAGEEVTISYVPVTETLW---DRRQALWRQYKFDCECDR 302
>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
Length = 452
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 71/159 (44%), Gaps = 10/159 (6%)
Query: 40 LEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQE------EILKICGIIQVNAHEM 93
L + EE + + D + + F+ LNE + E E+L I G + +N+ +
Sbjct: 164 LNNYREEVIYRKEIKEDTKRMEYFMTICGVLNEYLSNEILPNSVELLGIYGRMCINSFNI 223
Query: 94 PLTEPSYIA--IFDRASFIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPL 150
E I I+ S ++H+C PN +F ++ L P D + I Y D +
Sbjct: 224 LNGEMQAIGTGIYLAPSILDHSCSPNAVATFDGFKLRIQLTQELPKLEWDSIRISYIDLM 283
Query: 151 WGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+R+ L+ YF C C RC++ +++ ++ KCP
Sbjct: 284 NSKSHRKKELKDRYYFDCDCPRCKN-DDIDCYHYAAKCP 321
>gi|449524736|ref|XP_004169377.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
sativus]
Length = 346
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN + + ++ L+A++ + P + L ICY D R+ L
Sbjct: 266 AVYMLPSFYNHDCDPNTHIIWINNANAKLKALRDVDPDEELRICYIDTSMDYDARQTLLH 325
Query: 162 TSKYFICQCERC 173
FIC C RC
Sbjct: 326 RGFGFICNCARC 337
>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
intestinalis]
Length = 430
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 47/102 (46%), Gaps = 4/102 (3%)
Query: 79 ILKICGIIQVNAHEM---PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
+L++C I+ N+ + L ++ SFI H+C PN F + + +RA+K
Sbjct: 170 LLELCARIKNNSFAICNEELQSDVGTGVYLNCSFINHSCEPNCVAEF-NMRTLKIRAVKN 228
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPT 177
I G+ + I Y D + R+ L + +F C C C T
Sbjct: 229 ITAGEEVLISYVDLFATSFERQRELMSIYHFQCTCHSCNAKT 270
>gi|389624275|ref|XP_003709791.1| hypothetical protein MGG_09180 [Magnaporthe oryzae 70-15]
gi|351649320|gb|EHA57179.1| hypothetical protein MGG_09180 [Magnaporthe oryzae 70-15]
gi|440467268|gb|ELQ36499.1| hypothetical protein OOU_Y34scaffold00656g6 [Magnaporthe oryzae
Y34]
gi|440488608|gb|ELQ68324.1| hypothetical protein OOW_P131scaffold00255g26 [Magnaporthe oryzae
P131]
Length = 443
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/113 (24%), Positives = 52/113 (46%), Gaps = 4/113 (3%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG 126
FF L ++ + ++ ++ N + + + Y+ ++ + I H C PN + F S
Sbjct: 216 FFDLARSYSGGDAVE--DVLNTNTFAVGVGDGDYMGLYPEIARINHACSPNSFSRFHPSD 273
Query: 127 QVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+ + AM+ I PG+ ++I Y PL + R + F C C+ C P E
Sbjct: 274 LTMDVGAMRDIMPGEEITISYI-PLGLPSSHRAGQIRAWGFACTCQLCSTPAE 325
>gi|83774976|dbj|BAE65099.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 323
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
A+ I H+C N +F DS + ++A +PI GD + I Y D RR L+ YF
Sbjct: 35 AALINHSCNYNAAVTF-DSDNLYIKATRPIQKGDQIFISYIDATNPVKLRRSELRERYYF 93
Query: 167 ICQCERC-RDPTE 178
C C +C +D TE
Sbjct: 94 DCHCAKCAKDLTE 106
>gi|302774507|ref|XP_002970670.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
gi|300161381|gb|EFJ27996.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
Length = 292
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 64/163 (39%), Gaps = 18/163 (11%)
Query: 28 SHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNF-FKLNEEFTQEEILKICGII 86
+ + + +L+ L+ + N P+ V+ L F+L + + E L +
Sbjct: 32 ARNAEDFQRLLLLDGQASVFANDPRPAGVAELVLALLRTHCFELAKGLSLEMFLNLLAKD 91
Query: 87 QVNAHEMPLTEPS---------YIAIFDRASFIEHNCYPN------LYKSFTDSGQVLLR 131
++NA L PS A + +AS + H+C PN L K + + +R
Sbjct: 92 KINA--FGLMAPSSGTAQRKVRAYARYAQASMLNHDCLPNACRFEYLDKPGASNTDIYIR 149
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+ + G + I Y W RR L F C CERCR
Sbjct: 150 LLHDVPQGSEICISYFPVNWNYKERRERLVEDYGFECNCERCR 192
>gi|391868965|gb|EIT78172.1| hypothetical protein Ao3042_05555 [Aspergillus oryzae 3.042]
Length = 324
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
A+ I H+C N +F DS + ++A +PI GD + I Y D RR L+ YF
Sbjct: 36 AALINHSCNYNAAVTF-DSDNLYIKATRPIQKGDQIFISYIDATNPVKLRRSELRERYYF 94
Query: 167 ICQCERC-RDPTE 178
C C +C +D TE
Sbjct: 95 DCHCAKCAKDLTE 107
>gi|326518712|dbj|BAJ92517.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 166
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 48/118 (40%), Gaps = 18/118 (15%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA------------------SFIEHNCY 115
T++ + + I+VNA + L SY + A SF H+C
Sbjct: 45 LTKQWYINVLARIRVNAFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCD 104
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PN + + ++ L ++ I G+ L ICY D R+ L F C+C+RC
Sbjct: 105 PNTHIVWLENADAKLNTLRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRC 162
>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 387
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL---WGTINRRH 158
A+F S I H+C+PN F + + L A++ I PGD ++I Y P R+
Sbjct: 291 ALFQNHSSINHSCFPNAASVFDGNHVLRLVAIRMIEPGDEINISYIAPCELDHSRHTRQK 350
Query: 159 HLQTSKYFICQCERCRD 175
+LQ + F C+C +C +
Sbjct: 351 YLQENYVFTCRCIKCEE 367
>gi|449544639|gb|EMD35612.1| hypothetical protein CERSUDRAFT_124865 [Ceriporiopsis subvermispora
B]
Length = 406
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/100 (26%), Positives = 44/100 (44%), Gaps = 6/100 (6%)
Query: 80 LKICGIIQVNAHEMPLTEP-----SYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAM 133
L GI NA +P T+ + + +F S H+C PN+ +++ + + + R +
Sbjct: 182 LPALGIFDTNALPIPTTDAKGNPQTDLGLFLLGSRFNHSCTPNVARTWDAAAKAMAFRVL 241
Query: 134 KPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ + PG+ L Y D L R L+ F C C C
Sbjct: 242 RAVEPGEELCTNYVDVLGTRAERAAELEEKYAFACLCSVC 281
>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 47.4 bits (111), Expect = 0.003, Method: Composition-based stats.
Identities = 33/108 (30%), Positives = 48/108 (44%), Gaps = 4/108 (3%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA--SFIEHNCYPNLYKSFTDSGQ 127
L E E+ +I IQ NA + IF + + H+C PN + F D
Sbjct: 197 LQGEEVLEKAREILCKIQTNAFNRLDADTGMAGIFLDVGLAMVNHSCVPNAFIGF-DKRT 255
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
+LRA +PI G+ ++I Y D R L+ +F C C RC+D
Sbjct: 256 AILRAERPIQEGEEITISYIDNTLPKAARYEALRLY-HFQCDCVRCKD 302
>gi|367046502|ref|XP_003653631.1| hypothetical protein THITE_2144701 [Thielavia terrestris NRRL 8126]
gi|347000893|gb|AEO67295.1| hypothetical protein THITE_2144701 [Thielavia terrestris NRRL 8126]
Length = 399
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 32/53 (60%), Gaps = 1/53 (1%)
Query: 122 FTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
F G+ L +R++K IAPG+ ++ICY DP + R+ L+ +F C C RC
Sbjct: 122 FCSKGRELRVRSLKKIAPGEEITICYIDPTFDVAARQEVLKREYFFDCSCARC 174
>gi|380477969|emb|CCF43859.1| hypothetical protein CH063_13445 [Colletotrichum higginsianum]
Length = 463
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 39/76 (51%), Gaps = 3/76 (3%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTINRR 157
+ A+F S + H+C PN+ F D + R A++ I PG+ L++ Y DP+ RR
Sbjct: 250 FYAVFVNTSRMNHDCRPNVDYWF-DPRTLTQRTVAIRDIIPGEELTLSYIDPMQSRAARR 308
Query: 158 HHLQTSKYFICQCERC 173
L T+ F C C C
Sbjct: 309 ERLHTTWGFHCSCNHC 324
>gi|389739036|gb|EIM80231.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 445
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 68/158 (43%), Gaps = 13/158 (8%)
Query: 22 RCLYKKSHQPQ--IWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEI 79
+ ++ K +P W K QL + +SP+ D++ + + L E +E+
Sbjct: 256 KSVWYKGAEPDRPTWTKAHQLCVQAFQTPSSPQ---DQKKLAKILRKPL---PEHIAKEL 309
Query: 80 LKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR----AMKP 135
+ G ++ +M L ++ ++ S + H+C PNL D L R A +
Sbjct: 310 FEYGGFLR-GLGKMNLNLEAHGGLYTLHSHLNHSCDPNLSIRHLDPRTSLSRITVIAKRD 368
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
I G+ L++ Y DP RR L + +C+C RC
Sbjct: 369 IDAGEELTVTYVDPGLRLKRRREELGGWGFGVCKCARC 406
>gi|237843241|ref|XP_002370918.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|211968582|gb|EEB03778.1| hypothetical protein, conserved [Toxoplasma gondii ME49]
gi|221481883|gb|EEE20253.1| conserved hypothetical protein [Toxoplasma gondii GT1]
gi|221502380|gb|EEE28113.1| hypothetical protein, conserved [Toxoplasma gondii VEG]
Length = 709
Score = 47.0 bits (110), Expect = 0.003, Method: Composition-based stats.
Identities = 28/96 (29%), Positives = 43/96 (44%), Gaps = 5/96 (5%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYT--DPLWGT 153
TE + +++R S + H+C + + +LRA + GD L+I Y D L+ +
Sbjct: 431 TEQHGLVLYNRISMMAHSCRATACWHYGEDDAFILRARVKLQAGDELTISYIGDDDLFKS 490
Query: 154 INRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
N R F CQC RC P + G +CP
Sbjct: 491 TNVRREKVYGWLFTCQCVRCAAPVDNAR---GFRCP 523
>gi|57970742|ref|XP_564258.1| AGAP011234-PA [Anopheles gambiae str. PEST]
gi|55244729|gb|EAL41557.1| AGAP011234-PA [Anopheles gambiae str. PEST]
Length = 646
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 39/78 (50%), Gaps = 6/78 (7%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHL----SICYTDPLWGTINR 156
IAI+ S + H+C PN+ G++ + A +PIA G+ L DP R
Sbjct: 472 IAIYPLFSMVNHSCIPNVAPIHLLDGRLAMVATRPIAAGEQLYNINGFSTFDP--DDSAR 529
Query: 157 RHHLQTSKYFICQCERCR 174
RH LQ S +F C+C C+
Sbjct: 530 RHALQLSHFFKCRCASCQ 547
>gi|393236329|gb|EJD43878.1| hypothetical protein AURDEDRAFT_88438 [Auricularia delicata
TFB-10046 SS5]
Length = 385
Score = 47.0 bits (110), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 8/101 (7%)
Query: 82 ICGIIQVNAHEMPLTEPS-----YIAIFDRASFIEHNCYPNLYKSF-TDSGQVLLRAMKP 135
+ GI N++ + + PS Y +F S + H+C PN + +++ + LRA++P
Sbjct: 151 LTGIYNTNSYPL-VGLPSDNIIEYSGVFPTLSRLNHSCRPNANPQWNSETLTIELRALRP 209
Query: 136 IAPGDHLSICYT-DPLWGTINRRHHLQTSKYFICQCERCRD 175
I G+ ++I Y+ D L RR L+ + +F C C C D
Sbjct: 210 IRAGEEVTITYSPDLLIPAYQRRASLREAYHFTCTCTACSD 250
>gi|167377998|ref|XP_001734627.1| set and mynd domain containing protein [Entamoeba dispar SAW760]
gi|165903780|gb|EDR29210.1| set and mynd domain containing protein, putative [Entamoeba dispar
SAW760]
Length = 426
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+F + + H+C PN + + TD L L A PI+PGD L+I Y D R+ L
Sbjct: 350 GLFKYLNTLNHSCSPNCFLANTDDSFALSLIASCPISPGDELTISYIDNTLPYSQRQSLL 409
Query: 161 QTSKYFICQCERC 173
S +F C C +C
Sbjct: 410 YDSYHFYCHCPKC 422
>gi|440801440|gb|ELR22460.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 545
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A F S + H+C PN ++ + GQ L+ A++ IA G+ LSI Y R+ L+
Sbjct: 408 AFFPLLSTLNHSCEPNCQVAYLEDGQALVFALRDIAAGEELSISYIYRHLPLAERQQQLR 467
Query: 162 TSKYFICQCERC 173
S F+C C RC
Sbjct: 468 -SYGFVCACPRC 478
>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
Length = 393
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S ++ L+A+ PI GD + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNACSTFPYSNDIIVLKALAPIQQGDEICISYLDECMLERSRHSRHKVLREN 353
Query: 165 Y-FICQCERCR 174
Y FICQC +CR
Sbjct: 354 YVFICQCPKCR 364
>gi|146413184|ref|XP_001482563.1| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
6260]
Length = 483
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 11/117 (9%)
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL--- 118
+FL F +++ EE + + G +N + IF S + H+C+PN
Sbjct: 309 KFLKVFPSAHDQVPFEEFMMMMGTYNINNLDS--------CIFLTQSHLNHSCHPNTDVQ 360
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
+ + +G + + A + I G+ L+ Y +P R+ L+ + FIC C+RC+D
Sbjct: 361 ASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQRQRELRVNWGFICSCQRCKD 417
>gi|223634690|sp|A5DQN2.2|SET5_PICGU RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|190348931|gb|EDK41485.2| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
6260]
Length = 483
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 11/117 (9%)
Query: 62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL--- 118
+FL F +++ EE + + G +N + IF S + H+C+PN
Sbjct: 309 KFLKVFPSAHDQVPFEEFMMMMGTYNINNLDS--------CIFLTQSHLNHSCHPNTDVQ 360
Query: 119 YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
+ + +G + + A + I G+ L+ Y +P R+ L+ + FIC C+RC+D
Sbjct: 361 ASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQRQRELRVNWGFICSCQRCKD 417
>gi|356519848|ref|XP_003528581.1| PREDICTED: uncharacterized protein LOC100812233 [Glycine max]
Length = 527
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/115 (26%), Positives = 57/115 (49%), Gaps = 19/115 (16%)
Query: 78 EILKICGIIQVNAHEMPLTEPSYIA-IFDRA------------SFIEHNCYPNLYKSFTD 124
+++K+ GI+ VN+ LTE + A + R SF+ H+C PN +
Sbjct: 283 DMVKLVGILDVNS----LTEDAVSANVLTRGNDCYGVGLWLLPSFVNHSCVPNARRLHVG 338
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
+++ A K + G+ ++ Y DPL G +++R + + C+C+RCR E+
Sbjct: 339 D-YLVVHASKDLKAGEEVTFAYFDPLCG-LSKRKEMSVNWGIHCKCKRCRFEGEV 391
>gi|397612716|gb|EJK61854.1| hypothetical protein THAOC_17577 [Thalassiosira oceanica]
Length = 750
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 5/84 (5%)
Query: 95 LTEPSYIAIFDRASFIEHNCYPNL---YKSFTDSG--QVLLRAMKPIAPGDHLSICYTDP 149
L P A++ A+ + H+C PN YK +S Q+ + A K I+ G+ L Y D
Sbjct: 472 LYAPIGEAVYPHAALLNHSCSPNCILRYKIGLESSPPQLEIVACKDISSGEELVHSYVDL 531
Query: 150 LWGTINRRHHLQTSKYFICQCERC 173
+ T RR+ L+ + FIC+C+RC
Sbjct: 532 VLPTETRRNRLRDTHGFICECKRC 555
>gi|365986138|ref|XP_003669901.1| hypothetical protein NDAI_0D03440 [Naumovozyma dairenensis CBS 421]
gi|343768670|emb|CCD24658.1| hypothetical protein NDAI_0D03440 [Naumovozyma dairenensis CBS 421]
Length = 566
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 33/68 (48%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
SF+ HNC PN+ + + + KPI G+ L Y PL G RR L+ +
Sbjct: 392 SFLNHNCEPNVRFEIDSKLCLKIFSRKPINKGEQLFTTYVSPLHGVNLRRRELRVNYGIF 451
Query: 168 CQCERCRD 175
C+C RC +
Sbjct: 452 CKCNRCEN 459
>gi|195389466|ref|XP_002053397.1| GJ23360 [Drosophila virilis]
gi|194151483|gb|EDW66917.1| GJ23360 [Drosophila virilis]
Length = 393
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI GD + I Y D L + + RH
Sbjct: 294 SKINHSCVPNAQSTFPYSNDIVVLKALAPIQKGDEICISYLDDCQLERSRHSRHKTLREN 353
Query: 165 Y-FICQCERCR 174
Y F+CQC +CR
Sbjct: 354 YIFVCQCPKCR 364
>gi|156543612|ref|XP_001604392.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 629
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 7/100 (7%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIA---IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
+ + C I+ N+ + EP A ++ S + H C PN ++ F D ++ RA++P
Sbjct: 443 VFRNCVIVSSNSFSVQ-QEPGVKAGSGLYVAGSLMNHACSPNTFRHF-DGLTMITRALEP 500
Query: 136 IAPGDHLSICYTDP--LWGTINRRHHLQTSKYFICQCERC 173
I GD + CY R+ + +F CQC+ C
Sbjct: 501 IKAGDQIFTCYGGGYQYMSRGERKKKMMDEYFFDCQCQSC 540
>gi|440796609|gb|ELR17718.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 419
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 66/152 (43%), Gaps = 8/152 (5%)
Query: 28 SHQPQIWDKLMQLEAHVEEYKNSP-KYENDRRNV--VQFLLNFFKLNEEFTQEEILKICG 84
S +P+ D + L A+V SP +Y+ D V V+ +L L ++T +E+++
Sbjct: 226 SGKPRFAD-FLCLVANVPSLAISPDRYKGDMEVVRYVEKMLASCSLAHKYTSQELMEYIS 284
Query: 85 IIQVNAHEM---PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDH 141
I+ N M L + ++ S+ H+C PN + V A I GD
Sbjct: 285 KIESNTFGMWSDKLVSMGMV-LYAEGSYFNHSCAPNCGTRTGEGQAVQFVATHDIPAGDE 343
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ I Y D T +RR L + +F C C C
Sbjct: 344 VCIRYIDVDKPTTSRRSELLSHYHFTCMCPLC 375
>gi|367009358|ref|XP_003679180.1| hypothetical protein TDEL_0A06370 [Torulaspora delbrueckii]
gi|359746837|emb|CCE89969.1| hypothetical protein TDEL_0A06370 [Torulaspora delbrueckii]
Length = 369
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 2/73 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT-INRRHHL 160
A++ RAS+ H+C PNL KS G ++ A K + G+ L I Y+ L T + RR L
Sbjct: 290 AVYPRASYFNHSCDPNLTKSRI-KGTMVFTANKDVPKGEPLCIDYSGLLSHTMLKRRQLL 348
Query: 161 QTSKYFICQCERC 173
+ + +F C C RC
Sbjct: 349 KENWFFECCCTRC 361
>gi|260796111|ref|XP_002593048.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
gi|229278272|gb|EEN49059.1| hypothetical protein BRAFLDRAFT_74375 [Branchiostoma floridae]
Length = 390
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 51/101 (50%), Gaps = 8/101 (7%)
Query: 91 HEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDP 149
H + L E + ++ +A+ I H+C N +F G L +RA+ I PG+ + YT+
Sbjct: 117 HNLDLREIG-VGLYPQAAMINHSCKSNCVSTF--RGPTLQIRALVDIQPGEEVCYSYTEK 173
Query: 150 LWGTINRRHHLQTSKYFI-CQCERCRDPTELNTFYDGVKCP 189
T RR L+ KYF CQC C D T+ + VKCP
Sbjct: 174 GNVTHERRDELR--KYFFECQCPHCLD-TDRDAIMKSVKCP 211
>gi|171679715|ref|XP_001904804.1| hypothetical protein [Podospora anserina S mat+]
gi|170939483|emb|CAP64711.1| unnamed protein product [Podospora anserina S mat+]
Length = 401
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 29/130 (22%), Positives = 58/130 (44%), Gaps = 3/130 (2%)
Query: 46 EYKNSPKYE-NDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIF 104
E+ N +Y ++R ++ + E + +I +++ N + + ++A+F
Sbjct: 155 EFINGDRYSAKNKRRMLDLAVGNLPEETRREVEGLARIVDVLRTNGFGIEIAGEGHLALF 214
Query: 105 DRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTS 163
S + HNC PN + + S + + A++ I G+ ++ Y PL T +R +
Sbjct: 215 LEGSRVNHNCRPNAFWRWVPSKMAMEVVALRGIGKGEEVAHSYA-PLGYTHEQRKAVLQP 273
Query: 164 KYFICQCERC 173
F CQC C
Sbjct: 274 WGFQCQCALC 283
>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
Length = 553
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 32/67 (47%), Gaps = 1/67 (1%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYF 166
AS H C PN F D +RA+ I P ++I Y DP +R+ L+ +F
Sbjct: 218 ASLFNHTCRPNAVIVF-DGALAEIRAIDTIEPDTEITISYIDPAHARAHRKRALREKYFF 276
Query: 167 ICQCERC 173
C+C RC
Sbjct: 277 DCRCVRC 283
>gi|167525114|ref|XP_001746892.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774672|gb|EDQ88299.1| predicted protein [Monosiga brevicollis MX1]
Length = 500
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 33/69 (47%)
Query: 108 SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
+ + H+C+PN G++ +R + + G L + Y D R+ HL SK F
Sbjct: 244 ALVNHSCHPNCAVVSRPGGELEVRTLGTVRAGAELFVSYVDLTLPRAERQAHLLASKEFT 303
Query: 168 CQCERCRDP 176
C C RC+ P
Sbjct: 304 CTCYRCQHP 312
>gi|297739785|emb|CBI29967.3| unnamed protein product [Vitis vinifera]
Length = 660
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 24/72 (33%), Positives = 36/72 (50%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN++ + D+ L+A++ I G+ L ICY D R+ L
Sbjct: 585 AVYMLPSFYNHDCDPNVHIIWIDNVNARLKALREIEAGEELRICYIDASMDHDARQTILF 644
Query: 162 TSKYFICQCERC 173
F C C RC
Sbjct: 645 QGFGFRCSCLRC 656
>gi|340960170|gb|EGS21351.1| hypothetical protein CTHT_0032060 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 420
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 29/45 (64%)
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+R++K I+ G+ ++ICY DP G +R+H L+ + C C RC+
Sbjct: 150 VRSLKRISAGEEITICYVDPTLGVASRQHLLEQEYHIKCSCRRCK 194
>gi|320162626|gb|EFW39525.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 779
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
Query: 106 RASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
RAS + H+C N S + +V++RA IA G L + Y D L + + ++ +
Sbjct: 430 RASLLNHSCLSNCTVSLR-ANRVVVRAHSRIAEGQELLVSYRDLLQARVQWQAQVERGYH 488
Query: 166 FICQCERCRDPTEL 179
F C C RC P +L
Sbjct: 489 FACSCLRCTAPLDL 502
>gi|145347218|ref|XP_001418071.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578299|gb|ABO96364.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 654
Score = 47.0 bits (110), Expect = 0.004, Method: Composition-based stats.
Identities = 26/78 (33%), Positives = 41/78 (52%), Gaps = 4/78 (5%)
Query: 101 IAIFDRASFIEHNCYPNLYKS-FTDSGQVLLRAMKPIAPGDHLSICYTDP--LWGTIN-R 156
+ +FD S H+C PN S +D G+V L +++PI G+ ++I Y P W R
Sbjct: 235 MVMFDAISACNHSCDPNAEVSHVSDEGEVSLYSLRPIERGEGITIAYGKPSLRWLPARCR 294
Query: 157 RHHLQTSKYFICQCERCR 174
+ L+ YF C C +C+
Sbjct: 295 KKALRRDWYFDCACAQCK 312
>gi|428170891|gb|EKX39812.1| hypothetical protein GUITHDRAFT_143201 [Guillardia theta CCMP2712]
Length = 496
Score = 47.0 bits (110), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 7/92 (7%)
Query: 101 IAIFDRASFIEHNCYPN-LYKSFTDSGQVLLRAMKPIAPGDHLSICY--TDPLWGTINRR 157
+A+F R + + H+C PN ++ S T+ G++ L AM+ I G+ ++ Y D + R
Sbjct: 162 LALFPRCAKVNHSCRPNVMFSSQTEDGKLRLIAMRRIERGEEVTFSYLGEDGDVMSREER 221
Query: 158 HHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
K F+C C RC ++ G++CP
Sbjct: 222 RERMRGKDFLCSCARCEGVDDVR----GIRCP 249
>gi|47204414|emb|CAF87026.1| unnamed protein product [Tetraodon nigroviridis]
Length = 386
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 15/114 (13%)
Query: 75 TQEEILKICGIIQVNAHEMP---LTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQ---- 127
++E+I + GII+ N + + + +F + H+C+PN Q
Sbjct: 116 SREDIAHVLGIIKTNGFTLSDQRGLQAVGVGLFPNLCLVNHDCWPNCSVVLNHGNQSATN 175
Query: 128 --------VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
V LRA+ + G+ L++ Y D L + R+ L +F C C+RC
Sbjct: 176 PALHSQRRVELRALGKVCQGEELTVSYVDFLDTSAERQRKLSELFFFHCTCQRC 229
>gi|221054468|ref|XP_002258373.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193808442|emb|CAQ39145.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 1074
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 1/78 (1%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTI-N 155
+ + I +F AS + H+C+PN S+ D ++ +M+ I + ++I D L+ +I +
Sbjct: 559 DKTKIMLFQYASLLNHSCFPNASYSYIDINRICFISMRTINRYEEITISLIDELYASIHH 618
Query: 156 RRHHLQTSKYFICQCERC 173
RR L K C C RC
Sbjct: 619 RRSKLNEVKNGTCFCNRC 636
>gi|407405659|gb|EKF30538.1| hypothetical protein MOQ_005654 [Trypanosoma cruzi marinkellei]
Length = 585
Score = 46.6 bits (109), Expect = 0.004, Method: Composition-based stats.
Identities = 38/135 (28%), Positives = 61/135 (45%), Gaps = 18/135 (13%)
Query: 52 KYENDRRNVVQFLLNFFKLNEEFTQEE----ILKICGIIQVNA------HEMPLTEPSYI 101
+ E+ V Q L +L ++ EE I +I G+++ NA +P+ + ++
Sbjct: 288 RLESHEVEVSQGLTPLLRLVQDLFNEEDPTFITRILGVVRCNALAVCDASGLPVGQALHV 347
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHL 160
A S+ H+C PN + V RA++P G+ L+I Y L W RR L
Sbjct: 348 A--SVTSYFNHSCLPNC--AIEADAIVTTRAIRP---GEELTISYLPQLYWPAWLRREEL 400
Query: 161 QTSKYFICQCERCRD 175
+F C+C RC D
Sbjct: 401 AERYFFDCRCMRCGD 415
>gi|302686744|ref|XP_003033052.1| hypothetical protein SCHCODRAFT_107538 [Schizophyllum commune H4-8]
gi|300106746|gb|EFI98149.1| hypothetical protein SCHCODRAFT_107538, partial [Schizophyllum
commune H4-8]
Length = 404
Score = 46.6 bits (109), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 64/158 (40%), Gaps = 28/158 (17%)
Query: 34 WDKLMQLEAHVEEYK--NSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAH 91
W++ ++L+ E++ + P R N L + K GI +
Sbjct: 171 WERFLELKTSHSEHRAGSGPLLSRARTNAWHALGEY-------------KFAGIDAQDGQ 217
Query: 92 EMPLTEPSYIAIFDRASFIEHNCYPNL---YKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
Y+AIFD S + H+C PN + S T SG LRA++ I PG+ +++ Y
Sbjct: 218 --------YLAIFDELSRMNHSCRPNALYHWDSSTFSGS--LRAVRDIEPGEEITVSYCG 267
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
+ + R L F C C C + + Y+ +
Sbjct: 268 EVDRPYSDRRALLAPYGFGCDCRACAEGDAADIRYEQI 305
>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Metaseiulus occidentalis]
Length = 775
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN---RRH 158
AI+ AS + H+C PN+ F +++++++PIA G+ + CY P + + R+
Sbjct: 494 AIYPSASLMNHSCNPNIISGFRSGSTLVVKSVRPIASGEEVFNCY-GPHFRRMTFQERQT 552
Query: 159 HLQTSKYFICQCERCR 174
LQ +F C C C+
Sbjct: 553 ALQEQYFFRCDCTACQ 568
>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
Length = 430
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP---IAPGDHLSICYTDPLWGTINRRH 158
I+ +AS H+C PN F + +R + P + G L+I Y D + T NR+
Sbjct: 201 GIYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEGTPLTINYVDLMDLTANRQK 260
Query: 159 HLQTSKYFICQCERCRDPTE 178
L+ +F C C RC + E
Sbjct: 261 KLKEQYHFTCTCPRCLNSNE 280
>gi|261327654|emb|CBH10631.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 673
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTD-----SGQVLLRAMKPIAPGDHLSICYTD-- 148
T P A++ A+ H+C PN + SF S Q+ +RAM+ + G+ L+I Y +
Sbjct: 422 TIPVGKAVYVTAARFRHSCQPNCFASFVGNPLGCSLQLCIRAMRSVQAGEELTIAYHNMT 481
Query: 149 --PLWGTINRRHHLQTSKYFICQCERCRD 175
RR L F+C+C CRD
Sbjct: 482 KYKAVSAHTRRRSLVERCGFLCECIACRD 510
>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
Length = 421
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 5/87 (5%)
Query: 104 FDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
F + I H+C PN +F + ++ RAM+PI G+ ++ Y D R LQ
Sbjct: 191 FPLGAMINHSCDPNCAITFVPKTLEMEFRAMRPIKAGEEITQTYVDVALPRRERHERLQR 250
Query: 163 SKYFICQCERCRDPTE----LNTFYDG 185
+F C C RC P + L+ F D
Sbjct: 251 KYHFNCACSRCSVPLQESGSLDAFLDA 277
>gi|320592601|gb|EFX05031.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 386
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 82 ICGIIQVNAHE-MPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAMKPIAPG 139
+ G + N E PL+ ++ A I H+C PN++ S+ D + + ++PI G
Sbjct: 127 LVGTFRSNMLECRPLS--GLFGVYATAGRINHSCRPNVHHSWDDEKMRQVFCVLQPIRAG 184
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
D ++I Y T ++R +L S+ F C+CERC P
Sbjct: 185 DEITITYGRGGI-TSDQRVYLSGSRGFACRCERCLLP 220
>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
Length = 338
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 1/71 (1%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
++ S H+C PN + + VL R+M+ I G+ + I Y DP RR L
Sbjct: 133 VYAELSRANHSCQPNAAVVYNGAAAVL-RSMRDIPEGEEVCISYVDPTLARDVRRRELVQ 191
Query: 163 SKYFICQCERC 173
S F C C RC
Sbjct: 192 SYGFACDCARC 202
>gi|307108187|gb|EFN56428.1| hypothetical protein CHLNCDRAFT_145047 [Chlorella variabilis]
Length = 383
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 1/73 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
A + AS + H+C P+L SF + V A + IA G+ L++ Y D G RR L
Sbjct: 302 AAYLLASLLNHSCEPSLGVSFPRNNAVAAFTAARHIARGEQLTVSYVDAGQGLAARRQAL 361
Query: 161 QTSKYFICQCERC 173
+ F C+C RC
Sbjct: 362 AWAYGFTCRCPRC 374
>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 422
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 62/143 (43%), Gaps = 3/143 (2%)
Query: 33 IWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHE 92
I D++ L + N + RR F +N E + +IL++ + N
Sbjct: 183 IKDRMSVLVGGFDNIVNKDDKKLIRRMTGIVKKAFSNVNYEVEESDILELLTMSTRNCFG 242
Query: 93 M--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
+ E +AI+ AS H+CYPN + + + A++ I P + + I Y +
Sbjct: 243 LWKNYEECFGLAIYLEASLFNHSCYPNAAR-VQRGRSIDIIAIRDIEPNEEICISYLNIT 301
Query: 151 WGTINRRHHLQTSKYFICQCERC 173
G+ R+ HL+ + F C C RC
Sbjct: 302 NGSHERKDHLKNNYLFDCVCIRC 324
>gi|353239178|emb|CCA71099.1| hypothetical protein PIIN_05034 [Piriformospora indica DSM 11827]
Length = 386
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/80 (31%), Positives = 38/80 (47%), Gaps = 1/80 (1%)
Query: 98 PSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
P + +F S H+C PN + F + + LRA + I G+ + I Y D L + R
Sbjct: 165 PPHAGVFATLSRCNHSCGPNAFAHFNHQTWAIELRASRAIKAGEEIVISYVDVLDPCLKR 224
Query: 157 RHHLQTSKYFICQCERCRDP 176
R L + F C+C+ C P
Sbjct: 225 RAQLAANYEFTCKCKWCTFP 244
>gi|156386886|ref|XP_001634142.1| predicted protein [Nematostella vectensis]
gi|156221221|gb|EDO42079.1| predicted protein [Nematostella vectensis]
Length = 73
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
IF A + H+C PN F + + ++A++ I G+ L+I Y L R+ LQ
Sbjct: 3 GIFPNAVCLNHSCAPNSVAVFNGT-NIYIKALEEIPVGEELTISYIQQLHPRETRQEELQ 61
Query: 162 TSKYFICQCERC 173
T F CQC RC
Sbjct: 62 TQFCFYCQCHRC 73
>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
NZE10]
Length = 549
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C PN F Q+L RA+KPI G+ +S+ YTD R+ L+ +F C C
Sbjct: 236 HSCDPNAAALFNQPRQIL-RALKPIKKGEEISMKYTDVTNPLCVRQAELKGYYFFTCHCA 294
Query: 172 RCR 174
RC+
Sbjct: 295 RCK 297
>gi|328776415|ref|XP_003249162.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 714
Score = 46.6 bits (109), Expect = 0.005, Method: Composition-based stats.
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 7/90 (7%)
Query: 91 HEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
H ++P YI AI+ + H+CYP + + F +++RA++ + PGD ++ Y
Sbjct: 504 HRFRGSKPIYIGVAIYPTVARFNHDCYPAVTRYFLGRC-IVIRAIRSLRPGDVVAENY-G 561
Query: 149 PLWGTIN---RRHHLQTSKYFICQCERCRD 175
P++ N RR +L +F C+C CR+
Sbjct: 562 PIFTKRNLEERRRNLAGRYWFFCECNACRE 591
>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 679
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 45/93 (48%), Gaps = 8/93 (8%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWG---TINRRH 158
A++ S + H C PN+ SF G + +RA K I GD + CY P G T +R+
Sbjct: 417 AVYPTVSMMNHACIPNIIPSFR-KGILCVRATKKIMHGDEIQHCY-GPQVGHMTTSDRQQ 474
Query: 159 HLQTSKYFICQCERC-RDPTELNTFYDGVKCPE 190
L F C+C C RDP + +KCP+
Sbjct: 475 ALLNQYCFTCRCRACTRDPKSRDNLC--MKCPK 505
>gi|346324090|gb|EGX93687.1| SET domain protein [Cordyceps militaris CM01]
Length = 404
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 45/113 (39%), Gaps = 1/113 (0%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQV-LLRA 132
F Q G I NA + + + + AIF + + H+C PN F ++ + A
Sbjct: 183 FGQPSYGPAQGRIYTNAFSIEIKDIEHYAIFPEIARLNHDCRPNAAYFFDEATLTHYVHA 242
Query: 133 MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
+ I PG L+I Y DP R L + F C C C L+ D
Sbjct: 243 LTDIHPGTELTITYIDPQMSRKERMETLADTWGFTCSCNSCSMAARLSLASDA 295
>gi|294880152|ref|XP_002768910.1| hypothetical protein Pmar_PMAR002623 [Perkinsus marinus ATCC 50983]
gi|239871908|gb|EER01628.1| hypothetical protein Pmar_PMAR002623 [Perkinsus marinus ATCC 50983]
Length = 196
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 35/75 (46%), Gaps = 3/75 (4%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK---PIAPGDHLSICYTDPLWGTINRRH 158
A+F S + H+C PN+ F+ G L+ + IA G+ L I Y D ++ R
Sbjct: 115 AVFPLLSCVNHSCRPNMEIEFSGDGATLVANVSESGSIAAGEELMISYCDIEEDSVGERQ 174
Query: 159 HLQTSKYFICQCERC 173
FIC CERC
Sbjct: 175 KQLDPYGFICNCERC 189
>gi|414871461|tpg|DAA50018.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 173
Score = 46.6 bits (109), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 43/106 (40%), Gaps = 18/106 (16%)
Query: 86 IQVNAHEMPLTEPSYIAIFDRA------------------SFIEHNCYPNLYKSFTDSGQ 127
I++NA + L SY + A SF H+C PN + + +
Sbjct: 64 IRINAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNAHIVWLQNAD 123
Query: 128 VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
L+A++ I G+ L ICY D R+ L F C+C RC
Sbjct: 124 AKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRC 169
>gi|400595566|gb|EJP63361.1| TPR domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 662
Score = 46.6 bits (109), Expect = 0.006, Method: Composition-based stats.
Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHH-L 160
++ RAS++ H+C PN K F +++RA++ I GD ++ Y D G + R L
Sbjct: 516 GLWARASYLNHSCMPNSVKDFAGD-LIVVRALQHIRTGDEITHAYQDN--GDYDARQALL 572
Query: 161 QTSKYFICQCERC 173
QT+ F C+C+ C
Sbjct: 573 QTTWGFTCRCKLC 585
>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
Length = 499
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 14/148 (9%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ-----EEILKICGIIQVNAH 91
+ L +H+ E NS + + +R ++ K +E++ E I + I+VN+
Sbjct: 145 FLSLRSHLNEITNSNQEQYER------IMLSAKAEKEYSGSDLDIETIAEYLARIEVNSF 198
Query: 92 EM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+P + I A ++ H+C PN F D G + ++A+ I P + + I Y D
Sbjct: 199 TFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGF-DEGLITVKALCKIKPDEQVFISYIDN 257
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPT 177
+ R+ L +F C+C +C T
Sbjct: 258 TYPFEVRQKQLAERYFFTCKCSKCAQGT 285
>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
127.97]
Length = 499
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 14/148 (9%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ-----EEILKICGIIQVNAH 91
+ L +H+ E NS + + +R ++ K +E++ E I + I+VN+
Sbjct: 145 FLSLRSHLNEITNSNQEQYER------IMLSAKAEKEYSGSDLDIETIAEYLARIEVNSF 198
Query: 92 EM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+P + I A ++ H+C PN F D G + ++A+ I P + + I Y D
Sbjct: 199 TFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGF-DEGLITVKALCKIKPDEQVFISYIDN 257
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPT 177
+ R+ L +F C+C +C T
Sbjct: 258 TYPFEVRQKQLAERYFFTCKCSKCAQGT 285
>gi|195038187|ref|XP_001990541.1| GH18189 [Drosophila grimshawi]
gi|193894737|gb|EDV93603.1| GH18189 [Drosophila grimshawi]
Length = 393
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI GD + I Y D L + + RH
Sbjct: 294 SKINHSCVPNAQSTFPYSNDIVVLKAVAPIEKGDEICISYLDECQLERSRHSRHKTLREN 353
Query: 165 Y-FICQCERCR 174
Y F+CQC +CR
Sbjct: 354 YIFVCQCPKCR 364
>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 718
Score = 46.2 bits (108), Expect = 0.006, Method: Composition-based stats.
Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 7/92 (7%)
Query: 89 NAHEMPLTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
N H + P YI AI+ + H+CYP + + F +++RA++ + PGD ++ Y
Sbjct: 502 NEHRFRGSRPLYIGVAIYPTVARFNHDCYPAVTRYFVGR-SIVIRAIRSLRPGDTVAENY 560
Query: 147 TDPLW---GTINRRHHLQTSKYFICQCERCRD 175
P++ +RR L +F C+C CR+
Sbjct: 561 -GPIFTKRSLEDRRRTLAARYWFRCECTACRE 591
>gi|170096156|ref|XP_001879298.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645666|gb|EDR09913.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 367
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 62/156 (39%), Gaps = 18/156 (11%)
Query: 33 IWDKLMQLEAHVEEY---KNSPKYENDRRNVVQFLLNFFKLNEE---FTQEEILKICGII 86
+WD L+ L+ + EY ++ P R + F+ F +L T E + I
Sbjct: 201 VWDDLLSLQDN--EYALIRSKPYILASRIRIYGFIKRFVQLARSELVSTPETVRAILARD 258
Query: 87 QVNAHEMPLTEPS-------YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPG 139
N + T P ++ S+ H+C PNL K G + +
Sbjct: 259 HGNVFGIWDTAPDDEESEMLGWGLYVSGSYFNHDCSPNLKKRRHKRG-INFYTTHDVNAD 317
Query: 140 DHLSICYTD--PLWGTINRRHHLQTSKYFICQCERC 173
+ L I Y D P INR+ L+ +F+CQC RC
Sbjct: 318 EELCISYVDVCPGASVINRKTALENDWFFVCQCARC 353
>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
Length = 442
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 64/136 (47%), Gaps = 11/136 (8%)
Query: 51 PKYENDRRNVVQF------LLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIA-- 102
P + D++ + F L F + E++ + G + +N+ + E I
Sbjct: 121 PNIKGDQQRMEHFTSLCAVLFEFLGDDSLPNSAELMGMYGRMCINSFNIIDQELQCIGTG 180
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD--HLSICYTDPLWGTINRRHHL 160
++ AS I+H+C PN F D + +RA++ D + I Y D L T +R+ L
Sbjct: 181 MYLGASVIDHSCSPNAVAIF-DGPILSIRALQTFQYLDWSQIKISYIDILNTTKDRQSEL 239
Query: 161 QTSKYFICQCERCRDP 176
+ + YF+C+C +C +P
Sbjct: 240 EAAYYFLCKCPKCLEP 255
>gi|393212472|gb|EJC97972.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 444
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 41/85 (48%), Gaps = 4/85 (4%)
Query: 93 MPLTEPSYIAIFDRASFIEHNCYPNL----YKSFTDSGQVLLRAMKPIAPGDHLSICYTD 148
M L ++ ++ S + H+C PN+ + T ++ +RA + I PG+ L++ Y D
Sbjct: 313 MGLNLEAHGGLYILHSHMNHSCEPNISVRHFDQRTALSRITMRARRDIQPGEELTVAYVD 372
Query: 149 PLWGTINRRHHLQTSKYFICQCERC 173
P RR L + C C+RC
Sbjct: 373 PALLLSARRRALIPWAFGTCMCDRC 397
>gi|302686770|ref|XP_003033065.1| hypothetical protein SCHCODRAFT_107558 [Schizophyllum commune H4-8]
gi|300106759|gb|EFI98162.1| hypothetical protein SCHCODRAFT_107558, partial [Schizophyllum
commune H4-8]
Length = 422
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 61/123 (49%), Gaps = 9/123 (7%)
Query: 56 DRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPS---YIAIFDRASFIEH 112
DRR V+ L ++ + ++++ G+ VN + +P TE + Y+A+++ S H
Sbjct: 203 DRREVL-LRLTHAPAGDKLSAIDLIRANGLT-VNGYTLPETEGADGRYVAVYETLSRANH 260
Query: 113 NCYPNLYKSF-TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI-CQC 170
+C PN + +F S V LRA++ I G+ + I Y P R+ L + Y + C C
Sbjct: 261 SCRPNAHFAFHKPSFSVRLRALRDIKAGEEILISYVPPEAPYAQRQEEL--AHYGLSCAC 318
Query: 171 ERC 173
C
Sbjct: 319 GVC 321
>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
Length = 392
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI GD + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNSCSTFPYSNDIVVLKALTPIQEGDEICISYLDECQLERSRHSRHKVLREN 353
Query: 165 Y-FICQCERCR 174
Y F+CQC +CR
Sbjct: 354 YIFVCQCPKCR 364
>gi|358379728|gb|EHK17407.1| hypothetical protein TRIVIDRAFT_42740 [Trichoderma virens Gv29-8]
Length = 353
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 39/89 (43%), Gaps = 1/89 (1%)
Query: 86 IQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQV-LLRAMKPIAPGDHLSI 144
I NA E+ L E +Y A+F + + H+C PN F + A+ I PG ++I
Sbjct: 151 IDTNAFELELDEVTYYAVFPEIARLNHDCRPNAAYFFDRQSLTHYVHAITDITPGTEITI 210
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERC 173
Y DP R L + F C C C
Sbjct: 211 TYIDPHMPRQKRLKKLSSLWGFNCSCSLC 239
>gi|341038420|gb|EGS23412.1| hypothetical protein CTHT_0001010 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 362
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 85 IIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAMKPIAPGDHLS 143
+++ N+ + + Y+A+F + + I H C P+ +F + + A I PGD ++
Sbjct: 170 VVKTNSFSIEIGGRGYMALFPKIARINHACKPSALTNFNSTDLSNKVTAFHDILPGDEIT 229
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
I YT T+ RR L + F C C C P
Sbjct: 230 ISYTPFALPTLTRRALLLQTWGFTCICPLCTLP 262
>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 704
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 46/94 (48%), Gaps = 7/94 (7%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWG---TINRRH 158
A++ S + H C PN+ SF G + +RA++ I GD + CY P G T +R+
Sbjct: 439 AVYPTVSLMNHACQPNVIASFR-KGIISVRAIEKIMRGDEIQHCY-GPQVGHMTTSDRQQ 496
Query: 159 HLQTSKYFICQCERC-RDPTELNTFYD-GVKCPE 190
L F C+C C R P + D +KCP+
Sbjct: 497 ALLNQYCFTCRCRACTRKPRTFDKEEDLCIKCPQ 530
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 46.2 bits (108), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 34/139 (24%), Positives = 60/139 (43%), Gaps = 12/139 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V +L + +N + E K+ NAH + +E P ++ S I H+C P
Sbjct: 163 NLVNLILRWPDVNIKEIAENFSKLA----CNAHTICDSELRPLGTGLYPVVSIINHSCLP 218
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC--- 173
N F D ++ ++ I G + I Y + T+ R+ L+ +F C C RC
Sbjct: 219 NAVLVF-DGRLAVVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRCIKM 277
Query: 174 --RDPTELNTFYDGVKCPE 190
D + + +G +C +
Sbjct: 278 GLLDDIQESAILEGYRCKD 296
>gi|156097208|ref|XP_001614637.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803511|gb|EDL44910.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 1072
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTI-NRRHH 159
I +F AS + H+C+PN S+ D ++ +M+ I+ + ++I D L+ +I +RR
Sbjct: 566 IMLFQYASLLNHSCFPNASYSYIDINKICFISMRTISRYEEITISLIDELYASIHHRRGK 625
Query: 160 LQTSKYFICQCERC 173
L K C C RC
Sbjct: 626 LSEVKNDTCFCNRC 639
>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
Length = 750
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 8/94 (8%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR--RHH 159
AI+ AS + H C P++ SF D G ++ RA IAPG ++ CY P + R R
Sbjct: 475 AIYPTASLLNHACDPDVLVSFVD-GVLVARATHNIAPGSGITHCY-GPHVNHMPREERQK 532
Query: 160 LQTSKY-FICQCERCRDPTELNTF---YDGVKCP 189
L +Y F CQC C E+ + CP
Sbjct: 533 LLYKQYFFTCQCSACTSDEEMENTRLCFSAFACP 566
>gi|440801197|gb|ELR22218.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 343
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 64/163 (39%), Gaps = 27/163 (16%)
Query: 19 IALRCLYKKSHQPQIW----DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEF 74
+ L+ L +P++W D + H E+ + YE R + F +N + +
Sbjct: 188 VPLQVLGFAKQKPELWTQDYDVFKRCFIHNEDQADFFSYEWYCRMMQIFYINSVSVTMDS 247
Query: 75 TQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
TQ+ + ++ +SF H+C PN S+ ++ + A K
Sbjct: 248 TQQNV--------------------GAGLYILSSFYNHSCVPNTRSSYPENNTWHVYASK 287
Query: 135 PIAPGDHLSICYTDPLWGTI---NRRHHLQTSKYFICQCERCR 174
PI G + I Y D + RR HL F C+C RCR
Sbjct: 288 PIEAGQEIFISYVDHMKSNKTVETRRQHLYNHYGFWCECPRCR 330
>gi|72388114|ref|XP_844481.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359369|gb|AAX79807.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801014|gb|AAZ10922.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 673
Score = 46.2 bits (108), Expect = 0.007, Method: Composition-based stats.
Identities = 27/89 (30%), Positives = 42/89 (47%), Gaps = 9/89 (10%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTD-----SGQVLLRAMKPIAPGDHLSICYTD-- 148
T P A++ A+ H+C PN + SF S Q+ +RA++ + G+ L+I Y +
Sbjct: 422 TIPVGKAVYVTAARFRHSCQPNCFASFVGNPLGCSLQLCIRAIRSVQAGEELTIAYHNMT 481
Query: 149 --PLWGTINRRHHLQTSKYFICQCERCRD 175
RR L F+C+C CRD
Sbjct: 482 KYKAVSAHTRRRSLVERCGFLCECSACRD 510
>gi|402222955|gb|EJU03020.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 445
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 48/113 (42%), Gaps = 11/113 (9%)
Query: 79 ILKICGIIQVNAHEM--PLTEPSYIAIFDRASFIEHNCYPNLY-----KSFTDSGQVLLR 131
+++ I++ N + P S A+FD S H+C PN Y K F+ LLR
Sbjct: 175 MIEAISIMRTNGFGIDYPFDTESQRAVFDVISRANHSCVPNAYFEWDFKVFSG----LLR 230
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ PI G+ ++I YT L R+ L F C C C P L D
Sbjct: 231 PLVPIEKGEEITISYTSLLVPASERKADLLRKYGFECTCPACSLPPRLQKESD 283
>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 666
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 70/148 (47%), Gaps = 32/148 (21%)
Query: 64 LLNFFK----LNEEFTQEEI------LKICGIIQVNAHEMPLTEPSYIAIFDRA------ 107
L+N+F L E T++EI L+ I+Q NAHE+ E I D +
Sbjct: 403 LVNYFNKTDNLLETLTEDEIFIGGLILRHLQILQFNAHEISELEMENKNILDDSKSLFVG 462
Query: 108 -------SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWG---TINRR 157
+ H+C P + + F + QV++RA KPI G+ ++ Y P++ + R+
Sbjct: 463 GGLYPTLALFNHSCDPGIVRYFIGT-QVIVRAAKPIKKGEIVAENY-GPIYSQMKKLERQ 520
Query: 158 HHLQTSKYFICQCERCRD--PT--ELNT 181
L++ +F C C C + PT EL+T
Sbjct: 521 EKLKSQYWFTCTCTPCLELWPTFDELDT 548
>gi|168011005|ref|XP_001758194.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690650|gb|EDQ77016.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 532
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 26/172 (15%)
Query: 31 PQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ------------EE 78
P I D ++ L+ H E K + ++ R LL+ + L+ E T EE
Sbjct: 217 PTIQD-VLNLQTH--EDKATTAWKESVRKGCNLLLSSWVLDGEVTDIASNNNSQLSNLEE 273
Query: 79 ILKICGIIQVNAHEMPLTEP----SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+ K ++ NAH M L + + +F S + H+C+PN F G+V+ AM
Sbjct: 274 LEKFALLVNTNAHGMGLQDTHNADVALGMFPFVSMLNHSCWPNC--CFASEGRVMTAAMC 331
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQ-CERCRDPTELNTFYDG 185
P A G + + L G + Q++ + C C R P +T +DG
Sbjct: 332 P-ARGCGGVLVKSVSLAGRFEKED--QSASSWTCDVCSRVASPVS-STLFDG 379
>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
Length = 392
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S + H+C PN +F S V+L+A+ PI G+ + I Y D L + + RH +
Sbjct: 294 SKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKGEEICISYLDECQLERSRHSRHKILREN 353
Query: 165 Y-FICQCERCR 174
Y F+CQC +CR
Sbjct: 354 YIFVCQCSKCR 364
>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
Length = 481
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 39/143 (27%), Positives = 67/143 (46%), Gaps = 18/143 (12%)
Query: 40 LEAH-VEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQ----VNAHEMP 94
LE+H ++ K+ P+Y Q L + E++ E L + II+ ++ +
Sbjct: 130 LESHQIQTLKSKPEY-------FQKLALSARAVREYSGTE-LSLHKIIEYFCKLDINAFT 181
Query: 95 LTEPSY----IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL 150
LT P Y AI A+ H+C PN F D G++ +RA++ I G+ + + Y +
Sbjct: 182 LTTPFYDHVGAAIEPLAALCNHSCSPNAATDF-DKGKIWVRALRDIGKGEQVFVSYIETT 240
Query: 151 WGTINRRHHLQTSKYFICQCERC 173
+R+ L YF C+C +C
Sbjct: 241 DPYAHRQSELLKRYYFNCKCNKC 263
>gi|166240153|ref|XP_001733058.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|165988456|gb|EDR41012.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 594
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 69/169 (40%), Gaps = 30/169 (17%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRN--VVQFLLNFFKLNEEFTQEEILKICGIIQVNA- 90
D+ ++ H E KN R++ + Q + F N E TQE ++ II++NA
Sbjct: 414 MDQFLKRLVHTEPSKNHTTSYLSRKDMKIYQSIKGIFS-NREITQEIYHRVKSIIRLNAV 472
Query: 91 -----------HEMPLTEPSY------------IAIFDRASFIEHNCYPNLYKS--FTDS 125
+ P+ E Y +I + SF H+C PN++ + +
Sbjct: 473 AFPTSRIKILSEKNPMDELGYSFDFQEIPSQQLASILMQGSFFNHSCEPNVFIATPVVND 532
Query: 126 GQVLLRAMKPIAPGDHLSICYTDP-LWGTINRRHHLQTSKYFICQCERC 173
+ +PI G+ L I Y D T RR L+ + FIC C+ C
Sbjct: 533 KSIRFCTRRPIKKGEELFISYLDGEKLTTEKRRTTLKETYSFICNCQAC 581
>gi|393233731|gb|EJD41300.1| hypothetical protein AURDEDRAFT_186645 [Auricularia delicata
TFB-10046 SS5]
Length = 293
Score = 46.2 bits (108), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 7/98 (7%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA----MKPIAPGDHLS 143
+++H PL Y A + + H+C N F + R ++ +APGD ++
Sbjct: 88 LHSHLTPLAAAVYPAA---SRSLNHSCASNAVPLFVFAPATPPRMEVVLVRDVAPGDEIT 144
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
I Y DP RR L+ S F C C RC P+ L
Sbjct: 145 IPYIDPALAPSARRERLRASYGFECACARCITPSGLEA 182
>gi|390595808|gb|EIN05212.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 441
Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 38/89 (42%), Gaps = 4/89 (4%)
Query: 89 NAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR----AMKPIAPGDHLSI 144
N M L S+ ++ S + H+C PN+ D L R A + I PG L+I
Sbjct: 314 NLGRMSLNLESHGGLYTLHSHLNHSCAPNVSVRHLDQRSALARITVIARQTILPGQELTI 373
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERC 173
Y +P RR L + C CERC
Sbjct: 374 SYVNPELPLPARRRRLGDWGFGKCTCERC 402
>gi|168039813|ref|XP_001772391.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676378|gb|EDQ62862.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 311
Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 51/120 (42%), Gaps = 12/120 (10%)
Query: 60 VVQFLLNFFKLN--EEFTQEEILKICGIIQVNAHEMPLTEPSYI--AIFDRASFIEHNCY 115
+ + +N F++ EE E+IL A + S + A++ S HNC
Sbjct: 198 LARLHINAFRIERIEEADSEDILA--------AAAASIMGESIVGSAVYILPSMYNHNCD 249
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
PN+ + + L A + I G+ L I Y D RR L+ F C+C RCRD
Sbjct: 250 PNVDILWPSNATANLVARRSIKSGEELHITYIDSSMSLDERRSFLEQHYGFTCRCARCRD 309
>gi|67900474|ref|XP_680493.1| hypothetical protein AN7224.2 [Aspergillus nidulans FGSC A4]
gi|40742081|gb|EAA61271.1| hypothetical protein AN7224.2 [Aspergillus nidulans FGSC A4]
Length = 678
Score = 45.8 bits (107), Expect = 0.007, Method: Composition-based stats.
Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 2/74 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD--PLWGTINRRHH 159
A++ RAS H+C PN+ G+++ A + IA G+ I Y D +RR H
Sbjct: 582 AVYPRASIANHSCCPNIIHKPDKVGRMVFTAGRDIAAGEECCISYFDMTQYVSLQDRRRH 641
Query: 160 LQTSKYFICQCERC 173
LQ F C C RC
Sbjct: 642 LQGLFRFKCGCPRC 655
>gi|345496800|ref|XP_001601864.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 607
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 57/121 (47%), Gaps = 19/121 (15%)
Query: 79 ILKICGIIQVNAH--EMPLTEPSYIA------------IFDR-ASFIEHNCYPNLYKSFT 123
+ KI I+ VN H + L + +Y A IF S I H+C+PN+ + F
Sbjct: 401 LAKITNILAVNIHTFQGSLCDCNYFANCNDTCTGKRGQIFASCTSLINHSCHPNISRMFM 460
Query: 124 DSGQVLLRAMKPIAPGDHLSICYTDPLWGT--INRRHHLQTSKYFICQCERCRD--PTEL 179
+V++ P+ G+ L Y + T I R+ +LQ + F C+C+ CR+ EL
Sbjct: 461 PQRKVVVFTTCPVKKGEQLCDTYGPTVRYTNKIQRQQYLQNNYNFTCRCQACRENWSIEL 520
Query: 180 N 180
N
Sbjct: 521 N 521
>gi|157875626|ref|XP_001686198.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129272|emb|CAJ07812.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 700
Score = 45.8 bits (107), Expect = 0.008, Method: Composition-based stats.
Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 9/86 (10%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ-----VLLRAMKPIAPGDHLSICYTD-PLWGTIN 155
A++ + H C PN Y SF + Q +++RA++PI G+ L++ Y + +
Sbjct: 448 AVYAIGALFNHACDPNCYVSFEGNPQGSCARLIVRAIRPIMEGEELTVSYGGISCFSFHS 507
Query: 156 RRHHLQT--SKY-FICQCERCRDPTE 178
RH LQT +Y F C C CR+ +
Sbjct: 508 MRHRLQTLRDRYGFFCGCRSCRNQVD 533
>gi|402075935|gb|EJT71358.1| hypothetical protein GGTG_10617 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 337
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 43/94 (45%), Gaps = 3/94 (3%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHL 142
GII+ NA IA F S I H+C PN ++++ + Q + A +PI G+ L
Sbjct: 98 GIIRTNAQSCG-ARSQMIATFATVSLINHSCEPNTHQAWNKEKEQETIHATRPIKAGEEL 156
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
+I Y T R+ L+ F C C C P
Sbjct: 157 TIAYV-MFETTAVRQATLKAIYGFTCTCPVCSKP 189
>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 749
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 68/147 (46%), Gaps = 12/147 (8%)
Query: 54 ENDRRNVVQFLLNFFKLNEEFTQEEILKICGII---QVNAHEMPLTEPSYIA--IFDRAS 108
E+D R V LL L I KI ++ + N +++ + + IA I+ AS
Sbjct: 410 EDDERLYVSSLLLRHVLQLISNGHAITKIKAVVNNTRYNKNKVFIQQEDRIATAIYPSAS 469
Query: 109 FIEHNCYPNLYKSFTDSGQVLL-RAMKPIAPGDHLSICYTDPLWGTI--NRRHHLQTSKY 165
+ H+C PN+ SF GQ L+ +A + IA G+ + CY + +R+ +++
Sbjct: 470 MMNHSCDPNIINSFL--GQFLITKATRDIAAGEEVFNCYGADFRRMLRKDRQEKMESQYC 527
Query: 166 FICQCERCRDP--TELNTFYDGVKCPE 190
F C C C P ++ + KCPE
Sbjct: 528 FKCNCAACSAPEYEDILKKFTAKKCPE 554
>gi|392563243|gb|EIW56422.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 430
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 38/85 (44%), Gaps = 4/85 (4%)
Query: 93 MPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR----AMKPIAPGDHLSICYTD 148
M L ++ ++ S I H+C PN+ D L R A + IAPG+ L I Y +
Sbjct: 310 MSLNLEAHGGLYVLHSHINHSCAPNVSVRHLDQRTALSRITLLARRDIAPGEELLITYVN 369
Query: 149 PLWGTINRRHHLQTSKYFICQCERC 173
P RR L + C CERC
Sbjct: 370 PELPLEQRRRQLMEWGFGKCMCERC 394
>gi|353242193|emb|CCA73857.1| hypothetical protein PIIN_07811 [Piriformospora indica DSM 11827]
Length = 365
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 41/88 (46%), Gaps = 3/88 (3%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRR 157
+ IF + I H C +++ D+ + ++ A KPI G+ + I YTD NRR
Sbjct: 160 VGIFPTIARINHGCSSAFNAAYSWREDTKETVVYAFKPIFQGEEILISYTDTKKSRDNRR 219
Query: 158 HHLQTSKYFICQCERCRDPTELNTFYDG 185
+L+ S F C C C EL+ D
Sbjct: 220 AYLKQSYNFDCTCSVCSLSPELSKASDA 247
>gi|402087542|gb|EJT82440.1| hypothetical protein GGTG_02413 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 371
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 57/114 (50%), Gaps = 9/114 (7%)
Query: 67 FFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA-----SFIEHNCYPNLYKS 121
FF L++ KI GI N+ + + +PS ++ A S + H+C PN
Sbjct: 106 FFSLSDSSHSHGSGKIRGIWVTNSLPVMIPDPSNFSLPREAVALTMSRVNHSCQPNSLLR 165
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN--RRHHLQTSKYFICQCERC 173
+ D G L+A++PI G+ ++I Y D + T + RR L+ +Y C+CE C
Sbjct: 166 Y-DEGWYQLKALRPIRAGEQVTITYKDYIHQTTSQWRRRGLE-ERYVECKCEVC 217
>gi|395328146|gb|EJF60540.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 300
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 1/73 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
AIF S +C PN+ + D ++ A IA G+ L ICY D +RR L
Sbjct: 103 AIFVIGSRFNSSCQPNVNNYWNEDLQKIAFWATSDIAEGEELCICYGDLWKARDDRRRRL 162
Query: 161 QTSKYFICQCERC 173
++S F+CQC C
Sbjct: 163 ESSFRFVCQCVAC 175
>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
Length = 498
Score = 45.8 bits (107), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 34/144 (23%), Positives = 68/144 (47%), Gaps = 14/144 (9%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQ-----EEILKICGIIQVNAH 91
+ L +H++E +S + + +R ++ K +E++ E I + I+VN+
Sbjct: 144 FLGLRSHLKEIASSNQEQYER------IMLCAKAEKEYSGSGLDLETIAEYLARIEVNSF 197
Query: 92 EMPLT--EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+ +P + I A ++ H+C PN F D G + ++A++ I P + + I Y D
Sbjct: 198 TFTTSFGDPLGLCIQPFACYMNHSCDPNAVVGF-DEGLITVKALRKIKPDEQVFISYIDN 256
Query: 150 LWGTINRRHHLQTSKYFICQCERC 173
+ R+ L +F C+C +C
Sbjct: 257 TYPFEVRQKQLAERYFFTCKCSKC 280
>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 487
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Query: 79 ILKICGIIQVNAHEMPLTEPSYIAIFD----RASFIEHNCYPNLYKSFTDSGQVLLRAMK 134
+LK+ G N+ + + IAI D RAS + H+C N F D ++ LR +K
Sbjct: 172 LLKMYGATNCNS--FGIFDNDLIAISDGIYLRASMVNHSCDYNCIVVF-DERKLQLRTVK 228
Query: 135 PIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTF 182
+ G+ +I Y D + RR L+ +F C+C +C + E+N
Sbjct: 229 DVKEGEECTIGYVDVIHPAKERRAELEEKYHFTCKCVKCNE--EINAL 274
Score = 37.7 bits (86), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 18/68 (26%), Positives = 35/68 (51%), Gaps = 1/68 (1%)
Query: 106 RASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
+AS + H+C PN +D ++ + +K + G+ +I Y D + R+ L+ +
Sbjct: 362 KASMLNHSCDPNC-AWVSDGRKIQIMTVKDVKEGEECTITYVDVMDPAKVRQADLKERYH 420
Query: 166 FICQCERC 173
F C+C +C
Sbjct: 421 FTCKCVKC 428
>gi|358342712|dbj|GAA50167.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
Length = 761
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLW----GTINRR 157
A+F S + H C P++ SF + ++LR KPI P D + CY P + + +RR
Sbjct: 504 ALFPCVSMLNHACEPSVSNSFENQF-IVLRCAKPIHPSDEVYNCY-GPHYLHDTSSSSRR 561
Query: 158 HHLQTSKYFICQCERCRDP 176
LQ +F C C+ C +P
Sbjct: 562 AQLQKQYFFNCSCQHCTNP 580
>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
Length = 393
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI G+ + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNACSTFPYSNDIVVLKALAPIQQGEEICISYLDECMLERSRHSRHKVLREN 353
Query: 165 Y-FICQCERCR 174
Y FICQC +CR
Sbjct: 354 YVFICQCPKCR 364
>gi|407425398|gb|EKF39397.1| hypothetical protein MOQ_000377 [Trypanosoma cruzi marinkellei]
Length = 623
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDP---LWGTINRR 157
F + H+C+ N + + V L A++PI+ GD +++ Y L G RR
Sbjct: 250 GFFRACALANHSCHANAAMKYNPEDETVTLIAVRPISAGDFVNVKYLSDAHFLMGVGKRR 309
Query: 158 HHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+L+ S F C+C+RC +E + + +KCP
Sbjct: 310 EYLR-SWLFWCKCDRCSMDSENSATQEQIKCP 340
>gi|393233728|gb|EJD41297.1| hypothetical protein AURDEDRAFT_106306 [Auricularia delicata
TFB-10046 SS5]
Length = 373
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 43/98 (43%), Gaps = 7/98 (7%)
Query: 88 VNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRA----MKPIAPGDHLS 143
+++H PL Y A + + H+C N F + R ++ +APGD ++
Sbjct: 168 LHSHLTPLAAAVYPAA---SRALNHSCATNAVPLFVFAPATPPRMEVVLVRDVAPGDEIT 224
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNT 181
I Y DP RR L+ S F C C RC P+ L
Sbjct: 225 IPYIDPALAPSARRERLRASYGFECACARCITPSGLEA 262
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
Length = 428
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 65 LNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
++FF++ + T C ++ EM + + ++ S + H+C PN F +
Sbjct: 169 IDFFQIFTKVT-------CNCFTISNGEM---QDVGVGLYPSMSLLNHSCDPNCVIIF-E 217
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
Q+LLR+++ I G+ L+I Y + L T R+ L+ F C C C+D
Sbjct: 218 GYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDCCLCQD 268
>gi|303322581|ref|XP_003071282.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
delta SOWgp]
gi|240110984|gb|EER29137.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
delta SOWgp]
Length = 367
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 4/105 (3%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ GI + NA +P + + IF RA+ I H+C N ++ + ++ + A K I G
Sbjct: 116 VIGITKTNA--IPFGSGGADGGIFPRAARINHSCKQNAQNTWNHNLSKLTIHAFKDIEEG 173
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ ++I Y D R+ L+ + F+CQCE C E N D
Sbjct: 174 EEITISYVDGAETFNTRQLCLEEAFGFVCQCELCSLSAEENKKRD 218
>gi|342883051|gb|EGU83613.1| hypothetical protein FOXB_05861 [Fusarium oxysporum Fo5176]
Length = 374
Score = 45.8 bits (107), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 44/95 (46%), Gaps = 2/95 (2%)
Query: 81 KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD 140
+ G VN + P E +A++ RA+ H+C PN+ +++ A + I+ G+
Sbjct: 242 RATGSQPVNDNAAPRGESYGMALYPRAAQFNHSCLPNVTHQPDGQARMVYTAARDISKGE 301
Query: 141 HLSICYTDPLW--GTINRRHHLQTSKYFICQCERC 173
I Y D +R++H Q F C CERC
Sbjct: 302 ECMITYFDLTTHKDLTSRQNHTQEQFQFKCTCERC 336
>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
Length = 391
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI G+ + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNACSTFPYSNDIVVLKALGPIQQGEEICISYLDECMLERSRHSRHKVLREN 353
Query: 165 Y-FICQCERCR 174
Y FICQC +CR
Sbjct: 354 YVFICQCPKCR 364
>gi|242020146|ref|XP_002430517.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212515674|gb|EEB17779.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 378
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 4/89 (4%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMK--PIAPGDHLSICYTDPLWGTINRRHH 159
++ +S I+H+C PN F + + +R ++ PI + I Y + L R+
Sbjct: 126 GLYLASSIIDHSCSPNAVAVFKGT-TIFIRTLEDIPIMDWSKVFISYIELLNLPEIRQQE 184
Query: 160 LQTSKYFICQCERCRDPTELNTFYDGVKC 188
L +S YF+CQC +C D LN F + C
Sbjct: 185 LLSSYYFLCQCSKCTDSDNLN-FMKSIYC 212
>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
Length = 392
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 39/71 (54%), Gaps = 4/71 (5%)
Query: 108 SFIEHNCYPNLYKSFTDSGQ-VLLRAMKPIAPGDHLSICYTDP--LWGTINRRHHLQTSK 164
S I H+C PN +F S V+L+A+ PI G+ + I Y D L + + RH +
Sbjct: 294 SKINHSCVPNACSTFPYSNDIVVLKALGPIQQGEEICISYLDECMLERSRHSRHKVLREN 353
Query: 165 Y-FICQCERCR 174
Y FICQC +CR
Sbjct: 354 YVFICQCPKCR 364
>gi|320032992|gb|EFW14942.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 358
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 4/105 (3%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ GI + NA +P + + IF RA+ I H+C N ++ + ++ + A K I G
Sbjct: 116 VIGITKTNA--IPFGSGGADGGIFPRAARINHSCKQNAQNTWNHNLSKLTIHAFKDIEEG 173
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ ++I Y D R+ L+ + F+CQCE C E N D
Sbjct: 174 EEITISYVDGAETFNTRQLCLEEAFGFVCQCELCSLSAEENKKRD 218
>gi|119190077|ref|XP_001245645.1| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
gi|392868546|gb|EAS34345.2| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
Length = 367
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 51/105 (48%), Gaps = 4/105 (3%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ GI + NA +P + + IF RA+ I H+C N ++ + ++ + A K I G
Sbjct: 116 VIGITKTNA--IPFGSGGADGGIFPRAARINHSCKQNAQNTWNHNLSKLTIHAFKDIEEG 173
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYD 184
+ ++I Y D R+ L+ + F+CQCE C E N D
Sbjct: 174 EEITISYVDGAETFNTRQLCLEEAFGFVCQCELCSLSAEENKKRD 218
>gi|356518519|ref|XP_003527926.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
ASHR1-like [Glycine max]
Length = 401
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 61/137 (44%), Gaps = 12/137 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V +L + ++N + E K NAH + +E P ++ S I H+C P
Sbjct: 87 NLVNSILQWPEINIKEIAENFSKFA----CNAHTICDSELRPVGTGLYPVISIINHSCLP 142
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-- 174
N F + L+ ++ I G + I Y + T+ +++ L++S F C C RC
Sbjct: 143 NSVLVF-EGRSALVPTVQHIPTGTEVPISYIETAGSTMTQQNALKSSTXFTCTCPRCSKV 201
Query: 175 ---DPTELNTFYDGVKC 188
D + + +G KC
Sbjct: 202 GEYDDIQESAILEGYKC 218
>gi|401428423|ref|XP_003878694.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494943|emb|CBZ30246.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 700
Score = 45.4 bits (106), Expect = 0.010, Method: Composition-based stats.
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 9/86 (10%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ-----VLLRAMKPIAPGDHLSICYTD-PLWGTIN 155
A++ + H C PN Y SF + Q +++RA++P+ G+ L++ Y + +
Sbjct: 448 AVYAVGALFNHACDPNCYVSFEGNPQGSCARLIVRAIRPVMEGEELTVSYGGISCFSFHS 507
Query: 156 RRHHLQT--SKY-FICQCERCRDPTE 178
RH LQT +Y F C C CR+ +
Sbjct: 508 MRHRLQTLRDRYGFFCGCRSCRNQVD 533
>gi|162458873|ref|NP_001105199.1| SET domain protein 123 [Zea mays]
gi|27466907|gb|AAO12860.1| SET domain protein 123 [Zea mays]
gi|194705040|gb|ACF86604.1| unknown [Zea mays]
gi|195624178|gb|ACG33919.1| SET domain protein 123 [Zea mays]
gi|414871456|tpg|DAA50013.1| TPA: SET domain protein 123 [Zea mays]
Length = 303
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 30/118 (25%), Positives = 47/118 (39%), Gaps = 18/118 (15%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRA------------------SFIEHNCY 115
T + + + I++NA + L SY + A SF H+C
Sbjct: 182 LTIDWYINVLARIRINAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCD 241
Query: 116 PNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PN + + + L+A++ I G+ L ICY D R+ L F C+C RC
Sbjct: 242 PNAHIVWLQNADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRC 299
>gi|407860332|gb|EKG07351.1| hypothetical protein TCSYLVIO_001535 [Trypanosoma cruzi]
Length = 623
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDP---LWGTINRR 157
F + H+C+ N + + V L A++PI+ GD +++ Y L G RR
Sbjct: 250 GFFRACALANHSCHANAAMKYNPEDETVTLIAVRPISAGDFVNVKYLSDAHFLMGVGKRR 309
Query: 158 HHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+L+ S F C+C+RC +E + + +KCP
Sbjct: 310 EYLR-SWLFWCKCDRCSMDSENSATQEQIKCP 340
>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 438
Score = 45.4 bits (106), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 97 EPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
E ++ R SF H+C PN+ Y ++ +V M+ + G+ L I Y D +
Sbjct: 326 ESRGCGVYVRNSFFNHSCNPNVNYWVVENTLEVECSLMRAVRKGEELCISYIDTAASLRD 385
Query: 156 RRHHLQTSKYFICQCERC 173
RR L F C+CE+C
Sbjct: 386 RREKLSEGYLFHCRCEKC 403
>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
antarctica T-34]
Length = 882
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 45/191 (23%), Positives = 73/191 (38%), Gaps = 44/191 (23%)
Query: 23 CLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQF---------LLNFFKLNEE 73
C +++ + P+ + QL H+ +Y + R F L + E
Sbjct: 395 CSHRERYAPEELQQQAQLALHLAQYLTA-----HARAAATFGPVSRGEANLDEMMQPEAE 449
Query: 74 FTQEEILKICGI----------IQVNAHEMPLTE----PSYIAIFDRASFIEHNCYPNLY 119
F+Q + L+ GI Q +++ LT+ P + + + + H+C PN
Sbjct: 450 FSQGDALRALGIESATELLDLVCQFSSNSFTLTDSDLNPLGVCMHASMAMLNHSCAPNAA 509
Query: 120 KSF-----TDSGQ-----------VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTS 163
F SGQ + L A++ I PG+ L I Y D RR L+
Sbjct: 510 VVFPFGGAAKSGQQNWSDGEDAKVMQLVALRAIEPGEELLISYVDVADAYEQRRSALKKR 569
Query: 164 KYFICQCERCR 174
F C+CE CR
Sbjct: 570 YCFECRCELCR 580
>gi|393236328|gb|EJD43877.1| hypothetical protein AURDEDRAFT_114449 [Auricularia delicata
TFB-10046 SS5]
Length = 451
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 54/120 (45%), Gaps = 8/120 (6%)
Query: 64 LLNFFKLNEEFTQEEILKICGIIQVNAH---EMPLTEP-SYIAIFDRASFIEHNCYPNLY 119
L N + EF + GI N++ +P + Y +F S + H+C PN
Sbjct: 203 LHNTWAGRTEFDHRGNGVLTGIYNTNSYFLDGLPSDDKIEYSGVFPTFSRVSHSCCPNAI 262
Query: 120 KSFTDSGQVLL--RAMKPIAPGDHLSICYTDP-LWGTINRRHHLQTSKYFICQCERCRDP 176
+ DSG + L RA + I G ++I Y P L T R+ L+ +F C C CRDP
Sbjct: 263 PRW-DSGTMTLELRATRHIQQGAEVTISYVPPPLKPTEARQEFLRDGYHFECTCPACRDP 321
>gi|71412230|ref|XP_808310.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70872488|gb|EAN86459.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 567
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDP---LWGTINRR 157
F + H+C+ N + + V L A++PI+ GD +++ Y L G RR
Sbjct: 292 GFFRACALANHSCHANAAMKYNPEDETVTLIAVRPISAGDFVNVKYLSDAHFLMGVGKRR 351
Query: 158 HHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+L+ S F C+C+RC +E + + +KCP
Sbjct: 352 EYLR-SWLFWCKCDRCSMDSENSATQEQIKCP 382
>gi|451851263|gb|EMD64564.1| hypothetical protein COCSADRAFT_37132 [Cochliobolus sativus ND90Pr]
Length = 463
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/89 (26%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
Query: 86 IQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSI 144
+ N E+ L+ + +F + + H+C PN F D+ + A++PI PG+ ++I
Sbjct: 241 LTTNYFELTLSGETLSGLFPEIAMMNHDCRPNAAYFFDEDTMTHYVHAIRPIYPGEEITI 300
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERC 173
Y + + R L+T+ F C C C
Sbjct: 301 TYINNEVTRVRRMGRLRTNWGFTCACSAC 329
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
gallopavo]
Length = 427
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 65 LNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTD 124
++FF++ + T C ++ EM + + ++ S + H+C PN F +
Sbjct: 168 IDFFQIFTKVT-------CNCFTISNGEM---QDVGVGLYPSMSLLNHSCDPNCVIIF-E 216
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRD 175
Q+LLR+++ I G+ L++ Y + L T R+ L+ F C C C+D
Sbjct: 217 GYQLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCFECDCCLCQD 267
>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
[Brachypodium distachyon]
gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
[Brachypodium distachyon]
Length = 389
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 38/79 (48%), Gaps = 6/79 (7%)
Query: 102 AIFDRASFIEHNCYPNL----YKSFTDSG--QVLLRAMKPIAPGDHLSICYTDPLWGTIN 155
A++ RAS + H+C PN Y G ++LRA+ I G + I Y W +
Sbjct: 196 AVYHRASLLNHDCLPNACHFDYPDRPGPGNTDIVLRALHGITAGMEVRISYFAANWRYAD 255
Query: 156 RRHHLQTSKYFICQCERCR 174
R+ L F C+CERC+
Sbjct: 256 RQRRLLEDYGFRCECERCQ 274
>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 509
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
I+ A+ + H C N ++ S + +RA++ I G+ ++ +TD T+ R+ HL
Sbjct: 272 GIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEITHAFTDAASPTVVRKAHL 331
Query: 161 QTSKYFICQCERCRD 175
++ F C C RC D
Sbjct: 332 KSLYAFDCNCSRCND 346
>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
1015]
Length = 386
Score = 45.4 bits (106), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 35 DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP 94
D+ +QL H+++ ++ + +R ++ + + + EE++ G +++ +
Sbjct: 131 DQFLQLGTHIKDIRDQSASQWERISLSSKAIRAYSGTD--MSEEVISAMGA-KLDLNSFN 187
Query: 95 LTEPSYIAIFDR--------ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
LT A++DR A+ H+C N SF D + ++A++PI + + I Y
Sbjct: 188 LTN----AVYDRLGVYLHPYAAIFNHSCDHNAAVSF-DGPNLHIKALRPIRKDEQIFITY 242
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERC 173
D R+ +LQ+ YF C C +C
Sbjct: 243 IDVTDPYPIRQANLQSRYYFTCHCSKC 269
>gi|146098807|ref|XP_001468476.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134072844|emb|CAM71560.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 700
Score = 45.4 bits (106), Expect = 0.011, Method: Composition-based stats.
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 9/86 (10%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ-----VLLRAMKPIAPGDHLSICYTD-PLWGTIN 155
A++ + H C PN Y SF + Q +++RA++P+ G+ L++ Y + +
Sbjct: 448 AVYAIGALFNHACDPNCYVSFEGNPQGSCARLIVRAIRPVMEGEELTVSYGGISCFSFHS 507
Query: 156 RRHHLQT--SKY-FICQCERCRDPTE 178
RH LQT +Y F C C CR+ +
Sbjct: 508 MRHRLQTLRDRYGFFCGCRSCRNQVD 533
>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
Length = 596
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/174 (26%), Positives = 78/174 (44%), Gaps = 32/174 (18%)
Query: 29 HQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQ-FLLNFFKLNEEFTQEEILKICGIIQ 87
++PQ ++ ++QL H K+ P + R V+ +LL + F EE +++ I
Sbjct: 311 YRPQDYNTIIQLVTHA---KDRPVQDLFHRTVMAVYLLKLLQQTSYFNGEEDVEMQAYIA 367
Query: 88 -----------VNAHEMP----------LTEPSYIA--IFDRASFIEHNCYPNLYKSFTD 124
NAHE+P L+ P+ + I+ S H+C P + ++F
Sbjct: 368 GLFLSHLQSFPCNAHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNHSCDPGVNRNFY- 426
Query: 125 SGQVLLRAMKPIAPGDHLSICYTDPLWGT--INRRH-HLQTSKYFICQCERCRD 175
++RA+K I G +S Y L+ T + RH LQ +F C+CE C +
Sbjct: 427 GDTCVVRAIKTIRKGHQVSDNY-GALYATNTLKERHDKLQPQYFFSCRCEPCSN 479
>gi|440790098|gb|ELR11386.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 307
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 43/94 (45%), Gaps = 7/94 (7%)
Query: 87 QVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSF-------TDSGQVLLRAMKPIAPG 139
Q +A ++ + E + S + H+C PN+ + DS +V L A + + P
Sbjct: 201 QADAEQIKMMEFDGAVVCPILSSLNHSCQPNVKSTLRSQAQARGDSCRVSLIAKQDVEPN 260
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ L I Y W R+ L + +F CQCERC
Sbjct: 261 EELCISYVPLSWTRATRQTTLLKNYFFHCQCERC 294
>gi|224101385|ref|XP_002312257.1| SET domain protein [Populus trichocarpa]
gi|222852077|gb|EEE89624.1| SET domain protein [Populus trichocarpa]
Length = 542
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 58/117 (49%), Gaps = 8/117 (6%)
Query: 69 KLNEEFTQEEILKICGI-----IQVNAHEMPLTEPSY-IAIFDRASFIEHNCYPNLYKSF 122
+LNE+ + +IL + + V+A + Y + ++ ASFI H+C PN +
Sbjct: 290 ELNEKLDKVKILNVLDVNSLVEDSVSAKVLGRNSDYYGVGLWVLASFINHSCNPNARRLH 349
Query: 123 TDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
VL+ A + + G+ ++ Y D L +++R+ + + F C C+RC+ E+
Sbjct: 350 VGD-HVLVHASRDVKAGEEITFAYFDVL-SPLSKRNEMSKTWGFHCSCKRCKFEEEM 404
>gi|429849921|gb|ELA25247.1| hypothetical protein CGGC5_13535 [Colletotrichum gloeosporioides
Nara gc5]
Length = 380
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 45/102 (44%), Gaps = 3/102 (2%)
Query: 86 IQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSG--QVLLRAMKPIAPGDHLS 143
+ N+ + L P+ +A+ R + I H+C PN Y G ++ + IA G+ ++
Sbjct: 191 LNANSLTIKLRMPN-MAVLPRIARINHDCRPNAYMRLPTDGLSGTVVAGEQGIAQGEEIT 249
Query: 144 ICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
I Y RR +LQ F C C+ C P E+ D
Sbjct: 250 ISYLPVELARERRRRNLQKDWGFECGCKLCTAPEEVVAASDA 291
>gi|398022224|ref|XP_003864274.1| hypothetical protein, conserved [Leishmania donovani]
gi|322502509|emb|CBZ37592.1| hypothetical protein, conserved [Leishmania donovani]
Length = 700
Score = 45.4 bits (106), Expect = 0.012, Method: Composition-based stats.
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 9/86 (10%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQ-----VLLRAMKPIAPGDHLSICYTD-PLWGTIN 155
A++ + H C PN Y SF + Q +++RA++P+ G+ L++ Y + +
Sbjct: 448 AVYAIGALFNHACDPNCYVSFEGNPQGSCARLIVRAIRPVMEGEELTVSYGGISCFSFHS 507
Query: 156 RRHHLQT--SKY-FICQCERCRDPTE 178
RH LQT +Y F C C CR+ +
Sbjct: 508 MRHRLQTLRDRYGFFCGCRSCRNQVD 533
>gi|224029645|gb|ACN33898.1| unknown [Zea mays]
Length = 369
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 14/124 (11%)
Query: 70 LNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL 129
+N F E K+C + QV + A++ S H+C PN++ F S +
Sbjct: 110 INRGFYVAEGAKMCSVEQVRVAQ---------ALYVSGSLFNHSCQPNVHAYFL-SRAFV 159
Query: 130 LRAMKPIAPGDHLSICYTDPLWGTIN---RRHHLQTSKYFICQCERCRDPTELNTFYDGV 186
LR + I G + + Y P G ++ R+ LQ + YF CQC C + + +
Sbjct: 160 LRTTEFIKSGSPVELSY-GPQVGEMHISERQKSLQENYYFSCQCSSCSELNLSDLVMNSF 218
Query: 187 KCPE 190
CP+
Sbjct: 219 CCPQ 222
>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
Length = 256
Score = 45.4 bits (106), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 4/76 (5%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPL---WGTINRR 157
A++ S I H+C PN F S VL LRA++ +A G+ + I Y D +R+
Sbjct: 151 ALYAMQSKINHSCTPNAEIVFPKSNHVLALRALRDLAAGEEICISYLDECNLQRSRHSRQ 210
Query: 158 HHLQTSKYFICQCERC 173
+L+ F CQCERC
Sbjct: 211 KNLREYYLFECQCERC 226
>gi|345560264|gb|EGX43389.1| hypothetical protein AOL_s00215g125 [Arthrobotrys oligospora ATCC
24927]
Length = 484
Score = 45.1 bits (105), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 6/103 (5%)
Query: 82 ICGIIQVNAHEMPLTEPSYIA-IFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPG 139
+ GI + NA +P S IA I+ I H+C PN Y ++ ++ Q + A++PI G
Sbjct: 239 LSGIFKTNA--IPCGYNSSIAGIYLTICLINHSCIPNTYHNWNENLEQETIHAIRPINAG 296
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERC-RDPTELNT 181
+ ++I Y RR LQ F CQC+ C DP +L +
Sbjct: 297 EEITISYISESMSN-PRRKRLQECFGFDCQCQLCTSDPKQLES 338
>gi|330792060|ref|XP_003284108.1| hypothetical protein DICPUDRAFT_93608 [Dictyostelium purpureum]
gi|325085922|gb|EGC39320.1| hypothetical protein DICPUDRAFT_93608 [Dictyostelium purpureum]
Length = 731
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/141 (24%), Positives = 57/141 (40%), Gaps = 28/141 (19%)
Query: 60 VVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLT----------------------- 96
+ Q + N F +N E T + ++ I+Q+NA P +
Sbjct: 579 MFQLMKNIF-MNREITTDIFQRVKSIVQLNALIFPTSTIRVLSERNPMDELGWNFDFEEV 637
Query: 97 -EPSYIAIFDRASFIEHNCYPNLYKS--FTDSGQVLLRAMKPIAPGDHLSICYTDPL-WG 152
+ +I +ASF H+C PN++ + + + +PI G+ L I Y D
Sbjct: 638 QSRTVFSILQQASFFNHSCEPNIFIATPVVNDKSIRFCTRRPIKKGEELFITYLDGFDLD 697
Query: 153 TINRRHHLQTSKYFICQCERC 173
T R+ L T+ F C+C C
Sbjct: 698 TETRKTILNTTHMFTCKCPSC 718
>gi|312380816|gb|EFR26711.1| hypothetical protein AND_07035 [Anopheles darlingi]
Length = 368
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 48/89 (53%), Gaps = 15/89 (16%)
Query: 15 YECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF-----K 69
Y+ ++ L+CL + P +D++M LE+H EE K + Y + VV +LL F K
Sbjct: 101 YDALLVLKCLAIQIRSPTQFDQMMNLESHYEERKKTHYYRDTDDRVVSYLLRNFLDPLQK 160
Query: 70 LNEEFTQEEILKIC---------GIIQVN 89
L E+ +++LK+C GI++VN
Sbjct: 161 L-EKKQDKQVLKVCDRKALHKIAGILEVN 188
>gi|407847654|gb|EKG03293.1| hypothetical protein TCSYLVIO_005668 [Trypanosoma cruzi]
Length = 729
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 6/79 (7%)
Query: 101 IAIFDRASFIEHNCYPNLYKSF----TDSGQVL--LRAMKPIAPGDHLSICYTDPLWGTI 154
I +F S IEH+C+PN F T V LRA +PI G+ ++I Y
Sbjct: 373 IGVFGGISLIEHSCHPNALVVFRHGCTPESIVFAELRATRPIGIGERITISYVPTFIPKE 432
Query: 155 NRRHHLQTSKYFICQCERC 173
R+ L+ +F C C C
Sbjct: 433 ERQKRLRAKFFFSCACVHC 451
>gi|259483433|tpe|CBF78818.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 341
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 36/74 (48%), Gaps = 2/74 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD--PLWGTINRRHH 159
A++ RAS H+C PN+ G+++ A + IA G+ I Y D +RR H
Sbjct: 245 AVYPRASIANHSCCPNIIHKPDKVGRMVFTAGRDIAAGEECCISYFDMTQYVSLQDRRRH 304
Query: 160 LQTSKYFICQCERC 173
LQ F C C RC
Sbjct: 305 LQGLFRFKCGCPRC 318
>gi|403164683|ref|XP_003324753.2| hypothetical protein PGTG_06290 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165269|gb|EFP80334.2| hypothetical protein PGTG_06290 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 463
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/66 (34%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 109 FIEHNCYPNLYKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI 167
++ H+C PN+ F + + QV + A++ IAPG+ LSI Y + RR L + F
Sbjct: 261 YLNHDCRPNVGYRFNNVTIQVEMHALREIAPGEELSISYITLVQSREKRRKSLHGTYGFH 320
Query: 168 CQCERC 173
C C +C
Sbjct: 321 CGCSQC 326
>gi|297806681|ref|XP_002871224.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
lyrata]
gi|297317061|gb|EFH47483.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
lyrata]
Length = 325
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 35/73 (47%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN + + + L ++ + G+ L ICY D G R+ L
Sbjct: 250 AVYMLPSFYNHDCDPNAHIIWLQNADARLNTLRDVEEGEELRICYIDASMGYEARQTILS 309
Query: 162 TSKYFICQCERCR 174
F+C C RC+
Sbjct: 310 QGFGFLCNCLRCQ 322
>gi|258569357|ref|XP_002543482.1| predicted protein [Uncinocarpus reesii 1704]
gi|237903752|gb|EEP78153.1| predicted protein [Uncinocarpus reesii 1704]
Length = 369
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 49/94 (52%), Gaps = 4/94 (4%)
Query: 82 ICGIIQVNAHEMPL-TEPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPG 139
+ GI + NA +P + S IF RA+ I H+C N S+ + G++ + A + I G
Sbjct: 126 VIGITRTNA--IPFGSRGSAGGIFPRAARINHSCKQNAQNSWNYNIGKLTIHAFRDIEEG 183
Query: 140 DHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ ++I Y D R++ L+ + F C+CE C
Sbjct: 184 EEITIAYIDGSEYFETRQNTLEEAFGFKCRCEIC 217
>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
Length = 415
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 35 DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP 94
D+ +QL H+++ ++ + +R ++ + + + EE++ G +++ +
Sbjct: 99 DQFLQLGTHIKDIRDQSASQWERISLSSKAIKAYSGTD--MSEEVISAMGA-KLDLNSFN 155
Query: 95 LTEPSYIAIFDR--------ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
LT A++DR A+ H+C N SF D + ++A++PI + + I Y
Sbjct: 156 LTN----AVYDRLGVYLHPYAAIFNHSCDHNAAVSF-DGPNLHIKALRPIRKDEQIFITY 210
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERC 173
D R+ +LQ+ YF C C +C
Sbjct: 211 IDVTDPYPIRQANLQSRYYFTCHCSKC 237
>gi|440487605|gb|ELQ67385.1| hypothetical protein OOW_P131scaffold00319g4 [Magnaporthe oryzae
P131]
Length = 312
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
E ++ ++ +A+ I H+C P++ D Q A++ I PG+ LS+ Y D + R
Sbjct: 104 ESQHLGVYAQAAAINHDCRPSINYRLNDITQTTT-AVREIQPGEELSVSYVDLMLPHKQR 162
Query: 157 RHHLQTSKYFICQCERC 173
R L+ F C+C +C
Sbjct: 163 RQRLRDWG-FDCKCSKC 178
>gi|429859827|gb|ELA34590.1| set domain-containing protein 5 [Colletotrichum gloeosporioides
Nara gc5]
Length = 648
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 84 GIIQVNAHEMPLTEPSYIA-IFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDH 141
GI + NA +PL + IF AS I H+C N ++ D GQ+ + A++ I G+
Sbjct: 412 GIAKTNA--LPLGSGAREGGIFLEASRINHSCRHNAQNTWNADRGQLTVHALRDIDDGEE 469
Query: 142 LSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
++I Y +R+ L+ S F C CE C P
Sbjct: 470 ITISYLGASENYSSRQSRLKESFGFTCACELCTLP 504
>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 386
Score = 45.1 bits (105), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 36/73 (49%), Gaps = 1/73 (1%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+A+ +S+ H+C PN D + +++ PI GD L+I Y + +R+ L
Sbjct: 249 VAVSPSSSYFNHSCIPNC-TDVRDGSNMTFKSLYPIKKGDQLTISYIELDQPIQDRKDEL 307
Query: 161 QTSKYFICQCERC 173
+ YF C C RC
Sbjct: 308 KYGYYFDCICPRC 320
>gi|71663779|ref|XP_818878.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884153|gb|EAN97027.1| hypothetical protein Tc00.1047053510729.180 [Trypanosoma cruzi]
Length = 729
Score = 45.1 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 6/79 (7%)
Query: 101 IAIFDRASFIEHNCYPNLYKSF----TDSGQVL--LRAMKPIAPGDHLSICYTDPLWGTI 154
I +F S IEH+C+PN F T V LRA +PI G+ ++I Y
Sbjct: 373 IGVFGGISLIEHSCHPNAIVVFRHGCTPESIVFAELRATRPIGIGERITIAYVPTFIPKE 432
Query: 155 NRRHHLQTSKYFICQCERC 173
R+ L+ +F C C C
Sbjct: 433 ERQKRLRAKFFFSCACVHC 451
>gi|71659612|ref|XP_821527.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70886909|gb|EAN99676.1| hypothetical protein Tc00.1047053506885.250 [Trypanosoma cruzi]
Length = 729
Score = 45.1 bits (105), Expect = 0.014, Method: Composition-based stats.
Identities = 26/79 (32%), Positives = 36/79 (45%), Gaps = 6/79 (7%)
Query: 101 IAIFDRASFIEHNCYPNLYKSF----TDSGQVL--LRAMKPIAPGDHLSICYTDPLWGTI 154
I +F S IEH+C+PN F T V LRA +PI G+ ++I Y
Sbjct: 373 IGVFGGISLIEHSCHPNALVVFRHGCTPESIVFAELRATRPIGIGERITIAYVPTFIPKE 432
Query: 155 NRRHHLQTSKYFICQCERC 173
R+ L+ +F C C C
Sbjct: 433 ERQKRLRAKFFFSCACVHC 451
>gi|402086338|gb|EJT81236.1| hypothetical protein GGTG_01220 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 470
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 49/114 (42%)
Query: 72 EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR 131
+ T+E K+ ++ N+ ++A + S + H+C PNL + S
Sbjct: 232 QSATEERGRKVKDVMDTNSFNAAAGGHPHVASYLDVSRLNHDCRPNLVFHVSPSLSHTTY 291
Query: 132 AMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDG 185
A++ IA G+ L+I Y DP RR + S F C C +C L DG
Sbjct: 292 AVRDIAAGEELTISYLDPFRAREVRRARAERSWGFACACSQCALAPALAAESDG 345
>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
Length = 498
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 34/148 (22%), Positives = 66/148 (44%), Gaps = 14/148 (9%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEE-----ILKICGIIQVNAH 91
+ L +H++E S + + +R ++ K +E++ E I + I+VN+
Sbjct: 144 FLSLRSHLDEITKSNQEQYER------IMLSAKAEKEYSGSELDVETIAEYLARIEVNSF 197
Query: 92 EM--PLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDP 149
+P + I A ++ H+C PN F D G + ++A++ I + + I Y D
Sbjct: 198 TFTTAFGDPLGLCIQPFACYMNHSCEPNAVVGF-DGGLITVKALREIKSDEQVFISYIDN 256
Query: 150 LWGTINRRHHLQTSKYFICQCERCRDPT 177
+ R+ L +F C+C +C T
Sbjct: 257 TYPFEVRQKQLAERYFFTCKCSKCAQGT 284
>gi|336266212|ref|XP_003347875.1| hypothetical protein SMAC_06707 [Sordaria macrospora k-hell]
gi|380091808|emb|CCC10536.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 518
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 2/68 (2%)
Query: 108 SFIEHNCYPNL-YKSFTDSGQV-LLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY 165
S + H+C PN Y+ + G V ++ A++ I PG+ ++I Y +PL R+ HL+T
Sbjct: 286 SRLNHDCRPNADYRFDWEKGLVQVITAVRDIFPGEEITISYINPLQSRHARQKHLRTVWG 345
Query: 166 FICQCERC 173
F C CE C
Sbjct: 346 FDCSCELC 353
>gi|310789329|gb|EFQ24862.1| hypothetical protein GLRG_00006 [Glomerella graminicola M1.001]
Length = 460
Score = 45.1 bits (105), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 3/76 (3%)
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTINRR 157
+ A+F S + H+C PN+ F D + R A++ I PG+ L++ Y DP+ RR
Sbjct: 249 FYAVFVNTSRMNHDCRPNVDYYF-DPRTLTQRTVALRDIIPGEELTLSYIDPMQSREARR 307
Query: 158 HHLQTSKYFICQCERC 173
L ++ F C C C
Sbjct: 308 ERLHSTWGFHCSCHHC 323
>gi|429857737|gb|ELA32586.1| set domain-containing protein 5 [Colletotrichum gloeosporioides
Nara gc5]
Length = 274
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 39/83 (46%), Gaps = 3/83 (3%)
Query: 96 TEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTI 154
T P A+F + + + H+C PN +F + A + I P + ++I Y D G
Sbjct: 83 TSPGRTALFPQVARLNHSCVPNAQGNFNKKLDAFTVHATRDIKPEEEITISYLDEHLGLR 142
Query: 155 N-RRHHLQTSKYFICQCERCRDP 176
R+ HLQ F+C C C DP
Sbjct: 143 QARQDHLQDGYGFLCDCSAC-DP 164
>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 477
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 68/152 (44%), Gaps = 15/152 (9%)
Query: 36 KLMQLEAHVEEYK-NSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP 94
K +L H E+ K + KYE+ + + ++ F+ +E +++ I +N+H +
Sbjct: 119 KWSELLGHEEDIKADQAKYEHFEKIFNK--MSIFRKDEMIDKDKFYIIFCKTSINSHSIH 176
Query: 95 LTEPSYI--AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLS------ICY 146
+ + + A+ S +H+C PN F D Q LR P+ P + S I Y
Sbjct: 177 TSAGTEVGMALDLGVSAYDHSCRPNCSLVF-DGYQACLR---PLTPQTNASDPRTSFISY 232
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
D RR L+ YF C CERC DP +
Sbjct: 233 IDIGRSRYQRRKELKAKWYFDCACERCCDPAD 264
>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
Length = 481
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 69/147 (46%), Gaps = 16/147 (10%)
Query: 35 DKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMP 94
D+ +QL H+++ ++ + +R ++ + + + EE++ G +++ +
Sbjct: 131 DQFLQLGTHIKDIRDQSASQWERISLSSKAIKAYSGTD--MSEEVISAMGA-KLDLNSFN 187
Query: 95 LTEPSYIAIFDR--------ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY 146
LT A++DR A+ H+C N SF D + ++A++PI + + I Y
Sbjct: 188 LTN----AVYDRLGVYLHPYAAIFNHSCDHNAAVSF-DGPNLHIKALRPIRKDEQIFITY 242
Query: 147 TDPLWGTINRRHHLQTSKYFICQCERC 173
D R+ +LQ+ YF C C +C
Sbjct: 243 IDVTDPYPIRQANLQSRYYFTCHCSKC 269
>gi|389626775|ref|XP_003711041.1| hypothetical protein MGG_08614 [Magnaporthe oryzae 70-15]
gi|351650570|gb|EHA58429.1| hypothetical protein MGG_08614 [Magnaporthe oryzae 70-15]
gi|440466565|gb|ELQ35825.1| hypothetical protein OOU_Y34scaffold00685g3 [Magnaporthe oryzae
Y34]
Length = 399
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINR 156
E ++ ++ +A+ I H+C P++ D Q A++ I PG+ LS+ Y D + R
Sbjct: 191 ESQHLGVYAQAAAINHDCRPSINYRLNDITQTTT-AVREIQPGEELSVSYVDLMLPHKQR 249
Query: 157 RHHLQTSKYFICQCERC 173
R L+ F C+C +C
Sbjct: 250 RQRLRDWG-FDCKCSKC 265
>gi|331241839|ref|XP_003333567.1| hypothetical protein PGTG_14989 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312557|gb|EFP89148.1| hypothetical protein PGTG_14989 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 425
Score = 45.1 bits (105), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 31/125 (24%), Positives = 55/125 (44%), Gaps = 19/125 (15%)
Query: 61 VQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYK 120
+ L++ EF+++E L+ ++ EP+ + H C PN+
Sbjct: 227 ISSLVDMNVFGSEFSEDESLEFGALV---------LEPARL---------NHACRPNVVY 268
Query: 121 SFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTEL 179
+ Q+L +RA++P+A G+ L+I Y + RR L+ + F C C C+ EL
Sbjct: 269 YLDEGTQILYMRALEPVAKGEELTINYREYELPRQERRDILEEAYGFNCTCSHCQMSEEL 328
Query: 180 NTFYD 184
D
Sbjct: 329 GKLSD 333
>gi|115475273|ref|NP_001061233.1| Os08g0205300 [Oryza sativa Japonica Group]
gi|40253402|dbj|BAD05332.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761301|dbj|BAD11544.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|113623202|dbj|BAF23147.1| Os08g0205300 [Oryza sativa Japonica Group]
Length = 331
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNL----YKSFTDSG--QVLLRAMKPIAPGDHLSICYT 147
PL + A++ RAS + H+C PN Y G +++RA+ I G + + Y
Sbjct: 132 PLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCLSYF 191
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCR 174
W +R+ L F C+CERC+
Sbjct: 192 AANWQYKDRQQRLLEDYGFRCECERCQ 218
>gi|406867181|gb|EKD20220.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 709
Score = 45.1 bits (105), Expect = 0.016, Method: Composition-based stats.
Identities = 20/72 (27%), Positives = 36/72 (50%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A+++ A+ H+C PN+ + FT + + I G+ L+I Y + ++ RR L
Sbjct: 495 AVYEVAARFNHDCRPNVRRGFTKENCIAFVTAREITRGEELTISYLNMAGMSVARRKELT 554
Query: 162 TSKYFICQCERC 173
F C+C+ C
Sbjct: 555 RWWGFECKCDAC 566
>gi|225441557|ref|XP_002276611.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Vitis
vinifera]
Length = 327
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 36/72 (50%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN++ + D+ L+A++ I G+ L ICY D R+ L
Sbjct: 252 AVYMLPSFYNHDCDPNVHIIWIDNVNARLKALREIEAGEELRICYIDASMDHDARQTILF 311
Query: 162 TSKYFICQCERC 173
F C C RC
Sbjct: 312 QGFGFRCSCLRC 323
>gi|67587953|ref|XP_665287.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54655890|gb|EAL35057.1| hypothetical protein Chro.70122, partial [Cryptosporidium hominis]
Length = 1007
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 1/69 (1%)
Query: 122 FTDSGQVLLRAMKPIAPGDHLSICYTDPLWG-TINRRHHLQTSKYFICQCERCRDPTELN 180
D ++ +R+ I +L+I Y L+ T R+ + + K F C C RC D +E N
Sbjct: 1 MNDEHKLCIRSAYNIPENANLTISYILDLYQPTFKRKSIVNSPKVFACMCSRCLDDSENN 60
Query: 181 TFYDGVKCP 189
F +G++CP
Sbjct: 61 LFLEGIRCP 69
>gi|219125459|ref|XP_002182999.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405793|gb|EEC45735.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 528
Score = 45.1 bits (105), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 43/92 (46%), Gaps = 7/92 (7%)
Query: 103 IFDRASFIEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPLW--GTINRR 157
++D S I H+C PN T+ + + A PIA GD ++I Y L R+
Sbjct: 186 VYDSISRINHSCDPNAVVQLGLGTEQDRQSIVACAPIANGDEITISYLGLLLYADRPTRQ 245
Query: 158 HHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
L +K+F C C+RC+ T L + CP
Sbjct: 246 ASLLGTKHFTCACDRCK--TSLPDNASAIPCP 275
>gi|451996075|gb|EMD88542.1| hypothetical protein COCHEDRAFT_1226723 [Cochliobolus
heterostrophus C5]
Length = 468
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/89 (25%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
Query: 86 IQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFT-DSGQVLLRAMKPIAPGDHLSI 144
+ N E+ ++ + +F + + H+C PN F D+ + A++PI PG+ ++I
Sbjct: 240 LTTNYFELTVSGTTLTGLFPEIAMMNHDCRPNAAYFFDQDTMTHYVHAIRPIYPGEEITI 299
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERC 173
Y + + R L+T+ F C C C
Sbjct: 300 TYINNEVTRVQRMGRLRTNWGFTCACSAC 328
>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 389
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 107 ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL--WGTINRRHHLQTSK 164
AS+ H+C PN Y F D + L+A++ IA + + I Y D WG R+ LQ
Sbjct: 124 ASYANHSCEPNAYIGF-DGPVIYLKALQDIALDEQIFISYIDNTEPWG--KRQSELQKRY 180
Query: 165 YFICQCERC 173
+F C+C +C
Sbjct: 181 FFTCKCPKC 189
>gi|429852402|gb|ELA27540.1| lysine methyltransferase [Colletotrichum gloeosporioides Nara gc5]
Length = 444
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 39/79 (49%), Gaps = 3/79 (3%)
Query: 97 EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTI 154
+ + A+F S + H+C PN+ F D + R A++ I PG+ L++ Y DP+
Sbjct: 234 DDDFYAVFVNTSRMNHDCRPNVDYWF-DPRTLTQRTTAIRDIMPGEELTLSYIDPMQSRK 292
Query: 155 NRRHHLQTSKYFICQCERC 173
RR L + F C C C
Sbjct: 293 ARRERLHGTWGFHCSCHHC 311
>gi|412985398|emb|CCO18844.1| predicted protein [Bathycoccus prasinos]
Length = 528
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 3/75 (4%)
Query: 102 AIFDRASFIEHNCYPNL--YKSFTD-SGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRH 158
A+F S H+C PN +K TD +G +L A KPI G+ L+I Y + +RR
Sbjct: 439 ALFALQSNCNHDCDPNCHPFKDETDINGSCVLVARKPIRKGEELTISYLEDDQLDWSRRQ 498
Query: 159 HLQTSKYFICQCERC 173
+ F+C+C RC
Sbjct: 499 DALSDYGFVCRCARC 513
>gi|336275947|ref|XP_003352727.1| hypothetical protein SMAC_12588 [Sordaria macrospora k-hell]
gi|380094617|emb|CCC07997.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 398
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 41/82 (50%), Gaps = 3/82 (3%)
Query: 99 SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRH 158
++ +F S + H+C PNL + A++PI PG+ L+I Y D L +R+
Sbjct: 176 GHLGVFPEVSRLNHDCRPNLNSRIENITHTTT-AVRPILPGEELTISYIDGLLPLHSRQE 234
Query: 159 HLQTSKYFICQCERC-RDPTEL 179
LQ + F C C C P+E+
Sbjct: 235 RLQDWR-FNCTCSLCSSSPSEI 255
>gi|119189249|ref|XP_001245231.1| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
gi|303323233|ref|XP_003071608.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|240111310|gb|EER29463.1| hypothetical protein CPC735_071450 [Coccidioides posadasii C735
delta SOWgp]
gi|320031288|gb|EFW13261.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
gi|392868134|gb|EAS33877.2| hypothetical protein CIMG_04672 [Coccidioides immitis RS]
Length = 338
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 2/94 (2%)
Query: 84 GIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDS-GQVLLRAMKPIAPGDHL 142
GI NA + Y IF + S H C PN +S+ Q ++ +K ++ G+ +
Sbjct: 102 GIAATNAMLIDFETTEY-GIFPKVSRFNHGCRPNSMRSYHPVLDQAVVHVVKDVSEGEEI 160
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERCRDP 176
++ Y +P R+ L+ FIC C C P
Sbjct: 161 TVSYVEPGLAFYLRQEQLKEKFGFICGCNLCLMP 194
>gi|358400070|gb|EHK49401.1| hypothetical protein TRIATDRAFT_183670, partial [Trichoderma
atroviride IMI 206040]
Length = 350
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 5/91 (5%)
Query: 86 IQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVL---LRAMKPIAPGDHL 142
I NA + L E +Y A+F + + H+C PN F Q L + A+ I PG +
Sbjct: 150 IDTNAFVLELNEVTYYAVFPETARLNHDCRPNAAYFF--DKQTLTHYVHAITDITPGTEI 207
Query: 143 SICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+I Y DP R L + F C C C
Sbjct: 208 TITYVDPHMPRQKRLKKLSSLWGFDCSCSLC 238
>gi|303279855|ref|XP_003059220.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459056|gb|EEH56352.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 591
Score = 44.7 bits (104), Expect = 0.017, Method: Composition-based stats.
Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY-TDPLWGTIN-RRH 158
+ +F AS I H C PN + A++ I PG+ +++ Y + L+ + RR
Sbjct: 240 VTLFANASAINHACAPNAAVAVAGEDVATAYALRAIEPGEEITVSYASSSLYLPLTMRRM 299
Query: 159 HLQTSKYFICQCERC-RD 175
L F+C+CERC RD
Sbjct: 300 ALSRRWGFVCECERCARD 317
>gi|50294041|ref|XP_449432.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528746|emb|CAG62408.1| unnamed protein product [Candida glabrata]
Length = 374
Score = 44.7 bits (104), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 44/78 (56%), Gaps = 2/78 (2%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHH-LQ 161
+F +AS+ H+C PN+ K+ D +++ + +A G+ L+I Y+ L ++RR L
Sbjct: 296 VFPKASYFNHSCDPNITKTRIDR-KMVFTLNRDVACGEELNIDYSGVLDLPVDRRQKFLF 354
Query: 162 TSKYFICQCERCRDPTEL 179
+ +F+C C RC +L
Sbjct: 355 ENWFFVCGCNRCNSDLKL 372
>gi|351710111|gb|EHB13030.1| SET and MYND domain-containing protein 2 [Heterocephalus glaber]
Length = 527
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQT 162
+F + + H+C PN+ ++ + +RA++ I PG+ + Y D L+ T +R L+
Sbjct: 266 VFLGVALMNHSCCPNVIVTYRGT-LAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRD 324
Query: 163 SKYFICQCERC 173
S +F C+C C
Sbjct: 325 SYFFTCECREC 335
>gi|340960913|gb|EGS22094.1| hypothetical protein CTHT_0039800 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 247
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 8/94 (8%)
Query: 81 KICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD 140
++ I+ NAH + L + ++ +F + + I H C P L + +VL A + I PG+
Sbjct: 49 QVDAILGPNAHTVMLADEVHVGLFTQLARINHACRPKL------TLEVL--AYRTIQPGE 100
Query: 141 HLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
++I Y R+ +L+ F C+CE CR
Sbjct: 101 EITISYVPLEMPVDERKKYLKDHWGFDCRCELCR 134
>gi|151942634|gb|EDN60980.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 373
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 2/74 (2%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQ 161
+F AS+ H+C PN+ K + +L + I + + I Y+ L T+ RR L
Sbjct: 295 VFPEASYFNHSCNPNITK-YRKGNSMLFTVNRDIKKDEQICIDYSGVLDLPTVKRRAFLA 353
Query: 162 TSKYFICQCERCRD 175
+ +F+C CERC+
Sbjct: 354 DNWFFVCACERCKS 367
>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 521
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 61/148 (41%), Gaps = 8/148 (5%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDR---RNVVQFLLNFFKLNEEFTQEEIL--KICGI--I 86
++ +L ++VE Y S K + VV+ K+ +EF IL C I
Sbjct: 285 YNDYAELVSNVENYNESLKESLSYWICKYVVKLSAKLGKIEDEFDLLNILLRNRCNAFYI 344
Query: 87 QVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHLSIC 145
Q + E ++ R SF H+C PN+ Y ++ +V +K + GD L+I
Sbjct: 345 QGRPRDGSSGESRGCGVYVRNSFFNHSCDPNVNYWVVNNTLEVECTLLKNVKEGDELTIS 404
Query: 146 YTDPLWGTINRRHHLQTSKYFICQCERC 173
Y D RR L F C C +C
Sbjct: 405 YIDTTSPLNKRREKLLEGYLFNCLCTKC 432
>gi|331224545|ref|XP_003324944.1| hypothetical protein PGTG_06481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309303934|gb|EFP80525.1| hypothetical protein PGTG_06481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 444
Score = 44.7 bits (104), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 7/98 (7%)
Query: 82 ICGIIQVNAHEMPLTEPS---YIAIFDRASFIEHNCYPN-LYKSFTDSGQVLLRAMKPIA 137
I I +NA E+ L S + A+F S + H+C PN + ++S ++ + A++ I
Sbjct: 231 ISSAIDMNAFEINLGNDSDVPFFAVFLTPSRLNHDCRPNTAFHVNSESLEIHMHALRTIN 290
Query: 138 PGDHLSICYTDPLWGTINRRHHLQTSKY-FICQCERCR 174
PG+ ++I Y D I + S Y F C C CR
Sbjct: 291 PGEEMTISYRD--MAQIREQRQEDISNYGFQCTCAHCR 326
>gi|429860343|gb|ELA35083.1| het domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 642
Score = 44.7 bits (104), Expect = 0.019, Method: Composition-based stats.
Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 5/98 (5%)
Query: 81 KICGIIQVNAHEMPLT-----EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKP 135
K+ I+Q N+ +M L + + + S H+C PN+ G+ ++P
Sbjct: 430 KVVDIMQTNSFQMDLGGGAQGDGHHYGNYPEVSRYNHDCRPNVAFHIGADGRHRTTVVRP 489
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
+ G+ L+I Y D L R+H + + F C C +C
Sbjct: 490 VKSGEELTISYLDQLGVRSERQHRAKLAWGFECGCSQC 527
>gi|12834773|dbj|BAB23038.1| unnamed protein product [Mus musculus]
Length = 229
Score = 44.7 bits (104), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 21/64 (32%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 110 IEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQ 169
+ H+C PN+ ++ + +RA++ I PGD + Y D L+ T +R L+ S +F C+
Sbjct: 1 MNHSCCPNVIVTYKGT-LAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCE 59
Query: 170 CERC 173
C C
Sbjct: 60 CREC 63
>gi|428176697|gb|EKX45580.1| hypothetical protein GUITHDRAFT_108453 [Guillardia theta CCMP2712]
Length = 586
Score = 44.7 bits (104), Expect = 0.020, Method: Composition-based stats.
Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 12/154 (7%)
Query: 29 HQPQIWDKLMQLEAHVEEYKNSPKYENDRRNV-VQFLLNFFKLNEEF--TQEEILKICGI 85
+Q W L H + S Y D R+ ++ N + +EF +Q+ +L +
Sbjct: 205 NQSDEWLLTYLLLQHGSDEAWSASYVTDARSAEIESNENVAWIAKEFGCSQKTVLSVFRA 264
Query: 86 IQVNAHEM--PLTEPSYIAIFDRA-SFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHL 142
+ NA + L Y A F RA S+ H+C+PN + S G + + +P G+ L
Sbjct: 265 VANNAFSLDTALLRVKYGAAFYRAASYFNHSCFPNCF-SRRMGGNMAMFTNRPCKQGEEL 323
Query: 143 SICY--TDPLWGTINRRHHLQTSKYFICQCERCR 174
+ Y + L I R + +F+C+CERCR
Sbjct: 324 THSYLPVELLAAPIEVR---AANLHFVCECERCR 354
>gi|401623295|gb|EJS41399.1| set6p [Saccharomyces arboricola H-6]
Length = 373
Score = 44.7 bits (104), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 2/72 (2%)
Query: 103 IFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPL-WGTINRRHHLQ 161
+F AS+ H+C PN+ K +L + I G+ + I Y+ L T+ RR+ L
Sbjct: 295 VFPEASYFNHSCNPNVTKH-RKGNSMLFKVNTDIKKGEQICIDYSGVLSLPTLERRNFLA 353
Query: 162 TSKYFICQCERC 173
S +F C CERC
Sbjct: 354 DSWFFDCACERC 365
>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
Length = 392
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNL----YKSFTDSG--QVLLRAMKPIAPGDHLSICYT 147
PL + A++ RAS + H+C PN Y G +++RA+ I G + + Y
Sbjct: 193 PLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCLSYF 252
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCR 174
W +R+ L F C+CERC+
Sbjct: 253 AANWQYKDRQQRLLEDYGFRCECERCQ 279
>gi|116194252|ref|XP_001222938.1| hypothetical protein CHGG_03724 [Chaetomium globosum CBS 148.51]
gi|88179637|gb|EAQ87105.1| hypothetical protein CHGG_03724 [Chaetomium globosum CBS 148.51]
Length = 262
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 28/97 (28%), Positives = 49/97 (50%), Gaps = 4/97 (4%)
Query: 81 KICGIIQVNAHEMPLTEPS-YIAIFDRASFIEHNCYPNLYKSFTDSGQVL-LRAMKPIAP 138
++ I+ N++ + + E ++ +F A+ I H C PN Y F++ + + A + I P
Sbjct: 52 EVDAIMGSNSNTITVGEDEVHVGLFTEAARINHACRPNAYYRFSERRLTMEVVAFRAIQP 111
Query: 139 GDHLSICYTDPLWGTIN-RRHHLQTSKYFICQCERCR 174
G+ + + Y PL + RR +LQ F C C CR
Sbjct: 112 GEEIFMSYV-PLETPVEERRKYLQDHWGFNCACSLCR 147
>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 501
Score = 44.7 bits (104), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 9/84 (10%)
Query: 98 PSYIAIFDRASFIEHNCYPNL---YKSFTDSGQVLLRAMKPIAPGDHLSICY-----TDP 149
P +A+F + I H+C PN + F D+ ++ + A++ IA G+ ++I Y T
Sbjct: 415 PEIVAVFPLTARINHSCVPNAQVQSQEFVDA-RIDVVALRDIAAGEEITISYIGCGRTSG 473
Query: 150 LWGTINRRHHLQTSKYFICQCERC 173
T RR L F C+C RC
Sbjct: 474 SKSTSRRRRELLAKYLFTCECPRC 497
>gi|325187786|emb|CCA22331.1| SET and MYND domain containing 2 putative [Albugo laibachii Nc14]
Length = 507
Score = 44.3 bits (103), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 6/121 (4%)
Query: 64 LLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSY--IAIFDRASFI-EHNCYPNLYK 120
LL KL+ + + + ++ NAH + L + +F A+ + H+C PN+
Sbjct: 258 LLRGSKLSHFVSPTQAHDVLLKVRSNAHPLVLNGSVTCGLGLFPEAAMVFNHSCSPNIIL 317
Query: 121 SFTDSGQVLLRA--MKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE 178
+F G +LRA ++PI P L Y D L RR L + F C C RC + +
Sbjct: 318 AF-QPGTRMLRAHSIRPIQPRQALEYAYIDLLPSKSRRRQLLNDAFAFDCSCLRCYEEGK 376
Query: 179 L 179
L
Sbjct: 377 L 377
>gi|440471791|gb|ELQ40736.1| ankyrin repeat protein [Magnaporthe oryzae Y34]
gi|440479502|gb|ELQ60265.1| ankyrin repeat protein [Magnaporthe oryzae P131]
Length = 2018
Score = 44.3 bits (103), Expect = 0.022, Method: Composition-based stats.
Identities = 28/97 (28%), Positives = 46/97 (47%), Gaps = 6/97 (6%)
Query: 81 KICGIIQVNAHEMPLTEP-SYIAIFDRASFIEHNCYPNLYKSF---TDSGQVLLRAMKPI 136
K+ I NA+ + + + F S + H C PN F T S ++ A++ I
Sbjct: 1660 KVHKIYSTNAYRTSIRRGVDFHSTFVEVSRMNHECSPNCAAYFDPMTMSQRIY--AIRDI 1717
Query: 137 APGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
PG+ L++ Y DP+ R++ L+ F C C+RC
Sbjct: 1718 MPGEELTVSYIDPVQTREARQNRLRKDWGFGCSCQRC 1754
>gi|79507571|ref|NP_196280.2| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
gi|229488103|sp|Q9FG08.2|ATXR4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR4; AltName:
Full=Protein SET DOMAIN GROUP 38; AltName:
Full=Trithorax-related protein 4; Short=TRX-related
protein 4; Flags: Precursor
gi|332003660|gb|AED91043.1| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
Length = 325
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 35/73 (47%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN + + + L ++ + G+ L ICY D G R+ L
Sbjct: 250 AVYMLPSFYNHDCDPNAHIIWLHNADARLNTLRDVEEGEELRICYIDASMGYEARQTILS 309
Query: 162 TSKYFICQCERCR 174
F+C C RC+
Sbjct: 310 QGFGFLCNCLRCQ 322
>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
Length = 392
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 41/87 (47%), Gaps = 6/87 (6%)
Query: 94 PLTEPSYIAIFDRASFIEHNCYPNL----YKSFTDSG--QVLLRAMKPIAPGDHLSICYT 147
PL + A++ RAS + H+C PN Y G +++RA+ I G + + Y
Sbjct: 193 PLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCLSYF 252
Query: 148 DPLWGTINRRHHLQTSKYFICQCERCR 174
W +R+ L F C+CERC+
Sbjct: 253 AANWQYKDRQQRLLEDYGFRCECERCQ 279
>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
Length = 545
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ ASFI H+C PN + V++ A + I G+ +S Y D L + +R +
Sbjct: 331 VGLWTLASFINHSCIPNARRLHVGD-YVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 388
Query: 161 QTSKYFICQCERCR 174
S F C C RC+
Sbjct: 389 AESWGFCCGCSRCK 402
>gi|302689349|ref|XP_003034354.1| hypothetical protein SCHCODRAFT_256449 [Schizophyllum commune H4-8]
gi|300108049|gb|EFI99451.1| hypothetical protein SCHCODRAFT_256449 [Schizophyllum commune H4-8]
Length = 344
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 34/130 (26%), Positives = 55/130 (42%), Gaps = 12/130 (9%)
Query: 63 FLLNFFKLNEEF--TQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNC---YPN 117
F L++ L ++F + I Q NA + + IF R + + H C + +
Sbjct: 100 FDLSYVGLPDDFDPASDPDAHALAIFQTNA----VAAGDMVGIFPRMARLNHGCVRAFNS 155
Query: 118 LYKSFTDSGQVLLRAMKPIAPGDH---LSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+Y G +++ A+KPIA G+ L Y D +RR HL+ F C C C
Sbjct: 156 VYSWREKEGALVVHALKPIAKGERSQELLTTYFDTKRARDDRRAHLKERYGFECNCSVCA 215
Query: 175 DPTELNTFYD 184
P ++ D
Sbjct: 216 LPEAKSSALD 225
>gi|402219261|gb|EJT99335.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 445
Score = 44.3 bits (103), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 56/125 (44%), Gaps = 18/125 (14%)
Query: 58 RNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPN 117
R +V+ L+ E F+ L+ G + +N S+ +F S + H+C PN
Sbjct: 301 RKIVKKTLSEDVRTELFSYRGFLRGLGRVSLNLD-------SHGGLFLLHSHMNHSCIPN 353
Query: 118 LYKSFTDSGQ--------VLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFI-C 168
+ + Q + L A + I PG+ L++ Y +P W R+ L+ ++ + C
Sbjct: 354 MAVKHPPAAQAQKYPSSRIALVANRDIQPGEELTVTYVNPTWPYRRRKEELK--EWGVEC 411
Query: 169 QCERC 173
CERC
Sbjct: 412 DCERC 416
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 58/137 (42%), Gaps = 12/137 (8%)
Query: 59 NVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLTE--PSYIAIFDRASFIEHNCYP 116
N+V +L + ++N +EI NAH + +E P ++ S I H+C P
Sbjct: 196 NLVHLILQWPEIN----IKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSIINHSCLP 251
Query: 117 NLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR-- 174
N F D + +RA++ I G + I Y + T+ R+ L+ F C C C
Sbjct: 252 NSVLVF-DGREASVRALQHIPKGTEVLISYIETAGSTVTRQKALREQYLFQCVCPLCSKV 310
Query: 175 ---DPTELNTFYDGVKC 188
+ N +G +C
Sbjct: 311 GQYEDVRENAILEGYRC 327
>gi|330846003|ref|XP_003294847.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
gi|325074603|gb|EGC28624.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
Length = 428
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 68/150 (45%), Gaps = 10/150 (6%)
Query: 34 WDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFF-KLNEEFTQEEILKI-----CGI-- 85
++ L ++VE Y S K E+ + ++++ K+++E ++E+L I C
Sbjct: 228 YNDYADLVSNVENYNESLK-ESLSYWICKYIVKLAAKIDKEEDEDELLNILLRNRCNAFY 286
Query: 86 IQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNL-YKSFTDSGQVLLRAMKPIAPGDHLSI 144
IQ + E ++ R SF H+C PN+ Y ++ +V +K + G+ L+I
Sbjct: 287 IQGRPRDGTSGESRGCGVYVRNSFFNHSCDPNVNYWVVNNTLEVECTLLKNVKEGEELTI 346
Query: 145 CYTDPLWGTINRRHHLQTSKYFICQCERCR 174
Y D RR L F C C +C+
Sbjct: 347 SYIDTSAPLNKRREKLLEGYLFTCLCTKCK 376
>gi|242016107|ref|XP_002428677.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212513348|gb|EEB15939.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 697
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 37/137 (27%), Positives = 64/137 (46%), Gaps = 29/137 (21%)
Query: 64 LLNFF-----KLNEEFTQEEILKICGII------QVNAHE--------MPLTEPS----- 99
L N+F K ++F ++EIL I+ Q NAHE + TE S
Sbjct: 438 LTNYFESPENKTTDDFNEDEILTASIILHHLELLQFNAHEIYETRVEELYKTEDSKTVFI 497
Query: 100 YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGT---INR 156
I ++ + H C P++ + F+ ++++A++P+A D LS Y P +G + R
Sbjct: 498 AIGVYPTVALFNHECSPSVTRYFS-GKNIIIKAVRPLATNDILSDNY-GPHYGKKTLLER 555
Query: 157 RHHLQTSKYFICQCERC 173
L + +F C+C+ C
Sbjct: 556 TRELTSRYWFRCRCQAC 572
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 74/186 (39%), Gaps = 22/186 (11%)
Query: 11 PHPSYECIIALRCLYKKSHQPQIWDKLMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKL 70
P P+ I AL L+ K I D L LE HV E + D + F L
Sbjct: 105 PTPTRALIQAL--LWGK-----IRDGLADLEGHVLEKRAEGDEWRDIEMMAVAGCAFSGL 157
Query: 71 N--EEFTQEEILKICGIIQVNAHEMPLTEPSYIAIF--DRASFIEHNCYPNLYKSFTDSG 126
++ + +C I Q N+ + +E ++ + H+C PN F
Sbjct: 158 GRGDQDVRMAAEMLCKI-QNNSFQRFDSELGVAGLYLEPTLAMANHSCIPNASVQFIGRN 216
Query: 127 QVLLRAMKPIAPGDHLSICYT---DPLWGTINRRHHLQTSKYFICQCERCRDPTELNTFY 183
+L+ A PI GD + I YT DPL RR L K F CQC RCRD LN +
Sbjct: 217 ALLI-AENPIRAGDEIEIAYTFYTDPL---PKRREALAHYK-FTCQCLRCRD--NLNVYQ 269
Query: 184 DGVKCP 189
P
Sbjct: 270 VAAISP 275
>gi|10178118|dbj|BAB11411.1| unnamed protein product [Arabidopsis thaliana]
Length = 258
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 35/73 (47%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
A++ SF H+C PN + + + L ++ + G+ L ICY D G R+ L
Sbjct: 183 AVYMLPSFYNHDCDPNAHIIWLHNADARLNTLRDVEEGEELRICYIDASMGYEARQTILS 242
Query: 162 TSKYFICQCERCR 174
F+C C RC+
Sbjct: 243 QGFGFLCNCLRCQ 255
>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 359
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 34/74 (45%), Gaps = 2/74 (2%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLR--AMKPIAPGDHLSICYTDPLWGTINRRHH 159
++ H+C PNL +++SG L A++ I PG+ L I Y D R+
Sbjct: 259 GVYTVGCLFNHSCEPNLQVVYSESGDETLSVVALRDIEPGEELCISYIDESLSYPERQQE 318
Query: 160 LQTSKYFICQCERC 173
L F CQC +C
Sbjct: 319 LYEHYLFFCQCPKC 332
>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
AI+ RAS H+C N F D ++ LR +K + G+ +I Y D + RR L+
Sbjct: 262 AIYLRASMANHSCDYNCIVVF-DERKLQLRTVKDVQEGEECTIGYVDVIHPAKERRAELE 320
Query: 162 TSKYFICQCERC 173
+F C+C +C
Sbjct: 321 EKYHFTCKCVKC 332
>gi|340054943|emb|CCC49251.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 434
Score = 44.3 bits (103), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 6/105 (5%)
Query: 71 NEEFTQEEILKICGIIQVNAHEMPLTEPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLL 130
E EEIL + IIQ N +P A++ SF+ H+C PN + + + L
Sbjct: 330 QEHLYMEEILSLFAIIQTNEFLLP----GGTALYHGYSFLNHSCEPNCAIAGSSAINRRL 385
Query: 131 RAMKPIAPGDHLSICYTDPLWGTIN--RRHHLQTSKYFICQCERC 173
A++ I G+ L I Y L ++ R L ++F C C RC
Sbjct: 386 VALRNIREGEQLMINYNARLTTCVSYEDRRALCMQRHFECFCPRC 430
>gi|67524199|ref|XP_660161.1| hypothetical protein AN2557.2 [Aspergillus nidulans FGSC A4]
gi|40745506|gb|EAA64662.1| hypothetical protein AN2557.2 [Aspergillus nidulans FGSC A4]
gi|259487990|tpe|CBF87094.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 638
Score = 44.3 bits (103), Expect = 0.023, Method: Composition-based stats.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 98 PSY-IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD-PLWGTIN 155
P Y +A + RA+ H+C PNL + +++L A + IA G+ I Y D + +N
Sbjct: 553 PQYGLACYPRATLCNHSCVPNLKHGPDEQSRMVLTATRDIAAGEECCISYFDLTVHVDLN 612
Query: 156 -RRHHLQTSKYFICQCERC 173
RR + F C CERC
Sbjct: 613 ARRKRTRELFTFSCTCERC 631
>gi|451852752|gb|EMD66047.1| hypothetical protein COCSADRAFT_86951 [Cochliobolus sativus ND90Pr]
Length = 735
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 102 AIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQ 161
AI+ +AS+I H+C N+++SF +++RA K + G L+ Y T N L+
Sbjct: 500 AIWTKASYINHSCIGNVFRSFIGD-MMIIRAAKDLEAGTELTFSYATSDEAT-NIEQKLK 557
Query: 162 TSKYFICQCERCRD 175
F C C RC D
Sbjct: 558 NWG-FACSCARCED 570
>gi|357474929|ref|XP_003607750.1| SET domain protein [Medicago truncatula]
gi|355508805|gb|AES89947.1| SET domain protein [Medicago truncatula]
Length = 540
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 27/115 (23%), Positives = 56/115 (48%), Gaps = 19/115 (16%)
Query: 73 EFTQEEILKICGIIQVNAHEMPLTEPSY-------------IAIFDRASFIEHNCYPNLY 119
E + +++K+ I+ VN+ LTE + + ++ SF+ H+C PN
Sbjct: 285 EDVEIDMVKLLAILDVNS----LTEDAVSANVLRKNNDCYGVGLWLLPSFVNHSCCPNAR 340
Query: 120 KSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCR 174
+ +++ A + + G+ ++ Y DPL +N+R L + +C+C+RC+
Sbjct: 341 RLHVGD-YLIVHASRDLKAGEEITFAYLDPL-SPLNKRKELSVTWGILCKCKRCK 393
>gi|312085936|ref|XP_003144878.1| MYND finger family protein [Loa loa]
Length = 535
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 37/161 (22%), Positives = 65/161 (40%), Gaps = 13/161 (8%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLT 96
+M + +H + K P + LL F+ ++E+ ++ +N H ++
Sbjct: 173 IMDIWSHTDLIKQDPGAMKKFNGIYADLLAFYGSKAMVGKDEVFELHCRDYINRH--AIS 230
Query: 97 EPSYIAIFDRASFIE-----HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSIC---YTD 148
+ YI + +++ H+C PN + D LR + +LS Y D
Sbjct: 231 DCGYIEEIGKGLYLDLCAYDHSCRPNTIYT-CDGFVATLRGLTASVDLRNLSSAHYSYID 289
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ T RR L+ + YF C C RC DP + + CP
Sbjct: 290 LINTTQQRRKLLKDTWYFECHCTRCSDPDD--ALLSSILCP 328
>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 25/106 (23%), Positives = 50/106 (47%), Gaps = 8/106 (7%)
Query: 74 FTQEEILKICGIIQVNAHEMPLTE-----PSYIAIFDRASFIEHNCYPNLYKSFTDSGQV 128
+ + E+ ++ + NAH + + P IA++ + S H+C P+ F + G
Sbjct: 158 YGEAELTQLLCSVNCNAHTLYANDRSSLVPVGIAVYLQGSAFNHSCVPS--AEFCNVGTS 215
Query: 129 L-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
L +R+++ + G+ +++ Y RR L+ F+C C RC
Sbjct: 216 LTVRSLRRVQAGEEITVSYVPTTMNLKERRRCLEGQYKFVCACARC 261
>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
B]
Length = 410
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 46/108 (42%), Gaps = 10/108 (9%)
Query: 74 FTQEEILKICGIIQVNAH---EMPLTEPS---YIAIFDRASFIEHNCYPNLYKSFTDSGQ 127
T E + G+ N+ E+P+ E S A++ R SF H+C PN K D
Sbjct: 290 ITIENVRTALGVDPGNSFGIWEVPIMEESECLGFAVYPRLSFFNHDCAPNARKE-RDGRA 348
Query: 128 VLLRAMKPIAPGDHLSICY--TDPLWGTINRRHHLQTSKYFICQCERC 173
+ + +A GD L I Y D + RR L +F C C RC
Sbjct: 349 LRFVTTREVAEGDELCISYGHVDGM-AWRERRKELSDGWFFDCACGRC 395
>gi|159471331|ref|XP_001693810.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283313|gb|EDP09064.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 484
Score = 44.3 bits (103), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 99 SYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTD-PLWGTINRR 157
+I ++ + + H+C PN ++ G +++RA+ PI+ G+ ++ICY P N+R
Sbjct: 204 GHIGLWPEFALLNHSCAPNTV-NYVVGGSMVVRAVAPISQGEEVTICYLGRPQLLPFNKR 262
Query: 158 HHLQTSKY-FICQCERC 173
+ Y F C C RC
Sbjct: 263 IAILADDYGFECSCPRC 279
>gi|393905719|gb|EFO19192.2| MYND finger family protein [Loa loa]
Length = 545
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/161 (22%), Positives = 65/161 (40%), Gaps = 13/161 (8%)
Query: 37 LMQLEAHVEEYKNSPKYENDRRNVVQFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPLT 96
+M + +H + K P + LL F+ ++E+ ++ +N H ++
Sbjct: 173 IMDIWSHTDLIKQDPGAMKKFNGIYADLLAFYGSKAMVGKDEVFELHCRDYINRH--AIS 230
Query: 97 EPSYIAIFDRASFIE-----HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSIC---YTD 148
+ YI + +++ H+C PN + D LR + +LS Y D
Sbjct: 231 DCGYIEEIGKGLYLDLCAYDHSCRPNTIYT-CDGFVATLRGLTASVDLRNLSSAHYSYID 289
Query: 149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP 189
+ T RR L+ + YF C C RC DP + + CP
Sbjct: 290 LINTTQQRRKLLKDTWYFECHCTRCSDPDD--ALLSSILCP 328
>gi|294887209|ref|XP_002772010.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239875918|gb|EER03826.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 74
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 112 HNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCE 171
H+C PN SF S + ++++KP+ G+ + Y D + R+ L+ + F C+C
Sbjct: 3 HSCSPNAEASFRGSRCLRVKSLKPVHSGEEVFQSYIDENLPLVERQSKLRQAYGFACRCG 62
Query: 172 RCRDP--TELN 180
RCR ELN
Sbjct: 63 RCRTEALAELN 73
>gi|407394360|gb|EKF26913.1| hypothetical protein MOQ_009381 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/144 (25%), Positives = 60/144 (41%), Gaps = 27/144 (18%)
Query: 51 PKYEND-----------RRNVVQFLLNFFKLNEEFTQEEILKICG-IIQVNAHEMPLTEP 98
P+ END R + ++ L LN + I G IIQ + E + P
Sbjct: 239 PRSENDAWALEIVDALLRHDTLETFLGQMSLNSQ-------AINGFIIQCPSGEKAFSAP 291
Query: 99 SYI-------AIFDRASFIEHNCYPNLYKSFTDSG-QVLLRAMKPIAPGDHLSICYTDPL 150
+ A++ S H+C PN+ S D + LR ++P+ G+ L+I Y
Sbjct: 292 PLVEWVLKGGAVYTLQSVFNHSCDPNVTVSNVDGTHDITLRTLRPVKCGEELTITYIPLE 351
Query: 151 WGTINRRHHLQTSKYFICQCERCR 174
T +R+ +F C+C RC+
Sbjct: 352 NTTPEQRNEKLKGYFFTCRCLRCQ 375
>gi|402218653|gb|EJT98729.1| hypothetical protein DACRYDRAFT_17900 [Dacryopinax sp. DJM-731 SS1]
Length = 379
Score = 44.3 bits (103), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 4/66 (6%)
Query: 113 NCYPN---LYKSFTDSGQVL-LRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFIC 168
N PN +Y+ G V+ LRA++PIA G+ ++I Y DP RR L+ + F+C
Sbjct: 119 NSRPNCVVVYEIVEQKGLVMELRAVRPIAKGEEITISYVDPAIWLGARRSLLKMNYDFLC 178
Query: 169 QCERCR 174
C +C+
Sbjct: 179 DCPKCQ 184
>gi|302838107|ref|XP_002950612.1| SET domain-containing protein [Volvox carteri f. nagariensis]
gi|300264161|gb|EFJ48358.1| SET domain-containing protein [Volvox carteri f. nagariensis]
Length = 838
Score = 44.3 bits (103), Expect = 0.025, Method: Composition-based stats.
Identities = 33/99 (33%), Positives = 46/99 (46%), Gaps = 12/99 (12%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHH- 159
+A++ RA+ + H+C PNL F V R+ + + PG L+I Y P G + R
Sbjct: 615 LALYPRAALLNHSCVPNLGLRFLGLRLV-ARSCREVLPGQPLTISY-GPQQGKMPRASRV 672
Query: 160 --LQTSKYFICQCERC-RDP------TELNTFYDGVKCP 189
LQ F C C+ C DP EL G+KCP
Sbjct: 673 AALQAQYAFSCGCDACATDPRVTGHTAELEAALWGLKCP 711
>gi|302408415|ref|XP_003002042.1| Mcg1p [Verticillium albo-atrum VaMs.102]
gi|261358963|gb|EEY21391.1| Mcg1p [Verticillium albo-atrum VaMs.102]
Length = 103
Score = 44.3 bits (103), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 45/98 (45%), Gaps = 9/98 (9%)
Query: 84 GIIQVNAHEMPLTEPS------YIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAM--KP 135
GI+ NA ++ + + + +F S + H+C PN ++T ++ RA K
Sbjct: 7 GIVATNAFQVDVGGDAAGGHHHHFGVFPAISRLNHDCGPNA-AAWTGKTDLVHRAFASKD 65
Query: 136 IAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERC 173
IA G+ +SI Y D L RR + S F C C R
Sbjct: 66 IAAGEEISISYVDALAPRAERRARMTGSWGFECACRRA 103
>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
Length = 969
Score = 44.3 bits (103), Expect = 0.026, Method: Composition-based stats.
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 2/74 (2%)
Query: 101 IAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHL 160
+ ++ ASFI H+C PN + V++ A + I G+ +S Y D L + +R +
Sbjct: 755 VGLWTLASFINHSCIPNA-RRLHVGDYVIVHASRDIKTGEEISFAYFDVL-SPLEKRKEM 812
Query: 161 QTSKYFICQCERCR 174
S F C C RC+
Sbjct: 813 AESWGFCCGCSRCK 826
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.138 0.443
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,127,486,982
Number of Sequences: 23463169
Number of extensions: 123042963
Number of successful extensions: 248950
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 814
Number of HSP's successfully gapped in prelim test: 1248
Number of HSP's that attempted gapping in prelim test: 246786
Number of HSP's gapped (non-prelim): 2144
length of query: 190
length of database: 8,064,228,071
effective HSP length: 134
effective length of query: 56
effective length of database: 9,215,130,721
effective search space: 516047320376
effective search space used: 516047320376
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 72 (32.3 bits)