BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10008
(214 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 770
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/172 (37%), Positives = 93/172 (54%), Gaps = 20/172 (11%)
Query: 61 ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
+ +P +A G T+ S + M S KGR ++A D+ GD +FVE+P A V+LP +
Sbjct: 225 SQVPDVAYGAHGTVPSCSMAVDMLYSTEKGRFLVANRDLQPGDAIFVERPYASVLLPGHT 284
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
+NC HC + +PC +C YCS +C +++W YHRWEC G L LL +VGIAHLA+
Sbjct: 285 KTNCQHCHKRLLNAVPCAQCNQVRYCSFACAKDSWNSYHRWEC-GNLNLLYSVGIAHLAV 343
Query: 181 KLILVSSHS-------------------DRYKEVYHLETHLQDMRPEDLYQY 213
+++LV+ S Y V+ L TH + M EDL QY
Sbjct: 344 RVLLVTGLSGLADFCRHLAEGKVDVDKNGGYGSVHELVTHSEKMHVEDLLQY 395
>gi|332021392|gb|EGI61760.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 749
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 101/173 (58%), Gaps = 6/173 (3%)
Query: 47 MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
M+ ++ S++ ++ SLP +A GE+ AS ++++ + KGR+V+A D+ +G LF
Sbjct: 198 MEENADSTTDTAEFPSLPDVAFGENPNFRFASAAVEIKCTPKKGRYVVANRDIKRGQILF 257
Query: 107 VEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
VEK AFV L NC +CC +S + P+PC EC+ ++YC+ +C EA YHRWEC G
Sbjct: 258 VEKAFAFVPLCHIKSDNCYNCCRSSGNTPVPCTECVDSIYCNITCWDEAHSSYHRWECLG 317
Query: 166 -ALRLLEAVGIAHLALKLILV----SSHSDRYKEVYHLETHLQDMRPEDLYQY 213
+ L +GIA+LA++++ + +R EV L T+ + P D+ Y
Sbjct: 318 NQMGLWAEIGIAYLAVRMLFKCTTDVTDDNRLDEVQKLVTNFSKLPPGDVISY 370
>gi|345493155|ref|XP_001601354.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 737
Score = 119 bits (298), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 92/157 (58%), Gaps = 6/157 (3%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP-PCSM 121
LP A GE+ AS + ++ S KGRHV+A DV KG+ LFVEKP AFV+L S
Sbjct: 198 LPDPAFGENPDFPNASAALDLKFSVDKGRHVVANRDVQKGEVLFVEKPFAFVLLDNEYSD 257
Query: 122 SNCNHCCT-SISAPIPCNECILAVYCSESCRREAWLRYHRWECHG-ALRLLEAVGIAHLA 179
+ C +C P+PC C VYC+E CR++AW YH+WEC G + + + +GIAHL
Sbjct: 258 AVCANCLKFRGDVPVPCKFCASTVYCTEQCRKKAWSTYHQWECFGNQIGIWDQIGIAHLT 317
Query: 180 LKLILVSSHSD---RYKEVYHLETHLQDMRPEDLYQY 213
++ L ++D ++ E+ L T++ + +D++ Y
Sbjct: 318 VRTFLNCCYTDDTKKFNEIQRLVTNIDKIATQDMFVY 354
>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Metaseiulus occidentalis]
Length = 775
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 82/150 (54%), Gaps = 14/150 (9%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
AS+ + S KGR ++A DV GD LFVE P A V+LP + ++C+HC I A P
Sbjct: 244 ASSALDCLYSTEKGRFIVANRDVRPGDNLFVESPYASVLLPAFTKTHCHHCYRRIKAAFP 303
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD------ 190
C +C YCS SC E+W YH EC G L LL +VGIAHLA ++ILV+
Sbjct: 304 CRQCAQVRYCSMSCSGESWNLYHSQEC-GNLDLLISVGIAHLAERVILVTGLGLIKDFMK 362
Query: 191 -------RYKEVYHLETHLQDMRPEDLYQY 213
Y VY L TH +DM DL+QY
Sbjct: 363 SCNTLECSYLPVYQLVTHEEDMHIADLFQY 392
>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 333
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 98/165 (59%), Gaps = 6/165 (3%)
Query: 55 SASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV 114
+ S S L L GE+ AS I+++ + KGR+V+A ++ KG+TLFVE FV
Sbjct: 28 TESPSQPKLKRLKSGENPNFRSASASIEVKHAPGKGRYVVANRNIKKGETLFVEDAFTFV 87
Query: 115 ILPPC-SMSNCNHCCTS-ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA-LRLLE 171
++ ++C++CC S + P+PC EC+ ++C+ C EA L YHRW C + + LL
Sbjct: 88 LVNNNKDNTHCHNCCKSYLDVPVPCTECVDTLFCNMKCWDEACLSYHRWLCLASQMGLLR 147
Query: 172 AVGIAHLALKLILV---SSHSDRYKEVYHLETHLQDMRPEDLYQY 213
+GI HLALKL+ V +++++++ +V L T+ + PED + Y
Sbjct: 148 QIGITHLALKLLCVCATTTNNEKFNDVQQLVTNFNKLTPEDAFSY 192
>gi|328698846|ref|XP_001948827.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 742
Score = 113 bits (282), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 85/143 (59%), Gaps = 2/143 (1%)
Query: 73 TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
TL S K+++ S AKGR+ +A + G+ L E P AFV+LP S C +CC +
Sbjct: 240 TLPFRSPKMELCYSEAKGRYFVASTKIEYGELLIFENPFAFVLLPEYYNSFCYNCCVPLK 299
Query: 133 A-PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD- 190
IPC+ C ++C + C +EA YHRWEC + + +GIAHLAL+L + +S ++
Sbjct: 300 YYSIPCDNCCTILFCGDKCLQEARNSYHRWECKQGTSIFKCIGIAHLALRLTIETSQANS 359
Query: 191 RYKEVYHLETHLQDMRPEDLYQY 213
++Y+L TH+ D++ +LYQY
Sbjct: 360 NNDQIYNLLTHIDDLKSLELYQY 382
>gi|340720588|ref|XP_003398716.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 735
Score = 110 bits (275), Expect = 4e-22, Method: Composition-based stats.
Identities = 77/219 (35%), Positives = 111/219 (50%), Gaps = 11/219 (5%)
Query: 3 KQDVQETLSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQ--VSSTSSSASSSV 60
+Q +ETLS + D K+ +IEK N A S Q ++ S + +
Sbjct: 152 QQLAKETLSKIQTMIHDSDYIAPSMKD---SIEKKINNITVAESFQQNIAQNSVQDTDDL 208
Query: 61 ASLPSLACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
L S E +T AS I S GRHVIA + + KGD LF+EKPV+FV+L
Sbjct: 209 LKLKSHIMFEENTNFPNASVAIDRIYSQELGRHVIANKSIKKGDILFLEKPVSFVLLSHD 268
Query: 120 SMSNCNHC-CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG-ALRLLEAVGIAH 177
+ + C HC C++ P+PC C+ YC+E C +AW YH WEC G + L + +GI H
Sbjct: 269 TYNLCPHCNCSNTDIPVPCTTCLNTFYCNEYCLNKAWSLYHCWECPGNQMELWKEIGIGH 328
Query: 178 LALKLILVSS---HSDRYKEVYHLETHLQDMRPEDLYQY 213
LALK++L S ++ E+ +L T+ + DL Y
Sbjct: 329 LALKVLLTCSTMTDETKFNEIQNLVTNFDKLSMGDLTVY 367
>gi|291242073|ref|XP_002740933.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 669
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 98/199 (49%), Gaps = 32/199 (16%)
Query: 47 MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
M S+ S + P L+ G + + C S+ + +++S KGR+ +A DV GD L
Sbjct: 210 MMESTESVGLQKKLKIPPKLSYGSNLHVPCMSSAVTIESSPDKGRYAVATRDVKVGDVLI 269
Query: 107 VEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA 166
VE P + V L PC++S+C+HC + A IPC +C VYCS+ CR +W YH ECH
Sbjct: 270 VENPYSSVGLQPCNVSHCHHCYIRVLASIPCLQCAGIVYCSKECRNASWEMYHNLECH-H 328
Query: 167 LRLLE--AVGIAHLALKLILVSS-----------------------------HSDRYKEV 195
L L++ +G+ HLAL+ I+ + + Y V
Sbjct: 329 LDLIQELGLGMGHLALRTIIRTGLAFLLKFREQSANVNIPDESFHGCTVDGEYESNYYSV 388
Query: 196 YHLETHLQDMRPEDLYQYV 214
Y+L H +D +P DL++ V
Sbjct: 389 YNLVGHSEDRKPGDLFKRV 407
>gi|350401158|ref|XP_003486068.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 662
Score = 108 bits (271), Expect = 1e-21, Method: Composition-based stats.
Identities = 55/129 (42%), Positives = 82/129 (63%), Gaps = 5/129 (3%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSE 148
GRHVIA + + KGD LF+EKPV+FV+L + + C HC C + P+PC C+ YC+E
Sbjct: 159 GRHVIANKSIKKGDILFLEKPVSFVVLSHDTYNRCPHCNCQNADIPVPCTTCLNTFYCNE 218
Query: 149 SCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV-SSHSD--RYKEVYHLETHLQD 204
C +AW YH WEC G ++L E +GI HLALK++L S+ +D ++ E+ +L T+
Sbjct: 219 YCSTKAWSLYHCWECPGNQMKLWEEIGIGHLALKVLLTCSTMTDEIKFNEIQNLVTNFDK 278
Query: 205 MRPEDLYQY 213
+ ++L Y
Sbjct: 279 LSMDELTVY 287
>gi|449672885|ref|XP_002160254.2| PREDICTED: SET and MYND domain-containing protein 4-like [Hydra
magnipapillata]
Length = 729
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 107/233 (45%), Gaps = 43/233 (18%)
Query: 23 SHLDFKE--DVLTIEKSEE-NSVEAMSMQVSSTS---SSASSSVASLPSL--ACGESHTL 74
S L FK+ VL E SEE N VE + V + A + V L S G L
Sbjct: 157 SQLSFKKCLTVLPTEFSEEKNQVEMLLSSVCKFDLAHTVAVNEVVELDSFIPVNGFDKIL 216
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP 134
S+ I++ + +GR++ A ED+ G + EKP A V+LP ++C C + +
Sbjct: 217 TRGSSSIEVGYDSKQGRYIFAKEDIPNGSIIISEKPYAAVLLPHWYKTHCQLCFDKVVSL 276
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD---- 190
PC EC V+CS SC +AW YHR+EC L L+E VGIAHL+L+++LVS D
Sbjct: 277 FPCYECAEVVFCSLSCYNDAWATYHRFECK-KLSLMEKVGIAHLSLRIVLVSDAKDLLRF 335
Query: 191 ------------------------------RYKEVYHLETHLQDMRPEDLYQY 213
Y+ VY L TH + EDL+QY
Sbjct: 336 LGSDLNKFTDSPTLPSSKIEGCNDQGIYQGDYESVYFLSTHSDRLPIEDLFQY 388
>gi|321476199|gb|EFX87160.1| hypothetical protein DAPPUDRAFT_312722 [Daphnia pulex]
Length = 732
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/171 (37%), Positives = 93/171 (54%), Gaps = 27/171 (15%)
Query: 68 CGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC 127
CGE+ + AS+ + + S KGR+V+A D+ G+TLFVE+P A V+LP S C+HC
Sbjct: 219 CGENSSFKGASSALNISHSKEKGRYVVANRDIKAGETLFVEQPNALVVLPDFQTSRCHHC 278
Query: 128 CTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL-- 184
SA PC C +CS+SCR+E+ YH +EC G +L +VGIAHL +++L
Sbjct: 279 TRHSSAKRYPCLACGKIWFCSDSCRQESSC-YHNFEC-GLEAVLNSVGIAHLGARIVLSH 336
Query: 185 -------VSSHSDRYKEV---------------YHLETHLQDMRPEDLYQY 213
+D+ K+V +HL +H + M PE+LYQY
Sbjct: 337 GLDSVLAFLKDTDKVKKVPGIDGPYDTKSYQVMFHLVSHTERMAPEELYQY 387
>gi|322800492|gb|EFZ21496.1| hypothetical protein SINV_13685 [Solenopsis invicta]
Length = 677
Score = 107 bits (268), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 96/172 (55%), Gaps = 5/172 (2%)
Query: 47 MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
M+ + +++ ++ S P +A GE+ AS I+++ + KGR+V+A D+ +G L
Sbjct: 243 MEDDADNTTDTTEFPSQPEVAYGENPNFRFASAAIEVKYTAEKGRYVVANRDIKRGQVLI 302
Query: 107 VEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
VEK AFV L + C +CC +S P+PC EC+ + YC+ C EA YHRWEC G
Sbjct: 303 VEKAFAFVPLSHVKVDICYNCCRSSGDTPVPCTECVDSTYCNIDCWDEARSSYHRWECLG 362
Query: 166 -ALRLLEAVGIAHLALKLIL---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
+ L +GI++LA++++ ++ +R EV L T+ + P D+ Y
Sbjct: 363 NQMGLWPQIGISYLAIRMLFKCTTATDDNRLNEVQELVTNFSKLPPCDVISY 414
>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 628
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 94/175 (53%), Gaps = 9/175 (5%)
Query: 48 QVSSTSSSASSSVAS---LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDT 104
Q+ TS + S LP + GE+ L AS + ++ + KGRH++A D+ +
Sbjct: 79 QMVDTSDDVTGVTESHRILPEILFGENPNLPFASTALDIKYTPQKGRHMVANRDIKRSQI 138
Query: 105 LFVEKPVAFVILPPCSMSN-CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWE 162
LFVEKP +FV L S+ C +CC S + PC+ CI V+C +C EA+ +HRWE
Sbjct: 139 LFVEKPFSFVTLECDGASDLCENCCRSCGKNLTPCSGCIDTVFCDVNCWNEAYSSHHRWE 198
Query: 163 CHG-ALRLLEAVGIAHLALKLIL---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
C G +RL + +G+AHL LK +L +++ + + V L T ++ DL Y
Sbjct: 199 CVGNQMRLWQQIGVAHLGLKTLLKCTMTTDNSMFNRVQQLVTGFDNLSANDLIVY 253
>gi|383849380|ref|XP_003700323.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 691
Score = 103 bits (257), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 88/147 (59%), Gaps = 7/147 (4%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHC- 127
E+ AS I ++ + GRHV+A + ++KG+ LFVEKP++ V++P S+S+ C++C
Sbjct: 174 ENKNFPNASASIDIKYNEEVGRHVVANKFINKGEILFVEKPISIVLIPNESISDRCHNCN 233
Query: 128 CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA-LRLLEAVGIAHLALKLIL-V 185
C PIPC C L YC E C EAW YH WEC G+ +RL VGI HLA K++
Sbjct: 234 CFIGDIPIPCKTC-LYTYCGEKCLNEAWSLYHCWECPGSQMRLWGQVGITHLASKVLFNC 292
Query: 186 SSHSD--RYKEVYHLETHLQDMRPEDL 210
S+ SD R+ ++ +L +H + DL
Sbjct: 293 STMSDKIRFNQLQNLVSHFDKIPDADL 319
>gi|328785590|ref|XP_003250620.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Apis mellifera]
Length = 444
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 81/129 (62%), Gaps = 5/129 (3%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSE 148
GRHVIA + + +GD LF+E+P++FV+L + S C +C + P+PC C+ YC+E
Sbjct: 155 GRHVIANKFIKEGDILFLEEPISFVLLNHDTYSYCQYCNNLNTDIPVPCRTCLNTFYCNE 214
Query: 149 SCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV-SSHSD--RYKEVYHLETHLQD 204
+C +AW YH WEC G + L + +GI HLALK++L S+ +D ++ E+ +L T+
Sbjct: 215 NCLTKAWSSYHCWECPGNQMNLWKEIGIGHLALKVLLTCSTITDKIKFNEMQNLITNFDK 274
Query: 205 MRPEDLYQY 213
+ +DL Y
Sbjct: 275 LSIDDLTIY 283
>gi|156356358|ref|XP_001623892.1| predicted protein [Nematostella vectensis]
gi|156210632|gb|EDO31792.1| predicted protein [Nematostella vectensis]
Length = 634
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 106/219 (48%), Gaps = 20/219 (9%)
Query: 11 SSSLK-LCRLQDTSHLDFKEDVLTIE------KSEENSVEAMSMQVSSTSSSASS----- 58
+ SLK LCR++++ + F+E +++ + K E+ V+ + Q+S S+ +
Sbjct: 118 ARSLKALCRIEESKKV-FEEAIVSSKTSKLKRKDRESWVKQVEKQISECSNQTENLMKSV 176
Query: 59 -------SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
S S P G + S + ++ + KGRH IA D++ GD L VEKP
Sbjct: 177 THLEEARSGFSTPVQNYGLHEKFVSISKALDIKYTEEKGRHTIAARDINIGDVLLVEKPF 236
Query: 112 AFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
A V+L S S+C+ C I AP+PC+ C YCSE C +E+W YH EC +
Sbjct: 237 ASVLLQEQSKSHCHQCFVHILAPLPCSYCTTVRYCSEKCAKESWDAYHYAECMNLEHVYV 296
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHLETHLQDMRPEDL 210
A HLAL++++ + V E+ + P +L
Sbjct: 297 AGKYGHLALRVVVKAGFQYLKASVKQFESEEKKCDPAEL 335
>gi|270004498|gb|EFA00946.1| hypothetical protein TcasGA2_TC003856 [Tribolium castaneum]
Length = 713
Score = 97.4 bits (241), Expect = 3e-18, Method: Composition-based stats.
Identities = 52/139 (37%), Positives = 75/139 (53%), Gaps = 9/139 (6%)
Query: 53 SSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVA 112
+ S + +P GE+ L AS K+KM + KGRH+IA ++ +GD LF+EK A
Sbjct: 186 TDSGQECLHEIPVFPDGENKCLAYASTKVKMSYNKDKGRHIIASTNIQRGDLLFIEK--A 243
Query: 113 FVILPPCSMSN------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG- 165
FV P + C +C + I + +PCN C + VYC E CR + W + HRWEC G
Sbjct: 244 FVFAPVFKENKEFYSFKCYNCLSDIVSGVPCNTCTVCVYCDEKCRDQNWTKCHRWECDGM 303
Query: 166 ALRLLEAVGIAHLALKLIL 184
+ +GIA A K++L
Sbjct: 304 QANIWFDLGIALPAFKVML 322
>gi|345483949|ref|XP_003424915.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4-like [Nasonia vitripennis]
Length = 633
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 88/184 (47%), Gaps = 37/184 (20%)
Query: 62 SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
+LP + E + AS+ I ++ +N GRH++A D+ G+ L +EKP A V+ P
Sbjct: 204 TLPEIIRNE--IIPSASDAISIKYTNVFGRHLVASRDIDVGEILVIEKPYASVLSSPNIY 261
Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLEAVGIAHLA 179
++C+ C IPC++CI A+YCSE CR EAW +YH EC G L L ++ L+
Sbjct: 262 THCSQCFKRTWDSIPCDKCIYAMYCSEKCRSEAWQQYHDIECSIKGYLIGLRMNDLSGLS 321
Query: 180 LKLILV---------------------------------SSHSDRYKEVYHLETHLQDMR 206
LKL ++ + HSD Y+ Y L TH Q+ +
Sbjct: 322 LKLAILAVRESGSIRKLRKKIRKIDDCEDYLMKSISEDGTYHSDEYRPFYSLVTHEQNRK 381
Query: 207 PEDL 210
DL
Sbjct: 382 RYDL 385
>gi|242007370|ref|XP_002424514.1| set and mynd domain containing protein, putative [Pediculus humanus
corporis]
gi|212507932|gb|EEB11776.1| set and mynd domain containing protein, putative [Pediculus humanus
corporis]
Length = 646
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 57/176 (32%), Positives = 90/176 (51%), Gaps = 30/176 (17%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
LP G S+T AS+ I + + GRHV A+ D+ GD LFVEKP A+++
Sbjct: 143 LPVPVGGVSNTFPSASSYINVFYNETLGRHVKALHDIEVGDVLFVEKPFAYILFHEV--- 199
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
C++C +PIPC++C+ +YCS C+++ H +EC G + +++GIAHL+L+L
Sbjct: 200 -CHNCAKQTISPIPCDKCLDVIYCSVECKKKFEKSVHSFEC-GKGKFFKSIGIAHLSLRL 257
Query: 183 ILVSS-------------------------HSDRYKEVYHLETHLQDMRPEDLYQY 213
IL +S ++ Y VY L T+L+ + L+QY
Sbjct: 258 ILAASSNYKELKKLISSYTETKYFDGIVPCENNSYHNVYGLITNLEKYEGKHLFQY 313
>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
magnipapillata]
Length = 1044
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 66/108 (61%), Gaps = 1/108 (0%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
S +K+ ++N+KGRH IA D+ G+ + +EKP A + LP C ++C HC T P
Sbjct: 226 TSEALKIVSTNSKGRHAIASRDIKAGEVIIIEKPFASLCLPECYNTHCYHCLTRFKINYP 285
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLI 183
C C YCS SC +E+W ++H +EC + + + + VG+AHLA K+I
Sbjct: 286 CRLCSTVNYCSISCEKESWEKFHCFECEYLGVLINDDVGLAHLAFKII 333
>gi|357609801|gb|EHJ66685.1| hypothetical protein KGM_08794 [Danaus plexippus]
Length = 776
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 74/128 (57%), Gaps = 10/128 (7%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
LP G + + AS I+++ ++GRHV+AVE +GD LF E+P A+V LP +
Sbjct: 223 LPDCYLGPNPDFVSASKSIELRYKESRGRHVVAVEPAGRGDVLFSEEPYAWVALPS-DDA 281
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL-RLLEAVGIAHLALK 181
C CC + P+P VYC C A + +HRWEC GA L +GIAHLAL+
Sbjct: 282 ICEMCCDTDINPVP-------VYCGCECASRA-ISFHRWECVGAQCSLFPTIGIAHLALR 333
Query: 182 LILVSSHS 189
++L+S+++
Sbjct: 334 VLLISTNN 341
>gi|156367132|ref|XP_001627273.1| predicted protein [Nematostella vectensis]
gi|156214178|gb|EDO35173.1| predicted protein [Nematostella vectensis]
Length = 750
Score = 90.9 bits (224), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 87/165 (52%), Gaps = 10/165 (6%)
Query: 27 FKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTS 86
++ +L ++KS S + +S SA S P +A G S+ +K+
Sbjct: 115 IRDKLLKLQKSCLRKQARHSQNKNGSSGSACLSGGRHPKMANG--------SSLLKINYD 166
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
+GR + A ++ GDTL E+P + V+LP + ++C C S+ AP+PCN C +YC
Sbjct: 167 QNQGRFLQASSEIRAGDTLIAEEPYSAVLLPENAKTHCECCYKSLVAPVPCNHCSSVLYC 226
Query: 147 SESCRREAWLRYHRWECHGALRLLEAVG-IAHLALKLILVSSHSD 190
S +CR +AW +YH EC +LE V HL+L+++L +S D
Sbjct: 227 SAACRNKAWSQYHHVECE-IFPVLEIVDTFTHLSLRILLTTSAKD 270
>gi|270016446|gb|EFA12892.1| hypothetical protein TcasGA2_TC004406 [Tribolium castaneum]
Length = 1108
Score = 90.5 bits (223), Expect = 4e-16, Method: Composition-based stats.
Identities = 52/176 (29%), Positives = 95/176 (53%), Gaps = 21/176 (11%)
Query: 54 SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
++A V + P+ ++ ++ CA N I++ + GRHV+A ++ G+ L +EKP A
Sbjct: 140 NNAYDPVPTFPAFE--KNASIQCARNCIEIMQNERYGRHVVATRNIEMGEILAIEKPYAS 197
Query: 114 VILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLE 171
++ S+ C+ C IPC++C A+YCS++C+ +A+ YH++EC H +L L
Sbjct: 198 IVTDSVSVY-CHECLKLCYNMIPCDKCTKALYCSDNCKDKAYESYHKYECPIHLSLDPLL 256
Query: 172 AVGIAHLALKLILVSSH----------------SDRYKEVYHLETHLQDMRPEDLY 211
LAL++ L+S + SDR+KEV++L+ +++ DL+
Sbjct: 257 IDSSKRLALRIALISRNEWAGSLLNESPDEMYCSDRFKEVFNLDQNVRQRFTHDLF 312
>gi|157169545|ref|XP_001657892.1| hypothetical protein AaeL_AAEL001050 [Aedes aegypti]
gi|108883672|gb|EAT47897.1| AAEL001050-PA [Aedes aegypti]
Length = 662
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 86/172 (50%), Gaps = 7/172 (4%)
Query: 30 DVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTS--- 86
D +++ E+ + +++ + LP L+ G + LM AS KI + S
Sbjct: 210 DDYRLQRLREDLEDLRTLKAEDAAMKPKEPEPVLPELSGGVNEELMGASKKISLSGSLED 269
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
+ +GR+++A ED+ G+T+ VE + P ++CN C T + A +PC +C +C
Sbjct: 270 SNRGRYIVAAEDLKSGETVLVEPAYGACLYPKYFGTHCNQCFTRLVASVPCEQCCGVAFC 329
Query: 147 SESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHSDRYKEV 195
S CR A YHR+EC L L+ G++ HLA +L+ + ++ E+
Sbjct: 330 SPECRDRACAGYHRFECQ-YLDLMIGSGMSILCHLAFRLVTQAGSPEKAAEI 380
>gi|321479127|gb|EFX90083.1| hypothetical protein DAPPUDRAFT_309882 [Daphnia pulex]
Length = 732
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 76/140 (54%), Gaps = 4/140 (2%)
Query: 54 SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
S+A+++ +LP L G + + AS + M+ + GR+ +A +D+ G TL EKP A
Sbjct: 264 STANANTNALPELVSGANSNIPNASKSLAMEVDDNSGRYYVAADDIKPGQTLVCEKPYAA 323
Query: 114 VILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
+LP S+C+HC + AP+ C C YCS CR EA YH++EC G + + A
Sbjct: 324 CLLPGKFTSHCHHCFVRLIAPLGCLTCRGVFYCSVECRDEAASTYHQYEC-GIIDYMIAS 382
Query: 174 G---IAHLALKLILVSSHSD 190
G ++ +AL+++ D
Sbjct: 383 GSSILSWIALRILTKGKMED 402
>gi|189242482|ref|XP_001810915.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
Length = 538
Score = 87.4 bits (215), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/176 (29%), Positives = 95/176 (53%), Gaps = 21/176 (11%)
Query: 54 SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
++A V + P+ ++ ++ CA N I++ + GRHV+A ++ G+ L +EKP A
Sbjct: 63 NNAYDPVPTFPAFE--KNASIQCARNCIEIMQNERYGRHVVATRNIEMGEILAIEKPYAS 120
Query: 114 VILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLE 171
++ S+ C+ C IPC++C A+YCS++C+ +A+ YH++EC H +L L
Sbjct: 121 IVTDSVSVY-CHECLKLCYNMIPCDKCTKALYCSDNCKDKAYESYHKYECPIHLSLDPLL 179
Query: 172 AVGIAHLALKLILVSSH----------------SDRYKEVYHLETHLQDMRPEDLY 211
LAL++ L+S + SDR+KEV++L+ +++ DL+
Sbjct: 180 IDSSKRLALRIALISRNEWAGSLLNESPDEMYCSDRFKEVFNLDQNVRQRFTHDLF 235
>gi|345483845|ref|XP_003424896.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
[Nasonia vitripennis]
gi|345483847|ref|XP_001604625.2| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
[Nasonia vitripennis]
Length = 632
Score = 87.4 bits (215), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 79/155 (50%), Gaps = 3/155 (1%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESH-TLMCASNKIKMQTSNAKGRHV 93
+K++E + + T ++S SL + H + CAS+ I ++ A GRH
Sbjct: 171 DKNKEQLEQKIQTATQITELPKVEDISSKKSLLKFKRHERISCASDAIDLKYDEANGRHT 230
Query: 94 IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRRE 153
+A D++ G+ L EKP A ++ P ++C+HC IPC CI A+YCS CR +
Sbjct: 231 VANRDINVGEILVFEKPYALLLKPERIYTHCSHCFIRAWDSIPCPNCIHAMYCSTKCRDK 290
Query: 154 AWLRYHRWEC--HGALRLLEAVGIAHLALKLILVS 186
AW +YH EC G L +A +LKL +++
Sbjct: 291 AWEQYHDIECPIKGYFLSLTMNDLAPFSLKLAVLA 325
>gi|195376269|ref|XP_002046919.1| GJ12224 [Drosophila virilis]
gi|194154077|gb|EDW69261.1| GJ12224 [Drosophila virilis]
Length = 661
Score = 87.0 bits (214), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 8/155 (5%)
Query: 64 PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
P+LA GE+ L A+N +K+ + KGR V+A E + GD + E PVA +LP +N
Sbjct: 240 PALAHGENPALSGAANVVKLVETKEKGRFVVANEGLKTGDVVLCENPVAACLLPNFFGTN 299
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL---AL 180
C+HC + P+PC C +CS C EA YHR+EC + L G++ L AL
Sbjct: 300 CHHCFKRLHIPVPCLHCSGIAFCSAQCMGEACETYHRFECQ-FMDLFIGSGMSILCFVAL 358
Query: 181 KLIL-VSSHSDRYKEVYHLETHL---QDMRPEDLY 211
++ +S D L HL QD+R D Y
Sbjct: 359 RVFTQAASLEDGLTTANLLFEHLCSHQDVREADDY 393
>gi|195012821|ref|XP_001983754.1| GH16068 [Drosophila grimshawi]
gi|193897236|gb|EDV96102.1| GH16068 [Drosophila grimshawi]
Length = 662
Score = 85.5 bits (210), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 82/157 (52%), Gaps = 9/157 (5%)
Query: 64 PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
P+LA GE+ L A+N +K+ + KGR V+A E + GD + E PVA +LP +N
Sbjct: 241 PALAHGENPALSGAANVVKLVETKDKGRFVVANEGLKTGDVVLSENPVAACLLPNFFGTN 300
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLAL 180
C+HC + P+ C C +CS C EA YHR+EC + L G++ ++AL
Sbjct: 301 CHHCFKRLHTPVACLHCSGIAFCSAQCMGEACETYHRFECQ-FMDLFIGSGMSILCYIAL 359
Query: 181 KLIL-VSSHSDRYKEVYHLETHL---QDMR-PEDLYQ 212
++ +S D + L HL QD+R P+D Q
Sbjct: 360 RIFTQAASLEDGLQTSNLLFEHLVSHQDVRQPDDYLQ 396
>gi|291227705|ref|XP_002733823.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 644
Score = 84.0 bits (206), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 65/121 (53%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
ES L AS +K+ KGR+++A E + +G+ + EKP V+LP + C HC
Sbjct: 55 ESKLLSNASKAVKLTNHGEKGRYILATETICRGEIIIKEKPYGCVLLPSHYNTRCYHCVR 114
Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
APIPC C YCS C++E+W YH EC L +A + L+L+++L + S
Sbjct: 115 KTVAPIPCCTCTHVRYCSVECQQESWKSYHYIECPLWPFLSQAGNFSQLSLRILLKAGWS 174
Query: 190 D 190
+
Sbjct: 175 N 175
>gi|117935365|gb|ABK56990.1| hypothetical protein GIP_L1_00040 [Glyptapanteles indiensis]
Length = 561
Score = 83.6 bits (205), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 71/124 (57%), Gaps = 3/124 (2%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
LPS+ ++ + A++ + ++ S G+HV A D+ G+ L V + A V++ + +
Sbjct: 213 LPSIIV-DNEEICGATSALAVEFSEQYGKHVRATRDIKVGEVLSVNEGYATVLMLDKTYT 271
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH--GALRLLEAVGIAHLAL 180
+C HC + IPCN CI AV+C E CRREAW YH EC G + +E +A +AL
Sbjct: 272 HCAHCLKQTWSAIPCNFCIYAVFCGEDCRREAWKGYHEVECRVTGPMVAMEMNHMALMAL 331
Query: 181 KLIL 184
+L++
Sbjct: 332 RLLV 335
>gi|190702163|gb|ACE75061.1| conserved hypothetical protein [Glyptapanteles flavicoxis]
Length = 565
Score = 82.8 bits (203), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 85/162 (52%), Gaps = 5/162 (3%)
Query: 25 LDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQ 84
+D KE + + + NS+ MS S + LPS+ ++ + A++ + ++
Sbjct: 177 MDGKESGVKLVEKTLNSLHLMSQ--SEAAFVKLDDKRHLPSIIV-DNEEICGATSALAVE 233
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
S G+HV A D+ G+ L V + A V++ + ++C HC + IPCN CI A+
Sbjct: 234 FSEQYGKHVRATRDIKVGEVLSVNEGYATVLMLDKTYTHCAHCLKQTWSAIPCNFCIYAL 293
Query: 145 YCSESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLIL 184
+C E CRREAW YH EC G + +E +A +AL+L++
Sbjct: 294 FCGEDCRREAWEGYHEVECRVTGPMVAMEMNHMALMALRLLV 335
>gi|194869827|ref|XP_001972529.1| GG13835 [Drosophila erecta]
gi|190654312|gb|EDV51555.1| GG13835 [Drosophila erecta]
Length = 665
Score = 82.8 bits (203), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 81/159 (50%), Gaps = 9/159 (5%)
Query: 59 SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
S +P LA GES L+ AS +++ + KGR V+A E + GD L E+PVA + P
Sbjct: 236 SAEQVPQLAHGESAELIGASKVVRLVETKDKGRFVVANEGLRAGDVLLCEEPVAACLEPT 295
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
++C+HC +S P+ C C +CS C EA YHR+EC + L+ G++ L
Sbjct: 296 YFGTHCHHCFKRLSTPVSCLHCSGVAFCSAQCMGEASSSYHRFECE-YMDLMIGSGMSIL 354
Query: 179 ALKLILVSSHSDRYKE--------VYHLETHLQDMRPED 209
+ V + + ++ HL +H ++ +P+D
Sbjct: 355 CFIALRVFTQAPSLEQGLATANLLFEHLCSHEEERQPDD 393
>gi|194883957|ref|XP_001976062.1| GG20202 [Drosophila erecta]
gi|190659249|gb|EDV56462.1| GG20202 [Drosophila erecta]
Length = 660
Score = 82.4 bits (202), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 79/160 (49%), Gaps = 27/160 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ +GR A DV GD L VE+P V+L + ++C +C P+ C C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGDELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR+EA +YH++EC G + ++ G + H+AL++I
Sbjct: 301 ADVLYCSEQCRQEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H + +P + +Q+V
Sbjct: 360 IDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399
>gi|443689223|gb|ELT91670.1| hypothetical protein CAPTEDRAFT_90747 [Capitella teleta]
Length = 673
Score = 82.0 bits (201), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 71/156 (45%), Gaps = 31/156 (19%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
KGRH +AV+++ GDT+ E P +V+ P C HC ++ P+ C +C+ YCSE
Sbjct: 247 KGRHAMAVQNIKVGDTVVKEDPYVWVLDPSQYGYRCYHCFKVLTYPVGCTQCMRVRYCSE 306
Query: 149 SCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSD--RYKE---------- 194
+CR AW YH+ EC A+ + + L L+ ILV+ S RYK
Sbjct: 307 TCRTSAWESYHKTECPYLPAMLMSRTGNLTSLVLRTILVTGMSKIIRYKNNPKSDQHDAQ 366
Query: 195 -----------------VYHLETHLQDMRPEDLYQY 213
+Y L TH + P +L QY
Sbjct: 367 FSLFTDSKGVYMGGFVGLYGLLTHTEHRSPSELLQY 402
>gi|332022516|gb|EGI62819.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 637
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 94/210 (44%), Gaps = 39/210 (18%)
Query: 42 VEAMSMQVSSTSSSASSSVAS--LPSLACGESHT--LMCASNKIKMQTSNAKGRHVIAVE 97
++ + +++ T + ++ V + P+L +S+ + CAS+ + ++ + GRH++A
Sbjct: 187 LDVLHNKITQTKVNKNTKVKTEETPALPIIKSYNNEVPCASDAVAIKYNERYGRHIVATR 246
Query: 98 DVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLR 157
+H G+ + VEKP A +++ ++C++C A IPCN C A+YCSE CR W +
Sbjct: 247 KIHPGEIIAVEKPYALILVQENMQTHCSNCLRVCWANIPCNYCTYAMYCSEECRNIEWEK 306
Query: 158 YHRWEC--------------------HGALRLLEAVGIAHLALKLILVSSHSD------- 190
H EC L L EA GI L + V H D
Sbjct: 307 CHNIECVVFPAVLEYSIYNLDLCSIRLAVLALREAGGIEKLKTMVKEVDEHDDPRTKGFS 366
Query: 191 --------RYKEVYHLETHLQDMRPEDLYQ 212
RY VY L T+ + DL++
Sbjct: 367 QDGKLHSNRYIGVYSLVTNTEKRSVSDLFK 396
>gi|321477118|gb|EFX88077.1| hypothetical protein DAPPUDRAFT_305694 [Daphnia pulex]
Length = 651
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 80/154 (51%), Gaps = 12/154 (7%)
Query: 10 LSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACG 69
+S + K+ +L D ++ + + +S E++ ++ QV +A P L
Sbjct: 190 ISINSKIVKLGDDAN-----PTIGVSESNESAAASIERQVIKKRRTA-------PDLNRD 237
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
+ L AS I++ S+ +GR ++A ED+ G T+ VEK +A ++L S+C+HC
Sbjct: 238 SNALLPAASTSIELTDSDDRGRCLVATEDIQIGTTVIVEKALASILLEEFKESHCHHCLH 297
Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
P+PC++C +CS CR EAW YH+ EC
Sbjct: 298 WTPGPVPCHQCSQVGFCSTLCRDEAWASYHQSEC 331
>gi|195493827|ref|XP_002094580.1| GE20125 [Drosophila yakuba]
gi|194180681|gb|EDW94292.1| GE20125 [Drosophila yakuba]
Length = 667
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 79/155 (50%), Gaps = 9/155 (5%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
+P LA GES L+ AS +++ + KGR V+A E + GD L E+PVA + P +
Sbjct: 242 VPQLAHGESAELIGASKVVRLVETKEKGRFVVANEGLRTGDVLLCEEPVAACLEPTYFGT 301
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
+C+HC + P+ C C +CS C EA YHR+EC + L+ G++ L
Sbjct: 302 HCHHCFKRLRTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 360
Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
+ V + + ++ HL +H ++ +PED
Sbjct: 361 LRVFTQASSLEQGLATANLLFEHLCSHEEERQPED 395
>gi|340708900|ref|XP_003393055.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 673
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 5/108 (4%)
Query: 61 ASLPSLACGESHTLMCA-SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
AS+ + G+ HT++ + S+ +++Q +GRH+IA ++ G L VE P AF
Sbjct: 204 ASVIPIVDGKQHTILKSCSDAVELQFDEKRGRHLIATRNIKAGSVLIVETPFAFSTNKEA 263
Query: 120 SMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
NC HC ++ S IPC+ C +CSE CRREAW YH++EC
Sbjct: 264 LDRNCLHCHITLKSSESVKIPCHFCQTVSFCSEKCRREAWQMYHQYEC 311
>gi|21355139|ref|NP_648574.1| SET and MYND domain protein 4 [Drosophila melanogaster]
gi|7294583|gb|AAF49923.1| SET and MYND domain protein 4 [Drosophila melanogaster]
gi|17945748|gb|AAL48922.1| RE32936p [Drosophila melanogaster]
Length = 663
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
+P LA GES L+ AS +++ + KGR V+A E + GD L E+PVA + P +
Sbjct: 238 VPQLAHGESAELVGASKVVRLVETKDKGRFVVANEGLRTGDVLLFEEPVAACLEPSYFGT 297
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
+C+HC + P+ C C +CS C EA YHR+EC + L+ G++ L
Sbjct: 298 HCHHCFKRLHTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 356
Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
+ + + + ++ HL +H +D +P+D
Sbjct: 357 LRIFTQAPSLEQGLATANLLFEHLCSHEEDRQPDD 391
>gi|327283812|ref|XP_003226634.1| PREDICTED: SET and MYND domain-containing protein 4-like [Anolis
carolinensis]
Length = 683
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 62/121 (51%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
E+ + CAS +++ S KGRH IA D+ G+ L E+ V+ P +C+HC +
Sbjct: 174 ENGLISCASPSMRLAFSKDKGRHCIAQTDIVPGEILLKEEAFVAVLCPGEEDIHCHHCLS 233
Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
+ A +PC C A YC +C R AW YH+ EC LL H+AL+ I V+
Sbjct: 234 PLVASVPCRGCSYAKYCGSACARAAWQSYHQRECPFGGLLLAMGVFCHVALRTIFVAGFE 293
Query: 190 D 190
+
Sbjct: 294 E 294
>gi|242016107|ref|XP_002428677.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212513348|gb|EEB15939.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 697
Score = 81.6 bits (200), Expect = 2e-13, Method: Composition-based stats.
Identities = 51/159 (32%), Positives = 78/159 (49%), Gaps = 5/159 (3%)
Query: 29 EDVLTIEKSEENSVEAMS-MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSN 87
E ++ S N E M QV + + + LP ++ + ASN +K++
Sbjct: 212 EKIIDNLTSSNNDDETMKKQQVENLNFNVPYEGEPLPKVSGEPNEMYPNASNLLKIRQGT 271
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
GR A + GDT+ VE+P V+LP + S+C+HC T + APIPC C +CS
Sbjct: 272 DCGRFAAAANKIEIGDTIVVEEPQTSVLLPKMNSSHCHHCFTRLFAPIPCKTCSGVAFCS 331
Query: 148 ESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLI 183
CR A YHR+EC L G I++L+L+++
Sbjct: 332 VQCRDSASNTYHRYECQ-LLGFFMGSGMSLISYLSLRMV 369
>gi|350419310|ref|XP_003492139.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 673
Score = 81.6 bits (200), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 5/108 (4%)
Query: 61 ASLPSLACGESHTLMCA-SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
AS+ + G+ HT++ + S+ +++Q +GRH+IA ++ G L VE P AF
Sbjct: 204 ASVIPIVNGKQHTVLKSCSDAVELQFDEKRGRHLIATRNIKAGSVLIVETPFAFSTNKEA 263
Query: 120 SMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
NC HC ++ S IPC+ C +CSE CRREAW YH++EC
Sbjct: 264 LDRNCLHCHITLKSSESVKIPCHFCQTVSFCSEKCRREAWQMYHQYEC 311
>gi|383849683|ref|XP_003700474.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 686
Score = 81.3 bits (199), Expect = 2e-13, Method: Composition-based stats.
Identities = 49/160 (30%), Positives = 87/160 (54%), Gaps = 4/160 (2%)
Query: 33 TIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRH 92
++ ++E + + A ++ +S S LP LA GE+ L AS+ ++++ ++ G+
Sbjct: 182 SMRENENSLISAKELERRRGWTSVESK--PLPRLAAGENPRLPGASSLLEIEETDDAGKR 239
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
IA +++ GD L VE P+ V+LP ++C+HC + AP+ C +C +C CR
Sbjct: 240 AIAAKEIVPGDALVVEAPLGAVLLPDFFGTHCHHCFSRFVAPVGCPDCSGVAFCGIRCRD 299
Query: 153 EAWLRYHRWECHGALRLLEAVGIAHLA-LKLILVSSHSDR 191
A YH++EC L LL G++ L+ L L +V+ + R
Sbjct: 300 AAISSYHKYECK-ILALLIGSGMSVLSMLALRMVTQNGPR 338
>gi|195485733|ref|XP_002091210.1| GE12361 [Drosophila yakuba]
gi|194177311|gb|EDW90922.1| GE12361 [Drosophila yakuba]
Length = 660
Score = 81.3 bits (199), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 79/160 (49%), Gaps = 27/160 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ +GR A DV G+ L VE+P V+L + ++C +C P+ C +C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPKC 300
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR EA +YH++EC G + ++ G + H+AL++I
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D Y+ V LE H + +P + +Q+V
Sbjct: 360 IDEELTPEKLISLPKDDYRRVAQLERHQGERQPSNFFQHV 399
>gi|195589810|ref|XP_002084642.1| GD12724 [Drosophila simulans]
gi|194196651|gb|EDX10227.1| GD12724 [Drosophila simulans]
Length = 663
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
+P LA GES L+ AS +++ + KGR V+A E + GD L E+PVA + P +
Sbjct: 238 VPQLAHGESAELVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVAACLEPSYFGT 297
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
+C+HC + P+ C C +CS C EA YHR+EC + L+ G++ L
Sbjct: 298 HCHHCFKRLRTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 356
Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
+ + + + ++ HL +H +D +P+D
Sbjct: 357 LRIFTQAPSLEQGLATANLLFEHLCSHEEDRQPDD 391
>gi|195327115|ref|XP_002030267.1| GM24661 [Drosophila sechellia]
gi|194119210|gb|EDW41253.1| GM24661 [Drosophila sechellia]
Length = 660
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
+P LA GES L+ AS +++ + KGR V+A E + GD L E+PVA + P +
Sbjct: 238 VPQLAHGESAELVGASKVVRLVETKDKGRFVVANEGLRTGDVLLCEEPVAACLEPTYFGT 297
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
+C+HC + P+ C C +CS C EA YHR+EC + L+ G++ L
Sbjct: 298 HCHHCFKRLRTPVSCLHCSGIAFCSAQCMGEACSSYHRFECE-YMDLMIGSGMSILCFIA 356
Query: 183 ILVSSHSDRYKE--------VYHLETHLQDMRPED 209
+ + + + ++ HL +H +D +P+D
Sbjct: 357 LRIFTQAPSLEQGLATANLLFEHLCSHEEDRQPDD 391
>gi|194747113|ref|XP_001955997.1| GF24983 [Drosophila ananassae]
gi|190623279|gb|EDV38803.1| GF24983 [Drosophila ananassae]
Length = 663
Score = 80.9 bits (198), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 68/132 (51%), Gaps = 1/132 (0%)
Query: 59 SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
S +P L GE+ L AS +K+ + KGR V+A E V GD + E+PVA ++P
Sbjct: 237 STEQVPKLTHGENPELSGASKVVKLVETKDKGRFVVASEGVKTGDVVLCEEPVAACLIPS 296
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
+NC+HC + P+ C C +CS C EA YHR+EC + LL G++ L
Sbjct: 297 FFGTNCHHCFKRLHTPVSCLHCSGVAFCSAQCMGEACSSYHRFECE-FMDLLIGSGMSIL 355
Query: 179 ALKLILVSSHSD 190
+ + + ++
Sbjct: 356 CFIALRIFTQAN 367
>gi|383854636|ref|XP_003702826.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 631
Score = 80.5 bits (197), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 72/137 (52%), Gaps = 5/137 (3%)
Query: 32 LTIEKSEENSVEAMSM---QVSSTSSSASSSVASLPSLACGESHT--LMCASNKIKMQTS 86
+TI+ + + M + +VS + S S+ SL ++H + CAS+ + ++ +
Sbjct: 173 MTIDNNTRKKMNDMFISAGRVSVSDKSNKQSITKQSSLPELKTHNPEIPCASDAVTLKYN 232
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
GRH +A D+H G+ + VEK + ++ P ++C++C IPCN C A+YC
Sbjct: 233 EHYGRHFVAARDIHPGEIIAVEKAYSVMLDPDNLQTHCSNCLKVSWTNIPCNYCTYAIYC 292
Query: 147 SESCRREAWLRYHRWEC 163
SE C+ W +YH EC
Sbjct: 293 SEECKLSDWKKYHDIEC 309
>gi|195126158|ref|XP_002007541.1| GI12333 [Drosophila mojavensis]
gi|193919150|gb|EDW18017.1| GI12333 [Drosophila mojavensis]
Length = 661
Score = 80.5 bits (197), Expect = 4e-13, Method: Composition-based stats.
Identities = 38/102 (37%), Positives = 56/102 (54%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
+P+L GE+ L A+ +K+ + KGR V+A + + GD + E PVA +LP +
Sbjct: 239 VPALTHGENPELSGAAKVVKLVETKDKGRFVVADQGLKTGDVVLAEDPVAACLLPNFFGT 298
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
NC+HC + P+PC C +CS C EA YHR+EC
Sbjct: 299 NCHHCFKRLHTPVPCLHCSGIAFCSAQCMGEACETYHRFECQ 340
>gi|307195197|gb|EFN77181.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 693
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 65/117 (55%), Gaps = 6/117 (5%)
Query: 52 TSSSASSSVASLPSLACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP 110
T S V ++P + G+ H+ L+ + + ++ A+GRH++A D+ G L V++P
Sbjct: 210 TPSFGKYCVTTVPVVD-GKRHSCLVSCPDSVTLRFDVARGRHLVATRDIRPGAVLIVDRP 268
Query: 111 VAFVILPPCSMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++ + NC HC ++ S IPC C +C+E+CR+EAW RYHR+EC
Sbjct: 269 FSYSTDASALIRNCLHCHATLKLENSVRIPCRNCQTVSFCTETCRKEAWQRYHRYEC 325
>gi|91089699|ref|XP_974834.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
gi|270011321|gb|EFA07769.1| hypothetical protein TcasGA2_TC005323 [Tribolium castaneum]
Length = 543
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 73/135 (54%), Gaps = 13/135 (9%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
GR V+A D+ G+ L VEKP+ +++ S ++C+ C + IPC C A+YCSES
Sbjct: 196 GRFVVATRDIKVGEVLAVEKPLVTLVVNELS-NHCHECVSLCYNLIPCKTCTQAMYCSES 254
Query: 150 CRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSS----------HSDRYKEVYH 197
CR A+ YH++EC LR L+ + ALK+ L + SDRY+E+++
Sbjct: 255 CRDYAFDMYHKYECSILATLRFLQFDKLKLFALKISLQYNDLGETSDLLYRSDRYREIHN 314
Query: 198 LETHLQDMRPEDLYQ 212
L T+ DL++
Sbjct: 315 LVTNTTKRSVPDLFE 329
>gi|24652742|ref|NP_725048.1| CG7759, isoform A [Drosophila melanogaster]
gi|21627489|gb|AAF58645.2| CG7759, isoform A [Drosophila melanogaster]
Length = 660
Score = 80.1 bits (196), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ +GR A DV G+ L VE+P V+L + ++C +C P+ C C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR EA +YH++EC G + ++ G + H+AL++I
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H + +P + +Q+V
Sbjct: 360 IDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399
>gi|194752924|ref|XP_001958769.1| GF12554 [Drosophila ananassae]
gi|190620067|gb|EDV35591.1| GF12554 [Drosophila ananassae]
Length = 660
Score = 80.1 bits (196), Expect = 5e-13, Method: Composition-based stats.
Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ + +GR A DV G+ L VE+P V+L + ++C +C P+ C+ C
Sbjct: 241 VRIDRNRQEGRFARAAADVKAGEELLVERPFVAVLLEKFAKTHCENCFVRSGVPVACSRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR EA ++H++EC G + +L G + H+AL++I
Sbjct: 301 ADVIYCSEQCREEAANKFHKYEC-GIVPILWRSGASINNHIALRIIASKPLDYFLKLKPT 359
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H + +P + +Q+V
Sbjct: 360 IDEELSPEKLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399
>gi|195123911|ref|XP_002006445.1| GI18563 [Drosophila mojavensis]
gi|193911513|gb|EDW10380.1| GI18563 [Drosophila mojavensis]
Length = 666
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 108/240 (45%), Gaps = 32/240 (13%)
Query: 1 ALKQDVQETLSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSV 60
ALK+ + T S+L R + HLD + +E + +A Q + S++ ++
Sbjct: 172 ALKKCITATDDSTLPADR-RSKLHLDAMTMIKMLENDPRTAKQAAKQQKLKEAKSSTPTL 230
Query: 61 ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
+L + S+ +++ + +GR A DV G L VE P V+L +
Sbjct: 231 EQAQTLPYEKEFV----SDLVRIDQNPQEGRFARAASDVQVGQELLVEHPYVAVLLEKYA 286
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---H 177
++C C P+ C C +YCSE C+++A +YH++EC G L ++ G + H
Sbjct: 287 QTHCEFCFMRTVVPVSCPGCSDVIYCSEQCQQKAAAKYHKFEC-GLLPVIWRSGASINNH 345
Query: 178 LALKLI-----------------------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
+AL++I L+S D ++ V HLE H ++ P + +QYV
Sbjct: 346 MALRIIASKPLDYFMQLRASLDEELSLEQLLSLPKDDFRRVAHLERHQKERPPSNFFQYV 405
>gi|432896582|ref|XP_004076331.1| PREDICTED: SET and MYND domain-containing protein 4-like [Oryzias
latipes]
Length = 770
Score = 80.1 bits (196), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 15/120 (12%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP---------------PCSMSNC 124
K+ ++ + KGRH++AVE + GD + ++P +FV++P C
Sbjct: 185 KVAIEYNLEKGRHLVAVERIAPGDVILTDRPYSFVLIPEMEEMREKDARADMFGTQYLRC 244
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
+ C +PC C + YCS C+REAW +H WEC L+ +AHLAL++ L
Sbjct: 245 HQCLAKTLCSVPCEGCSYSRYCSARCQREAWEEHHSWECSLGADLMAMGVMAHLALRITL 304
>gi|195333441|ref|XP_002033400.1| GM21288 [Drosophila sechellia]
gi|194125370|gb|EDW47413.1| GM21288 [Drosophila sechellia]
Length = 660
Score = 79.7 bits (195), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ +GR A DV G+ L VE+P V+L + ++C +C P+ C C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR EA +YH++EC G + ++ G + H+AL++I
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H + +P + +Q+V
Sbjct: 360 IDEQLTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 399
>gi|443693726|gb|ELT95020.1| hypothetical protein CAPTEDRAFT_183922 [Capitella teleta]
Length = 666
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 49/182 (26%), Positives = 85/182 (46%), Gaps = 34/182 (18%)
Query: 64 PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
P+LA + T AS +++ ++ KGR ++A +D+ G+ L V++P+ V+ S
Sbjct: 223 PALAHLGNETHPNASRALRVGSTPEKGRQILADQDICLGEVLMVDQPITSVLFQDHHASR 282
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEA---VGIAHLAL 180
C HC AP PC E ++CS CR +AW Y ++E A +L+A I LAL
Sbjct: 283 CYHCFQRCLAPFPCFETSSVIFCSRECRDKAWNTYQKYEYQYAA-VLDANFCGKIGQLAL 341
Query: 181 KLIL------------------------------VSSHSDRYKEVYHLETHLQDMRPEDL 210
+++L + Y+ +YH++TH + +P+DL
Sbjct: 342 RMLLECNLEGVLEFVRGLDKTGDVDDMEKVGLDKEGKYCSDYRSLYHMKTHADERKPDDL 401
Query: 211 YQ 212
++
Sbjct: 402 FE 403
>gi|19922036|ref|NP_610684.1| CG7759, isoform B [Drosophila melanogaster]
gi|16768704|gb|AAL28571.1| HL04910p [Drosophila melanogaster]
gi|21627490|gb|AAM68738.1| CG7759, isoform B [Drosophila melanogaster]
Length = 567
Score = 79.7 bits (195), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ +GR A DV G+ L VE+P V+L + ++C +C P+ C C
Sbjct: 148 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKTHCENCFMRTVVPVACPRC 207
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR EA +YH++EC G + ++ G + H+AL++I
Sbjct: 208 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 266
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H + +P + +Q+V
Sbjct: 267 IDEELTPEQLISLPKDDFRRVAQLERHQGERQPSNFFQHV 306
>gi|195582454|ref|XP_002081043.1| GD10796 [Drosophila simulans]
gi|194193052|gb|EDX06628.1| GD10796 [Drosophila simulans]
Length = 646
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 77/159 (48%), Gaps = 27/159 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ +GR A DV G+ L VE+P V+L + S+C +C P+ C C
Sbjct: 241 VRIDSNRQEGRFARASADVKPGEELLVERPFVSVLLEKFAKSHCENCFMRTVVPVACPRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-------------- 183
+YCSE CR EA +YH++EC G + ++ G + H+AL++I
Sbjct: 301 ADVLYCSEQCREEASKKYHKYEC-GIVPIIWRSGASINNHIALRIIASKPLDYFLKLKPT 359
Query: 184 ---------LVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
L+S D ++ V LE H + +P + +Q+
Sbjct: 360 IDEQLTPEQLISLPKDDFRRVAQLERHQGERQPPNFFQH 398
>gi|340716457|ref|XP_003396714.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 635
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 78/152 (51%), Gaps = 9/152 (5%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
LA G S AS+ I + S GRH++A ++ GD + +E P A+VI ++C+
Sbjct: 229 LAYGPSKEAPAASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYYTHCH 288
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-----GALRLLEAVGIAHLA- 179
HC + IPC+ C +A YCSE CR+ AW H+ EC G L ++ I L
Sbjct: 289 HCLSRSYNLIPCSYCPVAQYCSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTK 348
Query: 180 --LKLILVSSHSDRYKEVYHLETHLQDMRPED 209
LI+V+S ++ E+ ++ L + P++
Sbjct: 349 IIRFLIVVTSKGKKFDEL-RVDMELAESNPDN 379
>gi|383864965|ref|XP_003707948.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 674
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 63/119 (52%), Gaps = 4/119 (3%)
Query: 49 VSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVE 108
VS ++S+++ V ++PS+ + L S+ + +Q +GRH++A ++ G L VE
Sbjct: 193 VSLENASSANRVTAIPSIDGKQHAMLESCSDAVTLQFDEKRGRHLVATRNIKAGYILIVE 252
Query: 109 KPVAFVILPPCSMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
P AF NC +C ++ S IPC C +CSE CR EAW YHR+EC
Sbjct: 253 SPFAFSTNNEAFDRNCLNCHVTLKLNDSVKIPCYFCRTVSFCSEKCRNEAWKLYHRYEC 311
>gi|350404370|ref|XP_003487084.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 635
Score = 79.0 bits (193), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 78/152 (51%), Gaps = 9/152 (5%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
LA G S AS+ I + S GRH++A ++ GD + +E P A+VI ++C+
Sbjct: 229 LAYGPSKEAPAASDGISISFSEKYGRHLVATKEFKPGDVVTIENPYAYVIYTQRYYTHCH 288
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-----GALRLLEAVGIAHLA- 179
HC + IPC+ C +A YCSE CR+ AW H+ EC G L ++ I L
Sbjct: 289 HCLSRSYNLIPCSHCPVAQYCSEKCRKLAWEMAHQIECPIMALVGNLLNVDKDKIRMLTK 348
Query: 180 --LKLILVSSHSDRYKEVYHLETHLQDMRPED 209
LI+++S ++ E+ ++ L + P++
Sbjct: 349 IIRFLIVITSKGKKFDEL-RVDMELAESNPDN 379
>gi|195160613|ref|XP_002021169.1| GL24960 [Drosophila persimilis]
gi|194118282|gb|EDW40325.1| GL24960 [Drosophila persimilis]
Length = 661
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 71/135 (52%), Gaps = 4/135 (2%)
Query: 59 SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
S +P LA GE+ L A+N +K+ + KGR V+A E + GD + E PVA ++P
Sbjct: 235 STEKVPQLAHGENPELSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPS 294
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA-- 176
++C+HC + P+ C C +CS C EA YHR+EC + LL G++
Sbjct: 295 FFGTHCHHCFKRLHTPVACLHCSGIAFCSAQCMGEACESYHRFECE-FMDLLIGSGMSIL 353
Query: 177 -HLALKLILVSSHSD 190
+AL++ ++ D
Sbjct: 354 CFIALRIFTQAASVD 368
>gi|125978717|ref|XP_001353391.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
gi|54642149|gb|EAL30898.1| GA12774 [Drosophila pseudoobscura pseudoobscura]
Length = 661
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 71/135 (52%), Gaps = 4/135 (2%)
Query: 59 SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
S +P LA GE+ L A+N +K+ + KGR V+A E + GD + E PVA ++P
Sbjct: 235 STEKVPQLAHGENPELSGAANVVKLVETKEKGRFVVASECLKTGDVVLCENPVAACLIPS 294
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA-- 176
++C+HC + P+ C C +CS C EA YHR+EC + LL G++
Sbjct: 295 FFGTHCHHCFKRLHTPVACLHCSGIAFCSAQCMGEACESYHRFECE-FMDLLIGSGMSIL 353
Query: 177 -HLALKLILVSSHSD 190
+AL++ ++ D
Sbjct: 354 CFIALRIFTQAASVD 368
>gi|156543612|ref|XP_001604392.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 629
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 74/144 (51%), Gaps = 2/144 (1%)
Query: 20 QDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASN 79
+D +L F ++++ E + + +E +S + ++ + + LA G S S+
Sbjct: 169 KDEKNLTFIKELIDKEDAATDEMELISKKPANVPRYLADE--NDLKLANGPSDEAPSISD 226
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
IK+ S GRH+IA + GD L +EKP A VI ++C++C IPC
Sbjct: 227 GIKIAYSEKYGRHLIATKPFEPGDILLLEKPYANVIYREKYYTHCHYCLARSYNLIPCPH 286
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C L++YCSE+CR AW + H EC
Sbjct: 287 CPLSLYCSENCRTLAWSKGHEIEC 310
>gi|322794112|gb|EFZ17321.1| hypothetical protein SINV_03681 [Solenopsis invicta]
Length = 464
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 106/249 (42%), Gaps = 49/249 (19%)
Query: 4 QDVQETLSSS---LKLCRLQDTSH--LDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASS 58
Q V +TL L L DTS L K DVL ++ ++ +T +
Sbjct: 157 QSVDDTLKKVQHWLDKMSLNDTSREKLKTKLDVLRNRNTQ--------TKLCATDTRRKV 208
Query: 59 SVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
+ LP++A + + CAS+ + ++ +N GRH++A D+ G+ + VEKP + ++
Sbjct: 209 LESPLPTIA-SHNDEVPCASDAVVIKYNNHYGRHIVATRDICPGEVIAVEKPYSLILKQE 267
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-------------- 164
++C++C A IPCN C A+YCSE C+ W + H EC
Sbjct: 268 NMQTHCSNCLRVCWANIPCNYCTYAMYCSEECKNIEWKKCHDVECAIFPALIEYSFYNLD 327
Query: 165 ------GALRLLEAVGIAHLALKLILVSS---------------HSDRYKEVYHLETHLQ 203
L L +A G+ L L V HSDRY VY L T+ +
Sbjct: 328 WCSIRLAVLALRDAGGMKELRTMLKEVDEHADPRTRGFSQDGKLHSDRYISVYSLVTNSE 387
Query: 204 DMRPEDLYQ 212
DL++
Sbjct: 388 KRSVSDLFK 396
>gi|345326540|ref|XP_003431054.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4-like [Ornithorhynchus anatinus]
Length = 815
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 73/153 (47%), Gaps = 18/153 (11%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS------- 122
E+ + AS+ + ++ +GRH+IA ED+ G+ L E+ V+ P +
Sbjct: 219 ENDQIPQASSCVLLRADIPRGRHLIAKEDILPGELLVREEAFVSVLTPGEDLGLQEGLET 278
Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
+C+HC +S+ IPCN C A YCS+ C ++AW RYH+ EC LL
Sbjct: 279 KWNPGITHGDLHCHHCLKPVSSSIPCNGCSYAKYCSQECLQQAWERYHKIECPLGGILLT 338
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
H+AL+ +LV+ D + V L +
Sbjct: 339 LGVFCHVALRTVLVAGFEDVSRSVERFPEQLTN 371
>gi|307183763|gb|EFN70437.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 646
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 38/101 (37%), Positives = 60/101 (59%), Gaps = 1/101 (0%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
LP++A + + CAS+ + ++ + GRHVIA +++ G+ + VEKP A +++ +
Sbjct: 219 LPTIA-SYNDEVPCASDAVAIKYNTRCGRHVIATRNINPGEVIAVEKPYALLLVQQNMQT 277
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+C++C A IPCN C A+YCSE CR W R H EC
Sbjct: 278 HCSNCLKVCWANIPCNYCTYAMYCSEECRYAEWKRCHDVEC 318
>gi|195435832|ref|XP_002065883.1| GK20590 [Drosophila willistoni]
gi|194161968|gb|EDW76869.1| GK20590 [Drosophila willistoni]
Length = 661
Score = 77.8 bits (190), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 75/156 (48%), Gaps = 8/156 (5%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
LP L G + L A+ +K+ + KGR V+A + GD + E PVA +LP +
Sbjct: 239 LPELTHGPNPELSGAAQGVKLVETKEKGRFVVANGGLKTGDVVLCENPVAACLLPNFFGT 298
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLA 179
NC+HC + P+ C C +CS C EA YHR+EC + LL G++ +A
Sbjct: 299 NCHHCFKRLRNPVACLHCSGVAFCSAQCMGEASESYHRFECE-FMDLLIGSGMSILCFIA 357
Query: 180 LKLIL-VSSHSDRYKEVYHLETHL---QDMRPEDLY 211
L++ SS D L HL +D+R D Y
Sbjct: 358 LRVFTQASSLEDGLNTANLLFEHLCSHEDVRQADDY 393
>gi|328785040|ref|XP_001122116.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 629
Score = 77.4 bits (189), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 61/105 (58%), Gaps = 5/105 (4%)
Query: 61 ASLPSLACGESHTLM--CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
A LP + ++H + CAS+ I ++ ++ GRH++A ++ G+ + +EKP + ++ P
Sbjct: 206 APLPKI---KTHNIEVPCASDAITIKYNDKYGRHIVATRKINPGEVIAIEKPYSLILTPD 262
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++C++C A IPC C A+YCSE C+ W +YH EC
Sbjct: 263 NIYTHCSNCLEVSWANIPCEYCTYAMYCSEECKAMEWKKYHDIEC 307
>gi|260788099|ref|XP_002589088.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
gi|229274262|gb|EEN45099.1| hypothetical protein BRAFLDRAFT_75068 [Branchiostoma floridae]
Length = 909
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 72/154 (46%), Gaps = 13/154 (8%)
Query: 20 QDTSHLDFKEDVLTIEKSE--ENSVEAM-----------SMQVSSTSSSASSSVASLPSL 66
+D + + K + IEK++ E+ EAM S Q S+ +P L
Sbjct: 168 KDDAKKEVKGTLEAIEKADIPEDRKEAMRKELKLAGRNISKQEEGKEGEESAEEVKVPKL 227
Query: 67 ACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNH 126
GES + AS ++M+ KGR ++A + G L VE+P A V+L ++C+
Sbjct: 228 QYGESQQVAHASGGVEMREEEDKGRMLVAQKAFEPGSVLIVEQPYAAVLLQKHHSTHCHT 287
Query: 127 CCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
C T + P PC C YCS +C +AW YHR
Sbjct: 288 CVTPVLVPHPCRGCQYVQYCSGTCEEQAWREYHR 321
>gi|195151412|ref|XP_002016641.1| GL11690 [Drosophila persimilis]
gi|194110488|gb|EDW32531.1| GL11690 [Drosophila persimilis]
Length = 660
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 73/159 (45%), Gaps = 25/159 (15%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S +GR A DV G+ L VE+P V+L + S+C +C P+ C C
Sbjct: 241 VRIDRSPQEGRFARAAADVKAGEELLVERPFVSVLLEKFAKSHCENCFVRSVVPVACPRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWEC---------------HGALRLLEAVGIAH-LALKLI- 183
+YCSE CR EA +YH++EC H ALR+ + + + L LK
Sbjct: 301 ADVIYCSEQCRDEAAKKYHKYECGIVPTIWRSGASINNHIALRIFASKPLEYFLQLKPTI 360
Query: 184 --------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H P + +Q+V
Sbjct: 361 DKELSPEELLSLPKDDFRRVAQLERHQNKRPPSNFFQHV 399
>gi|47221832|emb|CAG08886.1| unnamed protein product [Tetraodon nigroviridis]
Length = 743
Score = 77.4 bits (189), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 45/173 (26%), Positives = 68/173 (39%), Gaps = 50/173 (28%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP--------------------CSMSNCN 125
S KGRH++A E + GD + ++P + V++P C+
Sbjct: 195 SPEKGRHLVATERIAAGDVILSDRPYSCVLIPGMKEVKGKGAKQGTDGGELFGIEQRRCH 254
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL- 184
C P+PC+ C + YCS SC++EAW +HRWEC L+ G+ LAL++ L
Sbjct: 255 RCLAETLCPLPCDGCSYSRYCSASCQQEAWEEHHRWECPLGAELMVMGGMLQLALRVALK 314
Query: 185 -----------------------------VSSHSDRYKEVYHLETHLQDMRPE 208
H D Y V+HL H+ P+
Sbjct: 315 AGQENIQTARKLIRDERASSKCSSQSSFQTPGHGDSYLSVFHLLHHMNQHSPQ 367
>gi|307179275|gb|EFN67661.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 631
Score = 77.0 bits (188), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 60/111 (54%), Gaps = 4/111 (3%)
Query: 57 SSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVIL 116
+A++P + L S+ ++++ A+GRH++A D+ G L V++P +F
Sbjct: 157 GKRIAAMPVVDGKPHDCLASCSDSVELRLDEARGRHLVAARDIRPGAVLIVDQPFSFSTD 216
Query: 117 PPCSMSNCNHCCTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
P NC +C ++ S IPC C +C+E+CR++AW YH++EC
Sbjct: 217 GPALSRNCLYCHATLKLEDSVKIPCRNCQTVSFCTETCRKKAWEAYHQYEC 267
>gi|332023172|gb|EGI63428.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 624
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 35/100 (35%), Positives = 56/100 (56%), Gaps = 5/100 (5%)
Query: 69 GESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC 127
G+ H+ L+ + +++Q A+GRH++A D+ G L V++P +F NC HC
Sbjct: 167 GKPHSCLVSCPDSVELQVDEARGRHLVATRDIRPGTVLIVDRPFSFSTDASALDRNCLHC 226
Query: 128 CTSI----SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++ S IPC C +C+E CR+EAW YH++EC
Sbjct: 227 HATLKLEDSVRIPCRNCQTVAFCTEVCRKEAWETYHQYEC 266
>gi|125810418|ref|XP_001361485.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
gi|54636660|gb|EAL26063.1| GA20567 [Drosophila pseudoobscura pseudoobscura]
Length = 660
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 73/159 (45%), Gaps = 25/159 (15%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S +GR A DV G+ L VE+P V+L + ++C +C P+ C C
Sbjct: 241 VRIDRSPQEGRFARAAADVKAGEELLVERPFVSVLLEKFAKTHCENCFVRSVVPVACPRC 300
Query: 141 ILAVYCSESCRREAWLRYHRWEC---------------HGALRLLEAVGIAH-LALKLI- 183
+YCSE CR EA +YH++EC H ALR+ + + + L LK
Sbjct: 301 ADVIYCSEQCRDEAAKKYHKYECGIVPTIWRSGASINNHIALRIFASKPLEYFLQLKPTI 360
Query: 184 --------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H P + +Q+V
Sbjct: 361 DKELSPEELLSLPKDDFRRVAQLERHQNKRPPSNFFQHV 399
>gi|345497540|ref|XP_001600173.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 621
Score = 76.6 bits (187), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 61/107 (57%), Gaps = 3/107 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+ +Q + GRH++A D++ G+ +F+E+P + + C HC T IPC C
Sbjct: 202 VDVQYNEKYGRHLVAKRDINPGEIIFIEEPYMHCLDLVRGYTYCFHCLTPCLITIPCEHC 261
Query: 141 ILAVYCSESCRREAWLRYHRWEC---HGALRLLEAVGIAHLALKLIL 184
A++CSE C+++AW++YH EC A ++ G+ H+A+K ++
Sbjct: 262 GWAMFCSEGCKQQAWVKYHDLECAVYDFAKENVDGDGVKHMAVKSLI 308
>gi|197304712|ref|NP_001127869.1| N-lysine methyltransferase SMYD2-like [Nasonia vitripennis]
Length = 704
Score = 76.3 bits (186), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 91/219 (41%), Gaps = 40/219 (18%)
Query: 34 IEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCA-SNKIKMQTSNAKGRH 92
+EK + ++ + +S S A+++ +P++ + + + + S+ + + + AKGR
Sbjct: 230 LEKKQSLALSQRLLDAASRSRRATANQIRIPTVHGAKPNDFLTSCSDALALGSDEAKGRK 289
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP----IPCNECILAVYCSE 148
+ A D+ G L V++P A NC C S+ P +PC C +CSE
Sbjct: 290 LYATRDIKPGTVLIVDRPFASTTDVAALCRNCLCCHASLKLPESIRVPCEHCQAVQFCSE 349
Query: 149 SCRREAWLRYHRWECH------------------------GALRLLEAVGIAHLALKL-- 182
CRR AW ++H++EC ALR + + LKL
Sbjct: 350 DCRRRAWNKFHKYECRIFDYFYAKQAQKTNCLLAYRAVVAAALRNEKEFSLDQEFLKLHE 409
Query: 183 ---------ILVSSHSDRYKEVYHLETHLQDMRPEDLYQ 212
L S S YK V+ LE+H D P D Q
Sbjct: 410 NDKDSLDIASLASYESRDYKTVFSLESHCADADPADNLQ 448
>gi|195382848|ref|XP_002050140.1| GJ20356 [Drosophila virilis]
gi|194144937|gb|EDW61333.1| GJ20356 [Drosophila virilis]
Length = 662
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 76/163 (46%), Gaps = 27/163 (16%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
S+ +++ + +GR A DV G L VE P V+L + ++C +C P+ C
Sbjct: 240 SSLVRIDQNAQEGRFARAAADVQVGQELLVEHPYVAVLLEKFAQTHCEYCFVRTVVPVAC 299
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI----------L 184
C +YCSE C+ A +YH++EC G L ++ G + H+AL++I L
Sbjct: 300 PGCSDVIYCSEQCQERASAKYHKYEC-GILPVIWRSGASINNHMALRIIASKPLDYFLQL 358
Query: 185 VSSH-------------SDRYKEVYHLETHLQDMRPEDLYQYV 214
SS D ++ V HLE H +P + +QYV
Sbjct: 359 KSSLDEELSLEQLLSLPKDDFRRVAHLERHEGQRQPSNFFQYV 401
>gi|340724466|ref|XP_003400603.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 631
Score = 76.3 bits (186), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 70/135 (51%), Gaps = 11/135 (8%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
CAS+ + ++ + GRH++A + G+ + VEKP + ++ P ++C++C A I
Sbjct: 222 CASDAVTIKYNEEYGRHIVATHKIRPGEVIAVEKPYSLILTPDNIHTHCSNCLEVSWANI 281
Query: 136 PCNECILAVYCSESCRREAWLRYHRWEC--HGALRLLEAVGIAHLALKLILVSSHSDRYK 193
PC C A+YCSE C+ W +YH EC ++ + V + +L+L + +
Sbjct: 282 PCEHCTYAMYCSEECKIMEWKKYHDVECTVFPSMLKMNFVKLDLFSLRLAIQAVRE---- 337
Query: 194 EVYHLETHLQDMRPE 208
TH+Q++R E
Sbjct: 338 -----ATHIQELRKE 347
>gi|328791769|ref|XP_001121272.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 668
Score = 75.9 bits (185), Expect = 9e-12, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 63 LPSLACGESHTLM-CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
+P+L G+ HT++ S+ + +Q +GRH+IA +++ G L VE P AF
Sbjct: 203 IPTLN-GKQHTILKSCSDAVTLQFDEKRGRHLIATKNIKAGSVLIVETPFAFSTNKEALG 261
Query: 122 SNCNHC----CTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
NC HC +S S IPC C +CSE CR +AW YH++EC
Sbjct: 262 RNCLHCHITLMSSNSVKIPCYYCQTVSFCSEKCRSKAWQIYHQYEC 307
>gi|350418674|ref|XP_003491932.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 681
Score = 75.5 bits (184), Expect = 1e-11, Method: Composition-based stats.
Identities = 41/124 (33%), Positives = 64/124 (51%), Gaps = 1/124 (0%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
LA E+ L AS + ++ + G+ IA E + GD L VE P A +LP ++C
Sbjct: 191 LASDENPRLPGASTLLDIEETENAGKRAIAAEPIEPGDRLVVEAPYAATLLPEFFGTHCQ 250
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV 185
HC + API C +C +C CR A YH++EC L LL G++ L++ + +
Sbjct: 251 HCFSRFVAPIGCPDCSSVAFCGRECRDAAMASYHKYECK-VLALLIGSGMSVLSMLALRM 309
Query: 186 SSHS 189
++ S
Sbjct: 310 TTQS 313
>gi|380021898|ref|XP_003694793.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
florea]
Length = 629
Score = 75.5 bits (184), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 60/105 (57%), Gaps = 5/105 (4%)
Query: 61 ASLPSLACGESHTLM--CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
A LP + ++H + CAS+ I ++ ++ GRH++A + G+ + +EKP + ++ P
Sbjct: 206 APLPKI---KTHNIEVPCASDAITIKYNDKYGRHIVATRKISPGEVVAIEKPYSLILTPD 262
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++C++C A IPC C A+YCSE C+ W +YH EC
Sbjct: 263 NIHTHCSNCLEVSWANIPCEYCTYAMYCSEECKAMEWKKYHDIEC 307
>gi|350424957|ref|XP_003493967.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 631
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 86/162 (53%), Gaps = 14/162 (8%)
Query: 12 SSLKLCRLQDTSHLDFKEDVLTIEK--SEENSVEAMSMQVSSTSSSASS------SVASL 63
++LK R++D KE +EK S++ + + +S ++++T + +S ++
Sbjct: 152 NALKHPRMEDA----IKEAQYWLEKMSSDDINRKRLSEKLNNTKHNLASQKLKKQNIIKQ 207
Query: 64 PSLACGESHTLM--CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
P L ++ + CAS+ + ++ + GRH++A + G+ + VEKP + ++ P
Sbjct: 208 PPLPKIKTRNIEVPCASDAVTIKYNEEYGRHIVATRKIRPGEVIAVEKPYSLILTPDNIH 267
Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++C++C A IPC C A+YCSE C+ W +YH EC
Sbjct: 268 THCSNCLEVSWANIPCEHCTYAMYCSEECKAMEWKKYHDVEC 309
>gi|383866011|ref|XP_003708465.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 636
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 53/98 (54%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
L G S AS+ + + S GRH++A ++ GD + +E P AF I ++C+
Sbjct: 230 LTYGPSKEAPAASDGVSIAFSEKYGRHLVATKEFKPGDIVTIEDPFAFTIYMQSYFTHCH 289
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
HC IPC +C +A YCSE+CR+ AW H+ EC
Sbjct: 290 HCLARSFNLIPCLKCPVAQYCSETCRKLAWEMAHQIEC 327
>gi|328776415|ref|XP_003249162.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 714
Score = 75.1 bits (183), Expect = 2e-11, Method: Composition-based stats.
Identities = 46/153 (30%), Positives = 77/153 (50%), Gaps = 2/153 (1%)
Query: 28 KEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSN 87
KE ++ E S+ ++ QVS + A + L GE+ L AS + ++ +
Sbjct: 178 KETCEKFLRANEKSLVSIE-QVSDRRNGALVATEEKYVLKGGENPRLPGASTFLDVEETV 236
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
G+ VIA +++ G+ L +E P A ++LP ++C HC + API C +C +C
Sbjct: 237 NAGKRVIAAKNIEPGNRLIIESPHAAILLPEFFGTHCQHCFSRFKAPIGCPDCSSVAFCG 296
Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
CR A YH++EC L LL G++ L++
Sbjct: 297 RKCRDTALASYHKYECK-ILVLLIGSGMSVLSM 328
>gi|158299452|ref|XP_319583.4| AGAP008839-PA [Anopheles gambiae str. PEST]
gi|157013526|gb|EAA14823.4| AGAP008839-PA [Anopheles gambiae str. PEST]
Length = 650
Score = 75.1 bits (183), Expect = 2e-11, Method: Composition-based stats.
Identities = 48/152 (31%), Positives = 68/152 (44%), Gaps = 26/152 (17%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S +GR D+ L +E+P V+L S+ +C HC +S PI C C V+
Sbjct: 236 SPDEGRFARTNTDLKPNTILLLERPHVSVLLEDYSLDHCTHCFKRVSVPIACPLCADVVF 295
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHS------------- 189
CS+ C +A YHR+EC G L +L G + H+AL++I S
Sbjct: 296 CSDECETKANATYHRYEC-GFLPILWGSGASITCHMALRMITQKSEEYFLKLKPELAGLT 354
Query: 190 ---------DRYKEVYHLETHLQDMRPEDLYQ 212
D Y++VY L TH PED +Q
Sbjct: 355 NEQIDKLPVDDYRKVYKLVTHESTRSPEDFFQ 386
>gi|357610542|gb|EHJ67027.1| hypothetical protein KGM_11275 [Danaus plexippus]
Length = 661
Score = 75.1 bits (183), Expect = 2e-11, Method: Composition-based stats.
Identities = 47/169 (27%), Positives = 77/169 (45%), Gaps = 23/169 (13%)
Query: 64 PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
P L+ + AS +++ +GR A D+ G+ L +E+P V+L S S+
Sbjct: 228 PKLSGKHNKLFPAASETVEIAFDETRGRFAKADRDIQAGEILLIEEPHGGVLLSEFSKSH 287
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLAL 180
C +C P+PC +C ++CSE C A YH +ECH L LL G + H+AL
Sbjct: 288 CQNCFNKCLIPLPCPKCPNVIFCSEKCLDIALKSYHGYECH-ILPLLWKSGCSVTCHIAL 346
Query: 181 KLILVSS-------------------HSDRYKEVYHLETHLQDMRPEDL 210
++I +S ++ Y+ +YHL +H +D+
Sbjct: 347 RMITQNSKDYFMKIMQDLKEKPTGPYKTEDYRNIYHLVSHENKRTKQDI 395
>gi|158287355|ref|XP_309411.4| AGAP011232-PA [Anopheles gambiae str. PEST]
gi|157019610|gb|EAA05175.4| AGAP011232-PA [Anopheles gambiae str. PEST]
Length = 434
Score = 74.7 bits (182), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 53/98 (54%), Gaps = 2/98 (2%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSE 148
GRHV+A + GD + VEKP A V+ C C IPC EC +A+YCS+
Sbjct: 162 GRHVVATRQLRVGDVVMVEKPYATVLSDHMKRVRCAFCHAEEPFLLIPCEECTIAMYCSQ 221
Query: 149 SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
C R AW +YHR+EC L + +G +LAL + V+
Sbjct: 222 KCLRAAWQQYHRYEC-PILNDMRTIGTEYLALAVRTVA 258
>gi|242009727|ref|XP_002425634.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
gi|212509527|gb|EEB12896.1| set and mynd domain-containing protein, putative [Pediculus humanus
corporis]
Length = 666
Score = 74.3 bits (181), Expect = 3e-11, Method: Composition-based stats.
Identities = 40/110 (36%), Positives = 59/110 (53%), Gaps = 6/110 (5%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
AS KI M+ + GR A E++ G+T+ VEKP V+L + NC HC P P
Sbjct: 233 ASVKIFMKNDSTVGRFASASENIFPGETILVEKPYCSVLLEDYHLINCQHCFKRTQGPYP 292
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI 183
C C ++CS +CR A YH++EC L+ L G++ +AL++I
Sbjct: 293 CENCSQVIFCSLNCRESA--NYHKYEC-SILQTLHKSGVSITVLMALRMI 339
>gi|345482384|ref|XP_001608081.2| PREDICTED: hypothetical protein LOC100124188 isoform 1 [Nasonia
vitripennis]
gi|345482386|ref|XP_003424586.1| PREDICTED: hypothetical protein LOC100124188 isoform 2 [Nasonia
vitripennis]
Length = 654
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 54/101 (53%), Gaps = 2/101 (1%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
LPS + + CAS+ + + GRH+IA D+ G+ + VE+ + P +
Sbjct: 222 LPSFKPNKEAS--CASDAVAIAHHEIWGRHIIAERDIEPGEIIVVEENYLSFLDPTKMYA 279
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
C+ C IPCN CI VYCSE C+ EAW +YH++EC
Sbjct: 280 FCSTCMKPSLCLIPCNNCIYDVYCSEECKSEAWKKYHQFEC 320
>gi|195119678|ref|XP_002004356.1| GI19663 [Drosophila mojavensis]
gi|193909424|gb|EDW08291.1| GI19663 [Drosophila mojavensis]
Length = 762
Score = 74.3 bits (181), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 78/158 (49%), Gaps = 15/158 (9%)
Query: 31 VLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKG 90
V +E+ +E + QVS +S A PSL + A N + KG
Sbjct: 187 VNQLEQLQEQLTKLQEQQVSEPKEELTSEQA--PSLQ-------ILARNHKIWNSPGVKG 237
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYCSE 148
RH++A + + GD +F E+ FV L P + C C S+ APIPC +C VYCS
Sbjct: 238 RHMVATQSIQPGDIIFKEQTACFVPLEPGLI--CQQCAASVLYAPIPCPQCHQQVVYCSR 295
Query: 149 SCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILV 185
CR+ + H +EC G + L +G++HLAL+++L
Sbjct: 296 HCRQLN-EQIHSYECAGYRKNLFTMLGVSHLALRMVLF 332
>gi|312373131|gb|EFR20943.1| hypothetical protein AND_18265 [Anopheles darlingi]
Length = 675
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 72/152 (47%), Gaps = 27/152 (17%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S+A+GR D+ L +E+P +L S+ +C++C +S PI C C V+
Sbjct: 250 SDAEGRFAKTSADLKPNTILLLERPHVSALLEDYSLDHCSNCFKRVSVPIACPLCSDVVF 309
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI------------------L 184
CS+ C R A +HR+EC G L +L G + H+AL++I L
Sbjct: 310 CSDECERRANASFHRYEC-GFLPILWGSGASITCHMALRMITQKSSEYFAKLQPELAAGL 368
Query: 185 VSSHSDR-----YKEVYHLETHLQDMRPEDLY 211
+ DR Y++VY L TH PEDL+
Sbjct: 369 TNEQIDRLPVNDYRKVYQLVTHEATRTPEDLF 400
>gi|195027211|ref|XP_001986477.1| GH21386 [Drosophila grimshawi]
gi|193902477|gb|EDW01344.1| GH21386 [Drosophila grimshawi]
Length = 664
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 77/163 (47%), Gaps = 27/163 (16%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
S+ +++ + +GR A DV G L VE P V+L + ++C +C P+ C
Sbjct: 242 SSLVRIDQNPQEGRFARAAADVQVGQELLVEHPYVAVLLEKYAHTHCEYCFVRTVVPVAC 301
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI----------- 183
C +YCSE C++ + +YH++EC G L ++ G + H+AL++I
Sbjct: 302 PGCSDVIYCSEQCQQRSADKYHKYEC-GILPIIWRSGASINNHMALRIIASKPLDYFLQL 360
Query: 184 ------------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V HLE H ++ + +QYV
Sbjct: 361 RPSLDEDLSLEQLLSLPKDDFRRVAHLERHQKERAASNFFQYV 403
>gi|410915300|ref|XP_003971125.1| PREDICTED: SET and MYND domain-containing protein 4-like [Takifugu
rubripes]
Length = 723
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 63/129 (48%), Gaps = 20/129 (15%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPP--------CSMSN------------CNHCC 128
KGRH++A E V GD + ++P + V++P SN C+ C
Sbjct: 170 KGRHLVATEKVAAGDVILRDRPYSCVLIPGRKEVQGKEIKHSNDGDILFGMEHKLCHRCL 229
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSH 188
T P+PC+ C + YCS SC++EAW +HRWEC L+ ++ LAL++ L +
Sbjct: 230 TETLCPLPCDGCSYSRYCSASCQQEAWEEHHRWECPLGAELMVMGVLSQLALRVALKAGK 289
Query: 189 SDRYKEVYH 197
+ + H
Sbjct: 290 KNIQRGRVH 298
>gi|345486427|ref|XP_001607097.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 596
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/90 (37%), Positives = 54/90 (60%), Gaps = 4/90 (4%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVA--FVILPPCSMSNCNHCCTSISAP 134
+S+ ++++ S GRH++A +D+ G+ +F+EKP + I P C HC +
Sbjct: 163 SSSLVEVRHSEKYGRHLVATQDIKPGEIIFIEKPYISCYNIKKP--YLYCCHCLSIAWTG 220
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECH 164
IPC+ C V+CSE C++EAW +YH ECH
Sbjct: 221 IPCDNCGWFVFCSEKCKKEAWTQYHDIECH 250
>gi|158287323|ref|XP_309376.4| AGAP011270-PA [Anopheles gambiae str. PEST]
gi|157019595|gb|EAA05063.4| AGAP011270-PA [Anopheles gambiae str. PEST]
Length = 341
Score = 73.9 bits (180), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 76/152 (50%), Gaps = 7/152 (4%)
Query: 29 EDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNA 88
E ++ EK +N++ A + +++SS AS V P L+ +N ++++ +
Sbjct: 76 EKLIQREKEVKNAL-AKARNKNASSSKASPDVVEEPELSYAAKENAPQVANCLELRKNEE 134
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCS 147
GRHV+ + GD + +E+P V+ C+ C IPC C +A+YCS
Sbjct: 135 YGRHVVTTRKLKVGDVVMIERPFVTVLKDSFRYVRCDFCHEERPFTLIPCEGCTMAMYCS 194
Query: 148 ESCRREAWLRYHRWECHGALR----LLEAVGI 175
E C +A+ YHR+EC G LR + EAV +
Sbjct: 195 EECLSKAYNNYHRYEC-GLLRDLWEVFEAVPL 225
>gi|321476942|gb|EFX87901.1| SET and MYND domain-containing protein 4A [Daphnia pulex]
Length = 648
Score = 73.6 bits (179), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 36/102 (35%), Positives = 57/102 (55%)
Query: 62 SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
S P L + L AS +++ + +GR ++A ED+ G+T+ VEK A ++
Sbjct: 230 SAPELHRARNPLLPSASASVELINTPDRGRCLVATEDIKIGETVIVEKAHASILQYEFKE 289
Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
S+C+HC P+PC++C +CS CR EAW YH++EC
Sbjct: 290 SHCHHCLHWTPGPVPCHKCSQVGFCSTQCRDEAWDSYHQFEC 331
>gi|332025618|gb|EGI65780.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 630
Score = 73.6 bits (179), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 74/154 (48%), Gaps = 16/154 (10%)
Query: 62 SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM 121
+ L G S + S+ + + S GRH IA + GD + +E P A VI
Sbjct: 219 NFSKLMNGPSREIPAVSDGVMIFFSERYGRHYIATREFKPGDIISIEDPYAHVIYEERYY 278
Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV-------- 173
++C++C IPC+EC +A YCSE+CR+ AW H EC L+LL +
Sbjct: 279 THCHYCLARSYNLIPCSECPIAQYCSENCRKLAWEACHETEC-PILKLLTNLLNVDKDKI 337
Query: 174 -GIAHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
I+ + LI+V+ + R KE+ QDM+
Sbjct: 338 RMISKIIRLLIVVTENGTRIKEL------QQDMK 365
>gi|357620108|gb|EHJ72415.1| hypothetical protein KGM_11009 [Danaus plexippus]
Length = 477
Score = 73.2 bits (178), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 39/104 (37%), Positives = 59/104 (56%)
Query: 60 VASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
V SL ++ T +CAS+K+++ S+ GRHV A ED++ G+ L +E P ++L
Sbjct: 19 VESLVTMKGPCDPTYVCASSKLEVVYSDEMGRHVCAKEDINVGEVLVIEDPYFTLLLKSQ 78
Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ C++C S +PCN C A+YCSE C+ A YH EC
Sbjct: 79 YLLCCSYCLLSNLNLLPCNNCCFAMYCSEECKVRALKEYHSVEC 122
>gi|170042501|ref|XP_001848962.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167866038|gb|EDS29421.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 667
Score = 73.2 bits (178), Expect = 6e-11, Method: Composition-based stats.
Identities = 50/161 (31%), Positives = 79/161 (49%), Gaps = 13/161 (8%)
Query: 63 LPSLACGESHTLMCASNKIKM---QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
LP ++ G + L AS+K+ + + KGR+++A +D+ + + VE V +
Sbjct: 243 LPEVS-GANADLPYASHKLALGGAREDTTKGRYIVAAQDLKPAEPIVVEPAVGACLYTKF 301
Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA--- 176
S+CN C + AP+ C EC +CS CR A YHR+ECH L LL G++
Sbjct: 302 FGSHCNACFAKLVAPVACPECAGVAFCSPECRDRACSGYHRFECH-YLDLLIGSGMSVLC 360
Query: 177 HLALKLILVSSHSDRYKEV-----YHLETHLQDMRPEDLYQ 212
HLAL+L+ + ++ E+ L H P D +Q
Sbjct: 361 HLALRLVTQAGTPEKAIELGAELKQTLCAHEDRREPSDYFQ 401
>gi|115495645|ref|NP_001070062.1| SET and MYND domain-containing protein 4 [Danio rerio]
gi|115313830|gb|AAI24342.1| SET and MYND domain containing 4 [Danio rerio]
Length = 753
Score = 73.2 bits (178), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 74/140 (52%), Gaps = 15/140 (10%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS--------------- 122
S+ + + S+ KGRH++ +E+ G+ + ++ V++P +
Sbjct: 185 SDGVSVYFSSDKGRHMLVMENKPAGEVVLEDEAYCSVLIPANIFNTGTNKAVETFGTEDR 244
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL 182
+C+HC + + +PC +C A YC ESC+++AW ++H+WEC LL + HLAL++
Sbjct: 245 HCHHCLSQSLSFVPCPKCSYARYCGESCQKDAWDQWHQWECPVGADLLAIGVLGHLALRV 304
Query: 183 ILVSSHSDRYKEVYHLETHL 202
+L + ++ + + + H+
Sbjct: 305 VLKAGQTEVQMGIKNTKDHV 324
>gi|195430720|ref|XP_002063397.1| GK21416 [Drosophila willistoni]
gi|194159482|gb|EDW74383.1| GK21416 [Drosophila willistoni]
Length = 661
Score = 72.8 bits (177), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 45/163 (27%), Positives = 75/163 (46%), Gaps = 27/163 (16%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
S+ +++ + +GR A DV G L VE+P V+L + ++C C P+ C
Sbjct: 238 SDLVRIDQNKQEGRFARAAADVKIGQELLVERPYVAVLLEKYAKTHCEACFMRTVVPVAC 297
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI----------- 183
C VYCSE C++ A +YH++EC G L + G + H+AL++
Sbjct: 298 PRCADVVYCSEKCQQLAANKYHKYEC-GILPSIWRSGASINNHIALRIFASKPWSYFKDL 356
Query: 184 ------------LVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+S D ++ V LE H + +P + +Q+V
Sbjct: 357 KPLIDVELPVEKLISLPKDDFRRVAQLERHQTERQPSNFFQHV 399
>gi|118781832|ref|XP_311885.3| AGAP002999-PA [Anopheles gambiae str. PEST]
gi|116129279|gb|EAA07926.3| AGAP002999-PA [Anopheles gambiae str. PEST]
Length = 664
Score = 72.8 bits (177), Expect = 1e-10, Method: Composition-based stats.
Identities = 48/162 (29%), Positives = 86/162 (53%), Gaps = 12/162 (7%)
Query: 62 SLPSLAC-GESHTLMCASNKIKMQTS--NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP 118
SLP LA GE+ ++ AS+KIK+ S + +GR+V+A D+ G+ + E A +
Sbjct: 237 SLPKLAGEGENGEILGASSKIKLAGSKEDPRGRYVVAAADLGPGEVILTEPAYAACLHAK 296
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA-- 176
++C+ C + + AP+ C +C +CS +CR +A YHR+EC L L+ G++
Sbjct: 297 YYGTHCSACFSRLIAPVACPDCCGVAFCSVACRDKACATYHRFECQ-YLDLMIGSGMSIL 355
Query: 177 -HLALKLILVSSHSDRYKEVYHLE-----THLQDMRPEDLYQ 212
H+AL+++ + ++ E + H + PED ++
Sbjct: 356 CHVALRMVTQAGTPEKVLEEGKMLRDTFCAHTEHRDPEDHFK 397
>gi|328722737|ref|XP_001943013.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 661
Score = 72.4 bits (176), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 63/123 (51%), Gaps = 4/123 (3%)
Query: 64 PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
P + G S SN + ++ + +GRH +A DV G L E+ A V+L S S+
Sbjct: 227 PEVHGGRSEVYPAFSNAVSVEYNETEGRHAVAGSDVPVGKVLLTERAYASVLLREYSHSH 286
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLAL 180
C +C + AP+PC +C +C E C+R A YH EC L L + G++ +AL
Sbjct: 287 CTNCFVRLVAPLPCPDCERVAFCGEFCKRSALESYHAVEC-SMLPALFSAGVSITCLIAL 345
Query: 181 KLI 183
++I
Sbjct: 346 RII 348
>gi|357603437|gb|EHJ63766.1| putative set and mynd domain-containing protein [Danaus plexippus]
Length = 734
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 66/140 (47%), Gaps = 9/140 (6%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
++ S S S P L G + L S +K+ N KGR+ +A V GD + VE P
Sbjct: 221 STQSESPSKLKEPQLTSGATLNLPALSKLLKITEDNEKGRYAVANAPVKTGDIVLVESPY 280
Query: 112 AFVILPPCSMSNCNHCCTSI-----SAPIPCNECILAVYCSESCRREAWLRYHRWECHGA 166
A +L C S+C HC + SAPI C C +CS CR A YH +EC
Sbjct: 281 AACLLADCHGSHCLHCFVRLEDFEDSAPIWCPNCSGVAFCSIQCRDAAISTYHLYEC-PF 339
Query: 167 LRLLEAVG---IAHLALKLI 183
L G ++H+AL+++
Sbjct: 340 FNLFIGSGMSVLSHIALRMV 359
>gi|307207016|gb|EFN84839.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 638
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 75/150 (50%), Gaps = 16/150 (10%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
L G S + S+ + + S GRH IA D GD + +E P A VI ++C+
Sbjct: 231 LTYGSSKEVPATSDGVTVSFSKRYGRHYIATRDFKPGDIVSIEDPYAHVIYEERYYTHCH 290
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV---------GIA 176
+C + IPC++C +A YCSE CR+ AW H EC L+LL ++ I+
Sbjct: 291 YCLSRSYNLIPCSKCPIAQYCSEKCRKLAWEVCHMTEC-PILKLLTSLLNVDKDKIRMIS 349
Query: 177 HLALKLILVSSHSDRYKEVYHLETHLQDMR 206
+ LI+V+++ + +E+ QDM+
Sbjct: 350 KIIRLLIIVTANGSKIEEL------QQDMK 373
>gi|194858456|ref|XP_001969182.1| GG25277 [Drosophila erecta]
gi|190661049|gb|EDV58241.1| GG25277 [Drosophila erecta]
Length = 750
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 69/119 (57%), Gaps = 14/119 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
+GR+++A E++ KG+ +F E+ FV P + C C S+ SAPIPC +C VYC
Sbjct: 221 RGRYMVAKENISKGNVIFSERASCFV--PLQQLLICQQCAASLMSAPIPCPKCHQRVVYC 278
Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
S C REA H++EC + LL+ +G++HLAL+L+L + H+ HLQ+
Sbjct: 279 SRKC-REAHSAIHKYECAAYRKDLLKLLGVSHLALRLVLAY--------IPHIMPHLQE 328
>gi|270011320|gb|EFA07768.1| hypothetical protein TcasGA2_TC005322 [Tribolium castaneum]
Length = 1086
Score = 71.6 bits (174), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 70/125 (56%), Gaps = 5/125 (4%)
Query: 71 SHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS 130
++++ CA+N ++++ + V+A +D+ G + VE P ++ + +C+ C
Sbjct: 195 NNSIPCATNSVEIKVDEKLRKRVVATKDIQIGQVIAVETPCVAALIN-VVLFHCHDCYIL 253
Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI-AHL--ALKL-ILVS 186
PIPC C VYCSE+CR A+ +YH+ EC L + + VGI H ALK+ +LV
Sbjct: 254 CYNPIPCKTCTEVVYCSEACRENAFAKYHQKECPIYLSMRKLVGIDTHFQWALKMTLLVQ 313
Query: 187 SHSDR 191
+ +D+
Sbjct: 314 TQADK 318
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 63/129 (48%), Gaps = 4/129 (3%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
AS+ I+++ R V A + G+ + VEKP F L + +C+ C PIP
Sbjct: 722 ASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFT-LAAADLYHCHECYQLCYNPIP 780
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH---LALKLILVSSHSDRYK 193
C C +YC E CR +A +YH++EC + L VG LA+K+ + S +
Sbjct: 781 CEICSQTLYCGEECRDKAREKYHQYECPILISLKNIVGKHKAFLLAIKMSFMISDENDVP 840
Query: 194 EVYHLETHL 202
EVY L +L
Sbjct: 841 EVYALVENL 849
>gi|340723180|ref|XP_003399973.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 718
Score = 71.6 bits (174), Expect = 2e-10, Method: Composition-based stats.
Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 1/120 (0%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
E+ L AS + ++ + G+ IA + + GD L VE P A +LP ++C HC +
Sbjct: 232 ENPRLPGASTLLDIEETENAGKRAIAAKPIEPGDRLVVEAPYAATLLPEFFGTHCQHCFS 291
Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
API C +C +C CR A YH++EC L LL G++ L++ + +++ S
Sbjct: 292 RFVAPIGCPDCSSVAFCGRKCRDAAMASYHKYECK-ILALLIGSGMSVLSMLALRMATQS 350
>gi|158287868|ref|XP_309762.4| AGAP010931-PA [Anopheles gambiae str. PEST]
gi|157019395|gb|EAA05512.4| AGAP010931-PA [Anopheles gambiae str. PEST]
Length = 383
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 63/113 (55%), Gaps = 4/113 (3%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPI 135
A +K + Q ++ GRHV+ + GD + +EKP V C+ C + I
Sbjct: 100 ALSKARNQNASKYGRHVVTKRKLKVGDVVMIEKPFVTVAKETFQYIRCDFCQAKRLFTLI 159
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAH-LALKLILVS 186
PC C +A+YCSE C +A+ +YHR+EC G LR L +GI+ +AL++I ++
Sbjct: 160 PCEGCTVAMYCSEECISKAYGKYHRYEC-GVLRDLWTVLGISGVIALRMIAIA 211
>gi|157136831|ref|XP_001656929.1| hypothetical protein AaeL_AAEL003516 [Aedes aegypti]
gi|108880958|gb|EAT45183.1| AAEL003516-PA [Aedes aegypti]
Length = 649
Score = 71.2 bits (173), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 26/152 (17%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S +GR D+ + +EKP V+L S+S+C+ C ++ P+ C +C ++
Sbjct: 235 SQDEGRFAKTNMDLKPNTIILLEKPHVSVLLEEYSLSHCSTCFKRVTIPVCCPKCSDVIF 294
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSS--------------- 187
CSE+C ++A YHR+EC G L + G + H+AL++I +
Sbjct: 295 CSEACEKKANSSYHRYEC-GFLPIFWKSGASITCHMALRMITQKTEEYFVGLRHELDGLT 353
Query: 188 -------HSDRYKEVYHLETHLQDMRPEDLYQ 212
++D Y++VY L TH PED +Q
Sbjct: 354 SEVTDKLNNDDYRKVYKLVTHEDKRSPEDYFQ 385
>gi|158284652|ref|XP_307655.3| Anopheles gambiae str. PEST AGAP012638-PA [Anopheles gambiae str.
PEST]
gi|157020944|gb|EAA03452.3| AGAP012638-PA [Anopheles gambiae str. PEST]
Length = 452
Score = 70.9 bits (172), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 79/161 (49%), Gaps = 7/161 (4%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
EK +N++ A + +++SS S V P L+ +N ++++ + GRHV+
Sbjct: 95 EKEVKNAL-AKARNKNASSSKVSPDVVEEPELSYPVKKNAPQVANCLELRKNEEYGRHVV 153
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCRRE 153
+ GD + +E+P V+ C+ C IPC C +A+YCSE C +
Sbjct: 154 TTRKLKVGDVVMIERPFVTVLKDSFRYVRCDFCHGERPFTLIPCEGCTMAMYCSEECLSK 213
Query: 154 AWLRYHRWECHGALR----LLEAVGIAHLALKLILVSSHSD 190
A+ +YHR+EC G LR + E V + + + I +++ +
Sbjct: 214 AYNKYHRYEC-GLLRDMWEVFEEVSLIDIRMIAIAITTFDN 253
>gi|194752637|ref|XP_001958627.1| GF12496 [Drosophila ananassae]
gi|190619925|gb|EDV35449.1| GF12496 [Drosophila ananassae]
Length = 751
Score = 70.5 bits (171), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/100 (40%), Positives = 61/100 (61%), Gaps = 6/100 (6%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
+GR+++A E + +G +F E+ FV L + C C +S+ SAPIPC +C VYC
Sbjct: 224 RGRYMVATEAIPQGTAIFSERASCFVPLEQRLI--CQQCASSLMSAPIPCPQCHQKVVYC 281
Query: 147 SESCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV 185
S C RE HR+EC G + L + +G++HLAL+++LV
Sbjct: 282 SRKC-REGHANIHRYECAGYRMDLFKLLGVSHLALRMVLV 320
>gi|307171148|gb|EFN63141.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 641
Score = 70.1 bits (170), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 54/105 (51%), Gaps = 1/105 (0%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
L G S + S+ + + S GRH IA + GD + +E P A VI ++C+
Sbjct: 234 LTYGPSQEVPAISDGVTISFSERYGRHYIATREFQPGDIISIEDPYAHVIYEERYYTHCH 293
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
+C + IPC +C +A YCSE CR+ W H+ EC L+LL
Sbjct: 294 YCLSKSYNLIPCAKCPIAQYCSEKCRKLGWEACHKTEC-PILKLL 337
>gi|449265983|gb|EMC77110.1| SET and MYND domain-containing protein 4, partial [Columba livia]
Length = 706
Score = 69.7 bits (169), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 71/140 (50%), Gaps = 20/140 (14%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-------- 121
E+ ++ AS+ +++ +GRH++A ED+ G +L E+ V+ P S
Sbjct: 132 ENDSISGASSSLRLNFDMDRGRHLVASEDILPGQSLLEEEAFVSVLCPGDSFLLQDSAET 191
Query: 122 ------SN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
+N C+HC + A +PC C A YCS+ C AW +YHR EC GAL L
Sbjct: 192 VWDTQVTNADLYCHHCLRQLLASVPCQGCSYAKYCSQICADVAWQQYHRTECSLGALLLT 251
Query: 171 EAVGIAHLALKLILVSSHSD 190
V H+AL+ +L++ ++
Sbjct: 252 LGV-FCHVALRTVLLAGFAE 270
>gi|345482388|ref|XP_001608083.2| PREDICTED: hypothetical protein LOC100124189 [Nasonia vitripennis]
Length = 493
Score = 69.7 bits (169), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 47/88 (53%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
CAS + ++ + GRH++A + G+ + +EK I C +C A I
Sbjct: 220 CASTAVSIRYNKKWGRHIVADRRIEPGEVIAIEKSYLTSICLDGMYLFCANCVQQTWASI 279
Query: 136 PCNECILAVYCSESCRREAWLRYHRWEC 163
PC CI VYCS+ C+ EAW +YH++EC
Sbjct: 280 PCESCIYNVYCSQKCKSEAWKKYHQYEC 307
>gi|344290601|ref|XP_003417026.1| PREDICTED: SET and MYND domain-containing protein 4-like [Loxodonta
africana]
Length = 800
Score = 69.3 bits (168), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 69/149 (46%), Gaps = 22/149 (14%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSNCNH-- 126
ES + AS+ + + T KGRH+IA +D+ G+ L K AFV +L P M +H
Sbjct: 224 ESEQIPSASSSVSLCTDPLKGRHLIATKDILPGELLV--KEAAFVSVLNPGEMPQLHHGL 281
Query: 127 -----------------CCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C A +PC++C A YCS+ C ++AW YH EC L
Sbjct: 282 ESKWNTRVTNGDLHCHCCLKHTLATVPCDQCSYAKYCSQECMQQAWELYHTVECPQGGLL 341
Query: 170 LEAVGIAHLALKLILVSSHSDRYKEVYHL 198
L H+AL++ L++ D K V L
Sbjct: 342 LTLGIFCHIALRVTLLARFEDVGKVVRKL 370
>gi|118403760|ref|NP_001072288.1| SET and MYND domain containing 4 [Xenopus (Silurana) tropicalis]
gi|111305557|gb|AAI21315.1| hypothetical protein MGC145614 [Xenopus (Silurana) tropicalis]
Length = 739
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 67/145 (46%), Gaps = 15/145 (10%)
Query: 58 SSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP 117
+S L +A + + AS + + KGRH++A +++ +G+ L E+ A VI+P
Sbjct: 180 TSTERLKEMALDRNPHITNASCALHLNFDTRKGRHLLASQNIEQGEVLIWEEAFASVIIP 239
Query: 118 P-----------CSMSNCNH----CCTSISAPIPCNECILAVYCSESCRREAWLRYHRWE 162
++ C+H C + A +PC C A YCS+ C +AW YH E
Sbjct: 240 ERKQWRKEIKWDTRITACDHYCHYCLNRVIASLPCQYCSFARYCSQECMDKAWRSYHYIE 299
Query: 163 CHGALRLLEAVGIAHLALKLILVSS 187
C LL H AL+ +LV+
Sbjct: 300 CSMGDLLLALGMFCHTALRAVLVAG 324
>gi|340728154|ref|XP_003402393.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Bombus terrestris]
Length = 350
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 61/104 (58%), Gaps = 2/104 (1%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP-CSM 121
+PSL+ E+ + S+ +++ + G + A + + GD + +EKP +LPP ++
Sbjct: 82 IPSLSHDENKDIPDMSDAVRLVHNKKYGVNFEATKPIGTGDVIPIEKPQVTSVLPPDVAI 141
Query: 122 SN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
+N C++C SA +PC C A+YCS+ CR +A+ YHR+ C+
Sbjct: 142 ANVCHYCLKRYSALLPCERCNSALYCSKQCRAQAFEEYHRFHCN 185
>gi|24652121|ref|NP_724802.1| CG1868, isoform B [Drosophila melanogaster]
gi|21645544|gb|AAM71072.1| CG1868, isoform B [Drosophila melanogaster]
Length = 751
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 14/120 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNEC-ILAVYC 146
+GR+++A E + KG+ +F E+ FV P + C C T +SAPIPC C VYC
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFV--PLEQLLICQQCAATLMSAPIPCPNCHQRVVYC 282
Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
S C REA H++EC + +L +GI+HLAL+L+L + ++ HLQ+M
Sbjct: 283 SRKC-REAHSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIRPHLQEM 333
>gi|54650928|gb|AAV37042.1| AT13626p [Drosophila melanogaster]
Length = 751
Score = 68.9 bits (167), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 14/120 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNEC-ILAVYC 146
+GR+++A E + KG+ +F E+ FV P + C C T +SAPIPC C VYC
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFV--PLEQLLICQQCAATLMSAPIPCPNCHQRVVYC 282
Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
S C REA H++EC + +L +GI+HLAL+L+L + ++ HLQ+M
Sbjct: 283 SRKC-REAHSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIRPHLQEM 333
>gi|19921918|ref|NP_610500.1| CG1868, isoform A [Drosophila melanogaster]
gi|7303878|gb|AAF58923.1| CG1868, isoform A [Drosophila melanogaster]
gi|15291779|gb|AAK93158.1| LD26240p [Drosophila melanogaster]
gi|220945802|gb|ACL85444.1| CG1868-PA [synthetic construct]
gi|220955560|gb|ACL90323.1| CG1868-PA [synthetic construct]
Length = 718
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 67/120 (55%), Gaps = 14/120 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNEC-ILAVYC 146
+GR+++A E + KG+ +F E+ FV P + C C T +SAPIPC C VYC
Sbjct: 192 RGRYMVAKEAISKGNVIFSERASCFV--PLEQLLICQQCAATLMSAPIPCPNCHQRVVYC 249
Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
S C REA H++EC + +L +GI+HLAL+L+L + ++ HLQ+M
Sbjct: 250 SRKC-REAHSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIRPHLQEM 300
>gi|348540237|ref|XP_003457594.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Oreochromis niloticus]
Length = 725
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/170 (24%), Positives = 75/170 (44%), Gaps = 30/170 (17%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
EK E+ + S + ++ S SV L C ++ + ++ KGRH++
Sbjct: 117 EKGREDDQKPASNDCKGSDTANSPSVGPLTVGIC----------PQVVVDSTVEKGRHLV 166
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSM--------------------SNCNHCCTSISAP 134
A + G+ + ++P + V++P C+ C T +
Sbjct: 167 AANRIAAGEVILTDRPYSCVLIPGMEEVKGKVGRYSRDSEVLFGVEDRRCHRCLTETLSL 226
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
+PC+ C + YCS +C++EAW +HRWEC L+ ++ LAL++ L
Sbjct: 227 VPCDGCSYSRYCSTACQQEAWEEHHRWECALGAHLMAMGVMSQLALRVAL 276
>gi|158287327|ref|XP_309378.2| AGAP011267-PA [Anopheles gambiae str. PEST]
gi|157019597|gb|EAA05164.2| AGAP011267-PA [Anopheles gambiae str. PEST]
Length = 540
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 59/120 (49%), Gaps = 2/120 (1%)
Query: 50 SSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEK 109
+++SS S V P L+ +N ++++ + GRHV+ + GD + +E+
Sbjct: 109 NASSSKISPDVVEEPELSYPSKENAPQVANCLELRKNEEYGRHVVTTRKLKVGDVVMIER 168
Query: 110 PVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALR 168
P V+ C+ C IPC C A+YCSE C +A+ +YHR+EC G LR
Sbjct: 169 PFVTVLRDSLRYVRCDFCHEERPFTLIPCEGCTAAMYCSEECLSKAYNKYHRYEC-GLLR 227
>gi|345486792|ref|XP_003425557.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 719
Score = 68.2 bits (165), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 54/101 (53%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC 128
G T C S+ I + S G+ ++A D+ G+ +FV++ + + + C HC
Sbjct: 251 GTMVTETCTSSGINVSYSPDSGQQLVASHDLQPGEIIFVQQSYVTSVNTNKACAYCCHCM 310
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
T IPC+ C L +YCS+ C+ EAW +YH EC+ A+ +
Sbjct: 311 TPTWCTIPCDHCSLNMYCSKQCKDEAWNKYHDIECNMAMHM 351
>gi|340720355|ref|XP_003398606.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 485
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 63/114 (55%), Gaps = 2/114 (1%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
T + + +PSL+ E+ + S+ +++ + G + A + + GD + +EKP
Sbjct: 58 TKQNNRNHFEDIPSLSHDENKDIPGMSDAVRLVHNKKYGVNFEATKPIGTGDVILIEKPQ 117
Query: 112 AFVILPP-CSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+LPP +++N C++C SA +PC C A+YCS+ CR +A+ +HR +C
Sbjct: 118 VTSVLPPDVAIANVCHYCLKRYSALLPCERCNSALYCSKECRAKAYEEHHRIQC 171
>gi|350405733|ref|XP_003487532.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 676
Score = 67.8 bits (164), Expect = 3e-09, Method: Composition-based stats.
Identities = 47/153 (30%), Positives = 66/153 (43%), Gaps = 31/153 (20%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH +A +D+ G+TL +EKP V+L +SNC+HC I PIP C+ C YCS
Sbjct: 251 GRHAVATKDIEPGETLVIEKPHCAVLLAEYRLSNCHHCFAKIFVPIPTSCDTCNFVAYCS 310
Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVS------------------- 186
CR + H+ EC +L E LALK I+
Sbjct: 311 IPCRNKD-AEIHKNECMILPSLWFSETSVNCFLALKAIVQKPFEELLALKDKLKATKGRF 369
Query: 187 -------SHSDRYKEVYHLETHLQDMRPEDLYQ 212
D ++ +Y L TH ++ EDL+
Sbjct: 370 ETSTQRPRRHDDFEAIYGLITHEEERTSEDLFH 402
>gi|195332853|ref|XP_002033107.1| GM20595 [Drosophila sechellia]
gi|194125077|gb|EDW47120.1| GM20595 [Drosophila sechellia]
Length = 751
Score = 67.8 bits (164), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 67/120 (55%), Gaps = 14/120 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
+GR+++A E + KG+ +F E+ FV L + C HC S+ SAPIPC C VYC
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFVPLEQQLI--CQHCAASLMSAPIPCPNCHQRVVYC 282
Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
S CR + H++EC + +L +GI+HLAL+L+L + ++ HLQ+M
Sbjct: 283 SRKCRDPQSV-IHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIMPHLQEM 333
>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
Length = 596
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 69/152 (45%), Gaps = 25/152 (16%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
++ +GR + AV ++ G+ + +EKP A V+L ++C+HC IPC C A
Sbjct: 184 STENEGRFITAVRNIAPGEIVLIEKPFASVLLRANYSNHCHHCLKHTLEGIPCRTCPDAR 243
Query: 145 YCSESCRREAWLRYHRWEC-------------HGALRLLEAVGIAHLALK------LILV 185
+CSE+CR A YH++EC G L ++ +LK L L
Sbjct: 244 FCSEACRDTAMQTYHQYECSVLNTLHHSQINKFGCLAFRAITKQSYQSLKDIRAQDLPLN 303
Query: 186 SSHSD------RYKEVYHLETHLQDMRPEDLY 211
HSD Y + L TH +D +DL+
Sbjct: 304 GCHSDGLYRPQDYNTIIQLVTHAKDRPVQDLF 335
>gi|395536340|ref|XP_003770178.1| PREDICTED: SET and MYND domain-containing protein 4 [Sarcophilus
harrisii]
Length = 802
Score = 67.4 bits (163), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 62/127 (48%), Gaps = 22/127 (17%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN---------------- 123
+ +++ ++GR++IA EDV G+ L E+ AFV +L P +
Sbjct: 235 VSLRSDPSRGRYLIATEDVFPGELLVKEE--AFVSVLNPGETAGLRRGLEAQRDGRATPG 292
Query: 124 ---CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
C+ C I A +PC C A YCS+ C + AW RYHR EC LL H+AL
Sbjct: 293 DCHCHRCLKPIVATVPCEGCSYAKYCSQQCLQVAWERYHRVECPLGGTLLTLGVFCHVAL 352
Query: 181 KLILVSS 187
+ +L++
Sbjct: 353 RTVLLAG 359
>gi|328777896|ref|XP_001120776.2| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 633
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 49/98 (50%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCN 125
L G S S+ I + S GRH++ ++ GD + +E P A+VI ++C+
Sbjct: 227 LIYGPSKEAPAISDGISISFSEKYGRHLVVTKEFKPGDIITIEDPYAYVIYTQRYYTHCH 286
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
HC + IPC C +A YCSE CR AW H EC
Sbjct: 287 HCLSRSYNLIPCLHCPVAQYCSEKCRILAWEMAHDIEC 324
>gi|402898218|ref|XP_003912123.1| PREDICTED: SET and MYND domain-containing protein 4 [Papio anubis]
Length = 802
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 69/148 (46%), Gaps = 18/148 (12%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
GE+ L AS+ + + KGR+++A +D+ G+ L E V+ LPP
Sbjct: 223 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 282
Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 283 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 342
Query: 171 EAVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L LV D K V L
Sbjct: 343 TLGVFCHIALRLTLVVGVEDVRKIVKKL 370
>gi|328710001|ref|XP_003244133.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 1
[Acyrthosiphon pisum]
gi|328710003|ref|XP_003244134.1| PREDICTED: SET and MYND domain-containing protein 4-like isoform 2
[Acyrthosiphon pisum]
Length = 645
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 80/169 (47%), Gaps = 7/169 (4%)
Query: 23 SHLDFKEDVLTIEKSE--ENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNK 80
SHL E +++ EK + E ++ + + ++ ++ L G + + S
Sbjct: 181 SHLS--EAIISKEKKKGIEMQIKNFLRETNKKETNNKQDKLNVIKLFGGPNKNIPALSKF 238
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+KM+ S + GR ++ D++ G+ L +EKP V+ NC +C IPC +C
Sbjct: 239 VKMKYSESMGRCLVVSSDINPGEVLAIEKPYVGVLRRESYEYNCRNCFKRCLNGIPCLKC 298
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLILVS 186
L +YC E+CR +++ H++EC G + HL+L + L S
Sbjct: 299 TLVIYCDETCRIKSYESGHKYECSLFSTFNNWPGMDHMEHLSLNIFLKS 347
>gi|158287331|ref|XP_309383.4| AGAP011257-PA [Anopheles gambiae str. PEST]
gi|157019599|gb|EAA05326.4| AGAP011257-PA [Anopheles gambiae str. PEST]
Length = 540
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 83/182 (45%), Gaps = 8/182 (4%)
Query: 14 LKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHT 73
LK RL S ++ + L + E + A + +++SS S V P L+
Sbjct: 75 LKNIRLARES--NYSGEKLNQREKEVKNALAKARNKNASSSKISPDVVEEPELSYPAKEN 132
Query: 74 LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSIS 132
+N ++++ + GRHV+ + GD + +E+P V+ C+ C
Sbjct: 133 APQVANCLELRKNEEYGRHVVTTRKLKVGDVVMIERPFVTVLKDSFRYVRCDFCHGERPF 192
Query: 133 APIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL----EAVGIAHLALKLILVSSH 188
IPC C A+YCSE C +A+ YHR++C G LR L E V + + + I +++
Sbjct: 193 TLIPCEGCTAAMYCSEECLSKAYNNYHRYDC-GILRDLYEDFEEVSLIDIRMIAIAITTF 251
Query: 189 SD 190
+
Sbjct: 252 DN 253
>gi|329664548|ref|NP_001192662.1| SET and MYND domain-containing protein 4 [Bos taurus]
Length = 802
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 68/143 (47%), Gaps = 20/143 (13%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
E+ + AS+ + + KGR+++A +D+ G+ L E V+ P P + +
Sbjct: 224 ENEQIPGASSSVSLHIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGEVPLRPRGLES 283
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
C+ C AP+PC+ C A YCS+ C ++AW RYH EC GAL L
Sbjct: 284 KWDIQVTNGDLHCHRCLRHTLAPVPCDGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343
Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
V H+AL+ L++ D K
Sbjct: 344 LGV-FCHVALRSTLLARFEDVGK 365
>gi|296476893|tpg|DAA19008.1| TPA: CG8378-like [Bos taurus]
Length = 802
Score = 67.4 bits (163), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 68/143 (47%), Gaps = 20/143 (13%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
E+ + AS+ + + KGR+++A +D+ G+ L E V+ P P + +
Sbjct: 224 ENEQIPGASSSVSLHIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGEVPLRPRGLES 283
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
C+ C AP+PC+ C A YCS+ C ++AW RYH EC GAL L
Sbjct: 284 KWDIQVTNGDLHCHRCLRHTLAPVPCDGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343
Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
V H+AL+ L++ D K
Sbjct: 344 LGV-FCHVALRSTLLARFEDVGK 365
>gi|426237300|ref|XP_004012599.1| PREDICTED: SET and MYND domain-containing protein 4 [Ovis aries]
Length = 802
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
E+ + AS+ + + KGR ++A +D+ G+ L E V+ P P S+ +
Sbjct: 224 ENEQIPGASSSVTLHIDPLKGRFLVATKDILPGELLVKEDAFVSVLNPGEVPLRPHSLES 283
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
C+ C AP+PC C A YCS+ C ++AW RYH EC GAL L
Sbjct: 284 KWEIQATSGDLHCHRCLRHTLAPVPCEGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343
Query: 171 EAVGIAHLALKLILVSSHSDRYKEVYHL 198
V H+AL+ L++ D K + L
Sbjct: 344 LGV-FCHVALRSTLLARFEDVGKVIKKL 370
>gi|195475072|ref|XP_002089809.1| GE22225 [Drosophila yakuba]
gi|194175910|gb|EDW89521.1| GE22225 [Drosophila yakuba]
Length = 753
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 62/105 (59%), Gaps = 6/105 (5%)
Query: 83 MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC- 140
+ S +GR+++A + + KG +F E+ FV L + C C S+ SAPIPC +C
Sbjct: 219 LTDSGQRGRYMVAKDTISKGKVIFSERASCFVPLEQHLI--CQQCAASLMSAPIPCPKCH 276
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
VYCS C REA H++EC + LL+ +G++HLAL+L+L
Sbjct: 277 QRVVYCSRKC-REAHSAIHKFECAAYRKNLLKLLGVSHLALRLVL 320
>gi|345496800|ref|XP_001601864.2| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 607
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 51/95 (53%), Gaps = 4/95 (4%)
Query: 71 SHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCC 128
S + CA+ + + + + GRH+IA D+ G+ + E+ + + P C+HC
Sbjct: 178 SQKIPCAAESVTIAYNESIGRHLIATRDIKPGEVIIAEE--GYAVFPKIKKMYLFCSHCL 235
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
T IPC+ C LA+YCSE C+++A YH EC
Sbjct: 236 TFAWNGIPCDNCALALYCSEECKKKALEEYHDVEC 270
>gi|189233943|ref|XP_974086.2| PREDICTED: similar to AGAP008839-PA [Tribolium castaneum]
Length = 666
Score = 67.0 bits (162), Expect = 5e-09, Method: Composition-based stats.
Identities = 34/97 (35%), Positives = 49/97 (50%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
GR A+ED G + E P VI ++ NC CC S P+ C C AV+CS +
Sbjct: 236 GRFARALEDFDTGVIIVEETPHCAVISQENALMNCQFCCISTQQPVACRNCGHAVFCSLN 295
Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
C R+A L +H++EC L A + A+ L ++S
Sbjct: 296 CERQANLTFHKYECKAQPVLFHAGASINCAMALRMIS 332
>gi|72169832|ref|XP_781331.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 704
Score = 67.0 bits (162), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 81/168 (48%), Gaps = 12/168 (7%)
Query: 30 DVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMC-ASNKIKMQTSNA 88
D +T E + ++ + + S+ + + + +P E +++ AS + +
Sbjct: 125 DSITGEDKKRQEIKTLLQESSNPTYVSFADHFDIPDEEIKEKSSMLANASAAVTCKHQEG 184
Query: 89 KGRHVIAVEDVHKGDTLFV--------EKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
KGR + A D+ GD L EKP A +IL S+C+ C S PIPC+ C
Sbjct: 185 KGRFLEATRDIAAGDRLLKAVSELLIKEKPYAAIILKEEESSHCHQCFEQCS-PIPCSNC 243
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI-AHLALKLILVSS 187
I A YCS CR + +YH EC G LL+ V + + L+L++++ +
Sbjct: 244 IHARYCSSRCRSDCLSQYHSIEC-GTEGLLQQVSVFSRLSLRILITAG 290
>gi|170048026|ref|XP_001851501.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870252|gb|EDS33635.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 521
Score = 66.6 bits (161), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 42/76 (55%), Gaps = 1/76 (1%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSE 148
GR + A D G+ + E P VI P + + CNHC IPC C A+YCSE
Sbjct: 172 GRGLTAKMDFLVGEIVLHENPSLVVIEPEVTFTRCNHCGRRNEYDLIPCKTCSSAMYCSE 231
Query: 149 SCRREAWLRYHRWECH 164
CR+EA+ +YHR+EC
Sbjct: 232 QCRKEAFTKYHRFECE 247
>gi|345486839|ref|XP_003425566.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 621
Score = 66.6 bits (161), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 48/83 (57%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+K++ GRH++A D+ G+ ++VE+P + + C+HC T+ + +PC+ C
Sbjct: 202 VKIEHDEKWGRHLVATRDIKPGEVIYVEEPYTKCLTIKHLRAYCSHCLTTTWSNVPCDHC 261
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
++CSE+C+ AW YH EC
Sbjct: 262 SWTMFCSEACKDLAWKNYHSSEC 284
>gi|444516396|gb|ELV11145.1| SET and MYND domain-containing protein 4 [Tupaia chinensis]
Length = 536
Score = 66.6 bits (161), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 23/175 (13%)
Query: 39 ENSVEAMSMQVSSTSSSASSSVASLPSLACG-----ESHTLMCASNKIKMQTSNAKGRHV 93
+ ++ + M+V +S A+L + G E+ + AS + ++T KGRH+
Sbjct: 103 QENLHRLKMKVQEKKILTKTSPAALAKASNGHDLREENEQISSASPSVGLRTHPLKGRHL 162
Query: 94 IAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN------------CNHCCTSISAPI 135
IA +D+ G+ L E V+ +PP S+ N C+ C A +
Sbjct: 163 IATKDILPGELLVKEDAFVSVLNPGDMPPLRHSLENKWDTRVTNGDLYCHRCLKHTLATV 222
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
PC+ C A YCS +C ++AW YH EC LL H+AL+ L++ D
Sbjct: 223 PCDGCSYAKYCSHACTQQAWELYHSTECSLGGLLLALGVFCHVALRATLIARFED 277
>gi|91089701|ref|XP_974846.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 561
Score = 66.6 bits (161), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 63/129 (48%), Gaps = 4/129 (3%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
AS+ I+++ R V A + G+ + VEKP F L + +C+ C PIP
Sbjct: 197 ASSCIEIKPDEKSRRRVFAARKIEIGEIIAVEKPFVFT-LAAADLYHCHECYQLCYNPIP 255
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH---LALKLILVSSHSDRYK 193
C C +YC E CR +A +YH++EC + L VG LA+K+ + S +
Sbjct: 256 CEICSQTLYCGEECRDKAREKYHQYECPILISLKNIVGKHKAFLLAIKMSFMISDENDVP 315
Query: 194 EVYHLETHL 202
EVY L +L
Sbjct: 316 EVYALVENL 324
>gi|71897069|ref|NP_001025886.1| SET and MYND domain-containing protein 4 [Gallus gallus]
gi|82125404|sp|Q5F3V0.1|SMYD4_CHICK RecName: Full=SET and MYND domain-containing protein 4
gi|60098707|emb|CAH65184.1| hypothetical protein RCJMB04_6f24 [Gallus gallus]
Length = 742
Score = 66.2 bits (160), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 62/126 (49%), Gaps = 22/126 (17%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP----------------CSMSN----CN 125
S +GRH++A +D+ G L EK V+ P ++N C+
Sbjct: 237 STERGRHLVASQDILPGQNLLKEKAFVSVLCPGEGDSLLLQDSSETVWDTRVTNADLYCH 296
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLLEAVGIAHLALKLIL 184
HC + A IPC C A YCS++C AW +YHR EC GAL L V H+AL+ +L
Sbjct: 297 HCLKQLLASIPCCGCSYAKYCSQNCADVAWEQYHRTECPLGALLLTLGV-FFHVALRTVL 355
Query: 185 VSSHSD 190
++ S+
Sbjct: 356 LAGFSE 361
>gi|355568065|gb|EHH24346.1| hypothetical protein EGK_07991 [Macaca mulatta]
Length = 806
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 18/143 (12%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
GE+ L AS+ + + KGR+++A +D+ G+ L E V+ LPP
Sbjct: 227 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 286
Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 287 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 346
Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
H+AL+L LV D K
Sbjct: 347 TLGVFCHIALRLTLVVGVEDVRK 369
>gi|109112713|ref|XP_001117365.1| PREDICTED: SET and MYND domain-containing protein 4-like [Macaca
mulatta]
Length = 802
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 18/143 (12%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
GE+ L AS+ + + KGR+++A +D+ G+ L E V+ LPP
Sbjct: 223 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 282
Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 283 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 342
Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
H+AL+L LV D K
Sbjct: 343 TLGVFCHIALRLTLVVGVEDVRK 365
>gi|355753603|gb|EHH57568.1| hypothetical protein EGM_07234 [Macaca fascicularis]
Length = 806
Score = 66.2 bits (160), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 67/143 (46%), Gaps = 18/143 (12%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------ 118
GE+ L AS+ + + KGR+++A +D+ G+ L E V+ LPP
Sbjct: 227 GENERLSNASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLD 286
Query: 119 ----CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 287 SQWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLL 346
Query: 171 EAVGIAHLALKLILVSSHSDRYK 193
H+AL+L LV D K
Sbjct: 347 TLGVFCHIALRLTLVVGVEDVRK 369
>gi|156548936|ref|XP_001606913.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 612
Score = 66.2 bits (160), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 70/156 (44%), Gaps = 35/156 (22%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+ +Q + GRH+IA + G+ + V KP + + C HC + A IPC+ C
Sbjct: 193 VSIQHNEKFGRHLIANRYIKPGEIIMVIKPYIKCLNLKNMHAFCGHCLKTSWATIPCDYC 252
Query: 141 ILAVYCSESCRREAWLRYHRWEC----------------HGALR-----LLEAVGIAHLA 179
++CSE C++EAW +YH EC ALR + EA GI +L
Sbjct: 253 NWCMFCSEDCKQEAWQQYHDIECPVIPYIMFDQLGDYWKQLALRSTVMAIREAGGIQNLK 312
Query: 180 LKLILVSS--------------HSDRYKEVYHLETH 201
+L LV + ++DR++ +Y L H
Sbjct: 313 EELDLVDNCNSKLHKGFLNGKFYNDRFQSIYSLSDH 348
>gi|170060287|ref|XP_001865735.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878799|gb|EDS42182.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 650
Score = 66.2 bits (160), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 26/152 (17%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S+ +GR D+ + VEKP V+L S ++C+ C +S P+ C +C V+
Sbjct: 236 SSEEGRFAKTNVDLKPNTIVLVEKPHVSVLLEEYSKTHCSTCFKRVSVPVCCPKCSDVVF 295
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI-----------------LV 185
CSE C A YH++EC G L + G + H+AL++I L
Sbjct: 296 CSEDCESSANSGYHKYEC-GFLPIFWKSGASITCHMALRIITQQSEEYFLQLRPELDGLT 354
Query: 186 SSHSDR-----YKEVYHLETHLQDMRPEDLYQ 212
S +D+ Y+++Y L TH + ED +Q
Sbjct: 355 SEQTDKLKHDDYRKIYKLVTHEETRSAEDFFQ 386
>gi|157136833|ref|XP_001656930.1| hypothetical protein AaeL_AAEL003527 [Aedes aegypti]
gi|108880959|gb|EAT45184.1| AAEL003527-PA [Aedes aegypti]
Length = 650
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 26/154 (16%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
+ ++GR + + L E P A V++ CS S+C+ CC+ + C C+ VY
Sbjct: 235 TESEGRFARTRNHLKPNNVLLKELPHASVVMSECSGSHCDQCCSRVEVLFSCPNCVDVVY 294
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLILVSSHS------------- 189
CSE C+++A +H +EC G L L G ++ LAL+++ S
Sbjct: 295 CSEECQKQAISGHHAFEC-GFLSFLRNSGANVVSMLALRIVSQKSEKYFYELQDELDNLQ 353
Query: 190 ---------DRYKEVYHLETHLQDMRPEDLYQYV 214
D Y++VY+ TH Q ED ++
Sbjct: 354 NDFVDSLFFDDYRKVYNFVTHGQQRNAEDYLKWT 387
>gi|390360826|ref|XP_001181205.2| PREDICTED: SET and MYND domain-containing protein 4-like
[Strongylocentrotus purpuratus]
Length = 679
Score = 65.9 bits (159), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 56/97 (57%), Gaps = 3/97 (3%)
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREA 154
A D+ G+ L EKP A +IL S+C+HC S PIPC CI A YCS CR ++
Sbjct: 178 ATRDIAAGELLIKEKPYAAIILKEEESSHCHHCFEQCS-PIPCPNCIHARYCSSRCRSDS 236
Query: 155 WLRYHRWECHGALRLLEAVGI-AHLALKLILVSSHSD 190
+YH EC G +LL+ V + + L+L++++ + +
Sbjct: 237 LTQYHSIEC-GTEQLLQQVSVFSRLSLRILITAGREE 272
>gi|195581916|ref|XP_002080775.1| GD10068 [Drosophila simulans]
gi|194192784|gb|EDX06360.1| GD10068 [Drosophila simulans]
Length = 751
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 67/120 (55%), Gaps = 14/120 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILAVYC 146
+GR+++A E + KG+ +F E+ FV L + C C S+ SAPIPC C V+C
Sbjct: 225 RGRYMVAKEAISKGNVIFSERASCFVPLEQQLI--CQQCAASLMSAPIPCPNCHQRVVFC 282
Query: 147 SESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
S C R+A H++EC + +L +GI+HLAL+L+L + ++ HLQ+M
Sbjct: 283 SRKC-RDAQSAIHKFECAAYRKDILRLLGISHLALRLLLTY--------IPYIMPHLQEM 333
>gi|126314231|ref|XP_001371610.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Monodelphis domestica]
Length = 845
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 63/129 (48%), Gaps = 18/129 (13%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP---------PCSMSNC--- 124
AS+ + + + ++GR+++A ED+ G+ L E+ V+ P P + NC
Sbjct: 229 ASSSVSLCSDPSRGRYLVATEDILPGELLVKEEAFVSVLNPGETSWLRSDPGAKWNCRVT 288
Query: 125 ------NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
+ C + A IPC C A YCS+ C ++AW YHR EC LL H+
Sbjct: 289 TGDLHCHRCLKPVWATIPCQGCSYARYCSQQCMQQAWECYHRTECSLGGTLLALGVFCHV 348
Query: 179 ALKLILVSS 187
AL+ +L++
Sbjct: 349 ALRTVLLAG 357
>gi|189240077|ref|XP_971167.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 637
Score = 65.5 bits (158), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 57/105 (54%), Gaps = 3/105 (2%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-SNCNHCCTSISAPI 135
AS+KI ++TS GR+V++ D+ G+ L E P A L P ++C HC + AP+
Sbjct: 228 ASSKITLKTSPDVGRYVVSNVDIATGEILVAE-PAAVACLNPEKFGTHCQHCFARLLAPV 286
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
C C +CS CR +A YH++EC LL G++ L+L
Sbjct: 287 GCPHCSSVAFCSPKCRDDAITTYHKYECK-FFDLLLGSGMSVLSL 330
>gi|426383436|ref|XP_004058287.1| PREDICTED: SET and MYND domain-containing protein 4 [Gorilla
gorilla gorilla]
Length = 804
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 68/147 (46%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR+++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLPSASSSIGLCVDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370
>gi|440912312|gb|ELR61896.1| SET and MYND domain-containing protein 4 [Bos grunniens mutus]
Length = 802
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 64/139 (46%), Gaps = 18/139 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSN 123
E+ + AS+ + + KGR+++A +D+ G+ L E V+ P P + +
Sbjct: 224 ENEQIPGASSSVSLHIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGEVPLRPRGLES 283
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C AP+PC+ C A YCS+ C ++AW RYH EC LL
Sbjct: 284 KWDIQVTNGDLHCHRCLRHTLAPVPCDGCSYAKYCSQECMQQAWDRYHSVECSLGALLLT 343
Query: 172 AVGIAHLALKLILVSSHSD 190
H+AL+ L++ D
Sbjct: 344 LGLFCHVALRSTLLARFED 362
>gi|410980233|ref|XP_003996482.1| PREDICTED: SET and MYND domain-containing protein 4 [Felis catus]
Length = 800
Score = 65.1 bits (157), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 69/147 (46%), Gaps = 17/147 (11%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP-CSMSN 123
GE+ + AS+ + ++T +GR+++A +D+ G+ L E V+ +PP C + +
Sbjct: 223 GENGQVPSASSSVSLRTDPLRGRYLVATKDILPGELLVKEDAFVSVLHPGEMPPLCHLES 282
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C A +PC+ C A YCS C ++AW YH EC LL
Sbjct: 283 KWDTRVTNGDLYCHRCLDHTLATVPCDGCSYAKYCSHDCMQQAWDLYHSVECSLGGLLLT 342
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+ LV+ D K + L
Sbjct: 343 LGVFCHIALRSTLVARFEDASKVIRKL 369
>gi|395853348|ref|XP_003799177.1| PREDICTED: SET and MYND domain-containing protein 4 [Otolemur
garnettii]
Length = 749
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 72/158 (45%), Gaps = 24/158 (15%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN----- 123
E+ + AS+ + + T KGR++IA D+ G+ L E AFV +L P M
Sbjct: 226 ENEQISSASSSVTLCTDPLKGRYLIATRDILPGELLVKED--AFVSVLNPGEMQPLHHGL 283
Query: 124 --------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C+ C A +PC+ C YCS+ C ++AW YH EC L
Sbjct: 284 ESKWRTRVTNGDLYCHRCLRHTLATVPCDGCSYVKYCSQECLQQAWELYHSIECSLGGLL 343
Query: 170 LEAVGIAHLALKLILVSSHSDRYKEVYHL--ETHLQDM 205
L H+AL+L L++ D K + L ET L+D+
Sbjct: 344 LTLGVFCHIALRLTLLARFEDVGKIIRKLRDETSLKDI 381
>gi|350410051|ref|XP_003488929.1| PREDICTED: hypothetical protein LOC100742343 [Bombus impatiens]
Length = 788
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
T + + +PSL+ E+ + S+ +++ S G + A + + GD + +EKP
Sbjct: 140 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 199
Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
V +I ++ C +C A +PC C A+YCS+ CR +A+ YHR++C+
Sbjct: 200 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 254
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
T + + +PSL+ E+ + S+ +++ S G + A + + GD + +EKP
Sbjct: 380 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 439
Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
V +I ++ C +C A +PC C A+YCS+ CR +A+ YHR++C+
Sbjct: 440 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 494
>gi|270014900|gb|EFA11348.1| hypothetical protein TcasGA2_TC010888 [Tribolium castaneum]
Length = 1112
Score = 64.3 bits (155), Expect = 3e-08, Method: Composition-based stats.
Identities = 34/97 (35%), Positives = 49/97 (50%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
GR A+ED G + E P VI ++ NC CC S P+ C C AV+CS +
Sbjct: 236 GRFARALEDFDTGVIIVEETPHCAVISQENALMNCQFCCISTQQPVACRNCGHAVFCSLN 295
Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
C R+A L +H++EC L A + A+ L ++S
Sbjct: 296 CERQANLTFHKYECKAQPVLFHAGASINCAMALRMIS 332
>gi|328696607|ref|XP_003240079.1| PREDICTED: n-lysine methyltransferase SMYD2-B-like [Acyrthosiphon
pisum]
Length = 407
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 43/74 (58%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
GR + A +++ G+ L +EKP+A V M NCN+C + IPC++C VYC E+
Sbjct: 2 GRGLFASKNIKPGEILIIEKPIAGVFKNSMWMFNCNYCFQRCLSAIPCSKCSQVVYCDET 61
Query: 150 CRREAWLRYHRWEC 163
C R+A YH EC
Sbjct: 62 CLRKAHTCYHGIEC 75
>gi|170040660|ref|XP_001848110.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864249|gb|EDS27632.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 574
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 2/106 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
+ M+ A GR ++A + GD + EKPV V+ P NC+HC S + IPC
Sbjct: 177 VGMKQYPASGRGLVAERNFKAGDVILDEKPVVGVVAWPYKYLNCSHCGISNQHSLIPCPN 236
Query: 140 CILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLIL 184
C++ +YCSE C E HR+EC GA ++ ALKL
Sbjct: 237 CVMYMYCSEECLAEDQRLTHRFECGFGAQAENTTFNSSNNALKLFF 282
>gi|340711193|ref|XP_003394163.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 676
Score = 64.3 bits (155), Expect = 3e-08, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH +A +D+ G+TL +E+P V+L ++NC+HC I PIP C+ C YCS
Sbjct: 251 GRHAVATKDIEPGETLVIERPHCAVLLAEYRLTNCHHCFKKIFVPIPTSCDTCNFVAYCS 310
Query: 148 ESCRREAWLRYHRWEC 163
CR + H+ EC
Sbjct: 311 ILCRNKD-AEIHKNEC 325
>gi|194217425|ref|XP_001504392.2| PREDICTED: SET and MYND domain-containing protein 4 [Equus
caballus]
Length = 802
Score = 64.3 bits (155), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/140 (31%), Positives = 65/140 (46%), Gaps = 20/140 (14%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP----PCSMS--- 122
E+ + AS + + T KGR++IA +D+ G+ L E V+ P P +
Sbjct: 224 ENKQISGASASVSLCTDPFKGRYLIATKDILPGELLVKEDAFVSVLNPAEVLPLHLGLES 283
Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLL 170
C+ C AP+PC+ C A YCS+ C ++AW YH EC GAL L
Sbjct: 284 KWDTRVTNGDLYCHRCLKRSLAPVPCDGCSYAKYCSQECMQQAWDLYHNIECSLGALLLT 343
Query: 171 EAVGIAHLALKLILVSSHSD 190
V H+AL+ L++ D
Sbjct: 344 LGV-FCHVALRSTLLARFED 362
>gi|350410049|ref|XP_003488928.1| PREDICTED: hypothetical protein LOC100742217 [Bombus impatiens]
Length = 987
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
T + + +PSL+ E+ + S+ +++ S G + A + + GD + +EKP
Sbjct: 140 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 199
Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
V +I ++ C +C A +PC C A+YCS+ CR +A+ YHR++C+
Sbjct: 200 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 254
Score = 64.3 bits (155), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 2/115 (1%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP- 110
T + + +PSL+ E+ + S+ +++ S G + A + + GD + +EKP
Sbjct: 386 TKQNNRNHFEDIPSLSHNENKDIPGMSDAVRLVHSTKYGVNFEATKPIGTGDVILIEKPQ 445
Query: 111 VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
V +I ++ C +C A +PC C A+YCS+ CR +A+ YHR++C+
Sbjct: 446 VTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYHRFQCN 500
Score = 52.8 bits (125), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 102 GDTLFVEKP-VAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYH 159
GD + +EKP V +I ++ C +C A +PC C A+YCS+ CR +A+ YH
Sbjct: 682 GDVILIEKPQVTSIIQTDVDVARMCYYCLRDYRALLPCERCNSALYCSKECRAKAYEEYH 741
Query: 160 RWECH 164
R++C+
Sbjct: 742 RFQCN 746
>gi|197102342|ref|NP_001127594.1| SET and MYND domain-containing protein 4 [Pongo abelii]
gi|75054780|sp|Q5R5X9.1|SMYD4_PONAB RecName: Full=SET and MYND domain-containing protein 4
gi|55732267|emb|CAH92837.1| hypothetical protein [Pongo abelii]
Length = 804
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 66/142 (46%), Gaps = 18/142 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ + + KGR+++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSSASSSVGLCIDPLKGRYLVATKDILPGELLVKEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYK 193
H+AL+L L+ D K
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRK 365
>gi|380014448|ref|XP_003691244.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
florea]
Length = 661
Score = 63.9 bits (154), Expect = 4e-08, Method: Composition-based stats.
Identities = 45/159 (28%), Positives = 68/159 (42%), Gaps = 40/159 (25%)
Query: 61 ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
ASLP L GE+ L AS + ++ + G+ VIA +++ GD L +E P A ++LP
Sbjct: 117 ASLPVLKGGENPRLPGASTLLDVEETANAGKRVIAAKNIEPGDRLIIESPHAAILLPEFF 176
Query: 121 MSNCNHC-----------------------------------CTSI----SAPIPCNECI 141
++C HC C +I API C +C
Sbjct: 177 GTHCQHCFSRKESDLQFSLICNTAYAFTTYETISCVKTNSFVCLTIIFRFKAPIGCPDCS 236
Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
+C CR A YH++EC L LL G++ L++
Sbjct: 237 SVAFCGRKCRDAALAGYHKYECK-ILALLIGSGMSVLSM 274
>gi|198459501|ref|XP_002138697.1| GA24934 [Drosophila pseudoobscura pseudoobscura]
gi|198136711|gb|EDY69255.1| GA24934 [Drosophila pseudoobscura pseudoobscura]
Length = 760
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 6/102 (5%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILA 143
S ++GR++IA E + +G +F E+ FV L + C C S+ APIPC +C
Sbjct: 227 SGSRGRYMIAKEAIPQGKIIFREQATCFVPLEQRLI--CQQCAASLLCAPIPCPQCRQRV 284
Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
VYCS C RE H++EC + LL+ VG++HLAL+++L
Sbjct: 285 VYCSRQC-RERHKPIHKFECAAYRKDLLKMVGVSHLALRMVL 325
>gi|195172794|ref|XP_002027181.1| GL20110 [Drosophila persimilis]
gi|194112994|gb|EDW35037.1| GL20110 [Drosophila persimilis]
Length = 760
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 60/102 (58%), Gaps = 6/102 (5%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIPCNEC-ILA 143
S ++GR++IA E + +G +F E+ FV L + C C S+ APIPC +C
Sbjct: 227 SGSRGRYMIAKEAIPQGKIIFREQATCFVPLEQRLI--CQQCAASLLCAPIPCPQCRQRV 284
Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
VYCS C RE H++EC + LL+ VG++HLAL+++L
Sbjct: 285 VYCSRQC-RERHKPIHKFECAAYRKDLLKMVGVSHLALRMVL 325
>gi|428171150|gb|EKX40069.1| hypothetical protein GUITHDRAFT_164767 [Guillardia theta CCMP2712]
Length = 500
Score = 63.9 bits (154), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 46/89 (51%), Gaps = 3/89 (3%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP- 136
S + ++++ KG H++A++ GDT+ +P AF I P CN C + I
Sbjct: 43 SFSVGVRSTENKGSHLVALKAFRPGDTVLRSRPYAFEIFPELREERCNECFRRPAEGISL 102
Query: 137 --CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C + YC + C+ AW R H++EC
Sbjct: 103 LRCSSCKITRYCGKECQARAWKRSHKYEC 131
>gi|157786648|ref|NP_001099280.1| SET and MYND domain-containing protein 4 [Rattus norvegicus]
gi|149053389|gb|EDM05206.1| SET and MYND domain containing 4 (predicted) [Rattus norvegicus]
Length = 801
Score = 63.9 bits (154), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 81/177 (45%), Gaps = 27/177 (15%)
Query: 39 ENSVEAMSMQVSSTSSSASSSVASLPS----LACGESHT-LMCASNKIKMQTSNAKGRHV 93
+ +V+ + M++ + S A+L +A GE + + AS + + T KGRH+
Sbjct: 188 QRNVQQLKMKIQEKETLPESFPAALTKALEDMALGEENKQISGASLSVSLCTDPLKGRHL 247
Query: 94 IAVEDVHKGDTLFVEKPVAFV-ILPPCSMSNCNHCC-----TSIS--------------A 133
+A +D+ G+ L E AFV +L P M HC T ++ A
Sbjct: 248 VATKDILPGELLVKED--AFVSVLNPGEMPPLRHCLENKWDTRVTSADLYCHRCLRHTLA 305
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
+PC C A YCS+ C ++AW YH EC LL H+AL++ L++ D
Sbjct: 306 TVPCGGCSYAKYCSQECVQQAWDHYHSTECSLGGLLLTLGVFCHVALRMTLLARFED 362
>gi|449480134|ref|XP_004177074.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4 [Taeniopygia guttata]
Length = 797
Score = 63.5 bits (153), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 60/120 (50%), Gaps = 19/120 (15%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS-----------------NCNHCCTSI 131
+GRH++A +D+ G +L E+ V+ P S+ +C+ C +
Sbjct: 242 RGRHLVASQDIVPGQSLVKEEAFVSVLCPGESLPLQDGGTAWDTRATNADLHCHRCLRQL 301
Query: 132 SAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLLEAVGIAHLALKLILVSSHSD 190
A +PC C A YCS+ C AW +YHR EC GAL L V H+AL+ +L++ ++
Sbjct: 302 LASVPCQGCSYAKYCSQDCADLAWQQYHRTECSLGALLLTLGV-FCHVALRTVLLAGFAE 360
>gi|345496836|ref|XP_003427832.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 520
Score = 63.5 bits (153), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 41/75 (54%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
+GRH+ DV G+ L+ P + +NC+HC + A IPC+ C+ A+YCSE
Sbjct: 215 QGRHLTVTRDVGPGEALYASSPYVVCPIETRIYNNCSHCLSFAWACIPCDSCVYAMYCSE 274
Query: 149 SCRREAWLRYHRWEC 163
C+ EA YH EC
Sbjct: 275 RCKNEAKDLYHDIEC 289
>gi|312381827|gb|EFR27477.1| hypothetical protein AND_05788 [Anopheles darlingi]
Length = 479
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+N +++ S GR+VI D+ GD + E+P +++ C++C + P
Sbjct: 195 ANCLELAESKQFGRYVITNRDLKAGDIVIHEQPTHSLLIDTYRHMRCDYCLQKHLYTLRP 254
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C +A+YCSE CR++A L YHR+EC
Sbjct: 255 CEGCTVAMYCSEECRKQAQLTYHRYEC 281
>gi|158292886|ref|XP_001688544.1| AGAP005253-PA [Anopheles gambiae str. PEST]
gi|157017203|gb|EDO64127.1| AGAP005253-PA [Anopheles gambiae str. PEST]
Length = 638
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 50/89 (56%), Gaps = 2/89 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
++++ ++ GRH++ + + GD L +EKP A ++ C C + IPC
Sbjct: 209 VELRQNSEFGRHLVTTQHLKAGDVLLIEKPYASMLNDKERYKRCAFCHNEDTFTLIPCEG 268
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
C L +YCS+ C +A+ +YHR+EC G LR
Sbjct: 269 CTLTMYCSDECMDKAYKQYHRYEC-GVLR 296
>gi|348567599|ref|XP_003469586.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cavia
porcellus]
Length = 800
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 66/141 (46%), Gaps = 22/141 (15%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN----- 123
E+ + AS+ +++ T KGR++IA +D+ G+ L E AFV +L P M
Sbjct: 224 ENEHISSASSSVRLCTDPLKGRYLIATKDIIPGELLVKED--AFVSVLNPGEMPPPHQGL 281
Query: 124 --------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C+ C A +PC+ C A YCS+ C ++AW YH EC L
Sbjct: 282 ESRWDTRVTLGDLYCHRCLKHTLAMVPCHGCSYAKYCSQECMQQAWELYHSVECSLGELL 341
Query: 170 LEAVGIAHLALKLILVSSHSD 190
L H+AL+L L++ D
Sbjct: 342 LTLGVFCHVALRLTLLARFED 362
>gi|312377245|gb|EFR24125.1| hypothetical protein AND_11518 [Anopheles darlingi]
Length = 479
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 49/87 (56%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+N +++ S GR+VI D+ GD + E+P +++ C++C + P
Sbjct: 195 ANCLELAESKQFGRYVITNRDLKAGDIVIHEQPTHSLLIDTYRHMRCDYCLQKHLYTLRP 254
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C +A+YCSE CR++A L YHR+EC
Sbjct: 255 CEGCTVAMYCSEECRKQAQLTYHRYEC 281
>gi|158292888|ref|XP_314169.4| AGAP005253-PB [Anopheles gambiae str. PEST]
gi|157017204|gb|EAA09348.4| AGAP005253-PB [Anopheles gambiae str. PEST]
Length = 613
Score = 63.2 bits (152), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 50/89 (56%), Gaps = 2/89 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
++++ ++ GRH++ + + GD L +EKP A ++ C C + IPC
Sbjct: 209 VELRQNSEFGRHLVTTQHLKAGDVLLIEKPYASMLNDKERYKRCAFCHNEDTFTLIPCEG 268
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
C L +YCS+ C +A+ +YHR+EC G LR
Sbjct: 269 CTLTMYCSDECMDKAYKQYHRYEC-GVLR 296
>gi|380013978|ref|XP_003691021.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
florea]
Length = 678
Score = 63.2 bits (152), Expect = 8e-08, Method: Composition-based stats.
Identities = 42/119 (35%), Positives = 61/119 (51%), Gaps = 6/119 (5%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH IA +D+ G+ L +EKP + +L + NC +C T I P+P C C YCS
Sbjct: 253 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLINCFYCFTKIFVPVPAVCQTCSCVAYCS 312
Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
SCR + + H EC L L + LAL++I+ S YK + L+T+ +D
Sbjct: 313 ISCRNKD-AKIHENECPILPTLWLSKTSINCFLALRIIVQQSFDKLYK-LKDLKTNSKD 369
>gi|57970742|ref|XP_564258.1| AGAP011234-PA [Anopheles gambiae str. PEST]
gi|55244729|gb|EAL41557.1| AGAP011234-PA [Anopheles gambiae str. PEST]
Length = 646
Score = 62.8 bits (151), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
++++ + GRH++ + + GD L +EKP A ++ C C + IPC
Sbjct: 207 VELRQDSEFGRHLVTTQHLKAGDVLMIEKPYASLLCERDQYKRCAFCHNEDTFTLIPCEG 266
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C +A+YCSE CR +A +YHR+EC
Sbjct: 267 CTVAMYCSEECRDKAHKQYHRYEC 290
>gi|26354773|dbj|BAC41013.1| unnamed protein product [Mus musculus]
Length = 799
Score = 62.8 bits (151), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 219 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 276
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 277 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 336
Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
LL H+AL++ L++ D
Sbjct: 337 LGGLLLTLGVFCHVALRMTLLARFED 362
>gi|156616310|ref|NP_001096081.1| SET and MYND domain-containing protein 4 [Mus musculus]
gi|90101757|sp|Q8BTK5.2|SMYD4_MOUSE RecName: Full=SET and MYND domain-containing protein 4
gi|148680856|gb|EDL12803.1| SET and MYND domain containing 4, isoform CRA_c [Mus musculus]
Length = 799
Score = 62.8 bits (151), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 219 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 276
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 277 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 336
Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
LL H+AL++ L++ D
Sbjct: 337 LGGLLLTLGVFCHVALRMTLLARFED 362
>gi|351704191|gb|EHB07110.1| SET and MYND domain-containing protein 4, partial [Heterocephalus
glaber]
Length = 793
Score = 62.8 bits (151), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 72/144 (50%), Gaps = 22/144 (15%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ + AS+ +++ T KGR+++A++D+ G+ L E V+ +PP
Sbjct: 224 ENEQISNASSSVRLCTDPLKGRYLVAIKDILPGEILVKEDAFVSVLNPGEMPPLHQGLES 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YH EC LL
Sbjct: 284 RWDTRVTNGDLYCHKCLKHTLAMVPCHGCSYANYCSQECMQQAWELYHSIECSLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEV 195
H+AL+L L++ R+K+V
Sbjct: 344 LGMFCHVALRLTLLA----RFKDV 363
>gi|26326379|dbj|BAC26933.1| unnamed protein product [Mus musculus]
Length = 666
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 23/154 (14%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 86 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 143
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 144 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 203
Query: 165 GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHL 198
LL H+AL++ L++ D + V L
Sbjct: 204 LGGLLLTLGVFCHVALRMTLLARFEDDDRVVRML 237
>gi|21754614|dbj|BAC04538.1| unnamed protein product [Homo sapiens]
Length = 804
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370
>gi|168275572|dbj|BAG10506.1| SET and MYND domain-containing protein 4 [synthetic construct]
Length = 804
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370
>gi|119610982|gb|EAW90576.1| SET and MYND domain containing 4 [Homo sapiens]
Length = 804
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370
>gi|158287350|ref|XP_309407.4| AGAP011238-PA [Anopheles gambiae str. PEST]
gi|157019608|gb|EAA45344.4| AGAP011238-PA [Anopheles gambiae str. PEST]
Length = 497
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
++++ + GRH++ + + GD L +EKP A +++ C C IPC
Sbjct: 210 VELRQDSEFGRHLVTTQHLKAGDVLLIEKPYANLLIDVERHVRCAFCQNEDRFTLIPCEG 269
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
C + +YCSE CR +A +YHR+EC G LR
Sbjct: 270 CTVTMYCSEECRDKAHKQYHRYEC-GVLR 297
>gi|156616308|ref|NP_443160.2| SET and MYND domain-containing protein 4 [Homo sapiens]
gi|296452956|sp|Q8IYR2.3|SMYD4_HUMAN RecName: Full=SET and MYND domain-containing protein 4
Length = 804
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370
>gi|148680855|gb|EDL12802.1| SET and MYND domain containing 4, isoform CRA_b [Mus musculus]
Length = 828
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 248 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 305
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 306 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 365
Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
LL H+AL++ L++ D
Sbjct: 366 LGGLLLTLGVFCHVALRMTLLARFED 391
>gi|328789942|ref|XP_392262.3| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 677
Score = 62.4 bits (150), Expect = 1e-07, Method: Composition-based stats.
Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 7/109 (6%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH IA +D+ G+ L +EKP + +L + NC +C T I PIP C C YCS
Sbjct: 252 GRHAIATKDIEPGEILAIEKPYSAFLLAEYRLINCFYCFTKIFVPIPAVCQTCSCVAYCS 311
Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHSDRYK 193
SCR + + H EC L L A + LAL++I+ S YK
Sbjct: 312 ISCRDKD-AKIHENEC-SILPTLWASKTSINCFLALRIIVQQSFEKLYK 358
>gi|74181871|dbj|BAE32637.1| unnamed protein product [Mus musculus]
gi|122936404|gb|AAI30221.1| Smyd4 protein [Mus musculus]
Length = 666
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 86 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 143
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 144 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 203
Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
LL H+AL++ L++ D
Sbjct: 204 LGGLLLTLGVFCHVALRMTLLARFED 229
>gi|50511197|dbj|BAD32584.1| mKIAA1936 protein [Mus musculus]
Length = 716
Score = 62.4 bits (150), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 136 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 193
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 194 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 253
Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
LL H+AL++ L++ D
Sbjct: 254 LGGLLLTLGVFCHVALRMTLLARFED 279
>gi|148680854|gb|EDL12801.1| SET and MYND domain containing 4, isoform CRA_a [Mus musculus]
Length = 743
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 23/146 (15%)
Query: 66 LACGESHT-LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN 123
+A GE +T + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 163 IALGEENTQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPR 220
Query: 124 -------------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
C+ C A +PC C A YCS+ C ++AW YH EC
Sbjct: 221 PHHCLENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECS 280
Query: 165 GALRLLEAVGIAHLALKLILVSSHSD 190
LL H+AL++ L++ D
Sbjct: 281 LGGLLLTLGVFCHVALRMTLLARFED 306
>gi|195430416|ref|XP_002063251.1| GK21490 [Drosophila willistoni]
gi|194159336|gb|EDW74237.1| GK21490 [Drosophila willistoni]
Length = 745
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 59/103 (57%), Gaps = 6/103 (5%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNEC-ILA 143
SN R++IA ++ + +F E AFV P C+ C TS APIPC C
Sbjct: 223 SNCDNRYMIAKNNIAAHEVIFQEMASAFV--PIEDNIICHQCATSFMCAPIPCPHCHHRV 280
Query: 144 VYCSESCRREAWLRYHRWECHG-ALRLLEAVGIAHLALKLILV 185
+YCS +C R++ L H++EC L LL+ +G++HLA++L+L
Sbjct: 281 IYCSRAC-RQSHLYIHQYECAAYRLNLLQMLGVSHLAMRLVLT 322
>gi|410209806|gb|JAA02122.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410209808|gb|JAA02123.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410246942|gb|JAA11438.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410246944|gb|JAA11439.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289944|gb|JAA23572.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289946|gb|JAA23573.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410289948|gb|JAA23574.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330765|gb|JAA34329.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330767|gb|JAA34330.1| SET and MYND domain containing 4 [Pan troglodytes]
gi|410330769|gb|JAA34331.1| SET and MYND domain containing 4 [Pan troglodytes]
Length = 835
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIIPKL 370
>gi|114665632|ref|XP_511253.2| PREDICTED: SET and MYND domain-containing protein 4 isoform 3 [Pan
troglodytes]
Length = 835
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIIPKL 370
>gi|326931360|ref|XP_003211799.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Meleagris gallopavo]
Length = 706
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 58/122 (47%), Gaps = 20/122 (16%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPP----------------CSMSN----CNHCC 128
+GRH++A +++ G L EK V+ P ++N C+HC
Sbjct: 147 RGRHLVASQNILPGQNLLKEKAFVSVLCPGEGDSLLLQDSSETVWDTQVTNADLYCHHCL 206
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSH 188
+ A +PC C A YCS++C AW +YHR EC LL H+AL+ +L++
Sbjct: 207 KQLLASVPCCGCSYAKYCSQNCADVAWEQYHRTECPLGALLLVLGVFCHVALRTVLLAGF 266
Query: 189 SD 190
S+
Sbjct: 267 SE 268
>gi|15620931|dbj|BAB67829.1| KIAA1936 protein [Homo sapiens]
Length = 558
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 28 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 87
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 88 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 147
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 148 LGVFCHIALRLTLLVGFEDVRKIITKL 174
>gi|397491961|ref|XP_003816904.1| PREDICTED: SET and MYND domain-containing protein 4 [Pan paniscus]
Length = 835
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 67/147 (45%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGIFCHIALRLTLLVGFEDVRKIIPKL 370
>gi|170060861|ref|XP_001865989.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879226|gb|EDS42609.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 588
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 77/158 (48%), Gaps = 16/158 (10%)
Query: 48 QVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFV 107
+V +T S A++ + +P++ G I+++ S+ GRH++A + GDTL +
Sbjct: 180 EVEATLSHAANPI--IPNVVAG-----------IELEVSDVYGRHLVARQYFDFGDTLLI 226
Query: 108 EKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECH-- 164
+ P V C+HC + +PC +C+ ++CS C + A RYHR+EC
Sbjct: 227 DDPYVVVADLGQQYRQCHHCLKFCTLKLVPCPDCVEVMFCSPECAKLAQQRYHRFECPIL 286
Query: 165 GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHL 202
L L+ +G + + I+ ++ + ++ L H+
Sbjct: 287 RGLYTLDKIGRMAILVARIVFTAVTGFNDDLDALREHI 324
>gi|296201020|ref|XP_002747857.1| PREDICTED: SET and MYND domain-containing protein 4 [Callithrix
jacchus]
Length = 798
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 18/142 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPCSMS--- 122
E+ L AS+ + + KGR+++A ++ G+ L E V+ LPP S
Sbjct: 224 ENKQLSNASSSVGLCMDPLKGRYLVATREILPGEPLVKEDAFVSVLNPGELPPLHHSLES 283
Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQAWKLYHRIECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYK 193
H+AL+L L+ D K
Sbjct: 344 LGVFCHIALRLTLLVGFQDVSK 365
>gi|403275299|ref|XP_003929391.1| PREDICTED: SET and MYND domain-containing protein 4 [Saimiri
boliviensis boliviensis]
Length = 800
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 63/142 (44%), Gaps = 18/142 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPCSMS--- 122
E+ L AS+ + + KGR+++A ++ G+ L E V+ LPP S
Sbjct: 224 ENKQLSNASSSVSLCMDPVKGRYLVATREILPGEPLVKEDAFVSVLNPGELPPLHHSLGS 283
Query: 123 -----------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C A +PC+ C A YCS+ C ++AW YHR EC LL
Sbjct: 284 KWNTRVTNADLYCHRCLRHTLATVPCDGCSYAKYCSQECLQQAWGLYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYK 193
H+AL+L L+ D K
Sbjct: 344 LGVFCHIALRLTLLVGFEDVSK 365
>gi|432096123|gb|ELK26991.1| SET and MYND domain-containing protein 4 [Myotis davidii]
Length = 777
Score = 61.6 bits (148), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/139 (30%), Positives = 65/139 (46%), Gaps = 18/139 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ + CAS+ + + T KGR +IA +D+ G+ L E+ V+ +PP
Sbjct: 199 ENEQISCASSSVTLCTDPFKGRSLIATKDILPGELLVREEAFVSVLNPGEMPPRRDGLES 258
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++AW YH EC LL
Sbjct: 259 KWDTRVTNGDLYCHRCLKHTVATVPCDGCSYAKYCSQECMQQAWDLYHSAECSLGGLLLT 318
Query: 172 AVGIAHLALKLILVSSHSD 190
H AL+ L++ D
Sbjct: 319 LGVFCHTALRATLLARCED 337
>gi|170029107|ref|XP_001842435.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880642|gb|EDS44025.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 574
Score = 61.2 bits (147), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/94 (34%), Positives = 52/94 (55%), Gaps = 2/94 (2%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIP 136
+ IKM++ GR ++A +D GD + EKP+A I +C++C ++ IP
Sbjct: 189 AKGIKMESLPEFGRSMMAEKDFRAGDVILSEKPLAAAIDSSLRYVDCSYCTIVHFNSLIP 248
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C+ +YC E CR+ W + HR+EC A +L+
Sbjct: 249 CPGCVSFMYCDEECRQADW-KTHRFECGVAEKLM 281
>gi|23272879|gb|AAH35077.1| SET and MYND domain containing 4 [Homo sapiens]
Length = 804
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 66/147 (44%), Gaps = 18/147 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP------- 118
E+ L AS+ I + KGR ++A +D+ G+ L E V+ LPP
Sbjct: 224 ENEQLSNASSSIGLCVDPLKGRCLVATKDILPGELLVQEDAFVSVLNPGELPPPHHGLDS 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS+ C ++ W YHR EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLATVPCDGCSYAKYCSQECLQQVWELYHRTECPLGGLLLT 343
Query: 172 AVGIAHLALKLILVSSHSDRYKEVYHL 198
H+AL+L L+ D K + L
Sbjct: 344 LGVFCHIALRLTLLVGFEDVRKIITKL 370
>gi|63146229|gb|AAH95952.1| Smyd4 protein [Mus musculus]
Length = 799
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 63/141 (44%), Gaps = 22/141 (15%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN----- 123
E+ + AS + + T KGRH++A +D+ G+ L E AFV +L P M
Sbjct: 224 ENKQISGASLSVSLCTHPLKGRHLVATKDILPGELLVKED--AFVSVLIPGEMPRPHHCL 281
Query: 124 --------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C+ C A +PC C A YCS+ C ++AW YH EC L
Sbjct: 282 ENKWDTRVTSGDLYCHRCLKHTLATVPCGSCSYAKYCSQECMQQAWDLYHSTECSLGGLL 341
Query: 170 LEAVGIAHLALKLILVSSHSD 190
L H+AL++ L++ D
Sbjct: 342 LTLGVFCHVALRMTLLARFED 362
>gi|332257691|ref|XP_003277939.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 4 [Nomascus leucogenys]
Length = 703
Score = 60.8 bits (146), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 71/158 (44%), Gaps = 22/158 (13%)
Query: 61 ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP--- 117
A+L L E+ L AS+ I + KGR+++A +D+ G+ L E AFV +
Sbjct: 114 AALFHLGQYENEQLSSASSSIGLCVDPLKGRYLVATKDILPGELLVKED--AFVSVLNPE 171
Query: 118 -------------PCSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
++N C+ C A +PC+ C A YCS+ C ++AW YHR
Sbjct: 172 ELPPPPHGLDSKWDTRVTNGDLYCHRCLKHTLATVPCDRCSYAKYCSQECLQQAWELYHR 231
Query: 161 WECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHL 198
EC LL H+AL+L L+ D K + L
Sbjct: 232 TECPLGGLLLTLGVFCHIALRLTLLVGFEDVLKIITKL 269
>gi|195029279|ref|XP_001987502.1| GH21957 [Drosophila grimshawi]
gi|193903502|gb|EDW02369.1| GH21957 [Drosophila grimshawi]
Length = 758
Score = 60.8 bits (146), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 65/119 (54%), Gaps = 7/119 (5%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT 129
E L ++KI M +S +GR++IA + + +GD +F E+ FV P C C
Sbjct: 211 ELQKLTGDTHKISMDSST-RGRYMIATKRLSEGDIIFTEQADCFV--PIEQRLICQKCAA 267
Query: 130 S-ISAPIPCNEC-ILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILV 185
S + APIPC +C VYCS C R+ H +EC + LL +G++HLA++L+L
Sbjct: 268 SLLCAPIPCPQCHQRVVYCSRRC-RDLHKYIHSYECAAYRKDLLIMLGVSHLAMRLLLT 325
>gi|291405379|ref|XP_002719091.1| PREDICTED: SET and MYND domain containing 4 [Oryctolagus cuniculus]
Length = 801
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 59/134 (44%), Gaps = 18/134 (13%)
Query: 83 MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP--------------CSMSN----C 124
+ T +KGR ++A +DV G+ L E V++P +SN C
Sbjct: 237 LGTDPSKGRCLVATKDVLPGELLVKEDAFVSVLIPGEVPPLPHGLGSRWDTRISNGDLYC 296
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
+ C A +PC+ C A YCS C R AW YH EC LL H+AL+L L
Sbjct: 297 HRCLKHTLATVPCDGCSYAKYCSLECLRRAWELYHSAECALGGLLLTLGIFCHVALRLTL 356
Query: 185 VSSHSDRYKEVYHL 198
V+ D K + L
Sbjct: 357 VARFQDFGKVIRKL 370
>gi|195485531|ref|XP_002091128.1| GE12406 [Drosophila yakuba]
gi|194177229|gb|EDW90840.1| GE12406 [Drosophila yakuba]
Length = 573
Score = 60.5 bits (145), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 48/84 (57%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ S +GR V+ D+ GD + VE+P +LPP C C + IPC+
Sbjct: 187 LELRESADEGRFVVTNRDLVVGDLVAVEQPFCSTLLPPMRYIRCATCKRENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A++CSE C+++A +HR+EC
Sbjct: 247 CCSAMFCSEECKQQAMSTFHRFEC 270
>gi|158287352|ref|XP_309409.4| AGAP011237-PA [Anopheles gambiae str. PEST]
gi|157019609|gb|EAA05217.4| AGAP011237-PA [Anopheles gambiae str. PEST]
Length = 481
Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 49/89 (55%), Gaps = 2/89 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
++++ ++ GRH++ + + GD L +EKP A ++ C+ C IPC
Sbjct: 197 VELRQNSEVGRHLVTTQKLKVGDVLLIEKPYASMLNDQERYKRCDFCQNEDRFTLIPCEG 256
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALR 168
C + +YCS+ C +A +YHR+EC G LR
Sbjct: 257 CTVTMYCSKECMDKAHKQYHRYEC-GVLR 284
>gi|321449488|gb|EFX61914.1| hypothetical protein DAPPUDRAFT_68494 [Daphnia pulex]
Length = 680
Score = 59.7 bits (143), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 65/135 (48%), Gaps = 17/135 (12%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC--CTSISAPI 135
S KI ++ GR+ +A + GD + V+ P A V+ P ++C+HC + +
Sbjct: 242 SKKILVEEDAVSGRYGVAASPIRVGDVIAVDAPYASVMNPEKFSTHCHHCYQILELGEVL 301
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLILVSSHSDRY 192
PC+ C L +CS +CR A YH EC L L A GI+ +L+L++I +
Sbjct: 302 PCSHCDLVSFCSVNCRSRAMEIYHAIEC-PILSCLYAAGISIICYLSLRMIAI------- 353
Query: 193 KEVYHLETHLQDMRP 207
H + D+RP
Sbjct: 354 ----HPPSFFMDVRP 364
>gi|158299652|ref|XP_319721.4| AGAP008973-PA [Anopheles gambiae str. PEST]
gi|157013620|gb|EAA14846.4| AGAP008973-PA [Anopheles gambiae str. PEST]
Length = 668
Score = 59.7 bits (143), Expect = 8e-07, Method: Composition-based stats.
Identities = 40/102 (39%), Positives = 51/102 (50%), Gaps = 8/102 (7%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMS-----NCNHCCTSISAPIPCNECILA 143
GR+V+A E + DT+ E V+FV + P S S +C C P PC C A
Sbjct: 165 GRYVVAAEAIKANDTVARETAVSFVPVYDPESSSTLPSFDCQKCAKVNVVPFPCPTCGRA 224
Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLIL 184
YCS C R A HR+EC G + L +GIAHL L+ L
Sbjct: 225 CYCSTRC-RVAHRPVHRFECFGYQKHLWYQIGIAHLGLRCFL 265
>gi|170036325|ref|XP_001846015.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878892|gb|EDS42275.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 559
Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
R ++A D + GD + +P+ I NC+HC S + IPC C+ +YC E
Sbjct: 183 RILVAERDFNPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPGCVFFMYCGEE 242
Query: 150 CRREAWLRYHRWECHGALRL 169
CR+++W +HR+EC A +L
Sbjct: 243 CRQKSWKLWHRFECPVATKL 262
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 59.7 bits (143), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 8/140 (5%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
++ +++ + KGR + D+ G+ +F E A V+L S+ C+ C P C
Sbjct: 6 TDPVEVFGAGLKGRGLRGTRDLSAGEVVFAEASFAAVVLDSLSLQVCHSCFRRQVNPHRC 65
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSD----- 190
+C A YC +C+R AW HR EC + +A + +A L + H+
Sbjct: 66 AQCKFAHYCDRTCQRAAW-DEHRKECSAIRNIGKAPNENVRLVARILWRIQKHTGLVSDS 124
Query: 191 RYKEVYHLETHLQDMRPEDL 210
+ + LE HL M PEDL
Sbjct: 125 QLTTLDMLEDHLSRMTPEDL 144
>gi|383852386|ref|XP_003701709.1| PREDICTED: SET and MYND domain-containing protein 4-like [Megachile
rotundata]
Length = 675
Score = 59.7 bits (143), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 54/126 (42%), Gaps = 3/126 (2%)
Query: 40 NSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDV 99
N + S +++ S S P + G L C+ GRH +A ++
Sbjct: 200 NQIAQKSNKMNLKKDKIDGSKRSCPKIEDGNPLFLSCSKAVTIRNDGGDIGRHAVATRNI 259
Query: 100 HKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCSESCRREAWLR 157
G+ L VEKP IL +++C+ C I PIP C C YCS SC R+A +
Sbjct: 260 EPGEILVVEKPHCSFILAEYRLTHCHLCMNKIFVPIPSKCQVCSYLAYCSTSC-RDADAK 318
Query: 158 YHRWEC 163
H EC
Sbjct: 319 IHESEC 324
>gi|290976641|ref|XP_002671048.1| SET domain-containing protein [Naegleria gruberi]
gi|284084613|gb|EFC38304.1| SET domain-containing protein [Naegleria gruberi]
Length = 914
Score = 59.3 bits (142), Expect = 9e-07, Method: Composition-based stats.
Identities = 32/94 (34%), Positives = 46/94 (48%), Gaps = 3/94 (3%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
C+ + T+ GR + A ++ G + EKP V L + + CN+C + I
Sbjct: 274 CSPLHAHVITTEQYGRGMYASSELKSGQVVLSEKPFLCVTLD--TENTCNNCLNKLKKKI 331
Query: 136 PC-NECILAVYCSESCRREAWLRYHRWECHGALR 168
PC N C + YCS CR EAW +YHR C +R
Sbjct: 332 PCKNNCGMEYYCSMECRSEAWDKYHRKLCGFDMR 365
>gi|170036323|ref|XP_001846014.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878891|gb|EDS42274.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 576
Score = 59.3 bits (142), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 3/96 (3%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP--VAFVILPPCSMSNCNHCCTSI-SAP 134
+ I M+ GR +IA++D GD + EKP A S C HC + ++
Sbjct: 183 AKGIGMKQYPGLGRGLIALKDFQPGDVILDEKPELCAVSFNNNFSYHYCFHCGSEFQTSL 242
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
IPC +C +YCSESC W + HR+EC A++L
Sbjct: 243 IPCPKCTNHMYCSESCLETDWKQAHRFECAVAMKLF 278
>gi|57974582|ref|XP_566179.1| AGAP000216-PA [Anopheles gambiae str. PEST]
gi|55243600|gb|EAL41256.1| AGAP000216-PA [Anopheles gambiae str. PEST]
Length = 526
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 69/138 (50%), Gaps = 8/138 (5%)
Query: 32 LTIEKSEENSVEAMSMQVSST--SSSASSSVASLP-SLACGESHTLMCASNKIKMQTSNA 88
L K +E+ A SM +S+T ++ SS + P L+ +N ++M+ S
Sbjct: 97 LLRRKLDESERCARSMLLSATDDATPGSSRHGAAPLKLSFPAYENAPRVANCLEMRRSPQ 156
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP---IPCNECILAVY 145
GR++ + + GD + +++P V+ P + C+HC AP IPC C A+Y
Sbjct: 157 YGRYLQTNKALKVGDVVMIDEPYVSVLEPEFCYARCDHC--QRPAPFTLIPCERCTKAMY 214
Query: 146 CSESCRREAWLRYHRWEC 163
CS++C R A YH +EC
Sbjct: 215 CSKNCLRRARTEYHEFEC 232
>gi|170038782|ref|XP_001847227.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167882473|gb|EDS45856.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 560
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 55/107 (51%), Gaps = 1/107 (0%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
R ++A +D++ GD + +P+ I NC+HC S + IPC +C+ +YC E
Sbjct: 231 RILVAEKDINPGDVIMDAEPLLTAIDFNLCYKNCSHCGVKFSNSLIPCPDCVFFMYCEEE 290
Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVY 196
CR+++W +H++EC A L L+ + + +R++ Y
Sbjct: 291 CRQKSWKLWHQFECPVATELRNFSNFNLLSTPRLSTGAELNRFELDY 337
>gi|170036321|ref|XP_001846013.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878890|gb|EDS42273.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 559
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/87 (34%), Positives = 45/87 (51%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIP 136
++ I M GR +IA +D + GD + EK + S NCNHC + + + IP
Sbjct: 175 ADGIGMNYEPKFGRGLIAEKDFNPGDIILEEKSELCGVDFNLSYRNCNHCSSRFNYSLIP 234
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C +YCS+ C + W +HR+EC
Sbjct: 235 CPTCPFMMYCSQECLEQNWNLFHRFEC 261
>gi|170058970|ref|XP_001865157.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877852|gb|EDS41235.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 556
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 46/93 (49%), Gaps = 1/93 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
+N I M GR ++AV D + GD + EK S SNC HC S+ IP
Sbjct: 168 ANGIDMTQYEDMGRGLVAVHDFNAGDLILNEKIELCFGSFERSYSNCAHCGAEFRSSLIP 227
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C C+ +YC + CR E + HR+EC A +L
Sbjct: 228 CPGCVAFMYCGKKCRNEDFRTVHRFECAVATKL 260
>gi|170072158|ref|XP_001870107.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167868273|gb|EDS31656.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 554
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
+ I M GR ++A +D + GD + E FV CN C S+ IP
Sbjct: 172 AKGIAMGYDPKFGRGLVAEKDFNAGDLILDETSELFVHEFSLCFQRCNQCSASLYQILIP 231
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C+L +YCSE CR W +HR+EC
Sbjct: 232 CRTCVLFMYCSEKCRELHWKLFHRFEC 258
>gi|157136250|ref|XP_001656795.1| hypothetical protein AaeL_AAEL003447 [Aedes aegypti]
gi|108881063|gb|EAT45288.1| AAEL003447-PA [Aedes aegypti]
Length = 537
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 97 EDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCSESCRREAW 155
+++ G+ VE+P+ V+ P S CN+C + IPC +C+ +YCSE CR EA+
Sbjct: 190 KELRVGEVAVVERPLLVVVEPEAVQSRCNYCGSKNELDLIPCRKCVSVMYCSEKCRDEAY 249
Query: 156 LRYHRWEC------HGALRLLEAVGIAHLALKLILVS 186
YH++EC R + + L LKL ++
Sbjct: 250 SCYHKFECAVIKDLKNLFRGPKPTRMFQLTLKLFWMA 286
>gi|354508096|ref|XP_003516089.1| PREDICTED: SET and MYND domain-containing protein 4-like, partial
[Cricetulus griseus]
Length = 391
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 18/139 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN 123
E+ + AS + + T KGRH++A +D+ G+ L E V+ PP + N
Sbjct: 101 ENKQISGASLSVSLHTDPLKGRHLVAAKDILPGELLVKEDAFVSVLNPGETPPLHYGLEN 160
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C A +PC C A YCS+ C ++AW YH EC LL
Sbjct: 161 KWDTRVTNGDLYCHRCLKHTLATVPCGGCSYARYCSQECMQKAWECYHSTECPLGGLLLT 220
Query: 172 AVGIAHLALKLILVSSHSD 190
H+AL++ L++ D
Sbjct: 221 LGVFCHVALRITLLARFED 239
>gi|195402463|ref|XP_002059824.1| GJ15028 [Drosophila virilis]
gi|194140690|gb|EDW57161.1| GJ15028 [Drosophila virilis]
Length = 723
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 54/103 (52%), Gaps = 6/103 (5%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIPCNEC-ILA 143
+ +GR++IA + + +G+ +F E+ FV P C C S + APIPC C
Sbjct: 195 AGGRGRYMIASQRIKEGEIVFTEQAECFV--PTQGRLICQQCAASLLCAPIPCTRCHQRV 252
Query: 144 VYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILV 185
VYCS C R+ H +EC R LL +G HLAL+L+L
Sbjct: 253 VYCSRRC-RQLHATIHGYECEAYRRDLLLPIGPFHLALRLVLT 294
>gi|195429543|ref|XP_002062817.1| GK19653 [Drosophila willistoni]
gi|194158902|gb|EDW73803.1| GK19653 [Drosophila willistoni]
Length = 574
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNE 139
++++ S +GR ++ +D+ GD + +E+P +LPP C C + IPC+
Sbjct: 187 LELRESPDEGRFIVTKQDLVVGDLVAIEQPFCSTLLPPMRYIRCATCKMENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ +A +HR+EC
Sbjct: 247 CCSVMFCSEECKTKAMTTFHRYEC 270
>gi|344235250|gb|EGV91353.1| SET and MYND domain-containing protein 4 [Cricetulus griseus]
Length = 370
Score = 58.9 bits (141), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 62/139 (44%), Gaps = 18/139 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN 123
E+ + AS + + T KGRH++A +D+ G+ L E V+ PP + N
Sbjct: 80 ENKQISGASLSVSLHTDPLKGRHLVAAKDILPGELLVKEDAFVSVLNPGETPPLHYGLEN 139
Query: 124 ------------CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C A +PC C A YCS+ C ++AW YH EC LL
Sbjct: 140 KWDTRVTNGDLYCHRCLKHTLATVPCGGCSYARYCSQECMQKAWECYHSTECPLGGLLLT 199
Query: 172 AVGIAHLALKLILVSSHSD 190
H+AL++ L++ D
Sbjct: 200 LGVFCHVALRITLLARFED 218
>gi|170047663|ref|XP_001851333.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870014|gb|EDS33397.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 561
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/110 (30%), Positives = 53/110 (48%), Gaps = 11/110 (10%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP-CSMSNCNHCCTS-ISAPIPCN 138
I M GR ++A +D + G+ + ++P+ L P +S+CN C + IPC
Sbjct: 176 IGMSYDAKFGRGLVAEKDFNPGNVI-CDEPIELCALEPNLLLSSCNQCSAELVEMLIPCL 234
Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSH 188
C + ++CSE CR W YHR+EC AV H + V++H
Sbjct: 235 TCPMVMFCSEKCRELNWQLYHRFEC--------AVASKHFISSFVTVATH 276
>gi|170070169|ref|XP_001869489.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167866058|gb|EDS29441.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 541
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 1/80 (1%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
R ++A +D++ GD + +P+ I NC+HC S + IPC +C+ +YC E
Sbjct: 231 RILVAEKDINPGDVIMDAEPLLTAIDFNLCYKNCSHCGVKFSNSLIPCPDCVFFMYCGEE 290
Query: 150 CRREAWLRYHRWECHGALRL 169
CR+++W +H++EC A L
Sbjct: 291 CRQKSWKLWHQFECPVATEL 310
>gi|170047661|ref|XP_001851332.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870013|gb|EDS33396.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 465
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 59/128 (46%), Gaps = 13/128 (10%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP-CSMSNCNHCCTS-ISAPIPCN 138
I M GR ++A +D + G+ + ++P+ L P +S+CN C + IPC
Sbjct: 80 IGMSYDAKFGRGLVAEKDFNPGNVI-CDEPIELCALEPNLLLSSCNQCSAELVEMLIPCL 138
Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHL 198
C + ++CSE CR W YHR+EC AV H + V++H + Y L
Sbjct: 139 TCPMVMFCSEKCRELNWQLYHRFEC--------AVASKHFISSFVTVATHPRLF--FYGL 188
Query: 199 ETHLQDMR 206
D++
Sbjct: 189 SQFGDDLQ 196
>gi|312373132|gb|EFR20944.1| hypothetical protein AND_18266 [Anopheles darlingi]
Length = 648
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 39/157 (24%), Positives = 70/157 (44%), Gaps = 26/157 (16%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ ++ ++ GR+ A +++ + EKP ++ S ++C+ C + C
Sbjct: 223 DDAVRYDSNETDGRYATAQKNLKPNQHILKEKPHGATLVQEYSSTHCSTCFERVEILFCC 282
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLILVSSH------ 188
C+ V+CS C + A YHR+EC G LR L + G ++ LAL+++ +
Sbjct: 283 PNCVDVVFCSGRCEKTANQSYHRYEC-GFLRSLWSSGATIVSLLALRIVTQKPYSYFEAI 341
Query: 189 ----------------SDRYKEVYHLETHLQDMRPED 209
SD Y++V++L TH PED
Sbjct: 342 RDELPNLTANFTDKLSSDDYRKVFNLVTHSDKRDPED 378
>gi|158284775|ref|XP_307865.2| AGAP009448-PA [Anopheles gambiae str. PEST]
gi|157020893|gb|EAA03631.2| AGAP009448-PA [Anopheles gambiae str. PEST]
Length = 561
Score = 58.2 bits (139), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 64/142 (45%), Gaps = 4/142 (2%)
Query: 34 IEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHV 93
++K EE + E ++ + + LP L+ + +N +++Q + GRHV
Sbjct: 112 LDKREEQAKEELA-KSNEKQKKKDDPKPELP-LSFPGHENMPHVANCLQLQQNEQYGRHV 169
Query: 94 IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCRR 152
+ + GD + ++ P + P C+ C IPC C +YCS C
Sbjct: 170 VTTRRLKVGDVVMLDTPFVKTLHDPLRHVRCDFCHAERPFTLIPCEGCTWVMYCSAECLG 229
Query: 153 EAWLRYHRWECHGALRLLEAVG 174
+A+ +YHR+EC G +R L V
Sbjct: 230 KAYSQYHRYEC-GVMRDLWRVA 250
>gi|170041125|ref|XP_001848325.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864690|gb|EDS28073.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 592
Score = 57.8 bits (138), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 56/101 (55%), Gaps = 1/101 (0%)
Query: 64 PSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN 123
P L+ + ++ S+ ++++ + GR+++ ++ GD + VEKP + ++ N
Sbjct: 171 PKLSYKPNPSIPHVSDCLELKEDSQFGRYLVTNRNLSAGDVVIVEKPFSRLLAAQLRYLN 230
Query: 124 CNHCCTSIS-APIPCNECILAVYCSESCRREAWLRYHRWEC 163
C++C + + IPC C ++CSE+CR++A YH EC
Sbjct: 231 CDYCHQDVFLSLIPCESCSFTMFCSENCRKKANEGYHAVEC 271
>gi|73967367|ref|XP_548315.2| PREDICTED: SET and MYND domain-containing protein 4 [Canis lupus
familiaris]
Length = 794
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 62/135 (45%), Gaps = 19/135 (14%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP-----------CSMSN----CNHCCT 129
KGR++IA +D+ G+ L E V+ +PP ++N C+ C
Sbjct: 243 KGRYLIATKDILPGELLVKEDAFVSVLNPGEMPPGLHHGLENKWDTRITNGDLYCHRCLK 302
Query: 130 SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
A +PC+ C A YCS C ++AW YH+ EC LL H+AL+ L++
Sbjct: 303 HTLATVPCDGCSYAKYCSNECMQQAWDLYHQIECSLGGLLLTLGIFCHIALRSTLLARFE 362
Query: 190 DRYKEVYHLETHLQD 204
D K + L + + +
Sbjct: 363 DASKVIRKLYSEISN 377
>gi|170040654|ref|XP_001848107.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864246|gb|EDS27629.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 561
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 52/108 (48%), Gaps = 2/108 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
+ M+ GR ++A + GD + EK V V+ P C+HC S + IPC +
Sbjct: 178 VGMKQYPGSGRGLVAERNFKTGDVILDEKTVISVVGSPFKFLYCSHCGISNQHSLIPCPD 237
Query: 140 CILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLILVS 186
C++ +YCSE C E HR+EC GA +++A KL L
Sbjct: 238 CVMYMYCSEECLAEDKRLTHRFECGFGAQSENITFNGSNIAPKLFLYG 285
>gi|301765320|ref|XP_002918077.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Ailuropoda melanoleuca]
gi|281342357|gb|EFB17941.1| hypothetical protein PANDA_006471 [Ailuropoda melanoleuca]
Length = 793
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 63/142 (44%), Gaps = 18/142 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP----------- 118
E+ + AS+ + ++ +GR++IA +D+ G+ L E V+ P
Sbjct: 224 ENEQIASASSSVGLRVDPLRGRYLIATKDILPGELLVKEDAFVSVLNPGERPPLHHGLES 283
Query: 119 ---CSMSN----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
++N C+ C A +PC+ C A YCS C ++AW YH EC LL
Sbjct: 284 KWDTRVTNGDLYCHRCLKHTLAMVPCDGCSYAKYCSHECMQQAWDLYHNIECPLGGLLLA 343
Query: 172 AVGIAHLALKLILVSSHSDRYK 193
H+AL+ L++ D K
Sbjct: 344 LGVFCHVALRSTLLARFEDAGK 365
>gi|157130709|ref|XP_001661975.1| hypothetical protein AaeL_AAEL011838 [Aedes aegypti]
gi|108871817|gb|EAT36042.1| AAEL011838-PA [Aedes aegypti]
Length = 625
Score = 57.4 bits (137), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 42/76 (55%), Gaps = 3/76 (3%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSE 148
GR V+A + GD + EK + I P NCNHC + + IPC +C+ ++CS+
Sbjct: 249 GRSVVAERNFKPGDVILNEKAMLTAISPEVKYKNCNHCSMENFHSLIPCPKCVSVMFCSK 308
Query: 149 SCRREAWLRY-HRWEC 163
C E LRY HR+EC
Sbjct: 309 EC-LEKGLRYSHRFEC 323
>gi|198437054|ref|XP_002123001.1| PREDICTED: similar to SET and MYND domain containing 4 [Ciona
intestinalis]
Length = 773
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 8/149 (5%)
Query: 16 LCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLM 75
L + +D + ++ + + +++E E V S++ + E+ ++
Sbjct: 128 LTKWKDGIGHELQDKLKKVNRNKETKFEWREYIVRRKEKKKGSNLNN-------ENKSVR 180
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAP 134
S K+KM + KGRH + + L E V+ P ++C++C T S+
Sbjct: 181 NLSAKVKMNITPQKGRHYFTTFNTETNECLLEEVAYLGVLNPEFFSTHCSYCLTPCKSSG 240
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
IPC C +YC E CR AW YH EC
Sbjct: 241 IPCLGCSCTIYCDEQCRISAWKIYHWMEC 269
>gi|335309609|ref|XP_003131876.2| PREDICTED: SET and MYND domain-containing protein 4-like [Sus
scrofa]
Length = 788
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 61/132 (46%), Gaps = 18/132 (13%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPP----------CSMS 122
AS+ + ++ KGR ++A +D+ G+ L E V+ +PP ++
Sbjct: 231 ASSSVSLRVDPLKGRSLVATKDILPGELLVKEDAFVSVLNPGEMPPRHEGLESKWDTRVT 290
Query: 123 N----CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHL 178
N C+ C A +PC+ C A YCS+ C ++AW YH EC LL H+
Sbjct: 291 NGDLYCHRCLKPTLATVPCDGCSYAKYCSQECLQQAWDLYHSVECSLGGLLLTLGVFCHV 350
Query: 179 ALKLILVSSHSD 190
AL+ L++ D
Sbjct: 351 ALRSTLLARFED 362
>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
rubripes]
Length = 434
Score = 57.0 bits (136), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 67/126 (53%), Gaps = 5/126 (3%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ S KG + A + +G+ ++V +P+A+ + S + C+ C T + C++
Sbjct: 4 KLERFVSPGKGNGLRATVRIEEGELVYVTEPLAYCVSQKQSRNVCHQCFTRHETLLRCSQ 63
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI--AHLALKLI--LVSSHSDRYKEV 195
C +A YCS +C+R AW H+ EC LL + LA +LI ++SS S +E+
Sbjct: 64 CKMARYCSATCQRRAW-SDHKRECKCLQSLLPRIPTDSVRLAARLIFAMLSSCSSSSEEL 122
Query: 196 YHLETH 201
Y LE H
Sbjct: 123 YTLEEH 128
>gi|170060284|ref|XP_001865734.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878798|gb|EDS42181.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 647
Score = 56.6 bits (135), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 61/141 (43%), Gaps = 26/141 (18%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
+ +GR D+ L + P A V+ S+C+HCC + C C+ +Y
Sbjct: 234 TETEGRFARTRIDLRPNHVLLRQLPHAAVVTGEFCESHCDHCCRRVEILFSCPRCMDVIY 293
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVG---IAHLALKLI-----------------LV 185
CS C++ A YHR+EC G L L+ G +A LAL+++ L
Sbjct: 294 CSSECQKTAQESYHRFEC-GFLSYLKNSGANVVAMLALRIVTQKSLDYFVEIKDELGSLG 352
Query: 186 SSHSDR-----YKEVYHLETH 201
S DR Y+ +Y+ TH
Sbjct: 353 SEEVDRLPVDDYRRIYNFVTH 373
>gi|194883800|ref|XP_001975985.1| GG20247 [Drosophila erecta]
gi|190659172|gb|EDV56385.1| GG20247 [Drosophila erecta]
Length = 573
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ S +GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 187 LELRESADEGRFVVTNRDLVVGDLVAVEQPFCSTLLAPMRYIRCATCKRENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+++A YHR+EC
Sbjct: 247 CCSVMFCSEECKQQAMSTYHRFEC 270
>gi|242007004|ref|XP_002424332.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507732|gb|EEB11594.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 572
Score = 56.6 bits (135), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 63/129 (48%), Gaps = 5/129 (3%)
Query: 61 ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
+P L +S SN +++ R +IA +D++ G+ L VEKP + +++P
Sbjct: 162 GEIPPLDREKSSFFDSTSNVVELSEDIYSNRKLIANDDINVGEVLIVEKPYSSIVIPKHF 221
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH--GALRLLEAVGIAHL 178
++C++C IPC+ C ++C E C A H EC G L L A L
Sbjct: 222 YTHCSNCFRRSLNLIPCSNCCTGMFCREECLDIA---NHVIECSYLGILSELNADRKEML 278
Query: 179 ALKLILVSS 187
ALK+++ +S
Sbjct: 279 ALKILIDAS 287
>gi|170028158|ref|XP_001841963.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167871788|gb|EDS35171.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 289
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 3/101 (2%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
R ++A +D++ GD + +P+ I NC+HC S + IPC +C+ ++C E
Sbjct: 62 RILVAEKDINPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPDCVFFMFCGEE 121
Query: 150 CRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
CR+++W +H++EC E + AL L V+S S+
Sbjct: 122 CRQKSWKLWHQFECPHLPG--ETSQVEGRALPLAAVTSRSN 160
>gi|158300865|ref|XP_320681.4| AGAP011835-PA [Anopheles gambiae str. PEST]
gi|157013367|gb|EAA00753.4| AGAP011835-PA [Anopheles gambiae str. PEST]
Length = 575
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNE 139
+++ S GR+V ++ GD + +E+P + ++ C+ C SI +PC
Sbjct: 174 LELSESEQFGRYVATNRNLEAGDVVIIEQPFSRLLRDIYRHVRCDFCHRESIFTLLPCEN 233
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C +A+YCS SC +A +YHR+EC
Sbjct: 234 CTVAMYCSGSCASQAARQYHRYEC 257
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
+ +KGR + A + + GDT+ E+PV +++ C+ C +S C+ C A Y
Sbjct: 13 TESKGRGLRAAKPLKSGDTILSEQPVVYMLSNMLRGQRCDFCLEKLSDLQRCSRCKFARY 72
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAV 173
C SC+R AW R H+ EC R+ V
Sbjct: 73 CGASCQRAAW-RIHKSECERLKRVFPRV 99
>gi|170058966|ref|XP_001865155.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877850|gb|EDS41233.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 559
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 1/90 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNE 139
I M+ S+ ++A + + GD + EK +L S S C HC +S IPC
Sbjct: 174 IAMRQSDGNEDTLVAEREFNTGDVILNEKIELCAVLFKKSNSYCAHCSSSFKQNLIPCPG 233
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRL 169
C+ +YC E CR+E + HR+EC A +L
Sbjct: 234 CVTVMYCGEECRKEDFRTVHRFECSIATKL 263
>gi|125810725|ref|XP_001361596.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
gi|195154214|ref|XP_002018017.1| GL16991 [Drosophila persimilis]
gi|54636772|gb|EAL26175.1| GA21031 [Drosophila pseudoobscura pseudoobscura]
gi|194113813|gb|EDW35856.1| GL16991 [Drosophila persimilis]
Length = 574
Score = 56.2 bits (134), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ S +GR V+ D+ GD + VE+P +LPP C C + IPC+
Sbjct: 187 LELRESPDEGRFVVTNRDLVVGDLVAVEQPFCSTLLPPMRYIRCATCKRENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A +HR+EC
Sbjct: 247 CCSVMFCSEECKLRATSSFHRFEC 270
>gi|170029113|ref|XP_001842438.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880645|gb|EDS44028.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 567
Score = 56.2 bits (134), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 44/89 (49%), Gaps = 1/89 (1%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-AP 134
C ++ I+M+ + GR + A D G + EK + S+C C ++ +
Sbjct: 177 CVADGIRMKVYDEFGRGMFAERDFQAGAVILDEKAALAITTLETRYSHCGRCVKQLTYSL 236
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
IPC+ C+ +YCSE C RE HR+EC
Sbjct: 237 IPCSGCVSTLYCSEECLREDERFAHRFEC 265
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 49/96 (51%), Gaps = 2/96 (2%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
+ +++ S KGR + A +++ GD LF E P A V+ + C+ C C
Sbjct: 2 DNVEVFDSPGKGRGLRATKELLSGDVLFSEPPFAAVVFDSLADKICHSCFRRQDKLQRCG 61
Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
+C A YC ++C+ W + H+ EC GA++ A+G
Sbjct: 62 QCKFAQYCDKTCQSAGW-KEHKLEC-GAIKAFGAIG 95
>gi|157115764|ref|XP_001652685.1| hypothetical protein AaeL_AAEL007348 [Aedes aegypti]
gi|108876753|gb|EAT40978.1| AAEL007348-PA [Aedes aegypti]
Length = 543
Score = 55.8 bits (133), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 70/155 (45%), Gaps = 19/155 (12%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI---------LPPCS 120
ES + + +++ ++ GR ++ ++ + + EK V+FV +PP
Sbjct: 220 ESDDGLLSFPELEEKSDTVHGRFIVTKNEIKADEPIVQEKAVSFVPVYDPRGHSEIPPF- 278
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLA 179
+C C P PC C A YCS +C +E HR+EC G + L +GIAHL
Sbjct: 279 --DCQFCGEVNIIPFPCPSCGRACYCSVACCKEH-SELHRYECPGYEKHLWYLIGIAHLG 335
Query: 180 LKLILVSSHSDRYKEVYHLETHLQDMRPEDLYQYV 214
++ L D + H L++ E ++QYV
Sbjct: 336 MRSFL-----DGFASSTHNLEELKECNAEKMFQYV 365
>gi|194753329|ref|XP_001958966.1| GF12292 [Drosophila ananassae]
gi|190620264|gb|EDV35788.1| GF12292 [Drosophila ananassae]
Length = 574
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ S +GR ++ D+ GD + VE+P +L P C C + IPC+
Sbjct: 187 LELRESGDEGRFIVTNRDLVVGDLVAVEQPFCSTLLSPMRYIRCATCKRENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+++A +HR+EC
Sbjct: 247 CCSVMFCSEECKQQAMSTFHRFEC 270
>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
Length = 554
Score = 55.5 bits (132), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 43/74 (58%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A+ + ++ + KGRH +A E++ +G T+ +E AFV+ NC+ C ++ +
Sbjct: 57 ANYPVHLKNTKTKGRHAVASEELPEGATVSLETATAFVVRSEFIDQNCHFCLADLTQKVI 116
Query: 137 CNECILAVYCSESC 150
C+ C L+ YCSE+C
Sbjct: 117 CSNCSLSYYCSETC 130
>gi|19922072|ref|NP_610730.1| CG8378 [Drosophila melanogaster]
gi|5052674|gb|AAD38667.1|AF145692_1 BcDNA.LD29892 [Drosophila melanogaster]
gi|7303523|gb|AAF58578.1| CG8378 [Drosophila melanogaster]
gi|220953632|gb|ACL89359.1| CG8378-PA [synthetic construct]
Length = 573
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 187 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 247 CCSTMFCSEECKSIAMQTYHRYEC 270
>gi|429892168|gb|AGA18626.1| CG8378, partial [Drosophila melanogaster]
Length = 556
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 170 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 229
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 230 CCSTMFCSEECKSIAMQTYHRYEC 253
>gi|429892184|gb|AGA18634.1| CG8378, partial [Drosophila melanogaster]
Length = 556
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 170 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 229
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 230 CCSTMFCSEECKSIAMQTYHRYEC 253
>gi|429892182|gb|AGA18633.1| CG8378, partial [Drosophila melanogaster]
Length = 555
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 170 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 229
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 230 CCSTMFCSEECKSIAMQTYHRYEC 253
>gi|429892186|gb|AGA18635.1| CG8378, partial [Drosophila melanogaster]
Length = 554
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 168 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 227
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 228 CCSTMFCSEECKSIAMQTYHRYEC 251
>gi|429892176|gb|AGA18630.1| CG8378, partial [Drosophila melanogaster]
Length = 553
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 168 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 227
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 228 CCSTMFCSEECKSIAMQTYHRYEC 251
>gi|256070417|ref|XP_002571539.1| hypothetical protein [Schistosoma mansoni]
gi|350645320|emb|CCD59943.1| hypothetical protein Smp_000700 [Schistosoma mansoni]
Length = 782
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ G H++A +++ G+ L ++KP + + + NC C PIPC C
Sbjct: 270 VRLNYNDEFGWHIVATKNIQPGELLILDKPYSCRLHYSRLLLNCYRCSKRCINPIPCRSC 329
Query: 141 ILAVYCSESCRREAW 155
+CSESC +EAW
Sbjct: 330 TQVGFCSESCEQEAW 344
>gi|429892154|gb|AGA18619.1| CG8378, partial [Drosophila melanogaster]
Length = 556
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 169 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 228
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 229 CCSTMFCSEECKSIAMQTYHRYEC 252
>gi|429892172|gb|AGA18628.1| CG8378, partial [Drosophila melanogaster]
Length = 555
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 169 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 228
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 229 CCSTMFCSEECKSIAMQTYHRYEC 252
>gi|429892164|gb|AGA18624.1| CG8378, partial [Drosophila melanogaster]
Length = 553
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 168 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 227
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 228 CCSTMFCSEECKSIAMQTYHRYEC 251
>gi|170041129|ref|XP_001848327.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864692|gb|EDS28075.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 592
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 51/87 (58%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIP 136
+N ++++ + GR ++ D+ GD + +EKP + ++ +C++C +I + IP
Sbjct: 188 ANCLELKHEDQYGRCIVTNRDLKVGDVVIIEKPHSTLLDEELRYLHCDYCNQEAILSLIP 247
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C +C +A++CS +C + A YHR EC
Sbjct: 248 CKQCSIAMFCSNACYQSALDSYHRLEC 274
>gi|328721096|ref|XP_003247209.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 611
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 4/106 (3%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++++S GR V D++ GD + +++P + L S C +C S PIPC +C
Sbjct: 203 LELKSSKNMGRGVYTTRDINPGDVVAIDEPYIYGGLQ-LHNSCCRNCLKSPVVPIPCTKC 261
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLI 183
L YCSE+C ++ H EC + GI LALK +
Sbjct: 262 SLVYYCSENCLHKSMKDGHNLECPIIFFIRSLPGITDLNQLALKWL 307
>gi|195333608|ref|XP_002033482.1| GM21333 [Drosophila sechellia]
gi|194125452|gb|EDW47495.1| GM21333 [Drosophila sechellia]
Length = 573
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/87 (33%), Positives = 47/87 (54%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIP 136
+N ++++ + A+GR V+ D+ GD + VE+P +L P C C + IP
Sbjct: 184 ANCLELRETAAEGRFVVTNRDLVVGDLVCVEEPFCSTLLTPMRYIRCATCKRENYLTLIP 243
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C ++CSE C+ A YHR+EC
Sbjct: 244 CDSCCSTMFCSEECKSIAMQTYHRFEC 270
>gi|429892162|gb|AGA18623.1| CG8378, partial [Drosophila melanogaster]
gi|429892174|gb|AGA18629.1| CG8378, partial [Drosophila melanogaster]
Length = 559
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 174 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 233
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 234 CCSTMFCSEECKSIAMQTYHRYEC 257
>gi|429892160|gb|AGA18622.1| CG8378, partial [Drosophila melanogaster]
Length = 559
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 174 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 233
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 234 CCSTMFCSEECKSIAMQTYHRYEC 257
>gi|429892156|gb|AGA18620.1| CG8378, partial [Drosophila melanogaster]
gi|429892180|gb|AGA18632.1| CG8378, partial [Drosophila melanogaster]
Length = 558
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 172 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 231
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 232 CCSTMFCSEECKSIAMQTYHRYEC 255
>gi|429892178|gb|AGA18631.1| CG8378, partial [Drosophila melanogaster]
Length = 556
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 171 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 230
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 231 CCSTMFCSEECKSIAMQTYHRYEC 254
>gi|429892170|gb|AGA18627.1| CG8378, partial [Drosophila melanogaster]
Length = 556
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 171 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 230
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 231 CCSTMFCSEECKSIAMQTYHRYEC 254
>gi|429892158|gb|AGA18621.1| CG8378, partial [Drosophila melanogaster]
gi|429892166|gb|AGA18625.1| CG8378, partial [Drosophila melanogaster]
Length = 557
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ + A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 171 LELRETAAEGRFVVTNRDLAVGDLVSVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 230
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 231 CCSTMFCSEECKSIAMQTYHRYEC 254
>gi|355720917|gb|AES07092.1| SET and MYND domain containing 4 [Mustela putorius furo]
Length = 576
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 66/151 (43%), Gaps = 22/151 (14%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI----LPPC--SMSN------ 123
CAS+ + + +GR++IA ED+ G+ L E V+ PP ++N
Sbjct: 18 CASSSVSLCMDPLRGRYLIATEDILPGELLLKEDAFVSVLNPGERPPLHQGLANKWDTRV 77
Query: 124 ------CNHCCTSISAPIPCNEC----ILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
C+ C T A +PC+ C A YCS C ++AW YH EC LL
Sbjct: 78 TNGDLYCHRCLTHTLAMVPCDGCSYAKCYAKYCSHECMQQAWGLYHNRECPLGGLLLTLG 137
Query: 174 GIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
H+AL+ L++ K + L + + +
Sbjct: 138 IFCHVALRATLLARFEGASKVIRKLSSEISN 168
>gi|431891028|gb|ELK01907.1| SET and MYND domain-containing protein 4 [Pteropus alecto]
Length = 776
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 50/106 (47%), Gaps = 24/106 (22%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFV-ILPPCSMSN-------------------CNHCC 128
KGR++IA +D+ G+ L E AFV +L P MS C+ C
Sbjct: 237 KGRYLIATKDILPGELLVRED--AFVSVLNPGEMSPLCYGLDGKWNTRVTNGDLYCHRCL 294
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
A +PC+ C A YCS+ C ++AW YH EC +L E VG
Sbjct: 295 KHTLATVPCDGCSYAKYCSQECMQQAWDLYHSIEC--SLARFEDVG 338
>gi|170028853|ref|XP_001842309.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877994|gb|EDS41377.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 566
Score = 55.1 bits (131), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 48/94 (51%), Gaps = 2/94 (2%)
Query: 78 SNKIKMQTSNAK-GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-I 135
++ I M+ N + GR +IA +D G + EK V + SNC HC ++ I
Sbjct: 176 ADGIAMKVYNDEHGRGLIAEKDFQAGAVIMNEKAVLTLTTLDAQYSNCGHCSVKLTYNMI 235
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
PC C+ +YCSE C +E HR+EC A +L
Sbjct: 236 PCPGCVSMLYCSEECLQENERFTHRFECAIADKL 269
>gi|195582627|ref|XP_002081128.1| GD10841 [Drosophila simulans]
gi|194193137|gb|EDX06713.1| GD10841 [Drosophila simulans]
Length = 694
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNE 139
++++ ++A+GR V+ D+ GD + VE+P +L P C C + IPC+
Sbjct: 187 LELRETSAEGRFVVTNRDLVVGDLVCVEEPFCSTLLTPMRYIRCATCKRENYLTLIPCDS 246
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C ++CSE C+ A YHR+EC
Sbjct: 247 CCSTMFCSEECKSIAMQTYHRFEC 270
>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
P131]
Length = 582
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 11/92 (11%)
Query: 129 TSISAPIP-----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
TSI P P C+ C + YCS++C+R AW +H+ EC L A+G LA+ +
Sbjct: 103 TSIMPPKPTAMLLCSGCKVGRYCSQACQRNAWKTFHKLECKAVLSEAPAMGPTTLAMYRL 162
Query: 184 LVSSHSDRYKE-----VYHLETHLQDMRPEDL 210
L+ R+ E + LE+H ++ EDL
Sbjct: 163 LLLKKDGRFTERQWKAIMQLESHFEEQN-EDL 193
>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
Y34]
Length = 581
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 11/92 (11%)
Query: 129 TSISAPIP-----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
TSI P P C+ C + YCS++C+R AW +H+ EC L A+G LA+ +
Sbjct: 103 TSIMPPKPTAMLLCSGCKVGRYCSQACQRNAWKTFHKLECKAVLSEAPAMGPTTLAMYRL 162
Query: 184 LVSSHSDRYKE-----VYHLETHLQDMRPEDL 210
L+ R+ E + LE+H ++ EDL
Sbjct: 163 LLLKKDGRFTERQWKAIMQLESHFEEQN-EDL 193
>gi|326421768|gb|ADZ73912.1| SET and MYND domain containing protein 4 [Grampus griseus]
Length = 212
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 40/75 (53%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C SAP+PC+ C A YCS+ C ++AW RYH EC LL H+AL+
Sbjct: 19 CHRCLKHTSAPVPCDGCSYAKYCSQECLQQAWDRYHSIECSLGGLLLTLGAFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|170040662|ref|XP_001848111.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864250|gb|EDS27633.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 567
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSES 149
R + A D GD + EK V ++ P NC+HC + + IPC +C+ A+YCSE
Sbjct: 191 RGLEAERDFRTGDVILDEKAVLSIVDVPLKYYNCSHCSAENQHSLIPCPDCVFAMYCSEE 250
Query: 150 CRREAWLRYHRWECHGALRL 169
C HR+EC A +L
Sbjct: 251 CLERDKRLVHRFECGLATKL 270
>gi|195123617|ref|XP_002006300.1| GI20970 [Drosophila mojavensis]
gi|193911368|gb|EDW10235.1| GI20970 [Drosophila mojavensis]
Length = 574
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 48/87 (55%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+N ++M+ + +GR ++ +D+ GD + E+P +L P C++C IP
Sbjct: 184 ANCLEMRETPDEGRFIVTTQDLVVGDIVATEEPFCSSLLAPMRYIRCSNCKEERYLTLIP 243
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C ++CSE C++ A +H++EC
Sbjct: 244 CDNCCSVMFCSEECKKRANSTFHKYEC 270
>gi|170049075|ref|XP_001870867.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167871002|gb|EDS34385.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 560
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+ I M GR ++A +D + GD + EK I + CN C + IP
Sbjct: 178 AKGIAMGYDPRFGRGLVAEKDFNAGDIILDEKLELCGIDYNVTFQTCNQCSSRRYHVLIP 237
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C +C ++CSE CR W +HR+EC A +L
Sbjct: 238 CPKCPFFMFCSEECRELNWKLFHRFECDVATKL 270
>gi|157104022|ref|XP_001648222.1| hypothetical protein AaeL_AAEL003992 [Aedes aegypti]
gi|108880438|gb|EAT44663.1| AAEL003992-PA [Aedes aegypti]
Length = 594
Score = 54.3 bits (129), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNE 139
I+++ + GR++I +++ GD + EKP + +++ NC++C IPC
Sbjct: 193 IELKEDDQFGRYLITNRNLNAGDIVIEEKPFSSLLVNDHRYMNCDYCHDDKFLTLIPCKC 252
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C + ++CS C+++A YHR EC
Sbjct: 253 CTVTMFCSTKCQQKAMDNYHRIEC 276
>gi|170060884|ref|XP_001865999.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879236|gb|EDS42619.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 378
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 51/87 (58%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT-SISAPIP 136
+N ++++ + GR ++ D+ GD + +EKP + ++ +C++C +I + IP
Sbjct: 182 ANCLELKHEDQYGRCIVTNRDLKVGDVVIIEKPHSTLLDEELRYLHCDYCNQEAILSLIP 241
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C +C +A++CS +C + A YHR EC
Sbjct: 242 CKQCSIAMFCSNACYQSALDSYHRLEC 268
>gi|170072162|ref|XP_001870109.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167868275|gb|EDS31658.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 481
Score = 53.9 bits (128), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 1/93 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+ I M GR ++A +D + GD + EK I + CN C + IP
Sbjct: 97 AKGIAMGYDPRFGRGLVAEKDFNAGDIILDEKLELCGIDYNVTFQTCNQCSSRRYHVLIP 156
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C +C ++CSE CR W +HR+EC A +L
Sbjct: 157 CPKCPFFMFCSEECRELNWKLFHRFECDVATKL 189
>gi|170049077|ref|XP_001870868.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167871003|gb|EDS34386.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 554
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
+ I M GR ++A +D + GD + E F +C+ C ++ IP
Sbjct: 172 AKGIAMGYDPRFGRGLVAEKDFNAGDLILDETSELFAHDFNLCFKHCSQCSANVCQILIP 231
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C+L +YCSE CR W +HR+EC
Sbjct: 232 CRTCVLFMYCSEECRELHWKLFHRFEC 258
>gi|326421786|gb|ADZ73921.1| SET and MYND domain containing protein 4 [Moschus moschiferus]
Length = 212
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/85 (36%), Positives = 45/85 (52%), Gaps = 2/85 (2%)
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH-GALRLLEAVGIAHLALK 181
+C+ C AP+PC+ C A YCS C ++AW RYH EC GAL L V H+AL+
Sbjct: 18 HCHRCLRHTLAPVPCDGCSYAKYCSRECMQQAWDRYHSVECSLGALLLTLGV-FCHVALR 76
Query: 182 LILVSSHSDRYKEVYHLETHLQDMR 206
L++ D + + L + + R
Sbjct: 77 STLLARFEDVGEVIKKLCGEMNNQR 101
>gi|195024336|ref|XP_001985854.1| GH20862 [Drosophila grimshawi]
gi|193901854|gb|EDW00721.1| GH20862 [Drosophila grimshawi]
Length = 574
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+N ++M+ + +GR ++ D+ GD + +E+P +L P C C IP
Sbjct: 184 ANCLEMRETPDEGRFIVTNRDLVVGDVIAIEQPFCSSLLAPMRYIRCATCKEERYLTLIP 243
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C A++CSE C+ A +H++EC
Sbjct: 244 CDSCCSAMFCSEECKERATSTFHKYEC 270
>gi|358342712|dbj|GAA50167.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
Length = 761
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 76/198 (38%), Gaps = 34/198 (17%)
Query: 51 STSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP 110
ST+S + LP G S+ + + S KGR+V+A E+ GD L VE
Sbjct: 233 STASDIDAYTHELPEPRYGFQEAYPGLSSGLLVTESEQKGRYVVATEEFQPGDVLAVEPA 292
Query: 111 VAF----------------VILPPCSMSNCNHCCTSISA-PIPCNECILAVYCSES---- 149
+ ++LP MS C C T +SA C C YC+
Sbjct: 293 SGWPANGPQTTVNASGDCLLLLPSQRMSKCTACLTPLSAIGYLCPHCCDTAYCAPPSKCF 352
Query: 150 ----CRREAWLRYHRWECHGA----LRLLEAVGIAHLALKLILVSSH-----SDRYKEVY 196
R + + Y + H A + LL + G+ HL +L + S + +
Sbjct: 353 FRHLTRSDEMISYSQPPWHTAECRFMFLLNSTGLGHLCFRLAWLRRKNHQQISSQQLTID 412
Query: 197 HLETHLQDMRPEDLYQYV 214
L +H + EDL+QY
Sbjct: 413 RLVSHFDEFEVEDLFQYA 430
>gi|270012868|gb|EFA09316.1| hypothetical protein TcasGA2_TC030778 [Tribolium castaneum]
Length = 568
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 13/104 (12%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
AS+KI ++TS GR+V++ D+ G+ L E P A L P AP+
Sbjct: 205 ASSKITLKTSPDVGRYVVSNVDIATGEILVAE-PAAVACLNPEKF-----------APVG 252
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
C C +CS CR +A YH++EC LL G++ L+L
Sbjct: 253 CPHCSSVAFCSPKCRDDAITTYHKYECK-FFDLLLGSGMSVLSL 295
>gi|170050386|ref|XP_001861286.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872021|gb|EDS35404.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 456
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 45/94 (47%), Gaps = 1/94 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
+ M+ GR +IA D GD + EKP +NC +C T + + +PC
Sbjct: 46 VGMKLYKQFGRGLIAERDFQVGDIILDEKPTLVTNALGYRYTNCANCSTGFNHSLLPCPG 105
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
C+ +YCSE C + HR+EC A +L A+
Sbjct: 106 CVSYMYCSEECLQRDKRFVHRFECGMAEKLRNAM 139
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 53.5 bits (127), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 5/120 (4%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KG + A + KG+ + +P+A + + C+HC T + + C++C +A Y
Sbjct: 11 SPGKGNGLRAAGRIRKGELVISAEPLACCVSNRLAKEVCHHCFTRHESLLRCSQCKMARY 70
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGI--AHLALKLI--LVSSHSDRYKEVYHLETH 201
C+ +C+++AW H+ EC LL + LA KLI L+S +E+Y LE H
Sbjct: 71 CNTTCQKQAW-SGHKRECKCLQSLLPRLPTDSVRLAAKLIFALLSPSKSSSEELYTLEEH 129
>gi|170045012|ref|XP_001850118.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167868070|gb|EDS31453.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 558
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 54/117 (46%), Gaps = 7/117 (5%)
Query: 73 TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEK-PVAFVILPPCSMS--NCNHCCT 129
T+ + I M+ GR ++A D GD + EK + FV C + NC HC +
Sbjct: 158 TIPFLAKGITMKQYGEMGRGLVAERDFTTGDVILDEKVDLCFV---SCFRNYLNCGHCGS 214
Query: 130 SI-SAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV 185
+ IPC C++ +YCSE CR + HR+EC A +L + + + +
Sbjct: 215 EFCRSLIPCLGCVMFMYCSEKCREQDLRITHRFECSVASKLFSVTDLTAMMMARLFF 271
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 53.1 bits (126), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S KGR + A ++ GD LF E P A V+ + S C+ C C +C
Sbjct: 4 VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC ++C+R W H+ EC
Sbjct: 64 RFAQYCDKTCQRAGW-EEHKLEC 85
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S KGR + A ++ GD LF E P A V+ + S C+ C C +C
Sbjct: 4 VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC ++C+R W H+ EC
Sbjct: 64 RFAQYCDKTCQRAGW-EEHKLEC 85
>gi|170052730|ref|XP_001862354.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873576|gb|EDS36959.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 569
Score = 53.1 bits (126), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 48/89 (53%), Gaps = 12/89 (13%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNH---CCTSI--SAPIPCNECILAV 144
GR ++A D GD + EKP+ +S+ H CC + + IPC C+ A+
Sbjct: 197 GRGLVAERDFKPGDIILDEKPMLM------HLSSWRHYCSCCAAFFRGSLIPCPGCVNAM 250
Query: 145 YCSESCRREAWLRYHRWECHGALRLLEAV 173
YCSE CR++ + +HR++C G + LE V
Sbjct: 251 YCSEECRQQDYRYWHRFQC-GFVEQLEHV 278
>gi|157115760|ref|XP_001652683.1| hypothetical protein AaeL_AAEL007340 [Aedes aegypti]
gi|108876751|gb|EAT40976.1| AAEL007340-PA [Aedes aegypti]
Length = 363
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/145 (27%), Positives = 65/145 (44%), Gaps = 19/145 (13%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI---------LPPCSMSNCNHCCTS 130
+++ ++ GR ++ ++ + + EK V+FV +PP +C C
Sbjct: 210 ELEEKSDTVHGRFIVTKNEIKADEPIVQEKAVSFVPVYDPRGHSEIPPF---DCQFCGEV 266
Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVGIAHLALKLILVSSHS 189
P PC C A YCS +C +E HR+EC G + L +GIAHL ++ L
Sbjct: 267 NIIPFPCPSCGRACYCSVACCKEH-SELHRYECPGYEKHLWYLIGIAHLGMRSFL----- 320
Query: 190 DRYKEVYHLETHLQDMRPEDLYQYV 214
D H L++ E ++QYV
Sbjct: 321 DGLASSTHNLEELKECNAEKMFQYV 345
>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
domestica]
Length = 490
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K+++ TS KGR + A ++ D +F E+ + V+ + C+ C C +
Sbjct: 8 KVEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQ 67
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A YC +C+++AWL H+ EC
Sbjct: 68 CKFAHYCDRTCQKDAWLN-HKHEC 90
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 52.8 bits (125), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 49/92 (53%), Gaps = 9/92 (9%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS----MSNCNHCCTSISA 133
+ K+ + T KGR ++A D + G+ + ++P +V +P S C+ C T+I+
Sbjct: 10 NRKLSLSTLPEKGRSLLATRDFYPGEVIISQEP--YVCVPNNSSVSPQKRCDGCFTTINN 67
Query: 134 PI--PCNECILAVYCSESCRREAWLRYHRWEC 163
+ C+ C LA YC +C+R W + HR EC
Sbjct: 68 NVLSRCSRCQLAFYCGTACQRSEW-KLHRLEC 98
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 66/152 (43%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR +IA GD + ++P A S+C+HC TS +
Sbjct: 13 AGRGLTVASIPGKGRGLIATFTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
C+ C + YCS +C++E W + H+ EC L E + LALK
Sbjct: 72 CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTENRKKMLTPTIRLMVRLALKRKLQNE 130
Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
++ SS D Y V LE+H+ + L Y
Sbjct: 131 KVIPSSSIDNYNLVDALESHISKVDENQLVLY 162
>gi|260826490|ref|XP_002608198.1| hypothetical protein BRAFLDRAFT_90370 [Branchiostoma floridae]
gi|229293549|gb|EEN64208.1| hypothetical protein BRAFLDRAFT_90370 [Branchiostoma floridae]
Length = 484
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 32/55 (58%), Gaps = 2/55 (3%)
Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLI 183
+SAP+PC +C A YCS C AW +YH EC G + G+ HLAL+L+
Sbjct: 140 VSAPLPCQKCSTARYCSHECADLAWTQYHGVECQYLGLIHSSGVGGMGHLALRLV 194
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 4/73 (5%)
Query: 57 SSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVIL 116
+ +P+ E L+ AS S +GRH +A +D+ GD L VEKP V+L
Sbjct: 284 NGGFFRVPAEEVAEEDQLLVAS----YLQSEQRGRHAVAAQDIRVGDVLIVEKPYGSVVL 339
Query: 117 PPCSMSNCNHCCT 129
P ++C++CC+
Sbjct: 340 PEQGDTHCDYCCS 352
>gi|413933332|gb|AFW67883.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 190
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 67/152 (44%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A+ + + + KGR +IA GD + ++P A S+C+HC TS +
Sbjct: 13 AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
C+ C + YCS +C++E W + H+ EC L E + LALK
Sbjct: 72 CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE 130
Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
++ SS D Y V LE+H+ + L Y
Sbjct: 131 KVIPSSSIDNYNLVDGLESHISKVDENQLVLY 162
>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Meleagris gallopavo]
Length = 486
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 4 VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+R+AWL H+ EC
Sbjct: 64 KFAYYCDRTCQRDAWLN-HKNEC 85
>gi|170051200|ref|XP_001861657.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167872534|gb|EDS35917.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 558
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIP 136
++ I M+ GR + A +D G+ + E P+ + L + CN+C S IP
Sbjct: 175 ADGIVMKRYPGFGRGLAAEKDFQPGNIILEESPM--LTLVGSRYNYCNNCAGSFQGCLIP 232
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
C C A+YCSE C + + +HR+EC G L+ V +A+
Sbjct: 233 CPGCTSAMYCSEECLGKDYRFWHRFEC-GLAEKLQNVMFGTMAM 275
>gi|170029111|ref|XP_001842437.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880644|gb|EDS44027.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 560
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 62/141 (43%), Gaps = 20/141 (14%)
Query: 65 SLACGESHTLM-CASNKIKMQTSN-----AKG----------RHVIAVEDVHKGDTLFVE 108
LA GES L+ C I ++T+ AKG R ++A D GD + E
Sbjct: 144 KLAAGESKGLIPCPLMYINVETNPKIPFLAKGIHMEVLPDFDRSMVADRDFRTGDVILDE 203
Query: 109 KPVAFVILPPCSMSNCNHCCT-SISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL 167
K + V+ C C T + IPC C+ +YCSE C W HR+EC +
Sbjct: 204 KCILAVVDAERRYHRCGRCSTEGFYSLIPCPHCVSVMYCSEKCLEWDWKYVHRFECAVSD 263
Query: 168 RLLEAVGIAHLAL--KLILVS 186
+ L+ + H+A+ KL L
Sbjct: 264 K-LQYISYGHVAMGSKLFLYG 283
>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Meleagris gallopavo]
Length = 478
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 9 VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+R+AWL H+ EC
Sbjct: 69 KFAYYCDRTCQRDAWLN-HKNEC 90
>gi|170065531|ref|XP_001867978.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167862497|gb|EDS25880.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 562
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 51/117 (43%), Gaps = 3/117 (2%)
Query: 56 ASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI 115
S+ ++P G M A +I+M+ GR A D GD L EK + VI
Sbjct: 157 GGRSLGAVPCPRMGIKIPFMAA--QIRMEIFPELGRGFCAERDFRPGDVLLDEKSMMAVI 214
Query: 116 LPPCSMSNCNHCCTS-ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
NC+ C + + IPC EC+ +YC E CR HR+EC +L E
Sbjct: 215 DSGLRYFNCSWCSNANFHSLIPCLECVSYMYCCEECRDRDRETCHRFECGIGGKLRE 271
>gi|170044019|ref|XP_001849660.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867271|gb|EDS30654.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 589
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 49/95 (51%), Gaps = 5/95 (5%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM--SNCNHCCTSIS-AP 134
++ I M GR +IA +D + GD L EK V L C + +C+ C +
Sbjct: 201 ASGIGMNYEAKFGRGLIAQKDFNPGDILLDEK-VELCGLE-CDLIYQSCSQCSGEFGYSL 258
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
IPC C LA++CS+ C+ W YHR+EC A +L
Sbjct: 259 IPCPGCPLAMFCSKECQEMNWKLYHRFECGVASKL 293
>gi|195381641|ref|XP_002049556.1| GJ20689 [Drosophila virilis]
gi|194144353|gb|EDW60749.1| GJ20689 [Drosophila virilis]
Length = 574
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 46/87 (52%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIP 136
+N ++M+ + +GR ++ +D+ GD + E+P +L P C C IP
Sbjct: 184 ANCLEMRETPDEGRFIVTTQDLVVGDVVAAEEPFCSSLLAPMRYIRCATCKEERYLTLIP 243
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C ++CSE C++ A +H++EC
Sbjct: 244 CDSCCSVMFCSEECKQRANSTFHKYEC 270
>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
Length = 486
Score = 52.4 bits (124), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 4 VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+R+AWL H+ EC
Sbjct: 64 KFAYYCDRTCQRDAWLN-HKNEC 85
>gi|170036329|ref|XP_001846017.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878894|gb|EDS42277.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 574
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 12/113 (10%)
Query: 61 ASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS 120
A LP +A G I M+ GR ++A D + GD + EKP+ +
Sbjct: 188 AKLPFVAKG-----------IAMKYYPEFGRGLVAERDFNTGDVILDEKPMLMAVEHGLR 236
Query: 121 MSNCNHCCTSI-SAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEA 172
S C++C S + IPC C+ +YCSE C HR+EC RL A
Sbjct: 237 YSVCSYCTASFRKSLIPCPNCVSCMYCSEKCLEMDHRFTHRFECGIMERLRNA 289
>gi|393904729|gb|EJD73778.1| hypothetical protein LOAG_18824 [Loa loa]
Length = 420
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/129 (24%), Positives = 66/129 (51%), Gaps = 14/129 (10%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI--PCNECILAVYCSESCRREAWL 156
+++ + ++P A++ + + C++C I C++C A YC++ C+R AW
Sbjct: 11 IYQQRIRYFDEPFAYICINNQVNNYCSYCLRKPDKSILYKCSKCEFAKYCNKECQRLAWK 70
Query: 157 RYHRWEC------HGALRLLEAVGIAHLALKLILVSSHSDRYK-----EVYHLETHLQDM 205
++HR EC + L L E + ++ + K++ + + D+YK + L H D+
Sbjct: 71 KHHRMECQRLVMVYPNLPLTEVLFLSRIIDKVLFIEQNGDKYKWEKDRKWNSLIGHEDDI 130
Query: 206 RPEDLYQYV 214
R ++L +YV
Sbjct: 131 RNDEL-KYV 138
>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
Length = 473
Score = 52.0 bits (123), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 4 VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+R+AWL H+ EC
Sbjct: 64 KFAYYCDRTCQRDAWLN-HKNEC 85
>gi|157123231|ref|XP_001660071.1| hypothetical protein AaeL_AAEL009454 [Aedes aegypti]
gi|108874450|gb|EAT38675.1| AAEL009454-PA [Aedes aegypti]
Length = 586
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSE 148
GR ++A GD + EK I P NCN C + + IPC C+ +YCSE
Sbjct: 211 GRSLVAERPFKAGDVILHEKAWMAAISPELKYQNCNRCSIANFHSLIPCPNCVSIMYCSE 270
Query: 149 SCRREAWLRYHRWEC 163
C E HR+EC
Sbjct: 271 KCLEEDLKYSHRFEC 285
>gi|357464329|ref|XP_003602446.1| SET and MYND domain-containing protein [Medicago truncatula]
gi|355491494|gb|AES72697.1| SET and MYND domain-containing protein [Medicago truncatula]
Length = 785
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 3/78 (3%)
Query: 80 KIKMQTSNA--KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
+IK+Q+ + KGR ++ DV G + E+P A VIL C ++C++C + A +P
Sbjct: 214 QIKLQSVSVPDKGRGMVPPSDVSPGSLVHAEEPYAMVILKQCRETHCHYCLNDLPADRVP 273
Query: 137 CNECILAVYCSESCRREA 154
C C + +YCS C+ +A
Sbjct: 274 CISCSIPLYCSHQCQIKA 291
>gi|270011318|gb|EFA07766.1| hypothetical protein TcasGA2_TC005320 [Tribolium castaneum]
Length = 563
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 24/144 (16%)
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
++A D+ G+ L +EKP A +L +++C+ C IPC +C +A++CSE C+
Sbjct: 192 IVASRDIEIGEILAIEKPWA-AVLTTDRLTHCHECLCLCYNLIPCPKCTVALFCSELCKD 250
Query: 153 EAWLRYHRWECHGAL----RLLEAVGIAH---LALKLILVSS---------------HSD 190
+A YH++EC L R L I LA K+ +++ SD
Sbjct: 251 KA-RSYHKYECPILLTICYRALCEFAIDEAMLLAFKVAILAKDDYSNLDYMRLDGTYRSD 309
Query: 191 RYKEVYHLETHLQDMRPEDLYQYV 214
RY EV+ L+ + R E +Q +
Sbjct: 310 RYNEVHFLKPDGEFSRFEIFHQAI 333
>gi|170041205|ref|XP_001848363.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864728|gb|EDS28111.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 563
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 65/158 (41%), Gaps = 56/158 (35%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
GR++I D+ GD + +EKP + PC C +A++CS+
Sbjct: 199 GRYLITNRDLKAGDVVALEKPFS----------------------KPCKGCTIAMFCSDE 236
Query: 150 CRREAWLRYHRWEC---HGALRLLEAVGIAHL---------------ALKLILVSSHSDR 191
C+++A YHR+EC RL+E V + L AL+L++ S +
Sbjct: 237 CQQKAMEEYHRFECPILQDIHRLVEHVLVPALRLTIKTLTTFNFDLEALRLLMESDGKNT 296
Query: 192 ----------------YKEVYHLETHLQDMRPEDLYQY 213
Y+ VY+LE+H P+ L QY
Sbjct: 297 PNALDMDWTSIDPKETYEIVYNLESHRDKRSPDHLGQY 334
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 67/152 (44%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A+ + + + KGR +IA GD + ++P A S+C+HC TS +
Sbjct: 13 AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
C+ C + YCS +C++E W + H+ EC L E + LALK
Sbjct: 72 CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE 130
Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
++ SS D Y V LE+H+ + L Y
Sbjct: 131 KVIPSSSIDNYNLVDGLESHISKVDENQLVLY 162
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 69/148 (46%), Gaps = 17/148 (11%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ KGR ++ +D + G+ + ++P V + S C+ C +S S C+ C
Sbjct: 13 LRVANVTEKGRSLVTTKDFNPGEVIISQEPYVCVPNNSATESRCDRCFSS-SNVKKCSAC 71
Query: 141 ILAVYCSESCRREAWLRYHRWECHG--------------ALRLLEAVGIAH-LALKLILV 185
+A YC SC++ W + HR EC ++RL+ + I L + I+
Sbjct: 72 QVAWYCGSSCQKLEW-KLHRIECEALSKLDKDRRKYVTPSIRLMVRLLIRRKLQSEKIIS 130
Query: 186 SSHSDRYKEVYHLETHLQDMRPEDLYQY 213
S+ +D Y V L H++D+ + L Y
Sbjct: 131 STATDNYDLVEALVAHMKDIDEKQLVLY 158
>gi|440801098|gb|ELR22123.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 397
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/149 (25%), Positives = 62/149 (41%), Gaps = 11/149 (7%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
E++E+ + + + S ++ + A + G + A + + GR +
Sbjct: 23 ERNEDGMLREAGIGIKSVVITSGGTAAGAQEV--GAASDASGADVDAGAEQAETMGRGMF 80
Query: 95 AVEDVHKGDTLFVEKPVAFVILPP----CSMSNCNHCCTSI---SAPIPCNECILAVYCS 147
A+ D GD + E P+A I P M +C+ C + +A + C C L YCS
Sbjct: 81 ALRDFKAGDVILEESPLA--IAPRDCVVAQMLHCSQCLRKLAASTAAVACPHCRLDPYCS 138
Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIA 176
+ C EAW +HR C A E G A
Sbjct: 139 DRCAAEAWESHHRLLCEAAQADPETTGQA 167
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 18/138 (13%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+HC + C++
Sbjct: 5 KVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDHCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
C +A YCS C+++AW + H+ EC ++RLL + +KL+ S
Sbjct: 65 CRVAKYCSAKCQKKAW-QDHKQECKCLKSCKPRYPPDSVRLL-----GRVVIKLMEKTPS 118
Query: 188 HSDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 119 ESEKLYSFYDLESNINKL 136
>gi|170058972|ref|XP_001865158.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877853|gb|EDS41236.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 545
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/95 (33%), Positives = 47/95 (49%), Gaps = 9/95 (9%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKP----VAFVILPPCSMSNCNHCCTS-ISAPI 135
I M+ GR ++A D GD + EK ++FV S C HC ++ +++ I
Sbjct: 167 IHMKEFGDMGRGLVAERDFKTGDVILDEKSDLCSLSFVR----SFVECAHCGSAFLNSLI 222
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
PC+ C +YC E CR E HR+EC +LL
Sbjct: 223 PCSLCFAVMYCGEKCREEDLRITHRFECSVVTKLL 257
>gi|328785875|ref|XP_003250668.1| PREDICTED: SET and MYND domain-containing protein 4-like [Apis
mellifera]
Length = 554
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 62/128 (48%), Gaps = 16/128 (12%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVI-------LPPCSMSNCNHCCTSISAPIPCNECIL 142
GR+ +AV+ + D + ++K + P +M C++C A IPC+ C
Sbjct: 183 GRYFVAVKPIKMKDVILIDKSQITHLHKDDWDDDPTSNM--CHYCFKYCRALIPCDYCYH 240
Query: 143 AVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLIL-VSSHSDRYKEVYHLET 200
A+YCS+ CR +A+ YH+ C +G L + LKL+L ++ + R KE
Sbjct: 241 ALYCSKECRGKAYQAYHQIYCRYGNLD-----NKSSFVLKLLLKITDNGARLKEALEYHK 295
Query: 201 HLQDMRPE 208
L++M E
Sbjct: 296 ELENMSEE 303
>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 61/143 (42%), Gaps = 19/143 (13%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI---------SAPIP 136
S KGR + A D G+ + +P AFVI + C+HC + + +
Sbjct: 108 SEEKGRGIKAARDFKVGNLILKAEPFAFVIFDHMAEHVCHHCFNMVVRDRQGQPTTQLLR 167
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE-- 194
C+ C A YCS C+++AW H+ EC R+ + + ++ ++R ++
Sbjct: 168 CSSCKFARYCSRECQKKAW-SMHKKECMAIKRIAPRTASDEVRMVSQILWKQAERGEKRA 226
Query: 195 -------VYHLETHLQDMRPEDL 210
V L HL DM ED+
Sbjct: 227 KSEELCRVEELCDHLNDMSFEDV 249
>gi|189240591|ref|XP_001815228.1| PREDICTED: similar to CG8378 CG8378-PA [Tribolium castaneum]
Length = 439
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 69/144 (47%), Gaps = 24/144 (16%)
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
++A D+ G+ L +EKP A +L +++C+ C IPC +C +A++CSE C+
Sbjct: 68 IVASRDIEIGEILAIEKPWA-AVLTTDRLTHCHECLCLCYNLIPCPKCTVALFCSELCKD 126
Query: 153 EAWLRYHRWECHGAL----RLLEAVGIAH---LALKLILVSS---------------HSD 190
+A YH++EC L R L I LA K+ +++ SD
Sbjct: 127 KA-RSYHKYECPILLTICYRALCEFAIDEAMLLAFKVAILAKDDYSNLDYMRLDGTYRSD 185
Query: 191 RYKEVYHLETHLQDMRPEDLYQYV 214
RY EV+ L+ + R E +Q +
Sbjct: 186 RYNEVHFLKPDGEFSRFEIFHQAI 209
>gi|326669800|ref|XP_003199084.1| PREDICTED: SET and MYND domain-containing protein 1-like [Danio
rerio]
Length = 231
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S KGR + A ++ GD LF E P A V+ + S C+ C C +C
Sbjct: 4 VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC ++C+R W H+ EC
Sbjct: 64 RFAQYCDKTCQRAGW-EEHKLEC 85
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 67/152 (44%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A+ + + + KGR +IA GD + ++P A S+C+HC TS +
Sbjct: 13 AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---------IAHLALKL----- 182
C+ C + YCS +C++E W + H+ EC L E + LALK
Sbjct: 72 CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLALKRKLQNE 130
Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
++ SS D Y V LE+H+ + L Y
Sbjct: 131 KVIPSSSIDNYNLVDGLESHISKVDENQLVLY 162
>gi|384485526|gb|EIE77706.1| hypothetical protein RO3G_02410 [Rhizopus delemar RA 99-880]
Length = 510
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
+ + IA + + G LF+E P+ V LP CN+C C+ C A +CS
Sbjct: 16 ENKRFIASKQLEPGTALFIEPPLVSVPLPSKRHQRCNYCLCKAQLQC-CSRCRSAYFCSN 74
Query: 149 SCRREAWLRYHRWECHGALR 168
+C R AWL++HR C R
Sbjct: 75 ACFRNAWLQFHRILCEPQER 94
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 51.6 bits (122), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 13 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 72 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS +D Y V LE+H+ ++ L Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 13 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 72 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS +D Y V LE+H+ ++ L Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K + S KG + A ++H G+ L+ +P+A+ + C+ C+ C + + C++
Sbjct: 4 KFERFASAEKGNGLRATHEIHAGELLYTAEPLAYCVSNTCAKDLCHSCFSRRKTLLRCSQ 63
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C +A YC +C+++AW H+ EC
Sbjct: 64 CKVARYCDVTCQKQAW-SDHKREC 86
>gi|449273578|gb|EMC83062.1| SET and MYND domain-containing protein 1 [Columba livia]
Length = 491
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 9 VEVFTTEGKGRGLKAQKEFLPGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+R AWL H+ EC R
Sbjct: 69 KFAYYCDRTCQRAAWLN-HKNECSAIKR 95
>gi|281201842|gb|EFA76050.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 965
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 40/85 (47%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S++ GR + + LF EK F + S C++C I API C +C LA Y
Sbjct: 361 SDSVGRLGETTDLIASNTILFQEKAYVFCLDRYSHSSYCHNCLKEILAPIYCKKCNLAQY 420
Query: 146 CSESCRREAWLRYHRWECHGALRLL 170
C + C +E + +H EC LL
Sbjct: 421 CGDECLKEDYNNFHEKECGKGFLLL 445
>gi|157130707|ref|XP_001661974.1| hypothetical protein AaeL_AAEL011836 [Aedes aegypti]
gi|108871816|gb|EAT36041.1| AAEL011836-PA [Aedes aegypti]
Length = 571
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 2/84 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNE 139
I+ + ++G ++A D GD + EKP+ + + CN+C T++S IPC
Sbjct: 196 IQGRQHPSQGPIMVAKRDFKAGDVILREKPMIACVSWENVFTRCNYCISTNLSHLIPCPN 255
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A++C E C R+A HR+EC
Sbjct: 256 CATAMFCDEECMRKAQ-NVHRFEC 278
>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
Length = 478
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S KGR + A+ + GD +F E + V+ S S C+ C + C +C
Sbjct: 4 VEIFESEGKGRGLRAIRESWAGDIIFAEPAYSAVVFDNLSHSVCHSCFKRQEKLLRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C++E+W H+ EC
Sbjct: 64 KFAHYCDRTCQKESWAN-HKNEC 85
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 13 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 72 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS +D Y V LE+H+ ++ L Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 13 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 72 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS +D Y V LE+H+ ++ L Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR +IA GD + ++P A S+C+HC TS +
Sbjct: 13 AGRGLAVASIPGKGRGLIATRTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C + YCS +C++E W + H+ EC L E L ++L+L
Sbjct: 72 CSMCQVTWYCSTNCQKEEW-KLHQLECRAMAALTEDRKKMLTPTIRLMVRLVLKRKLQNE 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS D Y V LE+H+ + L Y
Sbjct: 131 KVIPSSSIDNYNLVDALESHIWKVDENHLVLY 162
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 13 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 72 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS +D Y V LE+H+ ++ L Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162
>gi|42562487|ref|NP_174606.2| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
gi|332193468|gb|AEE31589.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana]
Length = 798
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCN 138
K++ ++ KGR +++ D+ + + VE+P + VI C ++C+ C + A +PC
Sbjct: 219 KLRCVSTKEKGRGMVSECDIEEASVIHVEEPFSVVISKSCRETHCHFCLNELPADTVPCP 278
Query: 139 ECILAVYCSESCR 151
C + VYCSESC+
Sbjct: 279 SCSIPVYCSESCQ 291
>gi|312381116|gb|EFR26935.1| hypothetical protein AND_06647 [Anopheles darlingi]
Length = 545
Score = 50.8 bits (120), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 7/96 (7%)
Query: 98 DVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI-PCNECILAVYCSESCRREAWL 156
D+ G+ + +E+P V+ + + C++C + PC C +YCSE C+ +A
Sbjct: 189 DLSVGEKVLLERPFVLVLEAEVAYARCDYCGACNEHNLRPCKHCTAVMYCSEECQEQALQ 248
Query: 157 RYHRWECH--GALRLL----EAVGIAHLALKLILVS 186
RYH++EC L+LL +A + +AL+L +
Sbjct: 249 RYHQFECEVIDDLQLLYRGPKATRVLQVALRLFWLG 284
>gi|108710652|gb|ABF98447.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 282
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 65/152 (42%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 13 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 72 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 130
Query: 185 ---VSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
SS +D Y V LE+H+ ++ L Y
Sbjct: 131 KAIPSSGTDNYNLVDALESHISEVDKNQLVLY 162
>gi|157130705|ref|XP_001661973.1| hypothetical protein AaeL_AAEL011840 [Aedes aegypti]
gi|108871815|gb|EAT36040.1| AAEL011840-PA [Aedes aegypti]
Length = 415
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 2/84 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC-TSISAPIPCNE 139
I+ + ++G ++A D GD + EKP+ + + CN+C T++S IPC
Sbjct: 40 IQGRQHPSQGPIMVAKRDFKAGDVILREKPMIACVSWENVFTRCNYCISTNLSHLIPCPN 99
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A++C E C R+A HR+EC
Sbjct: 100 CATAMFCDEECMRKAQ-NVHRFEC 122
>gi|170038058|ref|XP_001846870.1| U5 snRNP-specific protein [Culex quinquefasciatus]
gi|167881490|gb|EDS44873.1| U5 snRNP-specific protein [Culex quinquefasciatus]
Length = 789
Score = 50.8 bits (120), Expect = 4e-04, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSESCRRE 153
+ D + GD + +P+ I NC+HC S + IPC C+ ++YC E CR++
Sbjct: 149 GMRDFNPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPGCVFSMYCGEKCRQK 208
Query: 154 AWLRYHRW 161
+W +H++
Sbjct: 209 SWKLWHQF 216
>gi|170040667|ref|XP_001848113.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864252|gb|EDS27635.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 560
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 51/113 (45%), Gaps = 2/113 (1%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAP 134
C + I M+ + GR ++A + GD + EK + V+ C+ C + +
Sbjct: 174 CLAKGIAMKQYSGSGRGLVAERNFKAGDVILNEKAILSVVAVAFKYLYCSRCGIPNQHSL 233
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKLILVS 186
IPC C+ +YCSE C E HR+EC GA A ++LA KL
Sbjct: 234 IPCPNCVHCMYCSEECLAEDSRMAHRYECSFGAQMANIAYDGSNLATKLFFYG 286
>gi|26450409|dbj|BAC42319.1| unknown protein [Arabidopsis thaliana]
Length = 604
Score = 50.4 bits (119), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 43/73 (58%), Gaps = 1/73 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCN 138
K++ ++ KGR +++ D+ + + VE+P + VI C ++C+ C + A +PC
Sbjct: 219 KLRCVSTKEKGRGMVSECDIEEASVIHVEEPFSVVISKSCRETHCHFCLNELPADTVPCP 278
Query: 139 ECILAVYCSESCR 151
C + VYCSESC+
Sbjct: 279 SCSIPVYCSESCQ 291
>gi|255570887|ref|XP_002526395.1| heat shock protein 70 (HSP70)-interacting protein, putative
[Ricinus communis]
gi|223534257|gb|EEF35971.1| heat shock protein 70 (HSP70)-interacting protein, putative
[Ricinus communis]
Length = 781
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 68/125 (54%), Gaps = 10/125 (8%)
Query: 35 EKSEENSVEAMSMQVSSTSSSA----SSSVASLPSLACGESHTLMCASNKIKMQTSNAKG 90
+K EN ++ ++ Q + ++S + +S+ S +L+C +H+ K+ T+ KG
Sbjct: 176 KKQIENELKIIAGQCAGSASRSFNHIDNSLGSF-NLSCQMTHS----EYKLLRVTTPDKG 230
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCNECILAVYCSES 149
R + + D+ + + VEKP A ++L C ++C++C + A +PC C + +YCS+
Sbjct: 231 RGMASGCDIPQASLVHVEKPYALIVLKNCRDTHCHYCLKELPADTVPCLSCSVPLYCSQL 290
Query: 150 CRREA 154
C+ A
Sbjct: 291 CQVHA 295
>gi|156539569|ref|XP_001601623.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 697
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GR +A +D+ G+ L VEKP ++L +++C+ C I AP P C C YCS
Sbjct: 271 GRFAVATKDIQPGELLVVEKPHCSMLLGEYRLTHCHRCSIRIVAPYPASCYLCSSVAYCS 330
Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSDRYK 193
+C R+ R H EC G+L +A +AL+ I+ + + K
Sbjct: 331 PNC-RDLDERVHSIECGLLGSLWCSKASVTCMMALRAIIQKPYEEFIK 377
>gi|307104881|gb|EFN53133.1| hypothetical protein CHLNCDRAFT_137504 [Chlorella variabilis]
Length = 623
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 61/136 (44%), Gaps = 14/136 (10%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP 134
M A+ ++AKGR VIA GD + + A+ ++ + C+ C + P
Sbjct: 1 MAAAGWHIQDGADAKGRRVIADRAYRAGDVVLEDDAAAWCLISEQLGAWCDCCLAPAAQP 60
Query: 135 IPCNECILAVYCSESCRREAW-LRYHRWECHGALRLLEAVGIAH---LALKLILVSSHSD 190
+ C+ C LA Y S +R AW HR EC ALR + H A++L L
Sbjct: 61 LRCSACKLAWYASREHQRRAWQAGAHRLEC-AALRSC----VPHTPPTAVRLAL-----R 110
Query: 191 RYKEVYHLETHLQDMR 206
RY EV L H D++
Sbjct: 111 RYGEVLGLRHHWGDLQ 126
>gi|312383019|gb|EFR28260.1| hypothetical protein AND_04036 [Anopheles darlingi]
Length = 455
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 60/122 (49%), Gaps = 17/122 (13%)
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
+GD + E+P A+V+LP C+ C + + C+ C+ YC+ +C++EAW H+
Sbjct: 8 RGDLILQEQPFAYVLLPQFRHERCDRC-FKLGKVLKCSGCLYVRYCNRACQQEAWPD-HQ 65
Query: 161 WECHGALRLLEA---VGIAHLALKLIL----------VSSHSDR-YKEVYHLETHLQDMR 206
EC L+LL A V A L L I+ ++D+ Y+ L H +D+R
Sbjct: 66 AECE-KLKLLPATLVVPSAALMLARIIRRLQKGGDFCKGYYTDKLYRRFNDLMPHEEDIR 124
Query: 207 PE 208
+
Sbjct: 125 KD 126
>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 409
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 48/88 (54%), Gaps = 6/88 (6%)
Query: 82 KMQTSNAKGRHVIAVEDVH--KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP--IPC 137
++++S+ GR + A E H GD++ +P+ + + P + + C H C + AP C
Sbjct: 10 RIESSSVGGRGLFAGEGSHYIPGDSIATYRPIVYCVTPTMAKACC-HWCLNSDAPKYYQC 68
Query: 138 NECILAVYCSESCRREAW-LRYHRWECH 164
+ C AVYCS+ C + A+ L HR EC
Sbjct: 69 SGCRYAVYCSKECAKAAYKLGTHRRECQ 96
>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Ornithorhynchus anatinus]
Length = 490
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K+++ TS +KGR + A ++ D +F E+ + V+ + C+ C C +
Sbjct: 8 KVEVFTSESKGRGLKATKEFWAADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQ 67
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A YC +C+++AWL H+ EC
Sbjct: 68 CKFAHYCDRTCQKDAWLN-HKNEC 90
>gi|170058968|ref|XP_001865156.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167877851|gb|EDS41234.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 557
Score = 50.4 bits (119), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 41/87 (47%), Gaps = 1/87 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI-SAPIP 136
+ I+M+ GR ++A D GD + EK V+ S +C HC ++ + IP
Sbjct: 169 AKGIRMKEFGDMGRGLVAERDFKTGDVILDEKSDLCVVSFERSFISCAHCGSTFWKSLIP 228
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C +YC E CR HR+EC
Sbjct: 229 CLGCAAVMYCGEKCREADLQAIHRFEC 255
>gi|170047760|ref|XP_001851378.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167870065|gb|EDS33448.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 574
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 2/109 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIP 136
++ IKM GR +IA + GD + EK + I NCN C + IP
Sbjct: 188 ASGIKMARLEDFGRSMIATKRFKTGDVILHEKSILASIDLKLQFQNCNLCSAEEGLSLIP 247
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE-AVGIAHLALKLIL 184
C C +YCS+ C + HR+EC A ++ + G +L +L L
Sbjct: 248 CPNCATVMYCSQECLERDFRLVHRFECPIAEKVQHLSYGFVNLGSRLFL 296
>gi|440793165|gb|ELR14357.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 387
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 44/98 (44%), Gaps = 5/98 (5%)
Query: 84 QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP--PCSMSNCNHCCTSI---SAPIPCN 138
+ + GR + A+ D GD + E P+AF M +C+ C + +A + C
Sbjct: 61 EQAETMGRGMFALRDFKAGDVILEESPLAFAPRDCVVAQMLHCSQCLRKLAASTAAVTCP 120
Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA 176
C L YCS+ C EAW +HR C A E G A
Sbjct: 121 HCRLDPYCSDRCAAEAWESHHRLLCEAAQADPETTGQA 158
>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Taeniopygia guttata]
Length = 491
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 9 VELFTAEGKGRGLKAQKEFLAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+R AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQRAAWLS-HKNECSAIKR 95
>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Ornithorhynchus anatinus]
Length = 477
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 44/84 (52%), Gaps = 1/84 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K+++ TS +KGR + A ++ D +F E+ + V+ + C+ C C +
Sbjct: 8 KVEVFTSESKGRGLKATKEFWAADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQ 67
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A YC +C+++AWL H+ EC
Sbjct: 68 CKFAHYCDRTCQKDAWLN-HKNEC 90
>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
[Taeniopygia guttata]
Length = 490
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 9 VELFTAEGKGRGLKAQKEFLAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+R AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQRAAWLS-HKNECSAIKR 95
>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Taeniopygia guttata]
Length = 478
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ GD +F E A V+ + C+ C C +C
Sbjct: 9 VELFTAEGKGRGLKAQKEFLAGDVIFAEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+R AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQRAAWLS-HKNECSAIKR 95
>gi|327241846|gb|AEA40704.1| SET and MYND domain containing protein 4, partial [Ceratotherium
simum]
Length = 176
Score = 50.1 bits (118), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 39/75 (52%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 16 CHRCLKHTLAPVPCDGCSYAKYCSQECMQQAWDLYHSVECSLGGLLLTLGVFCHIALRST 75
Query: 184 LVSSHSDRYKEVYHL 198
L++ D K + +L
Sbjct: 76 LLARFEDAGKVIRNL 90
>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Anolis carolinensis]
Length = 485
Score = 50.1 bits (118), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ KGR + A ++ GD +F E+ A V+ + C+ C + C +C
Sbjct: 4 VEVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AW+ H+ EC
Sbjct: 64 KFAHYCDRTCQKDAWVN-HKNEC 85
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW HR EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNISKL 136
>gi|158300855|ref|XP_552390.3| AGAP011843-PA [Anopheles gambiae str. PEST]
gi|157013362|gb|EAL38849.3| AGAP011843-PA [Anopheles gambiae str. PEST]
Length = 425
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 11/108 (10%)
Query: 84 QTSNAKG-RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI-PCNECI 141
+T + +G R ++ED G + VEKP V+ + C+ C + + PC C
Sbjct: 156 ETGDGEGIRTNRSLED---GAKVLVEKPFVLVLEAELAYQRCDFCGATNEHNLRPCTGCT 212
Query: 142 LAVYCSESCRREAWLRYHRWECH--GALRLL----EAVGIAHLALKLI 183
+YCSE C+ +++ RYH++EC L+LL + + H+ L+L
Sbjct: 213 GVMYCSEECQEQSYQRYHQFECEIVDDLQLLFRGPKPTRMFHVVLRLF 260
>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
Length = 553
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 37/80 (46%), Gaps = 2/80 (2%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI--PCNECILAVYC 146
+GR++ + + KG + +P+ V LP CN+C + P C+ C +C
Sbjct: 20 RGRYIKSTATIPKGTVIITSQPLGTVALPQTINEYCNYCFRKQTRPPLQRCSRCKSVYFC 79
Query: 147 SESCRREAWLRYHRWECHGA 166
C AWL YH++ C A
Sbjct: 80 DMGCFSNAWLSYHQFVCDPA 99
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 49.7 bits (117), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW HR EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNISKL 136
>gi|303271753|ref|XP_003055238.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463212|gb|EEH60490.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 838
Score = 49.7 bits (117), Expect = 7e-04, Method: Composition-based stats.
Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 3/87 (3%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS---APIP 136
+++M+ A+GRH++A D+ GD V +P A + + C+H A +
Sbjct: 22 RVEMRVDPARGRHLVAAVDMRAGDEALVARPYACALHDSQRAARCDHTFKRADDGGALLR 81
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C A Y + AW R HR EC
Sbjct: 82 CARCKHARYLGRDAQAAAWKRGHREEC 108
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 49.7 bits (117), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW HR EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNISKL 136
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 49.7 bits (117), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ ++ KGR + A ++ GD +F E+ A V+ + C+ C + C +C
Sbjct: 4 VEVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AW+ H+ EC
Sbjct: 64 KFAHYCDRTCQKDAWVN-HKNEC 85
>gi|384485695|gb|EIE77875.1| hypothetical protein RO3G_02579 [Rhizopus delemar RA 99-880]
Length = 394
Score = 49.7 bits (117), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 1/85 (1%)
Query: 80 KIKMQTSNAKGRH-VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
K+K + K H +IA +D + D L+ E P+ + P CN C + + C
Sbjct: 169 KLKELAQDDKVTHCLIAEQDFEENDVLYTETPLISALSPQLEGKYCNFCMNELKDVVECK 228
Query: 139 ECILAVYCSESCRREAWLRYHRWEC 163
C +CSE+C A +YH++ C
Sbjct: 229 NCDQVAFCSEACESAATKQYHQFLC 253
>gi|344297528|ref|XP_003420449.1| PREDICTED: SET and MYND domain-containing protein 1 [Loxodonta
africana]
Length = 478
Score = 49.7 bits (117), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ S C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLINSVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|170034229|ref|XP_001844977.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167875489|gb|EDS38872.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 582
Score = 49.7 bits (117), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 1/84 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS-ISAPIPCNE 139
++M A GR ++A GD L EK + P +C+ C S + IPC++
Sbjct: 194 LRMTEVPAYGRAMVAERSFRPGDVLMKEKVFMTMSDPQYQYLHCHFCGESKLFNLIPCSD 253
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C +YCS+ CR+E HR+EC
Sbjct: 254 CASVMYCSDKCRQEDKRTLHRFEC 277
>gi|405974951|gb|EKC39558.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 670
Score = 49.7 bits (117), Expect = 0.001, Method: Composition-based stats.
Identities = 26/107 (24%), Positives = 43/107 (40%), Gaps = 33/107 (30%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAF-----VILPPCSMSNCNHCCTSI------------- 131
GR + A ED+ +G+T+F + P+ + ++ +C+HC S+
Sbjct: 255 GRAIYATEDIPEGETVFEDSPIVLGKVFDTVHYKTTIQSCDHCAQSLLSPQEYFMETFST 314
Query: 132 ---------------SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ PI C C YCS CR ++W YH+ C
Sbjct: 315 MESDLRDLVLKYWPNTKPIYCKNCKFVKYCSNDCRTQSWNTYHKIIC 361
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 44 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 102
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 103 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 156
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + R E L Q V
Sbjct: 157 EKLYSFYDLESNINKLTEDRKEGLRQLV 184
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 61/141 (43%), Gaps = 17/141 (12%)
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
KGR ++ G+ + +P A SNC+HC TS S C+ C +A YC
Sbjct: 24 GKGRGLVTTRRFFPGEVILCGEPYASTPNKILVGSNCDHCFTS-SNLRKCSVCRVAWYCG 82
Query: 148 ESCRREAWLRYHRWECHGALRLLE-----AVGIAHLALKLIL----------VSSHSDRY 192
C++E W + H+ EC L + L ++LIL S+ +D Y
Sbjct: 83 SVCQKEEW-KLHQLECRAISALTDDRKKMLTPTIRLMVRLILRRKLQNEKVIPSTGTDNY 141
Query: 193 KEVYHLETHLQDMRPEDLYQY 213
V LE+H+ ++ + L Y
Sbjct: 142 GLVDALESHISEVDDKQLVLY 162
>gi|170036319|ref|XP_001846012.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878889|gb|EDS42272.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 565
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 42/96 (43%), Gaps = 13/96 (13%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS------NCNHCCTSIS-A 133
I M GR + A + + GD + EK L C + +CN C S +
Sbjct: 173 IVMTYDPKFGRGLYAEREFNPGDIIINEK------LELCGLDYGYVYRHCNQCGADFSYS 226
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
IPC C +YCSE C W YHR+EC A +L
Sbjct: 227 LIPCPACTFCMYCSEECLEMNWKFYHRFECGVATKL 262
>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
harrisii]
Length = 490
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K+++ TS KGR + +++ D +F E + V+ + C+ C C +
Sbjct: 8 KVEVFTSEGKGRGLKTTKELWAADVVFAEPAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQ 67
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C A YC C+++AWL H+ EC
Sbjct: 68 CKFARYCGRPCQKDAWLN-HKNEC 90
>gi|327241840|gb|AEA40701.1| SET and MYND domain containing protein 4, partial [Hippopotamus
amphibius]
Length = 176
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 37/75 (49%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS C +EAW YH EC LL H+AL+
Sbjct: 16 CHRCLKHTLAPVPCDGCSYAKYCSHECMQEAWHLYHSVECSLGGLLLTLGVFCHIALRSA 75
Query: 184 LVSSHSDRYKEVYHL 198
L++ D K V L
Sbjct: 76 LLARFEDVSKVVNKL 90
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 60/125 (48%), Gaps = 8/125 (6%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + ++ GD +F E +A V+ + C+ C C++C A Y
Sbjct: 9 SPGKGRGLKTTKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHY 68
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
C +C+R W H+ EC GA++ A G A + I V SH + V LE H+ DM
Sbjct: 69 CDRTCQRAGWAE-HKQEC-GAIK---AYGKA--PNENIRVVSHM-QLITVEELEDHVADM 120
Query: 206 RPEDL 210
+ +++
Sbjct: 121 QEDEI 125
>gi|302772491|ref|XP_002969663.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
gi|300162174|gb|EFJ28787.1| hypothetical protein SELMODRAFT_231366 [Selaginella moellendorffii]
Length = 484
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 38/145 (26%), Positives = 57/145 (39%), Gaps = 29/145 (20%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
KGR ++A D + G T+ + P V+ + C+ C A C+ C YCS
Sbjct: 24 KGRCLVADRDFYAGQTVLEQDPYVAVLDGDSRGNRCDACFKQSPALRRCSACKFVFYCSP 83
Query: 149 SCRREAWLRYHRWECHGALRL--------------------LEAVGIAHLALKLILVSSH 188
+C+R W + H+ EC +RL L+A G +L +
Sbjct: 84 TCQRSQW-KIHQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATG--------VLPVTA 134
Query: 189 SDRYKEVYHLETHLQDMRPEDLYQY 213
D Y+ V L TH + E L Y
Sbjct: 135 LDNYEIVRALPTHFSETGDERLVMY 159
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S KGR + A + GD +F E + V+ S C+ C + C +C
Sbjct: 4 VEIFDSEGKGRGLRATRESWAGDVIFAEPAYSAVVFDNLSHCVCHSCFKRQEKLLRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C++E+W+ H+ EC
Sbjct: 64 KFAHYCDRTCQKESWVN-HKHEC 85
>gi|432880356|ref|XP_004073657.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 478
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 75 MCASNK--IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
M A NK +++ + AKGR + A ++ G+ +F E + V+ + C HCC
Sbjct: 1 MTAGNKENVRLFDAGAKGRGLKASKEFSTGEVVFAEPSYSAVVFDSLAFQVC-HCCFRRQ 59
Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWEC 163
A + C +C A YC+ +C+ W H+ EC
Sbjct: 60 AQLHRCGQCRFAHYCNRTCQTACW-EEHKQEC 90
>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
Length = 428
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/137 (25%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMHCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW HR EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HRRECSCLKSCKPRYPPDSVRLLGRVIVKLMDEK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNISKL 136
>gi|170040673|ref|XP_001848116.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864255|gb|EDS27638.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 586
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 54/127 (42%), Gaps = 11/127 (8%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP--- 134
+N ++M GR +IA GD + EK + V NCN C S P
Sbjct: 191 ANGLRMAELPTYGRAMIAERSFEPGDVILHEKSLLTVGDARYLHLNCNLC--SEWKPYNL 248
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL------LEAVGIAHLALKLILVSSH 188
IPC C+ +YCSE CR++ HR+EC +L + +G L L +
Sbjct: 249 IPCPNCVWVMYCSEECRQQDLRTVHRFECGITAKLKHLNSWMVDMGPKLFFYGLTLFGDN 308
Query: 189 SDRYKEV 195
D+ K+
Sbjct: 309 VDKIKQF 315
>gi|327241844|gb|AEA40703.1| SET and MYND domain containing protein 4, partial [Camelus
bactrianus]
Length = 175
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC-HGALRLLEAVGIAHLALKL 182
C+ C A +PC+ C A YCS+ C ++AW YH EC GAL L V H+AL+
Sbjct: 16 CHRCLKHTLATVPCDGCSYAKYCSQECMQQAWDLYHSVECSQGALLLTLGV-FCHIALRS 74
Query: 183 ILVSSHSDRYK 193
L++ D K
Sbjct: 75 TLLARFEDAGK 85
>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Cavia porcellus]
Length = 490
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEIFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKHEC 90
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 9 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 67
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 68 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 121
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + R E L Q V
Sbjct: 122 EKLYSFYDLESNINKLTEDRKEGLRQLV 149
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + R E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDRKEGLRQLV 148
>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
[Oryzias latipes]
Length = 476
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 75 MCASNK--IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
M A NK +++ + AKGR + A ++ G+ +F E + V+ + C HCC
Sbjct: 1 MTAGNKENVRLFDAGAKGRGLKASKEFSTGEVVFAEPSYSAVVFDSLAFQVC-HCCFRRQ 59
Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWEC 163
A + C +C A YC+ +C+ W H+ EC
Sbjct: 60 AQLHRCGQCRFAHYCNRTCQTACW-EEHKQEC 90
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + R E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDRKEGLRQLV 148
>gi|432880354|ref|XP_004073656.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 489
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 46/92 (50%), Gaps = 5/92 (5%)
Query: 75 MCASNK--IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
M A NK +++ + AKGR + A ++ G+ +F E + V+ + C HCC
Sbjct: 1 MTAGNKENVRLFDAGAKGRGLKASKEFSTGEVVFAEPSYSAVVFDSLAFQVC-HCCFRRQ 59
Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWEC 163
A + C +C A YC+ +C+ W H+ EC
Sbjct: 60 AQLHRCGQCRFAHYCNRTCQTACW-EEHKQEC 90
>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Cavia porcellus]
Length = 477
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEIFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKHECSAIKR 95
>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
Length = 716
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 53/108 (49%), Gaps = 22/108 (20%)
Query: 80 KIKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
+IK+Q ++ KGR + ++ ++ + + E+P A +IL C ++C+ C + A +P
Sbjct: 219 QIKLQCVSTPTKGRGMASLSEISQSYLVHTEEPYAAIILKHCRDTHCHFCFNELPADSVP 278
Query: 137 CNECILAVYCSESCRREA-------------------WLRYHRWECHG 165
C C + +YCS+ C+ +A W+ H+ EC G
Sbjct: 279 CTSCSIPLYCSQHCQMQAGGQELRNNSKNHDSDSNIEWIAEHKHECKG 326
>gi|26352708|dbj|BAC39984.1| unnamed protein product [Mus musculus]
Length = 114
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 1/90 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRL 169
C +A YCS C+++AW HR EC RL
Sbjct: 65 CRIAKYCSAKCQKKAWP-DHRRECSCLKRL 93
>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
Length = 462
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|426334416|ref|XP_004028748.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
gorilla gorilla]
Length = 182
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE++++ + + E L Q V
Sbjct: 121 EKLYSFYDLESNIKKLTEDKKEGLRQLV 148
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 41/83 (49%), Gaps = 2/83 (2%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A ++ GD +F E +A V+ + C+ C C++C A Y
Sbjct: 9 SPGKGRGLKATKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHY 68
Query: 146 CSESCRREAWLRYHRWECHGALR 168
C +C+R W H+ EC GA++
Sbjct: 69 CDRTCQRAGWAE-HKQEC-GAIK 89
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 48.9 bits (115), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/94 (28%), Positives = 45/94 (47%), Gaps = 1/94 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
S+ + + S AKGR +IA H+G+ + ++P + C+ C S C
Sbjct: 16 SHNLSVSVSPAKGRCLIANRAFHRGEVILQQEPYVCALDDSSQALRCDRCYRQSSNLKRC 75
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
+ C YC +C+R W H++EC +RLL+
Sbjct: 76 SACKTVFYCCANCQRSGW-GLHKFECAILVRLLK 108
>gi|328724471|ref|XP_003248160.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 627
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 4/123 (3%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS-MSNC 124
L G + + S ++++ S GR V A D++ GD + +++P V P S + N
Sbjct: 217 LVGGRNENIPALSAFVELKMSENMGRGVYATRDINPGDVVAIDEPYICVPFPDESEVCNY 276
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGA--LRLLEAVG-IAHLALK 181
N C A C +C++A YC++ C + + H C ++L + I LALK
Sbjct: 277 NGCLKFDIALFRCPKCVMAYYCNKDCMNKDYKDGHHLVCPIICFIKLTPGISKINELALK 336
Query: 182 LIL 184
L
Sbjct: 337 WFL 339
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW H+ EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLLGRVVVKLMDGK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNISKL 136
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ T+ +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW H+ EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLLGRVVVKLMDGK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNISKL 136
>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Ailuropoda melanoleuca]
Length = 490
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
Length = 490
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
Full=CD8b-opposite; AltName: Full=Zinc finger protein
BOP; Short=m-BOP
gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
Methyltransferase Smyd1
gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
Length = 490
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
Length = 483
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 4 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 64 KFAHYCDRTCQKDAWLN-HKNECSAIKR 90
>gi|384498991|gb|EIE89482.1| hypothetical protein RO3G_14193 [Rhizopus delemar RA 99-880]
Length = 511
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 1/77 (1%)
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
+A+ + A + + G TLF+E + + LP CN+C + C+ C A +C
Sbjct: 14 SAENKQFTAAKHLEPGITLFIEPSLVSIPLPSKRHQRCNYCLSKAQLQC-CSRCRSAYFC 72
Query: 147 SESCRREAWLRYHRWEC 163
+C R AWL +HR C
Sbjct: 73 GNACFRNAWLHFHRVLC 89
>gi|348505212|ref|XP_003440155.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 467
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 10/135 (7%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
+ +KGR + A ++++ G+ +F E A V+ + C+ C + C +C A Y
Sbjct: 14 AGSKGRGLRAAKELNTGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCKFAHY 73
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL-ILVSSHSD-------RYKEVYH 197
C +C+ W H+ EC GA++ L + ++ L +L H D + V
Sbjct: 74 CDRTCQTACW-NEHKQEC-GAIKKLGSAPSENIRLAARVLWRMHKDTGIASDSQLISVDQ 131
Query: 198 LETHLQDMRPEDLYQ 212
L+ H+ D+ ED Q
Sbjct: 132 LQEHVADLAAEDFKQ 146
>gi|312384185|gb|EFR28972.1| hypothetical protein AND_02431 [Anopheles darlingi]
Length = 641
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 73/165 (44%), Gaps = 45/165 (27%)
Query: 27 FKEDV--LTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQ 84
KED+ L + K E S E + + T +A+S ++ AS+KIK+
Sbjct: 215 LKEDLEDLRVLKRETPSNEEEKVLPTFTGDAANSELSG--------------ASSKIKLV 260
Query: 85 TS--NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
S +++GR++ A ED+ G+ + E +A + P
Sbjct: 261 GSKDDSRGRYIAAAEDIAPGEPILAEPAIAACLYPK-----------------------F 297
Query: 143 AVYCSESCRREAWLRYHRWECHGALRLLEAVGIA---HLALKLIL 184
+CS +CR A YHR+EC L L+ G++ HLAL+++L
Sbjct: 298 FAFCSVACRDRACASYHRFECQ-YLDLMIGSGMSILCHLALRIVL 341
>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
Length = 485
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 4 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 64 KFAHYCDRTCQKDAWLN-HKNEC 85
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 4 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 63
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 64 KFAHYCDRTCQKDAWLN-HKNEC 85
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 60/133 (45%), Gaps = 10/133 (7%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A ++ GD +F E +A V+ + C+ C C +C A Y
Sbjct: 9 SPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHY 68
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV-------SSHSD-RYKEVYH 197
C +C+R W H+ EC A++ V ++ L ++ S+ SD + +
Sbjct: 69 CDRTCQRAGWAE-HKLEC-SAIKAYGKVPNENIRLVARIMWRLDKEGSTVSDMQLTTLDE 126
Query: 198 LETHLQDMRPEDL 210
LE H+ DM +DL
Sbjct: 127 LEDHIADMPEDDL 139
>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Canis lupus familiaris]
Length = 490
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|224140669|ref|XP_002323703.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
gi|222868333|gb|EEF05464.1| tetratricopeptide repeat-containing protein [Populus trichocarpa]
Length = 584
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCN 138
K+ ++ KGR + + D+ + + E+P A +IL C ++C++C + A +PC
Sbjct: 16 KLSQVSTPDKGRGMASQCDIPQASLVHKEEPYALIILKSCRETHCHYCLNQLRADTVPCT 75
Query: 139 ECILAVYCSESCRREA 154
C + +YCSE C+ +A
Sbjct: 76 SCTIPLYCSEHCQIQA 91
>gi|307212779|gb|EFN88450.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 687
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 63/134 (47%), Gaps = 19/134 (14%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH +A +V G+ + +E+P +L +++C+ C I PIP C C YCS
Sbjct: 262 GRHAVATREVSPGEIVIMERPYCASLLAENRLTHCHLCFARIFVPIPAACRTCSCVAYCS 321
Query: 148 ESCR-REAWLRYHRWECHGALRLL---EAVGIAHLALKLILVSSHSDRYKEVYHLETHLQ 203
CR R+A + H EC L +L +A LALK I ++EV L+ L
Sbjct: 322 RRCRDRDAQV--HLREC-SLLPVLWHSKASVTCFLALKAITQRP----FEEVMKLKERLG 374
Query: 204 D------MRPEDLY 211
D + PE+ Y
Sbjct: 375 DAAGALRISPENPY 388
>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
Length = 490
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|449459406|ref|XP_004147437.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
sativus]
gi|449500639|ref|XP_004161155.1| PREDICTED: SET and MYND domain-containing protein 4-like [Cucumis
sativus]
Length = 775
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCNECILA 143
T++ KGR +++ ++ + VE+P A VIL C ++C++C + +PC C +
Sbjct: 219 TTSIKGRGMVSPTEIPPSSLVHVEEPYAVVILKHCRETHCHYCLNELPVDKVPCPSCSIP 278
Query: 144 VYCSESCRREAWLR 157
+YCS+ C+ +A R
Sbjct: 279 LYCSQHCQIQAGGR 292
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/133 (27%), Positives = 60/133 (45%), Gaps = 10/133 (7%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A ++ GD +F E +A V+ + C+ C C +C A Y
Sbjct: 9 SPGKGRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHY 68
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV-------SSHSD-RYKEVYH 197
C +C+R W H+ EC A++ V ++ L ++ S+ SD + +
Sbjct: 69 CDRTCQRAGWAE-HKLEC-SAIKAYGKVPNENIRLVARIMWRLDKEGSTVSDMQLTTLDE 126
Query: 198 LETHLQDMRPEDL 210
LE H+ DM +DL
Sbjct: 127 LEDHIADMPEDDL 139
>gi|307202234|gb|EFN81718.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 343
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 26/43 (60%)
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
MS+C++C A IPC C A+YCSE CR W +YH EC
Sbjct: 1 MSHCSNCLRVCLATIPCKYCTYAMYCSEQCRDIEWEKYHDVEC 43
>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
garnettii]
Length = 490
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/84 (29%), Positives = 41/84 (48%), Gaps = 1/84 (1%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
+++ N GR +IA +D+ G +EKP V+ C+ C + C+ C
Sbjct: 9 QVENRNGYGRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQTCSGCFRLAAHMQQCSSCK 68
Query: 142 LAVYCSESCRREAWLRYHRWECHG 165
+ YCS++C+R W H+ EC G
Sbjct: 69 VVQYCSQTCQRSDW-SIHKPECEG 91
>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
cuniculus]
Length = 490
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 490
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 454
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 62/135 (45%), Gaps = 10/135 (7%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
+ +KGR + A ++++ G+ +F E A V+ + C+ C + C +C A Y
Sbjct: 14 AGSKGRGLRAAKELNTGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCKFAHY 73
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKL-ILVSSHSD-------RYKEVYH 197
C +C+ W H+ EC GA++ L + ++ L +L H D + V
Sbjct: 74 CDRTCQTACW-NEHKQEC-GAIKKLGSAPSENIRLAARVLWRMHKDTGIASDSQLISVDQ 131
Query: 198 LETHLQDMRPEDLYQ 212
L+ H+ D+ ED Q
Sbjct: 132 LQEHVADLAAEDFKQ 146
>gi|115497232|ref|NP_001069874.1| SET and MYND domain-containing protein 3 [Bos taurus]
gi|112362116|gb|AAI20242.1| SET and MYND domain containing 3 [Bos taurus]
gi|296479288|tpg|DAA21403.1| TPA: SET and MYND domain containing 3 [Bos taurus]
Length = 391
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 65/139 (46%), Gaps = 18/139 (12%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
+K++ ++ KG + A+ + G+ LF P+A+ + C+ C + C+
Sbjct: 4 SKVEKFSTTDKGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCS 63
Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLIL-VS 186
+C +A YCS C+++AW + H+ EC ++RLL + KL+
Sbjct: 64 QCRIAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMQETP 117
Query: 187 SHSDRYKEVYHLETHLQDM 205
S S++ Y LE+++ +
Sbjct: 118 SESEKLYSFYDLESNINKL 136
>gi|78070776|gb|AAI07726.1| SMYD3 protein [Homo sapiens]
Length = 182
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
cuniculus]
Length = 477
Score = 48.1 bits (113), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|322801942|gb|EFZ22489.1| hypothetical protein SINV_02250 [Solenopsis invicta]
Length = 675
Score = 48.1 bits (113), Expect = 0.002, Method: Composition-based stats.
Identities = 25/76 (32%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH +A ++ G+ + VE+ +L +++CN C I P P C C +YCS
Sbjct: 249 GRHAVATREIVPGEIVIVERAHCAFLLEENRLTHCNLCFVRIFVPTPAACRTCNCVMYCS 308
Query: 148 ESCRREAWLRYHRWEC 163
C R+A H EC
Sbjct: 309 RRC-RDADAEVHSREC 323
>gi|326421762|gb|ADZ73909.1| SET and MYND domain containing protein 4 [Neophocaena phocaenoides]
Length = 212
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECMQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|326421780|gb|ADZ73918.1| SET and MYND domain containing protein 4 [Kogia sima]
Length = 212
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 40/81 (49%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHLETHLQD 204
L++ + K + L + D
Sbjct: 79 LLARFEEAGKVIDKLCGEITD 99
>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
Length = 476
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFISDGKGRSLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAYYCDRTCQKDAWLN-HKNEC 90
>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
griseus]
Length = 498
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|197692952|gb|ACH71264.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 194
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 6 VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 65
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 66 KFAHYCDRTCQKDAWLN-HKNEC 87
>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
intestinalis]
Length = 430
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/83 (25%), Positives = 42/83 (50%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS GR + A +++ G T+ +P A+++ C+ C + C+ C
Sbjct: 5 VELFTSEKSGRGLKAKRNLNPGSTVLSSEPYAYLLSKKQKGVYCDFCFKKQDGLLQCSGC 64
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
YC+ +C++ AW +H+ EC
Sbjct: 65 KYMKYCNRNCQKMAWNEHHKAEC 87
>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
Length = 467
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|297722479|ref|NP_001173603.1| Os03g0704200 [Oryza sativa Japonica Group]
gi|255674817|dbj|BAH92331.1| Os03g0704200 [Oryza sativa Japonica Group]
Length = 174
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/140 (27%), Positives = 60/140 (42%), Gaps = 17/140 (12%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR + A G+ + ++P A SNC++C S +
Sbjct: 34 AGRDLAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLR-K 92
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEA-----VGIAHLALKLIL------- 184
C+ C +A YC +C+RE W + H+ EC L E L ++L+L
Sbjct: 93 CSVCRVAWYCGSACQREEW-KLHQLECRAIAALTEDRKKMLTPTIRLMVRLVLRRKLQDD 151
Query: 185 ---VSSHSDRYKEVYHLETH 201
SS +D Y V LE+H
Sbjct: 152 KAIPSSGTDNYNLVDALESH 171
>gi|326421778|gb|ADZ73917.1| SET and MYND domain containing protein 4 [Sousa chinensis]
Length = 212
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|209878147|ref|XP_002140515.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209556121|gb|EEA06166.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 573
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 6/90 (6%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP---- 134
N + ++ S KGR +IA D+ G+ +++E P +++ S C+ C I
Sbjct: 9 NIVHLKVSEEKGRCIIANRDIKAGEVIWLEYPFIYIVFEEFLDSVCDTCLKYIGTEKVVN 68
Query: 135 -IPCNECILAVYCSESCRREAWLRYHRWEC 163
I CN C + YCS C+ + H++EC
Sbjct: 69 IIVCNGCSVVKYCSNICKSNS-EGIHKFEC 97
>gi|195059707|ref|XP_001995687.1| GH17627 [Drosophila grimshawi]
gi|193896473|gb|EDV95339.1| GH17627 [Drosophila grimshawi]
Length = 457
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 3/103 (2%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP A+V+ + C++C ++ C+ C YC+ +C+++AW
Sbjct: 19 IKRGTRILTEKPFAYVLKSQYRLERCDNCLEAVQKVRKCSNCRYVSYCNRTCQQQAW-PL 77
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
H+ EC R+ + A + +LIL H + Y+ E
Sbjct: 78 HKHECPFLKRVQPRIVPDAARMLCRLILRLEHGGDLERGYYTE 120
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 10 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 68
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 69 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 122
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + + E L Q V
Sbjct: 123 EKLYSFYDLESNINKLTEDKKEGLRQLV 150
>gi|326322048|gb|ADZ54024.1| SET and MYND domain containing 4 [Mesoplodon densirostris]
Length = 212
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 43/82 (52%), Gaps = 14/82 (17%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------CNHCCTSISAPIPCNECI 141
KGR V A++D+ +G + E+P+A C ++N C++C + + C++C
Sbjct: 15 KGRGVRALQDIKRGVEILKEEPLA------CILTNSKYRGIRCDYCYSEPEKLLKCSKCK 68
Query: 142 LAVYCSESCRREAWLRYHRWEC 163
YC + C+ W + H++EC
Sbjct: 69 FIAYCGKVCQASDW-KMHKYEC 89
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+ + T + KGR + D H GD + ++P +V +P + C+ C ++ + C+ C
Sbjct: 13 LTVSTISEKGRSLFTTRDFHPGDVIISQEP--YVCVP--TQKRCDGCFSTTNLS-KCSRC 67
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRL 169
+ YC C++ W + HR EC RL
Sbjct: 68 QVVWYCGTPCQKSEW-KLHRLECQALSRL 95
>gi|326421776|gb|ADZ73916.1| SET and MYND domain containing protein 4 [Sousa chinensis]
Length = 212
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|326421772|gb|ADZ73914.1| SET and MYND domain containing protein 4 [Tursiops aduncus]
Length = 212
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/152 (24%), Positives = 69/152 (45%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
++N + + T+ KGR +++++D G+ + ++P V S C C S S
Sbjct: 9 SANGLTVSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFRS-SNLKK 67
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL-----------LEAVGIAHLALKL--- 182
C+ C + YC +C++ W + HR EC+ RL + + ++ KL
Sbjct: 68 CSACHVVWYCGSTCQKSDW-KLHRLECNALSRLEKERQKSLTPSIRLMVKLYMRRKLQSE 126
Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
I+ ++ D Y V L +H+ D+ + L Y
Sbjct: 127 KIMPTTARDNYNLVEALVSHITDIDEKQLVLY 158
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/141 (24%), Positives = 66/141 (46%), Gaps = 18/141 (12%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ S +G + AV + G+ LF +P+A+ + C C + C++
Sbjct: 5 KLEKFRSPGRGNGLRAVAPLRPGELLFRSEPLAYTVCKGSRGVVCERCLCRKEKLLRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSS- 187
C +A YC C+++AW + H+ EC ++RLL V KL+ SS
Sbjct: 65 CKVARYCGSECQKQAW-QDHKRECKCLKSSEPKFPPDSVRLLGKV-----VFKLLRESSC 118
Query: 188 HSDRYKEVYHLETHLQDMRPE 208
S++ + LE++++++ E
Sbjct: 119 ASEKLYSFFDLESNIKNLSEE 139
>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
Length = 492
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 66/150 (44%), Gaps = 20/150 (13%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC---TSISA 133
A +K+ + S G+ + A +D+ GD +F K +L S+ +C C T A
Sbjct: 2 ADSKVLKKKSAISGKGLFANQDIAAGDLIFSLKRPLIAVLDNVSLDSCCANCFASTGFGA 61
Query: 134 P------IPCNECILAVYCSESCRREAWLRYHRWECH-----GALRLL----EAVGIAHL 178
C C YC SC+ ++W R+H+ EC GA R L AV I L
Sbjct: 62 TNNDLSLRACTGCSTLKYCGRSCQSQSWKRHHKHECPVLKTLGAERQLPNAVRAV-IQTL 120
Query: 179 ALKLILVSSHSDRYKEVYHLETHLQDMRPE 208
++ + S D +K + L HL ++R E
Sbjct: 121 VMRKSGLISDED-WKNLQELPAHLDELRQE 149
>gi|388518229|gb|AFK47176.1| unknown [Medicago truncatula]
Length = 477
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 6/89 (6%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+ + T + KGR + D H GD + ++P +V +P + C+ C ++ + C+ C
Sbjct: 13 LTVSTISEKGRSLFTTRDFHPGDVIISQEP--YVCVP--TQKRCDGCFSTTNLS-KCSRC 67
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRL 169
+ YC C++ W + HR EC RL
Sbjct: 68 QVVWYCGTPCQKSEW-KLHRLECQALSRL 95
>gi|328874283|gb|EGG22649.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 1240
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSE 148
GR A +D+ K +T+ +++ L S S+ C++C +PI C +C + YCSE
Sbjct: 598 GRIAEATDDI-KANTILIKENAYVTCLDRASYSSHCHNCFKETLSPIFCKKCNHSQYCSE 656
Query: 149 SCRREAWLRYHRWECHGALRLL 170
C + +++YH EC LL
Sbjct: 657 KCLNQDFIKYHEMECSIGFLLL 678
>gi|356518688|ref|XP_003528010.1| PREDICTED: uncharacterized protein LOC100783548 [Glycine max]
Length = 786
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 81 IKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPC 137
IK+Q ++ KGR +++ + G + VE+P A +IL C ++C++C + A +PC
Sbjct: 215 IKLQCVSTPDKGRGMVSSCVISPGSLVHVEEPYAMIILKQCRETHCHYCLNDLPADRVPC 274
Query: 138 NECILAVYCSESCRREA 154
C + +YCS C+ A
Sbjct: 275 ISCSIPLYCSHQCQTRA 291
>gi|307196209|gb|EFN77865.1| SET and MYND domain-containing protein 3 [Harpegnathos saltator]
Length = 354
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 52/110 (47%), Gaps = 4/110 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
V +G T+F KP A+V+ C+ C++C S C+ C YC +C++E+W
Sbjct: 8 VKRGITIFTSKPFAYVLTSKCNAICCDNCLKSGKLS-KCSGCQYVYYCDRNCQKESW-PI 65
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
H+ EC R+ + A L ++I+ + + Y+ ET + +
Sbjct: 66 HKAECANLKRISPKIIPDAARLMARIIIKLNQGGANEVGYYCETKFRKFK 115
>gi|193676233|ref|XP_001943351.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 636
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 56/126 (44%), Gaps = 10/126 (7%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS--MSN 123
L G + + S ++++ S GR V A D++ GD + +++P I P S
Sbjct: 214 LVGGRNENIPALSAFVELKMSKNMGRGVYATRDINPGDVVAIDEPY---ICGPISDHTGV 270
Query: 124 CNH--CCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH---L 178
C + C A IPC +C+L YC++ C +A H EC + GI L
Sbjct: 271 CQYSGCLNLDLALIPCPKCLLVYYCNKDCMNKANEDGHYLECPIMYFIKSTPGITRMNEL 330
Query: 179 ALKLIL 184
A+K L
Sbjct: 331 AMKWFL 336
>gi|326421788|gb|ADZ73922.1| SET and MYND domain containing protein 4 [Delphinapterus leucas]
Length = 212
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/72 (34%), Positives = 37/72 (51%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEV 195
L++ + K +
Sbjct: 79 LLARFEEAGKVI 90
>gi|326421784|gb|ADZ73920.1| SET and MYND domain containing protein 4 [Elaphurus davidianus]
Length = 212
Score = 47.8 bits (112), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|195133772|ref|XP_002011313.1| GI16066 [Drosophila mojavensis]
gi|193907288|gb|EDW06155.1| GI16066 [Drosophila mojavensis]
Length = 448
Score = 47.4 bits (111), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 49/103 (47%), Gaps = 4/103 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP A+V+ + C++C + + C+ C YC+ SC+++AW
Sbjct: 9 IKRGTRILTEKPFAYVLKSKYRLERCDNCLEATKV-LKCSNCRYVSYCNRSCQQQAW-SI 66
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
H+ EC R L + A + +LIL H + Y+ E
Sbjct: 67 HKHECLFLKRALPRIVPDAARMLCRLILRLEHGGDLERGYYTE 109
>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
boliviensis boliviensis]
Length = 490
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|326421766|gb|ADZ73911.1| SET and MYND domain containing protein 4 [Delphinus capensis]
gi|326421770|gb|ADZ73913.1| SET and MYND domain containing protein 4 [Stenella attenuata]
gi|326421774|gb|ADZ73915.1| SET and MYND domain containing protein 4 [Stenella coeruleoalba]
Length = 212
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|326421782|gb|ADZ73919.1| SET and MYND domain containing protein 4 [Platanista gangetica]
Length = 212
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 79 LLARFEEAGKVINKL 93
>gi|328710706|ref|XP_003244339.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 648
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 1/99 (1%)
Query: 66 LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS-MSNC 124
L G + + S+ ++++ S + R V A D++ GD + +++P P C+ + N
Sbjct: 228 LVGGRNENIPALSSFVELKMSESMERGVYATRDINPGDVVAIDEPYIGWPFPECAGVCNF 287
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
N C SA I C +C L YC++ C + H EC
Sbjct: 288 NGCMKINSALIQCPKCKLVSYCNKDCMNKDDKDGHNLEC 326
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/152 (24%), Positives = 69/152 (45%), Gaps = 17/152 (11%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
++N + + T+ KGR +++++D G+ + ++P V S C C S S
Sbjct: 9 SANGLTVSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFRS-SNLKK 67
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRL-----------LEAVGIAHLALKL--- 182
C+ C + YC +C++ W + HR EC+ RL + + ++ KL
Sbjct: 68 CSACHVVWYCGSTCQKSDW-KLHRLECNALSRLEKERQKSLTPSIRLMVKLYMRRKLQSE 126
Query: 183 -ILVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
I+ ++ D Y V L +H+ D+ + L Y
Sbjct: 127 KIMPTTARDNYNLVEALVSHITDIDEKQLVLY 158
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + A+ + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFSTAN-RGNGLRALAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEETPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEEKKEGLRQLV 148
>gi|158299622|ref|XP_319707.4| AGAP008954-PA [Anopheles gambiae str. PEST]
gi|157013605|gb|EAA14812.4| AGAP008954-PA [Anopheles gambiae str. PEST]
Length = 453
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 33/63 (52%), Gaps = 2/63 (3%)
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
+GD + EKP A V+ P S C+ C + C+ C+ YC SC++EAW H+
Sbjct: 8 RGDVILQEKPFACVLDPRYRDSRCDRCFKETKV-MKCSNCLYVRYCGRSCQKEAWSD-HK 65
Query: 161 WEC 163
EC
Sbjct: 66 EEC 68
>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
Length = 490
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
Length = 410
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 42/85 (49%), Gaps = 14/85 (16%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPC---SMSNCNHCCTSISAPI--------PCN 138
G+ + A D+ + + LF E+P +V PP S++ H C PI C
Sbjct: 105 GKGLFASRDIKQDEILFTEEP--YVYFPPWEGFSLARSGHVCGLCCKPILYANRLTQHCK 162
Query: 139 ECILAVYCSESCRREAWLRYHRWEC 163
C + YCS+SCR AW ++H+ EC
Sbjct: 163 HCDM-YYCSKSCRATAWDQFHQLEC 186
>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
jacchus]
Length = 442
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIRR 95
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/148 (26%), Positives = 67/148 (45%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVTALRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGTPSES 120
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + + E L Q V
Sbjct: 121 EKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
Length = 421
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 37/78 (47%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S +GR V + GD +A V+LP S+C+ C S + C C A Y
Sbjct: 14 SGDRGRRVETETALAAGDLALRAPALATVLLPALWDSHCHKCFASGTRLSRCGRCNTAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
CS++C++ W HR EC
Sbjct: 74 CSKACQQADWKPDHRKEC 91
>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
Length = 370
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 5/81 (6%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
K +H+ AV+ GD + E+P A+V+ S C+ C + C+ C YC+
Sbjct: 3 KLKHINAVK---TGDLILSEEPFAYVLSSKEKGSRCDFCLEK-GKVLKCSGCQFVHYCNR 58
Query: 149 SCRREAWLRYHRWECHGALRL 169
SC+++AW H+WEC R+
Sbjct: 59 SCQKDAW-EDHKWECANLKRI 78
>gi|327241838|gb|AEA40700.1| SET and MYND domain containing protein 4, partial [Balaenoptera
acutorostrata]
Length = 176
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 16 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 75
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 76 LLARFEEAGKVINKL 90
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A ++ GD +F E +A V+ + C+ C C +C A Y
Sbjct: 9 SPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHY 68
Query: 146 CSESCRREAWLRYHRWEC 163
C +C+R W H+ EC
Sbjct: 69 CDRTCQRAGWAE-HKQEC 85
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A ++ GD +F E +A V+ + C+ C C +C A Y
Sbjct: 9 SPGKGRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHY 68
Query: 146 CSESCRREAWLRYHRWEC 163
C +C+R W H+ EC
Sbjct: 69 CDRTCQRAGWAE-HKQEC 85
>gi|299750613|ref|XP_001829706.2| hypothetical protein CC1G_09595 [Coprinopsis cinerea okayama7#130]
gi|298408988|gb|EAU92074.2| hypothetical protein CC1G_09595 [Coprinopsis cinerea okayama7#130]
Length = 951
Score = 47.4 bits (111), Expect = 0.004, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 8/70 (11%)
Query: 100 HKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP------CNECILAVYCSESCRRE 153
H+ + E P V++ C S+ H C + ++ P C C YCS C++E
Sbjct: 747 HEANESRYEPPERHVVI--CDNSSAPHSCPNPTSLDPTKAGKTCGGCHWVNYCSAKCQQE 804
Query: 154 AWLRYHRWEC 163
W R HRWEC
Sbjct: 805 DWTRRHRWEC 814
>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
Length = 449
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTSEGKGRGLKATKEFWASDVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AW+ H+ EC R
Sbjct: 69 KFARYCDRTCQKDAWVD-HKKECLAVKR 95
>gi|327241836|gb|AEA40699.1| SET and MYND domain containing protein 4, partial [Tursiops
truncatus]
Length = 176
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 38/75 (50%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 16 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 75
Query: 184 LVSSHSDRYKEVYHL 198
L++ + K + L
Sbjct: 76 LLARFEEAGKVINKL 90
>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
NIH/UT8656]
Length = 550
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 33/151 (21%), Positives = 65/151 (43%), Gaps = 20/151 (13%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVH-KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA 133
+ ++ + + +G + AV+D+ + LFV +P+ + +NC +C
Sbjct: 29 LVEDERVYTKRTADRGNGLFAVDDIAAESQLLFVARPLLVALENAKLPTNCYYCFREPGG 88
Query: 134 P-------------IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL 180
C+ C +A +C + C+ EAW +YHR EC RL V + +
Sbjct: 89 QGLWQMTERKEDSLKTCSRCKVAKFCDQKCQTEAWSQYHRLECKLFSRLYPRVLPSTVRA 148
Query: 181 KLILVSSHS------DRYKEVYHLETHLQDM 205
+ L+ H ++++ L++H QD+
Sbjct: 149 VIRLLKQHKAGILPPGEWEQLLALQSHQQDL 179
>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
Length = 490
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
Length = 493
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
Length = 484
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 3 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 62
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 63 KFAHYCDRTCQKDAWLN-HKNECSAIKR 89
>gi|322779440|gb|EFZ09632.1| hypothetical protein SINV_01274 [Solenopsis invicta]
Length = 425
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/124 (24%), Positives = 58/124 (46%), Gaps = 15/124 (12%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS--ISAPIPCNECILAVYCSESCRREAWL 156
V KG T+ +P FV+ + C++C S + + C+ C YC+ +C++++W
Sbjct: 1 VKKGTTILTGRPFVFVLRSKYRTTRCDNCFKSEQLVKLLKCSSCQYVYYCNRNCQKQSW- 59
Query: 157 RYHRWECHGALRLLEAV------GIAHLALKLILVSSH------SDRYKEVYHLETHLQD 204
R H+ EC ++L + +A + +KL + + Y++ L +H D
Sbjct: 60 RIHKRECVCLKKILPVILPDSVRLVARIIIKLKQGGADEMDYYTKENYRKFTDLMSHYSD 119
Query: 205 MRPE 208
MR +
Sbjct: 120 MRAD 123
>gi|380030528|ref|XP_003698898.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3-like [Apis florea]
Length = 414
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ KG TLF KP A+V+ C++C S C+ C YC++SC++ +W
Sbjct: 8 IKKGTTLFTAKPFAYVLYSKYRNERCDYCFKS-GKLFRCSVCKCIYYCNQSCQQMSW-TI 65
Query: 159 HRWECHGALRLLEAV--GIAHLALKLIL---------VSSHSD-RYKEVYHLETHLQDMR 206
H EC R V +A L ++I+ + +S +Y++ L +H D++
Sbjct: 66 HSKECASLKRFSSKVIPDVARLMARIIIKLNQGGGEEIGYYSKTKYRKFKDLMSHYSDIK 125
Query: 207 PED 209
++
Sbjct: 126 KDE 128
>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
Length = 510
Score = 47.4 bits (111), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|383865502|ref|XP_003708212.1| PREDICTED: SET and MYND domain-containing protein 3-like [Megachile
rotundata]
Length = 438
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 51/108 (47%), Gaps = 4/108 (3%)
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
KG T+F KP A+V+ + C++C TS C+ C YC+ SC++ +W H
Sbjct: 10 KGTTIFTAKPFAYVLSSKYRTNRCDYCLTS-GKLFRCSACQYVYYCNRSCQQGSWPT-HN 67
Query: 161 WECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
EC R+ V +A L ++I+ S + Y+ +T + +
Sbjct: 68 TECVNLKRVSPKVVPDMARLMARIIIKLSQGGDDEVEYYTKTKFRRFK 115
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 18/139 (12%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
+K++ ++ +G + A+ + G+ LF P+A+ + C+ C + C+
Sbjct: 4 SKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCS 63
Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLIL-VS 186
+C +A YCS C+++AW + H+ EC ++RLL + KL+
Sbjct: 64 QCRIAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMQETP 117
Query: 187 SHSDRYKEVYHLETHLQDM 205
S S++ Y LE+++ +
Sbjct: 118 SESEKLYSFYDLESNINKL 136
>gi|218184846|gb|EEC67273.1| hypothetical protein OsI_34249 [Oryza sativa Indica Group]
Length = 816
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-CNECILAVYCS 147
KGR + + D+ + E P+A +I+ PC ++C++C + A + C C + +YCS
Sbjct: 250 KGRGMTSPNDISPASLIHAEDPLAVIIMKPCRDTHCHYCFSEAPADVVFCPSCTIPIYCS 309
Query: 148 ESCRREA 154
C+ +A
Sbjct: 310 NRCQEKA 316
>gi|195398853|ref|XP_002058035.1| GJ15711 [Drosophila virilis]
gi|194150459|gb|EDW66143.1| GJ15711 [Drosophila virilis]
Length = 448
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 4/102 (3%)
Query: 98 DVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLR 157
++ +G + EKP A+V+ + C++C ++ + C+ C YC+ +C+++AW
Sbjct: 13 EIKRGTRILTEKPFAYVLKSKYRLERCDNCLEAVKV-LKCSNCRYVSYCNRACQQQAW-P 70
Query: 158 YHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYH 197
H+ EC R L + A + +LIL H + Y+
Sbjct: 71 LHKHECPFLKRALPRIVPDAARMLCRLILRLEHGGDLERGYY 112
>gi|170034243|ref|XP_001844984.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167875496|gb|EDS38879.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 556
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 57/119 (47%), Gaps = 5/119 (4%)
Query: 48 QVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHK-GDTLF 106
Q++ S A++ A LP + + + + + M+ GR + +++ + GD L
Sbjct: 172 QIAEGKSFANAPFARLP-MRLEPNPRIPFMAKGLAMKNDPDYGRCLFPDDEIFRPGDVLM 230
Query: 107 VEKPVAFVILPPCSMSNCNHCCTSISAP--IPCNECILAVYCSESCRREAWLRYHRWEC 163
E+ + + P + NCN+C T + A +PC C +YCSE C + HR+EC
Sbjct: 231 EEEALVAMNDPKFRLVNCNYC-TEMKAFQLVPCPGCCSVMYCSEECLEKDGQGGHRFEC 288
>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 1-like [Pongo abelii]
Length = 490
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|406866444|gb|EKD19484.1| MYND domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 754
Score = 47.0 bits (110), Expect = 0.005, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 40/78 (51%), Gaps = 7/78 (8%)
Query: 93 VIAVEDVHKGDTLFVEKPVAFVI-LPPCSMSNCNHCCTSIS------APIPCNECILAVY 145
+ A + KG+ + ++K + V +P S +C+ C S++ PIPC+ C A+Y
Sbjct: 407 IFATRLIEKGELVILDKSITGVSNVPSSSFRHCDACHGSLANQYLRQPPIPCSSCRKAIY 466
Query: 146 CSESCRREAWLRYHRWEC 163
CS C + A YH+ C
Sbjct: 467 CSPECYKTATEGYHKVLC 484
>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
Length = 490
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|302850329|ref|XP_002956692.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
gi|300258053|gb|EFJ42294.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
Length = 265
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 55/128 (42%), Gaps = 4/128 (3%)
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
KGRH++A + G + + P V+ + C+ C P+ C LA YC
Sbjct: 28 GKGRHLVATRWFNPGAVVLQQDPYVSVLSDERTPGFCDFCFRPCERPLRCTRSKLARYCC 87
Query: 148 ESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
+ +R AW+ ++ EC +R V LA +L+ + ++ LE H ++
Sbjct: 88 KEHQRLAWVAGYKMECEALVRCAPRVPPPTVRLAARLLWRRARCGGINGLWRLEHHWDEL 147
Query: 206 --RPEDLY 211
R + LY
Sbjct: 148 DDRRKQLY 155
>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
Length = 497
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
Length = 477
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|196000040|ref|XP_002109888.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
gi|190588012|gb|EDV28054.1| hypothetical protein TRIADDRAFT_53251 [Trichoplax adhaerens]
Length = 448
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 7/122 (5%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
S+ +++ + KGR + + + G ++ E P V+ S C+ C S C
Sbjct: 8 SDNVEIFETYEKGRGIRCKKQLAIGTSVGKENPFCHVVSQDMLSSYCHSCLLMQSELYKC 67
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG------IAHLALKLILVSSHSDR 191
+ C + +YC +SC++E W ++H++EC RL V + +A ++ +D+
Sbjct: 68 SRCKIIMYCCKSCQKEDW-QWHKYECKSITRLGPKVPPDSIRLLGRVAYTILQGQDRADQ 126
Query: 192 YK 193
+K
Sbjct: 127 FK 128
>gi|426239581|ref|XP_004013698.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 3 [Ovis
aries]
Length = 391
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 18/139 (12%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
+K++ ++ +G + A+ + G+ LF P+A+ + C+ C + C+
Sbjct: 4 SKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCS 63
Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLIL-VS 186
+C +A YCS C+++AW + H+ EC ++RLL + KL+
Sbjct: 64 QCRIAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMQETP 117
Query: 187 SHSDRYKEVYHLETHLQDM 205
S S++ Y LE+++ +
Sbjct: 118 SESEKLYSFYDLESNINKL 136
>gi|66540564|ref|XP_625013.1| PREDICTED: SET and MYND domain-containing protein 3 [Apis
mellifera]
Length = 440
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 57/123 (46%), Gaps = 14/123 (11%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ KG TLF KP A+V+ C++C S C+ C YC++SC++ +W
Sbjct: 8 IKKGTTLFTAKPFAYVLYSKYRNERCDYCFKS-GKLFRCSVCKCIYYCNQSCQQMSW-TI 65
Query: 159 HRWECHGALRLLEAV--GIAHLALKLIL---------VSSHSD-RYKEVYHLETHLQDMR 206
H EC R V +A L ++I+ + +S +Y++ L +H D++
Sbjct: 66 HSKECASLKRFSSKVIPDVARLMARIIIKLNQGGGEEIGYYSKTKYRKFKDLMSHYSDIK 125
Query: 207 PED 209
++
Sbjct: 126 KDE 128
>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
Methionine
Length = 429
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 22/148 (14%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C C++C
Sbjct: 9 KFATAN-RGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLXRCSQCR 67
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW H+ EC ++RLL + KL S S
Sbjct: 68 VAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLXDGAPSES 121
Query: 190 DRYKEVYHLETHLQDM---RPEDLYQYV 214
++ Y LE+++ + R E L Q V
Sbjct: 122 EKLYSFYDLESNINKLTEDRKEGLRQLV 149
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
Length = 428
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KG + + V G+ L+ +P A+V+ C C C++C +A Y
Sbjct: 11 SPGKGSGLRSRRQVRPGELLYRAEPFAYVVTKEQLGGVCEQCLQRNEHLHRCSQCKVAKY 70
Query: 146 CSESCRREAWLRYHRWEC 163
C +SC++EAWL H+ EC
Sbjct: 71 CGKSCQKEAWLD-HKREC 87
>gi|195059722|ref|XP_001995690.1| GH17625 [Drosophila grimshawi]
gi|193896476|gb|EDV95342.1| GH17625 [Drosophila grimshawi]
Length = 457
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 3/103 (2%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP A+V+ + C++C ++ C+ C YC+ +C+++AW
Sbjct: 19 IKRGTRILSEKPFAYVLKSQYRLERCDNCLEAVQKVRKCSNCRYVSYCNRTCQQQAW-PL 77
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
H+ EC R+ + A + +LIL H + Y+ E
Sbjct: 78 HKHECPFLKRVQPRIVPDAARMLCRLILRLEHGGDLERGYYTE 120
>gi|384485789|gb|EIE77969.1| hypothetical protein RO3G_02673 [Rhizopus delemar RA 99-880]
Length = 459
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
R ++ D+ + ++VEKP+ + P S+CN C ++ I C +C + +CS+
Sbjct: 189 RGLVVNRDIAADEIIYVEKPIVSGLFPGLEGSHCNLCLKRLNDVKIECPDCDVVAFCSDE 248
Query: 150 CRREAWLRYHRWEC 163
C A YH++ C
Sbjct: 249 CLNHAKEEYHQYIC 262
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 68/150 (45%), Gaps = 21/150 (14%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ + +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFATADRGNGLRAVTPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVS-S 187
C +A YCS C+++AW H+ EC ++RLL + KL+ S S
Sbjct: 65 CRVAKYCSSKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEGSPS 118
Query: 188 HSDRYKEVYHLETHLQDM---RPEDLYQYV 214
S++ Y LE+++ + + E L Q V
Sbjct: 119 ESEKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
gorilla gorilla]
Length = 459
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|301113434|ref|XP_002998487.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262111788|gb|EEY69840.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 40/86 (46%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S +GR V D++ GD A V+LP S+C+ C + C C A Y
Sbjct: 16 SGDRGRRVETQIDLNAGDLALRASASAAVLLPELWGSHCHKCFATGVRLSRCGRCHTAFY 75
Query: 146 CSESCRREAWLRYHRWECHGALRLLE 171
CS++C++ W HR EC +L +
Sbjct: 76 CSKACQQADWAPDHRRECKSMAQLAQ 101
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 67/150 (44%), Gaps = 21/150 (14%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ + +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFATAKRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
C +A YCS C+++AW H+ EC ++RLL + KL+ S
Sbjct: 65 CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMDGAPS 118
Query: 188 HSDRYKEVYHLETHLQDM---RPEDLYQYV 214
S++ Y LE+++ + + E L Q V
Sbjct: 119 ESEKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/136 (25%), Positives = 64/136 (47%), Gaps = 19/136 (13%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K T+N +G + AV + G+ LF P+A+ + C+ C + C++C
Sbjct: 8 KFATAN-RGNGLRAVVALRPGELLFRSDPLAYTVCKGSRGIVCDRCLLGKEKLMRCSQCQ 66
Query: 142 LAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSSHS 189
+A YCS C+++AW + H+ EC ++RLL + KL+ S S
Sbjct: 67 VAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVIFKLMEETPSES 120
Query: 190 DRYKEVYHLETHLQDM 205
++ Y LE++++ +
Sbjct: 121 EKLYSFYDLESNIKKL 136
>gi|301603799|ref|XP_002931531.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
(Silurana) tropicalis]
Length = 176
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAV 144
S KG V A++D+ G T+ + +P + + C + C+HC + C++C +
Sbjct: 11 SPGKGNGVRALKDMSHGLTVMIAEPYVYTV---CRIKTACDHCLHRKEKLLRCSQCKVTR 67
Query: 145 YCSESCRREAWLRYHRWEC 163
YC+ C+R+AW + H+ EC
Sbjct: 68 YCNSHCQRKAW-QGHKREC 85
>gi|119113125|ref|XP_564259.2| AGAP011233-PA [Anopheles gambiae str. PEST]
gi|116131639|gb|EAL41558.2| AGAP011233-PA [Anopheles gambiae str. PEST]
Length = 393
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 35/56 (62%), Gaps = 2/56 (3%)
Query: 122 SNCNHC-CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA 176
+ CN+C + IPC C +A++CS +C EA+ +YHR+EC G +R L +G A
Sbjct: 230 ARCNYCHSRRLFTFIPCEGCTVAMFCSVACLTEAYRKYHRYEC-GLVRDLWKIGGA 284
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 42/91 (46%), Gaps = 1/91 (1%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP 134
M +S+ ++++ +KGR + A + G +F P+ + C S C+HC P
Sbjct: 1 MSSSSPVEIRPHPSKGRALHATKSFPPGAVIFSFTPLLLLPTVACLTSVCSHCLRP-GEP 59
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHG 165
C+ C A YC +C+ AW H EC
Sbjct: 60 RLCSRCHAAAYCDATCQAAAWKAVHSRECKA 90
>gi|326421764|gb|ADZ73910.1| SET and MYND domain containing protein 4 [Lipotes vexillifer]
Length = 212
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 4/72 (5%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C AP+PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 19 CHRCLKHTLAPVPCDGCSYAKYCSQECLQQAWDLYHSIECSLGGLLLTLGVFCHIALRST 78
Query: 184 LVSSHSDRYKEV 195
L++ R++E
Sbjct: 79 LLA----RFEEA 86
>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
Length = 548
Score = 47.0 bits (110), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 40/95 (42%), Gaps = 9/95 (9%)
Query: 84 QTSNAKGRHVIAVEDVHKGDTLF-VEKPVAFVILPPC--SMSNCNHCCTSISAPIP---- 136
+ + GR + A GD + E+P V+LPP ++ CNHC P
Sbjct: 13 KGGDKGGRSIHATRRFKPGDVIACFEEPA--VVLPPGHRALEYCNHCLKKQQPGGPKLRA 70
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C C YC +C+R W H+ EC RL E
Sbjct: 71 CTGCKTVAYCGPACQRANWSLVHKLECKAIQRLHE 105
>gi|350582164|ref|XP_003481215.1| PREDICTED: SET and MYND domain-containing protein 1-like [Sus
scrofa]
Length = 220
Score = 47.0 bits (110), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ TS KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
mulatta]
Length = 399
Score = 47.0 bits (110), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
A YC +C+++AWL H+ EC
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNEC 90
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A+ + + + KGR +IA GD + ++P A S+C+HC TS +
Sbjct: 13 AARGLTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C + YCS +C++E W + H+ EC L E
Sbjct: 72 CSMCRVTWYCSSNCQKEEW-KLHQLECRAMAALTE 105
>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
Length = 515
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 70/150 (46%), Gaps = 16/150 (10%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS-NCNHCCTS-ISAPI 135
S K K++ S GR+++A +D+ G+ + ++P FV+ P S C +C I+ +
Sbjct: 2 SVKYKVEKSEKLGRYLVAAKDIKAGERILSDQP--FVLGPSSDTSLVCFNCYLPLINKFL 59
Query: 136 PCNECILAVYCSESCRREAWLRYH-RWECH-----------GALRLLEAVGIAHLALKLI 183
C C +A C + + +YH R EC + +++ VG + L+
Sbjct: 60 VCKNCAVAPLCPGDGCPDEFTKYHNRQECDVFRNLKLTKGISPMTMVQNVGSLSVLRALL 119
Query: 184 LVSSHSDRYKEVYHLETHLQDMRPEDLYQY 213
++ +K LETHL+ R +++QY
Sbjct: 120 KKETNLLEWKLFMELETHLERRRESNVWQY 149
>gi|359486519|ref|XP_002270770.2| PREDICTED: uncharacterized protein LOC100263830 [Vitis vinifera]
Length = 799
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 46/78 (58%), Gaps = 3/78 (3%)
Query: 80 KIKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIP 136
+IK+Q ++ KGR + ++ ++ + + E+P A +IL C ++C+ C + A +P
Sbjct: 224 QIKLQCVSTPTKGRGMASLSEISQSYLVHTEEPYAAIILKHCRDTHCHFCFNELPADSVP 283
Query: 137 CNECILAVYCSESCRREA 154
C C + +YCS+ C+ +A
Sbjct: 284 CTSCSIPLYCSQHCQMQA 301
>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
Length = 383
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 32/111 (28%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC----------------T 129
+G+ + A +++ KG+T+FVE+PV F+ + C+HC +
Sbjct: 1 QGKGLFATKNIRKGETVFVERPVVSSQFLWNALYNYRACDHCLRALETAEENAQRLLGNS 60
Query: 130 SISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
S+ P P C C + +YCS CR+ AW +YHR C G R
Sbjct: 61 SLVLPHPEQCSIRKDLHQQCPRCQV-MYCSAECRQAAWEQYHRVLCLGPSR 110
>gi|303282641|ref|XP_003060612.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458083|gb|EEH55381.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 302
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI--SAPIPCN 138
+K+ S+ +GRHV+A D+ +G+ L E+ + C+ C + P+ C
Sbjct: 7 LKIAQSDDRGRHVVATTDIARGELLLSEECTVAMTYRDLESPYCSQCVKPMVDETPVECE 66
Query: 139 ECILAVYCSESC 150
C L YCS+SC
Sbjct: 67 GCFLERYCSKSC 78
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 46.6 bits (109), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 70/151 (46%), Gaps = 21/151 (13%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
+K++ ++ +G + A+ + G+ LF P+A+ + C+ C + C+
Sbjct: 4 SKVEKFSTADRGNGLRALARLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCS 63
Query: 139 ECILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVS 186
+C +A YCS C+++AW + H+ EC ++RLL + KL+
Sbjct: 64 QCRVAKYCSAKCQKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVIFKLMEETP 117
Query: 187 SHSDRYKEVYHLETHLQDM---RPEDLYQYV 214
S S++ Y LE+++ + + E L Q V
Sbjct: 118 SESEKLYSFYDLESNMNKLTEEKKEGLRQLV 148
>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
Length = 549
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 66/156 (42%), Gaps = 28/156 (17%)
Query: 75 MCASNKIKMQTSNAK---GRHVIAVEDVHKGDTLF-VEKPVAFVILPPC--SMSNCNHCC 128
M A++ I+++ + GR + A GD + + P V+LPP ++ CNHC
Sbjct: 1 MLANDGIEVRGTKGGAKGGRSIYATRRFKPGDVIARFDNPA--VVLPPGHRALEYCNHCV 58
Query: 129 TSISAPI-----PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH------ 177
P C C YC +C+R W H+ EC RL EA AH
Sbjct: 59 KK-QRPAGVKLRACTGCKTVAYCGPACQRANWSLVHKLECKAIQRLHEA-KPAHQPDWVP 116
Query: 178 ----LALKLILVSSHSDRYKEVYHLETHLQDMRPED 209
A +++L R++E LE H++ R +D
Sbjct: 117 TPIRAAAQVMLRPQVLARFEE---LEGHVEQWRKKD 149
>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Equus caballus]
Length = 490
Score = 46.6 bits (109), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 41/88 (46%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ S KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFNSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|356507528|ref|XP_003522516.1| PREDICTED: uncharacterized protein LOC100794609 [Glycine max]
Length = 784
Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 81 IKMQ--TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPC 137
IK+Q ++ KGR +++ + G + E+P A +IL C ++C++C + A +PC
Sbjct: 217 IKLQRVSTPDKGRGIVSSCVISPGFLVHAEEPYAMIILKQCRETHCHYCLNDLPADRVPC 276
Query: 138 NECILAVYCSESCRREA 154
C + +YCS CR A
Sbjct: 277 ILCSIPLYCSHQCRIRA 293
>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
Length = 490
Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
Length = 490
Score = 46.2 bits (108), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
Length = 404
Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/87 (31%), Positives = 48/87 (55%), Gaps = 9/87 (10%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CN 125
++ T+ A + +++ +GR ++A V +G+ LF E P V+L P ++S+ C+
Sbjct: 6 QTPTMAAAGDVLRVADLPGRGRGLVAARIVREGEVLFSEPP---VLLYPSTLSSLRSYCS 62
Query: 126 HCCTSIS--APIPCNECILAVYCSESC 150
C S++ A IPC C A +CS +C
Sbjct: 63 ACFRSLTPAATIPCVSCRAAAFCSPAC 89
>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
Length = 490
Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T+ KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|355720907|gb|AES07089.1| SET and MYND domain containing 2 [Mustela putorius furo]
Length = 432
Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ H GD LF A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALQPFHVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
Length = 380
Score = 46.2 bits (108), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 63/132 (47%), Gaps = 12/132 (9%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KG + A+ ++ G+ ++ +P AF + + C C + C++C A Y
Sbjct: 11 SEGKGNGLRALREIKPGEVIYSCEPFAFCVARDFLKTACQSCLKRGESLSRCSQCKTARY 70
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVG------IAHLALKLILVSSHSDRYKEVYHL- 198
C+ C+++AW H+ EC L + +A + KL L S SD+ +E+Y +
Sbjct: 71 CNVQCQKQAWPD-HKRECKCLKHLQPRIPTDSVRLVARIIFKL-LSQSESDQ-EELYSIA 127
Query: 199 --ETHLQDMRPE 208
++HL DM E
Sbjct: 128 EHQSHLADMSEE 139
>gi|327241842|gb|AEA40702.1| SET and MYND domain containing protein 4, partial [Sus scrofa]
Length = 176
Score = 46.2 bits (108), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 35/67 (52%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C + A +PC+ C A YCS+ C ++AW YH EC LL H+AL+
Sbjct: 16 CHRCLKTTLATVPCDGCSYAKYCSQECLQQAWDLYHSVECSLGGLLLTLGVFCHVALRST 75
Query: 184 LVSSHSD 190
L++ D
Sbjct: 76 LLARFED 82
>gi|170047543|ref|XP_001851277.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167869950|gb|EDS33333.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 697
Score = 46.2 bits (108), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 36/70 (51%), Gaps = 6/70 (8%)
Query: 116 LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR-LLEAVG 174
LPP +C +C P PC C A YCS +CR+ HR+EC G + L +G
Sbjct: 213 LPP---FDCQYCGEINVIPFPCITCGRACYCSVACRKNH--SSHRFECAGYGKHLWYLIG 267
Query: 175 IAHLALKLIL 184
IAHL ++ L
Sbjct: 268 IAHLGIRSFL 277
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 46.2 bits (108), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K +T++ +G + A + GD LF P+A+ + + C+ C + C++C
Sbjct: 8 KFRTAD-RGNGLRASAALRPGDLLFRSDPLAYTVCKGSRGAVCDRCLLGKEKLLRCSQCR 66
Query: 142 LAVYCSESCRREAWLRYHRWEC 163
+A YCS C+++AW H+ EC
Sbjct: 67 VAKYCSAKCQKKAWPD-HKREC 87
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 18/138 (13%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ +S ++G + A + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFSSASRGNGLRASAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
C +A YCS C+++AW H+ EC ++RLL + KL+ S
Sbjct: 65 CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEETPS 118
Query: 188 HSDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 119 ESEKLYSFYDLESNINKL 136
>gi|357146941|ref|XP_003574165.1| PREDICTED: uncharacterized protein LOC100821025 [Brachypodium
distachyon]
Length = 791
Score = 45.8 bits (107), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 32/102 (31%), Positives = 52/102 (50%), Gaps = 8/102 (7%)
Query: 54 SSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF 113
SS+ A LP A L CAS T N KGR ++++ + + VE P+A
Sbjct: 199 SSSDCQDAGLPHTAEPHKVVLECAS------TPN-KGRGMMSLNVISPASLIHVEDPLAA 251
Query: 114 VILPPCSMSNCNHCCTSISAPIP-CNECILAVYCSESCRREA 154
+I+ C ++C++C + A + C C + VYCS+ C+ +A
Sbjct: 252 IIIKSCRETHCHYCFSEAPADVVFCPLCTIPVYCSKECQEQA 293
>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 552
Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 61/146 (41%), Gaps = 24/146 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP---- 136
+ ++TS+ GR + A E + G TL KP V+ S N C++ ++P P
Sbjct: 25 LDIRTSSTDGRGIWAREALSAGATLMALKPHVHVL----STRNLELYCSACTSPAPVTGL 80
Query: 137 --CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG-------------IAHLALK 181
C C + YCS +C+ W H+ EC R + L +
Sbjct: 81 KRCTRCRVIWYCSTACQNNDWT-MHKPECAAIQRWAANAPSPDVSIPAEPIRCLGRLLWR 139
Query: 182 LILVSSHSDRYKEVYHLETHLQDMRP 207
+ S S KEV L++H +D++P
Sbjct: 140 KQKLRSSSTWAKEVDGLQSHRKDLQP 165
>gi|194742499|ref|XP_001953740.1| GF17913 [Drosophila ananassae]
gi|190626777|gb|EDV42301.1| GF17913 [Drosophila ananassae]
Length = 675
Score = 45.8 bits (107), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 7/98 (7%)
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCS 147
+K V A D+ KG+ + E P + P +NC+ C C C YCS
Sbjct: 223 SKNAGVFATSDIDKGEIVLAENPTYYQFGAP--FNNCDLCGVPQELIYTCAGCRYKTYCS 280
Query: 148 ESCRREAWLRYHRWECH----GALRLLEAVGIAHLALK 181
ESC ++ H++EC G + ++E+ + L LK
Sbjct: 281 ESCMKDDE-EIHQFECSGFKVGIIPMMESTTLFRLFLK 317
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 45.8 bits (107), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 63/138 (45%), Gaps = 18/138 (13%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ + +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFATAHRGNGLRAVVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
C +A YCS C+++AW H+ EC ++RLL + KL+ + S
Sbjct: 65 CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEEIPS 118
Query: 188 HSDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 119 ESEKLYTFYDLESNINKL 136
>gi|307179276|gb|EFN67662.1| hypothetical protein EAG_07125 [Camponotus floridanus]
Length = 275
Score = 45.8 bits (107), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 20/72 (27%), Positives = 37/72 (51%)
Query: 60 VASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPC 119
+A++P + L S+ ++++ A+GRH++A D+ G L V++P +F P
Sbjct: 203 IAAMPVVDGKPHDCLASCSDSVELRLDEARGRHLVAARDIRPGAVLIVDQPFSFSTDGPA 262
Query: 120 SMSNCNHCCTSI 131
NC + C I
Sbjct: 263 LSRNCLYTCVYI 274
>gi|194900528|ref|XP_001979809.1| GG16796 [Drosophila erecta]
gi|190651512|gb|EDV48767.1| GG16796 [Drosophila erecta]
Length = 690
Score = 45.8 bits (107), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 36/73 (49%), Gaps = 3/73 (4%)
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
V A D+ KG+ + VE PV F P NC+ C C+ C YCS+SC
Sbjct: 241 VFASCDIPKGEIVLVENPVHFQFSAP--FLNCDLCGVHQEQLYTCDNCRFRTYCSKSC-M 297
Query: 153 EAWLRYHRWECHG 165
E+ H++EC+G
Sbjct: 298 ESDAEVHQYECYG 310
>gi|340780606|pdb|3RIB|A Chain A, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
gi|340780607|pdb|3RIB|B Chain B, Human Lysine Methyltransferase Smyd2 In Complex With
Adohcy
Length = 441
Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C T C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
Length = 452
Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 50/113 (44%), Gaps = 7/113 (6%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
K+ + K R+ E + KG T+ P + + C+ C + S C C+
Sbjct: 41 KLLRGSLKMRNTGKHEPIPKGTTILESVPFVYCLKSSFRRELCDFCLKANSNLRKCLGCM 100
Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVG------IAHLALKLILVSSH 188
+ YC C+RE W + H+ EC +R+ V IA L LKL +++ +
Sbjct: 101 VVSYCGRVCQREGW-KDHKGECKNFVRVKPNVPTDSVRLIARLILKLQVINGY 152
>gi|188035871|ref|NP_064582.2| N-lysine methyltransferase SMYD2 [Homo sapiens]
gi|90185234|sp|Q9NRG4.2|SMYD2_HUMAN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=HSKM-B; AltName: Full=Histone methyltransferase
SMYD2; AltName: Full=Lysine N-methyltransferase 3C;
AltName: Full=SET and MYND domain-containing protein 2
gi|345111051|pdb|3TG4|A Chain A, Structure Of Smyd2 In Complex With Sam
gi|345111052|pdb|3TG5|A Chain A, Structure Of Smyd2 In Complex With P53 And Sah
gi|67514269|gb|AAH98276.1| SET and MYND domain containing 2 [Homo sapiens]
Length = 433
Score = 45.4 bits (106), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C T C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|343197592|pdb|3S7B|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197593|pdb|3S7D|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197595|pdb|3S7F|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|343197597|pdb|3S7J|A Chain A, Structural Basis Of Substrate Methylation And Inhibition
Of Smyd2
gi|9295345|gb|AAF86953.1|AF226053_1 HSKM-B [Homo sapiens]
gi|66990077|gb|AAH98133.1| SET and MYND domain containing 2 [Homo sapiens]
gi|68226703|gb|AAH98335.1| SET and MYND domain containing 2 [Homo sapiens]
gi|189054089|dbj|BAG36596.1| unnamed protein product [Homo sapiens]
gi|307686281|dbj|BAJ21071.1| SET and MYND domain containing 2 [synthetic construct]
Length = 433
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C T C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|327241848|gb|AEA40705.1| SET and MYND domain containing protein 4, partial [Manis
pentadactyla]
Length = 176
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 36/75 (48%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
C+ C A +PC+ C A YCS C+++AW YH EC LL H AL+
Sbjct: 16 CHRCLKHTLATVPCDGCSHAKYCSPECKQQAWNLYHNIECSLGGLLLTLGVFCHTALRTT 75
Query: 184 LVSSHSDRYKEVYHL 198
L++ D K + L
Sbjct: 76 LLARFEDVGKVIRKL 90
>gi|170040664|ref|XP_001848112.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167864251|gb|EDS27634.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 558
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 2/84 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNE 139
I M+ GR ++A + GD + EK + V+ NC+HC S + IPC
Sbjct: 178 IGMKQYPGSGRGLVAERNFKTGDVILDEKTILSVVSVANRYLNCSHCGISNQHSLIPCPN 237
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C+ +YCS C E HR+EC
Sbjct: 238 CVHCMYCSGECLAEDK-PLHRFEC 260
>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
Length = 404
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 9/87 (10%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CN 125
++ T+ A + +++ +GR ++A +V +G+ L E P V+L P ++S+ C+
Sbjct: 6 QTPTMAAAGDVLRVADLPGRGRGLVAARNVREGEVLLSEPP---VLLYPSTLSSLRSYCS 62
Query: 126 HCCTSI--SAPIPCNECILAVYCSESC 150
C S+ +A IPC C A +CS +C
Sbjct: 63 ACFRSLPPAATIPCVSCRAAAFCSPAC 89
>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
Length = 433
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C T C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
troglodytes]
Length = 452
Score = 45.4 bits (106), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ + KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFITEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
gallopavo]
Length = 427
Score = 45.4 bits (106), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KG + + V G+ L+ +P A+V+ C C C++C +A Y
Sbjct: 11 SPGKGSGLRSRRRVRPGELLYRAEPFAYVVTKEQLGGVCEQCFQRNEHLHRCSQCKVAKY 70
Query: 146 CSESCRREAWLRYHRWEC 163
C +SC++E WL H+ EC
Sbjct: 71 CGKSCQKEGWLD-HKREC 87
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 45.4 bits (106), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 33/141 (23%), Positives = 65/141 (46%), Gaps = 18/141 (12%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ ++ +G + A+ + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
C +A YCS C+++AW H+ EC ++RLL + +KL+ S
Sbjct: 65 CRVAKYCSAKCQKKAWPD-HKRECKCLKSCKPRYPPDSVRLL-----GRVVVKLMEETPS 118
Query: 188 HSDRYKEVYHLETHLQDMRPE 208
S++ Y LE+++ + E
Sbjct: 119 ESEKLYSFYDLESNISKLTEE 139
>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
Length = 433
Score = 45.1 bits (105), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 40/139 (28%), Positives = 56/139 (40%), Gaps = 12/139 (8%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
+S KGR + A+ GD LF A+V+ ++C HC C C A
Sbjct: 13 SSPGKGRGLRALRPFQVGDLLFSCPAYAYVLTVNERGNHCEHCFARKEGLSKCGRCKQAF 72
Query: 145 YCSESCRREAWLRYHRWECHGALRLLE----AVGIAHLALKLILVSSHSDRYKE-----V 195
YC C++E W H+ EC + E + + A L +SH DR V
Sbjct: 73 YCDVECQKEDW-PMHKLECSPMVVFGENWNPSETVRLTARILAKQTSHPDRTPSEKLLAV 131
Query: 196 YHLETHLQ--DMRPEDLYQ 212
E+HL D +DL Q
Sbjct: 132 KEFESHLDKLDNEKKDLIQ 150
>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
Length = 490
Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 1/88 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ + KGR + A ++ D +F E+ + V+ C+ C C +C
Sbjct: 9 VEVFIAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALR 168
A YC +C+++AWL H+ EC R
Sbjct: 69 KFAHYCDRTCQKDAWLN-HKNECSAIKR 95
>gi|71052093|gb|AAH49367.2| SMYD2 protein [Homo sapiens]
Length = 371
Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C T C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
griseus]
Length = 428
Score = 45.1 bits (105), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 33/137 (24%), Positives = 62/137 (45%), Gaps = 16/137 (11%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ + +G + AV + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFATADRGNGLRAVVPLRPGELLFRCDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLILVSSH 188
C +A YCS C+++AW H+ EC ++RLL V + + K S
Sbjct: 65 CRIAKYCSAKCQKKAWPD-HKQECKCLKSCKPRYPPDSVRLLARVIVKLMDEK----PSE 119
Query: 189 SDRYKEVYHLETHLQDM 205
S++ Y LE+++ +
Sbjct: 120 SEKLYSFYDLESNINKL 136
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 45.1 bits (105), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 6/83 (7%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS--MSNCNHCCTSISAPIPCNECILAVYC 146
KGR + + G+ + ++P +V +P S S C+ C SI+ C+ C +A YC
Sbjct: 21 KGRSLFTTRNFRPGEEIISQEP--YVCVPNNSPVESRCDRCFASINLK-KCSACKVAWYC 77
Query: 147 SESCRREAWLRYHRWECHGALRL 169
S C++ W + HR EC RL
Sbjct: 78 SSVCQKSEW-KLHRLECDALARL 99
>gi|332027349|gb|EGI67433.1| SET and MYND domain-containing protein 4 [Acromyrmex echinatior]
Length = 619
Score = 45.1 bits (105), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 53/119 (44%), Gaps = 9/119 (7%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP--CNECILAVYCS 147
GRH +A + G+ + VE+P +L +++C+ C I P P C YCS
Sbjct: 193 GRHAVATRKIIPGEIVIVERPHCAFLLAETRLTHCHLCFVKIFVPTPAACRTYSCVAYCS 252
Query: 148 ESCRREAWLRYHRWECH--GALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
C R+A + H EC AL +A LAL+ I ++EV L+ +D
Sbjct: 253 RRC-RDADAQVHSQECKLLPALWYSKASVTCFLALRAITQRP----FEEVMRLKEQFRD 306
>gi|66822113|ref|XP_644411.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822969|ref|XP_644839.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|122057743|sp|Q557F7.1|Y3589_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273589
gi|60472534|gb|EAL70485.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472895|gb|EAL70844.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 386
Score = 45.1 bits (105), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-- 136
N +++++S +GR++IA D+ G+ L K V ++C +C + + I
Sbjct: 6 NGLELKSSENEGRYLIATRDIQIGEDLLKCKSYFAVTSETLKTTSCFNCIKQLPSVIKLS 65
Query: 137 --CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
CN+C YC+E C+ E ++ +EC +L
Sbjct: 66 LKCNQCNEIWYCNEQCKNENINKHQHYECKFYKKL 100
>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
Length = 433
Score = 45.1 bits (105), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A V+ S+C HC C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFACPAYASVLTVGERGSHCEHCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|194768855|ref|XP_001966527.1| GF21943 [Drosophila ananassae]
gi|190617291|gb|EDV32815.1| GF21943 [Drosophila ananassae]
Length = 459
Score = 45.1 bits (105), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 22/83 (26%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
I +K + I + +G + EKP AFV+ + C++C + + C+ C
Sbjct: 8 IAKANGASKSKSNIPNPQIKRGQRILTEKPFAFVLKSKYRLERCDNCLEATKV-LKCSNC 66
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
YC+ +C+ +AW + H+ EC
Sbjct: 67 RYVSYCNRACQSQAWAQ-HKHEC 88
>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
Length = 404
Score = 44.7 bits (104), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 47/87 (54%), Gaps = 9/87 (10%)
Query: 70 ESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CN 125
++ T A + +++ +GR ++A +V +G+ L E P V+L P ++S+ C+
Sbjct: 6 QTPTTAAAGDALRVADLPGRGRGLVAARNVREGEVLLSEPP---VLLYPSTLSSLASYCS 62
Query: 126 HCCTSI--SAPIPCNECILAVYCSESC 150
C S+ +A +PC C A +CS +C
Sbjct: 63 ACFRSLPAAATVPCASCRAAAFCSPAC 89
>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
Length = 433
Score = 44.7 bits (104), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF + A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCRAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
Length = 542
Score = 44.7 bits (104), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 9/140 (6%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + +Q KGR + + V GD + V P+ + + C+ C + P P
Sbjct: 5 APDGVAIQPHKTKGRALHTTKTVAAGDVIAVFTPLLLLPSLSHLTTVCSFCLRA-GTPRP 63
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEA------VGIAHLALKLILVS--SH 188
C+ C A YC C+ AW H EC +R +++ + AL +L+S
Sbjct: 64 CSRCRAAYYCDARCQAAAWSGGHSLECAALVRAVKSSKKRREIPTPVRALVKVLLSCGQP 123
Query: 189 SDRYKEVYHLETHLQDMRPE 208
D K + LE H+ + R E
Sbjct: 124 EDLSKNMDGLEGHVAERRRE 143
>gi|366987641|ref|XP_003673587.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
gi|342299450|emb|CCC67204.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
Length = 508
Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 18/85 (21%)
Query: 84 QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPP---CSMSNCNHCCT----------- 129
Q N KGR V A++ KGD +F EK + VI+PP + + CCT
Sbjct: 119 QDKNGKGRGVYALKHFKKGDMIFEEK-LPIVIIPPMEKLELMSKGKCCTLCGKSLYELST 177
Query: 130 --SISAPIPCNECILAVYCSESCRR 152
+ + CN+C AV+CS++C++
Sbjct: 178 HYIMMNGLDCNDCT-AVWCSKNCKK 201
>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
Length = 429
Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 60/135 (44%), Gaps = 7/135 (5%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ KG + A + V G+ L+ +P+A + C C+ C + + C++
Sbjct: 4 KLERFVCPGKGNGLRANQRVLAGELLYTAEPLACCVSSQCRNVVCHSCFSRHETLLRCSQ 63
Query: 140 CILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI------AHLALKLILVSSHSDRYK 193
C +A YC +C++ AW H+ EC L L + A + +L+ S + +
Sbjct: 64 CKVARYCDVTCQKRAW-SIHKRECKCLLSLHPRIPTDSVRLAARIIFRLLCPSQMTPQLY 122
Query: 194 EVYHLETHLQDMRPE 208
E+HL DM E
Sbjct: 123 SFEEHESHLCDMGEE 137
>gi|414870818|tpg|DAA49375.1| TPA: hypothetical protein ZEAMMB73_691595 [Zea mays]
Length = 437
Score = 44.7 bits (104), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/87 (27%), Positives = 47/87 (54%), Gaps = 6/87 (6%)
Query: 69 GESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC 128
G+ H ++ S + T N KGR +I+ +D+ + VE P+A +I+ ++C+ C
Sbjct: 212 GQPHKIVIES----ISTPN-KGRGMISTDDIPPASLIHVEDPLAAIIMKSSRETHCHFCF 266
Query: 129 TSISAPIP-CNECILAVYCSESCRREA 154
+ A + C C + +YCS+ C+ ++
Sbjct: 267 SETPADVVFCPSCTIPIYCSKRCQEQS 293
>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3 [Equus caballus]
Length = 440
Score = 44.7 bits (104), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ + KG + A+ + G+ LF P+A+ + C+ C + C++
Sbjct: 5 KVEKFITADKGNGLRALVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQ 64
Query: 140 CILAVYCSESCRREAWLRYHRWEC 163
C +A YCS C+++AW + H+ EC
Sbjct: 65 CRVAKYCSAKCQKKAW-QDHKREC 87
>gi|67514234|gb|AAH98305.1| SMYD2 protein [Homo sapiens]
Length = 272
Score = 44.7 bits (104), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C T C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|403277520|ref|XP_003930406.1| PREDICTED: N-lysine methyltransferase SMYD2 [Saimiri boliviensis
boliviensis]
Length = 433
Score = 44.7 bits (104), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A+E GD LF A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALEPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|296230124|ref|XP_002760570.1| PREDICTED: N-lysine methyltransferase SMYD2 [Callithrix jacchus]
Length = 433
Score = 44.3 bits (103), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A+E GD LF A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALEPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
Length = 559
Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 44/99 (44%), Gaps = 10/99 (10%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP--PCSMSNCNHCCTSIS----API--- 135
+S GR ++A +D G +L + LP P + CN+C S +P+
Sbjct: 40 SSGPHGRSLVATQDFTPG-SLIATFSSPTLALPDGPSMRTTCNYCLRVGSNEGFSPVSLK 98
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC +C+R W H+ EC R+ E G
Sbjct: 99 ACTGCKAAVYCGPTCQRAHWKSIHKAECKMFARVRETTG 137
>gi|91076642|ref|XP_970424.1| PREDICTED: similar to Buzidau CG13761-PB [Tribolium castaneum]
gi|270002373|gb|EEZ98820.1| hypothetical protein TcasGA2_TC004426 [Tribolium castaneum]
Length = 442
Score = 44.3 bits (103), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 46/104 (44%), Gaps = 4/104 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
VH+G T++ EKP +V+ C+ C + C+ C YC + C+++ W
Sbjct: 7 VHEGCTIYKEKPFVYVLSSKLRTEYCDFCLKK-GQFMKCSGCHYVYYCGKVCQKDGW-SV 64
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLET 200
H+ EC G R+ + A +LI + + Y+LE
Sbjct: 65 HKSECRGLKRVAPRILPDAARFIARLIHILRKGGDLVKSYYLEN 108
>gi|146091661|ref|XP_001470086.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134084880|emb|CAM69278.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 712
Score = 44.3 bits (103), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 30/53 (56%)
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
P+PC+ A+YCSESC+ +AW+ +H EC R + V + H L + +
Sbjct: 426 PVPCDSNCSALYCSESCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTT 478
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 44.3 bits (103), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 2/81 (2%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
KGR + D G+ + +KP V S S C+ C + + C+ C + YC
Sbjct: 21 KGRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCFKTNNLK-KCSACQVVWYCGS 79
Query: 149 SCRREAWLRYHRWECHGALRL 169
SC++ W + HR EC RL
Sbjct: 80 SCQKSEW-KLHRDECKALTRL 99
>gi|154340255|ref|XP_001566084.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063403|emb|CAM39582.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 713
Score = 44.3 bits (103), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 27/44 (61%)
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH 177
P+PC+ A+YCSESC+ +AW+ +H EC R + V + H
Sbjct: 427 PVPCDSSCNALYCSESCKSKAWVEHHWVECTQRGRWRDGVPVVH 470
>gi|398017931|ref|XP_003862152.1| hypothetical protein, conserved [Leishmania donovani]
gi|322500381|emb|CBZ35458.1| hypothetical protein, conserved [Leishmania donovani]
Length = 712
Score = 44.3 bits (103), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/53 (35%), Positives = 30/53 (56%)
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
P+PC+ A+YCSESC+ +AW+ +H EC R + V + H L + +
Sbjct: 426 PVPCDSNCSALYCSESCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTT 478
>gi|440794181|gb|ELR15350.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 823
Score = 44.3 bits (103), Expect = 0.033, Method: Composition-based stats.
Identities = 21/60 (35%), Positives = 36/60 (60%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
AS+ + ++ S GR V+A ED+ G+ + E+ A V+L P +++C+HC T + IP
Sbjct: 256 ASSAVAIRRSPEAGRFVVAAEDIKLGEPVTREEAYAAVLLRPYELTHCHHCFTQEAVLIP 315
>gi|299115343|emb|CBN74163.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 579
Score = 44.3 bits (103), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 44/101 (43%), Gaps = 12/101 (11%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC------TSIS 132
+ K+ + GR VIA D+ G+ + E P V+ C+ C +CC I
Sbjct: 72 DTFKVTEDSVSGRCVIASRDLKAGELVLREPPFVKVVRRDCASRQCAYCCQQVTERGKIE 131
Query: 133 APIPCNECILAVYCSESCR-REAWLRYHRWECHGALRLLEA 172
A +P AVYCS +C+ RE LR G L + A
Sbjct: 132 ADVP-----FAVYCSRACQAREDALRAAEASALGKLAGISA 167
>gi|410903311|ref|XP_003965137.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 471
Score = 44.3 bits (103), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 54/131 (41%), Gaps = 9/131 (6%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
KGR + A +++ G+ +F E + V+ + C+ C + C +C A Y
Sbjct: 14 GGIKGRGLRATKEISTGEVVFAEPSYSAVVFDSFASQVCHSCFRHQAQLHRCGQCKFAHY 73
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD-------RYKEVYHL 198
C+ +C+ W H+ EC +L G ++ H D + V L
Sbjct: 74 CNRTCQTACW-EEHKQECQAIRKLGTVPGDKVRLAARVMWRIHKDTGVASDSQLLSVEEL 132
Query: 199 ETHLQDMRPED 209
E H+ D+ PED
Sbjct: 133 EDHVADL-PED 142
>gi|358059760|dbj|GAA94529.1| hypothetical protein E5Q_01181 [Mixia osmundae IAM 14324]
Length = 554
Score = 44.3 bits (103), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 42/90 (46%), Gaps = 4/90 (4%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP- 136
S+K +M+ K R V A + GDTLF +P V+ ++ C HC S
Sbjct: 15 SDKFEMRMHPIKKRSVHARQAFAAGDTLFETRPPFAVVGNSDILTRCFHCLKPTSPEREL 74
Query: 137 --CNECILAVYCSESCRREAWL-RYHRWEC 163
C+ C LA +C +C + W H++EC
Sbjct: 75 KRCSSCQLARFCGPACVKAGWTPEGHKYEC 104
>gi|347830753|emb|CCD46450.1| hypothetical protein [Botryotinia fuckeliana]
Length = 240
Score = 44.3 bits (103), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 48/111 (43%), Gaps = 21/111 (18%)
Query: 87 NAKGRHVIAVEDVHKGDTLF-VEKPV-AFVILPPCSMSNCNHCCT--------------- 129
+ K + + V GD L VEKP+ A V P + C++C
Sbjct: 51 DPKNTGLFVDKAVSAGDILLKVEKPLFAIVEETPMRETTCDNCFAYQGMEELLGTVSTAK 110
Query: 130 -SISAPIP---CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIA 176
I IP C C + YC++ CR EAW +H++EC ++L V +A
Sbjct: 111 KDIETAIPFNKCGSCRVFYYCNKKCREEAWEHHHKYECLILAKILATVNLA 161
>gi|194913303|ref|XP_001982670.1| GG12606 [Drosophila erecta]
gi|190648346|gb|EDV45639.1| GG12606 [Drosophila erecta]
Length = 468
Score = 44.3 bits (103), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/103 (26%), Positives = 48/103 (46%), Gaps = 4/103 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP AFV+ + C++C + + C+ C YC+ +C+ +AW +
Sbjct: 35 IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCNRACQMQAWAQ- 92
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
H+ EC ++ V A + +LIL H Y+ E
Sbjct: 93 HKHECPFLKKVHPRVVPDAARMLCRLILRLEHGGDLIRGYYTE 135
>gi|66822115|ref|XP_644412.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|66822967|ref|XP_644838.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74857697|sp|Q557F6.1|Y3591_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0273591
gi|60472535|gb|EAL70486.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60472965|gb|EAL70914.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 413
Score = 44.3 bits (103), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 5/88 (5%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP---- 136
+K+ S +GR++IA D+ G+++ K V ++C +C I +P P
Sbjct: 8 LKLSNSELEGRYIIANRDIDIGESILKCKSYFAVTCEDFKKNSCYNCIKLIKSPSPQQVP 67
Query: 137 -CNECILAVYCSESCRREAWLRYHRWEC 163
C C YCSE C+++ ++ +EC
Sbjct: 68 RCFGCNEVWYCSEKCKQDNQAKHQHYEC 95
>gi|195163295|ref|XP_002022487.1| GL12938 [Drosophila persimilis]
gi|194104479|gb|EDW26522.1| GL12938 [Drosophila persimilis]
Length = 420
Score = 44.3 bits (103), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
V +G + EKP AFV+ + C++C + C+ C YC+ SC+ +AW +
Sbjct: 6 VKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKVR-KCSNCRYVSYCNRSCQTQAWAQ- 63
Query: 159 HRWEC 163
H+ EC
Sbjct: 64 HKHEC 68
>gi|28317148|gb|AAO39589.1| LD11102p [Drosophila melanogaster]
Length = 687
Score = 43.9 bits (102), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 3/77 (3%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
K V A DV KG+ + VE PV F P NC C C+ C YCS+
Sbjct: 238 KSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYRSYCSK 295
Query: 149 SCRREAWLRYHRWECHG 165
SC + + H++EC+G
Sbjct: 296 SCMK-SDAEVHQYECYG 311
>gi|195477573|ref|XP_002100247.1| GE16937 [Drosophila yakuba]
gi|194187771|gb|EDX01355.1| GE16937 [Drosophila yakuba]
Length = 469
Score = 43.9 bits (102), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 4/103 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP AFV+ + C++C + + C+ C YC SC+ +AW +
Sbjct: 35 IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCHRSCQMQAWAQ- 92
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
H+ EC ++ V A + +LIL H Y+ E
Sbjct: 93 HKHECPFLKKVHPRVVPDAARMLCRLILRLEHGGDLIRGYYTE 135
>gi|157871558|ref|XP_001684328.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68127397|emb|CAJ05004.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 712
Score = 43.9 bits (102), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 19/61 (31%), Positives = 32/61 (52%)
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYK 193
P+PC+ A+YCSE C+ +AW+ +H EC R + V + H L + + + +
Sbjct: 426 PVPCDSNCSALYCSEGCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTTRSARSFV 485
Query: 194 E 194
E
Sbjct: 486 E 486
>gi|429860577|gb|ELA35307.1| mynd finger family protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 551
Score = 43.9 bits (102), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 47/107 (43%), Gaps = 12/107 (11%)
Query: 80 KIKMQTSNAKG-RHVIAVEDVHKGDTL-FVEKPVAFVILPPC--SMSNCNHCC-----TS 130
+++ AKG R + A GD + ++P V+LPP ++ CNHC
Sbjct: 9 EVRGSKGGAKGGRSIHATRRFKPGDVIALFDEPA--VVLPPGHRALEYCNHCLKRQGHAG 66
Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAH 177
+ C C YC +C++ W H+ EC A++ L V AH
Sbjct: 67 VGKLRACTGCKTVAYCGAACQKANWTLIHKVECK-AIQFLHEVKPAH 112
>gi|328708541|ref|XP_003243720.1| PREDICTED: hypothetical protein LOC100572177 [Acyrthosiphon pisum]
Length = 125
Score = 43.9 bits (102), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 26/44 (59%), Gaps = 5/44 (11%)
Query: 125 NHC----CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECH 164
NHC C + A I CN+C +A YCSE CR+ W H +ECH
Sbjct: 77 NHCHRENCNNFGATI-CNKCKMARYCSEICRKRHWWSKHSFECH 119
>gi|195347982|ref|XP_002040530.1| GM18871 [Drosophila sechellia]
gi|194121958|gb|EDW44001.1| GM18871 [Drosophila sechellia]
Length = 466
Score = 43.9 bits (102), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP AFV+ + C++C + + C+ C YC SC+ +AW +
Sbjct: 33 IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCHRSCQMQAWAQ- 90
Query: 159 HRWEC 163
H+ EC
Sbjct: 91 HKHEC 95
>gi|201065683|gb|ACH92251.1| FI04131p [Drosophila melanogaster]
Length = 682
Score = 43.9 bits (102), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 36/77 (46%), Gaps = 3/77 (3%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
K V A DV KG+ + VE PV F P NC C C+ C YCS+
Sbjct: 238 KSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYRSYCSK 295
Query: 149 SCRREAWLRYHRWECHG 165
SC + + H++EC+G
Sbjct: 296 SCMK-SDAEVHQYECYG 311
>gi|386765940|ref|NP_650589.2| CG18213 [Drosophila melanogaster]
gi|383292760|gb|AAF55379.4| CG18213 [Drosophila melanogaster]
Length = 879
Score = 43.9 bits (102), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 3/82 (3%)
Query: 84 QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILA 143
+ S K V A DV KG+ + VE PV F P NC C C+ C
Sbjct: 233 EKSIEKSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYR 290
Query: 144 VYCSESCRREAWLRYHRWECHG 165
YCS+SC + + H++EC+G
Sbjct: 291 SYCSKSCMK-SDAEVHQYECYG 311
>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
Length = 433
Score = 43.9 bits (102), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVSERGNHCEFCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C+RE W H+ EC
Sbjct: 74 CNVECQREDW-PMHKLEC 90
>gi|125981921|ref|XP_001354964.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
gi|54643276|gb|EAL32020.1| GA12511 [Drosophila pseudoobscura pseudoobscura]
Length = 439
Score = 43.9 bits (102), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
V +G + EKP AFV+ + C++C + C+ C YC+ SC+ +AW +
Sbjct: 6 VKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKVR-KCSNCRYVSYCNRSCQTQAWAQ- 63
Query: 159 HRWEC 163
H+ EC
Sbjct: 64 HKHEC 68
>gi|352962201|gb|AEQ62998.1| RE05389p1 [Drosophila melanogaster]
Length = 879
Score = 43.9 bits (102), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 38/82 (46%), Gaps = 3/82 (3%)
Query: 84 QTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILA 143
+ S K V A DV KG+ + VE PV F P NC C C+ C
Sbjct: 233 EKSIEKSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRYR 290
Query: 144 VYCSESCRREAWLRYHRWECHG 165
YCS+SC + + H++EC+G
Sbjct: 291 SYCSKSCMK-SDAEVHQYECYG 311
>gi|345480324|ref|XP_003424127.1| PREDICTED: SET and MYND domain-containing protein 3-like [Nasonia
vitripennis]
Length = 391
Score = 43.9 bits (102), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ KGD L KP A+V+ ++C++C S + C+ C YC SC++E+W
Sbjct: 3 IKKGDCLISSKPFAYVLSSKHKDNHCDYCFKS-GKLLKCSGCQYVYYCDRSCQKESW-SV 60
Query: 159 HRWECHGALRL 169
H+ EC R+
Sbjct: 61 HKSECINLKRI 71
>gi|170073283|ref|XP_001870345.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167869783|gb|EDS33166.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 266
Score = 43.9 bits (102), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 125 NHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
+HC S + PC C+ +YC E CR+++W +H++EC A +L
Sbjct: 106 DHCGVKFSNSLKPCPGCVFFMYCGEECRQKSWKLWHQFECPVATKL 151
>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
Length = 433
Score = 43.9 bits (102), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C++C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCDYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C C++E W H+ EC
Sbjct: 74 CDVECQKEDW-PMHKLEC 90
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 43.9 bits (102), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 46/109 (42%), Gaps = 2/109 (1%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A +++ S GR ++ G L E P A+V+ C+ C S +
Sbjct: 7 AMESVRVVKSATCGRSLVFTSKFAAGKCLLEELPYAYVLCNKKRGLFCDFCLKQCSTLLR 66
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLI 183
C+ C YCS C++ W R H+ +C R+ V +A L ++I
Sbjct: 67 CSSCKYVRYCSRPCQKGDWKRCHKQDCKTLKRIHPRVPPDLAQLLAQII 115
>gi|340054943|emb|CCC49251.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 434
Score = 43.9 bits (102), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 13/90 (14%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS------- 132
++++ + GR + A+ + G + V++P + P S C HC + IS
Sbjct: 154 ELRLPHESVYGRGIYALTAISAGTVVMVDEPFVVQRMVP---SACAHCLSDISTTSKEGG 210
Query: 133 ---APIPCNECILAVYCSESCRREAWLRYH 159
+ C C + YCS CR AW YH
Sbjct: 211 AAVGVVLCTHCGVESYCSTGCREAAWREYH 240
>gi|170074355|ref|XP_001870561.1| U5 snRNP-specific protein [Culex quinquefasciatus]
gi|167871141|gb|EDS34524.1| U5 snRNP-specific protein [Culex quinquefasciatus]
Length = 951
Score = 43.9 bits (102), Expect = 0.049, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
IPC C+ +YC E CR+++W +H++EC A +L
Sbjct: 226 IPCPGCVFFMYCGEECRQKSWNLWHQFECPVAPKL 260
>gi|432852547|ref|XP_004067302.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 1
[Oryzias latipes]
Length = 435
Score = 43.5 bits (101), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A+ D G+ +F ++V+ ++C HC T C +C A Y
Sbjct: 14 SPDKGRGLRALRDFAVGELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C+R W H+ EC
Sbjct: 74 CNVDCQRGDW-PMHKLEC 90
>gi|432852549|ref|XP_004067303.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like isoform 2
[Oryzias latipes]
Length = 424
Score = 43.5 bits (101), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A+ D G+ +F ++V+ ++C HC T C +C A Y
Sbjct: 14 SPDKGRGLRALRDFAVGELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C+R W H+ EC
Sbjct: 74 CNVDCQRGDW-PMHKLEC 90
>gi|340711607|ref|XP_003394366.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
terrestris]
Length = 439
Score = 43.5 bits (101), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 52/111 (46%), Gaps = 6/111 (5%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ KG T+ KP A+V+ C++C S C+ C YC++SC++ +W
Sbjct: 8 IKKGTTILTAKPFAYVLCSKYKNVRCDYCFKS-GKLFRCSACQYVYYCNQSCQQMSW-PM 65
Query: 159 HRWECHGALRLLEAVGI---AHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
H EC L+ GI A L ++I+ + ++ Y+ ET+ + +
Sbjct: 66 HSKEC-ARLKKFSPWGISNVARLMARIIIKLNQGGDEEKGYYNETNYRKFK 115
>gi|24639370|ref|NP_524768.2| buzidau [Drosophila melanogaster]
gi|6018870|emb|CAB58065.1| EG:BACR7C10.4 [Drosophila melanogaster]
gi|22831587|gb|AAF45795.2| buzidau [Drosophila melanogaster]
gi|223718730|gb|ACN22202.1| MIP05435p [Drosophila melanogaster]
Length = 468
Score = 43.5 bits (101), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 2/65 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP AFV+ + C++C + + C+ C YC SC+ +AW +
Sbjct: 35 IKRGQRILTEKPFAFVLKSQYRLERCDNCLEATKV-LKCSNCRYVSYCHRSCQMQAWGQ- 92
Query: 159 HRWEC 163
H+ EC
Sbjct: 93 HKHEC 97
>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
[Brachypodium distachyon]
gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
[Brachypodium distachyon]
Length = 389
Score = 43.5 bits (101), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 53/119 (44%), Gaps = 12/119 (10%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP------PCSMSNCNHCCTS 130
A + +++ +GR ++A D+ +G+ L E P+ ++ P P S C C
Sbjct: 2 AGDALRVADLPGRGRGLVAARDILEGEVLLSEPPI--LLYPSSLASLPSYCSACFRCLPQ 59
Query: 131 ISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHS 189
PC C A +CS +C A + R C AL L A +H L L+S++S
Sbjct: 60 APHAAPCPSCRAAAFCSPAC---AAASHPRLLC-AALSRLAAAPESHQEQLLFLLSAYS 114
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 43.5 bits (101), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 36/77 (46%), Gaps = 5/77 (6%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCCTSISAPIPCNECILAVYC 146
KGR +IA GDT+ P +++P S ++ C+HC P C+ C YC
Sbjct: 14 KGRCLIASRSFTPGDTILTFTPT--ILIPSLSHASTVCSHCFKP-GEPRSCSRCHAVAYC 70
Query: 147 SESCRREAWLRYHRWEC 163
SC+ AW H EC
Sbjct: 71 DASCQSAAWKAIHSKEC 87
>gi|170033581|ref|XP_001844655.1| U5 snRNP-specific protein [Culex quinquefasciatus]
gi|167874623|gb|EDS38006.1| U5 snRNP-specific protein [Culex quinquefasciatus]
Length = 964
Score = 43.5 bits (101), Expect = 0.055, Method: Composition-based stats.
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
IPC C+ +YC E CR+++W +H++EC A +L
Sbjct: 132 IPCPGCVFFMYCGEECRQKSWNLWHQFECPVAPKL 166
>gi|402857226|ref|XP_003893168.1| PREDICTED: N-lysine methyltransferase SMYD2 [Papio anubis]
Length = 433
Score = 43.5 bits (101), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|297661992|ref|XP_002809506.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pongo abelii]
Length = 433
Score = 43.5 bits (101), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|440800364|gb|ELR21403.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 648
Score = 43.5 bits (101), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 50/116 (43%), Gaps = 34/116 (29%)
Query: 124 CNHCCTSISAP--------------IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C HC ++ P +P E VYCSESCR+EAW +H C G +L
Sbjct: 315 CPHCAGDLAKPPPVLWQSVFRAYPSLPPAESYQEVYCSESCRKEAWESHHHSLCPG--QL 372
Query: 170 LEAVGIAHLALKLILVSSHSDRYKEV-YHL-----------ETHLQDMRPEDLYQY 213
E G H AL +L+ Y+EV YH H Q+ +P+ ++ +
Sbjct: 373 AENQG-KHFALLNLLIM-----YREVGYHTIAMMLRILAQKAKHQQEAKPDAVFPF 422
>gi|297280801|ref|XP_001106482.2| PREDICTED: SET and MYND domain-containing protein 2 [Macaca
mulatta]
Length = 433
Score = 43.5 bits (101), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C +C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|170060243|ref|XP_001865716.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167878780|gb|EDS42163.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 510
Score = 43.5 bits (101), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 24/35 (68%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
IPC C+ +YC E CR+++W +H++EC A +L
Sbjct: 226 IPCPGCVFFMYCGEECRQKSWNLWHQFECQVAPKL 260
>gi|310656796|gb|ADP02224.1| zf-MYND domain-containing protein [Triticum aestivum]
Length = 520
Score = 43.5 bits (101), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 26/95 (27%), Positives = 44/95 (46%), Gaps = 2/95 (2%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A + + + KGR ++A G+ + ++P A S+C+HC S +
Sbjct: 13 AGRGLAVASVPGKGRGLVAARSFFPGEVIICQEPYASTPNRISVGSSCDHCFASGNLR-K 71
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C +A YC C++E W + H+ EC L E
Sbjct: 72 CSVCRVAWYCGSVCQKEEW-KLHQLECQAISALTE 105
>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 558
Score = 43.5 bits (101), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 19/108 (17%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGD-TLFVEKPVAFVILPPCSMSNCNHC------ 127
M A+ ++K + GR + A +D GD L + +P + C C
Sbjct: 1 MAANVQVKQSGIESAGRGLFAKKDFQPGDIVLALNRPFVAELDQDRLRDTCAWCLQRGAS 60
Query: 128 ------CTSISAPIP------CNECILAVYCSESCRREAWLRYHRWEC 163
TS+ PI C C YCS++C+ AW R H++EC
Sbjct: 61 AEDRAKATSLGLPIAGIETKACTGCKRVRYCSKTCQTRAWKREHKYEC 108
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 43.5 bits (101), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 24/96 (25%), Positives = 46/96 (47%), Gaps = 8/96 (8%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC---TSISAPIP- 136
+++ ++AKGR + A G + ++P A+ ++ C++C AP+
Sbjct: 6 VELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDVVCHYCLCAPGQPGAPVED 65
Query: 137 ---CNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C C A YC++ C+++AW H+ EC R+
Sbjct: 66 LHRCTGCKFAQYCTKECQKKAWPE-HKQECAAIKRI 100
>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Taeniopygia guttata]
Length = 404
Score = 43.1 bits (100), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 50/114 (43%), Gaps = 32/114 (28%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC-------------- 128
S+AKG+ + A ++ KG+T+FVE+PV F+ + C+HC
Sbjct: 19 SSAKGKGLFATRNIRKGETVFVERPVVSSQFLWNALYNYRACDHCLRALETAEENAQRLL 78
Query: 129 --TSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
+S+ P P C C + YCS CR+ A +YH+ C G R
Sbjct: 79 GRSSLVLPHPEQCSIRKDLHQQCPRCQV-TYCSAECRQAALEQYHQVLCLGPSR 131
>gi|195441983|ref|XP_002068740.1| GK17864 [Drosophila willistoni]
gi|194164825|gb|EDW79726.1| GK17864 [Drosophila willistoni]
Length = 504
Score = 43.1 bits (100), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 36/144 (25%), Positives = 71/144 (49%), Gaps = 16/144 (11%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ ++ GR V+AV+++ G+TL VE+P+ VILP C C + C
Sbjct: 9 AKKCQVKHNDTLGRFVVAVDNIRAGETLLVEEPI--VILPHLGERRCARCLNLTTN--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
++C L C E C+R A L+ + L+ + + + +K +L+ H++
Sbjct: 65 SKCQLLPLCEECKDHEERDCQRLADLQLSEEQVQ-QLQDSQRTEVQSM-VKCLLLREHTE 122
Query: 191 R---YKEVYHLETHLQDMRPEDLY 211
Y+E+Y ++ L R +++
Sbjct: 123 TRSLYEEMYQMDAQLDARRGTEVW 146
>gi|410929759|ref|XP_003978267.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Takifugu rubripes]
Length = 790
Score = 43.1 bits (100), Expect = 0.069, Method: Composition-based stats.
Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 5/63 (7%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECH----GALRLLEAVGIAHL 178
C +C S+S + PC C +YCS CR +AW H+ EC G + AVG++
Sbjct: 712 CYYCGRSVSVTLTPCYRCYKVLYCSRPCRLKAWDAIHKKECFRVKAGTRDCVAAVGLSQN 771
Query: 179 ALK 181
L+
Sbjct: 772 GLE 774
>gi|401424796|ref|XP_003876883.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322493127|emb|CBZ28412.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 712
Score = 43.1 bits (100), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 18/57 (31%), Positives = 31/57 (54%)
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
P+PC+ A YCSE+C+ +AW+ +H EC R + V + H L + + ++
Sbjct: 426 PVPCDSNCSAFYCSENCKSKAWVEHHWVECTQRGRWRDGVPVVHRLLDQFVTARSAN 482
>gi|328724231|ref|XP_003248075.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 624
Score = 43.1 bits (100), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 62/140 (44%), Gaps = 10/140 (7%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
T S+ + + L G++ + S ++++ S GR V A D++ GD + +++P
Sbjct: 205 TKSTTTMETLCVEQLVGGKNENIPALSAFVELKLSENMGRGVYATCDINPGDVVAIDEPY 264
Query: 112 AFVILPPCS----MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL 167
I P S + + N C S+ I C +C+L YC++ C + + H C
Sbjct: 265 ---ICGPLSEFREVCHYNGCMKIHSSLIHCPKCLLVFYCNKDCMNKDYKDGHNLLCPIMY 321
Query: 168 RLLEAVGIA---HLALKLIL 184
+ GI LA+K L
Sbjct: 322 HIKSRPGITKINELAIKWFL 341
>gi|170044639|ref|XP_001849948.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867702|gb|EDS31085.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 282
Score = 43.1 bits (100), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 91 RHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNECILAVYCSES 149
R ++A +D + GD + + + I NC+HC S + IP C+ ++YC E
Sbjct: 165 RILVAEKDFNPGDVIMDAEQLLTAIDFNLCYENCSHCGVKFSNSLIPFPGCVFSMYCGEE 224
Query: 150 CRREAWLRYHRW 161
CR+ +W +H++
Sbjct: 225 CRQNSWKLWHQF 236
>gi|299740414|ref|XP_001838814.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
gi|298404233|gb|EAU83041.2| hypothetical protein CC1G_08978 [Coprinopsis cinerea okayama7#130]
Length = 1320
Score = 43.1 bits (100), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 44/104 (42%), Gaps = 12/104 (11%)
Query: 72 HTLMCASNKIKM--QTSNAKGRHVIAVEDVHKG-DTLFVEKPVAFVILPPCSMS-----N 123
+ L A N+IK+ T KGR V DV D + V F PP + N
Sbjct: 381 YRLQMAFNEIKIGSDTEYWKGRRYEDVSDVTVYLDLMNVLYGFGFGRAPPLTQDPRFCYN 440
Query: 124 CNHCCTS----ISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
HC T ++ C+ C YCS C+++ W R HR EC
Sbjct: 441 IAHCATGYHPYMAGTKWCSGCKSVAYCSRECQKQDWKRVHREEC 484
>gi|307111681|gb|EFN59915.1| hypothetical protein CHLNCDRAFT_132958 [Chlorella variabilis]
Length = 853
Score = 43.1 bits (100), Expect = 0.076, Method: Composition-based stats.
Identities = 40/145 (27%), Positives = 65/145 (44%), Gaps = 11/145 (7%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS---A 133
A + +Q + GR + A E++ G + E+P AF + S C+ C ++ A
Sbjct: 327 AGPALVVQQTEDAGRGLSAAEELPAGRDVLAEQPFAFALTKLGRRSVCSTCLAPLADAPA 386
Query: 134 PIPCNECILAVYCSESCRREAWLRYHR---WECHGALRLL---EAVGIAHLALKLILVSS 187
P C C + YC+ +CR A +H+ EC +L EAV LA +L +
Sbjct: 387 PYYCRRCPMPAYCTPACR--AADPFHQPGGPECGRPWTVLLPAEAVAALRLARRLRFGGA 444
Query: 188 HSDRYKEVYHLETHLQDMRPEDLYQ 212
S + V L TH ++ P ++ Q
Sbjct: 445 GSPAARHVASLGTHFAELDPTEVVQ 469
>gi|168011005|ref|XP_001758194.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162690650|gb|EDQ77016.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 532
Score = 43.1 bits (100), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 4/76 (5%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI---SAPIPCNECILAVY 145
+GR IA D+ G+ + E+ VAFV + C+ CC + + P+ C C A+Y
Sbjct: 93 RGRCAIASRDIKAGEVIVAEEAVAFVPRSQDRTAVCHECCRDLHQNARPVECPGCKHAIY 152
Query: 146 CSESCRREAWLRYHRW 161
C + C A + +W
Sbjct: 153 C-QDCEASAMASHRKW 167
>gi|219122841|ref|XP_002181746.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407022|gb|EEC46960.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 596
Score = 43.1 bits (100), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 44/93 (47%), Gaps = 2/93 (2%)
Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLA 179
S NC C S + C C A YCS C++E W HR +C +A +
Sbjct: 6 SRGNCQGCRKSTGKLLRCQACKQAWYCSIECQKEDWKTNHRKQCRDVAASNKAQATWN-E 64
Query: 180 LKLILVSSHSDRYKEVYHLETH-LQDMRPEDLY 211
L+ ++ S+ D+ +E +H + ++ +R +D Y
Sbjct: 65 LRTLMASASLDQAQEGFHKASDAVKQLRSDDAY 97
>gi|389631537|ref|XP_003713421.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|351645754|gb|EHA53614.1| hypothetical protein MGG_15522 [Magnaporthe oryzae 70-15]
gi|440463734|gb|ELQ33288.1| hypothetical protein OOU_Y34scaffold00979g72 [Magnaporthe oryzae
Y34]
gi|440483676|gb|ELQ64025.1| hypothetical protein OOW_P131scaffold00888g14 [Magnaporthe oryzae
P131]
Length = 542
Score = 43.1 bits (100), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 49/116 (42%), Gaps = 2/116 (1%)
Query: 90 GRHVIAVEDVHKGDTLF-VEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV-YCS 147
GR + A D GD + + P+A + + C+HC + C C AV YCS
Sbjct: 28 GRSLRATRDFQPGDLIAEFDNPLAAFPDAARASTTCHHCLDQNAKVFGCMGCEKAVKYCS 87
Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQ 203
C+R W H EC ++ AVG L + + R+ EV L + L+
Sbjct: 88 SECQRANWKLVHSKECKVFRKVQTAVGKDWLPTPVRTLVQLLVRWAEVQQLVSQLE 143
>gi|330801330|ref|XP_003288681.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
gi|325081244|gb|EGC34766.1| hypothetical protein DICPUDRAFT_34471 [Dictyostelium purpureum]
Length = 378
Score = 43.1 bits (100), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 18/99 (18%)
Query: 83 MQTSNAKGRHVIAVEDVHKGDTL-FVEKPVAFVI---LPPCSMSNCNHCCTSISAP---- 134
++ S +G +V+A +D+ GD + F EKP + I + C++C T +
Sbjct: 13 VKNSKDQGNYVVASKDIKAGDFIVFREKPYSCCINFKIEENEKKICHYCFTINNNNNLNN 72
Query: 135 ----------IPCNECILAVYCSESCRREAWLRYHRWEC 163
I CN C L YCSE C+ E + H++EC
Sbjct: 73 NIHNMHTDNFIICNNCKLNWYCSEKCKNEDYQTSHQYEC 111
>gi|452820074|gb|EME27122.1| SET and MYND domain-containing protein [Galdieria sulphuraria]
Length = 464
Score = 42.7 bits (99), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 29/104 (27%), Positives = 48/104 (46%), Gaps = 6/104 (5%)
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRR 152
+ A + G+++ VE+ ++V + C+ C + C++C A YC C++
Sbjct: 3 LFATKPYEVGNSILVEEAYSWVTFRDNGPNICHTCFKFSKSLKKCSQCKFARYCGLDCQK 62
Query: 153 EAWLRY-HRWECHGALRLLEA-----VGIAHLALKLILVSSHSD 190
+AW HRWEC +EA V +A L L L S+ D
Sbjct: 63 KAWKENSHRWECKAIASSVEAKIPSIVRLAALFLFRALNSNDKD 106
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 42.7 bits (99), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 2/81 (2%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
KGR ++ ++ + G+ + ++P V ++S C+ C S S C+ C + YC
Sbjct: 21 KGRCLLTTKNFNPGEVILRQEPYVCVPNNSSTVSRCDGCFASESLK-KCSACQVVWYCGS 79
Query: 149 SCRREAWLRYHRWECHGALRL 169
+C++ W + HR EC+ RL
Sbjct: 80 TCQKSEW-KLHRLECNALSRL 99
>gi|302835433|ref|XP_002949278.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
nagariensis]
gi|300265580|gb|EFJ49771.1| hypothetical protein VOLCADRAFT_104220 [Volvox carteri f.
nagariensis]
Length = 467
Score = 42.7 bits (99), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP--IPCN 138
++++T N KGR + V G+T+ E P+ ++ P + S C C ++ I C+
Sbjct: 15 LEVRTHNRKGRSLYTTRLVKGGETVLDEAPLLLLVAPETAQSTCIACLRAVRPQECIECS 74
Query: 139 ECILAVYCSESCRREA----WL 156
C A +CS +C++ A WL
Sbjct: 75 VCRQACFCSATCQQTAQGTPWL 96
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 42.7 bits (99), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 5/87 (5%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCCTSISAPIP 136
+ + ++ + +GR + A + V G L V P+ +++P S C+ C ++ P P
Sbjct: 7 DGVVIKPHSTRGRALHATKTVAAGGVLAVFSPL--ILVPSLSHLTIVCSFCLRGVT-PRP 63
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C A YC C+ AW H EC
Sbjct: 64 CSRCRAAYYCDAQCQAAAWSSGHAAEC 90
>gi|66804135|ref|XP_635865.1| hypothetical protein DDB_G0290183 [Dictyostelium discoideum AX4]
gi|60464204|gb|EAL62363.1| hypothetical protein DDB_G0290183 [Dictyostelium discoideum AX4]
Length = 447
Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 55/121 (45%), Gaps = 18/121 (14%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTL-FVEKP--VAFVILPPC 119
+PSL E L+ S ++ S KGR+V+A D+ G+++ F E+P V P
Sbjct: 56 IPSLLSSE---LISNSKNSIIKQSKEKGRYVVAKRDIKAGESIVFREEPYVVCMNYKDPY 112
Query: 120 SMSNCNHCC--------TSISAP---IPCNECILAVYCSESCRREAWLRYHRWECHGALR 168
C+HC + I+ P + C C + YCS C+ L YH+ EC R
Sbjct: 113 ERKICHHCFGLSSTTSKSMITTPDFTLHCETCNIVWYCSNYCQSND-LTYHKHECFTYKR 171
Query: 169 L 169
+
Sbjct: 172 M 172
>gi|66363372|ref|XP_628652.1| SET domain protein with MYND insert (Skm/BOP family)
[Cryptosporidium parvum Iowa II]
gi|46229836|gb|EAK90654.1| SET domain protein with MYND insert (Skm/BOP family)
[Cryptosporidium parvum Iowa II]
Length = 587
Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 9/97 (9%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP----- 134
KI+++ KGR +IA E++ G+++ +E+P ++ C+ C + +
Sbjct: 20 KIEIKEDERKGRSIIAKEEIQIGESILMEEPYCRILFSDNIEEICDTCFNYLRSEGSYSE 79
Query: 135 --IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
+ C EC +CS+ C E+ H +EC G L+L
Sbjct: 80 CILECQECKKIKFCSKKCMEES-KTIHHFEC-GILKL 114
>gi|353234811|emb|CCA66832.1| hypothetical protein PIIN_00594 [Piriformospora indica DSM 11827]
Length = 456
Score = 42.7 bits (99), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 6/81 (7%)
Query: 86 SNAKGRHVIAV-EDVHKGDTLFVEKPVAFVILPPCSMSNCNHCC--TSISAPIPCNECIL 142
S A+ R V+A + V G + +A +LP C+ CC T++ A C+ C
Sbjct: 12 SQARSRAVVAPGQTVKPGQVILRSSSLATSLLPKSRGQRCDECCRQTTVKA---CSRCKE 68
Query: 143 AVYCSESCRREAWLRYHRWEC 163
A YC C+ AW +HR C
Sbjct: 69 AFYCDTRCQSAAWKSHHRTTC 89
>gi|115482840|ref|NP_001065013.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|62733614|gb|AAX95731.1| tetratricopeptide repeat, putative [Oryza sativa Japonica Group]
gi|110289368|gb|ABB47859.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113639622|dbj|BAF26927.1| Os10g0506800 [Oryza sativa Japonica Group]
gi|215695497|dbj|BAG90688.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 793
Score = 42.7 bits (99), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCS 147
KGR + + D+ + E P+A +I+ C ++C++C + A + C C + +YCS
Sbjct: 227 KGRGMSSPNDISPASLIHAEDPLAVIIMKSCRDTHCHYCFSEAPADVVVCPSCTIPIYCS 286
Query: 148 ESCRREA 154
C+ +A
Sbjct: 287 NRCQEKA 293
>gi|145517400|ref|XP_001444583.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412005|emb|CAK77186.1| unnamed protein product [Paramecium tetraurelia]
Length = 354
Score = 42.7 bits (99), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 2/62 (3%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE 194
+ C+ C A YCS CR W YH C+G L+ V ++ L ++ L+ S S + +
Sbjct: 23 VTCDSCSNAFYCSTKCRDFDWEGYHYLVCNGLFELVPKVSLSQLNDEMDLIGSGS--FGQ 80
Query: 195 VY 196
VY
Sbjct: 81 VY 82
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 45/95 (47%), Gaps = 13/95 (13%)
Query: 73 TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
T+MC + + AKG VIA +D+ G T+ E +A+ L MS C+HC T +
Sbjct: 57 TIMC-------EKTPAKGTVVIAKKDIPAGTTILAEPAIAWQPLEDRIMSVCHHCMTEVP 109
Query: 133 A-PIPCNE----CILAVYCSESCRREAWLRYHRWE 162
+ C E C YCS C REA HR E
Sbjct: 110 RWAVGCGEGAGGCSGLGYCSPKC-REASEALHRVE 143
>gi|190348269|gb|EDK40693.2| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
6260]
Length = 637
Score = 42.4 bits (98), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 6/88 (6%)
Query: 124 CNHCCTSISA---PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAH 177
C HC + A CN C L YC++ C +W + H++EC +L + G
Sbjct: 72 CQHCTRRLPAHKSQYTCNGCKLYTYCNQRCYDLSWEKVHQYECPVFEQLKSSFGYQEFVR 131
Query: 178 LALKLILVSSHSDRYKEVYHLETHLQDM 205
LAL+L + S K + L TH +++
Sbjct: 132 LALRLCTLYSIPSYRKNLERLTTHREEV 159
>gi|312384571|gb|EFR29269.1| hypothetical protein AND_01935 [Anopheles darlingi]
Length = 750
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 45/92 (48%), Gaps = 5/92 (5%)
Query: 93 VIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCR 151
+ + + GD + E+ + I P + C++C ++ A +PC +C YCSE+C
Sbjct: 236 FVTTKRLKPGDVVLTEEACLWKISPEGQLELCDYCGADAVLALVPCVDCNRVFYCSEACH 295
Query: 152 REAWLRYHRWECHGALRLLEAVGIAHLALKLI 183
+ YH EC ++L G++ A +L+
Sbjct: 296 KAGVSEYHWLEC----KVLRNQGLSLSATQLL 323
>gi|110289369|gb|AAP54526.2| TPR Domain containing protein, expressed [Oryza sativa Japonica
Group]
Length = 687
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 1/67 (1%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IPCNECILAVYCS 147
KGR + + D+ + E P+A +I+ C ++C++C + A + C C + +YCS
Sbjct: 121 KGRGMSSPNDISPASLIHAEDPLAVIIMKSCRDTHCHYCFSEAPADVVVCPSCTIPIYCS 180
Query: 148 ESCRREA 154
C+ +A
Sbjct: 181 NRCQEKA 187
>gi|146413731|ref|XP_001482836.1| hypothetical protein PGUG_04791 [Meyerozyma guilliermondii ATCC
6260]
Length = 637
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 6/88 (6%)
Query: 124 CNHCCTSISA---PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG---IAH 177
C HC + A CN C L YC++ C +W + H++EC +L + G
Sbjct: 72 CQHCTRRLPAHKSQYTCNGCKLYTYCNQRCYDLSWEKVHQYECPVFEQLKSSFGYQEFVR 131
Query: 178 LALKLILVSSHSDRYKEVYHLETHLQDM 205
LAL+L + S K + L TH +++
Sbjct: 132 LALRLCTLYSIPSYRKNLERLTTHREEV 159
>gi|67596952|ref|XP_666111.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54657034|gb|EAL35879.1| hypothetical protein Chro.70568 [Cryptosporidium hominis]
Length = 575
Score = 42.4 bits (98), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 48/97 (49%), Gaps = 9/97 (9%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP----- 134
KI+++ KGR +IA E++ G+++ +E+P ++ C+ C + +
Sbjct: 7 KIEIKEDKRKGRSIIAKEEIQIGESILMEEPYCRILFSDNIEEICDTCFNYLRSAGSYSE 66
Query: 135 --IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
+ C EC +CS+ C E+ H +EC G L+L
Sbjct: 67 CILECQECKKIKFCSKKCMEES-KTIHHFEC-GILKL 101
>gi|113206687|gb|ABI34494.1| SET and MYND domain containing 2a [Danio rerio]
Length = 156
Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 51/132 (38%), Gaps = 10/132 (7%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD +F A+V+ C C T C +C A Y
Sbjct: 14 SPGKGRGLKAIKHFKVGDLVFACPAYAYVLTVNERGGRCECCFTRKEGLSKCGKCKQAYY 73
Query: 146 CSESCRREAWLRYHRWECHGALRLLE---AVGIAHLALKLILVSSH------SDRYKEVY 196
C+ C+R W H+ EC E L ++IL H S+R +
Sbjct: 74 CNVECQRGDWP-MHKLECSAMCAYGENWCPSETVRLVARIILKQKHQTERTPSERVLTLR 132
Query: 197 HLETHLQDMRPE 208
LE HL + E
Sbjct: 133 ELEAHLDKLDNE 144
>gi|195438687|ref|XP_002067264.1| GK16327 [Drosophila willistoni]
gi|194163349|gb|EDW78250.1| GK16327 [Drosophila willistoni]
Length = 461
Score = 42.4 bits (98), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 4/103 (3%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ +G + EKP AFV+ C++C C+ C YC+ +C+ +AWL
Sbjct: 22 IKRGQRILTEKPFAFVLKSKYRQQRCDNCLEEGKVR-KCSNCRYVYYCNRACQTQAWL-M 79
Query: 159 HRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKEVYHLE 199
H+ EC R + A + +LIL H Y+ E
Sbjct: 80 HKHECPFLKRSYPRIVPDAARMLCRLILRLEHGGDLIRGYYTE 122
>gi|229892824|ref|NP_001153563.1| N-lysine methyltransferase SMYD2 [Sus scrofa]
gi|325530260|sp|C3RZA1.1|SMYD2_PIG RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|197692954|gb|ACH71265.1| SET and MYND domain-containing 2 [Sus scrofa]
Length = 433
Score = 42.4 bits (98), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD LF A+V+ ++C C C C A Y
Sbjct: 14 SPGKGRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEFCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 74 CNVECQKEDW-PMHKLEC 90
>gi|358253876|dbj|GAA53887.1| SET and MYND domain-containing protein 4 [Clonorchis sinensis]
Length = 869
Score = 42.4 bits (98), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 19/67 (28%), Positives = 32/67 (47%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSES 149
G H+ A ++ G+ + +EKP A I + +C C +PC +C +CS
Sbjct: 324 GWHLQASRNIEAGEVILIEKPYARRICRRFLLKHCYQCFKRCLNLLPCRKCSEVGFCSTE 383
Query: 150 CRREAWL 156
C + +WL
Sbjct: 384 CEKTSWL 390
>gi|296422187|ref|XP_002840643.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636864|emb|CAZ84834.1| unnamed protein product [Tuber melanosporum]
Length = 502
Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 34/132 (25%), Positives = 61/132 (46%), Gaps = 14/132 (10%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
++ + ++E ++ ++ S+ A V LP L L A + + S+ K V
Sbjct: 169 DRFSDKTLEVLNAAMAKCSAKAELKVVDLPVL----QRDLSFAGDGDR---SSDKHLGVY 221
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-CNECILA------VYCS 147
A ED+ G+T F E V+ P + S C +C ++ C++C ++CS
Sbjct: 222 AKEDIFLGETFFSEASPLTVVADPENSSLCEYCGGNLPDQYEVCDDCWEGDEDSGVMWCS 281
Query: 148 ESCRREAWLRYH 159
E CR++A+ YH
Sbjct: 282 EECRQKAFKSYH 293
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 42.0 bits (97), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 35/151 (23%), Positives = 60/151 (39%), Gaps = 17/151 (11%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
C + ++ S GR + + +G +L +P V+ P S+C+ C S
Sbjct: 29 CIPPTLDLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYLSSHCSSCAAPASPER 88
Query: 136 P----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILV------ 185
P C +C + YCS+ C+ W + H+ EC R A LA+ V
Sbjct: 89 PQLKRCAKCHVVHYCSQKCQNADW-KAHKPECDALQRWATAAPSPDLAVPEEAVRCLGRM 147
Query: 186 ------SSHSDRYKEVYHLETHLQDMRPEDL 210
+ S +E+ +++H +RPE
Sbjct: 148 LWQKQRNPSSIWSREIDSMQSHRSSLRPESF 178
>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
Length = 435
Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/132 (27%), Positives = 51/132 (38%), Gaps = 10/132 (7%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A++ GD +F A+V+ C C T C +C A Y
Sbjct: 14 SPGKGRGLKAIKHFKVGDLVFACPAYAYVLTVNERGGRCECCFTRKEGLSKCGKCKQAYY 73
Query: 146 CSESCRREAWLRYHRWECHGALRLLE---AVGIAHLALKLILVSSH------SDRYKEVY 196
C+ C+R W H+ EC E L ++IL H S+R +
Sbjct: 74 CNVECQRGDW-PMHKLECSAMCAYGENWCPSETVRLVARIILKQKHQTERTPSERVLTLR 132
Query: 197 HLETHLQDMRPE 208
LE HL + E
Sbjct: 133 ELEAHLDKLDNE 144
>gi|299744169|ref|XP_001840927.2| hypothetical protein CC1G_03156 [Coprinopsis cinerea okayama7#130]
gi|298405996|gb|EAU80980.2| hypothetical protein CC1G_03156 [Coprinopsis cinerea okayama7#130]
Length = 486
Score = 42.0 bits (97), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 29/54 (53%), Gaps = 7/54 (12%)
Query: 117 PP---CSMSNCNHCCTSISAPIP----CNECILAVYCSESCRREAWLRYHRWEC 163
PP C++ + + AP+P C+ C+ VYCS C+ E WL HR+EC
Sbjct: 268 PPVMCCNLRHLDKTHGRTPAPLPEPKACSRCLCVVYCSTECQTEDWLAIHRFEC 321
>gi|328715975|ref|XP_003245797.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 632
Score = 42.0 bits (97), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 5/115 (4%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
T + S+ L G + + S ++++ S GR V A D++ GD + +++P
Sbjct: 199 TEQKRTMETPSVEHLVGGRNENIPALSAFVELKMSEDMGRGVYATRDIYPGDVVAIDEP- 257
Query: 112 AFVILP---PCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++ P + + N C A C +C L YC+E C + + H C
Sbjct: 258 -YICGPFRDDIEVCHYNGCLKLGFALFRCPKCQLVYYCNEDCMNKDYKDGHNLAC 311
>gi|195488849|ref|XP_002092487.1| GE11640 [Drosophila yakuba]
gi|194178588|gb|EDW92199.1| GE11640 [Drosophila yakuba]
Length = 497
Score = 42.0 bits (97), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 68/145 (46%), Gaps = 16/145 (11%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ ++ GR V+A+ ++ G+TL +E P+ V+LP C+ C + C
Sbjct: 9 ARKCEIKQNDTLGRFVVALCNLRAGETLLLENPI--VVLPQIGDRRCSKCFKLTQS--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
+C L C + CRR A + + L+ E I ALK +L+ H +
Sbjct: 65 RKCRLLALCEDCSDHDGRDCRRLAEMNLSNDQVE-LLQRKEHTEIQS-ALKCLLLREHEE 122
Query: 191 R---YKEVYHLETHLQDMRPEDLYQ 212
Y+E+ +E+ L R D+++
Sbjct: 123 TLPLYEEMSQMESQLASRRGTDIWK 147
>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
carolinensis]
Length = 334
Score = 42.0 bits (97), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S +G + + V G L+ E+P A+ + + C C + C++C +A Y
Sbjct: 41 SPGRGNGLRLSKAVKAGALLYREEPFAYAVTKKRLGAACERCLRRKERLLRCSQCKVARY 100
Query: 146 CSESCRREAWLRYHRWEC 163
C C++EAW H+ EC
Sbjct: 101 CDARCQKEAW-EDHKREC 117
>gi|358398595|gb|EHK47946.1| hypothetical protein TRIATDRAFT_290442 [Trichoderma atroviride IMI
206040]
Length = 561
Score = 41.6 bits (96), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 70/166 (42%), Gaps = 24/166 (14%)
Query: 3 KQDVQETLSSSLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVAS 62
+QD + L R + D + D + E S+++++ + + + VA+
Sbjct: 179 RQDFDASDLPDFGLVRREVYPWNDHEPDRFSPE-----SLQSLNDGLKEMAPKCAVQVAT 233
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM- 121
LP L GES T N + T G V A ED+ G+ + E + +L +
Sbjct: 234 LPILLEGESET----DNYEIIPTCKQLG--VFAQEDIEAGEVVLRE----YTLLTANNRH 283
Query: 122 --SNCNHCCTSISA------PIPCNECILAVYCSESCRREAWLRYH 159
S C+ C + ++ PI C EC V+CS+ C A RYH
Sbjct: 284 KDSVCDACSSDLAPLGSDNQPIRCEECFDTVFCSQECHDLAQDRYH 329
>gi|328699920|ref|XP_001945553.2| PREDICTED: hypothetical protein LOC100161821 [Acyrthosiphon pisum]
Length = 719
Score = 41.6 bits (96), Expect = 0.20, Method: Composition-based stats.
Identities = 20/57 (35%), Positives = 27/57 (47%)
Query: 107 VEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ KP FV P M+ C + SA I C+ C YCS +C+R W H +C
Sbjct: 656 INKPKTFVKPPQIIMTKCYARNCTKSATIMCSSCTSVKYCSHNCQRHDWYDSHINDC 712
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 41.6 bits (96), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 64/150 (42%), Gaps = 21/150 (14%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ +S +G + A+ + G+ LF P+A+ + C C + C+
Sbjct: 5 KVEKFSSANRGNGLRALVQLRPGELLFRSDPLAYTVCKGSRGVVCVRCIIGRKYLLYCSR 64
Query: 140 CILAVYCSESCRREAWLRYHRWECH-----------GALRLLEAVGIAHLALKLI-LVSS 187
C L C CR++AW + H+ EC ++RLL + KL+ S
Sbjct: 65 CRLLAQCGAKCRKKAW-QDHKRECKCLKSCKPRYPPDSVRLL-----GRVVFKLMEETPS 118
Query: 188 HSDRYKEVYHLETHLQDM---RPEDLYQYV 214
S++ Y LE+++ + + E L Q V
Sbjct: 119 ESEKLYSFYDLESNINKLTEDKKEGLRQLV 148
>gi|299754623|ref|XP_001841076.2| hypothetical protein CC1G_04920 [Coprinopsis cinerea okayama7#130]
gi|298410845|gb|EAU80810.2| hypothetical protein CC1G_04920 [Coprinopsis cinerea okayama7#130]
Length = 600
Score = 41.6 bits (96), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 25/42 (59%), Gaps = 2/42 (4%)
Query: 134 PIP--CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV 173
P+P C C++ YCS C+RE W HR+EC G R E++
Sbjct: 402 PVPKACRGCLVVGYCSAECQREDWQLIHRFECDGLKRQKESM 443
>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
Length = 420
Score = 41.6 bits (96), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 48/114 (42%), Gaps = 32/114 (28%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC-------------- 128
S+AKG+ + A + KG+ +FVEKPV F+ + C+HC
Sbjct: 35 SSAKGKGLFATRSIRKGEAVFVEKPVVSSQFLWNALYNYRACDHCLRALETAEENAQRLL 94
Query: 129 --TSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
+S+ P P C C + YCS CR+ A +YH+ C G R
Sbjct: 95 GRSSLVLPHPEQCSIRKDLHQQCPRCQV-TYCSAECRQAALEQYHQVLCLGPSR 147
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 41.6 bits (96), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 36/81 (44%), Gaps = 2/81 (2%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
KGR + D+ G+ + +KP V S C+ C + + C+ C + YC
Sbjct: 21 KGRSLFTTRDIRPGEVILSQKPYICVPNNTSLESRCDGCFKTNNLK-KCSGCQVVWYCGS 79
Query: 149 SCRREAWLRYHRWECHGALRL 169
C++ W + HR EC RL
Sbjct: 80 FCQKSEW-KLHRHECKALTRL 99
>gi|384490543|gb|EIE81765.1| hypothetical protein RO3G_06470 [Rhizopus delemar RA 99-880]
Length = 400
Score = 41.6 bits (96), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 33/128 (25%), Positives = 55/128 (42%), Gaps = 6/128 (4%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCT---SISAPIPCNECILAV-Y 145
G V+A + +G T+ E P + ++C +C T I +C V Y
Sbjct: 43 GERVVANAKLKQGRTILEEDPFIRQLNDANRTTHCTYCFTEFKKIKKLCRNKDCKWEVLY 102
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQDM 205
CS +C+++ WL H+W C L AL+ + S Y + L +++
Sbjct: 103 CSLNCQQQHWLTEHKWFCRFP-NLANTDKNVLFALQGYIASRSKSEY-TLPGLVSNIDSY 160
Query: 206 RPEDLYQY 213
+PEDL +Y
Sbjct: 161 KPEDLEEY 168
>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
niloticus]
Length = 435
Score = 41.6 bits (96), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 36/78 (46%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + AV G+ +F ++V+ ++C HC T C +C A Y
Sbjct: 14 SPDKGRGLRAVRHFAVGELVFACPAYSYVLTVNERGAHCEHCFTRREDLFKCGKCKQAYY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C+R W H+ EC
Sbjct: 74 CNVDCQRGDW-PMHKLEC 90
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 43/98 (43%), Gaps = 7/98 (7%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC---CTSISA 133
A + + S KGR +IA +G+ + ++ P V+ C+ C CT++
Sbjct: 22 AEKGLAISISPEKGRCLIAKRSFTRGEIVLLQDPYVSVLDSASVNKRCDVCFRLCTNLKR 81
Query: 134 PIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C+ C YC +C+R W R H+ EC L E
Sbjct: 82 ---CSVCKTTWYCGGTCQRNGW-RLHQHECKAITSLKE 115
>gi|195570243|ref|XP_002103118.1| GD20257 [Drosophila simulans]
gi|194199045|gb|EDX12621.1| GD20257 [Drosophila simulans]
Length = 408
Score = 41.2 bits (95), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 3/83 (3%)
Query: 83 MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
+ S K V A DV KG+ + VE PV F P NC C C+ C
Sbjct: 232 FEKSIEKSGGVFASCDVPKGEIVLVENPVYFQFSAP--FLNCELCGVHQQQLYTCDNCRY 289
Query: 143 AVYCSESCRREAWLRYHRWECHG 165
YCS SC + + H++EC G
Sbjct: 290 RSYCSMSCMK-SDAEVHQYECTG 311
>gi|409042906|gb|EKM52389.1| hypothetical protein PHACADRAFT_148986 [Phanerochaete carnosa
HHB-10118-sp]
Length = 436
Score = 41.2 bits (95), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 16/111 (14%)
Query: 49 VSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQ-TSNAKGRHVIAVEDVHKGDTLFV 107
+S+ S + + A P+ A E + ++KI+++ KG+ ++A E + +G+ ++
Sbjct: 64 AASSRSKSGTGNAPYPTSAVIEGLDVSKWTSKIQVKYFGKHKGKGLVATERIAEGEVVWK 123
Query: 108 EKPVAFVILPPCSMSN-------CNHCCTSISAPIPCNECILAVYCSESCR 151
E P F++ P S+ + C HC T P N LAV CS S +
Sbjct: 124 EDP--FILAPEWSLYDLQTASVACTHCST------PLNNSPLAVPCSGSAK 166
>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 487
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/87 (24%), Positives = 39/87 (44%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A +++ S GR ++ +G + E P A+ + C+ C S
Sbjct: 5 AMESVRVVKSATCGRSLVFTSKFARGQCILEELPYAYTLHDNKRGLFCDFCLKKCSTLKK 64
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C YC++SC+++ W R H+ +C
Sbjct: 65 CSSCNYVSYCNKSCQKKDWARCHKQDC 91
>gi|350593085|ref|XP_001927076.4| PREDICTED: tudor domain-containing protein 1-like [Sus scrofa]
Length = 1373
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 59/148 (39%), Gaps = 12/148 (8%)
Query: 18 RLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSST----SSSASSSVASLPSLACGESHT 73
RL+ + K + L E++++ V VSS S A+ S L G +
Sbjct: 245 RLKSPETGNKKNNFLLCEQTKQYLVSQEDSSVSSNPNGVSGEAAGSRGDRKMLPTGNPVS 304
Query: 74 LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF-----VEKPVAFVIL-PPCSMSNCNHC 127
+ N + N+K + + KGD L + P F L PP + C+ C
Sbjct: 305 PLNIGNNSPPKEINSKPSNNVPPAKSKKGDKLVENSLSINNPALFSSLGPPLRSTTCHRC 364
Query: 128 CTSISAPIPCNECILAVYCSESCRREAW 155
+ + C++C YCS +C+R W
Sbjct: 365 --GLFGSLRCSQCKQTYYCSTACQRRDW 390
>gi|145518734|ref|XP_001445239.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124412683|emb|CAK77842.1| unnamed protein product [Paramecium tetraurelia]
Length = 356
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE 194
I C+ C YCS CR W YH C+G L+ + ++ L ++ L+ S S + +
Sbjct: 23 ITCDSCSSTFYCSSKCRDFDWEAYHYLVCNGQFELVPKLSLSQLNDEMNLIGSGS--FGQ 80
Query: 195 VY 196
VY
Sbjct: 81 VY 82
>gi|328768420|gb|EGF78466.1| hypothetical protein BATDEDRAFT_90401 [Batrachochytrium
dendrobatidis JAM81]
Length = 578
Score = 41.2 bits (95), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 29/51 (56%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHC 127
+S I ++ + +KGRH +A + G T+ VE P AFV+ + C++C
Sbjct: 13 SSYPITLKATKSKGRHFVAKSQLQPGTTVLVESPCAFVLFKDNARDYCSYC 63
>gi|405974952|gb|EKC39559.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 766
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 31/105 (29%), Positives = 48/105 (45%), Gaps = 31/105 (29%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPV--AFVILPPCSMS-NCNHCCTSI--------------- 131
G+ V+A + + +G+ LF EKP+ V+ + + +C++C SI
Sbjct: 338 GKCVVAKKAIKEGEELFTEKPLIMGHVMDKDNNFALSCDNCAASILTAEDYFGSTLETME 397
Query: 132 ---------SAP-IP---CNECILAVYCSESCRREAWLRYHRWEC 163
S P IP C++C YCSE CRR+AW+ H C
Sbjct: 398 LDLKELIKESWPDIPTVTCDKCQKVKYCSEDCRRQAWVSQHELIC 442
>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
impatiens]
Length = 439
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 50/111 (45%), Gaps = 6/111 (5%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ KG T+ KP A+V+ C++C S C+ C YC++SC++ +W
Sbjct: 8 IKKGTTILTAKPFAYVLCSKYKNVRCDYCFKS-GKLFRCSACQYVYYCNQSCQQMSW-PM 65
Query: 159 HRWECHGALRLLEAVGI---AHLALKLILVSSHSDRYKEVYHLETHLQDMR 206
H EC L+ GI A L ++I+ + + Y+ ET + +
Sbjct: 66 HSKEC-ARLKKCSPWGISNVARLMARIIIKLNQGGDEERGYYNETSYRKFK 115
>gi|363740353|ref|XP_003642314.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Gallus gallus]
Length = 708
Score = 40.8 bits (94), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGAL 167
C C S+ + PC C CSE+CRR++W H EC G L
Sbjct: 375 CYQCGRSVGVQLSPCKRCYEVFTCSEACRRKSWNERHSLECSGLL 419
>gi|115475273|ref|NP_001061233.1| Os08g0205300 [Oryza sativa Japonica Group]
gi|40253402|dbj|BAD05332.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761301|dbj|BAD11544.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|113623202|dbj|BAF23147.1| Os08g0205300 [Oryza sativa Japonica Group]
Length = 331
Score = 40.8 bits (94), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 45/88 (51%), Gaps = 8/88 (9%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CNHCCTSIS 132
A + ++ +GR ++A + +G+ + E+P+ +L P S+++ C+ C S+S
Sbjct: 2 AGDALRAADLPGRGRGLLAARSIREGEVILTEQPL---LLYPASLASLPSFCSACFRSLS 58
Query: 133 API-PCNECILAVYCSESCRREAWLRYH 159
A PC C A +CS SC + R H
Sbjct: 59 AAASPCPSCRAAGFCSPSCAAASHPRLH 86
>gi|358387028|gb|EHK24623.1| hypothetical protein TRIVIDRAFT_30213 [Trichoderma virens Gv29-8]
Length = 561
Score = 40.8 bits (94), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 19/128 (14%)
Query: 41 SVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVH 100
S+++++ + + + VA+LP L GES T N + T G V A ED+
Sbjct: 212 SLQSLNDGLREMAPKCAVQVATLPVLLEGESET----DNYEIIPTCKQLG--VFAQEDIE 265
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHC--CTSISAP-------IPCNECILAVYCSESCR 151
G+ + E + +L + + C C+S AP I C EC V+CS+ C
Sbjct: 266 AGEVVLRE----YTLLTANNRHKDSVCDACSSELAPLGSENRSIQCEECFDTVFCSQECH 321
Query: 152 REAWLRYH 159
EA RYH
Sbjct: 322 DEAQSRYH 329
>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 509
Score = 40.8 bits (94), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 44/101 (43%), Gaps = 10/101 (9%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI---------LPPCSMSNCNHCCTSI 131
+ + + A GRH+ A D G+ + KP A V P C + +
Sbjct: 26 LALAYTPALGRHITAKRDFRAGELVLASKPYAAVADTDGPAAGKAPECFQAQDEDADVAA 85
Query: 132 SAPIP-CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
+A + C +C A YCS C+R AW H+ EC +R L+
Sbjct: 86 AAEMKRCAQCRRAQYCSVECQRAAWHGGHKAECAAWVRGLQ 126
>gi|440802445|gb|ELR23374.1| SET and MYND domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 521
Score = 40.8 bits (94), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 50/124 (40%), Gaps = 46/124 (37%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-----CNHCCTSI------------ 131
KG+ ++A+ D +GD +F E+P+ V PP + C++C +
Sbjct: 170 KGKGLVALRDFKQGDEIFHEEPL--VCAPPFFPAQEKPLCCSNCMRYLDTREAMVDRMVA 227
Query: 132 ---------SAPIP----------------CNECILAVYCSESCRREAWLRYHRWEC--H 164
+A +P C C + VYC+ C+ +AW RYH C +
Sbjct: 228 HVKQSDKAKAAALPRFDELCPPFSETEGQMCQGCGVEVYCTTQCKDDAWRRYHEAACPAY 287
Query: 165 GALR 168
ALR
Sbjct: 288 AALR 291
>gi|328876145|gb|EGG24508.1| hypothetical protein DFA_02751 [Dictyostelium fasciculatum]
Length = 915
Score = 40.8 bits (94), Expect = 0.38, Method: Composition-based stats.
Identities = 15/41 (36%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLR-YHRWEC 163
C+ C ++ P+ C+ C VYCS++C++E+W HR +C
Sbjct: 847 CHQCKQTLMKPLLCSGCKAVVYCSKACQKESWFHDGHRDKC 887
>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
Length = 392
Score = 40.8 bits (94), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CNHCCTSIS 132
A + ++ +GR ++A + +G+ + E+P+ +L P S+S+ C+ C S+S
Sbjct: 2 AGDALRAADLPGRGRGLLAARSIREGEVILTEQPL---LLYPASLSSLPSFCSACFRSLS 58
Query: 133 API-PCNECILAVYCSESC 150
A PC C A +CS SC
Sbjct: 59 AAASPCPSCRAAGFCSPSC 77
>gi|255085856|ref|XP_002505359.1| predicted protein [Micromonas sp. RCC299]
gi|226520628|gb|ACO66617.1| predicted protein [Micromonas sp. RCC299]
Length = 435
Score = 40.8 bits (94), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 4/79 (5%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKP-VAFVILPPCSMSNCNHCCTSISAPIP---CNECILAV 144
+GR+ + ++D D LF+ P + SM C C P C++C LA
Sbjct: 264 RGRYAMGMDDPTAADRLFIAAPDLGLEQAKIDSMLKCYMCGECALPHEPHEFCSKCHLAC 323
Query: 145 YCSESCRREAWLRYHRWEC 163
YCS C+++ W H+ EC
Sbjct: 324 YCSRDCQKQHWKWRHKREC 342
>gi|328705723|ref|XP_003242888.1| PREDICTED: hypothetical protein LOC100576056 [Acyrthosiphon pisum]
Length = 669
Score = 40.8 bits (94), Expect = 0.40, Method: Composition-based stats.
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 7/75 (9%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
V G L ++P VI C M+ C + SA I C+ C + YCS SC+++ W+
Sbjct: 601 VDVGTHLAGKQPTTDVI--KCYMNKCEN-----SAKIMCSNCKIVKYCSLSCQQQHWIIK 653
Query: 159 HRWECHGALRLLEAV 173
H EC RL +A+
Sbjct: 654 HIDECDNLRRLNQAL 668
>gi|195349219|ref|XP_002041144.1| GM15391 [Drosophila sechellia]
gi|194122749|gb|EDW44792.1| GM15391 [Drosophila sechellia]
Length = 686
Score = 40.4 bits (93), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 35/83 (42%), Gaps = 3/83 (3%)
Query: 83 MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
+ S K V A DV KG + VE PV F P NC C C+ C
Sbjct: 232 FEKSIEKSGGVFASCDVPKGKIVLVENPVYFQFSAP--FLNCELCGVHQQQLYSCDNCRY 289
Query: 143 AVYCSESCRREAWLRYHRWECHG 165
YCS SC + + H++EC G
Sbjct: 290 RSYCSMSCMK-SDAEVHQYECTG 311
>gi|407424679|gb|EKF39087.1| hypothetical protein MOQ_000686 [Trypanosoma cruzi marinkellei]
Length = 429
Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 9/76 (11%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILA 143
GR + A+ ++ G + +++P ++ C HC +I + + C C
Sbjct: 163 GRGIYALTRINSGTPVMLDQPF---LVQRMRDDACAHCLATIGRSGASAGGVQCAHCDRE 219
Query: 144 VYCSESCRREAWLRYH 159
VYCS +CR AW YH
Sbjct: 220 VYCSVACRDAAWREYH 235
>gi|320168148|gb|EFW45047.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 522
Score = 40.4 bits (93), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 4/88 (4%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI---PC 137
+ ++ A+GR+++A + G+T+FV P A+ + C C C C
Sbjct: 25 VAVREDAARGRYMVATRAIAAGETVFVALPYAYAVFTNCRKRVCAFCLLYDEQGRLFEHC 84
Query: 138 NECILAVYCSESCR-REAWLRYHRWECH 164
C YCS CR R+ + H+ CH
Sbjct: 85 AMCDQVYYCSAQCRQRDMDIGLHQRLCH 112
>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
Length = 544
Score = 40.4 bits (93), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 32/142 (22%), Positives = 62/142 (43%), Gaps = 14/142 (9%)
Query: 86 SNAKGRHVIAVEDVHKGDT-LFVEKPVAFVILPPCSMSNCN------HCCTSISAPIP-C 137
S G + A D+ G T + +++P V+ C+ HC ++ + C
Sbjct: 65 SPGAGTGIFAATDIPVGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKAC 124
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKE- 194
C + YC+++C+ + W H +EC L+ V A L+++L S KE
Sbjct: 125 TGCRVVKYCNKTCQAKDWKFAHSFECRVYKELMPRVLPSNARAILRMVLRSERGKYPKEE 184
Query: 195 ---VYHLETHLQDMRPEDLYQY 213
LE+H+ +++ ++ Q+
Sbjct: 185 LELFRKLESHMAEIQAQNWEQW 206
>gi|195500128|ref|XP_002097242.1| GE26113 [Drosophila yakuba]
gi|194183343|gb|EDW96954.1| GE26113 [Drosophila yakuba]
Length = 690
Score = 40.4 bits (93), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 12/105 (11%)
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYC 146
N+ G V A ++ KG+ + VE PV F P NC+ C C++C YC
Sbjct: 238 NSGG--VFASCEMPKGEIVLVENPVHFQFSAP--FLNCDLCGVHQEQLYTCDDCRYRTYC 293
Query: 147 SESCRREAWLRYHRWECH----GALRLLEAVGIAHL---ALKLIL 184
S C E+ H++EC+ G + +LEA + L AL+ IL
Sbjct: 294 SRLC-MESDAEIHQYECYGYRIGLIPMLEASMLFRLFYEALEYIL 337
>gi|268570006|ref|XP_002640670.1| Hypothetical protein CBG19732 [Caenorhabditis briggsae]
Length = 445
Score = 40.4 bits (93), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 6/66 (9%)
Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILAVYCSESCRREAWLR 157
T+ EKP+A V+ P + C C + I S + C++C YCS C+R+ W
Sbjct: 3 TVAREKPLAAVLSPEFQDTYCATCFSEIDPSHLDSEILTCDDCTQVSYCSLKCQRKDWKT 62
Query: 158 YHRWEC 163
H+ EC
Sbjct: 63 VHQLEC 68
>gi|326477204|gb|EGE01214.1| hypothetical protein TEQG_00267 [Trichophyton equinum CBS 127.97]
Length = 1184
Score = 40.4 bits (93), Expect = 0.53, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 97 EDVHKGDTLFVEKPVAFVILPPCSMSNCN-HCCTSISAPIPCNECILAVYCSESCRREAW 155
+D +G + +E + + L S SNC S + + C+ C +A YCS++C+RE W
Sbjct: 1110 KDTKEGVSQKIESLLQLMDLKKGSCSNCGGKKPGSGTKLLRCSGCKVAEYCSKNCQREDW 1169
Query: 156 LRYHRWEC 163
+ H++ C
Sbjct: 1170 MARHKFAC 1177
>gi|194882335|ref|XP_001975267.1| GG20655 [Drosophila erecta]
gi|190658454|gb|EDV55667.1| GG20655 [Drosophila erecta]
Length = 498
Score = 40.4 bits (93), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 36/145 (24%), Positives = 69/145 (47%), Gaps = 16/145 (11%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ ++ GR V+A+ ++ G+TL +E P+ V+LP C+ C + C
Sbjct: 9 AKKCEIKQNDTLGRFVVALCNLRAGETLLLENPI--VVLPQMGDRRCSKCFKLTQS--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
+C L C + CRR A + + + L+ E I + LK +L+ H +
Sbjct: 65 RKCHLLALCEDCSNHDERDCRRLAEMHFSADQVE-LLQRKEHTEIQPV-LKCLLLREHEE 122
Query: 191 R---YKEVYHLETHLQDMRPEDLYQ 212
Y+E+ +E+ L R D+++
Sbjct: 123 TLPLYEEMSQMESQLATRRGTDIWK 147
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 40.4 bits (93), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 50/123 (40%), Gaps = 12/123 (9%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSE 148
KGR +A + GD + + V P S+C C C+ C A YCS
Sbjct: 12 KGRCAVASTSLRAGDRVLITSAVCAASSP----SSCGWCFAPGDVFSRCSACRKARYCSR 67
Query: 149 SCRREAWLRYHRWECHGALRLLE------AVGIAHLALKLILVSSHSDRYKE-VYHLETH 201
+C++ W + HR EC + E + +A LA KL L S K V L H
Sbjct: 68 ACQQRDWPQ-HRHECAAWRSIPERNPSPTVLLVARLAAKLFLGSQVDQEEKNGVLKLRDH 126
Query: 202 LQD 204
L D
Sbjct: 127 LAD 129
>gi|326471930|gb|EGD95939.1| hypothetical protein TESG_03400 [Trichophyton tonsurans CBS 112818]
Length = 1217
Score = 40.4 bits (93), Expect = 0.54, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 97 EDVHKGDTLFVEKPVAFVILPPCSMSNCN-HCCTSISAPIPCNECILAVYCSESCRREAW 155
+D +G + +E + + L S SNC S + + C+ C +A YCS++C+RE W
Sbjct: 1143 KDTKEGVSQKIESLLQLMDLKKGSCSNCGGKKPGSGTKLLRCSGCKVAEYCSKNCQREDW 1202
Query: 156 LRYHRWEC 163
+ H++ C
Sbjct: 1203 MARHKFAC 1210
>gi|410914273|ref|XP_003970612.1| PREDICTED: SET and MYND domain-containing protein 5-like [Takifugu
rubripes]
Length = 417
Score = 40.0 bits (92), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 45/111 (40%), Gaps = 32/111 (28%)
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
N KG+ + A KGDT+F+E+P+ F+ C +C
Sbjct: 29 NIKGKGLFAKRSFRKGDTIFIERPLVSAQFLWNSLYKYRACEYCLRALETAEENARRLSG 88
Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHGA 166
+S P P C +C + +YCS CR+ A +YHR C GA
Sbjct: 89 VPGLSLPHPELCCVRPELHQCCPQCQV-MYCSSECRQAAADQYHRVLCLGA 138
>gi|390603364|gb|EIN12756.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 450
Score = 40.0 bits (92), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 30/69 (43%)
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREA 154
A GD + V++ + ILP + CNHC S C C YCS C+ +
Sbjct: 24 ATRSFAPGDVILVDEALTTAILPSDAGRRCNHCLRLTSDLKRCAGCAAYHYCSTQCQSQQ 83
Query: 155 WLRYHRWEC 163
W +H+ C
Sbjct: 84 WSIHHKRIC 92
>gi|169864358|ref|XP_001838788.1| hypothetical protein CC1G_08952 [Coprinopsis cinerea okayama7#130]
gi|116500120|gb|EAU83015.1| hypothetical protein CC1G_08952 [Coprinopsis cinerea okayama7#130]
Length = 617
Score = 40.0 bits (92), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C++C VYCS+ C+RE W R HR+EC
Sbjct: 456 CSQCHSVVYCSDECQREDWKRVHRFEC 482
>gi|392567641|gb|EIW60816.1| hypothetical protein TRAVEDRAFT_71094 [Trametes versicolor
FP-101664 SS1]
Length = 316
Score = 40.0 bits (92), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 42/91 (46%), Gaps = 10/91 (10%)
Query: 116 LPPCSMSNCNHCCTSISAPIP-------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
LPP ++ C HC +P C C A+YCS +C+R AW R H+ +C
Sbjct: 16 LPPPCLTRCAHC-NKRERQLPAGAKLKRCKGCTGAMYCSRACQRAAWPR-HKNKCGSVGS 73
Query: 169 LLEAVGIAHLALKLILVSSHSDRYKEVYHLE 199
+A G A A LV + D ++ HL+
Sbjct: 74 CYQAEGCAGYASSAALVEAMHDWVDDI-HLQ 103
>gi|194764587|ref|XP_001964410.1| GF23162 [Drosophila ananassae]
gi|190614682|gb|EDV30206.1| GF23162 [Drosophila ananassae]
Length = 392
Score = 40.0 bits (92), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 34/124 (27%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSI---- 131
N +++ KGR +IA ++ + +F E+P F + C+HC +
Sbjct: 2 NHFEIREIPGKGRAMIATKNFAANEVIFEEEPFVSSQFSWNTAYGYAACDHCMRPLETLL 61
Query: 132 -------SAP---IP----------------CNECILAVYCSESCRREAWLRYHRWECHG 165
S P +P C C + YCSE C EA RYHR C G
Sbjct: 62 ENVRRLASDPKVEVPLLQHDPTAAWVAQFTQCPRCKVR-YCSEDCLMEAQKRYHRVACMG 120
Query: 166 ALRL 169
A RL
Sbjct: 121 AFRL 124
>gi|170067052|ref|XP_001868327.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863295|gb|EDS26678.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 288
Score = 40.0 bits (92), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 7/74 (9%)
Query: 88 AKGRHV------IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-APIPCNEC 140
AKG+ V +A +D + GD + +P+ I NC+HC S + IPC
Sbjct: 92 AKGKKVDHRRILVAEKDFNPGDVIMDAEPLLTAIDFNLCYENCSHCGVKFSNSLIPCPGS 151
Query: 141 ILAVYCSESCRREA 154
+ +YC E CR+++
Sbjct: 152 VFFMYCGEECRQKS 165
>gi|302682538|ref|XP_003030950.1| expressed protein [Schizophyllum commune H4-8]
gi|300104642|gb|EFI96047.1| expressed protein, partial [Schizophyllum commune H4-8]
Length = 423
Score = 40.0 bits (92), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 4/86 (4%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP---C 137
I++Q V+A+ D +G +F + +A +L C+ CT +++ P C
Sbjct: 1 IQLQPHPRARDRVVAITDFIEGSVIFGQYALATALLSTEKGRRCD-ACTRLASDKPLRRC 59
Query: 138 NECILAVYCSESCRREAWLRYHRWEC 163
+ C YC E C+R W R H+ C
Sbjct: 60 SGCAEYYYCDEHCQRYHWQRNHKRIC 85
>gi|255732233|ref|XP_002551040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240131326|gb|EER30886.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 474
Score = 40.0 bits (92), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 44/82 (53%), Gaps = 18/82 (21%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS----MSN---CNHC--------CTS 130
S+ KG+ + A +D+ KGD ++ E+P+ FV PP + M N C +C ++
Sbjct: 114 SSKKGKGLYAKKDIKKGDLMWSEEPLFFV--PPLANVHLMKNASACTYCGKLLQQSEASA 171
Query: 131 ISAPIPCNECILAVYCSESCRR 152
+ + CN C V+CS+ C++
Sbjct: 172 VLKGLDCNVCA-EVWCSKKCKQ 192
>gi|118777349|ref|XP_001230586.1| AGAP009449-PA [Anopheles gambiae str. PEST]
gi|116132908|gb|EAU77762.1| AGAP009449-PA [Anopheles gambiae str. PEST]
Length = 257
Score = 40.0 bits (92), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 107 VEKPVAFVILPPCSMSNCNHC-CTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
++ P + C C+ C IPC C +YCS+ C +A+ +YHR+EC G
Sbjct: 2 LDTPFVKTLQEDCRYVRCDFCHAERPFTLIPCEGCTWVMYCSQECLSKAFDQYHRYEC-G 60
Query: 166 ALR 168
+R
Sbjct: 61 VMR 63
>gi|326923979|ref|XP_003208210.1| PREDICTED: tudor domain-containing protein 1-like [Meleagris
gallopavo]
Length = 946
Score = 40.0 bits (92), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 40/104 (38%), Gaps = 12/104 (11%)
Query: 62 SLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVI------ 115
+LP G S K+ + +N G V+ + V K L F I
Sbjct: 4 ALPVKDAGSSMESTATRRKLPLTDTNDNGEMVLHSDSVEKPKKLQYASKNVFSIDYDKSL 63
Query: 116 ----LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAW 155
PP M C+HC +S + C++C YCS C++ W
Sbjct: 64 FNTLTPPPQMRTCHHC--GLSGSLRCSQCKQIYYCSVDCQKRDW 105
>gi|326930121|ref|XP_003211200.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Meleagris gallopavo]
Length = 1010
Score = 40.0 bits (92), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 23/45 (51%), Gaps = 1/45 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGAL 167
C C S+ + PC C CSE+CRR++W H EC G L
Sbjct: 797 CYQCGRSVGVQLSPCKHCYEVFTCSEACRRKSWNERHSRECSGLL 841
>gi|451849565|gb|EMD62868.1| hypothetical protein COCSADRAFT_144768 [Cochliobolus sativus
ND90Pr]
Length = 529
Score = 39.7 bits (91), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 28/85 (32%), Positives = 44/85 (51%), Gaps = 8/85 (9%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI--AHLA-LKLILVSSHS---D 190
C C YCS++C++EAW R HR EC L+ + G+ A LA ++L+ H D
Sbjct: 94 CAACQRFRYCSKACQKEAWNRGHRHECK-ILKPMAGRGLPKAFLASMELLTRRKHGLIPD 152
Query: 191 RYKEVY-HLETHLQDMRPEDLYQYV 214
+ E+ L +H+ D + YQ +
Sbjct: 153 KEWELLCQLPSHIDDFKRNGTYQNI 177
>gi|299754705|ref|XP_001828134.2| hypothetical protein CC1G_12282 [Coprinopsis cinerea okayama7#130]
gi|298410885|gb|EAU93684.2| hypothetical protein CC1G_12282 [Coprinopsis cinerea okayama7#130]
Length = 580
Score = 39.7 bits (91), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 26/62 (41%), Positives = 32/62 (51%), Gaps = 12/62 (19%)
Query: 107 VEKPVAFVILPPCSMSNCNHCCTSIS---APIP-CNECILAVYCSESCRREAWLRYHRWE 162
VEKPV C N NH T +S PI C+ C VYCS +C +E W R H+ E
Sbjct: 411 VEKPVL------CD--NSNHLSTRVSRKRGPIKSCSRCRTVVYCSVACHKEDWGRRHQNE 462
Query: 163 CH 164
C+
Sbjct: 463 CN 464
>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
Length = 392
Score = 39.7 bits (91), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 8/79 (10%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN----CNHCCTSIS 132
A + ++ +GR ++A + +G+ + E+P+ +L P S+++ C+ C S+S
Sbjct: 2 AGDALRAADLPGRGRGLLAARSIREGEVILTEQPL---LLYPASLASLPSFCSACFRSLS 58
Query: 133 API-PCNECILAVYCSESC 150
A PC C A +CS SC
Sbjct: 59 AAASPCPSCRAAGFCSPSC 77
>gi|402220572|gb|EJU00643.1| hypothetical protein DACRYDRAFT_108715 [Dacryopinax sp. DJM-731
SS1]
Length = 400
Score = 39.7 bits (91), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 32/121 (26%), Positives = 47/121 (38%), Gaps = 17/121 (14%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILP----------PCSMSNC 124
M + + + S KGR + + G + P A +LP PCS+
Sbjct: 1 MAPTPLLNIVHSPTKGRCIYPTTPLSSGAAVLHALPYALGVLPSHRKRLCAYLPCSLPTT 60
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
++ TS C+ C YCS C+R H W C G RL H+A ++L
Sbjct: 61 SYYTTS------CSGCGAVFYCSIDCKRGN-EEKHSWICAGLRRLKGWAKDRHIAGMMVL 113
Query: 185 V 185
V
Sbjct: 114 V 114
>gi|166240380|ref|XP_001733010.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|165988569|gb|EDR41059.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 513
Score = 39.7 bits (91), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 33/79 (41%), Gaps = 22/79 (27%)
Query: 97 EDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWL 156
E + K LF+EKP +F C+ C VYCSE+C+ EAW
Sbjct: 225 EKMKKQKELFIEKP-SFT---------CDKNCK------------FEVYCSENCKNEAWK 262
Query: 157 RYHRWECHGALRLLEAVGI 175
+YH C +L+ G
Sbjct: 263 QYHNLLCPNYDLILKLYGF 281
>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 537
Score = 39.7 bits (91), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 42/100 (42%), Gaps = 16/100 (16%)
Query: 90 GRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------- 135
GR V + E G+T+ +E P+ + + CN+C +S I
Sbjct: 23 GRTVFSTERFGAGETIAILENPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVKAGPRTC 81
Query: 136 -PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC+ C+R+ W H+ EC R+ E G
Sbjct: 82 KACTGCKAAVYCNAECQRQHWKLVHKAECKMFKRIRERTG 121
>gi|66809987|ref|XP_638717.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74854407|sp|Q54Q80.1|Y4059_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0284059
gi|60467339|gb|EAL65370.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 1280
Score = 39.7 bits (91), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 38/88 (43%)
Query: 83 MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECIL 142
+ S+ GR A + + L+ E+P + C +C I +PI C EC
Sbjct: 492 LYQSDLVGRISEASDFIPSNTVLYQEEPYVSCLDRNYHSQYCYNCFKEILSPIYCKECSN 551
Query: 143 AVYCSESCRREAWLRYHRWECHGALRLL 170
+ YCS C E +++ H EC ++
Sbjct: 552 SQYCSNKCLNEDYVKQHGRECGKGFLII 579
>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 39.7 bits (91), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 25/106 (23%), Positives = 47/106 (44%), Gaps = 2/106 (1%)
Query: 79 NKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCN 138
N + +Q G + D+ +G+ L ++ +V+ C++CC+ C
Sbjct: 3 NLVSVQEEPGAGLGLGVTADIERGECLLEKRLDLYVLNNELLNEACSYCCSQTKPTKRCA 62
Query: 139 ECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
C YCS+ C+++ W H+ EC ALR G+ +L++
Sbjct: 63 ACKQVHYCSKICQKQDW-SMHKLECK-ALRNASTNGLLPTVCRLLI 106
>gi|410912130|ref|XP_003969543.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Takifugu
rubripes]
Length = 434
Score = 39.7 bits (91), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 1/79 (1%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
S KGR + AV G+ +F ++V+ + C C T C +C A
Sbjct: 13 VSTDKGRGLRAVRHFSVGELVFACPAYSYVLTVNERGAYCEQCFTRREDLFKCGKCKQAY 72
Query: 145 YCSESCRREAWLRYHRWEC 163
YC+ C+R W H+ EC
Sbjct: 73 YCNVDCQRGDW-SMHKMEC 90
>gi|32140151|tpg|DAA01312.1| TPA_exp: SET and MYND domain protein 2 [Takifugu rubripes]
Length = 433
Score = 39.7 bits (91), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 1/79 (1%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
S KGR + AV G+ +F ++V+ + C C T C +C A
Sbjct: 13 VSTDKGRGLRAVRHFSVGELVFACPAYSYVLTVNERGAYCEQCFTRREDLFKCGKCKQAY 72
Query: 145 YCSESCRREAWLRYHRWEC 163
YC+ C+R W H+ EC
Sbjct: 73 YCNVDCQRGDW-SMHKMEC 90
>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
Length = 463
Score = 39.7 bits (91), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 22/86 (25%), Positives = 39/86 (45%), Gaps = 1/86 (1%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
++ +++ +S KGR + A + G+ + P A+V+ C+ C C
Sbjct: 2 ADGVEIFSSAEKGRGLCATKVFKPGNLVRAADPYAYVLCNSERGKRCDFCFARKDDMSRC 61
Query: 138 NECILAVYCSESCRREAWLRYHRWEC 163
+ C A YC C++ AW H+ EC
Sbjct: 62 SGCKFARYCDGKCQKAAWTE-HKSEC 86
>gi|170056789|ref|XP_001864189.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876476|gb|EDS39859.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 508
Score = 39.7 bits (91), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 33/127 (25%), Positives = 57/127 (44%), Gaps = 4/127 (3%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S+ GR+++A D+ +G+ + VE+P V+ P C C S I C C A
Sbjct: 13 SDRLGRYLVAKRDLARGELILVERPT--VVGPHWDSDVCCLNCFRNSCTI-CRLCRRAPL 69
Query: 146 CSE-SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETHLQD 204
C + + EA R++R L + L+L + DR+ E+ +E HL+
Sbjct: 70 CYDCTGHDEAECRFYRESNLDINFLFNHFNVVTSVRCLLLYRTDRDRFWEMRRMEAHLEK 129
Query: 205 MRPEDLY 211
R +++
Sbjct: 130 RRGSEIW 136
>gi|299744348|ref|XP_001840718.2| hypothetical protein CC1G_09769 [Coprinopsis cinerea okayama7#130]
gi|298406082|gb|EAU81127.2| hypothetical protein CC1G_09769 [Coprinopsis cinerea okayama7#130]
Length = 723
Score = 39.7 bits (91), Expect = 0.87, Method: Composition-based stats.
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 132 SAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ + C C VYCSE CR E W +HR EC
Sbjct: 556 TGAVICATCEAQVYCSEECRTEDWDEFHRNEC 587
>gi|291226945|ref|XP_002733450.1| PREDICTED: MORN repeat containing 1-like [Saccoglossus kowalevskii]
Length = 1129
Score = 39.7 bits (91), Expect = 0.87, Method: Composition-based stats.
Identities = 25/91 (27%), Positives = 38/91 (41%), Gaps = 21/91 (23%)
Query: 85 TSNAKGRHVI---AVEDVH--------KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA 133
TS G+ V+ +D H + D ++V KP+ C C SI
Sbjct: 904 TSGEDGKQVVFHSIAKDSHGQEYPLLDRNDDIYVRKPL---------FKYCYECGRSIGV 954
Query: 134 PIP-CNECILAVYCSESCRREAWLRYHRWEC 163
+ C C YCS++C+ +AW H+ EC
Sbjct: 955 RLSACTRCKEVYYCSKACKLKAWNARHKEEC 985
>gi|195146441|ref|XP_002014193.1| GL22989 [Drosophila persimilis]
gi|194103136|gb|EDW25179.1| GL22989 [Drosophila persimilis]
Length = 709
Score = 39.7 bits (91), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 27/104 (25%), Positives = 43/104 (41%), Gaps = 4/104 (3%)
Query: 63 LPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMS 122
+P G S + AS+ + + + V A ++ KG + V+ PV F P
Sbjct: 211 VPFNGFGGSKKIPFASSACEYKRTMNGPAGVFARVNIPKGKIILVDTPVYFQFSAP--FL 268
Query: 123 NCNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHG 165
NC C C+ C YC+++C W H+ EC+G
Sbjct: 269 NCEMCGVHQELVFHTCSRCRYKTYCTQTCMELDW-EIHQTECYG 311
>gi|440802660|gb|ELR23589.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 669
Score = 39.7 bits (91), Expect = 0.90, Method: Composition-based stats.
Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
C +C ++ P+ C+ C VYCS +C+R+ W + H+ C
Sbjct: 620 CWNCKRTLLKPLQCSRCQSVVYCSRTCQRDDWAK-HKVAC 658
>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
Length = 475
Score = 39.7 bits (91), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 19/83 (22%), Positives = 36/83 (43%), Gaps = 1/83 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
++++ + GR + + ++ G + P + C CC + C C
Sbjct: 5 VEIRNTKEAGRGLYSTKNFETGMNILNSTPYVHTLALKYHGVRCAQCCVFKPKLLRCAGC 64
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
L YCS+ C++ AW + H+ EC
Sbjct: 65 KLQYYCSKDCQKSAW-KIHKEEC 86
>gi|302799034|ref|XP_002981276.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
gi|300150816|gb|EFJ17464.1| hypothetical protein SELMODRAFT_114558 [Selaginella moellendorffii]
Length = 453
Score = 39.7 bits (91), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 33/135 (24%), Positives = 50/135 (37%), Gaps = 29/135 (21%)
Query: 99 VHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRY 158
+ G T+ + P V+ + C+ C A C+ C YCS +C+R W +
Sbjct: 1 FYPGQTVLEQDPYVAVLDGDSRGNRCDACFKQSPALKRCSACKFVFYCSPTCQRSQW-KI 59
Query: 159 HRWECHGALRL--------------------LEAVGIAHLALKLILVSSHSDRYKEVYHL 198
H+ EC +RL L+A G +L + D Y+ V L
Sbjct: 60 HQEECKVLVRLSTEQKRPTLLLMLRLLVKRELQATG--------VLPVTALDNYEIVRAL 111
Query: 199 ETHLQDMRPEDLYQY 213
TH + E L Y
Sbjct: 112 PTHFSETGDERLVMY 126
>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 555
Score = 39.3 bits (90), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 21/137 (15%)
Query: 86 SNAKGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTS---------ISAPI 135
S G + A +D+ G+T+ ++KP V+ S C+ C + + A
Sbjct: 66 SPGSGTGLFASKDIQPGETVVLIQKPFVAVLDTAQLESKCSGCLGAHANRQEAVELKA-- 123
Query: 136 PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYK 193
C C + YC+++C+ + W +H EC L V A L+++++S RY
Sbjct: 124 -CTGCRVVKYCNKTCQAKDWKLFHSLECRIFQNLKPRVLPNNARAILRMVMLSERG-RYP 181
Query: 194 ----EVYH-LETHLQDM 205
E++ LE+H++D+
Sbjct: 182 AGEMELFKGLESHMKDV 198
>gi|167521854|ref|XP_001745265.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776223|gb|EDQ89843.1| predicted protein [Monosiga brevicollis MX1]
Length = 630
Score = 39.3 bits (90), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 4/69 (5%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSISAPIPCNECILAVY 145
KGR + + GD L E+ A ++ P + N C+ C AP+PC C V+
Sbjct: 152 KGRALFTSTPLAAGDVLLRERAFASILAPTTTQENYLRCHECLDESWAPLPCPNCPR-VF 210
Query: 146 CSESCRREA 154
CS C +A
Sbjct: 211 CSTECHAKA 219
>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
harrisii]
Length = 609
Score = 39.3 bits (90), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 42/171 (24%), Positives = 65/171 (38%), Gaps = 39/171 (22%)
Query: 32 LTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLAC--GESHTLMCASNKIKMQTSNA- 88
L++ + + A+ + S S +LP AC G + A K + S
Sbjct: 162 LSLSRPPDGGASALRIPRSPQPPWESPRRRALPQ-ACVQGGPGCSLGAPAKPRESGSPGP 220
Query: 89 ---KGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA--------- 133
+G+ + A V KG+T+FVE+P+ F+ C+HC ++
Sbjct: 221 LALQGKGLFATRPVQKGETIFVERPLVASQFLWNALYRYRACDHCLRALETAEENAQRLS 280
Query: 134 -------PIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
P P C C +A YCS CR+ A +YHR C G
Sbjct: 281 GRTCGALPHPEQCSIRKELHQLCPHCQVA-YCSAECRQAALEQYHRALCLG 330
>gi|194756634|ref|XP_001960581.1| GF11433 [Drosophila ananassae]
gi|190621879|gb|EDV37403.1| GF11433 [Drosophila ananassae]
Length = 497
Score = 39.3 bits (90), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 69/147 (46%), Gaps = 20/147 (13%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ + GR VIA+ ++ G+TL +E+P+ ++LP C+ C C
Sbjct: 9 AKKCQVKQNETLGRFVIALGNIRAGETLLLEEPI--LVLPHAGDRRCSKCFKLTEE--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
+C L C + C+R + L+ + + LL+ + + ALK IL+ H
Sbjct: 65 RKCHLLALCGDCSEHDERDCKRLSDLKLSEEQ----VLLLQEKKHSEVQSALKCILLREH 120
Query: 189 SDR---YKEVYHLETHLQDMRPEDLYQ 212
+ Y+E+ +E L R D+++
Sbjct: 121 LETQPLYEEISQMENQLTARRGTDIWK 147
>gi|47226174|emb|CAG08321.1| unnamed protein product [Tetraodon nigroviridis]
Length = 417
Score = 39.3 bits (90), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/110 (27%), Positives = 43/110 (39%), Gaps = 32/110 (29%)
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
N KG+ + A KGDT+F E+P+ F+ C +C
Sbjct: 29 NVKGKGLFAKRSFRKGDTIFTERPLVSAQFLWNSLYKYKACEYCLHALETAEENARRLSG 88
Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
+S P P C +C + +YCS CR+ A +YHR C G
Sbjct: 89 VPGLSLPYPELCRVRPELHQACPQCQV-MYCSSECRQAAADQYHRVLCLG 137
>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
Length = 420
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 1/95 (1%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A ++ S KGR + A+ H GD LF A V+ +C C
Sbjct: 5 ARGGLERFCSAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK 64
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C C A YC C++E W H+ EC + L E
Sbjct: 65 CGRCKQAFYCDVECQKEDW-PLHKLECSSMVVLGE 98
>gi|290562039|gb|ADD38416.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
Length = 311
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 45/97 (46%), Gaps = 3/97 (3%)
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-CNECILAVYCSESCRREAWLRYH 159
KGD + P ++ C+ C ++ A I C C + YC + C+R +W H
Sbjct: 5 KGDEILKCYPFSYAFREEHQHKVCDFCFSTGQAIISRCARCCVIYYCGKQCQRSSWKETH 64
Query: 160 RWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVY 196
+ EC L++L+ + H+ L LI+ + RY Y
Sbjct: 65 KHECK-YLKMLD-IKEPHMDLLLIIRTLCKLRYDGGY 99
>gi|157818831|ref|NP_001099672.1| tudor domain-containing protein 1 [Rattus norvegicus]
gi|149040467|gb|EDL94505.1| tudor domain containing 1 (predicted) [Rattus norvegicus]
Length = 1173
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 64/155 (41%), Gaps = 26/155 (16%)
Query: 18 RLQDTSHLDFKEDVLTIEKS-------EENSVEAMSMQVSSTSSSASSSVASLPSLACGE 70
RL+ + + K + L E++ E+NSV +S + + S +L G
Sbjct: 46 RLKSPENGNKKNNFLLCEQNKQYLASQEDNSV--VSSNPAGINGEVGGSKGDRKTLPTGN 103
Query: 71 SHTLMCASN----KIKMQTSNAKGRHVIAVEDVHKGDTLF-----VEKPVAFVIL-PPCS 120
+L +N ++K + SNA + E K LF V P F L PP
Sbjct: 104 PVSLSLGNNSPPNQVKTKPSNA-----VTPEKSKKSHKLFENALSVNNPALFNSLGPPLR 158
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAW 155
+ C+ C + + C++C YCS +C+R W
Sbjct: 159 STTCHRC--GLFGSLRCSQCKQTYYCSTACQRRDW 191
>gi|71985487|ref|NP_493620.2| Protein SET-10 [Caenorhabditis elegans]
gi|31873104|emb|CAB04267.2| Protein SET-10 [Caenorhabditis elegans]
Length = 430
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 58/120 (48%), Gaps = 16/120 (13%)
Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSISAP-----IPCNECILAVYCSESCRREAWLRY 158
T+ E+P+A V+ P S C C I + + C++C+ YC+ C+R+ W
Sbjct: 3 TVAREQPLAAVLSPQFSELYCATCFLEIDSSQETEILTCDDCLAVSYCTLKCQRKDWKSC 62
Query: 159 HRWECHGALRLLEAVGIAHLALKL---ILVSSHSDRYKE-----VYHLETHLQDMR--PE 208
H++EC LR + L +KL +L++S S + + LET+ ++ R PE
Sbjct: 63 HQFECE-ILRCQGSSTPMTLTMKLCIRVLLASRSTQTPSFNGAVLEDLETNYKEFRSSPE 121
>gi|400600152|gb|EJP67843.1| Wd tetratricopeptide repeat domain-containing protein [Beauveria
bassiana ARSEF 2860]
Length = 564
Score = 39.3 bits (90), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 31/134 (23%), Positives = 62/134 (46%), Gaps = 18/134 (13%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
++ ++ S+++++ + + + VA+LP L T +++ +++ K V
Sbjct: 201 DRFDQESLDSLNADLKKMAPKCAIKVATLPVLL----ETASSSTDSLEI-IPTCKQLGVF 255
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSM---SNCNHCCTSISA------PIPCNECILAVY 145
A ED+ G+ + E + +L + S C+ C + P+ C+EC V+
Sbjct: 256 AEEDIEPGEAVLRE----YTLLTANNRLKDSACDACSCDLPPIGSEQEPVQCDECYDTVF 311
Query: 146 CSESCRREAWLRYH 159
CS+ C EA RYH
Sbjct: 312 CSQYCHDEAQERYH 325
>gi|443920735|gb|ELU40589.1| SET domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 380
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 4/86 (4%)
Query: 79 NKIKMQTSNAKGR-HVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
N + S+++ R +A + G+ + VE P+A I P C+ C + C
Sbjct: 8 NGVFQLKSDSRARSRAVAASPLEAGNIVLVETPLAVAIHPTYKGRRCDGCLREFDSLQKC 67
Query: 138 NECILAVYCSESCRREAWLRYHRWEC 163
+ C + YC S +W R+HR C
Sbjct: 68 SGCGVYWYCGVSA---SWKRHHRRLC 90
>gi|260834511|ref|XP_002612254.1| hypothetical protein BRAFLDRAFT_100073 [Branchiostoma floridae]
gi|229297629|gb|EEN68263.1| hypothetical protein BRAFLDRAFT_100073 [Branchiostoma floridae]
Length = 143
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/74 (28%), Positives = 34/74 (45%), Gaps = 3/74 (4%)
Query: 81 IKMQTSNAKGRHVIA---VEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+++ T KG+ V A + +G + E P F + S C+ C + P+ C
Sbjct: 6 VELFTEPGKGKGVRARTVGRGIRQGSLIIAEDPYIFSLEVEESKRRCHSCLQKKTVPLVC 65
Query: 138 NECILAVYCSESCR 151
C A YCSE+C+
Sbjct: 66 YGCKYARYCSETCQ 79
>gi|321473475|gb|EFX84442.1| hypothetical protein DAPPUDRAFT_194440 [Daphnia pulex]
Length = 524
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 61/131 (46%), Gaps = 14/131 (10%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSE 148
GRH+IA D+ GD + EKP+ V+ P + C C + + C++C + +
Sbjct: 53 GRHLIACRDLKAGDVILQEKPI--VMGPKHTAGQICLGCYSGVDGRTRCSQCGWPMCGRD 110
Query: 149 SCRREAWLRYHRWECH----GALRLLEAVGI----AHLALK-LILVSSHSDRYKEVYHLE 199
C A H EC G ++ ++ + + + L+ L L + +R+ E+ LE
Sbjct: 111 DC--HAHESDHAAECGVMASGGRPIVGSLPVQAYQSVMVLRCLALRDQNPERWDELMQLE 168
Query: 200 THLQDMRPEDL 210
H+Q+ R + +
Sbjct: 169 AHVQERRQKGM 179
>gi|170042649|ref|XP_001849030.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167866157|gb|EDS29540.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 822
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 123 NCNHCCTSI-SAPIPCNECILAVYCSESCRREAW 155
NC+HC ++ IPC C+ +YC E CR+++W
Sbjct: 201 NCSHCGVKFRNSLIPCPGCVFFMYCGEECRQKSW 234
>gi|323455804|gb|EGB11672.1| hypothetical protein AURANDRAFT_61652 [Aureococcus anophagefferens]
Length = 859
Score = 39.3 bits (90), Expect = 1.2, Method: Composition-based stats.
Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 110 PVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
P A ++ P + +C+ C C+ C YCS C+R AW R HR C +L
Sbjct: 143 PAASLLSLP-RVQHCDGCGLRQPGMARCSRCRHVAYCSNGCQRNAWARRHRKTCAELAKL 201
Query: 170 LE 171
E
Sbjct: 202 TE 203
>gi|401415234|ref|XP_003872113.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488335|emb|CBZ23582.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 494
Score = 39.3 bits (90), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 36/91 (39%), Gaps = 20/91 (21%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA---------------- 133
GR + A + +++ E P+ ++ C + C HC +SA
Sbjct: 186 GRGLYATRHIQPRNSILCESPL---LVQRCDDTKCAHCLAPLSARSGSSITNTGVSDPLA 242
Query: 134 -PIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ C C YCSE CR AW +YH C
Sbjct: 243 LGVACPHCEQETYCSEDCRDAAWEQYHVCSC 273
>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
Length = 450
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 38/95 (40%), Gaps = 1/95 (1%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A ++ S KGR + A+ H GD LF A V+ +C C
Sbjct: 5 ARGGLERFCSAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK 64
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALRLLE 171
C C A YC C++E W H+ EC + L E
Sbjct: 65 CGRCKQAFYCDVECQKEDW-PLHKLECSSMVVLGE 98
>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Cofactor Product Adohcy
gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Methyltransferase Inhibitor
Sinefungin
gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
Length = 433
Score = 38.9 bits (89), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 35/86 (40%), Gaps = 1/86 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A+ H GD LF A V+ +C C C C A Y
Sbjct: 14 SAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWECHGALRLLE 171
C C++E W H+ EC + L E
Sbjct: 74 CDVECQKEDW-PLHKLECSSMVVLGE 98
>gi|344299040|ref|XP_003421196.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1
[Loxodonta africana]
Length = 868
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C+ C SI + PC C + CS+ C+ AW +H+ +C AL ++
Sbjct: 815 CHQCGRSIGVRLSPCTRCYGILTCSKFCKTRAWNEFHKRDCGAALTMM 862
>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
Length = 427
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 6/68 (8%)
Query: 148 ESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKE-----VYHLETHL 202
++C+R AW +H+ EC L A+G LA+ +L+ R+ E + LE+H
Sbjct: 15 KACQRNAWKTFHKLECKAVLSEAPAMGPTTLAMYRLLLLKKDGRFTERQWKAIMQLESHF 74
Query: 203 QDMRPEDL 210
++ EDL
Sbjct: 75 EEQN-EDL 81
>gi|281306785|ref|NP_996733.1| N-lysine methyltransferase SMYD2 [Rattus norvegicus]
gi|47116972|sp|Q7M6Z3.1|SMYD2_RAT RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|32140157|tpg|DAA01315.1| TPA_exp: SET and MYND domain protein 2 [Rattus norvegicus]
Length = 433
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 32/78 (41%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A+ H GD LF A V+ +C C C C A Y
Sbjct: 14 SAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSKCGRCKQAFY 73
Query: 146 CSESCRREAWLRYHRWEC 163
C C++E W H+ EC
Sbjct: 74 CDVECQKEDW-PLHKLEC 90
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 42/86 (48%), Gaps = 2/86 (2%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
++ +GR + A D+ +G+++ E+ V+L + C C + I C++C
Sbjct: 9 RVDRDAVRGRMLRATRDLQEGESVLSERAYGNVVLSQHRATLCAVCLCAADPDICCDDCS 68
Query: 142 LAVYCSESCRREAWLRYHRWECHGAL 167
+CS++C +E H EC GAL
Sbjct: 69 KVFFCSDAC-QEKLQDVHEKEC-GAL 92
>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
Length = 544
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/142 (22%), Positives = 62/142 (43%), Gaps = 14/142 (9%)
Query: 86 SNAKGRHVIAVEDVHKGDT-LFVEKPVAFVILPPCSMSNCN------HCCTSISAPIP-C 137
S G + A ++ G T + +++P V+ C+ HC ++ + C
Sbjct: 65 SPGAGTGIFAATNIPVGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKAC 124
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKE- 194
C + YC+++C+ + W H +EC L+ V A L++IL S KE
Sbjct: 125 TGCRVVKYCNKTCQAKDWKFVHSFECRVYKELMPRVLPSNARAILRMILRSERGKYPKEE 184
Query: 195 ---VYHLETHLQDMRPEDLYQY 213
LE+H+ +++ ++ Q+
Sbjct: 185 LELFRKLESHMAEIQAQNWEQW 206
>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
B]
Length = 604
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 36/151 (23%), Positives = 58/151 (38%), Gaps = 24/151 (15%)
Query: 76 CASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI 135
C + ++++ GR +I E G TL KP V+ S C+ CC P
Sbjct: 74 CLPSYLEVRQVRDAGRGIITKERQKAGSTLIATKPHVTVLSTTYLDSYCSACC----GPA 129
Query: 136 P------CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLAL--------- 180
P C C + YC C+ W H+ EC R + +A+
Sbjct: 130 PETGLKRCTRCRVVWYCDSCCQNNDW-PIHKDECIAIQRWASSAPSPDVAIPSDAIRCLG 188
Query: 181 KLILVSSH----SDRYKEVYHLETHLQDMRP 207
+++L H S KE+ +++H M+P
Sbjct: 189 RILLTQQHRGLDSIWSKEINAMQSHRSSMQP 219
>gi|449686608|ref|XP_002163562.2| PREDICTED: SET and MYND domain-containing protein 5-like [Hydra
magnipapillata]
Length = 395
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 52/122 (42%), Gaps = 33/122 (27%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSI 131
M AS+ + ++ + +KG+ + A + KGDT+ EKP+ N C++C S+
Sbjct: 6 MLASDNVYVKETLSKGKGLFAASAIKKGDTILTEKPLVLCQFSWNRQYNYVACDYCMRSL 65
Query: 132 SAP--------------IPCNE-CILAV---------------YCSESCRREAWLRYHRW 161
+P +E C L+V +CS+ C EA+ +YH+
Sbjct: 66 ETAQNMARRLAADYTLELPYHEQCSLSVQRVNSIYKCPNCCIPFCSKECYSEAYEKYHKS 125
Query: 162 EC 163
C
Sbjct: 126 LC 127
>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
[Strongylocentrotus purpuratus]
Length = 396
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 41/105 (39%), Gaps = 30/105 (28%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPV--AFVILPPC-----------SMSNCNHCCTSISAP- 134
KGR + A D+ G LF EKP+ A + C SM N +SA
Sbjct: 16 KGRGLFASRDIAAGTVLFEEKPLVCAQFLWNECYHYEACEYCMRSMENAEEMARRLSANP 75
Query: 135 ---IPCNECILA-------------VYCSESCRREAWLRYHRWEC 163
+PC EC A YC CR +AW +YH+ C
Sbjct: 76 SLVLPCMECCKAEKQNFVHCPHCRISYCCSECREQAWNQYHQVLC 120
>gi|170097029|ref|XP_001879734.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645137|gb|EDR09385.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 435
Score = 38.9 bits (89), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 57/126 (45%), Gaps = 18/126 (14%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQT-SNAKGRHV 93
EK ++ + +SSTS S+ V PS + +NK+++ KG+ +
Sbjct: 44 EKRTRKILQNEGLVLSSTSIKRSTIV--YPSSRVIPQLDVTQWTNKVEVTFYDQQKGKGL 101
Query: 94 IAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------CNHCCTSI------SAPIPCNEC 140
IA D+ +G+ ++ E P F+I P + + C +C T + +PIPC
Sbjct: 102 IATSDIEEGEVIWKEDP--FIIAPEWDLFDLQLKSTACAYCTTPLVDSPIAGSPIPCPAS 159
Query: 141 ILAVYC 146
I + +C
Sbjct: 160 ISSAFC 165
>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 319
Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 26/103 (25%), Positives = 48/103 (46%), Gaps = 12/103 (11%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-------APIPCNECI 141
KGR + AV + G + ++P +++ C HC ++S C C
Sbjct: 14 KGRGLRAVSTIQAGTCVLEDEPYVYILNEEQLQLRCGHCLRALSLGNTSEVESKNCKSCK 73
Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLIL 184
VYC+++C + A + HR+EC +LL+ + L+L++
Sbjct: 74 RVVYCNKAC-KVAGRKEHRYEC----KLLQGKEKDDVTLRLLM 111
>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
FGSC 2508]
Length = 668
Score = 38.9 bits (89), Expect = 1.4, Method: Composition-based stats.
Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 16/101 (15%)
Query: 89 KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
+GR + + E G+T+ ++ P+ + + CN+C +S I
Sbjct: 22 RGRSLFSTERFGAGETIAIIDNPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVEAGPRT 80
Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC+ C+R W H+ EC R+ E G
Sbjct: 81 CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIKERTG 121
>gi|195335121|ref|XP_002034224.1| GM21751 [Drosophila sechellia]
gi|194126194|gb|EDW48237.1| GM21751 [Drosophila sechellia]
Length = 498
Score = 38.9 bits (89), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 68/147 (46%), Gaps = 20/147 (13%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ ++ GR V+A+ ++ G+TL +E P+ V+LP C+ C + C
Sbjct: 9 ARKCEIKHNDMLGRFVVALCNLRAGETLLLENPI--VVLPQMGDRRCSKCFNLTES--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
+C L C + C+R A + + + LL+ + LK +L+ H
Sbjct: 65 RKCRLLALCEDCSYHDERDCKRLAEMNFS----DDQVELLQKKEHTEIQPVLKCLLLREH 120
Query: 189 SDR---YKEVYHLETHLQDMRPEDLYQ 212
+ Y+E+ +E+ L R D+++
Sbjct: 121 EETLPLYEEMSQMESQLMTRRGTDIWK 147
>gi|299741263|ref|XP_001834348.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
gi|298404633|gb|EAU87325.2| hypothetical protein CC1G_02084 [Coprinopsis cinerea okayama7#130]
Length = 434
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 17/107 (15%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHK----GDTLFVEKP-----------VAFVILPPCSMS 122
K + +T N R ED H G L++ KP V +IL C+
Sbjct: 290 GKKTRPETVNHNPRAPNGPEDAHDYLWYGQDLWM-KPDVWNWVNSREDVKEIILRTCNHV 348
Query: 123 NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C+ T ++ C+ C L VYC + C++E W R H+ EC L+L
Sbjct: 349 GCSARETEVAQFKRCSACRLVVYCGQECQKEDWKR-HKPECKQHLKL 394
>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
Length = 565
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/97 (27%), Positives = 35/97 (36%), Gaps = 2/97 (2%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+++ TS GR + D G L +P + NC CTS C
Sbjct: 45 GTNLRLGTSTLGGRGLFTQTDAKPGTRLLSVRPHIHAVSARFLEDNCT-LCTSEENVRRC 103
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C YCS C+ W H+ EC R EA G
Sbjct: 104 TRCKKVAYCSTECQTADW-GIHKQECQSLRRWAEASG 139
>gi|340522060|gb|EGR52293.1| predicted protein [Trichoderma reesei QM6a]
Length = 562
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 56/129 (43%), Gaps = 20/129 (15%)
Query: 41 SVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVH 100
S++A++ + + + VA+LP L G+S T S K V A ED+
Sbjct: 212 SLQALNDGLKEMAPKCAVQVATLPVLLEGDSDTDSLDS------IPTCKQLGVFAQEDIE 265
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHC--CTSISAP--------IPCNECILAVYCSESC 150
G+ + E + +L + + C C+S AP I C EC V+CS+ C
Sbjct: 266 PGEVVLRE----YTLLTANNRHKDSVCDACSSELAPLGSSENQSIQCEECFDTVFCSQEC 321
Query: 151 RREAWLRYH 159
+A RYH
Sbjct: 322 YDQAQSRYH 330
>gi|346318901|gb|EGX88503.1| MYND domain protein [Cordyceps militaris CM01]
Length = 565
Score = 38.9 bits (89), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 31/134 (23%), Positives = 62/134 (46%), Gaps = 18/134 (13%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
++ + S++A++ + + + VA+LP L T +++ +++ + V
Sbjct: 201 DRFDPASLDALNADLKKMAPKCAIKVATLPVLL----ETASSSTDSLEI-IPTCQQLGVF 255
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSM---SNCNHCCTSISA------PIPCNECILAVY 145
A ED+ G+ + E + +L + S C+ C + + P+ C+EC V+
Sbjct: 256 AQEDIEPGEAVLRE----YTLLTANNRLKDSVCDACSSDLPPIGSEQEPVKCDECYDTVF 311
Query: 146 CSESCRREAWLRYH 159
CS+ C EA RYH
Sbjct: 312 CSQFCHNEAQERYH 325
>gi|403366172|gb|EJY82881.1| Serine/threonine protein kinase [Oxytricha trifallax]
Length = 1151
Score = 38.9 bits (89), Expect = 1.5, Method: Composition-based stats.
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
+PC C + YCSE C ++ W R H+++C
Sbjct: 31 LPCPYCAVQFYCSERCVQQDWFRSHQFKC 59
>gi|169848656|ref|XP_001831033.1| hypothetical protein CC1G_09298 [Coprinopsis cinerea okayama7#130]
gi|116507926|gb|EAU90821.1| hypothetical protein CC1G_09298 [Coprinopsis cinerea okayama7#130]
Length = 661
Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 7/67 (10%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWECH---GALRLLEAVGIAHLA----LKLILVSSHS 189
C+ C+ YC + C+ + W R HR+ECH R L +GIA L L+++
Sbjct: 455 CSSCLSVTYCGQPCQSDDWDRVHRYECHILASDHRALTKLGIALPPRLKHLHLVIIQRQM 514
Query: 190 DRYKEVY 196
D + Y
Sbjct: 515 DDMGDAY 521
>gi|308468789|ref|XP_003096635.1| hypothetical protein CRE_29105 [Caenorhabditis remanei]
gi|308241582|gb|EFO85534.1| hypothetical protein CRE_29105 [Caenorhabditis remanei]
Length = 428
Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 5/65 (7%)
Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSISAP-----IPCNECILAVYCSESCRREAWLRY 158
TL EKP+A V+ P C C + + C++C YCS C+R+ W
Sbjct: 3 TLAREKPLAAVLCPQFQDFYCATCFAELDVNGETEILMCDDCSEVSYCSPKCQRQDWRSV 62
Query: 159 HRWEC 163
H++EC
Sbjct: 63 HQFEC 67
>gi|167533183|ref|XP_001748272.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773392|gb|EDQ87033.1| predicted protein [Monosiga brevicollis MX1]
Length = 456
Score = 38.9 bits (89), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 4/80 (5%)
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAP---IPCNECILA 143
+GR V A + + +G T+ + P+AFV+ ++ C C S C+ C A
Sbjct: 9 GRGRAVRATKALARGQTVLLNPPLAFVLRHEERVARRCEDCFVSEKPEHRLANCSLCHTA 68
Query: 144 VYCSESCRREAWLRYHRWEC 163
YCS+ C+ W R H+ C
Sbjct: 69 AYCSKPCQTRNWKRAHKHVC 88
>gi|339896842|ref|XP_003392212.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|321398900|emb|CBZ08344.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 2122
Score = 38.5 bits (88), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 42/101 (41%), Gaps = 26/101 (25%)
Query: 91 RHV--IAVEDVHKGDTLFVEKPVAFV----ILP---------------PCSMSNCNHCCT 129
RH+ A + K + +FVE+P + +LP P S + CC
Sbjct: 1436 RHIRLSATRRIQKDEVIFVERPAILMPFWPLLPSRLASASTGAGVSDRPVSTTPEPSCCA 1495
Query: 130 S-----ISAPIPCNECILAVYCSESCRREAWLRYHRWECHG 165
++ P+ C +VYCSE CR+ A YH EC G
Sbjct: 1496 QCGRQRLAKPVNCPGGCHSVYCSEQCRQRALRLYHVVECGG 1536
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 34/78 (43%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A G+ LF +V+ ++C+ C T C +C A Y
Sbjct: 12 SPGKGRGLKATRSFALGELLFSCPAYTYVLTDNERGNHCDFCFTRKEGLSKCGKCKQAFY 71
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++ W H+ EC
Sbjct: 72 CNVDCQKGDW-PMHKLEC 88
>gi|38566954|emb|CAE76256.1| conserved hypothetical protein [Neurospora crassa]
Length = 533
Score = 38.5 bits (88), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 16/101 (15%)
Query: 89 KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
+GR + + E G+T+ +E P+ + + CN+C +S I
Sbjct: 22 RGRSLFSTERFGAGETIAIIENPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVEAGART 80
Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC+ C+R W H+ EC R+ E G
Sbjct: 81 CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIQERTG 121
>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
Length = 400
Score = 38.5 bits (88), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 45/96 (46%), Gaps = 16/96 (16%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSISAP--- 134
++++ S +GR++IA D+ G+ L K A+ +P C + C +C I P
Sbjct: 8 LELKKSETEGRYIIATRDIDVGENLL--KCKAYFSMP-CEIYKKITCGNCVKIIKRPQQE 64
Query: 135 -------IPCNECILAVYCSESCRREAWLRYHRWEC 163
CN C YCSE C++E+ + +EC
Sbjct: 65 KKKKVKEYKCNVCNEIWYCSEFCKKESQPIHAHYEC 100
>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
Length = 561
Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 44/113 (38%), Gaps = 31/113 (27%)
Query: 80 KIKMQ-TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN------CNHC----- 127
KI+++ T + + + A DV KGD +F + + I P N C+HC
Sbjct: 192 KIELRYTDEIRRKSLFARCDVKKGDVVFTDTDLYANIKSPSLDINTYQRLFCDHCMGQLP 251
Query: 128 -----------------CTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
CT I I N YCSE CR +AW YH EC
Sbjct: 252 KVSETETAEKVVLCKYGCTRIM--IGSNTVDTTKYCSEKCRDDAWDEYHCQEC 302
>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
Length = 349
Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 35/87 (40%), Gaps = 1/87 (1%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP 136
A ++ S KGR + A+ H GD LF A V+ +C C
Sbjct: 5 ARGGLERFCSAGKGRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK 64
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C A YC C++E W H+ EC
Sbjct: 65 CGRCKQAFYCDVECQKEDWP-LHKLEC 90
>gi|124513818|ref|XP_001350265.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
3D7]
gi|23615682|emb|CAD52674.1| histone-lysine N-methyltransferase, putative [Plasmodium falciparum
3D7]
Length = 509
Score = 38.5 bits (88), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 40/102 (39%), Gaps = 22/102 (21%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCS-------------------M 121
K++ +G+ V+AV + G L P F+ P C +
Sbjct: 2 FKIEYKEDRGKCVVAVNQIRAGYCLVESHPEIFI--PLCVKYMAPRIIDQNNKKNNFKII 59
Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ C +C + I C C VYCS+ C AW + HR EC
Sbjct: 60 NTCFYCLEKFNKCICCPNCKYVVYCSDMCLERAW-KSHREEC 100
>gi|327267201|ref|XP_003218391.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Anolis carolinensis]
Length = 1012
Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C HC S+ +PC C CS+SC+ ++W H+ EC A R+
Sbjct: 945 CYHCGRSVGVNLVPCARCFETYSCSKSCKLKSWNERHKRECLQAGRM 991
>gi|219114284|ref|XP_002176313.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402716|gb|EEC42705.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 705
Score = 38.5 bits (88), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 45/121 (37%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN------CNHCCTSISAPI---- 135
S +GRH++A + +G L++E+P+ + LP S+ N C C + P+
Sbjct: 19 SPTRGRHLVATKAQPRGRLLWIERPL--LALP--SLENRPDAWTCAACHAFLGGPVRALQ 74
Query: 136 -------------------------------PCNECILAVYCSESCRREAWLRYHRWECH 164
PC VYCS C R+ W +HR+ C
Sbjct: 75 HRWSRSPTPSSASSTLDDDDDDDHSEEYRTVPCRRNCGFVYCSADCERDVWAAHHRYLCT 134
Query: 165 G 165
G
Sbjct: 135 G 135
>gi|50554579|ref|XP_504698.1| YALI0E32725p [Yarrowia lipolytica]
gi|49650567|emb|CAG80302.1| YALI0E32725p [Yarrowia lipolytica CLIB122]
Length = 655
Score = 38.1 bits (87), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC----HGALRLLEAVGIAHLALKLILVSSHSDRY 192
C C LA+YC C++ W +H++EC + LL V +A + ++ +D
Sbjct: 89 CGGCGLAMYCRAECQKSDWTEHHKYECKIFSNWTTPLLPGVSLALIVRTAFAMAQDTDLR 148
Query: 193 KEVYHLET 200
++ + T
Sbjct: 149 TRIFQMHT 156
>gi|85117964|ref|XP_965349.1| hypothetical protein NCU02962 [Neurospora crassa OR74A]
gi|28927157|gb|EAA36113.1| predicted protein [Neurospora crassa OR74A]
Length = 610
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 16/101 (15%)
Query: 89 KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
+GR + + E G+T+ +E P+ + + CN+C +S I
Sbjct: 22 RGRSLFSTERFGAGETIAIIENPLLALPDDANMRTTCNYCL-YVSGTIEFEGDVEAGART 80
Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC+ C+R W H+ EC R+ E G
Sbjct: 81 CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIQERTG 121
>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
Length = 583
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 42/111 (37%), Gaps = 20/111 (18%)
Query: 73 TLMCASNKIKMQTSNAK--GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTS 130
T+ A + +Q S G + DV +G+ +F P+ M C+ C
Sbjct: 78 TIKAAEGSLSIQESTIPNAGSGMFTTRDVEEGELIFTSVPLVTCAEVGPGMEACDFCFQQ 137
Query: 131 ---ISAPIP---------------CNECILAVYCSESCRREAWLRYHRWEC 163
+ P+ CN C L YCS+SC + AW H +EC
Sbjct: 138 RRRVFHPVEDRFLQPGEVLPPLHICNGCRLYQYCSQSCSQRAWDTGHLYEC 188
>gi|330791333|ref|XP_003283748.1| hypothetical protein DICPUDRAFT_147450 [Dictyostelium purpureum]
gi|325086371|gb|EGC39762.1| hypothetical protein DICPUDRAFT_147450 [Dictyostelium purpureum]
Length = 1240
Score = 38.1 bits (87), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 48/112 (42%), Gaps = 5/112 (4%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S+ GR A + + LF E+P + C +C + P+ C C A Y
Sbjct: 611 SDKVGRITEANSPIPENSLLFQERPYVSCLDRNYHSLYCYNCFKELLVPVYCKGCNNAQY 670
Query: 146 CSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLI-LVSSHSDRYKEVY 196
C +SC +E + H EC L +H +L +I ++ + Y+E+Y
Sbjct: 671 CGDSCLKEDYKNQHSVECDKGFLL----TCSHESLLVIRILIKKAKEYRELY 718
>gi|195381151|ref|XP_002049318.1| GJ20818 [Drosophila virilis]
gi|194144115|gb|EDW60511.1| GJ20818 [Drosophila virilis]
Length = 495
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 36/145 (24%), Positives = 69/145 (47%), Gaps = 18/145 (12%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ + GR+V+A ++ G+T+ +E+PV ++L C +C S+ C
Sbjct: 9 AKKCEVKQNETLGRYVVAATNIKAGETVLLEQPV--LLLANNGDRRCCNCHQLTSS--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSD 190
+C L C++ CRR A L+ + E ALK +L+ H++
Sbjct: 65 GKCRLMPLCADCVEHHGPDCRRLAELQLQEQQVEQLQAHTEVCS----ALKYLLLREHAE 120
Query: 191 R---YKEVYHLETHLQDMRPEDLYQ 212
+ Y+++ LE HL R D+++
Sbjct: 121 KRSHYEQLLQLEAHLARRRDTDIWR 145
>gi|146071727|ref|XP_001463183.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134067266|emb|CAM65535.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 501
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 20/91 (21%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA---------------- 133
GR + A + ++ E P+ ++ C + C HC +SA
Sbjct: 193 GRGLYATRHIQPRSSILCESPL---LVQRCDDTKCAHCLAPLSARRGSSITDTGASDPLA 249
Query: 134 -PIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ C C YCSE CR AW +YH C
Sbjct: 250 SGVACPHCQHETYCSEDCRDAAWEQYHVCSC 280
>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 43/109 (39%), Gaps = 14/109 (12%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-SNCNHCC-TSISAPIP- 136
+I S+ K R + A G T+ + +M + CN+C T+ +P P
Sbjct: 10 RIHNDPSSPKRRSLRATRAFSPGSTIATFSNPLLALPDGATMRTTCNYCLRTTAPSPSPS 69
Query: 137 -----------CNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC +C+R W H+ EC R+ E G
Sbjct: 70 QSAATPPAFKACTACKAAVYCGPACQRAHWKAAHKAECKMFARVREQAG 118
>gi|242007166|ref|XP_002424413.1| protein msta, isoform A, putative [Pediculus humanus corporis]
gi|212507813|gb|EEB11675.1| protein msta, isoform A, putative [Pediculus humanus corporis]
Length = 540
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 37/157 (23%), Positives = 70/157 (44%), Gaps = 34/157 (21%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAF----VILPPCSMSNCNHCCTSISAPIPC 137
K++ ++ GR++IA E++ KGD +F E P+ V +P C C++ C ++ C
Sbjct: 47 KLEQNDRLGRYLIASENISKGDVIFYESPLIVGPKTVTVPVC--LGCHNVC-NMDNYHEC 103
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALR---------------------LLEAVGIA 176
+ C + CS C+ H EC+ ++ E
Sbjct: 104 DGCGFPL-CSPQCQVSD---NHVDECYQMMKSNYRIKMDSKKKKNNDDNNGGGKEIFYFP 159
Query: 177 HLALKLILVSS--HSDRYKEVYHLETHLQDMRPEDLY 211
+ L+ IL+ +S++YK++ +LE+H D + L+
Sbjct: 160 IVPLRCILLKGKKNSEKYKKIMNLESHYDDRINKPLF 196
>gi|398010425|ref|XP_003858410.1| hypothetical protein, conserved [Leishmania donovani]
gi|322496617|emb|CBZ31687.1| hypothetical protein, conserved [Leishmania donovani]
Length = 501
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 35/91 (38%), Gaps = 20/91 (21%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA---------------- 133
GR + A + ++ E P+ ++ C + C HC +SA
Sbjct: 193 GRGLYATRHIQPRSSILCESPL---LVQRCDDTKCAHCLAPLSARRGSSITDTGASDPLA 249
Query: 134 -PIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ C C YCSE CR AW +YH C
Sbjct: 250 SGVACPHCQHETYCSEDCRDAAWEQYHVCSC 280
>gi|307198437|gb|EFN79379.1| Protein msta, isoform A [Harpegnathos saltator]
Length = 517
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 68/143 (47%), Gaps = 16/143 (11%)
Query: 75 MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA- 133
M K K+ S GR+++A +++ G+ + E+P+A + C C S+S
Sbjct: 1 MAPIPKYKVTHSGRLGRYLVAAKNIAAGEVIIREEPIAVGPMTYRKDRLCFACLRSLSKI 60
Query: 134 --PIPCNECILAVYCSESCRREAWLRYHRW-EC--HGALRLL------EAVGIAHLALKL 182
C+ C LA CS +C E ++H EC G R L +A GI LAL+L
Sbjct: 61 GRQYVCSRCNLAPLCSVAC--ETRTKHHTSDECLILGENRDLLAKNGIDATGIL-LALRL 117
Query: 183 ILVSSHSDR-YKEVYHLETHLQD 204
L+ + +++V H+E HL +
Sbjct: 118 WLIKCRNPAVWEQVDHMEDHLHE 140
>gi|157137747|ref|XP_001657162.1| hypothetical protein AaeL_AAEL003674 [Aedes aegypti]
gi|108880819|gb|EAT45044.1| AAEL003674-PA [Aedes aegypti]
Length = 446
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 6/100 (6%)
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR 160
KG + EKP A V+ C+ C + C+ C+ YC+ C++EAW H+
Sbjct: 8 KGSLILKEKPFACVLHSRYRAERCDKCFKP-GKVLKCSNCLYVRYCNRFCQKEAWPD-HQ 65
Query: 161 WECHGALRLLEAVGIAHLAL---KLILVSSHSDRYKEVYH 197
EC G L+ + + AL ++I Y++ Y+
Sbjct: 66 DEC-GKLKAIGTRTVPDAALMISRIIRKLQKGGDYQKGYY 104
>gi|12322390|gb|AAG51222.1|AC051630_19 unknown protein; 33246-28649 [Arabidopsis thaliana]
Length = 781
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 16/72 (22%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNE 139
K++ ++ KGR +++ D+ + + VE+P F +LP + +PC
Sbjct: 219 KLRCVSTKEKGRGMVSECDIEEASVIHVEEP--FSVLP--------------ADTVPCPS 262
Query: 140 CILAVYCSESCR 151
C + VYCSESC+
Sbjct: 263 CSIPVYCSESCQ 274
>gi|281206757|gb|EFA80942.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 367
Score = 38.1 bits (87), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 43/101 (42%), Gaps = 19/101 (18%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKP-VAFVILPPCSMSNCNHCCTSISAP----- 134
IK + +GR + DV G ++ EKP V++ L + + CNHC S+
Sbjct: 60 IKDSGDSLRGRGLFTKLDVQSGQVIYTEKPFVSYQSLEIDNKTICNHCLKSLDKDKIEKS 119
Query: 135 ------------IPCNECILAVYCSESCRREAWLRYHRWEC 163
C +C YCSE C+ E+ ++YH C
Sbjct: 120 KQKKDDDFDDLFSKCVKCS-TRYCSEQCKSESDIQYHLASC 159
>gi|430813591|emb|CCJ29069.1| unnamed protein product [Pneumocystis jirovecii]
Length = 463
Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 8/83 (9%)
Query: 89 KGRHVIAVEDVHKGDTLFVE-KPVAFVILPPCSMSNCNHCCTS-------ISAPIPCNEC 140
KG + + + +G+ L+V KP+ V+ + C++C S I I C C
Sbjct: 20 KGYGLYSKKRFAEGERLYVFVKPLVAVLDEKHIKNTCSNCFISEEDKDDDIFHLIQCKGC 79
Query: 141 ILAVYCSESCRREAWLRYHRWEC 163
+ YC + C++E W +H++EC
Sbjct: 80 KILQYCCQKCQKEDWEIFHQYEC 102
>gi|328697017|ref|XP_003240210.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Acyrthosiphon pisum]
Length = 635
Score = 38.1 bits (87), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 23/115 (20%), Positives = 49/115 (42%), Gaps = 5/115 (4%)
Query: 52 TSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV 111
T + + + L G + + S ++++ S R V A D++ GD + +++P
Sbjct: 203 TEQERAMKIPIIEHLVGGRNDNIPALSAFVELKMSEDMRRGVYATRDINPGDVVAIDEP- 261
Query: 112 AFVILPPCSMSNC---NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
++ P + + N C A C +C + YC++ C + + H + C
Sbjct: 262 -YICGPSGNDTEVCHYNGCLKMDFALFRCPKCFMVYYCTKDCMDKDYKEGHNFAC 315
>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
Length = 544
Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 14/142 (9%)
Query: 86 SNAKGRHVIAVEDVHKGDT-LFVEKPVAFVILPPCSMSNCN------HCCTSISAPIP-C 137
S G + A ++ G T + +++P V+ C+ HC ++ + C
Sbjct: 65 SPGAGTGIFAATNIPVGGTVMLIQRPFVAVLDTERLQDTCSGCLGQHHCNRDVNVELKAC 124
Query: 138 NECILAVYCSESCRREAWLRYHRWECHGALRLLEAV--GIAHLALKLILVSSHSDRYKE- 194
C + YC+++C+ + W H +EC L+ V A L+++L S KE
Sbjct: 125 TGCRVVKYCNKTCQAKDWKFAHSFECRVYKELMPRVLPSNARAILRMVLRSERGKYPKEE 184
Query: 195 ---VYHLETHLQDMRPEDLYQY 213
LE+H+ +++ ++ Q+
Sbjct: 185 LELFRKLESHMAEIQAQNWEQW 206
>gi|196012830|ref|XP_002116277.1| hypothetical protein TRIADDRAFT_60207 [Trichoplax adhaerens]
gi|190581232|gb|EDV21310.1| hypothetical protein TRIADDRAFT_60207 [Trichoplax adhaerens]
Length = 543
Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 37/93 (39%), Gaps = 3/93 (3%)
Query: 73 TLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS 132
T S IK T +G ++ +G FV P S C C SI
Sbjct: 366 TQTVGSLGIKTSTVGGEGEIAENTKNSSRGSRRLTPGRHRFVAKP--SFCYCYECGRSIG 423
Query: 133 API-PCNECILAVYCSESCRREAWLRYHRWECH 164
+ C+ C+ YCS+ C+ AW H+ EC+
Sbjct: 424 VRLTACSRCLKVYYCSKQCKLRAWNARHKDECY 456
>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 38.1 bits (87), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 44/115 (38%), Gaps = 32/115 (27%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA-------- 133
S+ KG+ + A + KG+T+F EKP+ F C+HC S+
Sbjct: 27 VSSGKGKGLFATRAIRKGETIFQEKPLVSSQFQWNALYRYRACDHCLRSLETAEENAQRL 86
Query: 134 --------PIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
P P C C + YCS C + A +YHR C GA R
Sbjct: 87 SGNAHVVLPYPELCTVRNGLHQQCPRCQV-TYCSAECLKAAADQYHRALCLGASR 140
>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
morsitans]
Length = 395
Score = 38.1 bits (87), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 47/114 (41%), Gaps = 34/114 (29%)
Query: 88 AKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCC---------------- 128
+GR +++ + ++ D +F E+P F + C+HC
Sbjct: 11 GRGRAMVSTKSLNTDDIIFEEEPFVSCQFSWNAAYGYAACDHCMRPLETITENIRRLANQ 70
Query: 129 TSISAPIP--------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
+++ P+ C++C + YCSE CR EA RYHR C G+ R
Sbjct: 71 PALAVPLTEYDPTAQWLKQFTSCSKCRVR-YCSEECRIEALKRYHRIACLGSYR 123
>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
Length = 430
Score = 38.1 bits (87), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + A G+ LF +V+ ++C+ C C +C A Y
Sbjct: 12 SPGKGRGLKATRSFALGELLFTCPAYTYVLTDTERGNHCDFCFARKEGLSKCGKCKQAFY 71
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++ W H+ EC
Sbjct: 72 CNVDCQKGDW-PMHKLEC 88
>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 420
Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 44/114 (38%), Gaps = 32/114 (28%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA--------- 133
S+ KG+ + A + KG+T+F EKP+ F C+HC S+
Sbjct: 26 SSGKGKGLFATRAIRKGETIFQEKPLVSSQFQWNALYRYRACDHCLRSLETAEENAQRLS 85
Query: 134 -------PIP------------CNECILAVYCSESCRREAWLRYHRWECHGALR 168
P P C C + YCS C + A +YHR C GA R
Sbjct: 86 GNAHVVLPYPELCTVRNGLHQQCPRCQV-TYCSAECLKAAADQYHRALCLGASR 138
>gi|322707137|gb|EFY98716.1| wd and tetratricopeptide repeat protein [Metarhizium anisopliae
ARSEF 23]
Length = 553
Score = 38.1 bits (87), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 60/139 (43%), Gaps = 29/139 (20%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
++ E+S++A++ + + + + VA+LP L G S+T + + T G V
Sbjct: 201 DRFAEDSLKALNADLKTMAPKCAVQVATLPVLLEGASNT----DHYDIIPTCKQLG--VF 254
Query: 95 AVEDVHKGDTLFVE-----------KPVAFVI---LPPCSMSNCNHCCTSISAPIPCNEC 140
AVED+ G+ + E PV LPP +N + C+EC
Sbjct: 255 AVEDIEPGEDVLREYSLLTANNRLKDPVCDACSSDLPPLGSNN---------EAVNCDEC 305
Query: 141 ILAVYCSESCRREAWLRYH 159
V+CS+ C A RYH
Sbjct: 306 YDTVFCSQFCHDVAQERYH 324
>gi|299752592|ref|XP_001831035.2| hypothetical protein CC1G_09300 [Coprinopsis cinerea okayama7#130]
gi|298409907|gb|EAU90823.2| hypothetical protein CC1G_09300 [Coprinopsis cinerea okayama7#130]
Length = 573
Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 23/44 (52%)
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEA 172
T I+ C+ C VYCS C+RE W HR EC LR EA
Sbjct: 389 TEITMGAQCSGCHSVVYCSLQCQREDWNERHRTECLAMLRSYEA 432
>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 547
Score = 37.7 bits (86), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 11/85 (12%)
Query: 90 GRHVIAVEDVHKG-DTLFVEKPVAFVILPPCSMSNCNHC-----CTSISAPIP-----CN 138
G+ V A DV KG D L V+ P+ ++ + C+ C SI P C
Sbjct: 46 GQGVFATIDVKKGADVLVVDNPLIALVEEAQLQNICSGCYDTSKAGSIDNRRPDLVKACT 105
Query: 139 ECILAVYCSESCRREAWLRYHRWEC 163
C + YC ++C+R+ W H EC
Sbjct: 106 RCKVVYYCDKNCQRKDWKAGHSLEC 130
>gi|338725783|ref|XP_003365201.1| PREDICTED: LOW QUALITY PROTEIN: ankyrin repeat and MYND
domain-containing protein 1-like [Equus caballus]
Length = 917
Score = 37.7 bits (86), Expect = 2.8, Method: Composition-based stats.
Identities = 22/68 (32%), Positives = 32/68 (47%), Gaps = 11/68 (16%)
Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRLLE---------AV 173
C C SI +PC C + CS+ C+ +AW +H+ +C AL L +
Sbjct: 815 CYQCGRSIGVRLVPCTRCYGILTCSKYCKTKAWTDFHKKDC-SALAALAGDFVQHFDMSA 873
Query: 174 GIAHLALK 181
G HL+LK
Sbjct: 874 GKEHLSLK 881
>gi|328698136|ref|XP_003240555.1| PREDICTED: hypothetical protein LOC100574406 isoform 1
[Acyrthosiphon pisum]
gi|328698140|ref|XP_003240557.1| PREDICTED: hypothetical protein LOC100574406 isoform 3
[Acyrthosiphon pisum]
Length = 766
Score = 37.7 bits (86), Expect = 2.9, Method: Composition-based stats.
Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 12/125 (9%)
Query: 55 SASSSVASLPSLACGESHTLM--CASNKIKMQTSN---AKGRHVIAVEDVHK---GDT-- 104
+AS S +LP A + ++ N I M S+ A+G+ +I +K G T
Sbjct: 639 TASLSKTALPIPATPTTFVIIKPAVPNNISMANSSLNRAEGKTIIVGNKQYKFVKGPTGE 698
Query: 105 --LFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWE 162
V K FV PP + C + I C+ C YCS C++ W H +
Sbjct: 699 LRAVVNKTKTFVKPPPTIKTKCYSRNCTKPVTIMCSSCTSVKYCSNICQQHDWYDGHIND 758
Query: 163 CHGAL 167
CH L
Sbjct: 759 CHHLL 763
>gi|326435247|gb|EGD80817.1| hypothetical protein PTSG_01403 [Salpingoeca sp. ATCC 50818]
Length = 431
Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 15/45 (33%), Positives = 21/45 (46%)
Query: 119 CSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
C C C ++ P+ C C YCS C+R+ W H+ EC
Sbjct: 378 CRGPQCAACNCFLTGPLLCGRCRQVAYCSSECQRQHWKAAHKREC 422
>gi|237757339|ref|NP_766438.2| ankyrin repeat and MYND domain-containing protein 1 [Mus musculus]
Length = 1001
Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C SI + PC C + CS+ C+ +AW+ +H+ +C+ + ++
Sbjct: 940 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 987
>gi|148708043|gb|EDL39990.1| ankyrin repeat and MYND domain containing 1 [Mus musculus]
Length = 1002
Score = 37.7 bits (86), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C SI + PC C + CS+ C+ +AW+ +H+ +C+ + ++
Sbjct: 941 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 988
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 32/78 (41%), Gaps = 1/78 (1%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KGR + G+ L + ++V+ +C C T + C +C A Y
Sbjct: 13 SPGKGRGLRVTRAFQVGELLLSSQAYSYVLSVKERGEHCESCFTRKKSLARCGKCKKAFY 72
Query: 146 CSESCRREAWLRYHRWEC 163
C C++ W HR EC
Sbjct: 73 CDVKCQKGDWA-MHRLEC 89
>gi|391337345|ref|XP_003743030.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Metaseiulus occidentalis]
Length = 437
Score = 37.7 bits (86), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 14/79 (17%)
Query: 101 KGDTLFVEKPVAFVILPPCSMSNCNHCCTSIS-----APIPCNECILAVYCSESCRR--- 152
KG+ + +P AF + S C+ C + API C+ C + YCS+ C R
Sbjct: 6 KGEVILFSRPYAFALYESFVDSTCSFCFRTQGDGHDFAPIECHSCKVYKYCSDECLRLDQ 65
Query: 153 -EAWLRYHRWECHGALRLL 170
+ WL EC A R L
Sbjct: 66 MDHWL-----ECQVAKRKL 79
>gi|403259468|ref|XP_003922234.1| PREDICTED: tudor domain-containing protein 1 [Saimiri boliviensis
boliviensis]
Length = 1234
Score = 37.7 bits (86), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 32/151 (21%), Positives = 69/151 (45%), Gaps = 19/151 (12%)
Query: 18 RLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCA 77
RL+ + + + K + L E++++ ++ Q +++SS S S+ + G+ L
Sbjct: 54 RLKSSENANKKNNFLLCEQTKQ----YLASQEDNSASSNSGSINGEVVASQGDRKKLPAG 109
Query: 78 SNK--IKMQTSNAKGRHVIAVEDVHKG----------DTLFVEKPVAFVIL-PPCSMSNC 124
+++ ++ ++ KG + +VH ++L + P F L PP + C
Sbjct: 110 NSESSPSVENNSPKGVNTKPENNVHPAKSKKLHKLVENSLSINNPELFTSLGPPLRSTTC 169
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAW 155
+ C + + C++C YCS +C+R W
Sbjct: 170 HRC--GLFGSLRCSQCKQTYYCSTACQRRDW 198
>gi|113197611|gb|AAI20888.1| Ankmy1 protein [Mus musculus]
Length = 905
Score = 37.7 bits (86), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C SI + PC C + CS+ C+ +AW+ +H+ +C+ + ++
Sbjct: 844 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 891
>gi|81898358|sp|Q8C0W1.1|ANMY1_MOUSE RecName: Full=Ankyrin repeat and MYND domain-containing protein 1
gi|26325636|dbj|BAC26572.1| unnamed protein product [Mus musculus]
gi|113197927|gb|AAI20889.1| Ankyrin repeat and MYND domain containing 1 [Mus musculus]
Length = 906
Score = 37.7 bits (86), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C SI + PC C + CS+ C+ +AW+ +H+ +C+ + ++
Sbjct: 845 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 892
>gi|260798620|ref|XP_002594298.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
gi|229279531|gb|EEN50309.1| hypothetical protein BRAFLDRAFT_117670 [Branchiostoma floridae]
Length = 400
Score = 37.7 bits (86), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 24/98 (24%), Positives = 44/98 (44%), Gaps = 7/98 (7%)
Query: 74 LMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSM-SNCNHCCTSIS 132
L C + + N KG + G + E+P ++ + + + C++C +
Sbjct: 8 LFCEIGQGRGIRCNKKGS-----SGIEPGTLIVKEEPYSYTLTDGELLRTRCHYCLKRLE 62
Query: 133 APIPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
+ C+ C A YC+E C++ A +H EC G RL+
Sbjct: 63 NSVSCDACRTAKYCNEECKKAAKF-HHTPECRGYSRLM 99
>gi|320165180|gb|EFW42079.1| SMYD5 protein [Capsaspora owczarzaki ATCC 30864]
Length = 429
Score = 37.7 bits (86), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 23/86 (26%), Positives = 40/86 (46%), Gaps = 4/86 (4%)
Query: 53 SSSASSSVASLPSLACGESHTLMCASNKIKMQTSNA-KGRHVIAVEDVHKGDTLFVEKPV 111
S A+ V++ P E +CA ++++ +A KGR + A + KGD +F E P+
Sbjct: 19 SMPAAELVSARPLPLADEYFATLCAGKHVQVRVEDARKGRGLFATQAFKKGDIVFTEAPL 78
Query: 112 A---FVILPPCSMSNCNHCCTSISAP 134
F+ C+ C S+ +P
Sbjct: 79 VCAQFLWNEAYGYKACHQCMRSLESP 104
>gi|225711848|gb|ACO11770.1| SET and MYND domain-containing protein 3 [Lepeophtheirus salmonis]
Length = 426
Score = 37.4 bits (85), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 21/85 (24%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 102 GDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHR 160
GD + +P ++ C++C +S A I C+ C + YC C+R +W H+
Sbjct: 6 GDEVLRCRPFSYAFKEKNRDKICDYCFSSRQAKILRCSRCCIIYYCGRQCQRLSWSEIHK 65
Query: 161 WECHGALRLLEAVGIAHLALKLILV 185
EC + L+ + + L L+ +
Sbjct: 66 KEC----KYLKMLDLKEPPLDLLFI 86
>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
Length = 511
Score = 37.4 bits (85), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 120 SMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+++ C +C ++ I C C YCS+SC AW ++HR EC
Sbjct: 58 TINICFYCFEKVNKCIYCPNCKYVAYCSDSCLERAW-KFHREEC 100
>gi|441613050|ref|XP_003265162.2| PREDICTED: N-lysine methyltransferase SMYD2 [Nomascus leucogenys]
Length = 536
Score = 37.4 bits (85), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 34/138 (24%), Positives = 51/138 (36%), Gaps = 11/138 (7%)
Query: 28 KEDVLTIEKSEENSVEAMSMQVSSTSSSASS--SVASLPSLACGESHTLMCASNKIKMQT 85
+ED+ + EE+ S + T S A +V SL G ++
Sbjct: 65 REDMGGEDHGEEDGSTGFSPWILETPSKAPPLRTVRSLEDEGKG--------LGGMERFC 116
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVY 145
S KG GD LF A+V+ ++C +C C C A Y
Sbjct: 117 SPGKGPGAAGSAAFQVGDLLFSCPAYAYVLTVNERGNHCEYCFARKEGLSKCGRCKQAFY 176
Query: 146 CSESCRREAWLRYHRWEC 163
C+ C++E W H+ EC
Sbjct: 177 CNVECQKEDW-PMHKLEC 193
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 30/141 (21%)
Query: 78 SNKIKMQTSNAK-GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN---CNHCCTSISA 133
S+ I+++ + K GR + A+ KG +F E+P FV P + CNHC ++
Sbjct: 63 SSSIQIKDNGTKSGRGLFAITAFKKGSQIFQEEP--FVSYPSLDVDKSTICNHCLKQLNN 120
Query: 134 PI------------PCNECILAVYCSESCRREAWLRYHRWECHGA--------LRLLEAV 173
C +C YCSE CR A +++H C + L+E
Sbjct: 121 NGNNNNDNKSKQQQQCTKCG-ERYCSEKCRSSAEVQHHLASCPTSSNIDNITRYSLVEKR 179
Query: 174 GIAHLALKL---ILVSSHSDR 191
LA ++ IL+ +H D+
Sbjct: 180 RFPLLAARILSRILLENHFDK 200
>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
[Oreochromis niloticus]
Length = 417
Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 45/110 (40%), Gaps = 32/110 (29%)
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
N KG+ + A + KG+T+F+E+P+ F+ C +C
Sbjct: 29 NVKGKGLFAKRSMKKGETIFIERPLVSAQFLWNTLYKYKACEYCLRALETAEGNARRLSG 88
Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
+S P P C +C + +YCS CR+ A +YHR C G
Sbjct: 89 IPGLSLPHPELCRVRPEQHQACPQCQV-MYCSSECRQAAADQYHRALCLG 137
>gi|322698402|gb|EFY90172.1| MYND domain protein, putative [Metarhizium acridum CQMa 102]
Length = 553
Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 37/135 (27%), Positives = 58/135 (42%), Gaps = 29/135 (21%)
Query: 39 ENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVED 98
E+S++A++ + + + + VA+LP L G S+T + + T G V AVED
Sbjct: 205 EDSLKALNADLKTMAPKCAVQVATLPVLLEGASNT----DHYDIIPTCKQLG--VFAVED 258
Query: 99 VHKGDTLFVE-----------KPVAFVI---LPPCSMSNCNHCCTSISAPIPCNECILAV 144
+ G+ + E PV LPP +N + C+EC V
Sbjct: 259 IGPGEVVLREYSLLTANNRLKDPVCDACSSDLPPLGSNN---------EAVNCDECYDTV 309
Query: 145 YCSESCRREAWLRYH 159
+CS+ C A RYH
Sbjct: 310 FCSQYCHDVAQERYH 324
>gi|293349946|ref|XP_001068522.2| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Rattus norvegicus]
gi|392350877|ref|XP_003750785.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Rattus norvegicus]
gi|149037544|gb|EDL91975.1| rCG55486 [Rattus norvegicus]
Length = 1006
Score = 37.4 bits (85), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 14/48 (29%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C SI + PC C + CS+ C+ +AW+ +H+ +C+ + ++
Sbjct: 946 CYQCGRSIGVRLSPCPRCYGILTCSKYCKTKAWIEFHKKDCNDIMAMI 993
>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
2509]
Length = 547
Score = 37.4 bits (85), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 16/101 (15%)
Query: 89 KGRHVIAVEDVHKGDTL-FVEKPVAFVILPPCSMSNCNHCCTSISAPI------------ 135
+GR + + E G+T+ ++ P+ + + CN+C +S I
Sbjct: 22 RGRSLFSTESFGAGETIAIIDNPLLALPDDANMRTTCNYCLY-VSGTIEFEGDVEAGPRT 80
Query: 136 --PCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVG 174
C C AVYC+ C+R W H+ EC R+ E G
Sbjct: 81 CKACTGCKAAVYCNAECQRAHWKLVHKAECKMFKRIKERTG 121
>gi|302674593|ref|XP_003026981.1| hypothetical protein SCHCODRAFT_238272 [Schizophyllum commune H4-8]
gi|300100666|gb|EFI92078.1| hypothetical protein SCHCODRAFT_238272 [Schizophyllum commune H4-8]
Length = 634
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 13/79 (16%)
Query: 115 ILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHR---------WECHG 165
+L PC+ +N + A PC E A YCS+ C+RE W HR W G
Sbjct: 425 LLFPCNNTNAVVHEAEVRA-CPCGE---AFYCSKKCQREHWKTSHRDQCRRADDVWGTQG 480
Query: 166 ALRLLEAVGIAHLALKLIL 184
+ L +AV + AL++++
Sbjct: 481 DITLGDAVFLFESALRVVV 499
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 8/124 (6%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI 141
++ + +GR + A +V G+++ + V+L C C + A I C++C
Sbjct: 9 RVDSDATRGRLLRATRNVKMGESILRDCAYGNVVLSGNRAELCAVCLRAADADICCDDCS 68
Query: 142 LAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLETH 201
+CSE C RE H EC EA+ L V HSD V +
Sbjct: 69 KVFFCSEEC-REKLRDIHENEC-------EALEEVDLIASKTSVDLHSDEEGNVRSSYAN 120
Query: 202 LQDM 205
++D+
Sbjct: 121 VKDL 124
>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 25/95 (26%), Positives = 41/95 (43%), Gaps = 10/95 (10%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV----AFVILPPCSMSNCNHCCTSI--SAP 134
+++ +GR ++A + + G + E P+ AF L C+HC + SA
Sbjct: 12 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 71
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C C L +CS +C + + W C LRL
Sbjct: 72 QKCQSCSLVSFCSPNC----FASHTPWLCESLLRL 102
>gi|156101287|ref|XP_001616337.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148805211|gb|EDL46610.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 517
Score = 37.4 bits (85), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 37/103 (35%), Gaps = 18/103 (17%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------------- 123
S K+ KG+ V+AV + G + P FV L M+
Sbjct: 6 SQMYKVDYKEDKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDTENKKTDYKT 65
Query: 124 ---CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
C +C S C C VYCSE C W + HR EC
Sbjct: 66 VNLCFYCFEKFSKSFYCPNCKYVVYCSEVCLERGW-KLHRDEC 107
>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
CBS 8904]
Length = 573
Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 33/139 (23%), Positives = 57/139 (41%), Gaps = 34/139 (24%)
Query: 77 ASNKIKMQTSNAKGRHVIAVEDVH------------KGDTLFVEKPVAFVILPPCSMS-- 122
A + ++++TS +GR + A D+ G+TL KP+ V+ P
Sbjct: 55 AGSPLEVKTSPQRGRGLYAARDIKAGELVWRAPKLTPGETLLKTKPLVSVLSKPNLYEPA 114
Query: 123 -------NCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALRLLEAVGI 175
+CNHC A C+ C YC +C+++ W H+ EC R
Sbjct: 115 AGQKMELHCNHCYLLKPAQR-CSGCKGVYYCGAACQQDDWPS-HKTECKALTR------- 165
Query: 176 AHLALKLILVSSHSDRYKE 194
++ + V S+ ++ KE
Sbjct: 166 ----VRQLWVQSYPEKAKE 180
>gi|348515417|ref|XP_003445236.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 2
[Oreochromis niloticus]
Length = 373
Score = 37.4 bits (85), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 45/110 (40%), Gaps = 32/110 (29%)
Query: 87 NAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHC---------------- 127
N KG+ + A + KG+T+F+E+P+ F+ C +C
Sbjct: 29 NVKGKGLFAKRSMKKGETIFIERPLVSAQFLWNTLYKYKACEYCLRALETAEGNARRLSG 88
Query: 128 CTSISAPIP------------CNECILAVYCSESCRREAWLRYHRWECHG 165
+S P P C +C + +YCS CR+ A +YHR C G
Sbjct: 89 IPGLSLPHPELCRVRPEQHQACPQCQV-MYCSSECRQAAADQYHRALCLG 137
>gi|356546765|ref|XP_003541793.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
max]
Length = 990
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 115 ILPPCSMS-NCNHCCTSISAP--IPCNECILAVYCSESCRREAWLRYHRWECH 164
++PP S S N +H C SAP C+ C + YCS +C+ W H+ EC
Sbjct: 62 VIPPVSASKNEHHVCARCSAPARTRCSRCKIVRYCSGNCQIIHWRLIHKQECQ 114
>gi|302794157|ref|XP_002978843.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
gi|300153652|gb|EFJ20290.1| hypothetical protein SELMODRAFT_418586 [Selaginella moellendorffii]
Length = 849
Score = 37.0 bits (84), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 2/90 (2%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
S + M S K + +V G + E+ +A ++L ++C+ C + A +P
Sbjct: 206 SAQTAMYWSADKQWGLRTTSEVQAGTLVHSEEALAGILLKKHRPTHCHGCFGVVPADSVP 265
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGA 166
C C + YC+++CR +A + H+ EC G+
Sbjct: 266 CIGCGVVSYCNDACRDDATVE-HKLECGGS 294
>gi|426253397|ref|XP_004020382.1| PREDICTED: tudor domain-containing protein 1 [Ovis aries]
Length = 1326
Score = 37.0 bits (84), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 38/150 (25%), Positives = 66/150 (44%), Gaps = 16/150 (10%)
Query: 18 RLQDTSHLDFKEDVLTIEK------SEENSVEAMSMQVSSTSSSASSSVASLPSLACGES 71
RL+ + + K + L E+ S+E+S ++S + + + S +L G S
Sbjct: 183 RLKSPENGNKKNNFLLCEQTKQFLASQEDS--SLSSHPNGVNGEMAGSKEDRKTLPTGNS 240
Query: 72 HTLMCASNKIKMQTSNAK-GRHVIAVE--DVHK--GDTLFVEKPVAFVIL-PPCSMSNCN 125
+ + A N + N+K HV + VHK ++L + P F L PP + C+
Sbjct: 241 VSPLNAGNNSPPKEVNSKPSNHVPPAKSKKVHKLVENSLSINNPALFNSLGPPLRSTTCH 300
Query: 126 HCCTSISAPIPCNECILAVYCSESCRREAW 155
C + + C++C YCS C+R W
Sbjct: 301 RC--GLFGSLRCSQCKQTYYCSSVCQRRDW 328
>gi|303278744|ref|XP_003058665.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459825|gb|EEH57120.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 620
Score = 37.0 bits (84), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 46/106 (43%), Gaps = 2/106 (1%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNEC 140
+++ T +GR ++A + G P+A + C C C C
Sbjct: 24 VEVVTREGRGRCLVATRSLTPGTLALSSLPLACAPFQELTRQRCGACFERPPIVRRCGAC 83
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVS 186
A++C +C+ AW+ H EC A + LE + A +A + +LVS
Sbjct: 84 DRALFCDRACQAAAWVAGHSRECEKA-KELEKLPTA-VASEAVLVS 127
>gi|17535507|ref|NP_496323.1| Protein SET-14 [Caenorhabditis elegans]
gi|3123316|sp|Q09415.2|SET14_CAEEL RecName: Full=SET domain-containing protein 14
gi|3878909|emb|CAA86783.1| Protein SET-14 [Caenorhabditis elegans]
Length = 429
Score = 37.0 bits (84), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 9/96 (9%)
Query: 108 EKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGAL 167
E AF + S CN C TS++ C+ C YCS+ C+R W + H+ EC
Sbjct: 10 ENVYAFALARDQVESYCNQCLTSMAELKKCSACRRLAYCSQECQRADW-KLHKVECKAIK 68
Query: 168 RLLEAVGIAHLALKLIL-----VSSHSDRYKEVYHL 198
E +A+ +++L++ +S + D E Y++
Sbjct: 69 THNE---VANDSIRLVMRIAGKLSRNEDGEIEAYYI 101
>gi|440901267|gb|ELR52242.1| Ankyrin repeat and MYND domain-containing protein 1, partial [Bos
grunniens mutus]
Length = 958
Score = 37.0 bits (84), Expect = 5.0, Method: Composition-based stats.
Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRL 169
C C SI + PC C + CS+ C+ +AW +H+ +C+ + +
Sbjct: 912 CYQCGRSIGVRLAPCTRCYGILTCSKYCKTKAWSDFHKKDCNALMAI 958
>gi|281341007|gb|EFB16591.1| hypothetical protein PANDA_012308 [Ailuropoda melanoleuca]
Length = 951
Score = 37.0 bits (84), Expect = 5.0, Method: Composition-based stats.
Identities = 13/41 (31%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWEC 163
C+ C S+ +PC C + CS+ C+ AW +H+ +C
Sbjct: 908 CSQCGRSVGVRLVPCTRCCGVLTCSKYCKTRAWGDFHKRDC 948
>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 524
Score = 37.0 bits (84), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWEC 163
I C C + YCSESC++ AW +YH+ +C
Sbjct: 259 IVCGHCKVEYYCSESCQQLAWSQYHQIQC 287
>gi|169866977|ref|XP_001840071.1| hypothetical protein CC1G_12125 [Coprinopsis cinerea okayama7#130]
gi|116498849|gb|EAU81744.1| hypothetical protein CC1G_12125 [Coprinopsis cinerea okayama7#130]
Length = 574
Score = 37.0 bits (84), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 123 NCNHCCTSISAP------IPCNECILAVYCSESCRREAWLRYHRWEC 163
N H AP C+ C VYCSE+C+++ W R H+ EC
Sbjct: 376 NIGHLDRGFRAPSQGEGVRQCSRCRSVVYCSEACQKQDWERLHKREC 422
>gi|425770916|gb|EKV09375.1| hypothetical protein PDIP_64570 [Penicillium digitatum Pd1]
gi|425776732|gb|EKV14940.1| hypothetical protein PDIG_30190 [Penicillium digitatum PHI26]
Length = 495
Score = 37.0 bits (84), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 34/148 (22%), Positives = 58/148 (39%), Gaps = 13/148 (8%)
Query: 71 SHTLMCASNKIKMQTS-NAKGRHVIAVEDVHKG-DTLFVEKPVAFVILPPCSMSNCNHCC 128
S TL S + Q + N G + A D++ G D L ++ P V+ P C C
Sbjct: 4 SKTLPSVSTRTVPQPAPNGMGNGLFATADINPGEDVLHIKTPFVAVLDSPRLEDTCAGCF 63
Query: 129 T-----SISAPIPCNECILAVYCSESCRREAWLRYHRWEC------HGALRLLEAVGIAH 177
+ + C C + YC +C+ + W H EC + A +
Sbjct: 64 GKRQMQNGNELKACTGCRVVKYCDRACQSKDWKFAHSVECLIFKNVKPMVLPNNARALLR 123
Query: 178 LALKLILVSSHSDRYKEVYHLETHLQDM 205
+ L+ + S+ +K LETH+ ++
Sbjct: 124 IVLRTVRNKYDSEEFKVFDGLETHMNEI 151
>gi|47213741|emb|CAF96102.1| unnamed protein product [Tetraodon nigroviridis]
Length = 787
Score = 37.0 bits (84), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWEC 163
C +C S+ + PC C YCS CR AW H+ EC
Sbjct: 747 CYYCGRSVLVRLTPCYRCYKVFYCSRPCRLRAWDELHKNEC 787
>gi|302787779|ref|XP_002975659.1| hypothetical protein SELMODRAFT_415607 [Selaginella moellendorffii]
gi|300156660|gb|EFJ23288.1| hypothetical protein SELMODRAFT_415607 [Selaginella moellendorffii]
Length = 1052
Score = 37.0 bits (84), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 2/90 (2%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAP-IP 136
S + M S K + +V G + E+ +A ++L ++C+ C + A +P
Sbjct: 176 SAQTAMYWSADKQWGLRTTSEVQAGTLVHSEEALAGILLKKHRPTHCHGCFGVVPADSVP 235
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGA 166
C C + YC+++CR +A + H+ EC G+
Sbjct: 236 CIGCGVVSYCNDACRDDATVE-HKLECGGS 264
>gi|332372756|gb|AEE61520.1| unknown [Dendroctonus ponderosae]
Length = 501
Score = 37.0 bits (84), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 5/57 (8%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
K++ S GR+ +A ED+ GD +F E P A+ P S S+C C +P+ C
Sbjct: 8 FKVEKSETVGRYAVASEDLKPGDVIFSETPFAY---GPKSDSSC--LCLGCHSPVDC 59
>gi|302506607|ref|XP_003015260.1| hypothetical protein ARB_06383 [Arthroderma benhamiae CBS 112371]
gi|291178832|gb|EFE34620.1| hypothetical protein ARB_06383 [Arthroderma benhamiae CBS 112371]
Length = 1143
Score = 37.0 bits (84), Expect = 5.4, Method: Composition-based stats.
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C +A YCS+ C+RE W+ H+ C
Sbjct: 1110 CSGCKVAEYCSKDCQREDWMARHKLAC 1136
>gi|410969783|ref|XP_003991371.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1 [Felis
catus]
Length = 1104
Score = 37.0 bits (84), Expect = 5.5, Method: Composition-based stats.
Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAP-IPCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C S+ +PC C + CS++C+ AW +H+ +C L ++
Sbjct: 1040 CYQCGRSVGVRLVPCTRCYGILTCSKACKTRAWGDFHKRDCGSLLAIV 1087
>gi|302653465|ref|XP_003018558.1| hypothetical protein TRV_07403 [Trichophyton verrucosum HKI 0517]
gi|291182212|gb|EFE37913.1| hypothetical protein TRV_07403 [Trichophyton verrucosum HKI 0517]
Length = 1143
Score = 37.0 bits (84), Expect = 5.6, Method: Composition-based stats.
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C+ C +A YCS+ C+RE W+ H+ C
Sbjct: 1110 CSGCKVAEYCSKDCQREDWMARHKLAC 1136
>gi|367050672|ref|XP_003655715.1| hypothetical protein THITE_60220 [Thielavia terrestris NRRL 8126]
gi|347002979|gb|AEO69379.1| hypothetical protein THITE_60220 [Thielavia terrestris NRRL 8126]
Length = 592
Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 33/134 (24%), Positives = 61/134 (45%), Gaps = 19/134 (14%)
Query: 35 EKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVI 94
++ E S+ ++ Q+S + V++LP L GES+T + ++ K +
Sbjct: 214 DRFSEQSLSYLNQQLSEVAPKCEVRVSNLPVLLDGESNT-----DGYEI-VPTCKQLGLF 267
Query: 95 AVEDVHKGDTLFVEKPVAFVILPPCSM---SNCNHCCTSI------SAPIPCNECILAVY 145
A ED+ G+T+ E + +L + S C+ C T + S + C +C ++
Sbjct: 268 AKEDIAPGETVLRE----YSLLTANNRHKESTCDACGTELPPLGSNSTAVSCPDCYDTIF 323
Query: 146 CSESCRREAWLRYH 159
C E C ++A YH
Sbjct: 324 CDEFCFKKAQELYH 337
>gi|125809191|ref|XP_001361025.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
gi|54636198|gb|EAL25601.1| GA21935 [Drosophila pseudoobscura pseudoobscura]
Length = 501
Score = 37.0 bits (84), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 35/146 (23%), Positives = 70/146 (47%), Gaps = 20/146 (13%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ + GR V+A D+ G+TL +E+P+ V+LP C+ C ++A C
Sbjct: 9 AKKCELKQNETLGRFVVARCDLSAGETLLLEQPI--VVLPHIGDRRCSK-CFKLTATF-C 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
+C L C + C+R L+ + +++L+ + AL +L+ H
Sbjct: 65 GKCRLLSLCEDCAGHDERDCQRLRELQLSDEQ----VQILQEKANTEVQPALACLLLGEH 120
Query: 189 SDR---YKEVYHLETHLQDMRPEDLY 211
++E+ +ETHL+ R +++
Sbjct: 121 QHSRPLFEEMSQMETHLEARRGTEIW 146
>gi|320586941|gb|EFW99604.1| set and mynd domain protein [Grosmannia clavigera kw1407]
Length = 576
Score = 37.0 bits (84), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 18/63 (28%), Positives = 24/63 (38%)
Query: 109 KPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWECHGALR 168
+P+ V + CNHC C C YC C+R W H+ EC +
Sbjct: 54 EPLVVVPFGVAAAQTCNHCLDPRRPAKACTGCRAVAYCGSRCQRAHWTAVHKLECRVLKK 113
Query: 169 LLE 171
LE
Sbjct: 114 ALE 116
>gi|195584110|ref|XP_002081858.1| GD11242 [Drosophila simulans]
gi|194193867|gb|EDX07443.1| GD11242 [Drosophila simulans]
Length = 498
Score = 37.0 bits (84), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 66/147 (44%), Gaps = 20/147 (13%)
Query: 78 SNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
+ K +++ ++ GR V+A+ ++ G+TL +E P+ V+LP C C + C
Sbjct: 9 ARKCEIKHNDMLGRFVVALCNLRAGETLLLENPI--VVLPQMGDRRCTKCFNLTES--FC 64
Query: 138 NECILAVYCSE-------SCRREAWLRYHRWECHGALRLLEAVGIAHL--ALKLILVSSH 188
+C L C + C+R A + + LL+ + LK +L+ H
Sbjct: 65 RKCRLLALCEDCSDHDERDCKRLAEMNLS----DDQVELLQKKEHTEIQPVLKCLLLREH 120
Query: 189 SDR---YKEVYHLETHLQDMRPEDLYQ 212
+ Y+E+ +E+ L R D+++
Sbjct: 121 EETLPLYEEMSQMESQLMTRRGTDIWK 147
>gi|342321104|gb|EGU13041.1| Hypothetical Protein RTG_00754 [Rhodotorula glutinis ATCC 204091]
Length = 1150
Score = 37.0 bits (84), Expect = 5.8, Method: Composition-based stats.
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWECH 164
C C A YCS+ C+ EAW + HR CH
Sbjct: 927 CRRCRRAKYCSKMCQSEAWNQGHRHWCH 954
>gi|326431298|gb|EGD76868.1| hypothetical protein PTSG_08216 [Salpingoeca sp. ATCC 50818]
Length = 564
Score = 37.0 bits (84), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 39/150 (26%), Positives = 58/150 (38%), Gaps = 38/150 (25%)
Query: 48 QVSSTSSSASSSVASLPS--LACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTL 105
Q +S A S+ S PS L C E ++ + TS R ++A +D GD +
Sbjct: 23 QATSRHVEAHSTAHSTPSSTLGCTEQRRVVPI-----VYTSTMTERRLVADKDYIVGDVV 77
Query: 106 FVEKPVA---FVILPPCSMSNCNHCCTSISA---------------PIP----------- 136
+ P F C C+HC S+ + P P
Sbjct: 78 LKDTPFVCAQFSWNRKCGYVACSHCLRSLESAEAMATRLAGQDVKLPHPELCPNNGRERV 137
Query: 137 -CNECILAVYCSESCRREAWLRYHRWECHG 165
C +C VYC+E+CR +AW +H+ C G
Sbjct: 138 TCTQC-EEVYCNEACRDKAWQLHHKVLCPG 166
>gi|71420095|ref|XP_811365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876023|gb|EAN89514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 429
Score = 37.0 bits (84), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 9/76 (11%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILA 143
GR + A++ ++ + +++P ++ C HC +I + + C C
Sbjct: 163 GRGIYALDRINSSTPVMLDQPF---LVQRMRDDACAHCLATIGRSGASAGGVQCAHCDRE 219
Query: 144 VYCSESCRREAWLRYH 159
YCS +CR AW YH
Sbjct: 220 TYCSVACRDAAWREYH 235
>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 518
Score = 37.0 bits (84), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 38/99 (38%), Gaps = 18/99 (18%)
Query: 82 KMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-----------------C 124
K+ KG+ V+AV + G + P FV L M+ C
Sbjct: 10 KVDYKENKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDAENEETNYRTVNIC 69
Query: 125 NHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+C S + C C VYCS+ C AW + HR EC
Sbjct: 70 FYCFEKFSKSVYCPNCKYVVYCSDVCLDLAW-KLHREEC 107
>gi|296221259|ref|XP_002756485.1| PREDICTED: tudor domain-containing protein 1 [Callithrix jacchus]
Length = 1205
Score = 37.0 bits (84), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 34/147 (23%), Positives = 60/147 (40%), Gaps = 11/147 (7%)
Query: 18 RLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTSSSASSSV----ASLPSLACGESHT 73
RL+ + + + K + L E++++ SS SS + V G S +
Sbjct: 80 RLKSSENGNKKNNFLLCEQTKQYLASQEDNSASSNPSSINGEVVGSQGDRKKSPAGNSES 139
Query: 74 LMCASNK--IKMQTSNAKGRHVIAVEDVHK--GDTLFVEKPVAFVIL-PPCSMSNCNHCC 128
N ++ T H + +HK G++L + P F L PP + C+ C
Sbjct: 140 SPSVENNSPKEVNTKPENNVHPAKSKKLHKLVGNSLSINNPGLFTSLRPPLRSTTCHRC- 198
Query: 129 TSISAPIPCNECILAVYCSESCRREAW 155
+ + C++C YCS +C+R W
Sbjct: 199 -GLFGSLRCSQCKQTYYCSTACQRRDW 224
>gi|315045996|ref|XP_003172373.1| hypothetical protein MGYG_04963 [Arthroderma gypseum CBS 118893]
gi|311342759|gb|EFR01962.1| hypothetical protein MGYG_04963 [Arthroderma gypseum CBS 118893]
Length = 1210
Score = 36.6 bits (83), Expect = 6.2, Method: Composition-based stats.
Identities = 16/42 (38%), Positives = 23/42 (54%)
Query: 122 SNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
SNC S + + C+ C +A YCS+ C+RE W H+ C
Sbjct: 1162 SNCGKKPGSGTKLLRCSGCKIAEYCSKECQREDWKVTHKSVC 1203
>gi|356556906|ref|XP_003546761.1| PREDICTED: ubiquitin carboxyl-terminal hydrolase 15-like [Glycine
max]
Length = 989
Score = 36.6 bits (83), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 28/53 (52%), Gaps = 3/53 (5%)
Query: 115 ILPPCSMS-NCNHCCTSISAP--IPCNECILAVYCSESCRREAWLRYHRWECH 164
++PP S S N +H C SAP C+ C + YCS +C+ W H+ EC
Sbjct: 62 VIPPVSASKNEHHVCARCSAPARTRCSRCKIVRYCSGNCQIIHWRLNHKQECQ 114
>gi|384497176|gb|EIE87667.1| hypothetical protein RO3G_12378 [Rhizopus delemar RA 99-880]
Length = 288
Score = 36.6 bits (83), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 23/48 (47%)
Query: 116 LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
L C+ +C T C C A YCS+SC+ +AW HRW C
Sbjct: 172 LRRCANMSCQRWETQPREFAKCRRCRKAKYCSKSCQSKAWADGHRWWC 219
>gi|351737484|gb|AEQ60519.1| hypothetical protein [Acanthamoeba castellanii mamavirus]
Length = 374
Score = 36.6 bits (83), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
C +C +I P+ CN+C YCSE C+ E + YH +C
Sbjct: 328 CFYCNKNIEKPVVCNKCFRIKYCSEKCQSE-YNSYHSDDC 366
>gi|311977714|ref|YP_003986834.1| putative zinc finger MYND domain-containing protein [Acanthamoeba
polyphaga mimivirus]
gi|82000266|sp|Q5UQS2.1|YR331_MIMIV RecName: Full=Putative zinc finger MYND domain-containing protein
R331
gi|55416950|gb|AAV50600.1| unknown [Acanthamoeba polyphaga mimivirus]
gi|308204761|gb|ADO18562.1| putative zinc finger MYND domain-containing protein [Acanthamoeba
polyphaga mimivirus]
gi|339061264|gb|AEJ34568.1| hypothetical protein MIMI_R331 [Acanthamoeba polyphaga mimivirus]
Length = 374
Score = 36.6 bits (83), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 124 CNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
C +C +I P+ CN+C YCSE C+ E + YH +C
Sbjct: 328 CFYCNKNIEKPVVCNKCFRIKYCSEKCQSE-YNSYHSDDC 366
>gi|328700944|ref|XP_003241434.1| PREDICTED: hypothetical protein LOC100575277 [Acyrthosiphon pisum]
Length = 1525
Score = 36.6 bits (83), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 128 CTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
CT+ S+ CN C+ A YCS C++ W YH+ EC
Sbjct: 1488 CTNFSS-FSCNTCLAAFYCSIQCQKSDW-HYHQNEC 1521
>gi|449542915|gb|EMD33892.1| hypothetical protein CERSUDRAFT_86668 [Ceriporiopsis subvermispora
B]
Length = 245
Score = 36.6 bits (83), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 30/117 (25%), Positives = 50/117 (42%), Gaps = 24/117 (20%)
Query: 121 MSNCNHCCTSISAP---IPCNECILAVYCSESCRREAWLRYHRWEC-----------HGA 166
+S C HC +S C C + YCS C++ AW H+ +C H
Sbjct: 25 LSQCGHCYSSRGDGTTLFKCGACQVDTYCSRQCQKAAWA-VHKEKCAINRCAQEAGVHAQ 83
Query: 167 LRLLEAV------GIAHLALKLILVS---SHSDRYKEVYHLETHLQDMRPEDLYQYV 214
L+ L IA A++ + ++ + ++R+ + HL+T R + LYQ V
Sbjct: 84 LKNLRGYTDKHRPTIAEAAIRSLGLAQDVTRAERWCLIVHLQTRPNAQRTDRLYQCV 140
>gi|440463414|gb|ELQ32994.1| hypothetical protein OOU_Y34scaffold01005g18 [Magnaporthe oryzae
Y34]
gi|440481340|gb|ELQ61939.1| hypothetical protein OOW_P131scaffold01138g58 [Magnaporthe oryzae
P131]
Length = 374
Score = 36.6 bits (83), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 21/33 (63%), Gaps = 1/33 (3%)
Query: 124 CNHCCTSI-SAPIPCNECILAVYCSESCRREAW 155
CN CT++ +AP+ C C A YCS+ C+R W
Sbjct: 29 CNALCTNMHTAPLRCTGCGAATYCSKRCQRADW 61
>gi|431912257|gb|ELK14394.1| Ankyrin repeat and MYND domain-containing protein 1 [Pteropus
alecto]
Length = 1079
Score = 36.6 bits (83), Expect = 6.8, Method: Composition-based stats.
Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWEC 163
C C S+ + PC C + CS++C+ +AW +H+ +C
Sbjct: 934 CYQCGRSVGVRLAPCARCYGMLACSKNCKTKAWADFHKRDC 974
>gi|407859371|gb|EKG06994.1| hypothetical protein TCSYLVIO_001875 [Trypanosoma cruzi]
Length = 429
Score = 36.6 bits (83), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 9/76 (11%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSI------SAPIPCNECILA 143
GR + A++ ++ + +++P ++ C HC +I + + C C
Sbjct: 163 GRGIYALDRINSSTPVMLDQPF---LVQRMRDDACAHCLATIGRSGASAGGVRCAHCDRE 219
Query: 144 VYCSESCRREAWLRYH 159
YCS +CR AW YH
Sbjct: 220 TYCSVACRDAAWREYH 235
>gi|324513338|gb|ADY45484.1| SET and MYND domain-containing protein 3 [Ascaris suum]
Length = 477
Score = 36.6 bits (83), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 7/64 (10%)
Query: 135 IPCNECILAVYCSESCRREAWLRYHRWECH------GALRLLEAVGIAHLALKLILVSSH 188
+ C C A YC + C+R AW ++HR EC L L E + ++ + +++ + ++
Sbjct: 50 LKCASCEFARYCDKECQRIAW-KHHRNECRRLKAVFPNLPLTEVLFMSRIVDRVLFLEAN 108
Query: 189 SDRY 192
D+Y
Sbjct: 109 GDKY 112
>gi|66825697|ref|XP_646203.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
gi|60474828|gb|EAL72765.1| hypothetical protein DDB_G0270830 [Dictyostelium discoideum AX4]
Length = 696
Score = 36.6 bits (83), Expect = 7.1, Method: Composition-based stats.
Identities = 25/78 (32%), Positives = 33/78 (42%), Gaps = 4/78 (5%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPI----PCNECILAVY 145
GR V+A ED+ KG + P A V+ NC C I+ I C C +
Sbjct: 222 GRCVLASEDLPKGTMVLRVSPFASVLEDHKIEKNCGFCFKKINKSIRINQTCKNCKNHLL 281
Query: 146 CSESCRREAWLRYHRWEC 163
C + E L YH+ EC
Sbjct: 282 CPQCSVDEYSLNYHKDEC 299
>gi|154336305|ref|XP_001564388.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061423|emb|CAM38448.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 700
Score = 36.6 bits (83), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 68/146 (46%), Gaps = 22/146 (15%)
Query: 32 LTIEKSEENSVEAMSMQVSSTSSSASSSVASLPSLACGESHTLMCAS---------NKIK 82
L +E+ + +EA S Q SS ++++ ++A + GE+ M A+ +++
Sbjct: 172 LGLEELGQKDIEA-SKQESSDAANSPEAMAKV----FGETAGGMAAACLPPRARVNGEVR 226
Query: 83 MQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECI- 141
++ S AKGR ++AV + +G L E P A V +S C++C + +E
Sbjct: 227 IERS-AKGRGLVAVSQLTEGSVL-EETPYAIVARTETLLSVCSYCLQRSTCLYHGDEYRQ 284
Query: 142 -----LAVYCSESCRREAWLRYHRWE 162
+CS SC + AW Y + E
Sbjct: 285 HHVKSRGFFCSPSCAKAAWEHYGQQE 310
>gi|354474200|ref|XP_003499319.1| PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like
[Cricetulus griseus]
Length = 1012
Score = 36.6 bits (83), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 1/48 (2%)
Query: 124 CNHCCTSISAPI-PCNECILAVYCSESCRREAWLRYHRWECHGALRLL 170
C C SI + PC C + CS+ C+ +AW +H+ +C+ + L+
Sbjct: 948 CYQCGRSIGVRLTPCPRCYGILTCSKYCKTKAWTEFHKKDCNDIMILM 995
>gi|168002744|ref|XP_001754073.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694627|gb|EDQ80974.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 671
Score = 36.6 bits (83), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 13/91 (14%)
Query: 89 KGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA-PIPCNECILAVYC- 146
K R + A +D+ G + E+P+A +IL ++C+ C + P+ C C + +YC
Sbjct: 134 KQRGMHATKDLEAGSLVLEEEPMAALILKGHRNTHCHFCFEMLPPDPVVCFTCAIPLYCD 193
Query: 147 -----------SESCRREAWLRYHRWECHGA 166
SE + W H EC GA
Sbjct: 194 VPCMGAACDESSEELDGQKWKGEHMHECGGA 224
>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
Length = 530
Score = 36.6 bits (83), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 5/80 (6%)
Query: 86 SNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN--CNHCCTSISAPIPCNECILA 143
S+ KGR + + + GD + P +++P S N C+HC C+ C
Sbjct: 11 SHRKGRGIFSTKSFAPGDVILPFTPT--ILIPSLSHINTICSHCFKQAEVR-ACSRCHAV 67
Query: 144 VYCSESCRREAWLRYHRWEC 163
YC +C+ W H EC
Sbjct: 68 SYCDAACQAANWTAVHSKEC 87
>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 508
Score = 36.6 bits (83), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 23/104 (22%), Positives = 45/104 (43%), Gaps = 9/104 (8%)
Query: 68 CGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF-VEKPVAFVILPPCSMSNCNH 126
CG + ++C + + + + G + A + G +F + A V+ P C++
Sbjct: 2 CG-TERIVCIDPQAEPKQTVDMGLGLFATSKIRPGANVFAITANFATVLDTPRLKDTCSN 60
Query: 127 CCTSISAPI-------PCNECILAVYCSESCRREAWLRYHRWEC 163
C ++ + C C + YC+E C+ E+W H+ EC
Sbjct: 61 CFVTVGDEVNPGLTLKACTGCRVVKYCNERCQTESWAASHKKEC 104
>gi|268570288|ref|XP_002648464.1| C. briggsae CBR-SET-18 protein [Caenorhabditis briggsae]
Length = 369
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 31/131 (23%), Positives = 57/131 (43%), Gaps = 22/131 (16%)
Query: 96 VEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP------CNECILAVYCSES 149
+ +H +F E P A +L P + C +C + P P C C ++YCS+
Sbjct: 24 ISSIHARIKIF-ETPFATQVLNPKANEFCANC---LRGPAPGEKLLRCGGCNFSMYCSKD 79
Query: 150 CRREAWLRYHRWEC------HGALRLLEAVGIAHLALKLILVSSHSDRY-----KEVYHL 198
C+ AWL H+ EC L L E + ++ + ++ + + D+ + L
Sbjct: 80 CQAIAWL-VHKPECKRLKASFPNLPLTEVLFLSKVIDRIQFLEKNGDKLGVEAERTFSSL 138
Query: 199 ETHLQDMRPED 209
H +D+R ++
Sbjct: 139 VDHKKDIREDE 149
>gi|261329776|emb|CBH12758.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 429
Score = 36.6 bits (83), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 11/88 (12%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA--------PIPCNECI 141
GR + A+ V G + ++P FV+ S + C HC +SI++ + C C
Sbjct: 161 GRGIYALTRVPSGTAVLADQP--FVVQRMNS-TTCAHCLSSITSASSTSSAAGVVCPHCG 217
Query: 142 LAVYCSESCRREAWLRYHRWECHGALRL 169
YCS SCR AW YH CH ++
Sbjct: 218 QESYCSISCRDAAWREYHSCCCHATNKM 245
>gi|255072833|ref|XP_002500091.1| predicted protein [Micromonas sp. RCC299]
gi|226515353|gb|ACO61349.1| predicted protein [Micromonas sp. RCC299]
Length = 465
Score = 36.2 bits (82), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 12/74 (16%)
Query: 94 IAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIP-------CNECILAVYC 146
+ V+D+H F + P + PP +N C + A IP C+ C +A YC
Sbjct: 167 LLVDDIH---VTFRQLPTDVALTPPTPTANI--LCVACRASIPIDTRKKFCSGCNIAAYC 221
Query: 147 SESCRREAWLRYHR 160
+C++ W R+ +
Sbjct: 222 CRACQKADWKRHKK 235
>gi|384497433|gb|EIE87924.1| hypothetical protein RO3G_12635 [Rhizopus delemar RA 99-880]
Length = 382
Score = 36.2 bits (82), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 23/48 (47%)
Query: 116 LPPCSMSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
L C+ +C T C C A YCS+SC+ +AW HRW C
Sbjct: 245 LRRCANLSCRKWETQPREFAKCRRCRKAKYCSKSCQSKAWADGHRWWC 292
>gi|299749011|ref|XP_001838436.2| hypothetical protein CC1G_09064 [Coprinopsis cinerea okayama7#130]
gi|298408241|gb|EAU83370.2| hypothetical protein CC1G_09064 [Coprinopsis cinerea okayama7#130]
Length = 619
Score = 36.2 bits (82), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWECHGALR 168
C C VYCS +C+ E W R H+ EC G R
Sbjct: 439 CGRCHSVVYCSSACQHEDWERRHQRECKGMAR 470
>gi|389626527|ref|XP_003710917.1| hypothetical protein MGG_04563 [Magnaporthe oryzae 70-15]
gi|351650446|gb|EHA58305.1| hypothetical protein MGG_04563 [Magnaporthe oryzae 70-15]
Length = 374
Score = 36.2 bits (82), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 124 CNHCCTSI-SAPIPCNECILAVYCSESCRREAW 155
CN CT + +AP+ C C A YCS+ C+R W
Sbjct: 29 CNALCTKMHTAPLRCTGCGAATYCSKRCQRADW 61
>gi|72391782|ref|XP_846185.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62358390|gb|AAX78854.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70802721|gb|AAZ12626.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 429
Score = 36.2 bits (82), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 11/88 (12%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISA--------PIPCNECI 141
GR + A+ V G + ++P FV+ S + C HC +SI++ + C C
Sbjct: 161 GRGIYALTRVPSGTAVLADQP--FVVQRMNS-TTCAHCLSSITSASSTSSAAGVVCPHCG 217
Query: 142 LAVYCSESCRREAWLRYHRWECHGALRL 169
YCS SCR AW YH CH ++
Sbjct: 218 QESYCSISCRDAAWREYHSCCCHATNKM 245
>gi|91088053|ref|XP_966915.1| PREDICTED: similar to CG9642 CG9642-PA [Tribolium castaneum]
gi|270011878|gb|EFA08326.1| hypothetical protein TcasGA2_TC005968 [Tribolium castaneum]
Length = 492
Score = 36.2 bits (82), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-CNHCCTSISAPIPCNECILAVYCSE 148
GR ++A +++ +G + E PV ++ P C +CC + C +C +AV C+
Sbjct: 17 GRFIVASKNIKQGQLILKENPV--LLCPQVGGPIICFNCCAQLKKLFFCPDCRIAVLCNP 74
Query: 149 SCRRE 153
+C+ +
Sbjct: 75 NCKNQ 79
>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
vitripennis]
Length = 386
Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 28/110 (25%), Positives = 39/110 (35%), Gaps = 32/110 (29%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVA---FVILPPCSMSNCNHCCTSISA-------- 133
T KG+ + AV +GD + E+P+ F C++C +
Sbjct: 12 TDEEKGKGLFAVSSYKEGDIILEERPLVCCQFAWNADYKYLACDYCMKPLETAEENARRL 71
Query: 134 --------PIP------------CNECILAVYCSESCRREAWLRYHRWEC 163
P P C C A YCS C+ EAW RYH+ C
Sbjct: 72 SGKSDLILPFPECCETKKDSISECESC-GAKYCSTECQNEAWQRYHQILC 120
>gi|150866965|ref|XP_001386745.2| hypothetical protein PICST_63965 [Scheffersomyces stipitis CBS
6054]
gi|172044079|sp|A3M0J3.2|SET5_PICST RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|149388221|gb|ABN68716.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 478
Score = 36.2 bits (82), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 18/88 (20%)
Query: 80 KIKMQTSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSN-------CNHC----- 127
KI++ ++ +G+ + A + KG+ ++ E P+ F+ PP + N C+HC
Sbjct: 110 KIQIVMTSKRGKGLYAKHKIAKGELIWEETPLFFI--PPLANINLVKTGKACSHCGKLLQ 167
Query: 128 ---CTSISAPIPCNECILAVYCSESCRR 152
+S+ + CN C V+CS+ C++
Sbjct: 168 TTTGSSMLKGLDCNVC-PEVWCSQQCKK 194
>gi|402073679|gb|EJT69231.1| hypothetical protein GGTG_12851 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 541
Score = 36.2 bits (82), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 4/127 (3%)
Query: 81 IKMQTSNA--KGRHVIAVEDVHKGDTLFV-EKPVAFVILPPCSMSNCNHCCTSISAPIPC 137
IKM + GR + A D G + V + P+ S + C+HC + C
Sbjct: 14 IKMHGTKGFSPGRCLRAARDYKPGQLMAVFDDPLVAFPDAAGSKTTCHHCLVHNAKVFGC 73
Query: 138 NECILAV-YCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVY 196
C AV YCS+ C++ W H EC ++ AV L + + R EV+
Sbjct: 74 TGCDKAVSYCSDECQKVNWKLIHSKECKVFRKVQAAVSKNWLPTPVRALVQILLRLSEVH 133
Query: 197 HLETHLQ 203
+ T L+
Sbjct: 134 EVVTGLE 140
>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
Length = 387
Score = 36.2 bits (82), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 29/118 (24%), Positives = 47/118 (39%), Gaps = 32/118 (27%)
Query: 81 IKMQTSNAKGRHVIAVEDVHKGDTLFVEKPV---------AFVILP------PCSMSNCN 125
++++T+ KGR + A + + G LF E P+ A+ L P + CN
Sbjct: 4 VEVKTTAEKGRGLFARQPIEAGTVLFEESPLVSSQFCWNAAYGYLACEYCLRPLETAECN 63
Query: 126 --HCCTSISAPIPCNECI---------------LAVYCSESCRREAWLRYHRWECHGA 166
+ + +P EC +YCS+ C +A RYH+ C GA
Sbjct: 64 VRRLASDFAIDLPYPECCSVQEQLERHTRCADCGTLYCSQGCLNDALERYHKGLCLGA 121
>gi|299739523|ref|XP_001839567.2| hypothetical protein CC1G_12030 [Coprinopsis cinerea okayama7#130]
gi|298403816|gb|EAU82242.2| hypothetical protein CC1G_12030 [Coprinopsis cinerea okayama7#130]
Length = 628
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 27/47 (57%), Gaps = 3/47 (6%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC---HGALRLLEAVGIAHLAL 180
C+ C +YCS+ C++E W HR+EC + R+ EA G+ + L
Sbjct: 428 CSGCQTVLYCSQQCQKEDWESRHRYECMVMRHSYRVNEARGVKYSPL 474
>gi|299744221|ref|XP_001840954.2| hypothetical protein CC1G_03183 [Coprinopsis cinerea okayama7#130]
gi|298406021|gb|EAU81007.2| hypothetical protein CC1G_03183 [Coprinopsis cinerea okayama7#130]
Length = 590
Score = 36.2 bits (82), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 5/54 (9%)
Query: 129 TSISAPIPCNECILAVYCSESCRREAWLRYHRWEC----HG-ALRLLEAVGIAH 177
T + P C+ C VYCSE C++ W H+ EC HG +R VG +H
Sbjct: 375 TELVLPKACSGCHTVVYCSEKCQKADWESRHKDECNVMKHGHMIRKQNGVGYSH 428
>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
Length = 436
Score = 36.2 bits (82), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 33/79 (41%), Gaps = 1/79 (1%)
Query: 85 TSNAKGRHVIAVEDVHKGDTLFVEKPVAFVILPPCSMSNCNHCCTSISAPIPCNECILAV 144
S KGR + A+ G+ LF V+ S+C+ C C C A
Sbjct: 16 ASPGKGRGLRALRRYAVGELLFSCPAYTAVLTVSERGSHCDGCFARKEGLSKCGRCKQAF 75
Query: 145 YCSESCRREAWLRYHRWEC 163
YC+ C++E W H+ EC
Sbjct: 76 YCNVECQKEDW-PMHKLEC 93
>gi|303282879|ref|XP_003060731.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458202|gb|EEH55500.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 375
Score = 36.2 bits (82), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 4/56 (7%)
Query: 121 MSNCNHCCTSISAPIP----CNECILAVYCSESCRREAWLRYHRWECHGALRLLEA 172
+SN C + +A + C C L YC ++C++E W + H+ EC A E
Sbjct: 255 VSNACFSCGARAAGVERYSQCGGCHLVAYCGKACQKEDWKKRHKAECRPAKSFREG 310
>gi|344282241|ref|XP_003412882.1| PREDICTED: DNA polymerase theta [Loxodonta africana]
Length = 2567
Score = 36.2 bits (82), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 54/109 (49%), Gaps = 15/109 (13%)
Query: 13 SLKLCRLQDTSHLDFKEDVLTIEKSEENSVEAMSMQVSSTS----------SSASSSVAS 62
SL LC +D+ +LD + + + + + EN+ + + S+T+ SSASSS+
Sbjct: 1285 SLALCDFEDSLYLDTQSEKIIQQMATENAKQEGAKDSSTTAGMRQTSVDGRSSASSSLKE 1344
Query: 63 LPSLACGESHTL-MCASNKIKMQTSNAKGRHVIAVEDVHKGDTLFVEKP 110
+P GE H++ +N ++++TS+ + D H D L + P
Sbjct: 1345 IPVTLPGERHSVGTTGTNHLELKTSDTPKQR----SDSHAIDILTSQSP 1389
>gi|308485700|ref|XP_003105048.1| CRE-SET-10 protein [Caenorhabditis remanei]
gi|308256993|gb|EFP00946.1| CRE-SET-10 protein [Caenorhabditis remanei]
Length = 436
Score = 36.2 bits (82), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 5/65 (7%)
Query: 104 TLFVEKPVAFVILPPCSMSNCNHCCTSISAP-----IPCNECILAVYCSESCRREAWLRY 158
TL E P+A V+ P C C + + C++C YCS C+R+ W
Sbjct: 3 TLARENPLAAVLCPQFQDFYCATCFAELDVNGEAEILMCDDCSEVSYCSLKCQRKDWRSV 62
Query: 159 HRWEC 163
H++EC
Sbjct: 63 HQFEC 67
>gi|395327793|gb|EJF60190.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 442
Score = 36.2 bits (82), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 27/110 (24%), Positives = 49/110 (44%), Gaps = 13/110 (11%)
Query: 47 MQVSSTSSSASSSVASLPSLACGESHTLMCASNKIKMQTSNAKGRHVIAVEDVHKGDTLF 106
+Q S S +AS V L G T ++K AKG+ ++A E + +G+ ++
Sbjct: 61 VQTLSASGAASGQVHPTSKLIEGLDVTKWTTKAEVKY-FGKAKGKGLVAKEAIAEGEVIW 119
Query: 107 VEKPVAFVILPPCSMSN-------CNHCCTSISAP---IPCNECILAVYC 146
E P F++ P ++ + C HC T ++ +PC+ + C
Sbjct: 120 KEDP--FILAPEWNLYDLQMSSHACGHCSTPLTTSPLIVPCSASTSSTPC 167
>gi|384494170|gb|EIE84661.1| hypothetical protein RO3G_09371 [Rhizopus delemar RA 99-880]
Length = 432
Score = 36.2 bits (82), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 137 CNECILAVYCSESCRREAWLRYHRWEC 163
C C A YCS++C+ +AW HRW C
Sbjct: 276 CRRCRKAKYCSKACQSKAWADGHRWWC 302
>gi|330790441|ref|XP_003283305.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
gi|325086730|gb|EGC40115.1| hypothetical protein DICPUDRAFT_52183 [Dictyostelium purpureum]
Length = 754
Score = 36.2 bits (82), Expect = 9.8, Method: Composition-based stats.
Identities = 13/43 (30%), Positives = 23/43 (53%), Gaps = 1/43 (2%)
Query: 121 MSNCNHCCTSISAPIPCNECILAVYCSESCRREAWLRYHRWEC 163
+ C C ++ P+ C C YCS+ C+++ WL H+ +C
Sbjct: 668 LYKCWSCKLTLIKPLICGYCKTVAYCSKECQKDHWL-VHKEQC 709
>gi|157124067|ref|XP_001660315.1| hypothetical protein AaeL_AAEL009757 [Aedes aegypti]
gi|108874116|gb|EAT38341.1| AAEL009757-PA [Aedes aegypti]
Length = 518
Score = 36.2 bits (82), Expect = 10.0, Method: Compositional matrix adjust.
Identities = 33/131 (25%), Positives = 58/131 (44%), Gaps = 20/131 (15%)
Query: 90 GRHVIAVEDVHKGDTLFVEKPVAFVILP---------PCSMSNCNHCCTSISAPIPCNEC 140
GR V+A D+ +G+ + +E+P +V P C +C+ C AP+ C +C
Sbjct: 17 GRFVVAKRDLTRGEQILLEEP--YVTGPYWDADVSCLNCFRDSCSTCRQCGVAPL-CYDC 73
Query: 141 ILAVYCSESCRREAWLRYHRWECHGALRLLEAVGIAHLALKLILVSSHSDRYKEVYHLET 200
+ S RE+ L H H + L+L S+ +RY EV +E+
Sbjct: 74 KIHDEAECSFYRESSLDRHFLYNH--------FNVVMPVRCLMLYRSNRERYDEVMTMES 125
Query: 201 HLQDMRPEDLY 211
L++ R +++
Sbjct: 126 RLEERRGTEIW 136
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.128 0.384
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,983,881,819
Number of Sequences: 23463169
Number of extensions: 104969295
Number of successful extensions: 434407
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 559
Number of HSP's successfully gapped in prelim test: 637
Number of HSP's that attempted gapping in prelim test: 433090
Number of HSP's gapped (non-prelim): 1373
length of query: 214
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 78
effective length of database: 9,168,204,383
effective search space: 715119941874
effective search space used: 715119941874
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 74 (33.1 bits)