BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018070
(361 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297739785|emb|CBI29967.3| unnamed protein product [Vitis vinifera]
Length = 660
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 202/297 (68%), Positives = 244/297 (82%), Gaps = 1/297 (0%)
Query: 65 TATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCY 124
T N K + P PPPI+V++TE AGRGVFATRRI + DLIHTAKP+++HP+LS+++SVCY
Sbjct: 365 TPPENEKLASPGPPPIRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHSVCY 424
Query: 125 FCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVK 184
FCLRK+ + + N RFC + C++ +K F VER+ADWSA++DYCR++GLKYPLLVK
Sbjct: 425 FCLRKLKPVTSS-EDCNVRFCSQECEEQSKVFVAVERKADWSAYDDYCRTRGLKYPLLVK 483
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
RLACM++SG SADC+DILQPASLS E+I M EGF +L+SAF KA DE M FL +QW
Sbjct: 484 RLACMVVSGVASADCLDILQPASLSSEMISEMGEGFSLLQSAFMKAKARDECMAFLTEQW 543
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
Y NVLA+ RIN+FRIELAGG YEDL S AAAS+E+E AVGNA+YMLPSFYNHDCDPN HI
Sbjct: 544 YINVLARFRINSFRIELAGGSYEDLHSLAAASVETEAAVGNAVYMLPSFYNHDCDPNVHI 603
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+WIDN +ARL ALR++E GEELRICYIDASM DARQ IL QGFGF+C+CLRCSSGD
Sbjct: 604 IWIDNVNARLKALREIEAGEELRICYIDASMDHDARQTILFQGFGFRCSCLRCSSGD 660
>gi|225441557|ref|XP_002276611.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Vitis
vinifera]
Length = 327
Score = 432 bits (1112), Expect = e-119, Method: Compositional matrix adjust.
Identities = 202/297 (68%), Positives = 244/297 (82%), Gaps = 1/297 (0%)
Query: 65 TATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCY 124
T N K + P PPPI+V++TE AGRGVFATRRI + DLIHTAKP+++HP+LS+++SVCY
Sbjct: 32 TPPENEKLASPGPPPIRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHSVCY 91
Query: 125 FCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVK 184
FCLRK+ + + N RFC + C++ +K F VER+ADWSA++DYCR++GLKYPLLVK
Sbjct: 92 FCLRKLKPVTSS-EDCNVRFCSQECEEQSKVFVAVERKADWSAYDDYCRTRGLKYPLLVK 150
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
RLACM++SG SADC+DILQPASLS E+I M EGF +L+SAF KA DE M FL +QW
Sbjct: 151 RLACMVVSGVASADCLDILQPASLSSEMISEMGEGFSLLQSAFMKAKARDECMAFLTEQW 210
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
Y NVLA+ RIN+FRIELAGG YEDL S AAAS+E+E AVGNA+YMLPSFYNHDCDPN HI
Sbjct: 211 YINVLARFRINSFRIELAGGSYEDLHSLAAASVETEAAVGNAVYMLPSFYNHDCDPNVHI 270
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+WIDN +ARL ALR++E GEELRICYIDASM DARQ IL QGFGF+C+CLRCSSGD
Sbjct: 271 IWIDNVNARLKALREIEAGEELRICYIDASMDHDARQTILFQGFGFRCSCLRCSSGD 327
>gi|255588343|ref|XP_002534573.1| protein with unknown function [Ricinus communis]
gi|223524997|gb|EEF27811.1| protein with unknown function [Ricinus communis]
Length = 319
Score = 426 bits (1095), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/319 (64%), Positives = 256/319 (80%), Gaps = 5/319 (1%)
Query: 43 RYSRCLISRLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASD 102
RYSR SR ++ + + S+ N K + SPPPI+V +TESAGRGVF+TRRI +
Sbjct: 6 RYSR-WFSRFKNQNKHQILAFSSTAENEKQTLRSPPPIRVGVTESAGRGVFSTRRISGGE 64
Query: 103 LIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERR 162
LIH AKPI+++P+ S+ N+VCYFCL+K+ S+ ++ + FC + CK NAK FYDVE +
Sbjct: 65 LIHNAKPIVSYPSRSSTNTVCYFCLKKLAST----ENRSVAFCSQECKQNAKVFYDVETK 120
Query: 163 ADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVM 222
ADWS F+DYCR+QGLKYPL+VKRLACM+ISGA + +C+DILQPA+LSPE+IL MEEG+ +
Sbjct: 121 ADWSGFDDYCRTQGLKYPLMVKRLACMVISGAATVECLDILQPANLSPEMILEMEEGYDL 180
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
LRS F KA I D+++ FL +QWY N LA+IRINAFRIELA GLYEDLLSSAAA IE+E A
Sbjct: 181 LRSCFTKANIADDRLAFLTRQWYINQLARIRINAFRIELAVGLYEDLLSSAAACIEAEAA 240
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VGN++YMLPSF+NHDCDPNAHI+WI+NADARL ALRD++ EELRICYIDASM ARQ
Sbjct: 241 VGNSVYMLPSFFNHDCDPNAHIIWIENADARLKALRDIDPDEELRICYIDASMDHGARQT 300
Query: 343 ILTQGFGFQCNCLRCSSGD 361
IL QGFGF+CNCLRC SGD
Sbjct: 301 ILLQGFGFKCNCLRCLSGD 319
>gi|224088154|ref|XP_002308346.1| SET domain protein [Populus trichocarpa]
gi|222854322|gb|EEE91869.1| SET domain protein [Populus trichocarpa]
Length = 283
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 206/285 (72%), Positives = 242/285 (84%), Gaps = 2/285 (0%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PPPI+VALTESAGRGVF+TR+I A DLIHTAKPI+ HP+LST+N+VCYFCL+K+TS+
Sbjct: 1 PPPIRVALTESAGRGVFSTRKICAGDLIHTAKPILAHPSLSTINTVCYFCLKKLTST--E 58
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
F FC + CK+NAK FY+VE +ADW AF+DYCR++GLKYPLLVKRLACM+ISGA S
Sbjct: 59 FHGKGVAFCSQECKENAKVFYEVETKADWLAFDDYCRNKGLKYPLLVKRLACMVISGAAS 118
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
A+ +DILQP++LS E++L MEEG+ +L+S F A I DEQM FL K+WY +VLA+IRINA
Sbjct: 119 AESLDILQPSNLSHEMVLEMEEGYGLLKSGFAMANISDEQMAFLTKEWYNSVLARIRINA 178
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
FRIELA G YEDLLSSAAA IE+E AVGNA+YMLPS YNHDCDPNAHI+WIDNADA+L A
Sbjct: 179 FRIELAVGSYEDLLSSAAACIEAEAAVGNAVYMLPSLYNHDCDPNAHIVWIDNADAQLKA 238
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L DVEE EELRICYIDASM DARQ+ L QGFGF+CNCLRC SGD
Sbjct: 239 LCDVEEDEELRICYIDASMDHDARQSFLLQGFGFKCNCLRCLSGD 283
>gi|79507571|ref|NP_196280.2| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
gi|229488103|sp|Q9FG08.2|ATXR4_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR4; AltName:
Full=Protein SET DOMAIN GROUP 38; AltName:
Full=Trithorax-related protein 4; Short=TRX-related
protein 4; Flags: Precursor
gi|332003660|gb|AED91043.1| histone-lysine N-methyltransferase ATXR4 [Arabidopsis thaliana]
Length = 325
Score = 416 bits (1069), Expect = e-114, Method: Compositional matrix adjust.
Identities = 202/325 (62%), Positives = 248/325 (76%), Gaps = 3/325 (0%)
Query: 38 KMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRR 97
++ L RYSRC SRL++L S A++ Q PPPI+V LTESAGR VFATR+
Sbjct: 3 RLALNRYSRCF-SRLKTLTTPLFFSSSAASNRDGDYQIGPPPIRVGLTESAGRAVFATRK 61
Query: 98 IRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFY 157
I A DLIHTAKP++ P+L L+SVCY CL+K+ S++ F+ +C + C++N+K F
Sbjct: 62 IGAGDLIHTAKPVVACPSLLKLDSVCYLCLKKLMGSAK-FEDRGVSYCSQECQENSKGFL 120
Query: 158 DVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAME 217
DVE RADWS+F+DYCR+ KYPL+VKRL CMIISGA ADC+DILQPA LS E+I +E
Sbjct: 121 DVETRADWSSFDDYCRTHNFKYPLMVKRLCCMIISGARPADCLDILQPAVLSSEMISKIE 180
Query: 218 EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG-LYEDLLSSAAAS 276
+G+ +L +AF+KA D+ + FL KQWYT +LA+IRINAFRI+L GG EDLLS AAAS
Sbjct: 181 DGYGLLWNAFRKANFKDDDVAFLTKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAAS 240
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMA 336
+E E AVG+A+YMLPSFYNHDCDPNAHI+W+ NADARL LRDVEEGEELRICYIDASM
Sbjct: 241 VEGEGAVGHAVYMLPSFYNHDCDPNAHIIWLHNADARLNTLRDVEEGEELRICYIDASMG 300
Query: 337 RDARQAILTQGFGFQCNCLRCSSGD 361
+ARQ IL+QGFGF CNCLRC S D
Sbjct: 301 YEARQTILSQGFGFLCNCLRCQSTD 325
>gi|297806681|ref|XP_002871224.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
lyrata]
gi|297317061|gb|EFH47483.1| hypothetical protein ARALYDRAFT_487474 [Arabidopsis lyrata subsp.
lyrata]
Length = 325
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 203/325 (62%), Positives = 248/325 (76%), Gaps = 3/325 (0%)
Query: 38 KMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRR 97
+ L RYSR L SRL+ L S+A++ Q PPPI+V LTESAGR VFATRR
Sbjct: 3 RFALNRYSRYL-SRLKPLKTPPFFSSSSASNRDGDYQTGPPPIRVGLTESAGRAVFATRR 61
Query: 98 IRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFY 157
I A DLIHTAKP++ P+L L+SVCY CL+K+ S++ F+ +C + C++N+K F
Sbjct: 62 IGAGDLIHTAKPVVACPSLLKLDSVCYLCLKKLMGSAK-FEDRGVSYCSQECQENSKGFL 120
Query: 158 DVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAME 217
DVERRADWS+F DYCR+ KYPL+VKRL CMIISG ADC+DILQPA+LS E+I +E
Sbjct: 121 DVERRADWSSFVDYCRTHNFKYPLMVKRLCCMIISGVLPADCLDILQPAALSSEMISKIE 180
Query: 218 EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG-LYEDLLSSAAAS 276
+G+ +L +AF+KA + D+ + FL KQWYT +LA+IRINAFRI+L GG EDLLS AAAS
Sbjct: 181 DGYGLLWNAFRKANLKDDDVSFLTKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAAS 240
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMA 336
+E E AVG+A+YMLPSFYNHDCDPNAHI+W+ NADARL LRDVEEGEELRICYIDASM
Sbjct: 241 VEGEGAVGHAVYMLPSFYNHDCDPNAHIIWLQNADARLNTLRDVEEGEELRICYIDASMG 300
Query: 337 RDARQAILTQGFGFQCNCLRCSSGD 361
+ARQ IL+QGFGF CNCLRC S D
Sbjct: 301 YEARQTILSQGFGFLCNCLRCQSAD 325
>gi|115481972|ref|NP_001064579.1| Os10g0410700 [Oryza sativa Japonica Group]
gi|78708616|gb|ABB47591.1| SET domain protein 123, putative, expressed [Oryza sativa Japonica
Group]
gi|113639188|dbj|BAF26493.1| Os10g0410700 [Oryza sativa Japonica Group]
gi|215767244|dbj|BAG99472.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767272|dbj|BAG99500.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612812|gb|EEE50944.1| hypothetical protein OsJ_31491 [Oryza sativa Japonica Group]
Length = 298
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/306 (59%), Positives = 229/306 (74%), Gaps = 11/306 (3%)
Query: 56 HLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPT 115
HL++R LC+ A PPI+VALTES+GRGVFATR I A +++H+A+P+++HP+
Sbjct: 4 HLRRRLLCTAAA----------PPIRVALTESSGRGVFATRPISAGEVLHSAQPLVSHPS 53
Query: 116 LSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQ 175
++ VCY CLR+ S S + FC + C+++AK F+ VE++ADWS F+D+C S+
Sbjct: 54 PPLIHEVCYSCLRR-KSGSGGGSSGSCYFCSDACREHAKGFHGVEKKADWSLFDDHCSSR 112
Query: 176 GLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE 235
GLKYP + KRLACM+ISGA SADC+DILQPA L + MEE F +L S F+KAG +E
Sbjct: 113 GLKYPYMAKRLACMVISGAVSADCLDILQPARLHQGTLTEMEEEFALLDSTFRKAGFQEE 172
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
FL K+WY NVLA+IRINAFRIEL YE+LLSSA AS+ + AVGNA+YMLPSFYN
Sbjct: 173 ITTFLTKEWYINVLARIRINAFRIELVASSYENLLSSAVASVSCDAAVGNAVYMLPSFYN 232
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
HDCDPN HI+W+ +ADARL ALR++EEGEELRICYIDASM DARQ IL +GFGF+C C
Sbjct: 233 HDCDPNTHIVWLASADARLKALRNIEEGEELRICYIDASMDVDARQRILAEGFGFECRCQ 292
Query: 356 RCSSGD 361
RC SGD
Sbjct: 293 RCLSGD 298
>gi|326510051|dbj|BAJ87242.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 293
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 169/286 (59%), Positives = 218/286 (76%), Gaps = 1/286 (0%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
PPP++VALT+SAGRGVFATR I +++H+A+P++THP+ S + VCY CL++
Sbjct: 9 GPPPVRVALTDSAGRGVFATRPIATGEVLHSAQPLVTHPSRSLFHEVCYRCLKRKAGKGD 68
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
+ FC + C ++A+ F+ +E++ADWS +D+C S+GLKYP +VKRLACM+ISG
Sbjct: 69 D-SRGDCYFCSDACWEHAEGFHGIEKKADWSLLDDHCSSRGLKYPYMVKRLACMVISGDV 127
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRIN 255
ADC+DILQPA L + I+ MEE F +L+S F+KAG +E FL KQWY NVLA+IR+N
Sbjct: 128 GADCLDILQPAQLHQDTIIEMEEEFELLKSTFRKAGFQEELTTFLTKQWYINVLARIRVN 187
Query: 256 AFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM 315
AFRIEL YEDLLSSA AS+ + +VGNA+YMLPSFYNHDCDPN HI+W++NADA+L
Sbjct: 188 AFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCDPNTHIVWLENADAKLN 247
Query: 316 ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
LRD++EGEELRICYID SM +ARQ ILT+GFGFQC C RC SGD
Sbjct: 248 TLRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRCLSGD 293
>gi|356533089|ref|XP_003535101.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Glycine
max]
Length = 313
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/282 (64%), Positives = 224/282 (79%), Gaps = 4/282 (1%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
I+VALTES GR VFATR I + DLIHTA P + HP+ S+ + CY CL ++ H Q
Sbjct: 36 IRVALTESVGRAVFATRPIASGDLIHTATPTVCHPSSSSARAACYSCL----AALPHSQS 91
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
FC + C +K +YDVE +A+W AFNDYC ++GLKYP LVKRL CM+ISG +D
Sbjct: 92 QGIPFCSQRCHQRSKGYYDVEMKANWVAFNDYCWTRGLKYPFLVKRLVCMVISGDARSDT 151
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
+DILQPA+L+PE++L MEE F++LR+AF KA I DE + FL KQWY N+LA+IRINAFRI
Sbjct: 152 LDILQPANLTPEMVLKMEEEFLLLRNAFTKALIADEHIAFLTKQWYINILARIRINAFRI 211
Query: 260 ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD 319
ELAGGLYEDLL+SA AS+E+E AVGNA+Y+LPSFYNHDCDPNAHI+WIDNADA+L ALRD
Sbjct: 212 ELAGGLYEDLLASAVASVEAEAAVGNAVYLLPSFYNHDCDPNAHIIWIDNADAKLKALRD 271
Query: 320 VEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ EGEELRICYIDAS+ R+ARQ +L++GFGFQCNC RC GD
Sbjct: 272 IVEGEELRICYIDASLDRNARQELLSRGFGFQCNCSRCLHGD 313
>gi|449437548|ref|XP_004136554.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
sativus]
Length = 341
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/287 (63%), Positives = 229/287 (79%), Gaps = 2/287 (0%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITS-SS 134
PPPI+V+LT+SAGRGVFATR+I A +LIHTAKP++ HP+LS+++ VC FCL+K+ ++
Sbjct: 56 GPPPIRVSLTDSAGRGVFATRKIGAGELIHTAKPLVAHPSLSSIHHVCNFCLQKLQRYAN 115
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A FC E C+ ++K F+DVE ADWS +++YCR +G KYPLLVKRLACM+ISGA
Sbjct: 116 VDSDARRASFCSEECEQHSKVFHDVEMEADWSDYDNYCRERGFKYPLLVKRLACMVISGA 175
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
S+D +DILQP+ LS +++L +EEG+ +LR A A I DE+M FL ++WYT VLA+IRI
Sbjct: 176 MSSDHLDILQPSRLSTDMVLELEEGYSLLRKALINANITDERMLFLTQEWYTGVLARIRI 235
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAFRIELAGG YEDL S AAA +E+E AVGNA+YMLPSFYNHDCDPN HI+WI+NA+A+L
Sbjct: 236 NAFRIELAGG-YEDLHSLAAACVEAEAAVGNAVYMLPSFYNHDCDPNTHIIWINNANAKL 294
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
ALRDV+ EELRICYIDASM DARQ +L +GFGF C C RCS GD
Sbjct: 295 KALRDVDPDEELRICYIDASMDYDARQTLLHRGFGFICKCARCSYGD 341
>gi|162458873|ref|NP_001105199.1| SET domain protein 123 [Zea mays]
gi|27466907|gb|AAO12860.1| SET domain protein 123 [Zea mays]
gi|194705040|gb|ACF86604.1| unknown [Zea mays]
gi|195624178|gb|ACG33919.1| SET domain protein 123 [Zea mays]
gi|414871456|tpg|DAA50013.1| TPA: SET domain protein 123 [Zea mays]
Length = 303
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 172/285 (60%), Positives = 218/285 (76%), Gaps = 1/285 (0%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PPPI+V+LTESAGRGVFATR + + +L+H+A+P+++HP+ S L+ VCY CLR+ +
Sbjct: 20 PPPIRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHEVCYNCLRRKPGEGRA 79
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
+ FC + C+D+AK F+D+E+ DWS F+ +C S+ LKYP + KRLACM+ISGA +
Sbjct: 80 SSGGD-YFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
ADC++ILQPA L ++ M+E F +L S F+KAG +E FL WY NVLA+IRINA
Sbjct: 139 ADCLNILQPALLHQGTLIEMKEEFELLESTFRKAGFQEEITTFLTIDWYINVLARIRINA 198
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
FRIEL YEDLLSSAAAS+ + +VGNA+YMLPSFYNHDCDPNAHI+W+ NADA+L A
Sbjct: 199 FRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNAHIVWLQNADAKLKA 258
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
LRD+EEGEEL ICYIDASM DARQ IL GFGF+C CLRC SGD
Sbjct: 259 LRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRCLSGD 303
>gi|449524736|ref|XP_004169377.1| PREDICTED: histone-lysine N-methyltransferase ATXR4-like [Cucumis
sativus]
Length = 346
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 181/286 (63%), Positives = 228/286 (79%), Gaps = 2/286 (0%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITS-SS 134
PPPI+V+LT+SAGRGVFATR+I A +LIHTAKP++ HP+LS+++ VC FCL+K+ ++
Sbjct: 56 GPPPIRVSLTDSAGRGVFATRKIGAGELIHTAKPLVAHPSLSSIHHVCNFCLQKLQRYAN 115
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A FC E C+ ++K F+DVE ADWS +++YCR +G KYPLLVKRLACM+ISGA
Sbjct: 116 VDSDARRASFCSEECEQHSKVFHDVEMEADWSDYDNYCRERGFKYPLLVKRLACMVISGA 175
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
S+D +DILQP+ LS +++L +EEG+ +LR A A I DE+M FL ++WYT VLA+IRI
Sbjct: 176 MSSDHLDILQPSRLSTDMVLELEEGYSLLRKALINANITDERMLFLTQEWYTGVLARIRI 235
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAFRIELAGG YEDL S AAA +E+E AVGNA+YMLPSFYNHDCDPN HI+WI+NA+A+L
Sbjct: 236 NAFRIELAGG-YEDLHSLAAACVEAEAAVGNAVYMLPSFYNHDCDPNTHIIWINNANAKL 294
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
ALRDV+ EELRICYID SM DARQ +L +GFGF CNC RCS G
Sbjct: 295 KALRDVDPDEELRICYIDTSMDYDARQTLLHRGFGFICNCARCSYG 340
>gi|414871458|tpg|DAA50015.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 299
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 170/285 (59%), Positives = 217/285 (76%), Gaps = 5/285 (1%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PPPI+V+LTESAGRGVFATR + + +L+H+A+P+++HP+ S L+ VCY CLR+ +
Sbjct: 20 PPPIRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHEVCYNCLRRKPGEGRA 79
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
+ FC + C+D+AK F+D+E+ DWS F+ +C S+ LKYP + KRLACM+ISGA +
Sbjct: 80 SSGGD-YFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
ADC++ILQPA L ++ M+E F +L S F+KAG +E + WY NVLA+IRINA
Sbjct: 139 ADCLNILQPALLHQGTLIEMKEEFELLESTFRKAGFQEE----ITTYWYINVLARIRINA 194
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
FRIEL YEDLLSSAAAS+ + +VGNA+YMLPSFYNHDCDPNAHI+W+ NADA+L A
Sbjct: 195 FRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNAHIVWLQNADAKLKA 254
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
LRD+EEGEEL ICYIDASM DARQ IL GFGF+C CLRC SGD
Sbjct: 255 LRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRCLSGD 299
>gi|294463315|gb|ADE77193.1| unknown [Picea sitchensis]
Length = 288
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 165/326 (50%), Positives = 204/326 (62%), Gaps = 47/326 (14%)
Query: 36 FWKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFAT 95
FW + L RYS SR Q S+++H+ P PPP++V TE AGRGVFAT
Sbjct: 10 FW-VWLPRYS---FSRYQC------DYASSSSHSEAP----PPPVRVGFTEDAGRGVFAT 55
Query: 96 RRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKA 155
R IR DLIHTA P +THP+L L K
Sbjct: 56 REIRTGDLIHTADPFLTHPSLEAL---------------------------------GKG 82
Query: 156 FYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILA 215
F ++E DWS ++++CR +GLKYPLLVKR ACM+ISG SAD +DILQPA S +
Sbjct: 83 FCEIETTMDWSLYHEHCRVKGLKYPLLVKRFACMVISGMASADNLDILQPAIFSHGMSST 142
Query: 216 MEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAA 275
EE F +L FK I + FL K+WY+ +LA+IR N FRIEL GG + DLL++A+A
Sbjct: 143 WEEEFELLVHNFKNTDIHPGLLTFLTKEWYSGILARIRANVFRIELIGGYHTDLLAAASA 202
Query: 276 SIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASM 335
SI E VG+A+YMLPS YNHDCDPNAHI+WI+NA ARL ALR++E GEELRICYIDASM
Sbjct: 203 SISVETTVGSAVYMLPSMYNHDCDPNAHILWIENAKARLKALRNIEPGEELRICYIDASM 262
Query: 336 ARDARQAILTQGFGFQCNCLRCSSGD 361
+ARQ++L QGFGF+C CLRC D
Sbjct: 263 DYEARQSLLYQGFGFRCQCLRCKYKD 288
>gi|414871457|tpg|DAA50014.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 272
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 124/222 (55%), Positives = 164/222 (73%), Gaps = 1/222 (0%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PPPI+V+LTESAGRGVFATR + + +L+H+A+P+++HP+ S L+ VCY CLR+ +
Sbjct: 20 PPPIRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHEVCYNCLRRKPGEGRA 79
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
FC + C+D+AK F+D+E+ DWS F+ +C S+ LKYP + KRLACM+ISGA +
Sbjct: 80 -SSGGDYFCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAAN 138
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
ADC++ILQPA L ++ M+E F +L S F+KAG +E FL WY NVLA+IRINA
Sbjct: 139 ADCLNILQPALLHQGTLIEMKEEFELLESTFRKAGFQEEITTFLTIDWYINVLARIRINA 198
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDC 298
FRIEL YEDLLSSAAAS+ + +VGNA+YMLPSFYNHDC
Sbjct: 199 FRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDC 240
>gi|168039813|ref|XP_001772391.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162676378|gb|EDQ62862.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 311
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 192/321 (59%), Gaps = 24/321 (7%)
Query: 51 RLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPI 110
R+ SL+ +R CST PI+V T AGRGV+A R I +LIHTA P+
Sbjct: 5 RIGSLYFLRR--CSTHVQG--------LPIRVGHTLDAGRGVYAIRNIAHGELIHTADPV 54
Query: 111 ITHPTLSTLNSVCYFCLRKITSSSQHFQ----HHNARFCGEVCKDNAKA-FYDVERRADW 165
+ HP++++L+ CYFCL+ + S + + + +C C+++A+ FY ER+ DW
Sbjct: 55 VAHPSIASLDKACYFCLKGLKRSVSYGRLPASEGSPTYCDRTCEESAQMLFYAAERQVDW 114
Query: 166 SAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRS 225
S F+DYC + G+++PL+ KRLACM+ S S +DIL LS + + ++ R
Sbjct: 115 SEFHDYCSTYGVRFPLIAKRLACMVASSRTSPKSLDILSHVDLSH----GIPDNWITERK 170
Query: 226 AFKKAGIDDE----QMKFLNKQWYTNVLAQIRINAFRIE-LAGGLYEDLLSSAAASIESE 280
KA E ++FL WY VLA++ INAFRIE + ED+L++AAASI E
Sbjct: 171 LLFKAISSKEARGLNLEFLTDDWYAGVLARLHINAFRIERIEEADSEDILAAAAASIMGE 230
Query: 281 IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
VG+A+Y+LPS YNH+CDPN I+W NA A L+A R ++ GEEL I YID+SM+ D R
Sbjct: 231 SIVGSAVYILPSMYNHNCDPNVDILWPSNATANLVARRSIKSGEELHITYIDSSMSLDER 290
Query: 341 QAILTQGFGFQCNCLRCSSGD 361
++ L Q +GF C C RC GD
Sbjct: 291 RSFLEQHYGFTCRCARCRDGD 311
>gi|414871461|tpg|DAA50018.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 173
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 116/173 (67%), Positives = 136/173 (78%)
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
M+ISGA +ADC++ILQPA L ++ M+E F +L S F+KAG +E FL WY NV
Sbjct: 1 MVISGAANADCLNILQPALLHQGTLIEMKEEFELLESTFRKAGFQEEITTFLTIDWYINV 60
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
LA+IRINAFRIEL YEDLLSSAAAS+ + +VGNA+YMLPSFYNHDCDPNAHI+W+
Sbjct: 61 LARIRINAFRIELVASSYEDLLSSAAASVSCDSSVGNAVYMLPSFYNHDCDPNAHIVWLQ 120
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
NADA+L ALRD+EEGEEL ICYIDASM DARQ IL GFGF+C CLRC SGD
Sbjct: 121 NADAKLKALRDIEEGEELCICYIDASMDADARQKILADGFGFECRCLRCLSGD 173
>gi|302824578|ref|XP_002993931.1| hypothetical protein SELMODRAFT_137937 [Selaginella moellendorffii]
gi|300138203|gb|EFJ04978.1| hypothetical protein SELMODRAFT_137937 [Selaginella moellendorffii]
Length = 290
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 176/290 (60%), Gaps = 6/290 (2%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
PPIQV LTE+AGRGVFA+R+I A D+I T P+I+HP+ + + VC++CL++ + ++ F
Sbjct: 1 PPIQVKLTETAGRGVFASRKIGAGDVILTEFPVISHPSPTNMGKVCWWCLKRFSLITKPF 60
Query: 138 QHHNAR---FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
+ +C + C D AK D+E W F+DYCR L++PLLVKRLACM+ +
Sbjct: 61 LQLESSWMAYCSQECTDLAKGCMDLENNRHWKEFHDYCRESNLRFPLLVKRLACMVATNM 120
Query: 195 ESADCIDIL-QPASLSPELILAMEEGFVMLRSAF--KKAGIDDEQMKFLNKQWYTNVLAQ 251
S D +DIL P +L+ I + ++ F K+ ++++ L+ WY L++
Sbjct: 121 ASFDVLDILYPPKTLTRCGIDSASTKCILFLRFFPSKRVRFSFDRLEVLSDSWYAGALSR 180
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
I +N FR+E +L ++ SI + VG+A+Y+LPS +NH CD N +I W +NA
Sbjct: 181 IHLNTFRVETIATEITNLEAALVDSITGDDVVGSAVYILPSMFNHSCDANVNIYWRENAF 240
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+L AL+ +E G+EL I YIDASM +AR+A+L +GF C C RC D
Sbjct: 241 VQLKALQPIEPGKELCITYIDASMGCEARRALLQDAYGFHCKCPRCLDND 290
>gi|302758304|ref|XP_002962575.1| hypothetical protein SELMODRAFT_78536 [Selaginella moellendorffii]
gi|300169436|gb|EFJ36038.1| hypothetical protein SELMODRAFT_78536 [Selaginella moellendorffii]
Length = 290
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 176/290 (60%), Gaps = 6/290 (2%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
PPIQV LTE+AGRGVFA+R+I A D+I T P+I+ P+ + + VC++CL++ + ++ F
Sbjct: 1 PPIQVKLTETAGRGVFASRKIGAGDVILTEFPVISQPSPTNMGKVCWWCLKRFSLITKPF 60
Query: 138 QHHNAR---FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
+ +C + C D AK D+E W F+DYCR L++PLLVKRLACM+ +
Sbjct: 61 LQLESSWMAYCSQECTDLAKGCMDLENNRHWKEFHDYCRESNLRFPLLVKRLACMVATNM 120
Query: 195 ESADCIDIL-QPASLSPELILAMEEGFVMLRSAF--KKAGIDDEQMKFLNKQWYTNVLAQ 251
S D +DIL P +L+ I + ++ F K+ ++++ L+ WY L++
Sbjct: 121 ASFDVLDILYPPKTLTRCGIDSASTKCILFLRFFPSKRVRFSFDRLEVLSDSWYAGALSR 180
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
I +N FR+E +L ++ SI + VG+A+Y+LPS +NH CD N +I W +NA
Sbjct: 181 IHLNTFRVETIATEITNLEAALVDSITGDDVVGSAVYILPSMFNHSCDANVNIYWRENAF 240
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
A+L AL+ +E G+EL I YIDASM +AR+A+L +GF C C RC D
Sbjct: 241 AQLKALQPIEPGKELCITYIDASMGCEARRALLQDAYGFHCKCPRCLDND 290
>gi|10178118|dbj|BAB11411.1| unnamed protein product [Arabidopsis thaliana]
Length = 258
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 100/145 (68%), Positives = 115/145 (79%), Gaps = 4/145 (2%)
Query: 218 EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG-LYEDLLSSAAAS 276
+GF+ + + + DD FL KQWYT +LA+IRINAFRI+L GG EDLLS AAAS
Sbjct: 117 KGFLDVETRADWSSFDD---YFLTKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAAS 173
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMA 336
+E E AVG+A+YMLPSFYNHDCDPNAHI+W+ NADARL LRDVEEGEELRICYIDASM
Sbjct: 174 VEGEGAVGHAVYMLPSFYNHDCDPNAHIIWLHNADARLNTLRDVEEGEELRICYIDASMG 233
Query: 337 RDARQAILTQGFGFQCNCLRCSSGD 361
+ARQ IL+QGFGF CNCLRC S D
Sbjct: 234 YEARQTILSQGFGFLCNCLRCQSTD 258
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/192 (41%), Positives = 114/192 (59%), Gaps = 3/192 (1%)
Query: 38 KMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRR 97
++ L RYSRC SRL++L S A++ Q PPPI+V LTESAGR VFATR+
Sbjct: 3 RLALNRYSRCF-SRLKTLTTPLFFSSSAASNRDGDYQIGPPPIRVGLTESAGRAVFATRK 61
Query: 98 IRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFY 157
I A DLIHTAKP++ P+L L+SVCY CL+K+ S++ F+ +C + C++N+K F
Sbjct: 62 IGAGDLIHTAKPVVACPSLLKLDSVCYLCLKKLMGSAK-FEDRGVSYCSQECQENSKGFL 120
Query: 158 DVERRADWSAFNDYCRSQGLKYPLLVK-RLACMIISGAESADCIDILQPASLSPELILAM 216
DVE RADWS+F+DY ++ +L + R+ I + D+L A+ S E A+
Sbjct: 121 DVETRADWSSFDDYFLTKQWYTAILARIRINAFRIDLVGGSCGEDLLSLAAASVEGEGAV 180
Query: 217 EEGFVMLRSAFK 228
ML S +
Sbjct: 181 GHAVYMLPSFYN 192
>gi|326518712|dbj|BAJ92517.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 166
Score = 197 bits (501), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 88/122 (72%), Positives = 102/122 (83%)
Query: 240 LNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
L KQWY NVLA+IR+NAFRIEL YEDLLSSA AS+ + +VGNA+YMLPSFYNHDCD
Sbjct: 45 LTKQWYINVLARIRVNAFRIELVASSYEDLLSSAVASVTCDASVGNAVYMLPSFYNHDCD 104
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
PN HI+W++NADA+L LRD++EGEELRICYID SM +ARQ ILT+GFGFQC C RC S
Sbjct: 105 PNTHIVWLENADAKLNTLRDIDEGEELRICYIDTSMNVNARQKILTEGFGFQCRCQRCLS 164
Query: 360 GD 361
GD
Sbjct: 165 GD 166
>gi|238006634|gb|ACR34352.1| unknown [Zea mays]
gi|414871459|tpg|DAA50016.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 183
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 80/160 (50%), Positives = 116/160 (72%), Gaps = 1/160 (0%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
PPPI+V+LTESAGRGVFATR + + +L+H+A+P+++HP+ S L+ VCY CLR+ +
Sbjct: 19 GPPPIRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHEVCYNCLRRKPGEGR 78
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
+ FC + C+D+AK F+D+E+ DWS F+ +C S+ LKYP + KRLACM+ISGA
Sbjct: 79 ASSGGDY-FCSDACRDHAKGFHDIEKNVDWSLFDYHCSSRDLKYPYMAKRLACMVISGAA 137
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE 235
+ADC++ILQPA L ++ M+E F +L S F+KAG +E
Sbjct: 138 NADCLNILQPALLHQGTLIEMKEEFELLESTFRKAGFQEE 177
>gi|255074845|ref|XP_002501097.1| predicted protein [Micromonas sp. RCC299]
gi|226516360|gb|ACO62355.1| predicted protein [Micromonas sp. RCC299]
Length = 375
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 147/333 (44%), Gaps = 55/333 (16%)
Query: 80 IQVALTESAGRGVFATRRIRASDL-IHTAKPIITHPTLSTLNSVCYFCLRKITS--SSQH 136
+ V TE +GRGV+A R ++ IH A+P++ HPTL L + CY CL+ + + H
Sbjct: 38 VIVGHTERSGRGVYAGRDFAMGEVVIHRAEPLVAHPTLRNLRTSCYHCLKPLAAVPRESH 97
Query: 137 FQHH--------NARFCGEVCKDNAK-AFYDVERRAD--WSAFNDYCRSQGLKYPLLVKR 185
H FCGE C D A+ +++D E A + +CR GLK+PL R
Sbjct: 98 VPTHGGRAGGGAAGHFCGEPCADAARTSYHDAETAAGDAVAPLVAHCRQHGLKFPLAAAR 157
Query: 186 LACMIISGAESADCIDILQ----PASLSPELILAMEE--GFVMLRSAFKKAGIDDEQMKF 239
+A I G + D L P ++P L + R + G +
Sbjct: 158 MAFAIAQGNATPRDADALVRVNFPRGVAPGAWLDEHAMISLALARGDAVRGGAFGGRSSE 217
Query: 240 LNKQWYTNVLAQIRINAFRIEL----------------------------------AGGL 265
+ +WY V++++ +NAFR+E+ A
Sbjct: 218 VTAEWYVGVVSRMHLNAFRVEIPPPAVGHRDDDDTAGDAHDCDDHVSCNHHSHRSPAASS 277
Query: 266 YEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGE 324
++ L +A AS E+ A G A Y+ PS +NH CDPN + W+ +A R DV +G
Sbjct: 278 FKASLEAALASSETGDAAGTATYLAPSLFNHSCDPNVDVDWVSGDAGMRARTRVDVAKGT 337
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
EL ICY D DAR+ L +GF C C RC
Sbjct: 338 ELTICYTDGGAPVDARRGALEHAYGFVCRCERC 370
>gi|147816655|emb|CAN70637.1| hypothetical protein VITISV_032039 [Vitis vinifera]
Length = 139
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 62/109 (56%), Positives = 84/109 (77%), Gaps = 1/109 (0%)
Query: 65 TATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCY 124
T N K + P PPPI+V++TE AGRGVFATRRI + DLIHTAKP+++HP+LS+++SVCY
Sbjct: 32 TPPENEKLASPGPPPIRVSITEMAGRGVFATRRIGSGDLIHTAKPLVSHPSLSSIHSVCY 91
Query: 125 FCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR 173
FCLRK+ + + N RFC + C++ +K F VER+ADWSA++DYCR
Sbjct: 92 FCLRKLKPVTSS-EDCNVRFCSQECEEQSKVFVAVERKADWSAYDDYCR 139
>gi|308803330|ref|XP_003078978.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
gi|116057431|emb|CAL51858.1| SET domain protein 123 (ISS) [Ostreococcus tauri]
Length = 311
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 146/292 (50%), Gaps = 25/292 (8%)
Query: 82 VALTESAGRGVFATRRIRASD--LIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
V T ++GRG +A R + ++H +P++ P+ + ++ C C+R +
Sbjct: 32 VGETATSGRGAYAARDFAVGEAVVVHE-RPLVCAPSRANGRAIACETCVR---------E 81
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
RFC + C + ++ ER ++ C ++ LK+PL+V R A MI SG+
Sbjct: 82 SGGERFCSDACARAGRVEFERERGIEFDELEARCDARALKFPLVVARAATMIASGSLQYG 141
Query: 199 CIDILQPAS-LSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
++ L A+ + E + LRSA +A E L +WY +++ +N+F
Sbjct: 142 SLEFLVSANGVEKAPPRQWVEEWETLRSALARARPALEWTPSL--EWYAKTTSRLHLNSF 199
Query: 258 RIEL------AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
R+E+ AG ++ +++ +I A G A+Y S +NH C PNAH+ W +N D
Sbjct: 200 RVEIPPAMVGAGADFKSTMTAGIDAIARGAASGTAVYFTASLFNHSCAPNAHVSW-ENGD 258
Query: 312 ARLM--ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
A + LR V GEE I Y+DA+ +R+A L + +GF C C RC+SG+
Sbjct: 259 AAITIRTLRPVRAGEEFNITYVDANERSASRRARLKEWYGFDCACERCASGE 310
>gi|303280571|ref|XP_003059578.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459414|gb|EEH56710.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 394
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 146/355 (41%), Gaps = 95/355 (26%)
Query: 69 NGKPS--QPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC 126
G PS P P + T+++GRG+FA+R I +LI A P++ HP L + VCY C
Sbjct: 38 RGSPSFAIPDGAPFFLGKTDASGRGLFASRAIARGELILRAPPLVAHPALDAIADVCYAC 97
Query: 127 LRKITSSSQHFQHHN----------------ARFCGEVCKDNAKAFYDVERRADWSAFND 170
LR+I S++ + ARFCGE C +A A + R+ A
Sbjct: 98 LRRIPSATAQTRDDGSGPKKPAPVAAEVPDGARFCGERCATDASARFHAWERSVGDALAP 157
Query: 171 YCRS----QGLKYPLLVKRLACMIISGAESADCIDIL----QPASLSPE--------LIL 214
R GLK+PLL RLA ++S + +A D L P ++PE L +
Sbjct: 158 LRRRCEDHGGLKFPLLAARLAFAVLSSSVAARDADALCRVNFPPGVAPEDWLEEHAMLKV 217
Query: 215 AMEEGFVMLRSA-------------------FKKAGIDDEQMK-----FLNKQWYTNVLA 250
A+ G + R A A D E+ L +WY V+
Sbjct: 218 ALSRGLAIRRLADARGGGTGGGGGGGGGDAQLAAAASDREREARVVDGALTPEWYVGVVG 277
Query: 251 QIRINAFRIEL------------------------------------AGGLYEDLLSSAA 274
++ +NAFR+E+ + + D + +A
Sbjct: 278 RMHLNAFRVEIPAIVDGGRGGGGAGAGGCGHEHGGERGGERGGERVSSSSAFRDAMEAAL 337
Query: 275 ASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRI 328
+S+ S G+A+Y LPS NH CDP+ W +A L A RDV GEELRI
Sbjct: 338 SSVASGRGTGSALYGLPSMLNHSCDPSVDAAWTYGDATLVLTARRDVAAGEELRI 392
>gi|145346226|ref|XP_001417594.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577821|gb|ABO95887.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 253
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 129/259 (49%), Gaps = 29/259 (11%)
Query: 116 LSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAK-AFYDVERR--ADWSAFNDYC 172
+ S C +CLR + C E C D + A+ D E+ D S YC
Sbjct: 1 MDNARSRCEWCLRASAGGAT---------C-ESCADARRLAYGDFEKTHGVDLSELEAYC 50
Query: 173 -RSQGLKYPLLVKRLACMIISGAESADCIDILQPAS-LSPELILAMEEGFVMLRSAFKKA 230
++ GLK+PL+ R A MI SGA + ++ L A+ + E +LR AF A
Sbjct: 51 EKNGGLKFPLITARAASMIASGALNKSTLEWLCAANDVETNPPAQWLEECDVLRRAFGDA 110
Query: 231 GIDDEQMKFLNKQWYTNVLAQIRINAFRIEL------AGGLYEDLLSSAAASIESEIAVG 284
G D + WY + +++ +N+FR+E+ + ++D+LS+ +I A G
Sbjct: 111 GKD-----IITPAWYAGITSRLHLNSFRVEIPVDAAASTTDFKDVLSAGLDAITQGTASG 165
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQA 342
+A+Y S NH C PN H W +N D+ L ALR++ GEEL I Y+DA RDAR+A
Sbjct: 166 SAVYKYVSLLNHSCAPNCHTHW-ENGDSSLTIRALREIAPGEELTITYVDADSPRDARRA 224
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F C C RC++G+
Sbjct: 225 RLANSYAFDCACSRCAAGE 243
>gi|414871460|tpg|DAA50017.1| TPA: hypothetical protein ZEAMMB73_470058 [Zea mays]
Length = 117
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 72/98 (73%), Gaps = 1/98 (1%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PPPI+V+LTESAGRGVFATR + + +L+H+A+P+++HP+ S L+ VCY CLR+ +
Sbjct: 20 PPPIRVSLTESAGRGVFATRPVSSGELLHSAQPLVSHPSHSLLHEVCYNCLRRKPGEGRA 79
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS 174
+ FC + C+D+AK F+D+E+ DWS F+ +CRS
Sbjct: 80 SSGGDY-FCSDACRDHAKGFHDIEKNVDWSLFDYHCRS 116
>gi|330843375|ref|XP_003293631.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
gi|325076008|gb|EGC29833.1| hypothetical protein DICPUDRAFT_90323 [Dictyostelium purpureum]
Length = 343
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 148/308 (48%), Gaps = 41/308 (13%)
Query: 80 IQVALTESAGRGVFAT---------RRIRASDLIHTAKPIITHPTLSTLN--SVCYFCLR 128
+++ ++ GRG+F T + +++ +P I++P++ L+ +C CL+
Sbjct: 35 VEIKESDKKGRGLFYTQNTQLPVINKETNKPNILFKEEPFISYPSIIKLDHDKICNHCLK 94
Query: 129 KITSSSQHFQHH--------NARFCGEVCKDNAK-AFYDVERRADWSAFN---DYCRSQG 176
+ +++ + ++ N ++C CK+ A ++ V +++ S F+ + + +
Sbjct: 95 SLENNNSNNNNNNHTECEGCNVKYCSNECKEKASLQYHSVLCKSNGSGFHYLEKHSQIEK 154
Query: 177 LKYPLLVKRLACMIISG---AESAD----CIDILQPASLSPELILAMEEGFVMLRSAFKK 229
++PLL ++ ++ G +SA+ + +L A+ P + ++ + RS K
Sbjct: 155 RRFPLLAGKILARVLMGYHLEKSANKTWFPLQMLTFAN-KPAPLEWKDDYLIFSRSLLK- 212
Query: 230 AGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYM 289
G+ + +K N W+ V+ + +N I++ +S+ +SI +Y
Sbjct: 213 -GMKESSLKKFNYDWFVKVMQILYLNTIGIDIDPNQTTTKMSTPESSI--------GLYF 263
Query: 290 LPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
L SF NH CDPNA++ + ++ A + L+ + GEE+ I Y D S R++ L + +G
Sbjct: 264 LTSFINHSCDPNAYVQFPNDHTAEIRLLKPINPGEEITISYADTSKDIIDRRSQLFENYG 323
Query: 350 FQCNCLRC 357
F C C +C
Sbjct: 324 FNCECPKC 331
>gi|281206757|gb|EFA80942.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 367
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 128/298 (42%), Gaps = 43/298 (14%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQHFQHH------- 140
GRG+F +++ +I+T KP +++ +L N ++C CL+ +
Sbjct: 69 GRGLFTKLDVQSGQVIYTEKPFVSYQSLEIDNKTICNHCLKSLDKDKIEKSKQKKDDDFD 128
Query: 141 ---------NARFCGEVCKDNAKAFYDVERRADWSAFND---YCRSQGLKYPLLVKRLAC 188
+ R+C E CK + Y + S F++ Y + ++P+L ++
Sbjct: 129 DLFSKCVKCSTRYCSEQCKSESDIQYHLASCPSTSGFSEILKYTAVEKRRFPILAAKILA 188
Query: 189 MIISG------AESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE-QMKFLN 241
I+ G E + + +L A P L + F++A + E K N
Sbjct: 189 RILLGYHFQKNMEHWENLQVLSFAKRDPPLEWKDDYNL------FQRALLTKESNKKRFN 242
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
W+ V+ + IN IE+ AAS S G ++ L SF NH CDPN
Sbjct: 243 YDWFVRVMQILYINTLGIEVGA------TEPKAASTSS----GIGLFYLTSFINHSCDPN 292
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ + + A L AL+ ++ G+EL I Y D + RQ+ L +GF CNC +C S
Sbjct: 293 CYLAFPTDHTAHLTALKPLKAGDELLIAYGDPNKDYIDRQSHLFDNYGFSCNCSKCQS 350
>gi|66801407|ref|XP_629629.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74851021|sp|Q54D67.1|Y2454_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292454
gi|60463010|gb|EAL61206.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 343
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 129/285 (45%), Gaps = 41/285 (14%)
Query: 97 RIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQHFQHHNA-----RFCGEVCK 150
+ +LI +P I++P++ N ++C CL++I + + ++C CK
Sbjct: 66 KTNKPNLIFKEEPFISYPSIIKSNENICNHCLKEIKKEEEEIKQECEECKVYKYCSIECK 125
Query: 151 DNAK-AFYDVERRADWSAFN---DYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPA 206
+ + ++ V ++ S FN + + ++PLL ++ +I G L+ +
Sbjct: 126 EKSSIEYHSVLCKSTGSGFNYLEKHASIEKRRFPLLAGKILARMIMGYH-------LEKS 178
Query: 207 SLSPELILAM-------------EEGFVMLRSAFKKAGIDDEQMKF-LNKQWYTNVLAQI 252
S S L L M ++ + RS K GI++E MK + W+ V+ +
Sbjct: 179 SKSTWLPLQMLSFAKKPPPLEWKDDYLIFSRSLLK--GINNESMKKKFDYDWFVRVMQIL 236
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
+N I++ +SS + I +Y+L SF NHDCDPNA I + D+
Sbjct: 237 YLNTIGIDIDPNQQSTKMSSPESGI--------GLYLLTSFINHDCDPNAFIHFPDDHTM 288
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
L L+ + G+E+ I Y D + R++ L + +GF C C +C
Sbjct: 289 HLSPLKPINPGDEITISYTDTTKDLVDRRSQLFENYGFNCECKKC 333
>gi|328869415|gb|EGG17793.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 363
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 128/291 (43%), Gaps = 34/291 (11%)
Query: 88 AGRGVFATRRIRASDLIHTAKPIITHPTLSTLNS-VCYFCLRKI-----------TSSSQ 135
+GRG+FA + I +P +++P+L S +C CL+++ + Q
Sbjct: 75 SGRGLFAITAFKKGSQIFQEEPFVSYPSLDVDKSTICNHCLKQLNNNGNNNNDNKSKQQQ 134
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMIIS 192
R+C E C+ +A+ + + S ++ R + ++PLL R+ I+
Sbjct: 135 QCTKCGERYCSEKCRSSAEVQHHLASCPTSSNIDNITRYSLVEKRRFPLLAARILSRILL 194
Query: 193 GAESADCI---DILQPASLSP-ELILAMEEGFVMLRSAFKKAGIDDE-QMKFLNKQWYTN 247
+ + LQ S + + L ++ + FKK+ + E K + W+
Sbjct: 195 ENHFDKTMHNWENLQVLSFAKKDAPLEWKDDY----DVFKKSLLTRESNQKRFDFNWFVR 250
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
++ + +N I++ + S ++ L SF+NH CDPN ++ +
Sbjct: 251 IMQILYLNTLGIDVG----------STKPSISSPTSSIGLFFLSSFFNHSCDPNVYMAFP 300
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ A + ALRD+++GEEL I Y D+ RQ L +GF C+C +C+
Sbjct: 301 HDKTAVITALRDIKKGEELFISYGDSEKDMFDRQTHLFDNYGFNCDCPKCT 351
>gi|440801197|gb|ELR22218.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 343
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/309 (24%), Positives = 121/309 (39%), Gaps = 65/309 (21%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIIT------------HPTLSTLNSVCYFC 126
P+ +A E GR + A I LI KP ++ HP T C C
Sbjct: 56 PVMIADVEGRGRAIVARHNITKGQLITAEKPFVSYLSERVETHGAAHPPKPTR---CDHC 112
Query: 127 LRKITSSSQHFQHHN-------ARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKY 179
+R + ++ HN AR+ +C + K ++E +C K+
Sbjct: 113 MRGLPPAAAKAAQHNNPAGLLLARYHEAIC-ELGKEVREME---------IHCSKDKRKF 162
Query: 180 PLLVKRLACMIISGAESADCID-------ILQPASLSPELILAMEEGFVMLRSAFKKAGI 232
PLL R I+ +D +L A PEL + FK+ I
Sbjct: 163 PLLAMRTMARILLNFRDTGSLDATWVPLQVLGFAKQKPEL-------WTQDYDVFKRCFI 215
Query: 233 DDE-QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLP 291
+E Q F + +WY ++ IN+ + + ++ VG +Y+L
Sbjct: 216 HNEDQADFFSYEWYCRMMQIFYINSVSVTMDS---------------TQQNVGAGLYILS 260
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID---ASMARDARQAILTQGF 348
SFYNH C PN + +N + A + +E G+E+ I Y+D ++ + R+ L +
Sbjct: 261 SFYNHSCVPNTRSSYPENNTWHVYASKPIEAGQEIFISYVDHMKSNKTVETRRQHLYNHY 320
Query: 349 GFQCNCLRC 357
GF C C RC
Sbjct: 321 GFWCECPRC 329
>gi|322697990|gb|EFY89764.1| MYND finger family protein [Metarhizium acridum CQMa 102]
Length = 513
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 128/300 (42%), Gaps = 58/300 (19%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
PPI+ S R + AT+ R S IHT P+I HP+LS L SVC CLR S
Sbjct: 2 PPIETRPHPSKHRALHATQPFRPSQTIHTFHPLILHPSLSHLASVCTHCLRP-GSPRACS 60
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
+ H A +C C+ +A W+A + + L+ L R +GA+
Sbjct: 61 RCHAAYYCNTSCQ-----------QAAWTAVHSK-ECKALQQRKLGSR------TGADLP 102
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFK-----KAGIDDEQMK-----FLNKQWYTN 247
+ L L LI +E+G L + K+ D E M F +Q +N
Sbjct: 103 TPVRAL----LQTLLIKEIEDGVAALDGHTEERRKTKSWPDLEMMALAACAFAGRQGESN 158
Query: 248 V------LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
V L +I+ NAF ++ L +E +A+ N H C PN
Sbjct: 159 VRKAVELLCKIQTNAFH------RWDVDLGQVGVFLEPTLAMAN----------HSCVPN 202
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
A + ++ A L A R ++ G+E+ I Y D +M AR+ L Q + F+C C RC GD
Sbjct: 203 AVVQFVGR-KAILRAERPIQAGDEIEISYTDYTMPLSARREALEQ-YSFECTCARC-KGD 259
>gi|384493687|gb|EIE84178.1| hypothetical protein RO3G_08888 [Rhizopus delemar RA 99-880]
Length = 410
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 134/296 (45%), Gaps = 30/296 (10%)
Query: 88 AGRGVFATRRIRASDLIHTAKPIITHP-----TLSTLNSVCYFCLRKITSS---SQHFQH 139
G+G+FA+R I+ +++ T +P + P +L+ VC C + I + +QH +H
Sbjct: 104 TGKGLFASRDIKQDEILFTEEPYVYFPPWEGFSLARSGHVCGLCCKPILYANRLTQHCKH 163
Query: 140 HNARFCGEVCKDNA-KAFYDVE---RRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
+ +C + C+ A F+ +E + F ++C + + P+ V R+ +I +
Sbjct: 164 CDMYYCSKSCRATAWDQFHQLECTRLNPAITPFMNFCEGERWQAPMAVARIYARLILAHQ 223
Query: 196 SADCIDILQP----ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQ 251
+ D++ A++S E A E ++ + ++ + + L ++ Y A+
Sbjct: 224 REELNDVIGRLDAFATVSQEERQAKETEWIFMEQPTRELW---NKARDLLRKAYKQPPAK 280
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIE--SEIAVGN---AIYMLPSFYNHDCDPNAHIMW 306
+I +L L E L + + + N +Y++ S NH+C PN I +
Sbjct: 281 CKITQ---DLPEALKEKLFEDEETFLNFMGKFNINNQNGGMYLVHSHINHNCYPNVSIDY 337
Query: 307 IDNADA---RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ + A+RD+++GE+L Y++ ++ RQ L + + F C C RC
Sbjct: 338 PNPKSQYKIAVRAVRDIKQGEQLFETYVNPRWNKETRQTYLDKSYLFNCKCERCEK 393
>gi|440300086|gb|ELP92579.1| set and mynd domain containing protein, putative [Entamoeba
invadens IP1]
Length = 420
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 127/301 (42%), Gaps = 33/301 (10%)
Query: 84 LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI--TSSSQHFQHHN 141
++E+ G+GVF+ + + +++I+T P+I + C +CL+ + S+ + N
Sbjct: 123 ISETLGKGVFSKKEHQVNEIIYTETPVIASAKYKNYKNCCLYCLKALFPNESNTYTSVFN 182
Query: 142 AR------------FCGEVCKDNAKAFYDVERRADWSAFN---DYCRSQGLKYPLLVKRL 186
+C + C+D DV + N +C + + +PL + ++
Sbjct: 183 TNVVKGVKCKCGYVYCSKKCQD-----LDVGHQLVCGKLNKVVSFCTERCVSHPLTITKM 237
Query: 187 ACMIISGAESADCIDILQPASLSP--ELILAMEEGFVML----RSAFKKAGIDDEQMKFL 240
++S I P + + +E+ FV + F+ G + +
Sbjct: 238 FAKVLSATYREPTIYPFVVFHAQPLVKFDMTVEKIFVEIFGNKLETFQDFGGWKYVYRIM 297
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAA---SIESEIAVGNAIYMLPSFYNHD 297
+ N + +NA ++ LA + +LS A + G +Y + NH
Sbjct: 298 SSVLKYNASTILPLNAIQM-LATDPQKKVLSKEEALNWEVSKFSVEGEGLYKYLNTLNHS 356
Query: 298 CDPNAHIM-WIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
CDPN + D+ L+AL+D++ GEEL I YID SM ++ R L + F C C +
Sbjct: 357 CDPNCVLACTTDDFKLSLIALKDIKAGEELTISYIDNSMNKETRLKTLMDQYNFDCKCKK 416
Query: 357 C 357
C
Sbjct: 417 C 417
>gi|147771941|emb|CAN75702.1| hypothetical protein VITISV_035989 [Vitis vinifera]
Length = 223
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 36/57 (63%), Positives = 46/57 (80%), Gaps = 2/57 (3%)
Query: 69 NGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYF 125
N KP+ P PPPI+V++TE AGRGVFATRR+ + DLIHTAK +ITHP+LS+ +C F
Sbjct: 40 NEKPASPGPPPIRVSITEMAGRGVFATRRMGSGDLIHTAKLLITHPSLSSF--ICPF 94
>gi|345566296|gb|EGX49239.1| hypothetical protein AOL_s00078g272 [Arthrobotrys oligospora ATCC
24927]
Length = 526
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 137/311 (44%), Gaps = 51/311 (16%)
Query: 78 PPIQVALT------ESAGRGVFATRRIRA-SDLIHTAKPIITHPTLSTLNSVCYFCLRKI 130
PP+ ++ + SAGRGVFA+ I A ++++ A P+I P + L+S C++C+ +
Sbjct: 3 PPVNLSPSAYLKDHPSAGRGVFASGTIPAGTEILSVADPLICIPDEAHLDSCCHYCMAEA 62
Query: 131 TSSSQHF-QHHNA-------------RFCGEVCKDNAKAFYDVERRADWS--AFNDYCRS 174
T + Q + A ++C + C+ DW A C
Sbjct: 63 TDEAPSVNQAYRAPVKLSYCLGCRVVKYCSKACQTT-----------DWKQKAHKYECVI 111
Query: 175 QGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSA-FKKAGID 233
+YP ++ A I+ A+ Q S +P + G + + F K G D
Sbjct: 112 YKAQYPRILPVTARAILRMAK--------QFLSETPGSNIVGGIGALKAHAEDFAKIGGD 163
Query: 234 DEQMKFLNKQW---YTNVLAQIRINAFRIELAGGLYEDLL-SSAAASIESEIAVGNAIYM 289
+M L + ++ V QI +F E +Y LL +SA+ ++ +G +
Sbjct: 164 IWEMANLTAKAAAEFSKVSKQI---SFEPEFLRDMYCKLLINSASVVTQTFDPIGLCLAY 220
Query: 290 LPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
+ +NH CDPNA +M D + +L+++ + E+ I YID +R R+ L +
Sbjct: 221 QSAMFNHSCDPNA-VMMFDGRQLSIRSLKEITKDTEITISYIDNLASRKERKEELKSRYF 279
Query: 350 FQCNCLRCSSG 360
F C+C CSSG
Sbjct: 280 FDCSCSLCSSG 290
>gi|301777093|ref|XP_002923966.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Ailuropoda melanoleuca]
Length = 477
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 123/294 (41%), Gaps = 35/294 (11%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PP ++ T + G G+ A ++R +L+ + P+ + VC CL +
Sbjct: 3 PPKVEKFSTANRGNGLRALAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRC 62
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII----- 191
Q A++C C+ KA+ D +R +S +YP RL ++
Sbjct: 63 SQCRVAKYCSAKCQ--KKAWPDHKRECK------CLKSCKPRYPPDSVRLLGRVVFKLME 114
Query: 192 -SGAESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQWYTN 247
+ +ES S +L +EG VM F + I D L +
Sbjct: 115 ETPSESEKLYSFYDLESNINKLTEEKKEGLRQLVMTFQHFMREEIQDASQLPLPFDIF-E 173
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
A++ NAF I A ++ E+ VG +Y S NH CDPN I++
Sbjct: 174 AFAKVICNAFTI-------------CNAEMQ-EVGVG--LYPSMSLLNHSCDPNCSIVF- 216
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 217 NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|344278575|ref|XP_003411069.1| PREDICTED: SET and MYND domain-containing protein 3 [Loxodonta
africana]
Length = 428
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 123/293 (41%), Gaps = 35/293 (11%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P ++ T G G+ A+ +R DL+ + P+ + +VC CL +
Sbjct: 4 PKVEKFRTADRGNGLRASAALRPGDLLFRSDPLAYTVCKGSRGAVCDRCLLGKEKLLRCS 63
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP----LLVKRLACMIISG 193
Q A++C C+ KA+ D +R CR +YP L+ R+ C ++ G
Sbjct: 64 QCRVAKYCSAKCQ--KKAWPDHKRECKCLK---NCRP---RYPPDSVRLLGRVVCKLMEG 115
Query: 194 --AESADCIDILQPASLSPELILAMEEGFVMLRSAFK---KAGIDDEQMKFLNKQWYTNV 248
+ES S +L +EG L F+ + I D + +
Sbjct: 116 TPSESEKLYSFYDLESNINKLTEDKKEGLRQLAMTFQDFMREEIQDASQLPPSFDIF-EA 174
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
A++ N+F I A ++ VG +Y S NH CDPN I++ +
Sbjct: 175 FAKVICNSFTI-------------CDAEMQE---VGVGLYPSMSLLNHSCDPNCSIVF-N 217
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 218 GPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCRTQD 270
>gi|348534108|ref|XP_003454545.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Oreochromis niloticus]
Length = 439
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 123/309 (39%), Gaps = 61/309 (19%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P ++ ++ G G+ A RIR +L+ +A+P+ + VC+ C + S +
Sbjct: 4 PMLERFVSPGKGNGLRAAGRIRKGELVISAEPLACCVSNRLAKEVCHHCFTRHESLLRCS 63
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS-QGL--KYPLLVKRLACMIISGA 194
Q AR+C C+ A WS C+ Q L + P RLA +I
Sbjct: 64 QCKMARYCNTTCQKQA-----------WSGHKRECKCLQSLLPRLPTDSVRLAAKLI--- 109
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRS--AFKKAGIDD--EQMKFLNKQWYTNVL- 249
+L P+ S E + +EE L S KK G+ ++ +Q ++L
Sbjct: 110 -----FALLSPSKSSSEELYTLEEHESHLTSMPEQKKQGLSQLASMLELYLQQEAPDLLQ 164
Query: 250 -----------------AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPS 292
A++ N F I G L E +G +Y S
Sbjct: 165 EMGSALPPSCREPLSLIAKVTCNCFTIS-DGELQE---------------IGVGLYPSLS 208
Query: 293 FYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NHDC PN +M + L A+RD++ +EL I YI+ + RQ L + + F C
Sbjct: 209 LLNHDCRPNC-VMVFEGTKLELRAVRDIDPEDELTISYIETLSLTEDRQRQLEEQYHFTC 267
Query: 353 NCLRCSSGD 361
+C RC S D
Sbjct: 268 HCQRCDSRD 276
>gi|322708541|gb|EFZ00118.1| MYND finger family protein [Metarhizium anisopliae ARSEF 23]
Length = 526
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 125/294 (42%), Gaps = 46/294 (15%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
PPI+ S R + AT+ + S +IHT P+I HP+LS L SVC CLR S
Sbjct: 2 PPIETRPHPSKHRALHATQPFKPSQIIHTFHPLILHPSLSQLASVCTHCLRP-GSPRACS 60
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRS---------QGLKYPLLVKRLA 187
+ H A +C C+ +A W+A + C++ G P V+ L
Sbjct: 61 RCHAAYYCNTSCQ-----------QAAWTAVHSKECKALQQRKLGSRTGADLPTPVRALL 109
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
+++ E+ D I L + + + +M +A AG E N +
Sbjct: 110 QTLLN-KETEDGIAALDGHTEERRKTKSWPDLEMMAMAACAFAGRQGES----NIRRAVE 164
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+L +I+ NAF ++ L +E +A+ NH C PNA + ++
Sbjct: 165 LLCKIQTNAFH------RWDVDLGQVGVFLEPTLAMA----------NHSCVPNAVVQFV 208
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
A L A R + G+E+ I Y D +M R+ L Q + F+C C RC GD
Sbjct: 209 GR-KAILRAERPIHAGDEIEISYTDYTMPLSTRREALEQ-YSFECTCARC-KGD 259
>gi|225715142|gb|ACO13417.1| SET and MYND domain-containing protein 3 [Esox lucius]
Length = 429
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 124/280 (44%), Gaps = 32/280 (11%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
G G+ A +R+ A +L++TA+P+ + N VC+ C + + + Q AR+C
Sbjct: 14 GNGLRANQRVLAGELLYTAEPLACCVSSQCRNVVCHSCFSRHETLLRCSQCKVARYCDVT 73
Query: 149 CKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMIISGAESADCIDILQP 205
C+ A WS C+ S + P RLA II +L P
Sbjct: 74 CQKRA-----------WSIHKRECKCLLSLHPRIPTDSVRLAARII--------FRLLCP 114
Query: 206 ASLSPELILAME-EGFVMLRSAFKKAGIDD--EQMKFLNKQWYTNVLAQIRINAFR-IEL 261
+ ++P+L E E + K+ G+ ++ KQ +++ ++ +F I L
Sbjct: 115 SQMTPQLYSFEEHESHLCDMGEEKREGLSQLSSMLQLYLKQEQPDIIQ--KVPSFDPISL 172
Query: 262 AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE 321
+ + + + A ++ EI VG +Y S NHDC P+ +M +L A+RD++
Sbjct: 173 LAKVTCNCFTISDAELQ-EIGVG--LYPSMSLLNHDCRPSC-VMLFQGKTLQLRAIRDIQ 228
Query: 322 EGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
EE+ I YI + RQ L + + F C C CS+ +
Sbjct: 229 PTEEVTISYIGVLLPTRERQTQLMEQYHFSCQCGLCSTAE 268
>gi|440793165|gb|ELR14357.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 387
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 130/332 (39%), Gaps = 79/332 (23%)
Query: 86 ESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRKITSSSQHFQHHNAR 143
E+ GRG+FA R +A D+I P+ P + + C CLRK+ +S+ + R
Sbjct: 64 ETMGRGMFALRDFKAGDVILEESPLAFAPRDCVVAQMLHCSQCLRKLAASTAAVTCPHCR 123
Query: 144 ---FCGEVCKDNA----------KAFYDVERRAD-WSAFNDYCRSQGLKYPLLVKRLACM 189
+C + C A A D E FN CR Q YPLL+ RL
Sbjct: 124 LDPYCSDRCAAEAWESHHRLLCEAAQADPETTGQALRDFNALCREQARAYPLLIARLIAQ 183
Query: 190 I----------ISGAESAD------------CIDILQPASLSPELILAMEEGFVMLRSAF 227
+ +S E+ D C ++P PE F ML F
Sbjct: 184 LSTDVAKLGADVSLEEALDRPNGLFSHLRHFCSADIEPQGFPPE--------FSMLDYIF 235
Query: 228 KKAGIDDEQMK-------FLNKQ----------WYTNVLAQIRINAFRIEL--AGGLYED 268
+ + + FL+ + W +L + + +E A G+Y D
Sbjct: 236 GQTNLASHILSTCAATICFLDPRCVLVDEHFCFWLFTILMRNTFGLWIVEEGEAEGVYTD 295
Query: 269 LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEEL 326
G A+++ S++NH C PN + D R+ +A D+++GE+L
Sbjct: 296 RRED-----------GVALFLHASYFNHSCTPNVD-RCNRHGDKRVAFIACADIKKGEQL 343
Query: 327 RICYIDASMARDARQAILTQGFGFQCNCLRCS 358
I Y+D D R+ L Q +GF C+C +CS
Sbjct: 344 FIEYVDTRAPVDERRQELAQRYGFLCSCPKCS 375
>gi|345803330|ref|XP_537223.3| PREDICTED: SET and MYND domain-containing protein 3 [Canis lupus
familiaris]
Length = 428
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 120/298 (40%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
PP ++ + S G G+ A+ ++R +L+ + P+ + VC CL +
Sbjct: 2 EPPKVEKFSSASRGNGLRASAQLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMII- 191
Q A++C C+ A W C+ S +YP RL ++
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 192 -----SGAESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
+ +ES S +L ++G VM F + I D + +
Sbjct: 111 KLMEETPSESEKLYSFYDLESNINKLTEDKKDGLRQLVMTFQHFMREEIQDAS-QLPSSF 169
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
A++ NAF I A ++ VG +Y S NH CDPN
Sbjct: 170 DIFEAFAKVICNAFTI-------------CNAEMQE---VGVGLYPSMSLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C+RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLKDQYCFECDCVRCETQD 270
>gi|432102449|gb|ELK30026.1| SET and MYND domain-containing protein 1 [Myotis davidii]
Length = 449
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 116/287 (40%), Gaps = 32/287 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ ASD+I + S +N VC+ C ++ Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWASDVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
AR+C C+ +A V+ + + A Y K P RLA I+ E +
Sbjct: 69 KFARYCDRTCQKDAW----VDHKKECLAVKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVENFGEEEQKQLRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 223 LRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQKG 269
>gi|296230793|ref|XP_002760890.1| PREDICTED: SET and MYND domain-containing protein 3 [Callithrix
jacchus]
Length = 428
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 118/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
PP ++ T G G+ A +R +L+ + P+ + + VC CL +
Sbjct: 2 EPPKVEKFATADRGNGLRAVTPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSSKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ G+ ES S +L +EG VM F + I D
Sbjct: 111 KLMEGSPSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 IF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+++ + L A+RDVE GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 VVF-NGPHLLLRAVRDVEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|440801098|gb|ELR22123.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 397
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 130/333 (39%), Gaps = 79/333 (23%)
Query: 85 TESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRKITSSSQHFQHHNA 142
E+ GRG+FA R +A D+I P+ P + + C CLRK+ +S+ +
Sbjct: 72 AETMGRGMFALRDFKAGDVILEESPLAIAPRDCVVAQMLHCSQCLRKLAASTAAVACPHC 131
Query: 143 R---FCGEVCKDNA----------KAFYDVERRAD-WSAFNDYCRSQGLKYPLLVKRLAC 188
R +C + C A A D E FN CR Q YPLL+ RL
Sbjct: 132 RLDPYCSDRCAAEAWESHHRLLCEAAQADPETTGQALRDFNALCREQARAYPLLIARLIA 191
Query: 189 MI----------ISGAESAD------------CIDILQPASLSPELILAMEEGFVMLRSA 226
+ +S E+ D C ++P PE F ML
Sbjct: 192 QLSTDVAKLGADVSLEEALDRPNGLFSHLRHFCSADIEPQGFPPE--------FSMLDYI 243
Query: 227 FKKAGIDDEQMK-------FLNKQ----------WYTNVLAQIRINAFRIEL--AGGLYE 267
F + + + FL+ + W +L + + +E A G+Y
Sbjct: 244 FGQTNLASHILSTCAATICFLDPRCVLVDEHFCFWLFTILMRNTFGLWIVEEGEAEGVYT 303
Query: 268 DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEE 325
D G A+++ S++NH C PN + D R+ +A D+++GE+
Sbjct: 304 DRRED-----------GVALFLHASYFNHSCTPNVD-RCNRHGDKRVAFIACADIKKGEQ 351
Query: 326 LRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L I Y+D D R+ L Q +GF C+C +CS
Sbjct: 352 LFIEYVDTRAMVDERRQELAQRYGFLCSCPKCS 384
>gi|399216320|emb|CCF73008.1| unnamed protein product [Babesia microti strain RI]
Length = 380
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 121/275 (44%), Gaps = 46/275 (16%)
Query: 87 SAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCG 146
S G G+ A R I++ I T KP+I+ +ST + Y CL + S++F ++
Sbjct: 12 SIGIGLIAVRNIKSGTAIVTQKPVIS---ISTDGIIFYSCLGEKIILSENFTSRLSK--- 65
Query: 147 EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPA 206
+C D F+ V + + A N+ + P ++K L ++ + + +++ +
Sbjct: 66 -ICCDRHGKFFQVLKYVNGKAINNNAKVG----PCMIKTLFLIV-----TEENVNVWSHS 115
Query: 207 SLSPELILAMEEGFVML---RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAG 263
+ E ++ + F + R A GID L V+ I N+F IE
Sbjct: 116 NY--EKVMKNQSSFALANIAREALLNFGIDKSIKDILR------VITCISENSFSIENNH 167
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLP-SFYNHDCDPNAHIMWIDNADARLMALRDVEE 322
+Y G+A+Y P + NH C+PN I + N + ALRD+
Sbjct: 168 HVY-----------------GSALYKPPITKVNHSCNPNCTIKFSANV-ITMEALRDIRA 209
Query: 323 GEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
G++L I Y R+ RQA L + +GF+C C+ C
Sbjct: 210 GKQLFISYTYNVQPRNVRQANLLEQYGFECKCVYC 244
>gi|50740707|ref|XP_419536.1| PREDICTED: SET and MYND domain-containing protein 3 [Gallus gallus]
Length = 428
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 124/301 (41%), Gaps = 51/301 (16%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P ++ L+ G G+ + R++R +L++ A+P T L VC CL++ +
Sbjct: 4 PALERFLSPGKGSGLRSRRQVRPGELLYRAEPFAYVVTKEQLGGVCEQCLQRNEHLHRCS 63
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS-QGLK--YPLLVKRLACMII--- 191
Q A++CG+ C+ A W C+ Q +K +P RLA I+
Sbjct: 64 QCKVAKYCGKSCQKEA-----------WLDHKRECKCLQNVKPNFPPDSVRLAGRIVFKL 112
Query: 192 ---SGAESADCIDILQPASLSPELILAMEEGF--------VMLRSAFKKAGIDDEQMKFL 240
S S S + +L M+EG + LR+ + A + F
Sbjct: 113 LRQSACLSERLYSFKDLQSNAEQLSEEMKEGLGHLAHTLQLYLRAEIQDASHLPPAIDFF 172
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDP 300
+ ++ N F I + G +D VG +Y S NH CDP
Sbjct: 173 Q------IFTKVTCNCFTI--SNGEMQD--------------VGVGLYPSMSLLNHSCDP 210
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
N I++ + L ++R+++ GEEL I YI++ M RQ L + + F+C+C C
Sbjct: 211 NCVIIF-EGYQLLLRSIREIQIGEELTISYIESLMPTSERQKQLKRQYCFECDCCLCQDQ 269
Query: 361 D 361
+
Sbjct: 270 E 270
>gi|213512076|ref|NP_001133866.1| SET and MYND domain-containing protein 3 [Salmo salar]
gi|209155626|gb|ACI34045.1| SET and MYND domain-containing protein 3 [Salmo salar]
Length = 429
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 115/291 (39%), Gaps = 54/291 (18%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
G G+ AT I A +L++TA+P+ + + +C+ C + + + Q AR+C
Sbjct: 14 GNGLRATHEIHAGELLYTAEPLAYCVSNTCAKDLCHSCFSRRKTLLRCSQCKVARYCDVT 73
Query: 149 CKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAESADCIDILQP 205
C+ A WS C RS + P RLA II +L P
Sbjct: 74 CQKQA-----------WSDHKRECKCLRSLHPRIPTDSVRLAARII--------FRLLSP 114
Query: 206 ASLSPELI-LAMEEGFVMLRSAFKKAGIDD--EQMKFLNKQWYTNV------------LA 250
+ SP L L E + ++ G+ ++ +Q N+ LA
Sbjct: 115 SQTSPGLCSLEEHESHLCDMGEERREGLSQLSSMLQLYIQQEVPNITQELPAVDPLSLLA 174
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
++ N F I G L E +G +Y S NHDC P+ +M +
Sbjct: 175 KLTCNCFTIS-DGELRE---------------IGVGLYPSMSLLNHDCRPSC-VMLFEGE 217
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L A+RD++ EEL I YI R+ L + + F C C RC++ D
Sbjct: 218 TLHLRAVRDMQPAEELTISYIGTLAPTRDRRTQLEEQYHFTCQCQRCTTAD 268
>gi|393215651|gb|EJD01142.1| MAS20-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 555
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 143/329 (43%), Gaps = 68/329 (20%)
Query: 92 VFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKITSSSQ---HFQHHNARFCG 146
+ A++ +A D+I+ PI+ P L C CLR++ S + + +C
Sbjct: 196 LVASKSFKAGDVIYKETPIVASLDPDLVASGRHCGHCLREVESDAAIKPESDRLGSIYCS 255
Query: 147 EVCKDNAKA------FYD-----------------VERRADWSAFNDYCRSQGLKYPLLV 183
E C+ + F D ER A ++CR +G LLV
Sbjct: 256 EKCQTQSGTQSQDMLFSDKSPLPAEVAADLPQQSVAERPKTQDALAEFCRQKGTSGLLLV 315
Query: 184 KRLACMIISGAESADCIDILQPASLSPELI-LAMEEGFVML----RSAFKKAGIDDEQMK 238
RL + I AE A + Q A + +L ++ E + + R + + D++ K
Sbjct: 316 ARLVAIQIV-AELAKALP--QAAHMKEQLPEFSISEQYTVYDHIERLRYIDVTLPDDEYK 372
Query: 239 FLNK------QWYTNVLAQIRINAFRIELAG---GLY-----EDLLSSA-------AASI 277
L + + +V + R FR ++A G+Y +D +++ AA +
Sbjct: 373 TLRELLLATLPFLEDVHSDERHAMFRGKVAYNAIGIYFGEGRDDKPTTSELERTRIAAGV 432
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMA 336
+I G+ +Y++ S+ H C P+ ++ + ++ L+A RD+EEGEEL + Y+D S
Sbjct: 433 SKQI--GSGLYLVSSYIGHSCAPSVRPVFSEGTSELHLLAERDIEEGEELTMAYVDVSQR 490
Query: 337 RDA--------RQAILTQGFGFQCNCLRC 357
+ R++ LTQG+ F C C++C
Sbjct: 491 SEETPVEAFTRRRSELTQGWKFACACVKC 519
>gi|336274372|ref|XP_003351940.1| hypothetical protein SMAC_00488 [Sordaria macrospora k-hell]
gi|380096224|emb|CCC06271.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 537
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 126/308 (40%), Gaps = 48/308 (15%)
Query: 76 SPPPIQVALTESA-----GRGVFATRRIRASDLIHTAK-PIITHPTLSTLNSVCYFCLRK 129
+PPPI T + GR VF+T R A + I + P++ P + + + C +CL
Sbjct: 5 NPPPIPKTRTSGSPGLDHGRTVFSTERFGAGETIAILENPLLALPDDANMRTTCNYCL-- 62
Query: 130 ITSSSQHFQ---HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRL 186
S + F+ R C + C A Y N C+ Q K LV +
Sbjct: 63 YVSGTIEFEGDVKAGPRTC-KACTGCKAAVY----------CNAECQRQHWK---LVHKA 108
Query: 187 ACMIISGAESADCID-ILQPASLSPELILAMEEGFVMLRSAFKKAGI---DDEQMKFLNK 242
C + D + P +++L ++ G + AF G + E K +
Sbjct: 109 ECKMFKRIRERTGKDWVPTPVRAVAQVMLLLKAGDEEVTKAFGPGGTLESNVEGFKTDEE 168
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAV----------GNA-IYMLP 291
W L + AG L D A + +I G A IY+ P
Sbjct: 169 LWKDFELQATGAVVY----AGLLQSDETLKQAMEVLCKIQTNAFNRFDADTGQAGIYLHP 224
Query: 292 SF--YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
S NH C PNA+I + + A L A RD+E G+E+ I YID + R ARQ L + +
Sbjct: 225 SLSMVNHSCVPNAYITF-EKRKAFLKAERDIEPGDEILISYIDHTAPRRARQESL-RLYH 282
Query: 350 FQCNCLRC 357
FQCNC+RC
Sbjct: 283 FQCNCVRC 290
>gi|291386373|ref|XP_002709685.1| PREDICTED: SET and MYND domain containing 1 isoform 1 [Oryctolagus
cuniculus]
Length = 477
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 118/287 (41%), Gaps = 32/287 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + + F +A +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVDHFGEEEQKELRADVDTFLQYWPPQSQHFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ + NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLALVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 223 LRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQKG 269
>gi|327286307|ref|XP_003227872.1| PREDICTED: SET and MYND domain-containing protein 3-like [Anolis
carolinensis]
Length = 334
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 126/306 (41%), Gaps = 47/306 (15%)
Query: 71 KPSQPSPPPIQVALTESA----GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC 126
+P + + P++ + + A G G+ ++ ++A L++ +P T L + C C
Sbjct: 23 RPFRAALGPVKTTMEKFASPGRGNGLRLSKAVKAGALLYREEPFAYAVTKKRLGAACERC 82
Query: 127 LRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLV 183
LR+ + Q AR+C C+ A W C+ S +P
Sbjct: 83 LRRKERLLRCSQCKVARYCDARCQKEA-----------WEDHKRECKCIKSVEPNFPPDS 131
Query: 184 KRLACMII------SGAESADCIDILQPASLSPELILAMEEGFVMLRSAFK---KAGIDD 234
RL II S S + S +L M+EG L + K I D
Sbjct: 132 VRLVGRIIFKLLRQSTCPSEELYSFSDLQSNVEKLSEEMKEGLGHLAKTLQLYLKVEIQD 191
Query: 235 -EQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
Q+ + T A++ N F I + G +D VG +Y S
Sbjct: 192 VSQLPPALDVFET--FAKVACNCFSI--SNGEMQD--------------VGVGLYPSMSL 233
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
N+ CDPN I++ + + L ++R+++EGEEL ICY++ M RQ L + + F+C+
Sbjct: 234 LNNSCDPNCAIVF-EGPELHLRSIREIQEGEELTICYVETMMPTPERQERLKRQYCFECD 292
Query: 354 CLRCSS 359
CL C +
Sbjct: 293 CLMCCT 298
>gi|347300358|ref|NP_001153564.2| SET and MYND domain-containing protein 3 [Sus scrofa]
Length = 428
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 119/295 (40%), Gaps = 35/295 (11%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T G G+ A R+R +L+ + P+ + VC CL +
Sbjct: 2 EPSKVEKFSTADRGNGLRALARLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII---- 191
Q A++C C+ KA+ D +R +S +YP RL +I
Sbjct: 62 CSQCRVAKYCSAKCQ--KKAWQDHKRECKC------LKSCKPRYPPDSVRLLGRVIFKLM 113
Query: 192 --SGAESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQWYT 246
+ +ES S +L +EG V+ F + I D + +
Sbjct: 114 EETPSESEKLYSFYDLESNMNKLTEEKKEGLRQLVLTFQHFMREEIQDASQLPPSFDIF- 172
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
A++ N+F I A ++ VG +Y S NH CDPN I++
Sbjct: 173 EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSMSLLNHSCDPNCSIVF 216
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 217 -NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|326915443|ref|XP_003204027.1| PREDICTED: SET and MYND domain-containing protein 3-like [Meleagris
gallopavo]
Length = 427
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 124/291 (42%), Gaps = 44/291 (15%)
Query: 84 LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNAR 143
L+ G G+ + RR+R +L++ A+P T L VC C ++ + Q A+
Sbjct: 10 LSPGKGSGLRSRRRVRPGELLYRAEPFAYVVTKEQLGGVCEQCFQRNEHLHRCSQCKVAK 69
Query: 144 FCGEVCKDNAKAFYDVER-----RADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
+CG+ C+ + + D +R R F R G L+++ AC+ +E
Sbjct: 70 YCGKSCQK--EGWLDHKRECKCLRNVKPNFPXSVRLAGRIVFKLLRQSACL----SEKLY 123
Query: 199 CIDILQPASLSPELILAMEEGF--------VMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
LQ S + +L M+EG + LR+ + A + F +
Sbjct: 124 SFSDLQ--SNAEQLSEEMKEGLRHLAHTLQLYLRAEIQDASHLSPAIDFFQ------IFT 175
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
++ N F I + G +D VG +Y S NH CDPN I++ +
Sbjct: 176 KVTCNCFTI--SNGEMQD--------------VGVGLYPSMSLLNHSCDPNCVIIF-EGY 218
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L ++R+++ GEEL + YI++ M RQ L + + F+C+C C +
Sbjct: 219 QLLLRSVREIQIGEELTVSYIESLMPTSERQEQLKRQYCFECDCCLCQDQE 269
>gi|167535778|ref|XP_001749562.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771954|gb|EDQ85613.1| predicted protein [Monosiga brevicollis MX1]
Length = 338
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 50/76 (65%), Gaps = 1/76 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G +Y S NH C PNA +++ + A A L ++RD+ EGEE+ I Y+D ++ARD R+
Sbjct: 130 GAVVYAELSRANHSCQPNAAVVY-NGAAAVLRSMRDIPEGEEVCISYVDPTLARDVRRRE 188
Query: 344 LTQGFGFQCNCLRCSS 359
L Q +GF C+C RC++
Sbjct: 189 LVQSYGFACDCARCAT 204
>gi|334322119|ref|XP_003340188.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Monodelphis domestica]
Length = 441
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 125/288 (43%), Gaps = 16/288 (5%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P ++ + S G G+ A +R +L+ ++P+ +L VC CL + +
Sbjct: 4 PKLEKFWSPSRGNGLRALASLRPGELLFRSEPLAYTVCKESLGVVCERCLCRKEKLLRCS 63
Query: 138 QHHNARFCGEVCKDNAK--AFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
Q AR+CG C+ ++ +F D+ R ++ + S+ K+ + +I
Sbjct: 64 QCKVARYCGSACQADSSEVSFVDLFGRQSPTSTFPHFPSEPPKHLANSGSVCINLIVYVY 123
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRIN 255
+ + P + P I + + + S KK G+ + Q Y Q +
Sbjct: 124 IPRKYNAMLPLMIGPFPIFSPD---IKNLSEEKKKGLGHLAVTL---QLYLKEEIQ---D 174
Query: 256 AFRIELAGGLYED----LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
A ++ A ++E + + A S VG +Y S NH CDPN I++ +
Sbjct: 175 ASQLPPAFDIFESFAKVICNGFAISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVF-EGPS 233
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L A+RD+++GEEL ICY+D M RQ L + + F C+C C +
Sbjct: 234 LFLRAVRDIQQGEELTICYLDVLMPSAERQKQLKEQYCFDCDCPGCET 281
>gi|384498493|gb|EIE88984.1| hypothetical protein RO3G_13695 [Rhizopus delemar RA 99-880]
Length = 553
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 118/284 (41%), Gaps = 34/284 (11%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH--FQHHNARFCG 146
GR + +T I +I T++P+ T T+N C +C RK T + + FC
Sbjct: 21 GRYIKSTATIPKGTVIITSQPLGTVALPQTINEYCNYCFRKQTRPPLQRCSRCKSVYFCD 80
Query: 147 EVCKDNAKAFY-----DVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES-ADCI 200
C NA Y D R A N+ K L V R + E+ + +
Sbjct: 81 MGCFSNAWLSYHQFVCDPARVRHEDAENELDLEMLEKVALNVSRYRKRAKTETEAEGETV 140
Query: 201 DILQPASLSPELILAMEEGFVML------RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
++ A S A++ G V+ R A K+A + Q +++ + L +
Sbjct: 141 EVTMEAFFSLMGHDALQAGRVLASHRRLAREALKRAHV---QQTSIDEDELVHYLNVFKS 197
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
N F ++ + AVG Y + S +NH C PNA I++ D A A +
Sbjct: 198 NNFTLD----------------DQEMFAVGEGTYPVASLFNHTCRPNAVIVF-DGALAEI 240
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A+ +E E+ I YID + AR R+ L + + F C C+RC+
Sbjct: 241 RAIDTIEPDTEITISYIDPAHARAHRKRALREKYFFDCRCVRCT 284
>gi|19570495|dbj|BAB86333.1| histone methyltransferase [Homo sapiens]
gi|158260233|dbj|BAF82294.1| unnamed protein product [Homo sapiens]
gi|261858444|dbj|BAI45744.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 111 KLMDGAPSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|332812383|ref|XP_514316.3| PREDICTED: SET and MYND domain-containing protein 3 [Pan
troglodytes]
gi|397473047|ref|XP_003808033.1| PREDICTED: SET and MYND domain-containing protein 3 [Pan paniscus]
gi|410208860|gb|JAA01649.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410254410|gb|JAA15172.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410289336|gb|JAA23268.1| SET and MYND domain containing 3 [Pan troglodytes]
gi|410332281|gb|JAA35087.1| SET and MYND domain containing 3 [Pan troglodytes]
Length = 428
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 111 KLMDGAPSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|312208186|pdb|3PDN|A Chain A, Crystal Structure Of Smyd3 In Complex With
Methyltransferase Inhibitor Sinefungin
gi|323714509|pdb|3OXL|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Ii)
gi|21410974|gb|AAH31010.1| SMYD3 protein [Homo sapiens]
gi|123984179|gb|ABM83506.1| SET and MYND domain containing 3 [synthetic construct]
gi|123998243|gb|ABM86723.1| SET and MYND domain containing 3 [synthetic construct]
Length = 428
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 111 KLMDGAPSESEKLYSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|149036368|gb|EDL90986.1| rCG56113, isoform CRA_b [Rattus norvegicus]
Length = 477
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 117/287 (40%), Gaps = 32/287 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C+C C G
Sbjct: 223 LRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQKG 269
>gi|267844824|ref|NP_001161212.1| SET and MYND domain-containing protein 3 isoform 1 [Homo sapiens]
gi|212276523|sp|Q9H7B4.4|SMYD3_HUMAN RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|119597548|gb|EAW77142.1| SET and MYND domain containing 3, isoform CRA_c [Homo sapiens]
Length = 428
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 116/298 (38%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATAKRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 111 KLMDGAPSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|327533748|pdb|3QWP|A Chain A, Crystal Structure Of Set And Mynd Domain Containing 3;
Zinc Finger Mynd Domain-Containing Protein 1
Length = 429
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 3 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 62
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 63 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 111
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 112 KLMDGAPSESEKLYSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQLPPAFD 171
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 172 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 214
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 215 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 271
>gi|323714506|pdb|3OXF|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
gi|323714507|pdb|3OXF|B Chain B, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form I)
Length = 436
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 111 KLMDGAPSESEKLYSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|333361525|pdb|3RU0|A Chain A, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
gi|333361526|pdb|3RU0|B Chain B, Cocrystal Structure Of Human Smyd3 With Inhibitor
Sinefungin Bound
Length = 438
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 4 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 63
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 64 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 112
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 113 KLMDGAPSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFD 172
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 173 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 215
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 216 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 272
>gi|348577081|ref|XP_003474313.1| PREDICTED: SET and MYND domain-containing protein 3-like [Cavia
porcellus]
Length = 428
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 121/293 (41%), Gaps = 31/293 (10%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T G G+ A +R +L+ + P+ + + VC CL +
Sbjct: 2 EPLKVEKFTTADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDHCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMIIS 192
Q A++C C+ A W C+ S +YP RL ++
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WQDHKQECKCLKSCKPRYPPDSVRLLGRVV- 109
Query: 193 GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQI 252
I +++ E + + F L S K D ++ + + + +
Sbjct: 110 -------IKLMEKTPSESEKLYS----FYDLESNINKLTEDKKEGLRQLAMTFQHFMREE 158
Query: 253 RINAFRIELAGGLYE---DLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWID 308
+A ++ + ++E ++ ++ +E+ VG +Y S NH CDPN I++ +
Sbjct: 159 IQDASQLPPSFDIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVF-N 217
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 218 GPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|412988585|emb|CCO17921.1| SET and MYND domain-containing protein 5 [Bathycoccus prasinos]
Length = 477
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 14/131 (10%)
Query: 243 QWYTNVLAQIRINAFRIEL---------AGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
+WY ++ + +N+F++E G + + +SI + + G+AIY S
Sbjct: 343 RWYADLTTRFHLNSFKVEYPVSASSSSSQGDFRAMMEQTLTSSIRTGASNGSAIYAYGSM 402
Query: 294 YNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDA----SMARDARQAILTQGF 348
+NH C PN ++ W + N +A ++++GE+L I YID S+ R+A L + +
Sbjct: 403 FNHSCAPNVNVTWPERNHLVEFVANENIKQGEQLTIAYIDLNEDWSLNVAKRRAQLEEAY 462
Query: 349 GFQCNCLRCSS 359
GF C C RC S
Sbjct: 463 GFVCECPRCVS 473
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 28/130 (21%)
Query: 86 ESAGRGVFATRRIRASDLIHTA--------------------KPIITHPTLSTLNSVCYF 125
E GRG++A+R + ++ I KPI +HPT+ + CY
Sbjct: 70 EKNGRGLYASRDVTKNESIFKKKKKKNDNNNNKAGGGTEGRQKPITSHPTIWNYDLECYL 129
Query: 126 CLRKITSSSQHFQHH-------NARFCGEVCKDNAKA-FYDVERRADWSAFNDYCRSQGL 177
CL +++ + ++ + RFC C+ +A+A FY+ E+ D YC L
Sbjct: 130 CLGELSKKKKKNNNNNNEEVSSDGRFCSVECERDARASFYETEQTMDLRRMEKYCEENEL 189
Query: 178 KYPLLVKRLA 187
K+PL+ R+A
Sbjct: 190 KFPLMALRIA 199
>gi|336465134|gb|EGO53374.1| hypothetical protein NEUTE1DRAFT_150707 [Neurospora tetrasperma
FGSC 2508]
Length = 668
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 125/310 (40%), Gaps = 44/310 (14%)
Query: 72 PSQPSP-PPIQVALTESA--GRGVFATRRIRASDLIHTA-KPIITHPTLSTLNSVCYFCL 127
PS P P P +V GR +F+T R A + I P++ P + + + C +CL
Sbjct: 3 PSLPPPIPKTRVGGIPGVDRGRSLFSTERFGAGETIAIIDNPLLALPDDANMRTTCNYCL 62
Query: 128 RKITSSSQHFQ---HHNARFCGEVCKDNAKAFYDVE-RRADWSAFNDYCRSQGLKYPLLV 183
S + F+ R C A + + E +RA W LV
Sbjct: 63 --YVSGTIEFEGDVEAGPRTCKACTGCKAAVYCNAECQRAHWK---------------LV 105
Query: 184 KRLACMIISGAESADCIDIL-QPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK 242
+ C + + D L P +++L ++ G AF G + ++
Sbjct: 106 HKAECKMFKRIKERTGKDWLPTPVRAVAQVMLLLKAGDEETVKAFGPGGTLESNVEGFKT 165
Query: 243 QWYTNVLAQIRINAF-RIELAGGLYEDLLSSAAASIESEIAV----------GNA-IYML 290
+ + A + AG L D A + ++ G A IY+
Sbjct: 166 D--EGLWGDFELQATGAVVYAGLLQSDETLKQAMEVLCKMQTNAFNRFDADTGQAGIYLH 223
Query: 291 PSF--YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGF 348
PS NH C PNA+I + + A L A RD+E G+E+ I YID +M R ARQ L + +
Sbjct: 224 PSLSMVNHSCVPNAYIAF-EKRKAFLKAERDIEPGDEILISYIDHTMPRRARQESL-RLY 281
Query: 349 GFQCNCLRCS 358
FQCNC+RC
Sbjct: 282 HFQCNCIRCK 291
>gi|323714508|pdb|3OXG|A Chain A, Human Lysine Methyltransferase Smyd3 In Complex With
Adohcy (Form Iii)
Length = 464
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 38 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 97
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 98 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 146
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ GA ES S +L +EG VM F + I D
Sbjct: 147 KLMDGAPSESEKLYSFYDLESNINKLTEDRKEGLRQLVMTFQHFMREEIQDASQLPPAFD 206
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 207 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 249
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 250 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 306
>gi|236461142|ref|NP_033892.2| SET and MYND domain-containing protein 1 isoform 2 [Mus musculus]
gi|74209105|dbj|BAE24950.1| unnamed protein product [Mus musculus]
gi|148666517|gb|EDK98933.1| SET and MYND domain containing 1, isoform CRA_a [Mus musculus]
Length = 477
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 117/287 (40%), Gaps = 32/287 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + +A Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECAAIKKYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C+C C G
Sbjct: 223 LRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKG 269
>gi|5870834|gb|AAC53022.2| skm-BOP2 [Mus musculus]
Length = 472
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 117/287 (40%), Gaps = 32/287 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 4 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + +A Y K P RLA I+ E +
Sbjct: 64 KFAHYCDRTCQKDAW----LNHKNECAAIKKYG-----KVPNENIRLAARIMWRVER-EG 113
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 114 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 173
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 174 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 217
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C+C C G
Sbjct: 218 LRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKG 264
>gi|196009510|ref|XP_002114620.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
gi|190582682|gb|EDV22754.1| hypothetical protein TRIADDRAFT_28394 [Trichoplax adhaerens]
Length = 398
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 89/184 (48%), Gaps = 16/184 (8%)
Query: 176 GLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE 235
G ++ ++ L +I+ S+ + P L + G + S+F + + +
Sbjct: 204 GKQFVEQIEILRLELINTLPSSKITEWYTPGGFRSLLAMIGTNGQGIASSSFSQYARNVD 263
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
KF ++ + IN+F LY D+ + ++ E G+ +Y+L S N
Sbjct: 264 AAKFEKQE-------EDYINSF----LDQLYADMNEESGDFLDCE---GSGLYLLQSCCN 309
Query: 296 HDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMAR-DARQAILTQGFGFQCN 353
HDC PN I ++D NA + A+R++ EG+EL I YID+ + RQAIL + + F+C
Sbjct: 310 HDCSPNVEINFLDNNATLTVKAIRNISEGQELCISYIDSDIKNWKKRQAILMENYLFECT 369
Query: 354 CLRC 357
C RC
Sbjct: 370 CNRC 373
>gi|388454549|ref|NP_001252862.1| SET and MYND domain-containing protein 3 [Macaca mulatta]
gi|380787707|gb|AFE65729.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
gi|383412903|gb|AFH29665.1| SET and MYND domain-containing protein 3 isoform 1 [Macaca mulatta]
Length = 428
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 117/298 (39%), Gaps = 41/298 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATANRGNGLRAVTALRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 110
Query: 189 MIISG--AESADCIDILQPASLSPELILAMEEGF---VMLRSAFKKAGIDDEQMKFLNKQ 243
++ G +ES S +L +EG VM F + I D
Sbjct: 111 KLMDGTPSESEKLYSFYDLESNINKLTEDKKEGLRQLVMTFQHFMREEIQDASQLPPAFD 170
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A++ N+F I A ++ VG +Y S NH CDPN
Sbjct: 171 LF-EAFAKVICNSFTI-------------CNAEMQE---VGVGLYPSISLLNHSCDPNCS 213
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC + D
Sbjct: 214 IVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQTQD 270
>gi|395852657|ref|XP_003798851.1| PREDICTED: SET and MYND domain-containing protein 3 [Otolemur
garnettii]
Length = 428
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 75/294 (25%), Positives = 121/294 (41%), Gaps = 33/294 (11%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFATANRGNGLRAVVALRPGELLFRSDPLAYTVCKGSRGIVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII---- 191
Q A++C C+ KA+ D +R +S +YP RL +I
Sbjct: 62 CSQCQVAKYCSAKCQ--KKAWQDHKRECKC------LKSCKPRYPPDSVRLLGRVIFKLM 113
Query: 192 --SGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ-MKFLNKQWYT-N 247
+ +ES S +L +EG L F+ ++ Q + L +
Sbjct: 114 EETPSESEKLYSFYDLESNIKKLTEDKKEGLRQLAMTFQHFMREEIQDVSQLPPSFDVFE 173
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
A++ N+F I A ++ VG +Y S NH CDPN I++
Sbjct: 174 AFAKVICNSFTI-------------CNAEMQE---VGVGLYPSMSLLNHSCDPNCSIVF- 216
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ L A+RD+E GEEL ICY+D M + R+ L + F+C+CLRC + D
Sbjct: 217 NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCLRCQTQD 270
>gi|156368233|ref|XP_001627600.1| predicted protein [Nematostella vectensis]
gi|156214514|gb|EDO35500.1| predicted protein [Nematostella vectensis]
Length = 377
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 118/290 (40%), Gaps = 46/290 (15%)
Query: 75 PSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS 134
P P +V TES GRG+ A + +++ D I + +P++ + C FCL K++
Sbjct: 3 PKPSVFEVFETESKGRGLRAAKPLKSGDTILSEQPVVYMLSNMLRGQRCDFCLEKLSDLQ 62
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
+ + AR+CG C+ RA W C +P + L ++
Sbjct: 63 RCSRCKFARYCGASCQ-----------RAAWRIHKSECERLKRVFPRVPTDLVLLMFRVW 111
Query: 195 ESADCIDILQPASLSPELILAMEEGFV-MLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIR 253
+ L + G+ L S +K D K+ + +VL +
Sbjct: 112 Q------------------LKSQNGWYDSLVSNVEKIDSD-------AKEDFVSVL--MV 144
Query: 254 INAF---RIELAGGL--YEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWI 307
+N + I GL + + ++ A + E+ A+G I+ NH C PN+ +
Sbjct: 145 LNEYLGSEISPPEGLELFSKISCNSFAICDGEMQAIGTGIFPNAVCLNHSCAPNS-VAVF 203
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + + AL ++ GEEL I YI R+ RQ L F F C C RC
Sbjct: 204 NGTNIYIKALEEIPVGEELTISYIQQLHPRETRQEELQTQFCFYCQCHRC 253
>gi|348668850|gb|EGZ08673.1| hypothetical protein PHYSODRAFT_352669 [Phytophthora sojae]
Length = 1354
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 79/155 (50%), Gaps = 16/155 (10%)
Query: 204 QPASLSPELILAM-EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELA 262
Q ++SP + + +E ++ S + G ++E+++ L Q R+ EL
Sbjct: 1153 QTNNMSPRVQPQLSDEDQTIIASVRRVLGWEEERLQVLQ--------TQERLERNDGEL- 1203
Query: 263 GGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEE 322
++ L+ SIE G A++ + NH CDPN +++ N D ++A+RD+ +
Sbjct: 1204 DSEFDPLMPLGYPSIE-----GTALFPIICTMNHSCDPNCTVLYTKNGDGHVVAIRDIHK 1258
Query: 323 GEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
GEEL ICYID M R+A L + + F+C C RC
Sbjct: 1259 GEELCICYIDVDMDVQMREANLRE-YKFKCFCSRC 1292
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 34/61 (55%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G A++ NH CDPN +++ + A + A++D++EGEEL I YID + R
Sbjct: 390 GTALFSGICTMNHSCDPNCTVLYTKDGAAHVFAVQDIQEGEELCISYIDVDQEVEERAEC 449
Query: 344 L 344
L
Sbjct: 450 L 450
>gi|338722720|ref|XP_001914950.2| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 3 [Equus caballus]
Length = 440
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 130/303 (42%), Gaps = 39/303 (12%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
PP ++ +T G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPPKVEKFITADKGNGLRALVPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII---- 191
Q A++C C+ KA+ D +R +S +YP RL ++
Sbjct: 62 CSQCRVAKYCSAKCQ--KKAWQDHKRECK------CLKSCKPRYPPDSVRLLGRVVFKLM 113
Query: 192 --SGAESADCIDILQPASLSPELILAMEEG-------FVMLRSAFKKAGIDDEQMKFLNK 242
+ +ES +L +++++G + + KK G+ M F
Sbjct: 114 EETPSESEKLYHVLHND-------ISVDDGPHISWWAHINKLTEDKKEGLRQLAMTF--- 163
Query: 243 QWYTNVLAQIRINAFRIELAGGLYE---DLLSSAAASIESEI-AVGNAIYMLPSFYNHDC 298
+ + + +A ++ + ++E ++ ++ +E+ VG +Y S NH C
Sbjct: 164 ---QHFMREEIQDASQLPPSFDVFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSC 220
Query: 299 DPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
DPN I++ + L A+RD+E GEEL ICY+D M + R+ L + F+C+C RC
Sbjct: 221 DPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCFRCQ 279
Query: 359 SGD 361
+ D
Sbjct: 280 TQD 282
>gi|417400767|gb|JAA47309.1| Hypothetical protein [Desmodus rotundus]
Length = 428
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 115/300 (38%), Gaps = 45/300 (15%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T G G+ A +R +L+ + P+ + VC CL +
Sbjct: 2 EPLKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMII- 191
Q A++C C+ A W C+ S +YP RL ++
Sbjct: 62 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVV 110
Query: 192 -----SGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ--MKFLNKQW 244
+ +ES S +L +EG L F+ ++ Q + L
Sbjct: 111 KLMEETPSESEKLYSFYDLESNISKLTEEKKEGIRQLAMTFQHFMREEIQDVSQLLPAFD 170
Query: 245 YTNVLAQIRINAFRI---ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
A++ NAF I E+ G VG +Y S NH CDPN
Sbjct: 171 IFEAFAKVFCNAFTICNAEMQG-------------------VGVGLYPSMSLLNHSCDPN 211
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
I++ + L A+RD+E GEEL +CY+D M + R+ L + F C+C+RC + D
Sbjct: 212 CSIVF-NGPHLLLRAVRDIETGEELTVCYLDLLMTSEERRKHLRSQYCFDCDCVRCQTQD 270
>gi|350295434|gb|EGZ76411.1| hypothetical protein NEUTE2DRAFT_98210 [Neurospora tetrasperma FGSC
2509]
Length = 547
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/311 (27%), Positives = 121/311 (38%), Gaps = 68/311 (21%)
Query: 72 PSQPSPPP---IQVALTESAGRGVFATRRIRASDLIHTA-KPIITHPTLSTLNSVCYFCL 127
PS P P P I GR +F+T A + I P++ P + + + C +CL
Sbjct: 3 PSLPPPIPKTRIGGIPGVDRGRSLFSTESFGAGETIAIIDNPLLALPDDANMRTTCNYCL 62
Query: 128 RKITSSSQHFQ---HHNARFCGEVCKDNAKAFYDVE-RRADWSAFNDYCRSQGLKYPLLV 183
S + F+ R C A + + E +RA W LV
Sbjct: 63 --YVSGTIEFEGDVEAGPRTCKACTGCKAAVYCNAECQRAHWK---------------LV 105
Query: 184 KRLACMIISGAESADCIDILQPASLSPELILAME-EGF-----------VMLRSAFKKAG 231
+ C + + D L + P L EGF + A AG
Sbjct: 106 HKAECKMFKRIKERTGKDWLPTPAFGPGGTLESNVEGFKTDEGLWGDFELQATGAVVYAG 165
Query: 232 I--DDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNA-IY 288
+ DE +K VL +I+ NAF + G A IY
Sbjct: 166 LLQSDETLK-----QAMEVLCKIQTNAFN-------------------RFDADTGQAGIY 201
Query: 289 MLPSF--YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQ 346
+ PS NH C PNA+I + + A L A RD+E G+E+ I YID +M R ARQ L +
Sbjct: 202 LHPSLSMVNHSCVPNAYITF-EKRKAVLKAERDLEPGDEILISYIDHTMPRRARQESL-R 259
Query: 347 GFGFQCNCLRC 357
+ FQCNC+RC
Sbjct: 260 LYHFQCNCIRC 270
>gi|291231170|ref|XP_002735533.1| PREDICTED: SMYD family member 5-like [Saccoglossus kowalevskii]
Length = 395
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/99 (39%), Positives = 59/99 (59%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEG 323
LY D+ + + + E G+A+Y L S NH C PNA + + DN A +MAL+D++E
Sbjct: 279 LYVDIEKESGSFLNCE---GSALYSLQSCCNHSCVPNAEVTFPDNDAAVSVMALQDIQEN 335
Query: 324 EELRICYI---DASMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+ D +R +RQ IL + + F CNC++C S
Sbjct: 336 EEICISYLGECDIGRSRHSRQKILRENYLFNCNCMKCQS 374
>gi|50550485|ref|XP_502715.1| YALI0D11792p [Yarrowia lipolytica]
gi|74634629|sp|Q6C9E7.1|SET5_YARLI RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49648583|emb|CAG80903.1| YALI0D11792p [Yarrowia lipolytica CLIB122]
Length = 438
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/344 (21%), Positives = 130/344 (37%), Gaps = 66/344 (19%)
Query: 49 ISRLQSLHLQKRQLCSTAT--HNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHT 106
+ + +L L + + C + + P P + + + ++ G+G++A + I+ + +
Sbjct: 76 VMKAANLGLNQEKFCYAGSIMSHATPGLDMPEKVTLNIAKNRGKGLYAKKDIKKDEELWN 135
Query: 107 AKPIITHPTLSTLNSV-----CYFCLRKITSSSQHFQHHN--------ARFCGEVCKDNA 153
+ + P L + + C FC S+ FQ HN A++C CK
Sbjct: 136 EQAFLLVPPLEHVGIMRKGMGCAFC-------SRPFQSHNGVECPSCAAKWCDNQCKKKD 188
Query: 154 K---AFYDVER-----RADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQP 205
A + R R +W AF D+C G + L +I + D +Q
Sbjct: 189 VTHVAMWHESRHGKVTRVEWRAFEDFCVENGWMALYALGLLWLRVIRDPKK----DEVQK 244
Query: 206 ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGL 265
+ AF + G DE+ K + + +N L + + +
Sbjct: 245 QMM-----------------AFARVG-QDERHKAVEQ---SNSLFASEQSEVLWKKGYEM 283
Query: 266 YEDLLSSAAASIESEIAVG----------NAIYMLPSFYNHDCDPNAHIMWIDNADA-RL 314
+ LL S E + G +Y+ S NH C+PN + + +
Sbjct: 284 LDKLLLDTEFSYEKDFLPGLGMFNINNVDGNMYLTQSHLNHSCEPNVDVKNVGRTQGISV 343
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A RD++ GEEL Y++ D R+ L +GF CNC RC
Sbjct: 344 RAKRDIKTGEELFTTYVNPEHQLDDRRYNLRVNWGFNCNCTRCK 387
>gi|149040852|gb|EDL94809.1| similar to SET and MYND domain containing 3, isoform CRA_b [Rattus
norvegicus]
Length = 428
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 116/289 (40%), Gaps = 41/289 (14%)
Query: 85 TESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARF 144
T G G+ A +R +L+ + P+ + + VC CL + Q A++
Sbjct: 11 TADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKY 70
Query: 145 CGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLACMIISG--AE 195
C C+ A W C+ S +YP L+ R+ ++ G +E
Sbjct: 71 CSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVVKLMDGKPSE 119
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFK---KAGIDDEQMKFLNKQWYTNVLAQI 252
S S +L +EG L F+ + I D + + A++
Sbjct: 120 SEKLYSFYDLESNISKLTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLF-EAFAKV 178
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
N+F I A ++ VG +Y S NH CDPN I++ +
Sbjct: 179 ICNSFTI-------------CNAEMQE---VGVGLYPSMSLLNHSCDPNCSIVF-NGPHL 221
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L A+R++E GEEL ICY+D M + R+ L + F+C+C+RC + D
Sbjct: 222 LLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCPTQD 270
>gi|71043910|ref|NP_001020933.1| SET and MYND domain-containing protein 3 [Rattus norvegicus]
gi|66910630|gb|AAH97455.1| SET and MYND domain containing 3 [Rattus norvegicus]
Length = 428
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 116/289 (40%), Gaps = 41/289 (14%)
Query: 85 TESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARF 144
T G G+ A +R +L+ + P+ + + VC CL + Q A++
Sbjct: 11 TADRGNGLRAVAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMRCSQCRIAKY 70
Query: 145 CGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLACMIISG--AE 195
C C+ A W C+ S +YP L+ R+ ++ G +E
Sbjct: 71 CSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVVKLMDGKPSE 119
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFK---KAGIDDEQMKFLNKQWYTNVLAQI 252
S S +L +EG L F+ + I D + + A++
Sbjct: 120 SEKLYSFYDLESNISKLTEDKKEGLRQLAMTFQHFMREEIQDASQLPPSFDLF-EAFAKV 178
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
N+F I A ++ VG +Y S NH CDPN I++ +
Sbjct: 179 ICNSFTI-------------CNAEMQE---VGVGLYPSMSLLNHSCDPNCSIVF-NGPHL 221
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L A+R++E GEEL ICY+D M + R+ L + F+C+C+RC + D
Sbjct: 222 LLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTQD 270
>gi|389744421|gb|EIM85604.1| MAS20-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 549
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 141/335 (42%), Gaps = 78/335 (23%)
Query: 90 RGVFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKITSSSQHFQHHN---ARF 144
R + T+ A DLI+ +P++ L S C CLRKI + ++ + +
Sbjct: 190 RILVVTKDFNAGDLIYKEQPVVAVLDADLEGKGSYCSHCLRKIEDDIVYRIENDLYKSGY 249
Query: 145 CGEVCKDNAKAF-----------------------YDVERRADWSAFNDYCRSQGLKYPL 181
C + C+ +K+ + RR + F Y + Q P+
Sbjct: 250 CSKECQTKSKSLSQNLLFGTDSALPAELSTEMPGDFTAARREAQTEFAKYVKDQDKSVPM 309
Query: 182 LVKRLACMIISGAESADCIDILQPASLSPELI---------------LAME---EGFVML 223
LV R IS E+A + P S + E + L +E + +L
Sbjct: 310 LVARFIARQIS-EETAKMV----PGSAASEAVHSDYSLYDHLERLRYLEVEAPAQETKLL 364
Query: 224 RSAFKKA--GIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEI 281
RS F+ A G++ F+ + + + ++ NA+ + GG +D + ++E
Sbjct: 365 RSVFQTALPGLE----AFITDERHATLKGKMAYNAYGVVYNGGR-DDRPEVSERPEDAEK 419
Query: 282 A---------VGNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICY 330
VG+ +Y + ++ H CDP+A I NAD+ L +A RD++ G+E+ + +
Sbjct: 420 TRTPYGTQRQVGSGLYSVSAYICHSCDPSA-IPVFKNADSELYLIATRDLKAGDEITVSF 478
Query: 331 IDASM-----ARDARQAI---LTQGFGFQCNCLRC 357
+D+S+ A DAR+ L +G+ F C C RC
Sbjct: 479 VDSSVHEGESAADARRRRRFELARGWRFPCPCERC 513
>gi|410985721|ref|XP_003999165.1| PREDICTED: SET and MYND domain-containing protein 3 [Felis catus]
Length = 428
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRK 251
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 252 QLRDQYCFECDCIRCQTQD 270
>gi|332236389|ref|XP_003267386.1| PREDICTED: SET and MYND domain-containing protein 3 [Nomascus
leucogenys]
Length = 369
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 134 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 192
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+CLRC + D
Sbjct: 193 QLRDQYCFECDCLRCQTQD 211
>gi|297266487|ref|XP_001093225.2| PREDICTED: SET and MYND domain-containing protein 1 [Macaca
mulatta]
Length = 399
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 113/286 (39%), Gaps = 32/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFGMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 223 LRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 268
>gi|119597486|gb|EAW77080.1| SET and MYND domain containing 1, isoform CRA_d [Homo sapiens]
Length = 497
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 114/286 (39%), Gaps = 32/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 223 LRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 268
>gi|291402060|ref|XP_002717678.1| PREDICTED: SET and MYND domain containing 3 [Oryctolagus cuniculus]
Length = 428
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRK 251
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 252 QLRDQYCFECDCFRCETQD 270
>gi|197692956|gb|ACH71266.1| SET and MYND domain-containing 3 [Sus scrofa]
Length = 369
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 134 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRK 192
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 193 QLRDQYCFECDCFRCQTQD 211
>gi|348566409|ref|XP_003468994.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Cavia porcellus]
Length = 477
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/287 (23%), Positives = 115/287 (40%), Gaps = 32/287 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEIFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKHECSAIKRYG-----KVPNENVRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNGKIE 222
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + GEEL + YID + R+ L + + F C C C G
Sbjct: 223 LRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKG 269
>gi|342184438|emb|CCC93920.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 471
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 11/112 (9%)
Query: 250 AQIRINAFRIEL-AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
RI+AF +L A G+ LS + G +Y + +NH C+PN +++
Sbjct: 345 GHARIDAFEEKLRAAGVDPQKLSCGSR--------GQGVYTIGCLFNHSCEPNLQVLYTV 396
Query: 309 NADARLM--ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ D L ALRD+E GEEL I YID ++ RQ L + + FQC C RC+
Sbjct: 397 SGDETLSIEALRDIEPGEELNISYIDDTLPYPRRQLSLLEHYLFQCACPRCT 448
>gi|294662600|pdb|3MEK|A Chain A, Crystal Structure Of Human Histone-Lysine N-
Methyltransferase Smyd3 In Complex With S-Adenosyl-L-
Methionine
Length = 429
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 115/297 (38%), Gaps = 39/297 (13%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T + G G+ A +R +L+ + P+ + VC CL +
Sbjct: 3 EPLKVEKFATANRGNGLRAVTPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLXR 62
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYP----LLVKRLAC 188
Q A++C C+ A W C+ S +YP L+ R+
Sbjct: 63 CSQCRVAKYCSAKCQKKA-----------WPDHKRECKCLKSCKPRYPPDSVRLLGRVVF 111
Query: 189 MIISGA--ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYT 246
+ GA ES S +L +EG L F+ ++ Q +
Sbjct: 112 KLXDGAPSESEKLYSFYDLESNINKLTEDRKEGLRQLVXTFQHFXREEIQDASQLPPAFD 171
Query: 247 --NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
A++ N+F I + E+ VG +Y S NH CDPN I
Sbjct: 172 LFEAFAKVICNSFTI--------------CNAEXQEVGVG--LYPSISLLNHSCDPNCSI 215
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
++ + L A+RD+E GEEL ICY+D + R+ L + F+C+C RC + D
Sbjct: 216 VF-NGPHLLLRAVRDIEVGEELTICYLDXLXTSEERRKQLRDQYCFECDCFRCQTQD 271
>gi|354475964|ref|XP_003500195.1| PREDICTED: SET and MYND domain-containing protein 3 [Cricetulus
griseus]
Length = 428
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 70/295 (23%), Positives = 121/295 (41%), Gaps = 35/295 (11%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T G G+ A +R +L+ P+ + VC CL +
Sbjct: 2 EPLKVEKFATADRGNGLRAVVPLRPGELLFRCDPLAYTVCKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMIIS 192
Q A++C C+ A W C+ S +YP RL +I
Sbjct: 62 CSQCRIAKYCSAKCQKKA-----------WPDHKQECKCLKSCKPRYPPDSVRLLARVI- 109
Query: 193 GAESADCIDILQPASLSPELILAME--EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
+ ++ E + + E + + KK G+ + F Q +T
Sbjct: 110 -------VKLMDEKPSESEKLYSFYDLESNINKLTEDKKEGLRQLALTF---QHFTREEI 159
Query: 251 QIRINAFRIELAGGLYE---DLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMW 306
Q +A ++ + L+E ++ ++ +E+ VG +Y S NH CDPN I++
Sbjct: 160 Q---DASQLPPSFDLFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVF 216
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ L A+R++E GEEL ICY+D M + R+ L + F+C+C+RC + D
Sbjct: 217 -NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRKQLRDQYCFECDCIRCQTHD 270
>gi|403288342|ref|XP_003935365.1| PREDICTED: SET and MYND domain-containing protein 3 [Saimiri
boliviensis boliviensis]
Length = 369
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN +++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 134 VGVGLYPSISLLNHSCDPNCSVVF-NGPHLLLRAVRDIEVGEELTICYLDTLMTSEERRK 192
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 193 QLRDQYCFECDCFRCQTQD 211
>gi|355559119|gb|EHH15899.1| hypothetical protein EGK_02058, partial [Macaca mulatta]
Length = 373
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 138 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 196
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 197 QLRDQYCFECDCFRCQTQD 215
>gi|426239579|ref|XP_004013697.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 2 [Ovis
aries]
Length = 428
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 121/290 (41%), Gaps = 25/290 (8%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
P ++ T G G+ A +R +L+ + P+ + + VC CL +
Sbjct: 2 EPSKVEKFSTTDRGNGLRALAPLRPGELLFRSDPLAYTVSKGSRGVVCDRCLLGKEKLMR 61
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
Q A++C C+ KA+ D +R +S +YP RL ++
Sbjct: 62 CSQCRIAKYCSAKCQ--KKAWQDHKRECKC------LKSCKPRYPPDSVRLLGRVV---- 109
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRIN 255
++Q E + + F L S K D ++ + + + + +
Sbjct: 110 ----FKLMQETPSESEKLYS----FYDLESNINKLTEDKKEGLRQLALTFQHFMREEIQD 161
Query: 256 AFRIELAGGLYE---DLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
A ++ + ++E ++ ++ +E+ VG +Y S NH CDPN I++ +
Sbjct: 162 ASQLPPSFDIFEAFAKVICNSFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVF-NGPH 220
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L +RDVE GEEL ICY+D M + R+ L + F C+C RC + D
Sbjct: 221 LLLRTVRDVEAGEELTICYLDMLMTSEERRKQLRDQYCFDCDCFRCQTQD 270
>gi|355746250|gb|EHH50875.1| hypothetical protein EGM_01766, partial [Macaca fascicularis]
Length = 352
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 117 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 175
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 176 QLRDQYCFECDCFRCQTQD 194
>gi|12232401|ref|NP_073580.1| SET and MYND domain-containing protein 3 isoform 2 [Homo sapiens]
gi|10437096|dbj|BAB14981.1| unnamed protein product [Homo sapiens]
gi|119597549|gb|EAW77143.1| SET and MYND domain containing 3, isoform CRA_d [Homo sapiens]
Length = 369
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 134 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 192
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 193 QLRDQYCFECDCFRCQTQD 211
>gi|355720910|gb|AES07090.1| SET and MYND domain containing 3 [Mustela putorius furo]
Length = 251
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 16 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRK 74
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 75 QLRDQYCFECDCFRCQTQD 93
>gi|307108187|gb|EFN56428.1| hypothetical protein CHLNCDRAFT_145047 [Chlorella variabilis]
Length = 383
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 67/134 (50%), Gaps = 9/134 (6%)
Query: 233 DDEQMKFLNKQWYTNVLAQIRINAFRIELAGGL--------YEDLLSSAAASIESEIAVG 284
+ E + L+ W+ + +A++ +N+FR++ L +S A S + G
Sbjct: 241 EPELRRQLSLAWFCDAMARLHLNSFRVDTVPPLDASDPSALLRAAAASLAGSASAAHHTG 300
Query: 285 NAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
+A Y+L S NH C+P+ + + +NA A A R + GE+L + Y+DA AR+
Sbjct: 301 SAAYLLASLLNHSCEPSLGVSFPRNNAVAAFTAARHIARGEQLTVSYVDAGQGLAARRQA 360
Query: 344 LTQGFGFQCNCLRC 357
L +GF C C RC
Sbjct: 361 LAWAYGFTCRCPRC 374
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 13/125 (10%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P IQ+ AGRG+FA I ++H P++ P+ L+S C+ CLR + +
Sbjct: 25 PWIQLLPVPGAGRGLFAAADIEQGMVVHEEAPMLCAPSPHALHSTCHHCLRPLAPGAASH 84
Query: 138 QHHNAR------------FCGEVCK-DNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVK 184
++ FC C + R D+ CR G K+PL++
Sbjct: 85 HGGSSSSSPSSSSPSRMPFCSPTCSAAAQGEWAAAAARCDFGPLQQACREAGEKFPLMLA 144
Query: 185 RLACM 189
RLAC+
Sbjct: 145 RLACI 149
>gi|48257059|gb|AAH17079.2| SMYD3 protein, partial [Homo sapiens]
Length = 287
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 52 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 110
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 111 QLRDQYCFECDCFRCQTQD 129
>gi|297661482|ref|XP_002809270.1| PREDICTED: SET and MYND domain-containing protein 3-like [Pongo
abelii]
gi|426334412|ref|XP_004028746.1| PREDICTED: SET and MYND domain-containing protein 3-like [Gorilla
gorilla gorilla]
gi|28200379|gb|AAO31695.1| hypothetical protein FLJ21080 [Homo sapiens]
gi|119597546|gb|EAW77140.1| SET and MYND domain containing 3, isoform CRA_b [Homo sapiens]
Length = 258
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 23 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 81
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 82 QLRDQYCFECDCFRCQTQD 100
>gi|330846003|ref|XP_003294847.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
gi|325074603|gb|EGC28624.1| hypothetical protein DICPUDRAFT_159919 [Dictyostelium purpureum]
Length = 428
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 61/116 (52%), Gaps = 12/116 (10%)
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
N+L + R NAF I+ G D S + G +Y+ SF+NH CDPN +
Sbjct: 275 NILLRNRCNAFYIQ---GRPRDGTSGESR--------GCGVYVRNSFFNHSCDPNVNYWV 323
Query: 307 IDNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
++N + L++V+EGEEL I YID S + R+ L +G+ F C C +C + +
Sbjct: 324 VNNTLEVECTLLKNVKEGEELTISYIDTSAPLNKRREKLLEGYLFTCLCTKCKADE 379
>gi|74204261|dbj|BAE39889.1| unnamed protein product [Mus musculus]
Length = 428
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 251
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 252 QLRDQYCFECDCIRCQTQD 270
>gi|345328822|ref|XP_001510515.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Ornithorhynchus anatinus]
Length = 477
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 71/292 (24%), Positives = 118/292 (40%), Gaps = 42/292 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +ES GRG+ AT+ A+DLI + + S + VC+ C ++ + Q
Sbjct: 9 VEVFTSESKGRGLKATKEFWAADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A +C C+ +A W + C + G K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDA-----------WLNHKNECSAIKKHG-KAPNENIRLAARIMWRIER 116
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE--------QMKFLNKQWYTNV 248
+ + + +S + + + F K+ +D E Q + + Q+ +++
Sbjct: 117 -EGTGLTEGCLVSIDDLQNHVDSFG--EEEQKELRLDVESFLHFWPPQSQQFSMQYISHI 173
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
I N F + GL AVG ++ NHDC PN +++ +
Sbjct: 174 FGVINCNGFTLSDQRGLQ---------------AVGVGVFPNLCLVNHDCWPNCTVIF-N 217
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
N L AL + EGEEL + YID R+ +L + + F C C C G
Sbjct: 218 NGKIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEHCEKG 269
>gi|402858454|ref|XP_003893720.1| PREDICTED: SET and MYND domain-containing protein 3-like [Papio
anubis]
Length = 258
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 23 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 81
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 82 QLRDQYCFECDCFRCQTQD 100
>gi|119597545|gb|EAW77139.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|119597547|gb|EAW77141.1| SET and MYND domain containing 3, isoform CRA_a [Homo sapiens]
gi|193785773|dbj|BAG51208.1| unnamed protein product [Homo sapiens]
Length = 239
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 63 QLRDQYCFECDCFRCQTQD 81
>gi|21312378|ref|NP_081464.1| SET and MYND domain-containing protein 3 [Mus musculus]
gi|30913566|sp|Q9CWR2.1|SMYD3_MOUSE RecName: Full=SET and MYND domain-containing protein 3; AltName:
Full=Zinc finger MYND domain-containing protein 1
gi|12845900|dbj|BAB26947.1| unnamed protein product [Mus musculus]
gi|30851412|gb|AAH52431.1| SET and MYND domain containing 3 [Mus musculus]
Length = 428
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 251
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 252 QLRDQYCFECDCIRCQTQD 270
>gi|38494371|gb|AAH61485.1| SET and MYND domain containing 3 [Mus musculus]
gi|148681224|gb|EDL13171.1| SET and MYND domain containing 3, isoform CRA_b [Mus musculus]
Length = 428
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 251
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 252 QLRDQYCFECDCIRCQTQD 270
>gi|74188167|dbj|BAE37175.1| unnamed protein product [Mus musculus]
Length = 428
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 193 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 251
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 252 QLRDQYCFECDCIRCQTQD 270
>gi|168054064|ref|XP_001779453.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669138|gb|EDQ55731.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 377
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 123/292 (42%), Gaps = 44/292 (15%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRKITSSSQH----FQHHNA 142
GRG+ ATR +RA ++ P++ + + S C C+R ++ + A
Sbjct: 10 GRGMVATRMLRAGAVLLRESPLLLYQEAESAESTAFCSNCMRVLSLPGREKVPCSACDGA 69
Query: 143 RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDI 202
FCG+ C ++A+ R A + K+ L + LA +I+ A +
Sbjct: 70 TFCGKSCFESARVGSHNTRVCSGIAMLQKSKD---KFSLEHRTLARFLIAAYNLA----V 122
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA---FRI 259
P + + L L +E D+E +K L+ + + ++ A +
Sbjct: 123 EDPLAFAKLLELEGQE-------------CDEEVVKSLH-DFVVEIFGKLDGGASLELSL 168
Query: 260 ELAGGLYE-------DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWID--- 308
EL G L L++ + A E ++ G A++ S +NHDC PNA ++D
Sbjct: 169 ELTGKLLARDACNTFGLMAPSCAGEERKVR-GYAMFAQASMFNHDCLPNACRFEYVDIDG 227
Query: 309 --NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
N D + AL D+EEG E+ + Y RQ L + +GF C C RC+
Sbjct: 228 DGNTDVIVRALHDMEEGTEVCLSYFPVDWPYGDRQQKLQEEYGFWCTCARCN 279
>gi|119597550|gb|EAW77144.1| SET and MYND domain containing 3, isoform CRA_e [Homo sapiens]
Length = 170
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSISLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 63 QLRDQYCFECDCFRCQTQD 81
>gi|440907991|gb|ELR58064.1| SET and MYND domain-containing protein 3, partial [Bos grunniens
mutus]
Length = 221
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RDVE GEEL ICY+D M + R+
Sbjct: 19 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDVEAGEELTICYLDMLMTSEERRK 77
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F C+C RC + D
Sbjct: 78 QLRGQYCFDCDCFRCQTQD 96
>gi|409084007|gb|EKM84364.1| hypothetical protein AGABI1DRAFT_117750 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 429
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 120/335 (35%), Gaps = 76/335 (22%)
Query: 78 PPIQVA-LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-----SVCYFCLRKIT 131
P ++V + G+G+ AT I+ ++I P IT P L+ C FC +T
Sbjct: 86 PKVEVKWFDKRKGKGLVATHEIKEGEVIWKEDPFITSPEWGILDLQMKSVACAFCTTPLT 145
Query: 132 SS--------SQHFQHHNARFCGEVCKDNAKAFYDV---------------ERRADWSAF 168
S S ARFC +C + + + R + W AF
Sbjct: 146 GSPLILSCAGSNMASPCTARFCNRLCFTRSAKLHPLLCPTKNPASIPLIQFVRESRWMAF 205
Query: 169 NDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEE---------- 218
Y L++ ++ S +E D + EL MEE
Sbjct: 206 ----------YALVLCTSRLLLESQSEDKSFNDNWEMIHALAEL--GMEERHKYSYNLSG 253
Query: 219 -----------GFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYE 267
+ + AFK+ + K + + N F+ ++ Y
Sbjct: 254 QYEPDRETWRHAYELFIQAFKEPSSTADAKKL------AKIFRKPHPNNFQADIFD--YN 305
Query: 268 DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEG 323
+ I IY L S NH CDPN I +D A ++A RD++ G
Sbjct: 306 TGFLRGLGRMSLNIEAHGGIYTLHSHLNHSCDPNVSIRHLDQTTALSRITVIAKRDIKVG 365
Query: 324 EELRICYIDASMARDARQAILTQGFGF-QCNCLRC 357
EEL I Y+D + RQ L +G+GF +C C RC
Sbjct: 366 EELLITYVDPQLPYQTRQNEL-KGWGFGRCTCSRC 399
>gi|344244710|gb|EGW00814.1| SET and MYND domain-containing protein 3 [Cricetulus griseus]
Length = 239
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 63 QLRDQYCFECDCIRCQTHD 81
>gi|26336238|dbj|BAC31804.1| unnamed protein product [Mus musculus]
gi|26338432|dbj|BAC32887.1| unnamed protein product [Mus musculus]
Length = 239
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 63 QLRDQYCFECDCIRCQTQD 81
>gi|349604328|gb|AEP99912.1| SET and MYND domain-containing protein 3-like protein, partial
[Equus caballus]
Length = 358
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH C+PN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 123 VGVGLYPSMSLLNHSCEPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRK 181
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 182 QLRDQYCFECDCFRCQTQD 200
>gi|426239577|ref|XP_004013696.1| PREDICTED: SET and MYND domain-containing protein 3 isoform 1 [Ovis
aries]
Length = 369
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L +RDVE GEEL ICY+D M + R+
Sbjct: 134 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRTVRDVEAGEELTICYLDMLMTSEERRK 192
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F C+C RC + D
Sbjct: 193 QLRDQYCFDCDCFRCQTQD 211
>gi|148681223|gb|EDL13170.1| SET and MYND domain containing 3, isoform CRA_a [Mus musculus]
Length = 239
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 63 QLRDQYCFECDCIRCQTQD 81
>gi|149040851|gb|EDL94808.1| similar to SET and MYND domain containing 3, isoform CRA_a [Rattus
norvegicus]
Length = 239
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVREIEAGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C+RC + D
Sbjct: 63 QLRDQYCFECDCIRCPTQD 81
>gi|197102816|ref|NP_001125377.1| SET and MYND domain-containing protein 1 [Pongo abelii]
gi|55727869|emb|CAH90687.1| hypothetical protein [Pongo abelii]
Length = 477
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 113/289 (39%), Gaps = 38/289 (13%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A +C C+ +A W + C + G K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDA-----------WLNHKNECLAIKRYG-KVPNENIRLAARIMWRVER 116
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLA 250
+ + + +S + + E F + +D Q + + Q+ +++
Sbjct: 117 -EGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFG 175
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
I N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 176 VINCNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIF-NNG 219
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 220 KIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCACEHCQK 268
>gi|431906502|gb|ELK10625.1| SET and MYND domain-containing protein 3 [Pteropus alecto]
Length = 352
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 46/79 (58%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+
Sbjct: 4 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIEAGEELTICYLDMLMTSEERRK 62
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F+C+C RC + D
Sbjct: 63 QLRDQYCFECDCSRCQTQD 81
>gi|353241876|emb|CCA73661.1| hypothetical protein PIIN_07614 [Piriformospora indica DSM 11827]
Length = 394
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/337 (21%), Positives = 129/337 (38%), Gaps = 72/337 (21%)
Query: 92 VFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKITSSSQ---HFQHHNARFCG 146
+ AT+ +A D I T KP++T L + C +CL+ I + S ++ +C
Sbjct: 23 LVATKDFKAGDTIFTEKPMVTALDADLMGTSQYCSYCLKAIPNGSAIQVPLDPFHSAYCS 82
Query: 147 EVCKDNAKAFYDV------------------------ERRADWSAFNDYCRSQGLKYPLL 182
C+ +A+ Y ER+ F + + G LL
Sbjct: 83 TQCQHDAEEQYQTLLFSTIPPATPSNPAPVRTKAQVEERKKAQEQFVETVKESGKTGALL 142
Query: 183 VKRLACMIIS-------GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE 235
V R +I+ G D D L +L + + LRS +G DE
Sbjct: 143 VLRFVGRLIADEHARLVGENKTD--DGLPKVEEESKLNYSFYDYTERLRSLVLTSGPQDE 200
Query: 236 QM----------------KFLNKQWYTNVLAQIRINAFRIEL-AGGLYEDLLSSAAASIE 278
+ FL+ + + + ++ N + + G + S E
Sbjct: 201 KELAQSREVLRLAVDGMEAFLDDERFMLLKGKMAYNCYGVTYDTGRTNKPSPHGPTESFE 260
Query: 279 SEIA-------VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICY 330
VG+A+Y + S+ NH C P+A ++ A+ ++A +D++ GEE+ + Y
Sbjct: 261 RTRTPHGTMHQVGSALYRVASYINHSCAPSARPVFASGTAELSIVAAQDIKAGEEITVSY 320
Query: 331 IDASM---------ARDARQAILTQGFGFQCNCLRCS 358
+D AR R+ L +G+GF C+C RC+
Sbjct: 321 VDTKKRSKDKNLADARKHRRLELARGWGFACDCTRCA 357
>gi|366987641|ref|XP_003673587.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
gi|342299450|emb|CCC67204.1| hypothetical protein NCAS_0A06460 [Naumovozyma castellii CBS 4309]
Length = 508
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 117/301 (38%), Gaps = 33/301 (10%)
Query: 89 GRGVFATRRIRASDLIHTAK-PIITHPTLSTLN-----SVCYFCLRKITSSSQHFQHHNA 142
GRGV+A + + D+I K PI+ P + L C C + + S H+ N
Sbjct: 125 GRGVYALKHFKKGDMIFEEKLPIVIIPPMEKLELMSKGKCCTLCGKSLYELSTHYIMMNG 184
Query: 143 RFCGEV----CKDNAKAF----------YDVERRADWSAFNDYCRSQGLKYPLLVKRLAC 188
C + C N K ++++R W F ++C+ + V +
Sbjct: 185 LDCNDCTAVWCSKNCKKMDSCRHSFLKHVNLKKRNHWYKFENFCKEKNFVAGYSVGIIYA 244
Query: 189 MIISGAESADCIDILQP--ASLSPELILAMEE-----GFVMLRSAFKKAGIDDEQM-KFL 240
I+ ++ D I A +S + + + G + S + DD+ + +
Sbjct: 245 TILLNKKNRDSIKTKYESLAQVSQRIRIESSDSTNIGGTLDASSGAQSTNNDDDSGPESV 304
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDP 300
+Q Y L + + L + LL +I + IY L SF NH+C+P
Sbjct: 305 WEQAYK--LFKETFPTLNADEGFSLDDFLLDIGKFNINQ---LEGQIYHLYSFINHNCEP 359
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
N R+ A + ++ GEEL Y++ R+ L +GF C C RC +G
Sbjct: 360 NIRYEIDSKLCLRVFARKPIQPGEELLTTYVNPLHGVKLRRRALKVNWGFLCQCARCENG 419
Query: 361 D 361
+
Sbjct: 420 N 420
>gi|407404730|gb|EKF30079.1| hypothetical protein MOQ_006118 [Trypanosoma cruzi marinkellei]
Length = 359
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 60/109 (55%), Gaps = 13/109 (11%)
Query: 254 INAFRIEL--AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
++AF E+ AG + LL S+ G +Y + +NH C+PN +++ ++ D
Sbjct: 234 MDAFEKEVQKAGNEVQLLLQSSR---------GQGVYTVGCLFNHSCEPNLQVVYSESGD 284
Query: 312 ARL--MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L +ALRD+E GEEL I YID S++ RQ L + + F C+C +C
Sbjct: 285 ETLSVVALRDIEPGEELCISYIDESLSYPERQQELYEHYLFVCHCPKCE 333
>gi|395531511|ref|XP_003767821.1| PREDICTED: SET and MYND domain-containing protein 3 [Sarcophilus
harrisii]
Length = 428
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 113/280 (40%), Gaps = 35/280 (12%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
G G+ A +R +L+ ++P+ + VC CL + + Q AR+CG
Sbjct: 15 GNGLRAVAPLRPGELLFRSEPLAYTVCKGSRGVVCERCLCRKEKLLRCSQCKVARYCGSE 74
Query: 149 CKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMIISGAESADCIDILQP 205
C+ A W C+ S K+P RL ++ +L+
Sbjct: 75 CQKQA-----------WQDHKRECKCLKSSEPKFPPDSVRLLGKVV--------FKLLRE 115
Query: 206 ASLSPELILAM--EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAG 263
+S + E + + E + S KK G+ M Q Y Q NA ++ A
Sbjct: 116 SSCASEKLYSFFDLESNIKNLSEEKKEGLRHLAMTL---QLYLKEEIQ---NASQLPPAF 169
Query: 264 GLYED----LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD 319
++E + + S VG +Y S NH CDPN I++ + L A+R+
Sbjct: 170 DIFESFAKVICNGFTISNGEMQEVGVGLYPSMSLLNHSCDPNCVIVF-EGPSLFLRAIRN 228
Query: 320 VEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ GEEL ICY+D M RQ L + + F C+C C +
Sbjct: 229 IPLGEELTICYLDVLMPTAERQKQLKEQYCFDCDCPLCKT 268
>gi|346976798|gb|EGY20250.1| Mcg1p [Verticillium dahliae VdLs.17]
Length = 413
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 58/121 (47%), Gaps = 15/121 (12%)
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA 302
Q +V + NAF++++ G + A V AI L NHDC PNA
Sbjct: 199 QMGGSVRGIVATNAFQVDVGGDV--------AGGHHHHFGVFPAISRL----NHDCGPNA 246
Query: 303 HIMWIDNAD--ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
W D R A +D+ GEE+ I Y+DA R R+A + +GF+C C RC++G
Sbjct: 247 -AAWTDKTGLVHRAFASKDIAAGEEISISYVDALAPRAERRARMAGSWGFECGCRRCAAG 305
Query: 361 D 361
D
Sbjct: 306 D 306
>gi|303278994|ref|XP_003058790.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459950|gb|EEH57245.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 771
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 45/75 (60%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G A++ L S NH CDPN + ++D+A ++A R ++ GEEL I Y+D R+
Sbjct: 403 GAAVFPLASSLNHSCDPNCEVAYVDDARVLVVARRTLKPGEELTIAYVDVDADVGERRDE 462
Query: 344 LTQGFGFQCNCLRCS 358
L + +GF+C C RCS
Sbjct: 463 LREVYGFECVCERCS 477
>gi|4725963|emb|CAA15694.2| EG:103B4.4 [Drosophila melanogaster]
Length = 991
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 109/283 (38%), Gaps = 27/283 (9%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQH 136
P ++ + AGRGVFATR I A +LI + ++T PT S C C + +
Sbjct: 584 PNWTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETLPQTGFL 643
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS--GA 194
+H E C D+ + + E W D Q P+ ++ L + + G
Sbjct: 644 CRHRCTLPVCETCSDSEEHQAECEHFRRWQP-KDVDAEQEQVNPMSLRILTAVRVFHLGK 702
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
E +D +Q + A + F+ D F+++ + ++ +
Sbjct: 703 EQRHLVDAMQANAER-----AYRREIIQAAQCFRNFPTTDR--VFMDQLF--RIVGVLNT 753
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAF G +E LL ++ L + NH+C PNA + + A +
Sbjct: 754 NAFEAPCRSGGHETLL--------------RGLFPLTAIMNHECTPNASHYFENGRLAVV 799
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A RD+ +G E+ Y R L F C+C+RC
Sbjct: 800 RAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRC 842
Score = 44.3 bits (103), Expect = 0.093, Method: Compositional matrix adjust.
Identities = 62/295 (21%), Positives = 111/295 (37%), Gaps = 44/295 (14%)
Query: 74 QPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL-STLN-SVCYFCLRKIT 131
+P P +VA + +GRG+FATR I A + + ++ PT ++N C C R I
Sbjct: 38 RPEEPSWRVADSPISGRGIFATREIAAGEELFREHTLLVGPTAHRSMNLRTCTLCYRLIP 97
Query: 132 SSSQHFQHHNAR----FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
S+ A C E C+D+ + + + W PL +R+
Sbjct: 98 GSTDSAALCPAGCGLPVCSE-CRDSTRHDLECKLFRKWK-------------PLESQRIE 143
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQ---- 243
+ C L AS +L+ AM+ M R ++ + + ++
Sbjct: 144 PRALRILSVVRCF-FLDEASR--KLLYAMQAN--MDRYYMQEVQRAADCFEHFPREQDML 198
Query: 244 -WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA 302
++ + NAF + ++++ + A++ L NH C PNA
Sbjct: 199 DYFYRTICAFNTNAF--------------ESRSNVDGHEVLVRALFPLAGLLNHQCTPNA 244
Query: 303 HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + + A + G E+ + Y + AR+ L F C C+RC
Sbjct: 245 AHHFENGETIVVCATERIPAGAEITMSYAKLLWSTLARKIFLGMTKHFICKCVRC 299
>gi|426200938|gb|EKV50861.1| hypothetical protein AGABI2DRAFT_181876 [Agaricus bisporus var.
bisporus H97]
Length = 429
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 120/335 (35%), Gaps = 76/335 (22%)
Query: 78 PPIQVA-LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-----SVCYFCLRKIT 131
P ++V + G+G+ AT I+ ++I P IT P L+ C FC +T
Sbjct: 86 PKVEVKWFDKRKGKGLVATHEIKEGEVIWKEDPFITSPEWGILDLQMKSVACAFCTTPLT 145
Query: 132 SS--------SQHFQHHNARFCGEVCKDNAKAFYDV---------------ERRADWSAF 168
S S ARFC +C + + + R + W AF
Sbjct: 146 GSPLILSCAGSNMASPCTARFCNRLCFTRSAKLHPLLCPTKNPASIPLIQFVRESRWMAF 205
Query: 169 NDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEE---------- 218
Y L++ ++ S +E D + EL MEE
Sbjct: 206 ----------YALVLCTSRLLLESQSEDKSFNDNWEMVHALAEL--GMEERHKYSYNLSG 253
Query: 219 -----------GFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYE 267
+ + AFK+ + K + + + F+ E+ Y
Sbjct: 254 QYEPDRETWRHAYELFIQAFKEPSSTADAKKL------AKIFRKPHPSNFQAEIFD--YN 305
Query: 268 DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEG 323
+ I IY L S NH CDPN I +D A ++A RD++ G
Sbjct: 306 TGFLRGLGRMSLNIEAHGGIYTLHSHLNHSCDPNVSIRHLDQTTALSRITVIAKRDIKVG 365
Query: 324 EELRICYIDASMARDARQAILTQGFGF-QCNCLRC 357
EEL I Y+D + RQ L +G+GF +C C RC
Sbjct: 366 EELLITYVDPQLPYQTRQNEL-KGWGFGRCTCSRC 399
>gi|392561697|gb|EIW54878.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 300
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/87 (41%), Positives = 48/87 (55%), Gaps = 2/87 (2%)
Query: 277 IESEIAVGN-AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDAS 334
I + A G+ A+++L S +N CDPN + W + A+RD+ EGEEL I Y +
Sbjct: 93 ITTGTAAGSGAVFVLGSRFNSSCDPNVNNYWNEGQGKITFWAVRDIAEGEELLISYGEHF 152
Query: 335 MARDARQAILTQGFGFQCNCLRCSSGD 361
RDAR+A L FGF C C CS D
Sbjct: 153 AGRDARRARLKMQFGFVCECTACSRTD 179
>gi|201066227|gb|ACH92522.1| IP12582p [Drosophila melanogaster]
Length = 495
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 109/283 (38%), Gaps = 27/283 (9%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQH 136
P ++ + AGRGVFATR I A +LI + ++T PT S C C + +
Sbjct: 88 PNWTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETLPQTGFL 147
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII--SGA 194
+H E C D+ + + E W D Q P+ ++ L + + G
Sbjct: 148 CRHRCTLPVCETCSDSEEHQAECEHFRRWQP-KDVDAEQEQVNPMSLRILTAVRVFHLGK 206
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
E +D +Q + A + F+ D F+++ + ++ +
Sbjct: 207 EQRHLVDAMQANAER-----AYRREIIQAAQCFRNFPTTDR--VFMDQLF--RIVGVLNT 257
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAF G +E LL ++ L + NH+C PNA + + A +
Sbjct: 258 NAFEAPCRSGGHETLL--------------RGLFPLTAIMNHECTPNASHYFENGRLAVV 303
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A RD+ +G E+ Y R L F C+C+RC
Sbjct: 304 RAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRC 346
>gi|4753708|emb|CAB42051.1| MSTA protein [Drosophila melanogaster]
Length = 462
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 109/283 (38%), Gaps = 27/283 (9%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQH 136
P ++ + AGRGVFATR I A +LI + ++T PT S C C + +
Sbjct: 55 PNWTISSSTVAGRGVFATRDISAGELIFQERALVTGPTARKGQLSSCICCHETLPQTGFL 114
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS--GA 194
+H E C D+ + + E W D Q P+ ++ L + + G
Sbjct: 115 CRHRCTLPVCETCSDSEEHQAECEHFRRWQP-KDVDAEQEQVNPMSLRILTAVRVFHLGK 173
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
E +D +Q + A + F+ D F+++ + ++ +
Sbjct: 174 EQRHLVDAMQANAER-----AYRREIIQAAQCFRNFPTTDR--VFMDQLF--RIVGVLNT 224
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAF G +E LL ++ L + NH+C PNA + + A +
Sbjct: 225 NAFEAPCRSGGHETLL--------------RGLFPLTAIMNHECTPNASHYFENGRLAVV 270
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A RD+ +G E+ Y R L F C+C+RC
Sbjct: 271 RAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRC 313
>gi|345329417|ref|XP_001513395.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Ornithorhynchus anatinus]
Length = 415
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+R++++GEEL ICY+D + RQ
Sbjct: 167 VGVGLYPSMSLLNHSCDPNCVIVF-EGTSLFLRAVREIQKGEELTICYLDVLLPSQERQK 225
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + + F C+C+RC + D
Sbjct: 226 QLKEQYCFACDCIRCKTQD 244
>gi|281208419|gb|EFA82595.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 621
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 137/344 (39%), Gaps = 70/344 (20%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPT--LSTLNSVCYFCL---------- 127
+Q+ L+ GRG+FATR I A +L+ + +++ + L + C+ C
Sbjct: 275 VQIKLSNIHGRGLFATRDIGAEELVFETRSVVSTSSAQLKEDDQHCHNCHIAFKPVEIEN 334
Query: 128 RKITSSSQHF------------------------QHHNARFCGEVCKD------NAKAFY 157
+ S S+ F Q + FC CK+ N+
Sbjct: 335 EQEISKSREFNRLAQTITRMTRLPVGRIIPEGCTQCNTTLFCSPECKEDGIHKHNSLCSS 394
Query: 158 DVERRADWSAF----NDYCRSQGLKYPLLVKRLACMIISG--------AESADCIDILQ- 204
++ RA F +++ + Y +L+K +A ++G E D + L
Sbjct: 395 NILTRAQLDKFERELTGLTQTERVGYTMLLKLMASQSVNGDPNQPLRQLELDDFVKRLSY 454
Query: 205 --PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELA 262
PAS + +L F L+ F ++ + K T Q+ + ++
Sbjct: 455 AMPASETVQLSRKDNRVFKTLKDIFHTLT---PEIFYRLKSIVTLNQTQMFTSKIKVHCT 511
Query: 263 GGLYEDLLSSAAASIESEIAVGNAIYMLP---SFYNHDCDPNAHIM--WIDNADARLMAL 317
++L + E E V I +P SF NH CD N I I++ R+++
Sbjct: 512 PAPMDEL----GYNFEFEDKVSKYINTIPLHLSFMNHSCDSNVFIASPVINDKQIRIVSK 567
Query: 318 RDVEEGEELRICYIDA-SMARDARQAILTQGFGFQCNCLRCSSG 360
R +++GEE+ I YID + + RQ +L + +GFQC C C S
Sbjct: 568 RPIKKGEEILISYIDGMQLTTENRQEMLGESYGFQCTCPLCRSN 611
>gi|351695044|gb|EHA97962.1| SET and MYND domain-containing protein 3, partial [Heterocephalus
glaber]
Length = 252
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 44/79 (55%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ L A+RD+E GEEL ICY+D M + R+
Sbjct: 17 VGVGLYPSMSLLNHSCDPNCSIVF-SGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRK 75
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F C+C RC + D
Sbjct: 76 QLRDQYCFDCDCFRCQTQD 94
>gi|62473423|ref|NP_001014717.1| msta, isoform A [Drosophila melanogaster]
gi|25090702|sp|O46040.3|MSTAA_DROME RecName: Full=Protein msta, isoform A
gi|22831561|gb|AAN09072.1| msta, isoform A [Drosophila melanogaster]
Length = 462
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/283 (23%), Positives = 109/283 (38%), Gaps = 27/283 (9%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQH 136
P ++ + AGRGVFATR I A +LI + ++T PT S C C + +
Sbjct: 55 PNWTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETLPQTGFL 114
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS--GA 194
+H E C D+ + + E W D Q P+ ++ L + + G
Sbjct: 115 CRHRCTLPVCETCSDSEEHQAECEHFRRWQP-KDVDAEQEQVNPMSLRILTAVRVFHLGK 173
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
E +D +Q + A + F+ D F+++ + ++ +
Sbjct: 174 EQRHLVDAMQANAER-----AYRREIIQAAQCFRNFPTTDR--VFMDQLF--RIVGVLNT 224
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAF G +E LL ++ L + NH+C PNA + + A +
Sbjct: 225 NAFEAPCRSGGHETLL--------------RGLFPLTAIMNHECTPNASHYFENGRLAVV 270
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A RD+ +G E+ Y R L F C+C+RC
Sbjct: 271 RAARDIPKGGEITTTYTKILWGNLTRNIFLKMTKHFACDCVRC 313
>gi|336375028|gb|EGO03364.1| hypothetical protein SERLA73DRAFT_174827 [Serpula lacrymans var.
lacrymans S7.3]
gi|336387991|gb|EGO29135.1| hypothetical protein SERLADRAFT_456499 [Serpula lacrymans var.
lacrymans S7.9]
Length = 530
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 134/312 (42%), Gaps = 57/312 (18%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PP +++ + +GRG++A + A ++ + KP I+ + +L++ C C S
Sbjct: 3 PPSVEIRNSSVSGRGIWAKEKKEAGSILISVKPHISVLSNQSLHTHCSSCCGPAPPSGLK 62
Query: 137 --FQHHNARFCGEVCKDNAKAFYDVERRA--DWSAF---------NDYCRSQGLKYPLLV 183
+ +C C N + +E A WS+ +D R G +L
Sbjct: 63 RCVRCRMVWYCDVNCSSNDWTLHKLECSALKKWSSSAPSPDVAIPSDAVRCLGR---ILW 119
Query: 184 KRLACMIISGAES--ADCIDILQP--ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKF 239
KR A G ES A ID +Q SL P SAF+ F
Sbjct: 120 KRRA----EGPESIWAKEIDSMQSHRGSLQP--------------SAFES---HTHLAHF 158
Query: 240 LNKQWYTNVLAQIRINAFRIELAGGLYEDLLS-----SAAASIESEIAVGNAIYMLPSFY 294
L + + ++ + ++ + + AG L D++S + + S A+G ++ L +
Sbjct: 159 LVR--FLDLSSPAELSEYGLSTAGDLV-DIISKFITNTFTLTSSSLSALGVSVSPLVALI 215
Query: 295 NHDCDPNAHIMW--------IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQ 346
NH CDPNA I++ + +++A+RD+E EE+ YID ++ R +RQ L +
Sbjct: 216 NHSCDPNAVIVYPRCSNEPSTEEPLMQVVAIRDIEVDEEILTAYIDTTLPRFSRQKFLKE 275
Query: 347 GFGFQCNCLRCS 358
+ F C C C+
Sbjct: 276 TYNFDCQCPSCT 287
>gi|291386375|ref|XP_002709686.1| PREDICTED: SET and MYND domain containing 1 isoform 2 [Oryctolagus
cuniculus]
Length = 490
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 120/299 (40%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + + F +A +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVDHFGEEEQKELRADVDTFLQYWPPQSQHFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ + NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLALVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERRRQLKKQYYFDCTCEHCQKG 282
>gi|194913080|ref|XP_001982622.1| GG12639 [Drosophila erecta]
gi|190648298|gb|EDV45591.1| GG12639 [Drosophila erecta]
Length = 462
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/283 (23%), Positives = 109/283 (38%), Gaps = 27/283 (9%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQH 136
P ++ + AGRGVFATR I A +LI + ++T PT S C C + +
Sbjct: 55 PNWTISSSTVAGRGVFATRDIAAGELIFQERALVTGPTARKGQLSSCICCHETLPQTGFL 114
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS--GA 194
+H E C D+ + + E W D Q PL ++ L + + G
Sbjct: 115 CRHRCTLPVCETCADSEEHQAECEHFRRWQP-KDVDAEQEQVNPLSLRILTAVRVFHLGK 173
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
E +D +Q + A + F+ D F+++ + ++ +
Sbjct: 174 EQRHLVDAMQANAER-----AYRREIIQAAQCFRNFPTTDR--VFMDQLF--RIVGVLNT 224
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAF G +E LL ++ L + NH+C PNA + + A +
Sbjct: 225 NAFEAPCRSGGHETLL--------------RGLFPLTAIMNHECTPNASHYFENGRLAVV 270
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A RD+ +G E+ Y + R L F C+C RC
Sbjct: 271 RAARDIPKGGEITTTYTKILWSNLTRNIFLKMTKHFACDCARC 313
>gi|390362089|ref|XP_784346.3| PREDICTED: SET and MYND domain-containing protein 5-like
[Strongylocentrotus purpuratus]
Length = 396
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 144/357 (40%), Gaps = 86/357 (24%)
Query: 84 LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTL---NSVCYFCLRKITSSSQHFQHH 140
+ E GRG+FA+R I A ++ KP++ L C +C+R + ++ + +
Sbjct: 12 IDELKGRGLFASRDIAAGTVLFEEKPLVCAQFLWNECYHYEACEYCMRSMENAEEMARRL 71
Query: 141 NARF-----CGEVCKDNAKAFYDVE-----------RRADWSAFNDY-CRSQ---GLKYP 180
+A C E CK + F R W+ ++ C SQ +P
Sbjct: 72 SANPSLVLPCMECCKAEKQNFVHCPHCRISYCCSECREQAWNQYHQVLCLSQHPPDPNHP 131
Query: 181 LLVKR-----------------LACMIISGAESADCIDIL-------------------- 203
L++ + +A MI S ++ +IL
Sbjct: 132 LVILQEVWKTMHYPPETTSIMLIAKMIASIKQAKSKEEILASFNQFCHVPQNEERQLSHK 191
Query: 204 --------QPASLSPELILAMEEGFVM-------LRSAFKKAGIDDEQMKFLNKQWYTNV 248
Q +L P L+ A+ E V LRS F G++ + + + Y +
Sbjct: 192 LLGKQFEGQYEALQPLLLNALHEESVAQWFTPGGLRSLFALIGMNGQGVGSSSLSVYVHN 251
Query: 249 LAQIRINAFR----IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
++ + A + LY ++ + A + E G+A+Y L S NH C PNA I
Sbjct: 252 CDELDLEAEERQNLDQFIDQLYVEMEKESGAFLNCE---GSALYKLQSCCNHSCVPNAEI 308
Query: 305 MWI-DNADARLMALRDVEEGEELRICYID---ASMARDARQAILTQGFGFQCNCLRC 357
++ +N+ L+AL D+ EG+E+ I Y+D +R +RQ L + + F C+C +C
Sbjct: 309 SFLHNNSTLSLVALTDITEGQEVLISYLDECCKERSRHSRQKELRENYLFSCDCSKC 365
>gi|390599694|gb|EIN09090.1| SET domain-containing protein [Punctularia strigosozonata HHB-11173
SS5]
Length = 562
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/308 (21%), Positives = 127/308 (41%), Gaps = 43/308 (13%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
PP + + + +GRG++ IR + + +P ++ + L+S C C +
Sbjct: 31 PPTLDLRESPKSGRGLYVKVAIRRGSSLLSTRPHVSVLSTPYLSSHCSSCAAPASPERPQ 90
Query: 137 FQH----HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS 192
+ H +C + C++ ADW A C + ++R A S
Sbjct: 91 LKRCAKCHVVHYCSQKCQN-----------ADWKAHKPECDA--------LQRWATAAPS 131
Query: 193 G-----AESADCID-ILQPASLSPELILAME-EGFVMLRSAFKKAGIDDEQMKFLNKQWY 245
E+ C+ +L +P I + E + RS+ + + Y
Sbjct: 132 PDLAVPEEAVRCLGRMLWQKQRNPSSIWSREIDSMQSHRSSLRPESFESHAYLAHALVRY 191
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLS-----SAAASIESEIAVGNAIYMLPSFYNHDCDP 300
+ A ++ +F + AG + DL+S + + S +G ++ + NH C P
Sbjct: 192 LGLDAPEKLASFGLRSAGDVV-DLISRFTTNAITLAAPSLTPLGVSVSPAVALVNHSCAP 250
Query: 301 NAHIMW------IDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NA +++ +D +++ALRD+ EE+ YID ++ R+ RQ IL + F C+
Sbjct: 251 NAVVVFPRVSKTVDQEPVMQVIALRDIHPDEEVTTAYIDTTVPREQRQKILRDTYHFTCS 310
Query: 354 CLRCSSGD 361
C C++ D
Sbjct: 311 CSLCAAED 318
>gi|66802125|ref|XP_629856.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74851143|sp|Q54DL6.1|Y2140_DICDI RecName: Full=SET and MYND domain-containing protein DDB_G0292140
gi|60463233|gb|EAL61426.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 521
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 61/112 (54%), Gaps = 12/112 (10%)
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
N+L + R NAF I+ G D +S ES G +Y+ SF+NH CDPN +
Sbjct: 332 NILLRNRCNAFYIQ---GRPRD-----GSSGESR---GCGVYVRNSFFNHSCDPNVNYWV 380
Query: 307 IDNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++N + L++V+EG+EL I YID + + R+ L +G+ F C C +C
Sbjct: 381 VNNTLEVECTLLKNVKEGDELTISYIDTTSPLNKRREKLLEGYLFNCLCTKC 432
>gi|281202822|gb|EFA77024.1| hypothetical protein PPL_09776 [Polysphondylium pallidum PN500]
Length = 1187
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 125/308 (40%), Gaps = 48/308 (15%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P+ + +S GR + A+R I +L+ P TL + C C R I+ +++
Sbjct: 718 PVALTWDDSMGRKIIASRDIAKGELLLRVAPYGAALVDDTLLTHCTSCFRNIS----YYK 773
Query: 139 HHNARFCG-----EVCKDNAKAFYDVERRADWSAFNDY----CRSQGLKYPLLVKRLACM 189
HH + C E C + + D F ++ ++ + V +
Sbjct: 774 HHLCQKCKQCILCEECNSDVDLVNEHNEECDILVFLKQNVPGADTRDFRFMVRVMLKSIA 833
Query: 190 IISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
I++G S + + P + + ++ + + ID +QM+
Sbjct: 834 ILNGKLSKEQSPKCWSKNGVPFIFDSYDD---LTHLTTDTSNIDRKQMESFE------TA 884
Query: 250 AQIRINAFRIELAGG--------------LYEDLLSSAAASIE----SEIAVGNAIYMLP 291
Q IN F+ LA G LY +L +A I+ SE+A G IY
Sbjct: 885 TQSIINVFK--LAKGPKSLEPLTNQQILDLYPKMLFNAHEYIDPLYHSEVARG--IYPTA 940
Query: 292 SFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
++ NH C+PN +W ++ + + ++RD++ GEE+ YID + + RQ L +
Sbjct: 941 AYLNHSCEPNT--VWHNDNNGMIAYRSIRDIKAGEEITTTYIDITKYKSTRQLNLLSQYA 998
Query: 350 FQCNCLRC 357
F C C RC
Sbjct: 999 FLCQCARC 1006
>gi|440636518|gb|ELR06437.1| hypothetical protein GMDG_07962 [Geomyces destructans 20631-21]
Length = 287
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 57/112 (50%), Gaps = 17/112 (15%)
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+L + +N+FRI A G EI +G L NH C PNA + +
Sbjct: 181 LLCVMNVNSFRITDASG--------------DEIGIG--FDPLLGMANHSCAPNASLKF- 223
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
D A L AL+ +EEGEE+ I YID ++ R ARQA L + + F C C C++
Sbjct: 224 DGRCAVLTALKHIEEGEEITISYIDTTLPRAARQAFLQKHYYFTCTCAACTT 275
>gi|407849671|gb|EKG04345.1| hypothetical protein TCSYLVIO_004595 [Trypanosoma cruzi]
Length = 359
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/109 (33%), Positives = 60/109 (55%), Gaps = 13/109 (11%)
Query: 254 INAFR--IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
++AF +++AG + LL S+ G +Y + +NH C+PN +++ ++ D
Sbjct: 234 MDAFEQEVQMAGKELQHLLQSSR---------GQGVYTVGCLFNHSCEPNLQVVYSESGD 284
Query: 312 ARL--MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L +ALR++E GEEL I YID S++ RQ L + + F C C +C
Sbjct: 285 ETLSVVALRNIELGEELCISYIDESLSYPERQQELYEHYLFVCQCPKCE 333
>gi|242216987|ref|XP_002474297.1| predicted protein [Postia placenta Mad-698-R]
gi|220726589|gb|EED80534.1| predicted protein [Postia placenta Mad-698-R]
Length = 1061
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 292 SFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
S NHDC PNAH + +N RL+A+ D+EEGEE+ + Y+D+ R+ RQ++L + F
Sbjct: 872 SRANHDCTPNAHFHFSENTCCGRLVAMHDIEEGEEITVRYVDSLAPREERQSLLRGRYNF 931
Query: 351 QCNCLRCS 358
C C CS
Sbjct: 932 VCTCRTCS 939
>gi|346320789|gb|EGX90389.1| SET domain protein [Cordyceps militaris CM01]
Length = 540
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 121/285 (42%), Gaps = 25/285 (8%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
P + + + GR + AT+ + A ++ P+I P+LS L VC FCLR +T
Sbjct: 6 PDGVVIKPHSTRGRALHATKTVAAGGVLAVFSPLILVPSLSHLTIVCSFCLRGVTPRPCS 65
Query: 137 FQHHNARFCGEVCKDNA-KAFYDVERRADWSAFNDYCRSQGLKYPL--LVKRLACMIISG 193
+ A +C C+ A + + E A A R + + P+ LV+ L + G
Sbjct: 66 -RCRAAYYCDAQCQAAAWSSGHAAECAALTGAIPPARRRREIPAPVRALVQLLLLLRSGG 124
Query: 194 AESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIR 253
+ D +D L+ + + +M + AG++ + +L +++
Sbjct: 125 GHTRDLLDGLEGHAAQRRRAPGWADMELMAMAGCAFAGMETSEGAVREA---AEILCKLQ 181
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
NAF ++ L +E +A+ NH C PNA + ++ A
Sbjct: 182 TNAFH------RFDADLGHVGIFLEPTLAMA----------NHSCLPNAFVQFVGRT-AV 224
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L A + ++ G+E+ I Y D + + RQA L + F+C C RC+
Sbjct: 225 LRAEQRIQSGDEIEISYTDYTSSLSKRQAALAP-YHFECRCRRCT 268
>gi|71666752|ref|XP_820332.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70885672|gb|EAN98481.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 359
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 46/77 (59%), Gaps = 2/77 (2%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDARQ 341
G +Y + +NH C+PN +++ ++ D L +ALRD+E GEEL I YID S++ RQ
Sbjct: 257 GQGVYTVGCLFNHSCEPNLQVVYSESGDETLSVVALRDIEPGEELCISYIDESLSYPERQ 316
Query: 342 AILTQGFGFQCNCLRCS 358
L + + F C C +C
Sbjct: 317 QELYEHYLFFCQCPKCE 333
>gi|242209059|ref|XP_002470378.1| predicted protein [Postia placenta Mad-698-R]
gi|220730548|gb|EED84403.1| predicted protein [Postia placenta Mad-698-R]
Length = 1006
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 43/68 (63%), Gaps = 1/68 (1%)
Query: 292 SFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
S NHDC PNAH + +N RL+A+ D+EEGEE+ + Y+D+ R+ RQ++L + F
Sbjct: 819 SRANHDCTPNAHFHFSENTCCGRLVAMHDIEEGEEITVRYVDSLAPREERQSLLRGRYNF 878
Query: 351 QCNCLRCS 358
C C CS
Sbjct: 879 VCTCRTCS 886
>gi|444708470|gb|ELW49533.1| SET and MYND domain-containing protein 3 [Tupaia chinensis]
Length = 125
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 42/70 (60%), Gaps = 1/70 (1%)
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
S NH CDPN I++ + L A+RD+E GEEL ICY+D M + R+ L + F+
Sbjct: 17 SLLNHSCDPNCSIVF-NGPHLLLRAVRDIEVGEELTICYLDMLMTSEERRKQLRDQYCFE 75
Query: 352 CNCLRCSSGD 361
C+C RC + D
Sbjct: 76 CDCFRCQTQD 85
>gi|392562474|gb|EIW55654.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 499
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 67/148 (45%), Gaps = 14/148 (9%)
Query: 224 RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASI-----E 278
RS + ++D + Y V A + F + AG L DL+S +
Sbjct: 111 RSTVHASSVEDHTYLAHSLVRYLGVRAPAELQPFSLSSAGDLV-DLISRFTTNTFTLTSA 169
Query: 279 SEIAVGNAIYMLPSFYNHDCDPNAHIMWI--------DNADARLMALRDVEEGEELRICY 330
S +G + +F NH C PNA I++ L+ALRD+ G+E+RI Y
Sbjct: 170 SLSPIGICVAPTVAFANHSCSPNAVIVFPRAQGTPASKEPSLNLVALRDIAPGKEIRISY 229
Query: 331 IDASMARDARQAILTQGFGFQCNCLRCS 358
+D ++ +D RQ LT+ + F C C C+
Sbjct: 230 VDTTLPKDLRQKELTEVYNFTCQCKLCT 257
>gi|301102901|ref|XP_002900537.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101800|gb|EEY59852.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 425
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 46/77 (59%), Gaps = 1/77 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +++ S +NHDCDPN ++ + + ++DV+EG+EL I Y++ + RQ
Sbjct: 196 VGIGLFLQGSMFNHDCDPNC-VVSFQGQEMNVHVIKDVKEGQELTISYVEVLQSTKKRQK 254
Query: 343 ILTQGFGFQCNCLRCSS 359
IL + F+C C RC++
Sbjct: 255 ILKDSYFFECQCSRCTT 271
>gi|407929314|gb|EKG22146.1| hypothetical protein MPH_00473 [Macrophomina phaseolina MS6]
Length = 492
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 122/296 (41%), Gaps = 57/296 (19%)
Query: 88 AGRGVFATRRIRASDLIHTAK-PIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCG 146
+G+G+FA + I A DLI + K P+I +L+S C C +S F N
Sbjct: 14 SGKGLFANQDIAAGDLIFSLKRPLIAVLDNVSLDSCCANCF-----ASTGFGATNNDLSL 68
Query: 147 EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS--GAESADCIDILQ 204
C + Y C+SQ K + C ++ GAE
Sbjct: 69 RACTGCSTLKY----------CGRSCQSQSWKRH---HKHECPVLKTLGAERQ------- 108
Query: 205 PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------------QWYTN-VLA- 250
P + A+ + VM +S I DE K L + W + VLA
Sbjct: 109 ----LPNAVRAVIQTLVMRKSGL----ISDEDWKNLQELPAHLDELRQEPDWDKHAVLAL 160
Query: 251 -----QIRINAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHI 304
+ + F +A G+Y + +++ I S + +G L NH CDPNA +
Sbjct: 161 GALKYSMAEDKFSSNIATGIYGRVFTNSLTLIGSAMEPLGICFDPLACSANHSCDPNAFV 220
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ +D A AL+ + + EE+ I YID ++ + RQ LT+ + F C+C +C+ G
Sbjct: 221 I-MDGAQLSFRALKPIAKDEEILISYIDGTLPFERRQTDLTRRYYFTCSCTKCAQG 275
>gi|390363795|ref|XP_788278.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 356
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 280 EIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRICYIDASMARD 338
+IAVG IY+ S NH CDPN W+ D ++M ++DV+EG+E I Y+DA
Sbjct: 188 DIAVG--IYLRASMLNHSCDPNC--AWVCDGRKLQIMTVKDVKEGDECTISYVDAMKPAK 243
Query: 339 ARQAILTQGFGFQCNCLRC 357
RQA L + + F C C++C
Sbjct: 244 VRQADLKESYHFTCKCVKC 262
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
NH CD N I+ D +L ++DV++GEE I Y+D RQA L + + F C C
Sbjct: 3 NHSCDYNC-IVVFDERKLQLRTIKDVKDGEECTISYVDVINPAKERQAKLEEEYHFTCKC 61
Query: 355 LRC 357
++C
Sbjct: 62 VKC 64
>gi|328850107|gb|EGF99276.1| hypothetical protein MELLADRAFT_94773 [Melampsora larici-populina
98AG31]
Length = 415
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 45/75 (60%), Gaps = 1/75 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQAIL 344
A+ P+ NHDC PN+ + N + ALRD+ GEEL + YI ++++ RQ +L
Sbjct: 220 AVVFEPTRLNHDCRPNSAYHFDTNTLKLNVHALRDISPGEELTVSYIQVDVSKETRQGLL 279
Query: 345 TQGFGFQCNCLRCSS 359
++ +GF+C C CSS
Sbjct: 280 SRDYGFECGCSLCSS 294
>gi|154346778|ref|XP_001569326.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066668|emb|CAM44467.1| hypothetical protein LBRM_35_7140 [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 442
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 86/194 (44%), Gaps = 35/194 (18%)
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
M++ G +A ++ A+ E + ++ F M D E+ F++ Q ++ +
Sbjct: 249 MLVPGVRAAYARELRDKATFCAESLELLQNVFDM----------DPEERAFVHAQRWSEL 298
Query: 249 LAQIRINA----------FRIELAGGLYEDLLSSAA---ASIESEIAV----------GN 285
+ + +N EL L + + +A ++ IA G
Sbjct: 299 MGAVLLNGQERSPPSPYEMHRELVSCLPDGERAMSAFEQTVFQTRIATDVSDLLRSCRGQ 358
Query: 286 AIYMLPSFYNHDCDPNAHIMW--IDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
IY + +NH CDPN + + +++ ++ALRDV+ GEEL I YID+S+ RQ
Sbjct: 359 GIYEVGCLFNHSCDPNLSVQYSSLNDETLTVVALRDVKAGEELTISYIDSSLPFAVRQQQ 418
Query: 344 LTQGFGFQCNCLRC 357
L + F+C C RC
Sbjct: 419 LLDHYLFECRCPRC 432
>gi|410912232|ref|XP_003969594.1| PREDICTED: SET and MYND domain-containing protein 3-like [Takifugu
rubripes]
Length = 434
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/299 (25%), Positives = 125/299 (41%), Gaps = 43/299 (14%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P ++ ++ G G+ AT RI +L++ +P+ + +VC+ C + + +
Sbjct: 3 PKLERFVSPGKGNGLRATVRIEEGELVYVTEPLAYCVSQKQSRNVCHQCFTRHETLLRCS 62
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS-QGL--KYPLLVKRLAC-----M 189
Q AR+C C +RRA WS C+ Q L + P RLA M
Sbjct: 63 QCKMARYCSATC----------QRRA-WSDHKRECKCLQSLLPRIPTDSVRLAARLIFAM 111
Query: 190 IISGAESADCIDILQP-----ASLSPELILAMEEGFVMLRSAFKKAGID--DEQMKFLNK 242
+ S + S++ + L+ SLS + + + ML+ K D + +
Sbjct: 112 LSSCSSSSEELYTLEEHESHLTSLSEQRKQGLSQLATMLKLYLHKEVPDLPQDTPSLSSC 171
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA 302
+ +++A++ N F I G L E +G +Y S NHDC PN
Sbjct: 172 RDALSLIAKVTCNCFTIS-DGELQE---------------IGVGLYPSLSLLNHDCRPNC 215
Query: 303 HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+M + L A+R + GEEL I YI+ + RQ L + F C+C C+S D
Sbjct: 216 -VMVFEGTKLLLRAVRGLSPGEELTISYIETLSLNEDRQQRLEDQYCFTCHCQCCNSPD 273
>gi|448122372|ref|XP_004204432.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
gi|358349971|emb|CCE73250.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
Length = 485
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/326 (20%), Positives = 130/326 (39%), Gaps = 46/326 (14%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-----CYFC 126
P P I+V T G+G++A + ++ + + KP+ P L+ + + C +C
Sbjct: 102 PDMELPDKIKVLQTSKKGKGLYAKKNVKKGEFLWEEKPLFYVPALANMTLIKNGKACSYC 161
Query: 127 LRKITSSSQ------HFQHHNARFCGEV-----CKDNAKAFYDVERRADWS--------- 166
+ +T ++ + + + C E+ CK+ + + + ++
Sbjct: 162 GKLLTQGTRGSSGVSYLRGLDCDVCSEIWCSKECKNTDSKLHALLKHNIYNPSKSNTRKN 221
Query: 167 -------AFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEG 219
A DYC + + R+ I+S A +S E +
Sbjct: 222 INANAYLALQDYCLKEQWNALYAITRIYADIVSDKSGVKQKQFSAMARISQETRYKALDS 281
Query: 220 FVMLRSAFKKAGI-DDEQMKFLNKQWYTNVLAQIRINAFRIELAGGL--YEDLLSSAAA- 275
F+ + EQ + L K+ ++ + N F I + G YE+ +
Sbjct: 282 SAGSFDNFQGGALFVQEQQESLWKEGFS-----LFSNVFPISVDDGKINYEEFMYMLGTY 336
Query: 276 SIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA--RLMALRDVEEGEELRICYIDA 333
+I + + + I++L S NH+C+P+ + N ++ A RD++ GEEL Y++
Sbjct: 337 NINN---LDSIIFLLQSHLNHNCEPSTTVKLSSNRTEGLKVYAARDIKAGEELTTSYVNP 393
Query: 334 SMARDARQAILTQGFGFQCNCLRCSS 359
S RQ L +GF+C C +C +
Sbjct: 394 SHTVQQRQRELRVNWGFRCACEKCKN 419
>gi|325186193|emb|CCA20695.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 673
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 47/75 (62%), Gaps = 1/75 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
G+A+++L NH C+PN +++ N DA ++A+RDV +GEEL I Y+D + DA Q
Sbjct: 558 TGSALFLLICTLNHSCEPNVAVLYSKNGDAHVVAVRDVAKGEELCISYVDLDVDMDA-QE 616
Query: 343 ILTQGFGFQCNCLRC 357
++ + +C C RC
Sbjct: 617 VMLRDCDVECGCARC 631
>gi|432852874|ref|XP_004067428.1| PREDICTED: SET and MYND domain-containing protein 3-like [Oryzias
latipes]
Length = 495
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 63/245 (25%), Positives = 98/245 (40%), Gaps = 61/245 (24%)
Query: 142 ARFCGEVCKDNAKAFYDVERRADWSAFNDYCR---SQGLKYPLLVKRLACMIISGAESAD 198
AR+C C+ A W CR S ++P RLA +I
Sbjct: 125 ARYCNSACQKRA-----------WPGHKKECRCLCSLLPRHPTDSVRLAARLI------- 166
Query: 199 CIDILQPASLSPELILAMEEGFVMLRSA-----------------FKKAGIDDEQMKF-- 239
+L P S E + +++E L S + K I D + +
Sbjct: 167 -FTLLNPLQSSKEELYSIDEHESHLDSMPEQKKEGLSELATMLELYMKTEIPDLKQRMTS 225
Query: 240 ---LNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNH 296
L+ + +++A++ N F I G L E +G +Y S NH
Sbjct: 226 ELPLSCREALSIIAKVTCNCFTIS-DGELQE---------------IGVGLYPSLSLLNH 269
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
DC PN +M + + +L A++D+ GEELRI YI+ + RQ L + + F C+C R
Sbjct: 270 DCRPNC-VMVFNGTELQLRAVQDINPGEELRISYIETLSLTEDRQKQLEEQYHFTCHCQR 328
Query: 357 CSSGD 361
CSS +
Sbjct: 329 CSSKE 333
>gi|198420659|ref|XP_002128556.1| PREDICTED: similar to SET and MYND domain containing 3 [Ciona
intestinalis]
Length = 430
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/281 (21%), Positives = 120/281 (42%), Gaps = 18/281 (6%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ +E +GRG+ A R + + +++P + C FC +K Q
Sbjct: 5 VELFTSEKSGRGLKAKRNLNPGSTVLSSEPYAYLLSKKQKGVYCDFCFKKQDGLLQCSGC 64
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
++C C+ K ++ +A+ A + + + +L+ RL + +
Sbjct: 65 KYMKYCNRNCQ---KMAWNEHHKAECPALKNVMPKRPPDFVILLGRL----LWNMQQYSS 117
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA-QIRINAFR 258
+ + S IL +E + L K+A ++ + L+ W L Q+ N
Sbjct: 118 AKLPEKNS-----ILDLESNYNKLSQNQKEALMNF--LVILHTFWSPKPLPPQVTDNKML 170
Query: 259 IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALR 318
+EL + + + ++S+ VG +Y+ SF NH C+PN + + ++ A++
Sbjct: 171 LELCARIKNNSFAICNEELQSD--VGTGVYLNCSFINHSCEPNC-VAEFNMRTLKIRAVK 227
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ GEE+ I Y+D RQ L + FQC C C++
Sbjct: 228 NITAGEEVLISYVDLFATSFERQRELMSIYHFQCTCHSCNA 268
>gi|332813752|ref|XP_003309160.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan
troglodytes]
Length = 452
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 118/298 (39%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +TE GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFITEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|354543298|emb|CCE40016.1| hypothetical protein CPAR2_100550 [Candida parapsilosis]
Length = 477
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 123/316 (38%), Gaps = 45/316 (14%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-----SVCYFCLRKIT 131
PP +QV +T G+ ++A + I+ +LI + P+ P L+ ++ S C C + +
Sbjct: 107 PPKVQVVMTSKRGKCLYAKKPIKEGELIWSETPLFFVPPLANVHLVEHGSACANCGKLLQ 166
Query: 132 SSSQH--FQHHNARFCGEV-----CKDNAKAFYDVERRA------------DWSAF---N 169
S + + C EV CK + + D +AF
Sbjct: 167 QSEMKTVLKGLDCDVCAEVWCSKKCKQIDSQLHGATKHNVFNPNSKKPKIIDSNAFLELQ 226
Query: 170 DYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPEL-ILAMEEGFVMLRSAFK 228
+YC + + + I+ A +S ++ A++ G +
Sbjct: 227 EYCLKESWNALYAIALIYANIVLDKSGVKQKQFNAMARVSQDVRYKALDSGGGTFDNMNG 286
Query: 229 KAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIY 288
A EQ + L K+ Y L N + E L +I + + +++Y
Sbjct: 287 GALFVQEQQETLWKEGYEKFLRVFPKNTISYK------EFLYMMGTYNINN---LDSSVY 337
Query: 289 MLPSFYNHDCDPNAHIMWIDNADARLMAL-----RDVEEGEELRICYIDASMARDARQAI 343
+ S NH+CDPN + D A R+ L RD++EGEEL Y++ S RQ
Sbjct: 338 LTQSHLNHNCDPNTTV---DTASDRISGLKVFAKRDIKEGEELTTTYVNPSHTLHQRQRE 394
Query: 344 LTQGFGFQCNCLRCSS 359
L +GF C C +C S
Sbjct: 395 LRVNWGFICACDKCKS 410
>gi|449445136|ref|XP_004140329.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
gi|449502473|ref|XP_004161650.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Cucumis
sativus]
Length = 482
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 117/284 (41%), Gaps = 28/284 (9%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ V + GR +F TR R + I + +P + P S + S C C I + +
Sbjct: 13 LTVTILPEKGRSLFTTRNFRPGEEIISQEPYVCVPNNSPVESRCDRCFASI-NLKKCSAC 71
Query: 140 HNARFCGEVCKDNAKAFYDVE----RRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
A +C VC+ + + +E R D S L L ++R
Sbjct: 72 KVAWYCSSVCQKSEWKLHRLECDALARLDKDRHKSVTPSIRLMIKLFIRRK--------- 122
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKK-AGIDDEQMKFLNKQWYTNVLAQIRI 254
LQ + P +A + + ++ +GID++Q+ L Q T V ++
Sbjct: 123 -------LQSEKVIP---MAATDNYKLVEELIDHMSGIDEKQL-VLYAQMGTLVNFILQW 171
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
++ + L +A +SE+ +G +Y + S NH C PNA ++ + A
Sbjct: 172 PGMNVKEIAVNFSKLACNAHTICDSELRPLGTGLYPVISIINHSCLPNA-VLVFEGRTAV 230
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ A++ + G E+ I YI+ + + RQ L + + F C C RC
Sbjct: 231 VRAVQHIPAGAEVSISYIETAGSTMTRQKTLKENYLFTCTCSRC 274
>gi|157819445|ref|NP_001100065.1| SET and MYND domain-containing protein 1 [Rattus norvegicus]
gi|149036367|gb|EDL90985.1| rCG56113, isoform CRA_a [Rattus norvegicus]
Length = 490
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 119/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQERLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C+C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRQQLKKQYYFDCSCEHCQKG 282
>gi|261332900|emb|CBH15895.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 457
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQ 341
G IY + NH C+PN +++ D L ALRD+E GEEL I Y+D ++ RQ
Sbjct: 366 GQGIYTIGCLLNHSCEPNLQVLYTAVGDETLSIEALRDIEPGEELNISYVDETLPYPQRQ 425
Query: 342 AILTQGFGFQCNCLRCS 358
IL + + F C C +C+
Sbjct: 426 LILYEHYFFICKCPKCT 442
>gi|19075913|ref|NP_588413.1| histone lysine methyltransferase Set5 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626858|sp|O74467.1|SET5_SCHPO RecName: Full=SET domain-containing protein 5
gi|3581881|emb|CAA20779.1| histone lysine methyltransferase Set5 (predicted)
[Schizosaccharomyces pombe]
Length = 319
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICYIDASMARDARQAI 343
+++L S NHDC PN W D + A+RD+E GEE+ YID + RQ I
Sbjct: 94 GGMFLLGSRMNHDCSPNVKHTWNPRLDQVTVHAVRDIEAGEEILTTYIDLHKSHTERQKI 153
Query: 344 LTQGFGFQCNCLRCS 358
L + FGF+C C CS
Sbjct: 154 LLEHFGFKCYCSVCS 168
>gi|71747970|ref|XP_823040.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832708|gb|EAN78212.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 457
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 2/77 (2%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQ 341
G IY + NH C+PN +++ D L ALRD+E GEEL I Y+D ++ RQ
Sbjct: 366 GQGIYTIGCLLNHSCEPNLQVLYTAVGDETLSIEALRDIEPGEELNISYVDETLPYPQRQ 425
Query: 342 AILTQGFGFQCNCLRCS 358
IL + + F C C +C+
Sbjct: 426 LILYEHYFFICKCPKCT 442
>gi|444515202|gb|ELV10791.1| SET and MYND domain-containing protein 1 [Tupaia chinensis]
Length = 462
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 71/299 (23%), Positives = 118/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNLSEERRKQLKKQYYFDCTCEHCQKG 282
>gi|67589453|ref|XP_665414.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54656092|gb|EAL35184.1| hypothetical protein Chro.10248 [Cryptosporidium hominis]
Length = 212
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 65/124 (52%), Gaps = 8/124 (6%)
Query: 239 FLNKQWYTNVLAQIRINA-----FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
L+++ ++ Q+ +N+ + + L+G L +S +I I G I ++ S
Sbjct: 74 LLSRETIEQIVGQLTLNSQGLNIWGLHLSGKLDNGNISDNEPTIG--IIKGACICVIQSC 131
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
+NH CDPN H+ ID++ + RDV +GEEL I Y+D ++ R +++ Q + F C
Sbjct: 132 FNHSCDPNCHVYTIDDSTIYVTTNRDVMKGEELTISYVDNTLPLAERTSLI-QNYHFTCT 190
Query: 354 CLRC 357
C C
Sbjct: 191 CRLC 194
>gi|409079577|gb|EKM79938.1| hypothetical protein AGABI1DRAFT_113180 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 562
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 79/352 (22%), Positives = 141/352 (40%), Gaps = 78/352 (22%)
Query: 78 PP----IQVALTESAGRGVFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKIT 131
PP +++ E+ + + AT+ I A + I+ P++ L + C C R++
Sbjct: 170 PPKSMNVEIKSNETGRKIMLATKDIAAGETIYKEFPVVAVLDADLESSGKYCSHCFRELQ 229
Query: 132 SS-------SQHFQHHNARFCGEVCKDNAKA-----FYDVE-----------------RR 162
S S+ + +C E C +N+K + +E R+
Sbjct: 230 DSDSSAIRLSEDVNSLGSSYCSEECLNNSKTQSTSLLFTLESPLPPELPTSEDIDNEGRK 289
Query: 163 ADWSAFNDYCRSQGLKYPLLVKRL----------------ACMIISGAESADCIDILQPA 206
A + F +Y + + PLLV R A M + +AD D +
Sbjct: 290 AAQAKFVEYLQKEKRAGPLLVARFISRQVAVETAKLVKKDAPMQTNDFTTADGGDYMLVD 349
Query: 207 SLSPELILAM---EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAG 263
+ L + EE ++ + +K EQ F+ K YT ++ ++ NA +
Sbjct: 350 HMERFRYLEVQPKEEELPLIANVLQKTLPGLEQ--FVTKDRYTTLVGKMLYNAIGV---- 403
Query: 264 GLYEDLLSSAA---------ASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADAR 313
G +D S A + ++ VG A Y L S+ NH C P+A + + +
Sbjct: 404 GSRDDKPESTARPEDQEKTRTPVGTQKQVGAAFYTLSSYANHSCAPSAKLTFPSGTTELH 463
Query: 314 LMALRDVEEGEELRICYIDASMAR-----DARQAI---LTQGFGFQCNCLRC 357
++A +D++ G+E+ + Y+D S A DAR+ L +G+ F C C C
Sbjct: 464 VVASKDIKMGDEVSVAYVDVSPAEGESIADARRRRRVELARGWRFACPCELC 515
>gi|238607908|ref|XP_002397092.1| hypothetical protein MPER_02547 [Moniliophthora perniciosa FA553]
gi|215470888|gb|EEB98022.1| hypothetical protein MPER_02547 [Moniliophthora perniciosa FA553]
Length = 293
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 87/209 (41%), Gaps = 51/209 (24%)
Query: 161 RRADWSAFNDYCRSQGLKYPLLVKRL--------ACMIISGAESA-----DCIDI----- 202
RR + + DY + G PLLV R +I GA A D +D
Sbjct: 97 RRKAQTQYVDYVKKNGKAGPLLVGRFIARQVALETSKLIPGASPARNAENDFVDADGGEY 156
Query: 203 -------------LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
LQP ELI + L+S G++ F+ + Y +L
Sbjct: 157 LLADHLERLRFLELQPPKEEVELITEI------LKSVL--PGLE----SFVTDERYAVLL 204
Query: 250 AQIRINAFRIELAGGLYEDLLSSAA--------ASIESEIAVGNAIYMLPSFYNHDCDPN 301
++ NAF + GG + + A + + +G AIY + S+ +H CDPN
Sbjct: 205 GKMAYNAFGVSYDGGRDDKPTTDARPEDLERTRTPVGTSRQIGTAIYTVSSYLSHSCDPN 264
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICY 330
A + D ++ RL+A +D+++G+EL I +
Sbjct: 265 ARPSFTDTSELRLIANKDLKKGDELTISF 293
>gi|340513811|gb|EGR44092.1| predicted protein [Trichoderma reesei QM6a]
Length = 549
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 119/289 (41%), Gaps = 41/289 (14%)
Query: 87 SAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCG 146
+AG G++ R + +LI T+ P++ + C FC ++ + F +RF G
Sbjct: 61 NAGSGLYTNRDVEVGELIFTSVPLVLCAEVGDGMEACDFCFQQ---RRRVFHPSESRFLG 117
Query: 147 --------EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
VC N Y ++ W D G Y C ++SGA +
Sbjct: 118 PGEMMPPLHVC--NGCHMYQYCSKSCWQRAWD----TGHLY-------ECGLLSGATTD- 163
Query: 199 CIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN--KQWYTNVLAQIRINA 256
L+ +L LIL M + ++ + ++ E F K+ + VL
Sbjct: 164 ----LETRTLYRLLIL-MRKKVLLPEQVQGLSRLESEMANFERRAKKTWPKVLELALEAK 218
Query: 257 FRIELAGGLYEDLL--------SSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
R + G+YE L+ + +G A+ + + NHDCDPN I++ +
Sbjct: 219 ERTKSELGIYEILILYGIIRCNALPVDQTYRNAPLGIALDLGGALLNHDCDPNVAIVF-N 277
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N ++ ALR ++ GEEL CY D + R +T + F+C+C RC
Sbjct: 278 NTQVQVRALRKLKAGEELLHCYRDIAYDFTFRNPRITSRYQFRCHCERC 326
>gi|297836294|ref|XP_002886029.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
gi|297331869|gb|EFH62288.1| hypothetical protein ARALYDRAFT_343234 [Arabidopsis lyrata subsp.
lyrata]
Length = 424
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 128/313 (40%), Gaps = 68/313 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL----STLNSVCYFCLRKITSSSQ 135
++VA GR + A + +RA +I P++ + S+++ C C R + SS+
Sbjct: 12 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 71
Query: 136 H---------------FQHHNARFCGEVCK--DNAKAFYD------VERRADWSAFNDYC 172
F H C + + ++ AF D V+ R SA+N
Sbjct: 72 QKCQSCSLVSFCSPNCFASHTPWLCESLLRLHQSSSAFSDQPSDRQVQARFLLSAYNLAA 131
Query: 173 RSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFV--MLRSAFKKA 230
S P + L + SG + D P+S A + GF+ +L +
Sbjct: 132 AS-----PSDFQILLSLQGSGCSNGD------PSSS------ATDSGFLHSLLSAVCPPL 174
Query: 231 GIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYML 290
+ ++ + +LA+ ++NAF GL E + S + IY
Sbjct: 175 PV------CISPELTAALLAKDKVNAF------GLMEPF----SVSNDKRSVRAYGIYPK 218
Query: 291 PSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
SF+NHDC PNA ++D N D + + DV EG E+ + Y +M +RQ L
Sbjct: 219 TSFFNHDCLPNACRFDYVDSASDGNTDIIIRTIHDVPEGREVCLSYFPVNMNYSSRQKRL 278
Query: 345 TQGFGFQCNCLRC 357
+ +GF+C+C RC
Sbjct: 279 LEDYGFKCDCDRC 291
>gi|448124691|ref|XP_004204990.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
gi|358249623|emb|CCE72689.1| Piso0_000280 [Millerozyma farinosa CBS 7064]
Length = 484
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 73/349 (20%), Positives = 137/349 (39%), Gaps = 64/349 (18%)
Query: 58 QKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLS 117
Q+ S T P P I+V T G+G++A + ++ D + KP+ P L+
Sbjct: 88 QQYTYASEITSQETPDMDLPDKIKVLQTSKKGKGLYAKKNVKKGDFLWEEKPLFYVPPLA 147
Query: 118 TLN-----SVCYFCLRKIT------SSSQHFQHHNARFCGEV-----CKDNAKAFY---- 157
+ C +C + +T S + + + C E+ CK+ +
Sbjct: 148 NMTLIKNGKACSYCGKLLTQGTRGNSGVSYLRGLDCDVCSEIWCSKECKNTDSKIHASLK 207
Query: 158 ---------DVERRADWSAF---NDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQP 205
+ + + +A+ DYC + + R+ I+S ++
Sbjct: 208 HNVYHPSKSNTRKNINANAYLALQDYCLKEQWNALYSITRIYADIVSDKSG------VKE 261
Query: 206 ASLSPELILAMEEGFVMLRSAFKKAGIDD----------EQMKFLNKQ---WYTNVLAQI 252
S ++ E + L S+ AG D EQ + L K+ ++NV
Sbjct: 262 RQFSAMARISQETRYKALNSS---AGSFDNFQGGALFVQEQQESLWKEGFSLFSNVFPN- 317
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
++ +I +Y +L + + + + I+++ S NH+C+PN + N
Sbjct: 318 SVDEGKINYDEFMY--MLGTFNIN-----NLDSVIFLIQSHLNHNCEPNTTVKLSSNRTE 370
Query: 313 --RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ A RD++ GEEL Y++ S RQ L +GF+C C +C S
Sbjct: 371 GLKVYAARDIKAGEELTTSYVNPSHTVQQRQRELRVNWGFRCACEKCKS 419
>gi|187607241|ref|NP_001120357.1| SET and MYND domain containing 1 [Xenopus (Silurana) tropicalis]
gi|170284459|gb|AAI61001.1| LOC100145429 protein [Xenopus (Silurana) tropicalis]
Length = 478
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 116/289 (40%), Gaps = 38/289 (13%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTL-NSVCYFCLRKITSSSQHFQ 138
+++ +E GRG+ A R A D+I A+P + L +SVC+ C ++ + Q
Sbjct: 4 VEIFESEGKGRGLRAIRESWAGDIIF-AEPAYSAVVFDNLSHSVCHSCFKRQEKLLRCGQ 62
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS--QGLKYPLLVKRLACMIISGAE- 195
A +C C ++ W+ + C + + K P RLA I+ E
Sbjct: 63 CKFAHYCDRTC-----------QKESWANHKNECVAIKKAGKAPNENIRLAARILWRIER 111
Query: 196 -----SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
+ C+ + + E+G +M K Q + Q+ +++ +
Sbjct: 112 EGSGLTEGCLVSIDDLQNHIDKFDEAEKGLLM-EDVQKFLEYWPSQSQQFGMQYISHIFS 170
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
I N F + GL AVG I+ NHDC PN +++ +N
Sbjct: 171 VISCNGFTLSDQRGLQ---------------AVGVGIFPNLCLANHDCWPNCTVIF-NNG 214
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + +GEEL + Y+D + R+A L + + F C C C+
Sbjct: 215 KIELRALGKINKGEELTVSYVDFLNLTEDRKAQLKKQYYFDCTCEHCTK 263
>gi|225561626|gb|EEH09906.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 363
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + + + +D+EEGEE+ I Y+D + D RQA
Sbjct: 140 GAIFPRAARINHSCKPNSQNTWNQNLERLTIHSFKDIEEGEEVTIAYVDGTELYDERQAC 199
Query: 344 LTQGFGFQCNCLRCSS 359
+ FGF+C C C++
Sbjct: 200 FEEAFGFRCQCEVCAA 215
>gi|449540918|gb|EMD31905.1| hypothetical protein CERSUDRAFT_119222 [Ceriporiopsis subvermispora
B]
Length = 604
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 126/341 (36%), Gaps = 73/341 (21%)
Query: 64 STATHNGKPSQPS----------------PPPIQVALTESAGRGVFATRRIRASDLIHTA 107
ST G P PS P ++V AGRG+ R +A +
Sbjct: 47 STGDTTGSPKSPSYDVLETVSTDGLYKCLPSYLEVRQVRDAGRGIITKERQKAGSTLIAT 106
Query: 108 KPIITHPTLSTLNSVCYFCLRKI--TSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADW 165
KP +T + + L+S C C T + + +C C++N DW
Sbjct: 107 KPHVTVLSTTYLDSYCSACCGPAPETGLKRCTRCRVVWYCDSCCQNN-----------DW 155
Query: 166 SAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFV 221
D C + ++R A SA D+ P+ L L+ G
Sbjct: 156 PIHKDEC--------IAIQRWA-------SSAPSPDVAIPSDAIRCLGRILLTQQHRGLD 200
Query: 222 ML-----------RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLL 270
+ RS+ + + ++ + Y + + F + AG L DL+
Sbjct: 201 SIWSKEINAMQSHRSSMQPSAVESHTHMAHSLVRYLGIANPAELAPFGLNSAGNLV-DLI 259
Query: 271 SSAAA---SIESEIAVGNAIYMLPS--FYNHDCDPNAHIMWIDNAD--------ARLMAL 317
S A ++ S I + PS NH C+PNA I++ ++ LMA+
Sbjct: 260 SRFATNTFTLTSFSLTPIGICICPSIALANHSCEPNAAIVFPRSSSLPQAQEPLMHLMAI 319
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
RD+ E++ Y+D ++ R+ RQ L + + F C C C+
Sbjct: 320 RDIAPNEQVVAAYVDVTLPRELRQKALQETYSFTCKCKLCT 360
>gi|5870832|gb|AAC53021.2| skm-BOP1 [Mus musculus]
Length = 485
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 119/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 4 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + +A Y K P RLA I+ E +
Sbjct: 64 KFAHYCDRTCQKDAW----LNHKNECAAIKKYG-----KVPNENIRLAARIMWRVER-EG 113
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 114 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 173
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 174 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 218
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C+C C G
Sbjct: 219 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKG 277
>gi|340057403|emb|CCC51748.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 438
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 43/76 (56%), Gaps = 2/76 (2%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQ 341
G +Y + S +NH C+PN ++ D L+ ALRD E GEEL I YID S+ RQ
Sbjct: 350 GQGVYAVGSLFNHSCEPNLQVLNSATGDETLVVEALRDCEPGEELYISYIDESLPYAIRQ 409
Query: 342 AILTQGFGFQCNCLRC 357
L + + F+C C +C
Sbjct: 410 QQLREHYLFECRCSKC 425
>gi|236461014|ref|NP_001153599.1| SET and MYND domain-containing protein 1 isoform 1 [Mus musculus]
gi|332278221|sp|P97443.3|SMYD1_MOUSE RecName: Full=SET and MYND domain-containing protein 1; AltName:
Full=CD8b-opposite; AltName: Full=Zinc finger protein
BOP; Short=m-BOP
gi|308387879|pdb|3N71|A Chain A, Crystal Structure Of Cardiac Specific Histone
Methyltransferase Smyd1
gi|49903961|gb|AAH76601.1| Smyd1 protein [Mus musculus]
gi|74209249|dbj|BAE24995.1| unnamed protein product [Mus musculus]
gi|148666518|gb|EDK98934.1| SET and MYND domain containing 1, isoform CRA_b [Mus musculus]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 119/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLINFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + +A Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECAAIKKYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C+C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLKKQYYFDCSCEHCQKG 282
>gi|46108572|ref|XP_381344.1| hypothetical protein FG01168.1 [Gibberella zeae PH-1]
Length = 530
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 113/281 (40%), Gaps = 50/281 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+F+T+ D+I P I P+LS +N++C C ++ + + H +C
Sbjct: 15 GRGIFSTKSFAPGDVILPFTPTILIPSLSHINTICSHCFKQAEVRACS-RCHAVSYCDAA 73
Query: 149 CKDNAKAFYDVERRADWSAFNDY-CR---------SQGLKYPLLVKRLACMIISGAESAD 198
C+ A+W+A + C+ GL P+ R + E +
Sbjct: 74 CQ-----------AANWTAVHSKECKVLRKVTEQGRPGLPTPI---RAVVQALVKPEIGN 119
Query: 199 CIDILQ--PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
I+ L+ AS A E M SAF G E + Q +L +I+ NA
Sbjct: 120 AIEDLEGNAASWRKSEKWADMEMMAMGASAFAGLGTGQEDV-----QKALTILCKIQTNA 174
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
F Y+ L + +++A+ NH C PNA + +I A L A
Sbjct: 175 FH------RYDTDLGQVGIFLGTKLAMA----------NHSCIPNAMVQFIGRR-AILRA 217
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ ++ +E+ I Y D + R+ L F F C CLRC
Sbjct: 218 EKPIKIDDEIEISYTDYTFPLSNRKLALAPYF-FDCMCLRC 257
>gi|154282997|ref|XP_001542294.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150410474|gb|EDN05862.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 282
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + + + +D+EEGEEL I Y+D + D RQA
Sbjct: 59 GAIFPSAARINHSCKPNSQNTWNRNLERLTIHSFKDIEEGEELTIAYVDGTELYDERQAY 118
Query: 344 LTQGFGFQCNCLRCS 358
+ FGF+C C C+
Sbjct: 119 FEEAFGFRCQCEVCA 133
>gi|426192470|gb|EKV42406.1| hypothetical protein AGABI2DRAFT_195752 [Agaricus bisporus var.
bisporus H97]
Length = 562
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/352 (22%), Positives = 141/352 (40%), Gaps = 78/352 (22%)
Query: 78 PP----IQVALTESAGRGVFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKIT 131
PP +++ E+ + + AT+ I A + I+ P++ L + C C R++
Sbjct: 170 PPKSMNVEIKSNETGRKIMLATKDIAAGETIYKEFPVVAVLDADLESSGKYCSHCFRELQ 229
Query: 132 SS-------SQHFQHHNARFCGEVCKDNAK-------------------AFYDVE---RR 162
S S+ + +C E C +N+K D++ R+
Sbjct: 230 DSDSSAIRLSEDVNSLGSSYCSEECLNNSKIQSTSLLFTLESPLPPELPTSEDIDNEGRK 289
Query: 163 ADWSAFNDYCRSQGLKYPLLVKRL----------------ACMIISGAESADCIDILQPA 206
A + F +Y + + PLLV R A M + +AD D +
Sbjct: 290 AAQAKFVEYLQKEKRAGPLLVARFISRQVAVETAKLVKKDAPMQSNDFTTADGGDYMLVD 349
Query: 207 SLSPELILAM---EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAG 263
+ L + EE ++ + +K EQ F+ K YT ++ ++ NA +
Sbjct: 350 HMERFRYLEVQPKEEELPLIANVLQKTLPGLEQ--FVTKDRYTTLVGKMLYNAIGV---- 403
Query: 264 GLYEDLLSSAA---------ASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADAR 313
G +D S A + ++ VG A Y L S+ NH C P+A + + +
Sbjct: 404 GSRDDKPESTARPEDQEKTRTPVGTQKQVGAAFYTLSSYANHSCAPSAKLTFPSGTTELH 463
Query: 314 LMALRDVEEGEELRICYIDASMAR-----DARQAI---LTQGFGFQCNCLRC 357
++A +D++ G+E+ + Y+D S A DAR+ L +G+ F C C C
Sbjct: 464 VVATKDIKMGDEVSVAYVDVSPAEGESIADARRRRRVELARGWRFACPCELC 515
>gi|367020696|ref|XP_003659633.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
gi|347006900|gb|AEO54388.1| hypothetical protein MYCTH_99686 [Myceliophthora thermophila ATCC
42464]
Length = 573
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 130/326 (39%), Gaps = 68/326 (20%)
Query: 73 SQPSPPPIQVALTESAG--RGVFATRRIRASDLIHT-AKPIITHPTLSTLNSVCYFCLRK 129
+Q PP +++ S+ R + ATR I T + P++ P +T+ + C +CLR
Sbjct: 2 AQALPPGVRIHNDPSSPKRRSLRATRAFSPGSTIATFSNPLLALPDGATMRTTCNYCLRT 61
Query: 130 ITSSSQHFQH-------------HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQG 176
S Q A +CG C+ +A + +A+ F G
Sbjct: 62 TAPSPSPSQSAATPPAFKACTACKAAVYCGPACQ---RAHWKAAHKAECKMFARVREQAG 118
Query: 177 LKY-PLLVKRLACMIIS--------GAESADCIDILQPASLSPELILAMEEGFVMLRSAF 227
+ P V+ +A ++++ G A+ ++ E E+G F
Sbjct: 119 KDWLPTPVRAVAQVLLTLQQGKSGGGGREAEMRRAFIGSADGEE-----EDGLEGNVEGF 173
Query: 228 KKAG--IDDEQMKFLNKQWYTN-------------VLAQIRINAF-RIELAGGLYEDLLS 271
KK G D +++ Y +L +I+ NAF R++ G+ L
Sbjct: 174 KKDGEVWKDMELQATAAVVYAGLLQGEEVLEKAREILCKIQTNAFNRLDADTGMAGIFLD 233
Query: 272 SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYI 331
A + NH C PNA I + D A L A R ++EGEE+ I YI
Sbjct: 234 VGLAMV-----------------NHSCVPNAFIGF-DKRTAILRAERPIQEGEEITISYI 275
Query: 332 DASMARDARQAILTQGFGFQCNCLRC 357
D ++ + AR L + + FQC+C+RC
Sbjct: 276 DNTLPKAARYEAL-RLYHFQCDCVRC 300
>gi|171695572|ref|XP_001912710.1| hypothetical protein [Podospora anserina S mat+]
gi|170948028|emb|CAP60192.1| unnamed protein product [Podospora anserina S mat+]
Length = 444
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 62/125 (49%), Gaps = 16/125 (12%)
Query: 234 DEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
D Q KFL+ Q+ V A+I N+FRI ++ + + V ++ S
Sbjct: 164 DAQEKFLS-QFGDGVRAKIDKNSFRI---------VIDPSGDKESGHLGV----FVDVSG 209
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
+NHDC PN H I N +A+RD+ G+EL I YI R RQ L Q +GF+C
Sbjct: 210 FNHDCRPNVHYR-ITNTTHTTVAVRDIHPGDELTISYIYGIARRSTRQKEL-QDWGFECT 267
Query: 354 CLRCS 358
C +C+
Sbjct: 268 CSQCT 272
>gi|392568287|gb|EIW61461.1| MAS20-domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 609
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/357 (21%), Positives = 137/357 (38%), Gaps = 81/357 (22%)
Query: 77 PPPIQVALTESAGRG-------VFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCL 127
P + V++ AG G + A R ++I+T KP+I L S C +C
Sbjct: 212 PKRMNVSVKAGAGEGSQTMKKILVAERDFAEGEVIYTEKPLIVALDADLEGQGSHCSYCF 271
Query: 128 RKITSSSQHFQHH---NARFCGEVCK----------------------DNAKAFYDVERR 162
R I + H N+ +C C+ DN E+R
Sbjct: 272 RTIPEDAAVTPTHDKLNSVYCSRDCQQRAEIGWQNILFGLEPVLPPELDNGMGEITTEQR 331
Query: 163 -ADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSP--ELILAMEEG 219
A + +Y +++ + LL R I+ + I++L P P E + M +G
Sbjct: 332 NAAQGPYVEYMKTKAKQANLLAARFVAKQIA----HETINLL-PTKTGPIVEELQQMADG 386
Query: 220 FVML-------RSAFKKAGIDDEQMKFLN---------------KQWYTNVLAQIRINAF 257
+ R F + E++K L ++ + ++ ++ NA
Sbjct: 387 KDLYHVGDHIERLRFVEGNSSSEEVKLLGAVLGSALPGLEQSVTEERHATLVGKMGYNAI 446
Query: 258 RIELAGGL-YEDLLSSAAASIE-------SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDN 309
+ + GG + S +E + G+AIY++ S+ H CDPN + +
Sbjct: 447 GVCIDGGRDGRPVASDRPEELERTRTPHGTARQFGSAIYLVSSYLVHSCDPNTRPTFPNG 506
Query: 310 ADA-RLMALRDVEEGEELRICYIDASMARDARQAI--------LTQGFGFQCNCLRC 357
+ L+A R +E+G+EL + ++D S A I L +G+ F+C C +C
Sbjct: 507 TNELHLVATRPIEKGDELTMAWVDVSTAEGEDAGIARRRRRIELARGWRFKCECAKC 563
>gi|149727214|ref|XP_001497840.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Equus caballus]
Length = 490
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 118/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFNSEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQRELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG ++ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGVFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKG 282
>gi|432113965|gb|ELK36030.1| SET and MYND domain-containing protein 3, partial [Myotis davidii]
Length = 241
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 44/79 (55%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L A+RD++ GEEL ICY+D M R+
Sbjct: 19 VGVGLYPSMSLLNHSCDPNCSIVF-NGPHLLLRAVRDIQAGEELTICYLDMLMTSAERRE 77
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F C+C RC + D
Sbjct: 78 QLRNQYCFDCDCARCLTQD 96
>gi|115475273|ref|NP_001061233.1| Os08g0205300 [Oryza sativa Japonica Group]
gi|40253402|dbj|BAD05332.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761301|dbj|BAD11544.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|113623202|dbj|BAF23147.1| Os08g0205300 [Oryza sativa Japonica Group]
Length = 331
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/288 (22%), Positives = 106/288 (36%), Gaps = 86/288 (29%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHP-TLSTLNSVCYFCLRKITSSSQHFQ 138
++ A GRG+ A R IR ++I T +P++ +P +L++L S C C R +++++
Sbjct: 6 LRAADLPGRGRGLLAARSIREGEVILTEQPLLLYPASLASLPSFCSACFRSLSAAASPCP 65
Query: 139 H-HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
A FC C +P L +A ++
Sbjct: 66 SCRAAGFCSPSCA-------------------------AASHPRLHAMVAALVPPQ---- 96
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
+ P SP+L A+ L++ R N+F
Sbjct: 97 -----MLPPGFSPDLTAAL--------------------------------LSKDRTNSF 119
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLP--SFYNHDCDPNA-HIMWID-----N 309
I + + + A + P S NHDC PNA H + D N
Sbjct: 120 SI----------MEPYRPEVPQPLRKARAYAVYPRASLLNHDCLPNACHFDYADRPGPGN 169
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
D + AL D+ EG E+ + Y A+ RQ L + +GF+C C RC
Sbjct: 170 TDIVVRALHDITEGREVCLSYFAANWQYKDRQQRLLEDYGFRCECERC 217
>gi|378731922|gb|EHY58381.1| SET and MYND domain-containing protein [Exophiala dermatitidis
NIH/UT8656]
Length = 550
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 122/297 (41%), Gaps = 41/297 (13%)
Query: 85 TESAGRGVFATRRIRA-SDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS--QHFQHHN 141
T G G+FA I A S L+ A+P++ + L + CY+C R+ Q +
Sbjct: 40 TADRGNGLFAVDDIAAESQLLFVARPLLVALENAKLPTNCYYCFREPGGQGLWQMTERKE 99
Query: 142 -----------ARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRSQGLKYPLLVKRLACM 189
A+FC + C+ A WS ++ C+ YP ++
Sbjct: 100 DSLKTCSRCKVAKFCDQKCQTEA-----------WSQYHRLECKLFSRLYPRVLPSTVRA 148
Query: 190 IISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
+I + IL P E +LA++ + AG Q + Q
Sbjct: 149 VIRLLKQHKA-GILPPGEW--EQLLALQSH----QQDLANAGGQRWQDLLIMSQGIKGYS 201
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-ID 308
+ + L+ + + + + A+ +S +G ++ P+ NH CDPNA++ + +
Sbjct: 202 GTDEDDDLILRLSCAVIVNSFTLSNATFDS---IGVILHPKPALLNHSCDPNAYVRFDVS 258
Query: 309 NADA----RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
D + ALRD+ + EE+ I YID ++ RQ L++ + F C C C++ D
Sbjct: 259 ETDTLGSISVHALRDIAKDEEITISYIDTTVPCKRRQQQLSERYFFTCQCHLCANPD 315
>gi|66362174|ref|XP_628051.1| SET domain containing protein with a cysteine cluster at the
C-terminus [Cryptosporidium parvum Iowa II]
gi|46227454|gb|EAK88389.1| SET domain containing protein with a cysteine cluster at the
C-terminus [Cryptosporidium parvum Iowa II]
Length = 385
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 65/124 (52%), Gaps = 8/124 (6%)
Query: 239 FLNKQWYTNVLAQIRINA-----FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
L+++ ++ Q+ +N+ + + L+G L +S +I I G I ++ S
Sbjct: 247 LLSRETIEQIVGQLTLNSQGLNIWGLHLSGKLDNRNISDNEPTIG--IIKGACICVIQSC 304
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
+NH CDPN H+ ID++ + RD+ +GEEL I Y+D ++ R +++ Q + F C
Sbjct: 305 FNHSCDPNCHVYTIDDSTIYVTTNRDIMKGEELTISYVDNTLPLAERTSLI-QNYHFTCT 363
Query: 354 CLRC 357
C C
Sbjct: 364 CRLC 367
>gi|323304666|gb|EGA58429.1| Set5p [Saccharomyces cerevisiae FostersB]
Length = 524
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 146/378 (38%), Gaps = 72/378 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFFISNGKACARCXKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA IS E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--ISQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------W--EKCYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I ++ + + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF C C RC +
Sbjct: 414 RELRVNWGFLCQCDRCQN 431
>gi|407867902|gb|EKG08713.1| hypothetical protein TCSYLVIO_000130 [Trypanosoma cruzi]
Length = 393
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 53/105 (50%), Gaps = 5/105 (4%)
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DA 312
IN F ++ + G E S A + + G A+Y L S +NH CDPN ++ +D D
Sbjct: 274 INGFIVQSSSG--EKAFS--AIPLVEWVLKGGAVYALQSAFNHSCDPNVNVSNVDGTHDI 329
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
L LR V+ GEEL I YI ++ +G+ F C CLRC
Sbjct: 330 TLRTLRPVKRGEELTITYIPLENTTPEQRNEKLKGYFFTCRCLRC 374
>gi|189234759|ref|XP_001814707.1| PREDICTED: similar to SMYD family member 5 [Tribolium castaneum]
gi|270001538|gb|EEZ97985.1| hypothetical protein TcasGA2_TC000380 [Tribolium castaneum]
Length = 383
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 57/97 (58%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEG 323
LYED+ S + + +E G A++ L S NH C PNA ++ N + L+A+RD++EG
Sbjct: 268 LYEDMDSHSGNFLNNE---GVALFTLQSACNHSCVPNAEPTYLHNNNKLSLVAVRDIQEG 324
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R+ L + + F CNC +C
Sbjct: 325 EEICISYLDECNLHRSRHSRRKELMENYLFACNCPKC 361
>gi|449435328|ref|XP_004135447.1| PREDICTED: uncharacterized protein LOC101202892 [Cucumis sativus]
gi|449522881|ref|XP_004168454.1| PREDICTED: uncharacterized protein LOC101228219 [Cucumis sativus]
Length = 540
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 113/280 (40%), Gaps = 74/280 (26%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
PIQ+ + ++GRG+FAT+ + + L+ K I + N
Sbjct: 185 PIQIKRSGNSGRGLFATKNVDSGTLLLVTKAIAIERGILPEN------------------ 226
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
C +NA+ W F D K K L ++ SG E +
Sbjct: 227 ----------CDENAQLVM-------WKNFIDKVTDSATK-STKTKYLIGLLSSG-EGEE 267
Query: 199 CIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFR 258
+++ + + PE K I +M +N+L+ + IN+
Sbjct: 268 DLEVPEMSVFKPET----------------KDQISPSEM--------SNILSVLDINS-- 301
Query: 259 IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALR 318
L ED +SA ++ G +++LPSF NH C PNA + I + + A R
Sbjct: 302 ------LVEDA-NSAKVLGKNRDYYGVGLWVLPSFINHSCIPNARRLHIGD-HILVHASR 353
Query: 319 DVEEGEELRICYIDA-SMARDARQAILTQGFGFQCNCLRC 357
DV+ GEE+ Y D S +D ++ +++ +GF CNC RC
Sbjct: 354 DVKAGEEITFAYFDPLSSWKDRKR--MSETWGFNCNCKRC 391
>gi|344247559|gb|EGW03663.1| SET and MYND domain-containing protein 1 [Cricetulus griseus]
Length = 467
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 120/303 (39%), Gaps = 43/303 (14%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
S ++V +E GRG+ AT+ A+D++ + S +N VC+ C ++ +
Sbjct: 5 SMENVEVFTSEGKGRGLKATKEFWAADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHR 64
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
Q A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 65 CGQCKFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAGRIMWRVE 115
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGID------DEQMKFLNKQWYTNVL 249
+ + + +S + + E F + +D Q + + Q+ +++
Sbjct: 116 R-EGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDAFLQYWPPQGQQFSMQYISHIF 174
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID- 308
I N F + GL AVG I+ NHDC PN +++ +
Sbjct: 175 GVINCNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNG 219
Query: 309 NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N +A L AL + EGEEL + YID + R+ L + + F C+C C
Sbjct: 220 NHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQYYFDCSCEHC 279
Query: 358 SSG 360
G
Sbjct: 280 QKG 282
>gi|307102606|gb|EFN50876.1| hypothetical protein CHLNCDRAFT_141673 [Chlorella variabilis]
Length = 510
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
GNA Y L S +NH CDPNA D + A ++ALRD++ GEE+ + YID + R
Sbjct: 423 GNAFYALHSCFNHSCDPNAEAFKRDEDEDGGAVILALRDIQAGEEVTLSYIDEEAPLEER 482
Query: 341 QAILTQGFGFQCNCLRCSS 359
+ L +GF+C C +C +
Sbjct: 483 RQQLAD-YGFRCACDKCQA 500
>gi|354483543|ref|XP_003503952.1| PREDICTED: SET and MYND domain-containing protein 1 [Cricetulus
griseus]
Length = 498
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 120/303 (39%), Gaps = 43/303 (14%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
S ++V +E GRG+ AT+ A+D++ + S +N VC+ C ++ +
Sbjct: 5 SMENVEVFTSEGKGRGLKATKEFWAADVVFAERAYSAVVFDSLINVVCHTCFKRQEKLHR 64
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
Q A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 65 CGQCKFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAGRIMWRVE 115
Query: 196 SADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGID------DEQMKFLNKQWYTNVL 249
+ + + +S + + E F + +D Q + + Q+ +++
Sbjct: 116 R-EGTGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDAFLQYWPPQGQQFSMQYISHIF 174
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID- 308
I N F + GL AVG I+ NHDC PN +++ +
Sbjct: 175 GVINCNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNG 219
Query: 309 NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N +A L AL + EGEEL + YID + R+ L + + F C+C C
Sbjct: 220 NHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLHLSEERRRQLRKQYYFDCSCEHC 279
Query: 358 SSG 360
G
Sbjct: 280 QKG 282
>gi|328868687|gb|EGG17065.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 333
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 57/110 (51%), Gaps = 17/110 (15%)
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
VLAQ N+F GL++ S+ G A+Y SF NH C PN +
Sbjct: 190 VLAQSNRNSF------GLWK----------SSDEQYGLAMYATASFLNHSCFPNCARVQ- 232
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
NA ++A+RD+EE +E+ ICYI+A AR+ IL + F C C+RC
Sbjct: 233 RNAGIDIVAIRDIEENDEITICYINARDNDTARRMILKGCYYFDCQCIRC 282
>gi|229892820|ref|NP_001153561.1| SET and MYND domain-containing protein 1 [Sus scrofa]
gi|197692962|gb|ACH71269.1| SET and MYND domain-containing 1 [Sus scrofa]
Length = 490
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 118/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVITSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRMDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NH+C PN +++ + N +A
Sbjct: 179 CNGFTLSEQRGLQ---------------AVGVGIFPNLGLVNHNCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQTRIELRALGKISEGEELTVSYIDFLNFSEERKKQLKKQYYFDCTCEHCQKG 282
>gi|397468470|ref|XP_003805903.1| PREDICTED: SET and MYND domain-containing protein 1 [Pan paniscus]
Length = 490
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 117/298 (39%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V + E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFIAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|168051542|ref|XP_001778213.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670426|gb|EDQ56995.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 490
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G +Y + S NH C PNA ++ D A L AL D +EG E+ I Y++ + + + R+
Sbjct: 208 LGTGLYPVISIVNHSCVPNA-VLHFDGNRAALRALEDTQEGTEITISYVELAASTNTRRK 266
Query: 343 ILTQGFGFQCNCLRCS 358
L + F CNC+RCS
Sbjct: 267 ALRDQYYFDCNCIRCS 282
>gi|406606076|emb|CCH42549.1| lysine methyltransferase [Wickerhamomyces ciferrii]
Length = 481
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/365 (21%), Positives = 144/365 (39%), Gaps = 59/365 (16%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATR 96
W + R L S Q+ + T P P I++ T++ G+G+++T
Sbjct: 64 WSLSEKRLKTVLKSYNLLNSAQQYTYANEITSKETPDINLPKNIRLQFTKNRGKGLYSTN 123
Query: 97 RIRASDLIHTAK-PIITHPTLSTLNSV-----CYFCLRKITSSSQHFQHHNAR------- 143
+ DLI K P+ P L +N + C +C I+S++
Sbjct: 124 LFKEGDLIWEEKTPLFFIPPLDRVNLIANSKACTYCGSLISSNNSSRSVLKGLDCNVCRD 183
Query: 144 -FCGEVCKDNAKA--------FYDVERR---------ADWSAFNDYCRSQGLKYPLLVKR 185
+C + CK N K ++D ++ ++W + +C+ +
Sbjct: 184 VWCSQFCKKNDKVHQLLKHNVYHDDPKKQNSKKQIISSNWIKYEKFCQENKWNAAFAIGL 243
Query: 186 LACMIISG-----AESADCI-----DILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE 235
+ II+ A+ D + DI A S + + + + F K E
Sbjct: 244 IHSYIITDKSGVLAKQFDAMAKVRQDIRYKAIDSSGIGASFDNNNNGGGALFVK-----E 298
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAA-SIESEIAVGNAIYMLPSFY 294
Q + L KQ + + IN YE+ L++ +I + + +++++ S
Sbjct: 299 QQESLWKQGFELFNSIFPINQIS-------YEEFLTNLGTYNINN---IDGSLFLIQSHL 348
Query: 295 NHDCDPNAHIMWIDNADA--RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NH+CDPN + + + ++ A RD++ EEL Y++ S + + R L +GF C
Sbjct: 349 NHNCDPNVRVKFGEKKTDGIKVYAKRDIKANEELTTSYVNPSHSLNQRLRELRVNWGFIC 408
Query: 353 NCLRC 357
NC +C
Sbjct: 409 NCKKC 413
>gi|395508794|ref|XP_003758694.1| PREDICTED: SET and MYND domain-containing protein 1 [Sarcophilus
harrisii]
Length = 490
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 121/307 (39%), Gaps = 51/307 (16%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSS 134
S ++V +E GRG+ T+ + A+D++ A+P + + V C+ C ++
Sbjct: 5 SMEKVEVFTSEGKGRGLKTTKELWAADVVF-AEPAYSAVVFDSFTQVVCHTCFKRQEKLQ 63
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMII 191
+ Q AR+CG C+ +A W + C + G K P RLA I+
Sbjct: 64 RCGQCKFARYCGRPCQKDA-----------WLNHKNECAAIKKHG-KPPSENIRLAARIM 111
Query: 192 SGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWY 245
E + + + +S + + + F + +D Q + + Q+
Sbjct: 112 WRIER-EGTGLTEGCLVSVDDLQNHVDNFGEEEQKDLRMDVDSFLNFWPPQSQQFSMQYI 170
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIM 305
+++ I NAF + GL AVG I+ NHDC PN ++
Sbjct: 171 SHIFGVINCNAFTLSDQRGLQ---------------AVGVGIFPNLCLVNHDCWPNCTVI 215
Query: 306 WID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
+ + N +A L AL + EGEEL + YID R+ L + + F C
Sbjct: 216 FNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNISQERKKQLKKQYYFDCT 275
Query: 354 CLRCSSG 360
C C G
Sbjct: 276 CEHCEKG 282
>gi|134085965|ref|NP_001076953.1| SET and MYND domain-containing protein 1 [Bos taurus]
gi|126717435|gb|AAI33279.1| SMYD1 protein [Bos taurus]
Length = 490
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 117/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVEREGS 119
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 120 -GLTEGCLVSVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG ++ + NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGVFPNLALVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQYYFDCTCEHCQKG 282
>gi|71412496|ref|XP_808430.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70872634|gb|EAN86579.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 395
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DA 312
IN F ++ + G E S A + + G A+Y L S +NH CDPN + +D D
Sbjct: 274 INGFIVQSSSG--EKAFS--AIPLVEWVLKGGAVYALQSAFNHSCDPNVSVSNVDGTHDI 329
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
L LR V+ GEEL I YI ++ +G+ F C CLRC
Sbjct: 330 TLRTLRPVKSGEELTITYIPLENTTPEQRNEKLKGYFFTCRCLRC 374
>gi|358391269|gb|EHK40673.1| hypothetical protein TRIATDRAFT_286297 [Trichoderma atroviride IMI
206040]
Length = 347
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 53/100 (53%), Gaps = 4/100 (4%)
Query: 264 GLYEDLLSSAAASIESEIAVG---NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRD 319
GL + L A S S + G +++ + NH C PNAH W +N + A+RD
Sbjct: 90 GLDDGLFLGIARSNVSPLGPGASEGGLFLDAARINHSCRPNAHKSWNENLQRLTVHAVRD 149
Query: 320 VEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+E G+E+ I Y+ +++ RQAIL Q F F C C CS+
Sbjct: 150 IERGQEITISYLGETLSYIERQAILKQRFRFDCGCDLCSA 189
>gi|426223567|ref|XP_004005946.1| PREDICTED: SET and MYND domain-containing protein 1 [Ovis aries]
Length = 490
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 117/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVEREGS 119
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 120 -GLTEGCLVSVDDLQNHVEHFGEEEQKELRLDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG ++ + NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGVFPNLALVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNISEERKKQLKRQYYFDCTCEHCQKG 282
>gi|331224545|ref|XP_003324944.1| hypothetical protein PGTG_06481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309303934|gb|EFP80525.1| hypothetical protein PGTG_06481 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 444
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 58/108 (53%), Gaps = 9/108 (8%)
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGN-------AIYMLPSFYNHDCDPN-AHIMWIDN 309
R+ AG E+ +SSA EI +GN A+++ PS NHDC PN A + ++
Sbjct: 219 RLFGAGDSEEEWISSAIDMNAFEINLGNDSDVPFFAVFLTPSRLNHDCRPNTAFHVNSES 278
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + ALR + GEE+ I Y D + R+ RQ ++ +GFQC C C
Sbjct: 279 LEIHMHALRTINPGEEMTISYRDMAQIREQRQEDISN-YGFQCTCAHC 325
>gi|296482488|tpg|DAA24603.1| TPA: SET and MYND domain containing 1 [Bos taurus]
Length = 490
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/299 (23%), Positives = 117/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFCAADIIFAERAYSAVVFDSLVNVVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVEREGS 119
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGID------DEQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 120 -GLTEGCLVSVDDLQNHVEHFGEEEQKELRLDVDMFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG ++ + NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGVFPNLALVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + EGEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKRQYYFDCTCEHCQKG 282
>gi|448532414|ref|XP_003870424.1| Set5 protein [Candida orthopsilosis Co 90-125]
gi|380354779|emb|CCG24294.1| Set5 protein [Candida orthopsilosis]
Length = 499
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 74/322 (22%), Positives = 125/322 (38%), Gaps = 47/322 (14%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-----SVCYFC 126
P PP +QV +T G+ ++A + I +LI + P+ P L+ ++ S C C
Sbjct: 126 PDIELPPKVQVVMTSKRGKCLYAKKPINKGELIWSETPLFFVPPLANVHLVEHGSACANC 185
Query: 127 --LRKITSSSQHFQHHNARFCGEV-----CKDNAKAFYDVERRA------------DWSA 167
L + T + + C EV CK + + D +A
Sbjct: 186 GILLQQTEMKTVLKGLDCDVCSEVWCSKKCKQMDAQLHGATKHNVFNPNSKKPKIIDSNA 245
Query: 168 F---NDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPEL-ILAMEEGFVML 223
F +YC + + + ++ A +S ++ A++ G
Sbjct: 246 FLELQEYCLKESWNALYAITLICANMLLDKSGTKQKQFNAMARVSQDVRYKALDSGGGTF 305
Query: 224 RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAA-SIESEIA 282
+ A EQ + L K+ Y L N Y+D L +I +
Sbjct: 306 DNMNGGALFVQEQQETLWKEGYERFLRVFPKNFVS-------YKDFLYMMGTYNINN--- 355
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL-----RDVEEGEELRICYIDASMAR 337
+ +++++ S NH+CDPN + D A R+ L RD++EGEEL Y++ S
Sbjct: 356 LDSSVFLTQSHLNHNCDPNTTV---DTASDRISGLKVFAKRDIKEGEELTTTYVNPSHTL 412
Query: 338 DARQAILTQGFGFQCNCLRCSS 359
+ RQ L +GF C C +C S
Sbjct: 413 NQRQRELRVNWGFICACEKCKS 434
>gi|213407994|ref|XP_002174768.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212002815|gb|EEB08475.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 314
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
+++L S NHDC PN W D D + A+RD+++GEE+ YID + R+ +
Sbjct: 94 GGLFLLGSRMNHDCSPNVKHTWNDTLDGVTVHAVRDIQKGEEILTTYIDLQKPKGERRKL 153
Query: 344 LTQGFGFQCNCLRCS 358
L FGF C C CS
Sbjct: 154 LQAHFGFLCLCSACS 168
>gi|410077102|ref|XP_003956133.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
gi|372462716|emb|CCF56998.1| hypothetical protein KAFR_0B07000 [Kazachstania africana CBS 2517]
Length = 484
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/314 (20%), Positives = 123/314 (39%), Gaps = 69/314 (21%)
Query: 90 RGVFATRRIRASDLIHT-AKPIITHPTLSTL-----NSVCYFCLRKITSSSQHF------ 137
R + A + DLI+T + PI+ P + + C +C +++ S HF
Sbjct: 116 RKLLAKTALDVGDLIYTESSPIVVIPPMDKMALIKSGKACIYCGNSLSNISSHFIIKNGL 175
Query: 138 --QHHNARFCGEVCKDN----AKAFYDVERRA--------DWSAFNDYCRSQGLKYPLLV 183
N +C + C+ N + ++D + +W F YC+ + L V
Sbjct: 176 DCNDCNGVWCTKTCRKNDIIHSALYHDKSKNKSKITIKSHNWILFAKYCQEK-----LFV 230
Query: 184 KRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQ 243
+ II +++L+ EE R+ F+ ++++ +N
Sbjct: 231 AAYSVGIIFA-----------------KILLSNEEDSTKTRAQFESLAQVSQRIR-INLS 272
Query: 244 WYTNVLAQIRIN--AFRIELAGGLYED---LLSSAAASIESEI---------------AV 283
TN+ + I A ++ ++++ L + +E +I +
Sbjct: 273 DSTNIGGTLDITSGAMAVDDPESVWQEAYSLFTKTFPMLEGDIDLERFLCYIGKFNINQI 332
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
N +Y L S NH C+PNA + ++ A R ++ GEE+ + Y++ + R+
Sbjct: 333 SNQVYQLTSLANHSCEPNARYEIDSKLELKVYARRKIKPGEEIFLTYVNPLHGVNLRRRE 392
Query: 344 LTQGFGFQCNCLRC 357
L +GF C C RC
Sbjct: 393 LRVNWGFLCKCSRC 406
>gi|147810952|emb|CAN74411.1| hypothetical protein VITISV_025097 [Vitis vinifera]
Length = 588
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 111/281 (39%), Gaps = 70/281 (24%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+Q+ +E +GRG+FAT+ + A L+ K I T C + S+
Sbjct: 236 VQIMKSEISGRGLFATKNVDAGTLVLVTKAIATER--------CILPEQNDDSA------ 281
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMI--ISGAESA 197
DN + W F D K KRL +I +S E
Sbjct: 282 -----------DNIQLVM-------WKNFIDKVVESASK----CKRLHHLISVLSSGEXE 319
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
D +++ PE EE S K +D +L+ + +N+
Sbjct: 320 DVLEVPDVNLFRPE----TEE------SGLSKGKLD-----------MGKILSILDVNS- 357
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL 317
L ED S+ S+ G +++LP+F NH C+PNA + + + + A
Sbjct: 358 -------LVEDATSAKVLGKNSDY-YGVGLWILPAFINHSCNPNARRLHVGD-HVIVHAS 408
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
RDV+ GEE+ Y D ++ ++ + + +GFQCNC RC
Sbjct: 409 RDVKAGEEITFAYFDV-LSPWRKRTDMAKTWGFQCNCKRCK 448
>gi|321478979|gb|EFX89935.1| SET and MYND domain-containing 5 protein [Daphnia pulex]
Length = 387
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 138/351 (39%), Gaps = 82/351 (23%)
Query: 86 ESAGRGVFATRRIRASDLIHTAKPIIT---HPTLSTLNSVCYFCLRKITSSSQHFQHHNA 142
E+ GRG+F TR + + I +P+ + + S C FC+R + ++ ++ + A
Sbjct: 20 ETQGRGLFTTRSFKNGETIIEEQPLFSCQFSWNYAYGYSACDFCMRPLETAEENARRLTA 79
Query: 143 R---------------------------FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQ 175
+ +C CKD A Y + N +
Sbjct: 80 KADLILPHPECDGTDKSSHVICSQCAVTYCSVDCKDQAWNQYHKTICCNMFGGNSNHPLE 139
Query: 176 GLK-------YP-------LLVKRLACMI--ISGAESADCIDILQPASLSPELILA---- 215
L YP LLV+ LA I E + L +++ E +A
Sbjct: 140 KLNEAWKKMHYPPETSTIMLLVRILANFIQRTDREELKSQLMSLCHHTVNEEETIAHKLL 199
Query: 216 ---MEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRIN-------AFRI------ 259
E +LR KA E +FL Q + +++A I N AF +
Sbjct: 200 GQEFESQLELLRDLCTKALGMPETHEFLTPQGFRSLIALIGRNGQGIGTSAFSVWVRKVS 259
Query: 260 ---------ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWIDN 309
L +Y+++ + + + +E G+A++ + S NH C+PN N
Sbjct: 260 ENDIDPTTDALIDTIYQEMENESGDFLNNE---GSALFAIQSACNHSCEPNCISTFPFSN 316
Query: 310 ADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
L+A +D+EEGEE+ I Y+D S +R +R+ IL + + F CNC RC
Sbjct: 317 HTVALVASKDLEEGEEIFISYLDECAQSRSRHSRRKILKENYLFHCNCSRC 367
>gi|323354728|gb|EGA86563.1| Set5p [Saccharomyces cerevisiae VL3]
Length = 526
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 146/378 (38%), Gaps = 72/378 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFXISNGKACARCGKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA IS E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--ISQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------W--EKCYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I ++ + + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF C C RC +
Sbjct: 414 RELRVNWGFLCQCDRCQN 431
>gi|403303925|ref|XP_003942566.1| PREDICTED: SET and MYND domain-containing protein 1 [Saimiri
boliviensis boliviensis]
Length = 490
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK 281
>gi|190405982|gb|EDV09249.1| conserved hypothetical protein [Saccharomyces cerevisiae RM11-1a]
gi|259146968|emb|CAY80224.1| Set5p [Saccharomyces cerevisiae EC1118]
gi|323348221|gb|EGA82471.1| Set5p [Saccharomyces cerevisiae Lalvin QA23]
gi|365765161|gb|EHN06674.1| Set5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 526
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 146/378 (38%), Gaps = 72/378 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFFISNGKACARCGKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA IS E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--ISQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------W--EKCYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I ++ + + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF C C RC +
Sbjct: 414 RELRVNWGFLCQCDRCQN 431
>gi|432945357|ref|XP_004083558.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oryzias latipes]
Length = 432
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/274 (23%), Positives = 116/274 (42%), Gaps = 25/274 (9%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ +R +L+ + + +++ S C C R+ ++ + A +C
Sbjct: 17 GRGLRVSRAFSVGELLFSCPAYSSVLSVTERGSCCDLCFRRKEGLARCGKCKKAFYCNTS 76
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASL 208
C+ A + +E A + ++C S+ + LV R+ D IL +
Sbjct: 77 CQKADWAMHRLECSAMVAFGENWCPSETSR---LVARILAKKKMQKGRCDSEKILLIGEM 133
Query: 209 SPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL---AQIRINAFRIELAGGL 265
L E+ R+ AG+ K L + ++L +Q+ N F IE
Sbjct: 134 QSHL--EDEDNEKKERTELDIAGLGRFFSKHLEVPSHKDLLTLFSQVACNGFTIE----- 186
Query: 266 YEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEE 325
+D LS +G A+Y + NH C P+ I+ + A++ A++D++ G+E
Sbjct: 187 -DDELSH----------MGTAVYPDVALINHSCLPSV-IVTFNGTLAQVRAVQDMKPGDE 234
Query: 326 LRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ I YID D R + L + + F CNCL C +
Sbjct: 235 VLISYIDLLYPTDDRNSRLRESYYFTCNCLECQN 268
>gi|171704581|sp|A6ZTB4.1|SET5_YEAS7 RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|151944152|gb|EDN62445.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 526
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 146/378 (38%), Gaps = 72/378 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFLISNGKACARCGKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA IS E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--ISQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------WEK--CYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I ++ + + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF C C RC +
Sbjct: 414 RELRVNWGFLCQCDRCQN 431
>gi|407394360|gb|EKF26913.1| hypothetical protein MOQ_009381 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 51/105 (48%), Gaps = 5/105 (4%)
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DA 312
IN F I+ G E S A + + G A+Y L S +NH CDPN + +D D
Sbjct: 274 INGFIIQCPSG--EKAFS--APPLVEWVLKGGAVYTLQSVFNHSCDPNVTVSNVDGTHDI 329
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
L LR V+ GEEL I YI ++ +G+ F C CLRC
Sbjct: 330 TLRTLRPVKCGEELTITYIPLENTTPEQRNEKLKGYFFTCRCLRC 374
>gi|71409651|ref|XP_807159.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871099|gb|EAN85308.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 397
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 51/105 (48%), Gaps = 5/105 (4%)
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DA 312
IN F ++ G E S A + + G A+Y L S +NH CDPN + +D D
Sbjct: 274 INGFIVQSPSG--EKAFS--AIPLVEWVLKGGAVYALQSAFNHSCDPNVSVSNVDGTHDI 329
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
L LR V+ GEEL I YI ++ +G+ F C CLRC
Sbjct: 330 TLRTLRPVKSGEELTITYIPLENTTPEQRNEKLKGYFFTCRCLRC 374
>gi|395731585|ref|XP_003775930.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 1-like [Pongo abelii]
Length = 490
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWLPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|6322001|ref|NP_012077.1| Set5p [Saccharomyces cerevisiae S288c]
gi|731577|sp|P38890.1|SET5_YEAST RecName: Full=Putative protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|458924|gb|AAB69736.1| Yhr207cp [Saccharomyces cerevisiae]
gi|207344478|gb|EDZ71606.1| YHR207Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256272756|gb|EEU07728.1| Set5p [Saccharomyces cerevisiae JAY291]
gi|285810113|tpg|DAA06900.1| TPA: Set5p [Saccharomyces cerevisiae S288c]
gi|392299021|gb|EIW10116.1| Set5p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 526
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 90/378 (23%), Positives = 146/378 (38%), Gaps = 72/378 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFLISNGKACARCGKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA IS E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--ISQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------W--EKCYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I ++ + + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF C C RC +
Sbjct: 414 RELRVNWGFLCQCDRCQN 431
>gi|328850110|gb|EGF99279.1| hypothetical protein MELLADRAFT_68699 [Melampsora larici-populina
98AG31]
Length = 479
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/75 (44%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQAI 343
AI P+ NHDC PN ++D L ALRD+ GEEL YID S RD R+
Sbjct: 286 AIVFQPTRLNHDCRPNT-AYYVDGVTLTLHMHALRDIAPGEELTDSYIDTSTYRDERKQN 344
Query: 344 LTQGFGFQCNCLRCS 358
L + +GF C C CS
Sbjct: 345 LQKNYGFDCTCSICS 359
>gi|403413006|emb|CCL99706.1| predicted protein [Fibroporia radiculosa]
Length = 800
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 284 GNAIYMLPS-FYNHDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMARD 338
G+ IY L S +NH C PNA +I ++ ++ALRD+ EGEE+ I Y+D ++
Sbjct: 551 GHGIYPLASRLFNHSCVPNAVAKYIISPFESVQMEVVALRDIAEGEEITIPYLDPALPLH 610
Query: 339 ARQAILTQGFGFQCNCLRCS 358
RQ L +GF C C CS
Sbjct: 611 TRQEALRLNYGFTCTCPLCS 630
>gi|426336314|ref|XP_004031420.1| PREDICTED: SET and MYND domain-containing protein 1 [Gorilla
gorilla gorilla]
Length = 459
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|390474255|ref|XP_002757614.2| PREDICTED: SET and MYND domain-containing protein 1 [Callithrix
jacchus]
Length = 442
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIRRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKKQLKKQYYFDCTCEHCQK 281
>gi|348517231|ref|XP_003446138.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 476
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 107/281 (38%), Gaps = 40/281 (14%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ AT+ A D+I + I S +C+ C R+ + Q A +C
Sbjct: 13 GRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHYCDRT 72
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD----CIDILQ 204
C+ A E + + SA Y + L+ + + + G+ +D +D L+
Sbjct: 73 CQRAGWA----EHKLECSAIKAYGKVPNENIRLVARIMWRLDKEGSTVSDMQLTTLDELE 128
Query: 205 ------PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFR 258
P EL + + K+ IDD +++ I N F
Sbjct: 129 DHIADMPEDDLKELKVDIHNFLDYWPHNSKQHTIDD----------ISHIFGVINCNGFS 178
Query: 259 IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALR 318
+ GL AVG ++ NHDC PN ++ +++ L AL
Sbjct: 179 VSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCTVI-LNHGKIELRALG 222
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ EGEEL + Y+D + RQ +L + F C C C +
Sbjct: 223 KIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 263
>gi|301773928|ref|XP_002922386.1| PREDICTED: SET and MYND domain-containing protein 1-like
[Ailuropoda melanoleuca]
Length = 490
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 122/308 (39%), Gaps = 63/308 (20%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPSESIRLAARILWRVER--- 116
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE----------------QMKFLNKQ 243
+ L+ ++A++E L++ + G +++ Q + + Q
Sbjct: 117 ----EGTGLTEGCLVAVDE----LQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQ 168
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ ++V I N F + GL AVG I+ NH+C PN
Sbjct: 169 YISHVFGVINCNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHNCWPNCT 213
Query: 304 IMWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
+++ + N +A L AL + EGEEL + YID + R+ L + + F
Sbjct: 214 VIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFD 273
Query: 352 CNCLRCSS 359
C C C
Sbjct: 274 CTCEHCQK 281
>gi|119597483|gb|EAW77077.1| SET and MYND domain containing 1, isoform CRA_a [Homo sapiens]
Length = 510
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|351696197|gb|EHA99115.1| SET and MYND domain-containing protein 1 [Heterocephalus glaber]
Length = 483
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 117/298 (39%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 4 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 64 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 113
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGID------DEQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 114 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDMFLQYWPPQSQQFSMQYISHIFGVIN 173
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 174 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 218
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 219 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 276
>gi|358378046|gb|EHK15729.1| hypothetical protein TRIVIDRAFT_228756 [Trichoderma virens Gv29-8]
Length = 583
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 134/322 (41%), Gaps = 48/322 (14%)
Query: 58 QKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLS 117
+ ++LCS K ++ S IQ + +AG G+F TR + +LI T+ P++T +
Sbjct: 67 KDKELCSYILSTIKAAEGSLS-IQESTIPNAGSGMFTTRDVEEGELIFTSVPLVTCAEVG 125
Query: 118 TLNSVCYFCLRKITSSSQHFQHHNARFC--GEV---------CKDNAKAFYDVERRADWS 166
C FC ++ + F RF GEV C+ +RA W
Sbjct: 126 PGMEACDFCFQQ---RRRVFHPVEDRFLQPGEVLPPLHICNGCRLYQYCSQSCSQRA-WD 181
Query: 167 AFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSA 226
+ Y C +++GA SAD ++ +L LIL M + ++ +
Sbjct: 182 TGHLY---------------ECGLLAGA-SAD----VETRTLYRLLIL-MRKKVLLPQQV 220
Query: 227 FKKAGIDDEQMKF--LNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIES----- 279
A +++E F K+ + VL R R + + E L+ S
Sbjct: 221 KALARLENEVANFEKRTKKSWPRVLNLAREAKERTKSELSIGEVLMLYGIVRCNSLPVDQ 280
Query: 280 ---EIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMA 336
+G A+ M + NH CDPN I++ ++ ++ ALR +++GEEL CY D +
Sbjct: 281 TFRNAPLGIALDMGGALLNHCCDPNVVIVF-NSTQVQVRALRKIKDGEELLHCYRDIAYD 339
Query: 337 RDARQAILTQGFGFQCNCLRCS 358
R +T + F C C RC
Sbjct: 340 FTFRNPRITARYQFNCQCDRCK 361
>gi|38093643|ref|NP_938015.1| SET and MYND domain-containing protein 1 [Homo sapiens]
gi|34925329|sp|Q8NB12.1|SMYD1_HUMAN RecName: Full=SET and MYND domain-containing protein 1
gi|21750163|dbj|BAC03732.1| unnamed protein product [Homo sapiens]
gi|57997113|emb|CAI46139.1| hypothetical protein [Homo sapiens]
gi|116497127|gb|AAI26192.1| SET and MYND domain containing 1 [Homo sapiens]
gi|119597484|gb|EAW77078.1| SET and MYND domain containing 1, isoform CRA_b [Homo sapiens]
gi|190689781|gb|ACE86665.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|190691149|gb|ACE87349.1| SET and MYND domain containing 1 protein [synthetic construct]
gi|261858976|dbj|BAI46010.1| SET and MYND domain containing 1 [synthetic construct]
gi|313883064|gb|ADR83018.1| SET and MYND domain containing 1 (SMYD1) [synthetic construct]
Length = 490
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|48476975|gb|AAT44535.1| hypothetical rhabdomyosarcoma antigen MU-RMS-40.9A [Homo sapiens]
Length = 484
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 3 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 62
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 63 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 112
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 113 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 172
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 173 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 217
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 218 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 275
>gi|402891521|ref|XP_003908994.1| PREDICTED: SET and MYND domain-containing protein 1 [Papio anubis]
gi|355565873|gb|EHH22302.1| hypothetical protein EGK_05542 [Macaca mulatta]
gi|355751476|gb|EHH55731.1| hypothetical protein EGM_04994 [Macaca fascicularis]
Length = 490
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|340517499|gb|EGR47743.1| predicted protein [Trichoderma reesei QM6a]
Length = 535
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 118/296 (39%), Gaps = 51/296 (17%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
S P+++ S GR + AT+ +I + P++ PT++ L SVC CLR
Sbjct: 3 SSSPVEIRPHPSKGRALHATKSFPPGAVIFSFTPLLLLPTVACLTSVCSHCLRP-GEPRL 61
Query: 136 HFQHHNARFCGEVCKDNA-KAFYDVERRADWSAFNDYCRSQGLKYP---LLVKRLACMII 191
+ H A +C C+ A KA + E +A D R + L P L+ L I
Sbjct: 62 CSRCHAAAYCDATCQAAAWKAVHSRECKALTKTIRDEERRRMLPTPTRALIQALLWGKIR 121
Query: 192 SGAESADCIDILQPASLSPELILAMEEG--------FVMLRSAFKKAGIDDEQMKFLNKQ 243
G A L ++ EG + AF G D+ ++
Sbjct: 122 DGL-----------ADLEGHVLEKRAEGDEWRDIEMMAVAGCAFSGLGRGDQDVRM---- 166
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF--YNHDCDPN 301
+L +I+ N+F+ +SE+ V +Y+ P+ NH C PN
Sbjct: 167 -AAEMLCKIQNNSFQ-----------------RFDSELGVAG-LYLEPTLAMANHSCIPN 207
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + +I +A L+A + G+E+ I Y + R+ L + F C CLRC
Sbjct: 208 ASVQFIGR-NALLIAENPIRAGDEIEIAYTFYTDPLPKRREALAH-YKFTCQCLRC 261
>gi|260799834|ref|XP_002594889.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
gi|229280126|gb|EEN50900.1| hypothetical protein BRAFLDRAFT_124463 [Branchiostoma floridae]
Length = 463
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 114/291 (39%), Gaps = 50/291 (17%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ + GRG+ AT+ + +L+ A P S C FC + S+
Sbjct: 5 VEIFSSAEKGRGLCATKVFKPGNLVRAADPYAYVLCNSERGKRCDFCFARKDDMSRCSGC 64
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGL---KYPLLVKRLACMIISGAES 196
AR+C C+ +A W+ C+S + P RL II+ ++
Sbjct: 65 KFARYCDGKCQ-----------KAAWTEHKSECKSIKTVKPETPTDSIRLIARIINKTKT 113
Query: 197 ------ADCIDILQPASLSPELILAMEEGF----VMLRSAFKKAGIDDEQMKFLNKQWYT 246
+ ID LQ S E+ ++E F V+LR K +DD + F
Sbjct: 114 DSPGVPGNSIDELQ--SNLREMPENVKEMFAQLAVVLRMYVGKDVMDDAREIF------- 164
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
+ ++ N F I Y +G IY S +NH C+PN +
Sbjct: 165 ELFGRMTCNTFSICDPEMQY----------------IGIGIYPKMSLFNHSCEPNC-VAV 207
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + A+++++ GEEL I Y++ R+ L Q + F C C RC
Sbjct: 208 FNGLRMEVRAIQNIQPGEELLISYVEMLAMSSVRKQQLLQQYYFTCKCPRC 258
>gi|118379402|ref|XP_001022867.1| conserved hypothetical protein [Tetrahymena thermophila]
gi|89304634|gb|EAS02622.1| conserved hypothetical protein [Tetrahymena thermophila SB210]
Length = 418
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 41/76 (53%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQ 341
+G +Y ++ NH C PN ++ R++A+RD+E+GEE+ YID D R+
Sbjct: 151 GIGAGLYEEVNYMNHSCTPNVICVFNKLPQVRVIAIRDIEQGEEIMNSYIDTKKDLDFRR 210
Query: 342 AILTQGFGFQCNCLRC 357
L Q + F C C RC
Sbjct: 211 RFLKQNYFFLCECKRC 226
>gi|119597485|gb|EAW77079.1| SET and MYND domain containing 1, isoform CRA_c [Homo sapiens]
Length = 493
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 116/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|308811675|ref|XP_003083145.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
gi|116055024|emb|CAL57420.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
Length = 459
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 47/79 (59%), Gaps = 5/79 (6%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYI-DASMARD 338
+G A++ + S +NHDCDPNA M + ++A +D+ EGEE+ I Y+ D S +RD
Sbjct: 371 LGTALFSVQSGFNHDCDPNAEPMKGEQDITGACVIVARKDIAEGEEITISYLDDDSKSRD 430
Query: 339 ARQAILTQGFGFQCNCLRC 357
RQ L +GF C C RC
Sbjct: 431 ERQDALAD-YGFVCRCARC 448
>gi|349578759|dbj|GAA23924.1| K7_Set5p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 526
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/378 (23%), Positives = 146/378 (38%), Gaps = 72/378 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFFISNGKACARCGKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA +S E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--VSQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------WEK--CYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I ++ + + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF C C RC +
Sbjct: 414 RELRVNWGFLCQCDRCQN 431
>gi|348566407|ref|XP_003468993.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Cavia porcellus]
Length = 490
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 69/299 (23%), Positives = 117/299 (39%), Gaps = 43/299 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEIFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKHECSAIKRYG-----KVPNENVRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
L AL + GEEL + YID + R+ L + + F C C C G
Sbjct: 224 VKSMFHTQMRIELRALGKISVGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQKG 282
>gi|310799799|gb|EFQ34692.1| hypothetical protein GLRG_09836 [Glomerella graminicola M1.001]
Length = 430
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/100 (36%), Positives = 54/100 (54%), Gaps = 5/100 (5%)
Query: 263 GGLYEDLLSSAAASIESEIAVGNAIYMLP--SFYNHDCDPNAHIMWIDNADAR--LMALR 318
G D++ + + + +I G+ P S +NHDC PN +ID++D R A+R
Sbjct: 217 GQKVHDIMHTNSFELGLKITDGHHFGNYPEVSRFNHDCRPNV-AFYIDDSDLRHYTHAVR 275
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
D++ GEEL I Y+D+ AR RQ + +GF C C CS
Sbjct: 276 DIQAGEELTISYVDSLSARVVRQDRARRNWGFGCGCAHCS 315
>gi|30680748|ref|NP_849991.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|94707155|sp|Q9ZUM9.3|ASHR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR2; AltName:
Full=ASH1-related protein 2; AltName: Full=Protein SET
DOMAIN GROUP 39
gi|28393236|gb|AAO42047.1| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251813|gb|AEC06907.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 398
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 75/314 (23%), Positives = 127/314 (40%), Gaps = 66/314 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL----STLNSVCYFCLRKITSSSQ 135
++VA GR + A + +RA +I P++ + S+++ C C R + SS+
Sbjct: 13 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 72
Query: 136 H---------------FQHHNARFCGEVCK---DNAKAFYD------VERRADWSAFNDY 171
F H C + + ++ AF D V+ R SA+N
Sbjct: 73 QKCQSCSLVSFCSPNCFASHTPWLCESLRRLHQSSSSAFSDQPSDRQVQARFLLSAYNLA 132
Query: 172 CRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFV--MLRSAFKK 229
S P + L + SG+ + D P+ + + A GF+ +L S
Sbjct: 133 AAS-----PSDFQILLSLQGSGSSNGD------PSCSAGDSAAA---GFLHSLLSSVCPS 178
Query: 230 AGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYM 289
+ ++ +L++ ++NAF L+ + S E IY
Sbjct: 179 LPVS------ISPDLTAALLSKDKVNAF----------GLMEPCSVSNEKRSVRAYGIYP 222
Query: 290 LPSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAI 343
SF+NHDC PNA ++D N D + + DV EG E+ + Y +M +RQ
Sbjct: 223 KTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKR 282
Query: 344 LTQGFGFQCNCLRC 357
L + +GF+C+C RC
Sbjct: 283 LLEDYGFKCDCDRC 296
>gi|18399103|ref|NP_565457.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
gi|14334524|gb|AAK59459.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|17979109|gb|AAL47498.1| putative SET-domain transcriptional regulator protein [Arabidopsis
thaliana]
gi|20197479|gb|AAD10162.2| putative SET-domain transcriptional regulator [Arabidopsis
thaliana]
gi|330251812|gb|AEC06906.1| histone-lysine N-methyltransferase ASHR2 [Arabidopsis thaliana]
Length = 341
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 75/314 (23%), Positives = 127/314 (40%), Gaps = 66/314 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL----STLNSVCYFCLRKITSSSQ 135
++VA GR + A + +RA +I P++ + S+++ C C R + SS+
Sbjct: 13 LRVAEIGGRGRSLVAAQSLRAGQVILRESPLLLYSAFPFLSSSVSPYCDHCFRLLASSAH 72
Query: 136 H---------------FQHHNARFCGEVCK---DNAKAFYD------VERRADWSAFNDY 171
F H C + + ++ AF D V+ R SA+N
Sbjct: 73 QKCQSCSLVSFCSPNCFASHTPWLCESLRRLHQSSSSAFSDQPSDRQVQARFLLSAYNLA 132
Query: 172 CRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFV--MLRSAFKK 229
S P + L + SG+ + D P+ + + A GF+ +L S
Sbjct: 133 AAS-----PSDFQILLSLQGSGSSNGD------PSCSAGDSAAA---GFLHSLLSSVCPS 178
Query: 230 AGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYM 289
+ ++ +L++ ++NAF L+ + S E IY
Sbjct: 179 LPVS------ISPDLTAALLSKDKVNAF----------GLMEPCSVSNEKRSVRAYGIYP 222
Query: 290 LPSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAI 343
SF+NHDC PNA ++D N D + + DV EG E+ + Y +M +RQ
Sbjct: 223 KTSFFNHDCLPNACRFDYVDSASDGNTDIIIRMIHDVPEGREVCLSYFPVNMNYSSRQKR 282
Query: 344 LTQGFGFQCNCLRC 357
L + +GF+C+C RC
Sbjct: 283 LLEDYGFKCDCDRC 296
>gi|302672815|ref|XP_003026095.1| hypothetical protein SCHCODRAFT_238838 [Schizophyllum commune H4-8]
gi|300099775|gb|EFI91192.1| hypothetical protein SCHCODRAFT_238838 [Schizophyllum commune H4-8]
Length = 528
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 58/110 (52%), Gaps = 13/110 (11%)
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+LA+ NAF GL+ D+ + ESE+ +G A+Y+ S++NHDC PN +
Sbjct: 409 ILARDHANAF------GLF-DMQETG----ESEM-LGWAVYVAGSYFNHDCAPNVRKLRR 456
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + + RDV GEEL I Y+D + +R A Q + F C C RC
Sbjct: 457 GRA-LQFVTTRDVAPGEELCISYVDTQDTKASRAAQFAQHWNFVCGCGRC 505
>gi|297836508|ref|XP_002886136.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
gi|297331976|gb|EFH62395.1| hypothetical protein ARALYDRAFT_319740 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 59/278 (21%), Positives = 114/278 (41%), Gaps = 30/278 (10%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GR +F TR IR ++I + KP I P ++L S C C K + + +CG
Sbjct: 22 GRSLFTTRDIRPGEVILSQKPYICVPNNTSLESRCDGCF-KTNNLKKCSGCQVVWYCGSF 80
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRS----QGLKYPLLVKRLACMIISGAESADCIDILQ 204
C+ +++W C++ + K + + M+ +
Sbjct: 81 CQ-----------KSEWKLHRHECKALTRLEKEKRKFVTPTIRLMV----------KLYI 119
Query: 205 PASLSPELILAME--EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELA 262
+L E +L + + + ++ + DE+ L Q V ++ + +
Sbjct: 120 KRNLQNEKVLPITSTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPSVDLREI 179
Query: 263 GGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE 321
+ +A + +SE+ G ++ L S NH C PNA ++ + A + A+ ++
Sbjct: 180 AENFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNA-VLVFEEQMAVVRAMDNIS 238
Query: 322 EGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ E+ I YI+ + + RQ L + + F C C RCS+
Sbjct: 239 KDSEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSN 276
>gi|332373072|gb|AEE61677.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 73/149 (48%), Gaps = 14/149 (9%)
Query: 217 EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIE----LAGGLYEDLLSS 272
+EGF L + G F W NV ++ ++A + E L +Y+++ S
Sbjct: 219 QEGFQSLVALIGTNGQGVGSSPF--SSWIRNV-EKLNLSAEQKEQIDSLVDRIYDEMYSH 275
Query: 273 AAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRICYI 331
+ SE G +Y S NH CDPNA I ++ +N L+AL+D++ GEE+ I Y+
Sbjct: 276 TGDFLNSE---GVGLYARQSCANHSCDPNAEISFVHNNYRLSLIALKDIQPGEEICISYL 332
Query: 332 ---DASMARDARQAILTQGFGFQCNCLRC 357
D +R +R+ L + + F C C +C
Sbjct: 333 GDCDNERSRHSRRKTLMENYLFACECSKC 361
>gi|303279368|ref|XP_003058977.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460137|gb|EEH57432.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 322
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 41/75 (54%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G +Y + +NHDC PNA + R+ R V +GEEL I Y+D + R+ R+
Sbjct: 243 IGVGVYPSAAMFNHDCAPNAAQRFDAFGCVRVETTRRVRKGEELTIPYVDVMLGREERRG 302
Query: 343 ILTQGFGFQCNCLRC 357
L + F F+C C RC
Sbjct: 303 KLRKNFAFECACARC 317
>gi|224101385|ref|XP_002312257.1| SET domain protein [Populus trichocarpa]
gi|222852077|gb|EEE89624.1| SET domain protein [Populus trichocarpa]
Length = 542
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 70/281 (24%), Positives = 116/281 (41%), Gaps = 72/281 (25%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P+Q+ +E +GRG+FAT+ I A L+ K I T + + C
Sbjct: 188 PVQIKRSELSGRGLFATKNIDAGTLLLVTKAIATERGILSSEDSC--------------- 232
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMI--ISGAES 196
NAR W F D K +R +I +S E
Sbjct: 233 -ENARLVM------------------WKNFVDKVVDSATK----CERTHHLISTLSSGED 269
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
D ++ +PE+ L R ++ G +E+ L+K NVL +N+
Sbjct: 270 EDKLE-------APEMSL--------FRPEAEEIGELNEK---LDKVKILNVLD---VNS 308
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
L ED +S+ S+ G +++L SF NH C+PNA + + + + A
Sbjct: 309 --------LVEDSVSAKVLGRNSDY-YGVGLWVLASFINHSCNPNARRLHVGD-HVLVHA 358
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
RDV+ GEE+ Y D ++ +++ +++ +GF C+C RC
Sbjct: 359 SRDVKAGEEITFAYFDV-LSPLSKRNEMSKTWGFHCSCKRC 398
>gi|440801440|gb|ELR22460.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 545
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G A + L S NH C+PN + ++++ A + ALRD+ GEEL I YI + RQ
Sbjct: 406 GYAFFPLLSTLNHSCEPNCQVAYLEDGQALVFALRDIAAGEELSISYIYRHLPLAERQQQ 465
Query: 344 LTQGFGFQCNCLRCSS 359
L + +GF C C RC++
Sbjct: 466 L-RSYGFVCACPRCAA 480
>gi|401883282|gb|EJT47497.1| hypothetical protein A1Q1_03609 [Trichosporon asahii var. asahii
CBS 2479]
Length = 358
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 45/78 (57%), Gaps = 4/78 (5%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICY-IDASMARDARQAI 343
A+Y NH C PNAH W +N + + ALRD+ +GEE+ I Y +D R R+A
Sbjct: 167 AVYPTLCRINHSCIPNAHHSWNENLEMETIHALRDISKGEEITIGYFLDG--CRAERRAH 224
Query: 344 LTQGFGFQCNCLRCSSGD 361
L FGF+CNC CS+ D
Sbjct: 225 LQNKFGFECNCEACSASD 242
>gi|303280161|ref|XP_003059373.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226459209|gb|EEH56505.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 740
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 33/90 (36%), Positives = 50/90 (55%), Gaps = 6/90 (6%)
Query: 274 AASIESEIAVGNAIYMLPSFYNHDCDPNA--HIMWIDNADA----RLMALRDVEEGEELR 327
++ + E VG A+Y S +NH C PN H +AD+ + A+R +++G+E
Sbjct: 323 SSGVPREERVGGAVYAKLSRFNHSCAPNCTWHFAPPGSADSPVSVTVKAIRPIKKGDEAT 382
Query: 328 ICYIDASMARDARQAILTQGFGFQCNCLRC 357
+CY+DAS RDAR+ L + F C C RC
Sbjct: 383 VCYLDASGGRDARRRRLWAEYRFACECDRC 412
>gi|407037947|gb|EKE38859.1| tetratricopeptide repeat-containing protein [Entamoeba nuttalli
P19]
Length = 425
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 74/302 (24%), Positives = 131/302 (43%), Gaps = 37/302 (12%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITS---SSQHFQHHNARFC 145
G+GVF+ ++ + ++I PII + + C FCL+ + S ++++ + N
Sbjct: 126 GKGVFSKKKYQKEEIIMREIPIIQISKIEDYYNCCGFCLKSLYSKDPTNKYNKIFNQINK 185
Query: 146 GEVCKDNAKAFYDVER--RADW---------SAFNDYCRSQGLKYPLLVKRL-ACMIISG 193
EV K Y + + D+ +YC+ + +PL + ++ A + IS
Sbjct: 186 DEVVKCKCGMSYCNTKCQKLDYGHSLLCNKIKGLFNYCKKNNVSHPLCISKIFANIFISQ 245
Query: 194 --AESADCIDILQPASLSP---ELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
+S + + L P E+I + F L F G + L N
Sbjct: 246 NIEQSLFPFIVFHSSPLIPFNSEIIPFFLDIFGSLLLKFNIYGGWTFIYQILYSVLKYNS 305
Query: 249 LAQIRINAFR-IELAGGLYED--LLSSA--------AASIESEIAVGNAIYMLPSFYNHD 297
+ + +N + I L L D LLSS+ A +IE+E ++ + NH
Sbjct: 306 SSILPLNPIQSILLNNSLQIDNTLLSSSLINHPDITAFTIEAE-----GLFKYLNTLNHS 360
Query: 298 CDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C PN + D++ A L+A R + G+EL I YID ++ RQ++L + F C+C +
Sbjct: 361 CSPNCFLANTDDSCALSLIASRPISPGDELTISYIDNTLPYSQRQSLLYDSYHFYCHCPK 420
Query: 357 CS 358
C+
Sbjct: 421 CN 422
>gi|302144054|emb|CBI23159.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 68/280 (24%), Positives = 108/280 (38%), Gaps = 70/280 (25%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+Q+ +E +GRG+FAT+ + A L+ K I T C + S+
Sbjct: 151 VQIMKSEISGRGLFATKNVDAGTLVLVTKAIATER--------CILPEQNDDSA------ 196
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMI--ISGAESA 197
DN + W F D K KRL +I +S E
Sbjct: 197 -----------DNIQLVM-------WKNFIDKVVESASK----CKRLHHLISVLSNGEDE 234
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
D +++ PE E G M + K +L+ + +N+
Sbjct: 235 DVLEVPDVNLFRPE---TEESGLSMGKLDMGK------------------ILSILDVNS- 272
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL 317
L ED S+ S+ G +++LP+F NH C+PNA + + + + +
Sbjct: 273 -------LVEDATSAKVLGKNSDY-YGVGLWILPAFINHSCNPNARRLHVGD-NVIVHTS 323
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
RDV+ GEE+ Y D R+ + + +GFQCNC RC
Sbjct: 324 RDVKAGEEITFAYFDVLSPWRKRKD-MAKTWGFQCNCKRC 362
>gi|84180545|gb|ABC54714.1| histone methyltransferase SmyD1b [Danio rerio]
gi|190337458|gb|AAI63095.1| Smyd1b protein [Danio rerio]
Length = 473
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 70/296 (23%), Positives = 113/296 (38%), Gaps = 52/296 (17%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V + GRG+ AT+ A D++ P + S +S+C+ C R+ + Q
Sbjct: 4 VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A++C + C+ RA W C ++ G K P RLA I+ +
Sbjct: 64 RFAQYCDKTCQ-----------RAGWEEHKLECAAIKTYG-KPPSENVRLAARILWRMDK 111
Query: 197 ADCIDILQPASLSPELILAME--EGFVMLRSAFKKAGIDDEQMKFLNKQWYTN------- 247
Q + +S + +E E + S + FL+ W N
Sbjct: 112 -------QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLD-YWPRNSKPHTVD 163
Query: 248 ----VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+L I N F + GL AVG ++ NHDC PN
Sbjct: 164 SVSHILGVINCNGFMVSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCT 208
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ ++N L AL + GEE+ + Y+D RQ +L Q + F C C C+
Sbjct: 209 VI-LNNGKIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFFDCTCKHCTE 263
>gi|240274726|gb|EER38242.1| SET domain-containing protein 5 [Ajellomyces capsulatus H143]
gi|325091062|gb|EGC44372.1| SET domain-containing protein [Ajellomyces capsulatus H88]
Length = 363
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + + + +D+ EGEE+ I Y+D + D RQA
Sbjct: 140 GAIFPRAARINHSCKPNSQNTWNQNLERLTIHSFKDIVEGEEVTIAYVDGTELYDERQAC 199
Query: 344 LTQGFGFQCNCLRCS 358
+ FGF+C C C+
Sbjct: 200 FEEAFGFRCQCEVCA 214
>gi|395853479|ref|XP_003799234.1| PREDICTED: SET and MYND domain-containing protein 1 [Otolemur
garnettii]
Length = 490
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 117/298 (39%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + +S + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVSVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQPFSIQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKAMFHTQMRIELRALGKISEGEELTVSYIDFLNVTEERRKQLKKQYYFDCACEHCQK 281
>gi|348670644|gb|EGZ10465.1| hypothetical protein PHYSODRAFT_518318 [Phytophthora sojae]
Length = 437
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 76/305 (24%), Positives = 119/305 (39%), Gaps = 57/305 (18%)
Query: 79 PIQVALTESAGRGVFATRRIRASD--LIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
P + L GR A+ +RA D LI +A + P S C +C S+
Sbjct: 3 PYREVLDAIKGRCAVASTSLRAGDRVLITSAVCAASSP------SSCGWCFAPGDVFSRC 56
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYP----LLVKRLACM 189
AR+C C+ DW C RS + P LLV RLA
Sbjct: 57 SACRKARYCSRACQQR-----------DWPQHRHECAAWRSIPERNPSPTVLLVARLAAK 105
Query: 190 IISGA----ESADCIDILQP-----ASLSPELILAMEEGFVMLRSAFK-----KAGIDDE 235
+ G+ E + + L+ L M + ++L S +K K DE
Sbjct: 106 LFLGSQVDQEEKNGVLKLRDHLADHTELKRHQFDEMTQLVLLLLSRYKAEKREKNATLDE 165
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
+ L + + ++ NAF S A +E AVG +Y + +N
Sbjct: 166 LHRDLELE-ILKLFGRVNCNAF--------------SVANEFTNE-AVGIGLYPEGALFN 209
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
HDCDPN ++ + ++ +RD+E EEL + Y++ + AR+ L + + F C C
Sbjct: 210 HDCDPNC-VVSFKGREMQVRVVRDIEVDEELTVSYVELLQSTKARRRELKESYFFDCECK 268
Query: 356 RCSSG 360
RC +
Sbjct: 269 RCKAA 273
>gi|392586861|gb|EIW76196.1| MAS20-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 674
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 23/159 (14%)
Query: 217 EEGFVMLRSAFKKA--GIDDEQMKFLNKQWYTNVLAQIRINAFRI--------ELAGGLY 266
EE +R K A G++ F+ + Y +L ++ N+F + + G
Sbjct: 452 EEKLKGVRDVLKNALPGLE----TFVTDERYATMLGKMAYNSFGVVFGEGRDGKPVGERP 507
Query: 267 EDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEEL 326
EDL + + S VG A + + ++ +H C PNAH + + L+A RDV++G+EL
Sbjct: 508 EDLERTRTPAGTSR-QVGTAFFAVSAYLSHSCAPNAHPSFNGTTELSLVAERDVKKGDEL 566
Query: 327 RICYIDASMARDARQA--------ILTQGFGFQCNCLRC 357
+ Y+D S A L +G+ F C C RC
Sbjct: 567 NVSYVDLSSHEGETPAEARRRRRFELARGWKFACPCTRC 605
>gi|225444416|ref|XP_002265832.1| PREDICTED: uncharacterized protein LOC100253788 [Vitis vinifera]
Length = 550
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 68/280 (24%), Positives = 108/280 (38%), Gaps = 70/280 (25%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+Q+ +E +GRG+FAT+ + A L+ K I T C + S+
Sbjct: 198 VQIMKSEISGRGLFATKNVDAGTLVLVTKAIATER--------CILPEQNDDSA------ 243
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMI--ISGAESA 197
DN + W F D K KRL +I +S E
Sbjct: 244 -----------DNIQLVM-------WKNFIDKVVESASK----CKRLHHLISVLSNGEDE 281
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
D +++ PE E G M + K +L+ + +N+
Sbjct: 282 DVLEVPDVNLFRPE---TEESGLSMGKLDMGK------------------ILSILDVNS- 319
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL 317
L ED S+ S+ G +++LP+F NH C+PNA + + + + +
Sbjct: 320 -------LVEDATSAKVLGKNSDY-YGVGLWILPAFINHSCNPNARRLHVGD-NVIVHTS 370
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
RDV+ GEE+ Y D R+ + + +GFQCNC RC
Sbjct: 371 RDVKAGEEITFAYFDVLSPWRKRKD-MAKTWGFQCNCKRC 409
>gi|195393214|ref|XP_002055249.1| GJ18896 [Drosophila virilis]
gi|194149759|gb|EDW65450.1| GJ18896 [Drosophila virilis]
Length = 506
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 68/292 (23%), Positives = 121/292 (41%), Gaps = 32/292 (10%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPII---THPTLSTLNSVCYFCLR 128
P + P +V +++ AGRGV ATR+++ ++I P++ TLN+ C CL+
Sbjct: 17 PFKDKDPAWEVGVSKIAGRGVMATRQLKRGEIIFQDSPLLIGLAAQEEDTLNA-CSICLK 75
Query: 129 KITSSSQHFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
+ + F CG +C AK + ++D F + ++ ++ RL
Sbjct: 76 MLPDT--RFMCRQG--CGLPICSLCAKK---KQHKSDCDMFKSWGPNEPDVANSVIIRLL 128
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
C+ + + D D++ L L FK D + ++ +N+
Sbjct: 129 CVARAINLTKDQRDLIY--CLQANLDNNHRTEVRNAAKCFKNFPTDKKLVEIMNR----- 181
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+A +R N F + + +++ A+Y L + NHDC PN++ +
Sbjct: 182 TVAVLRTNGFD------------KTTDRTTDNQEFSYRALYPLFAVMNHDCIPNSYYTFE 229
Query: 308 DNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + ++ A D+ EGEE+ Y AR L GF C C RCS
Sbjct: 230 EKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLYLKMKKGFTCKCPRCS 281
>gi|410955280|ref|XP_003984284.1| PREDICTED: SET and MYND domain-containing protein 1 [Felis catus]
Length = 490
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 117/298 (39%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + ++ + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVAVDDLQNHVEHFGEEEQKELRVDVDTFLQYWPPQSQQFSMQYISHIFGVIS 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|328873515|gb|EGG21882.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 438
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 18/115 (15%)
Query: 247 NVLAQIRINAFRIE---LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
NVL + R NAF I+ GG E + G +Y+ SF+NH C+PN +
Sbjct: 303 NVLLRNRCNAFYIQGRPKNGGNGE--------------SRGCGVYVRNSFFNHSCNPNVN 348
Query: 304 IMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++N + +R V +GEEL I YID + + R+ L++G+ F C C +C
Sbjct: 349 YWVVENTLEVECSLMRAVRKGEELCISYIDTAASLRDRREKLSEGYLFHCRCEKC 403
>gi|330793186|ref|XP_003284666.1| hypothetical protein DICPUDRAFT_75630 [Dictyostelium purpureum]
gi|325085364|gb|EGC38772.1| hypothetical protein DICPUDRAFT_75630 [Dictyostelium purpureum]
Length = 468
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 46/78 (58%), Gaps = 4/78 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADAR--LMALRDVEEGEELRICYIDASMARDARQ 341
G I+ L + NH C+PN ++ + N+D R +MALR +E GEEL YID + D RQ
Sbjct: 391 GYGIFGLQAMVNHSCEPNINVAF-SNSDNRAHIMALRRIEAGEELYHSYIDEELPYDIRQ 449
Query: 342 AILTQGFGFQCNCLRCSS 359
L +GF+C C +C +
Sbjct: 450 EDLVT-YGFKCECRKCKA 466
>gi|327275307|ref|XP_003222415.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Anolis carolinensis]
Length = 472
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 113/299 (37%), Gaps = 56/299 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V T+ GRG+ A + A D+I + S + VC+ C ++ + Q
Sbjct: 4 VEVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMII----- 191
A +C C+ +A W + C + G K P RLA I+
Sbjct: 64 KFAHYCDRTCQKDA-----------WVNHKNECAAIKKHG-KAPNENIRLAARIMWRIER 111
Query: 192 -SGAESADC---IDILQ------PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN 241
G + C ID LQ EL L +E S + D +Q
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFGEEEKKELRLDLE-------SFLEFWPHDGKQFAM-- 162
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q+ +++L I N F + GL AVG I+ NHDC PN
Sbjct: 163 -QYISHILGVINCNGFTLSDQRGLQ---------------AVGVGIFPNLCLVNHDCWPN 206
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+++ +N L AL + GEEL + Y+D + R+ L + + F C C C G
Sbjct: 207 CTVIF-NNGKIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYYFDCTCEHCQKG 264
>gi|170115248|ref|XP_001888819.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164636295|gb|EDR00592.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 374
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/107 (33%), Positives = 51/107 (47%), Gaps = 8/107 (7%)
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNAD 311
R NA IEL D +A A +++ + NH C PNA I W +
Sbjct: 133 RTNALSIELKFPPEMDAYGESAKKY-------GAAFLILNRANHSCGPNAAIKWNLPTLT 185
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A + ALR + GEE+ Y+D S+ R R A L + +GF C+C C+
Sbjct: 186 ASMYALRPILAGEEITKTYVDPSLPRSQRIAHLQENYGFTCDCQWCN 232
>gi|224132628|ref|XP_002327842.1| SET domain protein [Populus trichocarpa]
gi|222837251|gb|EEE75630.1| SET domain protein [Populus trichocarpa]
Length = 398
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 67/158 (42%), Gaps = 24/158 (15%)
Query: 227 FKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG-----LYEDLLSSAAASIESEI 281
K A D+E + + Y N++ +N + +A LY D L +I
Sbjct: 238 LKAAIFDEECEPLFSLEIYGNIIGMFELNNLDLVVASPVEDYFLYIDDLPDPEKEKAEKI 297
Query: 282 A---------------VGNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEG 323
A G A Y L S NH C PNAH D + A ++ L+ + +G
Sbjct: 298 ARQLLDALGDDYSICCQGTAFYPLQSCMNHSCCPNAHAFKRDEDRDGQAAIITLKPIRKG 357
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
EE+ + YID + + RQA+L +GF+C C C D
Sbjct: 358 EEVTVSYIDEDLPFEDRQALLAD-YGFKCRCNACLEQD 394
>gi|328770914|gb|EGF80955.1| hypothetical protein BATDEDRAFT_88226 [Batrachochytrium
dendrobatidis JAM81]
Length = 227
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 55/113 (48%), Gaps = 15/113 (13%)
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
+++++I N F I G E +G A++ S++NH C PN +
Sbjct: 74 HIISRIESNCFGIWKPNG--------------KEACMGRAVFPAASYFNHSCFPNCQSIK 119
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
D+ A L+DV +GE L I YID +M AR+A L + F+C C RC S
Sbjct: 120 HDHKMA-FRTLKDVSKGEMLTISYIDTNMPVSARRARLMDDYFFECMCERCIS 171
>gi|149640598|ref|XP_001510487.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Ornithorhynchus anatinus]
Length = 490
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 72/304 (23%), Positives = 120/304 (39%), Gaps = 53/304 (17%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +ES GRG+ AT+ A+DLI + + S + VC+ C ++ + Q
Sbjct: 9 VEVFTSESKGRGLKATKEFWAADLIFSERAYSAVVFDSLTHLVCHTCFKRQEKLQRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A +C C+ +A W + C + G K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDA-----------WLNHKNECSAIKKHG-KAPNENIRLAARIMWRIER 116
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE--------QMKFLNKQWYTNV 248
+ + + +S + + + F K+ +D E Q + + Q+ +++
Sbjct: 117 -EGTGLTEGCLVSIDDLQNHVDSFG--EEEQKELRLDVESFLHFWPPQSQQFSMQYISHI 173
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
I N F + GL AVG ++ NHDC PN +++ +
Sbjct: 174 FGVINCNGFTLSDQRGLQ---------------AVGVGVFPNLCLVNHDCWPNCTVIFNN 218
Query: 309 -NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
N +A L AL + EGEEL + YID R+ +L + + F C C
Sbjct: 219 GNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSADRKKLLKKQYYFDCTCEH 278
Query: 357 CSSG 360
C G
Sbjct: 279 CEKG 282
>gi|409080894|gb|EKM81254.1| hypothetical protein AGABI1DRAFT_56703, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 305
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/68 (42%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 295 NHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C PNA W + N ++L ALR ++ EE+ I Y D + RD R+A L + F C+
Sbjct: 111 NHSCGPNAAHKWDLTNLSSKLYALRPIQPNEEITIFYTDITQPRDTRRAELNRNHRFLCS 170
Query: 354 CLRCSSGD 361
C CS GD
Sbjct: 171 CPHCSPGD 178
>gi|169619746|ref|XP_001803285.1| hypothetical protein SNOG_13071 [Phaeosphaeria nodorum SN15]
gi|160703895|gb|EAT79398.2| hypothetical protein SNOG_13071 [Phaeosphaeria nodorum SN15]
Length = 381
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NHDC PNA W ++ + ALRD++ GEE+ I YID R R L + +GF C
Sbjct: 189 NHDCRPNAAYFWDEDMMTHYVHALRDIQPGEEITITYIDNEKDRKTRNTRLKKNWGFDCG 248
Query: 354 CLRCSS 359
C C++
Sbjct: 249 CSACTA 254
>gi|443719695|gb|ELU09739.1| hypothetical protein CAPTEDRAFT_159764 [Capitella teleta]
Length = 338
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 55/99 (55%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEG 323
LYED+ + I E G+ +Y+L S NH C PNA I + +N+ + AL +++ G
Sbjct: 218 LYEDIDKLSGDFINCE---GSGLYLLQSTCNHSCMPNAEITFPHNNSTLAVKALSNIKTG 274
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+D +R +R IL + + F CNC +C S
Sbjct: 275 EEICISYLDECALERSRHSRHKILRENYLFNCNCSKCES 313
>gi|427782607|gb|JAA56755.1| Putative set and mynd domain protein [Rhipicephalus pulchellus]
Length = 423
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 259 IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALR 318
+E+ G + +++S + A+G +YM PS +H C PNAH ++ + L A
Sbjct: 155 LEIYGKM---IINSYCICNDEHTAIGTGLYMGPSILDHSCSPNAHAVY-EGHKLHLRAAE 210
Query: 319 DVE--EGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
D+ + +R+ YID + RQ L + F+CNC++CS
Sbjct: 211 DINCSNFDGIRVSYIDVMAPKKVRQEELRSQYYFECNCMKCS 252
>gi|390365247|ref|XP_003730778.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 487
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 1/78 (1%)
Query: 281 IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
IA+ + IY+ S NH CD N I+ D +L ++DV+EGEE I Y+D R
Sbjct: 192 IAISDGIYLRASMVNHSCDYNC-IVVFDERKLQLRTVKDVKEGEECTIGYVDVIHPAKER 250
Query: 341 QAILTQGFGFQCNCLRCS 358
+A L + + F C C++C+
Sbjct: 251 RAELEEKYHFTCKCVKCN 268
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 45/89 (50%), Gaps = 3/89 (3%)
Query: 270 LSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRI 328
+ S A +E+ N S NH CDPN W+ D ++M ++DV+EGEE I
Sbjct: 342 IQSQAWKKAAEMGQLNIESYKASMLNHSCDPNC--AWVSDGRKIQIMTVKDVKEGEECTI 399
Query: 329 CYIDASMARDARQAILTQGFGFQCNCLRC 357
Y+D RQA L + + F C C++C
Sbjct: 400 TYVDVMDPAKVRQADLKERYHFTCKCVKC 428
>gi|328866132|gb|EGG14518.1| hypothetical protein DFA_12294 [Dictyostelium fasciculatum]
Length = 561
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 70/135 (51%), Gaps = 12/135 (8%)
Query: 234 DEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIE-----SEIAVGNAIY 288
+E K + ++Y +L I N+ + G E ++++ A + + + +G+A+
Sbjct: 426 EELEKLFSNKFYDEILGLINYNSISTFVYGE--ETVINTGAKNKKMQPKLRQYCLGSALI 483
Query: 289 MLPSFYNHDCDPNAHIM-----WIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
+ + NH C PN + + A A ++A +D+++G+EL YID S RQ+I
Sbjct: 484 PIFACLNHSCAPNIEMGREQRDGVTKAIAVMIAKQDIKKGQELLTSYIDESNPFKDRQSI 543
Query: 344 LTQGFGFQCNCLRCS 358
L+ +GF CNC +CS
Sbjct: 544 LSSQYGFTCNCNKCS 558
>gi|440471791|gb|ELQ40736.1| ankyrin repeat protein [Magnaporthe oryzae Y34]
gi|440479502|gb|ELQ60265.1| ankyrin repeat protein [Magnaporthe oryzae P131]
Length = 2018
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 41/70 (58%), Gaps = 3/70 (4%)
Query: 292 SFYNHDCDPNAHIMWID--NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
S NH+C PN + D R+ A+RD+ GEEL + YID R+ARQ L + +G
Sbjct: 1688 SRMNHECSPNC-AAYFDPMTMSQRIYAIRDIMPGEELTVSYIDPVQTREARQNRLRKDWG 1746
Query: 350 FQCNCLRCSS 359
F C+C RC+S
Sbjct: 1747 FGCSCQRCTS 1756
>gi|261189795|ref|XP_002621308.1| SET domain-containing protein 5 [Ajellomyces dermatitidis SLH14081]
gi|239591544|gb|EEQ74125.1| SET domain-containing protein 5 [Ajellomyces dermatitidis SLH14081]
Length = 363
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + + +D+E GEE+ I Y+D + DARQ+
Sbjct: 140 GAIFPRTARINHSCKPNSQNTWNRNLERLTIHTFKDIEAGEEITIAYVDGTELFDARQSC 199
Query: 344 LTQGFGFQCNCLRCS 358
+ FGF+C C C+
Sbjct: 200 FGEAFGFRCACEVCA 214
>gi|327352101|gb|EGE80958.1| SET domain-containing protein 5 [Ajellomyces dermatitidis ATCC
18188]
Length = 363
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + + +D+E GEE+ I Y+D + DARQ+
Sbjct: 140 GAIFPRAARINHSCKPNSQNTWNRNLERLTIHTFKDIEAGEEITIAYVDGTELFDARQSC 199
Query: 344 LTQGFGFQCNCLRCS 358
+ FGF+C C C+
Sbjct: 200 FGEAFGFRCACEVCA 214
>gi|258569357|ref|XP_002543482.1| predicted protein [Uncinocarpus reesii 1704]
gi|237903752|gb|EEP78153.1| predicted protein [Uncinocarpus reesii 1704]
Length = 369
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
I+ + NH C NA W N + A RD+EEGEE+ I YID S + RQ L
Sbjct: 145 GIFPRAARINHSCKQNAQNSWNYNIGKLTIHAFRDIEEGEEITIAYIDGSEYFETRQNTL 204
Query: 345 TQGFGFQCNCLRC 357
+ FGF+C C C
Sbjct: 205 EEAFGFKCRCEIC 217
>gi|357145110|ref|XP_003573528.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 1
[Brachypodium distachyon]
gi|357145113|ref|XP_003573529.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like isoform 2
[Brachypodium distachyon]
Length = 389
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 116/298 (38%), Gaps = 50/298 (16%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPT----LSTLNSVCYFCLRKITSSSQ 135
++VA GRG+ A R I +++ + PI+ +P+ L + S C+ CL + ++
Sbjct: 6 LRVADLPGRGRGLVAARDILEGEVLLSEPPILLYPSSLASLPSYCSACFRCLPQAPHAAP 65
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQ--------GLKYPLLVKRLA 187
A FC C + + +A + + Q L+ P L+
Sbjct: 66 CPSCRAAAFCSPACAAASHPRLLCAALSRLAAAPESHQEQLLFLLSAYSLQEPAFHALLS 125
Query: 188 -CMIISGAESADCIDI-LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWY 245
G + D + +SL+P +L AG + L +
Sbjct: 126 LSSAPQGTQQQDAASLHAMVSSLAPPHML--------------PAGFSPDLTAALLSKDR 171
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HI 304
TN + + +R ++A GL + A A+Y S NHDC PNA H
Sbjct: 172 TNSFSIM--EPYRPDVAQGLRK--------------ARAYAVYHRASLLNHDCLPNACHF 215
Query: 305 MWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ D N D L AL + G E+RI Y A+ RQ L + +GF+C C RC
Sbjct: 216 DYPDRPGPGNTDIVLRALHGITAGMEVRISYFAANWRYADRQRRLLEDYGFRCECERC 273
>gi|239612927|gb|EEQ89914.1| SET domain-containing protein 5 [Ajellomyces dermatitidis ER-3]
Length = 363
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + + +D+E GEE+ I Y+D + DARQ+
Sbjct: 140 GAIFPRAARINHSCKPNSQNTWNRNLERLTIHTFKDIEAGEEITIAYVDGTELFDARQSC 199
Query: 344 LTQGFGFQCNCLRCS 358
+ FGF+C C C+
Sbjct: 200 FGEAFGFRCACEVCA 214
>gi|156839743|ref|XP_001643559.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
gi|171770010|sp|A7TPV3.1|SET5_VANPO RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|156114175|gb|EDO15701.1| hypothetical protein Kpol_1000p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 499
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 122/308 (39%), Gaps = 59/308 (19%)
Query: 89 GRGVFATRRIRASDLI-HTAKPIITHP-----TLSTLNSVCYFCLRKITSSSQHFQHHNA 142
GRG++A + DLI + PI+ P TL + C C ++ S+HF ++
Sbjct: 125 GRGLYALKDFVKDDLIFKESYPIVVVPSMERLTLMKMGKACSLCGGLLSHLSKHFIVVHS 184
Query: 143 RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDI 202
C DN W + + C++ + + L K L I+G +S
Sbjct: 185 LDC-----DNCGGV--------WCS--NECKNIDIGHNTL-KHLHKSKINGIDSV-AWGK 227
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELA 262
+ + A G +M + K D + KF ++V IRIN
Sbjct: 228 FEKYCNDNVFVAAYSIGVIMALALIDKKNTDKIRKKF---NLLSSVSQGIRINESDSTNI 284
Query: 263 GGLYE-------------------DLLSSAAAS-----IESEIA---------VGNAIYM 289
GG ++ +LL +A + +E+ ++ + + ++
Sbjct: 285 GGTFDASSGAMSNKDPEPIWKTSYELLKNAIPTTDELDMETYLSYIGRYNINQISDQMFF 344
Query: 290 LPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
LPS NH+C+PN + N + R+ A +++ G+EL YI+ R+ L +G
Sbjct: 345 LPSLINHNCEPNVRFEVVSNKEIRVYARKNISAGQELLTNYINPLHGVKLRRRELRVNYG 404
Query: 350 FQCNCLRC 357
F C+C RC
Sbjct: 405 FLCHCDRC 412
>gi|403354753|gb|EJY76937.1| TPR repeat-containing protein [Oxytricha trifallax]
Length = 765
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Query: 268 DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELR 327
+LL + E +I NAI + SF+NHDC N I +A A ++A +D+++GEEL
Sbjct: 539 ELLKAFERKEEEKIEYINAIQPILSFFNHDCYANTSRFSIGDA-AFIVAKKDIKKGEELT 597
Query: 328 ICYIDASMARDARQAILTQGFGFQCNCLRC 357
YI ++ D R+ + + +GF+C C C
Sbjct: 598 QFYISLALPFDEREQLTQKAWGFECRCNSC 627
>gi|73980306|ref|XP_852104.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Canis lupus familiaris]
Length = 490
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 121/308 (39%), Gaps = 63/308 (20%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S +N VC+ C ++ Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER--- 116
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE----------------QMKFLNKQ 243
+ L+ ++A+++ L++ + G +++ Q + + Q
Sbjct: 117 ----EGTGLTEGCLVAVDD----LQNHVEHFGEEEQKELRLDVDTFLQYWPPQGQQFSMQ 168
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ +++ I N F + GL AVG I+ NHDC PN
Sbjct: 169 YISHIFGVISCNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCT 213
Query: 304 IMWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
+++ + N +A L AL + EGEEL + YID + R+ L + + F
Sbjct: 214 VIFNNGNHEAVKSMFHTQMRIELRALGKISEGEELTVSYIDFLNLSEDRKKQLKKQYYFD 273
Query: 352 CNCLRCSS 359
C C C
Sbjct: 274 CTCEHCQK 281
>gi|57997548|emb|CAI46077.1| hypothetical protein [Homo sapiens]
Length = 490
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 115/298 (38%), Gaps = 43/298 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V E GRG+ AT+ A+D+I + S +N VC+ C ++ + Q
Sbjct: 9 VEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
A +C C+ +A + + + SA Y K P RLA I+ E +
Sbjct: 69 KFAHYCDRTCQKDAW----LNHKNECSAIKRYG-----KVPNENIRLAARIMWRVER-EG 118
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLAQIR 253
+ + + + + E F + +D Q + + Q+ +++ I
Sbjct: 119 TGLTEGCLVPVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQYISHIFGVIN 178
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADA 312
N F + GL AVG I+ NHDC PN +++ + N +A
Sbjct: 179 CNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTVIFNNGNHEA 223
Query: 313 -----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + YID + R+ L + + F C C C
Sbjct: 224 VKSMFHTQMRIELRALGKISEGEELTVSYIDFLNVSEERKRQLKKQYYFDCTCEHCQK 281
>gi|302408415|ref|XP_003002042.1| Mcg1p [Verticillium albo-atrum VaMs.102]
gi|261358963|gb|EEY21391.1| Mcg1p [Verticillium albo-atrum VaMs.102]
Length = 103
Score = 60.1 bits (144), Expect = 2e-06, Method: Composition-based stats.
Identities = 38/107 (35%), Positives = 51/107 (47%), Gaps = 15/107 (14%)
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+ NAF++++ G AA V AI L NHDC PNA W D
Sbjct: 9 VATNAFQVDVGG--------DAAGGHHHHFGVFPAISRL----NHDCGPNA-AAWTGKTD 55
Query: 312 A--RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
R A +D+ GEE+ I Y+DA R R+A +T +GF+C C R
Sbjct: 56 LVHRAFASKDIAAGEEISISYVDALAPRAERRARMTGSWGFECACRR 102
>gi|194768765|ref|XP_001966482.1| GF21975 [Drosophila ananassae]
gi|190617246|gb|EDV32770.1| GF21975 [Drosophila ananassae]
Length = 445
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/291 (22%), Positives = 111/291 (38%), Gaps = 37/291 (12%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
P + ++ AGRGVFATR I A +LI + ++T PT C ++ S
Sbjct: 38 PNWTIKSSKVAGRGVFATRDIAAGELIFRERALVTGPTARKGQLSSCVCCHQMLPQSGFL 97
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
H R VC D + A RA+ F + + V L+ I++
Sbjct: 98 CRH--RCTLPVCADCSDAEI---HRAECEHFRRWQPKDADEELEQVNPLSLRILTA---- 148
Query: 198 DCIDILQPASLSPELILAME--------EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
+ + + L+ AM+ + + F+ D + F+++ + ++
Sbjct: 149 --VRVFHLSKEQRHLVDAMQANAQRGYRQEIIKAAQCFRNFPTTDRE--FMDQLF--RIV 202
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDN 309
+ NAF G +E LL ++ L + NH+C PNA + +
Sbjct: 203 GVLNTNAFEAPCRSGGHEVLL--------------RGLFPLTAIMNHECTPNASHYFDNG 248
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
A + A RD+ +G E+ Y R L F C+C+RC+
Sbjct: 249 TLAVVRAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKNFNCDCVRCNDN 299
>gi|389746511|gb|EIM87691.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 552
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/130 (29%), Positives = 63/130 (48%), Gaps = 14/130 (10%)
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLS-----SAAASIESEIAVGNAIYMLPSFYNHDCD 299
Y + A ++ A+ I A L DL+S S + S +G ++ L + NH CD
Sbjct: 181 YLGLDAPGQLGAYGITTAADLV-DLISRFITNSITLTAPSLTPLGVSVSPLVALINHSCD 239
Query: 300 PNAHIMWIDNAD--------ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
PNA +++ + +++A++++ +E+ YID +M R RQA LT + F
Sbjct: 240 PNAVVVYPRSTSDPSQQEPQMQVVAIKNIAPDQEIFTAYIDTTMPRGHRQAALTTTYNFT 299
Query: 352 CNCLRCSSGD 361
C C CS D
Sbjct: 300 CKCSLCSQPD 309
>gi|113205415|gb|AAU90317.2| TPR domain containing protein, putative [Solanum demissum]
Length = 522
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/295 (24%), Positives = 120/295 (40%), Gaps = 74/295 (25%)
Query: 70 GKPSQPSP--PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCL 127
GKP + + I++ + +GRG+FAT+ + +L+ K +
Sbjct: 152 GKPPELAEYIGSIEIKKSGISGRGLFATKNLDCGNLLLVTKAVAVE-------------- 197
Query: 128 RKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRAD---WSAFNDYCRSQGLKYPLLVK 184
R I S F D + +A W F D + +K + +
Sbjct: 198 RAIVPES--------------------VFQDSKEQAQLDMWKNFIDKI-LESIKKCVRTR 236
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
L C + +G E+ D +++ P++ L EG S F ID E++
Sbjct: 237 DLICKLSNG-ENEDDLEV-------PDIDLFRPEG--EDSSTFHDKKIDKEKL------- 279
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
L + +N+ L E+L+S+ S++ G +++L SF NH CDPN
Sbjct: 280 ----LNILDVNS--------LVEELISAKVLGKNSDVH-GIGLWILSSFINHSCDPNVRR 326
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDA-SMARDARQAILTQGFGFQCNCLRCS 358
+ + + A RD++ GEEL Y D S RD + T+ +GF C C RC+
Sbjct: 327 SHVGDY-VMIHACRDIKAGEELTFAYFDVFSPFRDREEK--TKSWGFVCKCKRCN 378
>gi|392862338|gb|EAS37026.2| hypothetical protein CIMG_02225 [Coccidioides immitis RS]
Length = 497
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 1/75 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G A+ L S NH CDPNA + + D L AL+ +EEGE++ + Y++ + A RQA
Sbjct: 206 IGAAVEPLASLCNHSCDPNAAVDF-DKGKTWLRALQHIEEGEQIFVSYVEPTDACLHRQA 264
Query: 343 ILTQGFGFQCNCLRC 357
L++ + F+C C RC
Sbjct: 265 ELSKRYYFECECPRC 279
>gi|303321560|ref|XP_003070774.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240110471|gb|EER28629.1| MYND finger family protein [Coccidioides posadasii C735 delta
SOWgp]
gi|320040248|gb|EFW22181.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 497
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 1/75 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G A+ L S NH CDPNA + + D L AL+ +EEGE++ + Y++ + A RQA
Sbjct: 206 IGAAVEPLASLCNHSCDPNAAVDF-DKGKTWLRALQHIEEGEQIFVSYVEPTDACLHRQA 264
Query: 343 ILTQGFGFQCNCLRC 357
L++ + F+C C RC
Sbjct: 265 ELSKRYYFECECPRC 279
>gi|390363607|ref|XP_001199204.2| PREDICTED: N-lysine methyltransferase SMYD2-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 281 IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
I + +AIY+ S NH CD N I+ D +L ++DV+EGEE I Y+D R
Sbjct: 257 IVISDAIYLRASMANHSCDYNC-IVVFDERKLQLRTVKDVQEGEECTIGYVDVIHPAKER 315
Query: 341 QAILTQGFGFQCNCLRC 357
+A L + + F C C++C
Sbjct: 316 RAELEEKYHFTCKCVKC 332
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 35/63 (55%), Gaps = 1/63 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
NH CD N ++ D +L ++DV+EGEE I Y+D RQA L + + F C C
Sbjct: 3 NHSCDYNCAGVF-DGMKLQLRTIKDVKEGEECTISYVDVINPAKERQAKLEEEYHFTCKC 61
Query: 355 LRC 357
++C
Sbjct: 62 VKC 64
>gi|328702715|ref|XP_001948576.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Acyrthosiphon pisum]
Length = 387
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 15/124 (12%)
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAV------GNAIYMLPSFYNH 296
QW N + + L+E + SA ++E + V G+A++ S NH
Sbjct: 248 QWANNCKKHLSPDQME------LFEQFIDSAYDAMEQVVGVDFLDNEGSALFQNHSSINH 301
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYI---DASMARDARQAILTQGFGFQCN 353
C PNA ++ N RL+A+R +E G+E+ I YI + +R RQ L + + F C
Sbjct: 302 SCFPNAASVFDGNHVLRLVAIRMIEPGDEINISYIAPCELDHSRHTRQKYLQENYVFTCR 361
Query: 354 CLRC 357
C++C
Sbjct: 362 CIKC 365
>gi|449665374|ref|XP_002163555.2| PREDICTED: uncharacterized protein LOC100200645 [Hydra
magnipapillata]
Length = 1037
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 55/113 (48%), Gaps = 1/113 (0%)
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
L + IN I++ G + + +S A + ++G I+ S +NH CDPN + +
Sbjct: 152 LQDVNININDIDIYGLMCKASCNSFAITNAELNSLGTGIFSSASLFNHSCDPNC-VATFN 210
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
D + A++ + EGEEL + YI D RQ L + + F C C CS +
Sbjct: 211 GRDISIRAIKPIAEGEELMLSYISILATSDVRQLELRESYMFTCKCTVCSRKE 263
>gi|296806220|ref|XP_002843920.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
gi|238845222|gb|EEQ34884.1| histone methyltransferase SmyD1b [Arthroderma otae CBS 113480]
Length = 495
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 115/305 (37%), Gaps = 70/305 (22%)
Query: 88 AGRGVFATRRIRAS----DLIHTAKPIITHPTLSTLNSVCY----FCLRKITSSSQHFQH 139
AG G+FAT ++ DL T +++ L+ S C+ F + +
Sbjct: 16 AGTGLFATSTVKTGQAVFDLPATFSTVLSTERLADACSNCFANAPFAANVMADVGIKLRF 75
Query: 140 ----HNARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRSQGLKYP--LLVKRLACMIIS 192
++CG+ C+ +W+A + + C + YP L V A + I
Sbjct: 76 CTGCRVVKYCGKGCQTE-----------NWAAIHKHECATYKRLYPKVLPVNSRAVLRIL 124
Query: 193 GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKK-AGIDDEQMKF------------ 239
S+ D+ Q + F LRS FK+ A + EQ +
Sbjct: 125 KLRSSKQPDVQQDLGM-----------FRSLRSHFKEIAETNKEQYERIMLCAKAEKEYS 173
Query: 240 ---LNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNH 296
L+ + LA+I +N F G +G I F NH
Sbjct: 174 HSDLDIETIAEYLAKIEVNGFTFTTPFG----------------DPLGLCIQPFACFVNH 217
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
CDPNA ++ D + ALR +E E++ I YID + + RQ L + + F C C +
Sbjct: 218 SCDPNA-VVGFDEGRITVKALRTIEPDEQVFISYIDNTNPFEIRQKELAERYFFTCRCSK 276
Query: 357 CSSGD 361
C GD
Sbjct: 277 CLQGD 281
>gi|440476470|gb|ELQ45066.1| hypothetical protein OOU_Y34scaffold00021g6 [Magnaporthe oryzae
Y34]
Length = 581
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 1/82 (1%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMAR 337
E++++ G ++ ++ S NHDC PNAH+ + + + R+ +L+ + G E+ + Y D +
Sbjct: 235 EADVSFGTSLDLVVSMINHDCSPNAHV-FFEGSQVRVRSLKAIAAGGEITVSYCDPRLDV 293
Query: 338 DARQAILTQGFGFQCNCLRCSS 359
RQ IL Q F C C C+S
Sbjct: 294 LLRQEILRQTQFFHCECTTCNS 315
>gi|328769214|gb|EGF79258.1| hypothetical protein BATDEDRAFT_35411 [Batrachochytrium
dendrobatidis JAM81]
Length = 503
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG Y L S +NHDC PN I D + + +RD+ +G+EL I YID + D+R+
Sbjct: 256 VGEGTYPLASLFNHDCWPNC-IAIFDGSRVVIQTIRDIAKGDELCISYIDPILDHDSRRM 314
Query: 343 ILTQGFGFQCNCLRCSS 359
L + F C C C S
Sbjct: 315 SLETKYCFNCQCSVCMS 331
>gi|85086843|ref|XP_957767.1| hypothetical protein NCU00296 [Neurospora crassa OR74A]
gi|28918862|gb|EAA28531.1| predicted protein [Neurospora crassa OR74A]
Length = 510
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 43/74 (58%), Gaps = 7/74 (9%)
Query: 292 SFYNHDCDPNAHIMWIDNADAR-------LMALRDVEEGEELRICYIDASMARDARQAIL 344
S NHDC PNA W N++ + A++D+ GEE+ I YI+ +R ARQ +L
Sbjct: 284 SRLNHDCRPNADYRWDWNSNKGGPGLVQVITAVKDILPGEEITISYINPLGSRKARQKLL 343
Query: 345 TQGFGFQCNCLRCS 358
+ +GF+C+C CS
Sbjct: 344 STAWGFECSCELCS 357
>gi|426197815|gb|EKV47742.1| hypothetical protein AGABI2DRAFT_205157, partial [Agaricus bisporus
var. bisporus H97]
Length = 392
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 295 NHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C PNA W + N ++L ALR ++ EE+ I Y D + RD RQ+ L + F C+
Sbjct: 184 NHSCGPNASHKWDLTNLSSKLYALRPIQPNEEITIFYTDITQPRDTRQSELNRNHRFLCS 243
Query: 354 CLRCSSGD 361
C CS D
Sbjct: 244 CPHCSLDD 251
>gi|440490836|gb|ELQ70343.1| hypothetical protein OOW_P131scaffold00045g35 [Magnaporthe oryzae
P131]
Length = 582
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 1/82 (1%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMAR 337
E++++ G ++ ++ S NHDC PNAH+ + + + R+ +L+ + G E+ + Y D +
Sbjct: 235 EADVSFGTSLDLVVSMINHDCSPNAHV-FFEGSQVRVRSLKAIAAGGEITVSYCDPRLDV 293
Query: 338 DARQAILTQGFGFQCNCLRCSS 359
RQ IL Q F C C C+S
Sbjct: 294 LLRQEILRQTQFFHCECTTCNS 315
>gi|242006418|ref|XP_002424047.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507353|gb|EEB11309.1| SET and MYND domain-containing protein, putative [Pediculus humanus
corporis]
Length = 236
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/226 (22%), Positives = 90/226 (39%), Gaps = 37/226 (16%)
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
Q+ N FC E N + + G+K+ V+ L +I + +
Sbjct: 24 QYENKEFCHETSNKNENITHKL---------------LGIKFSDQVETLRQLIKASVPTE 68
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKK--AGIDDEQMKFLNKQWYTNVLAQIRIN 255
+ + P S L+L G + S+F + + ++ KQ N +
Sbjct: 69 HAEEWITPEGFSQLLVLVGMNGQGIGTSSFSEWVKNVSKAELPLKEKQLIDNYI------ 122
Query: 256 AFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARL 314
+YE+ + +E G+ +Y L S NH C PNA + + N +
Sbjct: 123 -------DAVYENFEKGVGDFLNNE---GSGLYELQSTINHSCSPNAEVTFPHSNYQLAV 172
Query: 315 MALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
+A ++ G+E+ I Y+D S +R +RQ IL + + F C C +C
Sbjct: 173 VATDNINPGDEICISYLDMCSLSRSRHSRQKILQENYLFTCKCHKC 218
>gi|384248030|gb|EIE21515.1| hypothetical protein COCSUDRAFT_56730 [Coccomyxa subellipsoidea
C-169]
Length = 494
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/95 (38%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 262 AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE 321
A G DL A A E +G +Y+ S NH C PN ++ + ALRD+E
Sbjct: 404 ANGTSHDLRDRATALEE---LIGRELYVRASLLNHSCRPNC-VVVRSMTSGSVRALRDIE 459
Query: 322 EGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
EGEEL I YID + AR L + F F+C C R
Sbjct: 460 EGEELTISYIDLGLPPSARGDELRKNFFFECTCDR 494
>gi|389636919|ref|XP_003716103.1| hypothetical protein MGG_14002 [Magnaporthe oryzae 70-15]
gi|351641922|gb|EHA49784.1| hypothetical protein MGG_14002 [Magnaporthe oryzae 70-15]
Length = 449
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 45/77 (58%), Gaps = 3/77 (3%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWID--NADARLMALRDVEEGEELRICYIDASMARDARQA 342
++ ++ S NH+C PN + D R+ A+RD+ GEEL + YID R+ARQ
Sbjct: 247 HSTFVEVSRMNHECSPNC-AAYFDPMTMSQRIYAIRDIMPGEELTVSYIDPVQTREARQN 305
Query: 343 ILTQGFGFQCNCLRCSS 359
L + +GF C+C RC+S
Sbjct: 306 RLRKDWGFGCSCQRCTS 322
>gi|357461631|ref|XP_003601097.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
gi|355490145|gb|AES71348.1| Histone-lysine N-methyltransferase ASHR1 [Medicago truncatula]
Length = 511
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 125/303 (41%), Gaps = 31/303 (10%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ V+ GR +F TR D+I + +P + PT C C T+ S+ +
Sbjct: 13 LTVSTISEKGRSLFTTRDFHPGDVIISQEPYVCVPT----QKRCDGCF-STTNLSKCSRC 67
Query: 140 HNARFCGEVCKDNAKAFYDVE----RRADWSAFNDYCRSQGLKYPLLVKR-LACMIISGA 194
+CG C+ + + +E R D + S L L ++R L I +
Sbjct: 68 QVVWYCGTPCQKSEWKLHRLECQALSRLDSNKRKSVTPSIRLMLKLYLRRKLQDQKIIPS 127
Query: 195 ESAD--------------CIDILQPASLSPE--LILAMEEGFVMLRSAFKK--AGIDDEQ 236
+ D I +L ++ E L+ M V+ + F + + I +EQ
Sbjct: 128 TAMDNYKLVEALVARILFIIVVLSSRAMIVEFSLLTNMSVVIVLFNTYFHEDMSDIKEEQ 187
Query: 237 MKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYN 295
+ L Q V ++ I+ + +A +SE+ +G +Y + S N
Sbjct: 188 L-VLYAQMANLVHLILQWPEINIKEIANFFSKFACNAHTVCDSELRPLGTGLYPVVSIIN 246
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
H C PN+ ++ D +A + AL+ + +G E+ I YI+ + + RQ L + + FQC C
Sbjct: 247 HSCLPNS-VLVFDGREASVRALQHIPKGTEVLISYIETAGSTVTRQKALREQYLFQCVCP 305
Query: 356 RCS 358
CS
Sbjct: 306 LCS 308
>gi|429859827|gb|ELA34590.1| set domain-containing protein 5 [Colletotrichum gloeosporioides
Nara gc5]
Length = 648
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
I++ S NH C NA W D + ALRD+++GEE+ I Y+ AS +RQ+
Sbjct: 428 GGIFLEASRINHSCRHNAQNTWNADRGQLTVHALRDIDDGEEITISYLGASENYSSRQSR 487
Query: 344 LTQGFGFQCNCLRCSSGD 361
L + FGF C C C+ D
Sbjct: 488 LKESFGFTCACELCTLPD 505
>gi|403222774|dbj|BAM40905.1| uncharacterized protein TOT_030000166 [Theileria orientalis strain
Shintoku]
Length = 150
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 41/76 (53%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQ 341
+G ++ S NH C+PN I + N A + L DV GEE I YID + + RQ
Sbjct: 66 TMGLGLFNYLSKMNHSCEPNLQIEYTKNNIAHIAPLVDVPRGEEATISYIDEKDSVENRQ 125
Query: 342 AILTQGFGFQCNCLRC 357
L + +GF+C+C +C
Sbjct: 126 EKLYKNYGFKCDCNKC 141
>gi|389645180|ref|XP_003720222.1| hypothetical protein MGG_09389 [Magnaporthe oryzae 70-15]
gi|351639991|gb|EHA47855.1| hypothetical protein MGG_09389 [Magnaporthe oryzae 70-15]
gi|440476572|gb|ELQ45150.1| hypothetical protein OOU_Y34scaffold00003g3 [Magnaporthe oryzae
Y34]
gi|440479296|gb|ELQ60071.1| hypothetical protein OOW_P131scaffold01314g3 [Magnaporthe oryzae
P131]
Length = 428
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 68/150 (45%), Gaps = 19/150 (12%)
Query: 210 PELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDL 269
P L LA+ + R +F G Q F Q ++ ++ N+F++ L+G
Sbjct: 186 PSLELAVSKMTAATRKSFMAQGT---QGDF---QGAMGIMDRLFTNSFQMALSG------ 233
Query: 270 LSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRIC 329
A +++ GN + S NHDC PN +N R A+RD++ GEEL I
Sbjct: 234 ---VGADVKAGFHYGN--FPNVSKLNHDCRPNMVFHIDNNLVHRTHAVRDIKPGEELTIS 288
Query: 330 YIDA-SMARDARQAILTQGFGFQCNCLRCS 358
Y+D ARD RQA GF C C CS
Sbjct: 289 YVDQMDPARD-RQARTRSSLGFVCGCAHCS 317
>gi|401887066|gb|EJT51072.1| hypothetical protein A1Q1_07762 [Trichosporon asahii var. asahii
CBS 2479]
Length = 621
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 44/77 (57%), Gaps = 4/77 (5%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDA---SMARDAR 340
+AI+ S NH C PN+H W + A R LMALRD++EGEE+R Y+D +R R
Sbjct: 419 SAIFEYISRTNHSCCPNSHWYWDNEAQERYLMALRDIKEGEEIRASYVDEFDLLNSRAKR 478
Query: 341 QAILTQGFGFQCNCLRC 357
+ IL + F C C C
Sbjct: 479 RQILRELHEFHCLCPGC 495
>gi|326919603|ref|XP_003206069.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Meleagris gallopavo]
Length = 478
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/298 (24%), Positives = 114/298 (38%), Gaps = 56/298 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
++V TE GRG+ A + D+I A+P +L V C+ C ++ + Q
Sbjct: 9 VEVFTTEGKGRGLKAQKEFLPGDVIF-AEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQ 67
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAE 195
A +C C+ +A W + C + G K P RLA I+ E
Sbjct: 68 CKFAYYCDRTCQRDA-----------WLNHKNECSAIKKHG-KAPTENIRLAARILWRIE 115
Query: 196 SADCIDILQPASLSPELILAME------EGFVMLRSAFKKAGIDDE--------QMKFLN 241
+ + LS +++++ E F K ID E Q +
Sbjct: 116 R-------EGSGLSENCLVSIDDLQNHVENFD--EEEKKDLRIDVESFLEFWPAQSQQFG 166
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q+ +++ I NAF + GL AVG I+ NHDC PN
Sbjct: 167 MQYISHIFGVINCNAFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWPN 211
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+++ +N L AL + G+EL + Y+D + RQ L + + F C C C
Sbjct: 212 CTVIF-NNGKIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYYFDCTCEHCKK 268
>gi|328875121|gb|EGG23486.1| hypothetical protein DFA_05619 [Dictyostelium fasciculatum]
Length = 391
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 45/80 (56%), Gaps = 1/80 (1%)
Query: 282 AVGNAIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
++G+ +Y S++NH C PN + + ++ALRD+E+GEEL YI SM+ +AR
Sbjct: 269 SIGSGLYDQCSYFNHSCQPNIFKVNQTPGGELVMVALRDIEQGEELFYNYIQISMSGEAR 328
Query: 341 QAILTQGFGFQCNCLRCSSG 360
L + + F C C C +
Sbjct: 329 IKKLKESYFFNCQCPGCKNA 348
>gi|432873524|ref|XP_004072259.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oryzias latipes]
Length = 476
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 105/282 (37%), Gaps = 42/282 (14%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + A D+I + + S VC+ C R+ + Q A +C
Sbjct: 13 GRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHYCDRT 72
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA-----------ESA 197
C+ A E + + SA Y + L+ + L + G+ E
Sbjct: 73 CQRAGWA----EHKQECSAIKAYGKVPNENIRLVARILWRLDKDGSTVSDMQLTTLDELE 128
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
D I +Q L EL + + K+ +DD +++ I N F
Sbjct: 129 DHITDMQEDELK-ELKVDIHNFLDFWPRTSKQHTVDD----------ISHIFGVINCNGF 177
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL 317
+ GL AVG ++ NHDC PN ++ +++ L +L
Sbjct: 178 SVSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCTVI-LNHGKIELRSL 221
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ EGEEL + Y+D + RQ +L + F C C C +
Sbjct: 222 GKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKN 263
>gi|392579066|gb|EIW72193.1| hypothetical protein TREMEDRAFT_58351 [Tremella mesenterica DSM
1558]
Length = 660
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 54/90 (60%), Gaps = 1/90 (1%)
Query: 269 LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELR 327
++++ +AS S VG A+ + NH C PNA +++ + AD+ R++A++ +E GEE+
Sbjct: 367 MINTFSASSPSLDLVGAALNPAMAMSNHSCSPNAVVVFPEGADSMRIVAIKAIEAGEEVL 426
Query: 328 ICYIDASMARDARQAILTQGFGFQCNCLRC 357
Y+D ++ RQA L + + F+C C C
Sbjct: 427 THYVDLALPYAQRQAELRRTYHFECKCPTC 456
>gi|431907306|gb|ELK11286.1| SET and MYND domain-containing protein 1 [Pteropus alecto]
Length = 476
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/308 (22%), Positives = 119/308 (38%), Gaps = 61/308 (19%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +++ GR + AT+ A+D+I + S +N VC+ C ++ Q
Sbjct: 9 VEVFISDGKGRSLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHHCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A +C C+ +A W + C + G K P RLA I+ E
Sbjct: 69 KFAYYCDRTCQKDA-----------WLNHKNECLAIKRYG-KVPNENIRLAARIMWRVER 116
Query: 197 ADCIDILQPASLSPELILAME--EGFVMLRSAFKKAGIDDEQMKFL----------NKQW 244
+ L+ +++++ + VM ++ + + FL + Q+
Sbjct: 117 -------EGTGLTEGCLVSVDDLQNHVMHFGEEEQKQLQVDVDTFLQYWPPKSQQFSMQY 169
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+N+ I N F + GL AVG I+ NHDC PN +
Sbjct: 170 ISNIFGVINCNGFTLSDQRGLQ---------------AVGVGIFPNLGLVNHDCWPNCTV 214
Query: 305 MWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
++ + N +A L AL + EGEEL + YID + R+ L + + F C
Sbjct: 215 IFNNGNHEAVKSMFHTQMRIELRALGKIAEGEELTVSYIDFLNVSEERKKQLKKQYYFDC 274
Query: 353 NCLRCSSG 360
C C G
Sbjct: 275 TCEHCQKG 282
>gi|296413480|ref|XP_002836440.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295630261|emb|CAZ80631.1| unnamed protein product [Tuber melanosporum]
Length = 763
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 119/294 (40%), Gaps = 81/294 (27%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPI-ITHPTLSTLNSVCYFCLR----KITSS 133
P++V GRG+FATR ++ +L+ +K + H + F L+ + +
Sbjct: 362 PVRVGKAGGKGRGLFATRDVKFGELLLCSKAFKVCHKEAANARLEILFNLKARKGQAGTH 421
Query: 134 SQHFQ------HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
SQ Q +HN + AK FYD+ DY R +G
Sbjct: 422 SQLVQELIQELYHNPK--------KAKRFYDLHT-------GDYKRVRG----------- 455
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAF---KKAGIDDEQMKFLNKQW 244
E+ D + I+ L++ +MLR+AF + + +DD ++
Sbjct: 456 -------ETVDGLPIV-----DSHLVMR-----IMLRNAFECSRLSSVDDPLENLFREK- 497
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
T V E +++ A+ + +E G+ ++++PS+ NH C N+
Sbjct: 498 -TEV------------------EKMIAEASPTGTAE---GSGLWIMPSYINHSCWQNSTR 535
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
++ + + A RD+ EGEE+ I Y++ RQ +GF+C C C
Sbjct: 536 SFLGDL-LIVRAARDLSEGEEITINYMENESGVQKRQKAFLSEWGFECKCTMCE 588
>gi|440792261|gb|ELR13489.1| tetratricopeptide repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 764
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDAR 340
+G +Y + NH C PNA+ W + + RL A+R +E GEE+ + Y+D ++ AR
Sbjct: 359 IGQGLYASAALLNHSCLPNAN--WSVDGEGRLCVRAVRPIEAGEEVTVAYVDPTLPYHAR 416
Query: 341 QAILTQGFGFQCNCLRC 357
Q L F F C CL+C
Sbjct: 417 QQALQDHFFFACRCLQC 433
>gi|358388868|gb|EHK26461.1| hypothetical protein TRIVIDRAFT_35412 [Trichoderma virens Gv29-8]
Length = 535
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 118/285 (41%), Gaps = 29/285 (10%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
SP P+++ S GRG++AT+ +I P++ PT+S L+SVC +CLR +
Sbjct: 3 SPSPLEIRPHPSKGRGLYATKSFSPGAVIFPFTPLLLLPTVSCLSSVCSYCLRPGNPRAC 62
Query: 136 HFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPL--LVKRLACMIIS 192
+ H A +C KA + E +A D R + L P L++ L C I
Sbjct: 63 S-RCHAASYCDAACQAAAWKAVHSRECKALRQGIKDEGRRRQLPTPTRALMQALLCGEIG 121
Query: 193 GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQI 252
+L+ + E E M AF G +E ++ +L +I
Sbjct: 122 DGLKDLEGHVLEKKAEGDEWRDI--EMMAMAACAFSGKGTAEELVR-----RAAEMLCKI 174
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
+ N+F+ ++ L +E +A+ NH C PNA + +I +
Sbjct: 175 QNNSFQ------RFDPDLGVVGLFLEPTLAMA----------NHSCIPNAAVQFIGR-NT 217
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
L+A + G+E+ + Y + R+ L + F C CLRC
Sbjct: 218 LLVAESPIRAGDEIELAYTFYTDPLLKRREALAH-YKFVCQCLRC 261
>gi|310796287|gb|EFQ31748.1| hypothetical protein GLRG_06723 [Glomerella graminicola M1.001]
Length = 432
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 55/123 (44%), Gaps = 16/123 (13%)
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
QM ++++A N+F++++ GG +D GN Y S YN
Sbjct: 212 QMGHFGGHQVSDIMA---TNSFQMDVGGGAQDDGHH-----------YGN--YPEVSRYN 255
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
HDC PN + R +R V+ GEEL I Y+D R RQ + +GF+C C
Sbjct: 256 HDCRPNVAFRIGADLRHRTTVVRPVKPGEELTIAYLDPLATRSVRQHRAKRAWGFECGCS 315
Query: 356 RCS 358
+C
Sbjct: 316 QCG 318
>gi|16930389|gb|AAL31881.1|AF410782_1 cardiac and skeletal muscle-specific BOP2 [Gallus gallus]
Length = 473
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 113/298 (37%), Gaps = 56/298 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
++V TE GRG+ A + D+I A+P +L V C+ C ++ + Q
Sbjct: 4 VEVFTTEGKGRGLKAQKEFLPGDVIF-AEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQ 62
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMII---- 191
A +C C+ +A W + C + G K P RLA I+
Sbjct: 63 CKFAYYCDRTCQRDA-----------WLNHKNECSAIKKHG-KAPTENIRLAARILWRIE 110
Query: 192 --SGAESADC---IDILQP--ASLSPELILAME---EGFVMLRSAFKKAGIDDEQMKFLN 241
G S +C ID LQ S E + E F+ A Q +
Sbjct: 111 REGGGLSENCLVSIDDLQNHVESFDEEEKKDLRVDVESFLEFWPA---------QSQQFG 161
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q+ +++ I NAF + GL AVG I+ NHDC PN
Sbjct: 162 MQYISHIFGVINCNAFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWPN 206
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+++ +N L AL + G+EL + Y+D + R+ L + + F C C C
Sbjct: 207 CTVIF-NNGKIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYYFDCTCEHCKK 263
>gi|169846736|ref|XP_001830082.1| hypothetical protein CC1G_04515 [Coprinopsis cinerea okayama7#130]
gi|116508852|gb|EAU91747.1| hypothetical protein CC1G_04515 [Coprinopsis cinerea okayama7#130]
Length = 396
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 72/174 (41%), Gaps = 14/174 (8%)
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD-EQMKFL-----NKQWYTNV 248
E I + P + P + L G + ++ + E+M L N++ T V
Sbjct: 76 EEGSLIIVEHPVFVVPAMPLPQNAGIELYEGVAQRMPRPEYEEMTTLANCRSNEECPTIV 135
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-I 307
R NA ++ + D ++ E G A + + NH C PNA W +
Sbjct: 136 EGVARTNALNLQFRFPEFAD------STFEGGRHYGGAFLKI-NRCNHSCGPNAAYRWDV 188
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
A L ALRD+ EGEE+ I Y D +R AR L + F C+C C+ D
Sbjct: 189 SKLAATLYALRDISEGEEITITYADPLQSRAARLKKLEPDYRFTCDCHWCTFKD 242
>gi|449468333|ref|XP_004151876.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like [Cucumis
sativus]
Length = 493
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 82/186 (44%), Gaps = 28/186 (15%)
Query: 198 DCI---DILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
DCI D ++P++ + + E F+ L+ K+A D + + Y +++ +
Sbjct: 305 DCIALPDDVEPSNEAAFRMQIREMAFMSLQ-LLKEAIFDVGCEPLFSLEIYGHIIGMFEL 363
Query: 255 NAFRIELAGG-----LYEDLLSSAAASIESEIA---------------VGNAIYMLPSFY 294
N + +A LY D LSS EI G A + L S
Sbjct: 364 NNLDLVVASPVEDYFLYIDELSSPYKERAEEITRPLLDALGDSYSICCQGTAFFPLQSCM 423
Query: 295 NHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
NH C PNA + + A ++ALR + GEE+ I YID + D R+A+L +GF+
Sbjct: 424 NHSCYPNAKAFKREEDRDGQATIIALRPIHPGEEVTISYIDEDLPFDQRRALLAD-YGFE 482
Query: 352 CNCLRC 357
C C +C
Sbjct: 483 CRCPKC 488
>gi|384485695|gb|EIE77875.1| hypothetical protein RO3G_02579 [Rhizopus delemar RA 99-880]
Length = 394
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 106/243 (43%), Gaps = 32/243 (13%)
Query: 48 LISRLQSLHLQKRQLCSTATHNGKPSQPSPPPI-QVALTESAGRGVFATRRIRASDLIHT 106
++ + ++ QKRQ T + PS+ + ++A + + A + +D+++T
Sbjct: 141 IVVNIMAVEQQKRQ---TEFYEQFPSEDVKLKLKELAQDDKVTHCLIAEQDFEENDVLYT 197
Query: 107 AKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFY--------D 158
P+I+ + C FC+ ++ + FC E C+ A Y
Sbjct: 198 ETPLISALSPQLEGKYCNFCMNELKDVVECKNCDQVAFCSEACESAATKQYHQFLCTNNK 257
Query: 159 VERRADWSA---FNDYCRSQGLKYPLLVKRLACMIISGA---------ESADCIDILQPA 206
+E A+ + F DY + Q LKYP ++ +L C++++ S D I+ ++ A
Sbjct: 258 LEPNAEQNKALDFVDYAKKQNLKYPYMIAKLFCVMVAEEVDKKNKPKYNSWDHIERMKGA 317
Query: 207 SLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI----ELA 262
L P A E + + K GI++ FL + Y +L+++ NA+ + EL+
Sbjct: 318 DLQPIDKYANESQMIKQLLSSKVPGIEE----FLTDEIYLTLLSKLNANAYEVTSTDELS 373
Query: 263 GGL 265
G+
Sbjct: 374 VGV 376
>gi|312371974|gb|EFR20029.1| hypothetical protein AND_20776 [Anopheles darlingi]
Length = 256
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/104 (36%), Positives = 54/104 (51%), Gaps = 7/104 (6%)
Query: 260 ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALR 318
EL LY + + +E G+A+Y + S NH C PNA I++ N L ALR
Sbjct: 128 ELIDELYNKMDEVVGTFLNNE---GSALYAMQSKINHSCTPNAEIVFPKSNHVLALRALR 184
Query: 319 DVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
D+ GEE+ I Y+D +R +RQ L + + F+C C RC S
Sbjct: 185 DLAAGEEICISYLDECNLQRSRHSRQKNLREYYLFECQCERCES 228
>gi|449519020|ref|XP_004166533.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like [Cucumis
sativus]
Length = 493
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/186 (27%), Positives = 82/186 (44%), Gaps = 28/186 (15%)
Query: 198 DCI---DILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
DCI D ++P++ + + E F+ L+ K+A D + + Y +++ +
Sbjct: 305 DCIALPDDVEPSNEAAFRMQIREMAFMSLQ-LLKEAIFDVGCEPLFSLEIYGHIIGMFEL 363
Query: 255 NAFRIELAGG-----LYEDLLSSAAASIESEIA---------------VGNAIYMLPSFY 294
N + +A LY D LSS EI G A + L S
Sbjct: 364 NNLDLVVASPVEDYFLYIDELSSPYKERAEEITRPLLDALGDSYSICCQGTAFFPLQSCM 423
Query: 295 NHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
NH C PNA + + A ++ALR + GEE+ I YID + D R+A+L +GF+
Sbjct: 424 NHSCYPNAKAFKREEDRDGQATIIALRPIHPGEEVTISYIDEDLPFDQRRALLAD-YGFE 482
Query: 352 CNCLRC 357
C C +C
Sbjct: 483 CRCPKC 488
>gi|148225404|ref|NP_001085463.1| SET and MYND domain containing 1 [Xenopus laevis]
gi|49118725|gb|AAH72803.1| MGC80131 protein [Xenopus laevis]
Length = 478
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 113/292 (38%), Gaps = 44/292 (15%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNS-VCYFCLRKITSSSQHFQ 138
+++ +E GRG+ ATR A D+I A+P + L+ VC+ C ++ + Q
Sbjct: 4 VEIFDSEGKGRGLRATRESWAGDVIF-AEPAYSAVVFDNLSHCVCHSCFKRQEKLLRCGQ 62
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS--QGLKYPLLVKRLACMII----- 191
A +C C+ + W C + + K P RLA I+
Sbjct: 63 CKFAHYCDRTCQ-----------KESWVNHKHECLAIKKAGKAPNENIRLAARILWRIER 111
Query: 192 -SGAESADC---IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
G + C ID LQ E ++ K Q + Q+ ++
Sbjct: 112 EGGGLTEGCLVSIDDLQNHLDK----FGDGEKSSLMEDVQKFMDFWPSQSQQFGMQYISH 167
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+ + I N F + GL AVG AI+ NHDC PN +++
Sbjct: 168 IFSVISCNGFTLSDQRGLQ---------------AVGVAIFPNLCLTNHDCWPNCTVIF- 211
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+N L AL + +G+EL + Y+D + R L + + F C C C++
Sbjct: 212 NNGKIELRALGKINKGDELTVSYVDFLNLTEDRMEQLKKQYYFDCTCEHCTN 263
>gi|118343990|ref|NP_001071820.1| SET and MYND domain containing protein [Ciona intestinalis]
gi|70571203|dbj|BAE06699.1| SET and MYND domain containing protein [Ciona intestinalis]
Length = 474
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 122/310 (39%), Gaps = 49/310 (15%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
S P +++ T + GRG+ ATR+ + +P S ++ VC++CL
Sbjct: 2 SYPGVELFQTSAKGRGLKATRKFETGQAVLKQEPYAYAVMSSHIDVVCHYCLCAPGQPGA 61
Query: 136 HFQHHN-------ARFCGEVCKDNAKAFYDVE----RRADWSAFNDYCRSQGLKYPLLVK 184
+ + A++C + C+ A + E +R D R G +L +
Sbjct: 62 PVEDLHRCTGCKFAQYCTKECQKKAWPEHKQECAAIKRITPGKPVDQTRLVGR---ILWR 118
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAME-EGFVMLRSAFKKAGIDDEQMKFLNKQ 243
R ++G + D + EL+ E E + R+A +K ID++ F +
Sbjct: 119 RKREENLNGEKK----DGKENDEKKVELVKIEELEDHLSKRNAEEKEAIDEKVYSFGDYF 174
Query: 244 WY----------TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
Y ++ A I NA + G+ +G IY S
Sbjct: 175 TYDEMPDSDEEMAHLFAIIDCNAIGLNDHRGVQ---------------TIGVGIYPGISM 219
Query: 294 YNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
NHDC PN M N RL ALR ++ GEEL I YID+ + R+ L + F
Sbjct: 220 LNHDCSPNCVAM---NNGPRLEVRALRVIQPGEELCISYIDSLETTEKRREKLKLQYYFD 276
Query: 352 CNCLRCSSGD 361
C C C+ G+
Sbjct: 277 CECDTCTKGE 286
>gi|302922324|ref|XP_003053442.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
gi|256734383|gb|EEU47729.1| hypothetical protein NECHADRAFT_74600 [Nectria haematococca mpVI
77-13-4]
Length = 530
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 112/283 (39%), Gaps = 33/283 (11%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
PP QV GR + A+R D I T P I P+LS ++VC C + S
Sbjct: 5 PPFQVK-RHLKGRCLIASRSFTPGDTILTFTPTILIPSLSHASTVCSHCFKPGEPRSCS- 62
Query: 138 QHHNARFCGEVCKDNA-KAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
+ H +C C+ A KA + E + + R GL P+ R + AE
Sbjct: 63 RCHAVAYCDASCQSAAWKAIHSKECKV-LQKVSAQGRP-GLPTPV---RAVVQALVKAEV 117
Query: 197 ADCIDILQPASLS--PELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
++ L+ LS A E M SAF G E +K ++L +I+
Sbjct: 118 GAALEDLEGNDLSWRGSDKWADMEMMAMGASAFAGLGTTQEAVKK-----ALSLLCKIQT 172
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
NAF Y+ + +E +A+ NH C PNA + ++ A L
Sbjct: 173 NAFH------RYDADIGQVGIFLEPRLAMA----------NHSCIPNATVQFVGR-RAIL 215
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A R ++ EE+ I Y D + R+ L F F C C RC
Sbjct: 216 RAERPIKADEEIEISYTDYNYPLSKRKEALAPYF-FTCECTRC 257
>gi|242077278|ref|XP_002448575.1| hypothetical protein SORBIDRAFT_06g029440 [Sorghum bicolor]
gi|241939758|gb|EES12903.1| hypothetical protein SORBIDRAFT_06g029440 [Sorghum bicolor]
Length = 472
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + A ++ALR + +GEE+ I YID + + R
Sbjct: 391 GTAFFPLQSCMNHSCCPNAKAFKRDEDKDGHAVIIALRPISKGEEITISYIDEDLPYEER 450
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C CL+C
Sbjct: 451 QAQLAD-YGFTCTCLKC 466
>gi|255085478|ref|XP_002505170.1| predicted protein [Micromonas sp. RCC299]
gi|226520439|gb|ACO66428.1| predicted protein [Micromonas sp. RCC299]
Length = 625
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 82/188 (43%), Gaps = 48/188 (25%)
Query: 190 IISGAESADCIDILQPASLSPE---LILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYT 246
+ +GAE+A ++ AS +P+ +ILA+ G N
Sbjct: 220 LTAGAEAALMGGLVGAASRNPQDLSVILALPSG--------------------ANDDADA 259
Query: 247 NVLAQIRINAFRIELAGG-------LYEDLLSSAAASIESEIAVGNA----IYMLPSFYN 295
+ I + FR L+ G +++ + AA++ AV NA +Y PSF N
Sbjct: 260 GKVPPIDVAKFRRHLSKGDEVLPELPPQEVFARLAANVVKTSAVRNANSVGVYPFPSFLN 319
Query: 296 HDCDPNA------HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
H C PNA H M+I A RD+ GEE+ + Y D +M + R A+ + +G
Sbjct: 320 HSCAPNACKLMVGHTMFI-------RAARDLVAGEEVFMKYFDVTMPKPERSAVAKR-WG 371
Query: 350 FQCNCLRC 357
F+C C RC
Sbjct: 372 FECACPRC 379
>gi|303283498|ref|XP_003061040.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226457391|gb|EEH54690.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 448
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 32/78 (41%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID-NADAR--LMALRDVEEGEELRICYIDASMARDAR 340
G + L S N+DCDPN + D + D + L+A R + GEEL +CY+D M AR
Sbjct: 356 GTGFFALQSCMNNDCDPNVTPLKDDGDVDGKCVLVAKRAIAAGEELTMCYVDEGMDVRAR 415
Query: 341 QAILTQGFGFQCNCLRCS 358
+A L +GF+C C RC
Sbjct: 416 RAELMD-YGFECGCARCE 432
>gi|302774507|ref|XP_002970670.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
gi|300161381|gb|EFJ27996.1| hypothetical protein SELMODRAFT_451410 [Selaginella moellendorffii]
Length = 292
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 64/129 (49%), Gaps = 17/129 (13%)
Query: 235 EQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFY 294
E K L+ + + N+LA+ +INAF L++ ++ + + ++ A Y S
Sbjct: 74 ELAKGLSLEMFLNLLAKDKINAF----------GLMAPSSGTAQRKVRA-YARYAQASML 122
Query: 295 NHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGF 348
NHDC PNA ++D N D + L DV +G E+ I Y + R+ L + +
Sbjct: 123 NHDCLPNACRFEYLDKPGASNTDIYIRLLHDVPQGSEICISYFPVNWNYKERRERLVEDY 182
Query: 349 GFQCNCLRC 357
GF+CNC RC
Sbjct: 183 GFECNCERC 191
>gi|67465651|ref|XP_648998.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56465330|gb|EAL43612.1| hypothetical protein, conserved domain containing [Entamoeba
histolytica HM-1:IMSS]
gi|449704953|gb|EMD45105.1| set and mynd domain containing protein [Entamoeba histolytica KU27]
Length = 426
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 132/303 (43%), Gaps = 38/303 (12%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITS---SSQHFQHHNARFC 145
G+GVF+ ++ + ++I PII + + C FCL+ + S ++++ + N
Sbjct: 126 GKGVFSKKKYQKEEIIMKEIPIIQISKIEDYYNCCGFCLKTLYSKDPTNKYNKVFNQINK 185
Query: 146 GEVCKDNAKAFYDVER--RADWS---------AFNDYCRSQGLKYPLLVKRL-ACMIISG 193
EV K Y + + D+S +YC+ + +PL + ++ A ++IS
Sbjct: 186 DEVVKCKCGMSYCNTKCQKLDYSHSLLCNKIKGLFNYCKKNNVSHPLCISKIFANILISQ 245
Query: 194 --AESADCIDILQPASLSP---ELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
+S + + L P E+I + F L F G + L N
Sbjct: 246 NIEQSLFPFIVFHSSPLIPFNSEIIPFFLDIFGSLLLKFNIYGGWTFIYQILYSVLKYNS 305
Query: 249 LAQIRINAFRIELAGG--LYED--LLSSA--------AASIESEIAVGNAIYMLPSFYNH 296
+ + +N + L L+ D LLSS+ A +IE+E ++ + NH
Sbjct: 306 SSILPLNPIQSVLLNNNSLHIDNTLLSSSLINHPDITAFTIEAE-----GLFKYLNTLNH 360
Query: 297 DCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
C PN + D++ A L+A + G+EL I YID ++ RQ++L + F C+C
Sbjct: 361 SCSPNCFLASTDDSCALSLIASCPISPGDELTISYIDNTLPYSQRQSLLYDSYHFYCHCP 420
Query: 356 RCS 358
+C+
Sbjct: 421 KCN 423
>gi|429852402|gb|ELA27540.1| lysine methyltransferase [Colletotrichum gloeosporioides Nara gc5]
Length = 444
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWID--NADARLMALRDVEEGEELRICYIDASMARDARQAI 343
A+++ S NHDC PN W D R A+RD+ GEEL + YID +R AR+
Sbjct: 239 AVFVNTSRMNHDCRPNVD-YWFDPRTLTQRTTAIRDIMPGEELTLSYIDPMQSRKARRER 297
Query: 344 LTQGFGFQCNCLRCS 358
L +GF C+C C+
Sbjct: 298 LHGTWGFHCSCHHCT 312
>gi|403419721|emb|CCM06421.1| predicted protein [Fibroporia radiculosa]
Length = 428
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 63/154 (40%), Gaps = 28/154 (18%)
Query: 206 ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGL 265
+ LSP+ + E F+ L +AF G D LN N+F L G
Sbjct: 175 SGLSPQAL----ESFLALHNAFPNEGPIDPMSGILN------------TNSFDTSLKIGD 218
Query: 266 YEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGE 324
+D L A Y S NHDC P+A+ + + A RD+ EGE
Sbjct: 219 GDDNLEKLAG-----------CYPTLSRANHDCSPSANFHFSQKTFCGQFFATRDILEGE 267
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
E+ + Y D M+ R +L + +GF C C CS
Sbjct: 268 EITVTYTDYLMSHKERNRVLKKTYGFICTCRTCS 301
>gi|189199084|ref|XP_001935879.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187982978|gb|EDU48466.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 276
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 2/74 (2%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDASMARDARQAIL 344
A+Y S NHDC PN+ W + + + + AL D++ GEE+ +CY +A ++ RQ L
Sbjct: 57 AVYATVSLINHDCMPNSSPNWNRHLNMQTIHALCDIKAGEEITVCYTNAVTTQE-RQVYL 115
Query: 345 TQGFGFQCNCLRCS 358
Q FGF C C CS
Sbjct: 116 KQHFGFDCACSICS 129
>gi|389645452|ref|XP_003720358.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
gi|351640127|gb|EHA47991.1| hypothetical protein MGG_09876 [Magnaporthe oryzae 70-15]
Length = 427
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 48/83 (57%), Gaps = 2/83 (2%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMAR 337
E++++ G ++ ++ S NHDC PNAH+ + + + R+ +L+ + G E+ + Y D +
Sbjct: 123 EADVSFGTSLDLVVSMINHDCSPNAHV-FFEGSQVRVRSLKAIAAGGEITVSYCDPRLDV 181
Query: 338 DARQAILTQGFGFQCNCLRCSSG 360
RQ IL Q F C C C+SG
Sbjct: 182 LLRQEILRQTQFFHCEC-NCTSG 203
>gi|429860343|gb|ELA35083.1| het domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 642
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 51/112 (45%), Gaps = 13/112 (11%)
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
V+ ++ N+F+++L GG D GN Y S YNHDC PN
Sbjct: 430 KVVDIMQTNSFQMDLGGGAQGDGHH-----------YGN--YPEVSRYNHDCRPNVAFHI 476
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ R +R V+ GEEL I Y+D R RQ +GF+C C +CS
Sbjct: 477 GADGRHRTTVVRPVKSGEELTISYLDQLGVRSERQHRAKLAWGFECGCSQCS 528
>gi|61097985|ref|NP_001012912.1| SET and MYND domain-containing protein 5 [Gallus gallus]
gi|75571258|sp|Q5ZIZ2.1|SMYD5_CHICK RecName: Full=SET and MYND domain-containing protein 5
gi|53134094|emb|CAG32301.1| hypothetical protein RCJMB04_22j23 [Gallus gallus]
Length = 420
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/97 (37%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + + E G+ +YML S NH C PNA + DN L AL D+E G
Sbjct: 294 LYKDIEKESGEFLNCE---GSGLYMLQSCCNHSCIPNAETSFPDNNFLLYLTALEDIEAG 350
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 351 EEICISYLDCCQRERSRHSRNKILRENYLFTCSCPKC 387
>gi|320032992|gb|EFW14942.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 358
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
I+ + NH C NA W N + + A +D+EEGEE+ I Y+D + + RQ L
Sbjct: 135 GIFPRAARINHSCKQNAQNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNTRQLCL 194
Query: 345 TQGFGFQCNCLRCS 358
+ FGF C C CS
Sbjct: 195 EEAFGFVCQCELCS 208
>gi|302850072|ref|XP_002956564.1| hypothetical protein VOLCADRAFT_97594 [Volvox carteri f.
nagariensis]
gi|300258091|gb|EFJ42331.1| hypothetical protein VOLCADRAFT_97594 [Volvox carteri f.
nagariensis]
Length = 1124
Score = 58.5 bits (140), Expect = 4e-06, Method: Composition-based stats.
Identities = 39/159 (24%), Positives = 74/159 (46%), Gaps = 17/159 (10%)
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
+D+ A L P+ +L G V + + +N +W+ V+A++ +N F++
Sbjct: 416 MDMRMEAELDPDSVLDSLSGVV-----------NKAVTERVNVEWFVGVMARLHLNVFQV 464
Query: 260 E-----LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADAR 313
++ + + G+A+Y++ S +NH C+P + + D +A A
Sbjct: 465 HNPLAGADPADLAAAAAALVSGSAGGASSGSAVYLVASLFNHSCEPTLELSFPDLDAAAC 524
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
+A RD+ GEEL + Y+D + + RQ L +GF C
Sbjct: 525 FVAARDIAPGEELCVSYLDVGLPYNVRQRHLKWSYGFVC 563
Score = 48.9 bits (115), Expect = 0.004, Method: Composition-based stats.
Identities = 35/135 (25%), Positives = 55/135 (40%), Gaps = 29/135 (21%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS------QHFQHHNA 142
GRG+ A I ++H P++ P L + VCY CL +TSS+ +H +
Sbjct: 207 GRGLVAVGSIPRGQVVHREVPLVAFPELGATHGVCYHCLAPLTSSTTTTSPVRHSGPNRR 266
Query: 143 RFCGEVCKDNAKAFY-----------------------DVERRADWSAFNDYCRSQGLKY 179
RFC + C + A+ Y + + + CR G ++
Sbjct: 267 RFCCDSCMNTARTQYLDVEEAVAAAAAAAATQGSGGKGGGGQGGPLAQLYEQCRQHGERF 326
Query: 180 PLLVKRLACMIISGA 194
PL+ RLA M +S A
Sbjct: 327 PLMAARLAFMEVSEA 341
>gi|303322581|ref|XP_003071282.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
delta SOWgp]
gi|240110984|gb|EER29137.1| hypothetical protein CPC735_068190 [Coccidioides posadasii C735
delta SOWgp]
Length = 367
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
I+ + NH C NA W N + + A +D+EEGEE+ I Y+D + + RQ L
Sbjct: 135 GIFPRAARINHSCKQNAQNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNTRQLCL 194
Query: 345 TQGFGFQCNCLRCS 358
+ FGF C C CS
Sbjct: 195 EEAFGFVCQCELCS 208
>gi|126305322|ref|XP_001379366.1| PREDICTED: SET and MYND domain-containing protein 1 [Monodelphis
domestica]
Length = 490
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 69/302 (22%), Positives = 114/302 (37%), Gaps = 49/302 (16%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V +E GRG+ AT+ A+D+I + S VC+ C ++ + Q
Sbjct: 9 VEVFTSEGKGRGLKATKEFWAADVIFAERAYSAVVFDSFTQVVCHTCFKRQEKLQRCGQC 68
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A +C C+ +A W C + G K P RLA I+ E
Sbjct: 69 KFAHYCDRTCQKDA-----------WLNHKHECAAIKKHG-KPPSENIRLAARIMWRIER 116
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD------EQMKFLNKQWYTNVLA 250
+ + + +S + + + F + +D Q + + Q+ +++
Sbjct: 117 -EGTGLTEGCLVSVDDLQNHVDNFGEEEQKDLRVDVDSFLNFWPAQSQQFSMQYISHIFG 175
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-N 309
I NAF + GL AVG I+ NHDC PN +++ + N
Sbjct: 176 VINCNAFTLSDQRGLQ---------------AVGVGIFPNLCLVNHDCWPNCTVIFNNGN 220
Query: 310 ADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+A L AL + EGEEL + Y+D R+ L + + F C C C
Sbjct: 221 HEAVKSMFHTQMRIELRALGKISEGEELTVSYVDFLNVSQERKKQLKKQYYFDCTCEHCE 280
Query: 359 SG 360
G
Sbjct: 281 KG 282
>gi|119190077|ref|XP_001245645.1| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
gi|392868546|gb|EAS34345.2| hypothetical protein CIMG_05086 [Coccidioides immitis RS]
Length = 367
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
I+ + NH C NA W N + + A +D+EEGEE+ I Y+D + + RQ L
Sbjct: 135 GIFPRAARINHSCKQNAQNTWNHNLSKLTIHAFKDIEEGEEITISYVDGAETFNTRQLCL 194
Query: 345 TQGFGFQCNCLRCS 358
+ FGF C C CS
Sbjct: 195 EEAFGFVCQCELCS 208
>gi|296423702|ref|XP_002841392.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637630|emb|CAZ85583.1| unnamed protein product [Tuber melanosporum]
Length = 489
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 115/298 (38%), Gaps = 66/298 (22%)
Query: 88 AGRGVFATRRIRA-SDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNA---- 142
G G+F T I A + ++ KP++T P + L S C CL + +++
Sbjct: 12 GGHGLFTTTAIPAGTQILSIPKPLVTIPDDAHLQSTCSNCLHHAPDTPSVLTTYSSPRKL 71
Query: 143 ---------RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISG 193
++CG+ C+ DW + + K P +
Sbjct: 72 SACLGCKIVKYCGKECQTE-----------DWKRVHKHECKTFAKLPRAL---------- 110
Query: 194 AESADCIDILQPASLSPELILAME--EGFVMLRSAFKKAGIDDEQMKFL----NKQWY-T 246
P S+ + + M+ EG V+ A +D KF K+W
Sbjct: 111 -----------PGSVRVTMRVLMQNAEGGVL-------AQCEDHVEKFRTEAGGKRWEGV 152
Query: 247 NVLAQIRINAFRIELAGG----LY-EDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
V+ + R G LY L++S E+ +G A L + NHDC PN
Sbjct: 153 FVMGKGAHGYSRTRRGEGEVRRLYCAVLVNSMTLVTETFDPIGIAFDPLTASINHDCIPN 212
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
A +M + ++ AL D+ E+ I YID ++ R+ RQ L++ + F C C RC+
Sbjct: 213 A-VMLFNGRILQIRALEDIPANAEVLISYIDNTVPRERRQLELSEKYFFTCTCSRCTG 269
>gi|344231392|gb|EGV63274.1| hypothetical protein CANTEDRAFT_105840 [Candida tenuis ATCC 10573]
Length = 478
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/341 (21%), Positives = 121/341 (35%), Gaps = 85/341 (24%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-----SVCYFC 126
P+ P ++V +T G+G++A I LI KP P L+ L C +C
Sbjct: 101 PNLQLPEKVKVIMTTKRGKGLYAKSNIAKGTLIWEEKPFFFIPPLANLKLIKSGKACTYC 160
Query: 127 LRKITSSSQHFQHHNAR----------FCGEVCKD--------------------NAKAF 156
+ +TS+ R +C CK N +AF
Sbjct: 161 GKLLTSARSSSGVSMLRGLDCTGCPELWCSMSCKKMNGTLHASLKHLSKGRKSALNPEAF 220
Query: 157 YDVER---RADWSA-------FNDYCRSQG---LKYPLLVKRL--ACMIISGAESADCID 201
D+E + W+A F D + LKY + R+ A + SA D
Sbjct: 221 LDLEEYCIKEQWNALFAITLMFADIVEDKSGEKLKYVKSMARVSQATRYKALESSAGSFD 280
Query: 202 ILQPASL--SPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
L +L + +EGF F KA D ++ Y +L IN
Sbjct: 281 SLTGGALFVKEQQENLWKEGFSKFSKVFPKATSDGT----VDYNEYLTLLGSYNINN--- 333
Query: 260 ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA---DARLMA 316
+ + ++ + S NH C+PN + N+ ++ A
Sbjct: 334 -----------------------LDSCVFRIQSHLNHTCNPNVDVETSPNSRYEGIKVFA 370
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+D++ GEEL Y++ + RQ L +GF C+C +C
Sbjct: 371 AKDIKAGEELSTTYVNPNHTVLQRQRELRANWGFTCSCNKC 411
>gi|449501420|ref|XP_002187966.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 1
[Taeniopygia guttata]
Length = 478
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 68/295 (23%), Positives = 113/295 (38%), Gaps = 50/295 (16%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
+++ E GRG+ A + A D+I A+P +L V C+ C ++ + Q
Sbjct: 9 VELFTAEGKGRGLKAQKEFLAGDVIF-AEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQ 67
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS--QGLKYPLLVKRLACMIISGAES 196
A +C C+ RA W + + C + + K P RLA I+ E
Sbjct: 68 CKFAHYCDRTCQ-----------RAAWLSHKNECSAIKRHGKAPTENIRLAARILWKMER 116
Query: 197 ADCIDILQPASLSPELILAMEE------GFVMLRSAFKKAGIDD------EQMKFLNKQW 244
+ + LS ++A+EE F +A ++ Q + Q
Sbjct: 117 -------EGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQL 169
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+++ I N F + GL AVG I+ NHDC PN +
Sbjct: 170 ISHIFGVISCNGFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWPNCTV 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ +N L AL + G+EL + Y+D + R+ L + + F C C C
Sbjct: 215 VF-NNGKIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDCTCEHCKK 268
>gi|353241217|emb|CCA73046.1| hypothetical protein PIIN_07001 [Piriformospora indica DSM 11827]
Length = 565
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 71/308 (23%), Positives = 122/308 (39%), Gaps = 65/308 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ + GRG+F + + + +P I + L C C
Sbjct: 48 LRLGTSTLGGRGLFTQTDAKPGTRLLSVRPHIHAVSARFLEDNCTLCT----------SE 97
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
N R C CK A + + ADW C+S ++R A S + AD
Sbjct: 98 ENVRRCTR-CKKVAYCSTECQT-ADWGIHKQECQS--------LRRWAEASGSDSTPADS 147
Query: 200 I----------DILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
I +I S+ I AM+ L S+ +++ YT+ L
Sbjct: 148 IRAISRLLWMRNIKGADSIWWRQIAAMQSNREHLSSSTQES--------------YTH-L 192
Query: 250 AQIRINAFRIELAGGLYE----------DLLS---SAAASIESEI--AVGNAIYMLPSFY 294
AQ + +IE GL E DL+S + + ++ S A+G A +P+
Sbjct: 193 AQSLVLYMKIESPEGLREYGIESGKDLVDLMSKFTTNSFTLTSTFLNAIGVATAPIPALI 252
Query: 295 NHDCDPNAHIMWIDNADA-----RLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
NH C PNA +++ ++A++ + GEE+ Y+D ++ R+ RQ L + +
Sbjct: 253 NHSCQPNAVVVFPATRKGAPPTLDVIAIQPIRRGEEVLAAYVDITLPREIRQKSLKETYA 312
Query: 350 FQCNCLRC 357
F+C+C C
Sbjct: 313 FECSCTLC 320
>gi|226294200|gb|EEH49620.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 365
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
AI+ + NH C PN W N + A +D+++GEE+ I Y+D + D RQ
Sbjct: 143 AIFPQAARINHSCQPNTQNTWNRNLGKLTIQAFKDIDKGEEITIAYVDCTELYDTRQECF 202
Query: 345 TQGFGFQCNCLRCS 358
FGF+C C C+
Sbjct: 203 ENAFGFRCRCEVCA 216
>gi|295664524|ref|XP_002792814.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278928|gb|EEH34494.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 365
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAI 343
AI+ + NH C PN+ W N + A +D+++GEE+ I YID + + RQ
Sbjct: 142 GAIFPQAARINHSCQPNSQNTWNRNLGKLTIQAFKDIDKGEEITIAYIDGTELYETRQEC 201
Query: 344 LTQGFGFQCNCLRCS 358
FGF+C C C+
Sbjct: 202 FENAFGFRCRCEVCA 216
>gi|225684542|gb|EEH22826.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 365
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
AI+ + NH C PN W N + A +D+++GEE+ I Y+D + D RQ
Sbjct: 143 AIFPQAARINHSCQPNTQNTWNRNLGKLTIQAFKDIDKGEEITIAYVDCTELYDTRQECF 202
Query: 345 TQGFGFQCNCLRCS 358
FGF+C C C+
Sbjct: 203 ENAFGFRCRCEVCA 216
>gi|408392130|gb|EKJ71490.1| hypothetical protein FPSE_08303 [Fusarium pseudograminearum CS3096]
Length = 543
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
AI + S NH C PNA W + A+ D+E+G+E+ I Y+ + RQ L
Sbjct: 101 AILLATSRINHSCKPNAQTRWNQGLGKITVHAVEDIEQGQEITITYLGNPEVYEERQKKL 160
Query: 345 TQGFGFQCNCLRCS 358
T FGF C+C CS
Sbjct: 161 TNAFGFDCHCRLCS 174
>gi|310789329|gb|EFQ24862.1| hypothetical protein GLRG_00006 [Glomerella graminicola M1.001]
Length = 460
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQAIL 344
A+++ S NHDC PN + R +ALRD+ GEEL + YID +R+AR+ L
Sbjct: 251 AVFVNTSRMNHDCRPNVDYYFDPRTLTQRTVALRDIIPGEELTLSYIDPMQSREARRERL 310
Query: 345 TQGFGFQCNCLRCS 358
+GF C+C C+
Sbjct: 311 HSTWGFHCSCHHCT 324
>gi|66815955|ref|XP_641994.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
gi|60470035|gb|EAL68016.1| hypothetical protein DDB_G0278829 [Dictyostelium discoideum AX4]
Length = 563
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 65/309 (21%), Positives = 116/309 (37%), Gaps = 53/309 (17%)
Query: 82 VALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNS-VCYFCLRK----------- 129
V + G V AT I+ +L+ + + P +N C+ C ++
Sbjct: 9 VNNNDERGNFVVATNEIKVGELLFPEESFASIPLEFHMNKDYCFLCCKQSEVEQVQQPSQ 68
Query: 130 --ITSSSQHFQHH---NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQ-----GLKY 179
++++ Q N FC E+C + D+ + + S N S +
Sbjct: 69 QPQQTNNKSIQCKFGCNMWFCSEICSN------DMTHQLECSFINKLIESSIKNECDVST 122
Query: 180 PLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFV--------MLRSAFKKAG 231
LL R+ MI + ES + + S E + + + F+ + ++
Sbjct: 123 CLLALRI--MIRNKIESDKYQETVGKLSNQLEKFIEINKSFIEKYDINFQQFINQLQQEN 180
Query: 232 IDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLP 291
D + NK + +++ I IN+F GL D + + N + P
Sbjct: 181 SDPSLLAIFNKDEFLQIISSIYINSF-----SGLSNDF---------NRKPISNGYFYKP 226
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
+ NH C+PN I + + + A + +E+ EE+ Y+D + RQ IL F
Sbjct: 227 ALLNHSCEPNI-FFTIKDKNLEMRACKKIEKDEEIVDSYVDLLLPTIERQKILYNSKNFL 285
Query: 352 CNCLRCSSG 360
C C RCS
Sbjct: 286 CKCSRCSDS 294
>gi|393219982|gb|EJD05468.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 635
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 126/309 (40%), Gaps = 49/309 (15%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC--------LR 128
PP + V ++ GRG++ + + A ++ + KP + + S L+ C C L+
Sbjct: 82 PPILSVKRSKERGRGMWTSGMVPAGTVLCSVKPHVHALSRSQLHLYCTSCCAPQPSVGLK 141
Query: 129 KITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRA--DWSAFNDYCRSQGLKYPLLVKRL 186
K T +CG C++ F+ +E A WS + S KY + + +
Sbjct: 142 KCTKCKA------IHYCGTTCQNADWQFHKLECEALQRWS-VSAPPESDDAKYAVPPEAV 194
Query: 187 ACMIIS-------GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKF 239
C+ + G+ S +I + S + +M + V L A +
Sbjct: 195 RCLARTIWRRKKLGSGSIWWREINEMQSKREGVAQSMIDAHVHLAHATVR---------- 244
Query: 240 LNKQWYTNVLAQIRINAFRI----ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
Y + Q + A + EL + + L+S + S +G ++ L N
Sbjct: 245 -----YMGISGQEELKAHGVHSISELVDLISKFTLNSYTLTTPSLSPIGVSVSPLAGLLN 299
Query: 296 HDCDPNAHIMW--IDNADAR----LMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
H CDPN +++ I +AD ++A++D+ EL Y+D ++ RQ L + +
Sbjct: 300 HSCDPNVSVVFPRILSADKEPALHIVAIQDIPADSELLTSYVDVTLPVLQRQKDLKETYS 359
Query: 350 FQCNCLRCS 358
F C+C CS
Sbjct: 360 FNCSCHSCS 368
>gi|336386840|gb|EGO27986.1| hypothetical protein SERLADRAFT_414176 [Serpula lacrymans var.
lacrymans S7.9]
Length = 731
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 130/331 (39%), Gaps = 68/331 (20%)
Query: 92 VFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKI---TSSSQHFQHHNARFCG 146
+ A + I A ++I+ ++T L + C CLR+I T+ N+ +C
Sbjct: 207 LVANKDIAAGEVIYKEGALVTALDLDLQGKGTHCSHCLREIGDGTALRPDSDRLNSVYCS 266
Query: 147 EVCKDNAKAFY-----------------DV------ERRADWSAFNDYCRSQGLKYPLLV 183
C+ AK Y DV +R A AF Y P LV
Sbjct: 267 NTCQLEAKLEYQSLLFTLEPPLPVGLTPDVSDSAQEDRDAAQKAFVAYLNHSAKASPELV 326
Query: 184 KRLACMIISGAESADCIDILQPASLSPELILAMEEGFVM------LRSAFKKAGIDDEQM 237
+ +S E++ + QP P LA + + LR K G ++ ++
Sbjct: 327 AKFIARQVS-IETSKMVPGGQPV---PTPALADGGDYTLYDHLERLRFLEVKPGKNEMKL 382
Query: 238 -------------KFLNKQWYTNVLAQIRINAFRIELAGGLYE--------DLLSSAAAS 276
+F+ ++ + +L ++ N++ + G + + +
Sbjct: 383 LGEVLQTALPGLEQFVTEERHGTLLGKMAYNSYGVFFGNGRDDKPEPKERPEDVEKTRTP 442
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASM 335
+ + VG A Y + S+ +H C P+A + D NA+ L+A RD+++G+E+ + YID +
Sbjct: 443 LGTARQVGAAFYAVSSYISHSCAPSARPSFDDGNAELHLIATRDLKKGDEITVSYIDVAQ 502
Query: 336 ARDARQAI--------LTQGFGFQCNCLRCS 358
D L +G+ F C C RC+
Sbjct: 503 HEDETTVDARRRRRMELARGWRFACPCTRCA 533
>gi|395328146|gb|EJF60540.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 300
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL---RDVEEGEELRICYIDASMARD 338
A AI+++ S +N C PN + W N D + +A D+ EGEEL ICY D ARD
Sbjct: 99 AASAAIFVIGSRFNSSCQPNVNNYW--NEDLQKIAFWATSDIAEGEELCICYGDLWKARD 156
Query: 339 ARQAILTQGFGFQCNCLRCS 358
R+ L F F C C+ CS
Sbjct: 157 DRRRRLESSFRFVCQCVACS 176
>gi|409048092|gb|EKM57570.1| hypothetical protein PHACADRAFT_192717 [Phanerochaete carnosa
HHB-10118-sp]
Length = 309
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 54/104 (51%), Gaps = 4/104 (3%)
Query: 258 RIELAGGLYE-DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM- 315
R A G++E ++L + +A I++ + +NH C PN + W D A +L+
Sbjct: 76 RYAQALGIFETNVLPCGSNDAHGHVAQQGGIFLAGARFNHSCVPNVNNHW-DAARGQLVF 134
Query: 316 -ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
ALRD+E GEEL + Y R R+A L+ FGF C C CS
Sbjct: 135 RALRDIEAGEELCLGYGRLLAKRGERRAELSAKFGFDCACEACS 178
>gi|281205858|gb|EFA80047.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 496
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 1/81 (1%)
Query: 279 SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARD 338
S + +G +Y +F NH CDPN + I L A+R +++GEE I YID +
Sbjct: 236 SGVLLGTGLYPGLTFINHSCDPNLQVTQIGKI-LTLKAVRPIKKGEEFTISYIDRTEGAI 294
Query: 339 ARQAILTQGFGFQCNCLRCSS 359
R L + F F+C C +C+S
Sbjct: 295 QRNEELMETFFFECQCHKCAS 315
>gi|328853238|gb|EGG02378.1| hypothetical protein MELLADRAFT_91461 [Melampsora larici-populina
98AG31]
Length = 479
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 45/82 (54%), Gaps = 4/82 (4%)
Query: 279 SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMA 336
SE + A+ PS +NH+C PN+ + D R+ A+RD+ GEE+ I Y D +
Sbjct: 275 SEEVLHYAVVPEPSVFNHECRPNS-AFFFDKKTLRVYINAVRDIAMGEEITIAYRDMKAS 333
Query: 337 RDARQAILTQGFGFQCNCLRCS 358
R RQA + +GFQC C CS
Sbjct: 334 RAERQASIAH-YGFQCKCSHCS 354
>gi|148235066|ref|NP_001085635.1| SET and MYND domain-containing protein 5 [Xenopus laevis]
gi|82184468|sp|Q6GPQ4.1|SMYD5_XENLA RecName: Full=SET and MYND domain-containing protein 5
gi|49115717|gb|AAH73058.1| MGC82689 protein [Xenopus laevis]
Length = 421
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRD 319
L LY+D+ + E G+ +Y+L S NH C PNA + DN L AL D
Sbjct: 283 LIDQLYKDIEKVTGEFLNCE---GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALED 339
Query: 320 VEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
++ GEE+ I Y+D +R +RQ IL + + F C+C +C
Sbjct: 340 IQPGEEICISYLDCCQRDRSRHSRQKILRENYLFMCSCPKC 380
>gi|428672307|gb|EKX73221.1| conserved hypothetical protein [Babesia equi]
Length = 402
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 42/75 (56%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G I+ S NH C PN I ++D+ AR+ L ++ G+E I YI+ + RQ
Sbjct: 316 IGLGIFDNISKMNHSCRPNVEIDYVDDNTARINLLCNISAGQEATISYINEEDVFEIRQK 375
Query: 343 ILTQGFGFQCNCLRC 357
L+ +GFQC+C +C
Sbjct: 376 KLSTNYGFQCDCNKC 390
>gi|260943444|ref|XP_002616020.1| hypothetical protein CLUG_03261 [Clavispora lusitaniae ATCC 42720]
gi|238849669|gb|EEQ39133.1| hypothetical protein CLUG_03261 [Clavispora lusitaniae ATCC 42720]
Length = 481
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 41/79 (51%), Gaps = 2/79 (2%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDAR 340
V + I++L S NH+CDPN ++ +L A RD+ GEEL Y++ S R
Sbjct: 339 VDSCIFLLQSHLNHNCDPNTKVVLSAKKYEKLKVFAARDIRTGEELTTTYVNPSHTVQQR 398
Query: 341 QAILTQGFGFQCNCLRCSS 359
Q L +GF C C +C S
Sbjct: 399 QRELRVNWGFICKCQKCKS 417
>gi|195130539|ref|XP_002009709.1| GI15088 [Drosophila mojavensis]
gi|193908159|gb|EDW07026.1| GI15088 [Drosophila mojavensis]
Length = 492
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 119/297 (40%), Gaps = 44/297 (14%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPII----THPTLSTLNSVCYFCL 127
P + P +V +++ AGRGV ATR ++ ++I P++ H TLN+ C C
Sbjct: 17 PFKDKDPAWEVGVSKIAGRGVMATRNLKRGEIIFQDSPLLIGLAAHEE-DTLNA-CSVCF 74
Query: 128 RKITSSSQHFQHHNARF-----CG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPL 181
+ + + RF CG VC AK + + D F + ++
Sbjct: 75 KMLPDT---------RFMCRQGCGLPVCSLCAKK---KQHKTDCDMFKAWGSNEPDVANS 122
Query: 182 LVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN 241
++ RL C+ + S D D++ L L FK D + ++ +N
Sbjct: 123 VIIRLLCVARAINLSKDQRDLIY--CLQANLDNNHRTEVRNAAKCFKNFPTDKKIVEIMN 180
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
+ +A +R N F + + +++ A+Y L + NHDC PN
Sbjct: 181 R-----TVAVLRTNGFD------------KTTDRTTDNQEFNYRALYPLFAVMNHDCIPN 223
Query: 302 AHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++ + + + ++ A D+ EGEE+ Y AR L GF C C RC
Sbjct: 224 SYYTFEEKTNNMIVRAAVDIAEGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRC 280
>gi|348524789|ref|XP_003449905.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Oreochromis
niloticus]
Length = 440
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 68/276 (24%), Positives = 112/276 (40%), Gaps = 29/276 (10%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCY--FCLRKITSSSQHFQHHNARFCG 146
GRG+ TR + +L+ + P +H LS CY FC + ++ + A +C
Sbjct: 25 GRGLRVTRPFKVGELLFSC-PAFSH-VLSVKERGCYCEFCFTRTQHLARCGKCKKAFYCN 82
Query: 147 EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPA 206
C+ A + +E A + ++C S+ LV R+ E C +L
Sbjct: 83 VKCQKGDWAMHKLECSAMVAFGENWCPSE---LSRLVARILAKKKMQKERCICEKMLLIG 139
Query: 207 SLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL---AQIRINAFRIELAG 263
+ E M + AG+ K L+ + +L +Q+ N F IE
Sbjct: 140 EMQSHTEDEDNEKREMTEADI--AGLHRFYSKHLDVPDHKELLTLFSQVACNGFTIE--- 194
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
+D LS +G A+Y + NH C P+ I+ A + A++D++ G
Sbjct: 195 ---DDELSH----------LGTAVYPDMALINHSCLPSV-IVTFKGTSAEVRAVQDMKPG 240
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+E+ I YID D R L + + F C+C C S
Sbjct: 241 DEVLISYIDLLYPTDDRNNRLRESYYFICDCQECKS 276
>gi|289739577|gb|ADD18536.1| putative histone tail methylase [Glossina morsitans morsitans]
Length = 429
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 72/291 (24%), Positives = 114/291 (39%), Gaps = 42/291 (14%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL---STLNSVCYFCLRKITSSS 134
P ++A+++ AGRGVF TR+++ D+I PI+ TLNS C C + + +
Sbjct: 23 PAWEIAVSKIAGRGVFTTRKLKRGDIILRDIPILIGLAARQEDTLNS-CSICFKVLPDTK 81
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFN-DYCRSQGLKYPLLVKRLACM---I 190
+ A VC + D W D S ++ RL C+ I
Sbjct: 82 FMCRQGCALPVCSVCAKKKQHKDDCALFKSWEPIEPDVANS-------VIIRLICIARAI 134
Query: 191 ISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
E D I LQ A+L + F K D + ++ +N+ +A
Sbjct: 135 NLSKEQRDLIYCLQ-ANLDNNHRTEVRNA----AKCFNKFPTDKKLIEIMNR-----TVA 184
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
+R N F +A + +++ A++ L + NHDC PN++ + +
Sbjct: 185 VLRTNGFD------------ETADRTNDNQEFFYRALFPLFALVNHDCVPNSYYTFEEKT 232
Query: 311 DARLMALR---DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ M LR D+ EG E+ Y AR L F C C RCS
Sbjct: 233 N--YMVLRASVDLPEGTEITTTYTKLFTGNIARHLFLKMKKNFTCKCSRCS 281
>gi|302696595|ref|XP_003037976.1| hypothetical protein SCHCODRAFT_102704 [Schizophyllum commune H4-8]
gi|300111673|gb|EFJ03074.1| hypothetical protein SCHCODRAFT_102704, partial [Schizophyllum
commune H4-8]
Length = 415
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 59/136 (43%), Gaps = 21/136 (15%)
Query: 239 FLNKQWYTNVLAQIRINAFRIELAGGLY---EDLLSSAAASIESEIAVGNAIYMLPSFYN 295
F + Y L ++ N+F + GG E + VG+A Y + S+ +
Sbjct: 235 FDTPERYATFLGKVAYNSFGVCYNGGRLDRPESYTLKTRTPFGTSHQVGSAFYTVSSYLS 294
Query: 296 HDCDPNA------------------HIMWIDNADARLMALRDVEEGEELRICYIDASMAR 337
H C PNA + +A+ +L+A RD+ +GEEL + +ID +
Sbjct: 295 HSCTPNARPSFDGESAAEANPDGTTELATDGSAELKLVATRDIAKGEELSVAFIDVAQPL 354
Query: 338 DARQAILTQGFGFQCN 353
DAR+A L G+ F C
Sbjct: 355 DARRAELQAGWKFVCT 370
>gi|50287869|ref|XP_446364.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637715|sp|Q6FTT0.1|SET5_CANGA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49525671|emb|CAG59288.1| unnamed protein product [Candida glabrata]
Length = 515
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 121/314 (38%), Gaps = 43/314 (13%)
Query: 75 PSPPPIQVALTESAGRGVFATRRIRASDLI-HTAKPIITHPTLSTLNSV-----CYFCLR 128
PS I+ GRG++ATR I+ +L+ H PI P + L + C C
Sbjct: 113 PSKVEIRECEELRKGRGLYATRDIQQGELLFHEKVPIAMVPPMDKLKLIRSGKSCSMCGV 172
Query: 129 KITSSSQHFQHH-------NARFCGEVCKDNA------KAFYDVE---RRADWSAFNDYC 172
+++SS H N+ +C CK K F R DW+ F +C
Sbjct: 173 SLSNSSHFTMLHGLDCNGCNSIWCSTNCKQKDITHPYLKHFGSKNKNIRPMDWNMFERHC 232
Query: 173 RSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPEL-ILAMEEGFVMLRSAFKK-- 229
+ +A I +A ID Q ++ + LA V S+
Sbjct: 233 QENIF--------VAAYSIGVIHAASLIDKAQSDEINQQFEALAKISQRVRYESSDSNNI 284
Query: 230 AGIDDEQMKFLNKQ-----WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVG 284
G D ++ L ++ W IR E+ ++ + L + +
Sbjct: 285 GGTFDAEIGSLKEENPEPLWKKAFDLFIRTFPDTAEMGYEVFLEYLGRFHIN-----QLS 339
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
+Y L SF NH+C+PN + + ++ A + +++ EEL Y++ R+ L
Sbjct: 340 GQLYFLYSFLNHNCEPNVRYDINNKLELKVYARKFIKKDEELVTTYVNPLHGVSLRRREL 399
Query: 345 TQGFGFQCNCLRCS 358
+GF CNC RC+
Sbjct: 400 RVNWGFICNCDRCA 413
>gi|209881460|ref|XP_002142168.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557774|gb|EEA07819.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 394
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 40/166 (24%), Positives = 82/166 (49%), Gaps = 12/166 (7%)
Query: 202 ILQPASLSPELILAMEEGFV--MLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
I P + PE+ L + ++ ++ + K+ ++ L + +L Q+ +N+ +
Sbjct: 206 IAPPVTFFPEINLELVTEYINIVMFNRMKEVFPNEVIHALLGRSTVEQILGQLVLNSQGL 265
Query: 260 ELAG------GLYEDLLSSAAASIESEIAVGNA--IYMLPSFYNHDCDPNAHIMWIDNAD 311
+ G +Y +S+ EI + A I + S +NH C+PN ++ ID+A
Sbjct: 266 NIWGRRSLNIPIYNSQISNNIQC-NDEIGIIKAACICTIQSCFNHSCEPNCYVHAIDDAT 324
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ +++ GEEL I YID ++ ++R+++L + + F+C C C
Sbjct: 325 VVVTTNKEILLGEELTISYIDNTLDYESRKSLL-KNYHFECTCNLC 369
>gi|367042716|ref|XP_003651738.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
gi|346999000|gb|AEO65402.1| hypothetical protein THITE_68287 [Thielavia terrestris NRRL 8126]
Length = 562
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 86/334 (25%), Positives = 124/334 (37%), Gaps = 100/334 (29%)
Query: 68 HNGKPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHT-AKPIITHPTLSTLNSVCYFC 126
H+ + PSP S R + ATR I T + P++ P +T+ + C +C
Sbjct: 12 HHDHTTSPSP--------GSKRRSLIATRAFAPGTPIATFSNPLLALPDGATMRTTCNYC 63
Query: 127 LRKITSSSQHFQHHN---------------------------ARFCGEVCKDNAKAFYDV 159
LR SSS N A +CG C+
Sbjct: 64 LRVGASSSSSSTTDNNKNNSSSGDGSSSSNGTPLRACTACKAAVYCGAACQ--------- 114
Query: 160 ERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA-ESADCIDILQPASLSPELILAME- 217
RA W A V R C + S ESA + PA A+E
Sbjct: 115 --RAHWRA---------------VHRAECKMFSRVRESAGKEWLPTPAFGGGGGAWALEG 157
Query: 218 --EGFVMLRSAFKK-----------AGIDDEQMKFLNKQWYTNVLAQIRINAF-RIELAG 263
EGF + ++ AG+ + + + VL +I+ NAF R++
Sbjct: 158 NVEGFRADEALWRDFELQAAAAAVYAGLLESEETLEKAR---EVLCKIQTNAFNRLDADT 214
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
GL L A + NH C PNA I + D A L A R ++EG
Sbjct: 215 GLSGIFLDVGLAMV-----------------NHSCVPNAFIGF-DRRTAVLRAERPIQEG 256
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
EE+ I YID ++ + ARQ L + + F+C+C RC
Sbjct: 257 EEITISYIDNALPKSARQEAL-RLYHFRCDCPRC 289
>gi|440638927|gb|ELR08846.1| hypothetical protein GMDG_03520 [Geomyces destructans 20631-21]
Length = 539
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDA-SMARDARQ 341
G A+ L S +NH C PN D N+ R+ ALRDV EGEEL I YI +M RQ
Sbjct: 455 GTAVNPLYSMFNHSCAPNVDWEHEDSNSTLRMFALRDVNEGEELFISYIRPLTMDYAERQ 514
Query: 342 AILTQGFGFQCNCLRCSS 359
L G +C C RC +
Sbjct: 515 ESLLPWLGMECECERCKA 532
>gi|169604975|ref|XP_001795908.1| hypothetical protein SNOG_05503 [Phaeosphaeria nodorum SN15]
gi|111065447|gb|EAT86567.1| hypothetical protein SNOG_05503 [Phaeosphaeria nodorum SN15]
Length = 367
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 36/65 (55%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH CD NA W DN D + A+RD+ +GEE+ I Y+ +S ARQ L F F C
Sbjct: 144 NHACDNNAQNFWNDNIDKLTIHAVRDIGQGEEITISYLSSSQNPQARQQELQHNFKFSCA 203
Query: 354 CLRCS 358
C CS
Sbjct: 204 CRLCS 208
>gi|380477969|emb|CCF43859.1| hypothetical protein CH063_13445 [Colletotrichum higginsianum]
Length = 463
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWID--NADARLMALRDVEEGEELRICYIDASMARDARQAI 343
A+++ S NHDC PN W D R +A+RD+ GEEL + YID +R AR+
Sbjct: 252 AVFVNTSRMNHDCRPNVDY-WFDPRTLTQRTVAIRDIIPGEELTLSYIDPMQSRAARRER 310
Query: 344 LTQGFGFQCNCLRC 357
L +GF C+C C
Sbjct: 311 LHTTWGFHCSCNHC 324
>gi|409040050|gb|EKM49538.1| hypothetical protein PHACADRAFT_214100 [Phanerochaete carnosa
HHB-10118-sp]
Length = 461
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 4/70 (5%)
Query: 293 FYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDARQAILTQGF 348
+NH C PN ++ + ++ALRD+E G+EL I Y+D ++ D RQ L + +
Sbjct: 211 LFNHSCVPNCASKYVITSTEMMGMEIVALRDIEFGDELTIPYLDPALPFDIRQNTLQESY 270
Query: 349 GFQCNCLRCS 358
GF CNC C+
Sbjct: 271 GFTCNCSLCN 280
>gi|440799473|gb|ELR20518.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 404
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 42/82 (51%), Gaps = 3/82 (3%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD---VEEGEELRICYIDASMARDA 339
G A+Y S +NH C PN + R+ A++ V GEEL I YID + R+
Sbjct: 259 TGVALYPALSLFNHSCMPNCAAVDDGTGSKRVCAIKTLVAVPPGEELTISYIDLDLTREL 318
Query: 340 RQAILTQGFGFQCNCLRCSSGD 361
RQ L + + F+C C RC + D
Sbjct: 319 RQDKLEESYAFRCTCARCRAPD 340
>gi|401406610|ref|XP_003882754.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325117170|emb|CBZ52722.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 535
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 4/78 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIM-W-IDNADARLM--ALRDVEEGEELRICYIDASMARDA 339
G +Y L + NH C+PN + W + D+ L ++ VE GEEL I Y+D ++
Sbjct: 439 GGGVYTLHACVNHSCEPNCAVSSWGPEGGDSTLTVTTVKAVEAGEELTISYVDEALPVRR 498
Query: 340 RQAILTQGFGFQCNCLRC 357
R+ +L FGF C C RC
Sbjct: 499 RRQLLETTFGFACTCPRC 516
>gi|56788824|gb|AAH88504.1| smyd5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 420
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRD 319
L LY+D+ + E G+ +Y+L S NH C PNA + DN L AL D
Sbjct: 283 LIDQLYKDIEKVTGEFLNCE---GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALED 339
Query: 320 VEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
++ GEE+ I Y+D +R +RQ IL + + F C+C +C
Sbjct: 340 IQPGEEICISYLDCCQRDRSRHSRQKILRENYLFVCSCPKC 380
>gi|356519848|ref|XP_003528581.1| PREDICTED: uncharacterized protein LOC100812233 [Glycine max]
Length = 527
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 125/348 (35%), Gaps = 96/348 (27%)
Query: 39 MCLGRYSRCL------------------ISRLQSLHLQKRQLC---STATHNG-KPSQPS 76
+CL RYS L + R +SL L + C S NG K P
Sbjct: 105 LCLNRYSSALECFRSAAIDGNAENASGYLERCKSLELLSKTGCLDLSDWVSNGFKGKVPE 164
Query: 77 PPP----IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITS 132
+++ +E +GRG+FAT+ I A LI K I ++ I
Sbjct: 165 LAEHIGAVEIRKSEISGRGLFATKNIDAGALILVTKAIAMERSI-------------IMG 211
Query: 133 SSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACM 189
Q ++C+D A W F D + K L+ RL+C
Sbjct: 212 GVQ-----------DLCEDAQLAM--------WKNFIDKVFEVVGKCYKTKGLITRLSC- 251
Query: 190 IISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
E+ D +++L PE + + V+ K GI D N L
Sbjct: 252 ----GENEDELEVLDVGLFRPESVEKDDGDAVVGVDMVKLVGILD-----------VNSL 296
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDN 309
+ ++A + Y G +++LPSF NH C PNA + + +
Sbjct: 297 TEDAVSANVLTRGNDCY-----------------GVGLWLLPSFVNHSCVPNARRLHVGD 339
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ A +D++ GEE+ Y D R+ ++ +G C C RC
Sbjct: 340 YLV-VHASKDLKAGEEVTFAYFDPLCGLSKRKE-MSVNWGIHCKCKRC 385
>gi|429850952|gb|ELA26177.1| hypothetical protein CGGC5_1748 [Colletotrichum gloeosporioides
Nara gc5]
Length = 423
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 52/101 (51%), Gaps = 6/101 (5%)
Query: 262 AGGLYEDLLSSAAASIESEIAVGNAIYMLP--SFYNHDCDPNAHIMWIDNADAR--LMAL 317
G D++ + + + I G+ P S YNHDC PN +ID +D R A+
Sbjct: 210 GGHKVHDIIHTNSFELRLGIRDGHHFGNYPEVSRYNHDCRPNV-AFYID-SDLRHYTHAV 267
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
RD++ GEEL I Y+D+ +R RQ + +GF C C CS
Sbjct: 268 RDIKPGEELTISYVDSLSSRQVRQDRAKRNWGFGCTCNHCS 308
>gi|166796007|ref|NP_001107677.1| SMYD family member 5 [Xenopus (Silurana) tropicalis]
gi|163916066|gb|AAI57309.1| smyd5 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 53/101 (52%), Gaps = 7/101 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRD 319
L LY+D+ + E G+ +Y+L S NH C PNA + DN L AL D
Sbjct: 285 LIDQLYKDIEKVTGEFLNCE---GSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALED 341
Query: 320 VEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
++ GEE+ I Y+D +R +RQ IL + + F C+C +C
Sbjct: 342 IQPGEEICISYLDCCQRDRSRHSRQKILRENYLFVCSCPKC 382
>gi|297835172|ref|XP_002885468.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297331308|gb|EFH61727.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 74/185 (40%), Gaps = 26/185 (14%)
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSA--FKKAGIDDEQMKFLNKQWYTNVLAQIRIN 255
DCI + LS E M+ + S K A D E + + Y N++ +N
Sbjct: 281 DCIALPDDVDLSDEGAFRMQIKNLACTSLELLKTAIFDKECEALFSLEIYGNIIGMFELN 340
Query: 256 AFRIELAGG-----LYEDLLSSAAASIESEIA---------------VGNAIYMLPSFYN 295
+ +A LY D L A EI G A + L S N
Sbjct: 341 NLDLVVASPVEDYFLYIDDLPDAEKEEAEEITRPFLDALGDEYSDCCQGTAFFPLQSCMN 400
Query: 296 HDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
H C PNA + + A ++ALR + + EE+ I YID + RQA+L +GF C
Sbjct: 401 HSCCPNAKAFKREEDKDGQAVIIALRRISKNEEVTISYIDEELPYKERQALLAD-YGFSC 459
Query: 353 NCLRC 357
C +C
Sbjct: 460 KCSKC 464
>gi|18568265|gb|AAL75997.1|AF466646_5 putative SET-domain transcriptional regulator [Zea mays]
gi|413933333|gb|AFW67884.1| putative SET-domain transcriptional regulator [Zea mays]
Length = 410
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 110/282 (39%), Gaps = 52/282 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ VA GRG+ AT D+I +P + P + S C C F
Sbjct: 17 LTVASIPGKGRGLIATCTFFPGDVILNQEPYASTPNKILVGSSCDHC----------FTS 66
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
N R C +C R W YC S K + +L C ++ A + D
Sbjct: 67 GNLRKCS-MC------------RVTW-----YCSSNCQKEEWKLHQLECRAMA-ALTEDR 107
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
+L P +M+R A K+ + +E+M L + +L I ++ I
Sbjct: 108 KKMLTPTIR------------LMVRLALKRK-LQNEKMANL----VSLILPFIELDLKEI 150
Query: 260 ELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALR 318
+ +A + E+ +G +Y + S NH C PNA ++ D A + AL+
Sbjct: 151 ---AQTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNA-VLIFDGRTAYVRALQ 206
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ + EE+ I YI+ + R L Q F F C C RC G
Sbjct: 207 PINKDEEVSISYIETATVTKKRNNDLKQYF-FTCTCPRCVKG 247
>gi|356525020|ref|XP_003531125.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Glycine
max]
Length = 419
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 123/320 (38%), Gaps = 58/320 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNS-------------VCYFC 126
++V + GRG+ A++ ++A ++ PI+ + L + C C
Sbjct: 10 LKVEEIQGRGRGMVASQPLKAGQIVLRDSPILLYSALPLVRQSLSSSSSSASTSCFCDHC 69
Query: 127 LRKITSSSQ-----------HFQHHNARFCGEVCKDNAKAFYDVERRADWSA-------F 168
R ++ S Q + +HH FC C NA + W
Sbjct: 70 FRILSPSLQGDSSSSTVLCPNCRHH--CFCNSNCLSNALN----SSHSSWVCQALSHLRA 123
Query: 169 NDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPE-LILAMEEGFVMLRSAF 227
N Q L++ + V L S I+ SP+ +A + L S+
Sbjct: 124 NSLLLEQPLEHQVQVNFLVAAYNLANISPSDFQIMLSLQGSPDDSTIAAAQFLHPLISSL 183
Query: 228 KKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAI 287
+ Q F + + + +LA+ ++NAF I ++D S A I
Sbjct: 184 CSLALIGPQNGF-SLELTSAILAKDKLNAFGIMQPFSEHDDQRSVRAY----------GI 232
Query: 288 YMLPSFYNHDCDPNA-HIMWID--------NADARLMALRDVEEGEELRICYIDASMARD 338
Y SF+NHDC PNA ++D N D + + DV +G E+ + Y +
Sbjct: 233 YPYASFFNHDCLPNACRFDYVDANPSDDSHNTDFIIRMIHDVPQGREICLSYFPVNEKYS 292
Query: 339 ARQAILTQGFGFQCNCLRCS 358
+RQ L + +GF CNC RC+
Sbjct: 293 SRQKRLIEDYGFTCNCDRCN 312
>gi|30680137|ref|NP_849969.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
gi|94707144|sp|Q7XJS0.2|ASHR1_ARATH RecName: Full=Histone-lysine N-methyltransferase ASHR1; AltName:
Full=ASH1-related protein 1; AltName: Full=Protein SET
DOMAIN GROUP 37
gi|145651792|gb|ABP88121.1| At2g17900 [Arabidopsis thaliana]
gi|330251606|gb|AEC06700.1| histone-lysine N-methyltransferase ASHR1 [Arabidopsis thaliana]
Length = 480
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 57/276 (20%), Positives = 111/276 (40%), Gaps = 26/276 (9%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GR +F R R ++I + KP I P ++ S C C K + + +CG
Sbjct: 22 GRSLFTARDFRPGEVILSQKPYICVPNNTSSESRCDGCF-KTNNLKKCSACQVVWYCGSS 80
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRS----QGLKYPLLVKRLACMIISGAESADCIDILQ 204
C+ +++W D C++ + K + + M+ + LQ
Sbjct: 81 CQ-----------KSEWKLHRDECKALTRLEKEKRKFVTPTIRLMVRLYIKRN-----LQ 124
Query: 205 PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG 264
+ P + + + ++ + DE+ L Q V ++ + +
Sbjct: 125 NEKVLP---ITTTDNYSLVEALVSHMSEIDEKQMLLYAQMANLVNLILQFPSVDLREIAE 181
Query: 265 LYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
+ +A + +SE+ G ++ L S NH C PNA ++ + A + A+ ++ +
Sbjct: 182 NFSKFSCNAHSICDSELRPQGIGLFPLVSIINHSCSPNA-VLVFEEQMAVVRAMDNISKD 240
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
E+ I YI+ + + RQ L + + F C C RCS+
Sbjct: 241 SEITISYIETAGSTLTRQKSLKEQYLFHCQCARCSN 276
>gi|301121168|ref|XP_002908311.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262103342|gb|EEY61394.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 42/80 (52%), Gaps = 6/80 (7%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADA--RLMALRDVEEGEELRICYIDASMARDA 339
AVG ++ + NH C PN W + D+ + ALRD+EEGEE+ + YID R
Sbjct: 162 AVG--LFSICGLINHSCQPNC--TWSNAGDSVMEVRALRDIEEGEEITLSYIDIDKERSE 217
Query: 340 RQAILTQGFGFQCNCLRCSS 359
RQ L F C C RCS+
Sbjct: 218 RQKELRDTKHFDCQCERCST 237
>gi|409042906|gb|EKM52389.1| hypothetical protein PHACADRAFT_148986 [Phanerochaete carnosa
HHB-10118-sp]
Length = 436
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 116/318 (36%), Gaps = 68/318 (21%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPT-----LSTLNSVCYFCLRKITSS--------SQ 135
G+G+ AT RI +++ P I P L T + C C + +S S
Sbjct: 106 GKGLVATERIAEGEVVWKEDPFILAPEWSLYDLQTASVACTHCSTPLNNSPLAVPCSGSA 165
Query: 136 HFQHHNARFCGEVCKDNAK---------------AFYDVERRADWSAFNDY--CRSQGLK 178
+ARFC +C A + RR +W A + C ++ L
Sbjct: 166 KVPSCSARFCNRLCLSRAARTHPLLCPAQNPAALPLLEFARRHEWMALHALAQCTARVL- 224
Query: 179 YPLLVKRLACMIISGAESADCIDILQPASLSPELIL-------------AMEEGFVMLRS 225
LA A SAD + A L E ++ + +
Sbjct: 225 -------LAEQQDEAAFSADWAVLRGLAQLGMEERAKGGWLGSAEPDRDTWKKAYELFVR 277
Query: 226 AFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYE-DLLSSAAASIESEIAVG 284
AFK G + E+ + LA++ R E+ ++ D + +
Sbjct: 278 AFKDPGTESEKKR----------LARLLRKPTRKEVDDEIFGYDAFLRGLGRMSLNLEAH 327
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDA- 339
+Y+L S NH C PN +D A ++ RD+E GEEL I Y++ + +
Sbjct: 328 GGLYVLHSHVNHSCTPNLSARHLDQRSALSRITVVVRRDIEVGEELFISYVNPDLPLEGR 387
Query: 340 RQAILTQGFGFQCNCLRC 357
R+ +L GFG C C RC
Sbjct: 388 RRQLLEWGFG-TCQCPRC 404
>gi|345484303|ref|XP_001603615.2| PREDICTED: SET and MYND domain-containing protein 5-like [Nasonia
vitripennis]
Length = 386
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 12/123 (9%)
Query: 243 QWYTNVLAQIRINAFRIE---LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A N RI+ L +Y+++ + +E G+ +Y+L S NH C
Sbjct: 247 RWVKNVSALDLPNDQRIQVDKLIDRIYDEMDEVVGTFLNNE---GSGLYVLQSAINHSCA 303
Query: 300 PNAHIMWIDNADARLM--ALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNC 354
PNA + + ++D+RL+ ALRD++ EE+ I Y+D +R +RQ L+ + F C C
Sbjct: 304 PNASVEF-PHSDSRLVVRALRDIKPDEEICIAYLDECHLERSRHSRQKALSSLYLFVCKC 362
Query: 355 LRC 357
+C
Sbjct: 363 DKC 365
>gi|195132929|ref|XP_002010892.1| GI21459 [Drosophila mojavensis]
gi|193907680|gb|EDW06547.1| GI21459 [Drosophila mojavensis]
Length = 460
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 116/297 (39%), Gaps = 42/297 (14%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL---STLNSVCYFCLRKITSSS 134
P +A + AGRGVFATR I A +LI + ++ PT S LN+ C ++ +
Sbjct: 42 PNWTLAPSPVAGRGVFATRDIAAGELIFRERALVVGPTARKGSQLNTC--VCCHRLLGAK 99
Query: 135 QHFQHHNARF--CGEVCKDNAKAFYDVERRADWSA--FNDYCRS-QGLKYPL-LVKRLAC 188
Q H CGE C D+ + W ++ C+ P+ +V L+
Sbjct: 100 QFLCPHRCTLPVCGE-CADSEAHHNECAHFQRWQPKDLSEECKELSPTGEPVDVVNPLSL 158
Query: 189 MIISGA-------ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN 241
I++ E +D +Q + + + F+K D+ F++
Sbjct: 159 RILTAVRVFYLSKEQRALVDAMQANAER-----GYRQEIIKAAQCFRKFPTTDK--PFMD 211
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
+ + V+ + NAF +E LL ++ L + NH+C PN
Sbjct: 212 QLF--RVVGVLNTNAFEAPCRVDGHESLL--------------RGLFPLTAIMNHECTPN 255
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A + + A + A RD+ +G E+ Y + R L F CNC RC+
Sbjct: 256 ASHYFENGRLAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKYFVCNCDRCN 312
>gi|170041827|ref|XP_001848651.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
gi|167865410|gb|EDS28793.1| SET and MYND domain-containing protein 5 [Culex quinquefasciatus]
Length = 387
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 53/103 (51%), Gaps = 7/103 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRD 319
L GLY L + + +E G+A+Y L S NH C PNA + N L ++D
Sbjct: 265 LIDGLYTKLDDVVGSFLNNE---GSALYSLQSKVNHSCAPNAECQFPHSNNVLALTTVQD 321
Query: 320 VEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
+ GEE+ I Y+D +R +RQ +L + + FQC C +C S
Sbjct: 322 IAAGEEICISYLDECALERSRHSRQKMLRENYLFQCQCEKCES 364
>gi|45387527|ref|NP_991103.1| SET and MYND domain containing 1 [Danio rerio]
gi|41223368|gb|AAH65475.1| SET and MYND domain containing 1a [Danio rerio]
Length = 485
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/302 (21%), Positives = 125/302 (41%), Gaps = 47/302 (15%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQHF 137
P++V GRG+ TR + A +++ A+ L +L+ VC+ C R+ + +
Sbjct: 8 PVEVFGAGLKGRGLRGTRDLSAGEVVF-AEASFAAVVLDSLSLQVCHSCFRRQVNPHRCA 66
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
Q A +C C+ +A +D E R + SA + ++ L+ + L + +
Sbjct: 67 QCKFAHYCDRTCQ---RAAWD-EHRKECSAIRNIGKAPNENVRLVARILWRIQKHTGLVS 122
Query: 198 D----CIDILQP--ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQ 251
D +D+L+ + ++PE + ++ + + K + K + + + +++
Sbjct: 123 DSQLTTLDMLEDHLSRMTPEDLKELKADVKTFYTYWPK------KSKAVGEDYVSHLFGV 176
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
I N F + GL +VG ++ NHDC PN ++ +++ D
Sbjct: 177 ISCNGFTLSDQRGLQ---------------SVGIGLFPNLCLVNHDCWPNCTVI-LNHGD 220
Query: 312 A-------------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L AL + G+EL + Y+D RQ +L Q + F C C C
Sbjct: 221 QSALDASFHSSRRIELRALEPISAGQELTVSYVDFLSVSTDRQRLLQQQYYFDCKCEHCV 280
Query: 359 SG 360
+G
Sbjct: 281 NG 282
>gi|340054943|emb|CCC49251.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 434
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 120/288 (41%), Gaps = 40/288 (13%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG++A I A ++ +P + + S C CL I+++S+ A +
Sbjct: 164 GRGIYALTAISAGTVVMVDEPFVVQ---RMVPSACAHCLSDISTTSKE---GGAAVGVVL 217
Query: 149 CKDNAKAFYDVE--RRADWSAFNDYC----RSQGLKYPLLVKRLACMIISGAESADCIDI 202
C Y R A W ++ C Q + L ++ L + ES +
Sbjct: 218 CTHCGVESYCSTGCREAAWREYHSCCCPARNPQYAAWELAMRELLSSDTT-EESRAALCC 276
Query: 203 LQPASLSPELILAMEEGFVM--LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIE 260
L A L + + + +RS +A D T L+QI A +
Sbjct: 277 LAVAKLCAISTIQQKHPLSLPCIRSLRGRADYDA-----------TTALSQI--GAMAVS 323
Query: 261 LAGGLY------EDLLSSAAASIESEIAV--GNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
LA L+ E++LS A +E + G A+Y SF NH C+PN I +
Sbjct: 324 LAEALHQEHLYMEEILSLFAIIQTNEFLLPGGTALYHGYSFLNHSCEPNCAIAGSSAINR 383
Query: 313 RLMALRDVEEGEELRICY---IDASMARDARQAILTQGFGFQCNCLRC 357
RL+ALR++ EGE+L I Y + ++ + R+A+ Q F+C C RC
Sbjct: 384 RLVALRNIREGEQLMINYNARLTTCVSYEDRRALCMQRH-FECFCPRC 430
>gi|336363664|gb|EGN92041.1| hypothetical protein SERLA73DRAFT_117876 [Serpula lacrymans var.
lacrymans S7.3]
Length = 556
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 72/331 (21%), Positives = 130/331 (39%), Gaps = 68/331 (20%)
Query: 92 VFATRRIRASDLIHTAKPIIT--HPTLSTLNSVCYFCLRKI---TSSSQHFQHHNARFCG 146
+ A + I A ++I+ ++T L + C CLR+I T+ N+ +C
Sbjct: 194 LVANKDIAAGEVIYKEGALVTALDLDLQGKGTHCSHCLREIGDGTALRPDSDRLNSVYCS 253
Query: 147 EVCKDNAKAFY-----------------DV------ERRADWSAFNDYCRSQGLKYPLLV 183
C+ AK Y DV +R A AF Y P LV
Sbjct: 254 NTCQLEAKLEYQSLLFTLEPPLPVGLTPDVSDSAQEDRDAAQKAFVAYLNHSAKASPELV 313
Query: 184 KRLACMIISGAESADCIDILQPASLSPELILAMEEGFVM------LRSAFKKAGIDDEQM 237
+ +S E++ + QP P LA + + LR K G ++ ++
Sbjct: 314 AKFIARQVS-IETSKMVPGGQPV---PTPALADGGDYTLYDHLERLRFLEVKPGKNEMKL 369
Query: 238 -------------KFLNKQWYTNVLAQIRINAFRIELAGGLYE--------DLLSSAAAS 276
+F+ ++ + +L ++ N++ + G + + +
Sbjct: 370 LGEVLQTALPGLEQFVTEERHGTLLGKMAYNSYGVFFGNGRDDKPEPKERPEDVEKTRTP 429
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASM 335
+ + VG A Y + S+ +H C P+A + D NA+ L+A RD+++G+E+ + YID +
Sbjct: 430 LGTARQVGAAFYAVSSYISHSCAPSARPSFDDGNAELHLIATRDLKKGDEITVSYIDVAQ 489
Query: 336 ARDARQAI--------LTQGFGFQCNCLRCS 358
D L +G+ F C C RC+
Sbjct: 490 HEDETTVDARRRRRMELARGWRFACPCTRCA 520
>gi|255078756|ref|XP_002502958.1| predicted protein [Micromonas sp. RCC299]
gi|226518224|gb|ACO64216.1| predicted protein [Micromonas sp. RCC299]
Length = 847
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 43/77 (55%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G AI+ L S NH CDPN + ++D++ ++A R + EE I Y+D + + R+
Sbjct: 399 GAAIFPLASSLNHSCDPNCEVAYVDDSRVHILAKRTIGRDEECTIAYVDPDLDGEERREE 458
Query: 344 LTQGFGFQCNCLRCSSG 360
L + +GF C C C +G
Sbjct: 459 LRETYGFDCECDVCIAG 475
>gi|407424679|gb|EKF39087.1| hypothetical protein MOQ_000686 [Trypanosoma cruzi marinkellei]
Length = 429
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 112/291 (38%), Gaps = 50/291 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS------QHFQHHNA 142
GRG++A RI + + +P + + C CL I S Q
Sbjct: 163 GRGIYALTRINSGTPVMLDQPFLVQ---RMRDDACAHCLATIGRSGASAGGVQCAHCDRE 219
Query: 143 RFCGEVCKDNAKAFYDV----ERRADWSAFNDYCRSQGLKYPLLVKR--LACMIISGAES 196
+C C+D A Y V R ++ + R + L + R LAC+ ++
Sbjct: 220 VYCSVACRDAAWREYHVCACVSRNEMYAFWEGAMRERLLSDKMEESRAALACLSVAKLCV 279
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
+ + P +L +RS +A Y A + A
Sbjct: 280 LSTVQQMHPLALP------------RIRSLRGRAD-------------YDAATALSEVGA 314
Query: 257 FRIELAGGL------YEDLLSSAAASIESEIAV--GNAIYMLPSFYNHDCDPNAHIMWID 308
+ LA L E+LLS A +E + G A+Y SF NH C+PN ++
Sbjct: 315 LAVTLATALRQTHLYMEELLSLFAIVQTNEFLLPSGMALYHGYSFLNHSCEPNCALLGSG 374
Query: 309 NADARLMALRDVEEGEELRICYIDASMARD--ARQAILTQGFGFQCNCLRC 357
+ RL+ LRDV EGE+L I Y + AR A + L Q F+C C +C
Sbjct: 375 ATNRRLVTLRDVREGEQLFIDYNASLTARVSYADRRALCQQRHFECFCPKC 425
>gi|213514028|ref|NP_001135157.1| SET and MYND domain-containing protein 1 [Salmo salar]
gi|197631975|gb|ACH70711.1| SET and MYND domain containing 1b [Salmo salar]
Length = 301
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/295 (22%), Positives = 114/295 (38%), Gaps = 43/295 (14%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V + GRG+ AT+ + + D++ + P S + +C+ C R+ + Q
Sbjct: 4 VEVFDSPGKGRGLRATKELLSGDVLFSEPPFAAVVFDSLADKICHSCFRRQDKLQRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII-----SGA 194
A++C + C+ + +E A AF + G K P RLA I+ G
Sbjct: 64 KFAQYCDKTCQSAGWKEHKLECGA-IKAFGAIGDNFG-KAPNENIRLAARIMWRLDKEGG 121
Query: 195 ESAD----CIDILQ------PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
+D +D L+ P EL + + K+ +DD
Sbjct: 122 VMSDMQMTSLDDLENHISDMPEDDLKELKVDIHNFLDYWPRNSKQHRVDD---------- 171
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+++ I N F + GL AVG ++ NHDC PN +
Sbjct: 172 ISHIFGVINCNGFTVSDQRGLN---------------AVGVGLFPNLCLVNHDCWPNCTV 216
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ +++ L AL +E GEE+ + Y+D + RQ L + F C C C
Sbjct: 217 I-LNHGKIELRALGKIEPGEEVTVSYVDFLNVTEDRQRQLKMHYFFDCTCEHCKG 270
>gi|452839795|gb|EME41734.1| hypothetical protein DOTSEDRAFT_81948 [Dothistroma septosporum
NZE10]
Length = 549
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/307 (20%), Positives = 121/307 (39%), Gaps = 38/307 (12%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHT-AKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
++ + + AGRG+FA + R DLI + +P + + L C +C ++ + Q
Sbjct: 5 VKTSDIKGAGRGLFAKKDFRPGDLILSLDRPYVAELDIERLKDTCAWCFQRGATDDFERQ 64
Query: 139 HHNA--------------------RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLK 178
A R+C C+ A + E + + R
Sbjct: 65 RRAALGNFANRIVDTKACTGCKRVRYCSRSCQSKA---WKREHKYECKVLAPTDRPDLPH 121
Query: 179 YPLLVKRLACMIISGAESAD--CIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ 236
V +L + + E D +D+LQ A++E + ++ G+
Sbjct: 122 GVRAVVKLLGRMKNDPEGKDEALLDVLQFWPAGSAERTALDEFKAQNQQRYEDFGM---- 177
Query: 237 MKFLNKQWYTNVLAQIRINAFRIE-LAGGLYEDLLSSAA--ASIESEIAVGNAIYMLPSF 293
W + ++ E +A GL+ +++S+ ++ + ++G +
Sbjct: 178 --LAYGAW--KYAGEPKMGGTESEAIANGLFFNVMSNTVQLSNPLDDTSLGMGFDPIMCS 233
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH CDPNA ++ + L AL+ +++GEE+ + Y D + RQA L + F C+
Sbjct: 234 ANHSCDPNAAALF-NQPRQILRALKPIKKGEEISMKYTDVTNPLCVRQAELKGYYFFTCH 292
Query: 354 CLRCSSG 360
C RC G
Sbjct: 293 CARCKQG 299
>gi|255545512|ref|XP_002513816.1| protein with unknown function [Ricinus communis]
gi|223546902|gb|EEF48399.1| protein with unknown function [Ricinus communis]
Length = 482
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/288 (23%), Positives = 119/288 (41%), Gaps = 36/288 (12%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++VA GR + T+ ++I + +P + P S S C C SS + +
Sbjct: 13 LRVANVTEKGRSLVTTKDFNPGEVIISQEPYVCVPNNSATESRCDRCF-----SSSNVKK 67
Query: 140 ----HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCR----SQGLKYPLLVKR-LACMI 190
A +CG C+ + +E A D + S L LL++R L
Sbjct: 68 CSACQVAWYCGSSCQKLEWKLHRIECEALSKLDKDRRKYVTPSIRLMVRLLIRRKLQSEK 127
Query: 191 ISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
I + + D D+++ L+ M++ ID++Q+ L Q V
Sbjct: 128 IISSTATDNYDLVEA------LVAHMKD-------------IDEKQL-VLYAQMANLVNL 167
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDN 309
+R I+ + L +A +SE+ +G +Y + S NH C PNA ++ D
Sbjct: 168 ILRWPDVNIKEIAENFSKLACNAHTICDSELRPLGTGLYPVVSIINHSCLPNA-VLVFDG 226
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + ++ + +G E+ I YI+ + + RQ L Q + F C C RC
Sbjct: 227 RLAVVHTVQHIPKGSEVLISYIETAGSTMTRQKALKQQYFFTCTCPRC 274
>gi|440790098|gb|ELR11386.1| MYND finger domain containing protein [Acanthamoeba castellanii
str. Neff]
Length = 307
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/230 (23%), Positives = 89/230 (38%), Gaps = 25/230 (10%)
Query: 146 GEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS----GAESADCID 201
EV ++ +A + + W F R + PLLV +L +++ G +D
Sbjct: 76 AEVSEEQQEALHQL-----WKEF----RGVFGELPLLVGKLIANVLTQLDDGKSPEQAVD 126
Query: 202 ILQPASLSPELILAMEE-----GFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
+ + P + E+ V + +A D + + Y + + +N+
Sbjct: 127 SFEHFVVYPREAIKAEDPVAYDQLVRMAHITNRALFDPRIPQLFEESVYLRLAGIVLLNS 186
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR--- 313
I+ + + + A I+ G + + S NH C PN A AR
Sbjct: 187 LGIDFSCPVLNTPTQADAEQIKMMEFDGAVVCPILSSLNHSCQPNVKSTLRSQAQARGDS 246
Query: 314 ----LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L+A +DVE EEL I Y+ S R RQ L + + F C C RC S
Sbjct: 247 CRVSLIAKQDVEPNEELCISYVPLSWTRATRQTTLLKNYFFHCQCERCVS 296
>gi|342182557|emb|CCC92036.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 387
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 44/88 (50%), Gaps = 2/88 (2%)
Query: 271 SSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRIC 329
+S+ +SIE + G IY L S +NH C PN + D L LRDV+ GEEL I
Sbjct: 289 TSSHSSIEW-VLKGGGIYSLQSNFNHSCQPNVAVFTECGTHDITLRTLRDVQAGEELTIS 347
Query: 330 YIDASMARDARQAILTQGFGFQCNCLRC 357
YI A + + +G+ F C C C
Sbjct: 348 YIPVENTNRAERHKMLEGYFFTCQCALC 375
>gi|302411726|ref|XP_003003696.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261357601|gb|EEY20029.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 564
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 56/113 (49%), Gaps = 23/113 (20%)
Query: 246 TNVLAQIRINAF-RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+L +I+ NAF R E GL+ D + M+ NH C PNA I
Sbjct: 197 VELLCKIKTNAFTRSEAGEGLFLD----------------TTLAMV----NHSCVPNA-I 235
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ A L ALRD++EGEE+ I YID + + + R+ L F FQC C+RC
Sbjct: 236 VAFSGRRAFLRALRDIKEGEEIEISYIDCTQSLEHRRKALELYF-FQCVCMRC 287
>gi|22902087|gb|AAC60295.2| skmBOP [Takifugu rubripes]
Length = 433
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/276 (21%), Positives = 107/276 (38%), Gaps = 42/276 (15%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ T+ A D+I + + S +C+ C R+ + Q A +C
Sbjct: 13 GRGLKTTKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHYCDRT 72
Query: 149 CKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACM-IISGAESADCIDILQ 204
C+ RA W+ C ++ G ++ ++ M +I+ E D + +Q
Sbjct: 73 CQ-----------RAGWAEHKQECGAIKAYGKAPNENIRVVSHMQLITVEELEDHVADMQ 121
Query: 205 PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG 264
+ EL + + K+ IDD +++ I N F + G
Sbjct: 122 EDEIK-ELKVDIHNFLDYWPRNSKQHTIDD----------ISHIFGVINCNGFTVSDQRG 170
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L AVG ++ NH+C PN ++ +++ L +L + EGE
Sbjct: 171 LQ---------------AVGVGLFPNLCMVNHNCWPNCTVI-LNHGKIELRSLGKIAEGE 214
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
EL + Y+D + R+ +L + F C C C +G
Sbjct: 215 ELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCKNG 250
>gi|168035501|ref|XP_001770248.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678465|gb|EDQ64923.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 691
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
++MLPSF NH C P+ + I NA ++A RD++ G+EL + Y D R+A +
Sbjct: 423 GLWMLPSFMNHSCTPSVATVVIGNAMI-IVAARDLKCGDELTVAYFDIFRPLQERRASML 481
Query: 346 QGFGFQCNCLRCS 358
+ F C+C RC+
Sbjct: 482 HSWNFMCSCPRCA 494
>gi|307204816|gb|EFN83374.1| SET and MYND domain-containing protein 4 [Harpegnathos saltator]
Length = 628
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 45/83 (54%), Gaps = 3/83 (3%)
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDA--S 334
+E + + AIY S NH CDPN ++D + A++D++ GEE+ CY
Sbjct: 338 VEQQDRIATAIYPSASMMNHSCDPNIITSFVDQY-LIVRAMKDIQAGEEVSNCYGPNFRR 396
Query: 335 MARDARQAILTQGFGFQCNCLRC 357
M+R+ RQ IL +GF+C C C
Sbjct: 397 MSREHRQEILKNQYGFECRCDPC 419
>gi|195334290|ref|XP_002033816.1| GM21522 [Drosophila sechellia]
gi|194125786|gb|EDW47829.1| GM21522 [Drosophila sechellia]
Length = 514
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 71/296 (23%), Positives = 110/296 (37%), Gaps = 37/296 (12%)
Query: 71 KPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI 130
K +PS P ++A E GR + ATR I+ +++ P++ P + VC CL I
Sbjct: 34 KAHKPSCHPFKIAHNEQLGRHLVATRTIKPYEIVLKEAPLVRGPAQISA-PVCLGCLNGI 92
Query: 131 TSSSQ-HFQHHNARFCGEVCK--DNAKA----FYDVERRADWSAFNDYCRSQGLKYPLL- 182
+ CG CK D KA D ++R + F G +PL
Sbjct: 93 EPDDHIECEQCGWPLCGPECKSLDEHKAECRLTKDRDQRVNVQEF-------GGPHPLYT 145
Query: 183 -VKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN 241
+ + C++I G S + Q E + + G ++ I KF
Sbjct: 146 CLSTVRCLLI-GETSPEKASKFQDL----ESLESTRRGSNQWKADL--VSIGQFIPKFFK 198
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q +T + A +I + + A++ SF + C PN
Sbjct: 199 TQKFTEEQIMKAVGALQIN-------------GHEVPTTDPSHVAVFYTASFTENSCLPN 245
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ N L A R++++ L ICY DA RQ L Q F+C C RC
Sbjct: 246 LAKSFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERC 301
>gi|342876017|gb|EGU77681.1| hypothetical protein FOXB_11800 [Fusarium oxysporum Fo5176]
Length = 560
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/98 (35%), Positives = 48/98 (48%), Gaps = 5/98 (5%)
Query: 262 AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDV 320
A Y + + A IES G+ I++ NH CD NA W + + ALRD+
Sbjct: 83 ADSQYLGIFRTNALPIES----GSGIFLAACRINHACDNNAQKAWNEGIKRHTVHALRDI 138
Query: 321 EEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
++GEE+ I YI R RQ L + F F C+C CS
Sbjct: 139 DKGEEITITYISILNNRRTRQEALRKKFKFTCSCNLCS 176
>gi|226508200|ref|NP_001149542.1| MYND finger family protein [Zea mays]
gi|223942703|gb|ACN25435.1| unknown [Zea mays]
gi|414585283|tpg|DAA35854.1| TPA: MYND finger family protein [Zea mays]
Length = 469
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + A ++ALR + + EE+ I YID + + R
Sbjct: 388 GTAFFPLQSCMNHSCRPNAKAFKRDEDKDGHAVIIALRPISKDEEITIAYIDEDLPYEER 447
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C CL+C
Sbjct: 448 QAQLAD-YGFTCTCLKC 463
>gi|195627902|gb|ACG35781.1| MYND finger family protein [Zea mays]
Length = 469
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + A ++ALR + + EE+ I YID + + R
Sbjct: 388 GTAFFPLQSCMNHSCRPNAKAFKRDEDKDGHAVIIALRPISKDEEITIAYIDEDLPYEER 447
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C CL+C
Sbjct: 448 QAQLAD-YGFTCTCLKC 463
>gi|159462910|ref|XP_001689685.1| histone methyltransferase [Chlamydomonas reinhardtii]
gi|158283673|gb|EDP09423.1| histone methyltransferase [Chlamydomonas reinhardtii]
Length = 569
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 53/102 (51%), Gaps = 9/102 (8%)
Query: 260 ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD--ARLMAL 317
+L G L + L + A E+ A L S NH CDPN D+ D A L+A
Sbjct: 444 KLTGPLLDALDAEYATPCEA-----TAFLALQSCINHSCDPNC-TAACDSGDRTATLLAQ 497
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
RD+ GEE+ + YID S+ RQA L + +GF C C RC++
Sbjct: 498 RDIAAGEEITLSYIDVSLPYKRRQAEL-RDYGFVCKCERCTA 538
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 2/62 (3%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNS--VCYFCLRKITSSSQHF 137
+++A + G+GVFAT+ I + + T P+++ + S VCY CLR + + Q
Sbjct: 18 VRLATEDGKGKGVFATKDIPEGETVFTEVPLVSLQHIENRGSALVCYRCLRFVGTVEQQI 77
Query: 138 QH 139
H
Sbjct: 78 GH 79
>gi|347831782|emb|CCD47479.1| hypothetical protein [Botryotinia fuckeliana]
Length = 612
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/242 (23%), Positives = 105/242 (43%), Gaps = 22/242 (9%)
Query: 133 SSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFN-------DYC--RSQGLKYPLLV 183
S + F+H + + + + A+ + R+ + N DYC + Q L + L
Sbjct: 99 SRKPFEHSRTQISAQAVTNTSCAYSNC--RSKKATVNCIGCKAVDYCSKKCQDLDHSLHE 156
Query: 184 KRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQ 243
K+ C I + ++ + P ++ + L + S F +++++ + +
Sbjct: 157 KQ--CKAIQSQSLGNSLNKVLPTAVRGIIQLVSNPAALAKDSPFMDLASHEDELRRVQSK 214
Query: 244 WYTNVL---AQIRINAFRIELAGGLYEDL--LSSAAASIESEIA---VGNAIYMLPSFYN 295
W L A + + + E E L LS+ +ES + +G + L + N
Sbjct: 215 WDDISLQAHAVVNLLQRKPEFVKYALESLCRLSTNGFRVESNVGNGPIGLCLDPLLARAN 274
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
H C PNA I + D A L AL + +GE++ I YID + ++ R+ L + + FQC C
Sbjct: 275 HSCRPNAAITF-DGKRATLRALSPIAKGEQIFISYIDETQRQEVRREALAKTWFFQCLCS 333
Query: 356 RC 357
RC
Sbjct: 334 RC 335
>gi|226499648|ref|NP_001143943.1| uncharacterized protein LOC100276756 [Zea mays]
gi|195629926|gb|ACG36604.1| hypothetical protein [Zea mays]
Length = 404
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 60/126 (47%), Gaps = 18/126 (14%)
Query: 240 LNKQWYTNVLAQIRINAFRI--ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
L+ +LA+ R N+F I G+ +LL + A A+Y S +NHD
Sbjct: 176 LSPDLTAALLAKDRGNSFAIMEPYRHGMSLELLKARA----------YAVYPRASLFNHD 225
Query: 298 CDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
C PNA H + D N D + AL D+ EG E+ I Y A+ RQ L + +GF+
Sbjct: 226 CLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFR 285
Query: 352 CNCLRC 357
C C RC
Sbjct: 286 CECDRC 291
Score = 42.0 bits (97), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHP-TLSTLNSVCYFCLRKITSSSQ--H 136
++VA GRG+ A R +R +++ + P++ +P TLS+L S C C R +T ++
Sbjct: 17 LRVADLPGRGRGLVAARIVREGEVLFSEPPVLLYPSTLSSLRSYCSACFRSLTPAATIPC 76
Query: 137 FQHHNARFCGEVC 149
A FC C
Sbjct: 77 VSCRAAAFCSPAC 89
>gi|219114284|ref|XP_002176313.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217402716|gb|EEC42705.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 705
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 43/81 (53%), Gaps = 8/81 (9%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARL-------MALRDVEEGEELRICYIDASMA 336
G A+Y NH CDPN +++ L +ALRD++ GEEL I YIDA++
Sbjct: 396 GTAMYATTCKMNHSCDPNVIVLYKRTGWGALHPLVAYCVALRDIQAGEELTISYIDANVP 455
Query: 337 RDARQAILTQGFGFQCNCLRC 357
RQ L + +GF C C +C
Sbjct: 456 WAERQEAL-ENYGFVCECPKC 475
>gi|389739036|gb|EIM80231.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 445
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 116/304 (38%), Gaps = 38/304 (12%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPT-----LSTLNSVCYFCLRKITSSSQHFQ----- 138
G+G+ A I+ D++ P + P L ++ C FC +T++
Sbjct: 106 GKGLVAKEAIKEGDVLWKEDPFVLAPEWDIYDLQQKSAACAFCSTPLTATPLSVSCPSST 165
Query: 139 ---HHNARFCGEVC-----KDNAKAFYDVERRADWS--AFNDYCRSQGLKYPLLVKRLAC 188
H ARFC +C K + A AF R GL LV+ +A
Sbjct: 166 SSIHCPARFCSRLCLSRGTKHTHPLLCPAQNPASVPLLAFVKKNRWMGLHA--LVQVIAR 223
Query: 189 MII---SGAESA-----DCIDILQPASLSPELILAMEEGFVMLRSAFKKA-GIDDEQMKF 239
++I SG E D + L SL +G R + KA + + +
Sbjct: 224 LLIAAQSGEEGQMEKDWDVVSALAELSLEDRAKSVWYKGAEPDRPTWTKAHQLCVQAFQT 283
Query: 240 LNKQWYTNVLAQIRINAFRIELAGGLYE-DLLSSAAASIESEIAVGNAIYMLPSFYNHDC 298
+ LA+I +A L+E + + +Y L S NH C
Sbjct: 284 PSSPQDQKKLAKILRKPLPEHIAKELFEYGGFLRGLGKMNLNLEAHGGLYTLHSHLNHSC 343
Query: 299 DPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ-CN 353
DPN I +D + ++A RD++ GEEL + Y+D + R+ L G+GF C
Sbjct: 344 DPNLSIRHLDPRTSLSRITVIAKRDIDAGEELTVTYVDPGLRLKRRREELG-GWGFGVCK 402
Query: 354 CLRC 357
C RC
Sbjct: 403 CARC 406
>gi|302506763|ref|XP_003015338.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
gi|291178910|gb|EFE34698.1| SET and MYND domain protein, putative [Arthroderma benhamiae CBS
112371]
Length = 498
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 53/112 (47%), Gaps = 17/112 (15%)
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
LA+I +N+F A G +G I + NH C+PNA ++ D
Sbjct: 189 LARIEVNSFTFTTAFG----------------DPLGLCIQPFACYMNHSCEPNA-VVGFD 231
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ ALR+++ E++ I YID + + RQ LT+ + F C C +C+ G
Sbjct: 232 GGLITVKALREIKPDEQVFISYIDNTYPLEVRQKQLTERYFFTCKCSKCAQG 283
>gi|326511411|dbj|BAJ87719.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 109/289 (37%), Gaps = 29/289 (10%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPT-LSTLNSVCYFCLRKITSSSQHFQ 138
++VA GR + A R I +++ + PI+ +P+ L++L+S C C R +
Sbjct: 6 LRVAELPGRGRALLAARDILEGEVLLSESPILLYPSSLASLSSYCSACFRSLPPPPHTPC 65
Query: 139 H--HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
A FC C A + + + PLL A + + S
Sbjct: 66 PSCRAAAFCSPACA----AASHPRLLCAALSSGLAAAPEAHQEPLLFLLSAYSLQEPSLS 121
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
A P SP + ++ L +A + +L++ R N+
Sbjct: 122 AILSLSSAPQGPSP----SQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALLSKDRTNS 177
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGN--AIYMLPSFYNHDCDPNA-HIMWID----- 308
F I L + E+ A+Y S NHDC PNA H + D
Sbjct: 178 FSI----------LEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPG 227
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N D + AL + EG E+ I Y A+ RQ L + +GF+C C RC
Sbjct: 228 NTDMVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRC 276
>gi|166240153|ref|XP_001733058.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|165988456|gb|EDR41012.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 594
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 286 AIYMLPSFYNHDCDPNAHIMW--IDNADARLMALRDVEEGEELRICYIDAS-MARDARQA 342
+I M SF+NH C+PN I +++ R R +++GEEL I Y+D + + R+
Sbjct: 507 SILMQGSFFNHSCEPNVFIATPVVNDKSIRFCTRRPIKKGEELFISYLDGEKLTTEKRRT 566
Query: 343 ILTQGFGFQCNCLRCSS 359
L + + F CNC C+S
Sbjct: 567 TLKETYSFICNCQACTS 583
>gi|47226174|emb|CAG08321.1| unnamed protein product [Tetraodon nigroviridis]
Length = 417
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 55/99 (55%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + E G+ +++L S NH C PNA + + +N +L AL D+ G
Sbjct: 287 LYKDIEKETGDFLNCE---GSGLFLLQSSCNHSCIPNAEVSFPNNNFLLQLSALSDISPG 343
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+D +R +R IL + + F C+C++C+S
Sbjct: 344 EEICISYLDCCQRERSRHSRHKILRENYLFACSCVKCTS 382
>gi|328853243|gb|EGG02383.1| hypothetical protein MELLADRAFT_66419 [Melampsora larici-populina
98AG31]
Length = 546
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 42/75 (56%), Gaps = 4/75 (5%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADAR--LMALRDVEEGEELRICYIDASMARDARQAI 343
A+ PS +NH+C PN+ + DN R + A+RD+ GEE+ I Y D ++ RQ
Sbjct: 352 AVVPEPSIFNHECRPNS-AFYFDNKTMRVYISAVRDIALGEEITIAYRDMKASKAERQTA 410
Query: 344 LTQGFGFQCNCLRCS 358
+ +GF+C C CS
Sbjct: 411 IAH-YGFKCTCTHCS 424
>gi|413917046|gb|AFW56978.1| hypothetical protein ZEAMMB73_771542 [Zea mays]
Length = 404
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 60/126 (47%), Gaps = 18/126 (14%)
Query: 240 LNKQWYTNVLAQIRINAFRI--ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
L+ +LA+ R N+F I G+ +LL + A A+Y S +NHD
Sbjct: 176 LSPDLTAALLAKDRGNSFAIMEPYRHGMSLELLKARA----------YAVYPRASLFNHD 225
Query: 298 CDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
C PNA H + D N D + AL D+ EG E+ I Y A+ RQ L + +GF+
Sbjct: 226 CLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFR 285
Query: 352 CNCLRC 357
C C RC
Sbjct: 286 CECDRC 291
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 3/73 (4%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHP-TLSTLNSVCYFCLRKITSSSQH-- 136
++VA GRG+ A R +R +++ + P++ +P TLS+L S C C R + ++
Sbjct: 17 LRVADLPGRGRGLVAARNVREGEVLLSEPPVLLYPSTLSSLRSYCSACFRSLPPAATIPC 76
Query: 137 FQHHNARFCGEVC 149
A FC C
Sbjct: 77 VSCRAAAFCSPAC 89
>gi|393212658|gb|EJC98158.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 459
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 45/80 (56%), Gaps = 4/80 (5%)
Query: 282 AVGNAIYMLPS-FYNHDCDPNAHIMWIDNADARLM---ALRDVEEGEELRICYIDASMAR 337
AVG+ I+ L S +NH C PNA + + ++ M AL +++GEE+ I Y D ++
Sbjct: 209 AVGHGIFPLASRLFNHSCLPNAIVTYSFTSEGIQMVVKALTPIKQGEEITIPYFDPALPY 268
Query: 338 DARQAILTQGFGFQCNCLRC 357
RQAI +GF+C C C
Sbjct: 269 HQRQAICRYSYGFECTCSVC 288
>gi|340924027|gb|EGS18930.1| hypothetical protein CTHT_0055430 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 278
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 53/106 (50%), Gaps = 2/106 (1%)
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
+F ++ YE ++ ++ + ++A A+Y S +NHDC PN H +
Sbjct: 39 ESFMRQMGESFYEKVVKNSFRAFVDDVA-HLAVYPKVSRFNHDCRPNVHYR-LSGLIHTT 96
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+A+RD+ G EL I YI +R RQ L++ + F C C +CS
Sbjct: 97 VAVRDIPAGTELTISYIYVRKSRAERQKELSESWNFTCTCEQCSKS 142
>gi|406695250|gb|EKC98561.1| hypothetical protein A1Q2_07157 [Trichosporon asahii var. asahii
CBS 8904]
Length = 621
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/77 (41%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDA---SMARDAR 340
+AI+ S NH C PN+H W + A R LMALRD++EGEE+ Y+D +R R
Sbjct: 419 SAIFEYISRTNHSCCPNSHWYWDNEAQERYLMALRDIKEGEEITASYVDEFDLLNSRAKR 478
Query: 341 QAILTQGFGFQCNCLRC 357
+ IL + F C C C
Sbjct: 479 RQILRELHEFHCLCPGC 495
>gi|326511387|dbj|BAJ87707.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/289 (24%), Positives = 109/289 (37%), Gaps = 29/289 (10%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPT-LSTLNSVCYFCLRKITSSSQHFQ 138
++VA GR + A R I +++ + PI+ +P+ L++L+S C C R +
Sbjct: 6 LRVAELPGRGRALLAARDILEGEVLLSESPILLYPSSLASLSSYCSACFRSLPPPPHTPC 65
Query: 139 H--HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
A FC C A + + + PLL A + + S
Sbjct: 66 PSCRAAAFCSPACA----AASHPRLLCAALSSGLAAAPEAHQEPLLFLLSAYSLQEPSLS 121
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
A P SP + ++ L +A + +L++ R N+
Sbjct: 122 AILSLSSAPQGPSP----SQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALLSKDRTNS 177
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGN--AIYMLPSFYNHDCDPNA-HIMWID----- 308
F I L + E+ A+Y S NHDC PNA H + D
Sbjct: 178 FSI----------LEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPG 227
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N D + AL + EG E+ I Y A+ RQ L + +GF+C C RC
Sbjct: 228 NTDMVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRC 276
>gi|336272855|ref|XP_003351183.1| hypothetical protein SMAC_08199 [Sordaria macrospora k-hell]
gi|380087872|emb|CCC14032.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 320
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQ 341
+G +++ S NH C PN W + + + A+RD+++GEE+ I Y+ + D RQ
Sbjct: 104 IGGGLFIEASRINHACKPNTQNSWNERISRETIHAVRDIKKGEEITISYMGHFASYDERQ 163
Query: 342 AILTQGFGFQCNCLRCS 358
A L F F C C CS
Sbjct: 164 AFLKDKFKFDCACEVCS 180
>gi|390361260|ref|XP_799878.2| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 315
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 3/77 (3%)
Query: 281 IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
IAVG IY+ S NH C PN +++ D +L +RDV+EG+ I Y+D R
Sbjct: 198 IAVG--IYLRASMLNHSCYPNCVVVF-DERKLQLRTVRDVKEGDVCTISYVDVINPAKER 254
Query: 341 QAILTQGFGFQCNCLRC 357
Q L + + F CNC++C
Sbjct: 255 QTELEERYHFSCNCVKC 271
>gi|221503737|gb|EEE29421.1| SET and MYND domain containing protein, putative [Toxoplasma gondii
VEG]
Length = 402
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMW--IDNADARLMALRDVEEGEELRICYIDASMARDAR 340
+G ++ S NH C PNA + + + NA A++ ALRD+ E EE+ + YID S+ R
Sbjct: 263 LGVGLFRAVSMTNHSCWPNAEVDYPFLTNA-AQVTALRDIAEKEEILLSYIDESLPLAER 321
Query: 341 QAILTQGFGFQCNCLRC 357
Q +L + F C C RC
Sbjct: 322 QRLLKSHYKFTCGCQRC 338
>gi|345570035|gb|EGX52860.1| hypothetical protein AOL_s00007g196 [Arthrobotrys oligospora ATCC
24927]
Length = 557
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 65/124 (52%), Gaps = 18/124 (14%)
Query: 247 NVLAQIRINAFRIELAG-------GLYEDLLSSAAA-----SIESEIAVGNAIYMLPSFY 294
++L ++ ++ F++ L ++ +L +A+A S + E+A A+++ S
Sbjct: 431 HILTRLEVDPFKVSLHTFDIWHTLTIHSKILGTASARHNPTSGQPEVA---ALHLCYSLV 487
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASM-ARDARQAILTQGFGFQCN 353
NHDC+P +MW ++ + LR+V+EGEE+R CY D + + R+ L G C
Sbjct: 488 NHDCEPC--LMWECQSEMKFRTLREVKEGEEVRDCYTDPRIESWKKRREWLMGSLGGGCR 545
Query: 354 CLRC 357
C RC
Sbjct: 546 CGRC 549
>gi|255076907|ref|XP_002502117.1| predicted protein [Micromonas sp. RCC299]
gi|226517382|gb|ACO63375.1| predicted protein [Micromonas sp. RCC299]
Length = 750
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIM---WIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
G A++ L NH C+P+ W A RL ALRD+E GEEL + YID S + R
Sbjct: 660 GQALFSLLCLVNHSCEPSTAARFSSWKGRAMVRLEALRDIECGEELTMSYIDESETLEER 719
Query: 341 QAILTQGFGFQCNCLRC 357
+ L +GF C C +C
Sbjct: 720 TSALAS-YGFACRCNKC 735
>gi|426192306|gb|EKV42243.1| hypothetical protein AGABI2DRAFT_181453 [Agaricus bisporus var.
bisporus H97]
Length = 454
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 47/105 (44%), Gaps = 10/105 (9%)
Query: 264 GLYEDLLSSAAASI---------ESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADAR 313
YED++ + + S+ E++ A++ + S NH C PNA W
Sbjct: 204 SFYEDIMQTNSISVNLHIPSDIPHPELSTHRALFFMLSRCNHSCSPNAQWSWDASTLTLT 263
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ ALR + GEE+ I YI RQ IL +GF C C C+
Sbjct: 264 VTALRPIATGEEITISYIPLYSDPTLRQQILKDAYGFDCVCDECT 308
>gi|242078489|ref|XP_002444013.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
gi|241940363|gb|EES13508.1| hypothetical protein SORBIDRAFT_07g005870 [Sorghum bicolor]
Length = 404
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 46/88 (52%), Gaps = 6/88 (6%)
Query: 276 SIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRIC 329
S+E A A+Y S +NHDC PNA H + D N D + AL D+ EG E+ I
Sbjct: 204 SLELLKARAYAVYPRASLFNHDCLPNACHFDYPDRPGPGNTDIVVRALHDIPEGREVCIS 263
Query: 330 YIDASMARDARQAILTQGFGFQCNCLRC 357
Y A+ RQ L + +GF+C C RC
Sbjct: 264 YFAANWRYADRQRRLLEDYGFRCECDRC 291
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 22/73 (30%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHP-TLSTLNSVCYFCLRKITSSSQH-- 136
++VA GRG+ A R +R +++ + P++ +P TLS+L S C C R + +++
Sbjct: 17 LRVADLPGRGRGLVAARNVREGEVLLSEPPVLLYPSTLSSLASYCSACFRSLPAAATVPC 76
Query: 137 FQHHNARFCGEVC 149
A FC C
Sbjct: 77 ASCRAAAFCSPAC 89
>gi|169622705|ref|XP_001804761.1| hypothetical protein SNOG_14579 [Phaeosphaeria nodorum SN15]
gi|111056999|gb|EAT78119.1| hypothetical protein SNOG_14579 [Phaeosphaeria nodorum SN15]
Length = 382
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 38/75 (50%), Gaps = 2/75 (2%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDASMARDARQAI 343
A+Y F NH C PN H W A + A+R + GEE+ I Y D+ D R+A
Sbjct: 242 GAVYPTICFINHSCIPNVHHSWDTKAKHETIYAIRPIAAGEEITIMY-DSGGPFDMRRAF 300
Query: 344 LTQGFGFQCNCLRCS 358
L + F F CNC CS
Sbjct: 301 LRESFSFDCNCRACS 315
>gi|392566265|gb|EIW59441.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 452
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 5/81 (6%)
Query: 282 AVGNAIYMLPS-FYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMA 336
A + ++ L S +NH C PNA +I A +++ALRD+ EGEE+ I Y+D ++
Sbjct: 196 AYAHGVFPLASRLFNHSCIPNAACKYIIRASEPVAMQVVALRDIAEGEEITIPYLDPALP 255
Query: 337 RDARQAILTQGFGFQCNCLRC 357
RQ L + F C+C C
Sbjct: 256 YQTRQEALEVNYSFNCDCRLC 276
>gi|452977535|gb|EME77301.1| hypothetical protein MYCFIDRAFT_179882 [Pseudocercospora fijiensis
CIRAD86]
Length = 616
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/318 (22%), Positives = 129/318 (40%), Gaps = 66/318 (20%)
Query: 86 ESAGRGVFATRRIRASDLIHT-AKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNA-- 142
+ AGRG+FA + + D + T +P + + ++ C +C ++ S++ + A
Sbjct: 66 DGAGRGLFAKKDFQPGDAVLTLDRPYVAELDIDRMHDSCAWCFQRGAVSAEERKKAAALG 125
Query: 143 -----------------RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKR 185
R+C + C+ + + E + + + R V +
Sbjct: 126 LPSAVIDTKACAGCKKIRYCSKTCQARS---WKREHKYECNVLKHPNRPDLPHGVRAVIK 182
Query: 186 LACMIISGAESAD--CIDILQ--PASLSPELILAME------EGFVML-RSAFKKAG--- 231
L + + E D + ILQ PA+ S L + E F ML A+K AG
Sbjct: 183 LLGRLKADPEGKDELLLSILQFKPAADSKALDFIKQHDPQRFEDFNMLGYGAWKYAGEPE 242
Query: 232 ---IDDEQMKFLN------KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K N K ++ NV++ +L+ + ++++
Sbjct: 243 FSHPNSPGVKLSNSGEATAKAFFFNVIS-----------------NLMQLSNPIDDTKLG 285
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G + S NH CDPNA +++ + L ALR + +GEE+ + YID S RQA
Sbjct: 286 IGFDPILNSS--NHSCDPNAAVVF-NQPRLVLRALRSISKGEEIFMKYIDVSNPFSVRQA 342
Query: 343 ILTQGFGFQCNCLRCSSG 360
L + + F C C +C G
Sbjct: 343 ELKESYFFSCRCSKCKKG 360
>gi|414585280|tpg|DAA35851.1| TPA: hypothetical protein ZEAMMB73_416669 [Zea mays]
Length = 198
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + A ++ALR + + EE+ I YID + + R
Sbjct: 117 GTAFFPLQSCMNHSCRPNAKAFKRDEDKDGHAVIIALRPISKDEEITIAYIDEDLPYEER 176
Query: 341 QAILTQGFGFQCNCLRCSS 359
QA L +GF C CL+C
Sbjct: 177 QAQLAD-YGFTCTCLKCQE 194
>gi|241712248|ref|XP_002413449.1| SET and MYND domain-containing protein, putative [Ixodes
scapularis]
gi|215507263|gb|EEC16757.1| SET and MYND domain-containing protein, putative [Ixodes
scapularis]
Length = 374
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 44/85 (51%), Gaps = 3/85 (3%)
Query: 279 SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE--EGEELRICYIDASMA 336
E +G +Y+ PS +H C PNA ++ D RL A D+E + +R+ YID
Sbjct: 128 EECPIGTGLYIGPSILDHSCTPNAQAVF-DGRTLRLRAAADIECDSVDGIRVSYIDLKEL 186
Query: 337 RDARQAILTQGFGFQCNCLRCSSGD 361
R R L + + F CNC RCS+ +
Sbjct: 187 RRTRIEELRERYYFSCNCPRCSADE 211
>gi|395331801|gb|EJF64181.1| hypothetical protein DICSQDRAFT_178674 [Dichomitus squalens
LYAD-421 SS1]
Length = 488
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 44/85 (51%), Gaps = 3/85 (3%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDA-SMA 336
ESE +G A+Y + SF+NH C PN + R + R VEEGEEL I Y M
Sbjct: 400 ESEC-LGFAVYPIASFFNHHCSPNVRKE-REGRTLRFVTTRTVEEGEELCISYGHVEGMD 457
Query: 337 RDARQAILTQGFGFQCNCLRCSSGD 361
RQ L +G+ F C C RC S D
Sbjct: 458 WATRQQELLEGWYFSCRCSRCKSED 482
>gi|71665295|ref|XP_819619.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884928|gb|EAN97768.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 429
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 114/292 (39%), Gaps = 52/292 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ----HHNAR- 143
GRG++A RI +S + +P + + C CL I S H R
Sbjct: 163 GRGIYALARINSSTPVMLDQPFLVQ---RMRDDACAHCLATIGRSGASAGGVRCAHCDRE 219
Query: 144 -FCGEVCKDNAKAFYDV----ERRADWSAFNDYCRSQGLKYPLLVKR--LACMIISGAES 196
+C C+D A Y V R ++ + R + L + R LAC+ ++
Sbjct: 220 TYCSVACRDAAWREYHVCACVSRNEMYAFWEGAMRERLLSDKMEESRAALACLAVAKLCV 279
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
+ + P +L P + + Y A + A
Sbjct: 280 LSTVQQMHPLAL-PRIC------------------------SLRGRADYDASTALSEVGA 314
Query: 257 FRIELAGGL------YEDLLSSAAASIESEIAV--GNAIYMLPSFYNHDCDPNAHIMWID 308
+ LA L E+LLS A +E + G A+Y SF NH C+PN ++
Sbjct: 315 LAVTLATALRQTHLYMEELLSLFAIVQTNEFLLPSGMALYHGYSFLNHSCEPNCALLGSG 374
Query: 309 NADARLMALRDVEEGEELRICYIDASMARD---ARQAILTQGFGFQCNCLRC 357
A+ RL+ LRDV EGE+L I Y +AS+ A + L Q F+C C +C
Sbjct: 375 AANRRLVTLRDVREGEQLFINY-NASLTTRVSYADRRALCQQRHFECFCPKC 425
>gi|407859371|gb|EKG06994.1| hypothetical protein TCSYLVIO_001875 [Trypanosoma cruzi]
Length = 429
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 114/292 (39%), Gaps = 52/292 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH----HNAR- 143
GRG++A RI +S + +P + + C CL I S H R
Sbjct: 163 GRGIYALDRINSSTPVMLDQPFLVQ---RMRDDACAHCLATIGRSGASAGGVRCAHCDRE 219
Query: 144 -FCGEVCKDNAKAFYDV----ERRADWSAFNDYCRSQGLKYPLLVKR--LACMIISGAES 196
+C C+D A Y V R ++ + R + L + R LAC+ ++
Sbjct: 220 TYCSVACRDAAWREYHVCACVSRNEMYAFWEGAMRERLLSDKMEESRAALACLAVAKLCV 279
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
+ + P +L P + + Y A + A
Sbjct: 280 LSTVQQMHPLAL-PRIC------------------------SLRGRADYDASTALSEVGA 314
Query: 257 FRIELAGGL------YEDLLSSAAASIESEIAV--GNAIYMLPSFYNHDCDPNAHIMWID 308
+ LA L E+LLS A +E + G A+Y SF NH C+PN ++
Sbjct: 315 LAVTLATALRQTHLYMEELLSLFAIVQTNEFLLPSGMALYHGYSFLNHSCEPNCALLGSG 374
Query: 309 NADARLMALRDVEEGEELRICYIDASMARD---ARQAILTQGFGFQCNCLRC 357
A+ RL+ LRDV EGE+L I Y +AS+ A + L Q F+C C +C
Sbjct: 375 AANRRLVTLRDVREGEQLFINY-NASLTTRVSYADRRALCQQRHFECFCPKC 425
>gi|408398088|gb|EKJ77223.1| hypothetical protein FPSE_02597 [Fusarium pseudograminearum CS3096]
Length = 520
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 107/281 (38%), Gaps = 60/281 (21%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+F+T+ D+I P I P+LS +N++C C ++ + + H +C
Sbjct: 15 GRGIFSTKSFVPGDVILPFTPRILIPSLSHINNICSHCFKQAEVRACS-RCHAVSYCDAA 73
Query: 149 CKDNAKAFYDVERRADWSAFNDY-CR---------SQGLKYPLLVKRLACMIISGAESAD 198
C+ A+W+A + C+ GL P+ R + E +
Sbjct: 74 CQ-----------AANWTAVHSKECKVLRKVTEQGRPGLPTPI---RAVVQALVKPEIGN 119
Query: 199 CIDILQP--ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
I+ L+ S A E M SAF G E I+ NA
Sbjct: 120 AIEDLEGNGTSWRKSEKWADMEMMAMGASAFAGLGTGQED---------------IQTNA 164
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
F Y+ L +E ++A+ N H C PNA + +I A L A
Sbjct: 165 FH------RYDTDLGQVGIFLEPKLAMAN----------HSCIPNAMVQFIGR-KAILRA 207
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ ++ +E+ I Y D + R+ L F F C CLRC
Sbjct: 208 EKPIKIDDEIEISYTDYTFPLSKRKLALAPYF-FDCMCLRC 247
>gi|313227557|emb|CBY22704.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/305 (21%), Positives = 112/305 (36%), Gaps = 57/305 (18%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHH 140
++ ++E GRG+ A R + +LI A+P VC+ C + Q
Sbjct: 104 EIFVSEEKGRGIKAARDFKVGNLILKAEPFAFVIFDHMAEHVCHHCFNMVVRDRQGQPTT 163
Query: 141 N---------ARFCGEVCKDNAKAFYDVERRADWSAFNDYC--------RSQGLKYPLLV 183
AR+C C+ A WS C R+ + ++
Sbjct: 164 QLLRCSSCKFARYCSRECQKKA-----------WSMHKKECMAIKRIAPRTASDEVRMVS 212
Query: 184 KRLACMIISGAESADCIDILQPASL-------SPELILAMEEGFVMLRSAFKKAGIDDEQ 236
+ L G + A ++ + L S E + +EE + F + D
Sbjct: 213 QILWKQAERGEKRAKSEELCRVEELCDHLNDMSFEDVNKLEEQSKEIGDYFGYENLPD-- 270
Query: 237 MKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNH 296
+ ++ ++ + N I GL +G AI+ + NH
Sbjct: 271 ----SDEYIDHLFGIVSCNGMSITDMRGLQ---------------YLGVAIHPTLNLINH 311
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
DC+PN + + + A++ ++EG+EL I YID S + R+ IL + F C C
Sbjct: 312 DCNPNV-VAVSCGPNIFVRAIKPIKEGDELFISYIDTSATSETRKNILKDQYYFDCTCKM 370
Query: 357 CSSGD 361
C SG+
Sbjct: 371 CESGE 375
>gi|380472212|emb|CCF46894.1| hypothetical protein CH063_15492 [Colletotrichum higginsianum]
Length = 429
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 34/67 (50%)
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
S YNHDC PN + R +R V+ GEEL I Y+D R RQ + +GF+
Sbjct: 249 SRYNHDCRPNVAFRIGPDLRHRTTVVRPVKPGEELTISYLDPLGTRSVRQXRAKRAWGFE 308
Query: 352 CNCLRCS 358
C C +C
Sbjct: 309 CGCSQCG 315
>gi|198433887|ref|XP_002127168.1| PREDICTED: similar to SMYD family member 5 [Ciona intestinalis]
Length = 372
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/103 (36%), Positives = 57/103 (55%), Gaps = 9/103 (8%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALR 318
L GLYE+L + A + + E G +Y + S NH C+PNA + + +N + RL A R
Sbjct: 261 LIDGLYEELENVAGSFLNCE---GAGLYRIQSKCNHSCEPNAEVCFPNN-NHRLAVKACR 316
Query: 319 DVEEGEELRICYI-DASMARD--ARQAILTQGFGFQCNCLRCS 358
D+ GEE+ I Y+ +AR +RQ L + + F C C +CS
Sbjct: 317 DIAAGEEITISYLSQCQIARGCRSRQQYLKENYLFHCCCSKCS 359
>gi|171678699|ref|XP_001904299.1| hypothetical protein [Podospora anserina S mat+]
gi|170937419|emb|CAP62077.1| unnamed protein product [Podospora anserina S mat+]
Length = 328
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 3/76 (3%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQA 342
+++ S NH C PN H W +N RL ALRD+++GEEL + Y+ + R RQ
Sbjct: 105 GGLFLEMSCMNHSCRPNTHYNW-NNKTERLTIHALRDIQDGEELTVSYMTQTGPRVHRQK 163
Query: 343 ILTQGFGFQCNCLRCS 358
L F F C C C
Sbjct: 164 FLEDCFFFHCECELCG 179
>gi|449274577|gb|EMC83667.1| SET and MYND domain-containing protein 5, partial [Columba livia]
Length = 383
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + + E G+ +Y+L S NH C PNA + +N L AL D+E G
Sbjct: 257 LYKDMEKESGEFLNCE---GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAG 313
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 314 EEICISYLDCCQRERSRHSRNKILRENYLFTCSCPKC 350
>gi|402074108|gb|EJT69637.1| hypothetical protein GGTG_12521 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 284
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 48/92 (52%), Gaps = 2/92 (2%)
Query: 267 EDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEE 325
ED++++ I ++ ++ S NH C PNA DN ++M A RD+E GEE
Sbjct: 60 EDIITTNGFGIHADTGSPGGLFPNASRINHSCIPNADQASFDNDGYKVMRANRDIETGEE 119
Query: 326 LRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ YI + R+ RQ +L + +GF C C C
Sbjct: 120 ITTSYIAHFLPRELRQLLLGR-WGFACQCPAC 150
>gi|380016398|ref|XP_003692172.1| PREDICTED: SET and MYND domain-containing protein 5-like [Apis
florea]
Length = 387
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 63/124 (50%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRIE---LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI+ L +Y+D+ + A+ + +E G+ +Y+L S NH C
Sbjct: 246 RWVKNVSALELPREERIQVDKLIDRIYDDMEEAVASFLNNE---GSGLYILQSSVNHSCV 302
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + + N L A+RD+ EE+ I Y+D +R +RQ L + FQC C
Sbjct: 303 PNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHSRQKALNSLYLFQCYCN 362
Query: 356 RCSS 359
+C S
Sbjct: 363 KCLS 366
>gi|212541368|ref|XP_002150839.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
gi|210068138|gb|EEA22230.1| SET and MYND domain protein, putative [Talaromyces marneffei ATCC
18224]
Length = 527
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/308 (24%), Positives = 129/308 (41%), Gaps = 46/308 (14%)
Query: 72 PSQPSPPPIQVALTE---------SAGRGVFATRRIR-ASDLIHTAKPIITHPTLSTLNS 121
PS+PS ALTE S G GVFAT I+ +D++ P++ L +
Sbjct: 3 PSEPSA--ANSALTEVRPAQQNGVSIGLGVFATANIKKGADVLVIDTPMVALVEEKQLKN 60
Query: 122 VCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVE-RRADWSAFND-----YCRSQ 175
+C C TS + + + D +R DW A + Y +
Sbjct: 61 ICSGCFD--TSKAASIDSRRPGLVKACVRCKVVYYCDKNCQRKDWKAGHSVECAIYSKLY 118
Query: 176 GLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE 235
PL V+ + +I+S + +SP + E F+ L ++ K DD+
Sbjct: 119 PRILPLPVRAILRVILSQDKD----------KVSP----YIHEEFLDL--SYPKEIYDDD 162
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLY-EDL---LSSAAASIESEIAVGNAIYMLP 291
+ + N + ++A+ + IE Y E L LS+ + ++ S +Y+ P
Sbjct: 163 ENEEANDHF---LMAEAVLKYGDIEYLDEDYVEQLFGKLSANSFTLTSAFGKRRGVYLHP 219
Query: 292 SF--YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
+ +NH C+PNA + D + A + + +GE++ I YID + + R+ L + +
Sbjct: 220 AAARFNHSCNPNASYSF-DKGKCYIRATKPIAKGEQIFIPYIDTTYSVGTRRHELKERYK 278
Query: 350 FQCNCLRC 357
F C C RC
Sbjct: 279 FDCQCPRC 286
>gi|198461003|ref|XP_002138938.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
gi|198137203|gb|EDY69496.1| GA25085 [Drosophila pseudoobscura pseudoobscura]
Length = 515
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 115/299 (38%), Gaps = 57/299 (19%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P ++A + GR + ATR I+ ++I P++ P T VC CL I
Sbjct: 42 PFKIAHNDLLGRHLVATRTIKPYEIILKEAPLVRGPAQIT-PPVCMGCLNSIEPED---- 96
Query: 139 HHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLK-------YPLL--VKRLAC 188
H + CG +C K+ E RA+ + R Q + +PL V + C
Sbjct: 97 HIDCDLCGWPLCGPECKSLG--EHRAECQLTQE--RGQKVNVHEFNGSHPLYTCVSTVRC 152
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
++I SPE +E L S E + + QW ++
Sbjct: 153 LLI--------------GETSPEKASKFQE----LESL--------ESTRRGSNQWKADL 186
Query: 249 --LAQIRINAFRIELAGGLYEDLLSSAAASIE---SEIAVGN----AIYMLPSFYNHDCD 299
+ Q F+ E E+ + A +++ E+ + A++ SF H C
Sbjct: 187 ASIGQFIPKFFKTE---KFSEEEVMRAVGALQINGHEVPTSDPPHVAVFYTASFTEHSCM 243
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
PN + N L A R++++ L ICY DA RQ L Q F+C C RC+
Sbjct: 244 PNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCT 302
>gi|449501577|ref|XP_002187845.2| PREDICTED: SET and MYND domain-containing protein 5-like
[Taeniopygia guttata]
Length = 404
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + + E G+ +Y+L S NH C PNA + +N L AL D+E G
Sbjct: 278 LYKDIEKESGEFLNCE---GSGLYVLQSCCNHSCIPNAETSFPENNFLLHLTALEDIEAG 334
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 335 EEICISYLDCCQRERSRHSRNKILRENYLFTCSCPKC 371
>gi|224047131|ref|XP_002191464.1| PREDICTED: N-lysine methyltransferase SMYD2 [Taeniopygia guttata]
Length = 436
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 121/297 (40%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A RR +L+ + + T+S S C C + S+ CG
Sbjct: 21 GRGLRARRRFAVGELLLSCPAYVAVLTVSERGSHCDGCFARKEGLSK---------CGR- 70
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY +VE ++ DW C S G + P RL I++ ++
Sbjct: 71 CK---QAFYCNVECQKEDWPMHKLECASMCTFGQNWHPSETVRLTARILAKQKT------ 121
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------QWYTN--------- 247
P E +LA++E S K +D+E+ + + +Y+
Sbjct: 122 -HPERTQSEKLLAVKE----FESHLDK--LDNEKRELIQNDIAALHHFYSKHLDYPDNAA 174
Query: 248 ---VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 175 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 217
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A+R++E GEE+ YID + R L + F C+C C++ +
Sbjct: 218 VTYKGTLAEVRAVREIEPGEEIFSSYIDLLYPTEDRNDRLRDSYFFSCDCRECTTKE 274
>gi|242823869|ref|XP_002488146.1| SET domain containing protein [Talaromyces stipitatus ATCC 10500]
gi|218713067|gb|EED12492.1| SET domain containing protein [Talaromyces stipitatus ATCC 10500]
Length = 251
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDASMARDARQAIL 344
+IY+L S +NH C PN + + D ++D+ GEEL I YID S +R+
Sbjct: 113 SIYLLGSRFNHSCVPNTTHSYHPSLDKETFHTIQDITAGEELLITYIDGSNWDRSRRQKY 172
Query: 345 TQGFGFQCNCLRC 357
Q +GFQCNC C
Sbjct: 173 LQKWGFQCNCPVC 185
>gi|66525107|ref|XP_394075.2| PREDICTED: SET and MYND domain-containing protein 5 [Apis
mellifera]
Length = 387
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 63/124 (50%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRIE---LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI+ L +Y+D+ + A+ + +E G+ +Y+L S NH C
Sbjct: 246 RWVKNVSALELPREERIQVDKLIDRIYDDMEEAVASFLNNE---GSGLYILQSSVNHSCV 302
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + + N L A+RD+ EE+ I Y+D +R +RQ L + FQC C
Sbjct: 303 PNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHSRQKALNSLYLFQCYCN 362
Query: 356 RCSS 359
+C S
Sbjct: 363 KCLS 366
>gi|195170308|ref|XP_002025955.1| GL10122 [Drosophila persimilis]
gi|194110819|gb|EDW32862.1| GL10122 [Drosophila persimilis]
Length = 515
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 115/299 (38%), Gaps = 57/299 (19%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P ++A + GR + ATR I+ ++I P++ P T VC CL I
Sbjct: 42 PFKIAHNDLLGRHLVATRTIKPYEIILKEAPLVRGPAQIT-PPVCMGCLNSIEPED---- 96
Query: 139 HHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLK-------YPLL--VKRLAC 188
H + CG +C K+ E RA+ + R Q + +PL V + C
Sbjct: 97 HIDCDLCGWPLCGPECKSLG--EHRAECQLTQE--RGQKVNVHEFNGSHPLYTCVSTVRC 152
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
++I SPE +E L S E + + QW ++
Sbjct: 153 LLI--------------GETSPEKASKFQE----LESL--------ESTRRGSNQWKADL 186
Query: 249 --LAQIRINAFRIELAGGLYEDLLSSAAASIE---SEIAVGN----AIYMLPSFYNHDCD 299
+ Q F+ E E+ + A +++ E+ + A++ SF H C
Sbjct: 187 ASIGQFIPKFFKTE---KFSEEEVMRAVGALQINGHEVPTSDPPHVAVFYTASFTEHSCM 243
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
PN + N L A R++++ L ICY DA RQ L Q F+C C RC+
Sbjct: 244 PNLAKSFNKNGHCILWAPREIKKNSHLSICYSDAMWGTADRQRHLVQTKLFKCACDRCT 302
>gi|302676884|ref|XP_003028125.1| hypothetical protein SCHCODRAFT_17369 [Schizophyllum commune H4-8]
gi|300101813|gb|EFI93222.1| hypothetical protein SCHCODRAFT_17369 [Schizophyllum commune H4-8]
Length = 427
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 5/77 (6%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDN----ADARLMALRDVEEGEELRICYIDASMARDAR 340
+Y+L S NH C PN + D A ++A +D+ GEEL I Y+D AR
Sbjct: 315 GGLYVLHSHMNHSCVPNISVRHFDKSTNWARITMVAKKDLAPGEELMISYVDPEAPYKAR 374
Query: 341 QAILTQGFGFQCNCLRC 357
QA L Q +GF+C C RC
Sbjct: 375 QAELEQ-WGFKCVCPRC 390
>gi|157134709|ref|XP_001656403.1| hypothetical protein AaeL_AAEL000436 [Aedes aegypti]
gi|108884280|gb|EAT48505.1| AAEL000436-PA [Aedes aegypti]
Length = 392
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/174 (27%), Positives = 74/174 (42%), Gaps = 35/174 (20%)
Query: 216 MEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIE--------------- 260
++E + ++ AF DE++K+L + ++LA I N I
Sbjct: 199 IDELYELISRAFDPTA--DERLKWLTLSGFRSLLALIGTNGQGIGTSSFADWVRNVSELD 256
Query: 261 -----------LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-D 308
L LY L + +E G+A+Y S NH C PN +
Sbjct: 257 LPDQQRESVDILIDNLYNKLDEVVGTFLNNE---GSALYSYQSKVNHSCSPNVECRFPHS 313
Query: 309 NADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
N L A RD++ GEE+ I Y+D +R +RQ +L++ + FQC C +C S
Sbjct: 314 NNVLALTATRDIKVGEEICISYLDECALERSRHSRQKMLSENYLFQCQCEKCDS 367
>gi|330791636|ref|XP_003283898.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
gi|325086169|gb|EGC39563.1| hypothetical protein DICPUDRAFT_86015 [Dictyostelium purpureum]
Length = 555
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 32/126 (25%), Positives = 56/126 (44%), Gaps = 15/126 (11%)
Query: 233 DDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPS 292
+ E NK+ Y V+ I +N+F GL + + + N + +
Sbjct: 180 NSELSSIFNKEEYLEVVCSIFVNSF-----AGLSNNF---------NRTPISNGYFYKAA 225
Query: 293 FYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NH C+PN I N+ + ++D+ +GE++ Y+D + RQ IL + F C
Sbjct: 226 LLNHSCEPNT-FFSIQNSTLEMRVVKDINKGEQIFDSYVDLLLPTIERQKILLKSKNFVC 284
Query: 353 NCLRCS 358
+C RC+
Sbjct: 285 DCSRCT 290
>gi|19113306|ref|NP_596514.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74626997|sp|O94256.1|SET6_SCHPO RecName: Full=SET domain and MYND-type zinc finger protein 6
gi|3810831|emb|CAA21792.1| histone lysine methyltransferase Set6 (predicted)
[Schizosaccharomyces pombe]
Length = 483
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 2/104 (1%)
Query: 257 FRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM 315
++ EL L+ L +A + S ++G + + NH CDPN I++ D A +L+
Sbjct: 154 YQAELFQKLFCRLAVNAMNLVTSSFDSLGMCLDTILCRLNHSCDPNCQIIF-DGAIVQLV 212
Query: 316 ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ RD+++ E+L I YID + + RQ L + + F C C RC +
Sbjct: 213 SKRDIKKDEQLFISYIDIRLPKSIRQKQLLKKYFFSCYCPRCEN 256
>gi|391328903|ref|XP_003738922.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Metaseiulus occidentalis]
Length = 297
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 44/81 (54%), Gaps = 3/81 (3%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD--VEEGEELRICYIDASMARDAR 340
+G A+Y+ PS ++H C P+A I + + + A+RD V + E+ ICY D D R
Sbjct: 49 IGEALYLAPSIHDHSCLPDA-IYTFNGCELTIRAMRDLAVSDPREIHICYTDILQPLDDR 107
Query: 341 QAILTQGFGFQCNCLRCSSGD 361
L + + F+C C+ CS D
Sbjct: 108 IGFLEEHYFFRCQCVLCSQAD 128
>gi|326508652|dbj|BAJ95848.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 383
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 70/288 (24%), Positives = 108/288 (37%), Gaps = 29/288 (10%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPT-LSTLNSVCYFCLRKITSSSQHFQH 139
+VA GR + A R I +++ + PI+ +P+ L++L+S C C R +
Sbjct: 1 RVAELPGRGRALLAARDILEGEVLLSESPILLYPSSLASLSSYCSACFRSLPPPPHTPCP 60
Query: 140 --HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
A FC C A + + + PLL A + + SA
Sbjct: 61 SCRAAAFCSPACA----AASHPRLLCAALSSGLAAAPEAHQEPLLFLLSAYSLQEPSLSA 116
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
P SP + ++ L +A + +L++ R N+F
Sbjct: 117 ILSLSSAPQGPSP----SQQQEAASLHAAVASVAPPHMLPAGFSLDLTAALLSKDRTNSF 172
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGN--AIYMLPSFYNHDCDPNA-HIMWID-----N 309
I L + E+ A+Y S NHDC PNA H + D N
Sbjct: 173 SI----------LEPYRPDVPLELRKARCCAVYPRASLLNHDCLPNACHFDYADRPGPGN 222
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
D + AL + EG E+ I Y A+ RQ L + +GF+C C RC
Sbjct: 223 TDMVVRALHGITEGNEVCISYFAANWRYADRQRRLLEDYGFRCECDRC 270
>gi|340725361|ref|XP_003401039.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
terrestris]
Length = 386
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRIE---LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI+ L +Y+DL + A + +E G+ +Y+ S NH C
Sbjct: 246 RWVKNVSALELPRDERIQVDKLIDRIYDDLEEAVAFFLNNE---GSGLYIFQSSINHSCV 302
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + + N L A+RD+ EE+ I Y+D +R +RQ +L+ + F C C
Sbjct: 303 PNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHSRQKVLSSLYLFHCYCN 362
Query: 356 RCSS 359
RC S
Sbjct: 363 RCLS 366
>gi|170097029|ref|XP_001879734.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164645137|gb|EDR09385.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 435
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 6/78 (7%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDAR 340
+Y L S NH C PNA + +D A ++A R +++GEEL I Y++ + + R
Sbjct: 326 GGLYTLHSHLNHSCRPNASVRHLDQRTALSRITIVAKRPIKKGEELLISYVNPELRYETR 385
Query: 341 QAILTQGFGF-QCNCLRC 357
Q+ L QG+GF C C RC
Sbjct: 386 QSEL-QGWGFGSCRCQRC 402
>gi|449496642|ref|XP_002188283.2| PREDICTED: SET and MYND domain-containing protein 3 [Taeniopygia
guttata]
Length = 289
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 45/79 (56%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG +Y S NH CDPN I++ + L ++RD++ GEEL I Y+++ M RQ
Sbjct: 54 VGVGLYPSMSLLNHSCDPNCVIVF-EGYQLLLHSVRDIQIGEELTISYVESLMPTRERQK 112
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + + F+C+C C + +
Sbjct: 113 QLMRQYCFECDCPLCQNQE 131
>gi|302694307|ref|XP_003036832.1| hypothetical protein SCHCODRAFT_80474 [Schizophyllum commune H4-8]
gi|300110529|gb|EFJ01930.1| hypothetical protein SCHCODRAFT_80474 [Schizophyllum commune H4-8]
Length = 348
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 292 SFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
S NH C+PNA+ W +D L A R +++GEE+ I Y++ S + RQ L + F
Sbjct: 279 SRLNHSCEPNAYTDWDVDTMTFGLYAERPIKKGEEIFIAYVEPSHPKQHRQYALWGMYSF 338
Query: 351 QCNCLRCS 358
C C +CS
Sbjct: 339 NCRCTKCS 346
>gi|389744758|gb|EIM85940.1| SET domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 480
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 46/84 (54%), Gaps = 6/84 (7%)
Query: 283 VGNAIYMLPS-FYNHDCDPNAHIMWI---DNADARLM--ALRDVEEGEELRICYIDASMA 336
+ + I+ + S +NH C PNA +I D R++ ALR++ EE+ I Y+D ++
Sbjct: 209 IAHGIFPVASRLFNHSCTPNAAAKYIFGRDKPGVRMVVVALRNISHNEEITIPYLDPALP 268
Query: 337 RDARQAILTQGFGFQCNCLRCSSG 360
RQ L Q +GF+C C CS G
Sbjct: 269 FLERQLSLEQNYGFRCTCHVCSFG 292
>gi|194698994|gb|ACF83581.1| unknown [Zea mays]
Length = 268
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 58/118 (49%), Gaps = 18/118 (15%)
Query: 248 VLAQIRINAFRI--ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HI 304
+LA+ R N+F I G+ +LL + A A+Y S +NHDC PNA H
Sbjct: 48 LLAKDRGNSFAIMEPYRHGMSLELLKARAY----------AVYPRASLFNHDCLPNACHF 97
Query: 305 MWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ D N D + AL D+ EG E+ I Y A+ RQ L + +GF+C C RC
Sbjct: 98 DYPDRPGPGNTDIVVRALHDIPEGREVCISYFAANWRYADRQRRLLEDYGFRCECDRC 155
>gi|432876719|ref|XP_004073079.1| PREDICTED: SET and MYND domain-containing protein 5-like [Oryzias
latipes]
Length = 418
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 14/152 (9%)
Query: 216 MEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIE----LAGGLYEDLLS 271
+ EGF RS F G + + + + + + + + A + E LY+D+
Sbjct: 237 LPEGF---RSLFALVGTNGQGIGTSSLSQWVHACDALELPAQQREHLDSFIDQLYKDIEK 293
Query: 272 SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICY 330
+ E G+ +Y+L S NH C PNA + +N L AL D+ GEE+ I Y
Sbjct: 294 ETGDFLNCE---GSGLYLLQSSCNHSCTPNAEASFPENNFLLHLCALSDINPGEEICISY 350
Query: 331 IDA---SMARDARQAILTQGFGFQCNCLRCSS 359
+D +R +R IL + + F C C++C S
Sbjct: 351 LDCCQRDRSRHSRHKILRENYLFACACVKCES 382
>gi|22450579|gb|AAM97151.1| unknown protein,3'-partial [Oryza sativa Japonica Group]
Length = 425
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 119/286 (41%), Gaps = 33/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ VA GRG+FA R +++ + +P + P ++ S C C LRK +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVC- 75
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A +CG C+ + +E RA + D + + P + RL ++
Sbjct: 76 -----RVAWYCGSACQREEWKLHQLECRAIAALTED---RKKMLTPTI--RLMVRLVLRR 125
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ D I + + L+ A+E + +D Q+ + L Q+ +
Sbjct: 126 KLQDDKAIPSSGTDNYNLVDALESHI---------SEVDKNQLVLYAQMAN---LVQLIL 173
Query: 255 NAFRIELA--GGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+F ++L + +A + E+ +G +Y + S NH C PNA ++ +
Sbjct: 174 PSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNA-VLIFEGRT 232
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + AL+ + + EE+ I YI+ + RQ L + + F C C RC
Sbjct: 233 AYVRALQPISKNEEVSISYIETAATTMKRQDDL-KHYYFTCTCPRC 277
>gi|156050587|ref|XP_001591255.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980]
gi|154692281|gb|EDN92019.1| hypothetical protein SS1G_07881 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 593
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 53/93 (56%), Gaps = 4/93 (4%)
Query: 270 LSSAAASIESEIA---VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEEL 326
LS+ A +ES + +G + L + NH C PNA I + D A L AL ++ GE++
Sbjct: 220 LSTNAFRVESNVGNGPIGLCLDPLLARANHCCYPNAAITF-DGKRATLRALFPIKNGEQI 278
Query: 327 RICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
I YID + ++ RQA L + + F+C C RC++
Sbjct: 279 FISYIDETQRQEVRQAALEETWFFKCRCSRCTN 311
>gi|296085566|emb|CBI29298.3| unnamed protein product [Vitis vinifera]
Length = 327
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 17/119 (14%)
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAV-GNAIYMLPSFYNHDCDPNA-H 303
T +LA+ ++NAF L+ A + E +V IY SF+NHDC PNA
Sbjct: 117 TALLAKDKLNAF----------GLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACR 166
Query: 304 IMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++D N D + + DV EG E+ + Y + RQ L + +GF C C RC
Sbjct: 167 FDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRC 225
>gi|340992675|gb|EGS23230.1| hypothetical protein CTHT_0008940 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 459
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 285 NAIYMLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
IY+ P + NH C P+A + D +A L A RD++EG+E+ ICY+D ++ A +
Sbjct: 117 TGIYLDPVLAMANHSCMPSAFVS-FDQRNAVLRAWRDIKEGDEITICYVDVTLPNKAARQ 175
Query: 343 ILTQGFGFQCNCLRCS 358
+ + F+C C RC
Sbjct: 176 EALKLYHFECRCPRCK 191
>gi|346978403|gb|EGY21855.1| hypothetical protein VDAG_03295 [Verticillium dahliae VdLs.17]
Length = 558
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 55/113 (48%), Gaps = 23/113 (20%)
Query: 246 TNVLAQIRINAF-RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+L +I+ NAF R E GL+ D + M+ NH C PNA I
Sbjct: 191 VELLCKIKTNAFTRSEAGEGLFLD----------------TTLAMV----NHSCVPNA-I 229
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ A L ALRD++EGEE+ I YID + + + R+ L F FQC C RC
Sbjct: 230 VAFSGRRAFLRALRDIKEGEEIEISYIDCTQSLEHRRKALELYF-FQCVCTRC 281
>gi|336467862|gb|EGO56025.1| hypothetical protein NEUTE1DRAFT_146820 [Neurospora tetrasperma
FGSC 2508]
gi|350289902|gb|EGZ71127.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 320
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQ 341
VG +++ S NH C+PN W + + A+RD+++GEE+ I YI A RQ
Sbjct: 104 VGGGLFIEASRINHACNPNTQNSWNSRISRETIHAVRDIKKGEEITISYIGHFAAYVERQ 163
Query: 342 AILTQGFGFQCNCLRCS 358
+IL F F C C CS
Sbjct: 164 SILKIKFNFDCACELCS 180
>gi|440465269|gb|ELQ34604.1| hypothetical protein OOU_Y34scaffold00760g4 [Magnaporthe oryzae
Y34]
Length = 416
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 59/113 (52%), Gaps = 7/113 (6%)
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+LA R F ++ ED+L + ++ A +++ S NH+C PN + +
Sbjct: 199 ILALARTGGFESQV-----EDVLRTNIFGVDVGGAFHMGLFVEGSRVNHNCRPNVYWEYD 253
Query: 308 DNADAR-LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
A+ ++ALRD+E+GEEL Y+ +R R+ L + +GF+C C CS+
Sbjct: 254 TKTMAQEVVALRDIEQGEELTHSYVTLGGSRSQRREEL-EAWGFECKCALCSA 305
>gi|289740749|gb|ADD19122.1| SET and MYND domain containing protein 5 [Glossina morsitans
morsitans]
Length = 395
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 44/78 (56%), Gaps = 4/78 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASM---ARDA 339
G+ +Y+L S NH C PNA + + N L AL+ ++ GEE+ I Y+D +R +
Sbjct: 286 GSGLYLLQSKINHSCLPNAQVTFPYSNDIVVLKALQPIQVGEEICISYLDEGQLERSRHS 345
Query: 340 RQAILTQGFGFQCNCLRC 357
RQ IL + + F C C +C
Sbjct: 346 RQKILKENYIFVCECFKC 363
>gi|195426423|ref|XP_002061334.1| GK20779 [Drosophila willistoni]
gi|194157419|gb|EDW72320.1| GK20779 [Drosophila willistoni]
Length = 515
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 112/290 (38%), Gaps = 25/290 (8%)
Query: 71 KPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI 130
K +P+ P ++A E GR + ATR I+ ++I P++ P + VC CL I
Sbjct: 34 KSHKPNCHPYKIAHNEVLGRHLVATRNIKPYEIILKEAPLVRGPAQISA-PVCMGCLNSI 92
Query: 131 TSSSQ-HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLL--VKRLA 187
+ + CG CK + + + D + QG +PL V +
Sbjct: 93 EPTDHIECEQCGWPLCGPECKSLDEHRAECKLTKDRGQKVNVQEFQG-PHPLYTCVSTVR 151
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
C+++ +A+ SL + G ++ G +F+ K + T
Sbjct: 152 CLLLGETNAANAKKFSDLESLE-----STRRGSSQWKADLTSIG------QFIPKFFKTQ 200
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
++ I + G L + + + A++ + SF + C PN +
Sbjct: 201 KFSEEEI----MRTVGALQIN-----GHEVPTSDPPHVAVFYVASFTENSCLPNLAKSFN 251
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ L A + +++ L ICY DA RQ L Q F+C CLRC
Sbjct: 252 KHGHCILWAPQAIKKNSHLSICYSDAIWGTSDRQRHLMQTKLFKCACLRC 301
>gi|328850125|gb|EGF99294.1| hypothetical protein MELLADRAFT_68704 [Melampsora larici-populina
98AG31]
Length = 467
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 43/75 (57%), Gaps = 3/75 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDARQAI 343
A+++ S NHDC PN + D A +L A+R++E GEEL Y++ + R+A
Sbjct: 274 ALFLDASRANHDCRPNV-VYHTDGATLKLNMYAVREIEVGEELVAAYLNIQQPTETRRAY 332
Query: 344 LTQGFGFQCNCLRCS 358
L Q +GF+C C CS
Sbjct: 333 LRQHYGFECQCSVCS 347
>gi|195121780|ref|XP_002005397.1| GI19093 [Drosophila mojavensis]
gi|193910465|gb|EDW09332.1| GI19093 [Drosophila mojavensis]
Length = 515
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/298 (23%), Positives = 116/298 (38%), Gaps = 57/298 (19%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P +VA E GR + ATR I+ +++ P++ P ++ VC CL I + H
Sbjct: 42 PFRVAHNELLGRHLVATRNIKPYEIVLKEAPLMRGPAQISV-PVCMGCLNAI-EPNDHIT 99
Query: 139 HHNARF--CGEVCK--DNAKAFYDVE----RRADWSAFNDYCRSQGLKYPLL--VKRLAC 188
+ CG C+ D KA + ++ + FN +PL V + C
Sbjct: 100 CDKCGWPLCGPECQALDEHKAECQLTEARGQKVNVQEFNG-------PHPLYTCVSTVRC 152
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
+++S A+ E F L S EQ + + QW ++
Sbjct: 153 LLLSETNPAN------------------AEKFQQLESL--------EQTRRGSNQWKADL 186
Query: 249 --LAQIRINAFRIELAGGLYEDLLSSAAASIE---SEIAVGN----AIYMLPSFYNHDCD 299
+ Q FR + E+ + A +++ E+ + A++ SF + C
Sbjct: 187 ASIGQFIPKFFRTQ---KFSEEEIMRAVGALQINGHEVPTSDPPHVAVFYTASFTENSCV 243
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
PN + N L A + +++G L ICY DA RQ L Q F+C C RC
Sbjct: 244 PNLAKSFTKNGHCILWAPKAIKKGANLSICYSDAVWGTADRQRHLMQTKLFKCTCERC 301
>gi|164656030|ref|XP_001729143.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
gi|159103033|gb|EDP41929.1| hypothetical protein MGL_3610 [Malassezia globosa CBS 7966]
Length = 390
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 42/74 (56%), Gaps = 7/74 (9%)
Query: 291 PSF--YNHDCDPNAHIMWIDN-----ADARLMALRDVEEGEELRICYIDASMARDARQAI 343
P+F NH CDPNA I++ D + L+A+R + GEE+RI Y+D + + RQA
Sbjct: 85 PTFALINHSCDPNAVIVFPDRVRGMPSKMHLVAIRAISAGEEVRISYVDVASCQAERQAT 144
Query: 344 LTQGFGFQCNCLRC 357
L + + F C C C
Sbjct: 145 LLERYCFACECRLC 158
>gi|409048091|gb|EKM57569.1| hypothetical protein PHACADRAFT_251260 [Phanerochaete carnosa
HHB-10118-sp]
Length = 314
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAIL 344
I++L S N C PN W + + + LRDV++GEEL Y D A+D R+ L
Sbjct: 118 GIFLLASRINSSCTPNVSKSWDEIRNVMVFRTLRDVQDGEELCFNYCDVLSAQDERKRTL 177
Query: 345 TQGFGFQCNCLRCSSG 360
+ FGF C C C G
Sbjct: 178 MEEFGFDCTCDACRLG 193
>gi|238588335|ref|XP_002391696.1| hypothetical protein MPER_08835 [Moniliophthora perniciosa FA553]
gi|215456713|gb|EEB92626.1| hypothetical protein MPER_08835 [Moniliophthora perniciosa FA553]
Length = 422
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 40/67 (59%), Gaps = 2/67 (2%)
Query: 292 SFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
S NH C PN H ++ ++ ++ ALRD+++GEEL I YID RQ L + +GF
Sbjct: 261 SRINHSCRPNVHTKFVMESFSVQVRALRDIKKGEELFISYIDIQTITQNRQKEL-EPYGF 319
Query: 351 QCNCLRC 357
+C CL C
Sbjct: 320 RCACLAC 326
>gi|307170062|gb|EFN62504.1| SET and MYND domain-containing protein 4 [Camponotus floridanus]
Length = 333
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 46/84 (54%), Gaps = 3/84 (3%)
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICY-ID-AS 334
IE +I + AIY S NH CDPN ++D + A++D++EGEE+ CY +D
Sbjct: 218 IEEQIRIATAIYPSASMMNHSCDPNIINSFLDQT-LIVKAIQDIKEGEEILHCYGVDFRR 276
Query: 335 MARDARQAILTQGFGFQCNCLRCS 358
M + RQ L + F CNC C+
Sbjct: 277 MPKAERQESLESQYYFTCNCKACT 300
>gi|260828157|ref|XP_002609030.1| hypothetical protein BRAFLDRAFT_84846 [Branchiostoma floridae]
gi|229294384|gb|EEN65040.1| hypothetical protein BRAFLDRAFT_84846 [Branchiostoma floridae]
Length = 302
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/150 (26%), Positives = 71/150 (47%), Gaps = 8/150 (5%)
Query: 217 EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAG-GLYE---DLLSS 272
++ +LR +A ++ ++ + ++ A I N I + +Y D L
Sbjct: 128 QDQLELLRGLLTEALYEESLDQWFTPDGFRSIFAMIGRNGQGIGTSSLSVYVHNCDALEL 187
Query: 273 AAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYI 331
+ E A + +Y+ NH C+P A + + N + ALRD+ EGEEL ICY+
Sbjct: 188 PSQDREKLDAFIDQLYVDMEHGNHSCEPTAEPSFDESNYVLSMRALRDITEGEELFICYL 247
Query: 332 DA---SMARDARQAILTQGFGFQCNCLRCS 358
D + +R +RQ +L + + F C C +C+
Sbjct: 248 DECERTRSRHSRQKLLRENYLFSCTCEKCT 277
>gi|254584925|ref|XP_002498030.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
gi|238940924|emb|CAR29097.1| ZYRO0G00462p [Zygosaccharomyces rouxii]
Length = 485
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 67/322 (20%), Positives = 109/322 (33%), Gaps = 90/322 (27%)
Query: 90 RGVFATRRIRASDLIHTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQHFQHH---- 140
RG+FAT ++A D+I P++ P + L C C SS H QH
Sbjct: 120 RGLFATNALKAGDMIFEDLPLVVVPPMDKLVLMQSGKACPLC----GSSVSHSQHTIIMN 175
Query: 141 -------NARFCGEVCKDNAKAFYDVERRAD---------WSAFNDYCR--------SQG 176
A +C + CK N + W + +C+ S G
Sbjct: 176 GLDCGECGAVWCSKNCKKNDLTHTSLRHTRSRCRQINLNGWKKYESFCKQYVFVAAYSIG 235
Query: 177 LKYPLLV--------------------KRLACMIISGAESADCIDILQPASLSPELILAM 216
+ Y ++ +R+ +D A S +
Sbjct: 236 IIYASMILDKNGKNGVHPKFQLLADVSQRIRKDSSDSTNLGGTLDASSGAFTSDDPEPMW 295
Query: 217 EEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAAS 276
+ + + AF + +DD M + Y N + + IN
Sbjct: 296 TQALDLFKEAFPE--LDDMDM-----EAYLNCIGKFNINQ-------------------- 328
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMA 336
V I+ + SF NHDC+PN D +L A + +++GEEL Y++
Sbjct: 329 ------VSGQIFPVYSFINHDCEPNVRYEIDDKLRLKLYARKHIKKGEELLTTYVNPLHG 382
Query: 337 RDARQAILTQGFGFQCNCLRCS 358
R+ L +GF C+C RC+
Sbjct: 383 VKLRRRELRVNWGFLCHCDRCN 404
>gi|401625526|gb|EJS43529.1| set5p [Saccharomyces arboricola H-6]
Length = 526
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 86/374 (22%), Positives = 145/374 (38%), Gaps = 62/374 (16%)
Query: 36 FWKMCLGRYSRCLISR-LQSLHLQKRQLCSTATHNGKPSQPSPPPIQVA-LTESAGRGVF 93
WK+ R+ + L L L+ L H K + S I+V + + GRG+F
Sbjct: 70 MWKLSASRFRKVLNQHHLYDTDLETVSLYKENIHFPK-ALDSNDKIKVKFIDDECGRGLF 128
Query: 94 ATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH----- 140
A + +I KPI+ P L L C C + + +QH H+
Sbjct: 129 AEKDFTKGQIILRENKPIVFVPPLDKLFFISNGKACARCGKALYDLTQHKIMVHYLDCEV 188
Query: 141 -NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGLKY 179
A +C E CK + D+ +W F +YC+ S GL Y
Sbjct: 189 CKAIWCSEKCKKAHAPLHELLYHSWRSHRIDILHAGNWKRFVNYCQKYCFTAAFSIGLIY 248
Query: 180 PLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEG----FVMLR--SAFKKAGID 233
+ ++ + E + D L S + L G F ++ + + G D
Sbjct: 249 GSM------LLDTTGEVREQWDELASVSQKVRIKLRDASGIGSTFNLMNGTTVHTEDGSD 302
Query: 234 DEQMKFLNKQWYTNVL----AQIRINAF---RIELAGGLYEDLLSSAAASIESEIAVGNA 286
+ K K ++ ++ NAF E+ + ++ + + +E
Sbjct: 303 NSSNKGDEKNIDDEIVWEKCYELFCNAFPKASEEIDMDKFLTMIGTFNINQYNE-----Q 357
Query: 287 IYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILT 345
IY SF NHDC+PNA+I + D +L A R +++GE++ + Y++ R+ L
Sbjct: 358 IYHWHSFMNHDCEPNAYIEQAEEHDELKLHARRPIKKGEQICVTYVNPLHGVRLRRRELR 417
Query: 346 QGFGFQCNCLRCSS 359
+GF C C RC +
Sbjct: 418 VNWGFLCQCDRCQN 431
>gi|50740296|ref|XP_419420.1| PREDICTED: N-lysine methyltransferase SMYD2 [Gallus gallus]
Length = 436
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 122/308 (39%), Gaps = 67/308 (21%)
Query: 75 PSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS 134
P P ++ + GRG+ A RR +L+ + T+S S C C + S
Sbjct: 7 PQPGGLERFASPGKGRGLRALRRYAVGELLFSCPAYTAVLTVSERGSHCDGCFARKEGLS 66
Query: 135 QHFQHHNARFCGEVCKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLAC 188
+ CG CK +AFY +VE ++ DW C + G + P RL
Sbjct: 67 K---------CGR-CK---QAFYCNVECQKEDWPMHKLECAAMCAFGQNWNPSETVRLTA 113
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ------------ 236
I++ + + P E +LA++E S K +D+E+
Sbjct: 114 RILAKQK-------IHPERTQSEKLLAVKE----FESHLDK--LDNEKRELIQNDIAALH 160
Query: 237 ------MKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYML 290
M++ + + AQ+ N F IE ++ LS +G+AI+
Sbjct: 161 HFYSKHMEYPDNAALVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPD 204
Query: 291 PSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
+ NH C PN I+ A + A++++E GEE+ YID + R L + F
Sbjct: 205 VALMNHSCCPNV-IVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 263
Query: 351 QCNCLRCS 358
C+C C+
Sbjct: 264 TCDCRECT 271
>gi|72391782|ref|XP_846185.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62358390|gb|AAX78854.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70802721|gb|AAZ12626.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 429
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 12/124 (9%)
Query: 245 YTNVLAQIRINAFRIELAGGLY------EDLLSSAAASIESE--IAVGNAIYMLPSFYNH 296
Y A + A + LA L+ E++LS A +E ++ G A+Y SF NH
Sbjct: 303 YDAATALSEVGALAVTLAQSLHQQHLYMEEILSLFALVQTNEFLLSGGMALYHGYSFLNH 362
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICY---IDASMARDARQAILTQGFGFQCN 353
C+PN + + + RL+ LRD+ EGE+L I Y + ++ + R+A+ Q + F+C
Sbjct: 363 SCEPNCGFVGTNAMNRRLVVLRDIREGEQLLINYNADLTTCVSYEDRRALCKQRY-FECF 421
Query: 354 CLRC 357
C +C
Sbjct: 422 CPKC 425
>gi|325530259|sp|E1C5V0.1|SMYD2_CHICK RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
Length = 436
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 74/308 (24%), Positives = 122/308 (39%), Gaps = 67/308 (21%)
Query: 75 PSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS 134
P P ++ + GRG+ A RR +L+ + T+S S C C + S
Sbjct: 7 PQPGGLERFASPGKGRGLRALRRYAVGELLFSCPAYTAVLTVSERGSHCDGCFARKEGLS 66
Query: 135 QHFQHHNARFCGEVCKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLAC 188
+ CG CK +AFY +VE ++ DW C + G + P RL
Sbjct: 67 K---------CGR-CK---QAFYCNVECQKEDWPMHKLECAAMCAFGQNWNPSETVRLTA 113
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ------------ 236
I++ + + P E +LA++E S K +D+E+
Sbjct: 114 RILAKQK-------IHPERTQSEKLLAVKE----FESHLDK--LDNEKRELIQNDIAALH 160
Query: 237 ------MKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYML 290
M++ + + AQ+ N F IE ++ LS +G+AI+
Sbjct: 161 HFYSKHMEYPDNAALVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPD 204
Query: 291 PSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
+ NH C PN I+ A + A++++E GEE+ YID + R L + F
Sbjct: 205 VALMNHSCCPNV-IVTYKGTLAEVRAVKEIEPGEEVFTSYIDLLYPTEDRNDRLRDSYFF 263
Query: 351 QCNCLRCS 358
C+C C+
Sbjct: 264 TCDCRECT 271
>gi|294935956|ref|XP_002781574.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239892411|gb|EER13369.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 181
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 42/78 (53%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQ 341
+G+ +Y + NH C PNA + + R+ +LR V GEE+ YID ++ RQ
Sbjct: 90 VLGSGLYRGVAVTNHSCSPNAEASFGGSRCLRVKSLRPVHAGEEVFQSYIDENLPLVERQ 149
Query: 342 AILTQGFGFQCNCLRCSS 359
+ L Q +GF C C +C +
Sbjct: 150 SKLRQAYGFACRCGKCRT 167
>gi|347826724|emb|CCD42421.1| similar to SET domain protein, partial sequence [Botryotinia
fuckeliana]
Length = 563
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH CD NA WI+++ + AL+D+ + EE+ I Y+ + R AR+A L + FGF C+
Sbjct: 124 NHACDRNAGTHWIESSQKIAVTALKDISKDEEITINYLGLNKPRRARRASLQKHFGFDCS 183
Query: 354 CLRCS 358
C CS
Sbjct: 184 CGLCS 188
>gi|345560264|gb|EGX43389.1| hypothetical protein AOL_s00215g125 [Arthrobotrys oligospora ATCC
24927]
Length = 484
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAIL 344
IY+ NH C PN + W +N + + A+R + GEE+ I YI SM+ + R+ L
Sbjct: 258 GIYLTICLINHSCIPNTYHNWNENLEQETIHAIRPINAGEEITISYISESMS-NPRRKRL 316
Query: 345 TQGFGFQCNCLRCSS 359
+ FGF C C C+S
Sbjct: 317 QECFGFDCQCQLCTS 331
>gi|354543914|emb|CCE40636.1| hypothetical protein CPAR2_106710 [Candida parapsilosis]
Length = 370
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 47/91 (51%), Gaps = 1/91 (1%)
Query: 269 LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRI 328
+ S+ A E + +G ++Y SF+NH CDPN + + N D R LRD+ GEEL I
Sbjct: 276 IWSNTTAKDEDKEFLGFSVYPSASFFNHSCDPNIIKIRVRN-DMRFETLRDIAIGEELCI 334
Query: 329 CYIDASMARDARQAILTQGFGFQCNCLRCSS 359
Y + ++ + Q + F C C +C S
Sbjct: 335 NYGNFQNEDVEKRQLELQEWFFDCGCTKCQS 365
>gi|428182841|gb|EKX51700.1| hypothetical protein GUITHDRAFT_55948, partial [Guillardia theta
CCMP2712]
Length = 98
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G+ +Y L SF NH C+PNA I+ D + AL ++ G E+ I Y++ D R+
Sbjct: 26 GHGLYPLASFINHSCEPNA-IISFDGNKLVVRALENIPRGTEITIAYVELYAPLDVRRDA 84
Query: 344 LTQGFGFQCNCLRC 357
L GF C C RC
Sbjct: 85 LLSRKGFLCRCSRC 98
>gi|218193604|gb|EEC76031.1| hypothetical protein OsI_13199 [Oryza sativa Indica Group]
Length = 481
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 120/286 (41%), Gaps = 33/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ VA GRG+FA R +++ + +P + P ++ S C C LRK +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVC- 75
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A +CG C+ + +E RA + D + + P + RL ++
Sbjct: 76 -----RVAWYCGSACQREEWKLHQLECRAIAALTED---RKKMLTPTI--RLMVRLVLRR 125
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ D I + + L+ A+E + +D Q+ + L Q+ +
Sbjct: 126 KLQDDKAIPSSGTDNYNLVDALESHI---------SEVDKNQLVLYAQM---ANLVQLIL 173
Query: 255 NAFRIELA--GGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+F ++L + +A + E+ ++G +Y + S NH C PNA ++ +
Sbjct: 174 PSFELDLKEITHTFSKFACNAHTICDPELRSLGTGLYPVLSIINHSCVPNA-VLIFEGRT 232
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + AL+ + + EE+ I YI+ + RQ L + + F C C RC
Sbjct: 233 AYVRALQPISKNEEVSISYIETAATTMKRQDDL-KHYYFTCTCPRC 277
>gi|261329776|emb|CBH12758.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 429
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 12/124 (9%)
Query: 245 YTNVLAQIRINAFRIELAGGLY------EDLLSSAAASIESE--IAVGNAIYMLPSFYNH 296
Y A + A + LA L+ E++LS A +E ++ G A+Y SF NH
Sbjct: 303 YDAATALSEVGALAVTLAQSLHQQHLYMEEILSLFALMQTNEFLLSGGMALYHGYSFLNH 362
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICY---IDASMARDARQAILTQGFGFQCN 353
C+PN + + + RL+ LRD+ EGE+L I Y + ++ + R+A+ Q + F+C
Sbjct: 363 SCEPNCGFVGTNAMNRRLVVLRDIREGEQLLINYNADLTTCVSYEDRRALCKQRY-FECF 421
Query: 354 CLRC 357
C +C
Sbjct: 422 CPKC 425
>gi|195485910|ref|XP_002091283.1| GE13567 [Drosophila yakuba]
gi|194177384|gb|EDW90995.1| GE13567 [Drosophila yakuba]
Length = 514
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 74/306 (24%), Positives = 119/306 (38%), Gaps = 57/306 (18%)
Query: 71 KPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI 130
K +PS P ++A E GR + ATR I+ +++ P++ P + VC CL I
Sbjct: 34 KTHKPSCHPFKIAHNEQLGRHLVATRTIKPYEIVLKEAPLVRGPAQISA-PVCLGCLNGI 92
Query: 131 TSSSQHFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQ-------GLKYPLL 182
+ H CG +C K+ E +A+ S D R Q G +PL
Sbjct: 93 EAED----HIECEQCGWPLCGPECKSLD--EHKAECSLTKD--RGQKVNVQEFGGPHPLY 144
Query: 183 --VKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFL 240
+ + C++I SPE +E L S E +
Sbjct: 145 TCLSTVRCLLI--------------GETSPEKASKFQE----LESL--------ESTRRG 178
Query: 241 NKQWYTNV--LAQIRINAFRIELAGGLYEDLLSSAAASIE---SEIAVGN----AIYMLP 291
+ QW ++ + Q F+ + E+ + A +++ E+ + A++
Sbjct: 179 SNQWKADLVSIGQFIPKFFKTQ---KFSEEEIMKAVGALQINGHEVPTTDPSHVAVFYTA 235
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
SF + C PN + N L A R++++ L ICY DA RQ L Q F+
Sbjct: 236 SFTENSCLPNLAKSFNKNGHCILWAPREIKKNAHLSICYSDAVWGTADRQRHLMQTKLFR 295
Query: 352 CNCLRC 357
C C RC
Sbjct: 296 CACERC 301
>gi|402082242|gb|EJT77387.1| hypothetical protein GGTG_07299 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 403
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 30/76 (39%), Positives = 42/76 (55%), Gaps = 2/76 (2%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
A+Y + NHDC P+ + N A+RD++ GEE+ + YID R RQA L
Sbjct: 197 AVYPQAALINHDCRPSTTYR-LSNLTHITTAVRDIQPGEEISLSYIDLMQPRAERQARL- 254
Query: 346 QGFGFQCNCLRCSSGD 361
+G+GF C C +CS D
Sbjct: 255 RGWGFDCRCRQCSLPD 270
Score = 38.1 bits (87), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
Query: 69 NGKP-SQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPII 111
+G P + P PP +VA GRG+ ATR IR +++ + +P++
Sbjct: 91 DGPPLTSPEDPPFRVAEVPGKGRGLVATRLIRTGEVVLSVRPVL 134
>gi|453088056|gb|EMF16097.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 558
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 68/309 (22%), Positives = 124/309 (40%), Gaps = 58/309 (18%)
Query: 86 ESAGRGVFATRRIRASDLI-HTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH----- 139
ESAGRG+FA + + D++ +P + L C +CL++ S+ +
Sbjct: 13 ESAGRGLFAKKDFQPGDIVLALNRPFVAELDQDRLRDTCAWCLQRGASAEDRAKATSLGL 72
Query: 140 -------------HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRL 186
R+C + C+ A + E + + R V +L
Sbjct: 73 PIAGIETKACTGCKRVRYCSKTCQTRA---WKREHKYECKVLAPAERPDLPHGVRAVVKL 129
Query: 187 ACMIISGAESAD--CIDILQ--PASLSPELILAME-------EGFVMLR-SAFKKAG--- 231
+ + E D +DI+Q PAS P+ + ++ E F ML A+K AG
Sbjct: 130 LGRLKADPEGKDQLLLDIVQFRPAS-DPKALEDIQRQDAQRFEDFSMLAYGAWKYAGEPM 188
Query: 232 IDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLP 291
+ D ++K ++ N+++ ++L+ L + +G A +
Sbjct: 189 LGDMDSNAISKAFFFNIMSNT------LQLSSAL-------------DDTKLGIAFDPIL 229
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
NH C+PN +++ + L A +++ EE+ + Y+D + RQA L Q + F
Sbjct: 230 CSANHSCEPNTAVIF-NQPQLILRAQTKIKKDEEIFMKYVDITNPFSVRQAELRQSYFFG 288
Query: 352 CNCLRCSSG 360
C C +C G
Sbjct: 289 CRCTKCRKG 297
>gi|348517233|ref|XP_003446139.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oreochromis niloticus]
Length = 489
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 105/293 (35%), Gaps = 51/293 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ AT+ A D+I + I S +C+ C R+ + Q A +C
Sbjct: 13 GRGLKATKEFWAGDVIFSEASIAAVVFDSLAERICHSCFRRQEKLQRCGQCKFAHYCDRT 72
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD----CIDILQ 204
C+ A E + + SA Y + L+ + + + G+ +D +D L+
Sbjct: 73 CQRAGWA----EHKLECSAIKAYGKVPNENIRLVARIMWRLDKEGSTVSDMQLTTLDELE 128
Query: 205 ------PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFR 258
P EL + + K+ IDD +++ I N F
Sbjct: 129 DHIADMPEDDLKELKVDIHNFLDYWPHNSKQHTIDD----------ISHIFGVINCNGFS 178
Query: 259 IELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN------------AHIMW 306
+ GL AVG ++ NHDC PN + M+
Sbjct: 179 VSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTMF 223
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
L AL + EGEEL + Y+D + RQ +L + F C C C +
Sbjct: 224 HSQRRIELRALGKIAEGEELTVAYVDFLNLSEERQRLLKTQYFFDCTCEHCKN 276
>gi|294656447|ref|XP_458717.2| DEHA2D05764p [Debaryomyces hansenii CBS767]
gi|218511783|sp|Q6BSV3.2|SET5_DEBHA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|199431481|emb|CAG86859.2| DEHA2D05764p [Debaryomyces hansenii CBS767]
Length = 493
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 287 IYMLPSFYNHDCDPNAHI---MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
IY+ S NH+CDPN ++ + ++ A RD+ GEEL Y++ + RQ
Sbjct: 353 IYLTQSHLNHNCDPNTNVETSTTVRTNGLKVFAARDIRAGEELTTTYVNPAYTVQQRQRE 412
Query: 344 LTQGFGFQCNCLRC 357
L +GF C C +C
Sbjct: 413 LRVNWGFMCGCQKC 426
>gi|396487206|ref|XP_003842584.1| hypothetical protein LEMA_P083440.1 [Leptosphaeria maculans JN3]
gi|312219161|emb|CBX99105.1| hypothetical protein LEMA_P083440.1 [Leptosphaeria maculans JN3]
Length = 346
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 41/79 (51%), Gaps = 4/79 (5%)
Query: 283 VGNAIYMLPSF--YNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDA 339
G I + P NH C PN W + + R++ A R +++GEE + YI + RD
Sbjct: 140 TGTGIGLFPKIARINHSCRPNTSYYWSEKLNKRIVFASRKIKKGEEFFVSYIPLLLTRDE 199
Query: 340 RQAILTQGFGFQCNCLRCS 358
RQ L Q +GF+C C C+
Sbjct: 200 RQRRLRQ-YGFECTCDVCA 217
>gi|358340510|dbj|GAA40653.2| SET and MYND domain-containing protein 5 [Clonorchis sinensis]
Length = 575
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 55/102 (53%), Gaps = 7/102 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRD 319
L +YE + + + +++E G +Y S NH C+PNA + + N ++
Sbjct: 301 LLADIYESMDNHVGSFLDNE---GVGLYEYQSLINHSCEPNAAVHFSGANNCLSVITSEP 357
Query: 320 VEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCLRCS 358
+ EGEE+ ICY+D S +R +R+ IL+ + F C+C +C+
Sbjct: 358 IREGEEVTICYLDECQQSRSRHSRRKILSSNYLFWCDCAKCN 399
>gi|315040696|ref|XP_003169725.1| hypothetical protein MGYG_07892 [Arthroderma gypseum CBS 118893]
gi|311345687|gb|EFR04890.1| hypothetical protein MGYG_07892 [Arthroderma gypseum CBS 118893]
Length = 538
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 4/105 (3%)
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-R 313
NAF E G+ ++ + A + S G +++ + NH C+PNA W +N
Sbjct: 76 NAFEDE--DGVELGIVRTNALPLGSNATEG-GMFLDSARINHSCNPNAQNTWNENLQKLT 132
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ A+RD+ EE+ ICY+ R ARQ L F F C+C C+
Sbjct: 133 IHAIRDIPRDEEITICYLSTIQNRSARQCALQSCFRFVCSCSLCA 177
>gi|449547900|gb|EMD38867.1| hypothetical protein CERSUDRAFT_47730 [Ceriporiopsis subvermispora
B]
Length = 410
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICY--IDAS 334
+E +G A+Y SF+NHDC PNA D R + R+V EG+EL I Y +D
Sbjct: 315 MEESECLGFAVYPRLSFFNHDCAPNARKE-RDGRALRFVTTREVAEGDELCISYGHVDG- 372
Query: 335 MARDARQAILTQGFGFQCNCLRC 357
MA R+ L+ G+ F C C RC
Sbjct: 373 MAWRERRKELSDGWFFDCACGRC 395
>gi|49533764|gb|AAT66763.1| Putative TPR domain containing protein, identical [Solanum
demissum]
Length = 438
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 118/302 (39%), Gaps = 88/302 (29%)
Query: 70 GKPSQPSP--PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCL 127
GKP + + I + ++ +GRG+FAT+ + +L+ K +
Sbjct: 129 GKPPELAEYIGSIDIKKSDISGRGLFATKNLDCGNLLLVTKAVAVE-------------- 174
Query: 128 RKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRAD---WSAFNDYCRSQGLKYPLLVK 184
R I S F D + +A W F D + +K + +
Sbjct: 175 RAIVPES--------------------VFQDSKEQAQLDMWKNFIDKIL-ESIKKCVRTR 213
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
L C + +G E+ D +++ P++ L +G S F ID E++
Sbjct: 214 DLICKLSNG-ENEDQLEV-------PDIDLFRPDG--EDSSTFHDKKIDKEKL------- 256
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-- 302
N+L L E+L+S+ S++ G +++L SF NH CDPN
Sbjct: 257 -LNILD-----------VNSLVEELISAKVLGKNSDVH-GIGLWILSSFINHSCDPNVRR 303
Query: 303 -----HIMWIDNADARLMALRDVEEGEELRICYIDA-SMARDARQAILTQGFGFQCNCLR 356
H+M + A RD++ G+EL Y D + RD + + +GF C C R
Sbjct: 304 SHVGDHVM--------IHACRDIKAGKELTFAYFDVFTPFRDREEK--AKNWGFVCKCKR 353
Query: 357 CS 358
C+
Sbjct: 354 CN 355
>gi|449539909|gb|EMD30911.1| hypothetical protein CERSUDRAFT_163648 [Ceriporiopsis subvermispora
B]
Length = 433
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 6/78 (7%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMA-RDA 339
+Y+L S NH C PNA I +D A ++A RD+ GEEL I Y++ ++
Sbjct: 325 GGLYVLHSHLNHSCAPNASIRHLDQRTALSRITVIAKRDIAPGEELLITYVNPALPLPQR 384
Query: 340 RQAILTQGFGFQCNCLRC 357
R+ ++ GFG +CNC RC
Sbjct: 385 RREVMEWGFG-KCNCERC 401
>gi|393234411|gb|EJD41974.1| hypothetical protein AURDEDRAFT_105699 [Auricularia delicata
TFB-10046 SS5]
Length = 531
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 57/117 (48%), Gaps = 13/117 (11%)
Query: 257 FRIELAGGLYEDLLS-----SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA- 310
F I AG L DL+S S S S +G + + NH CDPNA +++ A
Sbjct: 184 FGINSAGDLV-DLVSRFTTNSFTLSSPSLSPIGVCVSPTVALANHSCDPNAVVVFPSAAA 242
Query: 311 ---DARLM---ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
D LM A++D+ EE+ Y+D S+ R R+ L + + F CNC CSS D
Sbjct: 243 PLGDETLMQIIAIKDIMPEEEVLTSYVDISLPRHLRRRDLKETYNFTCNCSACSSLD 299
>gi|380494289|emb|CCF33265.1| MYND finger [Colletotrichum higginsianum]
Length = 549
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 66/287 (22%), Positives = 106/287 (36%), Gaps = 48/287 (16%)
Query: 87 SAGRGVFATRRIRASDLI-HTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH----HN 141
GR ++ATRR + D+I P + P C C++K + +
Sbjct: 17 KGGRSIYATRRFKPGDVIARFDNPAVVLPPGHRALEYCNHCVKKQRPAGVKLRACTGCKT 76
Query: 142 ARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII------SGAE 195
+CG C+ RA+WS LV +L C I A
Sbjct: 77 VAYCGPACQ-----------RANWS---------------LVHKLECKAIQRLHEAKPAH 110
Query: 196 SADCIDILQPAS----LSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQ 251
D + A+ L P+++ EE + KK G D + L L
Sbjct: 111 QPDWVPTPIRAAAQVMLRPQVLARFEELEGHVEQWRKKDGTDLQ----LQSHGVVKCLGL 166
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNA-IYMLPSFYNHDCDPNAHIMWIDNA 310
+ R+E + + L ++A + E G + + NH C PNA ++
Sbjct: 167 DTVTFERLETSFQVLCKLQTNAFSRTEEYYETGGVFLDTTLAMINHSCVPNA-LVQFGGR 225
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A L A ++ G+E+ I YID + R R L + F+C+C +C
Sbjct: 226 TATLRATSFLDPGDEIEISYIDQTQPRGKRHGELDL-YHFECSCYKC 271
>gi|297739311|emb|CBI28962.3| unnamed protein product [Vitis vinifera]
Length = 464
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA + + A ++ALR + + EE+ I YID + D R
Sbjct: 384 GTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIALRPIFKEEEVTISYIDEDLPFDER 443
Query: 341 QAILTQGFGFQCNCLRC 357
QA+L +GF+C C +C
Sbjct: 444 QALLAD-YGFRCKCPKC 459
>gi|389642221|ref|XP_003718743.1| hypothetical protein MGG_00339 [Magnaporthe oryzae 70-15]
gi|351641296|gb|EHA49159.1| hypothetical protein MGG_00339 [Magnaporthe oryzae 70-15]
Length = 456
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 37/73 (50%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
++ S NHDC PN + A RD+ GEEL I Y+D+ +R+ RQ L
Sbjct: 259 GTFLGVSRLNHDCRPNLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVRQERLK 318
Query: 346 QGFGFQCNCLRCS 358
+ +GF C C CS
Sbjct: 319 RNWGFTCTCPHCS 331
>gi|146263138|gb|ABQ15327.1| Mcg1p [Magnaporthe grisea]
Length = 456
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 37/73 (50%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
++ S NHDC PN + A RD+ GEEL I Y+D+ +R+ RQ L
Sbjct: 259 GTFLGVSRLNHDCRPNLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVRQERLK 318
Query: 346 QGFGFQCNCLRCS 358
+ +GF C C CS
Sbjct: 319 RNWGFTCTCPHCS 331
>gi|326427056|gb|EGD72626.1| hypothetical protein PTSG_04361 [Salpingoeca sp. ATCC 50818]
Length = 464
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 103/282 (36%), Gaps = 40/282 (14%)
Query: 86 ESAGRGVFATRRIR-ASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH----H 140
E +G R+R +D+I +P + + +VC C + + Q
Sbjct: 29 EKSGEEALDGERLRKGTDVIKGKRPFAHVVEKAKMETVCQTCFVQCEQLRRPLQRCAGCK 88
Query: 141 NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLL----VKRLACMIISGAES 196
R+C C+ +ADW C + P++ V LA ++ +
Sbjct: 89 ALRYCSAACQ-----------KADWKDHKPECAALKRISPVVPATFVMFLARILRKMERN 137
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
+D+LQ + G + + D E K L K Y VL N+
Sbjct: 138 TGEMDVLQLHMPGEPSDPQQQRGLFAILEHLRHFLPDAE--KHLLKSAYP-VLRITSANS 194
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMA 316
F I G + + VG +Y S+ NH C PN I + AR+ +
Sbjct: 195 FGISGVEG--------------NNLGVG--LYDTVSYINHSCAPNCSITF-SGVYARVRS 237
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ D+ +EL I YID R R+A L F F C C RC
Sbjct: 238 VHDLPPNQELTIAYIDPCDPRAKRRAHLKSQFMFDCECSRCE 279
>gi|171684729|ref|XP_001907306.1| hypothetical protein [Podospora anserina S mat+]
gi|170942325|emb|CAP67977.1| unnamed protein product [Podospora anserina S mat+]
Length = 559
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 114/287 (39%), Gaps = 62/287 (21%)
Query: 89 GRGVFATRRIRASDLIHT-AKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH-------- 139
GR + AT+ LI T + P + P ++ + C +CLR S++ F
Sbjct: 45 GRSLVATQDFTPGSLIATFSSPTLALPDGPSMRTTCNYCLR--VGSNEGFSPVSLKACTG 102
Query: 140 -HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
A +CG C+ +A + +A+ F + G ++ + D
Sbjct: 103 CKAAVYCGPTCQ---RAHWKSIHKAECKMFARVRETTGKEWHGI--------------RD 145
Query: 199 CIDILQPASLSPELILAMEEGF----VMLRSAFKKAGIDDEQMKFLNKQWY---TNVLAQ 251
D P + E A E+ + + +A +G+ F+N+ L Q
Sbjct: 146 PFDGSDPLESNLEGFKADEQVWGDFELQAMAAMTYSGV------FMNEDGLRVAMRFLCQ 199
Query: 252 IRINAF-RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
I+ NAF R++ G+ L A A + NH C PNA I + D
Sbjct: 200 IQTNAFNRLDADTGMSGIFLDPALARV-----------------NHSCVPNAFIGF-DKR 241
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A L A R ++EGEE+ I YI R R+ L + + F+C+C RC
Sbjct: 242 TATLRAERPIKEGEEITISYIANDKPRSIRREGL-RLYYFECDCPRC 287
>gi|171679715|ref|XP_001904804.1| hypothetical protein [Podospora anserina S mat+]
gi|170939483|emb|CAP64711.1| unnamed protein product [Podospora anserina S mat+]
Length = 401
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 53/107 (49%), Gaps = 20/107 (18%)
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+R N F IE+AG E +A +++ S NH+C PNA W+ +
Sbjct: 196 LRTNGFGIEIAG--------------EGHLA----LFLEGSRVNHNCRPNAFWRWVPSKM 237
Query: 312 A-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A ++ALR + +GEE+ Y + R+A+L Q +GFQC C C
Sbjct: 238 AMEVVALRGIGKGEEVAHSYAPLGYTHEQRKAVL-QPWGFQCQCALC 283
>gi|290976641|ref|XP_002671048.1| SET domain-containing protein [Naegleria gruberi]
gi|284084613|gb|EFC38304.1| SET domain-containing protein [Naegleria gruberi]
Length = 914
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 125/303 (41%), Gaps = 60/303 (19%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
SP V TE GRG++A+ +++ ++ + KP + TL T N+ C CL K+
Sbjct: 275 SPLHAHVITTEQYGRGMYASSELKSGQVVLSEKPFLC-VTLDTENT-CNNCLNKLKKKIP 332
Query: 136 HFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQ-GLKYPLLVKRLACMIISGA 194
CK+N Y A++ Y R G ++ + MI+SG+
Sbjct: 333 -------------CKNNCGMEYYCSMECRSEAWDKYHRKLCGFD----MRNINKMIVSGS 375
Query: 195 ESAD------------CIDILQPASLSPE-------LILAMEEGFVMLRSAFKKAGIDDE 235
++ + I+ P + I +EE + I ++
Sbjct: 376 TNSAKNHLYAIKFLGMLLQIVYPQGELKQQEIRYVQFIEKLEENNSSFKCLIAWPNILND 435
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
+ K Y L +I N+F+ Y D +S+ +A+ + + + N
Sbjct: 436 ---LIGKLSYHVSLDKIGNNSFK-------YGDAMSNG-------VALSSTLPKTIVYTN 478
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR-QAILTQGFGFQCNC 354
H C PN + +N L+ALR +++GE+L I Y++ S++ R ++IL + F C C
Sbjct: 479 HSCVPNVEYRY-ENGKCSLVALRKIKKGEQLFISYVNPSLSAQKRAESILP--YIFVCEC 535
Query: 355 LRC 357
+C
Sbjct: 536 EKC 538
>gi|357117918|ref|XP_003560708.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like
[Brachypodium distachyon]
Length = 482
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 69/289 (23%), Positives = 117/289 (40%), Gaps = 33/289 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ +A GRG+ TRR ++I +P + P + S C C LRK +
Sbjct: 17 LAIASIPGKGRGLVTTRRFFPGEVILCGEPYASTPNKILVGSNCDHCFTSSNLRKCSVC- 75
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A +CG VC+ + +E RA SA D + + P + RL +I
Sbjct: 76 -----RVAWYCGSVCQKEEWKLHQLECRA-ISALTD--DRKKMLTPTI--RLMVRLILRR 125
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ + I + + L+ A+E + +DD+Q+ + L Q+ +
Sbjct: 126 KLQNEKVIPSTGTDNYGLVDALESHI---------SEVDDKQLVLYAQMAN---LVQLIL 173
Query: 255 NAFRIEL--AGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
A ++L ++ +A + E+ VG ++ S NH C PNA ++ +
Sbjct: 174 PAIELDLKETAHIFSKFSCNAHTICDPELRPVGTGLFPAISTINHSCVPNA-VLLFEGRT 232
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
A + AL+ + E+ I YI+ + R L F F C C RC G
Sbjct: 233 AYVRALQPLSNNTEVSISYIETAATTLKRHNDLKHYF-FTCTCPRCIKG 280
>gi|147844297|emb|CAN82112.1| hypothetical protein VITISV_031337 [Vitis vinifera]
Length = 405
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 54/118 (45%), Gaps = 15/118 (12%)
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HI 304
T +LA+ ++NAF GL E A A IY SF+NHDC PNA
Sbjct: 186 TALLAKDKLNAF------GLME---PPALAPGGERSVRAYGIYPKASFFNHDCLPNACRF 236
Query: 305 MWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++D N D + + DV EG E+ + Y + RQ L + +GF C C RC
Sbjct: 237 DYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRC 294
>gi|350290486|gb|EGZ71700.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 488
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 7/71 (9%)
Query: 295 NHDCDPNAHIMWIDNADAR-------LMALRDVEEGEELRICYIDASMARDARQAILTQG 347
NHDC PNA + N+ + A++D+ GEE+ I YI+ +R ARQ +L+
Sbjct: 229 NHDCRPNADYRFDWNSSKGGPGLVQVITAVKDILPGEEITISYINPLRSRKARQKLLSTA 288
Query: 348 FGFQCNCLRCS 358
+GF+C+C CS
Sbjct: 289 WGFECSCELCS 299
>gi|340966658|gb|EGS22165.1| hypothetical protein CTHT_0016820 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 271
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 63/140 (45%), Gaps = 26/140 (18%)
Query: 222 MLRSAFKK--AGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIES 279
MLR A K+ G D M VL IR N F IE+ G E
Sbjct: 40 MLRKAVKQLPPGRRDAFMALARSTGGEPVLDVIRTNGFGIEIQG--------------EQ 85
Query: 280 EIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA 339
+A I + S NH+C PN M ++ ++ LRD++ GEE+ Y+ +
Sbjct: 86 HLA----IVINGSRINHNCRPNPSNMAME-----VVTLRDIQAGEEVTYSYVPLGYTSEE 136
Query: 340 RQAILTQGFGFQCNCLRCSS 359
RQA+L Q +GF+C C CS+
Sbjct: 137 RQAVL-QAWGFRCRCDLCSA 155
>gi|125560527|gb|EAZ05975.1| hypothetical protein OsI_28216 [Oryza sativa Indica Group]
Length = 392
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 286 AIYMLPSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDA 339
A+Y S NHDC PNA H + D N D + AL D+ EG E+ + Y A+
Sbjct: 201 AVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCLSYFAANWQYKD 260
Query: 340 RQAILTQGFGFQCNCLRC 357
RQ L + +GF+C C RC
Sbjct: 261 RQQRLLEDYGFRCECERC 278
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 2/72 (2%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHP-TLSTLNSVCYFCLRKITSSSQHFQ 138
++ A GRG+ A R IR ++I T +P++ +P +LS+L S C C R +++++
Sbjct: 6 LRAADLPGRGRGLLAARSIREGEVILTEQPLLLYPASLSSLPSFCSACFRSLSAAASPCP 65
Query: 139 H-HNARFCGEVC 149
A FC C
Sbjct: 66 SCRAAGFCSPSC 77
>gi|225425346|ref|XP_002275154.1| PREDICTED: histone-lysine N-methyltransferase ASHR2-like [Vitis
vinifera]
Length = 405
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 56/119 (47%), Gaps = 17/119 (14%)
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAV-GNAIYMLPSFYNHDCDPNA-H 303
T +LA+ ++NAF L+ A + E +V IY SF+NHDC PNA
Sbjct: 186 TALLAKDKLNAF----------GLMEPPALAPGGERSVRAYGIYPKASFFNHDCLPNACR 235
Query: 304 IMWID-----NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++D N D + + DV EG E+ + Y + RQ L + +GF C C RC
Sbjct: 236 FDYVDTASHHNTDITIRLIHDVPEGSEICLSYFPVNETYADRQKRLLEDYGFTCYCDRC 294
>gi|82658264|ref|NP_001032477.1| SET and MYND domain-containing protein 3 [Danio rerio]
gi|81097726|gb|AAI09413.1| SET and MYND domain containing 3 [Danio rerio]
Length = 380
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 103/260 (39%), Gaps = 39/260 (15%)
Query: 84 LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNAR 143
++E G G+ A R I+ ++I++ +P L + C CL++ S S+ Q AR
Sbjct: 10 ISEGKGNGLRALREIKPGEVIYSCEPFAFCVARDFLKTACQSCLKRGESLSRCSQCKTAR 69
Query: 144 FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP-------LLVKRLACMIISGAES 196
+C C+ A W C+ P LV R+ ++S +ES
Sbjct: 70 YCNVQCQKQA-----------WPDHKRECKCLKHLQPRIPTDSVRLVARIIFKLLSQSES 118
Query: 197 --ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD--EQMKFLNKQWYTNVLAQI 252
+ I + S ++ EG L + + ++ + + + ++LA++
Sbjct: 119 DQEELYSIAEHQSHLADMSEEKTEGLKHLCTTLQVYLAEENCDLSRLPSGLDPVSLLARV 178
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
N F I + G +D VG +Y S NHDC PN IM +
Sbjct: 179 TCNCFSI--SDGELQD--------------VGVGLYPSMSLLNHDCQPNC-IMMFEGKRL 221
Query: 313 RLMALRDVEEGEELRICYID 332
L A+R + EEL I Y D
Sbjct: 222 TLRAVRVIRSAEELTISYTD 241
>gi|405959013|gb|EKC25088.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 396
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 55/99 (55%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEG 323
LYE+L + + ++ E G+ +Y L S NH C PNA I + +N ++A + G
Sbjct: 280 LYEELEKVSGSFLDCE---GSGLYSLQSTCNHSCLPNAEITFPYNNNVMAVVAKEKIPSG 336
Query: 324 EELRICYI---DASMARDARQAILTQGFGFQCNCLRCSS 359
+E+ I Y+ D S +R +RQ IL + + F C+C +C S
Sbjct: 337 QEICISYLSECDLSRSRHSRQNILKENYLFTCDCPKCLS 375
>gi|393212472|gb|EJC97972.1| SET domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 444
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 67/314 (21%), Positives = 123/314 (39%), Gaps = 51/314 (16%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPT-----LSTLNSVCYFCLRKITSSSQHF------ 137
G+G+ AT+ I+ + + P + P+ L + C+ C SS+
Sbjct: 90 GKGLEATQEIKEGEDVWKEDPFVISPSWDIYDLQIASHACFHCTTPFNSSTLPHSLPCPS 149
Query: 138 -------QHHN--ARFCGEVCKDNAKA-----FYDVERRADWSAFNDYCRSQGLKYPLLV 183
Q+H ARFC +C A + A N + RS+G V
Sbjct: 150 STNPALSQNHPCPARFCNRLCITRAMSRQHPLLCPATNPASVPLLN-FARSRGWLAAHAV 208
Query: 184 KRLACMIISGAESADCIDILQPASLSPELI-LAMEEGFVMLRSAFKKAGIDDEQMKFLNK 242
+ ++ E D+ + L ++M+E + ++ ++ G++ + +
Sbjct: 209 AQCIARVLMAFEQGRKQDLEEDLRFIRSLAQMSMDERWKIIETS----GMEPDHQTWKTA 264
Query: 243 QWYT-------------NVLAQIRINAFRIELAGGLYE-DLLSSAAASIESEIAVGNAIY 288
T +L++I +L L++ D A + + + +Y
Sbjct: 265 HTLTLQAFHEPSNPRDKKLLSKIIRKPLPEDLVKTLFDYDAFLHALSRMGLNLEAHGGLY 324
Query: 289 MLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDA-RQAI 343
+L S NH C+PN + D A + A RD++ GEEL + Y+D ++ A R+A+
Sbjct: 325 ILHSHMNHSCEPNISVRHFDQRTALSRITMRARRDIQPGEELTVAYVDPALLLSARRRAL 384
Query: 344 LTQGFGFQCNCLRC 357
+ FG C C RC
Sbjct: 385 IPWAFG-TCMCDRC 397
>gi|336469834|gb|EGO57996.1| hypothetical protein NEUTE1DRAFT_110161 [Neurospora tetrasperma
FGSC 2508]
Length = 449
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 41/71 (57%), Gaps = 7/71 (9%)
Query: 295 NHDCDPNAHIMWIDNADAR-------LMALRDVEEGEELRICYIDASMARDARQAILTQG 347
NHDC PNA + N+ + A++D+ GEE+ I YI+ +R ARQ +L+
Sbjct: 229 NHDCRPNADYRFDWNSSKGGPGLVQVITAVKDILPGEEITISYINPLRSRKARQKLLSTA 288
Query: 348 FGFQCNCLRCS 358
+GF+C+C CS
Sbjct: 289 WGFECSCELCS 299
>gi|384485789|gb|EIE77969.1| hypothetical protein RO3G_02673 [Rhizopus delemar RA 99-880]
Length = 459
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 114/271 (42%), Gaps = 37/271 (13%)
Query: 90 RGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHN-ARFCGEV 148
RG+ R I A ++I+ KPI++ S C CL+++ + FC +
Sbjct: 189 RGLVVNRDIAADEIIYVEKPIVSGLFPGLEGSHCNLCLKRLNDVKIECPDCDVVAFCSDE 248
Query: 149 CKDNAKAFY--------DVERRADWSAFNDYCRSQGLKYPLLVKR-LACMIISGAESA-- 197
C ++AK Y E+ D F++ + KYP ++ R L+ M++ +
Sbjct: 249 CLNHAKEEYHQYICPQNKKEKNVDALEFHEALKKNNTKYPSMIARFLSAMVVEELNKSKE 308
Query: 198 ------------DCIDILQPASLSPELILAMEEGFVMLRSAF--KKAGIDDEQMKFLNKQ 243
D +D + +S A E ML+ K GI++ FL+K+
Sbjct: 309 QQKVGEASFGAWDHVDRFRYLEISENEETAAE--IEMLKKVLGPKVQGINE----FLSKE 362
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSA-AASIESEIA--VGNAIYMLPSFYNHDCD- 299
Y + ++ NA+ I E S A +++ A +G A+Y + ++ D
Sbjct: 363 IYLMLKGKLLFNAYAIPTNSEDIEVEESKEHARKLDNSEAKHIGAALYKISTYIGQSEDG 422
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICY 330
PN I + +N + ALR+++E EEL Y
Sbjct: 423 PNVKIEF-ENDAISVRALREIKEDEELVAAY 452
>gi|146097546|ref|XP_001468133.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134072500|emb|CAM71213.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 1004
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 272 SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-----------DNADARLMALRDV 320
+A AS+ + A++ NH C PNA ++ D A L+ALR +
Sbjct: 611 AACASLSANDVSVKALFPFLRHLNHACVPNAILVLDRTPTHLCRSGDDGVVASLVALRTI 670
Query: 321 EEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
E GEE+ + Y+ A+ A Q L++ GFQC CL C+
Sbjct: 671 ESGEEITVSYVPATTALTVSQKELSETLGFQCRCLLCT 708
>gi|40253403|dbj|BAD05333.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|42761302|dbj|BAD11545.1| SET-domain transcriptional regulator-like protein [Oryza sativa
Japonica Group]
gi|125602534|gb|EAZ41859.1| hypothetical protein OsJ_26404 [Oryza sativa Japonica Group]
Length = 392
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 286 AIYMLPSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDASMARDA 339
A+Y S NHDC PNA H + D N D + AL D+ EG E+ + Y A+
Sbjct: 201 AVYPRASLLNHDCLPNACHFDYADRPGPGNTDIVVRALHDITEGREVCLSYFAANWQYKD 260
Query: 340 RQAILTQGFGFQCNCLRC 357
RQ L + +GF+C C RC
Sbjct: 261 RQQRLLEDYGFRCECERC 278
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 2/63 (3%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHP-TLSTLNSVCYFCLRKITSSSQHFQH-HNARFCG 146
GRG+ A R IR ++I T +P++ +P +L++L S C C R +++++ A FC
Sbjct: 15 GRGLLAARSIREGEVILTEQPLLLYPASLASLPSFCSACFRSLSAAASPCPSCRAAGFCS 74
Query: 147 EVC 149
C
Sbjct: 75 PSC 77
>gi|42565094|ref|NP_188819.2| histone-lysine N-methyltransferase ATXR2 [Arabidopsis thaliana]
gi|75251251|sp|Q5PP37.1|ATXR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR2; AltName:
Full=Protein SET DOMAIN GROUP 36; AltName:
Full=Trithorax-related protein 2; Short=TRX-related
protein 2
gi|56236050|gb|AAV84481.1| At3g21820 [Arabidopsis thaliana]
gi|59958344|gb|AAX12882.1| At3g21820 [Arabidopsis thaliana]
gi|62320769|dbj|BAD95436.1| hypothetical protein [Arabidopsis thaliana]
gi|332643034|gb|AEE76555.1| histone-lysine N-methyltransferase ATXR2 [Arabidopsis thaliana]
Length = 473
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 63/154 (40%), Gaps = 24/154 (15%)
Query: 227 FKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG-----LYEDLLSSAAASIESEI 281
K A D E + + Y N++ +N + +A LY D L A EI
Sbjct: 314 LKIAIFDKECEALFSLEIYGNIIGMFELNNLDLVVASPVEDYFLYIDDLPDAEKEETEEI 373
Query: 282 A---------------VGNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEG 323
G A + L S NH C PNA + + A ++ALR + +
Sbjct: 374 TRPFLDALGDEYSDCCQGTAFFPLQSCMNHSCCPNAKAFKREEDRDGQAVIIALRRISKN 433
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
EE+ I YID + RQA+L +GF C C +C
Sbjct: 434 EEVTISYIDEELPYKERQALLAD-YGFSCKCSKC 466
>gi|412985398|emb|CCO18844.1| predicted protein [Bathycoccus prasinos]
Length = 528
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 3/80 (3%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDA 339
+G A++ L S NHDCDPN H + N L+A + + +GEEL I Y++ +
Sbjct: 436 LGTALFALQSNCNHDCDPNCHPFKDETDINGSCVLVARKPIRKGEELTISYLEDDQLDWS 495
Query: 340 RQAILTQGFGFQCNCLRCSS 359
R+ +GF C C RC S
Sbjct: 496 RRQDALSDYGFVCRCARCES 515
>gi|71420095|ref|XP_811365.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876023|gb|EAN89514.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 429
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 112/292 (38%), Gaps = 52/292 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS------QHFQHHNA 142
GRG++A RI +S + +P + + C CL I S Q
Sbjct: 163 GRGIYALDRINSSTPVMLDQPFLVQ---RMRDDACAHCLATIGRSGASAGGVQCAHCDRE 219
Query: 143 RFCGEVCKDNAKAFYDV----ERRADWSAFNDYCRSQGLKYPLLVKR--LACMIISGAES 196
+C C+D A Y V R ++ + R + L + R LAC+ ++
Sbjct: 220 TYCSVACRDAAWREYHVCACVSRNEMYAFWEGSMRERLLSDKMEESRAALACLAVAKLCV 279
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINA 256
+ + P +L P + + Y A + A
Sbjct: 280 LSTVQQMHPLAL-PRIC------------------------SLRGRADYDASTALSGVGA 314
Query: 257 FRIELAGGL------YEDLLSSAAASIESEIAV--GNAIYMLPSFYNHDCDPNAHIMWID 308
+ LA L E+LLS A +E + G A+Y SF NH C+PN ++
Sbjct: 315 LAVTLATALRQTHLYMEELLSLFAIVQTNEFLLPSGMALYHGYSFLNHSCEPNCALLGSG 374
Query: 309 NADARLMALRDVEEGEELRICYIDASMARD---ARQAILTQGFGFQCNCLRC 357
+ RL+ LRDV EGE+L I Y +AS+ A + L Q F+C C +C
Sbjct: 375 ATNRRLVTLRDVREGEQLFIDY-NASLTTRVSYADRRALCQQRHFECFCPKC 425
>gi|255085128|ref|XP_002504995.1| predicted protein [Micromonas sp. RCC299]
gi|226520264|gb|ACO66253.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 59/270 (21%), Positives = 102/270 (37%), Gaps = 46/270 (17%)
Query: 123 CYFCLRKITSSSQHFQHHNAR---FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKY 179
C FCLR + F+ + +C CKD A+ Y + +AF ++ + +
Sbjct: 11 CSFCLRFLEKDPPEFRCDGCKLECYCSTACKDEARRLYH-SSDGECAAFEAGIEAEAIDF 69
Query: 180 PL--LVKRLACMIISGA-------------ESADCIDILQP-------------ASLSPE 211
+ +R ++S A D + +LQ A +S
Sbjct: 70 EFGDVPERFLLRVLSQAGGWRTPGGPMVVGAGVDRVRMLQAHVPAPDTEERGRLAGISRN 129
Query: 212 LILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLS 271
+ M+E +G + + T +L + NA LY + S
Sbjct: 130 TLRLMDESVEDGVRVVDPSGDEPATTSRYGEAELTQLLCSVNCNAHT------LYANDRS 183
Query: 272 SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYI 331
S + VG A+Y+ S +NH C P+A + + + +LR V+ GEE+ + Y+
Sbjct: 184 SL-------VPVGIAVYLQGSAFNHSCVPSAEFCNVGTS-LTVRSLRRVQAGEEITVSYV 235
Query: 332 DASMARDARQAILTQGFGFQCNCLRCSSGD 361
+M R+ L + F C C RC + D
Sbjct: 236 PTTMNLKERRRCLEGQYKFVCACARCVAED 265
>gi|198470120|ref|XP_002133373.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
gi|198145302|gb|EDY72001.1| GA22855 [Drosophila pseudoobscura pseudoobscura]
Length = 844
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 75/355 (21%), Positives = 125/355 (35%), Gaps = 35/355 (9%)
Query: 11 RLAQYDFIGHNFVILFIPFLPQVCYFWKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNG 70
R +Q D +GH+ I Q W + + L QL +
Sbjct: 370 RPSQSDIVGHHRASSGIAEQRQRTTTWTNRMNTTATMLAHMAPCRDTTPEQLAQLIDMHL 429
Query: 71 KPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRK 129
+ P +A + AGRGVFATR I A +LI + ++ PT S C C R
Sbjct: 430 GELREKKPNWTLAASTVAGRGVFATRDIVAGELIFRERALVVGPTARKGQLSTCVCCHRL 489
Query: 130 ITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACM 189
+ + + C + + E W + + + PL ++ L +
Sbjct: 490 LPTQGFLCKQRCTLPVCAACSGSVTHRAECEHFRRWQPKDADAEEEQVN-PLSLRILTAV 548
Query: 190 II--SGAESADCIDILQPASLSPELILAMEEGF----VMLRSAFKKAGIDDEQMKFLNKQ 243
+ G E +D +Q + E GF + F+ D+ F+++
Sbjct: 549 RVFHLGKEQRHLVDAMQANA---------ERGFRQEIIKAAQCFRNFPTTDK--PFMDQL 597
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ ++ + NAF E LL ++ L + NH+C PNA
Sbjct: 598 F--RIVGVLNTNAFEAPCRTDGRETLL--------------RGLFPLTAIMNHECTPNAS 641
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + A + A RD+ +G E+ Y R L F C+C RC+
Sbjct: 642 HYFDNGRLAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCN 696
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 56/290 (19%), Positives = 99/290 (34%), Gaps = 34/290 (11%)
Query: 74 QPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL--STLNSVCYFCLRKIT 131
+P P +V+ + +GRG+FATR I + + I+ PT S+ C C R +
Sbjct: 47 RPKEPAWRVSDSPISGRGIFATRDIAKGEELFREHTILVGPTAHRSSNLRTCTLCYRLVP 106
Query: 132 SSSQH----FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
+ CGE C + + + + W P + +R+
Sbjct: 107 GQTDAEALCLAGCGLPVCGE-CGKSPRHKTECDLFLKWK-------------PKVTERID 152
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
+ C + + + A + + M A +D + ++
Sbjct: 153 PRSLRILSVVRCFFLDEQQRRLLYAMQANADRYYMKEVERAAACFEDFPREQEMLDFFYR 212
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+ NAF + + +E + A++ L NH C PNA +
Sbjct: 213 TICAFNTNAF--------------ESRSCVEGHEVLIRALFPLAGLLNHQCTPNAGHHFQ 258
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
D + A + G E+ + Y + AR+ L GF C C RC
Sbjct: 259 DGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLGMTKGFMCQCPRC 308
>gi|108710651|gb|ABF98446.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 352
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 118/286 (41%), Gaps = 33/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ VA GRG+FA R +++ + +P + P ++ S C C LRK +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVC- 75
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A +CG C+ + +E RA + D + + P + RL ++
Sbjct: 76 -----RVAWYCGSACQREEWKLHQLECRAIAALTED---RKKMLTPTI--RLMVRLVLRR 125
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ D I + + L+ A+E + +D Q+ + L Q+ +
Sbjct: 126 KLQDDKAIPSSGTDNYNLVDALESHI---------SEVDKNQLVLYAQMAN---LVQLIL 173
Query: 255 NAFRIELA--GGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+F ++L + +A + E+ +G +Y + S NH C PNA ++ +
Sbjct: 174 PSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNA-VLIFEGRT 232
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + AL+ + + EE+ I YI+ + RQ L + F C C RC
Sbjct: 233 AYVRALQPISKNEEVSISYIETAATTMKRQDDLKH-YYFTCTCPRC 277
>gi|398021427|ref|XP_003863876.1| hypothetical protein, conserved [Leishmania donovani]
gi|322502110|emb|CBZ37193.1| hypothetical protein, conserved [Leishmania donovani]
Length = 1004
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 48/98 (48%), Gaps = 11/98 (11%)
Query: 272 SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-----------DNADARLMALRDV 320
+A AS+ + A++ NH C PNA ++ D A L+ALR +
Sbjct: 611 AACASLSANDVSVKALFPFLRHLNHACVPNAILVLDRTPTHLCRSGDDGVVASLVALRTI 670
Query: 321 EEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
E GEE+ + Y+ A+ A Q L++ GFQC CL C+
Sbjct: 671 ESGEEITVSYVPATTALTVSQKELSETLGFQCRCLLCT 708
>gi|195058496|ref|XP_001995453.1| GH17754 [Drosophila grimshawi]
gi|193896239|gb|EDV95105.1| GH17754 [Drosophila grimshawi]
Length = 499
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 64/291 (21%), Positives = 115/291 (39%), Gaps = 30/291 (10%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKVKDPAWEIGISKIAGRGVMATRNLKRGEIIFQDMPLLIGLAAQEEDSLNACSICLKL 76
Query: 130 ITSSSQHFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLAC 188
+ + F CG +C K + + D F + ++ ++ RL C
Sbjct: 77 LPDT--RFMCRQG--CGLPICSLCGKKR---QHKTDCDMFKSWGPNEPDVANSVIIRLLC 129
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
+ + + D D++ L L FK D + ++ +N+
Sbjct: 130 VARAINLTKDQRDLIY--CLQANLDNNHRTEVRNAAKCFKNFPTDKKLVEIMNR-----T 182
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
+A +R N F + +++ A+Y L NHDC PN++ + +
Sbjct: 183 VAVLRTNGFD------------KTTDRCTDNQEFSYRALYPLFGVMNHDCIPNSYYTFEE 230
Query: 309 NADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ ++ A D+ EGEE+ Y AR L GF C C RCS
Sbjct: 231 KTNNMIVRAAVDILEGEEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCS 281
>gi|194769376|ref|XP_001966780.1| GF19100 [Drosophila ananassae]
gi|190618301|gb|EDV33825.1| GF19100 [Drosophila ananassae]
Length = 499
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 117/291 (40%), Gaps = 30/291 (10%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +++ C CL++
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVMATRSLKRGEIIFKDSPLLIGLAAHEEDALNACTVCLKE 76
Query: 130 ITSSSQHFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLAC 188
+ + + CG VC AK + + D F + ++ ++ RL C
Sbjct: 77 LPDTKFMCRQG----CGLPVCPLCAKK---KQHKTDCDMFKSWGPNEPEVANSVIIRLLC 129
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
+ + + D D++ L L FK D + ++ +N+
Sbjct: 130 IARAINLTKDQRDLIY--CLQANLDNNHRTEVRNAAKCFKNFPTDKKLIEIMNR-----T 182
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
+A +R N F + + +++ A+Y L NHDC PN++ + +
Sbjct: 183 VAVMRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVMNHDCIPNSYYTFDE 230
Query: 309 NADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ ++ A D+ EG E+ Y AR L GF C C RCS
Sbjct: 231 KTNHMVVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKMKKGFTCKCPRCS 281
>gi|409076546|gb|EKM76917.1| hypothetical protein AGABI1DRAFT_102088 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 643
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
Query: 283 VGNAIYMLPSFYNHDCDPNA--------HIMWIDNADARLMALRDVEEGEELRICYIDAS 334
+G +I + + +NH C PNA H + A ++ LRD+ EE+ I YID +
Sbjct: 266 IGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRDIPANEEVVISYIDTT 325
Query: 335 MARDARQAILTQGFGFQCNCLRC 357
+ + RQ L + + F CNC C
Sbjct: 326 LTKRERQKALKETYYFTCNCSLC 348
>gi|327305077|ref|XP_003237230.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
gi|326460228|gb|EGD85681.1| hypothetical protein TERG_01951 [Trichophyton rubrum CBS 118892]
Length = 498
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 68/298 (22%), Positives = 114/298 (38%), Gaps = 58/298 (19%)
Query: 88 AGRGVFATRRIRASDLIHTAKPIITHPTLST--LNSVCYFCLRKITSSSQHFQHHN---- 141
AGRG+FAT I+ + + P LST L C C + + +
Sbjct: 19 AGRGLFATSTIKMGKDVFSM-PATFSTVLSTERLKDACSNCFANLPFGVNIMANVDMKLW 77
Query: 142 -------ARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRSQGLKYPLLV---KRLACMI 190
++C C+ A+W + + C YP ++ R I
Sbjct: 78 ACSGCKVVKYCDRKCQS-----------ANWKLIHKHECAIYKKLYPKILPVNSRAVLRI 126
Query: 191 ISGAES------ADCIDILQPASLSPELILAMEEGF--VMLRSAFKKAGIDDEQMKFLNK 242
+ S +D I L S E+ + +E + +ML + +K E L+
Sbjct: 127 VKLRNSNEDHVKSDLIMFLSLRSHLDEITKSNQEQYERIMLSAKAEKEYSGSE----LDV 182
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA 302
+ LA+I +N+F A G +G I + NH C+PNA
Sbjct: 183 ETIAEYLARIEVNSFTFTTAFG----------------DPLGLCIQPFACYMNHSCEPNA 226
Query: 303 HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
++ D + ALR+++ E++ I YID + + RQ L + + F C C +C+ G
Sbjct: 227 -VVGFDGGLITVKALREIKSDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCAQG 283
>gi|195168896|ref|XP_002025266.1| GL13396 [Drosophila persimilis]
gi|194108722|gb|EDW30765.1| GL13396 [Drosophila persimilis]
Length = 787
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 64/288 (22%), Positives = 107/288 (37%), Gaps = 35/288 (12%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLN-SVCYFCLRKITSSSQH 136
P +A + AGRGVFATR I A +LI + ++ PT S C C R + +
Sbjct: 380 PNWTLAASTVAGRGVFATRDIVAGELIFRERALVVGPTARKGQLSTCVCCHRLLPTQGFL 439
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS--GA 194
+ C + + E W + + + PL ++ L + + G
Sbjct: 440 CKQRCTLPVCAACSGSVTHRAECEHFRRWQPKDADAEEEQVN-PLSLRILTAVRVFHLGK 498
Query: 195 ESADCIDILQPASLSPELILAMEEGF----VMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
E +D +Q + E GF + F+ D+ F+++ + ++
Sbjct: 499 EQRHLVDAMQANA---------ERGFRQEIIKAAQCFRNFPTTDK--PFMDQLF--RIVG 545
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
+ NAF E LL ++ L + NH+C PNA + +
Sbjct: 546 VLNTNAFEAPCRTDGRETLL--------------RGLFPLTAIMNHECTPNASHYFDNGR 591
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A + A RD+ +G E+ Y R L F C+C RC+
Sbjct: 592 LAIVRAARDIPKGGEITTTYTKILWGNLTRGIFLKMTKHFMCDCTRCN 639
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 56/290 (19%), Positives = 99/290 (34%), Gaps = 34/290 (11%)
Query: 74 QPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL--STLNSVCYFCLRKIT 131
+P P +V+ + +GRG+FATR I + + I+ PT S+ C C R +
Sbjct: 47 RPKEPAWRVSDSPISGRGIFATRDIAKGEELFREHTILVGPTAHRSSNLRTCTLCYRLVP 106
Query: 132 SSSQH----FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
+ CGE C + + + + W P + +R+
Sbjct: 107 GQTDAEALCLAGCGLPVCGE-CGKSPRHKTECDLFLKWK-------------PKVTERID 152
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
+ C + + + A + + M A +D + ++
Sbjct: 153 PRSLRILSVVRCFFLDEQQRRLLYAMQANADRYYMKEVERAAACFEDFPREQEMLDFFYR 212
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+ NAF + + +E + A++ L NH C PNA +
Sbjct: 213 TICAFNTNAF--------------ESRSCVEGHEVLIRALFPLAGLLNHQCTPNAGHHFQ 258
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
D + A + G E+ + Y + AR+ L GF C C RC
Sbjct: 259 DGETIIVCATERIACGAEITVSYAKMLWSTLARKIFLGMTKGFMCQCPRC 308
>gi|19922236|ref|NP_610944.1| CG8503 [Drosophila melanogaster]
gi|7303209|gb|AAF58272.1| CG8503 [Drosophila melanogaster]
gi|15291193|gb|AAK92865.1| GH11294p [Drosophila melanogaster]
gi|220945042|gb|ACL85064.1| CG8503-PA [synthetic construct]
gi|220954946|gb|ACL90016.1| CG8503-PA [synthetic construct]
Length = 513
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 69/293 (23%), Positives = 110/293 (37%), Gaps = 31/293 (10%)
Query: 71 KPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI 130
K +PS P ++A E GR + ATR I+ +++ P++ P + VC CL I
Sbjct: 34 KVHKPSCHPFKIAHNEQLGRHLVATRTIKPYEIVLKEAPLVRGPAQISA-PVCLGCLNGI 92
Query: 131 TSSSQ-HFQHHNARFCGEVCK--DNAKAFYDVER-RADWSAFNDYCRSQGLKYPLL--VK 184
+ + CG CK D KA + + R ++ G +PL +
Sbjct: 93 EAEDHIECEQCGWPLCGPECKSLDEHKAECGLTKDRGQKVNVQEF----GGPHPLYTCLS 148
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
+ C++I G S + Q E + + G ++ I KF Q
Sbjct: 149 TVRCLLI-GETSTEKASKFQDL----ESLESTRRGSNQWKADL--VSIGQFIPKFFKTQK 201
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+T + A +I + + A++ SF + C PN
Sbjct: 202 FTEEEIMKAVGALQIN-------------GHEVPTTDPSHVAVFYTASFTENSCLPNLAK 248
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ N L A R++++ L ICY DA RQ L Q F+C C RC
Sbjct: 249 SFNKNGHCILWAPREIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACERC 301
>gi|328853242|gb|EGG02382.1| hypothetical protein MELLADRAFT_110239 [Melampsora larici-populina
98AG31]
Length = 561
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 42/78 (53%), Gaps = 4/78 (5%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMA--LRDVEEGEELRICYIDASMARDARQAI 343
A+ PS +NH+C PN+ + D+ R+ A +RD+ GEE+ I Y +R RQA
Sbjct: 360 AVVPEPSVFNHECRPNS-AFYFDDKTLRVYAHAVRDIALGEEITIAYRAMKASRAGRQAA 418
Query: 344 LTQGFGFQCNCLRCSSGD 361
+ +GF C C CS D
Sbjct: 419 IAH-YGFSCTCSHCSMSD 435
>gi|440800514|gb|ELR21550.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 333
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G ++ + + NH C+PN + + N +A ++AL D+ EGEEL YI+ RQA
Sbjct: 254 GFGLFPIAAMMNHSCEPNTQVKFGRNREAVVVALCDIAEGEELTHSYIENDRPLAERQAD 313
Query: 344 LTQGFGFQCNCLRC 357
L + + F C C+RC
Sbjct: 314 LLE-YNFVCQCVRC 326
>gi|225447338|ref|XP_002274324.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like [Vitis
vinifera]
Length = 495
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA + + A ++ALR + + EE+ I YID + D R
Sbjct: 415 GTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIALRPIFKEEEVTISYIDEDLPFDER 474
Query: 341 QAILTQGFGFQCNCLRC 357
QA+L +GF+C C +C
Sbjct: 475 QALLAD-YGFRCKCPKC 490
>gi|449548889|gb|EMD39855.1| hypothetical protein CERSUDRAFT_122028 [Ceriporiopsis subvermispora
B]
Length = 454
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 4/68 (5%)
Query: 294 YNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
+NH C PNA + +I +++ALR+++EGEE+ I Y+D +++ AR+ L +G
Sbjct: 208 FNHSCVPNAVVKYIIRPSEPVCMQVVALREIQEGEEIVIPYLDPALSYAARRDALQTNYG 267
Query: 350 FQCNCLRC 357
F C+C C
Sbjct: 268 FICSCALC 275
>gi|303275318|ref|XP_003056955.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461307|gb|EEH58600.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 795
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNA---HIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
G ++ + NH C PNA + W A R+ A RDV GEE+ + Y+D + DAR
Sbjct: 697 GTVLFPMACLCNHSCAPNATPRYRSWKGAAAVRVQATRDVLAGEEITMSYVDETAGVDAR 756
Query: 341 QAILTQGFGFQCNCLRC 357
L +GF C C +C
Sbjct: 757 ADALAS-YGFTCECEKC 772
>gi|403351926|gb|EJY75465.1| SET and MYND domain containing 3 [Oxytricha trifallax]
Length = 517
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Query: 281 IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMA-RDA 339
+ +G +Y + NH CDPN +++ ++ R +E EE+ ICYID ++ R
Sbjct: 254 LRIGTGLYYPSNLLNHSCDPNCMVLFRGQTQF-IVTCRPIEADEEITICYIDNGISERII 312
Query: 340 RQAILTQGFGFQCNCLRC 357
RQ L + + F C C RC
Sbjct: 313 RQQYLQEQYHFNCMCARC 330
>gi|343473725|emb|CCD14464.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 512
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYID--ASMARD 338
+ G A+Y + NH C P+ + A A +MALRDV GEE+R Y+D A +
Sbjct: 431 STGVALYDAATKINHSCVPSVRFVPSHGAVKAIVMALRDVSPGEEVRTSYLDVGAYPTNE 490
Query: 339 ARQAILTQGFGFQCNCLRCSSG 360
AR+ L +GF C+C C+ G
Sbjct: 491 ARRNFLLHNYGFSCDCPLCNGG 512
>gi|41469441|gb|AAS07242.1| putative MYND finger protein [Oryza sativa Japonica Group]
gi|108710650|gb|ABF98445.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 480
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 119/286 (41%), Gaps = 33/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ VA GRG+FA R +++ + +P + P ++ S C C LRK +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVC- 75
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A +CG C+ + +E RA + D + + P + RL ++
Sbjct: 76 -----RVAWYCGSACQREEWKLHQLECRAIAALTED---RKKMLTPTI--RLMVRLVLRR 125
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ D I + + L+ A+E + +D Q+ + L Q+ +
Sbjct: 126 KLQDDKAIPSSGTDNYNLVDALESHI---------SEVDKNQLVLYAQM---ANLVQLIL 173
Query: 255 NAFRIELA--GGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+F ++L + +A + E+ +G +Y + S NH C PNA ++ +
Sbjct: 174 PSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNA-VLIFEGRT 232
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + AL+ + + EE+ I YI+ + RQ L + + F C C RC
Sbjct: 233 AYVRALQPISKNEEVSISYIETAATTMKRQDDL-KHYYFTCTCPRC 277
>gi|346324090|gb|EGX93687.1| SET domain protein [Cordyceps militaris CM01]
Length = 404
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/87 (40%), Positives = 44/87 (50%), Gaps = 3/87 (3%)
Query: 274 AASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR--LMALRDVEEGEELRICYI 331
A SIE + AI+ + NHDC PNA + D A + AL D+ G EL I YI
Sbjct: 199 AFSIEIKDIEHYAIFPEIARLNHDCRPNA-AYFFDEATLTHYVHALTDIHPGTELTITYI 257
Query: 332 DASMARDARQAILTQGFGFQCNCLRCS 358
D M+R R L +GF C+C CS
Sbjct: 258 DPQMSRKERMETLADTWGFTCSCNSCS 284
>gi|108710649|gb|ABF98444.1| MYND finger family protein, expressed [Oryza sativa Japonica Group]
Length = 481
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 65/286 (22%), Positives = 119/286 (41%), Gaps = 33/286 (11%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ VA GRG+FA R +++ + +P + P ++ S C C LRK +
Sbjct: 17 LAVASVPGKGRGLFAARSFFPGEVVISQEPYASTPNKISVGSNCDNCFASRNLRKCSVC- 75
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
A +CG C+ + +E RA + D + + P + RL ++
Sbjct: 76 -----RVAWYCGSACQREEWKLHQLECRAIAALTED---RKKMLTPTI--RLMVRLVLRR 125
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ D I + + L+ A+E + +D Q+ + L Q+ +
Sbjct: 126 KLQDDKAIPSSGTDNYNLVDALESHI---------SEVDKNQLVLYAQM---ANLVQLIL 173
Query: 255 NAFRIELA--GGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD 311
+F ++L + +A + E+ +G +Y + S NH C PNA ++ +
Sbjct: 174 PSFELDLKEITHTFSKFACNAHTICDPELRPLGTGLYPVLSIINHSCVPNA-VLIFEGRT 232
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A + AL+ + + EE+ I YI+ + RQ L + + F C C RC
Sbjct: 233 AYVRALQPISKNEEVSISYIETAATTMKRQDDL-KHYYFTCTCPRC 277
>gi|156047862|ref|XP_001589898.1| hypothetical protein SS1G_08661 [Sclerotinia sclerotiorum 1980]
gi|154693059|gb|EDN92797.1| hypothetical protein SS1G_08661 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 545
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 5/77 (6%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWI---DNADARLMALRDVEEGEELRICYIDASMARDAR 340
G AI L S +NH CDPN + W +N+ + A R++++GEE+ I YI + R
Sbjct: 457 GTAINPLYSMFNHSCDPN--VDWRHDHENSTVTMFAERNIKKGEEMFISYIGKGDSLKDR 514
Query: 341 QAILTQGFGFQCNCLRC 357
Q L FG +C C +C
Sbjct: 515 QRKLMPWFGMECACHKC 531
>gi|348670159|gb|EGZ09981.1| hypothetical protein PHYSODRAFT_523060 [Phytophthora sojae]
Length = 421
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 120/289 (41%), Gaps = 63/289 (21%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GR V + A DL A + T + +S C+ C T S+ + + A +C +
Sbjct: 18 GRRVETETALAAGDLALRAPALATVLLPALWDSHCHKCFASGTRLSRCGRCNTAFYCSKA 77
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCI-------- 200
C+ +ADW + K ++ +LA + + ++AD +
Sbjct: 78 CQ-----------QADW-------KPDHRKECKVLAQLAQLGLRNDQTADVLLLGRVLRR 119
Query: 201 ---DILQPASLSPELIL---AMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN----VLA 250
+ LQP EL+ ME+ +ML +A ++++ ++ + + +L+
Sbjct: 120 EDAEGLQP----KELVWYEEDMEDQELMLLAALA------QKLELVDGSYSMDEMLRMLS 169
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
+ R N F I ++LL + G + L + NH CDPN I ++
Sbjct: 170 RFRNNNFSI------CDELL----------LEQGAGCFPLGAMINHSCDPNCAITFVPKT 213
Query: 311 -DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ A+R ++ GEE+ Y+D ++ R R L + + F C C RCS
Sbjct: 214 LEMEFRAMRPIKAGEEITQTYVDVALPRRERHERLQRKYHFNCACSRCS 262
>gi|253741927|gb|EES98785.1| Hypothetical protein GL50581_3974 [Giardia intestinalis ATCC 50581]
Length = 409
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 34/79 (43%), Positives = 45/79 (56%), Gaps = 5/79 (6%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADAR---LMALRDVEEGEELRICYI-DASMARD 338
+G A+Y L S NH C PNA +++ D+ DAR L+ LR +GEEL I YI D +
Sbjct: 325 IGAALYSLISCCNHSCAPNAQVIFEDSEDAREATLVLLRPCVQGEELYISYITDLGRSVS 384
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L Q + F C C RC
Sbjct: 385 ERRRELAQ-WCFTCQCTRC 402
>gi|390353457|ref|XP_003728116.1| PREDICTED: SET and MYND domain-containing protein 3-like
[Strongylocentrotus purpuratus]
Length = 319
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 61/292 (20%), Positives = 118/292 (40%), Gaps = 54/292 (18%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKIT-SSSQHFQHHNARFCGE 147
GRG+ A I+A + +P + L C CLR ++ ++ + N
Sbjct: 15 GRGLRAVSTIQAGTCVLEDEPYVYILNEEQLQLRCGHCLRALSLGNTSEVESKN------ 68
Query: 148 VCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPAS 207
CK + Y N C+ G K R C ++ G E D
Sbjct: 69 -CKSCKRVVY----------CNKACKVAGRKE----HRYECKLLQGKEKDD--------- 104
Query: 208 LSPELILAM---EEGFVMLRS---------------AFKKAGIDDEQMKFLNKQWYTNVL 249
++ L++ M + + + +S +F+ + E + L + +
Sbjct: 105 VTLRLLMKMILKSKNYELKKSNGTLCEFPSSYRDLCSFQLSTKAQEHLLSLFRCYEKGWR 164
Query: 250 AQIR--INAFRIELAGGLYEDLLSSAAASIE--SEIAVGNAIYMLPSFYNHDCDPNAHIM 305
+++R ++ FR + Y+ L ++ + + + VG A+Y+ S +NH C+PN +
Sbjct: 165 SKLRSHLSPFRSQEVLDTYQKLTINSFSMYDEMTRTIVGEALYIRASMFNHSCEPNCTFV 224
Query: 306 WIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + + + A++ +E GEE I Y+ + + R+ L +GF C C RC
Sbjct: 225 F-EGSRLSVRAIKRIEIGEECCISYMSSLLPSPLRKEKLRSIYGFTCQCPRC 275
>gi|195359283|ref|XP_002045335.1| GM11659 [Drosophila sechellia]
gi|194129184|gb|EDW51227.1| GM11659 [Drosophila sechellia]
Length = 496
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 118/294 (40%), Gaps = 36/294 (12%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCLKM 76
Query: 130 ITSSSQHFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLAC 188
+ + F CG VC AK + ++D F + ++ ++ RL C
Sbjct: 77 LPDT--RFMCRQG--CGLPVCSLCAKK---KQHKSDCDLFKSWGPNEPDVANSVIIRLLC 129
Query: 189 M---IISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWY 245
+ I E D I LQ A+L + FK D + ++ +N+
Sbjct: 130 VARAINLSKEQRDLIYCLQ-ANLDNNHRTEVRNAA----KCFKNFPTDKKLIEIMNR--- 181
Query: 246 TNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIM 305
+A +R N F + + +++ A+Y L NHDC PNA+
Sbjct: 182 --TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVVNHDCIPNAYYT 227
Query: 306 WIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + + ++ A D+ EG E+ Y AR L F C C RCS
Sbjct: 228 FEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSRCS 281
>gi|358368449|dbj|GAA85066.1| SET and MYND domain protein [Aspergillus kawachii IFO 4308]
Length = 484
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 69/304 (22%), Positives = 119/304 (39%), Gaps = 54/304 (17%)
Query: 73 SQPSPPPIQVALTESAGRGVFATRRIR-ASDLIHTAKPIITHPTLSTLNSVCYFCLRKIT 131
+ P P P GRG+FA+ I D++H P + L VC C +
Sbjct: 5 TTPQPAP------GGLGRGLFASTDIAVGEDVLHIPVPFVAVLDTEHLGEVCSGCFGQRQ 58
Query: 132 SSSQHFQHHNARFCGEV--CKDNAKAFYDVERRADWSAFNDY--CRSQGLK---YPLLVK 184
+ R CG V C +A DW + + Q LK P+ +
Sbjct: 59 LEEEGIALKGCRGCGVVKYCDKTCQA-------KDWKLGHSFECTIYQKLKPRILPINAR 111
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
+ M++ + L + F+ L + K I DE W
Sbjct: 112 AVLRMVLRSERQKYSDEEL--------------DQFLQLETHIKD--IRDESAS----HW 151
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIE-SEIAVGNAIY------MLP--SFYN 295
L+ I A+ + E+++S+ A ++ + + NA+Y + P + +N
Sbjct: 152 ERISLSSKAIKAYS---GTEMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFN 208
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
H CD NA + + D + + A+R +++ E++ I YID + RQ L + F C+C
Sbjct: 209 HSCDHNAAVSF-DGPNLHIKAIRPIQKDEQIFITYIDVTDPYPIRQHNLQSRYYFTCHCS 267
Query: 356 RCSS 359
+C+S
Sbjct: 268 KCTS 271
>gi|371943475|gb|AEX61304.1| putative SET domain-containing protein [Megavirus courdo7]
Length = 157
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 267 EDLLSSAAASIESEIAVGNAIYMLPS--FYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
+D+L A + + G++I +L + +NH C PN + + N + + +RD+ +GE
Sbjct: 41 DDILLFCAKYMSNAFEYGDSIAILFNGRIFNHSCLPNI-VFYRCNDEMCFITVRDIYKGE 99
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
EL Y++ + + RQ+ L + F C+C RCSS +
Sbjct: 100 ELLDSYVNITHDKKTRQSRLWNQYRFHCDCQRCSSNN 136
>gi|407843445|gb|EKG01401.1| hypothetical protein TCSYLVIO_007600 [Trypanosoma cruzi]
Length = 504
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYID--ASMARD 338
A G A+Y + NH C P+ + A ++ALRD+E+GEE+R YID A +R
Sbjct: 423 ATGVALYDAATKINHSCVPSVRFVPTHGRVSAVVVALRDIEKGEEIRSSYIDLVAYTSRV 482
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L +GF+C+C C
Sbjct: 483 ERRGYLLSHYGFECDCSLC 501
>gi|299748156|ref|XP_001837499.2| hypothetical protein CC1G_01411 [Coprinopsis cinerea okayama7#130]
gi|298407841|gb|EAU84415.2| hypothetical protein CC1G_01411 [Coprinopsis cinerea okayama7#130]
Length = 673
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 17/139 (12%)
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAGG--------LYEDLLSSAAASIESEIAVGNAIYM 289
+F ++ Y + ++ NAF + GG + + + + +G+A+Y
Sbjct: 480 QFTTEERYAILSGKMAYNAFGVCFNGGRDDRPAPEVRPEEVEKTRTPYGTSRQIGSAVYT 539
Query: 290 LPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDA--------R 340
+ S+ H CDP+A + + + L+A RD+++G+ + I ++D + D R
Sbjct: 540 VSSYLTHSCDPSARVSFSSGTTELHLVANRDLKKGDVVTIAFVDVNQHPDESVAECKRRR 599
Query: 341 QAILTQGFGFQCNCLRCSS 359
+ L +G+ F C C RC +
Sbjct: 600 RVELARGWKFACGCDRCEA 618
>gi|403161566|ref|XP_003890393.1| hypothetical protein PGTG_20945 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171834|gb|EHS64479.1| hypothetical protein PGTG_20945 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 379
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 283 VGNAIYMLPSF-YNHDCDPN-AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDAR 340
V A+ + P+ +NHDC PN + + D + A R + GEEL I Y + + R R
Sbjct: 217 VYTALVLEPAVRFNHDCRPNVGYYIDHDTQSIHMTAFRQILAGEELTISYRASELTRKMR 276
Query: 341 QAILTQGFGFQCNCLRC 357
Q L+ +GFQC+C C
Sbjct: 277 QDSLSANYGFQCSCSHC 293
>gi|225679205|gb|EEH17489.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 377
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 44/74 (59%), Gaps = 3/74 (4%)
Query: 289 MLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQ 346
MLP S+ NH C+PNA+I + D L AL+D+ E++ I YID + + RQ+ L +
Sbjct: 109 MLPFASYANHSCEPNAYIGF-DGPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELQK 167
Query: 347 GFGFQCNCLRCSSG 360
+ F C C +C+ G
Sbjct: 168 RYFFTCKCPKCAQG 181
>gi|440484183|gb|ELQ64299.1| hypothetical protein OOW_P131scaffold00655g1, partial [Magnaporthe
oryzae P131]
Length = 668
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 63/127 (49%), Gaps = 7/127 (5%)
Query: 234 DEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
+ +K L +LA R F ++ ED+L + ++ A +++ S
Sbjct: 465 NRALKQLPAGEKGRILALARTGGFESQV-----EDVLRTNIFGVDVGGAFHMGLFVEGSR 519
Query: 294 YNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NH+C PN + + A+ ++ALRD+E+GEEL Y+ +R R+ L + +GF+C
Sbjct: 520 VNHNCRPNVYWEYDTKTMAQEVVALRDIEQGEELTHSYVTLGGSRSQRREEL-EAWGFEC 578
Query: 353 NCLRCSS 359
C CS+
Sbjct: 579 KCALCSA 585
>gi|113205454|gb|AAW28574.2| TPR domain containing protein, putative [Solanum demissum]
Length = 569
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 5/95 (5%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L E+L+S+ S++ G +++L SF NH CDPN I + + A RD++ GE
Sbjct: 302 LVEELISAKVLGKNSDVH-GIGLWILSSFINHSCDPNVRRSHIGDH-VTIHACRDIKAGE 359
Query: 325 ELRICYIDA-SMARDARQAILTQGFGFQCNCLRCS 358
EL Y D + RD + + +GF C C RC+
Sbjct: 360 ELTFAYFDVFTPFRDREEK--AKNWGFVCKCKRCN 392
>gi|67539656|ref|XP_663602.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|40738557|gb|EAA57747.1| hypothetical protein AN5998.2 [Aspergillus nidulans FGSC A4]
gi|259479819|tpe|CBF70391.1| TPA: SET and MYND domain protein, putative (AFU_orthologue;
AFUA_2G10080) [Aspergillus nidulans FGSC A4]
Length = 497
Score = 54.7 bits (130), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 119/307 (38%), Gaps = 59/307 (19%)
Query: 73 SQPSPPPIQVALTESAGRGVFATRRIRA-SDLIHTAKPIITHPTLSTLNSVCYFCLRKIT 131
++ +P P L GRG+FA IR D++H P + L C C K
Sbjct: 12 TKATPGPASNGL----GRGLFAYTDIRTCDDILHIQDPFVAVLKTERLQDTCSGCFGK-- 65
Query: 132 SSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII 191
+HF + G+ A V + D S C+S+ K L C+I
Sbjct: 66 ---RHFDS----YSGQEVSLKACTGCHVVKYCDKS-----CQSKDWK---LTHSRECVIF 110
Query: 192 SGAESADCIDILQPASL--SPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
L+P L + +L M +LR+ +K +E++ L + T++
Sbjct: 111 RN---------LKPKVLPVNARALLRM-----VLRTEARKNAYTEEEL-VLFQTLETHID 155
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGN-----------------AIYMLP- 291
+ NA + E + + +E E V IY+ P
Sbjct: 156 DILNRNAPQAERIALTSRAVKEYSKTDMEEEKIVAYHARLDLNSFNLTNDDDIGIYLHPY 215
Query: 292 -SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
+ NH CD NA ++ D ++ + A+R + GE++ I YID + RQ L + + F
Sbjct: 216 AALINHSCDYNA-VVGFDGSEIFVKAIRPIATGEQIFISYIDTTYPTRIRQKELQERYFF 274
Query: 351 QCNCLRC 357
CNC +C
Sbjct: 275 TCNCAKC 281
>gi|410916601|ref|XP_003971775.1| PREDICTED: N-lysine methyltransferase SMYD2-B-like [Takifugu
rubripes]
Length = 432
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/285 (25%), Positives = 129/285 (45%), Gaps = 47/285 (16%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ TR + +L+ +++ ++ C C + S ++ CG+
Sbjct: 17 GRGLRVTRAFQVGELLLSSQAYSYVLSVKERGEHCESCFTRKKSLAR---------CGK- 66
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK KAFY DV+ ++ DW+ C + G K+ P + RL I++ +
Sbjct: 67 CK---KAFYCDVKCQKGDWAMHRLECSAMNAFGEKWCPSEITRLVARILTKKK------- 116
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELA 262
+Q + E +L + E ++S + +D+E+ + + +V + + +E+
Sbjct: 117 MQKDRCASEKLLLLGE----MQSHTED--MDNERRETME----ADVAGLHQFFSKHLEIP 166
Query: 263 GGLYEDLLS--SAAA----SIESE--IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
G ++DLL+ S A +IE E +G A+Y + NH C P+ I+ A +
Sbjct: 167 G--HKDLLTLFSQVACNGFTIEDEELSHLGTAVYPDVALINHSCRPSV-IVTYSGTSAHV 223
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
ALRD++ G+E+ I YID + R L + + F C C C S
Sbjct: 224 RALRDMKPGDEVLISYIDVLYPTEDRNNRLRESYYFTCQCEECES 268
>gi|357484667|ref|XP_003612621.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
gi|355513956|gb|AES95579.1| Histone-lysine N-methyltransferase ASHR2 [Medicago truncatula]
Length = 384
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 122/295 (41%), Gaps = 46/295 (15%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITH---PTLS-TLNSVCYFCLRKITSSSQHFQH---HN 141
GRG+ AT+ ++A +I T P++ + P S T + C+ C R + SQ FQ N
Sbjct: 46 GRGLIATQDLKAGQIILTESPLLLYSATPLFSHTPSPYCHNCFRTL-PPSQTFQCPSCSN 104
Query: 142 ARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVK----RLACMIISGAESA 197
FC + C A + W+ Q PL K ++ +I A +
Sbjct: 105 YLFCSQKCLSIALN----SSHSSWTC-QTLSHLQNPTSPLSEKPCELQVQARLIVAAYN- 158
Query: 198 DCIDILQPASLSPELIL---AMEEGFVMLRSAFKKAGID---DEQMKFLNKQWYTNVLAQ 251
+ I P+ L L L ++ ++ + F + I M F + + ++A+
Sbjct: 159 --LAIHTPSKLQTLLSLHGNPNDQDSIVDNAKFLHSLISPFCSPHMNF-SAELAAKIIAK 215
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWID-- 308
R+N+F L E S S A G IY +F+NH C PNA +++
Sbjct: 216 ERLNSF------CLMEPY--SQKGPQRSIKAYG--IYQKATFFNHGCIPNACRFDYVESG 265
Query: 309 ------NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N D + ++DV G E+ I Y + R+ IL + +GF C C RC
Sbjct: 266 EPGDEHNTDIVIRLIKDVGVGSEICISYFRINKDYSTRKKILMEDYGFTCECDRC 320
>gi|240981045|ref|XP_002403605.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491387|gb|EEC01028.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 770
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 71/335 (21%), Positives = 136/335 (40%), Gaps = 64/335 (19%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ + + GR + A R ++ D I +P + + C C +++ ++ Q
Sbjct: 245 VDMLYSTEKGRFLVANRDLQPGDAIFVERPYASVLLPGHTKTNCQHCHKRLLNAVPCAQC 304
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRSQGLKYPLLVKRLAC--MIISG--- 193
+ R+C C ++ W++++ + C + L Y + + LA ++++G
Sbjct: 305 NQVRYCSFACAKDS-----------WNSYHRWECGNLNLLYSVGIAHLAVRVLLVTGLSG 353
Query: 194 --------AESADCIDILQPASLSPELILAMEEGFV--MLRSAFKKA------------G 231
AE +D EL+ E+ V +L+ + A
Sbjct: 354 LADFCRHLAEGKVDVDKNGGYGSVHELVTHSEKMHVEDLLQYSLTAALLSMLLDHVLFFN 413
Query: 232 IDD-----EQMKFLNKQWYTNVLAQIRINAFRIELAGGLY---EDLLSSAAA--SIES-- 279
+D+ ++ F ++ + LA A + + G L + L+ +A A S+ES
Sbjct: 414 VDEASWKMSELSFGSRNGSSYRLASTDAAALKTVVGGLLLRHIQQLVCNAHAITSLESKT 473
Query: 280 -----------EIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRI 328
++ + AIY S NH CDPN + + + A+R ++EGEE+
Sbjct: 474 SQEDDVVMTTEQVRIATAIYPSASLMNHSCDPNIFSSFRCGSTLVVRAIRRIQEGEEVLN 533
Query: 329 CY--IDASMARDARQAILTQGFGFQCNCLRCSSGD 361
CY M+ RQ +L + + F C+C CSSG+
Sbjct: 534 CYGPHHRRMSFAERQQLLQEQYFFVCSCTACSSGE 568
>gi|393238103|gb|EJD45641.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 409
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 46/108 (42%), Gaps = 15/108 (13%)
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNA 310
+R N +++ A L + S A+ + S NH C PN W
Sbjct: 314 LRTNGWKVTFAENLGDRTFS--------------AVMQIMSRANHSCAPNTSFNWEWQKY 359
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A ALRD+ GEE+ + YID + R+ L + + F+C C RC
Sbjct: 360 QATYTALRDIAAGEEITVSYIDDKKPKSERRKELKEKYFFKCTCERCG 407
>gi|340966897|gb|EGS22404.1| mannose-6-phosphate isomerase-like protein [Chaetomium thermophilum
var. thermophilum DSM 1495]
Length = 859
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 53/101 (52%), Gaps = 9/101 (8%)
Query: 264 GLYEDLLSSAAASIESEIAVGN----AIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALR 318
G ED++ + + + AVG A++ + NH+C PNA I + D A L + R
Sbjct: 597 GFVEDVMLTNSFGV----AVGGREVMALFTNLARINHNCRPNAFIHFSDTTLAMTLWSAR 652
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+++ GEE+ I Y A RQ +L +GF+C+C C+S
Sbjct: 653 EIQPGEEITITYSPAGRTSKERQRMLKSTWGFECHCELCTS 693
>gi|85109942|ref|XP_963161.1| hypothetical protein NCU09495 [Neurospora crassa OR74A]
gi|28924825|gb|EAA33925.1| predicted protein [Neurospora crassa OR74A]
gi|38524259|emb|CAE75723.1| conserved hypothetical protein [Neurospora crassa]
Length = 320
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 1/77 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQ 341
+G +++ S NH C+PN W + + A+RD+++GEE+ I YI RQ
Sbjct: 104 IGGGLFIEASRINHACNPNTQNSWNSRINRETIHAVRDIKKGEEITISYIGHFAPYVERQ 163
Query: 342 AILTQGFGFQCNCLRCS 358
+IL F F C C CS
Sbjct: 164 SILKIKFNFDCTCELCS 180
>gi|46122751|ref|XP_385929.1| hypothetical protein FG05753.1 [Gibberella zeae PH-1]
Length = 508
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
A+++ S NH C PNA W + A+ D+E+G+E+ I Y+ + RQ L
Sbjct: 90 AVFLATSRINHSCKPNAQNRWNQGLGKITVHAVEDIEQGQEITITYLGNPEVYEERQKKL 149
Query: 345 TQGFGFQCNCLRCS 358
FGF C C CS
Sbjct: 150 KNAFGFDCCCRLCS 163
>gi|443896871|dbj|GAC74214.1| predicted histone tail methylase containing SET domain [Pseudozyma
antarctica T-34]
Length = 882
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 43/82 (52%), Gaps = 16/82 (19%)
Query: 292 SFYNHDCDPNAHIM-------------WIDNADARLM---ALRDVEEGEELRICYIDASM 335
+ NH C PNA ++ W D DA++M ALR +E GEEL I Y+D +
Sbjct: 498 AMLNHSCAPNAAVVFPFGGAAKSGQQNWSDGEDAKVMQLVALRAIEPGEELLISYVDVAD 557
Query: 336 ARDARQAILTQGFGFQCNCLRC 357
A + R++ L + + F+C C C
Sbjct: 558 AYEQRRSALKKRYCFECRCELC 579
>gi|154305307|ref|XP_001553056.1| hypothetical protein BC1G_08948 [Botryotinia fuckeliana B05.10]
Length = 507
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 39/65 (60%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH CD NA WI+++ + AL+D+ + EE+ I Y+ + R AR+A L FGF C+
Sbjct: 124 NHACDRNAGTHWIESSQKIAVTALKDISKDEEITINYLGLNKPRRARRASLQTHFGFDCS 183
Query: 354 CLRCS 358
C CS
Sbjct: 184 CGLCS 188
>gi|147905510|ref|NP_001085986.1| N-lysine methyltransferase SMYD2-B [Xenopus laevis]
gi|82184154|sp|Q6GN68.1|SMY2B_XENLA RecName: Full=N-lysine methyltransferase SMYD2-B; AltName:
Full=Histone methyltransferase SMYD2-B; AltName:
Full=SET and MYND domain-containing protein 2B
gi|49115919|gb|AAH73650.1| MGC82991 protein [Xenopus laevis]
Length = 430
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 63/289 (21%), Positives = 116/289 (40%), Gaps = 31/289 (10%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
P ++ + GRG+ ATR +L+ + T + + C FC + S+
Sbjct: 4 PEGLERFDSPGKGRGLKATRSFALGELLFSCPAYTYVLTDNERGNHCDFCFTRKEGLSKC 63
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP--LLVKRLACMIISGA 194
+ A +C C+ + +E + S+ ++C S+ ++ +L K+ + +
Sbjct: 64 GKCKQAFYCNVDCQKGDWPMHKLECSSMCSSGQNWCPSETVRLTARILAKQKTQTERTAS 123
Query: 195 ESADCIDILQP--ASLSPELILAMEEGFVMLRSAFKKA--GIDDEQMKFLNKQWYTNVLA 250
E + + + L E +E L + K D+ ++FL A
Sbjct: 124 ERFMSVKEFESHLSKLDNEKKELIENDISALHRFYSKNVHNCDNAALEFL--------FA 175
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
Q+ N F IE ++ LS +G+AI+ + NH C PN I+
Sbjct: 176 QVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-IVTYKGT 218
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
A + A++++ GEE+ YID + R L + F C+C CS+
Sbjct: 219 VAEVRAVQEIHAGEEVFTSYIDLLYPTEDRNDRLKDSYFFSCDCRECST 267
>gi|195156964|ref|XP_002019366.1| GL12277 [Drosophila persimilis]
gi|194115957|gb|EDW38000.1| GL12277 [Drosophila persimilis]
Length = 188
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 58/124 (46%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRIEL---AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
QW TNV A + L LY + A + +E G+ +Y+L S NH C
Sbjct: 40 QWVTNVSDLPLPEADKTALDTVIDELYSKVGEFAGEFLNNE---GSGLYLLQSKINHSCV 96
Query: 300 PNA-HIMWIDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA N L AL ++EGEE+ I Y+D +R +R +L + + F C CL
Sbjct: 97 PNACSTFPYSNDIVVLKALAPIQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQCL 156
Query: 356 RCSS 359
+C +
Sbjct: 157 KCQA 160
>gi|194756896|ref|XP_001960706.1| GF13486 [Drosophila ananassae]
gi|190622004|gb|EDV37528.1| GF13486 [Drosophila ananassae]
Length = 517
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 65/289 (22%), Positives = 104/289 (35%), Gaps = 37/289 (12%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ-HF 137
P ++A +E GR + ATR I+ ++I P++ P + VC CL I +
Sbjct: 44 PFKIAHSEQLGRHLVATRTIKPYEIILREAPLVRGPAQISA-PVCMGCLNSIEAEDHIDC 102
Query: 138 QHHNARFCGEVCKD------NAKAFYDVERRADWSAFNDYCRSQGLKYPLL--VKRLACM 189
+ CG CK K D ++ + + FN +PL V + C+
Sbjct: 103 EQCGWPLCGPECKSLEEHQAECKLTKDRGQKVNVNEFNG-------PHPLYTCVSTVRCL 155
Query: 190 IISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
+I + SL + G ++ A I KF Q +T
Sbjct: 156 LIGETSPEKAAKFQELESLE-----STRRGSNQWKADL--ASIGQFIPKFFKTQKFTEEQ 208
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDN 309
+ A +I I + ++ SF + C PN + N
Sbjct: 209 IMRAVGALQIN-------------GHEIPTSDPPHVGVFYTASFTENSCLPNLAKSFNKN 255
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L A +++++ L ICY DA RQ L Q F+C C RC+
Sbjct: 256 GHCILWAPQEIKKNAHLSICYSDAMWGTADRQRHLMQTKLFKCACARCA 304
>gi|224082230|ref|XP_002306611.1| SET domain protein [Populus trichocarpa]
gi|222856060|gb|EEE93607.1| SET domain protein [Populus trichocarpa]
Length = 458
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/274 (21%), Positives = 108/274 (39%), Gaps = 47/274 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNA----RF 144
GR + T+ ++I +P + P S+ S C C +S+ + +A +
Sbjct: 22 GRCLLTTKNFNPGEVILRQEPYVCVPNNSSTVSRCDGCF-----ASESLKKCSACQVVWY 76
Query: 145 CGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQ 204
CG C+ +++W + RL C +S E +
Sbjct: 77 CGSTCQ-----------KSEWK----------------LHRLECNALSRLEKE------K 103
Query: 205 PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG 264
+++P + L + + LR + DE+ L Q V ++ ++
Sbjct: 104 RKAVTPSIRLMVR---LYLRRKLQNEMDLDEKQLVLYAQMANLVHFILQWPEINLKEIAE 160
Query: 265 LYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
+ L +A + E+ +G +Y + S NH C PNA ++ + + + A+ + EG
Sbjct: 161 NFSKLACNAHTICDCELRPLGTGLYPVVSIINHSCMPNA-VLTFEGKSSVVRAVEHIPEG 219
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ I YID + + RQ L + + F C C RC
Sbjct: 220 AEVSIAYIDTAGSTMTRQKALKEQYFFTCTCPRC 253
>gi|148906849|gb|ABR16570.1| unknown [Picea sitchensis]
Length = 441
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G +Y + S NH C PNA ++ + A + A+ + EG EL + YI+ + + +R+
Sbjct: 157 MGTGLYPVISIINHSCFPNA-VLLFEGRQAVVRAVEPIREGSELTVSYIEIAASTASRKK 215
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + + F C CLRC D
Sbjct: 216 SLKEQYFFDCKCLRCLKVD 234
>gi|242798932|ref|XP_002483270.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218716615|gb|EED16036.1| SET and MYND domain protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 547
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/278 (20%), Positives = 113/278 (40%), Gaps = 28/278 (10%)
Query: 89 GRGVFATRRIR-ASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGE 147
G+GVFAT ++ +D++ P+I + L ++C C TS + +
Sbjct: 46 GQGVFATIDVKKGADVLVVDNPLIALVEEAQLQNICSGCYD--TSKAGSIDNRRPDLVKA 103
Query: 148 VCKDNAKAFYDVE-RRADWSAFNDY-CRSQGLKYPLLVK---RLACMIISGAESADCIDI 202
+ + D +R DW A + C++ YP ++ R I+ + I
Sbjct: 104 CTRCKVVYYCDKNCQRKDWKAGHSLECKTYAELYPKILPLPVRAVLRILMLRRADKIIPE 163
Query: 203 LQPASLS---PELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
+ SL+ P++ +E KA + + L+ +W ++ ++ N+F +
Sbjct: 164 VYSESLALTYPKIYDCVESQETKDHLLMAKALREYSNLTDLDNKWVASLFGRLNANSFCL 223
Query: 260 ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD 319
A G G + P+ +NH CDPNA + + A+R
Sbjct: 224 TSAFGRRR----------------GVYFHPGPARFNHSCDPNASYSFA-KGKCYIRAIRP 266
Query: 320 VEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + E++ I Y+D + + R+ L + + F+C C +C
Sbjct: 267 IAKDEQIFISYVDTTYSVGTRRHELQERYRFECQCPKC 304
>gi|157823305|ref|NP_001101340.1| SET and MYND domain-containing protein 5 [Rattus norvegicus]
gi|149036567|gb|EDL91185.1| SET and MYND domain containing 5 (predicted) [Rattus norvegicus]
Length = 417
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D+E G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIEPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|71421326|ref|XP_811771.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876474|gb|EAN89920.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA-RQ 341
+G A+Y S++NH C PN + A ALR++ +GE L ICY+D A R+
Sbjct: 567 IGVALYPEASYFNHSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERR 626
Query: 342 AILTQGFGFQCNCLRCSSGD 361
L + F C C RCS +
Sbjct: 627 RTLFSSYRFFCECARCSGAN 646
>gi|73980996|ref|XP_533007.2| PREDICTED: SET and MYND domain-containing protein 5 [Canis lupus
familiaris]
Length = 419
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +Y+L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLYVLQSCCNHSCVPNAETTFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|157115758|ref|XP_001652682.1| hypothetical protein AaeL_AAEL007346 [Aedes aegypti]
gi|108876750|gb|EAT40975.1| AAEL007346-PA [Aedes aegypti]
Length = 354
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 43/79 (54%), Gaps = 3/79 (3%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID--ASMARDAR 340
V I+ S +NH CDPN + + + A RDVE G E+ CY M+R+ R
Sbjct: 99 VFTGIFPQISMFNHSCDPNIRNCF-SKSTLTVYATRDVEAGGEIFNCYGPNFKLMSREDR 157
Query: 341 QAILTQGFGFQCNCLRCSS 359
Q+ L Q + F C+C+RCSS
Sbjct: 158 QSALKQQYCFDCDCIRCSS 176
>gi|363540350|ref|YP_004894267.1| mg216 gene product [Megavirus chiliensis]
gi|350611515|gb|AEQ32959.1| putative set domain-containing protein [Megavirus chiliensis]
Length = 227
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 3/97 (3%)
Query: 267 EDLLSSAAASIESEIAVGNAIYMLPS--FYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
+D+L A + + G++I +L + +NH C PN + + N + + +RD+ +GE
Sbjct: 111 DDILLFCAKYMSNAFEYGDSIAILFNGRIFNHSCLPNI-VFYRCNDEMCFITVRDIYKGE 169
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
EL Y++ + + RQ+ L + F C+C RCSS +
Sbjct: 170 ELLDSYVNITHDKKTRQSRLWNQYRFHCDCQRCSSNN 206
>gi|295665212|ref|XP_002793157.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278071|gb|EEH33637.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 508
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 23/124 (18%)
Query: 259 IELAGGLYEDLLSSAAASIE-SEIAVG-------NA------------IYMLP--SFYNH 296
+ELAG E +L+SA E S VG NA + MLP S+ NH
Sbjct: 156 MELAGEQIERVLTSAEGLKEISNTDVGLPRLLETNAFTLTNRYFDRIGLCMLPFASYANH 215
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C+PNA+I + D L AL+D+ E++ I YID + + RQ+ L + + F C C +
Sbjct: 216 SCEPNAYIGF-DGPVIYLKALQDIALDEQIFISYIDNTEPWEKRQSELKKRYFFTCKCPK 274
Query: 357 CSSG 360
C G
Sbjct: 275 CVQG 278
>gi|302141868|emb|CBI19071.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 58/297 (19%), Positives = 117/297 (39%), Gaps = 54/297 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ V+ T GR + + + ++I + +P ++ P S ++S C C R ++ +
Sbjct: 13 LTVSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFRS-SNLKKCSAC 71
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
H +CG C+ ++DW + RL C +S E
Sbjct: 72 HVVWYCGSTCQ-----------KSDWK----------------LHRLECNALSRLEKE-- 102
Query: 200 IDILQPASLSPELILAME------------------EGFVMLRSAFKKAGIDDEQMKFLN 241
+ SL+P + L ++ + + ++ + DE+ L
Sbjct: 103 ----RQKSLTPSIRLMVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLY 158
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDP 300
Q V ++ ++ + L +A + E+ +G +Y + S NH C P
Sbjct: 159 AQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLP 218
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N+ ++ + A + A++ + +G E+ I YI+ + + RQ L + + F C C RC
Sbjct: 219 NS-VLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRC 274
>gi|356507670|ref|XP_003522587.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Glycine
max]
Length = 484
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 62/275 (22%), Positives = 117/275 (42%), Gaps = 21/275 (7%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLN--SVCYFCLRKITSS--SQHFQHHNARF 144
GR + ATR ++I + +P + P S+++ C C I ++ S+ + A +
Sbjct: 22 GRSLLATRDFYPGEVIISQEPYVCVPNNSSVSPQKRCDGCFTTINNNVLSRCSRCQLAFY 81
Query: 145 CGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQ 204
CG C+ + + +E S+ + Y R L + M+ D +
Sbjct: 82 CGTACQRSEWKLHRLECEV-LSSLHKYKRKS------LTPSIRLMLRLYLRRKLQNDKII 134
Query: 205 PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGG 264
P++ AM+ ++ + I +EQ+ L Q V + + I+
Sbjct: 135 PST-------AMDNYNLVEALVAHMSDITEEQL-VLYAQMANLVNSILEWPGINIKEIAE 186
Query: 265 LYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
+ +A +SE+ VG +Y + S NH C PN+ ++ + + A + A++ + G
Sbjct: 187 NFSKFACNAHTICDSELRPVGTGLYPVISIINHSCLPNS-VLVFEGSSALVRAVQHIPSG 245
Query: 324 EELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
E+ I YI+ + + RQ L + + F C C RCS
Sbjct: 246 TEVLISYIETAESTMTRQKALKEQYLFTCTCPRCS 280
>gi|392561695|gb|EIW54876.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 342
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQAI 343
+++ + NH C PN W D A ++ ALRDV EGEEL + Y+D R R
Sbjct: 146 GVFLTAARLNHSCRPNLARSW-DEASQQMTFRALRDVAEGEELCLNYVDVIGTRAQRTEE 204
Query: 344 LTQGFGFQCNCLRCSSG 360
L +GF+C C C+ G
Sbjct: 205 LQSAYGFECVCEACALG 221
>gi|327288923|ref|XP_003229174.1| PREDICTED: SET and MYND domain-containing protein 5-like, partial
[Anolis carolinensis]
Length = 163
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEG 323
LY+D+ + E G+ +Y+L S NH C PNA + DN L AL D+ G
Sbjct: 29 LYKDIEKETGEFLNCE---GSGLYVLQSCCNHSCIPNAETSFPDNNFLLHLTALEDIRPG 85
Query: 324 EELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R +L + + F C+C +C
Sbjct: 86 EEICISYLDCCQRERSRHSRHKVLRENYLFVCSCPKC 122
>gi|237832155|ref|XP_002365375.1| hypothetical protein TGME49_062750 [Toxoplasma gondii ME49]
gi|211963039|gb|EEA98234.1| hypothetical protein TGME49_062750 [Toxoplasma gondii ME49]
Length = 521
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 4/77 (5%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDAR 340
+Y L + NH CDPN + + + LR V GEEL + Y+D S+ AR
Sbjct: 427 GGVYTLHACANHSCDPNCGVSSSGSEGGGSTLSVATLRAVAPGEELTVSYVDISLPLKAR 486
Query: 341 QAILTQGFGFQCNCLRC 357
+ +L FGF C C +C
Sbjct: 487 REMLLSSFGFLCRCSKC 503
>gi|71409653|ref|XP_807160.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871100|gb|EAN85309.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 697
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA-RQ 341
+G A+Y S++NH C PN + A ALR++ +GE L ICY+D A R+
Sbjct: 567 IGVALYPEASYFNHSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERR 626
Query: 342 AILTQGFGFQCNCLRCSSGD 361
L + F C C RCS +
Sbjct: 627 RTLFSSYRFFCECARCSGAN 646
>gi|367012652|ref|XP_003680826.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
gi|359748486|emb|CCE91615.1| hypothetical protein TDEL_0D00310 [Torulaspora delbrueckii]
Length = 486
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 74/317 (23%), Positives = 117/317 (36%), Gaps = 54/317 (17%)
Query: 79 PIQVALTESA---GRGVFATRRIRASDLI-HTAKPIITHPTLSTLN-----SVCYFCLRK 129
P V L E+ GRG+FA + IR +LI + P+ P + L+ VC C
Sbjct: 108 PKNVELIETGDEKGRGLFAKKEIRQGELILGESGPLAAIPPIDKLSLIQAGKVCSLCGNS 167
Query: 130 ITSSSQHFQHHN--------ARFCGEVCKD---------NAKAFYDVERRADWSAFNDYC 172
+ S HF N A +C CK + K + + W+ F C
Sbjct: 168 L-GHSDHFVMMNGLDCNGCGAVWCNRNCKKTDVTHASLKHVKGKGSLTNASGWAKFEKSC 226
Query: 173 RSQGLKYPLLVKRLACMII---SGAESADCIDILQP----ASLS----PELILAMEEGFV 221
+ V + +I S AD +LQ A +S ++ + G
Sbjct: 227 MEN-----VFVAAYSVGVIYATSAVSKADENHVLQHFRSLAQVSQRARKQVSDSTNVGGT 281
Query: 222 MLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEI 281
S+ D E M +++ + F E L+ ++I
Sbjct: 282 FDASSGAVTSEDPEPMWQQAYEYFKEAFPKSEDIDF---------ESFLTYVGKFNLNQI 332
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQ 341
+ IY + SF NHDC+PN D ++ A + ++ GEEL Y++ R+
Sbjct: 333 S--GQIYPIYSFINHDCEPNVRYEIDDKLRLKVFARKPIKAGEELLTTYVNPLHGVKLRR 390
Query: 342 AILTQGFGFQCNCLRCS 358
L +GF C C RC+
Sbjct: 391 RELRVNWGFLCQCERCT 407
>gi|154689581|ref|NP_659167.2| SET and MYND domain-containing protein 5 [Mus musculus]
gi|90101759|sp|Q3TYX3.2|SMYD5_MOUSE RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|148666711|gb|EDK99127.1| SET and MYND domain containing 5 [Mus musculus]
Length = 416
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 287 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPG 343
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 344 EEICISYLDCCQRERSRHSRHKILRENYLFNCSCPKC 380
>gi|74196676|dbj|BAE34438.1| unnamed protein product [Mus musculus]
Length = 416
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 287 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPG 343
Query: 324 EELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 344 EEICISYLDCRQRERSRHSRHKILRENYLFNCSCPKC 380
>gi|321462734|gb|EFX73755.1| SET and MYND domain-containing protein 1 [Daphnia pulex]
Length = 452
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 52/101 (51%), Gaps = 4/101 (3%)
Query: 264 GLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE- 321
G+Y + ++ + E+ A+G IY+ PS +H C PNA + D R+ +++
Sbjct: 211 GIYGRMCINSFNILNGEMQAIGTGIYLAPSILDHSCSPNA-VATFDGFKLRIQLTQELPK 269
Query: 322 -EGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
E + +RI YID ++ R+ L + F C+C RC + D
Sbjct: 270 LEWDSIRISYIDLMNSKSHRKKELKDRYYFDCDCPRCKNDD 310
>gi|167377998|ref|XP_001734627.1| set and mynd domain containing protein [Entamoeba dispar SAW760]
gi|165903780|gb|EDR29210.1| set and mynd domain containing protein, putative [Entamoeba dispar
SAW760]
Length = 426
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 126/301 (41%), Gaps = 34/301 (11%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSS------SQHFQHHNA 142
G+GVF+ +R + ++I PII + + C FCL+ + S +Q F+ N
Sbjct: 126 GKGVFSKKRYQKEEIIMKEIPIIQISKIEDYYNCCGFCLKTLCSKEPKNKYNQIFKEINK 185
Query: 143 RFCGEV--CKDNAKAFYDVERRADWS---------AFNDYCRSQGLKYPLLVKRL-ACMI 190
E+ CK K ++ D++ +YC+ + +PL + ++ A ++
Sbjct: 186 E---EIIKCKCGMKYCNFKCQQLDYTHSLLCNKLKGLINYCKINHISHPLCIAKIFATIL 242
Query: 191 ISG--AESADCIDILQPASLSP---ELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWY 245
IS +S + + L P E++ F L F G + L
Sbjct: 243 ISQNIEKSLFPFVVFHSSPLIPFNSEIVPFFLNIFGSLLLKFNIYGGWTFIYQILYSVLK 302
Query: 246 TNVLAQIRINAFRIELAGG----LYEDLLSSAAAS---IESEIAVGNAIYMLPSFYNHDC 298
N + + +N + L + LLSS+ + I + ++ + NH C
Sbjct: 303 YNSSSILPLNPIQSLLLNNNSLLIDNTLLSSSLINHPDITAFTVEAEGLFKYLNTLNHSC 362
Query: 299 DPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
PN + D++ A L+A + G+EL I YID ++ RQ++L + F C+C +C
Sbjct: 363 SPNCFLANTDDSFALSLIASCPISPGDELTISYIDNTLPYSQRQSLLYDSYHFYCHCPKC 422
Query: 358 S 358
+
Sbjct: 423 N 423
>gi|74187979|dbj|BAE37118.1| unnamed protein product [Mus musculus]
Length = 400
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 271 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPG 327
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 328 EEICISYLDCCQRERSRHSRHKILRENYLFNCSCPKC 364
>gi|403413538|emb|CCM00238.1| predicted protein [Fibroporia radiculosa]
Length = 275
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C PN +W + ALR + GEEL I Y+D RD R+A L FGF+C
Sbjct: 149 NHSCTPNVSRVWDSEEQVLKFYALRHIGPGEELCISYLDVLETRDERRAELWSHFGFECA 208
Query: 354 CLRCS 358
C C+
Sbjct: 209 CSVCT 213
>gi|240979812|ref|XP_002403255.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
gi|215491338|gb|EEC00979.1| SET and MYND domain-containing protein (SMYD), putative [Ixodes
scapularis]
Length = 413
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEG 323
+Y+D+ + + +E G +Y L S H C PNA ++ N L+ALRD+ G
Sbjct: 293 IYDDIERESGVFLNNE---GCGLYPLQSLCAHSCCPNAEARFLHNDHTLSLVALRDIRPG 349
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ + YID S +R +R +L + F C C RC
Sbjct: 350 EEVTVSYIDECSLSRSRHSRIKMLRESHLFTCTCPRC 386
>gi|308809097|ref|XP_003081858.1| FOG: TPR repeat (ISS) [Ostreococcus tauri]
gi|116060325|emb|CAL55661.1| FOG: TPR repeat (ISS) [Ostreococcus tauri]
Length = 575
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
++ +PSF NH C PNAH + + + A +D+ G E+ I Y D + + R A T
Sbjct: 304 GVWSMPSFMNHSCIPNAHRINVGKV-MLVFASKDLPVGAEVTIKYYDTLIPKRDRDAFAT 362
Query: 346 QGFGFQCNCLRC---SSGD 361
+ G++CNC+RC S GD
Sbjct: 363 R-RGYECNCVRCAFESEGD 380
>gi|46139505|ref|XP_391443.1| hypothetical protein FG11267.1 [Gibberella zeae PH-1]
Length = 593
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQA 342
G+ +++ S NH CD NA + + + ALRD+EEGEE+ I Y+ R RQ
Sbjct: 102 GSGVFLQASRINHACDNNAQKDYNEGIKRHTVHALRDIEEGEEITITYLGILKNRRTRQQ 161
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L F F C C CS D
Sbjct: 162 ALRTKFMFTCTCNLCSLPD 180
>gi|328767279|gb|EGF77329.1| hypothetical protein BATDEDRAFT_27650 [Batrachochytrium
dendrobatidis JAM81]
Length = 445
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 111/297 (37%), Gaps = 63/297 (21%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHH 140
QV GR + AT+ +R KP + ++LN C C R Q
Sbjct: 9 QVENRNGYGRCMIATKDLRVGLEFMMEKPYVAVVDDASLNQTCSGCFRLAAHMQQCSSCK 68
Query: 141 NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKY------PLLVKRLACMIISGA 194
++C + C+ R+DWS C +G K P V+ L M+ A
Sbjct: 69 VVQYCSQTCQ-----------RSDWSIHKPEC--EGFKAVQPRIPPSPVRLLGRMMFKRA 115
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
+ DC + E ++ E K+A D E +
Sbjct: 116 K--DCNEF--------ERVVGQLESH-----RDKRASKDIEHI----------------- 143
Query: 255 NAFRIELAGGLYED--LLSSAAASIE--SEIAV-------GNAIYMLPSFYNHDCDPNAH 303
A +++A G LLS+ A I +I V G AIY S NH C PNA
Sbjct: 144 -AAMLQMASGFIPPALLLSTTADMIALCCKIQVNTMTTERGVAIYDRLSTVNHSCVPNAC 202
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ + ARL + + G+++ I Y+D + + RQ L + + F C C C++
Sbjct: 203 LTFGIGGIARLSPMTAIASGDQINISYVDVFQSCETRQRQLKEQYYFDCTCRLCTAN 259
>gi|225459467|ref|XP_002284386.1| PREDICTED: histone-lysine N-methyltransferase ASHR1-like [Vitis
vinifera]
Length = 477
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 58/297 (19%), Positives = 117/297 (39%), Gaps = 54/297 (18%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ V+ T GR + + + ++I + +P ++ P S ++S C C R ++ +
Sbjct: 13 LTVSTTPEKGRCLLSIKDFSPGEVIISQEPYVSVPNNSAVHSRCEGCFRS-SNLKKCSAC 71
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
H +CG C+ ++DW + RL C +S E
Sbjct: 72 HVVWYCGSTCQ-----------KSDWK----------------LHRLECNALSRLEKE-- 102
Query: 200 IDILQPASLSPELILAME------------------EGFVMLRSAFKKAGIDDEQMKFLN 241
+ SL+P + L ++ + + ++ + DE+ L
Sbjct: 103 ----RQKSLTPSIRLMVKLYMRRKLQSEKIMPTTARDNYNLVEALVSHITDIDEKQLVLY 158
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDP 300
Q V ++ ++ + L +A + E+ +G +Y + S NH C P
Sbjct: 159 AQMANLVNLILQWPDINVKEIAENFSKLACNAHTICDGELRPLGTGLYPVISIINHSCLP 218
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
N+ ++ + A + A++ + +G E+ I YI+ + + RQ L + + F C C RC
Sbjct: 219 NS-VLVFEERLAVVRAVQHIPKGTEVLISYIETAGSTITRQKALKEQYLFTCTCPRC 274
>gi|384485892|gb|EIE78072.1| hypothetical protein RO3G_02776 [Rhizopus delemar RA 99-880]
Length = 554
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 35/67 (52%)
Query: 293 FYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
F+NH C+PN + + N +R V++ EEL + YID RD R+ L F C
Sbjct: 264 FFNHGCNPNTAFVGLPNGQLAFRTIRSVQKDEELVVSYIDIYSDRDERRQELLTTKHFWC 323
Query: 353 NCLRCSS 359
C RC+S
Sbjct: 324 KCKRCAS 330
>gi|17946274|gb|AAL49177.1| RE62495p [Drosophila melanogaster]
Length = 500
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 122/302 (40%), Gaps = 52/302 (17%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCLKM 76
Query: 130 ITSSSQHFQHHNARF-----CG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLV 183
+ + RF CG VC AK + ++D F + ++ +V
Sbjct: 77 LP---------DTRFMCRQGCGLPVCSLCAKK---KQHKSDCDLFKSWGPNEPDVANSVV 124
Query: 184 KRLACM---IISGAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGIDDEQM 237
RL C+ I E D I LQ A+L +R+A FK D + +
Sbjct: 125 IRLLCVARAINLSKEQRDLIYCLQ-ANLD-------NNHRTEVRNAAKCFKNFPTDKKLI 176
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
+ +N+ +A +R N F + + +++ A+Y L NHD
Sbjct: 177 EIMNR-----TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVVNHD 219
Query: 298 CDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C PNA+ + + + ++ A D+ EG E+ Y AR L F C C R
Sbjct: 220 CIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSR 279
Query: 357 CS 358
CS
Sbjct: 280 CS 281
>gi|406698011|gb|EKD01258.1| hypothetical protein A1Q2_04415 [Trichosporon asahii var. asahii
CBS 8904]
Length = 354
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAI 343
A++ S NH C PN+HI W DN + A+R + GEE+ I Y ++ + RQA
Sbjct: 149 GAVFEFISRINHSCRPNSHICWNDNLGMETVHAIRTISSGEEITINYGHTGVSEE-RQAW 207
Query: 344 LTQGFGFQCNCLRCSSG 360
L + F F C C C G
Sbjct: 208 LKKRFKFDCACEVCGLG 224
>gi|393233922|gb|EJD41489.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 378
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQAI 343
+ + + S NH C PNA W A RD+ GEE+ + YID SM + R+A
Sbjct: 302 SGVMQIMSRANHSCAPNAAYQWDRTKFHGVFTAARDIVAGEEVTLSYIDQSMPKKDRRAE 361
Query: 344 LTQGFGFQCNCLRCS 358
L + + F+C C RC
Sbjct: 362 LRKKYLFKCTCERCG 376
>gi|255080882|ref|XP_002504007.1| predicted protein [Micromonas sp. RCC299]
gi|226519274|gb|ACO65265.1| predicted protein [Micromonas sp. RCC299]
Length = 385
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNAD-----ARLMALRDVEEGEELRICYIDASMARD 338
G A++ L NH C+PNA + + D A + ALR++ GEELR Y+D +
Sbjct: 301 GVALFPLTCLMNHSCEPNAEVRFEDAGPGAGVVAAVHALREIRVGEELRHSYVDETRPVF 360
Query: 339 ARQAILTQGFGFQCNCLRCS 358
R A L FGF+C+C RC+
Sbjct: 361 LRAADLAA-FGFRCDCGRCA 379
>gi|154308769|ref|XP_001553720.1| hypothetical protein BC1G_07807 [Botryotinia fuckeliana B05.10]
Length = 409
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 49/91 (53%), Gaps = 4/91 (4%)
Query: 270 LSSAAASIESEIA---VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEEL 326
LS+ +ES + +G + L + NH C PNA I + D A L AL + +GE++
Sbjct: 83 LSTNGFRVESNVGNGPIGLCLDPLLARANHSCRPNAAITF-DGKRATLRALSPIAKGEQI 141
Query: 327 RICYIDASMARDARQAILTQGFGFQCNCLRC 357
I YID + ++ R+ L + + FQC C RC
Sbjct: 142 FISYIDETQRQEVRREALAKTWFFQCLCSRC 172
>gi|451848012|gb|EMD61318.1| hypothetical protein COCSADRAFT_96412 [Cochliobolus sativus ND90Pr]
Length = 417
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 286 AIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
A++ S NH C PNA +++ D+ + R + EGEE+ I Y + RQ L
Sbjct: 234 AVWPETSRLNHACAPNAQYVIDTDHLSHTVRVTRPIAEGEEITISYTSPLEPTETRQHHL 293
Query: 345 TQGFGFQCNCLRCSS 359
QGF F C C RC+S
Sbjct: 294 AQGFHFTCTCPRCTS 308
>gi|407425989|gb|EKF39557.1| hypothetical protein MOQ_000211 [Trypanosoma cruzi marinkellei]
Length = 504
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYID--ASMARD 338
A G A+Y + NH C P+ + A ++ALRD+E+GEE+R YID A +R
Sbjct: 423 ATGVALYDAATKINHSCVPSVRFIPTHGRVSAVVVALRDIEKGEEIRSSYIDLPAHTSRV 482
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L +GF+C+C C
Sbjct: 483 ERREYLLSHYGFECDCSLC 501
>gi|302844915|ref|XP_002953997.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
gi|300260809|gb|EFJ45026.1| SET and zf-MYND domain-containing protein [Volvox carteri f.
nagariensis]
Length = 487
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 40/75 (53%), Gaps = 4/75 (5%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNAD--ARLMALRDVEEGEELRICYIDASMARDARQA 342
A L S NH CDPN + D D ++A RD+ GEE+ + YID ++ RQA
Sbjct: 390 TAFLALQSCINHSCDPNCTAV-CDTGDRTVTVLAQRDIAAGEEITLSYIDVTLPYKERQA 448
Query: 343 ILTQGFGFQCNCLRC 357
L + +GF C C RC
Sbjct: 449 AL-RDYGFVCRCTRC 462
>gi|361129398|gb|EHL01305.1| putative SET domain-containing protein 5 [Glarea lozoyensis 74030]
Length = 286
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 42/78 (53%), Gaps = 5/78 (6%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMARDARQ 341
++ S NH C PNA W + + + A++D++EGEE+ Y+ +++ R+
Sbjct: 76 GVFENISRINHGCLPNAVQNWNGLLGEEGEETVYAIKDIKEGEEITTSYLSGGTSKE-RR 134
Query: 342 AILTQGFGFQCNCLRCSS 359
A+L Q FGF C C C S
Sbjct: 135 AVLKQSFGFDCTCKLCDS 152
>gi|221506467|gb|EEE32084.1| hypothetical protein TGVEG_071970 [Toxoplasma gondii VEG]
Length = 521
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 4/77 (5%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADAR----LMALRDVEEGEELRICYIDASMARDAR 340
+Y L + NH CDPN + + + LR V GEEL + Y+D S+ AR
Sbjct: 427 GGVYTLHACANHSCDPNCGVSSSGSEGGGSTLAVATLRAVAPGEELTVSYVDISLPLKAR 486
Query: 341 QAILTQGFGFQCNCLRC 357
+ +L FGF C C +C
Sbjct: 487 REMLLSSFGFLCRCSKC 503
>gi|198454620|ref|XP_001359652.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
gi|198132877|gb|EAL28802.2| GA17400 [Drosophila pseudoobscura pseudoobscura]
Length = 392
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/125 (32%), Positives = 59/125 (47%), Gaps = 12/125 (9%)
Query: 243 QWYTNV----LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDC 298
QW TNV L + A + LY + A + +E G+ +Y+L S NH C
Sbjct: 245 QWVTNVSDLPLPEADKTALDT-VIDELYSKVGEFAGEFLNNE---GSGLYLLQSKINHSC 300
Query: 299 DPNA-HIMWIDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNC 354
PNA N L AL ++EGEE+ I Y+D +R +R +L + + F C C
Sbjct: 301 VPNACSTFPYSNDIVVLKALAPIQEGEEICISYLDECQLERSRHSRHKVLRENYIFVCQC 360
Query: 355 LRCSS 359
L+C +
Sbjct: 361 LKCQA 365
>gi|429849789|gb|ELA25132.1| putative protein lysine methyltransferase [Colletotrichum
gloeosporioides Nara gc5]
Length = 413
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYI---DASMARDAR 340
G + S++NH C PN + ++ A +D+ GEE+ I Y+ + M R+AR
Sbjct: 335 GYGCWPSASYFNHSCGPNI-VKRREDRVWEFRAAKDIASGEEMNITYLGGEEKDMPRNAR 393
Query: 341 QAILTQGFGFQCNCLRC 357
AIL + +GF C C RC
Sbjct: 394 MAILRKNWGFDCGCKRC 410
>gi|408398201|gb|EKJ77334.1| hypothetical protein FPSE_02412 [Fusarium pseudograminearum CS3096]
Length = 543
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQA 342
G+ +++ S NH CD NA + + + ALRD+EEGEE+ I Y+ R RQ
Sbjct: 102 GSGVFLQASRINHACDNNAQKDYNEGIKRHTVHALRDIEEGEEITITYLGILKNRRTRQQ 161
Query: 343 ILTQGFGFQCNCLRCS 358
L F F C C CS
Sbjct: 162 ALRTKFMFTCTCNLCS 177
>gi|116788911|gb|ABK25047.1| unknown [Picea sitchensis]
Length = 491
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 44/77 (57%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G+A + L S NH C+PN + + A L+A+R + +GE++ I YI+ M R
Sbjct: 408 GSAFFPLQSCINHSCEPNCKAFKREQDRDGQAILIAIRPIMKGEQIFISYIEEDMPWKER 467
Query: 341 QAILTQGFGFQCNCLRC 357
QA+L+ +GF C C RC
Sbjct: 468 QALLSD-YGFACKCCRC 483
>gi|116181330|ref|XP_001220514.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
gi|88185590|gb|EAQ93058.1| hypothetical protein CHGG_01293 [Chaetomium globosum CBS 148.51]
Length = 348
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 61/123 (49%), Gaps = 17/123 (13%)
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
+ +FL +Q + + ++ N+FRI + G D SA + E+ S +N
Sbjct: 138 RARFL-RQVGSTIYDKVERNSFRIFVDG----DRKHSAHLGVFPEV----------SKFN 182
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
HDC PN H ++ +A+RD+ G+EL I YI + RQ+ L + +GF C C
Sbjct: 183 HDCRPNVHYR-LNGLKHTTIAVRDIPAGDELTISYIYGRASHATRQSQLRE-WGFTCTCP 240
Query: 356 RCS 358
+C+
Sbjct: 241 QCT 243
>gi|310795375|gb|EFQ30836.1| hypothetical protein GLRG_05980 [Glomerella graminicola M1.001]
Length = 411
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 47/90 (52%), Gaps = 6/90 (6%)
Query: 276 SIESEIA--VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYI-- 331
S+E E + G+ + S++NH C PN A A RDV +GE+L I Y+
Sbjct: 316 SLEDEGSELFGHGCWPAASYFNHSCGPNVDKRREGRA-WEFRASRDVGKGEQLCITYLGG 374
Query: 332 -DASMARDARQAILTQGFGFQCNCLRCSSG 360
+ +M R+ R + L + +GF+C C RC G
Sbjct: 375 EEKAMPRETRMSTLRRNWGFECGCKRCEEG 404
>gi|255089819|ref|XP_002506831.1| set domain protein [Micromonas sp. RCC299]
gi|226522104|gb|ACO68089.1| set domain protein [Micromonas sp. RCC299]
Length = 468
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G + L S N DCDPN + D + L+A R V++GEEL +CY+D R
Sbjct: 376 GTGFFALQSQLNSDCDPNVTPLKDDGDVDGSCVLVAKRAVKKGEELTMCYVDEDADVRER 435
Query: 341 QAILTQGFGFQCNCLRC 357
+A L +GF+C C RC
Sbjct: 436 RAELAD-YGFECACERC 451
>gi|336266212|ref|XP_003347875.1| hypothetical protein SMAC_06707 [Sordaria macrospora k-hell]
gi|380091808|emb|CCC10536.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 518
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 295 NHDCDPNA--HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NHDC PNA W + A+RD+ GEE+ I YI+ +R ARQ L +GF C
Sbjct: 289 NHDCRPNADYRFDWEKGLVQVITAVRDIFPGEEITISYINPLQSRHARQKHLRTVWGFDC 348
Query: 353 NCLRCS 358
+C CS
Sbjct: 349 SCELCS 354
>gi|255644430|gb|ACU22720.1| unknown [Glycine max]
Length = 381
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/301 (22%), Positives = 120/301 (39%), Gaps = 48/301 (15%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITH---PTLSTLNS---VCYFCLR--KITSSSQHF--- 137
GRG+ A++ ++A ++ T P+I + P L+ +S C C R +T +S
Sbjct: 8 GRGLVASQPLKAGQIVLTESPLILYSASPLLTPSSSPYTYCDHCFRILPLTHNSTTVTCP 67
Query: 138 QHHNARFCGEVCKDNA----------KAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
N FC + C A KA +++ + + + + + ++ L+V
Sbjct: 68 SCSNHSFCSQKCFSLALKSSHSTWVCKALMSLQQHPNSTLLQQHPQERQVQARLIVASHK 127
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
+ + S +D +P+ + F L S Q L+
Sbjct: 128 LFLHNHTPSE--LDTFLSLHGTPDDAILDAANF--LHSLISPLFPPQAQ---LSVDLIAQ 180
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMW 306
+LA+ R+N+F L+ + AIY +F+NHDC PNA +
Sbjct: 181 LLAKDRLNSF----------GLMDPYSPDGPQRSIKAYAIYPKATFFNHDCVPNACRFDY 230
Query: 307 IDNA---------DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+D+ D + + DV+EG+E+ I Y R+ IL + +GF C C RC
Sbjct: 231 VDSTNDDYERNSTDIVIRLIEDVDEGKEVCISYFRIGRDYCTRKRILMEDYGFTCGCDRC 290
Query: 358 S 358
+
Sbjct: 291 T 291
>gi|221486765|gb|EEE25011.1| conserved hypothetical protein [Toxoplasma gondii GT1]
Length = 521
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 38/77 (49%), Gaps = 4/77 (5%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADAR----LMALRDVEEGEELRICYIDASMARDAR 340
+Y L + NH CDPN + + + LR V GEEL + Y+D S+ AR
Sbjct: 427 GGVYTLHACANHSCDPNCGVSSSGSEGGGSTLAVATLRAVAPGEELTVSYVDISLPLKAR 486
Query: 341 QAILTQGFGFQCNCLRC 357
+ +L FGF C C +C
Sbjct: 487 REMLLSSFGFLCRCSKC 503
>gi|167383128|ref|XP_001736416.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901228|gb|EDR27341.1| hypothetical protein EDI_341180 [Entamoeba dispar SAW760]
Length = 426
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/306 (23%), Positives = 128/306 (41%), Gaps = 44/306 (14%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSS------SQHFQHHNA 142
G+GVF+ +R + ++I PII + + C FCL+ + S +Q F+ N
Sbjct: 126 GKGVFSKKRYQKEEIIMKEIPIIQISKIEDYYNCCGFCLKTLCSKEPKNKYNQIFKEINK 185
Query: 143 RFCGEV--CKDNAKAFYDVERRADWS---------AFNDYCRSQGLKYPLLVKRL-ACMI 190
E+ CK K ++ D++ +YC + +PL + ++ A ++
Sbjct: 186 E---EIIKCKCGMKYCNFKCQQLDYTHSLLCNKLKGLINYCNINHISHPLCIAKIFATIL 242
Query: 191 ISG--AESADCIDILQPASLSP---ELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWY 245
IS +S + + L P E++ F L F G + L
Sbjct: 243 ISQNIEKSLFPFVVFHSSPLIPFNSEIVPFFLNIFGSLLLKFNIYGGWTFIYQILYSVLK 302
Query: 246 TNVLAQIRINAFRIELAGG----LYEDLLSSA--------AASIESEIAVGNAIYMLPSF 293
N + + +N + L + LLSS+ A +IE+E ++ +
Sbjct: 303 YNSSSILPLNPIQSLLLNNNSLLIDNTLLSSSLINHPDITAFTIEAE-----GLFKYLNT 357
Query: 294 YNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NH C PN + D++ A L+A + G+EL I YID ++ RQ++L + F C
Sbjct: 358 LNHSCSPNCFLANTDDSFALSLIASCPISHGDELTISYIDNTLPYSQRQSLLYDSYHFYC 417
Query: 353 NCLRCS 358
+C +C+
Sbjct: 418 HCPKCN 423
>gi|158288115|ref|XP_309979.4| AGAP011530-PA [Anopheles gambiae str. PEST]
gi|157019319|gb|EAA05706.4| AGAP011530-PA [Anopheles gambiae str. PEST]
Length = 522
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/290 (21%), Positives = 114/290 (39%), Gaps = 38/290 (13%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ-HF 137
P ++ E GR + ATR I+A +++ P++ P T VC CL+ +
Sbjct: 40 PFKIFSNEQYGRFLVATRDIKAGEIVLKESPLVHGPAQIT-GPVCVGCLQGLEEKKYLDC 98
Query: 138 QHHNARFCGEVCKDN----AKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISG 193
+ C C+D+ A+ + + R + S + Y + + V+ C++++
Sbjct: 99 ERCGWPVCKRSCQDSPSHQAECKFTIARGSKISIQHFYVPHPTYQCLMPVR---CLLLAE 155
Query: 194 AESADCIDILQPASLSPELILAME-----EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
++ A +L+ S E + + EG L F K ++ + L V
Sbjct: 156 SDPARWETLLKLESHEEERRGSEQWRNDREGVAKLIPRFFKCENKWDEDEILR------V 209
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
+ I++N + + + +S+ AIY + S H C PN + +
Sbjct: 210 VGIIQVNGHEVPM----------TEPSSV--------AIYNMASMLEHSCRPNLAKSFTN 251
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + A + G+ L ICY D R L Q F+C C RCS
Sbjct: 252 RGEVVMWAPNPIRRGDRLSICYTDVLWTTGNRLEHLQQTKMFRCECERCS 301
>gi|410914273|ref|XP_003970612.1| PREDICTED: SET and MYND domain-containing protein 5-like [Takifugu
rubripes]
Length = 417
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/99 (32%), Positives = 53/99 (53%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + E G+ +++L S NH C PNA + + +N +L L D+ G
Sbjct: 287 LYKDIEKETGDFLNCE---GSGLFLLQSSCNHSCIPNAEVSFPNNNFLLQLSTLSDISPG 343
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+D +R +R IL + + F C+C++C S
Sbjct: 344 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCIKCVS 382
>gi|403164683|ref|XP_003324753.2| hypothetical protein PGTG_06290 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165269|gb|EFP80334.2| hypothetical protein PGTG_06290 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 463
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 40/76 (52%), Gaps = 3/76 (3%)
Query: 287 IYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
+YM + NHDC PN + + + ALR++ GEEL I YI +R+ R+ L
Sbjct: 257 LYM--QYLNHDCRPNVGYRFNNVTIQVEMHALREIAPGEELSISYITLVQSREKRRKSLH 314
Query: 346 QGFGFQCNCLRCSSGD 361
+GF C C +CS D
Sbjct: 315 GTYGFHCGCSQCSLSD 330
>gi|212276262|ref|NP_001130593.1| uncharacterized protein LOC100191692 [Zea mays]
gi|194689580|gb|ACF78874.1| unknown [Zea mays]
gi|414865032|tpg|DAA43589.1| TPA: hypothetical protein ZEAMMB73_306282 [Zea mays]
Length = 544
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 2/93 (2%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED ++ + G +++LPSF NH C PNA I + A + A RD++ GE
Sbjct: 302 LTEDAPAADVLGNNGVVNCGVGLWVLPSFINHSCHPNARRTHIGD-HAIVHASRDIKAGE 360
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ Y D + R+ ++ +GF+C C RC
Sbjct: 361 EITFPYFDVLVPVSKRREA-SRAWGFECKCDRC 392
>gi|16741406|gb|AAH16525.1| SET and MYND domain containing 5 [Mus musculus]
Length = 300
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 171 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFVLHVTALEDIKPG 227
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 228 EEICISYLDCCQRERSRHSRHKILRENYLFNCSCPKC 264
>gi|449544639|gb|EMD35612.1| hypothetical protein CERSUDRAFT_124865 [Ceriporiopsis subvermispora
B]
Length = 406
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 49/99 (49%), Gaps = 10/99 (10%)
Query: 269 LLSSAAASIESEIAVGN-----AIYMLPSFYNHDCDPNAHIMWIDNADARLMA---LRDV 320
+ + A I + A GN +++L S +NH C PN W +A A+ MA LR V
Sbjct: 187 IFDTNALPIPTTDAKGNPQTDLGLFLLGSRFNHSCTPNVARTW--DAAAKAMAFRVLRAV 244
Query: 321 EEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
E GEEL Y+D R R A L + + F C C C++
Sbjct: 245 EPGEELCTNYVDVLGTRAERAAELEEKYAFACLCSVCAA 283
>gi|389630896|ref|XP_003713101.1| hypothetical protein MGG_16952 [Magnaporthe oryzae 70-15]
gi|351645433|gb|EHA53294.1| hypothetical protein MGG_16952 [Magnaporthe oryzae 70-15]
Length = 225
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 59/113 (52%), Gaps = 7/113 (6%)
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+LA R F ++ ED+L + ++ A +++ S NH+C PN + +
Sbjct: 8 ILALARTGGFESQV-----EDVLRTNIFGVDVGGAFHMGLFVEGSRVNHNCRPNVYWEYD 62
Query: 308 DNADAR-LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
A+ ++ALRD+E+GEEL Y+ +R R+ L + +GF+C C CS+
Sbjct: 63 TKTMAQEVVALRDIEQGEELTHSYVTLGGSRSQRREEL-EAWGFECKCALCSA 114
>gi|409050375|gb|EKM59852.1| hypothetical protein PHACADRAFT_250621 [Phanerochaete carnosa
HHB-10118-sp]
Length = 462
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/344 (22%), Positives = 132/344 (38%), Gaps = 71/344 (20%)
Query: 82 VALTESAGRGV-FATRRIRASDLIHTAKPIITHPTLSTL--NSVCYFCLRKITSSSQHFQ 138
V + + G+ V FA + D+++ PII+ L S C +CL+ + S++
Sbjct: 80 VRVQDKDGKKVLFADKDFEKGDVVYKEYPIISALDWDLLEQGSHCVYCLKLVHKVSEYRP 139
Query: 139 HH---NARFCGEVCKDNAKAFY-----------------------DVERRADWSAFNDYC 172
+ N +C + C AK Y ER +A +Y
Sbjct: 140 ENDRLNTVYCSKDCYTKAKQQYHGILFTLDPLLPPEMAAESGAASPEEREKAQAALVEYL 199
Query: 173 RSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELI------LAMEEGFVMLRSA 226
+S LV + + I+ S + P P+ I A+ + LR
Sbjct: 200 KSNQKLANFLVAKWVALQITIETSKVVPGAVHPDKDLPKYIEDDAPEYAIGDHLERLR-- 257
Query: 227 FKKAGIDDEQMKFLNKQW---------------YTNVLAQIRINAFRIELAGGLYEDLLS 271
+ A + E+ + L K + + L ++ NA I +GG +D
Sbjct: 258 YLDASVTPEETEVLRKVFGAALPGLEGSLTEDRHVTTLGKMLYNAIGICYSGG-RDDKPQ 316
Query: 272 SAAASIESEIA---------VGNAIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVE 321
A + E +G+ ++++ S+ H CDP+ + + LMA R ++
Sbjct: 317 LAERPEDQERTRTPYGTSRQIGSGLFLVSSYIQHSCDPSVRPSFPGGTNKLCLMATRAIK 376
Query: 322 EGEELRICYIDASM--------ARDARQAILTQGFGFQCNCLRC 357
EGEEL + Y+D + AR R+ L +G+ F+C C RC
Sbjct: 377 EGEELTMAYVDVTQHEGESAEEARRRRRHELARGWRFKCECTRC 420
>gi|392587935|gb|EIW77268.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 656
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/320 (20%), Positives = 126/320 (39%), Gaps = 71/320 (22%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC--------LR 128
P ++V T GRG++A + IRA ++ + +P + ++ L++ C +C L+
Sbjct: 64 PYALEVRSTPQCGRGLWAKQGIRAGTVLISVRPHVHVLSIPHLDTACSYCANAAPKNPLK 123
Query: 129 KITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLAC 188
+ T R+C C+ ++DW C +
Sbjct: 124 RCTGC------QRVRYCDAECQ-----------KSDWRLHKHECGA-------------- 152
Query: 189 MIISGAESADCIDILQPASL--------------SPELILAME-EGFVMLRSAFKKAGID 233
+ + A++A D+ P+ PE E + RS+ + +
Sbjct: 153 -LQNWAKAAPSADLAVPSDAVRCMGRALRDRQKKGPESAWVKEMDAMQSHRSSLHPSSFE 211
Query: 234 DEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLS-----SAAASIESEIAVGNAIY 288
+ Y + + + AF I A L DL S S + + +G ++
Sbjct: 212 SHTHLAHSIVRYLGLSSPGDLTAFGITSARDLV-DLTSRFTSNSFTLATPALAPIGVSVS 270
Query: 289 MLPSFYNHDCDPNAHIMWIDNADA----------RLMALRDVEEGEELRICYIDASMARD 338
L + NH CDPNA +++ ++ A +++A++D+E E+ YID ++ +
Sbjct: 271 PLIALINHSCDPNAVVVFPRSSTASTPAKLEPQMQVVAIKDIEPDTEILTSYIDTTLPQP 330
Query: 339 ARQAILTQGFGFQCNCLRCS 358
RQ+ L + + F C C C+
Sbjct: 331 IRQSALKETYCFTCACTLCA 350
>gi|398401723|ref|XP_003853182.1| hypothetical protein MYCGRDRAFT_85907 [Zymoseptoria tritici IPO323]
gi|339473064|gb|EGP88158.1| hypothetical protein MYCGRDRAFT_85907 [Zymoseptoria tritici IPO323]
Length = 320
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYIDAS---MARDAR 340
++ S +NH C NA W + + R+ A RD+ GEE+ + Y++ + R R
Sbjct: 97 GGVFPTCSRFNHSCTANAAYSWNEGQKEERVYATRDISAGEEIVVSYLNEEIWRLPRQER 156
Query: 341 QAILTQGFGFQCNCLRCSSGD 361
+ + + FGF C C+RC S +
Sbjct: 157 KRQILETFGFDCQCVRCGSSE 177
>gi|407867908|gb|EKG08717.1| hypothetical protein TCSYLVIO_000127 [Trypanosoma cruzi]
Length = 697
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA-RQ 341
+G A+Y S++NH C PN + A ALR++ +GE L ICY+D A R+
Sbjct: 567 IGVALYPEASYFNHSCCPNICRVTYRGILAAFHALREIRKGEPLTICYVDVQETSTAERR 626
Query: 342 AILTQGFGFQCNCLRCSSG 360
L + F C C RCS
Sbjct: 627 RTLFSSYRFFCECARCSGA 645
>gi|302654157|ref|XP_003018889.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
gi|291182573|gb|EFE38244.1| SET and MYND domain protein, putative [Trichophyton verrucosum HKI
0517]
Length = 498
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 52/112 (46%), Gaps = 17/112 (15%)
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
LA+I +N+F A G +G I + NH C+PNA ++ D
Sbjct: 189 LARIEVNSFTFTTAFG----------------DPLGLCIQPFACYMNHSCEPNA-VVGFD 231
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ ALR+++ E++ I YID + + RQ L + + F C C +C+ G
Sbjct: 232 GGLIIVKALREIKPDEQVFISYIDNTYPLEVRQKQLAERYFFTCKCSKCAQG 283
>gi|357453499|ref|XP_003597027.1| Histone-lysine N-methyltransferase ATXR2 [Medicago truncatula]
gi|355486075|gb|AES67278.1| Histone-lysine N-methyltransferase ATXR2 [Medicago truncatula]
Length = 659
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDA 339
+ A + L S NH C PNA D + A ++ALR + +GEE+ I Y+D + +
Sbjct: 578 IRTAFFPLQSCLNHSCCPNAKAFKRDEDRDGQAIIIALRSISKGEEITISYVDEDLPFEE 637
Query: 340 RQAILTQGFGFQCNCLRC 357
RQA L +GF+C C +C
Sbjct: 638 RQASLAD-YGFRCRCPKC 654
>gi|71420461|ref|XP_811494.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70876164|gb|EAN89643.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 663
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 69/291 (23%), Positives = 113/291 (38%), Gaps = 42/291 (14%)
Query: 90 RGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI------TSSSQHFQHHNAR 143
+GV A R + ++I T P++ C +CLR ++ + +
Sbjct: 232 KGVEAKRHLEPGEII-TEAPVVHALYDDHWGIRCCYCLRVTQALYPGSAYRERGKLSRGL 290
Query: 144 FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACM--IISGAES--ADC 199
FCGEVC + Y A+ F C L + R+ C +SG S D
Sbjct: 291 FCGEVCARLSWERYGQHETAN--PFFQLCPIDAL----IASRMMCSDCAVSGCHSLRGDF 344
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDD-EQMKFLNKQWYTNVLAQIRINAFR 258
L PA++ I E + ++D ++++ +Q + + +F
Sbjct: 345 TGELHPAAV----IGGYETAVSLCALVLDAVSLEDVDRLRLAQRQ--------VMLCSFE 392
Query: 259 IELAGGLYEDLLSSAAASIESE---IAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA--- 312
I+ G + S + E I VG A+Y+ S Y H CDPN ++ N
Sbjct: 393 IKFWTGSQVTINSETREAFIDESRPIPVGKALYVSASQYRHSCDPNCFASFVGNPLGCSL 452
Query: 313 --RLMALRDVEEGEELRICYIDASMARDA----RQAILTQGFGFQCNCLRC 357
+ A+R + GEEL I Y + + + R+ L + GF CNC C
Sbjct: 453 YLAIRAIRPIPAGEELTISYHNMTTYKAVSAQFRRRALAERCGFICNCRAC 503
>gi|402217590|gb|EJT97670.1| hypothetical protein DACRYDRAFT_111714 [Dacryopinax sp. DJM-731
SS1]
Length = 531
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 62/127 (48%), Gaps = 20/127 (15%)
Query: 235 EQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFY 294
+++ F + + TN+L++ + N+F + DL + VG AI L +
Sbjct: 176 KELGFSSIRDLTNLLSRFQTNSFSLTTP-----DLTN-----------VGVAISPLAALI 219
Query: 295 NHDCDPNAHIMWIDNADAR----LMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
+H C PNA +++ R ++ALRD++ GEE+ Y+D ++ R R L + F
Sbjct: 220 SHSCMPNAVVVFPTGLGRRGGLEVIALRDLQPGEEVLTSYVDIALPRSLRWKELKDRYLF 279
Query: 351 QCNCLRC 357
C C+ C
Sbjct: 280 DCECVLC 286
>gi|242036801|ref|XP_002465795.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
gi|241919649|gb|EER92793.1| hypothetical protein SORBIDRAFT_01g045950 [Sorghum bicolor]
Length = 543
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 2/93 (2%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED ++ + G +++LPSF NH C PNA I + A + A RD++ GE
Sbjct: 301 LTEDAPAADVLGNNGVVNCGVGLWVLPSFINHSCHPNARRTHIGD-HAIVHASRDIKAGE 359
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ Y D + R+ ++ +GF+C C RC
Sbjct: 360 EITFPYFDVLVPVSKRREA-SRAWGFECKCDRC 391
>gi|154290093|ref|XP_001545647.1| hypothetical protein BC1G_15826 [Botryotinia fuckeliana B05.10]
Length = 545
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMW---IDNADARLMALRDVEEGEELRICYIDASMARDAR 340
G + L S +NH CDPN I W +N+ + A RD++ GEE+ I YI + R
Sbjct: 457 GTGVNPLYSMFNHSCDPN--IDWRHDDENSTVTMFAERDIKNGEEMFISYIGKGKGLEER 514
Query: 341 QAILTQGFGFQCNCLRC 357
+ L FG C C +C
Sbjct: 515 RRKLMPWFGMDCACHKC 531
>gi|412988885|emb|CCO15476.1| Predicted histone tail methylase containing SET domain (ISS)
[Bathycoccus prasinos]
Length = 589
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 3/75 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICY---IDASMARDARQA 342
+ + + +NH C PN + + + LRD+ GEEL I Y I + D R+
Sbjct: 250 STHTVSQHFNHSCHPNCAVSNTVDEACSIYTLRDIRAGEELTINYAGPILTWVPVDIRRG 309
Query: 343 ILTQGFGFQCNCLRC 357
IL + FGF+C+CLRC
Sbjct: 310 ILRKQFGFECHCLRC 324
>gi|380022731|ref|XP_003695192.1| PREDICTED: protein msta, isoform A-like [Apis florea]
Length = 554
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/291 (20%), Positives = 112/291 (38%), Gaps = 37/291 (12%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH- 139
++A +E GR + A++ +RA ++I P+ P N +C+ CL + Q F +
Sbjct: 18 KIAYSEKLGRYLQASKDLRAGEVILREDPVAVGPMSCMKNPICFECLSLLPDIEQDFHYI 77
Query: 140 ----HNARFCGEVCKDNA--KAFYD---VERRADWSAFNDYCRSQGLKYPLLVKRLACMI 190
+ CG C++ A + Y+ ++ + S N + L L V RL +
Sbjct: 78 CSGCNVVTLCGFSCEERAYYHSMYECEIIKNNMELSIEN----TDTLAAVLFVLRLWLLK 133
Query: 191 ISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
E + L+ + E+ + + + K E N + ++
Sbjct: 134 QKDPELWKRVLSLESHLNKRRNTIVWEDREINIVNVLKSLNFVPE-----NDPSVSEIIQ 188
Query: 251 QI----RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
Q+ +N+F + GG+ LL +Y+ S HDC N H+
Sbjct: 189 QLCGILDVNSFELRSPGGMDGLLL--------------RGLYLEASLMAHDCRGNVHVTA 234
Query: 307 IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
DN + A ++EG+ + Y + + R+ L G F+C C C
Sbjct: 235 DDNFHLTVYASIPIKEGDVIFFNYTSSLLGTTGRREYLRTGKYFECECDLC 285
>gi|409047429|gb|EKM56908.1| hypothetical protein PHACADRAFT_208075 [Phanerochaete carnosa
HHB-10118-sp]
Length = 606
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/300 (22%), Positives = 118/300 (39%), Gaps = 36/300 (12%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI--TSSS 134
PP I+ ++ GRG++A + ++ + +P I+ + S L C C T
Sbjct: 73 PPAIEARKSKDKGRGLYAKQAFAPGTILLSVRPCISVLSSSQLGFYCSHCAGPAPETGLK 132
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP-----LLVKRLACM 189
+ R+C +C+++ A + +E +A + D S+ + P L + L M
Sbjct: 133 RCTGCRTTRYCNAMCQNSDWAVHKMECQA-LQRWADAAPSEDVAVPSDAVRCLGRMLWSM 191
Query: 190 IISGAES--ADCIDILQP--ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW- 244
G +S + +LQ SL P + + +G D LN
Sbjct: 192 QRKGFDSQWTKEMSVLQSHRGSLPPSQFESHTHLAHSVVRYIGASGPSDLASFGLNSAGD 251
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
++++ N F L+S S VG I + NH CDPN I
Sbjct: 252 LVGLISRFTTNTF-----------TLTSYTLS-----PVGICISPAMALTNHSCDPNVVI 295
Query: 305 MW-------IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++ +L+ALR + EE+ Y+D ++ ++ RQ+ L + + F C C C
Sbjct: 296 VFPRSSKKVTKEPQMQLIALRSILHEEEIMTAYVDVTLPKELRQSALKEAYNFTCMCSLC 355
>gi|342182166|emb|CCC91645.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 177
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 64/124 (51%), Gaps = 12/124 (9%)
Query: 245 YTNVLAQIRINAFRIELAGGLY------EDLLSSAAASIESE--IAVGNAIYMLPSFYNH 296
Y + A + A + LA L+ E++LS A +E ++ G A+Y SF NH
Sbjct: 51 YDSQTALSEVGALAVTLADALHQEHLYMEEILSLFALVQTNEFLLSDGMALYYGYSFLNH 110
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICY---IDASMARDARQAILTQGFGFQCN 353
C+PN ++ + RL+ LRD+ EGE+L I Y + ++ + R+A L Q F+C
Sbjct: 111 SCEPNCGLVGSNAVSKRLVTLRDIREGEQLFINYNANLTTCVSYEDRRA-LCQQRHFECF 169
Query: 354 CLRC 357
C +C
Sbjct: 170 CPKC 173
>gi|406698010|gb|EKD01257.1| hypothetical protein A1Q2_04414 [Trichosporon asahii var. asahii
CBS 8904]
Length = 359
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAIL 344
AIY + NH C PNA+ W N + ALRD+++GEE+ I YI R R+ +
Sbjct: 168 AIYPTVCYINHSCIPNANHCWNKNLKMETIHALRDIKKGEEITIAYI-LEGTRAERRVHV 226
Query: 345 TQGFGFQCNCLRCSSGD 361
+ GF+C C C + D
Sbjct: 227 HERLGFECQCEACGAPD 243
>gi|302817324|ref|XP_002990338.1| hypothetical protein SELMODRAFT_447985 [Selaginella moellendorffii]
gi|300141900|gb|EFJ08607.1| hypothetical protein SELMODRAFT_447985 [Selaginella moellendorffii]
Length = 733
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 4/73 (5%)
Query: 287 IYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID--ASMARDARQAIL 344
++ LP+F NH C PN + + A A RD+ G+EL I Y+D A + + R+A L
Sbjct: 480 VWGLPAFVNHSCLPNVSKLEVGKA-TMFHAARDLHAGDELLISYVDPYAPWSTERRKA-L 537
Query: 345 TQGFGFQCNCLRC 357
+ +GF+C+C RC
Sbjct: 538 VEDWGFECHCRRC 550
>gi|308812131|ref|XP_003083373.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
gi|116055253|emb|CAL57649.1| Predicted histone tail methylase containing SET domain (ISS)
[Ostreococcus tauri]
Length = 190
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 38/66 (57%)
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
S NH C+P+ + +I +A A ++A RD+ +GEE+ I Y+ S R+ L +GF
Sbjct: 103 SNINHSCEPSCEVAFIHDARAHVIATRDITKGEEITISYVPGSWPLRRRRKELLDRYGFA 162
Query: 352 CNCLRC 357
C+C C
Sbjct: 163 CDCALC 168
>gi|71664694|ref|XP_819325.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70884621|gb|EAN97474.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 504
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYID--ASMARD 338
A G A+Y + NH C P+ + A ++ALRD+E+GEE+R YID A +R
Sbjct: 423 ATGVALYDAATKINHSCVPSVRFVPTHGRVGAVVVALRDIEKGEEIRSSYIDLVAYNSRV 482
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L +GF+C+C C
Sbjct: 483 ERRGYLLSHYGFECDCSLC 501
>gi|383858904|ref|XP_003704939.1| PREDICTED: protein msta, isoform A-like [Megachile rotundata]
Length = 530
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 115/297 (38%), Gaps = 47/297 (15%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLR---KITSSSQHF 137
+VA +E GR + A + ++A ++I KPI P S+ + +C+ CLR K+ +Q+
Sbjct: 16 KVACSEKFGRYLQAAKNLQAGEVILREKPIAVGPITSSNDYLCFACLRLLPKLKKGTQYV 75
Query: 138 --QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAE 195
+ + A CG C++ K E C I E
Sbjct: 76 CSKCNIAPLCGTACQEQPKHHTSYE---------------------------CEIFKTNE 108
Query: 196 --SADCID----ILQPASL------SPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQ 243
SAD I+ IL P L PEL +E + + D + +N
Sbjct: 109 NLSADNIETLIGILLPLRLWLVKQTDPELSKRIESMEAHMEERRNTSVWRDRETNVINVM 168
Query: 244 WYTNVLAQ--IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
+++ + I + +L G L + + + + +Y+ SF HDC N
Sbjct: 169 KALHLIPEDDTSIPEYLQQLCGILDVNTFE-LRSPGGLDGLLLRGLYVEASFMAHDCRGN 227
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
H+ D+ + A ++EGE + Y + + R+ L G F+C C CS
Sbjct: 228 THLTVDDHFQLTVYASIPIKEGEAILFNYTSSLLGTLGRREHLRGGKYFECECSLCS 284
>gi|403170123|ref|XP_003329506.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168575|gb|EFP85087.2| hypothetical protein PGTG_11256 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 647
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 12/86 (13%)
Query: 286 AIYMLPS--FYNHDCDPNAHIMWIDNADA----------RLMALRDVEEGEELRICYIDA 333
+ +PS F NH C+PNA +++ + + +++A++ +E GEE+ YID+
Sbjct: 295 GVVFVPSAAFINHSCNPNAVVVFPEGGEGAGSTAGKEWVKVIAIKPIEPGEEIVTSYIDS 354
Query: 334 SMARDARQAILTQGFGFQCNCLRCSS 359
+ R R+ L + + F C+C C S
Sbjct: 355 AGTRQERRNELVKRYKFVCDCQACQS 380
>gi|347831206|emb|CCD46903.1| similar to MYND domain protein [Botryotinia fuckeliana]
Length = 545
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMW---IDNADARLMALRDVEEGEELRICYIDASMARDAR 340
G + L S +NH CDPN I W +N+ + A RD++ GEE+ I YI + R
Sbjct: 457 GTGVNPLYSMFNHSCDPN--IDWRHDDENSTVTMFAERDIKNGEEMFISYIGKGKGLEER 514
Query: 341 QAILTQGFGFQCNCLRC 357
+ L FG C C +C
Sbjct: 515 RRKLMPWFGMDCACHKC 531
>gi|302795201|ref|XP_002979364.1| hypothetical protein SELMODRAFT_419030 [Selaginella moellendorffii]
gi|300153132|gb|EFJ19772.1| hypothetical protein SELMODRAFT_419030 [Selaginella moellendorffii]
Length = 733
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 43/73 (58%), Gaps = 4/73 (5%)
Query: 287 IYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID--ASMARDARQAIL 344
++ LP+F NH C PN + + A A RD+ G+EL I Y+D A + + R+A L
Sbjct: 480 VWGLPAFVNHSCLPNVSKLEVGKA-TMFHAARDLHAGDELLISYVDPYAPWSTERRKA-L 537
Query: 345 TQGFGFQCNCLRC 357
+ +GF+C+C RC
Sbjct: 538 VEDWGFECHCRRC 550
>gi|302691170|ref|XP_003035264.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
gi|300108960|gb|EFJ00362.1| hypothetical protein SCHCODRAFT_14438 [Schizophyllum commune H4-8]
Length = 597
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/299 (21%), Positives = 116/299 (38%), Gaps = 45/299 (15%)
Query: 85 TESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHH--NA 142
+ S GRG++A + I+ +I KP ++ T L S C C + + H
Sbjct: 76 SPSEGRGLWAKQPIKRGTVIIAVKPHVSVLTTQNLASYCSNCFEEAPEAGLKRCAHCRVV 135
Query: 143 RFCGEVCKDNAKAFYDVERRADWSAFNDYCRS--QGLKYPLLVKRLACMIISGAESADCI 200
+C C++ DW+ C + + K+ A + +++ C+
Sbjct: 136 HYCNSECQNK-----------DWATHKRECTALQEWAKH-----APAAEVSVPSDAVRCL 179
Query: 201 DIL----QPASLSP---ELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIR 253
L Q L + I AM+ R + + +G + + Y + +
Sbjct: 180 GRLLWKRQKKGLDSTWAQQIAAMQSS----RKSLQPSGYEFHTHLSHSLVRYLGLSSPDG 235
Query: 254 INAFRIELAGGLYEDLLS-----SAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-- 306
+ F I G L DL S S + + +G + L S NH C PNA ++
Sbjct: 236 LQDFGISSPGDLL-DLTSRFATNSYSVTTPDLTPIGACVSPLVSLVNHSCSPNAATVFPR 294
Query: 307 ------IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
D ++A+RD++ E++ YID ++ R R+ L +G+ F C C C +
Sbjct: 295 ASKTPSTDEPLISVVAIRDIQPDEQIFTSYIDTTLPRALRRRELQEGYNFLCKCSLCKT 353
>gi|440802077|gb|ELR23016.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 481
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 58/123 (47%), Gaps = 2/123 (1%)
Query: 237 MKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNH 296
M+ L+ + ++VL + ++ +E G L + + + + + G+A+Y + +NH
Sbjct: 290 MEALSGRATSDVLQGLDVDEI-VEYIGRLESNTFARYSTTKRKTVNYGSALYNCAAMFNH 348
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C P+ + D L ALR + G+EL + YI RQ L Q + F C C R
Sbjct: 349 SCFPSV-VRQFDGQHLTLRALRPLNPGDELTMTYIPLRDDTPERQDELQQHYHFACRCQR 407
Query: 357 CSS 359
C +
Sbjct: 408 CEA 410
>gi|242033255|ref|XP_002464022.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
gi|241917876|gb|EER91020.1| hypothetical protein SORBIDRAFT_01g010740 [Sorghum bicolor]
Length = 482
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 67/287 (23%), Positives = 109/287 (37%), Gaps = 29/287 (10%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFC-----LRKITSSS 134
+ VA GRG+ ATR D+I +P + P + S C C LRK +
Sbjct: 17 LAVASIPGKGRGLIATRTFFPGDVILNQEPYASTPNKILVGSSCDHCFTSGNLRKCSMCQ 76
Query: 135 QHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
+ +C C+ + +E RA + D + L+V+ +++
Sbjct: 77 VTW------YCSTNCQKEEWKLHQLECRAMAALTEDRKKMLTPTIRLMVR----LVLKRK 126
Query: 195 ESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRI 254
LQ + P + L S K DE L Q V + +
Sbjct: 127 --------LQNEKVIPSSSIDNYNLVDALESHIWKV---DENHLVLYAQMANLVSLILPL 175
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR 313
++ + +A + E+ +G +Y + S NH C PNA ++ D A
Sbjct: 176 IELDLKEIAHTFSKFACNAHTICDPELRPLGTGLYPVISIINHSCVPNA-VLIFDGRTAY 234
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ AL+ + + EE+ I YI+ + R L Q F F C+C RC G
Sbjct: 235 VRALQPIGKNEEVSISYIETAAVTKKRHNDLKQYF-FTCSCPRCVKG 280
>gi|395331361|gb|EJF63742.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 652
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/127 (28%), Positives = 59/127 (46%), Gaps = 16/127 (12%)
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESE-----IAVGNAIYMLPSFYNHDCD 299
Y V + + + ++ AG L DL+S + + +G + + NH CD
Sbjct: 275 YLGVSSPQELEPYGLKSAGDLV-DLISRFTTNTFTLTTPALTPIGICVSPTVALANHSCD 333
Query: 300 PNAHIMWIDNADA---------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
PNA +++ AD L+ALR+V G+E+RI Y+D ++ RQ L + + F
Sbjct: 334 PNAALVF-PRADGGSRVKEPLLSLVALRNVAPGKEIRISYVDTTLPNRLRQNELKEVYSF 392
Query: 351 QCNCLRC 357
C C C
Sbjct: 393 SCQCKLC 399
>gi|367028494|ref|XP_003663531.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
gi|347010800|gb|AEO58286.1| hypothetical protein MYCTH_33039, partial [Myceliophthora
thermophila ATCC 42464]
Length = 419
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 279 SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARD 338
S+ +G A+ M+ + NH C PNA + ++ R+ +L+ + GEE+ + Y D ++
Sbjct: 110 SKDVIGYALDMVTAMINHSCAPNAFVT-LEGCQLRVRSLKPIAAGEEITVSYADPTLPVF 168
Query: 339 ARQAILTQGFGFQCNCLRCSSGD 361
RQ L + + F C C C + D
Sbjct: 169 NRQKFLKETYFFDCRCKGCENDD 191
>gi|323337383|gb|EGA78636.1| Set5p [Saccharomyces cerevisiae Vin13]
Length = 426
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 141/369 (38%), Gaps = 72/369 (19%)
Query: 37 WKMCLGRYSRCLISRLQSLHLQKRQLCSTATHNGKPSQP----SPPPIQVA-LTESAGRG 91
WK+ R+ + L HL L + + + K P S ++V + + GRG
Sbjct: 71 WKLSASRFRKIL----NEHHLYDTDLETVSLYKDKIHFPKALDSDAKVEVKFIDDEHGRG 126
Query: 92 VFATRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH--- 140
+FA R +I KPI+ P L L C C + + +QH H+
Sbjct: 127 LFAKRDFSKGQIILKENKPIVYIPPLDKLFFISNGKACARCGKALYDLTQHKIMVHYLDC 186
Query: 141 ---NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGL 177
A +C E CK + + D+ +W F +YC S GL
Sbjct: 187 EVCKAIWCSEKCKKAHASLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSVGL 246
Query: 178 KYPLLV-----------KRLACMIISGAESADCIDILQPAS----LSPELILAMEEGFVM 222
Y ++ ++LA IS E D S L+ + EE
Sbjct: 247 IYGSMLLDTTGEVKEQWQKLAS--ISQRERIKLRDASGIGSTFSLLNGTTVHTEEESDNG 304
Query: 223 LRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA 282
+ +K IDDE + W ++ AF +E L+ ++
Sbjct: 305 TKKGVEK-NIDDETV------W--EKCYELFCGAFPKASEEIDFEKFLTMIGTFNINQ-- 353
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRDVEEGEELRICYIDASMARDARQ 341
+Y SF NHDC+PNA+I + ++ + RL A + +++GE++RI Y++ R+
Sbjct: 354 YNGQVYHWISFINHDCEPNAYIEQVEEHEELRLHARKPIKKGEQIRITYVNPLHGVRLRR 413
Query: 342 AILTQGFGF 350
L +GF
Sbjct: 414 RELRVNWGF 422
>gi|226290918|gb|EEH46346.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
Length = 389
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 43/74 (58%), Gaps = 3/74 (4%)
Query: 289 MLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQ 346
MLP S+ NH C+PNA+I + D L AL+D+ E++ I YID + RQ+ L +
Sbjct: 120 MLPFASYANHSCEPNAYIGF-DGPVIYLKALQDIALDEQIFISYIDNTEPWGKRQSELQK 178
Query: 347 GFGFQCNCLRCSSG 360
+ F C C +C+ G
Sbjct: 179 RYFFTCKCPKCAQG 192
>gi|148228090|ref|NP_001080251.1| N-lysine methyltransferase SMYD2-A [Xenopus laevis]
gi|82176673|sp|Q7ZXV5.1|SMY2A_XENLA RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|28279861|gb|AAH44103.1| Smyd2-prov protein [Xenopus laevis]
Length = 430
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/287 (21%), Positives = 114/287 (39%), Gaps = 27/287 (9%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH 136
P ++ + GRG+ ATR +L+ T T + + C FC + S+
Sbjct: 4 PEGLERFDSPGKGRGLKATRSFALGELLFTCPAYTYVLTDTERGNHCDFCFARKEGLSKC 63
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP--LLVKRLACMIISGA 194
+ A +C C+ + +E A S ++C S+ ++ +L K+ + +
Sbjct: 64 GKCKQAFYCNVDCQKGDWPMHKLECSAMCSYGQNWCPSETVRLTARILAKQKTQTERTPS 123
Query: 195 ESADCIDILQP--ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQI 252
E+ + + + L E +E L + K + + + + AQ+
Sbjct: 124 ETFLSVKEFESHLSKLDNEKKELIESDIAALHRFYSK------NLHYTDNAALVFLFAQV 177
Query: 253 RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA 312
N F IE ++ LS +G+AI+ + NH C PN I+ A
Sbjct: 178 NCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNI-IVTFKGTVA 220
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ A++++ G+E+ YID + R L + F C+C CS+
Sbjct: 221 EIRAVQEIHAGDEVFTSYIDLLYPTEDRNDRLMDSYFFTCDCRECST 267
>gi|11994649|dbj|BAB02844.1| unnamed protein product [Arabidopsis thaliana]
Length = 565
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 58/138 (42%), Gaps = 24/138 (17%)
Query: 243 QWYTNVLAQIRINAFRIELAGG-----LYEDLLSSAAASIESEIA--------------- 282
+ Y N++ +N + +A LY D L A EI
Sbjct: 422 EIYGNIIGMFELNNLDLVVASPVEDYFLYIDDLPDAEKEETEEITRPFLDALGDEYSDCC 481
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDA 339
G A + L S NH C PNA + + A ++ALR + + EE+ I YID +
Sbjct: 482 QGTAFFPLQSCMNHSCCPNAKAFKREEDRDGQAVIIALRRISKNEEVTISYIDEELPYKE 541
Query: 340 RQAILTQGFGFQCNCLRC 357
RQA+L +GF C C +C
Sbjct: 542 RQALLAD-YGFSCKCSKC 558
>gi|71649958|ref|XP_813687.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70878595|gb|EAN91836.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 501
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDN-ADARLMALRDVEEGEELRICYID--ASMARD 338
A G A+Y + NH C P+ + A ++ALRD+E+GEE+R YID A +R
Sbjct: 420 ATGVALYDAATKINHSCLPSVRFVPTHGRVSAVVVALRDIEKGEEIRSSYIDLVAYTSRV 479
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L +GF+C+C C
Sbjct: 480 ERREYLLSHYGFECDCSLC 498
>gi|453083761|gb|EMF11806.1| hypothetical protein SEPMUDRAFT_133791 [Mycosphaerella populorum
SO2202]
Length = 383
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICYIDA---SMARDAR 340
I++ S +NH C NA W N R+ A+R++ EE+ + Y+ ++ R
Sbjct: 156 GGIFLQCSRFNHSCTANAAYHWDANQKRERVFAIREINPEEEIVVSYLSEDVWALPTTER 215
Query: 341 QAILTQGFGFQCNCLRCSSGD 361
+A + Q FGFQC C RC + D
Sbjct: 216 KARILQEFGFQCRCARCENND 236
>gi|326499620|dbj|BAJ86121.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + A ++AL + +GEE+ I YID + + R
Sbjct: 419 GTAFFPLQSCMNHSCCPNAKAFKRDEDNDGHAVIIALGPISKGEEITISYIDEDLPYEER 478
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C +C
Sbjct: 479 QAELAD-YGFTCTCSKC 494
>gi|209878147|ref|XP_002140515.1| SET domain-containing protein [Cryptosporidium muris RN66]
gi|209556121|gb|EEA06166.1| SET domain-containing protein [Cryptosporidium muris RN66]
Length = 573
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/340 (20%), Positives = 128/340 (37%), Gaps = 60/340 (17%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI-TSSSQ 135
P + + ++E GR + A R I+A ++I P I L+SVC CL+ I T
Sbjct: 8 PNIVHLKVSEEKGRCIIANRDIKAGEVIWLEYPFIYIVFEEFLDSVCDTCLKYIGTEKVV 67
Query: 136 HFQHHNA----RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMII 191
+ N ++C +CK N++ + E + + F D G+ + V+ L+ II
Sbjct: 68 NIIVCNGCSVVKYCSNICKSNSEGIHKFECKLMKNKFKDISEKCGVTFD-RVRILSRFII 126
Query: 192 SGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMK----------FLN 241
S I +L+ + E L S + + + + F N
Sbjct: 127 VNLFSKSNFGIFSNIKHENKLLYSSYEHVKTLVSHISEFCQERKSIYLSLASELLNIFEN 186
Query: 242 KQWY----------------------TNVLAQIRINAFRIELAG-----GLYEDLLSSAA 274
Q++ +L I N+F I + Y+++L ++
Sbjct: 187 SQYHKTFGSDHHEIGASSISLSIMDLVEILCIIDSNSFGIPIFASSYYSYYYDNILKNSN 246
Query: 275 ASIESEIAVGN-------------AIYMLPSFYNHDCDPNAHIMWIDNAD----ARLMAL 317
+S E+ ++ S NH C+PN + D + + +
Sbjct: 247 SSFENPSNSLQLSSILLSPNITAWGLFPYASLINHSCEPNCSYLGNDESTRYPYIEIRST 306
Query: 318 RDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++ +G+E+ I YI+ +R R + L + F C C RC
Sbjct: 307 TNIPKGDEITISYIELYESRKQRISQLYKTKYFICRCTRC 346
>gi|115449849|ref|XP_001218712.1| predicted protein [Aspergillus terreus NIH2624]
gi|114187661|gb|EAU29361.1| predicted protein [Aspergillus terreus NIH2624]
Length = 306
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G + + SF NH C+P A + + + R+ AL + GEE+ Y+D S + +RQA
Sbjct: 9 LGTTLDLAASFMNHSCNPGAFV-FFEGRQMRVRALLSLPAGEEITQAYVDLSGSVFSRQA 67
Query: 343 ILTQGFGFQCNCLRC 357
+ FQC+C+RC
Sbjct: 68 TTEAEYFFQCHCVRC 82
>gi|348515415|ref|XP_003445235.1| PREDICTED: SET and MYND domain-containing protein 5-like isoform 1
[Oreochromis niloticus]
Length = 417
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 52/99 (52%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + E G+ +++L S NH C PNA + DN L AL D+ +G
Sbjct: 287 LYKDIEKETGDFLNCE---GSGLFLLQSSCNHSCIPNAEASFPDNNFLLHLSALCDINQG 343
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+D +R +R IL + + F C+C +C S
Sbjct: 344 EEICISYLDCCQRDRSRHSRHKILRENYLFVCSCPKCVS 382
>gi|321248842|ref|XP_003191260.1| hypothetical protein CGB_A2330W [Cryptococcus gattii WM276]
gi|317457727|gb|ADV19473.1| conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 449
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 7/122 (5%)
Query: 242 KQWYT--NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGN--AIYMLPSFYNHD 297
K+W++ + L + + E +GGLY L + S E I V N Y P+
Sbjct: 316 KRWFSFESFLELLGLVGLNQEDSGGLYA-LHAHLNHSCEPNIQVRNLPKSYTPPTQETLP 374
Query: 298 CDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
D I D +L +A ++ GEEL + Y++ M RD R+ L +G+GF C C
Sbjct: 375 VDLPPPIQAGDKVSNKLTILARHGIQPGEELTVSYVNIKMPRDERRQALREGYGFWCACD 434
Query: 356 RC 357
RC
Sbjct: 435 RC 436
>gi|326508882|dbj|BAJ86834.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 493
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/77 (38%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + A ++AL + +GEE+ I YID + + R
Sbjct: 412 GTAFFPLQSCMNHSCCPNAKAFKRDEDNDGHAVIIALGPISKGEEITISYIDEDLPYEER 471
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C +C
Sbjct: 472 QAELAD-YGFTCTCSKC 487
>gi|297736653|emb|CBI25524.3| unnamed protein product [Vitis vinifera]
Length = 716
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/320 (22%), Positives = 135/320 (42%), Gaps = 48/320 (15%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQH--- 136
+Q T + GRG+ + I S L+HT +P ++ C+FC ++ + S
Sbjct: 222 LQCVSTPTKGRGMASLSEISQSYLVHTEEPYAAIILKHCRDTHCHFCFNELPADSVPCTS 281
Query: 137 -------FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP-------LL 182
QH + G+ ++N+K +D + +W A + + +G+ +P +L
Sbjct: 282 CSIPLYCSQHCQMQAGGQELRNNSKN-HDSDSNIEWIAEHKH-ECKGVNWPAVLPPEIVL 339
Query: 183 VKRLACMIISGAESA-------DCIDILQP-ASLSPELILAMEEGFVMLRSAFKKAGIDD 234
R+ + + + D +D+ +++PE L + V+L + + +
Sbjct: 340 AGRVMVKSVEQKKHSCNASNLMDTLDLSHSYKTMAPESKLDLHIYSVVLLYCLQHSYGFE 399
Query: 235 EQMKFLNKQWYTNVLAQIRINAF---RIELAGGLYEDL-----LSSAAASIES---EIAV 283
+ ++ +++QI++N+ R++ G Y L S A + S ++ V
Sbjct: 400 LPLNGISISQLIILISQIKVNSIAIVRMKFMDG-YSPLDQSVNFSPAGGAFTSNMEQVRV 458
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEE---GEELRICYIDASMARDA- 339
G AIY + S +NH C PN H ++ +R + LR E G L + Y D
Sbjct: 459 GQAIYSVASLFNHSCQPNIHAYFL----SRTLFLRATEHVAVGCPLELSYGPQVGQWDCK 514
Query: 340 -RQAILTQGFGFQCNCLRCS 358
RQ L + F+C C CS
Sbjct: 515 DRQKFLKDEYSFRCECSGCS 534
>gi|383861061|ref|XP_003706005.1| PREDICTED: SET and MYND domain-containing protein 5-like [Megachile
rotundata]
Length = 388
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 64/126 (50%), Gaps = 14/126 (11%)
Query: 243 QWYTNVLA-----QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
+W NV A + RI F+ L +Y+D+ + + + +E G+ +Y+L S NH
Sbjct: 246 RWVKNVSALELPEEERIYVFK--LIDRIYDDMEEAVGSFLNNE---GSGLYILQSAVNHS 300
Query: 298 CDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCN 353
C PNA + + N L A+RD+ EE+ I Y+D +R +RQ L+ + F C
Sbjct: 301 CIPNAIVEFPYSNNVLVLKAIRDIHPEEEICISYLDECDLERSRHSRQKALSSLYLFICC 360
Query: 354 CLRCSS 359
C +C S
Sbjct: 361 CSKCLS 366
>gi|401405154|ref|XP_003882027.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325116441|emb|CBZ51994.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 576
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQ 341
+G ++ S NH C PNA + + A++ ALRD+ + E++ + YID ++ +RQ
Sbjct: 419 MGIGLFRAVSMTNHSCWPNAEVDYPSLTKTAQVTALRDIAQDEQVLLSYIDETLPLASRQ 478
Query: 342 AILTQGFGFQCNCLRC 357
+L + + F C C+RC
Sbjct: 479 RLLKRHYKFTCACVRC 494
>gi|195569233|ref|XP_002102615.1| GD19999 [Drosophila simulans]
gi|194198542|gb|EDX12118.1| GD19999 [Drosophila simulans]
Length = 391
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEE 322
GLY + A + +E G+ +Y+L S NH C PNA N L AL +++
Sbjct: 269 GLYAKVAEFAGEFLNNE---GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQ 325
Query: 323 GEELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
GEE+ I Y+D M +R +R +L + + F C C +C
Sbjct: 326 GEEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|125542564|gb|EAY88703.1| hypothetical protein OsI_10178 [Oryza sativa Indica Group]
Length = 536
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 10/78 (12%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID----ASMARDA 339
G +++LP+F NH C PNA + + A + A RD++ GEE+ Y D AS R+A
Sbjct: 311 GVGLWILPAFINHSCHPNARRTHVGD-HAIVHASRDIKAGEEITFAYFDVLTPASKRREA 369
Query: 340 RQAILTQGFGFQCNCLRC 357
+A +GF+C C RC
Sbjct: 370 ARA-----WGFECQCDRC 382
>gi|406863955|gb|EKD17001.1| lysine methyltransferase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 448
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NHDC PNA + + R+ A R + G+E+ + YID ARQA++ +GFQC
Sbjct: 253 NHDCRPNARFAFDRTSLTHRVHATRAIHPGDEITVSYIDGKQDFAARQAVIHAHWGFQCR 312
Query: 354 CLRCS 358
C CS
Sbjct: 313 CSLCS 317
>gi|24640767|ref|NP_572539.2| CG12119 [Drosophila melanogaster]
gi|7291024|gb|AAF46462.1| CG12119 [Drosophila melanogaster]
Length = 500
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 122/302 (40%), Gaps = 52/302 (17%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCLKM 76
Query: 130 ITSSSQHFQHHNARF-----CG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLV 183
+ + RF CG VC AK + ++D F + ++ ++
Sbjct: 77 LP---------DTRFMCRQGCGLPVCSLCAKK---KQHKSDCDLFKSWGPNEPDVANSVI 124
Query: 184 KRLACM---IISGAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGIDDEQM 237
RL C+ I E D I LQ A+L +R+A FK D + +
Sbjct: 125 IRLLCVARAINLSKEQRDLIYCLQ-ANLD-------NNHRTEVRNAAKCFKNFPTDKKLI 176
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
+ +N+ +A +R N F + + +++ A+Y L NHD
Sbjct: 177 EIMNR-----TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVVNHD 219
Query: 298 CDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C PNA+ + + + ++ A D+ EG E+ Y AR L F C C R
Sbjct: 220 CIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSR 279
Query: 357 CS 358
CS
Sbjct: 280 CS 281
>gi|350424957|ref|XP_003493967.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
impatiens]
Length = 631
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 73/318 (22%), Positives = 128/318 (40%), Gaps = 48/318 (15%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ + E GR + ATR+IR ++I KP T +++ C CL +++ ++ +H
Sbjct: 227 VTIKYNEEYGRHIVATRKIRPGEVIAVEKPYSLILTPDNIHTHCSNCL-EVSWANIPCEH 285
Query: 140 -HNARFCGEVCKD-NAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
A +C E CK K ++DVE R F + +K L RLA I + E+
Sbjct: 286 CTYAMYCSEECKAMEWKKYHDVECR----VFPSMLKMNFVKLDLFSLRLA--IQAVREAT 339
Query: 198 DCIDILQPASLSPELILAMEEGF--------------VMLRSAFKKAGIDDEQMKFLNKQ 243
+ ++ + + + +GF + L + +K + D + L+
Sbjct: 340 NIQELRKELKEVDSCVASRTKGFSKDGTFPSDKYRSLLGLVTNTEKRSVQDLFRRSLDAS 399
Query: 244 WY-------TNVLAQ-------IRINAFRIELAGGL---YEDLLSSAAASIESE-----I 281
+ TN+ + I + G L ++ ++ S S E +
Sbjct: 400 FILYFVATCTNIFGNPLSKDLSVLIKNADVTFVGSLILRHQQMIPSNIHSFSEECGLEAV 459
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID--ASMARDA 339
G A S NH C+PN + + + A+ +EEGE+L Y A +
Sbjct: 460 ERGAAAMPFSSLINHSCNPNI-LRHSRSKYVIIYAIYPIEEGEQLYDIYTQHYAITPKAV 518
Query: 340 RQAILTQGFGFQCNCLRC 357
RQ L + + F+CNCL C
Sbjct: 519 RQKKLLKQYYFKCNCLPC 536
>gi|195355540|ref|XP_002044249.1| GM15092 [Drosophila sechellia]
gi|194129550|gb|EDW51593.1| GM15092 [Drosophila sechellia]
Length = 392
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEE 322
GLY + A + +E G+ +Y+L S NH C PNA N L AL +++
Sbjct: 269 GLYAKVAEFAGEFLNNE---GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALGPIQQ 325
Query: 323 GEELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
GEE+ I Y+D M +R +R +L + + F C C +C
Sbjct: 326 GEEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|328874806|gb|EGG23171.1| hypothetical protein DFA_05303 [Dictyostelium fasciculatum]
Length = 473
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 46/77 (59%), Gaps = 2/77 (2%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICYIDASMARDARQA 342
G ++ L + NH C+PN +++ + ++ A + ALRD++ GEEL YID + + R+
Sbjct: 385 GFGMFGLQAMINHSCEPNCLVVFDNGSNFAHIKALRDIQAGEELYHSYIDENTPFEEREQ 444
Query: 343 ILTQGFGFQCNCLRCSS 359
L +GF+C C +C S
Sbjct: 445 ELIT-YGFKCICRKCVS 460
>gi|125983736|ref|XP_001355633.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
gi|54643949|gb|EAL32692.1| GA11413 [Drosophila pseudoobscura pseudoobscura]
Length = 496
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 119/302 (39%), Gaps = 52/302 (17%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P +V +++ AGRG+ ATR I+ ++I P++ + +S+ C CLR+
Sbjct: 17 PFKDKDPAWEVGVSKIAGRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLRE 76
Query: 130 ITSSSQHFQHHNARF-----CG----EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP 180
+ + RF CG +C + D + W + +
Sbjct: 77 LPDT---------RFMCRIGCGLPVCSLCSKKKQHKTDCDLLKSWGPNEPEVANSVIIRL 127
Query: 181 LLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGIDDEQM 237
L V R + + D I LQ A+L V +R+A FK D +
Sbjct: 128 LCVARALNL---NKDQRDLIYCLQ-ANLD-------NNHRVEVRNAAKCFKNFPTDKKIF 176
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
+ +N+ +A +R N F + + +++ A+Y L + NHD
Sbjct: 177 EIMNR-----TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFAVMNHD 219
Query: 298 CDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C PN++ + + + ++ A D+ EG E+ Y AR L GF C C R
Sbjct: 220 CIPNSYYTFEEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSR 279
Query: 357 CS 358
CS
Sbjct: 280 CS 281
>gi|146098807|ref|XP_001468476.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134072844|emb|CAM71560.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 700
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 122/310 (39%), Gaps = 38/310 (12%)
Query: 81 QVALTESA-GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRK---ITSSSQH 136
+V + SA GRG+ A R+ ++ A I T + L SVC +CL+ + ++
Sbjct: 224 EVRIERSAKGRGLVAASRLTEGPVLEEAPYAIVARTEALL-SVCSYCLQHSACLYHGDEY 282
Query: 137 FQHH---NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGL-KYPLLVKRLACMIIS 192
QHH FC C A Y + F C + L Y +++ A ++
Sbjct: 283 RQHHLKSRGFFCSPACAKAAWEHYG--QHESHHPFFLCCPNDALLAYRMILGMRAYPDLA 340
Query: 193 GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA-- 250
++ +D + I +E F R A + + +Y L
Sbjct: 341 ELSASPELDPVTENDFGANHIRTLEGSFS--RELQSNAAVGGCESIVATIGFYVGALTEP 398
Query: 251 ---QIRINAFRIELAGGLYEDLLSSAAASIESEIA-----------VGNAIYMLPSFYNH 296
Q+R +I L ++ + AA E+ + +G A+Y + + +NH
Sbjct: 399 EAEQLRKAQRQILLNAVDVTCMMRTTAAPSEANSSTLLQTNTVVARLGKAVYAIGALFNH 458
Query: 297 DCDPNAHIMWIDNAD---ARLM--ALRDVEEGEELRICYIDAS----MARDARQAILTQG 347
CDPN ++ + N ARL+ A+R V EGEEL + Y S + R L
Sbjct: 459 ACDPNCYVSFEGNPQGSCARLIVRAIRPVMEGEELTVSYGGISCFSFHSMRHRLQTLRDR 518
Query: 348 FGFQCNCLRC 357
+GF C C C
Sbjct: 519 YGFFCGCRSC 528
>gi|389623721|ref|XP_003709514.1| hypothetical protein MGG_06798 [Magnaporthe oryzae 70-15]
gi|351649043|gb|EHA56902.1| hypothetical protein MGG_06798 [Magnaporthe oryzae 70-15]
Length = 440
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 3/97 (3%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA--DARLMALRDVEE 322
+ ED++ + S+E + ++ S NH C PN+ + + D A + A D+E
Sbjct: 231 VVEDVMRTNTFSLEIDGETYMGLFPTISRINHACGPNSQVKF-DPATLSQKAFASHDIEA 289
Query: 323 GEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
GEE+ I Y + M +RQ+ L +GF+C C CS+
Sbjct: 290 GEEITISYAEFGMTYQSRQSTLLHRWGFKCTCALCSA 326
>gi|401883283|gb|EJT47498.1| hypothetical protein A1Q1_03610 [Trichosporon asahii var. asahii
CBS 2479]
Length = 354
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 40/77 (51%), Gaps = 2/77 (2%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAI 343
A++ S NH C PN+HI W DN + A+R + GEE+ I Y ++ + RQA
Sbjct: 149 GAVFEYISRINHSCRPNSHICWNDNLGMETVHAIRTISSGEEITINYGHTGVSEE-RQAW 207
Query: 344 LTQGFGFQCNCLRCSSG 360
L + F F C C C G
Sbjct: 208 LKKRFKFDCACEVCGLG 224
>gi|350633710|gb|EHA22075.1| hypothetical protein ASPNIDRAFT_183370 [Aspergillus niger ATCC
1015]
Length = 386
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 120/303 (39%), Gaps = 54/303 (17%)
Query: 73 SQPSPPPIQVALTESAGRGVFATRRIR-ASDLIHTAKPIITHPTLSTLNSVCYFCLRKIT 131
+ P P P GRG+FA+ I +++H P + L VC C +
Sbjct: 5 TTPQPAP------GGLGRGLFASTDIAVGEEVLHIPVPFVAVLDTEHLGEVCSGCFGQRQ 58
Query: 132 SSSQHFQHHNARFCGEV--CKDNAKAFYDVERRADWSAFNDYCRS--QGLK---YPLLVK 184
+ R CG V C +A DW + S Q LK P+ +
Sbjct: 59 LEEEGITLKGCRGCGIVKYCDKTCQA-------KDWKLGHSLECSIYQNLKPRVLPINAR 111
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
+ M++ S EL + F+ L + K I D+ QW
Sbjct: 112 AVLRMVLRSERQKYS---------SEEL-----DQFLQLGTHIKD--IRDQSAS----QW 151
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIE-SEIAVGNAIY------MLP--SFYN 295
L+ I A+ + E+++S+ A ++ + + NA+Y + P + +N
Sbjct: 152 ERISLSSKAIRAYS---GTDMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFN 208
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
H CD NA + + D + + ALR + + E++ I YID + RQA L + F C+C
Sbjct: 209 HSCDHNAAVSF-DGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCS 267
Query: 356 RCS 358
+C+
Sbjct: 268 KCT 270
>gi|407394357|gb|EKF26910.1| hypothetical protein MOQ_009378 [Trypanosoma cruzi marinkellei]
Length = 697
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA-RQ 341
+G A+Y S++NH C PN + A ALR++ +GE L ICY+D A R+
Sbjct: 567 IGVALYPEASYFNHSCCPNICRVTYRGLFAAFHALREIPKGEPLTICYVDVQETSTAERR 626
Query: 342 AILTQGFGFQCNCLRCSSGD 361
L + F C C RCS +
Sbjct: 627 RTLFSSYRFFCECARCSGTN 646
>gi|195168440|ref|XP_002025039.1| GL26796 [Drosophila persimilis]
gi|194108484|gb|EDW30527.1| GL26796 [Drosophila persimilis]
Length = 498
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/293 (22%), Positives = 116/293 (39%), Gaps = 34/293 (11%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P +V +++ AGRG+ ATR I+ ++I P++ + +S+ C CLR+
Sbjct: 17 PFKDKDPAWEVGVSKIAGRGLMATRSIKRGEIIFKDMPLLISLSAHEEDSLNACSVCLRE 76
Query: 130 ITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACM 189
+ + + +C + D + W + + L V R +
Sbjct: 77 LPDTRFMCRMGCGLPVCSLCSKKKQHKTDCDLLKSWGPNEPEVANSVIIRLLCVARALNL 136
Query: 190 IISGAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGIDDEQMKFLNKQWYT 246
+ D I LQ A+L V +R+A FK D + + +N+
Sbjct: 137 ---NKDQRDLIYCLQ-ANLD-------NNHRVEVRNAAKCFKNFPTDKKIFEIMNR---- 181
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
+A +R N F + + +++ A+Y L + NHDC PN++ +
Sbjct: 182 -TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFAVMNHDCIPNSYYTF 228
Query: 307 IDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + ++ A D+ EG E+ Y AR L GF C C RCS
Sbjct: 229 EEKTNIMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKIKRGFTCKCSRCS 281
>gi|357606576|gb|EHJ65119.1| hypothetical protein KGM_22198 [Danaus plexippus]
Length = 515
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 68/291 (23%), Positives = 121/291 (41%), Gaps = 35/291 (12%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH- 139
+V +E GR + A + I+A + I + +P + P+ T + VC+ C + + ++
Sbjct: 6 KVEKSEKLGRYLVAAKDIKAGERILSDQPFVLGPSSDT-SLVCFNCYLPLINKFLVCKNC 64
Query: 140 HNARFC-GEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
A C G+ C D +++ R + F + ++G+ +V+ + + + A
Sbjct: 65 AVAPLCPGDGCPDEFTKYHN---RQECDVFRNLKLTKGISPMTMVQNVGSLSVLRA---- 117
Query: 199 CIDILQPASLSPELILAME-EGFVMLRSAFKKAGIDDEQMKFL----------NKQWYTN 247
+L+ + E L ME E + R D +KF+ N+
Sbjct: 118 ---LLKKETNLLEWKLFMELETHLERRRESNVWQYYDNTVKFIQSLGLLENGQNQDLVQK 174
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
+ A I +N+F E+ G A +EI G +Y+ + +HDC N H+
Sbjct: 175 ICAAIDVNSF--EVRG-------PPIPAIGCAEILRG--VYLQAALLSHDCIANTHMSID 223
Query: 308 DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
DN A D+++GE + Y D RQ L G F+C C RC+
Sbjct: 224 DNNMLVCHASVDIKKGESINYNYTDPLKGTIPRQQHLIVGKYFKCTCTRCT 274
>gi|406700189|gb|EKD03370.1| hypothetical protein A1Q2_02350 [Trichosporon asahii var. asahii
CBS 8904]
Length = 573
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/332 (20%), Positives = 119/332 (35%), Gaps = 66/332 (19%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPII-------THPTLSTLNSVCYFCLRKIT 131
P++V + GRG++A R I+A +L+ A + T P +S L+ +
Sbjct: 58 PLEVKTSPQRGRGLYAARDIKAGELVWRAPKLTPGETLLKTKPLVSVLSKPNLYEPAAGQ 117
Query: 132 SSSQHFQH-------------HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLK 178
H H +CG C+ + DW + C++
Sbjct: 118 KMELHCNHCYLLKPAQRCSGCKGVYYCGAACQQD-----------DWPSHKTECKALTRV 166
Query: 179 YPLLVKRLACMIISG-------AESADCIDILQPASLS------------PELILAMEEG 219
L V+ G AE A + +L A + P++ EG
Sbjct: 167 RQLWVQSYPEKAKEGLNNSWIQAEGARALGLLTWARKAYRDQHGRDPDYWPKVEKMYAEG 226
Query: 220 FVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIR----INAFRIELAGGLYEDLLSSAAA 275
M GI ++ +Y + + +E G E L+S +
Sbjct: 227 PDMGTGVH---GIPAADQMAIHLSYYVGAAEPPKDPNNLELIDMEPYGFQDEQLMSFVRS 283
Query: 276 SIESEIA--------VGNAIYMLPSFYNHDCDPNAHIMWIDNADARL-MALRDVEEGEEL 326
+ + +G A+ L + +NH C PNA I++ + +A D+ GEE+
Sbjct: 284 FVLNAFTLSSFDLKPIGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEV 343
Query: 327 RICYIDASMARDARQAILTQGFGFQCNCLRCS 358
Y+D S ++ RQ L +GF+C C C+
Sbjct: 344 LTTYVDISDDKETRQGDLQSRYGFECECPACT 375
>gi|255714340|ref|XP_002553452.1| KLTH0D17160p [Lachancea thermotolerans]
gi|238934832|emb|CAR23014.1| KLTH0D17160p [Lachancea thermotolerans CBS 6340]
Length = 485
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/312 (22%), Positives = 124/312 (39%), Gaps = 49/312 (15%)
Query: 81 QVALTES-AGR-GVFATRRIRASDLI-HTAKPI-----ITHPTLSTLNSVCYFCLRKITS 132
+V L +S GR VFA + +R +L+ + +P+ + +L+ + C C + +T
Sbjct: 111 KVELKQSCKGRFSVFAKKDLRRDELVFYEHEPLTSILPLEKTSLAAVGRACSLCGQSLTQ 170
Query: 133 SSQHFQHHN--------ARFCGEVCK--DNAKAF--YDVERR-----ADWSAFNDYCRSQ 175
S+Q F N A +C + CK D +F + + R + WS+F +C+
Sbjct: 171 STQ-FTMKNGLDCRMCSAVWCSKHCKKLDTTHSFLKHPMSRNKQVSASGWSSFEQFCKDN 229
Query: 176 GLKYPLLVKRL-ACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKA--GI 232
+ V + A ++ E D + +++ + F A +
Sbjct: 230 VMHAAYSVGTIHARALLDKNEGPRIKDQFEALRQISQVLRIRAADSTNIGGTFDAATGAV 289
Query: 233 DDEQMKFLNKQW------YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNA 286
E NK + + V +I N + I L G +
Sbjct: 290 SLEDQSIWNKSFESFCKAFPGVEDEIDFNHYMI-LIGKFNLNQREGQ------------- 335
Query: 287 IYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQ 346
I+ L S+ NH+C+PN +L A +D+++GEEL Y++ R+ L
Sbjct: 336 IFTLYSYLNHNCEPNVRYELDGKTGLKLFARKDIKKGEELLTTYVNPLHGVTLRRRELLV 395
Query: 347 GFGFQCNCLRCS 358
+GF C+C RC+
Sbjct: 396 NWGFLCDCERCN 407
>gi|195565893|ref|XP_002106530.1| GD16076 [Drosophila simulans]
gi|194203908|gb|EDX17484.1| GD16076 [Drosophila simulans]
Length = 497
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 122/302 (40%), Gaps = 52/302 (17%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCLKM 76
Query: 130 ITSSSQHFQHHNARF-----CG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLV 183
+ + RF CG VC AK + ++D F + ++ ++
Sbjct: 77 LP---------DTRFMCRQGCGLPVCSLCAKK---KQHKSDCDLFKSWGPNEPDVANSVI 124
Query: 184 KRLACM---IISGAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGIDDEQM 237
RL C+ I E D I LQ A+L +R+A FK D + +
Sbjct: 125 IRLLCVARAINLSKEQRDLIYCLQ-ANLD-------NNHRTEVRNAAKCFKNFPTDKKLI 176
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
+ +N+ +A +R N F + + +++ A+Y L NHD
Sbjct: 177 EIMNR-----TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVVNHD 219
Query: 298 CDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C PNA+ + + + ++ A D+ EG E+ Y AR L F C C R
Sbjct: 220 CIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSR 279
Query: 357 CS 358
CS
Sbjct: 280 CS 281
>gi|402077091|gb|EJT72440.1| hypothetical protein GGTG_09306 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 455
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/85 (37%), Positives = 43/85 (50%), Gaps = 2/85 (2%)
Query: 274 AASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYID 332
A I S A G I++ S NH C NA W + + A+RD+ +GEE+ I YI+
Sbjct: 94 ALPIGSNAATG-GIFLEASRINHACLQNAQNTWNEGLQQLTIHAIRDINQGEEITIMYIE 152
Query: 333 ASMARDARQAILTQGFGFQCNCLRC 357
R ARQ L + F F C+C C
Sbjct: 153 DRANRAARQRTLQRDFRFTCSCQLC 177
>gi|398022224|ref|XP_003864274.1| hypothetical protein, conserved [Leishmania donovani]
gi|322502509|emb|CBZ37592.1| hypothetical protein, conserved [Leishmania donovani]
Length = 700
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 122/310 (39%), Gaps = 38/310 (12%)
Query: 81 QVALTESA-GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRK---ITSSSQH 136
+V + SA GRG+ A R+ ++ A I T + L SVC +CL+ + ++
Sbjct: 224 EVRIERSAKGRGLVAASRLTEGPVLEEAPYAIVARTEALL-SVCSYCLQHSACLYHGDEY 282
Query: 137 FQHH---NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGL-KYPLLVKRLACMIIS 192
QHH FC C A Y + F C + L Y +++ A ++
Sbjct: 283 RQHHLKSRGFFCSPACAKAAWEHYG--QHESHHPFFLCCPNDALLAYRMILGMRAYPDLA 340
Query: 193 GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA-- 250
++ +D + I +E F R A + + +Y L
Sbjct: 341 ELSASPELDPVTENDFGANHIRTLEGSFS--RELQSNAAVGGCESIVATIGFYVGALTEP 398
Query: 251 ---QIRINAFRIELAGGLYEDLLSSAAASIESEIA-----------VGNAIYMLPSFYNH 296
Q+R +I L ++ + AA E+ + +G A+Y + + +NH
Sbjct: 399 EAEQLRKAQRQILLNAVDVTCMMRTTAAPSEANSSTLLQTNTVVARLGKAVYAIGALFNH 458
Query: 297 DCDPNAHIMWIDNAD---ARLM--ALRDVEEGEELRICYIDAS----MARDARQAILTQG 347
CDPN ++ + N ARL+ A+R V EGEEL + Y S + R L
Sbjct: 459 ACDPNCYVSFEGNPQGSCARLIVRAIRPVMEGEELTVSYGGISCFSFHSMRHRLQTLRDR 518
Query: 348 FGFQCNCLRC 357
+GF C C C
Sbjct: 519 YGFFCGCRSC 528
>gi|443683649|gb|ELT87827.1| hypothetical protein CAPTEDRAFT_118237 [Capitella teleta]
Length = 596
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/305 (21%), Positives = 115/305 (37%), Gaps = 34/305 (11%)
Query: 82 VALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHN 141
VA TE+ GR + A R I +++ KP + + ++ C+ CL+ +
Sbjct: 182 VASTENEGRFITAVRNIAPGEIVLIEKPFASVLLRANYSNHCHHCLKHTLEGIPCRTCPD 241
Query: 142 ARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLAC------------- 188
ARFC E C+D A Y + + S N SQ K+ L R
Sbjct: 242 ARFCSEACRDTAMQTY---HQYECSVLNTLHHSQINKFGCLAFRAITKQSYQSLKDIRAQ 298
Query: 189 -MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN 247
+ ++G S + +P + + L ++ F + + +K L + Y N
Sbjct: 299 DLPLNGCHSD---GLYRPQDYNTIIQLVTHAKDRPVQDLFHRTVMAVYLLKLLQQTSYFN 355
Query: 248 VLAQIRINAFRIEL-----------AGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNH 296
+ + A+ L A + E L A + +G IY S +NH
Sbjct: 356 GEEDVEMQAYIAGLFLSHLQSFPCNAHEVPELYLDPNAIDLSMPNELGAGIYSTLSLFNH 415
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICY--IDASMARDARQAILTQGFGFQCNC 354
CDP + + + + A++ + +G ++ Y + A+ R L + F C C
Sbjct: 416 SCDPGVNRNFYGDT-CVVRAIKTIRKGHQVSDNYGALYATNTLKERHDKLQPQYFFSCRC 474
Query: 355 LRCSS 359
CS+
Sbjct: 475 EPCSN 479
>gi|294933189|ref|XP_002780642.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
gi|239890576|gb|EER12437.1| SET and MYND domain-containing protein, putative [Perkinsus marinus
ATCC 50983]
Length = 409
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 116/282 (41%), Gaps = 28/282 (9%)
Query: 88 AGRGVFATRRIR--ASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFC 145
GRG+FA D I T +PI+ T + + C++CL + + +++Q R+
Sbjct: 17 GGRGLFAGEGSHYIPGDSIATYRPIVYCVTPTMAKACCHWCLN--SDAPKYYQCSGCRYA 74
Query: 146 GEVCKDNAKAFYDV-ERRADWSAFN----DYCRSQGLKYPLLVKRLACMIISGAESADCI 200
K+ AKA Y + R + ++ S L L +L ++ +
Sbjct: 75 VYCSKECAKAAYKLGTHRRECQLIKKLPAEHASSAPLTTFLAAAKLHWLVQEDEAVRHSV 134
Query: 201 DILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIE 260
D + + S + + A +L S + +D Q + +L +R+ +
Sbjct: 135 DEMCRHADSSDTLEADSGSTAILLSRY----LDGTQADLIYSPPVPVMLDLLRVLRYN-- 188
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDV 320
A + D L A + +E+ S NH C PN ++ ++ L +R V
Sbjct: 189 -AVTITNDSLQDVALGLYTEV----------SAMNHSCAPNV-VLIFSGSEVTLRTIRAV 236
Query: 321 EEGEELRICYIDASMARDARQAI-LTQGFGFQCNCLRCSSGD 361
E+G EL I Y+D ++ A++ L + F C+C RC+ D
Sbjct: 237 EDGAELFISYVDVCISPKAKRCQRLRDQYKFDCSCERCTRED 278
>gi|348505210|ref|XP_003440154.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Oreochromis niloticus]
Length = 454
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 64/284 (22%), Positives = 109/284 (38%), Gaps = 44/284 (15%)
Query: 87 SAGRGVFATRRIRASDLIHTAKPIITHPTLSTL------NSVCYFCLRKITSSSQHFQHH 140
S GRG +RA+ ++T + I + P+ + + VC+ C R + + Q
Sbjct: 16 SKGRG------LRAAKELNTGEVIFSEPSFAAVVFDSLATQVCHSCFRHQANLHRCAQCK 69
Query: 141 NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCI 200
A +C C+ A ++ E + + A + L + L M ++D
Sbjct: 70 FAHYCDRTCQ---TACWN-EHKQECGAIKKLGSAPSENIRLAARVLWRMHKDTGIASDS- 124
Query: 201 DILQPASLSPELILAMEEGFVMLRSAFKK------AGIDDEQMKFLNKQWYTNVLAQIRI 254
++ L + E F LR+ K G + ++ +++ I+
Sbjct: 125 QLISVDQLQEHVADLAAEDFKQLRADVHKFLQYWSYGTRQHSVDYI-----SHIFGIIKC 179
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARL 314
N F + GL AVG ++ NHDC PN ++ +++ L
Sbjct: 180 NGFTLSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCTVI-LNHGKIEL 223
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
AL + EGEEL + Y+D RQ L + F F C C CS
Sbjct: 224 RALGKIPEGEELTVSYVDFLNLSADRQKKLKERFHFDCTCEHCS 267
>gi|315046180|ref|XP_003172465.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
gi|311342851|gb|EFR02054.1| SET and MYND domain-containing protein 2 [Arthroderma gypseum CBS
118893]
Length = 498
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/121 (27%), Positives = 53/121 (43%), Gaps = 17/121 (14%)
Query: 240 LNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
L+ + LA+I +N+F + G +G I + NH CD
Sbjct: 180 LDLETIAEYLARIEVNSFTFTTSFG----------------DPLGLCIQPFACYMNHSCD 223
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
PNA ++ D + ALR ++ E++ I YID + + RQ L + + F C C +C
Sbjct: 224 PNA-VVGFDEGLITVKALRKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCVE 282
Query: 360 G 360
G
Sbjct: 283 G 283
>gi|401886254|gb|EJT50303.1| hypothetical protein A1Q1_00408 [Trichosporon asahii var. asahii
CBS 2479]
Length = 582
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 1/77 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARL-MALRDVEEGEELRICYIDASMARDARQ 341
+G A+ L + +NH C PNA I++ + +A D+ GEE+ Y+D S ++ RQ
Sbjct: 299 IGVAMSPLLALFNHSCAPNAAIVFPRGGKEMVAVANADIAAGEEVLTTYVDISDDKETRQ 358
Query: 342 AILTQGFGFQCNCLRCS 358
L +GF+C C C+
Sbjct: 359 GDLQSRYGFECECPACT 375
>gi|406863606|gb|EKD16653.1| TPR domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 333
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/115 (29%), Positives = 53/115 (46%), Gaps = 15/115 (13%)
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ VL R NAFR+ L G ++ + +A++ S NH C PNA
Sbjct: 115 HVAVLNIFRSNAFRLGLGQG-------------QTAEGISHAVFAGISRINHSCVPNAQA 161
Query: 305 MWIDNADA-RLMALRDVEEGEELRICYI-DASMARDARQAILTQGFGFQCNCLRC 357
W + + A RD+E EE+ + Y+ + R+ R L+ G+GF+C C C
Sbjct: 162 NWHGILERFNVHATRDIEVAEEVSLAYLPENGDLRETRMKRLSDGYGFECGCPAC 216
>gi|150866965|ref|XP_001386745.2| hypothetical protein PICST_63965 [Scheffersomyces stipitis CBS
6054]
gi|172044079|sp|A3M0J3.2|SET5_PICST RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|149388221|gb|ABN68716.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 478
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 2/73 (2%)
Query: 287 IYMLPSFYNHDCDPNAHIMW--IDNADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
I++ S NH+CDPN ++ + ++ A RD+ GEEL Y++ + RQ L
Sbjct: 341 IFLTQSHLNHNCDPNTNVDTSPVRTEGLKVYAARDIRAGEELTTTYVNPAHTVQQRQREL 400
Query: 345 TQGFGFQCNCLRC 357
+GF C C +C
Sbjct: 401 RVNWGFICGCQKC 413
>gi|331218423|ref|XP_003321889.1| hypothetical protein PGTG_03426 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 615
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 38/66 (57%), Gaps = 3/66 (4%)
Query: 294 YNHDCDPNAHIMWIDN--ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
+NHDC PN ++D+ R+ A R + GEEL I Y A R+ RQ L++ +GF+
Sbjct: 390 FNHDCRPNVE-YYMDHFTQSIRMTASRQILAGEELTISYRAAEFTRERRQNSLSETYGFR 448
Query: 352 CNCLRC 357
C+C C
Sbjct: 449 CSCSHC 454
>gi|213406173|ref|XP_002173858.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212001905|gb|EEB07565.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 449
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQ 341
A+G + + NH C PN M ++ A ARL+AL+ +++G+ + I Y+D ++ + R
Sbjct: 179 AIGTCMDSTLARINHSCQPNCVFM-VEGATARLVALQTLQKGDFVHISYVDTTLPYNVRT 237
Query: 342 AILTQGFGFQCNCLRC 357
L Q + F C C +C
Sbjct: 238 NELQQKYFFTCTCQKC 253
>gi|403161564|ref|XP_003890392.1| hypothetical protein PGTG_20944 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171833|gb|EHS64478.1| hypothetical protein PGTG_20944 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 334
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 43/75 (57%), Gaps = 4/75 (5%)
Query: 286 AIYMLPSF-YNHDCDPNAHIMWIDN--ADARLMALRDVEEGEELRICYIDASMARDARQA 342
A+ + P+ +NHDC PN ++D+ R+ A R + GEEL I Y A R+ RQ
Sbjct: 100 ALVLEPAVRFNHDCRPNVE-YYMDHFTQSIRMTASRQILAGEELTISYRAAEFTRERRQN 158
Query: 343 ILTQGFGFQCNCLRC 357
L++ +GF+C+C C
Sbjct: 159 SLSETYGFRCSCSHC 173
>gi|302906242|ref|XP_003049436.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256730371|gb|EEU43723.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 516
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/106 (35%), Positives = 49/106 (46%), Gaps = 13/106 (12%)
Query: 266 YEDLLSS-AAASIESEIAVGNA-----------IYMLPSFYNHDCDPNAHIMWID-NADA 312
Y DL +S S E IA+ NA I++ + NH C PNA W + +
Sbjct: 73 YRDLWNSYRGDSPEVGIALTNAYPLGPNASTTGIFLHAARLNHSCHPNAQDTWNEARLEL 132
Query: 313 RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
L A D+EEGEE+ I + R ARQ +L GF C C CS
Sbjct: 133 TLHACCDIEEGEEITITLLSTRGCRKARQRVLKLNCGFTCLCCLCS 178
>gi|398410778|ref|XP_003856737.1| hypothetical protein MYCGRDRAFT_90139 [Zymoseptoria tritici IPO323]
gi|339476622|gb|EGP91713.1| hypothetical protein MYCGRDRAFT_90139 [Zymoseptoria tritici IPO323]
Length = 358
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 63/135 (46%), Gaps = 29/135 (21%)
Query: 236 QMKFLNKQWYTNVLAQI-------RINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIY 288
Q K L + + + LA++ + NAF ++LA GL + A++
Sbjct: 105 QDKLLQRGFSPDELAEMITVASIMQTNAFNVDLADGLG---------------STYRAMF 149
Query: 289 MLPSFYNHDCDPNAHIMWI------DNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+ NH C PNAH+ + + + +LR +E+GEE+ I Y + M RD R
Sbjct: 150 AKVARINHACAPNAHVCYYPPDPEYERGRMVVHSLRPLEKGEEVLISYFNILMPRDDR-T 208
Query: 343 ILTQGFGFQCNCLRC 357
+ T+ +GF+C C C
Sbjct: 209 LRTRKWGFECACPVC 223
>gi|24648668|ref|NP_650955.1| CG3353 [Drosophila melanogaster]
gi|23171865|gb|AAF55866.2| CG3353 [Drosophila melanogaster]
gi|33636489|gb|AAQ23542.1| RE75113p [Drosophila melanogaster]
Length = 393
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEE 322
GLY + A + +E G+ +Y+L S NH C PNA N L AL +++
Sbjct: 269 GLYAKVGEFAGEFLNNE---GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQQ 325
Query: 323 GEELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
GEE+ I Y+D M +R +R +L + + F C C +C
Sbjct: 326 GEEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|72392607|ref|XP_847104.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175618|gb|AAX69751.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803134|gb|AAZ13038.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 713
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 280 EIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA 339
E +G ++ S++NH C PN + D A ALR++ +GE L ICY+D A
Sbjct: 569 EDCIGVSVIPEASYFNHSCLPNLCRVMCDGGIAAFYALREIRKGEPLTICYVDVQEVSTA 628
Query: 340 -RQAILTQGFGFQCNCLRCSS 359
R+ L + F C C RC+
Sbjct: 629 ERRRTLLTSYRFFCQCKRCNG 649
>gi|307177472|gb|EFN66599.1| SET and MYND domain-containing protein 5 [Camponotus floridanus]
Length = 386
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRI---ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI +L +Y+D+ + + +E G+ +Y+L S NH C
Sbjct: 246 RWVKNVSALDLPKDQRIYIDKLINRIYDDMEEVVGSFLNNE---GSGLYILQSSVNHSCT 302
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + N L A+RD++ GEE+ Y+D +R +RQ L+ + F C+C
Sbjct: 303 PNAIAEFPYSNNTLVLKAIRDIQVGEEICTSYLDECELERSRYSRQKALSSLYLFVCHCD 362
Query: 356 RCSS 359
+C +
Sbjct: 363 KCQT 366
>gi|327275309|ref|XP_003222416.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Anolis carolinensis]
Length = 485
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 115/311 (36%), Gaps = 67/311 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V T+ GRG+ A + A D+I + S + VC+ C ++ + Q
Sbjct: 4 VEVFDTDGKGRGLKAAKEFWAGDVIFAERAYAAVVFDSLTHLVCHTCFKRHAKLHRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMII----- 191
A +C C+ +A W + C + G K P RLA I+
Sbjct: 64 KFAHYCDRTCQKDA-----------WVNHKNECAAIKKHG-KAPNENIRLAARIMWRIER 111
Query: 192 -SGAESADC---IDILQ------PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN 241
G + C ID LQ EL L +E S + D +Q
Sbjct: 112 EGGGLTEGCFVSIDSLQNHVEHFGEEEKKELRLDLE-------SFLEFWPHDGKQFAM-- 162
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q+ +++L I N F + GL AVG I+ NHDC PN
Sbjct: 163 -QYISHILGVINCNGFTLSDQRGLQ---------------AVGVGIFPNLCLVNHDCWPN 206
Query: 302 AHIMWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
+++ + N +A L AL + GEEL + Y+D + R+ L + +
Sbjct: 207 CTVIFNNGNHEAVRSMFHTQMRIELRALGKISIGEELTVSYVDFLNVCEDRRQQLKKQYY 266
Query: 350 FQCNCLRCSSG 360
F C C C G
Sbjct: 267 FDCTCEHCQKG 277
>gi|345486839|ref|XP_003425566.1| PREDICTED: SET and MYND domain-containing protein 4-like [Nasonia
vitripennis]
Length = 621
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 125/330 (37%), Gaps = 59/330 (17%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ E GR + ATR I+ ++I+ +P T+ L + C CL S+
Sbjct: 202 VKIEHDEKWGRHLVATRDIKPGEVIYVEEPYTKCLTIKHLRAYCSHCLTTTWSNVPCDHC 261
Query: 140 HNARFCGEVCKDNA-KAFYDVE---RRADWSAFNDYCR-------SQGLKYPLLVKRLAC 188
FC E CKD A K ++ E S +D C+ + G+K V+ L
Sbjct: 262 SWTMFCSEACKDLAWKNYHSSECPVYVCSKSDTDDTCKQLAIRCIALGIKEAGSVENLKA 321
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
I S E D + L E I +M GF+ + + K +++ Q N
Sbjct: 322 QIKSFDECQD----VTKGFLKNEKIQSM--GFMSIYALSKNISLENLQTHLENTAIVLRA 375
Query: 249 LAQ----------------IRINAFRIELAGGLYEDLLSSAAASIESEI----------- 281
LA+ ++ N I LA LS A + E+
Sbjct: 376 LAEHTTWLEEKCDFNDCKRLKKNENAIFLAALFL--TLSKIAHVNKHEMWNSSFCRNNLK 433
Query: 282 --------AVGNAIYMLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYI 331
+Y+ P + NH CDPNA + + + A + +++G ++ CY
Sbjct: 434 RLECWKNECCSRGVYLAPITALLNHSCDPNARRCYSLDHKVIVYATKPIKKGSQIFDCYQ 493
Query: 332 DASMAR---DARQAILTQGFGFQCNCLRCS 358
+ R + R +L+ + F C+C C+
Sbjct: 494 EEFYERCKAEERCNMLSSTYNFDCDCKACT 523
>gi|294887209|ref|XP_002772010.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
gi|239875918|gb|EER03826.1| hypothetical protein, conserved [Perkinsus marinus ATCC 50983]
Length = 74
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 36/65 (55%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
NH C PNA + + R+ +L+ V GEE+ YID ++ RQ+ L Q +GF C C
Sbjct: 2 NHSCSPNAEASFRGSRCLRVKSLKPVHSGEEVFQSYIDENLPLVERQSKLRQAYGFACRC 61
Query: 355 LRCSS 359
RC +
Sbjct: 62 GRCRT 66
>gi|432873526|ref|XP_004072260.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 2
[Oryzias latipes]
Length = 489
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 65/294 (22%), Positives = 103/294 (35%), Gaps = 53/294 (18%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + A D+I + + S VC+ C R+ + Q A +C
Sbjct: 13 GRGLKAAKEFWAGDVIFSEASLAAVVFDSLAERVCHSCFRRQEKLQRCGQCKFAHYCDRT 72
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA-----------ESA 197
C+ A E + + SA Y + L+ + L + G+ E
Sbjct: 73 CQRAGWA----EHKQECSAIKAYGKVPNENIRLVARILWRLDKDGSTVSDMQLTTLDELE 128
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
D I +Q L EL + + K+ +DD +++ I N F
Sbjct: 129 DHITDMQEDELK-ELKVDIHNFLDFWPRTSKQHTVDD----------ISHIFGVINCNGF 177
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN------------AHIM 305
+ GL AVG ++ NHDC PN + M
Sbjct: 178 SVSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCTVILNHGNQSAVNTM 222
Query: 306 WIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ L +L + EGEEL + Y+D + RQ +L + F C C C +
Sbjct: 223 FHSQRRIELRSLGKIAEGEELTVAYVDYMNLSEERQRLLKTQYFFDCTCEHCKN 276
>gi|1245372|gb|AAB38131.1| NN8-4AG, partial [Homo sapiens]
Length = 412
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 282 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 338
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 339 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 375
>gi|109103368|ref|XP_001104101.1| PREDICTED: SET and MYND domain-containing protein 5 [Macaca
mulatta]
gi|402891240|ref|XP_003908860.1| PREDICTED: SET and MYND domain-containing protein 5 [Papio anubis]
gi|355565788|gb|EHH22217.1| hypothetical protein EGK_05442 [Macaca mulatta]
Length = 418
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|256062338|ref|XP_002570318.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230727|emb|CCD77144.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 423
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYID---ASMARD 338
G +Y S NH C PNA I+ ++RL +AL ++EGEE+ I Y+D S R
Sbjct: 314 GVGLYYYQSRINHSCSPNA-IIRFSGVNSRLSVVALTSIQEGEEITISYLDHCLQSRGRH 372
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L+ + F CNC +C
Sbjct: 373 TRRKHLSSNYLFWCNCPKC 391
>gi|332813419|ref|XP_001150449.2| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
troglodytes]
Length = 415
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|261330294|emb|CBH13278.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 713
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 280 EIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA 339
E +G ++ S++NH C PN + D A ALR++ +GE L ICY+D A
Sbjct: 569 EDCIGVSVIPEASYFNHSCLPNLCRVMCDGGIAAFYALREIRKGEPLTICYVDVQEVSTA 628
Query: 340 -RQAILTQGFGFQCNCLRCSS 359
R+ L + F C C RC+
Sbjct: 629 ERRRTLLTSYRFFCQCKRCNG 649
>gi|224057872|ref|XP_002299366.1| SET domain protein [Populus trichocarpa]
gi|222846624|gb|EEE84171.1| SET domain protein [Populus trichocarpa]
Length = 391
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 121/323 (37%), Gaps = 81/323 (25%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNS------VCYFCLRKITSS 133
++V + GRG+ +T+ +R ++ PI+ + L C C + I S+
Sbjct: 11 VRVEEIQGRGRGLVSTQPLRGGQIVLIDSPILLYSALPLTKQQHSTFLYCDKCFKTIQSA 70
Query: 134 SQHFQH-HNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS 192
S + RFC C A A W C+S + RL
Sbjct: 71 SVSCPTCSHQRFCSPTCLSAALA----SSHTPW-----VCQS--------LSRLR----- 108
Query: 193 GAESADCIDILQPASLSPE-----LILAMEEGFV------MLRSAFKKAGIDD------- 234
DC D LQ S+ + L+ A FV +L S +A +D
Sbjct: 109 -----DCQDFLQHHSVERQIQAQFLVAAYNLAFVSPSDFQILLSLQGRAEDEDPAIVQSL 163
Query: 235 -------------EQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDL-LSSAAASIESE 280
E F + + ++A+ R NAF GL E L L+ S
Sbjct: 164 HSVISSLCPPPPIEGFSF-SLELIAALVAKDRFNAF------GLMEPLNLNEENGGQRSV 216
Query: 281 IAVGNAIYMLPSFYNHDCDPNA-HIMWID-----NADARLMALRDVEEGEELRICYIDAS 334
A G IY S +NHDC PNA ++D N D + + DV +G E+ + Y +
Sbjct: 217 RAYG--IYPKASLFNHDCLPNACRFDYVDTNNSGNTDIVVRMIHDVPQGREICLSYFPVN 274
Query: 335 MARDARQAILTQGFGFQCNCLRC 357
R+ L + +GF C+C RC
Sbjct: 275 SNYSTRRKRLLEDYGFTCDCDRC 297
>gi|332226837|ref|XP_003262596.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Nomascus leucogenys]
Length = 418
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|299743688|ref|XP_002910692.1| hypothetical protein CC1G_15023 [Coprinopsis cinerea okayama7#130]
gi|298405772|gb|EFI27198.1| hypothetical protein CC1G_15023 [Coprinopsis cinerea okayama7#130]
Length = 449
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 46/81 (56%), Gaps = 6/81 (7%)
Query: 282 AVGNAIYMLPS-FYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMA 336
++ + I+ L S +NH C PNA + +A ++AL+D+ +GEE+ I Y+D ++
Sbjct: 196 SIAHGIFPLASRSFNHSCSPNAAPKYTFSAHQPVIMEVVALKDIHQGEEICIPYLDPALT 255
Query: 337 RDARQAILTQGFGFQCNCLRC 357
+ RQ I +GF CNC C
Sbjct: 256 QTKRQ-IFQFTYGFNCNCPAC 275
>gi|194377790|dbj|BAG63258.1| unnamed protein product [Homo sapiens]
Length = 418
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|194899548|ref|XP_001979321.1| GG14794 [Drosophila erecta]
gi|190651024|gb|EDV48279.1| GG14794 [Drosophila erecta]
Length = 392
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/98 (33%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEE 322
GLY + A + +E G+ +Y+L S NH C PNA N L AL ++E
Sbjct: 269 GLYAKVGEFAGEFLNNE---GSGLYLLQSKINHSCVPNACSTFPYSNDIVVLKALAPIQE 325
Query: 323 GEELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
G+E+ I Y+D M +R +R +L + + F C C +C
Sbjct: 326 GDEICISYLDECMLERSRHSRHKVLRENYVFICQCSKC 363
>gi|154689858|ref|NP_006053.2| SET and MYND domain-containing protein 5 [Homo sapiens]
gi|90101758|sp|Q6GMV2.2|SMYD5_HUMAN RecName: Full=SET and MYND domain-containing protein 5; AltName:
Full=Protein NN8-4AG; AltName: Full=Retinoic
acid-induced protein 15
gi|119620150|gb|EAW99744.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|119620151|gb|EAW99745.1| SMYD family member 5, isoform CRA_a [Homo sapiens]
gi|261859134|dbj|BAI46089.1| SMYD family member 5 [synthetic construct]
Length = 418
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|384943470|gb|AFI35340.1| SET and MYND domain-containing protein 5 [Macaca mulatta]
Length = 418
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|356543740|ref|XP_003540318.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like [Glycine
max]
Length = 484
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 76/183 (41%), Gaps = 28/183 (15%)
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
D +D AS ++ + E +L++A D E + + Y N++ +N
Sbjct: 302 DDVDSSDEASFRLQIKMLAFESLQLLKTAI----FDKECEPLFSLEIYGNIIGMFELNNL 357
Query: 258 RIELAGG-----LYEDLLSSAAASIESEIA---------------VGNAIYMLPSFYNHD 297
+ +A LY D L+ +I G A + L S NH
Sbjct: 358 DLVVASPVEDYFLYIDDLTYPNKEEAEKITQPVLDALGEEYSIYCEGTAFFPLQSCLNHS 417
Query: 298 CDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
C PNA + + A ++A R + +GEE+ I Y+D + + RQA L +GF+C C
Sbjct: 418 CCPNAKAFKREEDKDGQATIIAQRSICKGEEITISYVDEDLTFEERQASLAD-YGFRCRC 476
Query: 355 LRC 357
+C
Sbjct: 477 SKC 479
>gi|425701087|gb|AFX92249.1| putative set domain-containing protein [Megavirus courdo11]
Length = 157
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 52/97 (53%), Gaps = 3/97 (3%)
Query: 267 EDLLSSAAASIESEIAVGNAIYMLPS--FYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
+D+L A + + G++I +L + +NH C PN + + N + +RD+ +GE
Sbjct: 41 DDILLFCAKYMSNAFEYGDSIAILFNGRIFNHSCLPNI-VFYRCNDVMCFITVRDIYKGE 99
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
EL Y++ + + RQ+ L + F C+C RCSS +
Sbjct: 100 ELLDSYVNITHDKKTRQSRLWNQYRFHCDCQRCSSNN 136
>gi|417400530|gb|JAA47200.1| Putative histone tail methylase [Desmodus rotundus]
Length = 415
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|365760270|gb|EHN02002.1| Set5p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 524
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 83/372 (22%), Positives = 137/372 (36%), Gaps = 58/372 (15%)
Query: 36 FWKMCLGRYSRCLISR-LQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFA 94
WK+ R+ L L L+ L H K + L + GRG+FA
Sbjct: 68 MWKLSASRFREILNQHHLYDTDLETVSLYKENIHFPKILDSNAKVKVNFLDDERGRGLFA 127
Query: 95 TRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH------ 140
+ +I KPI+ P L L C C + + +QH H+
Sbjct: 128 EKDFAKGQIILKENKPIVYVPPLDKLFFISNGKACARCGKALYDLTQHKIMVHYLDCEVC 187
Query: 141 NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGLKYP 180
A +C E CK + D+ +W F +YC S GL Y
Sbjct: 188 KAIWCSEKCKKAHAPLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSIGLIYG 247
Query: 181 LL-------VKRLACMIISGAESA-----DCIDILQPASLSPELILAMEEGFVMLRSAFK 228
+ V+ + S ++ D I +L E+GF
Sbjct: 248 SMLLDTTGEVREQWNKLASVSQRVRIKLRDASGIGSTFNLMNGTTAHTEDGFDNNSKMSA 307
Query: 229 KAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIY 288
+ IDDE + ++ Y + + I+L E L+ ++ +Y
Sbjct: 308 EENIDDETVW---EKCYESFCGAFPKASEEIDL-----EKFLTMIGTFNINQ--YNGQMY 357
Query: 289 MLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQAILTQG 347
+ SF NHDC+PNA+I + + + +L A + +++GE++ I Y++ R+ L
Sbjct: 358 LWISFINHDCEPNAYIEQTEEHEELKLHARKPIKKGEQICITYVNPLHGVRLRRRELRVN 417
Query: 348 FGFQCNCLRCSS 359
+GF C C RC +
Sbjct: 418 WGFLCQCDRCQN 429
>gi|256062334|ref|XP_002570316.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230729|emb|CCD77146.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 433
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYID---ASMARD 338
G +Y S NH C PNA I+ ++RL +AL ++EGEE+ I Y+D S R
Sbjct: 324 GVGLYYYQSRINHSCSPNA-IIRFSGVNSRLSVVALTSIQEGEEITISYLDHCLQSRGRH 382
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L+ + F CNC +C
Sbjct: 383 TRRKHLSSNYLFWCNCPKC 401
>gi|449681118|ref|XP_002159729.2| PREDICTED: uncharacterized protein LOC100210152, partial [Hydra
magnipapillata]
Length = 1044
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 61/284 (21%), Positives = 115/284 (40%), Gaps = 30/284 (10%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+++ T S GR A+R I+A ++I KP + N+ CY CL + +
Sbjct: 230 LKIVSTNSKGRHAIASRDIKAGEVIIIEKPFASLCLPECYNTHCYHCLTRFKINYPCRLC 289
Query: 140 HNARFCGEVC-KDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESAD 198
+C C K++ + F+ E ND L + ++ M++S E+
Sbjct: 290 STVNYCSISCEKESWEKFHCFECEYLGVLINDDVGLAHLAFKIITNVGISMLLSFKENNS 349
Query: 199 CIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFR 258
D+ +S I ++ E ++ + K + + NA
Sbjct: 350 FDDLKPYSSTDYNSIFSLIENLKIVCAHLLKH------------------IQMLPCNAHE 391
Query: 259 IELAGGLYEDLLSSAAASIESEIA-VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMAL 317
+ +L A+ +SE+ +G+A+Y S NH CDP+ + L A+
Sbjct: 392 VS-------ELQLKASNYKDSELKEIGSAVYATLSLLNHSCDPSV-VRHCYGDTCVLRAI 443
Query: 318 RDVEEGEEL--RICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ ++EG E+ ++ A ++ RQ+ L + + F C C CS+
Sbjct: 444 KHIKEGSEIVDNYGFLYAVESKVIRQSHLMEQYYFACQCEACSN 487
>gi|296223556|ref|XP_002807574.1| PREDICTED: LOW QUALITY PROTEIN: SET and MYND domain-containing
protein 5 [Callithrix jacchus]
Length = 431
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 304 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 360
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 361 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 397
>gi|256062336|ref|XP_002570317.1| set and mynd domain containing [Schistosoma mansoni]
gi|353230728|emb|CCD77145.1| putative set and mynd domain containing [Schistosoma mansoni]
Length = 429
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYID---ASMARD 338
G +Y S NH C PNA I+ ++RL +AL ++EGEE+ I Y+D S R
Sbjct: 320 GVGLYYYQSRINHSCSPNA-IIRFSGVNSRLSVVALTSIQEGEEITISYLDHCLQSRGRH 378
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L+ + F CNC +C
Sbjct: 379 TRRKHLSSNYLFWCNCPKC 397
>gi|302682538|ref|XP_003030950.1| expressed protein [Schizophyllum commune H4-8]
gi|300104642|gb|EFI96047.1| expressed protein, partial [Schizophyllum commune H4-8]
Length = 423
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 284 GNAIYMLPS-FYNHDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMARD 338
G+ I+ L S +NH C PNA +I + ++ALR + GEE+ I Y+D ++ +
Sbjct: 181 GHGIFPLASRLFNHSCVPNAAARYILRPREPVKMEVVALRPITAGEEICIPYVDPALL-E 239
Query: 339 ARQAILTQGFGFQCNCLRC 357
RQ I +GF+C C C
Sbjct: 240 TRQQIFKLSYGFECRCTSC 258
>gi|380797359|gb|AFE70555.1| SET and MYND domain-containing protein 5, partial [Macaca mulatta]
Length = 407
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 277 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 333
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 334 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 370
>gi|149727812|ref|XP_001488565.1| PREDICTED: SET and MYND domain-containing protein 5 [Equus
caballus]
Length = 417
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|429851794|gb|ELA26957.1| lysine methyltransferase [Colletotrichum gloeosporioides Nara gc5]
Length = 430
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 54/118 (45%), Gaps = 10/118 (8%)
Query: 245 YTNVLAQ-IRINAFRIELAGGLYEDLLSSAAASIESEIAVGN-AIYMLPSFYNHDCDPNA 302
+T L Q I N + + +GG ED +E E+ +G A++ S NH C NA
Sbjct: 202 FTKGLKQKIWTNGYGV--SGGPAEDW-----PGLEDELDLGMIAVHANISKINHSCRSNA 254
Query: 303 HIMW-IDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
W D RL A+RD+ GEE+ I Y D RQ + GF+C C C +
Sbjct: 255 ASQWDWDLLAHRLWAVRDIAAGEEITISYFDPIQTLRERQRYAKETLGFECACSHCQA 312
>gi|303282313|ref|XP_003060448.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457919|gb|EEH55217.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 516
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 275 ASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDAS 334
++I +E +VG +Y L SF NH C PNA + I + A RD+ GEE+ + Y D +
Sbjct: 192 SAIRNEKSVG--VYPLMSFANHSCAPNACKLLIGHT-MFTRAARDLVAGEEVCVKYFDVT 248
Query: 335 MARDARQAILTQGFGFQCNCLRCS 358
+ R A+ + +GF+C C RC
Sbjct: 249 APKSERNAVAKR-WGFECACARCG 271
>gi|431920345|gb|ELK18377.1| SET and MYND domain-containing protein 5 [Pteropus alecto]
Length = 418
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|403165497|ref|XP_003325490.2| hypothetical protein PGTG_07323 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375165747|gb|EFP81071.2| hypothetical protein PGTG_07323 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 691
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 46/82 (56%), Gaps = 9/82 (10%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA---------RLMALRDVEEGEELRICYIDASM 335
+ +Y+L S +NH CDPNA ++ ++ +MA +D+ + EE+ + YI+ +
Sbjct: 539 HGLYLLHSRFNHSCDPNAVVLKHPGSEDIHKSAPSTIYVMARKDIAKDEEITLSYINPDL 598
Query: 336 ARDARQAILTQGFGFQCNCLRC 357
+ DAR+ L + + F C C RC
Sbjct: 599 SLDARRRKLFEDYLFSCFCSRC 620
>gi|344246823|gb|EGW02927.1| SET and MYND domain-containing protein 5 [Cricetulus griseus]
Length = 633
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 504 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 560
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 561 EEICISYLDCCQRERSRHSRHKILRENYLFICSCPKC 597
>gi|195636164|gb|ACG37550.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 2/78 (2%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G +Y + S NH C PNA ++ D A + AL+ +++ EE+ I YI+ + R
Sbjct: 205 LGTGLYPVISIINHSCVPNA-VLIFDGRTAYVRALQPIDKDEEVSISYIETAAVTKKRNN 263
Query: 343 ILTQGFGFQCNCLRCSSG 360
L Q F F C C RC G
Sbjct: 264 DLKQYF-FTCTCPRCVKG 280
>gi|397473487|ref|XP_003808242.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1 [Pan
paniscus]
Length = 420
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|134083355|emb|CAK42922.1| unnamed protein product [Aspergillus niger]
Length = 415
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/125 (28%), Positives = 63/125 (50%), Gaps = 13/125 (10%)
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIE-SEIAVGNAIY------MLP--SF 293
QW L+ I A+ + E+++S+ A ++ + + NA+Y + P +
Sbjct: 118 QWERISLSSKAIKAYS---GTDMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAI 174
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
+NH CD NA + + D + + ALR + + E++ I YID + RQA L + F C+
Sbjct: 175 FNHSCDHNAAVSF-DGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCH 233
Query: 354 CLRCS 358
C +CS
Sbjct: 234 CSKCS 238
>gi|50294041|ref|XP_449432.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528746|emb|CAG62408.1| unnamed protein product [Candida glabrata]
Length = 374
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 77/180 (42%), Gaps = 37/180 (20%)
Query: 193 GAESADCIDILQPASLS-----PEL----ILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQ 243
G+++ + LQ LS P L +L + FV+L K + L Q
Sbjct: 213 GSDTLEAFHNLQSNELSKIQKFPVLLNFQVLVFKTLFVLLPDNLKNS---------LTVQ 263
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ ++L NAF G+++D +++S G ++ S++NH CDPN
Sbjct: 264 AFRHILGSEYGNAF------GIWQD-----TEAVDSREYFGYWVFPKASYFNHSCDPNIT 312
Query: 304 IMWIDNADARLMAL---RDVEEGEELRICYIDA-SMARDARQAILTQGFGFQCNCLRCSS 359
ID R M RDV GEEL I Y + D RQ L + + F C C RC+S
Sbjct: 313 KTRID----RKMVFTLNRDVACGEELNIDYSGVLDLPVDRRQKFLFENWFFVCGCNRCNS 368
>gi|451856473|gb|EMD69764.1| hypothetical protein COCSADRAFT_195529 [Cochliobolus sativus
ND90Pr]
Length = 325
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 52/113 (46%), Gaps = 9/113 (7%)
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNA----IYMLPSF--YNHDCDPNAHIMWI 307
+ A L G ED++ + S + GN I + PS NH C PNA ++
Sbjct: 104 VAALDQSLGGSPIEDVMKTN--SFACRLDDGNVDKAYICLFPSVSRINHACQPNALARFV 161
Query: 308 DNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ A RD+ GEE+ I Y + RD RQ + G+ F+C C C++
Sbjct: 162 SKTLSMQIKAKRDIAAGEEITISYGRVDLTRDERQELYKDGWNFECTCSLCTA 214
>gi|254572041|ref|XP_002493130.1| Zinc-finger protein of unknown function [Komagataella pastoris
GS115]
gi|238032928|emb|CAY70951.1| Zinc-finger protein of unknown function [Komagataella pastoris
GS115]
gi|328352852|emb|CCA39250.1| Potential protein lysine methyltransferase SET5 [Komagataella
pastoris CBS 7435]
Length = 464
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 42/77 (54%), Gaps = 2/77 (2%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHI-MWIDNADA-RLMALRDVEEGEELRICYIDASMARDAR 340
V N ++++ + NH CD N I + AD ++ ALRD+ GE+L Y+ +M+ R
Sbjct: 323 VDNCLFLVYAHLNHSCDRNVSIDLQPKRADGLQVTALRDIAAGEQLTTTYVSPTMSLQDR 382
Query: 341 QAILTQGFGFQCNCLRC 357
Q L +GF C C RC
Sbjct: 383 QRELRLNWGFICECSRC 399
>gi|410955043|ref|XP_003984168.1| PREDICTED: SET and MYND domain-containing protein 5 [Felis catus]
Length = 416
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|297667345|ref|XP_002811937.1| PREDICTED: SET and MYND domain-containing protein 5 [Pongo abelii]
Length = 419
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|440799648|gb|ELR20692.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 548
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/290 (23%), Positives = 109/290 (37%), Gaps = 42/290 (14%)
Query: 85 TESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNAR- 143
T + G V A + I A I I P + SVC+ C+ ++ +
Sbjct: 63 TPAKGTVVIAKKDIPAGTTILAEPAIAWQPLEDRIMSVCHHCMTEVPRWAVGCGEGAGGC 122
Query: 144 ----FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADC 199
+C C++ ++A + VE + A + R+ K + + +LA II+
Sbjct: 123 SGLGYCSPKCREASEALHRVEHKVFLQARDIANRT---KADITLIKLATRIIA------- 172
Query: 200 IDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
SLS + E G + + G +DE +QW ++ ++ +
Sbjct: 173 -----LRSLSEGHRVQFERGVMAM------MGHEDE----CPQQWVDSITETAKLVMPLL 217
Query: 260 ELAGGLYEDLLSSAAASIES-----------EIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
L A I + + +G +Y L S NH C PN +
Sbjct: 218 PKPARLSAREFVKVCARINTNSHRQHHMFVPQRILGVGLYPLASLINHSCQPNCGF-YNR 276
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ L DV+EGEEL YID +R R+A L + F C C RCS
Sbjct: 277 GPTLYIRTLCDVKEGEELCYSYIDLYQSRSKRKAELLETKHFDCLCNRCS 326
>gi|355751414|gb|EHH55669.1| hypothetical protein EGM_04917 [Macaca fascicularis]
Length = 419
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|451927719|gb|AGF85597.1| hypothetical protein glt_00792 [Moumouvirus goulette]
Length = 236
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 294 YNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
+NH C PN +DN M + D+++G+EL Y+D + RQ L +GF C+
Sbjct: 143 FNHSCLPNIIFYRVDNM-MYFMTIIDIKKGQELVDNYVDIVQNKKIRQDRLLNQYGFICS 201
Query: 354 CLRCSSGD 361
C+RCS+ +
Sbjct: 202 CVRCSNNN 209
>gi|426335968|ref|XP_004029476.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Gorilla gorilla gorilla]
Length = 419
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|320591240|gb|EFX03679.1| oxysterol-binding protein [Grosmannia clavigera kw1407]
Length = 850
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDASMARDARQAIL 344
AIY + NHDC PN + D A R +MA RD+ GEEL Y+ RQ L
Sbjct: 208 AIYPKAALVNHDCRPN--VFNTDVAGRRVIMATRDIAVGEELLTTYVPLLADTTTRQRRL 265
Query: 345 TQGFGFQCNCLRCSSGD 361
Q +GF C+C+ C +G+
Sbjct: 266 IQ-YGFHCSCVACLAGE 281
>gi|58264108|ref|XP_569210.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|338819407|sp|P0CR42.1|SET5_CRYNJ RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|57223860|gb|AAW41903.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 449
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 25/98 (25%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWI-----------------------DNADARL--MALRD 319
+Y L + NH C+PN + + D +L +A +
Sbjct: 339 GGLYALHAHMNHSCEPNIQVRNLPKSYTPPTQDTLPVNLPPPIQAGDRVSNKLTILARHE 398
Query: 320 VEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++ GEEL I Y++ M+RD R+ L +G+GF C C RC
Sbjct: 399 IQPGEELTISYVNMKMSRDERRQALREGYGFWCACDRC 436
>gi|405118616|gb|AFR93390.1| hypothetical protein CNAG_03890 [Cryptococcus neoformans var.
grubii H99]
Length = 449
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 7/122 (5%)
Query: 242 KQWYT--NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGN--AIYMLPSFYNHD 297
++W++ + L + + E +GGLY L + S E I V N Y P+
Sbjct: 316 RRWFSFESFLELLGLVGLNQEDSGGLYA-LHAHLNHSCEPNIQVRNLPKSYTPPTPDTLP 374
Query: 298 CDPNAHIMWIDNADARL--MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
D I D +L +A ++ GEEL I Y++ M RD R+ L +G+GF C C
Sbjct: 375 VDLPPPIRAGDKVSNKLTILARHGIQPGEELTISYVNMKMPRDERRQALREGYGFWCACG 434
Query: 356 RC 357
RC
Sbjct: 435 RC 436
>gi|395841248|ref|XP_003793458.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 1
[Otolemur garnettii]
Length = 416
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFICSCPKC 381
>gi|311252397|ref|XP_003125060.1| PREDICTED: SET and MYND domain-containing protein 5 [Sus scrofa]
Length = 417
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|440901422|gb|ELR52368.1| SET and MYND domain-containing protein 5 [Bos grunniens mutus]
Length = 418
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|428178002|gb|EKX46879.1| hypothetical protein GUITHDRAFT_137874 [Guillardia theta CCMP2712]
Length = 978
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICY-IDASMARDAR 340
A ++ L +F++H C PN + +N + + ++RDV GEEL I Y + + R
Sbjct: 237 AFETGLHHLSAFFDHSCSPNCEVRGEENIE--VYSIRDVAAGEELTIAYSLSLDYPVEMR 294
Query: 341 QAILTQGFGFQCNCLRC 357
+ L FGF+C C RC
Sbjct: 295 RIFLQTNFGFRCECPRC 311
>gi|426194765|gb|EKV44696.1| hypothetical protein AGABI2DRAFT_120820 [Agaricus bisporus var.
bisporus H97]
Length = 674
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
Query: 283 VGNAIYMLPSFYNHDCDPNA--------HIMWIDNADARLMALRDVEEGEELRICYIDAS 334
+G +I + + +NH C PNA H + A ++ LRD+ EE+ I YID +
Sbjct: 299 IGVSISPIIALFNHSCAPNAVPVFPRAPHNAKANEPMASVITLRDIPANEEVVISYIDTT 358
Query: 335 MARDARQAILTQGFGFQCNCLRC 357
+ + RQ L + + F C+C C
Sbjct: 359 LTKRERQKALKETYYFTCSCSLC 381
>gi|403260393|ref|XP_003922658.1| PREDICTED: SET and MYND domain-containing protein 5 [Saimiri
boliviensis boliviensis]
Length = 419
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|291386545|ref|XP_002709793.1| PREDICTED: SMYD family member 5 [Oryctolagus cuniculus]
Length = 421
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|407919127|gb|EKG12382.1| hypothetical protein MPH_10499 [Macrophomina phaseolina MS6]
Length = 675
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 61/280 (21%), Positives = 102/280 (36%), Gaps = 29/280 (10%)
Query: 91 GVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCK 150
GVFA R I + I + T S+ + C C + I + +A FC E C+
Sbjct: 261 GVFADREIDPGECILVDR---TATGASSSVAGCVNCFKNIKDAGVEAACCSAIFCSEDCR 317
Query: 151 DNAKAFYDVER-RADW----SAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQP 205
A + Y R D+ + S PLL+ R + +D
Sbjct: 318 KLAVSTYHAAVCRKDFMWLQATVEGLSESAAAMRPLLMFRYLAACAQAGATEHPLDHPLV 377
Query: 206 ASLSPEL------ILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
A L P++ + ++E + ++ G+D + +I+ + I
Sbjct: 378 ARLQPQMSGRHLDVFTLKESVIRPFQILQQLGMD--------------IFENSKIDTWVI 423
Query: 260 ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD 319
+ E+ ++I L +NH C+PN D R A R
Sbjct: 424 HTIWTRLANNKQGFPRKPENGGGWIDSISPLMPLFNHSCEPNVECWHDDTTTVRFFARRR 483
Query: 320 VEEGEELRICYIDASMAR-DARQAILTQGFGFQCNCLRCS 358
+++GEEL Y++ + + RQ IL F C C +CS
Sbjct: 484 IQKGEELFDSYVNVENVQLEKRQQILWPWFEGPCLCSKCS 523
>gi|348681060|gb|EGZ20876.1| hypothetical protein PHYSODRAFT_313346 [Phytophthora sojae]
Length = 494
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD--ARLMALRDVEEGEELRICYIDASM 335
E + ++ + NH C PN W + D A + ALRD++EG+E+ + YID
Sbjct: 202 EKHLVAAVGLFPICGLINHSCQPNC--TWSNAGDGIAEVRALRDIKEGDEITLSYIDIDK 259
Query: 336 ARDARQAILTQGFGFQCNCLRCSS 359
R R+ L + F C C RC++
Sbjct: 260 ERAERRKELRETKHFDCQCGRCAA 283
>gi|255950300|ref|XP_002565917.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592934|emb|CAP99305.1| Pc22g20170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 495
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 42/77 (54%), Gaps = 3/77 (3%)
Query: 286 AIYMLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
+YM P NH CD N+ + + D + + A+R +++GE++ I YID + D R+
Sbjct: 204 GLYMHPYAGLINHSCDYNSTVGF-DGEELYVKAMRPIKKGEQIFISYIDTTTPYDIRRNE 262
Query: 344 LTQGFGFQCNCLRCSSG 360
L + + F C C +C G
Sbjct: 263 LKERYFFDCQCTKCKMG 279
>gi|134108020|ref|XP_777392.1| hypothetical protein CNBB1930 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819406|sp|P0CR43.1|SET5_CRYNB RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|50260082|gb|EAL22745.1| hypothetical protein CNBB1930 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 449
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 44/98 (44%), Gaps = 25/98 (25%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWI-----------------------DNADARL--MALRD 319
+Y L + NH C+PN + + D +L +A +
Sbjct: 339 GGLYALHAHMNHSCEPNIQVRNLPKSYTPPTQDTLPVNLPPPIQAGDRVSNKLTILARHE 398
Query: 320 VEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++ GEEL I Y++ M+RD R+ L +G+GF C C RC
Sbjct: 399 IQPGEELTISYVNMKMSRDERRQALREGYGFWCACDRC 436
>gi|122692553|ref|NP_001073717.1| SET and MYND domain-containing protein 5 [Bos taurus]
gi|120419446|gb|ABM21542.1| SMYD family member 5 [Bos taurus]
gi|151553913|gb|AAI49126.1| SMYD5 protein [Bos taurus]
gi|296482713|tpg|DAA24828.1| TPA: SMYD family member 5 [Bos taurus]
Length = 418
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|50303021|ref|XP_451448.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74637076|sp|Q6CX91.1|SET5_KLULA RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|49640579|emb|CAH03036.1| KLLA0A10241p [Kluyveromyces lactis]
Length = 492
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 69/331 (20%), Positives = 113/331 (34%), Gaps = 88/331 (26%)
Query: 80 IQVALTESAGRGVFATRRIRASDLI-HTAKPIITHPTLS-----TLNSVCYFCLRKITSS 133
+ +A TE G+G+ AT IR L+ H P+ P L + C C T S
Sbjct: 115 VTIADTE-FGKGLVATDDIRKGGLVFHEQLPLTVIPQLDKQPLVSRGKCCGLCSAFTTLS 173
Query: 134 SQHFQHHN-------ARFCGEVCKD---------NAKAFYDVERRADWSAFNDYCRSQGL 177
SQ +N +C + CK +A + + ++W F ++C + L
Sbjct: 174 SQLIIKNNLDCNMCNTIWCSKQCKKLDTTHSVLKHATSRNSLCNSSNWIKFENFCLQESL 233
Query: 178 KYPLLVKRL------------------ACMI-------------ISGAESADCIDILQPA 206
V + C + + G A C L
Sbjct: 234 VAAYAVGVIYARHVMQPTSNLWKLFTSLCSVSQRVRWQASDSINVGGTFDASCNGKLYAD 293
Query: 207 SLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLY 266
+P + + F M +F K +D Y
Sbjct: 294 VSAPNMETQWSDAFEMFAKSFPKCDMD--------------------------------Y 321
Query: 267 EDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEEL 326
ED L+ ++I +Y L + NH C+PN +L A +D+++GE+L
Sbjct: 322 EDFLNLTGTFNLNQIM--GQVYPLVAHINHSCEPNVRYELEPKHGIKLYARKDIKKGEQL 379
Query: 327 RICYIDASMARDARQAILTQGFGFQCNCLRC 357
R+ Y++ R+ L +GF C+C RC
Sbjct: 380 RLTYVNPLHGVTLRRRELRVNYGFLCHCPRC 410
>gi|301758142|ref|XP_002914921.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Ailuropoda melanoleuca]
gi|281346678|gb|EFB22262.1| hypothetical protein PANDA_002863 [Ailuropoda melanoleuca]
Length = 417
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|348688194|gb|EGZ28008.1| putative SET-domain transcriptional regulator protein, Zn-binding
site [Phytophthora sojae]
Length = 457
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 6/86 (6%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADAR-----LMALRDVEEGEELRICYI-DASM 335
A + +++ + +NHDC+PN ++A A + R + +GE+L I Y+
Sbjct: 201 AFDSGVFLYNAMFNHDCNPNCVKFTPEDAGAEGGVSEVRVARPIAKGEQLTISYLYPREQ 260
Query: 336 ARDARQAILTQGFGFQCNCLRCSSGD 361
+R+ RQ L + FGF+C+C C GD
Sbjct: 261 SRENRQKNLWEQFGFECSCELCKRGD 286
>gi|344283937|ref|XP_003413727.1| PREDICTED: SET and MYND domain-containing protein 5 [Loxodonta
africana]
Length = 417
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|88900471|ref|NP_001034725.1| SET and MYND domain-containing protein 1 [Danio rerio]
gi|84180543|gb|ABC54713.1| histone methyltransferase SmyD1a [Danio rerio]
Length = 486
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 70/309 (22%), Positives = 114/309 (36%), Gaps = 65/309 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
++V + GRG+ AT+ A D++ P + S +S+C+ C R+ + Q
Sbjct: 4 VEVFDSPGKGRGLRATKEAWAGDVLFAEPPFASVVFDSQASSICHSCFRRQEKLQRCGQC 63
Query: 140 HNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAES 196
A++C + C+ RA W C ++ G K P RLA I+ +
Sbjct: 64 RFAQYCDKTCQ-----------RAGWEEHKLECAAIKTYG-KPPSENVRLAARILWRMDK 111
Query: 197 ADCIDILQPASLSPELILAME--EGFVMLRSAFKKAGIDDEQMKFLNKQWYTN------- 247
Q + +S + +E E + S + FL+ W N
Sbjct: 112 -------QGSVVSDNQLTTLEDLEDHICDISEDDLKDFKVDIHNFLD-YWPRNSKPHTVD 163
Query: 248 ----VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+L I N F + GL AVG ++ NHDC PN
Sbjct: 164 SVSHILGVINCNGFMVSDQRGLQ---------------AVGVGLFPNLCLVNHDCWPNCT 208
Query: 304 IMWIDNADA-------------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
++ ++N + L AL + GEE+ + Y+D RQ +L Q + F
Sbjct: 209 VI-LNNGNQSAIDTVFHSQKRIELRALGKISAGEEVTVAYVDYLNVSADRQRLLKQQYFF 267
Query: 351 QCNCLRCSS 359
C C C+
Sbjct: 268 DCTCKHCTE 276
>gi|115496736|ref|NP_001069832.1| N-lysine methyltransferase SMYD2 [Bos taurus]
gi|122144240|sp|Q0P585.1|SMYD2_BOVIN RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|112362245|gb|AAI20365.1| SET and MYND domain containing 2 [Bos taurus]
gi|296478866|tpg|DAA20981.1| TPA: SET and MYND domain containing 2 [Bos taurus]
Length = 433
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 117/297 (39%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + + DL+ + T+S + C FC + S+ CG
Sbjct: 18 GRGLRALQPFQVGDLLFSCPAYAYVLTVSERGNHCEFCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY +VE +R DW C G + P RL I++ +
Sbjct: 68 CK---QAFYCNVECQREDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ------------------MKFLNKQW 244
+ P E +LA++E S K +D+E+ ++F +
Sbjct: 118 IHPERTPSEKLLAVKE----FESHLDK--LDNEKRDLIQSDIAALHHFYSKHLEFPDNDS 171
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A++++ GEE+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIHPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD 271
>gi|242216833|ref|XP_002474221.1| predicted protein [Postia placenta Mad-698-R]
gi|220726639|gb|EED80582.1| predicted protein [Postia placenta Mad-698-R]
Length = 152
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 49/95 (51%), Gaps = 8/95 (8%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
G++E L+ ESE +G A+Y PSF+NH C PN + A + R V G
Sbjct: 61 GIWEVPLTE-----ESE-CLGFAVYPRPSFFNHHCSPNVNKERYGRGLA-FVTTRSVRAG 113
Query: 324 EELRICYIDA-SMARDARQAILTQGFGFQCNCLRC 357
EEL I Y A M RQ L +G+ F+C+C +C
Sbjct: 114 EELCISYGHAEGMGWRQRQKELREGWFFECSCGKC 148
>gi|357607357|gb|EHJ65468.1| hypothetical protein KGM_05644 [Danaus plexippus]
Length = 370
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 4/97 (4%)
Query: 264 GLYEDLLSSAAASIESEI-AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE- 321
GLY ++ ++ ++ E+ ++G IY+ S +H C+PNA + D + AL+D+
Sbjct: 161 GLYGRMVINSFTILDIEMNSIGTGIYLASSVIDHSCNPNA-VAVFDGKTINIRALKDMNC 219
Query: 322 -EGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ +++RI YID RQ L Q + F C C RC
Sbjct: 220 LDWKKIRISYIDLMKTPYERQMELRQSYYFLCQCDRC 256
>gi|317036659|ref|XP_001397803.2| SET and MYND domain protein [Aspergillus niger CBS 513.88]
Length = 481
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 120/303 (39%), Gaps = 54/303 (17%)
Query: 73 SQPSPPPIQVALTESAGRGVFATRRIR-ASDLIHTAKPIITHPTLSTLNSVCYFCLRKIT 131
+ P P P GRG+FA+ I +++H P + L VC C +
Sbjct: 5 TTPQPAP------GGLGRGLFASTDIAVGEEVLHIPVPFVAVLDTEHLGEVCSGCFGQRQ 58
Query: 132 SSSQHFQHHNARFCGEV--CKDNAKAFYDVERRADWSAFNDYCRS--QGLK---YPLLVK 184
+ R CG V C +A +W + S Q LK P+ +
Sbjct: 59 LEEEGIALKGCRGCGIVKYCDKTCQA-------KNWKLGHSLECSIYQNLKPRVLPINAR 111
Query: 185 RLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW 244
+ M++ S EL + F+ L + K I D+ QW
Sbjct: 112 AVLRMVLRSERQKYS---------SEEL-----DQFLQLGTHIKD--IRDQSAS----QW 151
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIE-SEIAVGNAIY------MLP--SFYN 295
L+ I A+ + E+++S+ A ++ + + NA+Y + P + +N
Sbjct: 152 ERISLSSKAIKAYS---GTDMSEEVISAMGAKLDLNSFNLTNAVYDRLGVYLHPYAAIFN 208
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
H CD NA + + D + + ALR + + E++ I YID + RQA L + F C+C
Sbjct: 209 HSCDHNAAVSF-DGPNLHIKALRPIRKDEQIFITYIDVTDPYPIRQANLQSRYYFTCHCS 267
Query: 356 RCS 358
+CS
Sbjct: 268 KCS 270
>gi|255732233|ref|XP_002551040.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240131326|gb|EER30886.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 474
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 2/73 (2%)
Query: 287 IYMLPSFYNHDCDPNAHI-MWIDNADA-RLMALRDVEEGEELRICYIDASMARDARQAIL 344
++++ S NH+C PN + + D ++ A RD++ GEEL Y++ S RQ L
Sbjct: 335 VFLIQSHLNHNCSPNTSVETEMQRTDGLKVFAARDIKSGEELTTTYVNPSHTVHQRQREL 394
Query: 345 TQGFGFQCNCLRC 357
+GF C C +C
Sbjct: 395 RVNWGFVCGCAKC 407
>gi|398398565|ref|XP_003852740.1| hypothetical protein MYCGRDRAFT_109375 [Zymoseptoria tritici
IPO323]
gi|339472621|gb|EGP87716.1| hypothetical protein MYCGRDRAFT_109375 [Zymoseptoria tritici
IPO323]
Length = 383
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 4/83 (4%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYI---DAS 334
+SE +G A+Y S++NH C PN + + RDV+ GEE I Y+ +
Sbjct: 296 DSEEYMGWAVYPSASYFNHSCSPNLAKRRV-GREWEFTTARDVDAGEECCITYLGGDEKG 354
Query: 335 MARDARQAILTQGFGFQCNCLRC 357
M R RQ L + +GF C C RC
Sbjct: 355 MDRAERQRRLKEVWGFDCGCERC 377
>gi|403161562|ref|XP_003321888.2| hypothetical protein PGTG_03425 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171832|gb|EFP77469.2| hypothetical protein PGTG_03425 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 430
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 2/75 (2%)
Query: 285 NAIYMLPSF-YNHDCDPN-AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+A+ + P+ +NHDC PN + M + A R + GEEL I Y + R RQ
Sbjct: 237 SALVLEPAVRFNHDCRPNVGYYMDHVTQSMHMTAFRPISAGEELTISYRALELPRQVRQD 296
Query: 343 ILTQGFGFQCNCLRC 357
L++ +GFQC+C C
Sbjct: 297 SLSETYGFQCSCSLC 311
>gi|354492175|ref|XP_003508226.1| PREDICTED: SET and MYND domain-containing protein 5-like
[Cricetulus griseus]
Length = 435
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 306 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 362
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 363 EEICISYLDCCQRERSRHSRHKILRENYLFICSCPKC 399
>gi|195481611|ref|XP_002101711.1| GE15465 [Drosophila yakuba]
gi|194189235|gb|EDX02819.1| GE15465 [Drosophila yakuba]
Length = 497
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/302 (23%), Positives = 122/302 (40%), Gaps = 52/302 (17%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLMGLAAHEEDSLNACSVCLKM 76
Query: 130 ITSSSQHFQHHNARF-----CG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLV 183
+ + RF CG VC AK + ++D F + ++ ++
Sbjct: 77 LPDT---------RFMCRQGCGLPVCSLCAKK---KQHKSDCDLFKSWGPNEPDVANSVI 124
Query: 184 KRLACM---IISGAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGIDDEQM 237
RL C+ I E D I LQ A+L +R+A FK D + +
Sbjct: 125 IRLLCVARAINLTKEQRDLIYCLQ-ANLD-------NNHRTEVRNAAKCFKNFPTDKKLI 176
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHD 297
+ +N+ +A +R N F + + +++ A+Y L NHD
Sbjct: 177 EIMNR-----TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVVNHD 219
Query: 298 CDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C PNA+ + + + ++ A D+ EG E+ Y AR L F C C R
Sbjct: 220 CIPNAYYTFEEKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTCKCSR 279
Query: 357 CS 358
CS
Sbjct: 280 CS 281
>gi|392590224|gb|EIW79553.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 410
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 49/119 (41%), Gaps = 9/119 (7%)
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA--------VGNAIYMLPSFYNHDC 298
+LA+ NAF I L E +S + E+E G +++ S +NH C
Sbjct: 288 QLLARETGNAFGIWEGDQLPEGRISIGDSEKEAEDTRDMREREMFGWGVWVSASLFNHSC 347
Query: 299 DPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
PN LRDV+EGEELRI Y R+ L + F C C RC
Sbjct: 348 APNVRKK-RSGRSLEFFTLRDVKEGEELRISYASTDKPVGERREALRTSWFFDCMCSRC 405
>gi|406867181|gb|EKD20220.1| TPR domain protein [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 709
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 40/74 (54%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G A+Y + + +NHDC PN + + R++ GEEL I Y++ + AR+
Sbjct: 493 GFAVYEVAARFNHDCRPNVRRGFTKENCIAFVTAREITRGEELTISYLNMAGMSVARRKE 552
Query: 344 LTQGFGFQCNCLRC 357
LT+ +GF+C C C
Sbjct: 553 LTRWWGFECKCDAC 566
>gi|332813421|ref|XP_515547.3| PREDICTED: SET and MYND domain-containing protein 5 isoform 4 [Pan
troglodytes]
Length = 299
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 172 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 228
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 229 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 265
>gi|302813409|ref|XP_002988390.1| hypothetical protein SELMODRAFT_447338 [Selaginella moellendorffii]
gi|300143792|gb|EFJ10480.1| hypothetical protein SELMODRAFT_447338 [Selaginella moellendorffii]
Length = 652
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 57/134 (42%), Gaps = 1/134 (0%)
Query: 224 RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAV 283
+S F+ G+D E + TNVL + + A L SS A IE +
Sbjct: 354 KSLFQFLGLDSELAMEMGLFKPTNVLIHDQQQDYIWFDAVPAIAKLCSSYAFQIEDKPES 413
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
++ LP+ NH C PN + A + A R++ EEL + Y+D D R
Sbjct: 414 PMGLWTLPALINHSCVPNVSQQMVGTA-LFVRAARNIRAEEELTLPYMDVKKPLDERMYW 472
Query: 344 LTQGFGFQCNCLRC 357
+GF C CLRC
Sbjct: 473 TDSRWGFTCKCLRC 486
>gi|321472149|gb|EFX83120.1| hypothetical protein DAPPUDRAFT_2393 [Daphnia pulex]
Length = 458
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 65/284 (22%), Positives = 121/284 (42%), Gaps = 27/284 (9%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHF 137
PP ++ +E+ GR + A+R I+A ++I P++ P L+TL +C C + IT + +
Sbjct: 39 PPFKIKESEAEGRYLVASRLIKAGEVILQELPLVVGPKLNTL-PLCLGCYKSITDTYR-C 96
Query: 138 QHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
N C C+++A R D + ++ SQG ++
Sbjct: 97 SRCNWPLCSAACEESALHKNGECRMIDPTLMTNHL-SQG--------------AINSQVF 141
Query: 198 DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV-LAQIRINA 256
CI L+ +L ++E L ++ G+D ++ N + L + ++
Sbjct: 142 QCITPLRYLTLPDSDRERLDELVSHLE---QRRGMDIYRLVEQNISSFLRYRLLLTQYDS 198
Query: 257 FRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD--ARL 314
I+ G+ E + I+ ++V +Y S NHDC N ++ D AD R+
Sbjct: 199 ESIQRVCGILET--NCFEIRIQGRVSV-RGLYPTASLMNHDCVANTRHVF-DPADFRIRI 254
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+A +D+ G+++ Y + R+ L F C C RC+
Sbjct: 255 LATKDIPAGDKISATYTRSLWNTLDRRLHLKSTKHFWCQCSRCA 298
>gi|395330420|gb|EJF62803.1| MAS20-domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 582
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/86 (31%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASM--ARDA 339
VG+ +Y++ S+ H CDP+ + + + +L+A R +++G+EL I Y+D
Sbjct: 458 VGSGLYLVSSYLAHSCDPSTRPSFTVGSTQLQLIANRPIKKGDELTIAYVDVGQHPGEST 517
Query: 340 RQAI------LTQGFGFQCNCLRCSS 359
QA L +G+ F+C C RC S
Sbjct: 518 EQARRRRRVELARGWKFKCECERCVS 543
>gi|351716047|gb|EHB18966.1| SET and MYND domain-containing protein 5 [Heterocephalus glaber]
Length = 420
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D+ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETAFPENNFLLHVTALEDINPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|302795965|ref|XP_002979745.1| hypothetical protein SELMODRAFT_419510 [Selaginella moellendorffii]
gi|300152505|gb|EFJ19147.1| hypothetical protein SELMODRAFT_419510 [Selaginella moellendorffii]
Length = 652
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/134 (29%), Positives = 57/134 (42%), Gaps = 1/134 (0%)
Query: 224 RSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAV 283
+S F+ G+D E + TNVL + + A L SS A IE +
Sbjct: 354 KSLFQFLGLDSELAMEMGLFKPTNVLIHDQQQDYIWFDAVPAIAKLCSSYAFQIEDKPES 413
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
++ LP+ NH C PN + A + A R++ EEL + Y+D D R
Sbjct: 414 PMGLWTLPALINHSCVPNVSQQMVGTA-LFVRAARNIRAEEELTLPYMDVKKPLDERMYW 472
Query: 344 LTQGFGFQCNCLRC 357
+GF C CLRC
Sbjct: 473 TDSRWGFTCKCLRC 486
>gi|350403870|ref|XP_003486931.1| PREDICTED: SET and MYND domain-containing protein 5-like [Bombus
impatiens]
Length = 386
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRIE---LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI+ L +Y+D+ + A + +E G+ +Y+ S NH C
Sbjct: 246 RWVKNVSALELPRDERIQVDKLIDRIYDDMEEAVAFFLNNE---GSGLYIFQSSINHSCV 302
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + + N L A+RD+ EE+ I Y+D +R +RQ L+ + F C C
Sbjct: 303 PNAVVEFPYSNNVLVLKAIRDIHPEEEICISYLDECCLERSRHSRQKALSSLYLFHCYCN 362
Query: 356 RCSS 359
+C S
Sbjct: 363 KCLS 366
>gi|330931232|ref|XP_003303322.1| hypothetical protein PTT_15485 [Pyrenophora teres f. teres 0-1]
gi|311320765|gb|EFQ88594.1| hypothetical protein PTT_15485 [Pyrenophora teres f. teres 0-1]
Length = 345
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 42/79 (53%), Gaps = 4/79 (5%)
Query: 283 VGNAIYMLPSF--YNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDA 339
G+ + + P NH C PN W + + R++ A R ++ GEE + YI +M ++
Sbjct: 141 TGDKVGLFPKIARINHSCRPNTGYTWSERLNKRVVFATRKIKAGEEFFVSYISLAMPQED 200
Query: 340 RQAILTQGFGFQCNCLRCS 358
RQ L + +GF+C C C+
Sbjct: 201 RQKHLDK-YGFKCQCDACA 218
>gi|49257476|gb|AAH73806.1| SMYD family member 5 [Homo sapiens]
gi|167773713|gb|ABZ92291.1| SMYD family member 5 [synthetic construct]
gi|194374657|dbj|BAG62443.1| unnamed protein product [Homo sapiens]
Length = 302
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 172 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 228
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 229 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 265
>gi|302679838|ref|XP_003029601.1| hypothetical protein SCHCODRAFT_58188 [Schizophyllum commune H4-8]
gi|300103291|gb|EFI94698.1| hypothetical protein SCHCODRAFT_58188 [Schizophyllum commune H4-8]
Length = 361
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 54/108 (50%), Gaps = 10/108 (9%)
Query: 264 GLYEDLLSSAAASIE---------SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-R 313
G E ++++ A +IE E++ AI+M S NH C PNA W A
Sbjct: 126 GHAEGVMATNALAIELDVPTVGPNPELSTHRAIFMQASRINHSCSPNAKWEWDRKTLALT 185
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
L A+R + GEE+ I Y+D ++ R R+A L + F C+C C+ D
Sbjct: 186 LRAVRPIRAGEEITINYVDVALPRAERRARLRATYHFDCHCPACARDD 233
>gi|347832950|emb|CCD48647.1| hypothetical protein [Botryotinia fuckeliana]
Length = 403
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 33/72 (45%), Gaps = 2/72 (2%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
+Y NH C PNAH W D + A+R +E G E+ I Y D RQA
Sbjct: 231 GGVYPTSCLINHSCMPNAHNNWNSDKEHETIHAIRSIERGAEITISY-DHGGTSSERQAF 289
Query: 344 LTQGFGFQCNCL 355
L + FGF C L
Sbjct: 290 LKESFGFSCTFL 301
>gi|440800515|gb|ELR21551.1| Histone-lysine N-methyltransferase [Acanthamoeba castellanii str.
Neff]
Length = 501
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 42/74 (56%), Gaps = 1/74 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G ++ + + NH C+PN + + N +A ++AL D+ E EEL YI+ RQA
Sbjct: 422 GFGLFPIAAMMNHSCEPNTQVKFGRNREAVVVALCDIAEDEELTHSYIENDRPLAERQAD 481
Query: 344 LTQGFGFQCNCLRC 357
L + + F C+C+RC
Sbjct: 482 LLE-YNFVCHCVRC 494
>gi|395331093|gb|EJF63475.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 445
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 38/68 (55%), Gaps = 4/68 (5%)
Query: 295 NHDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
NH C PNA ++ + ++ALRD+ EG+E+ I Y+D ++ RQ L +GF
Sbjct: 203 NHSCVPNAACKYMLAPSEPVRMEVVALRDINEGDEVTIPYLDPALPFQTRQDALHVNYGF 262
Query: 351 QCNCLRCS 358
+C C C+
Sbjct: 263 ECGCRLCT 270
>gi|340055680|emb|CCC50001.1| conserved hypothetical protein, fragment [Trypanosoma vivax Y486]
Length = 492
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 3/78 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID--ASMARDA 339
A G A+Y S NH C P+ M + + +RD+ +GEE+R YI+ A +R+A
Sbjct: 413 ATGVALYDAASKINHSCAPSVRFM-PTHGGVQARVVRDIADGEEVRTSYIEVGAHDSREA 471
Query: 340 RQAILTQGFGFQCNCLRC 357
R+ L + +GF C+C C
Sbjct: 472 RRGYLLRHYGFNCDCPLC 489
>gi|195446904|ref|XP_002070974.1| GK25543 [Drosophila willistoni]
gi|194167059|gb|EDW81960.1| GK25543 [Drosophila willistoni]
Length = 503
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 66/291 (22%), Positives = 114/291 (39%), Gaps = 30/291 (10%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C C+
Sbjct: 17 PFKDKDPAWEIGISKIAGRGVVATRHLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCM-- 74
Query: 130 ITSSSQHFQHHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLAC 188
I F CG VC AK + + D F + ++ ++ RL C
Sbjct: 75 ILLPDTRFMCRQG--CGLPVCSLCAKK---KQHKTDCDLFRSWGPNEPEVANSVIIRLLC 129
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
+ + + D D++ L L FKK D + ++ +N+
Sbjct: 130 VARAINLNKDQRDLI--YCLQANLDNNHRTEVRNAAKCFKKFPTDKKVIEIMNR-----T 182
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWID 308
+A IR N F + + +++ A+Y L NHDC PN++ + +
Sbjct: 183 VAVIRTNGFD------------KTTDRTNDNQEFNYRALYPLFGVMNHDCIPNSYYTFEE 230
Query: 309 NADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ ++ A D+ EG E+ Y AR L F C C RCS
Sbjct: 231 KTNNMIVRAAVDIPEGFEITTTYTKLFTGNIARHLFLKMKKNFTCKCPRCS 281
>gi|119480927|ref|XP_001260492.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
gi|119408646|gb|EAW18595.1| SET and MYND domain protein, putative [Neosartorya fischeri NRRL
181]
Length = 544
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 44/77 (57%), Gaps = 3/77 (3%)
Query: 286 AIYMLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
Y+ P + NH CD N+ ++ D + ALR +++GEE+ + YIDA+ RQ
Sbjct: 252 GFYLHPYAALINHSCDYNS-VVGSDGDALYVKALRPIQKGEEILVSYIDATNPCKTRQTE 310
Query: 344 LTQGFGFQCNCLRCSSG 360
L++ + F+C C +C+ G
Sbjct: 311 LSERYYFKCRCPKCAKG 327
>gi|281208760|gb|EFA82935.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 422
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 280 EIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDA 339
E G AIY+ S +NH C PNA + + ++A+RD+E EE+ I Y++ +
Sbjct: 248 EECFGLAIYLEASLFNHSCYPNAARVQRGRS-IDIIAIRDIEPNEEICISYLNITNGSHE 306
Query: 340 RQAILTQGFGFQCNCLRCSSGD 361
R+ L + F C C+RC+ +
Sbjct: 307 RKDHLKNNYLFDCVCIRCTQTN 328
>gi|426335970|ref|XP_004029477.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Gorilla gorilla gorilla]
Length = 303
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 172 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 228
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 229 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 265
>gi|397473489|ref|XP_003808243.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2 [Pan
paniscus]
Length = 304
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 172 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 228
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 229 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 265
>gi|260788416|ref|XP_002589246.1| hypothetical protein BRAFLDRAFT_74594 [Branchiostoma floridae]
gi|229274421|gb|EEN45257.1| hypothetical protein BRAFLDRAFT_74594 [Branchiostoma floridae]
Length = 778
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 285 NAIYMLPSFYNHDCDPNAHIM--WIDNADARLM-ALRDVEEGEELRICYIDASMARDARQ 341
+Y L S NH CD N +M +D ++ A + +++G EL YID SM R R+
Sbjct: 645 GGLYALQSSLNHSCDKNVDVMDAVVDGKPGVVIRAKQPIKKGGELYTTYIDTSMQRPQRR 704
Query: 342 AILTQGFGFQCNCLRCS-SGD 361
A L + + F C C RC GD
Sbjct: 705 AWLYRAYHFWCECQRCKYEGD 725
>gi|313234749|emb|CBY24692.1| unnamed protein product [Oikopleura dioica]
Length = 475
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 59/126 (46%), Gaps = 5/126 (3%)
Query: 237 MKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIA-VGNAIYMLPSFYN 295
MKF N W + +R + ++L + + A ES I +G+A+Y+ S N
Sbjct: 153 MKF-NLYWKKPIPGSLRSSRKLLDLVAAIKNNQF--AICDEESSICDIGSALYLNHSLIN 209
Query: 296 HDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCL 355
H C PNA ++ + + + AL + GEE++I Y D R+ L + F C C
Sbjct: 210 HSCKPNAFPVF-NGTNLVIKALEKIAPGEEIKIAYTDTKAVIQDRRDYLNDIYRFVCQCQ 268
Query: 356 RCSSGD 361
C++ D
Sbjct: 269 GCTNDD 274
>gi|281208900|gb|EFA83075.1| hypothetical protein PPL_03863 [Polysphondylium pallidum PN500]
Length = 470
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/126 (26%), Positives = 51/126 (40%), Gaps = 18/126 (14%)
Query: 232 IDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLP 291
ID + F KQ N +N+F AS ++ I V N +
Sbjct: 211 IDPTLLSFFTKQEIINTACATMVNSF-----------------ASTKNSIIVSNGFFYEA 253
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
+ NH C PN I N ++ ++D+E+G ++ Y+D RQ +L Q F
Sbjct: 254 ALLNHSCQPNT-FYSIQNNQLQMRCIQDIEQGGDIFDSYVDLLEPTFERQQVLLQSKHFY 312
Query: 352 CNCLRC 357
C C RC
Sbjct: 313 CRCERC 318
>gi|392563243|gb|EIW56422.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 430
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 43/79 (54%), Gaps = 6/79 (7%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDAR 340
+Y+L S NH C PN + +D A L+A RD+ GEEL I Y++ + + R
Sbjct: 318 GGLYVLHSHINHSCAPNVSVRHLDQRTALSRITLLARRDIAPGEELLITYVNPELPLEQR 377
Query: 341 QAILTQ-GFGFQCNCLRCS 358
+ L + GFG +C C RC+
Sbjct: 378 RRQLMEWGFG-KCMCERCT 395
>gi|212722234|ref|NP_001131420.1| uncharacterized protein LOC100192749 [Zea mays]
gi|194691468|gb|ACF79818.1| unknown [Zea mays]
gi|413933330|gb|AFW67881.1| histone-lysine N-methyltransferase ASHR1 [Zea mays]
Length = 482
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G +Y + S NH C PNA ++ D A + AL+ + + EE+ I YI+ + R
Sbjct: 205 LGTGLYPVISIINHSCVPNA-VLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNN 263
Query: 343 ILTQGFGFQCNCLRCSSG 360
L Q F F C C RC G
Sbjct: 264 DLKQYF-FTCTCPRCVKG 280
>gi|90077650|dbj|BAE88505.1| unnamed protein product [Macaca fascicularis]
Length = 418
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ ++L S NH C PNA + +N + AL D++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGFFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|363755786|ref|XP_003648109.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
DBVPG#7215]
gi|356892145|gb|AET41292.1| hypothetical protein Ecym_7475 [Eremothecium cymbalariae
DBVPG#7215]
Length = 487
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+ IY L S NH C+PN + + D + A +D+ GEEL Y++ ++R+
Sbjct: 335 INGQIYPLFSHINHSCEPNTYFEF-DKHGIKAFARKDIAAGEELLTTYVNPLHDVNSRRR 393
Query: 343 ILTQGFGFQCNCLRC 357
L +GF CNC RC
Sbjct: 394 ELCVNWGFLCNCRRC 408
>gi|302686770|ref|XP_003033065.1| hypothetical protein SCHCODRAFT_107558 [Schizophyllum commune H4-8]
gi|300106759|gb|EFI98162.1| hypothetical protein SCHCODRAFT_107558, partial [Schizophyllum
commune H4-8]
Length = 422
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 42/86 (48%), Gaps = 4/86 (4%)
Query: 278 ESEIAVGN--AIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDAS 334
E+E A G A+Y S NH C PNAH + + RL ALRD++ GEE+ I Y+
Sbjct: 240 ETEGADGRYVAVYETLSRANHSCRPNAHFAFHKPSFSVRLRALRDIKAGEEILISYVPPE 299
Query: 335 MARDARQAILTQGFGFQCNCLRCSSG 360
RQ L +G C C C G
Sbjct: 300 APYAQRQEELAH-YGLSCACGVCDEG 324
>gi|297845524|ref|XP_002890643.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336485|gb|EFH66902.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 976
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 3/93 (3%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED +S+ E G ++ L SF NH C PNA + + + + A RD++ GE
Sbjct: 743 LVEDAVSAKVMGKNKEY-YGVGLWTLASFINHSCIPNARRLHVGDY-VIVHASRDIKAGE 800
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ Y D + R+ + + +GF+C C RC
Sbjct: 801 EITFAYFDVLSPLEKRKE-MAESWGFRCGCSRC 832
>gi|401427626|ref|XP_003878296.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494544|emb|CBZ29846.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 889
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 46/98 (46%), Gaps = 11/98 (11%)
Query: 272 SAAASIESEIAVGNAIYMLPSFYNHDCDPNA-----------HIMWIDNADARLMALRDV 320
+A AS+ + A++ NH C PNA H D A L+ALR +
Sbjct: 496 AACASLSANDTSVKALFSFLRHLNHACVPNAILVLDRTPAHPHRSEDDGVVASLVALRAI 555
Query: 321 EEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
E GEE+ + Y+ A+ A Q L++ GF+C C C+
Sbjct: 556 ESGEEITVSYVPATTALTVSQTELSETLGFRCRCHLCT 593
>gi|413933331|gb|AFW67882.1| hypothetical protein ZEAMMB73_219813 [Zea mays]
Length = 443
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G +Y + S NH C PNA ++ D A + AL+ + + EE+ I YI+ + R
Sbjct: 205 LGTGLYPVISIINHSCVPNA-VLIFDGRTAYVRALQPINKDEEVSISYIETATVTKKRNN 263
Query: 343 ILTQGFGFQCNCLRCSSG 360
L Q F F C C RC G
Sbjct: 264 DLKQYF-FTCTCPRCVKG 280
>gi|395841250|ref|XP_003793459.1| PREDICTED: SET and MYND domain-containing protein 5 isoform 2
[Otolemur garnettii]
Length = 300
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 172 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFFLHVTALEDIKPG 228
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 229 EEICISYLDCCQRERSRHSRHKILRENYLFICSCPKC 265
>gi|82594042|ref|XP_725259.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23480195|gb|EAA16824.1| Homo sapiens HSKM-B [Plasmodium yoelii yoelii]
Length = 511
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 119/302 (39%), Gaps = 51/302 (16%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHP-------------TLSTLN----SVC 123
++ TE G+ + A+ +IR+ I + P I P TL N ++C
Sbjct: 3 KIEYTEDKGKCIVASTQIRSGYCIVESHPEIAIPLCVKFMAPRIVDSTLKKNNYKTINIC 62
Query: 124 YFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWS-AFNDYCRSQGLKYPLL 182
++C K+ +C + C + A F+ E S F+ YC + ++ L+
Sbjct: 63 FYCFEKVNKCIYCPNCKYVAYCSDSCLERAWKFHREECDIYKSNIFDRYCPTITMR--LV 120
Query: 183 VKRLACMIISGAESADCIDILQPASLS---PELILAMEEGFVMLRSAFKKAGIDDEQMKF 239
+ S D+ + + P I+A V L S KK + ++
Sbjct: 121 IHSYLTHFNFYDYSGTITDLTKEKYENLKYPAYIVA-----VALMSKKKKIFSNFDE--- 172
Query: 240 LNKQWYTNVLAQ-IRINAFRIELAGGLYEDLLSSAAASIESEIA-VGNAIYMLP-SFYNH 296
NK NV+ + I+++ +++ I++E+ G Y P ++NH
Sbjct: 173 -NKNILKNVIEKFIKVSKNTLQI---------------IDNELEPCGLGFYKKPVPYFNH 216
Query: 297 DCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLR 356
C N I N + L D+ GEEL I Y+D + R+ R AI T + F C C
Sbjct: 217 SCLSNC-ITIFKNQKLYIRTLMDIYPGEELTISYLDIAFDRNTRLAICTDQYFFTCTCKL 275
Query: 357 CS 358
C
Sbjct: 276 CK 277
>gi|348566513|ref|XP_003469046.1| PREDICTED: SET and MYND domain-containing protein 5-like [Cavia
porcellus]
Length = 416
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D+ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDISPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|426226456|ref|XP_004007359.1| PREDICTED: SET and MYND domain-containing protein 5, partial [Ovis
aries]
Length = 391
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 52/97 (53%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 262 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 318
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
E L I Y+D +R +R IL + + F C+C +C
Sbjct: 319 ERLGISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 355
>gi|90399291|emb|CAJ86213.1| H0323C08.3 [Oryza sativa Indica Group]
gi|116312063|emb|CAJ86427.1| H0303G06.16 [Oryza sativa Indica Group]
Length = 531
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + +A ++AL +++ +E+ I YID ++ + R
Sbjct: 450 GTAFFPLQSCMNHSCCPNAKAYKRDEDTDGNAVIIALEPIKKDDEITISYIDEDVSYEER 509
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C RC
Sbjct: 510 QAELAD-YGFICTCPRC 525
>gi|51859341|gb|AAH81479.1| SMYD family member 5 [Danio rerio]
Length = 300
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + E G+ +++L S NH C PNA + DN L AL D+ G
Sbjct: 172 LYKDIDKETGDFLNCE---GSGLFLLQSSCNHSCVPNAEASFPDNNFLLHLTALGDIGPG 228
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+D +R +R IL + + F C+C +C S
Sbjct: 229 EEICISYLDCCQRDRSRHSRHKILRENYLFICSCQKCLS 267
>gi|322704705|gb|EFY96297.1| SET domain-containing protein 5 [Metarhizium anisopliae ARSEF 23]
Length = 451
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 48/102 (47%), Gaps = 5/102 (4%)
Query: 262 AGGLYEDLLSSAAASIESEIAVGN----AIYMLPSFYNHDCDPNAHIMWIDNADARLM-A 316
A +E+++ +E+ N I++ NH CD NA W N + A
Sbjct: 70 AARTFENVVQKYLEIVETNALPANDGEYGIFLQACRINHACDYNALNNWNKNIKQHTVHA 129
Query: 317 LRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
LRD+E+GEE+ I YI R R L + FGF C+C CS
Sbjct: 130 LRDIEKGEEITIYYIGRYWNRVTRIRTLQERFGFICSCGLCS 171
>gi|355720921|gb|AES07094.1| SMYD family member 5 [Mustela putorius furo]
Length = 159
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 30 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEDIKPG 86
Query: 324 EELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 87 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 123
>gi|326919601|ref|XP_003206068.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 1
[Meleagris gallopavo]
Length = 486
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 116/310 (37%), Gaps = 67/310 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
++V TE GRG+ A + D+I A+P +L V C+ C ++ + Q
Sbjct: 4 VEVFTTEGKGRGLKAQKEFLPGDVIF-AEPAYAAVVFDSLTHVVCHTCFKRQEKLHRCGQ 62
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMIISGAE 195
A +C C+ +A W + C + G K P RLA I+ E
Sbjct: 63 CKFAYYCDRTCQRDA-----------WLNHKNECSAIKKHG-KAPTENIRLAARILWRIE 110
Query: 196 SADCIDILQPASLSPELILAME------EGFVMLRSAFKKAGIDDE--------QMKFLN 241
+ + LS +++++ E F K ID E Q +
Sbjct: 111 R-------EGSGLSENCLVSIDDLQNHVENFD--EEEKKDLRIDVESFLEFWPAQSQQFG 161
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q+ +++ I NAF + GL AVG I+ NHDC PN
Sbjct: 162 MQYISHIFGVINCNAFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWPN 206
Query: 302 AHIMWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
+++ + N +A L AL + G+EL + Y+D + RQ L + +
Sbjct: 207 CTVIFNNGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFLNVSEERQKQLKKQYY 266
Query: 350 FQCNCLRCSS 359
F C C C
Sbjct: 267 FDCTCEHCKK 276
>gi|361126286|gb|EHK98295.1| putative N-lysine methyltransferase SMYD2 [Glarea lozoyensis 74030]
Length = 351
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 63/139 (45%), Gaps = 5/139 (3%)
Query: 226 AFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGN 285
AF++ GI D F N W ++ + G + + +++ A + E E
Sbjct: 125 AFQEMGIHDNINDFSNPPWELPEESKKILLRTAAHNPGDMIMERINTNAFAGEFEGMNHF 184
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM---ALRDVEEGEELRICYIDASMARDARQA 342
+Y + NHDC PNA M+ N + A R + GEE+ I Y++ + + RQ
Sbjct: 185 FLYPETALMNHDCRPNA--MYYHNISTLVHSAHASRTINIGEEITITYLNLLQSNNERQE 242
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L +GF C+C CS+ +
Sbjct: 243 TLKMIWGFDCDCKLCSASE 261
>gi|115460730|ref|NP_001053965.1| Os04g0629100 [Oryza sativa Japonica Group]
gi|113565536|dbj|BAF15879.1| Os04g0629100 [Oryza sativa Japonica Group]
gi|215686761|dbj|BAG89611.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 517
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + +A ++AL +++ +E+ I YID ++ + R
Sbjct: 436 GTAFFPLQSCMNHSCCPNAKAYKRDEDTDGNAVIIALEPIKKDDEITISYIDEDVSYEER 495
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C RC
Sbjct: 496 QAELAD-YGFICTCPRC 511
>gi|353234811|emb|CCA66832.1| hypothetical protein PIIN_00594 [Piriformospora indica DSM 11827]
Length = 456
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 37/67 (55%), Gaps = 3/67 (4%)
Query: 294 YNHDCDPNAHIMWI---DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
+NH C PNA ++ A + AL ++E EE+ I Y+D +++ RQA L +GF
Sbjct: 219 FNHSCSPNAWPAFVLEQRQAWLEIRALISIKESEEITIPYLDPALSLPERQARLKATYGF 278
Query: 351 QCNCLRC 357
C C RC
Sbjct: 279 DCTCSRC 285
>gi|449544027|gb|EMD35001.1| hypothetical protein CERSUDRAFT_54156, partial [Ceriporiopsis
subvermispora B]
Length = 310
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 41/83 (49%), Gaps = 5/83 (6%)
Query: 281 IAVGNAIYMLPSF--YNHDCDP--NAHIMWIDNADARLM-ALRDVEEGEELRICYIDASM 335
++ G + + P NH C N+ W D ++ AL+ V+EGEEL Y D
Sbjct: 95 VSAGQHVGIFPQMARLNHGCSSAFNSVYTWRDREGMIVVHALKPVKEGEELLTTYTDTKR 154
Query: 336 ARDARQAILTQGFGFQCNCLRCS 358
RDAR+ L +GF+C C CS
Sbjct: 155 PRDARRQFLEAHYGFRCGCSVCS 177
>gi|409076853|gb|EKM77222.1| hypothetical protein AGABI1DRAFT_108338 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 486
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 47/105 (44%), Gaps = 10/105 (9%)
Query: 264 GLYEDLLSSAAASI---------ESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADAR 313
YED++ + + S+ E++ A++ + S NH C PNA W
Sbjct: 204 SFYEDIMQTNSISVNLHIPSDIPHPELSTHRALFFMLSRCNHSCSPNAQWSWDALTLTLT 263
Query: 314 LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ ALR + GEE+ I YI RQ IL +GF C C C+
Sbjct: 264 VTALRPIATGEEITISYIPLYSDPTLRQQILKDAYGFDCVCDECT 308
>gi|310793975|gb|EFQ29436.1| TPR domain-containing protein [Glomerella graminicola M1.001]
Length = 293
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 2/66 (3%)
Query: 295 NHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMA-RDARQAILTQGFGFQC 352
NH C PN+ + +A + A RD+ + EE+ I Y+D + RD+RQ+ L G+GF C
Sbjct: 119 NHSCVPNSQGNFNQTLNAFTIHATRDIGKDEEITISYLDEHLGLRDSRQSALQDGYGFVC 178
Query: 353 NCLRCS 358
+C C+
Sbjct: 179 DCSACN 184
>gi|397623632|gb|EJK67077.1| hypothetical protein THAOC_11935 [Thalassiosira oceanica]
Length = 340
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 84/216 (38%), Gaps = 33/216 (15%)
Query: 152 NAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPE 211
+AF+ VE ++W Y LL + G E + +++ + +L +
Sbjct: 2 EGQAFFRVEYFSEWEGHGCVATRALANYTLLHTEEP--FLRGPEISRALELHEDGTLKSQ 59
Query: 212 LILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQW---YTNVLAQIRINAFRIELAGGLYED 268
L + +R KA D +++ L+ Q+ + IR N+F E
Sbjct: 60 L-----DDTAFMREECGKAQEDIDELWILHDQYADREQRLYGIIRSNSFENE-------- 106
Query: 269 LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRI 328
E+A +Y+ S +NH C PN + D RL LR + +G+ L I
Sbjct: 107 -----------EMAYQKRLYLKTSRFNHSCTPNVGYDF-DGWKIRLYTLRGISKGDTLCI 154
Query: 329 CYIDASM---ARDARQAILTQGFGFQCNCLRCSSGD 361
Y+D + RD R+ L F C C C+ D
Sbjct: 155 SYVDEVLYYFPRDRRRLYLRAALNFDCACPTCAMED 190
>gi|325189974|emb|CCA24457.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 465
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPN-AHIMWIDNADARLMALRDVEEGEELRICYIDASMA 336
E+ VG Y L + NH CDPN I +A L A++ + + EE+ I Y+D +
Sbjct: 192 ENFFEVGAGCYPLGAMVNHSCDPNCVTIFARGSAQLELWAMKSIGKDEEVTISYVDPANC 251
Query: 337 RDARQAILTQGFGFQCNCLRC 357
+ R+ L + + F C C RC
Sbjct: 252 MNKRRKYLQKRYHFDCRCQRC 272
>gi|440635519|gb|ELR05438.1| hypothetical protein GMDG_01733 [Geomyces destructans 20631-21]
Length = 505
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 1/93 (1%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L+ +L++ E + G + ++ SF NH CDPNA + + + + R+ +L+ + G+
Sbjct: 196 LFCKVLTNVMCINHQEGSPGITLDLVGSFVNHSCDPNAFV-FFEGSQLRMRSLKPINAGD 254
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ + Y + R+ L +G+ F C C RC
Sbjct: 255 EITLTYTELREGVLMRKRKLHRGYSFFCRCNRC 287
>gi|452001299|gb|EMD93759.1| hypothetical protein COCHEDRAFT_1095098 [Cochliobolus
heterostrophus C5]
Length = 345
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 4/79 (5%)
Query: 283 VGNAIYMLPSF--YNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDA 339
G+ + P NH C PN W + + ++ A R ++ GEE + YI +A++
Sbjct: 141 TGDKFSLFPRIARINHSCRPNTSYYWSEKLNQHIVFASRKIKAGEEFSVSYISLLLAQED 200
Query: 340 RQAILTQGFGFQCNCLRCS 358
RQ +L Q +GF+C C C+
Sbjct: 201 RQKLLDQ-YGFKCQCEACA 218
>gi|342182559|emb|CCC92038.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 712
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 43/80 (53%), Gaps = 3/80 (3%)
Query: 280 EIAVGNAIYMLPSFYNHDCDPN-AHIMWIDNADARLMALRDVEEGEELRICYIDASMARD 338
E +G +I+ S++NH C PN +M+ N A AL+ + +GE L ICY+D A
Sbjct: 569 ESCIGVSIFPEASYFNHSCLPNLCRVMYRGNI-AAFYALQSIRKGEPLTICYVDVQEAST 627
Query: 339 A-RQAILTQGFGFQCNCLRC 357
A R+ L + F C C RC
Sbjct: 628 AERRRTLLTSYRFFCECRRC 647
>gi|255084167|ref|XP_002508658.1| hypothetical protein MICPUN_61723 [Micromonas sp. RCC299]
gi|226523935|gb|ACO69916.1| hypothetical protein MICPUN_61723 [Micromonas sp. RCC299]
Length = 621
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Query: 279 SEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICY--IDASMA 336
+E AV +A+++ S NH C+PN + + R+ V+ G +L I Y +
Sbjct: 367 AECAVASALFLDASMLNHSCEPNCFASF-PGREMRIRNTEKVQSGGQLFISYGPVAGGAP 425
Query: 337 RDARQAILTQGFGFQCNCLRCSSGD 361
RD R+++L+ FGF+C C+ C D
Sbjct: 426 RDVRRSLLSDAFGFECKCVACVGRD 450
>gi|261330291|emb|CBH13275.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 399
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 1/75 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQA 342
G IY L S +NH C PN ++ D L LR ++ GEEL I YI A +
Sbjct: 321 GGGIYSLQSAFNHSCVPNVAVLAEGGTHDITLRTLRAIKNGEELTITYIPVENTTRAERQ 380
Query: 343 ILTQGFGFQCNCLRC 357
+ +G+ F C C C
Sbjct: 381 MKLEGYFFTCRCPLC 395
>gi|313217681|emb|CBY38723.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG+A+Y+ + NH C PNA+ + + + ALR +E GEE+ Y D R+
Sbjct: 50 VGSAVYIDHALVNHSCRPNAYPV-FNKTNMIFKALRKIEPGEEITHAYTDTISPIQERRE 108
Query: 343 ILTQGFGFQCNCLRCSSGD 361
L + F CNC C+ +
Sbjct: 109 YLNDVWRFMCNCPGCTKSN 127
>gi|39546242|emb|CAE04251.3| OSJNBa0089N06.12 [Oryza sativa Japonica Group]
Length = 463
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + +A ++AL +++ +E+ I YID ++ + R
Sbjct: 382 GTAFFPLQSCMNHSCCPNAKAYKRDEDTDGNAVIIALEPIKKDDEITISYIDEDVSYEER 441
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C RC
Sbjct: 442 QAELAD-YGFICTCPRC 457
>gi|449541748|gb|EMD32730.1| hypothetical protein CERSUDRAFT_118469 [Ceriporiopsis subvermispora
B]
Length = 523
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 55/113 (48%), Gaps = 8/113 (7%)
Query: 248 VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
++ ++R N F AG +++ S A E ++ A+ S NH C PN H+ +
Sbjct: 241 LVGRMRTNGFN---AG---KEMKSIAPTMPEGDMGRYTAVGSTLSRLNHSCSPNGHVDFD 294
Query: 308 DNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+L A+RD+ GE++ I Y D +++ RQ L + F F C C C +
Sbjct: 295 PGTLSLQLTAMRDIAAGEQITISYCDVFLSQAERQKSLKR-FDFTCTCRSCKN 346
>gi|328866939|gb|EGG15322.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 524
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 1/77 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIM-WIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
G+ +Y+ + NH C PNA I N A + ++ D+ G ++ I YI+ RQ+
Sbjct: 444 GSGLYVYANSCNHSCSPNAAITNESTNFSATIRSITDIPNGNQIEISYIEEDQPSQTRQS 503
Query: 343 ILTQGFGFQCNCLRCSS 359
L + F+C+C +C++
Sbjct: 504 ELIDKYKFKCHCQKCTN 520
>gi|451849242|gb|EMD62546.1| hypothetical protein COCSADRAFT_342827 [Cochliobolus sativus
ND90Pr]
Length = 345
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 4/79 (5%)
Query: 283 VGNAIYMLPSF--YNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDA 339
G+ + P NH C PN W + + ++ A R ++ GEE + YI +A++
Sbjct: 141 TGDKFSLFPRIARINHSCRPNTSYYWSERLNKHIVFASRKIKAGEEFSVSYISLLLAQED 200
Query: 340 RQAILTQGFGFQCNCLRCS 358
RQ +L Q +GF+C C C+
Sbjct: 201 RQKLLDQ-YGFKCQCEACA 218
>gi|290998093|ref|XP_002681615.1| SET domain-containing protein [Naegleria gruberi]
gi|284095240|gb|EFC48871.1| SET domain-containing protein [Naegleria gruberi]
Length = 430
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRD---VEEGEELRICYIDASMARD 338
A+G+ IY+ S +NH C PN I++ + + + L +EEG L I Y+D
Sbjct: 197 AIGSGIYLKASMFNHSCVPNCAILFDSDKNLYVRILNPSSLLEEGTPLTINYVDLMDLTA 256
Query: 339 ARQAILTQGFGFQCNCLRC 357
RQ L + + F C C RC
Sbjct: 257 NRQKKLKEQYHFTCTCPRC 275
>gi|45383804|ref|NP_989486.1| SET and MYND domain-containing protein 1 [Gallus gallus]
gi|16930387|gb|AAL31880.1|AF410781_1 cardiac and skeletal muscle-specific BOP1 [Gallus gallus]
Length = 486
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 115/310 (37%), Gaps = 67/310 (21%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
++V TE GRG+ A + D+I A+P +L V C+ C ++ + Q
Sbjct: 4 VEVFTTEGKGRGLKAQKEFLPGDVIF-AEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQ 62
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYC---RSQGLKYPLLVKRLACMII---- 191
A +C C+ +A W + C + G K P RLA I+
Sbjct: 63 CKFAYYCDRTCQRDA-----------WLNHKNECSAIKKHG-KAPTENIRLAARILWRIE 110
Query: 192 --SGAESADC---IDILQP--ASLSPELILAME---EGFVMLRSAFKKAGIDDEQMKFLN 241
G S +C ID LQ S E + E F+ A Q +
Sbjct: 111 REGGGLSENCLVSIDDLQNHVESFDEEEKKDLRVDVESFLEFWPA---------QSQQFG 161
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q+ +++ I NAF + GL AVG I+ NHDC PN
Sbjct: 162 MQYISHIFGVINCNAFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWPN 206
Query: 302 AHIMWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
+++ + N +A L AL + G+EL + Y+D + R+ L + +
Sbjct: 207 CTVIFNNGNHEAVRSMFHTQMRIELRALSKISPGDELTVSYVDFLNVSEERRKQLKKQYY 266
Query: 350 FQCNCLRCSS 359
F C C C
Sbjct: 267 FDCTCEHCKK 276
>gi|71001320|ref|XP_755341.1| SET and MYND domain protein [Aspergillus fumigatus Af293]
gi|66852979|gb|EAL93303.1| SET and MYND domain protein, putative [Aspergillus fumigatus Af293]
Length = 544
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 10/106 (9%)
Query: 264 GLYEDLLSSAAASIE-------SEIAVGNAIYMLP--SFYNHDCDPNAHIMWIDNADARL 314
L E++L A +E S +A Y+ P + NH CD N+ ++ D +
Sbjct: 223 ALDEEMLCHYGAKLELNSFNFHSPLADRLGFYLHPYAALINHSCDYNS-VVGSDGDALYV 281
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
ALR + +GEE+ + YIDA+ RQ L + + F+C C +C G
Sbjct: 282 KALRPIRKGEEILVSYIDATNPYKIRQTELLERYYFKCRCPKCGKG 327
>gi|398404390|ref|XP_003853661.1| hypothetical protein MYCGRDRAFT_31225, partial [Zymoseptoria
tritici IPO323]
gi|339473544|gb|EGP88637.1| hypothetical protein MYCGRDRAFT_31225 [Zymoseptoria tritici IPO323]
Length = 63
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
NH CDPN ++ + L ALR +++G+E+ + Y+D + RQA L++ + F C C
Sbjct: 2 NHSCDPNTAAIF-NQPRQLLRALRPIKKGDEITMKYVDITNPFSVRQAELSEAYFFGCKC 60
Query: 355 LRC 357
+RC
Sbjct: 61 IRC 63
>gi|320168148|gb|EFW45047.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 522
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/72 (37%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
A++ SF+NH CDP+ + D + R + GEEL I YID + RQ L
Sbjct: 349 ALFPSASFFNHSCDPSCD-RYQDKFLLSIATRRPIAAGEELSISYIDVNAPCRTRQHELL 407
Query: 346 QGFGFQCNCLRC 357
+ FQC+C RC
Sbjct: 408 DSYHFQCSCTRC 419
>gi|330917311|ref|XP_003297760.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
gi|311329388|gb|EFQ94158.1| hypothetical protein PTT_08278 [Pyrenophora teres f. teres 0-1]
Length = 532
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 51/106 (48%), Gaps = 6/106 (5%)
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSF--YNHDCDPNAHIMWIDNAD 311
N F + +Y ++S+A I + +G I + P+ NH CDPNA IM +D
Sbjct: 192 NMFDRDFIAAMYARVMSNALTIITPTLDPLG--IILDPTLCSLNHSCDPNAFIM-MDGPS 248
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ LR + + +E+ I YID + RQ L + F C C +C
Sbjct: 249 VSIRTLRPIRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRCAKC 294
>gi|218195644|gb|EEC78071.1| hypothetical protein OsI_17537 [Oryza sativa Indica Group]
Length = 472
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + +A ++AL +++ +E+ I YID ++ + R
Sbjct: 391 GTAFFPLQSCMNHSCCPNAKAYKRDEDTDGNAVIIALEPIKKDDEITISYIDEDVSYEER 450
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C RC
Sbjct: 451 QAELAD-YGFICTCPRC 466
>gi|222629611|gb|EEE61743.1| hypothetical protein OsJ_16269 [Oryza sativa Japonica Group]
Length = 472
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + +A ++AL +++ +E+ I YID ++ + R
Sbjct: 391 GTAFFPLQSCMNHSCCPNAKAYKRDEDTDGNAVIIALEPIKKDDEITISYIDEDVSYEER 450
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C RC
Sbjct: 451 QAELAD-YGFICTCPRC 466
>gi|72392603|ref|XP_847102.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175616|gb|AAX69749.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803132|gb|AAZ13036.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 399
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 36/75 (48%), Gaps = 1/75 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASMARDARQA 342
G IY L S +NH C PN ++ D L LR ++ GEEL I YI A +
Sbjct: 321 GGGIYSLQSAFNHSCVPNVAVLAEGGTHDITLRTLRAIKNGEELTITYIPVENTTRAERQ 380
Query: 343 ILTQGFGFQCNCLRC 357
+ +G+ F C C C
Sbjct: 381 MKLEGYFFTCRCPLC 395
>gi|401837625|gb|EJT41530.1| SET5-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 524
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 82/372 (22%), Positives = 136/372 (36%), Gaps = 58/372 (15%)
Query: 36 FWKMCLGRYSRCLISR-LQSLHLQKRQLCSTATHNGKPSQPSPPPIQVALTESAGRGVFA 94
WK+ R+ L L L+ L H K + L + GRG+FA
Sbjct: 68 MWKLSASRFREILNQHHLYDTDLETVSLYKENIHFPKILDSNAKVKVNFLDDERGRGLFA 127
Query: 95 TRRIRASDLI-HTAKPIITHPTLSTL-----NSVCYFCLRKITSSSQH--FQHH------ 140
+ +I KPI+ P L L C C + + +QH H+
Sbjct: 128 EKDFAKGQIILKENKPIVYVPPLDKLFFISNGKACARCGKALYDLTQHKIMVHYLDCEVC 187
Query: 141 NARFCGEVCKDNAKAFY------------DVERRADWSAFNDYCR--------SQGLKYP 180
A +C E CK + D+ +W F +YC S GL Y
Sbjct: 188 KAIWCSEKCKKAHAPLHELLYHSWRSNRIDILHAGNWKRFVNYCEKYCFTAAFSIGLIYG 247
Query: 181 LL-------VKRLACMIISGAESA-----DCIDILQPASLSPELILAMEEGFVMLRSAFK 228
+ V+ + S ++ D I +L E+GF
Sbjct: 248 SMLLDTTGEVREQWNKLASVSQRVRIKLRDASGIGSTFNLMNGTTAHTEDGFDNNSKMSA 307
Query: 229 KAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIY 288
+ IDDE + ++ Y + + I+L E L+ ++ +Y
Sbjct: 308 EENIDDETVW---EKCYESFCGAFPKASEEIDL-----EKFLTMIGTFNINQ--YNGQMY 357
Query: 289 MLPSFYNHDCDPNAHIMWID-NADARLMALRDVEEGEELRICYIDASMARDARQAILTQG 347
+ SF NHDC+PN +I + + + +L A + +++GE++ I Y++ R+ L
Sbjct: 358 LWISFINHDCEPNVYIEQTEEHEELKLHARKPIKKGEQICITYVNPLHGVRLRRRELRVN 417
Query: 348 FGFQCNCLRCSS 359
+GF C C RC +
Sbjct: 418 WGFLCQCDRCQN 429
>gi|321449930|gb|EFX62153.1| hypothetical protein DAPPUDRAFT_120473 [Daphnia pulex]
Length = 510
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 110/295 (37%), Gaps = 64/295 (21%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRKITSSSQH 136
P ++ E GR +FA++ ++ ++I +IT P CY L ++ +S
Sbjct: 48 PYKIVHDEKFGRCIFASKNLKPGEIIFGETAVITGPKQGCTPCCLKCYASLDRVQEASL- 106
Query: 137 FQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAES 196
F+ N F +C+ Q K P C+I+S A+S
Sbjct: 107 FRCPNCNF-------------------------PFCQEQCAKSP--EHEAECLILSRAKS 139
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAF-------------KKAGIDDEQMKFLNKQ 243
CI I + PE + + +S K G+ DE F ++
Sbjct: 140 --CIVINDVHRIHPEYKMLENHNDLRRQSDMWRIYQVNVVQFLRKICGLADE---FSEEE 194
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ + I +NAF I LAG Y+ +L ++ L S +H+C N
Sbjct: 195 IHASC-GVIDVNAFEIRLAGNQYQQVL---------------GVFPLASMMSHNCVANTQ 238
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ N + A + +GE++ Y R+ +L F+C+C RCS
Sbjct: 239 HVIDANYTMTVRASVPIMKGEQIFTSYTLPLEGTKERRDVLRHSKLFECDCSRCS 293
>gi|194890613|ref|XP_001977353.1| GG18992 [Drosophila erecta]
gi|190649002|gb|EDV46280.1| GG18992 [Drosophila erecta]
Length = 503
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 123/306 (40%), Gaps = 60/306 (19%)
Query: 72 PSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV--CYFCLRK 129
P + P ++ +++ AGRGV ATR ++ ++I P++ +S+ C CL+
Sbjct: 17 PFKDKDPAWEIGVSKIAGRGVVATRSLKRGEIIFRDSPLLIGLAAHEEDSLNACSVCLKM 76
Query: 130 ITSSSQHFQHHNARF-----CG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYP--- 180
+ + RF CG VC AK + ++D D +S G P
Sbjct: 77 LPDT---------RFMCRQGCGLPVCSLCAKK---KQHKSDC----DLLKSWGPNEPDVA 120
Query: 181 -LLVKRLACMIIS---GAESADCIDILQPASLSPELILAMEEGFVMLRSA---FKKAGID 233
++ RL C+ + E D I LQ A+L +R+A FK D
Sbjct: 121 NSVIIRLLCVARALNLSKEQRDLIYCLQ-ANLD-------NNHRTEVRNAAKCFKNFPTD 172
Query: 234 DEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSF 293
+ ++ +N+ +A +R N F + + +++ A+Y L
Sbjct: 173 KKIIEIMNR-----TVAVLRTNGFD------------KTTDRTNDNQEFNYRALYPLFGV 215
Query: 294 YNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
NHDC PNA+ + D + ++ A D+ EG E+ Y AR L F C
Sbjct: 216 VNHDCIPNAYYTFEDKTNNMIVRAAVDIPEGFEVTTTYTKLFTGNIARHLFLKMKKSFTC 275
Query: 353 NCLRCS 358
C RCS
Sbjct: 276 KCSRCS 281
>gi|189197821|ref|XP_001935248.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187981196|gb|EDU47822.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 345
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 36/65 (55%), Gaps = 2/65 (3%)
Query: 295 NHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C PN W + R++ A R ++ GEE + YI +M ++ RQ L + +GF+C
Sbjct: 155 NHSCRPNTGYTWSKRLNKRVVFATRKIKAGEEFFVSYISLAMPQEDRQKHLNK-YGFKCQ 213
Query: 354 CLRCS 358
C CS
Sbjct: 214 CDACS 218
>gi|403218380|emb|CCK72870.1| hypothetical protein KNAG_0M00170 [Kazachstania naganishii CBS
8797]
Length = 491
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 23/76 (30%), Positives = 39/76 (51%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQ 341
A+ +Y +PSF NH+C+PN A +D+++GE++ + Y + + R+
Sbjct: 335 AIAEQLYYIPSFINHNCEPNVRFEKDSRLHINFYARKDIKKGEQIFMTYCNPLHEVNLRR 394
Query: 342 AILTQGFGFQCNCLRC 357
L +GF C C RC
Sbjct: 395 RELRVNYGFLCFCDRC 410
>gi|432880358|ref|XP_004073658.1| PREDICTED: SET and MYND domain-containing protein 1-like isoform 3
[Oryzias latipes]
Length = 476
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 52/116 (44%), Gaps = 16/116 (13%)
Query: 243 QWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA 302
++ +++ I+ N F + GL AVG ++ + NHDC PN
Sbjct: 168 EYISHIFGIIKCNGFTLTDQRGLQ---------------AVGVGLFPNLALVNHDCWPNC 212
Query: 303 HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ +++ + L + EGEEL I Y+D RQ L + F F+C+C CS
Sbjct: 213 TAI-LNHGKIEVRTLGKISEGEELTISYVDFLQLSADRQKQLKEQFHFECSCKHCS 267
>gi|400593481|gb|EJP61426.1| SET domain-containing protein 5 [Beauveria bassiana ARSEF 2860]
Length = 633
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C NA W +N + ALRD+++GEE+ I Y+ +R RQA L F F C+
Sbjct: 108 NHACSNNAQKSWNENLKRHTVHALRDIQQGEEITIFYLRTLESRSRRQANLKAKFLFTCS 167
Query: 354 CLRCSSG 360
C C+ G
Sbjct: 168 CSLCALG 174
>gi|405121568|gb|AFR96336.1| hypothetical protein CNAG_03112 [Cryptococcus neoformans var.
grubii H99]
Length = 447
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 54/116 (46%), Gaps = 17/116 (14%)
Query: 247 NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW 306
N+ + I++NAF L S + A I G +I L + NH C+PNA ++
Sbjct: 116 NLCSAIQVNAF----------TLTSPSLAPI------GMSISPLLALANHSCEPNAIAVF 159
Query: 307 IDNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
D L+AL D+ GEE+ YID S RQ+ L + F C+C C +
Sbjct: 160 PKGGRDIFLVALNDIPPGEEILTSYIDTSTPYHHRQSELLSRYRFVCHCSLCQKSE 215
>gi|171686072|ref|XP_001907977.1| hypothetical protein [Podospora anserina S mat+]
gi|170942997|emb|CAP68650.1| unnamed protein product [Podospora anserina S mat+]
Length = 483
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWI-DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NHDC PNA + + + A+RD+ GEEL + YI+ M + AR L + +GFQC
Sbjct: 283 NHDCRPNADYYFDHETLTQYIHAIRDISPGEELTLSYINPIMKKRARNKKLNRIWGFQCA 342
Query: 354 CLRCS 358
C C+
Sbjct: 343 CPLCT 347
>gi|219112705|ref|XP_002178104.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217410989|gb|EEC50918.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 501
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 44/83 (53%), Gaps = 7/83 (8%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARL--MALRDVEEGEELRICYI-----DASMARD 338
A++ L + NH C PNA + + DAR+ +ALRD+ GEE+ I YI S +
Sbjct: 419 AVFPLTARINHSCVPNAQVQSQEFVDARIDVVALRDIAAGEEITISYIGCGRTSGSKSTS 478
Query: 339 ARQAILTQGFGFQCNCLRCSSGD 361
R+ L + F C C RC++ +
Sbjct: 479 RRRRELLAKYLFTCECPRCTAKE 501
>gi|310793444|gb|EFQ28905.1| MYND finger [Glomerella graminicola M1.001]
Length = 548
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 67/289 (23%), Positives = 109/289 (37%), Gaps = 50/289 (17%)
Query: 86 ESAGRGVFATRRIRASDLIHT-AKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH----H 140
+ GR + ATRR + D+I +P + P C CL+K +
Sbjct: 16 DKGGRSIHATRRFKPGDVIACFEEPAVVLPPGHRALEYCNHCLKKQQPGGPKLRACTGCK 75
Query: 141 NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCI 200
+CG C+ RA+WS LV +L C I
Sbjct: 76 TVAYCGPACQ-----------RANWS---------------LVHKLECKAIQRLHEIKPA 109
Query: 201 DILQPASLSPELILAMEEGFVMLRS-AFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRI 259
D +PA + + A + VMLR + G + ++ K+ T++ Q +
Sbjct: 110 D--EPAWVPTPIRAAAQ---VMLRPHVLAQFGELEGHVEQWRKKSETDLQLQSHGVVRCL 164
Query: 260 ELAGGLYEDL---------LSSAAASIESEIAVGNAIYMLPSF--YNHDCDPNAHIMWID 308
L G +E L L + A S E +++ + NH C PNA ++
Sbjct: 165 GLDMGTFEALEAAFQVLCKLQTNAFSRSEEYYETGGVFLDTTLAMINHSCVPNA-LVQFG 223
Query: 309 NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
A L + + G E+ I YID + + RQ L+ + F+C+C +C
Sbjct: 224 GRTATLRSASFIHPGSEIEISYIDQTQPKSRRQHELSL-YHFECHCSKC 271
>gi|159129417|gb|EDP54531.1| SET and MYND domain protein, putative [Aspergillus fumigatus A1163]
Length = 544
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 10/106 (9%)
Query: 264 GLYEDLLSSAAASIE-------SEIAVGNAIYMLP--SFYNHDCDPNAHIMWIDNADARL 314
L E++L A +E S +A Y+ P + NH CD N+ ++ D +
Sbjct: 223 ALDEEMLCHYGAKLELNSYNFHSPLADRLGFYLHPYAALINHSCDYNS-VVGSDGDALYV 281
Query: 315 MALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
ALR + +GEE+ + YIDA+ RQ L + + F+C C +C G
Sbjct: 282 KALRPIRKGEEILVSYIDATNPYKIRQTELLERYYFKCRCPKCGKG 327
>gi|242206607|ref|XP_002469159.1| predicted protein [Postia placenta Mad-698-R]
gi|220731830|gb|EED85671.1| predicted protein [Postia placenta Mad-698-R]
Length = 447
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 49/95 (51%), Gaps = 8/95 (8%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEG 323
G++E L+ ESE +G A+Y PSF+NH C PN + A + R V G
Sbjct: 349 GIWEVPLTE-----ESE-CLGFAVYPRPSFFNHHCSPNVNKERYGRGLA-FVTTRSVRAG 401
Query: 324 EELRICYIDA-SMARDARQAILTQGFGFQCNCLRC 357
EEL I Y A M RQ L +G+ F+C+C +C
Sbjct: 402 EELCISYGHAEGMGWRQRQKELREGWFFECSCGKC 436
>gi|115450985|ref|NP_001049093.1| Os03g0168700 [Oryza sativa Japonica Group]
gi|19071649|gb|AAL84316.1|AC073556_33 hypothetical protein [Oryza sativa Japonica Group]
gi|108706390|gb|ABF94185.1| TPR Domain containing protein [Oryza sativa Japonica Group]
gi|113547564|dbj|BAF11007.1| Os03g0168700 [Oryza sativa Japonica Group]
Length = 536
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 10/78 (12%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID----ASMARDA 339
G +++LP+F NH C PNA + + A + A RD++ GEE+ Y D AS R+A
Sbjct: 311 GVGLWILPAFINHSCHPNARRTHVGD-HAIVHASRDIKAGEEITFAYFDVLTPASKRREA 369
Query: 340 RQAILTQGFGFQCNCLRC 357
+A +G +C C RC
Sbjct: 370 ARA-----WGLECQCDRC 382
>gi|451897716|emb|CCT61066.1| hypothetical protein [Leptosphaeria maculans JN3]
Length = 726
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 8/91 (8%)
Query: 268 DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELR 327
DLLS A SI V N +++ S+ NH C PN+ +I + L A RD+E+ EE+
Sbjct: 479 DLLS--APSI-----VRNGVWIHSSYMNHSCLPNSVRSFIGDM-LFLRAARDIEQDEEIT 530
Query: 328 ICYIDASMARDARQAILTQGFGFQCNCLRCS 358
Y+ + + RQ +GF+CNC C+
Sbjct: 531 TQYLAPELEYETRQQKFETTWGFRCNCTLCT 561
>gi|403169846|ref|XP_003329264.2| hypothetical protein PGTG_10316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375168447|gb|EFP84845.2| hypothetical protein PGTG_10316 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 416
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEE---GEELRICYIDASMARDARQA 342
A+ + S +NH C PN + +ID+ + +LM LR E GEEL I Y M R++R+
Sbjct: 249 ALVLQGSRFNHACRPNV-VYFIDH-ETQLMNLRAFESISSGEELTISYRPLEMDRESRRK 306
Query: 343 ILTQGFGFQCNCLRC 357
L + +GF+C C C
Sbjct: 307 ELQETYGFRCTCPHC 321
>gi|340724466|ref|XP_003400603.1| PREDICTED: SET and MYND domain-containing protein 4-like [Bombus
terrestris]
Length = 631
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 73/319 (22%), Positives = 132/319 (41%), Gaps = 50/319 (15%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQH 139
+ + E GR + AT +IR ++I KP T +++ C CL +++ ++ +H
Sbjct: 227 VTIKYNEEYGRHIVATHKIRPGEVIAVEKPYSLILTPDNIHTHCSNCL-EVSWANIPCEH 285
Query: 140 -HNARFCGEVCKD-NAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESA 197
A +C E CK K ++DVE + F + +K L RLA I + E+
Sbjct: 286 CTYAMYCSEECKIMEWKKYHDVE----CTVFPSMLKMNFVKLDLFSLRLA--IQAVREAT 339
Query: 198 DCIDILQPASLSPELILAMEEGF--------------VMLRSAFKKAGIDDEQMKFLNKQ 243
++ + + + +GF + L + +K + D + L+
Sbjct: 340 HIQELRKELKEVDSCVASRTKGFSKDGTFPSDKYRSLLGLVTNTEKRSVQDLFRRSLDAS 399
Query: 244 W-----------YTNVLAQ---IRINAFRIELAGGL---YEDLLSSAAASIESE-----I 281
+ + N L++ + I + GGL ++ L+ + S E +
Sbjct: 400 FILYFVATCTNMFGNPLSKDLSVLIKNADVTFVGGLILRHQQLIPNNIHSFSEECGLEAV 459
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYID--ASMARD 338
G A S NH C+PN I+ ++ ++ A+ +EEGE+L Y A +
Sbjct: 460 ERGAAAMPFSSLINHSCNPN--ILRHSRSEYVIIYAIYPIEEGEQLYDIYTQHYAITPKA 517
Query: 339 ARQAILTQGFGFQCNCLRC 357
RQ L + + F+CNCL C
Sbjct: 518 VRQEKLLKQYYFKCNCLPC 536
>gi|331241839|ref|XP_003333567.1| hypothetical protein PGTG_14989 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312557|gb|EFP89148.1| hypothetical protein PGTG_14989 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 425
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQAI 343
A+ + P+ NH C PN + ++D L AL V +GEEL I Y + + R R+ I
Sbjct: 250 ALVLEPARLNHACRPNV-VYYLDEGTQILYMRALEPVAKGEELTINYREYELPRQERRDI 308
Query: 344 LTQGFGFQCNCLRC 357
L + +GF C C C
Sbjct: 309 LEEAYGFNCTCSHC 322
>gi|303278744|ref|XP_003058665.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459825|gb|EEH57120.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 620
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 35/69 (50%), Gaps = 4/69 (5%)
Query: 296 HDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
H D NA++ W+ ++ V GEEL Y+DAS ARQ ILT +GF+
Sbjct: 341 HAADRNANVDWMIGGGGVWAQEFRVVKHVAAGEELTHAYVDASDPIAARQKILTTRYGFK 400
Query: 352 CNCLRCSSG 360
C C RC G
Sbjct: 401 CLCARCGEG 409
>gi|357605609|gb|EHJ64694.1| hypothetical protein KGM_16822 [Danaus plexippus]
Length = 382
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 45/80 (56%), Gaps = 4/80 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDA 339
G+ +Y L S NH C PNA + N +L A+R+++ G+E+ I Y+D +R +
Sbjct: 283 GSGLYQLQSACNHSCAPNAESSFPYGNHRIQLKAIRNIKPGDEIHISYLDDCTLQRSRHS 342
Query: 340 RQAILTQGFGFQCNCLRCSS 359
RQ L++ + F C C RC S
Sbjct: 343 RQRELSENYLFVCCCERCMS 362
>gi|281206015|gb|EFA80204.1| SET domain-containing protein [Polysphondylium pallidum PN500]
Length = 490
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 1/75 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEGEELRICYIDASMARDARQA 342
G+A++ + + NH C+PNA + + N + L +LR + EE+ I YID +++ R+
Sbjct: 412 GSALFPIVNSCNHHCNPNAVVSYTTNCNRVTLRSLRSIPIHEEVEISYIDETVSCSQRRK 471
Query: 343 ILTQGFGFQCNCLRC 357
L + F C C RC
Sbjct: 472 ELQHKYLFNCKCTRC 486
>gi|350416087|ref|XP_003490836.1| PREDICTED: SET and MYND domain-containing protein 3-like [Bombus
impatiens]
Length = 439
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 49/96 (51%), Gaps = 3/96 (3%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVE-- 321
G+Y + ++ + S++ +G IY+ PS +H C PNA + + + A+ D+
Sbjct: 158 GIYGRITINSFSIFNSDMNIGVGIYLGPSILDHSCKPNA-VATFEGTTITVKAIEDLPSL 216
Query: 322 EGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ ++RI YID R+A L + F C+C RC
Sbjct: 217 DLSQIRIPYIDVIKTTRDRRAELQSSYYFWCDCERC 252
>gi|312373795|gb|EFR21480.1| hypothetical protein AND_16998 [Anopheles darlingi]
Length = 486
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 57/141 (40%), Gaps = 24/141 (17%)
Query: 218 EGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASI 277
EG L F K G ++ + L ++ I++N I L S+
Sbjct: 149 EGVAKLIPRFFKTGDRWDEDEILR------IVGIIQVNGHEIPL--------------SV 188
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMAR 337
S +A+ N ML H C PN + + + L A ++ GE L ICY DA A
Sbjct: 189 PSSVAIYNQASML----EHSCRPNLSKSFTNRGEIVLWAPNTIKRGERLSICYTDAMWAT 244
Query: 338 DARQAILTQGFGFQCNCLRCS 358
R L Q F+C C RCS
Sbjct: 245 GNRLEHLQQTKMFRCECERCS 265
>gi|403366287|gb|EJY82943.1| hypothetical protein OXYTRI_19440 [Oxytricha trifallax]
Length = 764
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 45/72 (62%), Gaps = 1/72 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
A++ + SF NH+C+ N I +A ++ALR++ +GEE+ Y+ + A + R+ ++
Sbjct: 556 ALWPIFSFINHECNANTTRFSIGDA-LFIVALRNINQGEEITQLYMPLASAFEERENLMQ 614
Query: 346 QGFGFQCNCLRC 357
+ +GF+C C+ C
Sbjct: 615 KSWGFKCTCISC 626
>gi|255562926|ref|XP_002522468.1| protein with unknown function [Ricinus communis]
gi|223538353|gb|EEF39960.1| protein with unknown function [Ricinus communis]
Length = 538
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 49/94 (52%), Gaps = 3/94 (3%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED +S+ + G +++L SF NH C+PNA + + + + A RD++ GE
Sbjct: 308 LVEDSISAKVLGKNGDY-YGFGLWVLASFINHSCNPNARRLHVGD-HVLVHASRDIKAGE 365
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
E+ Y D R A +++ +GF+C C RC
Sbjct: 366 EITFPYFDVLSPLYKRMA-MSKAWGFRCQCKRCK 398
>gi|367019314|ref|XP_003658942.1| hypothetical protein MYCTH_99098 [Myceliophthora thermophila ATCC
42464]
gi|347006209|gb|AEO53697.1| hypothetical protein MYCTH_99098 [Myceliophthora thermophila ATCC
42464]
Length = 414
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 57/129 (44%), Gaps = 33/129 (25%)
Query: 238 KFLNKQWYTNVLAQIRINAFRIELAG--------GLYEDLLSSAAASIESEIAVGNAIYM 289
+FL +Q V ++ N+FR+ L G GL+ D+
Sbjct: 148 RFL-RQTGDTVYDKVEKNSFRVFLDGDRKHSIHLGLFPDV-------------------- 186
Query: 290 LPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFG 349
S +NHDC PN H I + +A+RD+ GEEL I YI R R L++ +G
Sbjct: 187 --SKFNHDCRPNVHYR-ISDLTHTTVAVRDIPAGEELTISYIYGLKPRAERLEQLSE-WG 242
Query: 350 FQCNCLRCS 358
F C C +C+
Sbjct: 243 FMCTCPQCA 251
>gi|125585065|gb|EAZ25729.1| hypothetical protein OsJ_09564 [Oryza sativa Japonica Group]
Length = 509
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 10/78 (12%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYID----ASMARDA 339
G +++LP+F NH C PNA + + A + A RD++ GEE+ Y D AS R+A
Sbjct: 284 GVGLWILPAFINHSCHPNARRTHVGD-HAIVHASRDIKAGEEITFAYFDVLTPASKRREA 342
Query: 340 RQAILTQGFGFQCNCLRC 357
+A +G +C C RC
Sbjct: 343 ARA-----WGLECQCDRC 355
>gi|299746263|ref|XP_002911027.1| hypothetical protein CC1G_15570 [Coprinopsis cinerea okayama7#130]
gi|298406975|gb|EFI27533.1| hypothetical protein CC1G_15570 [Coprinopsis cinerea okayama7#130]
Length = 633
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARL-MALRDVEEGEELRICYIDASMARDARQAIL 344
A+ + S +NH C PN + + + L A RD+ G +L + Y D++M++ R L
Sbjct: 199 AVTRIGSRFNHSCIPNVNQEFHPKTFSMLATASRDIPAGSQLYVSYTDSTMSKMERATFL 258
Query: 345 TQGFGFQCNCLRC 357
Q FGF+C+C C
Sbjct: 259 LQEFGFRCSCKAC 271
>gi|258575593|ref|XP_002541978.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902244|gb|EEP76645.1| predicted protein [Uncinocarpus reesii 1704]
Length = 481
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
VG AI L + NH C PNA + D + ALRD+ +GE++ + YI+ + RQ+
Sbjct: 190 VGAAIEPLAALCNHSCSPNAATDF-DKGKIWVRALRDIGKGEQVFVSYIETTDPYAHRQS 248
Query: 343 ILTQGFGFQCNCLRC 357
L + + F C C +C
Sbjct: 249 ELLKRYYFNCKCNKC 263
>gi|402087074|gb|EJT81972.1| hypothetical protein GGTG_01946 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 444
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/93 (34%), Positives = 49/93 (52%), Gaps = 4/93 (4%)
Query: 269 LLSSAAASIESEIAVGNA---IYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGE 324
L+S+ A I+ + G+A I+ S NHDC PNA + + + A+R + GE
Sbjct: 221 LMSTNAFDIDDHDSEGDAFCAIFPEISRMNHDCRPNADYYYDPHTLTQHVHAVRRINIGE 280
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ + YID ++ AR A L + +GF C C C
Sbjct: 281 EITVSYIDPLQSQAARVARLNRTWGFPCGCSLC 313
>gi|452982383|gb|EME82142.1| hypothetical protein MYCFIDRAFT_83606 [Pseudocercospora fijiensis
CIRAD86]
Length = 277
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 44/77 (57%), Gaps = 9/77 (11%)
Query: 285 NAIYMLPSFYNHDCDPNAHIM-WIDNADARLMALRDVEEGEELRICYID---ASMARDAR 340
+++Y+ S NH C PNA + W ++AL+D+ EGEEL I Y++ + R
Sbjct: 95 SSLYLKISRINHSCIPNAVVTEW-----HAVIALKDIAEGEELEISYLNDRISGWTARQR 149
Query: 341 QAILTQGFGFQCNCLRC 357
+A+L +GF+C C+ C
Sbjct: 150 KALLEMHYGFECRCIAC 166
>gi|451992674|gb|EMD85153.1| hypothetical protein COCHEDRAFT_1149213 [Cochliobolus
heterostrophus C5]
Length = 508
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 6/106 (5%)
Query: 255 NAFRIELAGGLYEDLLSSAAASIESEI-AVGNAIYMLPSF--YNHDCDPNAHIMWIDNAD 311
N F + +Y ++S+A I + +G I + P+ NH CDPNA+IM +D
Sbjct: 171 NVFDKDFVAAMYGRVMSNALTLITPTLDPLG--IIVDPTLCQMNHSCDPNAYIM-MDGPL 227
Query: 312 ARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ LR + + +E+ I YID + RQ L + F C C +C
Sbjct: 228 VSIRTLRPIRKDKEIFISYIDTTHPYQKRQDELQARWFFTCRCAKC 273
>gi|440468072|gb|ELQ37255.1| hypothetical protein OOU_Y34scaffold00608g22 [Magnaporthe oryzae
Y34]
gi|440489032|gb|ELQ68713.1| hypothetical protein OOW_P131scaffold00220g51 [Magnaporthe oryzae
P131]
Length = 461
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 39/78 (50%), Gaps = 5/78 (6%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADAR-----LMALRDVEEGEELRICYIDASMARDAR 340
++ S NHDC PN + A+ A RD+ GEEL I Y+D+ +R+ R
Sbjct: 259 GTFLGVSRLNHDCRPNNLPTSLAFHVAKSFVHTTHATRDIAAGEELTISYVDSYSSRNVR 318
Query: 341 QAILTQGFGFQCNCLRCS 358
Q L + +GF C C CS
Sbjct: 319 QERLKRNWGFTCTCPHCS 336
>gi|409078830|gb|EKM79192.1| hypothetical protein AGABI1DRAFT_120631 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 443
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 45/81 (55%), Gaps = 6/81 (7%)
Query: 282 AVGNAIYMLPS-FYNHDCDPNAHIMWIDN-ADARLM---ALRDVEEGEELRICYIDASMA 336
++G+ ++ L S +NH C PNA ++ A LM ALRD + GEE+ I Y+D ++
Sbjct: 193 SIGHGVFPLASRLFNHSCIPNAAPRYVQGPAQPVLMEVVALRDTDVGEEICIPYLDPALT 252
Query: 337 RDARQAILTQGFGFQCNCLRC 357
+ +R I +GF C C C
Sbjct: 253 Q-SRSQIFQYTYGFTCQCSSC 272
>gi|157874751|ref|XP_001685790.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128863|emb|CAJ06023.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 887
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 51/100 (51%), Gaps = 12/100 (12%)
Query: 270 LSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-----------DNADARLMALR 318
+++ A+ S+++V A++ NH C PNA ++ D A L+ALR
Sbjct: 493 VAACASPSASDVSV-KALFPFLRHLNHACVPNAILVLDRTPTHLRRSGDDGVVASLVALR 551
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+E GEE+ + Y+ A+ A Q L + GF+C CL C+
Sbjct: 552 AIESGEEVTVSYVPATTALTVSQTELLETLGFRCRCLFCT 591
>gi|395328144|gb|EJF60538.1| SET domain-containing protein [Dichomitus squalens LYAD-421 SS1]
Length = 384
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASM--ARDARQ 341
I++ + NH C PN W + + ALRDV GEEL + Y+D + R+ R
Sbjct: 168 EGIFLQAARLNHSCRPNVARTWDVGAQEMAFRALRDVAPGEELCMSYVDVDILGTREERG 227
Query: 342 AILTQGFGFQCNCLRC 357
A + FGF C C C
Sbjct: 228 AEIEGAFGFVCACEAC 243
>gi|427798841|gb|JAA64872.1| Putative histone tail methylase, partial [Rhipicephalus pulchellus]
Length = 382
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 7/101 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRD 319
L +YED+ + A + +E G ++ L + +H C PNA + +N L+ALRD
Sbjct: 273 LIDQVYEDIEKESGAFLNNE---GVGLFSLQNLCSHSCVPNAQSSFPHNNYILSLVALRD 329
Query: 320 VEEGEELRICYID---ASMARDARQAILTQGFGFQCNCLRC 357
++ GEE+ + Y+D + +R +R +L + + C C RC
Sbjct: 330 IQPGEEITVSYLDDCSLNRSRHSRTKLLRENYLLTCWCKRC 370
>gi|406698302|gb|EKD01540.1| hypothetical protein A1Q2_04101 [Trichosporon asahii var. asahii
CBS 8904]
Length = 485
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 48/148 (32%)
Query: 233 DDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPS 292
+DE+ ++ + + ++L + IN +E +GGLY L
Sbjct: 308 EDEEARWFSYDAFLDLLGLVNIN---LEDSGGLY----------------------ALHC 342
Query: 293 FYNHDCDPNAHIM-----W---------IDNADAR---------LMALRDVEEGEELRIC 329
NH C+PN + W + +A+ R ++A R + G+EL +
Sbjct: 343 HMNHSCEPNVQVRNLPRHWTPPTSLPAPLPDANPRGVRGTHKLTMLARRTIHPGDELTVS 402
Query: 330 YIDASMARDARQAILTQGFGFQCNCLRC 357
Y+D ++ R AR+ L G+GF C C +C
Sbjct: 403 YVDFNLPRRARRQALRDGYGFWCVCAKC 430
>gi|391335768|ref|XP_003742261.1| PREDICTED: SET and MYND domain-containing protein 4-like
[Metaseiulus occidentalis]
Length = 775
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 70/341 (20%), Positives = 121/341 (35%), Gaps = 56/341 (16%)
Query: 74 QPSP----PPIQVAL----TESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYF 125
QPSP P AL + GR + A R +R D + P + + + C+
Sbjct: 234 QPSPHSSIPNASSALDCLYSTEKGRFIVANRDVRPGDNLFVESPYASVLLPAFTKTHCHH 293
Query: 126 CLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKR 185
C R+I ++ Q R+C C + Y + + + + +LV
Sbjct: 294 CYRRIKAAFPCRQCAQVRYCSMSCSGESWNLYHSQECGNLDLLISVGIAHLAERVILVTG 353
Query: 186 LACM-------------------IISGAESADCIDILQPASLSPELILAMEEGFVMLRSA 226
L + +++ E D+ Q + + L+ +E +
Sbjct: 354 LGLIKDFMKSCNTLECSYLPVYQLVTHEEDMHIADLFQYSFTATLLLKYLERQTTFFTES 413
Query: 227 FKKAGIDDEQMKFLNKQWYTNVLAQIRINAFR-IEL-AGGLY----EDLLSSAAASI--- 277
D ++ + QIRI R ++L GGL + L+ +A A
Sbjct: 414 HLSERFQDLNLRHAER---AQPKGQIRIECSRPLKLFVGGLLLRHIQQLICNAHAITTIQ 470
Query: 278 ---------------ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEE 322
+ ++ V AIY S NH C+PN + + + ++R +
Sbjct: 471 QPGDHVIEEDGIILEQEQVRVATAIYPSASLMNHSCNPNIISGFRSGSTLVVKSVRPIAS 530
Query: 323 GEELRICYID--ASMARDARQAILTQGFGFQCNCLRCSSGD 361
GEE+ CY M RQ L + + F+C+C C GD
Sbjct: 531 GEEVFNCYGPHFRRMTFQERQTALQEQYFFRCDCTACQKGD 571
>gi|336369557|gb|EGN97898.1| hypothetical protein SERLA73DRAFT_109167 [Serpula lacrymans var.
lacrymans S7.3]
Length = 452
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/307 (22%), Positives = 117/307 (38%), Gaps = 59/307 (19%)
Query: 75 PSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS 134
P P I++ A A I + ++ + P+ T S C +C R+ S
Sbjct: 5 PLPTSIRLEPHPIARDQTVAACHISSGSIVISLPPLTTVLLPSEKGCRCDYCFRR-DSPD 63
Query: 135 QHFQHHNA----RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKY--------PLL 182
H + +CG C+D A + + AFN YC S LL
Sbjct: 64 VHLRRCTGCASYWYCGAQCQDMHWAVH----KNLCKAFNRYCASPDFVKLAEHEKLDALL 119
Query: 183 VKRLACMIISGAESADCIDILQPASLSPELILAMEEGF-------VMLRSAFKKAGIDDE 235
+ LA I S + PAS S L + G V+ K++G+ +
Sbjct: 120 LTHLAAQITSNSSPRS-----TPAS-SLSTFLNLRPGPFSKGSRPVVCSMTVKRSGLSSD 173
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
+++ + ++ N A I++ A G++ + +N
Sbjct: 174 ELEAVYSRFGNNNFA---IHSHLTTYAHGVFP---------------------LASRLFN 209
Query: 296 HDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
H C PNA ++ + ++A+RD+ GEE+ I Y+D ++ + +RQ I +GF
Sbjct: 210 HSCLPNAAAKFVLSPSEPIHMEIVAIRDISPGEEICITYLDPALLQ-SRQQIFELTYGFT 268
Query: 352 CNCLRCS 358
C C C+
Sbjct: 269 CLCPSCA 275
>gi|157125402|ref|XP_001654323.1| hypothetical protein AaeL_AAEL001936 [Aedes aegypti]
gi|108882687|gb|EAT46912.1| AAEL001936-PA [Aedes aegypti]
Length = 546
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 67/293 (22%), Positives = 115/293 (39%), Gaps = 46/293 (15%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P ++ E GR + AT+ I+A D++ P++ P T N VC CL+ +T + Q +
Sbjct: 64 PFKICSDERYGRYMMATKDIKAGDVVLKESPLVYGPAQIT-NPVCVGCLQGLTEN-QFLE 121
Query: 139 HHNARF--CGEVCKDN----AKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIIS 192
+ C C+D+ A+ + + R + + Y + Y L+ L C++++
Sbjct: 122 CERCGWPVCKRECQDHPSHKAECKFTIARGSKMQLRHFY--NPHPVYQCLIP-LRCLLLA 178
Query: 193 GAESADCIDILQPAS-----LSPELILAMEEGFVMLRSAFKKAG---IDDEQMKFLNKQW 244
+ A +++ S E EG L F K +DE +K
Sbjct: 179 ERDPAKWQALIKLESHEEQRRGSEQWRNDREGVAKLIPRFFKCENKWSEDEILK------ 232
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
++ +++N + L + S+ AIY S H C PN
Sbjct: 233 ---IVGILQVNGHEVPL----------TEPPSV--------AIYNNASMLEHSCRPNLSK 271
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + A +++GE L I Y D RQ L Q F+C C+RC
Sbjct: 272 SFTSKKEIVFWAPNPIKQGERLSISYSDVLWGTANRQDHLQQTKLFRCTCVRC 324
>gi|444723373|gb|ELW64030.1| SET and MYND domain-containing protein 5 [Tupaia chinensis]
Length = 415
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL +++ G
Sbjct: 288 LYKDIEAATGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLHVTALEEIKPG 344
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 345 EEICISYLDCCQRERSRHSRHKILRENYLFVCSCPKC 381
>gi|312383723|gb|EFR28693.1| hypothetical protein AND_03014 [Anopheles darlingi]
Length = 423
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 111/311 (35%), Gaps = 68/311 (21%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHH 140
QV GR V ATR IR ++I P + P ++++ VC C R + +++ Q
Sbjct: 13 QVVENACLGRYVVATRNIRQGEVIFRDAPAVVGPKMASV-PVCLGCNRDLMATATVRQQG 71
Query: 141 NARFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA------ 194
+ F C R W C + R C I++G+
Sbjct: 72 DPNFRFHQCS-----------RCGWPVCGAECEAADQH------RSECSIMAGSSYRPKI 114
Query: 195 ----------ESADCI----DILQPASLSPELILAME--EGFVM--LRSAFKKAGIDDEQ 236
ESA C+ +L S SP+ ++ E V L+S + +
Sbjct: 115 CPAREEQARRESAYCVIVPLRVLLLQSHSPDTYAQLQRLESHVAERLKSPLYEV-VRSNL 173
Query: 237 MKF----LNKQWYT-----NVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAI 287
+ F L Q Y+ V A + N F I L E V A+
Sbjct: 174 VPFIRSVLGLQQYSAETILEVCAMLDTNCFEIRLP---------------ERRTKV-RAL 217
Query: 288 YMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQG 347
Y L + +HDC PN + D L+A D+E G + Y + R+ L Q
Sbjct: 218 YPLGAMLSHDCRPNTKHYFDDALRMVLVATVDIERGATISTSYTQPLLGTLHRRLALKQS 277
Query: 348 FGFQCNCLRCS 358
F C C RC+
Sbjct: 278 KHFDCQCERCT 288
>gi|195498437|ref|XP_002096523.1| GE25007 [Drosophila yakuba]
gi|194182624|gb|EDW96235.1| GE25007 [Drosophila yakuba]
Length = 393
Score = 50.8 bits (120), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 50/98 (51%), Gaps = 7/98 (7%)
Query: 264 GLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIMWIDNADARLMALRDVEE 322
GLY + A + +E G+ +Y+L S NH C PNA N L AL +++
Sbjct: 269 GLYAKVGEFAGEFLNNE---GSGLYLLQSKINHSCVPNACSTFPYSNDIIVLKALAPIQQ 325
Query: 323 GEELRICYIDASM---ARDARQAILTQGFGFQCNCLRC 357
G+E+ I Y+D M +R +R +L + + F C C +C
Sbjct: 326 GDEICISYLDECMLERSRHSRHKVLRENYVFICQCPKC 363
>gi|353242193|emb|CCA73857.1| hypothetical protein PIIN_07811 [Piriformospora indica DSM 11827]
Length = 365
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 5/80 (6%)
Query: 284 GNAIYMLPSF--YNHDCDP--NAHIMWI-DNADARLMALRDVEEGEELRICYIDASMARD 338
G ++ + P+ NH C NA W D + + A + + +GEE+ I Y D +RD
Sbjct: 157 GESVGIFPTIARINHGCSSAFNAAYSWREDTKETVVYAFKPIFQGEEILISYTDTKKSRD 216
Query: 339 ARQAILTQGFGFQCNCLRCS 358
R+A L Q + F C C CS
Sbjct: 217 NRRAYLKQSYNFDCTCSVCS 236
>gi|255543292|ref|XP_002512709.1| protein with unknown function [Ricinus communis]
gi|223548670|gb|EEF50161.1| protein with unknown function [Ricinus communis]
Length = 379
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 15/114 (13%)
Query: 248 VLAQIRINAFRIELAGGLYEDL-LSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA-HIM 305
+LA+ ++NAF GL E ++ S+ + IY + +NHDC PNA
Sbjct: 181 LLAKDKLNAF------GLMEPFDINDGKRSVRAY-----GIYPKAALFNHDCLPNACRFD 229
Query: 306 WIDNADARLM--ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
++D D L+ + DV +G E+ + Y + RQ L + +GF C+C RC
Sbjct: 230 YVDTQDTDLIIRMIHDVPQGREICLSYFPVNYDYSTRQKRLREDYGFICDCDRC 283
>gi|449295794|gb|EMC91815.1| hypothetical protein BAUCODRAFT_305787 [Baudoinia compniacensis
UAMH 10762]
Length = 503
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEELRICYIDASMA---RDARQ 341
A+Y S NH C PN H + + A RD+ GEEL Y+ R RQ
Sbjct: 259 AVYEEASRLNHSCIPNVHHSYNPTLKKLTVHAARDIYAGEELLTNYVGGRFTYWPRKRRQ 318
Query: 342 AILTQGFGFQCNCLRC 357
IL Q +GFQC C+ C
Sbjct: 319 GILRQQYGFQCACVAC 334
>gi|302766445|ref|XP_002966643.1| hypothetical protein SELMODRAFT_168317 [Selaginella moellendorffii]
gi|300166063|gb|EFJ32670.1| hypothetical protein SELMODRAFT_168317 [Selaginella moellendorffii]
Length = 436
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 5/78 (6%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDAS-MARDA 339
G+ + L S NH C+PNA D N +A ++A R + +GE++ YI+ ++
Sbjct: 353 GSGFFALHSCINHSCEPNAKAFKRDEDINGNAVIIATRKIMKGEQIFTSYIEEEDLSYTE 412
Query: 340 RQAILTQGFGFQCNCLRC 357
RQA+L+ +GF C C +C
Sbjct: 413 RQAMLSD-YGFACECTKC 429
>gi|402075935|gb|EJT71358.1| hypothetical protein GGTG_10617 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 337
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 35/74 (47%), Gaps = 2/74 (2%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAIL 344
A + S NH C+PN H W + + A R ++ GEEL I Y+ RQA L
Sbjct: 115 ATFATVSLINHSCEPNTHQAWNKEKEQETIHATRPIKAGEELTIAYVMFETTA-VRQATL 173
Query: 345 TQGFGFQCNCLRCS 358
+GF C C CS
Sbjct: 174 KAIYGFTCTCPVCS 187
>gi|448518203|ref|XP_003867935.1| hypothetical protein CORT_0B07940 [Candida orthopsilosis Co 90-125]
gi|380352274|emb|CCG22499.1| hypothetical protein CORT_0B07940 [Candida orthopsilosis]
Length = 372
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Query: 269 LLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRI 328
+ S+ E + +G ++Y SF+NH CDPN + + N D LRD+ EGEEL I
Sbjct: 278 IWSNTTNKDEDKEFLGFSVYPSASFFNHSCDPNIIKIRMRN-DMCFETLRDIAEGEELCI 336
Query: 329 CYIDASMARDARQAILTQGFGFQCNCLRC 357
Y + ++ + Q + F C C +C
Sbjct: 337 NYGNFQNEDVNKRQLELQEWFFDCGCTKC 365
>gi|261198320|ref|XP_002625562.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239595525|gb|EEQ78106.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|327355791|gb|EGE84648.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ATCC 18188]
Length = 519
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G S+ NH C PNA+I + D L AL+D+ E++ I YID + RQ+
Sbjct: 209 IGLCFLPFASYINHSCQPNAYIGF-DGPVMYLKALQDIAPDEQIYISYIDNTEPFQTRQS 267
Query: 343 ILTQGFGFQCNCLRCSSG 360
L + F+C C +C G
Sbjct: 268 ELQLQYFFECKCPKCLEG 285
>gi|384247293|gb|EIE20780.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 453
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIM---WID-NADARLMALRDVEEGEELRICYIDASMARDA 339
G+A Y L S NH C PNAH ID + A ++A + + GEE+ + YID
Sbjct: 365 GSAFYALQSCCNHSCAPNAHAFKRAHIDTDGSATILARKPIAPGEEVCLSYIDEDAPYHD 424
Query: 340 RQAILTQGFGFQCNCLRCSS 359
R+A L +GF C C +C S
Sbjct: 425 RRAALAD-YGFTCECDKCVS 443
>gi|427791137|gb|JAA61020.1| Hypothetical protein, partial [Rhipicephalus pulchellus]
Length = 345
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 7/101 (6%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI-DNADARLMALRD 319
L +YED+ + A + +E G ++ L + +H C PNA + +N L+ALRD
Sbjct: 236 LIDQVYEDIEKESGAFLNNE---GVGLFSLQNLCSHSCVPNAQSSFPHNNYILSLVALRD 292
Query: 320 VEEGEELRICYID---ASMARDARQAILTQGFGFQCNCLRC 357
++ GEE+ + Y+D + +R +R +L + + C C RC
Sbjct: 293 IQPGEEITVSYLDDCSLNRSRHSRTKLLRENYLLTCWCKRC 333
>gi|239607850|gb|EEQ84837.1| SET and MYND domain-containing protein [Ajellomyces dermatitidis
ER-3]
Length = 519
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G S+ NH C PNA+I + D L AL+D+ E++ I YID + RQ+
Sbjct: 209 IGLCFLPFASYINHSCQPNAYIGF-DGPVMYLKALQDIAPDEQIYISYIDNTEPFQTRQS 267
Query: 343 ILTQGFGFQCNCLRCSSG 360
L + F+C C +C G
Sbjct: 268 ELQLQYFFECKCPKCLEG 285
>gi|336382331|gb|EGO23481.1| hypothetical protein SERLADRAFT_449865 [Serpula lacrymans var.
lacrymans S7.9]
Length = 435
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/307 (22%), Positives = 117/307 (38%), Gaps = 59/307 (19%)
Query: 75 PSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSS 134
P P I++ A A I + ++ + P+ T S C +C R+ S
Sbjct: 5 PLPTSIRLEPHPIARDQTVAACHISSGSIVISLPPLTTVLLPSEKGCRCDYCFRR-DSPD 63
Query: 135 QHFQHHNA----RFCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKY--------PLL 182
H + +CG C+D A + + AFN YC S LL
Sbjct: 64 VHLRRCTGCASYWYCGAQCQDMHWAVH----KNLCKAFNRYCASPDFVKLAEHEKLDALL 119
Query: 183 VKRLACMIISGAESADCIDILQPASLSPELILAMEEGF-------VMLRSAFKKAGIDDE 235
+ LA I S + PAS S L + G V+ K++G+ +
Sbjct: 120 LTHLAAQITSNSSPRS-----TPAS-SLSTFLNLRPGPFSKGSRPVVCSMTVKRSGLSSD 173
Query: 236 QMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYN 295
+++ + ++ N A I++ A G++ + +N
Sbjct: 174 ELEAVYSRFGNNNFA---IHSHLTTYAHGVFP---------------------LASRLFN 209
Query: 296 HDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
H C PNA ++ + ++A+RD+ GEE+ I Y+D ++ + +RQ I +GF
Sbjct: 210 HSCLPNAAAKFVLSPSEPIHMEIVAIRDISPGEEICITYLDPALLQ-SRQQIFELTYGFT 268
Query: 352 CNCLRCS 358
C C C+
Sbjct: 269 CLCPSCA 275
>gi|328873018|gb|EGG21385.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 740
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 67/307 (21%), Positives = 123/307 (40%), Gaps = 36/307 (11%)
Query: 73 SQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITS 132
Q S P+++ ++ GR + ATR I+ +LI P + ++ C C RKI
Sbjct: 256 KQYSQFPVELVWNDNMGRCIMATRDIKQGELILRVAPFGCALSDDQIDLNCGSCFRKI-- 313
Query: 133 SSQHFQHHNARFCG-----EVCKDNAKAFYDVERRADWSAF--NDYCRSQGLKYPLLVKR 185
++++ C E+C + + +F Y +Q + +++
Sbjct: 314 --KYYKSEKCNSCKTNLLCEICIKDPTIVSMHQEECSLISFLQKHYPNAQTRDFRFMIR- 370
Query: 186 LACMIISGAESADCIDILQPASLSPEL------ILAMEEGFVMLRSAFKKAGIDDEQMKF 239
+++SG A+ + + +P+L I E L + K Q
Sbjct: 371 ---VLLSG--RANKLGRITKEK-TPQLWNQQPFIYDSYEDLASLTTDVSKIESKQLQSFT 424
Query: 240 LNKQWYTNVLAQIR-----INAFRIELAGGLYEDLLSSAAASIE--SEIAVGNAIYMLPS 292
+ TN A ++ + F + LY +L +A I+ + +G IY S
Sbjct: 425 VATNSITNFFAAVKGPKYLSDPFTKQEIFDLYPKILFNAHEYIDPLTHYEIGRGIYPTAS 484
Query: 293 FYNHDCDPNAHIMWIDNADARLM--ALRDVEEGEELRICYIDASMARDARQAILTQGFGF 350
+ NH C PN W ++ ++ + RD+ +GEE+ Y+D + R+ L Q + F
Sbjct: 485 YINHTCLPNT--TWYNDDHGLILYRSSRDILKGEEITTSYLDILKPKLQRRKDLKQ-YSF 541
Query: 351 QCNCLRC 357
C C RC
Sbjct: 542 VCQCERC 548
>gi|322785185|gb|EFZ11898.1| hypothetical protein SINV_10758 [Solenopsis invicta]
Length = 389
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRI---ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI +L +Y+ + + + +E G+ +Y+L S NH C
Sbjct: 253 RWVKNVSALELPKDERIYIDKLIEKIYDAMDEAVGPFLNNE---GSGLYVLQSSVNHSCV 309
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + + N L A+RD++ GEE+ Y+D +R +RQ L+ + F C+C
Sbjct: 310 PNAVVEFPYSNNTLVLKAIRDIKVGEEICTSYLDECQLERSRHSRQQALSSLYLFVCHCD 369
Query: 356 RCSS 359
+C +
Sbjct: 370 KCQA 373
>gi|218195645|gb|EEC78072.1| hypothetical protein OsI_17540 [Oryza sativa Indica Group]
Length = 140
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + +A ++AL +++ +E+ I YID ++ + R
Sbjct: 59 GTAFFPLQSCMNHSCCPNAKAYKRDEDTDGNAVIIALEPIKKDDEITISYIDEDVSYEER 118
Query: 341 QAILTQGFGFQCNCLRC 357
QA L +GF C C RC
Sbjct: 119 QAELAD-YGFICTCPRC 134
>gi|302685552|ref|XP_003032456.1| hypothetical protein SCHCODRAFT_233854 [Schizophyllum commune H4-8]
gi|300106150|gb|EFI97553.1| hypothetical protein SCHCODRAFT_233854 [Schizophyllum commune H4-8]
Length = 681
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 46/97 (47%), Gaps = 8/97 (8%)
Query: 272 SAAASIESEIAVGNAIYMLPSF-----YNHDCDPNAHIMWIDNA-DARLMALRDVEEGEE 325
+A A ++ E A G L +F NH C PNA W L A RD+ GEE
Sbjct: 201 TAFADLQFEGASGEDCKYLATFDMLSRMNHSCRPNALFAWDTRTFSGTLRAARDIAPGEE 260
Query: 326 LRICYI-DASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ + Y D + R+A L +GF+C C CS+GD
Sbjct: 261 ITVAYFGDVHIPLVQRRAFLAP-YGFECACPACSAGD 296
>gi|395508886|ref|XP_003758739.1| PREDICTED: SET and MYND domain-containing protein 5 [Sarcophilus
harrisii]
Length = 609
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 53/97 (54%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEG 323
LY+D+ ++ + E G+ +++L S NH C PNA + +N + AL D++ G
Sbjct: 480 LYKDIERASGEFLNCE---GSGLFVLQSCCNHSCVPNAETSFPENNFLLPVTALEDIKPG 536
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C+C +C
Sbjct: 537 EEICISYLDCCQRERSRYSRHKILRENYLFVCSCPKC 573
>gi|440583731|emb|CCH47233.1| similar to histone-lysine N-methyltransferase ASHR2 [Lupinus
angustifolius]
Length = 357
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 10/82 (12%)
Query: 286 AIYMLPSFYNHDCDPNA-HIMWID---------NADARLMALRDVEEGEELRICYIDASM 335
AIY S +NHDC PNA ++D N D + ++D+ EG E+ I Y S
Sbjct: 212 AIYPKASMFNHDCIPNACRFDYVDSADLDDEHNNTDMVIRMIQDLPEGREVCISYFRISR 271
Query: 336 ARDARQAILTQGFGFQCNCLRC 357
R+ IL +GF C C RC
Sbjct: 272 DYCTRKRILMDDYGFSCECDRC 293
>gi|74178258|dbj|BAE29912.1| unnamed protein product [Mus musculus]
Length = 420
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 114/297 (38%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A R DL+ + T+ C C + S+ CG
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY DVE ++ DW C S G + P RL I++ +
Sbjct: 68 CK---QAFYCDVECQKEDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------QWYTNVL------- 249
+ P E +LA+ E S K +D+E+ + Q+Y+ L
Sbjct: 118 IHPERTPSEKLLAVRE----FESHLDK--LDNEKKDLIQSDIAALHQFYSKYLEFPDHSS 171
Query: 250 -----AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A++++ G+E+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKD 271
>gi|302792667|ref|XP_002978099.1| hypothetical protein SELMODRAFT_108504 [Selaginella moellendorffii]
gi|300154120|gb|EFJ20756.1| hypothetical protein SELMODRAFT_108504 [Selaginella moellendorffii]
Length = 433
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 5/78 (6%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDAS-MARDA 339
G+ + L S NH C+PNA D N +A ++A R + +GE++ YI+ ++
Sbjct: 350 GSGFFALHSCINHSCEPNAKAFKRDEDINGNAVIIATRKIMKGEQIFTSYIEEEDLSYTE 409
Query: 340 RQAILTQGFGFQCNCLRC 357
RQA+L+ +GF C C +C
Sbjct: 410 RQAMLSD-YGFACECTKC 426
>gi|414585279|tpg|DAA35850.1| TPA: hypothetical protein ZEAMMB73_416669 [Zea mays]
Length = 72
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 4/68 (5%)
Query: 295 NHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
NH C PNA D + A ++ALR + + EE+ I YID + + RQA L +GF
Sbjct: 2 NHSCRPNAKAFKRDEDKDGHAVIIALRPISKDEEITIAYIDEDLPYEERQAQLA-DYGFT 60
Query: 352 CNCLRCSS 359
C CL+C
Sbjct: 61 CTCLKCQE 68
>gi|402226072|gb|EJU06132.1| hypothetical protein DACRYDRAFT_103080 [Dacryopinax sp. DJM-731
SS1]
Length = 380
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 58/245 (23%), Positives = 105/245 (42%), Gaps = 47/245 (19%)
Query: 158 DVERRADWSAFNDYCRSQGLKYPLLV------------KRLACMIISGAESA-------- 197
+ ERRA AF D+ ++ G LLV K+LA + S E +
Sbjct: 120 EAERRAYQQAFVDFLKACGWTQVLLVGRFCARHVSEEVKKLAPLPASLVERSALPEPITP 179
Query: 198 -------DCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVLA 250
D ++ L+ + P ++ +E +L+ KKA E++ +NK Y +
Sbjct: 180 SPEYTFYDHVERLKFLEIQPTEMVMLEHN--LLQDVLKKAVAGLEEVVEINK--YLAMKG 235
Query: 251 QIRINAFRIELAGGLYE--------DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNA 302
+I N+ + GG + + + + +E +G+A+Y + S+ H C P+
Sbjct: 236 RIAYNSIGVAFDGGRDDRADPKGRVETWEWSRTPVGTERQIGSALYRVSSYLTHSCSPSV 295
Query: 303 HIMWIDNADARLMALRDVEEGEELRICY--IDA------SMARDARQAILTQGFGFQCNC 354
+ + + L+A R + +GE L + Y I+A +AR R+ L +G+ F C C
Sbjct: 296 RLCFKGTNELHLIANRAISKGEHLTMSYTSIEAKNGEAPEVARRQRRIRLARGWRFACEC 355
Query: 355 LRCSS 359
C+
Sbjct: 356 EACTG 360
>gi|440634670|gb|ELR04589.1| hypothetical protein GMDG_06871 [Geomyces destructans 20631-21]
Length = 390
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NHDCD NA W +N + + A RD+E GEE+ I Y+ R RQ L F F
Sbjct: 53 NHDCDNNAQKSWNENINRHTVHAKRDIENGEEITIFYLGVLNNRKTRQEALRSKFRFTIL 112
Query: 354 CLRCS 358
C CS
Sbjct: 113 CRLCS 117
>gi|195383960|ref|XP_002050693.1| GJ20068 [Drosophila virilis]
gi|194145490|gb|EDW61886.1| GJ20068 [Drosophila virilis]
Length = 515
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/296 (21%), Positives = 114/296 (38%), Gaps = 53/296 (17%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P ++A E GR + ATR I+ +++ P++ P+ + VC CL I +
Sbjct: 42 PFKIAQNELLGRHLVATRNIKPYEIVIKEAPLMRGPSQISA-PVCMGCLNCIEPND---- 96
Query: 139 HHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLK-----YPLL--VKRLACMI 190
H CG +C A E RA+ + + ++ +PL V + C++
Sbjct: 97 HITCDKCGWPLCGPECHAL--DEHRAECQLTQERGQKVNVQEFNGPHPLYTCVSTVRCLL 154
Query: 191 ISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV-- 248
I A+ Q +L EQ + + QW ++
Sbjct: 155 IGETNPANAAKFKQLETL--------------------------EQTRRGSNQWKADLAS 188
Query: 249 LAQIRINAFRIELAGGLYEDLLSSAAASIE---SEIAVGN----AIYMLPSFYNHDCDPN 301
+ Q FR + E+ + A +++ E+ + A++ SF + C PN
Sbjct: 189 IGQFIPKFFRTQ---KFSEEEIMRAVGALQINGHEVPTSDPPHVAVFYTASFTENSCVPN 245
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ + L A +++++ L ICY DA RQ L Q F+C+C RC
Sbjct: 246 LAKSFNKHGHCMLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKCSCERC 301
>gi|357120626|ref|XP_003562026.1| PREDICTED: uncharacterized protein LOC100843629 [Brachypodium
distachyon]
Length = 542
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 2/93 (2%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED S+ + G +++LPSF NH C PNA + + A + A R+++ GE
Sbjct: 299 LTEDAPSANLLGNNGVVNCGVGLWVLPSFINHSCHPNACRTHVGD-HAIVHASREIKAGE 357
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ Y D + RQ + +GF+C C RC
Sbjct: 358 EITFPYFDVLVPVGKRQEA-ARAWGFECRCDRC 389
>gi|301607796|ref|XP_002933488.1| PREDICTED: SET and MYND domain-containing protein 5-like [Xenopus
(Silurana) tropicalis]
Length = 141
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 38/67 (56%), Gaps = 5/67 (7%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARD----ARQAILTQGFGF 350
NH C PNA + DN L +L D++ GEE+ I Y+D RD +RQ IL + + F
Sbjct: 38 NHSCVPNAEASFPDNFILHLTSLEDIQPGEEICISYLDCC-QRDRSWHSRQKILRENYLF 96
Query: 351 QCNCLRC 357
C+C +C
Sbjct: 97 MCSCPKC 103
>gi|238588421|ref|XP_002391720.1| hypothetical protein MPER_08809 [Moniliophthora perniciosa FA553]
gi|215456775|gb|EEB92650.1| hypothetical protein MPER_08809 [Moniliophthora perniciosa FA553]
Length = 210
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C PN I W A A ++ A RD++ G+EL I YI ++ A + + +GFQC
Sbjct: 2 NHSCQPNVDIGWDSPAFAMILRASRDIKAGDELCISYISNLLSPTASRQRDLRSYGFQCT 61
Query: 354 CLRCSS 359
C RCS+
Sbjct: 62 CGRCSN 67
>gi|440468695|gb|ELQ37841.1| hypothetical protein OOU_Y34scaffold00571g4 [Magnaporthe oryzae
Y34]
Length = 457
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 50/110 (45%), Gaps = 21/110 (19%)
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA- 310
+R N F +E+ G Y L + I + NH C PN+ + + D A
Sbjct: 253 MRTNTFSLEIDGETYMGLFPT--------------ILRI----NHACGPNSQVKF-DPAT 293
Query: 311 -DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
+ A D+E GEE+ I Y + M +RQ+ L +GF+C C CS+
Sbjct: 294 LSQKAFASHDIEAGEEITISYAEFGMTYQSRQSTLLHRWGFKCTCALCSA 343
>gi|400601369|gb|EJP69012.1| MYND finger protein [Beauveria bassiana ARSEF 2860]
Length = 542
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 125/289 (43%), Gaps = 32/289 (11%)
Query: 76 SPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQ 135
+P + + ++ GR + T+ + A D+I P++ P+LS L +VC FCLR T
Sbjct: 5 APDGVAIQPHKTKGRALHTTKTVAAGDVIAVFTPLLLLPSLSHLTTVCSFCLRAGTPRPC 64
Query: 136 HFQHHNARFCGEVCKDNA-KAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGA 194
+ A +C C+ A + +E A A + + + P V+ L +++S
Sbjct: 65 S-RCRAAYYCDARCQAAAWSGGHSLECAALVRAVKSSKKRREIPTP--VRALVKVLLSCG 121
Query: 195 ESADC---IDILQP--ASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNVL 249
+ D +D L+ A E A E M AF +E ++ +L
Sbjct: 122 QPEDLSKNMDGLEGHVAERRREPGWADMEMMAMGGCAFAGRETSEESVR-----QAVEIL 176
Query: 250 AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDN 309
+++ NAF ++ L +E +A+ N H C PNA + ++
Sbjct: 177 CKLQTNAFH------RFDADLGHVGIFLEPTLAMAN----------HSCLPNAFVQFVGR 220
Query: 310 ADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A A L A +++G+E+ I Y D + + R+A L + F+C C RC+
Sbjct: 221 A-AVLRAESRIQKGDEIEISYTDYTSSLGKRKAALAP-YNFECRCRRCT 267
>gi|31208665|ref|XP_313299.1| AGAP003552-PA [Anopheles gambiae str. PEST]
gi|21296724|gb|EAA08869.1| AGAP003552-PA [Anopheles gambiae str. PEST]
Length = 391
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDA 339
G+A+Y S NH C PNA ++ N L A RD++ GEE+ I Y+D +R +
Sbjct: 288 GSALYARQSKINHSCAPNAETVFPKSNHMLALRATRDIQPGEEICISYLDECNLQRSRHS 347
Query: 340 RQAILTQGFGFQCNCLRCSS 359
RQ L + F C C +C S
Sbjct: 348 RQKTLKDYYLFICQCEKCES 367
>gi|212546271|ref|XP_002153289.1| TPR domain protein [Talaromyces marneffei ATCC 18224]
gi|210064809|gb|EEA18904.1| TPR domain protein [Talaromyces marneffei ATCC 18224]
Length = 328
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/74 (39%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLM-ALRDVEEGEELRICYIDAS-MARDARQAI 343
+IY+L S +NH C PN + D ++D+ GEEL I Y+D S R RQ
Sbjct: 116 SIYLLGSRFNHSCLPNTTHSYNPQLDMETFHTIQDISAGEELLITYMDGSNWVRSKRQEY 175
Query: 344 LTQGFGFQCNCLRC 357
L Q +GF+C+C C
Sbjct: 176 L-QKWGFECSCPVC 188
>gi|392592096|gb|EIW81423.1| SET domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 450
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/84 (34%), Positives = 47/84 (55%), Gaps = 6/84 (7%)
Query: 283 VGNAIYMLPS-FYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMAR 337
V + ++ L S +NH C PNA +I + DA ++AL+D+ GEE+ + Y+D ++ +
Sbjct: 192 VAHGVFPLASRLFNHSCLPNAAARYILSEDAAPRMEIVALQDIGAGEEICVPYLDPALLQ 251
Query: 338 DARQAILTQGFGFQCNCLRCSSGD 361
+RQ +GF C C CS D
Sbjct: 252 -SRQQGFQLTYGFTCTCPSCSVLD 274
>gi|389739918|gb|EIM81110.1| hypothetical protein STEHIDRAFT_162106 [Stereum hirsutum FP-91666
SS1]
Length = 421
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 47/95 (49%), Gaps = 5/95 (5%)
Query: 268 DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEEL 326
D + A + E+ ++++ S +NH C NA W + + R+ ALRD+ GEE+
Sbjct: 203 DRYCTNAFATETRGVALESVFLAASRFNHGCMRNARYGWNEGSGQLRIYALRDIAVGEEI 262
Query: 327 RICYIDA----SMARDARQAILTQGFGFQCNCLRC 357
+ ++ R R+AIL + +GF C C C
Sbjct: 263 EVSFLGPPERHRAVRSERRAILEKSWGFVCACTVC 297
>gi|410219524|gb|JAA06981.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410248566|gb|JAA12250.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410291460|gb|JAA24330.1| SET and MYND domain containing 2 [Pan troglodytes]
gi|410336589|gb|JAA37241.1| SET and MYND domain containing 2 [Pan troglodytes]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 118/297 (39%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + + DL+ + T++ + C +C + S+ CG
Sbjct: 18 GRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCEYCFTRKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY +VE ++ DW C G + P RL I++ +
Sbjct: 68 CK---QAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ------------------MKFLNKQW 244
+ P E +LA++E S K +D+E+ ++F +
Sbjct: 118 IHPERTPSEKLLAVKE----FESHLDK--LDNEKKDLIQSDIAALHHFYSKHLEFPDNDS 171
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
V AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LIVVFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A+++++ GEE+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD 271
>gi|224050278|ref|XP_002188029.1| PREDICTED: SET and MYND domain-containing protein 1 isoform 3
[Taeniopygia guttata]
Length = 490
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/307 (22%), Positives = 115/307 (37%), Gaps = 61/307 (19%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
+++ E GRG+ A + A D+I A+P +L V C+ C ++ + Q
Sbjct: 9 VELFTAEGKGRGLKAQKEFLAGDVIF-AEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQ 67
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS--QGLKYPLLVKRLACMIISGAES 196
A +C C+ RA W + + C + + K P RLA I+ E
Sbjct: 68 CKFAHYCDRTCQ-----------RAAWLSHKNECSAIKRHGKAPTENIRLAARILWKMER 116
Query: 197 ADCIDILQPASLSPELILAMEE------GFVMLRSAFKKAGIDD------EQMKFLNKQW 244
+ + LS ++A+EE F +A ++ Q + Q
Sbjct: 117 -------EGSGLSEGCLVAIEELQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQFGMQL 169
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+++ I N F + GL AVG I+ NHDC PN +
Sbjct: 170 ISHIFGVISCNGFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWPNCTV 214
Query: 305 MWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
++ + N +A L AL + G+EL + Y+D + R+ L + + F C
Sbjct: 215 VFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQYYFDC 274
Query: 353 NCLRCSS 359
C C
Sbjct: 275 TCEHCKK 281
>gi|348041405|ref|NP_001004614.2| SET and MYND domain-containing protein 5 [Danio rerio]
Length = 415
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/99 (33%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-ARLMALRDVEEG 323
LY+D+ + E G+ +++L S NH C PNA + +N L AL D+ G
Sbjct: 287 LYKDIDKETGDFLNCE---GSGLFLLQSSCNHSCVPNAEASFPENNFLLHLTALGDIGPG 343
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRCSS 359
EE+ I Y+D +R +R IL + + F C+C +C S
Sbjct: 344 EEICISYLDCCQRDRSRHSRHKILRENYLFICSCQKCLS 382
>gi|405974952|gb|EKC39559.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 766
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDARQA 342
++ L S NH C N + + +++A D+++G+EL YID SM R R+A
Sbjct: 615 GLFQLQSCLNHSCTNNVQVSDAEVEGYGGVKVVAKADIKKGDELFTTYIDTSMPRRLRRA 674
Query: 343 ILTQGFGFQCNCLRCS-SGD 361
L + F F C+C RC GD
Sbjct: 675 WLFRSFNFWCHCHRCEFEGD 694
>gi|429857737|gb|ELA32586.1| set domain-containing protein 5 [Colletotrichum gloeosporioides
Nara gc5]
Length = 274
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 3/92 (3%)
Query: 269 LLSSAAASIESEIAVG-NAIYMLPSFYNHDCDPNAHIMWIDNADA-RLMALRDVEEGEEL 326
+LS A + +S+ + G A++ + NH C PNA + DA + A RD++ EE+
Sbjct: 71 VLSWAESLWQSKTSPGRTALFPQVARLNHSCVPNAQGNFNKKLDAFTVHATRDIKPEEEI 130
Query: 327 RICYIDASMA-RDARQAILTQGFGFQCNCLRC 357
I Y+D + R ARQ L G+GF C+C C
Sbjct: 131 TISYLDEHLGLRQARQDHLQDGYGFLCDCSAC 162
>gi|195456692|ref|XP_002075245.1| GK16970 [Drosophila willistoni]
gi|194171330|gb|EDW86231.1| GK16970 [Drosophila willistoni]
Length = 766
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 67/295 (22%), Positives = 114/295 (38%), Gaps = 42/295 (14%)
Query: 78 PPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTL---STLNS-VCYFCLRKITSS 133
P +A + AGRGVFATR I +LI + ++ PT + LN+ VC CL
Sbjct: 352 PNWTLAASVVAGRGVFATRDIARGELIFRERALVVGPTARKGAQLNTCVCCHCLL----P 407
Query: 134 SQHF---QHHNARFCGEVCKDNAKAFYDVERRADW---SAFNDYCRSQGLKY--PLLVKR 185
+Q F CG C D+ + + W A + G + PL ++
Sbjct: 408 AQGFLCAHRCTLPVCGP-CSDSETHREECQHFQRWQPIDAVPEGAEPSGAEQINPLSLRI 466
Query: 186 LACMIIS--GAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQ 243
L + + G + +D +Q + + + F+ D+ F+++
Sbjct: 467 LTAVRVFHLGQQQRSLVDAMQANAER-----GYRQEIIKAAQCFRNFPTTDK--PFMDQL 519
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ ++ + NAF +E LL ++ L + NH+C PNA
Sbjct: 520 F--RIVGVLNTNAFEAPCRVDGHETLL--------------RGLFPLTAIMNHECTPNAS 563
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + A + A RD+ +G E+ Y + R L F C+C RC+
Sbjct: 564 HYFDNGRMAVVRAARDIPKGGEITTTYTKILWSNLTRGIFLKMTKNFMCDCPRCN 618
Score = 38.9 bits (89), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 39/99 (39%), Gaps = 6/99 (6%)
Query: 265 LYEDLLSSAAASIESEIAVGN------AIYMLPSFYNHDCDPNAHIMWIDNADARLMALR 318
Y + + + ES VG A++ L NH C PNA + D + A
Sbjct: 195 FYRTICAFNTNAFESRSRVGGNEILVRALFPLAGLLNHHCTPNAAHHFEDGETIVVSATE 254
Query: 319 DVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ +G E+ + Y + AR+ L F C C RC
Sbjct: 255 RIPKGTEITMTYAKLLWSTLARKLFLAMTKHFICQCPRC 293
>gi|417400865|gb|JAA47349.1| Putative histone tail methylase [Desmodus rotundus]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 115/297 (38%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A R + DL+ + T++ + C C + S+ CG
Sbjct: 18 GRGLRALRPFQVGDLLFSCPAYAYVLTVNERGNHCEHCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY DVE ++ DW C G + P RL I++ S
Sbjct: 68 CK---QAFYCDVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQTS------ 118
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ------------------MKFLNKQW 244
P E +LA++E S K +D+E+ ++F + +
Sbjct: 119 -HPDRTPSEKLLAVKE----FESHLDK--LDNEKKDLIQSDISALHRFYSKHLEFPDNES 171
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVTLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A++ + GEE+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQAISPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD 271
>gi|348534090|ref|XP_003454536.1| PREDICTED: N-lysine methyltransferase SMYD2-A-like [Oreochromis
niloticus]
Length = 435
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 55/119 (46%), Gaps = 17/119 (14%)
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDP 300
++Q T + AQ+ N F IE ++ LS +G+A++ + NH C P
Sbjct: 169 DEQALTELFAQVNCNGFTIE------DEELSH----------LGSAVFPDVALMNHSCSP 212
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
N I+ A + A++++ GEE+ YID + R+ L + F C C C+S
Sbjct: 213 NV-IVTYKGTVAEVRAVKEINPGEEIFNSYIDLLYPTEDRKERLLDSYFFTCQCTECTS 270
>gi|301089299|ref|XP_002894963.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104481|gb|EEY62533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 457
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/86 (30%), Positives = 45/86 (52%), Gaps = 6/86 (6%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNAD-----ARLMALRDVEEGEELRICYI-DASM 335
A + +++ + +NHDC+PN ++A + + R + +GE+L I Y+
Sbjct: 201 AFDSGVFLYNAMFNHDCNPNCVKFTPEDAGPEGGVSEVRVARPIAKGEQLTISYLYPREQ 260
Query: 336 ARDARQAILTQGFGFQCNCLRCSSGD 361
+R RQ L++ FGF+C C C GD
Sbjct: 261 SRGNRQKNLSEQFGFECCCELCKRGD 286
>gi|221057668|ref|XP_002261342.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247347|emb|CAQ40747.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 518
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 115/305 (37%), Gaps = 45/305 (14%)
Query: 75 PSPPP--IQVALTESAGRGVFATRRIRASDLIHTAKPIITHP------------------ 114
P+ P +V E+ G+ V A ++RA I A P I P
Sbjct: 2 PASPNHMYKVDYKENKGKCVVAVSQMRAGYCIVEAHPEIFVPLSVKYMAPRVIDAENEET 61
Query: 115 TLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEVCKDNAKAFYDVERRADWS-AFNDYCR 173
T+N +C++C K + S +C +VC D A + E S F+ +C
Sbjct: 62 NYRTVN-ICFYCFEKFSKSVYCPNCKYVVYCSDVCLDLAWKLHREECEVFKSNIFDKFC- 119
Query: 174 SQGLKYPLLVKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGID 233
P ++ R M+I+ + + + EL E F A +
Sbjct: 120 ------PSIIMR---MVINSY--LNHFNFYEYCGSVEELPKEKYEYFKYPAYIVAVALMS 168
Query: 234 DEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLP-S 292
++ F N + ++L I IE + ++ L +E G A Y P
Sbjct: 169 KKKKLFSNFEDNESILKNI------IEKFVKISKNSLQIIDNDLE---PAGLAFYKKPVP 219
Query: 293 FYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQC 352
++NH C N + N + L DV GEEL I Y+D + R+ R AI + F C
Sbjct: 220 YFNHSCLSNC-VTIFKNQKLFIRTLVDVYPGEELTISYVDIAFDRNTRLAICMDQYFFTC 278
Query: 353 NCLRC 357
+C C
Sbjct: 279 SCKLC 283
>gi|169611204|ref|XP_001799020.1| hypothetical protein SNOG_08710 [Phaeosphaeria nodorum SN15]
gi|160702245|gb|EAT83878.2| hypothetical protein SNOG_08710 [Phaeosphaeria nodorum SN15]
Length = 321
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 50/103 (48%), Gaps = 5/103 (4%)
Query: 261 LAGGLYEDLLSSAAASIESEIAVGNAIYM--LPSF--YNHDCDPNAHIMWIDNADA-RLM 315
L G ED++ + A + + + YM PS NH C PNAH +I + +
Sbjct: 106 LGGSDIEDVMKTNAFACQLHDGGQDDSYMCLFPSVARINHACRPNAHARFIPTSLVMEIK 165
Query: 316 ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
AL+D++ GEE+ I Y + RQ + +G+ F C C C+
Sbjct: 166 ALQDIKPGEEIGISYGRVDLPHAERQKLYKEGWNFGCTCDMCT 208
>gi|320593393|gb|EFX05802.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 573
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/65 (36%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 295 NHDCDPN-AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NHDC PN + ++ + + A RD+EEGEEL I Y++ + D R ++ + + FQC
Sbjct: 353 NHDCRPNLSADIFGETMTMVVWANRDIEEGEELTISYLNDLLPSDRRSKVIKRRWDFQCQ 412
Query: 354 CLRCS 358
C C+
Sbjct: 413 CDLCT 417
>gi|380814842|gb|AFE79295.1| N-lysine methyltransferase SMYD2 [Macaca mulatta]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 119/297 (40%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + + DL+ + + T++ + C +C + S+ CG
Sbjct: 18 GRGLRALQPFQVGDLLFSCRAYAYVLTVNERGNHCEYCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY +VE ++ DW C G + P RL I++ +
Sbjct: 68 CK---QAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQ------------------MKFLNKQW 244
+ P E +LA++E S K +D+E+ ++F +
Sbjct: 118 IHPERTPSEKLLAVKE----FESHLDK--LDNEKKDLIQSDIAALHHFYSKHLEFPDNDS 171
Query: 245 YTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A+++++ GEE+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD 271
>gi|449501413|ref|XP_002187998.2| PREDICTED: SET and MYND domain-containing protein 1 isoform 2
[Taeniopygia guttata]
Length = 491
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/311 (22%), Positives = 117/311 (37%), Gaps = 69/311 (22%)
Query: 80 IQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-CYFCLRKITSSSQHFQ 138
+++ E GRG+ A + A D+I A+P +L V C+ C ++ + Q
Sbjct: 9 VELFTAEGKGRGLKAQKEFLAGDVIF-AEPAYAAVVFDSLTHVICHTCFKRQERLHRCGQ 67
Query: 139 HHNARFCGEVCKDNAKAFYDVERRADWSAFNDYCRS--QGLKYPLLVKRLACMIISGAES 196
A +C C+ RA W + + C + + K P RLA I+ E
Sbjct: 68 CKFAHYCDRTCQ-----------RAAWLSHKNECSAIKRHGKAPTENIRLAARILWKMER 116
Query: 197 ADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDE----------------QMKFL 240
+ + LS ++A+EE L++ G +++ Q +
Sbjct: 117 -------EGSGLSEGCLVAIEE----LQNHVDSFGEEEKKDLRADVESFLEFWPPQCQQF 165
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDP 300
Q +++ I N F + GL AVG I+ NHDC P
Sbjct: 166 GMQLISHIFGVISCNGFTLSDQRGLQ---------------AVGVGIFPNLCQANHDCWP 210
Query: 301 NAHIMWID-NADA-----------RLMALRDVEEGEELRICYIDASMARDARQAILTQGF 348
N +++ + N +A L AL + G+EL + Y+D + R+ L + +
Sbjct: 211 NCTVVFNNGNHEAVRSMFHTQMRIELRALNKISPGDELTVSYVDFLSLSEERRRQLKKQY 270
Query: 349 GFQCNCLRCSS 359
F C C C
Sbjct: 271 YFDCTCEHCKK 281
>gi|121715400|ref|XP_001275309.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
gi|119403466|gb|EAW13883.1| SET and MYND domain protein, putative [Aspergillus clavatus NRRL 1]
Length = 555
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 3/77 (3%)
Query: 286 AIYMLP--SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
IYM P + NH CD NA + + D+ + AL +++GE++ I Y+D + RQ
Sbjct: 256 GIYMHPYAALMNHSCDYNATVAF-DDDRLHVKALHPIKKGEQIFITYVDTTNPYKIRQKE 314
Query: 344 LTQGFGFQCNCLRCSSG 360
L+ + F C C +C G
Sbjct: 315 LSDRYYFTCRCSKCQQG 331
>gi|392565527|gb|EIW58704.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 402
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 43/84 (51%), Gaps = 2/84 (2%)
Query: 277 IESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDA-SM 335
+E +G A+Y +PSF+NH C N D R + R V GEEL I Y +M
Sbjct: 314 MEESECLGFALYPVPSFFNHHCAANVRKE-RDGRALRFVTTRAVVAGEELCISYGHVEAM 372
Query: 336 ARDARQAILTQGFGFQCNCLRCSS 359
RQ L +G+ F+C C RC++
Sbjct: 373 DWATRQKELLEGWYFECRCGRCTA 396
>gi|325188967|emb|CCA23495.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 497
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 1/74 (1%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
G A++ L NH C PN +++ N +A + A++ ++ +EL I YID + D RQ
Sbjct: 415 GTALFSLICTMNHSCLPNCIVVYGHNGEAHVHAIQAIQPKDELCIEYIDTDRSYDERQYE 474
Query: 344 LTQGFGFQCNCLRC 357
L + + F+C C++C
Sbjct: 475 LRE-YHFRCQCMKC 487
>gi|320169071|gb|EFW45970.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 509
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 42/81 (51%), Gaps = 1/81 (1%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARDAR 340
A G+ IY + + NH C N + + + + + A+RD+ GEE+ + DA+ R
Sbjct: 268 ARGSGIYPVAALVNHACVANCVLTYDLKSKRQFIRAIRDIRAGEEITHAFTDAASPTVVR 327
Query: 341 QAILTQGFGFQCNCLRCSSGD 361
+A L + F CNC RC+ D
Sbjct: 328 KAHLKSLYAFDCNCSRCNDSD 348
>gi|397486375|ref|XP_003814305.1| PREDICTED: N-lysine methyltransferase SMYD2 [Pan paniscus]
Length = 384
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 58/127 (45%), Gaps = 17/127 (13%)
Query: 235 EQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFY 294
+ ++FL+ V AQ+ N F IE ++ LS +G+AI+ +
Sbjct: 113 KHLEFLDNDSLVVVFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALM 156
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
NH C PN I+ A + A+++++ GEE+ YID + R L + F C C
Sbjct: 157 NHSCCPNV-IVTYKGTLAEVRAVQEIKPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCEC 215
Query: 355 LRCSSGD 361
C++ D
Sbjct: 216 QECTTKD 222
>gi|301603797|ref|XP_002931556.1| PREDICTED: SET and MYND domain-containing protein 3-like [Xenopus
(Silurana) tropicalis]
Length = 226
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
S NH CDPN I++ + L ++++ +GEEL I YID M R+ L + + F
Sbjct: 2 SLLNHSCDPNCVIVF-EGTCLLLRTVKEIPKGEELTISYIDVKMPTQGRRDQLQRQYCFL 60
Query: 352 CNCLRC 357
C+C RC
Sbjct: 61 CDCQRC 66
>gi|336374668|gb|EGO03005.1| hypothetical protein SERLA73DRAFT_103070 [Serpula lacrymans var.
lacrymans S7.3]
gi|336387539|gb|EGO28684.1| hypothetical protein SERLADRAFT_359922 [Serpula lacrymans var.
lacrymans S7.9]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 59/119 (49%), Gaps = 7/119 (5%)
Query: 249 LAQIRINAFRIELAGGLYE-DLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWI 307
L++I + EL L+E D + + +Y L + NH C P+ + +
Sbjct: 289 LSRILKKSTTPELDKELFEYDAFLRGLGRMSLNLEAHGGLYTLHAHLNHSCTPSISVRHL 348
Query: 308 DNADA----RLMALRDVEEGEELRICYID-ASMARDARQAILTQGFGFQCNCLRCSSGD 361
D +A ++A +D++ GEEL I Y++ A+ ++ R+ + GFG QC C RC S +
Sbjct: 349 DQHNALSRITIIARKDIDAGEELFISYVNPAARLKERRRNLAEWGFG-QCQCERCLSEE 406
>gi|389632029|ref|XP_003713667.1| hypothetical protein MGG_04672 [Magnaporthe oryzae 70-15]
gi|351646000|gb|EHA53860.1| hypothetical protein MGG_04672 [Magnaporthe oryzae 70-15]
gi|440485724|gb|ELQ65652.1| hypothetical protein OOW_P131scaffold00467g7 [Magnaporthe oryzae
P131]
Length = 447
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 13/111 (11%)
Query: 264 GLYED-------LLSSAAASIESEIAVGN-----AIYMLPSFYNHDCDPNAHIMW-IDNA 310
GLYE+ +LS+ A I+++ N ++ S NHDC PNA + +
Sbjct: 213 GLYENYQQKVDKILSTNAFDIDADPEGDNDDQFYVVFPEISRCNHDCRPNADYYYDPETL 272
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A+R + GEE+ + Y+D ++ AR A L++ +GF C C C+ D
Sbjct: 273 TQHVHAVRQINIGEEISLSYLDPLQSQAARLARLSRTWGFPCGCSMCTQRD 323
>gi|61806679|ref|NP_001013568.1| N-lysine methyltransferase SMYD2-A [Danio rerio]
gi|82178565|sp|Q5BJI7.1|SMY2A_DANRE RecName: Full=N-lysine methyltransferase SMYD2-A; AltName:
Full=Histone methyltransferase SMYD2-A; AltName:
Full=SET and MYND domain-containing protein 2A
gi|60552471|gb|AAH91465.1| SET and MYND domain containing 2a [Danio rerio]
Length = 435
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 105/277 (37%), Gaps = 21/277 (7%)
Query: 84 LTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNAR 143
L+ GRG+ A + + DL+ T++ C C + S+ + A
Sbjct: 13 LSPGKGRGLKAIKHFKVGDLVFACPAYAYVLTVNERGGRCECCFTRKEGLSKCGKCKQAY 72
Query: 144 FCGEVCKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDIL 203
+C C+ + +E A + ++C S+ ++ LV R+ E +L
Sbjct: 73 YCNVECQRGDWPMHKLECSAMCAYGENWCPSETVR---LVARIILKQKHQTERTPSERVL 129
Query: 204 QPASLSPELILAMEEGFVMLRSAFKK-AGIDDEQMKFLNKQWYTNVLAQIRINAFRIELA 262
L L E M + + F + T ++AQ+ N F IE
Sbjct: 130 TLRELEAHLDKLDNEKNEMNDTDIAALHHFYSRHLDFPDNAALTELIAQVNCNGFTIE-- 187
Query: 263 GGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEE 322
++ LS +G+A++ + NH C PN I+ A + A++++
Sbjct: 188 ----DEELSH----------LGSALFPDVALMNHSCSPNV-IVTYKGTVAEVRAVQEINP 232
Query: 323 GEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
EE+ YID + R L + F C+C C+S
Sbjct: 233 EEEIFNSYIDLLYPTEDRIERLKDSYFFNCDCKECTS 269
>gi|395859538|ref|XP_003802095.1| PREDICTED: N-lysine methyltransferase SMYD2 [Otolemur garnettii]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/291 (24%), Positives = 115/291 (39%), Gaps = 55/291 (18%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + + DL+ + T++ + C +C + S+ CG
Sbjct: 18 GRGLRALQPFQVGDLLFSCPAYAYVLTVNERGNHCDYCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY DVE ++ DW C G + P RL I++ +
Sbjct: 68 CK---QAFYCDVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRS--AFKKAGIDDE----------QMKFLNKQWYTNVLA 250
+ P E +LA++E L KK I + ++F + + A
Sbjct: 118 IHPERTPSEKLLAVKEFESHLDKLDNEKKELIQSDIAALHHFYSKHLEFPDNDSLVVLFA 177
Query: 251 QIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA 310
Q+ N F IE ++ LS +G+AI+ + NH C PN I+
Sbjct: 178 QVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-IVTYKGT 220
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
A + A++++ GEE+ YID + R L + F C C C++ D
Sbjct: 221 LAEVRAVQEINPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTKD 271
>gi|426195740|gb|EKV45669.1| hypothetical protein AGABI2DRAFT_223832 [Agaricus bisporus var.
bisporus H97]
Length = 443
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/81 (32%), Positives = 45/81 (55%), Gaps = 6/81 (7%)
Query: 282 AVGNAIYMLPS-FYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMA 336
++G+ ++ L S +NH C PNA ++ + ++ALRD + GEE+ I Y+D ++
Sbjct: 193 SIGHGVFPLASRLFNHSCIPNAAPRYVQGSAQPVLMEVVALRDTDIGEEICIPYLDPALT 252
Query: 337 RDARQAILTQGFGFQCNCLRC 357
+ +R I +GF C C C
Sbjct: 253 Q-SRSQIFQYTYGFTCQCGPC 272
>gi|146413184|ref|XP_001482563.1| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
6260]
Length = 483
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 287 IYMLPSFYNHDCDPNAHIMWIDNADA---RLMALRDVEEGEELRICYIDASMARDARQAI 343
I++ S NH C PN + + ++ A RD++ GEEL Y++ S RQ
Sbjct: 342 IFLTQSHLNHSCHPNTDVQASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQRQRE 401
Query: 344 LTQGFGFQCNCLRC 357
L +GF C+C RC
Sbjct: 402 LRVNWGFICSCQRC 415
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-----CYFCLRKIT 131
PP ++V +T G+G++A +IR DLI +P+ P L+ + V C +C + +T
Sbjct: 109 PPKVKVTMTAKRGKGLYAKTQIRKGDLIWEERPLFFVPALANVKLVRTGRACAYCAKLLT 168
Query: 132 SSS 134
S
Sbjct: 169 QRS 171
>gi|30840984|ref|NP_081072.1| N-lysine methyltransferase SMYD2 [Mus musculus]
gi|47117248|sp|Q8R5A0.1|SMYD2_MOUSE RecName: Full=N-lysine methyltransferase SMYD2; AltName:
Full=Histone methyltransferase SMYD2; AltName: Full=SET
and MYND domain-containing protein 2
gi|339717596|pdb|3QWV|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Cofactor Product Adohcy
gi|339717597|pdb|3QWW|A Chain A, Crystal Structure Of Histone Lysine Methyltransferase
Smyd2 In Complex With The Methyltransferase Inhibitor
Sinefungin
gi|18606376|gb|AAH23119.1| SET and MYND domain containing 2 [Mus musculus]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 114/297 (38%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A R DL+ + T+ C C + S+ CG
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY DVE ++ DW C S G + P RL I++ +
Sbjct: 68 CK---QAFYCDVECQKEDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------QWYTNVL------- 249
+ P E +LA+ E S K +D+E+ + Q+Y+ L
Sbjct: 118 IHPERTPSEKLLAVRE----FESHLDK--LDNEKKDLIQSDIAALHQFYSKYLEFPDHSS 171
Query: 250 -----AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A++++ G+E+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKD 271
>gi|223634690|sp|A5DQN2.2|SET5_PICGU RecName: Full=Potential protein lysine methyltransferase SET5;
AltName: Full=SET domain-containing protein 5
gi|190348931|gb|EDK41485.2| hypothetical protein PGUG_05583 [Meyerozyma guilliermondii ATCC
6260]
Length = 483
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 287 IYMLPSFYNHDCDPNAHIMWIDNADA---RLMALRDVEEGEELRICYIDASMARDARQAI 343
I++ S NH C PN + + ++ A RD++ GEEL Y++ S RQ
Sbjct: 342 IFLTQSHLNHSCHPNTDVQASTASRTGPLKVFAARDIKAGEELTTSYVNPSHTLHQRQRE 401
Query: 344 LTQGFGFQCNCLRC 357
L +GF C+C RC
Sbjct: 402 LRVNWGFICSCQRC 415
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 20/63 (31%), Positives = 34/63 (53%), Gaps = 5/63 (7%)
Query: 77 PPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSV-----CYFCLRKIT 131
PP ++V +T G+G++A +IR DLI +P+ P L+ + V C +C + +T
Sbjct: 109 PPKVKVTMTAKRGKGLYAKTQIRKGDLIWEERPLFFVPALANVKLVRTGRACAYCAKLLT 168
Query: 132 SSS 134
S
Sbjct: 169 QRS 171
>gi|66810740|ref|XP_639077.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|74854618|sp|Q54R14.1|Y3443_DICDI RecName: Full=SET domain-containing protein DDB_G0283443
gi|60467687|gb|EAL65706.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 393
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 40/67 (59%), Gaps = 1/67 (1%)
Query: 292 SFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQ 351
S++NH C PN + +N + +L +++G+EL I YID M+++ R L + + F+
Sbjct: 274 SYFNHSCFPNC-VRVQENQSISIYSLIPIKKGDELSISYIDIRMSKNDRLLHLKEIYYFE 332
Query: 352 CNCLRCS 358
C C RC+
Sbjct: 333 CKCKRCT 339
>gi|240275453|gb|EER38967.1| MYND finger family protein [Ajellomyces capsulatus H143]
Length = 518
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G + ++ NH C+PNA+I + D L AL+D+ EE+ I Y D + RQ
Sbjct: 208 IGLCLLPFAAYINHSCEPNAYIGF-DGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQT 266
Query: 343 ILTQGFGFQCNCLRCSSG 360
L + F+C C +C G
Sbjct: 267 ELQLRYFFECKCPKCLKG 284
>gi|195440652|ref|XP_002068154.1| GK12548 [Drosophila willistoni]
gi|194164239|gb|EDW79140.1| GK12548 [Drosophila willistoni]
Length = 534
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 61/295 (20%), Positives = 116/295 (39%), Gaps = 40/295 (13%)
Query: 81 QVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHH 140
+VA + GR + ATR I+ + I P++ P +++ +C C R I + + ++
Sbjct: 49 EVASNDMLGRHLRATRDIKIGEQILREAPLVVGPKVASA-PICLGCHRNIGAPGKSMNYY 107
Query: 141 NAR-----FCGEVCKDNAK--------AFYDVERRADWSAFNDYCRSQGLKYPLLVKRLA 187
CG+ C+ + A + + + ++ D + + ++ L
Sbjct: 108 KCSSCTWPLCGKECEQSPHHRGECQLMAQSNFQSKINYHPDQDQNQERKESAYCVIMLLR 167
Query: 188 CMIISGAESADCIDILQPASLSPELILAMEEG-FVMLRS---AFKKAGIDDEQMKFLNKQ 243
CM++ ++ D + + SL L +E + +LR+ F K + ++ ++
Sbjct: 168 CMLLKTSQPEDFVRLF---SLEDHLKERLETPLYQVLRANLITFIKTVLG---LREWSEM 221
Query: 244 WYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAH 303
+ A + N F + E A+Y + +HDC PN
Sbjct: 222 EILRIAAILDTNTFEVRQP----------------KERRKVRALYPGAAMISHDCVPNMR 265
Query: 304 IMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ DN + +A R + +GE L I Y + R+ L Q F C C RCS
Sbjct: 266 HRFDDNMNIIFLAKRAISKGEILNISYTQPLRSTIQRRLHLRQAKCFDCACQRCS 320
>gi|225561846|gb|EEH10126.1| MYND finger family protein [Ajellomyces capsulatus G186AR]
Length = 518
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G + ++ NH C+PNA+I + D L AL+D+ EE+ I Y D + RQ
Sbjct: 208 IGLCLLPFAAYINHSCEPNAYIGF-DGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQT 266
Query: 343 ILTQGFGFQCNCLRCSSG 360
L + F+C C +C G
Sbjct: 267 ELQLRYFFECKCPKCLKG 284
>gi|170090095|ref|XP_001876270.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164649530|gb|EDR13772.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 451
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 282 AVGNAIYMLPS-FYNHDCDPNAHIMWI----DNADARLMALRDVEEGEELRICYIDASMA 336
+G+ ++ L S +NH C PNA + ++A+RD+ GE++ I Y+D +M
Sbjct: 192 TIGHGVFPLASRLFNHSCVPNAAARYSLSPSHGVGMEVVAIRDIPSGEQICIPYLDPAMT 251
Query: 337 RDARQAILTQGFGFQCNCLRC 357
+ +R I +GF+C+C C
Sbjct: 252 Q-SRHQIFELTYGFRCDCSSC 271
>gi|452003037|gb|EMD95494.1| hypothetical protein COCHEDRAFT_1087695 [Cochliobolus
heterostrophus C5]
Length = 330
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 52/113 (46%), Gaps = 9/113 (7%)
Query: 254 INAFRIELAGGLYEDLLSSAAASIESEIAVGNA----IYMLPSF--YNHDCDPNAHIMWI 307
+ A L G ED++ + S + GN I + PS NH C PNA ++
Sbjct: 109 VAALDQSLGGSPIEDVMKTN--SFACRLDDGNVDKAYICLFPSVSRINHACQPNALARFV 166
Query: 308 DNA-DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSS 359
++ A RD+ GEE+ I Y + R+ RQ + G+ F+C C C++
Sbjct: 167 PKTLSMQVKAKRDIAAGEEINISYGRVDLTREERQELYKDGWNFECTCSLCTA 219
>gi|449540254|gb|EMD31248.1| hypothetical protein CERSUDRAFT_60339 [Ceriporiopsis subvermispora
B]
Length = 467
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI-MWIDNA 310
+R N F + G L + + I A+G + S NH C PNAH+ M +
Sbjct: 161 LRTNGFSV--TGVLRKVVPDLPKGEIHELTAIGEVL----SRVNHSCRPNAHVRMDTHSL 214
Query: 311 DARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+L+ALR + GE++ + Y D R+ L +GF C+CL C+
Sbjct: 215 SLQLVALRPIASGEQVTVAYTDILAPYTTRKRKLAP-YGFTCSCLSCA 261
>gi|330792060|ref|XP_003284108.1| hypothetical protein DICPUDRAFT_93608 [Dictyostelium purpureum]
gi|325085922|gb|EGC39320.1| hypothetical protein DICPUDRAFT_93608 [Dictyostelium purpureum]
Length = 731
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 3/78 (3%)
Query: 286 AIYMLPSFYNHDCDPNAHIMW--IDNADARLMALRDVEEGEELRICYIDA-SMARDARQA 342
+I SF+NH C+PN I +++ R R +++GEEL I Y+D + + R+
Sbjct: 644 SILQQASFFNHSCEPNIFIATPVVNDKSIRFCTRRPIKKGEELFITYLDGFDLDTETRKT 703
Query: 343 ILTQGFGFQCNCLRCSSG 360
IL F C C C++
Sbjct: 704 ILNTTHMFTCKCPSCTNN 721
>gi|328852756|gb|EGG01899.1| hypothetical protein MELLADRAFT_110526 [Melampsora larici-populina
98AG31]
Length = 544
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 8/103 (7%)
Query: 263 GGLYEDLLSSAAASIESEIAVGN------AIYMLPSFYNHDCDPNAHIMWIDNA-DARLM 315
G E + SA + E ++G A+ P+ NHDC NA + ++ +
Sbjct: 327 GETEEIWIESAISRNGFEFSIGEDEVPFFAVIPEPARLNHDCRANAAFHFDESTLQVYMY 386
Query: 316 ALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
A+RD+ GEE+ I Y D ++R+ R+ I + +GF+C C CS
Sbjct: 387 AIRDIAPGEEVTISYRDMKLSREERR-IELKHYGFECQCSLCS 428
>gi|326472015|gb|EGD96024.1| hypothetical protein TESG_03485 [Trichophyton tonsurans CBS 112818]
Length = 499
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 115/300 (38%), Gaps = 62/300 (20%)
Query: 88 AGRGVFATRRIRASDLIHTAKPIITHPTLST--LNSVCYFCLRKITSSSQHFQHHN---- 141
AGRG+FAT I + + P LST L+ VC C + +
Sbjct: 20 AGRGLFATSTINMGKEVFSM-PATFSTVLSTERLSDVCSNCFANLPFGVNVMASVDMKLR 78
Query: 142 -------ARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRSQGLKYPLL-------VKRL 186
++C E C+ A+W + + C YP + V R+
Sbjct: 79 ACSGCKVVKYCDEKCQS-----------ANWKLIHKHECAIYKKLYPKILPVNSRAVLRI 127
Query: 187 ACMIISGAE--SADCIDILQPASLSPELILAMEEGF--VMLRSAFKK--AGIDDEQMKFL 240
+ S E +D L S E+ + +E + +ML + +K +G D L
Sbjct: 128 VKLRNSNEEHVKSDLTMFLSLRSHLNEITNSNQEQYERIMLSAKAEKEYSGSD------L 181
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDP 300
+ + LA+I +N+F A G +G I + NH C+P
Sbjct: 182 DIETIAEYLARIEVNSFTFTTAFG----------------DPLGLCIQPFACYMNHSCEP 225
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
NA ++ D + AL ++ E++ I YID + + RQ L + + F C C +C+ G
Sbjct: 226 NA-VVGFDEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCAQG 284
>gi|170089169|ref|XP_001875807.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164649067|gb|EDR13309.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 213
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 5/83 (6%)
Query: 281 IAVGNAIYMLPSF--YNHDCDPNAHIM--WIDNADARLM-ALRDVEEGEELRICYIDASM 335
+A GN + + P NH C +++ W + ++ AL+ + +GEEL Y +
Sbjct: 59 VAAGNGVGIFPRMARLNHGCSSAFNVVYSWREKEGVLVVYALKSIRKGEELLTTYTELRR 118
Query: 336 ARDARQAILTQGFGFQCNCLRCS 358
R+ R+A LT+ +GF C C CS
Sbjct: 119 PREQRRAYLTEHYGFYCTCSACS 141
>gi|348577033|ref|XP_003474289.1| PREDICTED: N-lysine methyltransferase SMYD2-like [Cavia porcellus]
Length = 433
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 117/297 (39%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A + + DL+ + T+ S C C + S+ CG
Sbjct: 18 GRGLRALQPFQVGDLLFACPAYASVLTVGERGSHCEHCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY +VE ++ DW C G + P RL I++ +
Sbjct: 68 CK---QAFYCNVECQKEDWPMHKLECSPMVVFGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------QWYTN--------- 247
+ P E++LA++E S K +D+E+ + Q+Y+
Sbjct: 118 IHPERTPSEMLLAVKE----FESHLDK--LDNEKKDLIQSDIAMLHQFYSKHLEFPDHDS 171
Query: 248 ---VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A+++V GEE+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEVLPGEEVFTSYIDLLYPTEDRNDRLRDSYFFTCECQECTTRD 271
>gi|171688102|ref|XP_001908991.1| hypothetical protein [Podospora anserina S mat+]
gi|170944013|emb|CAP70123.1| unnamed protein product [Podospora anserina S mat+]
Length = 289
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 37/59 (62%), Gaps = 1/59 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NH C+PNA +++ + R+ +LR + GEE+ ICYID ++ +R+ IL F+C+
Sbjct: 3 NHSCEPNA-FAFLEKGEIRVRSLRKIAAGEEITICYIDPTIDVKSRREILMDEHFFECD 60
>gi|328855353|gb|EGG04480.1| hypothetical protein MELLADRAFT_108480 [Melampsora larici-populina
98AG31]
Length = 398
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 66/152 (43%), Gaps = 22/152 (14%)
Query: 213 ILAMEEGFVMLRSAFKKA-GIDDEQMKFLNKQWYTNVLAQIRINAFRIELAGGLYEDLLS 271
I ++E+ L S KK G D + L+ W L+ I N RI L
Sbjct: 139 IRSIEDYETNLNSILKKLDGEDRNRFLSLSNPWQNQSLSNIS-NYIRI----------LQ 187
Query: 272 SAAASIESEIAVGNAIYMLPSF--YNHDCD--PNAHIMWIDNADARLM-ALRDVEEGEEL 326
+ A SI S N + + PS NH C N+ W + ++ A + +E GEE+
Sbjct: 188 TNAISIGS-----NQLAIFPSLSRINHGCAGASNSVYNWREKEGVEVVHATKLIEVGEEI 242
Query: 327 RICYIDASMARDARQAILTQGFGFQCNCLRCS 358
I Y D+ +R RQ L +GFQC C CS
Sbjct: 243 LISYWDSKRSRSDRQDYLKSNYGFQCTCQTCS 274
>gi|134057769|emb|CAK38164.1| unnamed protein product [Aspergillus niger]
Length = 358
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 117/301 (38%), Gaps = 50/301 (16%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTL---NSVC--YFCLRKITSSSQHFQHH--- 140
GR + IRA +L+ P PT++ +S+C C R++ S+H +
Sbjct: 63 GRTLSTNGPIRAGELLMADLPYAVLPTVAVGGGDDSICSNLGCSRRV---SRHAANSVTC 119
Query: 141 ------NARFCGEVCKDNAKAFYDVERRADW-SAFNDYCRSQ---------GLKYPLLVK 184
+ +C E CKD +A +DVE W ++ R + + LL
Sbjct: 120 ETGCSLDIWWCNESCKDEDQARHDVE--CAWLKKYSPVLRQELGDHDYYMLWIIIRLLAA 177
Query: 185 RLACMIISGAESADCIDILQ------PASLSPELILAMEEGFVMLRSAFKKAGIDDEQMK 238
R + I+ + + +L+ P S E+ +L+S+ +D + +
Sbjct: 178 RNSHSKIASLAAGKGVQLLRSNRDSWPTSQLQHWSTLAEK--YLLQSSLLPETLDLDTLV 235
Query: 239 FLNKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDC 298
L TNV EL G E ++ + +E G A+++ + NH C
Sbjct: 236 DLICAEETNVF----------ELCPGPTE-IIPHQSPGVERGTQYGLAVFLRITLANHSC 284
Query: 299 DPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARD--ARQAILTQGFGFQCNCLR 356
PN D + ALRD+ GEE Y D S D AR+ + F F C C R
Sbjct: 285 APNVTHQADDRGRMMVTALRDIAPGEECCTSYFDLSEYVDLQARRKKTQELFTFTCTCPR 344
Query: 357 C 357
C
Sbjct: 345 C 345
>gi|326477120|gb|EGE01130.1| SET and MYND domain-containing protein [Trichophyton equinum CBS
127.97]
Length = 499
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 115/300 (38%), Gaps = 62/300 (20%)
Query: 88 AGRGVFATRRIRASDLIHTAKPIITHPTLST--LNSVCYFCLRKITSSSQHFQHHN---- 141
AGRG+FAT I + + P LST L+ VC C + +
Sbjct: 20 AGRGLFATSTINMGKEVFSM-PATFSTVLSTERLSDVCSNCFANLPFGVNVMASVDMKLR 78
Query: 142 -------ARFCGEVCKDNAKAFYDVERRADWSAFNDY-CRSQGLKYPLL-------VKRL 186
++C E C+ A+W + + C YP + V R+
Sbjct: 79 ACSGCKVVKYCDEKCQS-----------ANWKLIHKHECAIYKKLYPKILPVNSRAVLRI 127
Query: 187 ACMIISGAE--SADCIDILQPASLSPELILAMEEGF--VMLRSAFKK--AGIDDEQMKFL 240
+ S E +D L S E+ + +E + +ML + +K +G D L
Sbjct: 128 VKLRNSNEEHVKSDLTMFLSLRSHLNEITNSNQEQYERIMLSAKAEKEYSGSD------L 181
Query: 241 NKQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDP 300
+ + LA+I +N+F A G +G I + NH C+P
Sbjct: 182 DIETIAEYLARIEVNSFTFTTAFG----------------DPLGLCIQPFACYMNHSCEP 225
Query: 301 NAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
NA ++ D + AL ++ E++ I YID + + RQ L + + F C C +C+ G
Sbjct: 226 NA-VVGFDEGLITVKALCKIKPDEQVFISYIDNTYPFEVRQKQLAERYFFTCKCSKCAQG 284
>gi|322698787|gb|EFY90554.1| hypothetical protein MAC_03332 [Metarhizium acridum CQMa 102]
Length = 1175
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 11/77 (14%)
Query: 285 NAIYMLPS-FYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAI 343
+++M P+ F NHDCD NA +M NA ++A+R +E GEE+ + Y ++ D
Sbjct: 182 TSLFMGPARFANHDCDANAKLMTTSNAGIEIVAIRPIESGEEITVTYGESYFGED----- 236
Query: 344 LTQGFGFQCNCLRCSSG 360
+C C C G
Sbjct: 237 -----NCECLCGTCEKG 248
>gi|357603389|gb|EHJ63740.1| hypothetical protein KGM_20849 [Danaus plexippus]
Length = 544
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/296 (21%), Positives = 114/296 (38%), Gaps = 50/296 (16%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSS-SQHF 137
P ++ GR + ATR I+A D+I P+IT P T VC C + +
Sbjct: 45 PAKIKELPGIGRYLEATRTIKAGDVIMKEAPLITGPAQIT-PPVCLGCYVLLEEGKTTRC 103
Query: 138 QHHNARFCGEVCKDNA----KAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISG 193
FC E C ++ + FY ++R + + + Y G+ +P
Sbjct: 104 DKCGWPFCSEKCTHSSVHEPECFY-TQQRGEKVSISTY----GIPHP------------- 145
Query: 194 AESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTN--VLAQ 251
+ C+ +L+ L + + L + + E K +K W + ++AQ
Sbjct: 146 --NYQCVTVLR--------CLYQRDHNMKLWNKLQALETHTEDRKGTDK-WENDRKMVAQ 194
Query: 252 IRINAFRIELAGGLYEDLLSSAAASIESE---------IAVGNAIYMLPSFYNHDCDPNA 302
+ F++E E + I +AV + I M+ H+C N
Sbjct: 195 FIWSFFKLERLFNEEEIMKCCGIIQINGHEVPLLEPEYVAVFDKISMV----EHNCRANC 250
Query: 303 HIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCS 358
+ + N + L A + G + +CY D +AR+ L+ F+C+C RCS
Sbjct: 251 NKSFTSNGEIILSAGVAIPRGSHISVCYTDPLWGTEARRHHLSDSKFFECSCERCS 306
>gi|195111430|ref|XP_002000282.1| GI10143 [Drosophila mojavensis]
gi|193916876|gb|EDW15743.1| GI10143 [Drosophila mojavensis]
Length = 392
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 7/97 (7%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEG 323
+YE + A + +E G+ +Y+L S NH C PNA + N L A+ +++G
Sbjct: 270 IYEKVGEFAGEFLNNE---GSGLYILQSKVNHSCVPNAQSTFPYSNDIVVLKAVAPIQKG 326
Query: 324 EELRICYIDA---SMARDARQAILTQGFGFQCNCLRC 357
EE+ I Y+D +R +R IL + + F C C +C
Sbjct: 327 EEICISYLDECQLERSRHSRHKILRENYIFVCQCSKC 363
>gi|194883134|ref|XP_001975659.1| GG20436 [Drosophila erecta]
gi|190658846|gb|EDV56059.1| GG20436 [Drosophila erecta]
Length = 515
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 105/296 (35%), Gaps = 37/296 (12%)
Query: 71 KPSQPSPPPIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKI 130
K +PS P ++ E GR + ATR I+ +++ P++ P + VC CL I
Sbjct: 34 KRHKPSCHPFKIDHNEQLGRHLVATRTIKPYEIVLKEAPLVRGPAQISA-PVCLGCLNGI 92
Query: 131 TSSSQ-HFQHHNARFCGEVCK--DNAKA----FYDVERRADWSAFNDYCRSQGLKYPLL- 182
+ + CG CK D KA D ++ + F G +PL
Sbjct: 93 EAEDHIECEQCGWPLCGPECKSLDEHKAECRLTKDRGQKVNVQEF-------GGPHPLYT 145
Query: 183 -VKRLACMIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLN 241
+ + C++I + SL + G ++ I KF
Sbjct: 146 CLSTVRCLLIGETSPEKASKFQELESLE-----STRRGSNQWKADL--VSIGQFIPKFFK 198
Query: 242 KQWYTNVLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN 301
Q +T + A +I + + A++ SF + C PN
Sbjct: 199 TQKFTEEEIMKAVGALQIN-------------GHEVPTTDPSHVAVFYTASFTENSCLPN 245
Query: 302 AHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
+ N L A ++++ L ICY DA RQ L Q F+C C RC
Sbjct: 246 LAKSFNKNGHCILWAPHEIKKNAHLSICYSDAVWGTADRQRHLMQTKLFKCACDRC 301
>gi|189204167|ref|XP_001938419.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985518|gb|EDU51006.1| MYND finger family protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 454
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 33/63 (52%), Gaps = 1/63 (1%)
Query: 295 NHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNC 354
NH CDPNA IM +D + LR + + +E+ I YID + RQ L + F C C
Sbjct: 155 NHSCDPNAFIM-MDGPSVSIRTLRPIRKDKEIFISYIDTTYPYHKRQEELQTRWFFTCRC 213
Query: 355 LRC 357
+C
Sbjct: 214 AKC 216
>gi|307195015|gb|EFN77083.1| SET and MYND domain-containing protein 5 [Harpegnathos saltator]
Length = 385
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 63/124 (50%), Gaps = 10/124 (8%)
Query: 243 QWYTNVLAQIRINAFRI---ELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCD 299
+W NV A RI +L +Y+++ + +++E G+ +Y+L S NH C
Sbjct: 246 RWVKNVSALDMHKDQRIFIDKLIDRIYDEMEEVVGSFLDNE---GSGLYILQSAVNHSCM 302
Query: 300 PNAHIMW-IDNADARLMALRDVEEGEELRICYIDA---SMARDARQAILTQGFGFQCNCL 355
PNA + + N L A+RD++ EE+ Y++ +R +RQ L+ + F C+C
Sbjct: 303 PNASVEFPYSNNTLVLRAIRDIQPEEEICTSYLEECELERSRYSRQKALSSLYLFVCHCD 362
Query: 356 RCSS 359
+C S
Sbjct: 363 KCQS 366
>gi|149041012|gb|EDL94969.1| SET and MYND domain containing 2, isoform CRA_b [Rattus norvegicus]
Length = 349
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 114/297 (38%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A R DL+ + T+ C C + S+ CG
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY DVE ++ DW C S G + P RL I++ +
Sbjct: 68 CK---QAFYCDVECQKEDWPLHKLECSSMVVFGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------QWYTN--------- 247
+ P E +LA+ E S K +D+E+ + Q+Y+
Sbjct: 118 MHPERTPSEKLLAVRE----FESHLDK--LDNEKKDLIQSDIAALHQFYSKHLEFPDHSS 171
Query: 248 ---VLAQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
+ AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A++++ G+E+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKD 271
>gi|9295736|gb|AAF87042.1|AC006535_20 T24P13.14 [Arabidopsis thaliana]
Length = 969
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED +S E G ++ L SF NH C PNA + + + + A RD++ GE
Sbjct: 736 LVEDAVSGKVMGKNKEY-YGVGLWTLASFINHSCIPNARRLHVGDY-VIVHASRDIKTGE 793
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ Y D + R+ + + +GF C C RC
Sbjct: 794 EISFAYFDVLSPLEKRKE-MAESWGFCCGCSRC 825
>gi|148681077|gb|EDL13024.1| SET and MYND domain containing 2 [Mus musculus]
Length = 450
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 114/297 (38%), Gaps = 67/297 (22%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ A R DL+ + T+ C C + S+ CG
Sbjct: 18 GRGLRALRPFHVGDLLFSCPAYACVLTVGERGHHCECCFARKEGLSK---------CGR- 67
Query: 149 CKDNAKAFY-DVE-RRADWSAFNDYCRSQ---GLKY-PLLVKRLACMIISGAESADCIDI 202
CK +AFY DVE ++ DW C S G + P RL I++ +
Sbjct: 68 CK---QAFYCDVECQKEDWPLHKLECSSMVVLGENWNPSETVRLTARILAKQK------- 117
Query: 203 LQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNK------QWYTNVL------- 249
+ P E +LA+ E S K +D+E+ + Q+Y+ L
Sbjct: 118 IHPERTPSEKLLAVRE----FESHLDK--LDNEKKDLIQSDIAALHQFYSKYLEFPDHSS 171
Query: 250 -----AQIRINAFRIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHI 304
AQ+ N F IE ++ LS +G+AI+ + NH C PN I
Sbjct: 172 LVVLFAQVNCNGFTIE------DEELSH----------LGSAIFPDVALMNHSCCPNV-I 214
Query: 305 MWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSGD 361
+ A + A++++ G+E+ YID + R L + F C C C++ D
Sbjct: 215 VTYKGTLAEVRAVQEIHPGDEVFTSYIDLLYPTEDRNDRLRDSYFFTCECRECTTKD 271
>gi|367049710|ref|XP_003655234.1| hypothetical protein THITE_13029, partial [Thielavia terrestris
NRRL 8126]
gi|347002498|gb|AEO68898.1| hypothetical protein THITE_13029, partial [Thielavia terrestris
NRRL 8126]
Length = 378
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 37/65 (56%), Gaps = 1/65 (1%)
Query: 295 NHDCDPNAHIMWIDNADAR-LMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCN 353
NHDC PNA + A A+ + A RD+ GEEL + Y++ M R R L + +GF+C
Sbjct: 195 NHDCRPNAEYRFEHAALAQHVHAARDIAPGEELTLSYVNPLMTRAQRLQRLERNWGFRCG 254
Query: 354 CLRCS 358
C C+
Sbjct: 255 CPLCA 259
>gi|325091290|gb|EGC44600.1| MYND finger family protein [Ajellomyces capsulatus H88]
Length = 479
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 40/78 (51%), Gaps = 1/78 (1%)
Query: 283 VGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQA 342
+G + ++ NH C+PNA+I + D L AL+D+ EE+ I Y D + RQ
Sbjct: 169 IGLCLLPFAAYINHSCEPNAYIGF-DGQVMYLKALQDIAPDEEIFISYTDNTEPLKTRQT 227
Query: 343 ILTQGFGFQCNCLRCSSG 360
L + F+C C +C G
Sbjct: 228 ELQLRYFFECKCPKCLKG 245
>gi|195029731|ref|XP_001987725.1| GH19808 [Drosophila grimshawi]
gi|193903725|gb|EDW02592.1| GH19808 [Drosophila grimshawi]
Length = 515
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 116/298 (38%), Gaps = 57/298 (19%)
Query: 79 PIQVALTESAGRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQ 138
P ++A E GR + ATR I+ +++ P++ P + VC CL I ++
Sbjct: 42 PFKIAKNEVLGRHLVATRNIKPYEIVLKEAPLMRGPAQISA-PVCMGCLNCIEAND---- 96
Query: 139 HHNARFCG-EVCKDNAKAFYDVERRADWSAFNDYCRSQGLK-------YPLL--VKRLAC 188
H + CG +C + E RA+ + R Q + +PL V + C
Sbjct: 97 HISCEKCGWPLCGPECPSL--TEHRAECQLTEE--RGQKVNVTEFNGPHPLYTCVSTVRC 152
Query: 189 MIISGAESADCIDILQPASLSPELILAMEEGFVMLRSAFKKAGIDDEQMKFLNKQWYTNV 248
++++ A+ E F L S EQ + + QW ++
Sbjct: 153 LLLNTTNPAN------------------AEKFNQLESL--------EQTRRGSNQWKADL 186
Query: 249 --LAQIRINAFRIELAGGLYEDLLSSAAASIE---SEIAVGN----AIYMLPSFYNHDCD 299
+ Q FR + E+ + A ++ E+ + + A++ SF + C
Sbjct: 187 ASIGQFIPKFFRTQ---KFSEEEIMRAVGILQINGHEVPISDPPHVAVFYTASFTENSCV 243
Query: 300 PNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
PN + + L A +++++ L ICY DA RQ L Q F+C C RC
Sbjct: 244 PNLAKSFNKHGHCVLWAPKEIKKNSHLSICYSDAVWGTADRQRHLMQTKLFKCACERC 301
>gi|330799867|ref|XP_003287962.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
gi|325081986|gb|EGC35483.1| hypothetical protein DICPUDRAFT_33308 [Dictyostelium purpureum]
Length = 400
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 1/80 (1%)
Query: 278 ESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMAR 337
+++ + AI SF+NH C PN I D AL V++GE L I Y+D +
Sbjct: 250 KNDKCIAVAISPSSSFFNHSCIPNC-INIRDGNKMTFKALYPVKKGEPLAISYLDLDLPV 308
Query: 338 DARQAILTQGFGFQCNCLRC 357
++R+ L G+ F C C RC
Sbjct: 309 ESRKEYLKYGYYFDCGCPRC 328
>gi|405972122|gb|EKC36909.1| SET and MYND domain-containing protein 5 [Crassostrea gigas]
Length = 319
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 3/79 (3%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADA---RLMALRDVEEGEELRICYIDASMARD 338
AV I+ + S NH C N I+ D +A +++ R ++EG+EL ++D S+ R
Sbjct: 187 AVFTGIFRIQSSLNHSCANNVEIISGDVNEAPGIHVISKRPIKEGDELFTSFVDTSLNRQ 246
Query: 339 ARQAILTQGFGFQCNCLRC 357
R+ L + + F C C RC
Sbjct: 247 QRRGFLYRMYHFWCECPRC 265
>gi|186478896|ref|NP_173998.2| SET domain protein 35 [Arabidopsis thaliana]
gi|332192607|gb|AEE30728.1| SET domain protein 35 [Arabidopsis thaliana]
Length = 545
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 265 LYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGE 324
L ED +S E G ++ L SF NH C PNA + + + + A RD++ GE
Sbjct: 312 LVEDAVSGKVMGKNKEY-YGVGLWTLASFINHSCIPNARRLHVGDY-VIVHASRDIKTGE 369
Query: 325 ELRICYIDASMARDARQAILTQGFGFQCNCLRC 357
E+ Y D + R+ + + +GF C C RC
Sbjct: 370 EISFAYFDVLSPLEKRKE-MAESWGFCCGCSRC 401
>gi|441432095|ref|YP_007354137.1| SET domain protein [Acanthamoeba polyphaga moumouvirus]
gi|440383175|gb|AGC01701.1| SET domain protein [Acanthamoeba polyphaga moumouvirus]
Length = 233
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 40/76 (52%), Gaps = 1/76 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
+I + +NH C PN + N M + +V++GEEL Y+D + RQ L
Sbjct: 135 SILYVGRIFNHSCLPNIIFYRVKNVMC-FMTIVEVKKGEELVDNYVDITQNIKIRQNRLL 193
Query: 346 QGFGFQCNCLRCSSGD 361
+GF CNC+RC++ +
Sbjct: 194 NQYGFICNCIRCNNHN 209
>gi|357166126|ref|XP_003580607.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like
[Brachypodium distachyon]
Length = 473
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 4/77 (5%)
Query: 284 GNAIYMLPSFYNHDCDPNAHIMWID---NADARLMALRDVEEGEELRICYIDASMARDAR 340
G A + L S NH C PNA D + ++AL + + +E+ I YID + R
Sbjct: 392 GTAFFPLQSCMNHSCCPNAKAFKRDEDNDGHGVIIALEPISKDDEITISYIDEDLPYVER 451
Query: 341 QAILTQGFGFQCNCLRC 357
QA+L +GF C CL+C
Sbjct: 452 QALLAD-YGFTCACLKC 467
>gi|281210720|gb|EFA84886.1| hypothetical protein PPL_01879 [Polysphondylium pallidum PN500]
Length = 439
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 282 AVGNAIYMLPSFYNHDCDPNAHIMWIDNADA----RLMALRDVEEGEELRICYIDASMAR 337
++G+A+Y S +NH C PN + I+ A +++L D+ G EL YI S+
Sbjct: 279 SIGSAVYEKASLFNHSCQPN--VCRINRAGEWGALEMISLTDIAAGTELVYNYIQISLPT 336
Query: 338 DARQAILTQGFGFQCNCLRC 357
+ RQ+ L++ + F+C C C
Sbjct: 337 EDRQSKLSENYFFECKCNGC 356
>gi|410922900|ref|XP_003974920.1| PREDICTED: SET and MYND domain-containing protein 1-like [Takifugu
rubripes]
Length = 489
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/295 (20%), Positives = 106/295 (35%), Gaps = 53/295 (17%)
Query: 89 GRGVFATRRIRASDLIHTAKPIITHPTLSTLNSVCYFCLRKITSSSQHFQHHNARFCGEV 148
GRG+ AT+ A D+I + + S +C+ C R+ + Q A +C
Sbjct: 13 GRGLKATKEFWAGDVIFSEPSLAAVVFDSLAERICHSCFRRQEKLQKCSQCKFAHYCDRT 72
Query: 149 CKDNAKAFYDVERRADWSAFNDYCRSQGLKYPLLVKRLACMIISGAESADCIDILQPASL 208
C+ A E + + A Y ++ L+ + + + G+ +D + ++ L
Sbjct: 73 CQRAGWA----EHKQECGAIKAYGKAPNENIRLVARLMWRLDKEGSVVSD-MQLITVEEL 127
Query: 209 SPELILAMEEGFVMLRSAF-----------KKAGIDDEQMKFLNKQWYTNVLAQIRINAF 257
+ E+ L+ K+ IDD +++ I N F
Sbjct: 128 EDHVADMQEDEIKELKVDIHNFLDYWPRNSKQHTIDD----------ISHIFGVINCNGF 177
Query: 258 RIELAGGLYEDLLSSAAASIESEIAVGNAIYMLPSFYNHDCDPN------------AHIM 305
+ GL AVG ++ NH+C PN + M
Sbjct: 178 TVSDQRGLQ---------------AVGVGLFPNLCMVNHNCWPNCTVILNHGNQSAVNTM 222
Query: 306 WIDNADARLMALRDVEEGEELRICYIDASMARDARQAILTQGFGFQCNCLRCSSG 360
+ L +L + EGEEL + Y+D + R+ +L + F C C C +G
Sbjct: 223 FHSQRRIELRSLGKIAEGEELTVAYVDFLNLSEERRRLLKTQYFFDCQCDYCKNG 277
>gi|448825163|ref|YP_007418094.1| putative set domain-containing protein [Megavirus lba]
gi|444236348|gb|AGD92118.1| putative set domain-containing protein [Megavirus lba]
Length = 107
Score = 50.1 bits (118), Expect = 0.002, Method: Composition-based stats.
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 1/76 (1%)
Query: 286 AIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAILT 345
AI +NH C PN + + N + + +RD+ +GEEL Y++ + + RQ+ L
Sbjct: 12 AILFNGRIFNHSCLPNI-VFYRCNDEMCFITVRDIYKGEELLDSYVNITHDKKTRQSRLW 70
Query: 346 QGFGFQCNCLRCSSGD 361
+ F C+C RCSS +
Sbjct: 71 NQYRFHCDCQRCSSNN 86
>gi|302803777|ref|XP_002983641.1| hypothetical protein SELMODRAFT_422918 [Selaginella moellendorffii]
gi|300148478|gb|EFJ15137.1| hypothetical protein SELMODRAFT_422918 [Selaginella moellendorffii]
Length = 645
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 1/73 (1%)
Query: 285 NAIYMLPSFYNHDCDPNAHIMWIDNADARLMALRDVEEGEELRICYIDASMARDARQAIL 344
+ +++L +F NH C P+A + +A + + A RD++ G+E+ + Y+D M D R
Sbjct: 437 SGLWILTAFANHSCCPSATQKAVGSA-SLVRAARDLKAGDEVTLAYLDPFMPWDVRSRQT 495
Query: 345 TQGFGFQCNCLRC 357
+GF+C C RC
Sbjct: 496 ESRWGFECACERC 508
>gi|225559380|gb|EEH07663.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 517
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 44/89 (49%), Gaps = 9/89 (10%)
Query: 282 AVGNAIYMLPSFYNHDCDPNA-HIMWIDNADAR--------LMALRDVEEGEELRICYID 332
A G A+ + ++ NH C PNA H + ++ + A R +EEGEE+ I Y
Sbjct: 137 AKGRALGLDSAYLNHSCLPNAQHSLSAPASNGNRKERDFLTVYACRSIEEGEEITIPYES 196
Query: 333 ASMARDARQAILTQGFGFQCNCLRCSSGD 361
M +RQ L Q +GF+C+C C D
Sbjct: 197 LYMDIASRQQFLLQEYGFECSCRLCEKED 225
>gi|440463292|gb|ELQ32885.1| hypothetical protein OOU_Y34scaffold01018g15 [Magnaporthe oryzae
Y34]
gi|440490415|gb|ELQ69973.1| hypothetical protein OOW_P131scaffold00096g15 [Magnaporthe oryzae
P131]
Length = 428
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 40/70 (57%), Gaps = 4/70 (5%)
Query: 292 SFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYIDASM-ARDARQAILTQGFG 349
S NH C PN ++ + + + ++A RD+ G EL + Y +M +RD RQ +L G+G
Sbjct: 249 SRINHSCRPNVYVRYTRSTLELEVVAYRDISAGTELGVSYTPLNMLSRDRRQVLL--GWG 306
Query: 350 FQCNCLRCSS 359
F C C CS+
Sbjct: 307 FNCTCQLCSA 316
>gi|402222951|gb|EJU03016.1| SET domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 1/80 (1%)
Query: 280 EIAVGNAIYMLPSFYNHDCDPNAHIMW-IDNADARLMALRDVEEGEELRICYIDASMARD 338
E +V + +Y S NHDC PNA + W ++ + + R +++GEE+ I Y + +
Sbjct: 220 EDSVQSVVYDWMSRTNHDCTPNAQVSWDYESFCGSVWSTRLIKQGEEITISYTSIFKSAE 279
Query: 339 ARQAILTQGFGFQCNCLRCS 358
R++ L + + F+C C C+
Sbjct: 280 ERKSDLMEKYHFECKCRACT 299
>gi|302686744|ref|XP_003033052.1| hypothetical protein SCHCODRAFT_107538 [Schizophyllum commune H4-8]
gi|300106746|gb|EFI98149.1| hypothetical protein SCHCODRAFT_107538, partial [Schizophyllum
commune H4-8]
Length = 404
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/89 (34%), Positives = 46/89 (51%), Gaps = 3/89 (3%)
Query: 275 ASIESEIAVGNAIYMLPSFYNHDCDPNAHIMWIDNA-DARLMALRDVEEGEELRICYI-D 332
A I+++ AI+ S NH C PNA W + L A+RD+E GEE+ + Y +
Sbjct: 209 AGIDAQDGQYLAIFDELSRMNHSCRPNALYHWDSSTFSGSLRAVRDIEPGEEITVSYCGE 268
Query: 333 ASMARDARQAILTQGFGFQCNCLRCSSGD 361
R+A+L +GF C+C C+ GD
Sbjct: 269 VDRPYSDRRALLAP-YGFGCDCRACAEGD 296
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.137 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,391,118,855
Number of Sequences: 23463169
Number of extensions: 209600991
Number of successful extensions: 669353
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 849
Number of HSP's successfully gapped in prelim test: 1515
Number of HSP's that attempted gapping in prelim test: 665500
Number of HSP's gapped (non-prelim): 3480
length of query: 361
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 217
effective length of database: 8,980,499,031
effective search space: 1948768289727
effective search space used: 1948768289727
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 77 (34.3 bits)