BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013485
(442 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 508
Score = 724 bits (1869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/443 (80%), Positives = 390/443 (88%), Gaps = 8/443 (1%)
Query: 1 MAEASRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRI-VVHC---SVSTTND-ASRT 55
MAEASR F T LLP+FS L K P + P + +HC SVST++D +
Sbjct: 1 MAEASRIFQTTLLPTFSSLQK---PRLVSHHPPNLAHKKYQTIHCLSSSVSTSDDITTAK 57
Query: 56 KTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGE 115
T M+PWGC+IDS +NA+ LQ+WLS++GLP QKMAI KV+VGERGLVALKNIRKGE
Sbjct: 58 AATTVTQMVPWGCDIDSSDNAAALQRWLSNNGLPDQKMAIDKVEVGERGLVALKNIRKGE 117
Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
KLLFVPPSLVITADS+WSCPEAGEVLKQ SVPDWPLLA YLISEA+ +KSS+WSNYISAL
Sbjct: 118 KLLFVPPSLVITADSEWSCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISAL 177
Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN
Sbjct: 178 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 237
Query: 236 METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
+ETFKWSFGILFSRLVRLPSMDG+VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ
Sbjct: 238 LETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 297
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
Y+PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL LSLKKSDK YKEKLEAL+K+
Sbjct: 298 YEPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKH 357
Query: 356 GLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE 415
G SAS+CFP+++TGWP+EL+AYAYL VSPPSM KFEE+AAAASNK T KKD+ PEI+E
Sbjct: 358 GFSASQCFPVRVTGWPVELLAYAYLAVSPPSMSSKFEELAAAASNKTTIKKDVGFPEIEE 417
Query: 416 QALQFILDSCESSISKYSRFLQV 438
QALQFILDSCESSISKY++FLQ
Sbjct: 418 QALQFILDSCESSISKYTKFLQA 440
>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 722 bits (1864), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 356/440 (80%), Positives = 385/440 (87%), Gaps = 7/440 (1%)
Query: 1 MAEASRT-FHTILLPSFSHLHKAQSPAGFTD-FPRKRCGHRIVVHCSVSTTNDASRTKTT 58
MAEA R +T LPS LHK ++ F KR + CS+ST++D ++
Sbjct: 1 MAEACRIILNTTFLPSLHSLHKTHKKVSYSQPFLHKR---HPAIQCSISTSSD-TKAAAK 56
Query: 59 VTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLL 118
V++ +PWGC+IDSLENA LQKWLSDSGLPPQKMAIQKV+VGERGLVALKNIRKGE LL
Sbjct: 57 VSET-VPWGCDIDSLENAEALQKWLSDSGLPPQKMAIQKVEVGERGLVALKNIRKGEMLL 115
Query: 119 FVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ 178
FVPPSLVI ADS+WSCPEAGEVLK+ SVPDWPLLATYLISEASFEKSSRWSNYISALPRQ
Sbjct: 116 FVPPSLVIAADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ 175
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
PYSLLYWTRAELD YLEASQIRERAIERITNV GTYNDLRLRIFSKYP LFPEEVFNMET
Sbjct: 176 PYSLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRLRIFSKYPHLFPEEVFNMET 235
Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLDYDKSS+GVVFTTDR YQP
Sbjct: 236 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSSEVETFLDYDKSSKGVVFTTDRPYQP 295
Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
GEQVFISYG+KSNGELLLSYGFVPREGTNPSDSVEL LSLKKSDKCYKEKLEAL+K+GLS
Sbjct: 296 GEQVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLS 355
Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL 418
S+CFP+Q+TGWPLELMAYAYL VSPPSM +FEEMAAAASNK T+ K I P+I+EQAL
Sbjct: 356 VSQCFPLQVTGWPLELMAYAYLAVSPPSMSRQFEEMAAAASNKTTTNKKITYPDIEEQAL 415
Query: 419 QFILDSCESSISKYSRFLQV 438
QFILDSCE SISKY++FLQ
Sbjct: 416 QFILDSCELSISKYTKFLQA 435
>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
Length = 497
Score = 715 bits (1845), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/442 (80%), Positives = 388/442 (87%), Gaps = 19/442 (4%)
Query: 1 MAEASRTFHTILL----PSFSHLHKAQSPAGFTD-FPRKRCGHRIVVHCSVSTTNDASRT 55
MAEA R FHT L PSFS Q+P+ + PR R + CS+STT+ A
Sbjct: 1 MAEACRMFHTALTLTLPPSFS-----QTPSRHSQPIPR-----RHPIRCSISTTDTA--- 47
Query: 56 KTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGE 115
KT+VTQ IPWGCE+DSLENA+ LQKWLSDSGLPPQKM I++V+VGERGLVALKNIRKGE
Sbjct: 48 KTSVTQK-IPWGCEVDSLENAALLQKWLSDSGLPPQKMGIERVEVGERGLVALKNIRKGE 106
Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
KLLFVPPSLVITADS+WSC EAGEVLK+ SVPDWPLLATYLI EASF +SSRWSNYISAL
Sbjct: 107 KLLFVPPSLVITADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISAL 166
Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
PRQPYSLLYWTRAELD+YLEASQIRERAIERI +V GTYNDLRLRIFSK+P LFPEEVFN
Sbjct: 167 PRQPYSLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRLRIFSKHPHLFPEEVFN 226
Query: 236 METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
METFKWSFGILFSRLVRLPSMD ++ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR
Sbjct: 227 METFKWSFGILFSRLVRLPSMDEKIALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRT 286
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
YQP EQVFISYGKKSNGELLLSYGFVPREGTNP+D VEL LSLKKSDKCYKEK EA++K+
Sbjct: 287 YQPSEQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKH 346
Query: 356 GLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE 415
GLS S+CFPIQITGWPLELMAYAYLVVSPPSM FEE+AA ASNK TSKKDI+ PE++E
Sbjct: 347 GLSTSQCFPIQITGWPLELMAYAYLVVSPPSMSQHFEEIAAVASNKTTSKKDIRYPELEE 406
Query: 416 QALQFILDSCESSISKYSRFLQ 437
QALQFILDSCE+SISKYS+FLQ
Sbjct: 407 QALQFILDSCEASISKYSKFLQ 428
>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
carboxylase/oxygenase large subunit N-methyltransferase,
chloroplastic-like [Cucumis sativus]
Length = 497
Score = 708 bits (1828), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 342/438 (78%), Positives = 387/438 (88%), Gaps = 9/438 (2%)
Query: 1 MAEASRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVT 60
MAEASR F++ LLP+F L S P R ++CSVSTT D +R T
Sbjct: 1 MAEASRVFNSSLLPNFRPLQNTLSTK-----PHTATFRRHSINCSVSTT-DGARVAAT-- 52
Query: 61 QNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFV 120
IPWGCEIDSLENAS LQKWLS+SGLP QKM+IQ+V+VGERGLVALKN+RKGEKLLFV
Sbjct: 53 -GPIPWGCEIDSLENASALQKWLSESGLPDQKMSIQRVNVGERGLVALKNVRKGEKLLFV 111
Query: 121 PPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY 180
PPSLVI+A+S+WSCPEAGEVLK+ SVPDWPL+ATYLISEAS KSSRW+NYISALPRQPY
Sbjct: 112 PPSLVISAESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPY 171
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
SLLYWTR ELDRYLEAS+IRERAIERITNV+GTYNDL +R+FSK+P+LFPEEVFN+ETFK
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSIRVFSKHPELFPEEVFNIETFK 231
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
WSFGILFSRLVRLPSMDG+VALVPWADMLNH+CEVETFLDYDK+SQGVVFTTDR YQPGE
Sbjct: 232 WSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKASQGVVFTTDRAYQPGE 291
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
QVFISYGKKSNGELLLSYGFVP+EG+NPSDSVEL LSLKKSDKCYKEKLEAL+K+GL AS
Sbjct: 292 QVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRAS 351
Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQF 420
+CFPIQ+TGWPLEL A+AYL VSPPS+ +F+EMAAAASNK T+KKD+ P+I+E+ALQF
Sbjct: 352 QCFPIQVTGWPLELKAFAYLAVSPPSLSNQFDEMAAAASNKSTAKKDLNYPDIEEEALQF 411
Query: 421 ILDSCESSISKYSRFLQV 438
ILDSCE+SISKY++FLQ
Sbjct: 412 ILDSCETSISKYNKFLQA 429
>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 499
Score = 701 bits (1810), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/448 (77%), Positives = 378/448 (84%), Gaps = 31/448 (6%)
Query: 1 MAEASR-TFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIV---------VHCSVSTTN 50
MAEASR T H+ LLP F+ P+ GHR + V CSVS
Sbjct: 5 MAEASRITLHSTLLPFFT--------------PKTHVGHRHLSLSSSRKHQVQCSVSAGA 50
Query: 51 DASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKN 110
A N + WGCEIDSLEN+S LQ+WLS+SGLPPQKM I++V+VGERGLVALKN
Sbjct: 51 AAQ-------TNPVAWGCEIDSLENSSALQRWLSESGLPPQKMGIERVEVGERGLVALKN 103
Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSN 170
IRKGEKLLFVPPSLVIT DS+WSCPEAGEVLK+ SVPDWPLLATYLISEAS +SSRWSN
Sbjct: 104 IRKGEKLLFVPPSLVITPDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSN 163
Query: 171 YISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP 230
YISALPRQPYSLLYWT+AELDRYLEASQIRERAIERI NVIGTYNDLRLRIFSKYPDLFP
Sbjct: 164 YISALPRQPYSLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRLRIFSKYPDLFP 223
Query: 231 EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF 290
+EVFN+E+FKWSFGILFSRLVRLPSM G VALVPWADMLNHSC+VETFLDYDK+S+G+VF
Sbjct: 224 DEVFNIESFKWSFGILFSRLVRLPSMGGNVALVPWADMLNHSCDVETFLDYDKTSKGIVF 283
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
TTDR YQPGEQVFISYGKKSNGELLLSYGFVP+EG NPSDSVEL LSLKKSD YKEKLE
Sbjct: 284 TTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLE 343
Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
L+ YGLSAS+CFPIQITGWPLELMAYAYL VSP SM+G FEEMAAAASN TSKKD++
Sbjct: 344 LLKNYGLSASQCFPIQITGWPLELMAYAYLAVSPSSMRGDFEEMAAAASNNTTSKKDLRY 403
Query: 411 PEIDEQALQFILDSCESSISKYSRFLQV 438
PEI+EQALQFILDSCESSISKY++FLQ
Sbjct: 404 PEIEEQALQFILDSCESSISKYNKFLQA 431
>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 504
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/440 (75%), Positives = 375/440 (85%), Gaps = 8/440 (1%)
Query: 1 MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
MA+A + LLP++S LHK ++ F P RC R +HCSVS +R+
Sbjct: 1 MAKAC-VLQSTLLPAYSPLHKLRNQNFTLSFPPLPVSRC--RPGIHCSVSAGETTTRSVE 57
Query: 58 TVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKL 117
+ I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+RKGEKL
Sbjct: 58 EAPE--ISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLRKGEKL 115
Query: 118 LFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
LFVPPSLVI+ADS+W+ PEAGEV+K+ VPDWPLLATYLISEAS +KSSRW NYISALPR
Sbjct: 116 LFVPPSLVISADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPR 175
Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
QPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+EVFN E
Sbjct: 176 QPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPHLFPKEVFNDE 235
Query: 238 TFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
TFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GVVFTTDR YQ
Sbjct: 236 TFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVVFTTDRPYQ 295
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
PGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCYKEKL+AL+K+GL
Sbjct: 296 PGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDALKKHGL 355
Query: 358 SASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
S +CFP++ITGWP+ELMAYAYLVVSPP M FEEMA AASNK ++K D+K PEI+E A
Sbjct: 356 STPQCFPVRITGWPMELMAYAYLVVSPPDMGNNFEEMAKAASNKTSTKTDLKYPEIEEDA 415
Query: 418 LQFILDSCESSISKYSRFLQ 437
LQFILDSCE+SISKYSRFL+
Sbjct: 416 LQFILDSCETSISKYSRFLK 435
>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
Length = 497
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/433 (77%), Positives = 371/433 (85%), Gaps = 18/433 (4%)
Query: 7 TFHTILLPSFSH-LHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIP 65
T T L+P+F+ +HK T R H H ++S T+ T IP
Sbjct: 14 TNTTTLIPAFNQTIHKT------THLGLSRRNH---AHFTLSATSSLIET--------IP 56
Query: 66 WGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
WGCE DS+EN+S+LQKWLS SGLP QKM+I KVDVGERGLVAL NIRKGEKLLFVPP LV
Sbjct: 57 WGCENDSIENSSSLQKWLSQSGLPSQKMSIDKVDVGERGLVALNNIRKGEKLLFVPPQLV 116
Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
IT DS+WSCPEAGEVLK+ SVPDWPLLATYLISEAS KSSRW +YISALPRQPYSLLYW
Sbjct: 117 ITPDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQPYSLLYW 176
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
++AELDRYLEASQIRERAIER NVIGTYND+R+RIFSKYPD FPEEVFN+E+FKWSFGI
Sbjct: 177 SQAELDRYLEASQIRERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFKWSFGI 236
Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
LFSR+VRLPSMDG+ ALVPWADM+NHSCEVETFLDYDKSS+G+VF TDR YQPGEQVFIS
Sbjct: 237 LFSRMVRLPSMDGKNALVPWADMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFIS 296
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YGKKSNGELLLSYGFVP+EGTNPSDSVEL LSLKKSD+ YKEKLE L+KYGLS S+CFPI
Sbjct: 297 YGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPI 356
Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
++TGWPLELMAYAYL VSP SM+GKFEEMAAAASNK TSKKD++ PEI+EQALQFILDSC
Sbjct: 357 RVTGWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQALQFILDSC 416
Query: 426 ESSISKYSRFLQV 438
ESSISKY++FLQV
Sbjct: 417 ESSISKYNKFLQV 429
>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
Length = 494
Score = 681 bits (1756), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/433 (77%), Positives = 371/433 (85%), Gaps = 18/433 (4%)
Query: 7 TFHTILLPSFSH-LHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIP 65
T T L+P+F+ +HK T R H H ++S T+ T IP
Sbjct: 11 TNTTTLIPAFNQTIHKT------THLGLSRRNH---AHFTLSATSSLIET--------IP 53
Query: 66 WGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
WGCE DS+EN+S+LQKWLS SGLP QKM+I KVDVGERGLVAL NIRKGEKLLFVPP LV
Sbjct: 54 WGCENDSIENSSSLQKWLSQSGLPSQKMSIDKVDVGERGLVALNNIRKGEKLLFVPPQLV 113
Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
IT DS+WSCPEAGEVLK+ SVPDWPLLATYLISEAS KSSRW +YISALPRQPYSLLYW
Sbjct: 114 ITPDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQPYSLLYW 173
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
++AELDRYLEASQIRERAIER NVIGTYND+R+RIFSKYPD FPEEVFN+E+FKWSFGI
Sbjct: 174 SQAELDRYLEASQIRERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFKWSFGI 233
Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
LFSR+VRLPSMDG+ ALVPWADM+NHSCEVETFLDYDKSS+G+VF TDR YQPGEQVFIS
Sbjct: 234 LFSRMVRLPSMDGKNALVPWADMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFIS 293
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YGKKSNGELLLSYGFVP+EGTNPSDSVEL LSLKKSD+ YKEKLE L+KYGLS S+CFPI
Sbjct: 294 YGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPI 353
Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
++TGWPLELMAYAYL VSP SM+GKFEEMAAAASNK TSKKD++ PEI+EQALQFILDSC
Sbjct: 354 RVTGWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQALQFILDSC 413
Query: 426 ESSISKYSRFLQV 438
ESSISKY++FLQV
Sbjct: 414 ESSISKYNKFLQV 426
>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
Length = 504
Score = 678 bits (1749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/445 (74%), Positives = 376/445 (84%), Gaps = 18/445 (4%)
Query: 1 MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
MA+A + LLP++S LHK ++ F+ P RC R +HCSVS
Sbjct: 1 MAKAC-LLQSTLLPAYSPLHKLRNQNITLSFSPLPLSRC--RPGIHCSVSAGE------- 50
Query: 58 TVTQNM-----IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIR 112
T Q+M I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+R
Sbjct: 51 TTIQSMEEAPKISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLR 110
Query: 113 KGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYI 172
KGEKLLFVPPSLVI+ADS+W+ EAGEV+K+ VPDWPLLATYLISEAS +KSSRW NYI
Sbjct: 111 KGEKLLFVPPSLVISADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYI 170
Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
SALPRQPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+E
Sbjct: 171 SALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKE 230
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
VFN ETFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GVVFTT
Sbjct: 231 VFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVVFTT 290
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
DR YQPGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCY+EKL+AL
Sbjct: 291 DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDAL 350
Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
+K+GLS +CFP++ITGWP+ELMAYAYLVVSPP M+ FEEMA AASNK ++K D+K PE
Sbjct: 351 KKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPE 410
Query: 413 IDEQALQFILDSCESSISKYSRFLQ 437
I+E ALQFILDSCE+SISKYSRFL+
Sbjct: 411 IEEDALQFILDSCETSISKYSRFLK 435
>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Arabidopsis thaliana]
Length = 504
Score = 676 bits (1745), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/445 (74%), Positives = 376/445 (84%), Gaps = 18/445 (4%)
Query: 1 MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
MA+A + LLP++S LHK ++ F+ P RC R +HCSVS
Sbjct: 1 MAKAC-LLQSTLLPAYSPLHKLRNQNITLSFSPLPLSRC--RPGIHCSVSAGE------- 50
Query: 58 TVTQNM-----IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIR 112
T Q+M I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+R
Sbjct: 51 TTIQSMEEAPKISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLR 110
Query: 113 KGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYI 172
KGEKLLFVPPSLVI+ADS+W+ EAGEV+K+ VPDWPLLATYLISEA+ +KSSRW NYI
Sbjct: 111 KGEKLLFVPPSLVISADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYI 170
Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
SALPRQPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+E
Sbjct: 171 SALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKE 230
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
VFN ETFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GV+FTT
Sbjct: 231 VFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVIFTT 290
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
DR YQPGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCY+EKL+AL
Sbjct: 291 DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDAL 350
Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
+K+GLS +CFP++ITGWP+ELMAYAYLVVSPP M+ FEEMA AASNK ++K D+K PE
Sbjct: 351 KKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPE 410
Query: 413 IDEQALQFILDSCESSISKYSRFLQ 437
I+E ALQFILDSCE+SISKYSRFL+
Sbjct: 411 IEEDALQFILDSCETSISKYSRFLK 435
>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
Length = 504
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 330/445 (74%), Positives = 372/445 (83%), Gaps = 18/445 (4%)
Query: 1 MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
MA+A + LLP++S LHK ++ F+ P RC R +HCSVS
Sbjct: 1 MAKAC-LLQSTLLPAYSPLHKLRNQNITLSFSPLPLSRC--RPGIHCSVSAGE------- 50
Query: 58 TVTQNM-----IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIR 112
T Q+M I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+R
Sbjct: 51 TTIQSMEEAPKISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLR 110
Query: 113 KGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYI 172
KGEKLLFV PSLVI ADS+W+ EAGEV+K+ VPDWPLLATYLISEAS +KSSRW NYI
Sbjct: 111 KGEKLLFVSPSLVICADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYI 170
Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
SALPRQPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+E
Sbjct: 171 SALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKE 230
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
VFN ETFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GVVFTT
Sbjct: 231 VFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVVFTT 290
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
DR YQPGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCY+EKL+AL
Sbjct: 291 DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDAL 350
Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
+K+GLS +CFP++ITGWP+ELMAYAYLVVSPP M+ FEEMA ASNK ++K D+K PE
Sbjct: 351 KKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKRASNKTSTKNDLKYPE 410
Query: 413 IDEQALQFILDSCESSISKYSRFLQ 437
I+E ALQFILDSCE+SISK SRFL+
Sbjct: 411 IEEDALQFILDSCETSISKCSRFLK 435
>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
gi|194699272|gb|ACF83720.1| unknown [Zea mays]
gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 619 bits (1597), Expect = e-175, Method: Compositional matrix adjust.
Identities = 291/374 (77%), Positives = 341/374 (91%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGEKLLFVPPS
Sbjct: 61 VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGEKLLFVPPS 120
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W PE G+V+K+ SVPDWPL+ATYLISEAS E SSRW +YI+ALPRQPYSLL
Sbjct: 121 LVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQPYSLL 180
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++PDLFPEEV+N+ETF WSF
Sbjct: 181 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFLWSF 240
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLD+DKSS+G+VFTTDR YQPGEQVF
Sbjct: 241 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVF 300
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSD CYKEKL+AL++ GLS SE F
Sbjct: 301 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360
Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
P+++TGWP+ELMAYA+LVVSPP M FEEMAAAASNK +SK + P+++EQALQFILD
Sbjct: 361 PLRVTGWPVELMAYAFLVVSPPDMSQCFEEMAAAASNKTSSKPGLNYPDLEEQALQFILD 420
Query: 424 SCESSISKYSRFLQ 437
CES+I KY+++L+
Sbjct: 421 CCESNIEKYTKYLE 434
>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
Length = 499
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 290/374 (77%), Positives = 340/374 (90%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGEKLLFVPPS
Sbjct: 57 VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGEKLLFVPPS 116
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W PE GEV+K+ SVPDWPL+ATYLISEAS E SSRWS+YI+ALPRQPYSLL
Sbjct: 117 LVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRWSSYIAALPRQPYSLL 176
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++ DLFPEEV+N+ETF WSF
Sbjct: 177 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHSDLFPEEVYNIETFLWSF 236
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
GILFSRLVRLPSMD +VALVPWADMLNHS EVETFLD+DKSSQG+VFTTDR YQPGEQVF
Sbjct: 237 GILFSRLVRLPSMDEKVALVPWADMLNHSPEVETFLDFDKSSQGIVFTTDRSYQPGEQVF 296
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSDKCYKEKL+AL++ GLS SE F
Sbjct: 297 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNGLSESESF 356
Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
P+++TGWP+ELMAYA+LVVSPP M FEEMA AASNK +SK + P+++EQALQFILD
Sbjct: 357 PLRVTGWPVELMAYAFLVVSPPDMSQHFEEMAVAASNKSSSKPRLSYPDLEEQALQFILD 416
Query: 424 SCESSISKYSRFLQ 437
CE +I+KY+++L+
Sbjct: 417 CCEPNIAKYTKYLE 430
>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Zea mays]
Length = 503
Score = 615 bits (1587), Expect = e-173, Method: Compositional matrix adjust.
Identities = 289/374 (77%), Positives = 339/374 (90%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGE LLFVPPS
Sbjct: 61 VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGENLLFVPPS 120
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W PE G+V+K+ SVPDWPL+ATYLISEAS E SSRW +YI+ALPRQPYSLL
Sbjct: 121 LVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQPYSLL 180
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++PDLFPEEV+N+ETF WSF
Sbjct: 181 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFLWSF 240
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
GILFSRLVRLPSMDGRV LVPWADMLNHS EVETFLD+DKSS+G+VFTTDR YQPGEQVF
Sbjct: 241 GILFSRLVRLPSMDGRVVLVPWADMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVF 300
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSD CYKEKL+AL++ GLS SE F
Sbjct: 301 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360
Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
P+++TGWP+ELMAYA+LVVSPP M FEEMAAAASNK +SK + P+++EQALQFILD
Sbjct: 361 PLRVTGWPVELMAYAFLVVSPPDMSQCFEEMAAAASNKTSSKPGLNYPDLEEQALQFILD 420
Query: 424 SCESSISKYSRFLQ 437
CES+I KY+++L+
Sbjct: 421 CCESNIEKYTKYLE 434
>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
Length = 502
Score = 610 bits (1572), Expect = e-172, Method: Compositional matrix adjust.
Identities = 295/374 (78%), Positives = 339/374 (90%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A +L++WL+DSGLP Q++ IQ+VDVGERGLVALKNIRKGEKLLFVPPS
Sbjct: 60 VPWGCEIESLESAVSLERWLTDSGLPEQRLGIQRVDVGERGLVALKNIRKGEKLLFVPPS 119
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W CPE G VLK+ SVPDWPL+ATYLISEAS E SSRWS+YI+ALPRQPYSLL
Sbjct: 120 LVITADSEWGCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQPYSLL 179
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTR ELD YL AS IRERAI+RIT+V+GTYNDLR RIFSK+ DLFPEEV+N+ETF+WSF
Sbjct: 180 YWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSF 239
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLDYDKSS G+VFTTDR YQPGEQVF
Sbjct: 240 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVF 299
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSDKCYKEKL+AL++ GLS E F
Sbjct: 300 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359
Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
P+++TGWP+ELMAYA+LVVSPP M +FEEMA AASNK SK + PE++EQALQFILD
Sbjct: 360 PLRVTGWPVELMAYAFLVVSPPEMSQRFEEMAVAASNKSPSKPGLNYPELEEQALQFILD 419
Query: 424 SCESSISKYSRFLQ 437
CES+I+KY++FL+
Sbjct: 420 CCESNIAKYTKFLE 433
>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase [Oryza sativa Japonica
Group]
gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
Length = 502
Score = 610 bits (1572), Expect = e-172, Method: Compositional matrix adjust.
Identities = 295/374 (78%), Positives = 339/374 (90%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A +L++WL+DSGLP Q++ IQ+VDVGERGLVALKNIRKGEKLLFVPPS
Sbjct: 60 VPWGCEIESLESAVSLERWLTDSGLPEQRLGIQRVDVGERGLVALKNIRKGEKLLFVPPS 119
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W CPE G VLK+ SVPDWPL+ATYLISEAS E SSRWS+YI+ALPRQPYSLL
Sbjct: 120 LVITADSEWGCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQPYSLL 179
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTR ELD YL AS IRERAI+RIT+V+GTYNDLR RIFSK+ DLFPEEV+N+ETF+WSF
Sbjct: 180 YWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSF 239
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLDYDKSS G+VFTTDR YQPGEQVF
Sbjct: 240 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVF 299
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSDKCYKEKL+AL++ GLS E F
Sbjct: 300 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359
Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
P+++TGWP+ELMAYA+LVVSPP M +FEEMA AASNK SK + PE++EQALQFILD
Sbjct: 360 PLRVTGWPVELMAYAFLVVSPPEMSQRFEEMAVAASNKSPSKPGLNYPELEEQALQFILD 419
Query: 424 SCESSISKYSRFLQ 437
CES+I+KY++FL+
Sbjct: 420 CCESNIAKYTKFLE 433
>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 507
Score = 609 bits (1571), Expect = e-172, Method: Compositional matrix adjust.
Identities = 284/374 (75%), Positives = 337/374 (90%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A++L++WL+ SGLP Q++A++KVD+GERGLVALKN+R GEKLLFVPP+
Sbjct: 64 VPWGCEIESLESAASLERWLTASGLPEQRLALEKVDIGERGLVALKNVRNGEKLLFVPPT 123
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W+ E G+V+K+ SVPDWPLLATYLISEAS E SSRWS+YI ALPRQPYSLL
Sbjct: 124 LVITADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQPYSLL 183
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTR E+D YL AS IRERAI RI++VIGTYNDLR RIFSK+PDLFPE+V+NME F+WSF
Sbjct: 184 YWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRDRIFSKHPDLFPEKVYNMENFRWSF 243
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
GILFSRLVRL SM G+VALVPWADMLNHS EV+ FLDYDKSSQG+VFTTDR YQPGEQVF
Sbjct: 244 GILFSRLVRLESMGGKVALVPWADMLNHSPEVDAFLDYDKSSQGIVFTTDRSYQPGEQVF 303
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYGKKS+GELLLSYGFVP+EGTNP+DSVE +SLKKSD+CYKEKL+AL+K+GLS SE F
Sbjct: 304 ISYGKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQALKKHGLSESESF 363
Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
P+++TGWP+ELMAYA+LVVSPP M +FEEMA AASNK +SK + PE+DEQALQFILD
Sbjct: 364 PLRVTGWPVELMAYAFLVVSPPEMIQRFEEMAVAASNKGSSKPAVNYPELDEQALQFILD 423
Query: 424 SCESSISKYSRFLQ 437
CESSI +Y+++L+
Sbjct: 424 CCESSIKRYTKYLE 437
>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 389
Score = 566 bits (1459), Expect = e-159, Method: Compositional matrix adjust.
Identities = 265/345 (76%), Positives = 308/345 (89%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
MA+Q+VDVGERGLVAL N+R GEKLLFVPPSLVI+ADS+WS E G+V+K SVPDWPLL
Sbjct: 1 MALQRVDVGERGLVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLL 60
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
ATYLISEAS E SSRWS+YI ALPRQPYSLLYWTR E+D YL AS IRERAI RI +VIG
Sbjct: 61 ATYLISEASLEGSSRWSSYIDALPRQPYSLLYWTRTEIDAYLVASPIRERAISRIGDVIG 120
Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
TYNDLR RIFSK+P+LFPEEV+NME F+WSFGILFSRLVRLPSMDG+VALVPWADMLNH+
Sbjct: 121 TYNDLRDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLPSMDGKVALVPWADMLNHN 180
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
EV+ FLD+DKSSQG+VFTTDR YQPGEQVFISYGKKS+GELLLSYGFVP+EGTNP+DSV
Sbjct: 181 PEVDAFLDFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSV 240
Query: 333 ELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFE 392
E +SL KSD CY+EKL+AL+++GLS SE FP+++TGWP+ELMAYA+LVVSPP M +FE
Sbjct: 241 EFSVSLNKSDDCYREKLQALKRHGLSESESFPLRVTGWPVELMAYAFLVVSPPDMIQRFE 300
Query: 393 EMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
EMA AASNK +SK + PE+DEQALQFILD CES+I+KY+++L+
Sbjct: 301 EMAVAASNKSSSKPAVNYPELDEQALQFILDCCESNITKYTKYLE 345
>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Sus
scrofa]
Length = 326
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 200/256 (78%), Positives = 228/256 (89%), Gaps = 8/256 (3%)
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
L WT D Y+ ++R R+ I ++ TYND+R+RIFSKYPD FPEEVFN+E+FKWS
Sbjct: 47 LCWT----DYYM---RMRXRSTNYIA-MMHTYNDMRVRIFSKYPDFFPEEVFNIESFKWS 98
Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
FGILFSR+VRLPSMDG+ ALVPWADM+NHSCEVETFLDYDKSS+G+VF TDR YQPGEQV
Sbjct: 99 FGILFSRMVRLPSMDGKNALVPWADMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQV 158
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
FISYGKKSNGELLLSYGFVP+EGTNPSDSVEL LSLKKSD+ YKEKLE L+KYGLS S+C
Sbjct: 159 FISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQC 218
Query: 363 FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFIL 422
FPI++TGWPLELMAYAYL VSP SM+GKFEEMAAAASNK TSKKD++ PEI+EQALQFIL
Sbjct: 219 FPIRVTGWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQALQFIL 278
Query: 423 DSCESSISKYSRFLQV 438
DSCESSISKY++FLQ+
Sbjct: 279 DSCESSISKYNKFLQL 294
>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
Length = 306
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/236 (81%), Positives = 220/236 (93%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGEKLLFVPPS
Sbjct: 61 VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGEKLLFVPPS 120
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
LVITADS+W PE G+V+K+ SVPDWPL+ATYLISEAS E SSRW +YI+ALPRQPYSLL
Sbjct: 121 LVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQPYSLL 180
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++PDLFPEEV+N+ETF WSF
Sbjct: 181 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFLWSF 240
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLD+DKSS+G+VFTTDR YQPG
Sbjct: 241 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296
>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 431
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/382 (49%), Positives = 269/382 (70%), Gaps = 9/382 (2%)
Query: 64 IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
+ WGC+ S+E S LQ WL GL QK+ + +VD G RGLVA +++R+GE+LLFVP
Sbjct: 1 VNWGCDPQSIEKGSLLQDWLMKEGLAKQKLVLDRVDSGGRGLVATQSLRQGERLLFVPSG 60
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
L+ITADS+W C E G ++K+ +P+WP+LA +LISEAS E+SSRW Y + LP+ P S+L
Sbjct: 61 LLITADSEWGCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPKTPSSIL 120
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
WT E++ +L AS +RE+A+E I +V TY DLR IF K+P++FP +V+ + FKW+F
Sbjct: 121 QWTEEEVNTWLTASPVREKALECIRDVTETYRDLRATIFLKHPEVFPSQVYTLAAFKWAF 180
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK-SSQGVVFTTDRQYQPGEQV 302
GILFSRLVRLPS+ G++ALVPWADMLNHS +V++FLD+D+ +++ VV TDR YQ GEQV
Sbjct: 181 GILFSRLVRLPSV-GKLALVPWADMLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQV 239
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
FISYGK+S+GEL L+YGF+P E N DSVEL + + D ++ KL A + GLS+ +
Sbjct: 240 FISYGKRSSGELFLAYGFIPSE-LNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQR 298
Query: 363 FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD------IKCPEIDEQ 416
FP++ G+P +L+AYA L+ S S + +A AA+ ++KD + E +
Sbjct: 299 FPVRKDGFPAQLLAYARLIASRTSDPAQLSRIARAATEANATEKDDSDVIAMLSAEEETN 358
Query: 417 ALQFILDSCESSISKYSRFLQV 438
A + +L CE+SI++Y++FL+V
Sbjct: 359 AYERVLAVCENSIAEYTKFLEV 380
>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
Length = 467
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 181/390 (46%), Positives = 273/390 (70%), Gaps = 4/390 (1%)
Query: 54 RTKTTVTQNMIPWGCEIDS--LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNI 111
R + + +PWGC+ DS L + LQ+WLS +GLP QK+ ++ V G RGLV+ + +
Sbjct: 14 RPRLACRASSVPWGCDTDSSALNSGIALQQWLSQAGLPIQKVELKNVGAGGRGLVSKRML 73
Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
KG++LLF+P +L IT +S+W+C EAG+V++ +P+WP LA YLISEAS KSS W Y
Sbjct: 74 YKGDRLLFLPATLAITTESEWACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPY 133
Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
I+ALPR+P S+L WT +++ +L A+ I++RA++ + V T+NDL ++F K + FP
Sbjct: 134 IAALPRRPGSILLWTALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNREEFPP 193
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
EVFN+E+FKW+FGILFSRLVRLPS+ ++AL+P+ DMLNH EV TFLD+D S+ + T
Sbjct: 194 EVFNLESFKWAFGILFSRLVRLPSLGQKLALIPFGDMLNHDTEVTTFLDFDSGSKSITCT 253
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
DR Y+ ++VFISYGK+SNGELL++YGFVP G N DSV + L L +D+ Y+ KL A
Sbjct: 254 LDRGYESNKEVFISYGKRSNGELLVAYGFVP-SGKNSEDSVSITLGLDPADEMYEAKLGA 312
Query: 352 LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSK-KDIKC 410
L+++GLS + +PI++ GWP++L A+A L+ SPPS ++ E+ +AA+ K + + +
Sbjct: 313 LKEHGLSPQQSYPIKLKGWPVQLTAFARLITSPPSQLHRYSELTSAATEKQGRRMQPVFT 372
Query: 411 PEIDEQALQFILDSCESSISKYSRFLQVKE 440
E +A + IL +C+ +I+ +L+V++
Sbjct: 373 TEEQMKAYEMILSACKQAIAASKNYLEVEQ 402
>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
Length = 467
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/390 (45%), Positives = 273/390 (70%), Gaps = 4/390 (1%)
Query: 54 RTKTTVTQNMIPWGCEIDS--LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNI 111
R + + +PWGC+ DS L++ LQ+WLS +GLP QK+ ++ V G RGLV+ + +
Sbjct: 14 RPRLACRASSVPWGCDTDSSALDSGIALQQWLSQAGLPIQKVELKNVGAGGRGLVSKRML 73
Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
KG++LLF+P +L IT +S+W+C EAG+V++ +P+WP LA YLISEAS KSS W Y
Sbjct: 74 YKGDRLLFLPATLAITTESEWACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPY 133
Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
I+ALPR+P S+L WT +++ +L A+ I++RA++ + V T+NDL ++F K + FP
Sbjct: 134 IAALPRRPGSILLWTALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNREEFPP 193
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
EVFN+++FKW+FGILFSRLVRLPS+ ++AL+P+ DMLNH EV TFLD+D S+ + T
Sbjct: 194 EVFNLKSFKWAFGILFSRLVRLPSLGQKLALIPFGDMLNHDTEVTTFLDFDSGSKSITCT 253
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
DR Y+ +VFISYGK+SNGELL++YGFVP G N DSV + L L +D+ Y+ KL
Sbjct: 254 LDRGYESNREVFISYGKRSNGELLVAYGFVP-SGKNSEDSVSITLGLDPADEMYEAKLGT 312
Query: 352 LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSK-KDIKC 410
L+++GLS + +PI++ GWP++L A+A L+ SPPS ++ E+A+AA+ + + + +
Sbjct: 313 LKEHGLSPQQSYPIKLKGWPVQLTAFARLITSPPSQLHRYSELASAATEEQGRRMQPVFT 372
Query: 411 PEIDEQALQFILDSCESSISKYSRFLQVKE 440
E +A + IL +C+ +I+ +L+V++
Sbjct: 373 TEEQMKAYELILSACKQAIAASKNYLEVEQ 402
>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
Length = 271
Score = 364 bits (934), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 171/203 (84%), Positives = 188/203 (92%)
Query: 236 METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
ME+FKWSFGILFSR+VRLPSMDG+VALVPWADMLNHSC+VETFLDYDK S+G+VFTTDR
Sbjct: 1 MESFKWSFGILFSRMVRLPSMDGKVALVPWADMLNHSCDVETFLDYDKQSKGIVFTTDRP 60
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
YQPGEQVFISYGKKSNGELLLSYGFV REG NPSDSVEL LSLKKSD YKEKLE L+KY
Sbjct: 61 YQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLKKY 120
Query: 356 GLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE 415
GLS S+CFPI+ITGWPLELMAYAYL VSP SM+G+FE+MAAAASNK+TS KD K PEI+E
Sbjct: 121 GLSGSQCFPIRITGWPLELMAYAYLAVSPSSMRGQFEKMAAAASNKITSTKDFKYPEIEE 180
Query: 416 QALQFILDSCESSISKYSRFLQV 438
QALQFILDSCESS+SKY++FLQ
Sbjct: 181 QALQFILDSCESSMSKYNKFLQA 203
>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
Length = 543
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 189/339 (55%), Gaps = 18/339 (5%)
Query: 79 LQKWLSD-SGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L WL+D LP QKM ++ RGLVA ++I++GEK+L +P +IT +
Sbjct: 91 LDAWLADIMKLPEQKMKLEYFKEEGRGLVATESIKRGEKVLEIPQEAIITVEVALKESLL 150
Query: 138 GEVLKQCSVPDWPLLATYLISEA---SFEKSS----RWSNYISALPRQPYSLLYWTRAEL 190
E K + +W +LAT+L A S E +S R++ Y+ ALPR S+L W +++
Sbjct: 151 REKKKLAELQEWSILATFLAETAQNLSTEDNSSNKYRFATYVKALPRSTGSVLEWPESDV 210
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
L S A+ER +V ++R+ FPE N +T +W+F ILFSRL
Sbjct: 211 RTLLAGSPSLFSALERRASVAAAIAEIRVN--------FPE--LNEKTLQWAFDILFSRL 260
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+RL S+ G +ALVPWADMLNH E F+D D+ S+ V TTDR Y+PGEQV+ SYG++
Sbjct: 261 IRLESLGGNLALVPWADMLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRP 320
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
+ ELL+SYGF P G NP D L L + + D K+ AL + A E FP+++ G+
Sbjct: 321 SSELLISYGFAPAVGDNPDDEYALNLQIDEEDPFASAKVNALASQNIQAFETFPLRLNGY 380
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK 409
P +L+ YA + P K +E+ AA + + K ++
Sbjct: 381 PRQLLQYASFAMCTPDDPSKVDELCRAAFVDIQAPKSLR 419
>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
Length = 505
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 194/362 (53%), Gaps = 20/362 (5%)
Query: 40 IVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVD 99
+ V+ S T + T IP G + E+ L +WL+ +GL QKM ++
Sbjct: 31 VTVNEGASGTRGKNAEVTRYDDADIPRGVGSATRED---LTRWLASNGLRAQKMTLESNL 87
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
RGLVA + I++GE LL V S +IT + + EA + + +W +LAT+L +
Sbjct: 88 AEGRGLVATEEIKRGEALLGVDASCLITVER--AIAEAKLGPRHAELQEWSVLATFLAQQ 145
Query: 160 ASFEKSSR---WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
A +S + YI ALPR+ S+L W E++ L+ S R A ER +V +
Sbjct: 146 AMALESGNAGTFGEYIRALPRRTGSVLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAE 205
Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
+R S +PD+ +W+F ILFSRL+RL +M G +ALVPWADMLNH
Sbjct: 206 IR----SSFPDI------TEGALRWAFDILFSRLIRLDAMGGELALVPWADMLNHKPGCA 255
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
F+D + S+ V TTDR Y GEQV+ SYG++ + ELL+SYGF P G NP D L L
Sbjct: 256 AFIDLNGSA--VNLTTDRAYAAGEQVWASYGQRPSSELLISYGFAPEVGENPDDEYSLTL 313
Query: 337 SLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAA 396
+ +D + K + LR+ GLS E FP+++ G+P +L+ YA ++ P + E +A
Sbjct: 314 GVDVNDPYAQAKADVLRRMGLSPVETFPLRLNGYPRQLLQYASFILCNPDKPSELEGLAR 373
Query: 397 AA 398
A
Sbjct: 374 TA 375
>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 524
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 136/363 (37%), Positives = 195/363 (53%), Gaps = 31/363 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L +WL LP QKMA++ V++ E RGLVA + I++GE LL VP + +IT + + EA
Sbjct: 84 LARWLEGRRLPGQKMALE-VNLAEGRGLVATEEIKRGEALLGVPRTTLITVER--AIAEA 140
Query: 138 GEVLKQCSVPDWPLLATYLISEA-SFEKSS--RWSNYISALPRQPYSLLYWTRAELDRYL 194
K + +W +LAT+L +A + E + + YI ALPR+ S+L W E+D+ L
Sbjct: 141 KLGPKHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTGSVLDWPEDEVDKLL 200
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
+ S R A ER +V +++R FPE + +W+F ILFSRL+RL
Sbjct: 201 KGSPSRLAAAERQDSVNAAIDEIR--------SYFPE--ITVGALRWAFDILFSRLIRLD 250
Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
+M G +ALVPWADMLNH F+D + + V TTDR Y GEQV+ SYG++ + EL
Sbjct: 251 AMGGELALVPWADMLNHKPGCAAFIDLNGDA--VNLTTDRSYVKGEQVWASYGQRPSSEL 308
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLEL 374
L+SYGF P G NP D L L + +D K + LR GLS E FP+++ G+P +L
Sbjct: 309 LISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLRLNGYPRQL 368
Query: 375 MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSR 434
+ YA ++ P K E+ A + T +I Q I DS + +R
Sbjct: 369 LQYASFILCNPE---KPSELKGLAQSAFTGSANIG---------QSIFDSVRGLTNGKAR 416
Query: 435 FLQ 437
Q
Sbjct: 417 GKQ 419
>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
Length = 485
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/370 (34%), Positives = 203/370 (54%), Gaps = 12/370 (3%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT- 127
EI + L+ WL + GLPP K+A RGLVA + I KGE LL +P LV+T
Sbjct: 51 EIATDAEGEELKAWLIERGLPPPKLAAAATPGSGRGLVAAQPIGKGESLLSIPQQLVLTP 110
Query: 128 -ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
A + SC +L++ +P W +LA +L + + + W Y+ LP + +L W+
Sbjct: 111 AAALEQSCLR--PLLEEQPLPAWSVLALWLAEQRAAGSAGGWWPYVRLLPERTGCVLEWS 168
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS-KYPDLFPEE-VFNMETFKWSFG 244
E++ +L SQ+ A+E ++ +++ + + K P F +W+F
Sbjct: 169 EEEVE-WLCGSQLHSDALEIRAAAEASWAEMQAVLAAAKAQGRAPAHGAFGRAQLQWAFA 227
Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+L SRLVRL + + AL+PWAD+LNH C +FLD+ + VV +R+Y+ GEQ+ I
Sbjct: 228 VLLSRLVRLAGLGDQEALLPWADLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLI 287
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
SYG+K++GELLLSYGF P G+NP D L L L D K ALR++GL+AS+ FP
Sbjct: 288 SYGQKTSGELLLSYGFCPDLGSNPHDGCRLLLELAPGDAARNWKAAALRQHGLAASQLFP 347
Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
+++ P EL+ Y + + + E++A ++ + DI P + AL+ ++ +
Sbjct: 348 LRMAAAPFELVHYTAFSAAVVGSRQEAEQLA----RRLFEEGDIP-PALQTAALEAVVAA 402
Query: 425 CESSISKYSR 434
C+++++ Y R
Sbjct: 403 CKAALAAYPR 412
>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
Length = 425
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 112/303 (36%), Positives = 174/303 (57%), Gaps = 20/303 (6%)
Query: 88 LPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV 146
LP QK+ + VD+ E RGLVA + +R+GE LL +P S +IT + + G ++
Sbjct: 7 LPAQKLELV-VDLPEGRGLVATEEVRRGESLLDIPESTLITVERAIAESNLGPA--HANL 63
Query: 147 PDWPLLATYLISEA----SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER 202
+W +LA +L +A + SR++ Y+ ALPR+ +L W ++ L S +
Sbjct: 64 QEWSVLAAFLAEQALAIDAGADGSRFATYVRALPRRTGGVLDWPEEDVKELLAGSPSQRA 123
Query: 203 AIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
A+ER +V +++R + +P L P +W+F +LFSRL+RLP+ G +AL
Sbjct: 124 AMERQASVDAAIDEIR----ASFPQLTPG------ALRWAFDVLFSRLIRLPNRGGALAL 173
Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
VPWADMLNH + ++D + V + DR+Y+PGEQV+ SYG + + ELL+SYGF P
Sbjct: 174 VPWADMLNHRPGCDAYID--DTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAP 231
Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
G NP D E+ L + +D+ K +ALR+ GLS E FP+++ G+P +L+ YA +
Sbjct: 232 AVGENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAFPLKLNGYPKQLLQYASFAL 291
Query: 383 SPP 385
P
Sbjct: 292 CDP 294
>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 308
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 119/323 (36%), Positives = 181/323 (56%), Gaps = 21/323 (6%)
Query: 82 WLSDSG-LPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
WL++S LP QK+ + VD+ E RGLVA +++++GE LL +P + +IT + + G
Sbjct: 1 WLTNSQRLPAQKLDL-VVDLPEGRGLVAREDVKRGEPLLEIPDASLITVERAVKESKLGP 59
Query: 140 VLKQCSVPDWPLLATYLISEA----SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
K + +W LLA +L +A + ++S ++ Y+ ALPR+ +L W ++ L
Sbjct: 60 --KHAELQEWSLLAAFLAEQALDIENGDESGVFAAYVKALPRRTGGVLDWPEEDVKTLLA 117
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
S + A ER +V G ++R +++P L P +W+F +LFSRL+RLP+
Sbjct: 118 GSPSQRAAYERQASVDGAIEEIR----AEFPQLTPG------ALRWAFDVLFSRLIRLPN 167
Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
G +ALVPWADMLNH ++D S V DR Y+PGEQVF SYG++ + ELL
Sbjct: 168 RGGELALVPWADMLNHKPGCNAYID--DSGGKVCLQPDRAYKPGEQVFASYGQRPSAELL 225
Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
+SYGF P G NP D E+ L + +D+ K AL K GL E FP+++ G+P +L+
Sbjct: 226 ISYGFAPEVGENPDDEYEITLGIDPNDRYADAKAAALEKIGLRPVESFPLRLNGYPKQLL 285
Query: 376 AYAYLVVSPPSMKGKFEEMAAAA 398
YA + P + E +A A
Sbjct: 286 QYASFALCDPDDPKELEGLAEKA 308
>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 638
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 180/361 (49%), Gaps = 28/361 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA-----DSKWS 133
L +WLS G P Q + + GL A ++ ++GE L +P + +T +
Sbjct: 85 LSEWLSKQGFPTQDVILTGFGEEGVGLAAGRDFKEGEVALKIPENYTVTGVDVVNHPVVA 144
Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
P AG D L +L+ E S + S W Y+ P S + WT E +
Sbjct: 145 APAAGR-------GDVIGLTLWLMYERSLGEKSVWYPYLQTFPSTTLSPILWTAEEQQKL 197
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
L+ S E +R + G Y DL+ F+K P FP+E F++E FK +F ++ SR V L
Sbjct: 198 LKGSPALEEVQQRSAALEGEYEDLQ-SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYL 256
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK-KSNG 312
PS D ALVP+AD LNH + + +LDY Q VVF DR Y+ GEQVF SYG+ +SN
Sbjct: 257 PSAD-LFALVPYADALNHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNA 315
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
+LL++YGFV + N D ++L + L D+ K + L++ L + + FP+ + +P
Sbjct: 316 DLLITYGFV--DENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFPT 373
Query: 373 ELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCESSISK 431
+L+ Y L + ++ A K+ KDI + +E + LQ ++ C + +
Sbjct: 374 QLLTYMRL--------SRLQD--PALFPKIVFDKDIMLDQANEYECLQLLMGECRTKLGN 423
Query: 432 Y 432
Y
Sbjct: 424 Y 424
>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
nagariensis]
Length = 508
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 171/354 (48%), Gaps = 44/354 (12%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
++D + Q WL GL Q + ++ RGLVA +++ +GE L+ +P LVITA
Sbjct: 15 DVDGPDLEPEFQSWLRSEGLSTQPLLLRHCGREGRGLVASRSLSRGEVLVKLPDHLVITA 74
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLI--------SEASFEKSSRWSNYISALPRQPY 180
+ AGE W LLA L + S ++RW Y++ LP++P
Sbjct: 75 ERA-----AGE---------WSLLALLLAEVKGRLAAGDRSSPAAARWGPYVAVLPQRPG 120
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD-LFPEEV-FNMET 238
+LL W E+ + L S ++ A + ++ +L I D L PE V +
Sbjct: 121 TLLDWPAKEVQQLLRGSPLQRLADSITSAASASWRELEPLIAQGRADGLVPEHVPLSKGD 180
Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNH--SCEVETFLDYDKSSQG--------- 287
+W+FG+L SR +RLPS L PWAD LNH S E LD+ G
Sbjct: 181 LEWAFGVLLSRCIRLPSRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGDRA 240
Query: 288 -------VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL-K 339
+V DR Y G+QV++SYG KS+GELLLSYGF P +NP L +++ +
Sbjct: 241 GGATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDR 300
Query: 340 KSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAY-AYLVVSPPSMKGKFE 392
+ D K +AL ++GL + FP+++ G P L+ Y A+L P + FE
Sbjct: 301 QGDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFE 354
>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Ectocarpus siliculosus]
Length = 521
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 196/371 (52%), Gaps = 39/371 (10%)
Query: 84 SDSGLPPQKM--AIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
SD G+ P + A+ VD E RG++A + I++G++L +P L++T D+ + E
Sbjct: 107 SDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLPIDLLLTKDA--AKKE 164
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYL 194
G + + ++ +A + E + K S WS+YI LP + Y W +L L
Sbjct: 165 FGADVITEDLSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVEEVYPTYLWAEEDL-ALL 223
Query: 195 EASQI--RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
E S + ++ R V Y + + K+P++ P EV E F+W+F +LFSR +R
Sbjct: 224 EGSPVIAATESMRRKLEV--EYATVENDLLDKFPEILPREVHTYEEFQWAFAMLFSRAIR 281
Query: 253 LPSMDG--RVALVPWADMLNHSCEVETFLD-------YDKSSQGVVFTTDRQYQPGEQVF 303
L + VALVP+AD+ NH+ +++D + K+ + VV+ DR Y+ EQV+
Sbjct: 282 LGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLFFSKTDEVVVYA-DRSYKKMEQVY 340
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYG K N +LLL YGF NP +SV++ +SL ++D+ Y+ K L + GL ++ F
Sbjct: 341 ISYGPKGNSDLLLLYGFSLDR--NPYNSVDVTVSLDENDELYERKKAFLSEAGLPPTKAF 398
Query: 364 PIQITGWPLELMAYAYLV-VSPPSMKGK-FEEMAAAASNKMTSKKDIKCPEIDEQALQFI 421
P+ +P EL+ Y L+ ++ ++G+ E+++ KK E+ L +
Sbjct: 399 PLYNDRYPDELLQYLRLIQLNTDQLRGRTLEDLS-------FEKKQTDVNEL--MVLDSL 449
Query: 422 LDSCESSISKY 432
+++C+++I+ Y
Sbjct: 450 VEACKATIAGY 460
>gi|159465555|ref|XP_001690988.1| lysine N-methylase [Chlamydomonas reinhardtii]
gi|158279674|gb|EDP05434.1| lysine N-methylase [Chlamydomonas reinhardtii]
Length = 563
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 176/370 (47%), Gaps = 55/370 (14%)
Query: 80 QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD--SKWSCPEA 137
Q WL G+ Q + +++ RGLVA + + +GE LL +P SL++T ++ SC
Sbjct: 50 QAWLRGQGIITQPLVLRQCGREGRGLVADRPLGRGEALLQLPDSLLLTPQRAAEESC--L 107
Query: 138 GEVLKQCS---------------VPDWPLLATYLIS----EASFEKSSRWSNYISALPRQ 178
+L+Q S +P+W LLA YL A+ ++ SRW+ Y+ LP++
Sbjct: 108 APLLRQLSPAGASTSAAGAAALPLPEWSLLALYLAELRGRAAAGDRGSRWAAYVDMLPQR 167
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-LRIFSKYPDLFPEEV-FNM 236
P ++L W E + L S + A ++ +L L + L P V +
Sbjct: 168 PGTVLDWPAKETRQLLRGSPLLRLADSIAAAAAASWEELAPLIARGRAEGLVPAHVSLSK 227
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET------------------- 277
W+FG+L SR +RLP D L PWAD+LNH ET
Sbjct: 228 ADLDWAFGVLLSRCIRLPGRDQLQVLAPWADLLNHDVNAETGAAAAGAAGSGATGSGASG 287
Query: 278 ----FLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
LD++ +++G +V TDR Y G+QV++SYG KS+GELLLSYGF P NP
Sbjct: 288 SGGCHLDWEPTARGGAGALVLRTDRAYAAGQQVYVSYGPKSSGELLLSYGFCPPPAANPH 347
Query: 330 DSVELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPS 386
+L + + S D K E L K+GL S FP+++ G P L+ Y V + P
Sbjct: 348 QDYKLLVGVNDSAAADPLAALKAEVLAKHGLPPSLEFPLKLEGLPAGLLNYLAFVEAAPQ 407
Query: 387 MKGKFEEMAA 396
+ + ++ +
Sbjct: 408 VPQELHDLGS 417
>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 889
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 147/311 (47%), Gaps = 48/311 (15%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+L+ WL+ GLPPQK+AI RGLVA + +RK EKLL VP L++TAD
Sbjct: 32 GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKLLNVPAQLLLTADVALQHSA 91
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSR--WSNYISALPRQPYSLLYWTRAELDRYL 194
G +L+ C VP W +LAT+L + + W Y+ ALP Q +L W E+D L
Sbjct: 92 YGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTGCVLEWASEEVD-LL 150
Query: 195 EASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
+ A E I + +L LR + P + +W F +L SRL+R
Sbjct: 151 RGTAAMRAADEIIAACSASVAELAPILRESAS----MPGGPLTEQDLRWGFSMLLSRLIR 206
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
LP G+ L +C V++SYG+KS+
Sbjct: 207 LP---GKQDL--------EAC----------------------------VYVSYGQKSDT 227
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
+LLLSYGF+P +NP + L LSL++ D C+ K L + G SA FP+++ P
Sbjct: 228 QLLLSYGFMPAPLSNPHSACNLRLSLQRDDPCFDAKRALLEEAGHSACMEFPLRLDSLPQ 287
Query: 373 ELMAYAYLVVS 383
+L+ YA + +
Sbjct: 288 KLINYAAFLCT 298
>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
Length = 464
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/359 (28%), Positives = 177/359 (49%), Gaps = 27/359 (7%)
Query: 87 GLPPQKMAI--QKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
G P + + + D GE RGL+A K I++GEKL+ +P +L+++ D E + L
Sbjct: 80 GRAPHQCFVSNETTDEGEPCGRGLLAFKKIQQGEKLIEIPENLILSVDRDQIKNEGNDFL 139
Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR--YLEASQI 199
+ + L +LI + + S+W Y LPR+ L R L+ +L S+
Sbjct: 140 NE-----YDSLGIFLIQQMAMGDKSKWKIYFDILPREE-DLNLGFRWNLNDIVFLRGSKT 193
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR 259
++ + + L IFSK +P +FN+ ++W+ IL SR + L ++ +
Sbjct: 194 LNASLYLKEKIKIQFLRLEKTIFSKNRLKYPVSIFNLAQWEWALSILLSRAIFLQNL-KK 252
Query: 260 VALVPWADMLNHSCEVETFLDYDKSS----QGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
V+LVP+AD +NH+ ++++ K S +V D+ Y +Q+F +YG+K+N ELL
Sbjct: 253 VSLVPYADFMNHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELL 312
Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
L YGF+ NP DS+EL +SL D +++K + + + ++ FPI +P EL
Sbjct: 313 LLYGFILER--NPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKELY 370
Query: 376 AYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSR 434
+ +S EE+ + + D EI++ + +L SCE + YS+
Sbjct: 371 EFLRFCISNQ------EELGSTDLSDFNF-NDENNYEIEKIIRKLVLFSCEKLLKNYSK 422
>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Galdieria sulphuraria]
Length = 490
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/334 (31%), Positives = 162/334 (48%), Gaps = 35/334 (10%)
Query: 66 WGCEIDSLENASTLQKWLSDSGL----------PPQKMAI--QKVDVGE---RGLVALKN 110
W EI S WL ++G+ P ++ I + D GE RGL++ ++
Sbjct: 76 WSSEI------SAFYDWLKENGVYLSEKASWTHAPHRLVIAEETKDEGEYSGRGLLSSRS 129
Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSN 170
+ GEK+L +P L+ T K + + ++ + L+ E + S +
Sbjct: 130 VNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLLYEKAKGFDSFFKP 187
Query: 171 YISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDL 228
Y+ LP + L W+ +LD L+ S + ++ Y L I + P+
Sbjct: 188 YLDILPTLDELNPLFLWSNKDLD-LLQGSPTLSACEQLRDKLLREYTYLGKNIIPQIPN- 245
Query: 229 FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG- 287
F + + + F+W+FGILFSR + PS R+ALVP+AD+LNHS F+D +K G
Sbjct: 246 FASKPIDFKQFQWAFGILFSRAICFPS-SKRIALVPYADLLNHSPFCSAFIDEEKIPFGN 304
Query: 288 ----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
V DR Y+P EQV++SYG +SN ELLL YGF NP D VE+ + L K+D
Sbjct: 305 GVTEAVVYVDRLYEPYEQVYVSYGPRSNQELLLLYGFSLER--NPFDCVEITIGLDKTDP 362
Query: 344 CYKEKLEALRKYGLSASECFPIQITGWPLELMAY 377
Y EK L YG S + FP+ + +P+E+ +
Sbjct: 363 LYLEKCRMLESYGKSPLQSFPLYMDRYPVEMAEF 396
>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 465
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 185/365 (50%), Gaps = 37/365 (10%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSK 131
+T++ WL +SG + + +++ + RG+ L+ ++GEK+L +P S++ T +
Sbjct: 8 NTMESWLKESG----AVGLDGLELADFPDTGRGVKTLRPFKEGEKILTIPSSILWTVEHA 63
Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRA 188
++ P G L P P L TYL+ S E ++++ALP S +++T
Sbjct: 64 YADPLLGPALCSVQPPLSPEDTLTTYLLFVRSRESGYDGQRSHVAALPTSYSSSIFFTEE 123
Query: 189 ELD------RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
EL+ Y Q+ E++IE + L +++F ++ DLFP + F++E +KW+
Sbjct: 124 ELEVCAGTSLYTITKQL-EQSIE------DDHRALVMQLFIQHRDLFPLDKFSIEDYKWA 176
Query: 243 FGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
++SR + DG+ L P+ADMLNHS E + YD SS + + Y+PG+
Sbjct: 177 LCTVWSRRMDFQLRDGKSMRLLAPFADMLNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGD 236
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
QVFI+YG N LL YGFV NP+D+ +L LS Y++K + GL ++
Sbjct: 237 QVFINYGSVPNSRLLRLYGFVIP--GNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDST 294
Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQ 419
P+ +T PL YL + + ++AA A +K D K + +E + LQ
Sbjct: 295 STIPLTLTD-PLPKNVLRYLRI----QRADASDLAAMALQN--AKADEKVSDSNEVEILQ 347
Query: 420 FILDS 424
F+++S
Sbjct: 348 FLVES 352
>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Chroomonas mesostigmatica
CCMP1168]
Length = 464
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 160/296 (54%), Gaps = 18/296 (6%)
Query: 99 DVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY 155
D GE RGL+A + I++GEKL+ +P +L++ K + E L + + LA
Sbjct: 89 DEGESCGRGLLAFRKIQQGEKLIEIPENLILKKSLKENRSEDLSFLNE-----YDSLAIK 143
Query: 156 LISEASFEKSSRWSNYISALPRQP-YSLLY-WTRAELDRYLEASQIRERAIERITNVIGT 213
I E + + S+W Y LP++ +L++ W +++ +L S++ + +
Sbjct: 144 AIQERAIGEKSKWKVYYEILPKEKDLNLVFRWKISDI-VFLRGSKVLNASFYLKEKIKIQ 202
Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
+ + IFSK ++PE++FN+++++W+ +L SR + L +M ++ALVP+AD +NH+
Sbjct: 203 FLRIEKTIFSKNRLVYPEKIFNLQSWEWAISLLLSRAIFLQNM-KKIALVPYADFINHNP 261
Query: 274 EVETFLDYDK----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
++++ K + +V D+ Y +Q+F +YG+K+N ELL+ YGF+ NP
Sbjct: 262 FSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGFIIER--NPF 319
Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPP 385
DS+EL ++L D+ Y +K + + + FP+ +P EL + L +S P
Sbjct: 320 DSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKELYEFMRLCLSGP 375
>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
Length = 456
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 183/363 (50%), Gaps = 37/363 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
++ WL +SG + + +++ + RG+ L+ ++GEK+L +P S++ T + ++
Sbjct: 1 MESWLKESG----AVGLDGLELADFPDTGRGVKTLRPFKEGEKILTIPSSILWTVEHAYA 56
Query: 134 CPEAGEVLKQCSVPDWP--LLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAEL 190
P G L P P L TYL+ S E ++++ALP S +++T EL
Sbjct: 57 DPLLGPALCSVQPPLSPEDTLTTYLLFVRSRESGYDGQRSHVAALPTSYSSSIFFTEEEL 116
Query: 191 D------RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
+ Y Q+ E++IE + L +++F ++ DLFP + F++E +KW+
Sbjct: 117 EVCAGTSLYTITKQL-EQSIED------DHRALVMQLFIQHRDLFPLDKFSIEDYKWALC 169
Query: 245 ILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
++SR + DG+ L P+ADMLNHS E + YD SS + + Y+PG+QV
Sbjct: 170 TVWSRRMDFQLRDGKSMRLLAPFADMLNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQV 229
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
FI+YG N LL YGFV NP+D+ +L LS Y++K + GL ++
Sbjct: 230 FINYGSVPNSRLLRLYGFVIP--GNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTST 287
Query: 363 FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFI 421
P+ +T PL YL + + ++AA A +K D K + +E + LQF+
Sbjct: 288 IPLTLTD-PLPKNVLRYLRI----QRADASDLAAMALQ--NAKADEKVSDSNEVEILQFL 340
Query: 422 LDS 424
++S
Sbjct: 341 VES 343
>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
[Thalassiosira pseudonana CCMP1335]
Length = 434
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 103/341 (30%), Positives = 163/341 (47%), Gaps = 42/341 (12%)
Query: 87 GLPPQKMAIQKVDVGE-------RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
G P +AI V E RGL+A ++I G++LL +P L IT S G+
Sbjct: 23 GEAPHPLAISTETVDEITNESSGRGLLARRSINDGDELLKIPMDLCITRKSARKA--LGK 80
Query: 140 VLKQCSVPDWPLLATYLISEA-SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEA 196
Q + ++ +A LI E S W Y+ LP + W +L +L+
Sbjct: 81 DALQDGINEYLAIACQLIHEKYVLGDESEWDAYMGVLPEVEEVNPTFTWKDEDL-AFLDG 139
Query: 197 SQIRERAIERITNVIGTYNDL---RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
S + + Y+ L + + +K+PD FP E F E + W+F +LFSR +RL
Sbjct: 140 SPVVAATRSLQMKLRREYDALLGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRL 199
Query: 254 PSMD--GRVALVPWADMLNHSCEVETFLD--------YDKSSQGVVFTTDRQYQPGEQVF 303
++ R+A+VP+AD++NHS F+D + + V+ DR Y+ EQV+
Sbjct: 200 RNLQVGERLAMVPYADLINHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVY 259
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK-------------KSDKCYKEKLE 350
ISYG+KSN ELLL YGF NP +SV++ +S+ + D +EK+E
Sbjct: 260 ISYGQKSNAELLLLYGFALER--NPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVE 317
Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYL-VVSPPSMKGK 390
L G + FP +P+E++ + L +++P +GK
Sbjct: 318 FLASVGRDQTVDFPCYADRYPVEMLEFLRLMMMTPEDTRGK 358
>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
Length = 648
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 112/376 (29%), Positives = 182/376 (48%), Gaps = 53/376 (14%)
Query: 55 TKTTVTQNMIPWGCEIDSLENASTLQKWLSDS---GLPPQKMAIQKVDVGE-------RG 104
TK + +I W LE + +L++S G P +AI V E RG
Sbjct: 156 TKLSANARLISW------LEEEGGV--YLAESSTWGEAPHPLAISTETVDEITNESSGRG 207
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE- 163
L+A ++I G++LL +P L +T K + E G+ Q + ++ +A LI E +
Sbjct: 208 LLARRSINDGDELLKIPLDLCLT--RKSARRELGKDALQEGINEYLAVACQLIHEKFVKG 265
Query: 164 KSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR- 220
+ S ++ Y+ LP + W +L +LE S + + Y+DL
Sbjct: 266 EDSFYAAYMGVLPEVDEVNPTFTWPDEDL-AFLEGSPVVAATRSLQMKLRREYDDLLGGP 324
Query: 221 --IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD--GRVALVPWADMLNHSCEVE 276
+ +K+P FP E + E ++W+F +LFSR +RL ++ R+A+VP+AD++NHS +
Sbjct: 325 DGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMVPYADLINHSAFSQ 384
Query: 277 TFLD--------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
F+D + + V+ DR Y+ EQV+ISYG+KSN ELLL YGF NP
Sbjct: 385 AFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGFALER--NP 442
Query: 329 SDSVELPLSLK-------------KSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
+SV++ +S+ + D EKLE L G + FP +P+E++
Sbjct: 443 YNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDFPCYADRYPVEML 502
Query: 376 AYAYL-VVSPPSMKGK 390
Y L +++P +GK
Sbjct: 503 EYLRLMMMTPEDTRGK 518
>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Phaeodactylum tricornutum CCAP
1055/1]
Length = 575
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 177/380 (46%), Gaps = 50/380 (13%)
Query: 50 NDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE------- 102
NDA+ K T N+I W + S W G P +AI E
Sbjct: 73 NDAN-PKFTANTNLIQW-LTTEGNVYLSEESSW----GEAPHPLAISTETKDEITNESSG 126
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RGL+A ++I G++LL +P +L +T + G+ + + ++ +A +LI E +
Sbjct: 127 RGLLARRDINDGDELLRIPMALCMTKSAARKA--VGKDVLPSEINEYLAMACHLIYERNV 184
Query: 163 E-KSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL-- 217
+ S W Y+ LP + W +L +L S + + Y+ L
Sbjct: 185 RGEESPWKPYLDVLPDIDEVNPTFTWPDEDL-AFLNGSPVIAATKSLQMKLRREYDALLG 243
Query: 218 -RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSCE 274
+ +KYPD FP E FN + ++W+F +LFSR +RL S+ +ALVP+AD++NHS
Sbjct: 244 GEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYADLINHSPF 303
Query: 275 VETFLD--------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
+ ++D + + V+ DR Y+ EQ++ISYG KSN ELLL YGF
Sbjct: 304 SQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYGPKSNAELLLLYGFAVER-- 361
Query: 327 NPSDSVELPLSLKKS---------------DKCYKEKLEALRKYGLSASECFPIQITGWP 371
NP +SV++ +S+ D +EK L + G A+ FP +P
Sbjct: 362 NPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQVGRDATVDFPCYADRYP 421
Query: 372 LELMAYAYLV-VSPPSMKGK 390
+E++ Y L+ ++P +GK
Sbjct: 422 VEMLEYLRLMQMTPEDTRGK 441
>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
Length = 600
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 176/361 (48%), Gaps = 20/361 (5%)
Query: 71 DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+ L+ A + KWL + G P Q + + + G A ++++ G+ L +P + +TA
Sbjct: 42 ERLDAARDMTKWLQEQGFPQQPLLVSSFEDKGLGCCATRDLQAGDAALSIPENFTVTAVD 101
Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
+ P + + LA +L+ E + S W Y+ P SLL W + E
Sbjct: 102 VANHPVISSAAE--GRDELVGLALWLMYEQERSQDSPWYPYVKVFPASTLSLLLWEQEEQ 159
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
+ L S + +++T++ T++ L+ + K FP E F FK +F ++ SR
Sbjct: 160 EELLRGSSALAKVKDQLTSLRQTFDALKDTL--KDNKDFPMEKFTFSAFKTAFSVVLSRA 217
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK- 309
V LPS + ALVP+ D++NH + LDYD Q V D++Y+ G+QVF SY +
Sbjct: 218 VYLPSAE-LFALVPFGDLINHESS-RSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNL 275
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
++ + L+ YGF+ + ++ +D +E+ + L D K E L++ GL+ + FP+ +
Sbjct: 276 TSADFLIRYGFL--DESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR 333
Query: 370 WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCESS 428
+P +L+ Y L S G F K+T +KD+ + +E + L ++ C +
Sbjct: 334 FPTQLLTYTRLARIQDS--GLFA--------KITFEKDLIVSQTNEYETLMLLMADCRTK 383
Query: 429 I 429
+
Sbjct: 384 L 384
>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Cyanidioschyzon merolae strain 10D]
Length = 567
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 157/350 (44%), Gaps = 80/350 (22%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE--- 159
RG +A ++I+ GE L VP L T D A V + ++ LAT L+ E
Sbjct: 116 RGFLARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGL 175
Query: 160 -ASFEKSSR-----WSNYISALPRQPY-----------------SLLYWTRAELDRYLEA 196
S++KS R W Y+ LP P+ +L W E+ ++L+
Sbjct: 176 DESWKKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEM-QWLQG 234
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPS 255
S A + V Y + R++ ++P +F E F +E F W+FG+LFSR V LP+
Sbjct: 235 SPTLLSARALRSKVEREYAEACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLPA 294
Query: 256 MDGRVALVPWADMLNHSCEVETFLD----------------------------------- 280
+G +ALVP+AD+ NHS +F+D
Sbjct: 295 ENGMLALVPYADLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSDDAGA 354
Query: 281 --------YDKSSQG-VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
Y + +Q VV DR Y EQV++SYG+KSN ELLL YGFV NP +S
Sbjct: 355 VANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFV--SDRNPYNS 412
Query: 332 VELPLSLKKSDKCYKEKLEALRKYGLSAS------ECFPIQITGWPLELM 375
VE+ +SL S+ L+ R + L+ ECFP+ +PLELM
Sbjct: 413 VEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELM 462
>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
Length = 481
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 175/375 (46%), Gaps = 25/375 (6%)
Query: 66 WGCEI-----DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFV 120
W C + L+ A + KWL + G P Q + + + G A ++++ G+ L +
Sbjct: 32 WLCRASIADEERLDAARDMTKWLQEQGFPQQPLLVSSFEDKGLGCCATRDLQAGDAALSI 91
Query: 121 PPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY 180
P + +TA + P + + LA +L+ E + S W Y+ P
Sbjct: 92 PENFTVTAVDVANHPVISSAAE--GRDELVGLALWLMYEQERSQDSPWYPYLKVFPASTL 149
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
S L W + E + L S + +++T++ T++ L+ + K FP E F FK
Sbjct: 150 SPLLWEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL--KDNKDFPMEKFTFSAFK 207
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
+F ++ SR V LPS + ALVP+ D++NH + LDYD Q V D++Y+ G+
Sbjct: 208 AAFSVVLSRAVYLPSAE-LFALVPFGDLINHESS-RSLLDYDIEEQKVKLAVDKRYKKGD 265
Query: 301 QVFISYGKK-SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
QVF SY + ++ + L+ YGF+ N D +E+ + L D K E L++ GL+
Sbjct: 266 QVFASYAQNLTSADFLIRYGFLDESDEN--DFIEIEVGLVSGDSLAPLKREILQEVGLTV 323
Query: 360 SECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD-IKCPEIDEQAL 418
+ FP+ + +P +L+ Y L S G F K+T +KD I C + + L
Sbjct: 324 PQKFPVYLNRFPTQLLTYTRLARIQDS--GLFA--------KITFEKDLIVCQTNEYETL 373
Query: 419 QFILDSCESSISKYS 433
++ C + + +S
Sbjct: 374 MLLMADCRTKLLSFS 388
>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
Length = 1753
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 182/373 (48%), Gaps = 32/373 (8%)
Query: 76 ASTLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
+ +++ WL SG + + + V RG+ AL++ +KGE++L +P + + TA++ +
Sbjct: 882 SESMEAWLKHSGAVGVDAIEVADFPVTGRGVKALRSFKKGERILTIPSACLWTAEAARAD 941
Query: 135 PEAGEVLKQC----SVPDWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWTR 187
P G VL+ SV D LA +L+ S + + R +I+A+P++ + +++
Sbjct: 942 PLLGPVLRSAQPPLSVED--TLAIHLLFVKSRTAGYEGQRL--HIAAMPQRHSASIFFAE 997
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
EL + E S + + V + L +++ S++ DLFP + F +E +KW+ ++
Sbjct: 998 DEL-QVCEGSSLHTLTTQLEQRVQDDFRQLLVQLLSQHRDLFPLDQFTIEDYKWALCTIW 1056
Query: 248 SRLVRLPSMDG-RVALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
SR + D V LV P ADMLNHS +V+ YD +S + + YQ G+Q+FI
Sbjct: 1057 SRAMDFAVSDTTSVRLVAPLADMLNHSLDVKQCHAYDPTSGDLSILAAKDYQVGDQIFIY 1116
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG N LL YGFV + NP+DS +L L Y++K GL ++ P+
Sbjct: 1117 YGSVPNNRLLRLYGFVLLD--NPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTIPL 1174
Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMT----SKKDIKCPEIDE-QALQF 420
PL YL + + + AA MT + D K + +E Q LQF
Sbjct: 1175 -TAKHPLPKNVLRYL---------RTQRLDAADVADMTLQLLNGTDGKVNDGNEIQVLQF 1224
Query: 421 ILDSCESSISKYS 433
++DS S + +
Sbjct: 1225 LIDSLGSVLEGFG 1237
>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
2508]
gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 459
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 106/365 (29%), Positives = 176/365 (48%), Gaps = 40/365 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
+ WL SG + + +++ + RG+ L+ ++GEK+L +P ++ T ++
Sbjct: 1 MNSWLKQSG----AVGLDSLELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYA 56
Query: 134 CPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRA 188
P G L+ SV D LATY++ S E ++I+ALP S + +
Sbjct: 57 DPLLGPALRSAQPPLSVED--TLATYILFVKSRESGYDGQRSHIAALPTSYSSSILFAED 114
Query: 189 ELDR------YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
+L+ Y Q+ E++IE + L +R+F ++PDLFP + F +E +KW+
Sbjct: 115 DLEACAGTSLYTITKQL-EQSIED------DHRALVVRLFVQHPDLFPLDKFTVEDYKWA 167
Query: 243 FGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
++SR + DG L P+ADMLNH+ EV+ YD SS + + Y+ G+
Sbjct: 168 LCTVWSRAMDFVLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGD 227
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
QVFI+YG N LL YGFV NP+DS +L LS +++K + GL ++
Sbjct: 228 QVFINYGPVPNSRLLRLYGFVIP--GNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDST 285
Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQ 419
P+ +T PL YL + G A + + T D K + +E + L+
Sbjct: 286 ATIPLTLTD-PLPKKVLRYLRIQRLDASG-----LAVIARQQTDATDGKISDSNEVEILR 339
Query: 420 FILDS 424
F+++S
Sbjct: 340 FLVES 344
>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 431
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 157/316 (49%), Gaps = 24/316 (7%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-TADSKWS--CPEAGEVLKQCSVPDWPLLAT----Y 155
R +VA +I GE LL VP SLV+ +AD+ + PE +L + ++PL AT
Sbjct: 57 RSVVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDE----EFPLSATNENAL 112
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ + +S W YI LP + L+++ EL YLE S + A +R + Y+
Sbjct: 113 LLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELS-YLEGSSLHHFARQRRRAIESQYD 171
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
+ +F YP+ F E F+++ +KW+ +++SR + +G+ LVPWADM N + E
Sbjct: 172 TIFTPLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVD--EGKRGLVPWADMFNMAPET 229
Query: 276 ETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKK---SNGELLLSYGFVPREGTNPSDS 331
E + D ++++ + GEQ+F++YG+ SN +LL+ YGFV NP D+
Sbjct: 230 EQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLE--NNPHDA 287
Query: 332 VELPLSLKKSDKCYKEKLEALRKYGLSASECF--PIQITGWPLELMAYAYLVVSPPSMKG 389
V P++ S K L LR + L + F P + +P L+A + V+
Sbjct: 288 VVFPMTHSSSASPRKRGL--LRAHDLDRDQFFVGPPALGEFPEHLLAAFRVTVATEQELD 345
Query: 390 KFEEMAAAASNKMTSK 405
E +A ++ S+
Sbjct: 346 ALLEQSAQGRQRLPSR 361
>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 459
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/365 (29%), Positives = 176/365 (48%), Gaps = 40/365 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
+ WL SG + + +++ + RG+ L+ ++GEK+L +P ++ T ++
Sbjct: 1 MNSWLKQSG----AVGLDSLELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYA 56
Query: 134 CPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRA 188
P G L+ SV D LATY++ S E ++I+ALP S + +
Sbjct: 57 DPLLGPALRSAQPPLSVED--TLATYILFVKSRESGYDGQRSHIAALPASYSSSILFAED 114
Query: 189 ELDR------YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
+L+ Y Q+ E++IE + L +R+F ++PDLFP + F +E +KW+
Sbjct: 115 DLEACAGTSLYTITKQL-EQSIED------DHRALVVRLFVQHPDLFPLDKFTVEDYKWA 167
Query: 243 FGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
++SR + DG L P+ADMLNH+ EV+ YD SS + + Y+ G+
Sbjct: 168 LCTVWSRAMDFVLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGD 227
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
QVFI+YG N LL YGFV NP+DS +L LS +++K + GL ++
Sbjct: 228 QVFINYGPVPNSRLLRLYGFVIP--GNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDST 285
Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQ 419
P+ +T PL YL + G A + + T D K + +E + L+
Sbjct: 286 ATIPLTLTD-PLPKKVLRYLRIQRLDASG-----LAVIARQQTDATDGKISDSNEVEILR 339
Query: 420 FILDS 424
F+++S
Sbjct: 340 FLVES 344
>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 492
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 189/386 (48%), Gaps = 40/386 (10%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++W+S G + V G GLVA +N+ +GE + VP L + AD+ A
Sbjct: 60 NFRRWISSQGADTGAASPTVVPEG-LGLVAARNLPRGEVVAEVPKKLWMDADAV----AA 114
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
++ + C + W ++ ++ EA+ S W+ Y++ LPRQ S ++W+ EL
Sbjct: 115 SDIGRACRSGGDLRPWVSVSLLILREAARGGDSLWAPYLAILPRQTDSTIFWSEEEL-LE 173
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
++ +Q+ + V ++++ +I DLFP+ + + F W+FGIL SR+
Sbjct: 174 IQGTQLLSTTMGVKEYVQSEFDNVEAKIIGPNKDLFPDTI-TFDDFLWAFGILRSRV--F 230
Query: 254 PSMDG-RVALVPWADMLNHSCEVET-----------FLDYDKSSQGVVFT--TDRQYQPG 299
P + G ++AL+P+AD++NHS ++ + FL D VVF+ T + + G
Sbjct: 231 PELRGDKLALIPFADLINHSADITSKQSCWEIQGKGFLGRD-----VVFSLRTPMEVKSG 285
Query: 300 EQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
EQV++ Y KSN EL L YGF E + DS L L + +SD Y +KL+ G+
Sbjct: 286 EQVYVQYDLDKSNAELALDYGFT--ETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMG 343
Query: 359 ASECFPIQIT-GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
+ F + + P +++ Y L+ + E A NK+ ++ +E++
Sbjct: 344 ETAYFDVVLGESLPPQMITYLRLLCLGGTDAFLLE---ALFRNKVWGFLELPVSRDNEES 400
Query: 418 L-QFILDSCESSISKYSRFLQVKELL 442
+ Q I +C+S+++ Y ++ E L
Sbjct: 401 ICQVIQTACKSALTAYHTTIEEDEEL 426
>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 437
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 180/370 (48%), Gaps = 24/370 (6%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL +SG + + + + RG+ LK ++GE +L +P ++ T + ++
Sbjct: 1 MESWLKESGAVGLDNLELADFPITGRGVRTLKCFKEGENILTIPSGILWTVEHAYADSIL 60
Query: 138 GEVLKQCSVP--DWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
G VL+ S+P LA Y++ S S R N+++ALP S +++ +L+
Sbjct: 61 GPVLRSTSLPLSVEDTLAIYILFVRSRKSGYDGPR--NHVAALPASYSSSIFFMEDQLEV 118
Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
S I ++ +RI + Y L +R+ +YPDLFP + F +E +KW+ ++SR
Sbjct: 119 CAGTSLYTITKQLEQRIED---DYRGLVVRMLGQYPDLFPLDKFTVEDYKWALCTVWSRA 175
Query: 251 VRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
+ DG+ L P+ADMLNHS E + YD SS + + Y+ G+QVFI+YG
Sbjct: 176 MDFVLPDGKSIRLLAPFADMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGP 235
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT 368
N LL YGFV NP+DS +L L+ +K+K + GL ++ + T
Sbjct: 236 MPNNRLLRLYGFVVP--GNPNDSYDLVLATHPMAPFFKQKQKLWASAGLDSTTTITLTFT 293
Query: 369 GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCES 427
PL YL V E A + + T+ D K + +E + L+F+++S
Sbjct: 294 D-PLPKDVLRYLRVQRLD-----ESDVAVLALQQTNTTDAKISDSNEVEVLRFLVESIGG 347
Query: 428 SISKYSRFLQ 437
++ + ++
Sbjct: 348 LLNNFGTHVE 357
>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
gi|223949395|gb|ACN28781.1| unknown [Zea mays]
gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
Length = 489
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 179/376 (47%), Gaps = 21/376 (5%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
++WL+ G AI GLVA +++ +GE + VP L + AD+ +
Sbjct: 58 FRRWLASHGAGDGGKAIPAAVPEGLGLVAARDLPRGEVVAEVPKKLWMDADAVAASDIGR 117
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
+ W +A L+SE + S W+ Y++ LPRQ S ++W+ EL ++ +Q
Sbjct: 118 ACGGGGGLRPWVAVALLLLSEVARGADSPWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQ 176
Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG 258
+ + V ++ ++ I S DLFP + + F W+FG+L SR+ P + G
Sbjct: 177 LLSTTVGVKEYVQSEFDSVQAEIISTNKDLFPGSI-TFDDFLWAFGMLRSRV--FPELRG 233
Query: 259 -RVALVPWADMLNHSCEVET-FLDYDKSSQGVV-------FTTDRQYQPGEQVFISYG-K 308
++AL+P+AD++NHS + + ++ +G+ T + G+Q++I Y
Sbjct: 234 DKLALIPFADLVNHSPNITSEGSSWEIKGKGLFGRELMFSLRTPVNVKSGQQIYIQYDLD 293
Query: 309 KSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
KSN EL L YGFV +NPS DS + L + +SD Y +KL+ GL + F + I
Sbjct: 294 KSNAELALDYGFVE---SNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDV-I 349
Query: 368 TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK-CPEIDEQALQFILDSCE 426
PL YL + F + A N + ++ P+ +E Q + D+C+
Sbjct: 350 LNEPLPPQMLPYLRLLCIGGTDAF-LLEALFRNSVWGHLELPLSPDNEESICQAMRDACK 408
Query: 427 SSISKYSRFLQVKELL 442
S+++ Y ++ E L
Sbjct: 409 SALADYHTTIEEDEEL 424
>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 523
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 163/350 (46%), Gaps = 33/350 (9%)
Query: 45 SVSTTNDASRTKTTVTQNMIPWG------CEIDSLENASTLQKWLSDSGLPPQKMAI--Q 96
+VS+ SR + T+T ++ + E+ L++W+ + GLP K+++
Sbjct: 53 AVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPECKVSLAEH 112
Query: 97 KVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL 151
+ G++G +VA ++++ GE L +P SLV+T + E+L + +
Sbjct: 113 QPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTTNKLSELAC 172
Query: 152 LATYLISEASFEKSSRWSNYISALPRQP-------YSLLYWTRAELDRYLEASQIRERAI 204
LA YL+ E K S W YI L RQ S L W+R EL+ Y S ++E +
Sbjct: 173 LALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTGSTMKEVVL 232
Query: 205 ERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SM 256
ER+ + Y +L +F +YP P E F+ E FK +F + S +V L S+
Sbjct: 233 ERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCVVHLQGVSL 292
Query: 257 DGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
R ALVP +L + + L VV DR Y+ G+ + + G + N +LL
Sbjct: 293 ARRFALVPLGPPLLAYKSNCKAML--KAVDDNVVLEVDRAYKAGDPIAVWCGPQPNSKLL 350
Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
L+YGFV + NP D + + SL D Y++K ++K + F I
Sbjct: 351 LNYGFVDED--NPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQI 398
>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
Length = 509
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 95/321 (29%), Positives = 152/321 (47%), Gaps = 27/321 (8%)
Query: 68 CEIDSLENAS--TLQKWLSDSGLPPQKMAIQKVDV--GERG--LVALKNIRKGEKLLFVP 121
+DS A L WL G+ K++ VD G RG LVA ++I G+ +L +P
Sbjct: 48 ASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGGRGWGLVAAEDIGGGDAVLAIP 107
Query: 122 PSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
SL +T D+ + P + W +A L+ E S + SRW+ Y++ALP Q +
Sbjct: 108 RSLWMTVDTALASPIGAHCGDEAG---WIAVALQLLHERSIGEKSRWAAYVNALPAQLDA 164
Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
L+W+ E+ L +Q+ + A + GT+ L+ F PD+FP + F+ +F W
Sbjct: 165 PLFWSAEEV-ATLTGTQLLDAAAGYDSYARGTWARLKESAFDANPDVFPSDAFDEPSFLW 223
Query: 242 SFGILFSRLVRLPSMDGRVALVPWADMLNHSC-----------EVETFLDYDKSSQGVVF 290
+FGIL SR +ALVP DM NHS V KS ++
Sbjct: 224 AFGILRSRCQAPVDQGADIALVPGLDMANHSGLSSQTWTLNNGGVAAVFGGGKSGGSMLL 283
Query: 291 TTDRQYQ----PGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY 345
T++ + G +VF++YG +K + +L L YGF + P V P+++ +SD
Sbjct: 284 RTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYGFTDAFASRPG-YVLGPIAIPESDPNA 342
Query: 346 KEKLEALRKYGLSASECFPIQ 366
+K++ L GL + F ++
Sbjct: 343 FDKMDVLEVAGLREAPSFVLR 363
>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor
[Oryza sativa Japonica Group]
gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 177/354 (50%), Gaps = 22/354 (6%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
G GLVA +++ +GE L VP L + AD+ + G V + P W +A L+ EA
Sbjct: 86 GGLGLVAARDLPRGEVLAEVPKKLWLDADAVAASDLGGAVGRGGLRP-WVAVALLLLREA 144
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ S W+ Y++ LPRQ S ++W+ EL ++ +Q+ + V + +
Sbjct: 145 ARGAGSPWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFESVEAE 203
Query: 221 IFSKYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET-F 278
I S+ +LFP V FN F W+FGIL SR+ D ++AL+P+AD++NHS ++ +
Sbjct: 204 IISENRELFPGTVTFN--DFLWAFGILRSRVFAELRGD-KLALIPFADLVNHSDDITSKE 260
Query: 279 LDYDKSSQG-----VVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSD 330
++ +G VVF+ T + GEQ++I Y KSN EL L YGF E + D
Sbjct: 261 SSWEIKGKGLFGRDVVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFT--ESNSSRD 318
Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT-GWPLELMAYAYLVVSPPSMKG 389
+ L L + +SD Y +KL+ G+ + F I + P +++ Y L+ +
Sbjct: 319 AYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLGESLPPQMLPYLRLLCLGGTDAF 378
Query: 390 KFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSRFLQVKELL 442
E A N + ++ + +E+A+ Q I ++C+S++ Y ++ E L
Sbjct: 379 LLE---ALFRNAVWGHLELPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEEL 429
>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
Length = 455
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/368 (29%), Positives = 177/368 (48%), Gaps = 28/368 (7%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL DSG + + + V RG+ AL++ ++GE++L +P + + T ++ P
Sbjct: 1 MEAWLKDSGAIGVDGIEVADFAVTGRGVKALRSFKEGERILTIPSACLWTVKKAYADPLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWS-----NYISALPRQPYSLLYWTRA 188
G VL+ SV D LA YL+ F KS ++I+A+P+ + +++T
Sbjct: 61 GPVLRAAQPPLSVEDS--LALYLL----FVKSRTLGYEGQRHHIAAMPQSYSASIFFTDD 114
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
EL + + S + + V Y L + + S++ DLFP + F +E +KW+ ++S
Sbjct: 115 EL-QVCKGSSLYALTPQLEQRVHDDYRQLLVALLSQHRDLFPLDQFTIEDYKWALCSIWS 173
Query: 249 RLVRLP-SMDGRVALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
R + S V LV P ADMLNHS +V+ YD +S + + YQ G+Q+FI Y
Sbjct: 174 RAMDFAVSETASVRLVAPLADMLNHSPDVKQCHAYDPTSGDLSILAAKDYQVGDQIFIYY 233
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ 366
G N LL YGFV + NP+DS +L L Y++K GL ++ P+
Sbjct: 234 GSVPNNRLLRLYGFVLPD--NPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLT 291
Query: 367 ITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSC 425
+ PL YL + E + ++ + D K + +E Q LQF++DS
Sbjct: 292 VKD-PLPNNVLRYLRIQRLD-----ESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSI 345
Query: 426 ESSISKYS 433
S + +
Sbjct: 346 GSLLEGFG 353
>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 458
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 181/391 (46%), Gaps = 49/391 (12%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPSLVITADSKW 132
S L WL G + A+ + +G R L+++++I++GE++L V L+IT +
Sbjct: 35 SPLLAWLESRG---ETEALTSLTIGNTNQGRALLSIRHIKRGEQVLRVSRELMITPNRLP 91
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAEL 190
SC E E L + V +W LA + + K+S W YI LP R + ++W EL
Sbjct: 92 SCVE--ESLSE-DVNEWSRLALFQLLHKHAGKASPWEPYIRCLPPLRGLQNTVFWRDEEL 148
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
+ L S + ++ R T +I DL + +KYP+LF E V +E+FK ++ + SR
Sbjct: 149 E-LLRQSNVYDQTEHRKT-LISNQFDLVQAVVNKYPELFGETV-TLESFKHAYCVASSRS 205
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ ++ G + +VP+ DM NH L Y + D+ Y G QV I+YG
Sbjct: 206 WGVEAL-GSITMVPFVDMFNHDSSARALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLP 264
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQITG 369
N L L +GF + NP D V++ + D EKL+ LR +G++ C + G
Sbjct: 265 NSSLALDFGFTLPD--NPHDEVQIWMEAPSGDPLRAEKLKLLRDHGIATDPFCDGTESGG 322
Query: 370 -WPLELMAYAYLVVSPPSMKGK--------------------FEEMAAAASNKMT--SKK 406
W + +S P+ +GK E MA A + +++
Sbjct: 323 AW------FGLREISSPAARGKGIPRALRTFVRVISASTTKELEAMAEDAKRRQGRLAQR 376
Query: 407 DIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
+K + + +AL+ +LD+ E +S + L+
Sbjct: 377 PLKDGK-EARALKLLLDNIEQCVSSHRSALK 406
>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
Length = 555
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 177/362 (48%), Gaps = 32/362 (8%)
Query: 78 TLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+++ WL G + + + V RG+ L+ +KGE++L +P ++ T + ++ P
Sbjct: 87 SMESWLKSCGAVGLDDLELADFPVTGRGVKTLRRFKKGERILTIPSGILWTVEHAYADPL 146
Query: 137 AGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELD 191
G VL+ SV D LATY++ S E ++++A P S +++ EL+
Sbjct: 147 VGPVLRSARPPLSVED--TLATYILFIRSRESGYDGLRSHVAAFPTSYPSSIFFAEEELE 204
Query: 192 RYLEAS-----QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
S + +R+IE Y L +R+ ++ DLFP + F++E +KW+ +
Sbjct: 205 VCAGTSLYTITKKLDRSIED------DYRTLVVRVLAQSRDLFPLDKFSIEDYKWALCTV 258
Query: 247 FSRLVR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+SR + LP + + P+ADMLNHS EVE YD SS + + Y+ G+Q FI
Sbjct: 259 WSRAMDFVLPDGNSIRLVAPFADMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFI 318
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
YG N LL YGFV NP+DS +L +S S ++ K + GL ++
Sbjct: 319 YYGSIPNSRLLRLYGFV--MPGNPNDSYDLVISTHPSAPFFERKQKLWASAGLDSACTIS 376
Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMA-AAASNKMTSKKDIKCPEIDE-QALQFIL 422
+ +T PL YL + + +E AA +++ + D K + +E + L+F++
Sbjct: 377 LTLTD-PLPKNVLRYLRIQ------RLDESDFAAIAHRQLAAADEKINDSNEVEVLRFLV 429
Query: 423 DS 424
+S
Sbjct: 430 ES 431
>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
Length = 516
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 122/449 (27%), Positives = 185/449 (41%), Gaps = 71/449 (15%)
Query: 40 IVVHCSVSTTNDASRTKTTVTQNMIPWGCE-----------------------IDSLENA 76
+V C A R KTT+ + W E +D E A
Sbjct: 18 LVQTCGGFAATHAPRPKTTLAASDDSWAVELQQAAGGAEEKEAASWLKGKNAGVDGGERA 77
Query: 77 --STLQKWLSDS----------GLPPQKMAIQKVDVGE-------RGLVALKNIRKGEKL 117
L WL+D+ +PP MA+ E RGL+A + I + +L
Sbjct: 78 RNDALMAWLTDNDVWVSELSGWNVPPHSMALATTTFDELEGEDSGRGLLARRAITQDAEL 137
Query: 118 LFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
+ +P L +T S E L ++ +A LI E S S WS YI+ LP
Sbjct: 138 IRLPVRLCMTKASALKARELRGSLND-DTNEYIAIALLLILERSKGSRSFWSEYIAILPT 196
Query: 178 QP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
+ W EL YLE S + + + + L PE +F
Sbjct: 197 NEDVGATFTWPAEEL-AYLEGSPAASATASMMAKLRAEH----AAVLEGNSALDPE-IFT 250
Query: 236 METFKWSFGILFSRLVRL-PSMDGRV-ALVPWADMLNHSCEVETFLD---------YDKS 284
E ++W+F LFSR +RL S G + A+VP+ D +NHS +++D +++
Sbjct: 251 FEAWQWAFTNLFSRAIRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEK 310
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC 344
VV DR Y+ EQVFISYG KSN +LLL YGF NP +SV+L + K D
Sbjct: 311 EDEVVLFADRAYKKFEQVFISYGPKSNADLLLLYGFALDR--NPFNSVDLAVGASKDDAL 368
Query: 345 YKEKLEALRKYGLS-ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMT 403
Y K R G +S FP+ +P EL+ + + + + + A +
Sbjct: 369 YDAKERFARGAGRDVSSAAFPLYADRFPDELVQFLRMACATE------DHLGARPLDDPD 422
Query: 404 SKKDIKCPEIDEQALQFILDSCESSISKY 432
+ DI + + L I D+C+++++ Y
Sbjct: 423 NYVDILSLDNELAVLDTIRDACDAAVAAY 451
>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 523
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 167/388 (43%), Gaps = 44/388 (11%)
Query: 5 SRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMI 64
S T H +LLP F L + P + C + C T +S + +
Sbjct: 29 SGTHHRLLLPCF--LRRLPQPGS------RSCSRLRLAACHADTLLSSSGAQGPPS---- 76
Query: 65 PWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDV-GERG--------LVALKNIRKGE 115
P C S +A WL +GLPP K+AI + V RG + A +++ G+
Sbjct: 77 PAACL--SASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFVAAGQDLEAGD 134
Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
VP SLV+T + E+L + + LA YL+ E + S W YI L
Sbjct: 135 VAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKEL 194
Query: 176 PRQP-------YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIF 222
RQ S L WT +ELD YL S +R+ + R + YN+L +F
Sbjct: 195 DRQRGRGQLAVESPLLWTESELD-YLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLF 253
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL 279
+YP P E F E FK +F + S +V L S+ R ALVP +L + + L
Sbjct: 254 KQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAML 313
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
S V DR Y+ GE + + G + N LLL+YGFV + NP D + + SL
Sbjct: 314 TAVDGS--VRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDED--NPYDRIAIEASLN 369
Query: 340 KSDKCYKEKLEALRKYGLSASECFPIQI 367
D Y+EK ++ G A + F + +
Sbjct: 370 TEDPQYQEKRMVAQRNGKLAIQKFQVCV 397
>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 425
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 167/388 (43%), Gaps = 44/388 (11%)
Query: 5 SRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMI 64
S T H +LLP F L + P + C + C T +S + +
Sbjct: 24 SGTHHRLLLPCF--LRRLPQPGS------RSCSRLRLAACHADTLLSSSGAQGPPS---- 71
Query: 65 PWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDV----GERG-----LVALKNIRKGE 115
P C S +A WL +GLPP K+AI + V G R + A +++ G+
Sbjct: 72 PAACL--SASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFVAAGQDLEAGD 129
Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
VP SLV+T + E+L + + LA YL+ E + S W YI L
Sbjct: 130 VAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKEL 189
Query: 176 PRQP-------YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIF 222
RQ S L WT +ELD YL S +R+ + R + YN+L +F
Sbjct: 190 DRQRGRGQLAVESPLLWTESELD-YLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLF 248
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL 279
+YP P E F E FK +F + S +V L S+ R ALVP +L + + L
Sbjct: 249 KQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAML 308
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
S V DR Y+ GE + + G + N LLL+YGFV + NP D + + SL
Sbjct: 309 TAVDGS--VRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDED--NPYDRIAIEASLN 364
Query: 340 KSDKCYKEKLEALRKYGLSASECFPIQI 367
D Y+EK ++ G A + F + +
Sbjct: 365 TEDPQYQEKRMVAQRNGKLAIQKFQVCV 392
>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
Length = 649
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 176/351 (50%), Gaps = 22/351 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
GLVA +++ +GE L VP L + AD+ + G V + P W +A L+ EA+
Sbjct: 243 GLVAARDLPRGEVLAEVPKKLWLDADAVAASDLGGAVGRGGLRP-WVAVALLLLREAARG 301
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
S W+ Y++ LPRQ S ++W+ EL ++ +Q+ + V + + I S
Sbjct: 302 AGSPWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFESVEAEIIS 360
Query: 224 KYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF-LDY 281
+ +LFP V FN F W+FGIL SR+ D ++AL+P+AD++NHS ++ + +
Sbjct: 361 ENRELFPGTVTFN--DFLWAFGILRSRVFAELRGD-KLALIPFADLVNHSDDITSKESSW 417
Query: 282 DKSSQG-----VVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVE 333
+ +G VVF+ T + GEQ++I Y KSN EL L YGF E + D+
Sbjct: 418 EIKGKGLFGRDVVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFT--ESNSSRDAYT 475
Query: 334 LPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT-GWPLELMAYAYLVVSPPSMKGKFE 392
L L + +SD Y +KL+ G+ + F I + P +++ Y L+ + E
Sbjct: 476 LTLEISESDPFYDDKLDIAELNGMGETAYFDIVLGESLPPQMLPYLRLLCLGGTDAFLLE 535
Query: 393 EMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSRFLQVKELL 442
A N + ++ + +E+A+ Q I ++C+S++ Y ++ E L
Sbjct: 536 ---ALFRNAVWGHLELPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEEL 583
>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
Length = 432
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 139/286 (48%), Gaps = 23/286 (8%)
Query: 82 WLSDSGLPPQKMAIQKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
W+ + GLPP K+++++ D+ R +VA ++++ G+ L VP SLV+T +
Sbjct: 3 WMLEQGLPPCKVSLKERDLNGKTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNETIA 62
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAELD 191
E+L + + LA YL+ E K S W +I L RQ S L WT ELD
Sbjct: 63 ELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELD 122
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGI 245
Y S+++E +ER+ + Y +L +F +YP P E F+ E FK +F
Sbjct: 123 EYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVA 182
Query: 246 LFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ S +V L S+ R ALVP +L + + L + V DR Y+ GEQ+
Sbjct: 183 VQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAML--KAAGDLVRLEVDRAYKKGEQI 240
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
+ G + N LLL+YGFV + NP D + + SL D Y+ K
Sbjct: 241 LVWCGPQPNTRLLLNYGFV--DPDNPHDRLSVEASLNTRDPFYQNK 284
>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
Length = 432
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 139/286 (48%), Gaps = 23/286 (8%)
Query: 82 WLSDSGLPPQKMAIQKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
W+ + GLPP K+++++ D+ R +VA ++++ G+ L VP SLV+T +
Sbjct: 3 WMLEQGLPPCKVSLKERDLNGKTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNETIA 62
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP-------YSLLYWTRAELD 191
E+L + + LA YL+ E K S W +I L RQ S L WT ELD
Sbjct: 63 ELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELD 122
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGI 245
Y S+++E +ER+ + Y +L +F +YP P E F+ E FK +F
Sbjct: 123 EYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVA 182
Query: 246 LFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ S +V L S+ R ALVP +L + + L + V DR Y+ GEQ+
Sbjct: 183 VQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAML--KAAGDLVRLEVDRAYKKGEQI 240
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
+ G + N LLL+YGFV + NP D + + SL D Y+ K
Sbjct: 241 LVWCGPQPNTRLLLNYGFV--DPDNPHDRLSVEASLNTRDPFYQNK 284
>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
Length = 471
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 177/371 (47%), Gaps = 44/371 (11%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSK 131
S + WL SG + + +++ + RG+ L+ ++GEK+L +P ++ T
Sbjct: 7 SNMNSWLKQSG----AVGLDSLELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHA 62
Query: 132 WSCPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWT 186
++ P G L+ SV D LATY++ S E ++I+ALP S + +
Sbjct: 63 YADPLLGPALRSAQPPLSVED--TLATYILFVKSRESGYDGQRSHIAALPASYSSSILFA 120
Query: 187 RAELDR------YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET-- 238
+L+ Y Q+ E++IE + L +R+F ++PDLFP + F +E
Sbjct: 121 EDDLEACAGTSLYTITKQL-EQSIED------DHRALVVRLFVQHPDLFPLDKFTVEDVG 173
Query: 239 --FKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+KW+ ++SR + DG L P+ADMLNH+ EV+ YD SS + +
Sbjct: 174 LHYKWALCTVWSRAMDFVLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSVFAGK 233
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
Y+ G+QVFI+YG N LL YGFV NP+DS +L LS +++K +
Sbjct: 234 DYEAGDQVFINYGPVPNSRLLRLYGFVIP--GNPNDSYDLVLSTHPQAPFFEQKQKLWVS 291
Query: 355 YGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEID 414
GL ++ P+ +T PL YL + G A + + T D K + +
Sbjct: 292 AGLDSTATIPLTLTD-PLPKKVLRYLRIQRLDASG-----LAVIARQQTDATDGKISDSN 345
Query: 415 E-QALQFILDS 424
E + L+F+++S
Sbjct: 346 EVEILRFLVES 356
>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
CQMa 102]
Length = 466
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 104/374 (27%), Positives = 169/374 (45%), Gaps = 24/374 (6%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL SG + + + V RG+ A + ++GE++L +P +L T +
Sbjct: 1 METWLEQSGAVGLDGLEVADFPVTGRGVKARRRFKQGERILTIPSALHWTVQHAQADSLL 60
Query: 138 GEVLKQCSVP--DWPLLATYLISEASFEKSSRW-SNYISALPRQPYSLLYWTRAELDRYL 194
G L+ P LA Y++ S E ++++ALP S +++T EL+
Sbjct: 61 GPALRSARPPLTVEDTLAVYVLFVRSRESGYNGPRSHVAALPTSYSSSIFFTEDELEVCA 120
Query: 195 EAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
S I ++ +RI + Y DL R+ PDLFP F + +KW+ ++SR +
Sbjct: 121 GTSLYTITKQLKQRIED---DYKDLIARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMD 177
Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
DG L P+ADMLNHS E + YD S+ + + Y+ G+QV+I YG
Sbjct: 178 FELYDGSSMRLLAPFADMLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHYGSIP 237
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
N LL YGFV + NP+DS +L L+ +++K + GL A+ + +
Sbjct: 238 NSRLLRLYGFVIPD--NPNDSYDLVLATHPMAPFFEQKQKLWALAGLDATCTISLTLAN- 294
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFILDSCE 426
PL YL + + +E A N S D +I Q LQF+++S
Sbjct: 295 PLPKSVLCYLRIQ------RLDESDLAVINLQQSNTDTAFEKISNSNEVQVLQFLVESIT 348
Query: 427 SSISKYSRFLQVKE 440
S + + L+ E
Sbjct: 349 SLLDSFGTQLEKLE 362
>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 455
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 176/366 (48%), Gaps = 24/366 (6%)
Query: 79 LQKWLSDSGL-PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL DSG + + V G+ AL++ ++GE++L +P + + T + ++ P
Sbjct: 1 MEAWLKDSGARGVDGIEVANFAVTGSGVKALRSFKEGERILTIPSACLWTVEKAYADPLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
G VL+ SV D LA YL+ S S + R ++I+A+P+ + +++T EL
Sbjct: 61 GPVLRSAQPPLSVED--ALAVYLLFVRSRTSGYEGQR--HHIAAMPQSYSASIFFTEDEL 116
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
+ S + + V Y L + + S++ DLFP + F +E +KW+ ++SR
Sbjct: 117 -QVCAGSSLYALTRQLEQRVRDDYRQLLVPLLSQHRDLFPLDQFTIEDYKWALCSIWSRA 175
Query: 251 VRLP-SMDGRVALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
+ S V LV P ADMLNHS +V+ YD +S + + YQ G+QVFI YG
Sbjct: 176 MDFAVSGTTSVRLVAPLADMLNHSPDVKQCHAYDPTSGDLSILAAKDYQVGDQVFIYYGS 235
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT 368
N LL YGFV + NP+DS +L L Y++K GL ++ P+ +
Sbjct: 236 VPNNRLLRLYGFVLPD--NPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTVK 293
Query: 369 GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCES 427
PL YL + E + ++ + D K + +E Q LQF++DS S
Sbjct: 294 D-PLPNNVLRYLRIQRLD-----ESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGS 347
Query: 428 SISKYS 433
+ +
Sbjct: 348 LLEGFG 353
>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
Length = 470
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 183/380 (48%), Gaps = 34/380 (8%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL+ G + + + V RG+ L+ ++GE +L +P ++ + + +S P
Sbjct: 1 MESWLTKVGAVGLSDLELTDFPVTGRGVKTLRRFKQGEMILTIPSDVLWSVEHAYSDPNL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
G L+ SV D +LATY++ S E ++SALP S ++++ EL+
Sbjct: 61 GPALRSVMPPLSVED--ILATYILFVRSRESGYDGLRTHVSALPGIYSSSIFFSEGELEV 118
Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET----------FK 240
S + ++ +RI + Y L +R+F+++PDLFP + F +E +K
Sbjct: 119 CAGTSLYTVTKQLEQRIKD---DYRQLAVRLFAQHPDLFPLQKFTIEDVRLLRRATDPYK 175
Query: 241 WSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
W+ ++SR + DG L P+ADMLNHS EV+ YD S + + Y+
Sbjct: 176 WALCTVWSRSMDFTLPDGSSIRLLAPFADMLNHSSEVKQCHAYDVKSGDLSVFAGKDYEI 235
Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
G+QV+I YG N LL YGFV + NP+DS +L L+ Y++K + GL
Sbjct: 236 GDQVYIYYGPIPNNRLLRLYGFVIPD--NPNDSYDLVLTTHPMAPFYEQKQKLWVSAGLD 293
Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QA 417
++ + +T PL YL + + G AA + + D K + +E +
Sbjct: 294 STTTVALTLTD-PLPKNILRYLRIQRADVSG-----LAAITLQQIDGTDEKISDSNEMEI 347
Query: 418 LQFILDSCESSISKYSRFLQ 437
L+F+ +S S ++ ++ L+
Sbjct: 348 LRFLEESISSLLNCFATPLE 367
>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
24927]
Length = 468
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 176/357 (49%), Gaps = 19/357 (5%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL +SG + + + RGL +L++ ++GE++L +P S++ T + ++
Sbjct: 1 MESWLKESGAVGLDDLKLADFPATGRGLGSLRHFKEGERILTIPSSILWTVEHAYADSII 60
Query: 138 GEVLK--QCSVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDRYL 194
VL+ Q ++ LA Y++ S E + +++ ALP S +++T EL+
Sbjct: 61 RPVLQSMQGALSVDDTLAIYILFVRSRESGYNGLRSHVEALPTSYSSSIFFTDDELEVCA 120
Query: 195 EAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
+S I ++ ++I + Y L R+F +Y D+F F +E +KW+ ++SR +
Sbjct: 121 GSSLYTITKQLKQQIQD---DYRTLVERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMD 177
Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
DG+ L P+ADMLNHS +V+ YD SS + + Y+PG+QVFI+YG
Sbjct: 178 FVQPDGKSIRLLAPFADMLNHSSDVKKCHVYDTSSGDLSILAGKDYEPGDQVFINYGSIP 237
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
N LL YGFV NP+DS +L L + ++ K + GL + + +
Sbjct: 238 NNRLLRLYGFVV--PNNPNDSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISLSLND- 294
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS-CE 426
PL YL + + ++A A ++ + I +E+ LQ +++S CE
Sbjct: 295 PLPKSVLQYLRI----QRADESDLAIIALQQIDATDKILSNSNEEKVLQALIESFCE 347
>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
Length = 447
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/274 (32%), Positives = 134/274 (48%), Gaps = 28/274 (10%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
EN +L KW +G+ +K+ RG++A++ I E ++ VP L+ITA S
Sbjct: 48 ENYISLLKWAKRNGMVFKKIRPAIFSSTGRGMLAIERIHSSECVISVPERLLITASSVLE 107
Query: 134 CPEAGEVLKQC-----SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
V ++ S D+ LL +L+ E EK S W+ YI LP + Y+TR
Sbjct: 108 SAIGNYVAERMKGGAKSSNDY-LLVLFLMYEKYLEKGSFWAPYIRTLPDTFNTPCYFTRK 166
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK-YPDLFPE------EVFNMETFKW 241
EL +L Q RE+A E++T + +Y + F+K Y D+ + + E+FKW
Sbjct: 167 EL--FLLPEQCREQAFEQVTQIKQSY-----KSFAKAYNDVLQDFDCNFWRTVDFESFKW 219
Query: 242 SFGILFSRLV-------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
++ ++ +R V R +DG AL P D+LNH + E ++ SS+
Sbjct: 220 AWCVVNTRSVYHDEPNRRAQPIDGNCALAPLLDLLNHCDKAEMCGRFNSSSKNYEINVIT 279
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFV-PREGTN 327
+YQ G QVFI+YG N L L YGFV PR N
Sbjct: 280 EYQKGTQVFINYGPHDNTRLFLEYGFVLPRNVHN 313
>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
206040]
Length = 463
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 185/368 (50%), Gaps = 29/368 (7%)
Query: 79 LQKWLSDSGLPP-QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL++SG + + + V RG+ + ++GE++L +P + T + S P
Sbjct: 1 MEDWLNESGAAGLNDLELAEFPVTGRGVRTRRRFQQGERILTIPGDSLWTVEHADSDPLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLI----SEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
G VL+ SV D LA YL+ E +E ++++A+P + S +++ E
Sbjct: 61 GPVLRSVQPPLSVED--TLAVYLLFVRLREHGYEGPR---SHVAAMPARYSSSIFFNEDE 115
Query: 190 LDRYLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
L+ S I ++ ERI + Y L +R+F+++PDL P +++ +KW+ ++
Sbjct: 116 LEVCAGTSLYTITKQLEERIED---DYRVLVMRVFTQHPDLLPLAKISIQDYKWALCTVW 172
Query: 248 SRLVR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
SR + LP+ L P+ADM+NHS EV+ YD SS + + Y+ G+Q++IS
Sbjct: 173 SRAMDFVLPNGKPLRVLAPFADMINHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYIS 232
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG N LL YGFV E NP+DS +L LS Y++K + GL ++ P+
Sbjct: 233 YGSIPNNRLLRLYGFVIPE--NPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPL 290
Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
+ PL YL + + ++AA A K+ + + I + + + LQF+++S
Sbjct: 291 TLID-PLPKSVLRYLRIQ----RLDASDLAAIALQKLDTNEKISNSK-EVEILQFLVESI 344
Query: 426 ESSISKYS 433
+ + ++
Sbjct: 345 SALLDGFN 352
>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
Length = 435
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 163/361 (45%), Gaps = 35/361 (9%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQ-------KVDVGERGLVALKNIRKGEKLLFVPPSL 124
++E S + +WL++SG P QK+ +Q +VD+ VA + ++ G+ L +P L
Sbjct: 2 TVERPSKMMQWLTESGAPQQKVKLQTVVREGTEVDI----TVAAEALQPGDVALRIPEHL 57
Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ------ 178
++T D E++ + + L YL E K W +I L R
Sbjct: 58 IVTLDRVLEDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQ 117
Query: 179 -PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPE 231
S L W + L S + R+ + Y +L +F++ P P
Sbjct: 118 GAKSPLLWDEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPT 177
Query: 232 EVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGV 288
E F+ F+ +F + S +V L ++ R ALVP +L +S + L +D S V
Sbjct: 178 EQFSFPVFRQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEV 237
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
DR YQPGE V G + N LL++YG V + +NP D + L +++ D Y+ K
Sbjct: 238 RLAVDRAYQPGEAVLAWCGPQPNSRLLINYGIV--DESNPYDKLPLSITIPSDDPLYRLK 295
Query: 349 LEALRKYGLSASECFPIQITG-WPLELMAYAYLVVSP--PSMKG-KFEEMAA--AASNKM 402
+ L + GLS + F +Q P +L+ Y LV S ++G K+EE A A N++
Sbjct: 296 RDRLAERGLSTQQTFQLQAAASLPAQLLPYLRLVHSTREADVEGVKWEEEAGPVAPENEL 355
Query: 403 T 403
T
Sbjct: 356 T 356
>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
Length = 424
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 83/250 (33%), Positives = 131/250 (52%), Gaps = 18/250 (7%)
Query: 81 KWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
+WL D+G P + + + G RG VA + GE +L +P L+I+ D W P+ G
Sbjct: 16 QWLRDNGATFPKLQWPVTSPN-GLRGTVAAAAVASGEPMLCIPRRLLISEDLCWRDPQLG 74
Query: 139 EVL---KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
V + D P+LA +L+ E S + Y++ LP P S+ WT+AEL L
Sbjct: 75 RVFQDNRDVFTRDDPVLALFLVRELLLADRSFFHPYLAVLP-YPESVQDWTQAELGE-LH 132
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RL 253
++ + A R + + Y + +R+ +KYP FPE ++ + FK+++ + +R RL
Sbjct: 133 DERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQARTFGRRL 192
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF---TTDRQYQPGEQVFISYGKKS 310
P ALVP+AD LNH+ V T D+D + G+ + + G +VF SYG++S
Sbjct: 193 PW----TALVPFADCLNHT-NVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNSYGRRS 247
Query: 311 NGELLLSYGF 320
N +LLL YGF
Sbjct: 248 NFQLLLDYGF 257
>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
Length = 454
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 167/367 (45%), Gaps = 26/367 (7%)
Query: 79 LQKWLSDSGLPP-QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL SG + + RG+ AL+ + EK+L +P L+ T ++ P
Sbjct: 1 MEIWLKQSGASGLDDLELADFSDAGRGIRALRRFEEKEKILTIPHGLLWTVKRAYADPVL 60
Query: 138 GEVLKQC----SVPDWPLLATY-LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
G +L SV D LATY L A ++++ALP S +++ AEL+
Sbjct: 61 GPLLSSTRPPLSVDD--TLATYILFIRARKSGYDGPQSHVAALPASYSSSIFFADAELE- 117
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
S + + Y DL R+F ++ D+FP + F ++ +KW+ ++SR +
Sbjct: 118 ICAGSSLYTTTKHLARQIEVDYKDLVARLFGRHRDVFPSDKFTIDDYKWALCTVWSRAMD 177
Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
DG + P+ADMLNHS +V YD S + + Y+PG+QVFI+YG
Sbjct: 178 FKLRDGESIRLMAPFADMLNHSPDVGQCHVYDPQSGNLSILAGKSYEPGDQVFINYGPIP 237
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
N L YGFV NP+DS +L LS +++K + GL ++ + +T
Sbjct: 238 NNRLSRLYGFV--VPGNPNDSYDLVLSTHPMAPFFEQKHKLWIAAGLDSTSTVSLTLTD- 294
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFILDSCE 426
PL YL + + E AA T + D+ +I + + L F+++S
Sbjct: 295 PLPRSVLRYLRIQ------RLNETDLAAVG--TRQSDVAFEKISDSNETEVLTFLVESIS 346
Query: 427 SSISKYS 433
+ + ++
Sbjct: 347 ALLDGFT 353
>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
Length = 400
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 135/259 (52%), Gaps = 15/259 (5%)
Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVP-DWPLLATYLISEA 160
+RGL A ++IR GE+++ +P LV+TA+ C V K S DW L +++E
Sbjct: 9 KRGLFAARSIRAGEQIVRIPHELVLTAEKLDDC-----VKKLLSTEYDWCPLTLLILAEQ 63
Query: 161 SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
++SRW+ Y+S LP +S ++W + EL ++LE ++ ER + YN ++
Sbjct: 64 HKGEASRWAPYVSCLPSFGDHHSTIFWGKEEL-KFLECTRAFRGTAERREMISDEYNSVK 122
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF 278
+ S P +F E++ ++ F ++ + SR ++ +++ P+ D NH
Sbjct: 123 -DVISSCPHVFGEDI-SLFQFAHAYATVVSRAWN-GALSSEISMRPFVDFCNHDPVSHAT 179
Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+ +D + +R Y GE+VFISYGK+SN L + YGFV N SD EL + +
Sbjct: 180 VSHDTCKDATIIA-ERDYTKGEEVFISYGKRSNAVLAVDYGFVL--PNNLSDQAELWMEI 236
Query: 339 KKSDKCYKEKLEALRKYGL 357
+D ++KLE +R + +
Sbjct: 237 PWNDPLREKKLELMRAFNM 255
>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 161/345 (46%), Gaps = 27/345 (7%)
Query: 37 GHRIVVHCSVSTTNDAS--RTKTTVTQNMIP-WGCEIDSLENASTLQKWLSDSGLPPQKM 93
G ++ H S ++ S K ++ N + G E+ T +WL + G+ K
Sbjct: 10 GSAVLFHSRNSFSSKGSFLHLKRPLSANCVASLGTEVSVSPAVDTFWQWLKEEGVVSGKT 69
Query: 94 AIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS-VPDWPL 151
++ V E GLVALK+I + E +L VP L I D+ A E+ K CS + W
Sbjct: 70 PVKPGVVPEGLGLVALKDISRNEVVLQVPKRLWINPDA----VAASEIGKVCSGLKPWLA 125
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
+A +LI E S S W +Y S LP++ S +YW+ EL L+ +Q+ V
Sbjct: 126 VALFLIRERS-RSDSLWKHYFSILPKETDSTIYWSEEELSE-LQGTQLLNTTRSVKQYVQ 183
Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVRLPSMDGRVALVPWADMLN 270
+ L I LFP + ++ F W+FGIL SR RL + + + ++P AD++N
Sbjct: 184 NEFRRLEEEIIIPNKKLFPSSI-TLDDFFWAFGILRSRAFSRLRNEN--LVVIPLADLIN 240
Query: 271 HSCEVET-FLDYDKSSQGVVFTTDRQY--------QPGEQVFISYG-KKSNGELLLSYGF 320
HS V T Y+ +F+ D + + G+QV+I Y KSN EL L YGF
Sbjct: 241 HSARVTTDDHAYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGF 300
Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
+ E ++ L L + +SD + +KL+ G + F I
Sbjct: 301 I--EPNTDRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDI 343
>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
C-169]
Length = 539
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 90/318 (28%), Positives = 144/318 (45%), Gaps = 22/318 (6%)
Query: 84 SDSGLPPQKMAIQKVDVGERGL---VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
+D G P Q ++I++V L VA ++++ GE L +P LVIT D + E+
Sbjct: 23 TDHGAPQQGVSIKEVVQEGNTLDVSVAARDLQAGELALRIPDHLVITLDRVFEDESLAEL 82
Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-------QPYSLLYWTRAELDRY 193
L + + L YL+ E + S W +I L R S L W ++D Y
Sbjct: 83 LTTDKLSELACLTLYLMYEKKNGRQSVWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEY 142
Query: 194 LEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGILF 247
L S + ER+ + Y +L +F YP P E F+++ F+ F +
Sbjct: 143 LAGSPLVAEIKERLKGIEKEYAELDTVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQ 202
Query: 248 SRLVRLPS--MDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ +V L + R ALVP +L++S + L Y++ ++ V DR Y GE +
Sbjct: 203 ASVVHLQGVPLSKRFALVPLGPPLLSYSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEA 262
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
G + N LLL+YG V NP D + L ++L +D ++ K L++ LS + F
Sbjct: 263 WCGPQPNRRLLLNYGIVT--DNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQ 320
Query: 365 IQ-ITGWPLELMAYAYLV 381
+Q G P L+ Y L
Sbjct: 321 LQRDKGLPELLLPYLRLA 338
>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 174/352 (49%), Gaps = 43/352 (12%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWS------CPEAGEVLKQCSVPDWPLLATYLI 157
GLVA +N+ +GE + VP L + AD+ + C G++ W ++ ++
Sbjct: 42 GLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRP------WVSVSLLIL 95
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
EA+ S W+ Y++ LPRQ S ++W+ EL ++ +Q+ + V ++++
Sbjct: 96 REAARGGDSLWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFDNV 154
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVE 276
I + DLFP + + F W+FG+L SR+ P + G ++AL+P+AD++NH+ ++
Sbjct: 155 EAGIINVNKDLFPGTI-TFDDFLWAFGVLRSRV--FPELRGDKLALIPFADLINHNGDIT 211
Query: 277 T-----------FLDYDKSSQGVVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVP 322
+ FL D VF+ T + GEQ+++ Y KSN EL L YGF
Sbjct: 212 SKESCWEIKGKGFLGRD-----TVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFT- 265
Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT-GWPLELMAYAYLV 381
E + DS L L + +SD Y++KL+ G+ + F + + P +++ Y L+
Sbjct: 266 -ESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRLL 324
Query: 382 VSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKY 432
+ E A NK+ ++ +E+++ Q I ++C+S+++ Y
Sbjct: 325 CLGGTDAFLLE---ALFRNKVWEHLELPVSRDNEESICQVIQNACKSALAAY 373
>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Glycine
max]
Length = 487
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 148/312 (47%), Gaps = 24/312 (7%)
Query: 67 GCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLV 125
G E+ T +WL + G+ K ++ V E GLVALK+I + E +L VP L
Sbjct: 43 GTEVSVSPAVDTFWQWLKEEGVVSAKTPVKPSVVPEGLGLVALKDISRNEVVLQVPKRLW 102
Query: 126 ITADSKWSCPEAGEVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
I D+ A E+ K C + W +A +LI E S +S W +Y S LP++ S +Y
Sbjct: 103 INPDA----VAASEIGKVCIGLKPWLAVALFLIRERS-RSNSLWKHYFSVLPKETDSTIY 157
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
W+ EL L+ +Q+ V Y L I LFP + ++ F W+FG
Sbjct: 158 WSEEELSE-LQGTQLLNTTRSVKQYVENEYRRLEEEIILPNKKLFPSPL-TLDDFFWAFG 215
Query: 245 ILFSR-LVRLPSMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQY------ 296
IL SR RL + + + ++P+AD +NHS V T Y+ +F+ D +
Sbjct: 216 ILRSRAFSRLRNEN--LVVIPFADFINHSARVTTEDHAYEIKGAAGLFSWDYLFSLRSPL 273
Query: 297 --QPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ G+QV+I Y KSN EL L YGF+ E ++ L L + +SD + +KL+
Sbjct: 274 SLKAGDQVYIQYDLNKSNAELALDYGFI--EPNADRNAYTLTLQISESDPFFGDKLDIAE 331
Query: 354 KYGLSASECFPI 365
G + F I
Sbjct: 332 SNGFGETAYFDI 343
>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
Length = 512
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 96/311 (30%), Positives = 142/311 (45%), Gaps = 30/311 (9%)
Query: 82 WLSDSGLPPQKMAIQK---------VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
WL GLPP K+ I++ D+ R + A +++ G+ VP SLV+T +
Sbjct: 81 WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E K S W YI L R S L W
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
T +ELD YL S +++ + R + YN+L +F +YP P E F E F
Sbjct: 201 TESELD-YLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 259
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L + + L D S V DR Y
Sbjct: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDS--VRLVVDRPY 317
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ GE + I G ++N L+L+YGFV + NP D + + SL D Y+EK ++ G
Sbjct: 318 KAGEPIIIWCGPQTNSRLVLNYGFVDED--NPFDRIAIEASLNSEDPQYQEKRMVAQRNG 375
Query: 357 LSASECFPIQI 367
A + F + +
Sbjct: 376 KLAIQNFNVYV 386
>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
Length = 389
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 57/378 (15%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RGL A + I GE +L V L+IT + PE L V W LA +L++
Sbjct: 1 RGLFASRPIHTGECMLHVSHDLMITPEK---LPEEVTKLLSKDVSAWAKLALFLLAHQKK 57
Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNV----IGTYND 216
+++S W+ YIS LP +S ++WT+ EL YL+ S + ++R V N
Sbjct: 58 KETSAWAPYISCLPPFGSMHSTIFWTQDEL-VYLKVSPVYRETVQRKDVVRMEFAAAENA 116
Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
L L P +F V +E FK ++ + SR + ++ +ALVP+ D NH
Sbjct: 117 LLL-----CPHIFGSRVSALE-FKHAYATVCSRAWGIETIKS-LALVPFVDFFNHDANCR 169
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVE-L 334
L YD+ +DR Y G+QV ISYG+ SN L L +GF +P NP D V +
Sbjct: 170 AMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALP---FNPHDQVAGI 226
Query: 335 PLSLKKSDKCYKEKLEALRKYGL----------SASECFPIQIT--------GWPLELMA 376
LSL + D KL+ L + + +A F +Q G P L A
Sbjct: 227 WLSLSEKDPLRDSKLKLLHSHNMQTCVTREGVDTAGSSFSLQEVKSKAGRGKGIPQTLRA 286
Query: 377 YAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFILDSCESSISKY 432
+A +V + S + +EMA A++ T + + P ID+ +A+ + ++ I K+
Sbjct: 287 FARVVCATTS--EELDEMAKFAAD--TDGRLARRPSIDKTKEHKAMTLLQTVIDNRIQKH 342
Query: 433 SR---------FLQVKEL 441
+ L VKE+
Sbjct: 343 EQAALVLVSLFLLMVKEI 360
>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
Length = 514
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 149/316 (47%), Gaps = 27/316 (8%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVIT 127
+++ L+ W+ +GLPP K+ +++ ++ + A ++++KG+ VP SLV+T
Sbjct: 78 DDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIHYVAASEDLQKGDVAFSVPDSLVVT 137
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 138 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 197
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
S L W+ AELD YL S + +ER + YN+L +F +YP P E F
Sbjct: 198 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 256
Query: 235 NMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFT 291
+ E FK +F + S +V L + + R ALVP +L + + L V
Sbjct: 257 SFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAML--TAVDGAVELV 314
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
DR Y+ G+ + + G + N +LLL+YGFV + NP D V + +L D Y++K
Sbjct: 315 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRVIVEAALNTEDPQYQDKRMV 372
Query: 352 LRKYGLSASECFPIQI 367
++ G + + F +++
Sbjct: 373 AQRNGKLSQQVFQVRV 388
>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase I [Spinacia oleracea]
Length = 491
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 116/389 (29%), Positives = 174/389 (44%), Gaps = 29/389 (7%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVIT 127
E D+ WLSD G+ K ++ V E GLVA K+I + E +L VP I
Sbjct: 49 ETDTPPEIQKFWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWIN 108
Query: 128 ADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
D+ A E+ C+ + W +A +L+ E SS W YI LP S +YW+
Sbjct: 109 PDTV----AASEIGSVCNGLKPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTIYWS 164
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
EL L+ SQ+ + V + L + + LFP +V + F W+FG+L
Sbjct: 165 EEELSE-LQGSQLLNTTLGVKELVANEFAKLEEEVLVPHKQLFPFDV-TQDDFFWAFGML 222
Query: 247 FSRLVRLPSMDGR-VALVPWADMLNHSCEVETFLDYDKSSQG-------VVFTTDRQYQP 298
SR ++G+ + L+P AD+ NHS ++ T Y +G +VF+ R P
Sbjct: 223 RSR--AFTCLEGQSLVLIPLADLANHSPDI-TAPKYAWEIRGAGLFSRELVFSL-RNPTP 278
Query: 299 ---GEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
G+QV I Y KSN EL L YG E + ++ L L + +SD Y +KL+
Sbjct: 279 VKAGDQVLIQYDLNKSNAELALDYGLT--ESRSERNAYTLTLEIPESDSFYGDKLDIAES 336
Query: 355 YGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC-PEI 413
G+ S F I + PL YL + + F + + N + D+ P
Sbjct: 337 NGMGESAYFDI-VLEQPLPANMLPYLRLVALGGEDAF-LLESIFRNSIWGHLDLPISPAN 394
Query: 414 DEQALQFILDSCESSISKYSRFLQVKELL 442
+E Q I D+C S++S YS + E L
Sbjct: 395 EELICQVIRDACTSALSGYSTTIAEDEKL 423
>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
Y34]
Length = 464
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 176/358 (49%), Gaps = 27/358 (7%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL ++G + + + + RG+ L++ ++GEK+L +P + T + +
Sbjct: 1 MESWLKETGAVGLDDLELADFPITGRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
G L+ SV D +LATY++ S E ++++ALP S +++ EL+
Sbjct: 61 GPALRSIRPPLSVED--ILATYILFVRSRESGYDGLRSHVAALPSSYSSSIFFAGEELEV 118
Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
S I ++ +RI + Y L +R+ ++ DLFP E F +E +KW+ ++SR
Sbjct: 119 CAGTSLYTITKQLEQRIED---DYRALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRA 175
Query: 251 VR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
+ LP + L P+ADMLNHS V+ YD SS+ + + Y+ G+QVFI YG
Sbjct: 176 MDFVLPGGNSIRLLAPFADMLNHSDNVKQCHAYDSSSKTLSVLAGKDYEAGDQVFIYYGP 235
Query: 309 KSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
SN LL YGFV P N +D+ +L L+ + KL+ L ++ + +
Sbjct: 236 VSNSRLLRLYGFVLP---GNSNDNYDLVLATHPEAPFFARKLKLWASARLDSTSTISLTL 292
Query: 368 TG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
T P +++ Y + S S E+A A ++ + + I + + L+F+++S
Sbjct: 293 TDPLPNDVLRYLRIQRSGAS------ELAGMACQRIDATEKIS-DSNEVEVLRFLVES 343
>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
Precursor
gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
N-methyltransferase [Pisum sativum]
Length = 489
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 148/302 (49%), Gaps = 26/302 (8%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
T KWL + G+ K ++ V E GLVALK+I + + +L VP L I D+
Sbjct: 56 TFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA----VA 111
Query: 137 AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
A E+ + CS + W + +LI E S E S W +Y LP++ S +YW+ EL L+
Sbjct: 112 ASEIGRVCSELKPWLSVILFLIRERSREDSV-WKHYFGILPQETDSTIYWSEEELQE-LQ 169
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVRLP 254
SQ+ + + V L I LFP+ V ++ F W+FGIL SR RL
Sbjct: 170 GSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPV-TLDDFFWAFGILRSRAFSRLR 228
Query: 255 SMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQY--------QPGEQVFIS 305
+ + + +VP AD++NHS V T Y+ +F+ D + + GEQV+I
Sbjct: 229 NEN--LVVVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQ 286
Query: 306 YG-KKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
Y KSN EL L YGF+ P E + + L L + +SD + +KL+ G + + F
Sbjct: 287 YDLNKSNAELALDYGFIEPNENRH---AYTLTLEISESDPFFDDKLDVAESNGFAQTAYF 343
Query: 364 PI 365
I
Sbjct: 344 DI 345
>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
Length = 471
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 153/321 (47%), Gaps = 24/321 (7%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL SG + + + V RG+ L++ ++GE++L +P ++ T + ++
Sbjct: 1 MEGWLKQSGAVGLDVLELADFQVIGRGVKTLRHFKEGERILTIPSDVLWTVEHAYADSLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
G L SV D LATY++ S E + ++++ALP+ S +++T EL+
Sbjct: 61 GPTLHSARPPLSVDD--TLATYILFVRSRESGYNGLRSHLAALPKSYSSSIFFTEDELEV 118
Query: 193 YLEASQIRERAIERITNVIG-----TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
S + IT +G Y L +R+ ++ DLFP F +E +KW+ ++
Sbjct: 119 CTGTS------LYAITKQLGRCIQDDYKALVVRLLIQHRDLFPLSKFTIEDYKWALCTVW 172
Query: 248 SRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
SR + DG+ L P+ADMLNHS +V YD S + + Y+ G+QVFI
Sbjct: 173 SRAMDFVLPDGKSIRLLAPFADMLNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIY 232
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG N LL YGF+ +NP+D+ EL L +++K + GL + +
Sbjct: 233 YGSIPNNRLLRLYGFIIP--SNPNDNYELVLETHPMAPFFEQKHKLWESAGLDLTSTISL 290
Query: 366 QITGWPLELMAYAYLVVSPPS 386
+T PL YL + S
Sbjct: 291 TLTD-PLPKNVLQYLRIQRSS 310
>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
Length = 500
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 152/309 (49%), Gaps = 20/309 (6%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
KWL D G+ + I+K +VG GL A+K+I+ E + +P L++T ++ G +
Sbjct: 97 KWLEDHGVKSDAVTIEKFEVGGYGLKAVKDIKAEELFITIPRKLMLTTETARES-SLGPL 155
Query: 141 LKQ---CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
+K+ V LA +++ E + +S W+ YI+ P + LY+ E+ +L+ S
Sbjct: 156 IKKDRILQVMANVSLALHVLCE-KYSSNSFWAPYINIFPGTYTTPLYFEEGEM-LHLQGS 213
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLP 254
+ + ++ Y ++F P+ L +E F + ++W+ + +R ++P
Sbjct: 214 LNFSDVLNQYKSIARQYAYF-YKLFQTQPEAAGLPLKECFTFDEYRWAVSTVMTRQNQVP 272
Query: 255 SMDGR---VALVPWADMLNHS-CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ DGR AL+P DM NHS EV T +++ S R++ QV+I YG +S
Sbjct: 273 TSDGRHLITALIPMWDMCNHSNGEVST--EFNLGSDSAECLAMREFPTDSQVYIFYGMRS 330
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
N E L+ GFV E N D V + L + K+D + K E L + G+ AS F +
Sbjct: 331 NAEFLIHNGFVYPE--NVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKD 388
Query: 371 PL--ELMAY 377
P+ EL+ +
Sbjct: 389 PIPPELLVF 397
>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
Adohcy
gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
Monomethyllysine
Length = 440
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/302 (32%), Positives = 148/302 (49%), Gaps = 26/302 (8%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
T KWL + G+ K ++ V E GLVALK+I + + +L VP L I D+
Sbjct: 8 TFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA----VA 63
Query: 137 AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
A E+ + CS + W + +LI E S E S W +Y LP++ S +YW+ EL L+
Sbjct: 64 ASEIGRVCSELKPWLSVILFLIRERSREDSV-WKHYFGILPQETDSTIYWSEEELQE-LQ 121
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVRLP 254
SQ+ + + V L I LFP+ V ++ F W+FGIL SR RL
Sbjct: 122 GSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPV-TLDDFFWAFGILRSRAFSRLR 180
Query: 255 SMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQY--------QPGEQVFIS 305
+ + + +VP AD++NHS V T Y+ +F+ D + + GEQV+I
Sbjct: 181 NEN--LVVVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQ 238
Query: 306 YG-KKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
Y KSN EL L YGF+ P E + + L L + +SD + +KL+ G + + F
Sbjct: 239 YDLNKSNAELALDYGFIEPNENRH---AYTLTLEISESDPFFDDKLDVAESNGFAQTAYF 295
Query: 364 PI 365
I
Sbjct: 296 DI 297
>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
Methyltransferase
gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Lysine And Adohcy
gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
Melysine And Adohcy
Length = 444
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 148/304 (48%), Gaps = 26/304 (8%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
T KWL + G+ K ++ V E GLVALK+I + + +L VP L I D+
Sbjct: 10 VQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA---- 65
Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
A E+ + CS + W + +LI E S E S W +Y LP++ S +YW+ EL
Sbjct: 66 VAASEIGRVCSELKPWLSVILFLIRERSREDSV-WKHYFGILPQETDSTIYWSEEELQE- 123
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
L+ SQ+ + + V L I LFP+ V ++ F W+FGIL SR R
Sbjct: 124 LQGSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPV-TLDDFFWAFGILRSRAFSR 182
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLD-YDKSSQGVVFTTDRQY--------QPGEQVF 303
L + + + +VP AD++NHS V T Y+ +F+ D + + GEQV+
Sbjct: 183 LRNEN--LVVVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVY 240
Query: 304 ISYG-KKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
I Y KSN EL L YGF+ P E + + L L + +SD + +KL+ G + +
Sbjct: 241 IQYDLNKSNAELALDYGFIEPNENRH---AYTLTLEISESDPFFDDKLDVAESNGFAQTA 297
Query: 362 CFPI 365
F I
Sbjct: 298 YFDI 301
>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
Length = 479
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 111/397 (27%), Positives = 181/397 (45%), Gaps = 62/397 (15%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL SG + + + V RG+ AL++ +KGE++L +P + + TA++ + P
Sbjct: 1 MEAWLKHSGAVGVDAIEVADFPVTGRGVKALRSFKKGERILTIPSACLWTAEAARADPLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWSNY------ISALPRQPYSLLYWTR 187
G VL+ SV D LA +L+ F KS R + Y I+A+P++ + +++
Sbjct: 61 GPVLRSAQPPLSVED--TLAIHLL----FVKS-RTAGYEGQRLHIAAMPQRHSASIFFAE 113
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME---------- 237
EL + E S + + V + L +++ S++ DLFP + F +E
Sbjct: 114 DEL-QVCEGSSLHTLTTQLEQRVQDDFRQLLVQLLSQHRDLFPLDQFTIEDVSYIAAFPR 172
Query: 238 --------------TFKWSFGILFSRLVRLPSMDG-RVALV-PWADMLNHSCEVETFLDY 281
+KW+ ++SR + D V LV P ADMLNHS +V+ Y
Sbjct: 173 PTRSISLMNLYFPFQYKWALCTIWSRAMDFAVSDTTSVRLVAPLADMLNHSLDVKQCHAY 232
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
D +S + + YQ G+Q+FI YG N LL YGFV + NP+DS +L L
Sbjct: 233 DPTSGDLSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLD--NPNDSYDLVLQTSPM 290
Query: 342 DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNK 401
Y++K GL ++ P+ PL YL + + + AA
Sbjct: 291 APLYEQKERLWALAGLDSTCTIPL-TAKHPLPKNVLRYL---------RTQRLDAADVAD 340
Query: 402 MT----SKKDIKCPEIDE-QALQFILDSCESSISKYS 433
MT + D K + +E Q LQF++DS S + +
Sbjct: 341 MTLQLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFG 377
>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
methyltransferase; Short=rbcMT; Flags: Precursor
gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Nicotiana tabacum]
Length = 491
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 183/383 (47%), Gaps = 31/383 (8%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
T +WL G+ K ++ V E GLVA ++I KGE +L VP I D+ +
Sbjct: 57 VQTFWQWLCKEGVVTTKTPVKPGIVPEGLGLVAKRDIAKGETVLQVPKRFWINPDAV-AE 115
Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
E G V CS + W +A +L+ E + S+W Y+ LP+ S +YW+ EL
Sbjct: 116 SEIGNV---CSGLKPWISVALFLLRE-KWRDDSKWKYYMDVLPKSTDSTIYWSEEELSE- 170
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
++ +Q+ + V + + + + LFP + ++ F W+FGIL SR R
Sbjct: 171 IQGTQLLSTTMSVKDYVQNEFQKVEEEVILRNKQLFPFPI-TLDDFFWAFGILRSRAFSR 229
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG--VVFTTDRQY--------QPGEQV 302
L + + + LVP+AD+ NH+ V T D+ +G +F+ D + + G+Q+
Sbjct: 230 LRNQN--LILVPFADLTNHNARVTT-EDHAHEVRGPAGLFSWDLLFSLRSPLKLKAGDQL 286
Query: 303 FISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
FI Y KSN ++ L YGF+ E ++ D+ L L + +SD+ Y +KL+ G+ +
Sbjct: 287 FIQYDLNKSNADMALDYGFI--EPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETA 344
Query: 362 CFPIQI-TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-Q 419
F I+I P ++ Y LV + E + N + + +E+ + +
Sbjct: 345 YFDIKIGQSLPPTMIPYLRLVALGGTDAFLLESI---FRNSVWGHLGLPVSRANEELICK 401
Query: 420 FILDSCESSISKYSRFLQVKELL 442
+ D+C+S++S Y ++ E L
Sbjct: 402 VVRDACKSALSGYHTTIEEDEKL 424
>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 481
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 169/378 (44%), Gaps = 42/378 (11%)
Query: 79 LQKWLSDSG---LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
++ WL+DSG LP K+A + RG+ A + GE++L +P + T + ++
Sbjct: 1 MEAWLNDSGALGLPHLKVA--DFALTGRGVQAQRAFSAGERILTIPAQCLWTVEHAYADR 58
Query: 136 EAGEVLKQC----SVPDWPLLATYLI-------SEASFEKSSRWSNYISALPRQPYSLLY 184
G VL+ SV D L L+ + ++E R +++ LP + ++
Sbjct: 59 LLGPVLRALQPPLSVEDTLALHILLVRARRRPDDDGAYEAGRR--SHVDVLPDRYTMSIF 116
Query: 185 WTRAELD------RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
++ E+ Y +Q+R R + Y L R+ ++ +LFP F +E
Sbjct: 117 FSDEEMQVCKGSSLYTLTTQLRGR-------IGDDYKKLLTRVLMRHRNLFPLSKFGIEH 169
Query: 239 FKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
+KW+ ++SR + +G L P+ADMLNHS +V+ YD ++ + + Y
Sbjct: 170 YKWALCTVWSRGMDFTVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGDLSILASKDY 229
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
G+QVFI YG N LL YGFV E NP DS +L L Y++K + G
Sbjct: 230 NVGDQVFIYYGPVPNNRLLRLYGFVLPE--NPHDSYDLVLQTSPMAPLYEQKERLWKLAG 287
Query: 357 LSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQ 416
L + P+ PL YL + E + A + ++ + D K + E
Sbjct: 288 LDTACTIPLTAND-PLPRSVLRYLRIQRLD-----ESLLGAMTMQIATGADEKISDDSET 341
Query: 417 -ALQFILDSCESSISKYS 433
LQF++DS + + +S
Sbjct: 342 LILQFLIDSISAILEGFS 359
>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 516
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 27/316 (8%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGE------RGLVALKNIRKGEKLLFVPPSLVIT 127
+++ L+ W+ +GLPP K+ +++ + + A ++++KG+ VP SLV+T
Sbjct: 80 DDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIHYVAASEDLQKGDVAFSVPDSLVVT 139
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 140 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 199
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
S L W+ AELD YL S + +ER + YN+L +F +YP P E F
Sbjct: 200 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 258
Query: 235 NMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFT 291
+ E FK +F + S +V L + + R ALVP +L + + L V
Sbjct: 259 SFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAML--TAVDGAVELV 316
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
DR Y+ G+ + + G + N +LLL+YGFV + NP D + + +L D Y++K
Sbjct: 317 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRIIVEAALNTEDPQYQDKRMV 374
Query: 352 LRKYGLSASECFPIQI 367
++ G + + F +++
Sbjct: 375 AQRNGKLSQQVFQVRV 390
>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Brachypodium distachyon]
Length = 516
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 142/312 (45%), Gaps = 30/312 (9%)
Query: 81 KWLSDSGLPPQKMAIQKVDV-GERG--------LVALKNIRKGEKLLFVPPSLVITADSK 131
+WL GLPP K+AI + V RG + A +++ G+ +P SLV+T +
Sbjct: 84 EWLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERV 143
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLY 184
E+L + + LA YL+ E K S W YI L RQ S L
Sbjct: 144 LGDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLL 203
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMET 238
WT +ELD YL S +R+ + R + YN+L +F +YP P E F E
Sbjct: 204 WTESELD-YLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEI 262
Query: 239 FKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
FK +F + S +V L S+ R ALVP +L + + L S V DR
Sbjct: 263 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDS--VRLVVDRP 320
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
Y+ GE + + G + N LLL+YGFV + NP D + + SL D Y+EK ++
Sbjct: 321 YKAGEPIIVWCGPQPNSRLLLNYGFVDED--NPYDRIAIEASLNMEDPQYQEKRMVAQRN 378
Query: 356 GLSASECFPIQI 367
G A + F + +
Sbjct: 379 GKLAIQKFQVCV 390
>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 183/383 (47%), Gaps = 29/383 (7%)
Query: 75 NASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWS 133
N T +W+ G+ K ++ E GL KN+ K E +L VP I D+ +
Sbjct: 69 NVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWINPDAV-A 127
Query: 134 CPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
E G V CS + W +A +LI E + + SRW Y+ LP++ S ++W+ EL
Sbjct: 128 DSEIGNV---CSGLKPWISVALFLIRE-NLKGDSRWRRYLDILPQETDSTVFWSEEELAE 183
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LV 251
++ +Q+ + V + + I ++ DLFP + ++ F W+FGIL SR
Sbjct: 184 -IQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRI-TLDDFFWAFGILRSRAFS 241
Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLD-YDKSSQGVVFTTDRQY--------QPGEQV 302
RL + + L+P+AD++NHS V T ++ +F+ D + + G+QV
Sbjct: 242 RLRGQN--LVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQV 299
Query: 303 FISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
FI Y KKSN +L L YGF+ E + ++ L L + +SD + +KL+ GL+ +
Sbjct: 300 FIQYDLKKSNADLALDYGFI--EQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTA 357
Query: 362 CFPIQITG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-Q 419
F I + +P ++ + L+ + E + N + ++ +E+ + Q
Sbjct: 358 YFDIILERPFPPAMLPFLRLLALGGTDAFLLESL---FRNSVWGHLEMPVSRANEELICQ 414
Query: 420 FILDSCESSISKYSRFLQVKELL 442
+ ++CE+++S Y ++ E L
Sbjct: 415 VVRNACEAALSGYHTTIEEDEKL 437
>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 395
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 159/353 (45%), Gaps = 35/353 (9%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
R L A + I GE++L V L+IT + P + L V +W LA +++ E
Sbjct: 1 RTLFAARPIEVGEQVLRVSGDLMITPNK---LPTEVKELLPTGVTEWARLALFILVEQHL 57
Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
++S+W+ YI+ LP +S ++W + EL+ S RE R VIG+ L
Sbjct: 58 GQASQWAPYINCLPTCGALHSTVFWKKEELELVRFTSLHRETMQRRA--VIGSEFASVLP 115
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ K P +F E V + + FK ++ S L R S + R+ VP+ D NH L
Sbjct: 116 VLQKCPHIFGERVLHSK-FKQAYATGKS-LRR--SSNTRILTVPFVDFFNHDSNCRALLS 171
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
YD+ D+ Y GEQV ISYG+ N L L +GF NP D VE+ ++L
Sbjct: 172 YDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFTI--SCNPYDQVEVWMALSH 229
Query: 341 SDKCYKEKLEALRKYGL----------SASECFPIQ----IT----GWPLELMAYAYLVV 382
D K KL L +G+ S F ++ +T G P L A+A ++
Sbjct: 230 RDPLRKMKLALLHAHGMPTVVHADGSDSGGNGFHLREVKSVTGRGKGIPHALRAFARVLC 289
Query: 383 SPPSMKGKFEEMAAAAS--NKMTSKKDIKCPEIDEQALQFILDSCESSISKYS 433
+ + + EMAA A + ++ K + Q + +L ES I++ S
Sbjct: 290 A--TTPQELSEMAAEAMKYDGRLARLPAKSRNKEAQVMNLLLARLESLINQRS 340
>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Guillardia theta]
Length = 460
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 145/294 (49%), Gaps = 16/294 (5%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RGL+A +NI K EK++ + +L+ E + ++ LA L+ E
Sbjct: 100 RGLIASRNILKNEKIIEISENLMFDK------FEHNLEINSNGSDNYSDLAIKLLVELFK 153
Query: 163 EKSSRWSNYISALPRQ-PYSLLY-WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
K S W YI LP + LL+ W EL +++ S++ + + + Y +
Sbjct: 154 NKKSFWFPYIGILPEEYDLKLLFRWPLKEL-FFIKGSRLSKASDYLKKKLKAQYEMVNKE 212
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+F + L+P ++FN + ++WS IL SR + L +V L+P+ D+LNH+ +F+
Sbjct: 213 VFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQET-KKVVLIPYIDLLNHNPFSSSFIS 271
Query: 281 YDK----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
Y K S+ +V +D+ +Q++ISYG+KSN ELL YGF+ NP DSV + +
Sbjct: 272 YRKIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIAER--NPYDSVIIRI 329
Query: 337 SLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGK 390
S+ D +KEK L +PI + +P E++ + + + ++ K
Sbjct: 330 SMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPDEMIEFIKICLFNTNINDK 383
>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
Length = 502
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 165/375 (44%), Gaps = 36/375 (9%)
Query: 14 PSFSHLHKAQSPAGFTDFPRKRCG-HRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDS 72
PS + L + F++ P++ HR + +T D RT V++ G E D
Sbjct: 13 PSLTVLSRVS--ISFSNLPKRAVSFHRRRRNLCFATLVDGKRTSEVVSKR---GGEEEDE 67
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKL------LFVPPSLVI 126
+ L+ W+ +GLPP K+ +++ ++ L + + E L + VP SLV+
Sbjct: 68 FGD---LKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVAASEDLQASDVAVSVPNSLVV 124
Query: 127 TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------P 179
T + E+L + + LA YL+ E K S W YI L RQ
Sbjct: 125 TLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAV 184
Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEV 233
S L W+ AEL YL S + ++R + Y +L +F +YP P E
Sbjct: 185 ESPLLWSEAEL-AYLTGSPTKAEVLDRADGIKREYEELDTVWFMAGSLFQQYPYDIPTEA 243
Query: 234 FNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVF 290
F E FK +F + S +V L S+ R ALVP +L +S + L V
Sbjct: 244 FPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAYSSNCKAMLT--AVDGAVEL 301
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
DR Y+ GE + + G + N +LLL+YGFV + NP D + + +L D Y++K
Sbjct: 302 VVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDED--NPYDRIAVEAALNTEDPQYQDKRM 359
Query: 351 ALRKYGLSASECFPI 365
++ G + + F +
Sbjct: 360 VAQRNGKLSVQVFQV 374
>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 503
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 182/383 (47%), Gaps = 29/383 (7%)
Query: 75 NASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWS 133
N T +W+ G+ K ++ E GL KN+ K E +L VP I D+ +
Sbjct: 69 NVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWINPDAV-A 127
Query: 134 CPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
E G V CS + W +A +LI E + + SRW Y+ LP++ S ++W+ EL
Sbjct: 128 DSEIGNV---CSGLKPWISVALFLIRE-NLKGDSRWRRYLDILPQETDSTVFWSEEELAE 183
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LV 251
++ +Q+ + V + + I ++ DLFP + ++ F W+FGIL SR
Sbjct: 184 -IQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRI-TLDDFFWAFGILRSRAFS 241
Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLD-YDKSSQGVVFTTD--------RQYQPGEQV 302
RL + + L+P+AD++NHS V T ++ +F+ D + G+QV
Sbjct: 242 RLRGQN--LVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQV 299
Query: 303 FISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
FI Y KKSN +L L YGF+ E + ++ L L + +SD + +KL+ GL+ +
Sbjct: 300 FIQYDLKKSNADLALDYGFI--EQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTA 357
Query: 362 CFPIQITG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-Q 419
F I + +P ++ + L+ + E + N + ++ +E+ + Q
Sbjct: 358 YFDIILERPFPPAMLPFLRLLALGGTDAFLLESL---FRNSVWGHLEMPVSRANEELICQ 414
Query: 420 FILDSCESSISKYSRFLQVKELL 442
+ ++CE+++S Y ++ E L
Sbjct: 415 VVRNACEAALSGYHTTIEEDEKL 437
>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
Length = 442
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 155/321 (48%), Gaps = 41/321 (12%)
Query: 60 TQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLL 118
T++++P+ D ++ +WL+ G+ Q + G RGL+A ++ GE +L
Sbjct: 32 TEDVVPFHDREDLDQDMKDFLEWLTAKGMNFQSQVDVAITNGTGRGLLARRSFMPGETML 91
Query: 119 FVPPSLVITADSKWSCPEAGEVLKQCSVPD-----------WPLLATYLISEASFEKSSR 167
VPP L+IT D E G ++ + D PLLA +L + + +S
Sbjct: 92 AVPPELLITPDMARRS-EVGRAFREHGLDDCSGGEDSTYECMPLLAMHL-TVLYYNESHD 149
Query: 168 WSNYISALPRQPYSLLYWTRAE--------LDRYLEASQIRERAIERIT-NVIGTYNDLR 218
+ ++ LPR+ + L+W+ E L L+ + + R T V+G +N
Sbjct: 150 FHPWMKILPRKLTTPLFWSDKEREELQGSNLYNMLDGWTMNVEKLHRSTARVLGQHN--- 206
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS----MDGRVALV-PWADMLNHSC 273
+PDL P+ +++++ FKW++ +F+R + GR ++ P AD+ NH
Sbjct: 207 -----VFPDL-PKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHG- 259
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
+V+T ++ +S T + + GEQ+F++Y K+N E LL YGFV +NP D V
Sbjct: 260 DVKTSYTFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIE--SNPHDYVG 317
Query: 334 LPLSLKKSDKCYKEK-LEALR 353
+ S+ Y++K L+ LR
Sbjct: 318 IAASIGNDQPFYRDKSLDCLR 338
>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
Length = 514
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 27/316 (8%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVIT 127
+++ L+ W+ +GLPP K+ +++ ++ + A ++++KG+ VP SLV+T
Sbjct: 78 DDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIHYVAASEDLQKGDVAFSVPDSLVVT 137
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 138 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 197
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
S L W+ AELD YL S + +ER + YN+L +F +YP P E F
Sbjct: 198 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 256
Query: 235 NMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFT 291
+ E FK +F + S +V L + + R ALVP +L + + L V
Sbjct: 257 SFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAML--TAVDGAVELV 314
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
DR Y+ G+ + + G + N +LLL+YGFV + NP D V + +L Y++K
Sbjct: 315 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRVIVEAALNTEGPQYQDKRMV 372
Query: 352 LRKYGLSASECFPIQI 367
++ G + + F +++
Sbjct: 373 AQRNGKLSQQVFQVRV 388
>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 512
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/311 (30%), Positives = 141/311 (45%), Gaps = 30/311 (9%)
Query: 82 WLSDSGLPPQKMAIQK---------VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
WL GLPP K+ I++ D R + A+ +++ G+ V SLV+T +
Sbjct: 81 WLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVSAVVDLQAGDVAFEVSMSLVVTLERVL 140
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP-------YSLLYW 185
E+L + + LA YL+ E K S W YI L R S L W
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
T +ELD YL S +++ + R + YN+L +F +YP P E F E F
Sbjct: 201 TESELD-YLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 259
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L + + L D S V DR Y
Sbjct: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDS--VRLVVDRPY 317
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ GE + I G ++N L+L+YGFV + NP D V + SL D Y+EK ++ G
Sbjct: 318 KAGEPIIIWCGPQTNSRLVLNYGFVDED--NPFDRVAIEASLNTEDPQYQEKRMVAQRNG 375
Query: 357 LSASECFPIQI 367
A + F + +
Sbjct: 376 KLAIQNFNVYV 386
>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
Length = 458
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 183/376 (48%), Gaps = 25/376 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
++ WL +SG + + RG+ L+ ++GE++L +P + T + ++ G
Sbjct: 1 MEGWLRESGAELDGLELAHFPAIGRGVRTLRCFKQGERILTIPSGCLWTVEHAYADAVLG 60
Query: 139 EVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDRY 193
VL+ SV D LA Y++ S E ++++ALP S +++ EL+
Sbjct: 61 PVLRSAQPPLSVED--TLAIYILFVRSRESGYDGLRSHVAALPASYSSSIFFEDDELEVC 118
Query: 194 LEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET--FKWSFGILFSR 249
+S I + +RI Y L +R+F + DLFP F +E +KW+ ++SR
Sbjct: 119 AGSSLYTITRQLEQRIEE---DYRGLVVRVFGLHLDLFPLNKFTIENVGYKWALCTVWSR 175
Query: 250 LVR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
+ LP+ + L P+ADM+NHS EV+ YD SS + + Y+ +QVFI YG
Sbjct: 176 AMDFVLPNGNPLRLLAPFADMVNHSPEVKQCHVYDASSGNLSILAGKDYEAEDQVFIYYG 235
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
N LL YGFV + NP+DS +L LS Y++K + GL+++ + +
Sbjct: 236 PMPNSRLLRLYGFVIPD--NPNDSYDLVLSTHPLAPFYEQKQKLWASAGLNSTCTISLTL 293
Query: 368 TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCES 427
PL YL + + ++AA A K+ + + I + + + L+F+++S S
Sbjct: 294 DD-PLPKNVLRYLRIQ----RLDESDLAAIALQKIDTNEKISDSK-EVEILRFLVESIGS 347
Query: 428 SISKY-SRFLQVKELL 442
+ + +R +++E L
Sbjct: 348 LLDSFGTRLEKLQEQL 363
>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
23]
Length = 462
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 175/371 (47%), Gaps = 28/371 (7%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL +SG + + + + RG+ A + ++GE++L +P L T +
Sbjct: 1 METWLEESGAVGLDGLEVADFPLTGRGVKARRRFKQGERILTIPSGLHWTVKHAQNDSLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
G L SV D LA +++ S E +++ LP S +++T EL+
Sbjct: 61 GPALCSAQPPLSVED--TLAVHILFVRSRESGYDGLRSHVERLPASYSSSIFFTDDELEV 118
Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
AS I ++ +RI + Y DL +R+ +YPDLFP + F + +KW+ ++SR
Sbjct: 119 CAGASLYTITKQLQQRIED---DYRDLVVRVLVQYPDLFPLDKFTLHHYKWALCAVWSRA 175
Query: 251 VRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
+ DG L P+ADMLNHS E + YD SS + + Y+ G+QV+I YG
Sbjct: 176 MDFQLSDGSSIRLLAPFADMLNHSSESKQCHVYDASSGDLSVLAGKDYEAGDQVYIHYGS 235
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY--KEKLEALRKYGLSASECFPIQ 366
N LL YGF+ NP+DS +L L+ + K+KL AL GL ++ +
Sbjct: 236 IPNHRLLRLYGFIIP--GNPNDSYDLVLATHPLAPFFELKQKLWALA--GLDSTCTISLT 291
Query: 367 ITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCE 426
+T PL YL + + ++A+ A + +K E+ Q LQ +++S
Sbjct: 292 LTD-PLPKNVIRYLRI----QRLDESDLASIALGQAADEKISNSNEV--QVLQSLVESIA 344
Query: 427 SSISKYSRFLQ 437
S + + L+
Sbjct: 345 SLLGSFGTRLE 355
>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
str. Neff]
Length = 273
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/240 (32%), Positives = 126/240 (52%), Gaps = 20/240 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-TADS--KWSCPEAGEVLKQCSVPDWPLLAT----Y 155
R +VA +I GE LL VP SLV+ +AD+ S PE +L + ++PL T
Sbjct: 43 RSVVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDE----EFPLSPTNENAL 98
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ + +S W YI LP + L+++ EL YLE S + A +R + Y+
Sbjct: 99 LLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELS-YLEGSSLHYFARQRRRAIESQYD 157
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
+ +F YP+ F E F+++ +KW+ +++SR + +G+ LVPWADM N + E
Sbjct: 158 TIFTPLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVD--EGKSGLVPWADMFNMAPET 215
Query: 276 ETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKK---SNGELLLSYGFVPREGTNPSDS 331
E + D ++++ + GEQ+F++YG+ SN +LL+ YGFV NP D+
Sbjct: 216 EQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLE--NNPHDA 273
>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
Length = 595
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW SD+G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 82 LMKWASDNGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGPLYAQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P +E F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPQAHKLPLKESFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPVSAQLLAFLRV 387
>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
pulchellus]
Length = 485
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 152/308 (49%), Gaps = 35/308 (11%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT-ADSKWSCPEAGE 139
KW SD+G ++I+ + GE G VA ++I + + L VP L++T A +K S + G
Sbjct: 79 KWCSDNGAYLGSVSIKDLPDGEYGFVADEHIEESNQFLGVPLKLMMTTAAAKKS--KLGP 136
Query: 140 VLKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+L+ D P+ LA +LI E +SS W YIS LP ++LY++ EL+
Sbjct: 137 LLR-----DDPIMMSMSNVALAMFLILEFCTGESSFWHPYISTLPASFNTVLYFSVEELE 191
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFS 248
L S + + A++ ++ Y+ +IF +P L ++ F + ++W+ + +
Sbjct: 192 -LLHGSTVLDEALKLHRSIARQYSYFH-KIFRTHPLAKSLPYKDCFTYDLYRWAVSAVMT 249
Query: 249 RLVRLP-----------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
R +P D A+VP DM NHS + + F DYD S+ + R ++
Sbjct: 250 RQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHS-DGKVFTDYDISANMLRCYAMRDFE 308
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
G++V I YG+++N E + GFV E N DSV++ L + K D Y K + + L
Sbjct: 309 KGQEVTIFYGRRTNAEFFIHNGFVFPE--NRHDSVDIKLGISKQDPLYAVKAKLCDDHEL 366
Query: 358 SASECFPI 365
+ S F +
Sbjct: 367 TPSGIFAL 374
>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
Group]
gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
Length = 509
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 142/311 (45%), Gaps = 30/311 (9%)
Query: 82 WLSDSGLPPQKMAI---------QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
WL + GLPP K+AI + D+ + A +++ G+ VP SLV+T +
Sbjct: 78 WLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLERVL 137
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E + S W YI L RQ S L W
Sbjct: 138 GDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLW 197
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
T +EL+ YL+ S I++ + R + YN+L +F +YP P E F E F
Sbjct: 198 TESELN-YLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 256
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L + + L S V DR Y
Sbjct: 257 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDS--VRLVVDRPY 314
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ GE + + G + N LLL+YGF+ + NP D + + SL D ++EK ++ G
Sbjct: 315 KAGEPIIVWCGPQPNSRLLLNYGFIDED--NPYDRIVIEASLNIEDPQFQEKRMVAQRNG 372
Query: 357 LSASECFPIQI 367
A + F + +
Sbjct: 373 KLAIQNFHVCV 383
>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
Length = 505
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/311 (29%), Positives = 142/311 (45%), Gaps = 30/311 (9%)
Query: 82 WLSDSGLPPQKMAI---------QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
WL + GLPP K+AI + D+ + A +++ G+ VP SLV+T +
Sbjct: 74 WLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLERVL 133
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E + S W YI L RQ S L W
Sbjct: 134 GDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLW 193
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
T +EL+ YL+ S I++ + R + YN+L +F +YP P E F E F
Sbjct: 194 TESELN-YLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 252
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L + + L S V DR Y
Sbjct: 253 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDS--VRLVVDRPY 310
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ GE + + G + N LLL+YGF+ + NP D + + SL D ++EK ++ G
Sbjct: 311 KAGEPIIVWCGPQPNSRLLLNYGFIDED--NPYDRIVIEASLNIEDPQFQEKRMVAQRNG 368
Query: 357 LSASECFPIQI 367
A + F + +
Sbjct: 369 KLAIQNFHVCV 379
>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
purpuratus]
Length = 455
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 135/284 (47%), Gaps = 21/284 (7%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKV---DVGERGLVALKNIRKGEKLLFVPPSLVI 126
+D E TL KW+ + G + ++ D G RGL+ KN+R G+ ++ +P L++
Sbjct: 37 VDHDEQYITLMKWMKEHGFNCKGCCLKPAVFSDTG-RGLMTKKNLRPGDSIVEIPRHLLV 95
Query: 127 TADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
TA + E G ++K+ P ++ +L++E S KSS W YI+ LP+ + +
Sbjct: 96 TAKDILNT-ELGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKDFTTPAF 154
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE--VFNMETFKWS 242
+ + D + + R RAI ++ ++ + +F FP+ F++++F W+
Sbjct: 155 GSTKQADFDVLPTIARSRAINQLQDIRAAFESASC-LFEDIERTFPQYRIFFSLDSFVWA 213
Query: 243 FGILFSRLVRL---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
+ ++ SR V + P AL P+ D+LNHS E +D S T
Sbjct: 214 WFVINSRSVYIEPSGCEAFDPKASDDFALAPFLDLLNHSPGAEVTAGFDPVSNCYRIKTL 273
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
Y +QVFI YG N LLL YGFV +NP D+V L
Sbjct: 274 DSYHAYDQVFIHYGPHDNVNLLLEYGFVI--PSNPHDAVSFELG 315
>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 111/390 (28%), Positives = 176/390 (45%), Gaps = 36/390 (9%)
Query: 68 CEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVI 126
C + + T+ W G+ Q A++ +V E GL+A + + G+++L VP S+ I
Sbjct: 35 CSVGAEAQVQTIWSWAQSHGI--QGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWI 92
Query: 127 T--ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
A S +A E LK W +A +LI E+S SS+W Y+ +LP+ S L+
Sbjct: 93 NLAAVQNSSLGKACEGLKP-----WVAVALFLIHESS-NPSSKWRPYLDSLPKSLDSPLF 146
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
W+ EL L +Q+ + + YN+L + +F V+ + FKW+FG
Sbjct: 147 WSDEELAE-LVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVYTFDGFKWAFG 205
Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHSCEV------------ETFLDYDK-SSQGVVFT 291
IL SR P +ALVP AD++NH + F + K SS +
Sbjct: 206 ILRSRTFS-PLTGEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVR 264
Query: 292 TDRQYQPGEQVFISYGK-KSNGELLLSYGFVPRE-GTNPS-----DSVELPLSLKKSDKC 344
+ GEQV + YG KSN +L L YGFV R+ G+ S DS+ L L + D+
Sbjct: 265 ASANFSAGEQVLMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRF 324
Query: 345 YKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTS 404
+K + L G S F + P + M +L +S S F + A N+
Sbjct: 325 VDDKADILEINGFQCSMQFDLSRGQGPSDEM-ITFLRLSALSGPDSF-LLEALFRNEAWG 382
Query: 405 KKDIKCPEIDEQAL-QFILDSCESSISKYS 433
+ +E+AL +L+ ++++ YS
Sbjct: 383 HVSLPVSRDNEEALCTSMLEGLKAALDGYS 412
>gi|388493466|gb|AFK34799.1| unknown [Lotus japonicus]
Length = 132
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 51/64 (79%), Positives = 58/64 (90%)
Query: 375 MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSR 434
MAYAYL VSP SM+G+FE+MAAAASNK+TS KD K PEI+EQALQFILDSCESSISKY++
Sbjct: 1 MAYAYLAVSPSSMRGQFEKMAAAASNKITSTKDFKYPEIEEQALQFILDSCESSISKYNK 60
Query: 435 FLQV 438
FLQ
Sbjct: 61 FLQA 64
>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
Length = 406
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 168/370 (45%), Gaps = 27/370 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
+ WL+ SG + + +D+ + RG+ A + ++ E++L +P + + T ++
Sbjct: 1 MDAWLNKSG----AVGLGDLDLADFPETGRGVKAQRPFKEDERILTIPANCLWTVKGAYA 56
Query: 134 CPEAGEVLKQC----SVPDWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWT 186
P G VL+ SV D LA Y++ S + +++ LP + +Y+T
Sbjct: 57 DPLFGPVLQSVQPPLSVED--TLALYILFVRSRGEDPAYAERQTHVAMLPSEYTLSMYFT 114
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
EL R S + V Y L +F ++ DLFP + F+ + +KW+ +
Sbjct: 115 DEEL-RVCAGSSLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLFPLDKFSFQHYKWALSSI 173
Query: 247 FSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+SR + +G + P+ADMLNH+ + + YD S+ + R Y+ G+QVFI
Sbjct: 174 WSRGMDFTISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVLACRDYEVGDQVFI 233
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
YG SN LL YGFV + NP+D+ EL L Y++K + GL P
Sbjct: 234 YYGNVSNSRLLRLYGFVLPD--NPNDNYELVLQTSSMAPLYEQKQRLWKLAGLDEISTIP 291
Query: 365 IQITGWPLELMAYAYLVV---SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFI 421
+ + PL YL + + ++A + K++ + + + Q+++ +
Sbjct: 292 LSLQN-PLPDSVLRYLRIQRLDASDLGTMTMQIATESYTKISDENESQILLFLSQSIEAL 350
Query: 422 LDSCESSISK 431
L+ E S+ K
Sbjct: 351 LEGFEISLEK 360
>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic [Vitis
vinifera]
gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 172/374 (45%), Gaps = 34/374 (9%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
T KWL D G+ K ++ V E GLVA ++I + E +L VP I D+
Sbjct: 51 VQTFWKWLFDQGVVSGKTPVKPGIVPEGLGLVAQRDIARNEAVLEVPKRFWINPDAV--- 107
Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
A E+ C + W +A +LI E S W +Y+ LP S +YW+ EL
Sbjct: 108 -AASEIGSVCGGLKPWVSVALFLIRE-KLRDESPWRSYLDILPEYTNSTIYWSEEELVE- 164
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
++ +Q+ + V + + + + LFP V ++ F W+FGIL SR R
Sbjct: 165 IQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPV-TLDDFLWAFGILRSRAFSR 223
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QPGEQVF 303
L + + L+P AD++NHS + T +Y +G +F+ D+ + + GEQV
Sbjct: 224 LRGQN--LVLIPLADLINHSPSITT-EEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVL 280
Query: 304 ISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
I Y KSN EL L YGF+ E +S L L + +SD + +KL+ GLS
Sbjct: 281 IQYDLDKSNAELALDYGFI--ESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAY 338
Query: 363 FPIQI-TGWPLELMAYAYLVV--SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL- 418
F I + P ++ Y LV P + + + N + ++ +E+ +
Sbjct: 339 FDIVLGQSLPAAMLPYLRLVALGGPDAFL-----LESIFRNTIWGHLELPVSRANEELIC 393
Query: 419 QFILDSCESSISKY 432
Q I D+C+S++S Y
Sbjct: 394 QVIQDACKSALSGY 407
>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
Length = 507
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 172/374 (45%), Gaps = 34/374 (9%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
T KWL D G+ K ++ V E GLVA ++I + E +L VP I D+
Sbjct: 51 VQTFWKWLFDQGVVSGKTPVKPGIVPEGLGLVAQRDIARNEAVLEVPKRFWINPDAV--- 107
Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
A E+ C + W +A +LI E S W +Y+ LP S +YW+ EL
Sbjct: 108 -AASEIGSVCGGLKPWVSVALFLIRE-KLRDESPWRSYLDILPEYTNSTIYWSEEELVE- 164
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
++ +Q+ + V + + + + LFP V ++ F W+FGIL SR R
Sbjct: 165 IQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPV-TLDDFLWAFGILRSRAFSR 223
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QPGEQVF 303
L + + L+P AD++NHS + T +Y +G +F+ D+ + + GEQV
Sbjct: 224 LRGQN--LVLIPLADLINHSPSITTE-EYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVL 280
Query: 304 ISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
I Y KSN EL L YGF+ E +S L L + +SD + +KL+ GLS
Sbjct: 281 IQYDLDKSNAELALDYGFI--ESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAY 338
Query: 363 FPIQI-TGWPLELMAYAYLVV--SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL- 418
F I + P ++ Y LV P + + + N + ++ +E+ +
Sbjct: 339 FDIVLGQSLPAAMLPYLRLVALGGPDAFL-----LESIFRNTIWGHLELPVSRANEELIC 393
Query: 419 QFILDSCESSISKY 432
Q I D+C+S++S Y
Sbjct: 394 QVIQDACKSALSGY 407
>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
Length = 327
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 89/320 (27%), Positives = 159/320 (49%), Gaps = 25/320 (7%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
S +++ L +WL +G + + RG+++ +N+++G+ ++ +P +L+IT +
Sbjct: 2 SRDDSIQLMRWLRRNGFRDSHLVLTDFPDTGRGVMSTRNLKEGDCIVSLPENLLITTTTV 61
Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
+ G+ +K P +L+ YLI+E S K S W YI LP + Y++ AE
Sbjct: 62 VNS-HLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTLPTSYTTPSYFSTAE 120
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILF 247
+D + +RE + + +Y L+ + + P LFP+ VF +++++W++ ++
Sbjct: 121 VDAL--PALVREATLRHRKVLQNSYKSLQTSLHNLEP-LFPDWKTVFTLKSYRWAWATVY 177
Query: 248 SRLVRL---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
+R V PS AL P+ DMLNHS V+T D++ SS+ T+ +
Sbjct: 178 TRSVYKRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVSSKCYEVKTEGACRK 237
Query: 299 GEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKK----SDKCYKEKLEALR 353
QVFI+Y NG LL+ YGFV PR NP V ++K+ S ++K+E L
Sbjct: 238 YRQVFINYDPYDNGRLLMEYGFVMPR---NPHSVVTFTAAVKQNGLSSKNLLQKKMELLS 294
Query: 354 KYGLSASECFPIQITGWPLE 373
+ L+ + + W L+
Sbjct: 295 QENLTVNLSCSGEGLSWRLQ 314
>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
nagariensis]
Length = 488
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 151/336 (44%), Gaps = 26/336 (7%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQKVDVGERG-----LVALKNIRKGEKLLFVPPSLVI 126
++ AS L WL ++G + ++ +DV G +VA +++ GE L VP L +
Sbjct: 37 AVHTASELVDWLRENGAKIDAVEVKTMDVPSAGRPLDVVVAGRSLAAGEVALSVPERLCL 96
Query: 127 TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL-------PRQP 179
T D + E+L + + LA YL+ E +K S W YI L P+
Sbjct: 97 TLDRIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAA 156
Query: 180 YSLLYWTRAELDRYLEASQI------RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV 233
S L W ELD L+ S + R+ I + + T + +F+KYP P E
Sbjct: 157 ESPLLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTET 216
Query: 234 FNMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVF 290
F+ E F+ +F ++ + +V L + R ALVP ++ +S + + YD+ S+ V
Sbjct: 217 FSFELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRL 276
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE--LPLSLKKSDKCYKEK 348
+ G V G + N LLL+YG V + NP D ++ +L SD + K
Sbjct: 277 VVSGPVEAGRPVAAWCGPQPNSRLLLNYGVV--DEHNPFDKLQARFTFTLPTSDPLFPAK 334
Query: 349 LEALRKYGLSASECFPIQIT-GWPLELMAYAYLVVS 383
L + GL+ + F + + P +L+ Y L ++
Sbjct: 335 RAVLSEAGLATQQSFDVSVARPLPPQLLPYMMLALA 370
>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
[Spinacia oleracea]
gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
N-methyltransferase II [Spinacia oleracea]
Length = 495
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 174/393 (44%), Gaps = 33/393 (8%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVIT 127
E D+ WLSD G+ K ++ V E GLVA K+I + E +L VP I
Sbjct: 49 ETDTPPEIQKFWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWIN 108
Query: 128 ADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
D+ A E+ C+ + W +A +L+ E SS W YI LP S +YW+
Sbjct: 109 PDTV----AASEIGSVCNGLKPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTIYWS 164
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
EL L+ SQ+ + V + L + + LFP +V + F W+FG+L
Sbjct: 165 EEELSE-LQGSQLLNTTLGVKELVANEFAKLEEEVLVPHKQLFPFDV-TQDDFFWAFGML 222
Query: 247 FSRLVRLPSMDGR-VALVPWADM----LNHSCEVETFLDYDKSSQG-------VVFTTDR 294
SR ++G+ + L+P AD+ NHS ++ T Y +G +VF+ R
Sbjct: 223 RSR--AFTCLEGQSLVLIPLADLWVQQANHSPDI-TAPKYAWEIRGAGLFSRELVFSL-R 278
Query: 295 QYQP---GEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
P G+QV I Y KSN EL L YG E + ++ L L + +SD Y +KL+
Sbjct: 279 NPTPVKAGDQVLIQYDLNKSNAELALDYGLT--ESRSERNAYTLTLEIPESDSFYGDKLD 336
Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
G+ S F I + PL YL + + F + + N + D+
Sbjct: 337 IAESNGMGESAYFDI-VLEQPLPANMLPYLRLVALGGEDAF-LLESIFRNSIWGHLDLPI 394
Query: 411 -PEIDEQALQFILDSCESSISKYSRFLQVKELL 442
P +E Q I D+C S++S YS + E L
Sbjct: 395 SPANEELICQVIRDACTSALSGYSTTIAEDEKL 427
>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
Length = 394
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 143/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S +G + + + GL A + I+ E L+VP L++T +S
Sbjct: 82 LMKWASANGASVEGFEVVNFEEEGFGLRATREIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y R+ +P L ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YRVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R + GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFHAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
Length = 501
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 163/352 (46%), Gaps = 32/352 (9%)
Query: 38 HRIVVHCSVSTTNDASRTKTTVTQN---MIPWGCEIDSLENASTLQKWLSDSGLPPQKMA 94
HR+ S+ST + R N ++ + E+ L+ W+ +GLPP K+
Sbjct: 26 HRLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGKKKRDEDDGDLKTWMHKNGLPPCKVV 85
Query: 95 IQK---VDVGERGL---VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
++ +D + + A ++++KG+ VP SLV+T + E+L +
Sbjct: 86 LKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFSE 145
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAELDRYLEASQIRE 201
LA YL+ E K S W YI L RQ S L W+ +EL YLE S +++
Sbjct: 146 LACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL-AYLEGSPLKD 204
Query: 202 RAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP- 254
++RI + YN+L +F +YP P E F E FK +F + S +V L
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264
Query: 255 -SMDGRVALVPWA-DMLNHSCEVETFLD-YDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
S+ R ALVP +L + + L D + Q VV DR Y+ G+ + + G + N
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVV---DRPYKAGDPIVVWCGPQPN 321
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
+LL +YGFV + +N VE+ LS + D Y++K ++ G + + F
Sbjct: 322 TKLLTNYGFVDEDNSNDRLIVEVALSTE--DPQYQDKRIVAQRNGKLSIQTF 371
>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 453
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 169/347 (48%), Gaps = 43/347 (12%)
Query: 109 KNIRKGEKLLFVPPSLVITADSKWS------CPEAGEVLKQCSVPDWPLLATYLISEASF 162
+N+ +GE + VP L + AD+ + C G++ W ++ ++ EA+
Sbjct: 51 RNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRP------WVSVSLLILREAAR 104
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
S W+ Y++ LPRQ S ++W+ EL ++ +Q+ + V ++++ I
Sbjct: 105 GGDSLWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFDNVEAGII 163
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVET---- 277
+ DLFP + + F W+FG+L SR+ P + G ++AL+P+AD++NH ++ +
Sbjct: 164 NVNKDLFPGTI-TFDDFLWAFGVLRSRV--FPELRGDKLALIPFADLINHDGDITSKESC 220
Query: 278 -------FLDYDKSSQGVVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
FL D VF+ T + GEQ+++ Y KSN EL L YGF E +
Sbjct: 221 WEIKGKGFLGRD-----TVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFT--ESNS 273
Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI-TGWPLELMAYAYLVVSPPS 386
DS L L + +SD Y++KL+ G+ + F + + P +++ Y L+ +
Sbjct: 274 SRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRLLCLGGT 333
Query: 387 MKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKY 432
E A NK+ ++ +E+++ Q I ++C+S+++ Y
Sbjct: 334 DAFLLE---ALFRNKVWEHLELPVSRDNEESICQVIQNACKSALAAY 377
>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
Length = 537
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/334 (26%), Positives = 148/334 (44%), Gaps = 45/334 (13%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVIT 127
+++ L+ W+ +GLPP K+ +++ ++ + A ++++KG+ VP SLV+T
Sbjct: 78 DDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIHYVAASEDLQKGDVAFSVPDSLVVT 137
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
+ E+L + + LA YL+ E K S W YI L RQ
Sbjct: 138 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 197
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
S L W+ AELD YL S + +ER + YN+L +F +YP P E F
Sbjct: 198 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 256
Query: 235 NMETFKWSFGILFSRLVRLP--------------------SMDGRVALVPWA-DMLNHSC 273
+ E FK +F + S +V L + R ALVP +L +
Sbjct: 257 SFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCTQNVGLARRFALVPLGPPLLAYCS 316
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
+ L V DR Y+ G+ + + G + N +LLL+YGFV + NP D V
Sbjct: 317 NCKAML--TAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRVI 372
Query: 334 LPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
+ +L D Y++K ++ G + + F +++
Sbjct: 373 VEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRV 406
>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
Length = 805
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 149/319 (46%), Gaps = 28/319 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
L KW S++G + + VD E G L A ++I+ E L+VP L++T +S
Sbjct: 261 LMKWASENGASVE--GFEMVDFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA----- 313
Query: 137 AGEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
VL D L LA +L+ E + + +S W YI LP + + LY+ E
Sbjct: 314 KNSVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDE 372
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGIL 246
+ RYL+++Q + N Y ++ +P L ++ F E ++W+ +
Sbjct: 373 V-RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSV 430
Query: 247 FSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
+R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++
Sbjct: 431 MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIY 489
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
I YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F
Sbjct: 490 IFYGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF 547
Query: 364 PIQITGWPLELMAYAYLVV 382
+ T P+ A+L V
Sbjct: 548 ALHFTEPPISAQLLAFLRV 566
>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
setd3 [Ovis aries]
Length = 596
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 89 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 143
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 144 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 201
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y R+ +P ++ F E ++W+ + +
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQYAYF-YRVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 260
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 319
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394
>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
sativus]
Length = 521
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 141/307 (45%), Gaps = 27/307 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
L+ W+ D+GLPP K+ +++ ++ + A +++ G+ VP SLV+T +
Sbjct: 90 LKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVVTLERVL 149
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E K S W YI L RQ S L W
Sbjct: 150 GNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLW 209
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
+ ELD YL S ++ +ER + YN+L +F +YP P E F+ E F
Sbjct: 210 SEDELD-YLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 268
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L + + L V DR Y
Sbjct: 269 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAML--TAVDGAVELVVDRPY 326
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ GE + + G + N +LLL+YGFV + N D + + +L D Y++K ++ G
Sbjct: 327 KAGESIAVWCGPQPNSKLLLNYGFVDED--NRYDRLVVEAALNTEDPQYQDKRMVAQRNG 384
Query: 357 LSASECF 363
+ + F
Sbjct: 385 RLSIQAF 391
>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
Length = 541
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 153/319 (47%), Gaps = 17/319 (5%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
+N WL + + + + IQ DVG G+ A ++ ++GE L +P S+++T D+ +
Sbjct: 75 KNFDGFMGWLKSNSVDAEAVEIQHFDVGGYGIKATRDFKEGELFLAIPRSVMMTTDTAKN 134
Query: 134 CPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+ + P LLA +++ E +S W Y+ LP S LY+ +L
Sbjct: 135 SALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPHSYSSPLYFNPEDL- 192
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKY--PDLFPEEVFNM--ETFKWSFGILF 247
+ L+AS I + N+ Y +F + P +V N+ + ++W+ +
Sbjct: 193 QLLKASPTLSEMINQFRNITRQYAYF-FNLFQGHELASKLPIQVKNICYDDYRWAVSSVM 251
Query: 248 SRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVF 303
+R ++P++DG+ AL+P DM NH+ + D+ K+ + F+ + G QVF
Sbjct: 252 TRQNQIPTLDGQRMISALIPLWDMCNHT-NGQITTDFSLKNDRSECFSLEGTV-AGAQVF 309
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
I YG +SN ELL+ GFV + N SD + + L + K+D + K E L + + AS F
Sbjct: 310 IFYGSRSNAELLIHNGFVYPQ--NHSDRLTIRLGISKNDPLFSMKSEVLSRLSMQASRLF 367
Query: 364 PIQITGWPLELMAYAYLVV 382
+ P++ A+L V
Sbjct: 368 SLHCGVNPVDSDTLAFLRV 386
>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 1 [Glycine max]
gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
isoform 2 [Glycine max]
Length = 502
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 141/309 (45%), Gaps = 27/309 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
L+ W+ GLPP K+ ++ + A ++++ G+ VP SLV+T +
Sbjct: 71 LKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERVL 130
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E K S W YI L RQ S L W
Sbjct: 131 GNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLW 190
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
++ELD YL S I++ I+R + YN+L +F +YP P E F+ E F
Sbjct: 191 LKSELD-YLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L++ + L V DR Y
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAML--TAVDGAVELAVDRPY 307
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ G+ + + G + N +LL++YGFV +N D + + +L D Y++K ++ G
Sbjct: 308 KAGDPIVVWCGPQPNSKLLINYGFVDENNSN--DRLIVEAALNTEDPQYQDKRMVAQRNG 365
Query: 357 LSASECFPI 365
+ + F +
Sbjct: 366 KLSVQVFHV 374
>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
Length = 591
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S +G + + + GL A + I+ E L+VP L++T +S +
Sbjct: 82 LMKWASANGASVEGFEVVNFEEEGFGLRATREIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y R+ +P L ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YRVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R + GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFHAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
Length = 489
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 169/350 (48%), Gaps = 39/350 (11%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS-----VPDWPLLATYLIS 158
GLVA +++ +GE + VP L + AD+ A ++ + C + W +A L+S
Sbjct: 82 GLVAARDLPRGEVVAEVPKKLWMDADAV----AASDIGRACGGGGGGLRPWVAVALLLLS 137
Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI----GTY 214
E + S W+ Y++ LPRQ S ++ L+ S +R + + V +
Sbjct: 138 EVARGADSPWAPYLAILPRQTDSTIFCAG------LKKSSLRYKLLSTTVGVKEYVQSEF 191
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSC 273
+ ++ I S+ DLFP + + F W+FGIL SR+ P + G ++ALVP+AD++NHS
Sbjct: 192 DSVQAEIISRNKDLFPGSI-TFDDFLWAFGILRSRV--FPELRGDKLALVPFADLVNHSP 248
Query: 274 EVET-FLDYDKSSQGVV-------FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPRE 324
++ + ++ +G+ T + G+Q++I Y KSN EL L YGFV
Sbjct: 249 DITSEGSSWEIKGKGLFGREPMFSLRTPVDVKSGQQIYIQYDLDKSNAELALDYGFVE-- 306
Query: 325 GTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVS 383
+NPS DS + L + +SD Y +KL+ L + F I I PL YL +
Sbjct: 307 -SNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDI-ILDEPLPPQMLPYLRLL 364
Query: 384 PPSMKGKFEEMAAAASNKMTSKKDIKC-PEIDEQALQFILDSCESSISKY 432
F + A N + ++ P+ +E Q + D+C+S+++ Y
Sbjct: 365 CIGGTDAF-ILEALFRNSVWGHLELPLSPDNEESICQVMRDACKSALAAY 413
>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
lyrata]
Length = 482
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 183/389 (47%), Gaps = 40/389 (10%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKW 132
EN KWL D G+ K + V E GLVA ++I + E +L +P L W
Sbjct: 47 ENVRNFWKWLGDQGVVSGKSPAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL-------W 99
Query: 133 SCPE---AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
PE A ++ C + W +A +LI E +E+ S W Y+ LP+ S ++W+
Sbjct: 100 INPETVTASKIGPLCGGLKPWVSVALFLIRE-KYEEESSWRLYLDMLPQSTDSTVFWSEE 158
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
EL L+ +Q+ + V + L I DLF + ++ F W+FGIL S
Sbjct: 159 ELAE-LKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGILKS 216
Query: 249 R-LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QP 298
R RL + + L+P AD++NH+ + T DY +G +F+ D + +
Sbjct: 217 RAFSRLRGQN--LVLIPLADLINHNPAITTE-DYAYEIKGAGLFSRDLLFSLKSPVYVKA 273
Query: 299 GEQVFISYG-KKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYG 356
GEQV+I Y KSN EL L YGFV +NP+ +S L + + +SD + +KL+
Sbjct: 274 GEQVYIQYDLNKSNAELALDYGFVE---SNPNRNSYTLTIEIPESDPFFGDKLDIAETNK 330
Query: 357 LSASECFPIQITG--WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEID 414
+ + F + + G P ++ Y LV S E + +N + ++ +
Sbjct: 331 MGETGYFDV-VDGQTLPAGMLQYLRLVALGGSDAFLLESI---FNNTIWGHLELPVSRSN 386
Query: 415 EQAL-QFILDSCESSISKYSRFLQVKELL 442
E+ + + + D+C+S++S +S ++ E L
Sbjct: 387 EELICRVVRDACKSALSGFSTTIEEDEKL 415
>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
N-methyltransferase, chloroplastic; AltName:
Full=Aldolases N-methyltransferase; AltName:
Full=[Ribulose-bisphosphate carboxylase]-lysine
N-methyltransferase-like; Short=AtLSMT-L;
Short=LSMT-like enzyme; Flags: Precursor
gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
subunit N-methyltransferase [Arabidopsis thaliana]
gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
[Arabidopsis thaliana]
Length = 482
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 180/387 (46%), Gaps = 36/387 (9%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKW 132
EN KWL D G+ K + V E GLVA ++I + E +L +P L W
Sbjct: 47 ENVRNFWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL-------W 99
Query: 133 SCPE---AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
PE A ++ C + W +A +LI E +E+ S W Y+ LP+ S ++W+
Sbjct: 100 INPETVTASKIGPLCGGLKPWVSVALFLIRE-KYEEESSWRVYLDMLPQSTDSTVFWSEE 158
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
EL L+ +Q+ + V + L I DLF + ++ F W+FGIL S
Sbjct: 159 ELAE-LKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGILKS 216
Query: 249 R-LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QP 298
R RL + + L+P AD++NH+ ++T DY +G +F+ D + +
Sbjct: 217 RAFSRLRGQN--LVLIPLADLINHNPAIKT-EDYAYEIKGAGLFSRDLLFSLKSPVYVKA 273
Query: 299 GEQVFISYG-KKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYG 356
GEQV+I Y KSN EL L YGFV +NP +S L + + +SD + +KL+
Sbjct: 274 GEQVYIQYDLNKSNAELALDYGFVE---SNPKRNSYTLTIEIPESDPFFGDKLDIAESNK 330
Query: 357 LSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQ 416
+ + F I + G L YL + F + + +N + ++ +E+
Sbjct: 331 MGETGYFDI-VDGQTLPAGMLQYLRLVALGGPDAF-LLESIFNNTIWGHLELPVSRTNEE 388
Query: 417 AL-QFILDSCESSISKYSRFLQVKELL 442
+ + + D+C+S++S + ++ E L
Sbjct: 389 LICRVVRDACKSALSGFDTTIEEDEKL 415
>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
Length = 488
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 89 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 143
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 144 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 201
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P ++ F E ++W+ + +
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 260
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 319
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394
>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
florea]
Length = 483
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 169/362 (46%), Gaps = 22/362 (6%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
WL ++G ++ + + GL A +N + E +L +P L+ + + + PE +
Sbjct: 87 WLKENGANVDGASVAEFPGYDLGLKAERNFLENELILRIPRGLIFSIHN--AAPELITLQ 144
Query: 142 KQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI 199
+ P LA L+ E E +S+W Y+ LP ++LY T A++ L+ S
Sbjct: 145 NDPLIQHMPQVALAIALLIERHKE-NSKWKPYLDILPTTYTTVLYMTAADMIE-LKGSPT 202
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFSRLVRLPSM 256
E A+++ N+ Y+ ++F + +VF E + W+ + +R +PS
Sbjct: 203 LEAALKQCRNIARQYSYFN-KVFQNNNNAVSAILRDVFTYERYCWAVSTVMTRQNLIPSE 261
Query: 257 DGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
DG AL+P DM NH T D++ +S R ++ GEQ+FISYG ++N +
Sbjct: 262 DGSRMIHALIPMWDMCNHENGRIT-TDFNATSNYCECYALRDFKKGEQIFISYGPRTNSD 320
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLE 373
+ GFV E N D +L L + K+D KE++E L K L F +++ P+
Sbjct: 321 FFVHSGFVYME--NKQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKLGTEPIS 378
Query: 374 LMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESSIS 430
+ A+L V SM+ K E S+++ K + C ++E +F+L + I+
Sbjct: 379 DLLLAFLRVF--SMR-KAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIA 435
Query: 431 KY 432
Y
Sbjct: 436 NY 437
>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 427
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 141/306 (46%), Gaps = 27/306 (8%)
Query: 83 LSDSGLPPQKMAI--QKVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
+ + GLP +A+ ++ G++G +VA ++++ G+ L VP SLV+T +
Sbjct: 1 MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60
Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRA 188
E+L + + LA YL+ E K S W YI L RQ S L W+
Sbjct: 61 TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWS 242
EL+ Y S ++E +ER+ + Y +L +F +YP P E F+ E FK +
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180
Query: 243 FGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
F + S +V L S+ R ALVP +L + + L V D Y+ G
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAML--KAVGDNVQLEVDHAYKTG 238
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
+ + + G + N +LLL+YGFV + NP D + + SL D Y++K ++K
Sbjct: 239 DPIAVWCGPQPNSKLLLNYGFVDED--NPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLT 296
Query: 360 SECFPI 365
+ F I
Sbjct: 297 IQTFQI 302
>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
ferrumequinum]
Length = 594
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVSFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFGEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
CCE9901]
Length = 529
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 26/282 (9%)
Query: 107 ALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD----WPLLATYLISEASF 162
A + + +G K + VP SL IT + + E G+ L+ V W LA L+ E
Sbjct: 93 ATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLALTLLKERER 152
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+ S+++ Y+ LP +S L+W EL ++ +Q+ + A V G Y LR +F
Sbjct: 153 GEESKYAAYVKTLPEVLHSPLFWNAEELSE-IQGTQLLDNAAGYDGYVRGVYETLRTGMF 211
Query: 223 SKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC-------- 273
+K+ D+F E F+ + F+W+FGIL SR + P +ALVP D++NHS
Sbjct: 212 AKHADVFDVEGAFSEDNFRWAFGILRSRTM-APCDGANIALVPGVDLVNHSSLSQARWRV 270
Query: 274 ------EVETFLDYDKSSQGVV--FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPRE 324
V K GV DR E ++++Y + ++ L +GFV +
Sbjct: 271 SGGVAGAVAGLFGGGKGDDGVSARVECDRALNVNEPLYVNYNPEGTDTSFALDFGFV--D 328
Query: 325 GTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ 366
PS L LS+ + D +KL+ L GL + F ++
Sbjct: 329 TITPSPGYALSLSVPEDDPNVFDKLDVLDVCGLGETPTFTLR 370
>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
caballus]
Length = 609
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + GL A ++I+ E L+VP L++T +S +
Sbjct: 96 LMKWASENGASVDGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 151
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 152 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 208
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y R+ +P + P ++ F E ++W+ + +
Sbjct: 209 RYLQSTQAVHDVFSQYKNTARQYAYF-YRVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 267
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 268 RQNQIPTEDGSRVTLALIPLWDMCNHTTGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 326
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 327 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 384
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 385 HFTEPPISAQLLAFLRV 401
>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
Length = 463
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 143/301 (47%), Gaps = 35/301 (11%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L WL G ++ K +RGL A+++I+ GE +L V ++TAD P
Sbjct: 40 LVSWLKIRG-EHDACSLLKTGPDKRGLFAVRDIKAGECILRVSRDTMMTADR---LPLEF 95
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEA 196
+ L V +W LA L+ E ++S W+ YIS LPR +S +W + EL ++
Sbjct: 96 QQLLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWGTIHSTAFWRKEEL-AMIQE 154
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLP 254
S + + R + +N+++ IF +Y +F V + +FK ++ + SR R+
Sbjct: 155 SSLSYETMSRRAAIREEFNEMQ-PIFQRYEHVFGGPV-SYASFKHAYVTATVCSRAWRID 212
Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV------VFT-------------TDRQ 295
++ ++A+VP+AD +NH L YD + V++ D+
Sbjct: 213 GLE-KLAMVPFADFMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALFCAQLFADKN 271
Query: 296 YQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
Y GEQV IS+G N L L +GF VP NP D V+L L + + D KEKL+ L
Sbjct: 272 YAAGEQVTISFGPLCNASLALDFGFTVP---YNPWDKVQLWLGISRRDSLRKEKLQYLHA 328
Query: 355 Y 355
+
Sbjct: 329 H 329
>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
Length = 601
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 89 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 144
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 145 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 201
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P ++ F E ++W+ + +
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 260
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 319
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394
>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
Length = 594
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
Length = 439
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 167/357 (46%), Gaps = 25/357 (7%)
Query: 90 PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW 149
P + +++ ++G GL A +++ K ++++ +P +L + AD+ E GE + + W
Sbjct: 9 PGGVEVRRGELG-LGLFAKRSVSKNQEVVSIPKTLWMDADTV-RRSEIGECCE--GLRPW 64
Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
+A YL+ E + + S WS YI LPR S L+W+ EL L+ +Q+
Sbjct: 65 IAVALYLLHEKA-KPHSDWSAYIRVLPRTLDSPLFWSEEELAE-LKGTQLLSSMNGFKEF 122
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
+ Y+ + + PD+F ++ +E F W+FGIL SR P + +ALVP AD +
Sbjct: 123 LKREYDKVMTEVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFP-PLIGDNLALVPLADFV 181
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQ--------PGEQVFISYGKK-SNGELLLSYGF 320
NH + K VF ++V I YGKK N +L YGF
Sbjct: 182 NHGFGLTNEDPGWKVKSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGF 241
Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI-QITGWPLELMAYAY 379
V + N DS L L + S++ +K++ + GL ++ F + + G P +++AY
Sbjct: 242 VDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLYRNQGPPEDMIAYLR 301
Query: 380 LVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESSISKYS 433
L+ S + A + T ++ P E +E + +++ C +++ +YS
Sbjct: 302 LIALFGS-----DSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYS 353
>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
Length = 389
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 161/372 (43%), Gaps = 70/372 (18%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RGL A + I GE +L V L+IT + PE L V W LA +L++
Sbjct: 1 RGLFASRPIHTGECMLHVSHDLMITPEK---LPEEVTKLLSKDVSAWAKLALFLLAHQKK 57
Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIER----------ITNV 210
+++S W+ YIS LP +S ++WT+ EL YL+ S + ++R NV
Sbjct: 58 KETSAWAPYISCLPPFGSMHSTIFWTQDEL-VYLKVSPVYRETVQRKDVVRMEFAAAENV 116
Query: 211 IGTYNDLRL----RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWA 266
++L RI + Y + + +ET K +ALVP+
Sbjct: 117 CMLMQQVKLFVCSRILTDYITVC-SRAWGIETIK------------------SLALVPFV 157
Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREG 325
D NH L YD+ +DR Y G+QV ISYG+ SN L L +GF +P
Sbjct: 158 DFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALP--- 214
Query: 326 TNPSDSVE-LPLSLKKSDKCYKEKLEALRKYGL----------SASECFPIQIT------ 368
NP D V + LSL + D KL+ L + + +A F +Q
Sbjct: 215 FNPHDQVAGIWLSLSEKDPLRDSKLKLLHSHNMQTCVTREGVDTAGSSFSLQEVKSKAGR 274
Query: 369 --GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFIL 422
G P L A+A +V + S + +EMA A++ T + + P ID+ +A+ +
Sbjct: 275 GKGIPQTLRAFARVVCATTS--EELDEMAKFAAD--TDGRLARRPSIDKTKEHKAMTLLQ 330
Query: 423 DSCESSISKYSR 434
++ I K+ +
Sbjct: 331 TVIDNRIQKHEQ 342
>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
Length = 1658
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 146/306 (47%), Gaps = 14/306 (4%)
Query: 79 LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+KWL+D G+ K+ I D RG+V K + + E ++ VP +I D P
Sbjct: 1185 FEKWLTDGGVHFPKLQIANFNDSTGRGVVTTKKVEENECVVSVPRKFLINVDCARKHPVL 1244
Query: 138 GEVL--KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+L + + D +L ++I E +S W + LP + +++T EL LE
Sbjct: 1245 NSILFEEATGLNDDTILFLFVIYEKE-NPNSFWRPFFDTLPSYFPTSIHYTTTELLE-LE 1302
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
+ + E I+ ++ L + ++YPD+FPE +F ME F W+ + SR ++L
Sbjct: 1303 GTNLFEETIQIKEHLESIRELLFPELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-K 1361
Query: 256 MDGRVA--LVPWADMLNHSCEVETFLDY-DKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
+DGR+ LVP ADM+NH + + Y D+ + + Q+F+ YG +
Sbjct: 1362 IDGRIVNCLVPMADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSW 1421
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSD--KCYKEKLEALRKYGLSASECFPIQITGW 370
EL L YGFV N DSV + + + D + +EK + L ++ L+ + + +
Sbjct: 1422 ELALYYGFVI--SNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHY-LHRSNI 1478
Query: 371 PLELMA 376
P +L+A
Sbjct: 1479 PSKLLA 1484
>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
Length = 492
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
boliviensis boliviensis]
Length = 513
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 144/315 (45%), Gaps = 24/315 (7%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
KW S++G + + GL A ++I+ E L+VP L++T +S V
Sbjct: 2 KWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KNSV 56
Query: 141 LKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
L D L LA +L+ E + +S W YI LP + + LY+ E+ RY
Sbjct: 57 LGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RY 114
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRL 250
L+++Q + N Y ++ +P L ++ F E ++W+ + +R
Sbjct: 115 LQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQ 173
Query: 251 VRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
++P+ DG +AL+P DM NH+ + T Y+ + +Q GEQ++I YG
Sbjct: 174 NQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGEQIYIFYG 232
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
+SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 233 TRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 290
Query: 368 TGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 291 TEPPISAQLLAFLRV 305
>gi|171678927|ref|XP_001904412.1| hypothetical protein [Podospora anserina S mat+]
gi|170937534|emb|CAP62192.1| unnamed protein product [Podospora anserina S mat+]
Length = 466
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 146/295 (49%), Gaps = 21/295 (7%)
Query: 72 SLENASTLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+L ++++ WL SG + + + V RG+ AL+ +KGE++L +P ++ T +
Sbjct: 16 TLSRPNSMESWLKLSGAVGLDDLELADFPVTGRGVRALRRFKKGERILTIPCGVLWTVEH 75
Query: 131 KWSCPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYW 185
++ P G L+ SV D +LATY++ S E ++++ALP S +++
Sbjct: 76 AFADPLLGPALRSARPPLSVED--ILATYILFIRSRESGYDGLRSHVAALPTSYSSSIFF 133
Query: 186 TRAELDRYLEASQIR-ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
++ EL+ S + ++R ++ Y L + + +++ DL P + F +E W+
Sbjct: 134 SKDELEVCAGTSLYTITKQLDR--SIDDDYRALVVGVLAQHRDLLPLDKFTIE--DWALC 189
Query: 245 ILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
++SR + DG L P+ADMLNHS EV+ YD SS + + Y+ G+Q
Sbjct: 190 TVWSRAMDFALPDGNSIRLLAPFADMLNHSSEVKPCHVYDVSSGNLSVLAGKDYEAGDQA 249
Query: 303 FISYGKKSNGELLLSYGFVPRE----GTNPSDSVELPLSLKKSDKCYKEKLEALR 353
FISYG N LL YGFV + + +PL+L +D K L LR
Sbjct: 250 FISYGPIPNSRLLRLYGFVQKHKLWVSAGLDSTCTIPLTL--TDPLPKNVLRYLR 302
>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
large subunit N-methyltransferase, chloroplastic-like
[Glycine max]
Length = 503
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 27/309 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
L+ W+ GLPP K+ ++ + A ++++ G+ VP SLV+T +
Sbjct: 71 LKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERVL 130
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E K S W YI L RQ S L W
Sbjct: 131 GNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLW 190
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
+++ELD YL S I++ I+R + Y +L +F +YP P E F+ E F
Sbjct: 191 SKSELD-YLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
K +F + S +V L S+ R ALVP +L++ + L V DR Y
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAML--TAVDGAVELAVDRPY 307
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ G+ + + G + N +LL++YGFV +N D + + +L D Y++K ++ G
Sbjct: 308 KAGDPIVVWCGPQPNSKLLINYGFVDENNSN--DRLIVEAALNTEDPQYQDKRMVAQRNG 365
Query: 357 LSASECFPI 365
+ + F +
Sbjct: 366 KLSVQVFHV 374
>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
Length = 459
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 149/318 (46%), Gaps = 37/318 (11%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
KW S++G +AI+ G+ GLVA + I + + L +P LV+T S + G +
Sbjct: 49 KWCSENGAYLGSVAIKDRPDGDYGLVAEEKIEESMQFLGIPMKLVMTTASARKS-KLGPL 107
Query: 141 LKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
L+ D P+ LA +LI E S +SS W YIS LP ++LY+ EL+
Sbjct: 108 LR-----DDPIMKSMSNVALAIFLILELSAGESSFWHPYISVLPDSFNTVLYFNIEELE- 161
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSR 249
L S + + A++ ++ Y +IF +P L ++ F + ++W+ + +R
Sbjct: 162 LLSGSAVLDEALKLHRSIARQYAYFH-KIFRTHPLAKSLPFKDCFTYDLYRWAVSAVMTR 220
Query: 250 LVRLP------------SMDGRVA---LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+P +DG A LVP DM NHS + + DYD S+ V R
Sbjct: 221 QNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHS-DGKVLTDYDSSASMVRCYAMR 279
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+ GE+V I YGK++N E + GFV + N D+V++ L + K D + K +
Sbjct: 280 DFDKGEEVTIFYGKRTNAEFFIHNGFVFED--NRYDAVDIKLGVSKKDPLFAVKSKLCED 337
Query: 355 YGLSASECFPIQITGWPL 372
+ LS S F + P+
Sbjct: 338 HDLSLSGTFALVARDRPV 355
>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
Length = 595
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI +LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Callithrix jacchus]
Length = 610
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 97 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 152
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 153 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 209
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 210 RYLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 268
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 269 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 327
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 328 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 385
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 386 HFTEPPISAQLLAFLRV 402
>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
Length = 597
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 84 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 138
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 139 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 196
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 197 RYLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 255
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 256 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 314
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 315 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 372
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 373 HFTEPPISAQLLAFLRV 389
>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
Length = 593
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW +++G + I + GL A + I+ E L+VP L++T +S
Sbjct: 82 LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRRLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPSEYNTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HSTEPPISAQLLAFLRV 387
>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
Length = 595
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
Length = 595
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
Length = 595
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
Length = 430
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 165/357 (46%), Gaps = 25/357 (7%)
Query: 90 PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW 149
P + +++ ++G GL A +++ K ++++ +P +L + D+ E GE + W
Sbjct: 9 PGGVEVRRGELG-LGLFAKRSVSKNQEVVSIPKTLWMDVDTV-RRSEIGECC--AGLRPW 64
Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
+A YL+ E + + S WS YI LPR S L+W+ EL L+ +Q+
Sbjct: 65 IAVALYLLHEKA-KPHSDWSAYIRVLPRTLDSPLFWSEEELAE-LKGTQLLSSINGFKEF 122
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
+ Y+ + + PD+F ++ +E F W+FGIL SR P + +ALVP AD +
Sbjct: 123 LKREYDKVMTEVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFP-PLIGDNLALVPLADFV 181
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQ--------PGEQVFISYGKK-SNGELLLSYGF 320
NH + Y VF ++V + YGKK N +L YGF
Sbjct: 182 NHGFGLTNEDPYWHVKSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGF 241
Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI-QITGWPLELMAYAY 379
V + N DS L L + S++ +K++ + GL ++ F + + G P +++AY
Sbjct: 242 VDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLYRNQGPPEDMIAYLR 301
Query: 380 LVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESSISKYS 433
L+ S + A + T ++ P E +E + +++ C +++ +YS
Sbjct: 302 LIALFGS-----DSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYS 353
>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
Length = 456
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 157/326 (48%), Gaps = 29/326 (8%)
Query: 79 LQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
LQ+WL G + + A + D G RGL+A ++++ GE ++ +P + +IT ++
Sbjct: 36 LQRWLKGRGFQGRHLRAAEFADTG-RGLMATRDLKPGELIIALPETCLITTETVLQ-SYL 93
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD-RYL 194
G+ ++ PLLA T+LI+E + S+W Y+ +P +YW EL+ +L
Sbjct: 94 GKYIRLWRPHVSPLLALCTFLIAERFAGERSQWKPYLDVIPSTYSCPVYW---ELEIVHL 150
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF---KWSFGILFSRLV 251
+ +R++A+E+ T V + + L F+ LF + V ++ T+ +W++ + +R V
Sbjct: 151 LPAPLRQKALEQKTEVQELHTE-SLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTV 209
Query: 252 --------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
RL + AL P+ D+LNHS EV+ ++ K + T+ + +Q F
Sbjct: 210 YMKHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAF 269
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
I YG N LLL YGFV NP SV + L DK +K L+++
Sbjct: 270 ICYGPHDNQRLLLEYGFVA--ANNPHRSVYVTKDAILAHLSPGDKQMPKKWALLKEHDFL 327
Query: 359 ASECFPIQITGWPLELMAYAYLVVSP 384
+ F ++ W L L A L + P
Sbjct: 328 VNLTFGLEGPSWKL-LTAVKLLCLRP 352
>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 502
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/391 (26%), Positives = 177/391 (45%), Gaps = 40/391 (10%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
+WLSD G+ K + V E GL+A ++I + E +L +P L I D+ A +
Sbjct: 57 QWLSDQGVVSGKSPAKPGVVKEGLGLIAERDIARNEVVLEIPKKLWINPDAV----AASD 112
Query: 140 VLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
+ CS + W +A +LI E ++ S W Y+ LP S +YW + Y+
Sbjct: 113 IGNVCSGLKPWISVALFLIREKLKKEGSTWWPYLDILPDTTNSTIYWWVLLVAFYVLVLS 172
Query: 199 IRERAIERITNVIGT----------------YNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
+ R+ E + + GT + + I + +LFP + ++ F W+
Sbjct: 173 FQRRSEEELAELQGTQLLRTTLGVKEYMQREFAKVEEEILLPHKELFPSPI-TLDDFLWA 231
Query: 243 FGILFSR-LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY----- 296
FGIL SR RL + + L+P AD++NHS ++ T + G +F+ + +
Sbjct: 232 FGILRSRAFSRLRGQN--LVLIPLADLINHSPDITTEDYAYEIKGGGLFSRELLFSLRSP 289
Query: 297 ---QPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
+ GEQV I Y KSN EL L YGF+ E T ++ L L + +SD + +KL+
Sbjct: 290 ISVKSGEQVLIQYDLNKSNAELALDYGFI--EKTPDRNTYTLTLQISESDPFFGDKLDIA 347
Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
G + F I + G PL YL + F + + N + ++
Sbjct: 348 ETNGSGETADFDI-VLGNPLPPAMLPYLRLVALGGTDAF-LLESIFRNTIWGHLELPISR 405
Query: 413 IDEQAL-QFILDSCESSISKYSRFLQVKELL 442
+E+ + + + D+C+S++S Y ++ E L
Sbjct: 406 ANEELICRVVRDACKSALSGYHTTIEEDEKL 436
>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Nomascus leucogenys]
gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Nomascus leucogenys]
Length = 595
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
Length = 585
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
R L+ +Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RDLQCTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTDPPVSAQLLAFLRV 387
>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
Length = 594
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
Length = 594
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 595
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Pongo abelii]
gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Pongo abelii]
Length = 595
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
[Vitis vinifera]
Length = 509
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 143/312 (45%), Gaps = 29/312 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
L+ W+ ++GLPP K+ +++ + A ++++ G+ VP SLV+T +
Sbjct: 78 LKSWMHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVL 137
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
E+L + + LA YL+ E K S W YI L RQ S L W
Sbjct: 138 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLW 197
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
+ +EL YL S + +ER + YN+L +F +YP P E F E F
Sbjct: 198 SESEL-AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIF 256
Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQ 295
K +F + S +V L S+ R ALVP +L + + L D S Q VV DR
Sbjct: 257 KQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVV---DRP 313
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
Y+ GE + + G + N +LLL+YGFV + N D + + +L D Y++K ++
Sbjct: 314 YKAGESIVVWCGPQPNSKLLLNYGFVDED--NSYDRIVVEAALNTEDPQYQDKRMVAQRN 371
Query: 356 GLSASECFPIQI 367
G + F + +
Sbjct: 372 GKLTVQKFHVSV 383
>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
gorilla gorilla]
Length = 594
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
troglodytes]
gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
troglodytes]
gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
paniscus]
gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
paniscus]
gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
Length = 594
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
Length = 597
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 84 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 138
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 139 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 196
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 197 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 255
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 256 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 314
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 315 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 372
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 373 HFTEPPISAQLLAFLRV 389
>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
Length = 588
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HYTDPPVSAQLLAFLRV 387
>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
melanoleuca]
Length = 591
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
R L+ +Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RDLQCTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTDPPVSAQLLAFLRV 387
>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
garnettii]
Length = 595
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI +LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIYDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
Length = 589
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HYTDPPVSAQLLAFLRV 387
>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
africana]
Length = 597
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEVVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
R+L+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
pisum]
Length = 463
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 153/313 (48%), Gaps = 19/313 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE-- 136
L KW + +G + I + + G+ A KNI G+KL+ VP +L++T ++ S P
Sbjct: 89 LTKWATKNGAILNGVEIHQFENYAYGMKANKNITVGDKLVTVPRALMMTEENIPSSPLWK 148
Query: 137 -AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+ + ++P+ L L+ +K S W +Y++ LP + +Y+ A+L+ L+
Sbjct: 149 LHSQDMMLRNMPNVALAIFILVESLRKDKKSFWHSYLTTLPVTYSTPVYFDVADLEA-LK 207
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVR 252
S E A++ N+ Y + ++F D + ++ F E ++W+ L SR
Sbjct: 208 GSPAFEAALKLNRNIARQYAYFK-KLFQLSNDPASVILKDTFTYEYYRWAVSTLMSRQNT 266
Query: 253 LPSMDG----RVALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
+PS D AL+P DM NH S + T D+ KSS V D Y EQV+I YG
Sbjct: 267 VPSSDNPSENVSALIPLWDMFNHRSGRLST--DFVKSSNVCVCYADGDYAADEQVYIFYG 324
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
++N + L+ GFV + N D+V++ L + +SD Y + L+ L A F +
Sbjct: 325 VRTNADFLVHNGFVYPD--NEHDAVKIRLGVSRSDPLYSLRYRLLQTLSLPALAEFYLTP 382
Query: 368 TGWPLE--LMAYA 378
+P++ L+A+
Sbjct: 383 GPFPVDGKLLAFV 395
>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
novemcinctus]
Length = 589
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 146/313 (46%), Gaps = 16/313 (5%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S + G
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSM-LG 140
Query: 139 EVLKQCSVPDWP---LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+ Q + LA +L+ E + +S W YI +LP + + LY+ E+ RYL
Sbjct: 141 PLYSQDRILQAMGNITLAFHLLCERA-NPNSFWQPYIQSLPGEYDTPLYFEEDEV-RYLH 198
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVR 252
++Q + N Y ++ +P L ++ F E ++W+ + +R +
Sbjct: 199 STQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQ 257
Query: 253 LPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
+P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I YG +
Sbjct: 258 IPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIFYGTR 316
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F + T
Sbjct: 317 SNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTE 374
Query: 370 WPLELMAYAYLVV 382
P+ A+L V
Sbjct: 375 PPISAQLLAFLRV 387
>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
Length = 512
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 144/315 (45%), Gaps = 24/315 (7%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
KW S++G + + GL A ++I+ E L+VP L++T +S V
Sbjct: 2 KWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KNSV 56
Query: 141 LKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
L D L LA +L+ E + +S W YI LP + + LY+ E+ RY
Sbjct: 57 LGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV-RY 114
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRL 250
L+++Q + N Y ++ +P L ++ F E ++W+ + +R
Sbjct: 115 LQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQ 173
Query: 251 VRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I YG
Sbjct: 174 NQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIFYG 232
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
+SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 233 TRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 290
Query: 368 TGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 291 TEPPISAQLLAFLRV 305
>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
Length = 585
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P + P ++ F E ++W+ + +
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HSTEPPVSAQLLAFLRV 387
>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Strongylocentrotus purpuratus]
Length = 682
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 160/317 (50%), Gaps = 22/317 (6%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
T KWL+ +G+ + + K D G GL A ++I+ ++L+ +P +++T + P
Sbjct: 82 TFFKWLNTNGVTTDAVKMAKFDEG-YGLQATQDIKMDQELMNIPRKVMMTDQNAVDSPTI 140
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDR 192
G++++ +P+ L A +++SE + S W Y+ LP YSL LY+T E+ +
Sbjct: 141 GDLVRGDRLLKGMPNVSL-AIFILSE-KLKSDSFWKPYLDVLP-SSYSLPLYFTPDEI-Q 196
Query: 193 YLEASQIRERAIERITNVIGTYNDL-RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
+ S + +++ N+ Y L +L + L E F + ++W+ + +R
Sbjct: 197 LFQGSTMYGECLKQHKNIARQYAYLFKLLNLPENSKLHIREYFTYDFYRWAVSTVMTRQN 256
Query: 252 RLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
++P+ DG+ ++L+P DM NH+ E D+ + V R + GEQ+FI YG+
Sbjct: 257 QIPAKDGKGMSLSLIPLWDMCNHA-NGEMKTDFIEERDSCVNMALRDFSVGEQIFICYGR 315
Query: 309 KSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCY--KEKLEALRKYGLSASECFPI 365
+S+ +LLL GFV P N D + + L L SD+ Y K +L ++ K G+ S+ + I
Sbjct: 316 RSSADLLLYSGFVYP---GNVYDGMAIQLGLSSSDRLYAMKAQLCSVMKLGV-PSQNYHI 371
Query: 366 QITGWPLELMAYAYLVV 382
P+ L +L +
Sbjct: 372 SAGKEPVTLELLTFLRI 388
>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
Length = 1339
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 74/250 (29%), Positives = 118/250 (47%), Gaps = 13/250 (5%)
Query: 79 LQKWLSDSGLPPQKMAIQK-VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+ WL G+ K+ I D RG+V K + + E ++ VP +I D + P
Sbjct: 760 FENWLKAGGVQFPKLQIANFTDSTGRGVVTTKKVDENEAVVVVPKKYLINVDVAKAHPIL 819
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
G + ++ + D +L ++I E +S W + LP + ++++ EL LE +
Sbjct: 820 GPIFEELHLNDDTILFLFVIYEKG-NANSFWRPFYDTLPSYFTTSIHYSATELLE-LEGT 877
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSK-YPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+ E + + ++ D SK YPD+FPE F+ E F W+ +L SR ++L +
Sbjct: 878 NLFEETL-HTKQQLNSFRDYLFPELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KI 935
Query: 257 DGRVA--LVPWADMLNHSCEV---ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
DG + LVP ADM+NH E F D+D SQ + Q+F+ YG N
Sbjct: 936 DGSIKSCLVPMADMINHHTNAQISERFFDHD--SQSFKMISSCNIPANNQIFLHYGALQN 993
Query: 312 GELLLSYGFV 321
EL L YGF+
Sbjct: 994 WELALYYGFI 1003
>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 487
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 164/357 (45%), Gaps = 25/357 (7%)
Query: 82 WLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
WL +SG + + + V RG+ L+ ++GE++ +P ++ T + ++ G
Sbjct: 22 WLKESGAVGLNALELANFQVIGRGVRTLRCFKEGERIFTIPADVLWTVEHAYADSLLGPA 81
Query: 141 LKQC----SVPDWPLLATYLISEASFEKSSRW-SNYISALPRQPYSLLYWTRAELDRYLE 195
L+ SV D LA Y++ S E ++++ LP+ S +++T EL+ +
Sbjct: 82 LRSARPPLSVDD--TLAMYILFVRSRESGYDGPRSHLATLPKSYSSSIFFTDDELE--VC 137
Query: 196 ASQIRERAIERITNVI-GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR-- 252
A +R+ I Y L +R+ ++ DLFP + F +E +KW+ ++SR +
Sbjct: 138 AGSSLYALTKRLGRCIEDDYRALVVRLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFV 197
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE-----QVFISYG 307
LP + P+ADMLNHS EV YD S + + Y+ G+ QVFI YG
Sbjct: 198 LPGGKSIRLMAPFADMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYG 257
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
N LL YGFV NP+DS +L L +++K + G ++ I +
Sbjct: 258 SIPNNRLLRLYGFV--MPGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISITL 315
Query: 368 TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
T PL YL + + ++A+ A ++ K + + + LQ +++S
Sbjct: 316 TD-PLPKNVLGYLRIQ----RSDESDLASIARQRIDPKYEKISDSNEVEVLQSLIES 367
>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
Length = 456
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 89/312 (28%), Positives = 150/312 (48%), Gaps = 28/312 (8%)
Query: 79 LQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
LQ+WL G + + A + D G RGL+A ++++ GE ++ +P + +IT ++
Sbjct: 36 LQRWLKGRGFQGRHLRAAEFADTG-RGLMATRDLKPGELIIALPETCLITTETVLQ-SYL 93
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR-YL 194
G+ ++ PLLA T+LI+E S+W Y+ +P +YW EL+ +L
Sbjct: 94 GKYIRLWRPHVSPLLALCTFLIAERFAGDCSQWKPYLDVIPSTYSCPVYW---ELEIIHL 150
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF---KWSFGILFSRLV 251
+ +R++A+E+ T V + + L FS LF + V ++ T+ +W++ + +R V
Sbjct: 151 LPAPLRKKALEQKTEVQELHTE-SLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTV 209
Query: 252 --------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
RL + AL P+ D+LNHS EV+ ++ K + T+ + +Q F
Sbjct: 210 YMKHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAF 269
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
I YG N LLL YGFV NP SV + L DK +K L+++
Sbjct: 270 ICYGPHDNQRLLLEYGFVA--ANNPHRSVYVTKDAILAHLSPGDKQMPKKWALLKEHDFL 327
Query: 359 ASECFPIQITGW 370
+ F I+ W
Sbjct: 328 VNLTFGIEGPSW 339
>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 507
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +W ++G + + GL A + I+ E L+VP L++T +S G
Sbjct: 7 LMEWCKENGASTDGFELVEFPEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KG 61
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 62 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWLPYIKTLPNEYDTPLYFNEDEV- 119
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+YL+++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 120 QYLQSTQAILDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMT 178
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 179 RQNQIPTEDGSRVTLALIPLWDMCNHTNSLIT-TGYNLEDDRCECVALQDFKSGEQIYIF 237
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 238 YGTRSNAEFVIHNGFFFE--NNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 295
Query: 366 QITGWPLELMAYAYLVV 382
+T P+ A+L V
Sbjct: 296 HVTEPPISAQLLAFLRV 312
>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
Length = 592
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 146/319 (45%), Gaps = 24/319 (7%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
S L KW ++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 80 SELIKWAGENGAFTDGFEVANFEEEGFGLKATRDIKAEELFLWVPRKLLMTVESA----- 134
Query: 137 AGEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
+L D L LA +L+ E + +S W YI LP + + LY+ E
Sbjct: 135 KNSILGSLYSQDRILQAMGNITLAFHLLCE-RYNPNSFWLPYIQTLPNEYNTALYFEEDE 193
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGIL 246
+ +YL+++Q + N Y ++ +P+ L ++ F + ++W+ +
Sbjct: 194 V-QYLQSTQAIHDIFSQYKNTARQYAYF-YKVVQTHPNASKLPLKDSFTYDDYRWAVSSV 251
Query: 247 FSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
+R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++
Sbjct: 252 MARQNQIPAEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLKDDRCECVALQDFKAGEQIY 310
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
I YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F
Sbjct: 311 IFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF 368
Query: 364 PIQITGWPLELMAYAYLVV 382
+ T P+ A+L V
Sbjct: 369 ALHSTEPPISAQLLAFLRV 387
>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
griseus]
gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
Length = 577
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
norvegicus]
Length = 416
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 21 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 75
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+L D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 76 SILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 133
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 134 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 192
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q G+Q++I
Sbjct: 193 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 251
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 252 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 309
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 310 HFTEPPISAQLLAFLRV 326
>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 445
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 164/354 (46%), Gaps = 38/354 (10%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL +SG + + + + V RG+ L+ +GE++L +P ++ T + ++ P
Sbjct: 1 MEGWLKESGAVGLDALELAEFPVIGRGVRTLRRFNEGERILTIPRDVLWTVEHAYADPLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
G VL+ SV D LATY++ S E ++++A+P+ S +++T EL+
Sbjct: 61 GPVLRSARPPLSVDD--TLATYILFVRSRESGYDGLRSHLAAVPKSYSSSIFFTEDELEV 118
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
S + IT +G R E+ + +KW+ ++SR +
Sbjct: 119 CAGTS------LYAITKQLG-------RCI--------EDDYRALVYKWALCTVWSRAMD 157
Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
DG+ L P+ADMLNHS EV YD S + + Y+ G+QVFI YG
Sbjct: 158 FALPDGKSVRLLAPFADMLNHSSEVRQCHAYDPLSGNLSILAGKGYEAGDQVFIHYGSVP 217
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
N LL YGFV +NP+DS +L L +++K + GL ++ + +T
Sbjct: 218 NNRLLRLYGFVIP--SNPNDSYDLVLETHPLAPFFEQKRKLWALAGLDSTSTISLTLTD- 274
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
PL YL + + ++AA A + K + + +ALQF+++S
Sbjct: 275 PLPNNVLRYLRI----QRSDESDLAAVALQQADPKYEKISNSSEVEALQFLIES 324
>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 140/292 (47%), Gaps = 24/292 (8%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-------LATYL 156
GL A K+I+ E L++P +++T +S VL D L LA +L
Sbjct: 107 GLKATKDIKAEELFLWIPRKMLMTVESA-----KNSVLGPLYSQDRILQAMGNVTLALHL 161
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
+ E + SS W YI LP + + LY+ E+ R+L A+Q + + + N Y
Sbjct: 162 LCERA-NPSSPWLPYIKTLPSEYDTPLYFEEEEV-RHLLATQAIQDVLSQYKNTARQYAY 219
Query: 217 LRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLN 270
++ +P+ L ++ F + ++W+ + +R ++P+ DG +AL+P DM N
Sbjct: 220 F-YKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCN 278
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
H+ + T Y+ + Y+ GEQ++I YG +SN E ++ GF + N D
Sbjct: 279 HTNGLIT-TGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFED--NAHD 335
Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
V++ L + KS++ Y K E L + G+ AS F + + P+ A+L V
Sbjct: 336 RVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCSEPPISAQLLAFLRV 387
>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase, chloroplast precursor, putative
[Ricinus communis]
Length = 456
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 170/359 (47%), Gaps = 34/359 (9%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
R L A K+I+ G+ +L VP S I +D+ PE ++L V LA L+ +
Sbjct: 47 RSLFASKSIQTGDCILRVPYSAQIASDNL--LPELSDLLGD-EVGSVAKLAIVLLVDQKV 103
Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ S+W+ YIS LP+ + +S ++W+++ELD ++S +E I++ + + ++
Sbjct: 104 GQESKWAPYISRLPQLGEMHSTIFWSKSELDMIFQSSVYKE-TIKQKAQIEKDFLTIK-P 161
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ +P + F + F ++ ++ SR S G V+L+P+AD LNH E +
Sbjct: 162 VLEHFPQISRSITF--QDFMHAYALVKSR--AWGSTKG-VSLIPFADFLNHDGFSEAVVL 216
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLK 339
D+ Q DR Y P E+V I YGK SN LLL +GF +P N + VE+ +++
Sbjct: 217 NDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLP---YNIHEQVEIQINIP 273
Query: 340 KSDKCYKEKLEALRKYGLSAS----------ECFPIQIT--------GWPLELMAYAYLV 381
D + K E LR + + A+ + F I+ G P L A+A ++
Sbjct: 274 DHDTLREMKFEILRLHHIPATKDDNGFNSSWDSFLIKEVRSAGGKGKGLPQSLRAFARVL 333
Query: 382 VSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQVKE 440
+ AA ++ +++ ++ + QA + +L I +YS +++ E
Sbjct: 334 CCTSHQDLNDLVLEAAQTDGRLARRALENSSREIQAHEILLSRINQVIEEYSASIKLLE 392
>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 463
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 140/310 (45%), Gaps = 40/310 (12%)
Query: 54 RTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
+T+ ++ N +PW I + +TL S G R L A K I
Sbjct: 37 QTQASLDNNFLPWLERIAGAKITNTLSIGKSTYG---------------RSLFASKVIYA 81
Query: 114 GEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
G+ +L VP + IT D P VL V + +LA LI E + SRW YIS
Sbjct: 82 GDCMLKVPFNAQITPDE---LPSDIRVLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYIS 138
Query: 174 ALPR--QPYSLLYWTRAELDRY----LEASQIRERA-IERITNVIGTYNDLRLRIFSKYP 226
LP+ + +S ++W EL + ++++A IE+ + + I ++ P
Sbjct: 139 RLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFSFVAQAFKQHCPIVTERP 198
Query: 227 DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQ 286
DL E F +++ ++ SR R++L+P+AD +NH + + D+ +Q
Sbjct: 199 DL--------EDFMYAYALVGSRAW---ENSKRISLIPFADFMNHDGLSASIVLRDEDNQ 247
Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCY 345
T DR Y PG++VFI YG+ SN L+L +GF P N D V++ + + D
Sbjct: 248 LSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFP---YNIHDEVQIQMDVPNDDPLR 304
Query: 346 KEKLEALRKY 355
KL L+ +
Sbjct: 305 NMKLGLLQTH 314
>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
tropicalis]
gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 581
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +W ++G + + GL A + I+ E L+VP L++T +S G
Sbjct: 81 LMEWCKENGASTDGFELVEFPEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KG 135
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 136 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWLPYIKTLPNEYDTPLYFNEDEV- 193
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+YL+++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 194 QYLQSTQAILDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMT 252
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 253 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKSGEQIYIF 311
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 312 YGTRSNAEFVIHNGFFFE--NNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 369
Query: 366 QITGWPLELMAYAYLVV 382
+T P+ A+L V
Sbjct: 370 HVTEPPISAQLLAFLRV 386
>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
terrestris]
Length = 484
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 170/363 (46%), Gaps = 24/363 (6%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
WL +G ++ + + GL A +N + E +L +P L+ + + + PE +
Sbjct: 88 WLKQNGANVYGASVAEFPGYDLGLKAERNFLENELILRIPRELIFSIHN--AAPELVALQ 145
Query: 142 KQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI 199
+ P LA L+ E E S+W Y+ LP ++LY T A+++ L+ S
Sbjct: 146 NDPLLQLMPQVALAIALLIEKHKE-YSKWKPYLDILPTTYTTVLYMTAADMNE-LKGSPT 203
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFSRLVRLPSM 256
E A+++ N+ Y ++F K + +VF E + W+ + +R +PS
Sbjct: 204 LEAALKQCRNIARQYAYFN-KLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSK 262
Query: 257 DGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
DG + AL+P DM NH + + D++ + R ++ EQ+FISYG ++N +
Sbjct: 263 DGSLMIHALIPMWDMCNHE-DSKITTDFNATLNCCECYALRDFKKAEQIFISYGPRTNSD 321
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLE 373
+ GFV + N D +L L + K+D +KE++E L K L A F ++ P+
Sbjct: 322 FFVHSGFVYMD--NEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPGTEPIS 379
Query: 374 LMAYAYLVVSPPSMKGKFEEMAA-AASNKMTSKKDIKCP---EIDEQALQFILDSCESSI 429
A+L V SM+ EE+A S+++ K + C ++E +F+L + I
Sbjct: 380 DTLLAFLRVF--SMRK--EELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLI 435
Query: 430 SKY 432
+ Y
Sbjct: 436 ANY 438
>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 495
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/362 (28%), Positives = 165/362 (45%), Gaps = 43/362 (11%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
V RG+ AL+++ GE +L VP SL++ + + P G +L + D +A +LI E
Sbjct: 74 VAGRGVFALRDLAAGETVLRVPLSLLLNVEHASASPLGG-ILDDFRLSDAEAMAFWLIYE 132
Query: 160 ASF-EKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
+ E++S W Y+ +LP L +++ E+ R L+AS + E R + +
Sbjct: 133 LTRPERASPWLPYLESLPASIKQLTMFYDPFEMKR-LQASPVAEFTSRRTVKMRNKFGKY 191
Query: 218 RLRIFSKYPDL-----FPEEVFNMETFKWSFGILFSRL----VRLPSMDGR----VALVP 264
R +I P FP E+ ++ F W+ + F+RL V+ P+ DG LVP
Sbjct: 192 REQISKHRPAHLAEIEFPVELITVDDFLWAMAVQFTRLITVQVKHPA-DGEWERTKCLVP 250
Query: 265 WADMLNHS------CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG---KKSNGELL 315
AD+LN + E T LD S T R G+++ YG + SNG+L+
Sbjct: 251 LADLLNTAPADQINVECATNLD----STHFECATIRPVAEGQELLTPYGGAEQLSNGQLI 306
Query: 316 LSYGFVPREGTNPSDSVELPL-SLKKSDKCYKEKLEALRKYGLSASECFPIQI----TGW 370
+ YG R NPSD V LP+ L+++ Y K+ L L + + +
Sbjct: 307 MDYGVTFR--NNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQLPVLDHFESI 364
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSIS 430
P EL+A+A + VS PS E + M + I P + +AL+ +L I
Sbjct: 365 PKELLAFARVYVSTPSDLSDLEHVL----ELMKEHRAIN-PSNERRALELLLQLTNEMIL 419
Query: 431 KY 432
KY
Sbjct: 420 KY 421
>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
Length = 507
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 147/320 (45%), Gaps = 27/320 (8%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE------RGLVALKNIRKGEKLLFVPPS 123
I E L+ W+ GLPP ++ +++ + + + A ++++ G+ +P S
Sbjct: 67 IRGKEEEVDLKSWMHRHGLPPCRVMLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNS 126
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP---- 179
L++T + E+L + + LA YL+ E S W +I L RQ
Sbjct: 127 LIVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQ 186
Query: 180 ---YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFP 230
S L W+ EL +Y S ++E +ER + + Y +L +F +YP P
Sbjct: 187 LAVESPLLWSSEEL-KYFTGSPMKEIMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIP 245
Query: 231 EEVFNMETFKWSFGILFSRLVRLPSMD--GRVALVPWAD-MLNHSCEVETFLDYDKSSQG 287
E F E FK +F + S +V L +++ R ALVP +L++ + L S
Sbjct: 246 TEAFPFEIFKQAFVAVQSCVVHLQNVNLARRFALVPLGPPLLSYKSNCKAMLKAVGDS-- 303
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
V DR+Y+ GE + + G + N LLL+YGFV + NP D + + +SL D Y++
Sbjct: 304 VQLEVDREYKAGEPIVVWCGPQPNARLLLNYGFVDED--NPHDRLIVEVSLDTKDPLYQD 361
Query: 348 KLEALRKYGLSASECFPIQI 367
K ++ G + + F I I
Sbjct: 362 KRIIAQRNGKLSVQTFNIYI 381
>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=Endothelial differentiation inhibitory protein D10;
AltName: Full=SET domain-containing protein 3
gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
Length = 594
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P L +E F E ++W+ + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q G+Q++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HSTEPPISAQLLAFLRV 387
>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
Length = 582
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 147/317 (46%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +W ++G + + GL A + I+ E L+VP L++T +S G
Sbjct: 82 LMEWCKENGASTDGFELVEFPEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KG 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWLPYIKTLPNEYDTPLYFNEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+YL+++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 195 QYLQSTQAILDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKSGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHNGFFFE--NNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
+T P+ A+L V
Sbjct: 371 HVTEPPISAQLLAFLRV 387
>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
Length = 597
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 85 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 139
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 140 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 197
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P L +E F E ++W+ + +
Sbjct: 198 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMT 256
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q G+Q++I
Sbjct: 257 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 315
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 316 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 373
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 374 HSTEPPISAQLLAFLRV 390
>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
rubripes]
Length = 598
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 146/313 (46%), Gaps = 16/313 (5%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L W D G + A+ GL A ++I+ E L++P +++T +S G
Sbjct: 82 LMSWARDHGASCEGFAVTNFGAEGYGLRATRDIKAEELFLWIPRKMLMTVESAKKSV-LG 140
Query: 139 EVLKQCSV---PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+ Q + D LA +L+ E + +S W YI LP++ + L++ + E+ + L+
Sbjct: 141 PLYNQDRILQAMDNVTLALHLLCERA-NPASFWLPYIRTLPQEYDTPLFYEQDEV-QLLQ 198
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRLVR 252
+Q + + + N Y ++ +P L ++ F + ++W+ + +R +
Sbjct: 199 GTQAVQDVLSQYRNTARQYAYF-YKLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQ 257
Query: 253 LPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
+P+ DGR +AL+P DM NH + T Y+ + Y+ EQ++I YG +
Sbjct: 258 IPTEDGRQVTLALIPLWDMCNHRNGLIT-TGYNLEDDRCECVALQDYKKNEQIYIFYGTR 316
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
SN E ++ GF +E N D V++ L + KS++ Y K E L + G+ S F +
Sbjct: 317 SNAEFVIHNGFFYQE--NAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCNE 374
Query: 370 WPLELMAYAYLVV 382
P+ A+L V
Sbjct: 375 QPISAQLLAFLRV 387
>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
Length = 585
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 146/318 (45%), Gaps = 24/318 (7%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 70 NLMKWASENGASVEGFEMFNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS--- 126
Query: 138 GEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 127 --VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV 183
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILF 247
R L+++Q + N Y ++ +P + P ++ F E ++W+ +
Sbjct: 184 -RSLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVM 241
Query: 248 SRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+R ++P+ DG +AL+P DM NH+ + T Y+ R ++ GEQ++I
Sbjct: 242 TRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYI 300
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F
Sbjct: 301 FYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 358
Query: 365 IQITGWPLELMAYAYLVV 382
+ P+ A+L V
Sbjct: 359 LHFMEPPISAQLLAFLRV 376
>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 492
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 151/327 (46%), Gaps = 31/327 (9%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
++ S L W+ ++G + I+ + GL A K++ + E + +P ++++T D+
Sbjct: 83 DHFSNLISWIKENGGVADNVTIKHFNEMGYGLEAAKDLEESELICAIPKNVMMTLDNVKV 142
Query: 134 CP-----EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
P E +LK LA +LI E ++S W +YIS+LP ++LY+
Sbjct: 143 SPLKYLYENNPILKNMGNV---ALALFLILEHVKNENSFWHHYISSLPSDYNTVLYF--- 196
Query: 189 ELDRYLEA--SQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSF 243
+L+ +LE S E A + N+ Y +F D L VF + ++W+
Sbjct: 197 DLNDFLEMKNSPTFEMATKHCKNIARQYAYFN-NLFQNSNDEASLILRNVFTYQLYRWAV 255
Query: 244 GILFSRLVRLPSM-------DGRVALVPWADMLNHSCE-VETFLDYDKSSQGVVFTTDRQ 295
+ +R +PS +G L+P DM NH+ + T D+S +
Sbjct: 256 STVMTRQNFIPSSSTSNDVENGINGLIPLWDMCNHTNGYLSTQYKVDRSECLAC----KP 311
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
++ GEQV I YG++SN + L+ GFV E NP DS L L + KSDK + + E L+
Sbjct: 312 FKKGEQVLIFYGERSNSDFLVHNGFVYDE--NPHDSFRLRLGISKSDKLHGLRCELLKDL 369
Query: 356 GLSASECFPIQITGWPLELMAYAYLVV 382
G+ S F + P+ A+L +
Sbjct: 370 GIPDSGDFYLYSGSEPVRENLLAFLRI 396
>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
max]
Length = 463
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 132/269 (49%), Gaps = 26/269 (9%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
R L A K I+ G+ +L VP + ITAD+ PE ++ + V + LAT ++ E
Sbjct: 61 RSLFASKIIQTGDCILKVPYRVQITADNL--LPEIRSLIGE-EVGNIAKLATVILIEKKL 117
Query: 163 EKSSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ S W YIS LP+Q ++ ++WT +EL+ + S + + I++ + + + ++
Sbjct: 118 GQGSEWYPYISCLPQQGELHNTVFWTESELE-MIRPSSVYQETIDQKSQIEKDFLAIK-H 175
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRL-VRLP-------SMDGRVALVPWADMLNHS 272
IF F + + + +LF V LP S +G +AL+P+AD LNH
Sbjct: 176 IFECSHQSFGDSTYKDFMHACTL-VLFDHFNVELPVGSRAWGSTNG-LALIPFADFLNHD 233
Query: 273 CEVETFL--DYDKSS---QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGT 326
E + D DK Q + DR Y PGEQV I YGK SN L+L +GF +P
Sbjct: 234 GVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIP---Y 290
Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRKY 355
N D V++ + K D KLE L +Y
Sbjct: 291 NIYDQVQIQFDIPKHDPLRDMKLELLHQY 319
>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
[Vitis vinifera]
Length = 515
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/319 (29%), Positives = 140/319 (43%), Gaps = 37/319 (11%)
Query: 79 LQKWLSDSGLPP-------------QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
L+ W+ ++GLPP Q AI + E L ++ G+ VP SLV
Sbjct: 78 LKSWMHENGLPPCKVVLKERPSHHEQHKAIHYIAASE-DLQGFLLLQAGDVAFSVPDSLV 136
Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ------- 178
+T + E+L + + LA YL+ E K S W YI L RQ
Sbjct: 137 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLA 196
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEE 232
S L W+ +EL YL S + +ER + YN+L +F +YP P E
Sbjct: 197 VESPLLWSESEL-AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 255
Query: 233 VFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLD-YDKSSQGV 288
F E FK +F + S +V L S+ R ALVP +L + + L D S Q V
Sbjct: 256 AFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLV 315
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
V DR Y+ GE + + G + N +LLL+YGFV + N D + + +L D Y++K
Sbjct: 316 V---DRPYKAGESIVVWCGPQPNSKLLLNYGFVDED--NSYDRIVVEAALNTEDPQYQDK 370
Query: 349 LEALRKYGLSASECFPIQI 367
++ G + F + +
Sbjct: 371 RMVAQRNGKLTVQKFHVSV 389
>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
Length = 596
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 139/292 (47%), Gaps = 24/292 (8%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-------LATYL 156
GL A K+I+ E L++P +++T +S VL D L LA +L
Sbjct: 107 GLKATKDIKAEELFLWIPRKMLMTVESA-----KNSVLGPLYSQDRILQAMGNVTLALHL 161
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
+ E + SS W YI LP + + LY+ E+ R+L A+Q + + + N Y
Sbjct: 162 LCERA-NPSSPWLPYIKTLPSEYDTPLYFEEEEV-RHLLATQAIQDVLSQYKNTARQYAY 219
Query: 217 LRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLN 270
++ +P+ L ++ F + ++W+ + +R ++P+ DG +AL+P DM N
Sbjct: 220 F-YKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCN 278
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
H+ + T Y+ + Y+ GEQ++I YG +SN E ++ GF + N D
Sbjct: 279 HTNGLIT-TGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFED--NAHD 335
Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
V++ L + K ++ Y K E L + G+ AS F + + P+ A+L V
Sbjct: 336 RVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCSEPPISAQLLAFLRV 387
>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
Length = 497
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 146/319 (45%), Gaps = 28/319 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
L KW S++G + + V+ E G L A ++I+ E L+VP L+ T +S
Sbjct: 81 LXKWASENGASVE--GFEXVNFKEEGFGLRATRDIKAEELFLWVPRKLLXTVESA----- 133
Query: 137 AGEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
VL D L LA +L+ E + +S W YI LP + + LY+ E
Sbjct: 134 KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDE 192
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGIL 246
+ RYL+++Q + N Y ++ +P L ++ F E ++W+ +
Sbjct: 193 V-RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSV 250
Query: 247 FSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
+R ++P+ DG +AL+P D NH+ + T Y+ + ++ GEQ++
Sbjct: 251 XTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIY 309
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
I YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F
Sbjct: 310 IFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVF 367
Query: 364 PIQITGWPLELMAYAYLVV 382
+ T P+ A+L V
Sbjct: 368 ALHFTEPPISAQLLAFLRV 386
>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
norvegicus]
gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
Length = 596
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+L D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q G+Q++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
Length = 255
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 120/252 (47%), Gaps = 25/252 (9%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPEAGEVLKQCSVPDWPLLATYLISE 159
RG++ + + G+ ++ +P SL+IT + + P + L C + L +L+ E
Sbjct: 6 RGVMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLP-CRLSPTETLVIFLLCE 64
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-----LEASQIRERAIERITNVIGTY 214
+ SS W Y+ LP +L+WT E+D A +R +A E + +
Sbjct: 65 RNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLRLKAEESFNRLCNGF 124
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL---------PSMDGRVALVPW 265
L +R ++ F + FKW++ + +R V + P + + AL P+
Sbjct: 125 LPLLVRQMPQF-----NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKSALAPF 179
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D+LNH+ +VE +D SS+ TT +P +QVFI+YG SN +LLL YGF
Sbjct: 180 LDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLEYGFT--LP 237
Query: 326 TNPSDSVELPLS 337
NP +++ L LS
Sbjct: 238 CNPHNNISLTLS 249
>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
3-like [Meleagris gallopavo]
Length = 593
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 145/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW +++G + I + GL A + I+ E L+VP L++T +S S
Sbjct: 82 LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESAKSS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPNEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+YL ++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 195 QYLRSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
P+ A+L V
Sbjct: 371 HSIEPPISAQLLAFLRV 387
>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
Length = 486
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 145/291 (49%), Gaps = 22/291 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +WL +G +K+A+Q+ RG+ + K + GE++L +P +IT +
Sbjct: 45 LIQWLEGNGADTKKLALQEYAPEVRGVHSRKVLAPGERILVIPKKCLITVEMGKQTDIGR 104
Query: 139 EVLKQ---CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYL 194
++L + P L +L+++ ++S + NY S LP ++ ++W+ EL +L
Sbjct: 105 KLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLPSTLSNMPIFWSDEELG-WL 163
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
+ S I ++ ER + Y D+ R+ + F+++ F W+ I+ SR L
Sbjct: 164 KGSYIIQQIQERKAAIRKDY-DVICRVDPAFAR------FSLDRFSWARMIVCSRNFGL- 215
Query: 255 SMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
++DG ALVP+ADMLNH ET +D+S T+ G QV+ SYGKK N
Sbjct: 216 TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNH 275
Query: 313 ELLLSYGF-----VPREGTNPSDSVELPLSLKKSD-KCYKEKLEALRKYGL 357
LL+YGF +G NP++ V + L ++D + + +K L + G+
Sbjct: 276 RFLLNYGFAVEDNTEEDGRNPNE-VLIDFQLSQADGQLFYDKRAYLHESGI 325
>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
Length = 377
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 75/250 (30%), Positives = 121/250 (48%), Gaps = 14/250 (5%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RGL A + +R GE++L + L+I P+ ++ +V W LA ++ E
Sbjct: 68 RGLFASRPVRAGERVLEISLDLMIAPSD---LPDELSMVLPSTVKPWTKLALIVLMERYK 124
Query: 163 EKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
+SS W+ YIS LP QP L W EL YL+AS + + ER+ + + ++
Sbjct: 125 GQSSVWAPYISCLP-QPAELDNTFLWEDTELS-YLKASPLYGKTRERLEMITTEFGQVQ- 181
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFL 279
+ +P LF + ++E FK + +FSR + + D + ++P D NH+ L
Sbjct: 182 NALNVWPQLFGK--VSLEDFKHVYATVFSRSLAI-GEDSTLVMIPMLDFFNHNATSFAKL 238
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
++ V T DR Y +Q++I+YG SN EL L YGF E NP D +L
Sbjct: 239 SFNGLLNYAVVTADRAYTENDQIWINYGDLSNAELALDYGFTVPE--NPYDETDLLTQFP 296
Query: 340 KSDKCYKEKL 349
+ + K++L
Sbjct: 297 EMNTILKDQL 306
>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 524
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 117/447 (26%), Positives = 188/447 (42%), Gaps = 63/447 (14%)
Query: 26 AGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSD 85
+GF F KR G R V +N ++ TK + N+ +++ L N WL
Sbjct: 40 SGFRAFTHKRFGCRWV------QSNGSTHTKES---NVSISNTKVERLRN------WLKK 84
Query: 86 SGLPPQKMAIQKVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
+ +++ G G + G ++ VP ++T ++ C + G +
Sbjct: 85 LNHDDCNLKLERCPQGGSGSGYGAFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPL 144
Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL----LYWTRAELDRYLEA 196
+K+ + W + +L+ E + ++S W YI+ LP++ + + W++ +LE
Sbjct: 145 VKKSDLTPWQAMCLHLLYERARGETSFWYPYIAVLPKELELIGIHPMLWSQKMRREWLEG 204
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP-----EEVFNMET-FKWSFGILFSRL 250
S + + R+ Y + + + L P E + ET +W+ +L SR
Sbjct: 205 SPMLDVTERRLAICREDYEAM---LLAGAGRLTPRGNEGEPISITETAVQWAATMLLSRS 261
Query: 251 VRL--------PS--MDGRVALVPWADMLNHSCEV--ETFLDYDKSSQGVVFTTDRQYQP 298
L P + +ALVPWADMLNHS E+ L YD+ S R Y
Sbjct: 262 FSLNLQTQKLRPGSFAEDTIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSE 321
Query: 299 GEQVFISYGKK-SNGELLLSYGFVPREGTNPSDSVELPLSLKK--SDKCYKEKLEALRKY 355
GEQVF SYG S LLL YGFV E TN SV+LP S+ + K + LEA+
Sbjct: 322 GEQVFDSYGPSCSPSRLLLDYGFVDEENTN--HSVDLPASVLGPVNSKANELLLEAM--- 376
Query: 356 GLSA-SECFPIQITGWPLELMAYAYLVVSP------PSMKGKFEEMAAAASNKMTSKKDI 408
GL F + G +MA+ + V+ K E AA + T
Sbjct: 377 GLPLDGAIFSLTSAGVDESVMAWTRVAVATRQELYDAGWKEGIRERAAGYPSAATVMFRF 436
Query: 409 KCP---EIDEQALQFILDSCESSISKY 432
P + + + L+ +L +CE + KY
Sbjct: 437 STPINRDNESEVLRRLLSTCEFLLQKY 463
>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
Length = 643
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 102/406 (25%), Positives = 184/406 (45%), Gaps = 59/406 (14%)
Query: 74 ENASTLQKWLSDSG--LPPQKMAIQKVDVG---ERGLVALKNIRKGEKLLFVPPSLVITA 128
E+ + Q+WLS+ L P +I VD+G R +VA NI+K E L+ +P +++T
Sbjct: 207 EDLKSFQQWLSNKNTYLNP---SIDIVDLGPPFGRSMVANTNIKKDEILVEIPKGIMMTP 263
Query: 129 DS------KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
S ++ E+ + S D +A I + + S W Y+S LP+Q +
Sbjct: 264 KSMIKNLPRFIIDWMDEM--KISRTDQQAIA---IIYSILHEDSYWYEYVSILPKQFTTT 318
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---------------- 226
+Y+TR E+ + L+AS + R+ V Y+ R+ Y
Sbjct: 319 VYFTREEMTQ-LQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDSTKTKTKSQLD 377
Query: 227 --DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK- 283
F ++ + ++ FKW+ G ++SR L DG +VP ADM N + + K
Sbjct: 378 AMKEFKDDRYTLDQFKWALGCVWSRAFSLSEEDG--GMVPLADMFNADTVISRSKVHPKI 435
Query: 284 --SSQGVVFTTDRQYQPGEQVFISYG---KKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
SS +V+T + + GEQ+F YG +G++L+ YGF+ +G++ ++ +
Sbjct: 436 SASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSADSTIVTVAPI 495
Query: 339 KKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAA 398
S+ Y K ++ G+ + E F I EL +A + S+ K E A+A
Sbjct: 496 PPSEPLYDLKRHLMQSNGIESEE-FTITKNKLAKELFLFARI----KSINKK-ESDQASA 549
Query: 399 SNKMTSKKDIKCPEIDEQALQFI-------LDSCESSISKYSRFLQ 437
T + + P ++ AL+ + LD+ +++I + ++ L+
Sbjct: 550 HFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQDNQILK 595
>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
Length = 594
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
R L+++Q + N Y ++ +P L +E F E ++W+ + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + +Q G+Q++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAESVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HSTEPPISAQLLAFLRV 387
>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
impatiens]
Length = 484
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 97/366 (26%), Positives = 173/366 (47%), Gaps = 30/366 (8%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----A 137
WL +G ++ + + GL A +N + E +L +P L+ + + + PE
Sbjct: 88 WLKQNGANVYGASVAEFPGYDLGLKAERNFLENELILRIPRELIFSIHN--AAPELVALQ 145
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
+ L Q +P L LI + ++ S+W Y+ LP ++LY T A+++ L+ S
Sbjct: 146 NDPLLQL-MPQVALAIALLIEK--HKEYSKWKPYLDILPTTYTTVLYMTAADMNE-LKGS 201
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFSRLVRLP 254
E A+++ N+ Y ++F K + +VF E + W+ + +R +P
Sbjct: 202 PTLEAALKQCRNIARQYAYFN-KLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIP 260
Query: 255 SMDGRV---ALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
S DG + AL+P DM NH + ++ T D++ + R ++ EQ+FISYG ++
Sbjct: 261 SKDGSLMIHALIPMWDMCNHENSKITT--DFNATLNCCECYALRDFKKAEQIFISYGART 318
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
N + + GFV + N D +L L + K+D KE++E L K L A F ++
Sbjct: 319 NSDFFVHSGFVYMD--NEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPGTE 376
Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAA-AASNKMTSKKDIKCP---EIDEQALQFILDSCE 426
P+ A+L V SM+ EE+A S+++ K + C ++E +F+L +
Sbjct: 377 PISDTLLAFLRVF--SMRK--EELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQ 432
Query: 427 SSISKY 432
I+ Y
Sbjct: 433 LLIANY 438
>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
Length = 2165
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 81/269 (30%), Positives = 129/269 (47%), Gaps = 22/269 (8%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
TL KWL G +K+ I+ RG+ A ++I+KGE +L+VP +IT + + P
Sbjct: 149 TLLKWLEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVG 208
Query: 138 GEV----LKQCSV-PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
++ L+Q + P L+TY++ E + S+W YI LP+ + + E
Sbjct: 209 KKMYEKGLRQRLISPKHSFLSTYIMQEKR-KPESQWQIYIDILPKNFSNFPIFFTEEERI 267
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN---METFKWSFGILFSR 249
+L+ S ++ +E+I ++ Y DL + +Y FP ++ M FGI
Sbjct: 268 WLKGSPFLDQILEKIEDIKADY-DLICKEVPEYVQ-FPIREYSEIRMMVSSRIFGIQIEG 325
Query: 250 LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
+ DG VA +ADMLNH +T Y QG + Q GEQV+ SYGKK
Sbjct: 326 V----KTDGFVA---YADMLNHKRPRQTSWTYTDEKQGFIIEAMEDIQRGEQVYDSYGKK 378
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSL 338
N L+YGF+ +D+ E+P+ +
Sbjct: 379 CNSRFFLNYGFINLN----NDANEVPIKV 403
>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
harrisii]
Length = 602
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 143/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW +++G + GL A + I+ E L+VP L++T +S
Sbjct: 89 LIKWAAENGASTDGFELVNFKEEGFGLRATREIKAEELFLWVPRKLLMTVESA-----KN 143
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + SS W YI LP + + LY+ E+
Sbjct: 144 SVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSEYDTPLYFEEDEV- 201
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
++L+++Q + N Y ++ +P+ L ++ F E ++W+ + +
Sbjct: 202 QHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTYEDYRWAVSSVMT 260
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + + GEQ++I
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFNVGEQIYIF 319
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394
>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
Length = 437
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 15/206 (7%)
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR----LRI 221
S+W+ +I LPR ++ LY+ EL R LE S + A + V Y L+ L +
Sbjct: 112 SKWAKHIELLPRTYHNALYFGPEEL-RALEGSNVYFIAQQMEEKVAHDYARLKESVLLEL 170
Query: 222 FSKYP-----DLFPEEVFNMETFKWSFGILFSRLVRLP-SMDGRVALVPWADMLNHSCEV 275
F P DLF +E F++E +KW+ ++SR +P + A+VP DMLNH E
Sbjct: 171 FENVPEGINVDLF-DEFFSLENYKWALSTIWSRFGDVPVAKQSFKAMVPVFDMLNHDPEA 229
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
E +D S+Q + + + G Q+FI+YG SN +LL YGFV NP D+VE+
Sbjct: 230 EMSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHKLLALYGFVII--GNPFDAVEMW 287
Query: 336 LSLKK-SDKCYKEKLEALRKYGLSAS 360
L + + S K ++EK + L GL +
Sbjct: 288 LPMDEASTKFFQEKEQLLLTNGLDHA 313
>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
gallus]
gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
Full=SET domain-containing protein 3
gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
Length = 593
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW +++G + I + GL A + I+ E L+VP L++T +S
Sbjct: 82 LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+YL ++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 195 QYLRSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
P+ A+L V
Sbjct: 371 HSIEPPISAQLLAFLRV 387
>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
Length = 1234
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/269 (27%), Positives = 128/269 (47%), Gaps = 10/269 (3%)
Query: 79 LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+KWL+ G+ K+ I D RG+V K + + E ++ VP +I P
Sbjct: 740 FEKWLASDGVQCPKLQIANFQDSTGRGIVTTKKVEENEVIIKVPRKFLINVQVAREHPIL 799
Query: 138 GEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
G + ++ S + D +L ++I E +S W + LP + +++T EL LE
Sbjct: 800 GRIFEEFSGLNDDTILFLFVIYEKE-NPNSFWRPFFDTLPSYFPTSIHYTSTELLE-LEG 857
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+ + ++ ++ + L + +YP +FPE +F+ E F W+ + SR ++L +
Sbjct: 858 TNLFAETLQVKEHLQSIRDMLFPELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KI 916
Query: 257 DGRVA--LVPWADMLNHSCEVETFLDY-DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
D ++ LVP ADM+NH + + D++ Q + P Q+F+ YG N E
Sbjct: 917 DDKITNCLVPMADMINHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRE 976
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSD 342
L L YGFV ++ NP DS+ + L D
Sbjct: 977 LALYYGFVIQD--NPYDSMLIGFDLPDED 1003
>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 561
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/276 (29%), Positives = 131/276 (47%), Gaps = 23/276 (8%)
Query: 69 EIDSLEN---ASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
++ LEN + L WL + G +K+ +Q+ RG+ + GE++LF+P + +
Sbjct: 107 DVADLENDVVGAELIDWLQNQGAETKKLMLQQYAPEVRGVHCRNELVPGERILFIPKNCL 166
Query: 126 ITADSKWSCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
IT + +VL + P L YL+++ + + + Y S LP ++
Sbjct: 167 ITVEMGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYYSTLPSTLKNM 226
Query: 183 -LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
++W+ EL +L+ S I + ER + Y+ I P F++E F W
Sbjct: 227 PIFWSDQEL-SWLKGSYILHQIQERKAAIRKDYD----AICRADPSF---SRFSLERFSW 278
Query: 242 SFGILFSRLVRLPSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
+ I+ SR L ++DG ALVP+ADMLNH ET +D+ G T+ G
Sbjct: 279 ARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGFTITSLESICSG 337
Query: 300 EQVFISYGKKSNGELLLSYGF-----VPREGTNPSD 330
QV+ SYGKK N LL+YGF +G+NP++
Sbjct: 338 AQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNE 373
>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
Length = 440
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 144/310 (46%), Gaps = 29/310 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + + RGL+A+K+++ GE ++ +P ++T D+ G
Sbjct: 36 LRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTVIK-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + + + P PLLA T+LISE + S W Y+ LP+ Y+ L ++ R L
Sbjct: 95 DYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKD-YTCLVCLEPQVVRLL-P 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
++ +A E+ T V + R FS LF E+V F+ F W++ + +R V +
Sbjct: 153 KPLKIKAQEQKTQVQELFVSSR-GFFSSLQSLFTEDVKHIFHYHAFLWAWCTINTRTVYM 211
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ ++ T + E++FI
Sbjct: 212 KHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYEIRTTSSCKKYEELFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSAS 360
YG N LLL YGFV NP +V + + L D +KL L+++G S +
Sbjct: 272 YGPHDNHRLLLEYGFVAI--NNPHSAVYVSIDSLVDHLPSVDTQMNKKLSLLKEHGFSEN 329
Query: 361 ECFPIQITGW 370
F GW
Sbjct: 330 LTF-----GW 334
>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
Length = 355
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 129/314 (41%), Gaps = 32/314 (10%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV---- 125
I +KW+ L K+ ++ + G A +I GE + +P ++
Sbjct: 40 ISDARKIDAFEKWIQSQKLAVNKLEVKSIPGFRMGTTAKDDIADGELYIAIPDHMLMGPE 99
Query: 126 --------------ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
I S E +L + + +L +L+ + +K S W Y
Sbjct: 100 RVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKN----KVLMYFLLQMYNPKKESFWKPY 155
Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
+P S ++W+ EL L S++ A + Y++LR RIF F +
Sbjct: 156 FDIMPTNLTSPIFWSEDELQE-LAGSEVSNMARIEKKRLRAMYDELRERIFKHDRKTFLK 214
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRV---ALVPWADMLN-HSCEVETFLDYDKSSQG 287
+ F ++ + W+ G+ SR+++L G +P DM+N + +TF+ YDK +
Sbjct: 215 QAFTLKNWFWANGLYDSRVIQLNRQTGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRA 274
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY-- 345
V DR G QVF SYG KSN E LL GFV + N + P S + K Y
Sbjct: 275 AVMYADRAVSRGVQVFESYGNKSNYEYLLYNGFVMEDNPNDCVYISFPSSNARDAKSYLI 334
Query: 346 ---KEKLEALRKYG 356
+EK R++
Sbjct: 335 KHIEEKRRGYRRFA 348
>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
Length = 417
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 156/319 (48%), Gaps = 25/319 (7%)
Query: 73 LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+++ T ++W+ D G L P ++ D G R ++A I++G+ L+ VP +++++
Sbjct: 8 IDDLVTFKQWMDDEGIYLNPSLDIVKLEDYG-RSIIANTLIKEGDVLIRVPRNVMMSRTG 66
Query: 131 -KWSCPEAGEVLKQCSVPDWPLL---ATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
+ P+ + + D A YL+ + K S W Y S LP+Q + +Y+
Sbjct: 67 IELHIPKEIRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQFTTSIYFD 125
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
+ E+ + L+ S++R R++ + YN + + S D F ++ + E FKW+ +
Sbjct: 126 QDEM-KELQLSKLRYFTESRLSGIERHYN-VIFKKLSSLNDEFKKKEYTFELFKWALSCI 183
Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
+SR L S DG +VP ADM N + ++ + D + +++ + + GEQVF Y
Sbjct: 184 WSRAFSLSSDDG--GMVPLADMFNAIEKAKSKVRPDSRADQLIYYASKDIERGEQVFTPY 241
Query: 307 G---KKSNGELLLSYGFV---PREGTNPSDSVELPLSLKKSDKCYKE-KLEALRKYGLSA 359
G N ++L+ YGF P EG D+++L L D+ Y + K++ L + L
Sbjct: 242 GVYKTIGNAQMLMDYGFAFDDPSEG----DTIQLTLDNFSDDELYIDTKIDLLEQ--LDI 295
Query: 360 SECFPIQITGWPLELMAYA 378
F ++ P EL+ YA
Sbjct: 296 VREFNLKRNQLPQELLIYA 314
>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 478
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 143/291 (49%), Gaps = 22/291 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +WL +G +K+ +Q+ RG+ + K + GE++L +P +IT +
Sbjct: 37 LIQWLETNGADSKKLTLQEYAPEVRGVHSRKVLVPGERILVIPKKCLITVEMGKQTDIGR 96
Query: 139 EVLKQ---CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYL 194
++L + P L +L+++ ++S + NY S LP ++ ++W+ EL +L
Sbjct: 97 KLLARNVDFVAPKHIFLMMFLLTDMEHVETSFFRNYYSTLPSTLSNMPIFWSEEELS-WL 155
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
+ S I ++ ER + Y D+ R+ + F+++ F W+ I+ SR L
Sbjct: 156 KGSYIIQQIQERKAAIRKDY-DVICRVDPSFAR------FSLDRFSWARMIVCSRNFGL- 207
Query: 255 SMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
++DG ALVP+ADMLNH ET +D+S T+ G QV+ SYGKK N
Sbjct: 208 TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNH 267
Query: 313 ELLLSYGF-----VPREGTNPSDSVELPLSLKKSD-KCYKEKLEALRKYGL 357
LL+YGF +G NP++ V + L +D + + +K L + G+
Sbjct: 268 RFLLNYGFAVEDNTEEDGRNPNE-VLIDFQLSPADGQLFYDKRAYLHESGI 317
>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Monodelphis domestica]
Length = 595
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 142/317 (44%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW + +G + GL A + I+ E L+VP L++T +S
Sbjct: 82 LIKWAAANGASTDGFELVNFKEEGFGLRATREIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + SS W YI LP + + LY+ E+
Sbjct: 137 SVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
++L+++Q + N Y ++ +P+ L ++ F E ++W+ + +
Sbjct: 195 QHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + + GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFNVGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 538
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 140/323 (43%), Gaps = 39/323 (12%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDV--GERG--LVALKNIRKGEKLLFVPPSLVITADSKWS 133
L WL G ++ VD G RG LVA +++ G+ + VP +L +T ++ ++
Sbjct: 81 ALWTWLEREGADVASVSPALVDATPGGRGWGLVATRDVGGGDAAIVVPRALWMTKETAFA 140
Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELD 191
+ G L + P W LA L+ E S SRW+ YI LPR + L+W+ EL
Sbjct: 141 S-KIGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVEALDAPLFWSSEELA 199
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-------PEEVFNMETFKWSFG 244
L +Q+ A + V GT+ L+ F ++P LF F+ F W+FG
Sbjct: 200 E-LAGTQLLANAAGYDSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWAFG 258
Query: 245 ILFSRLVRLPSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVF------------ 290
+L SR LP +D +AL+P DM NH + VF
Sbjct: 259 VLRSRA--LPPVDQGESIALIPGIDMANHDGLCSQTWQLNNGGIAAVFGGRGGADGGGSV 316
Query: 291 ------TTDRQYQPGEQVFISYGKKS-NGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
T + GE++ +YG + + + L YGFV + P V PLS+ + D
Sbjct: 317 LLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFCSRPG-YVLGPLSIPEDDV 375
Query: 344 CYKEKLEALRKYGLSASECFPIQ 366
+K++ L GL S F I+
Sbjct: 376 NAFDKMDVLSVAGLKESPAFTIR 398
>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 140/308 (45%), Gaps = 29/308 (9%)
Query: 83 LSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+ ++GLPP K+ +++ + A ++++ G+ VP SLV+T +
Sbjct: 1 MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAE 189
E+L + + LA YL+ E K S W YI L RQ S L W+ +E
Sbjct: 61 IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSF 243
L YL S + +ER + YN+L +F +YP P E F E FK +F
Sbjct: 121 L-AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAF 179
Query: 244 GILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQYQPG 299
+ S +V L S+ R ALVP +L + + L D S Q VV DR Y+ G
Sbjct: 180 VAIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVV---DRPYKAG 236
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
E + + G + N +LLL+YGFV + N D + + +L D Y++K ++ G
Sbjct: 237 ESIVVWCGPQPNSKLLLNYGFVDED--NSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLT 294
Query: 360 SECFPIQI 367
+ F + +
Sbjct: 295 VQKFHVSV 302
>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
Length = 536
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 125/273 (45%), Gaps = 21/273 (7%)
Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSN 170
++ G+ VP SLV+T + E+L + + LA YL+ E + S W
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202
Query: 171 YISALPRQP-------YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL---- 219
YI L RQ S L WT +EL+ YL+ S I++ + R + YN+L
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELN-YLKGSPIKDEVVARDEGIRREYNELDTLWFM 261
Query: 220 --RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCE 274
+F +YP P E F E FK +F + S +V L S+ R ALVP +L +
Sbjct: 262 AGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSN 321
Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
+ L S V DR Y+ GE + + G + N LLL+YGF+ + NP D + +
Sbjct: 322 CKAMLTAVGDS--VRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDED--NPYDRIVI 377
Query: 335 PLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
SL D ++EK ++ G A + F + +
Sbjct: 378 EASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCV 410
>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
porcellus]
Length = 789
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+L D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SILGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWLPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+ L+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 QCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T P+ A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387
>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
guttata]
Length = 593
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW +++G + I + GL A + I+ E L+VP L++T +S
Sbjct: 82 LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + S W YI LP + + LY+ E+
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPHSFWLPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
++L+++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 195 QHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R ++P+ DG +AL+P DM NH+ + T Y+ + ++ GEQ++I
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+ S F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370
Query: 366 QITGWPLELMAYAYLVV 382
T + A+L V
Sbjct: 371 HSTEPAISAQLLAFLRV 387
>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
Length = 353
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 133/284 (46%), Gaps = 19/284 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +W+S +G + RGL+A K + G++++ +P L+IT + S G
Sbjct: 34 LVRWMSRNGFKGALLKPANFKETGRGLMATKPFQIGDQVISIPEMLLITTQNVLS-SYLG 92
Query: 139 EVLKQCSVPD---WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+ +KQ + P ++ TYLI E S +K S W NYI LP+ + +Y+T E++
Sbjct: 93 DFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPKSYSNPVYFTNEEIN--WL 150
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--EEVFNMETFKWSFGILFSRLVRL 253
+I+ + + + Y +L+ +FS F + +F F+W++ + +R V +
Sbjct: 151 PRRIKRKVFDECEKINTAYRELK-NLFSILESTFVSFKGIFEYSAFRWAWCTVNTRSVYM 209
Query: 254 -----PSMD---GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
P + AL P+ D+LNH+ VE Y+ S+ T + +Q+FI
Sbjct: 210 LQEQNPHLSIERDHYALAPFLDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMFIY 269
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
YG N +L + YGFV + N + VEL C + KL
Sbjct: 270 YGPHDNVKLFIEYGFVLPQ--NQHNVVELDFEDIYCKTCEERKL 311
>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
Length = 389
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 127/259 (49%), Gaps = 26/259 (10%)
Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVP-DWPLLATYLISEA 160
+RGL A ++IR GE+++ +P LV+TA+ C V K S DW L +++E
Sbjct: 9 KRGLFAARSIRAGEQIVRIPHDLVLTAEKLDDC-----VKKLLSTEYDWCPLTLLILAEQ 63
Query: 161 SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
++SRW+ Y+S LP +S ++W + EL ++LE ++ ER + Y ++
Sbjct: 64 HKGEASRWAPYVSCLPSFGDHHSTIFWEKEEL-KFLECTRAFRGTAERREMISDEYISVK 122
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF 278
+ S P +F E++ ++ F ++ + SR ++ +++ P+ D NH
Sbjct: 123 -NVISSCPHVFGEDI-SLFQFAHAYATVVSRAWN-GALSSEISMRPFVDFCNHDPVSHAT 179
Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+ +D V VFISYGK+SN L + YGFV N SD EL + +
Sbjct: 180 VSHDSCKDATV------------VFISYGKRSNAVLAVDYGFVL--PNNLSDQAELWMEI 225
Query: 339 KKSDKCYKEKLEALRKYGL 357
+D ++KLE + + +
Sbjct: 226 PWNDPLREKKLELMGAFNM 244
>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 568
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 149/327 (45%), Gaps = 28/327 (8%)
Query: 65 PWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-----GLVALKNIRKGEKLLF 119
P G D LE L+ WL +GL + ++ ++ G+VA K+ +KGE L
Sbjct: 60 PSGPTRDDLEQ---LRVWLLKNGLDSK--WLEGIEFAANLPEGSGVVAKKDFKKGEPFLQ 114
Query: 120 VPPSLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
VP L+ T + + P G++LK P LA +L+ E SS W+ YI L
Sbjct: 115 VPRKLMFTCQAMQNTP-LGQLLKVDKFLAQSPSL-CLALHLLVE-KHNHSSFWTPYIKTL 171
Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
P+ + LY+T EL+ L S AI+ I V Y + +F D+ F
Sbjct: 172 PKSYGTCLYFTLEELEG-LRGSPTFTSAIKVIATVAIQYTYIH-DLFQIRKDILHINAFT 229
Query: 236 METFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
+ F W+ + SR ++P AL+P DM NH +++TF +D +S
Sbjct: 230 WDEFIWAMSAVGSRQNQVPQWGHNALSEYALIPAWDMCNHDHGDLQTF--WDVNSDSTES 287
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
R Y+ GEQV+I YG + N +LLL GFV N D++ + + L + K+KL
Sbjct: 288 HAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYE--NNRFDALAIRVRLAPDAEHIKDKLR 345
Query: 351 ALRKYGLSASECFPIQITGWPLELMAY 377
L + + + G ++LMA+
Sbjct: 346 LLHLNNMKMDSQYYLYGLGLAVDLMAF 372
>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 1188
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 146/313 (46%), Gaps = 37/313 (11%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ--------CSVPDWPLL 152
G+RG A ++ G++L +P + +I+ P +L +P L+
Sbjct: 157 GDRGFFATCDLAPGDELASMPIATIISEQLASRSPVGMAMLSSPMLKRRGVTPIPGRTLI 216
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
YLI+ + S + +YI+ LP+ L+W AELD +L+ + I ER V
Sbjct: 217 CAYLIANRG-KLDSPFYHYINILPQTYSDPLWWNDAELD-HLDGTNIGGYIQERRNQVRN 274
Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL-----------------PS 255
+ ++ + + P LFP++VF E + W+F SR L P
Sbjct: 275 QFLNVFPVLSREQPALFPKDVFTYEAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPM 334
Query: 256 MDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
+ V L+P DM+NH D++S V F T + + GEQV+ +YG KSN EL
Sbjct: 335 KEPCVECLLPLLDMMNHQFGASITWFTDETS--VRFFTGAKVRKGEQVYNNYGPKSNEEL 392
Query: 315 LLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLE 373
L+ YGF +P N +D V++ L++ +D + KL LR +GLS + + P+E
Sbjct: 393 LMGYGFCLP---NNEADHVKIQLTV-GNDPDGEAKLAILRWHGLSLTHF--LHNRSVPVE 446
Query: 374 LMAYAYLVVSPPS 386
L + ++V P+
Sbjct: 447 LFSALRVLVMTPA 459
>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 590
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 130/285 (45%), Gaps = 16/285 (5%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
L N +T+ WL D + +Q RG+ A + + E +LF+P S +IT +
Sbjct: 148 LVNHNTMINWLLDGKSEFDNLKLQWYSKNYRGVHARRKVYNKETILFIPKSHLITLEMAK 207
Query: 133 SCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRA 188
A +++ P L+T+L+ E K S+W Y+ LP ++++
Sbjct: 208 ETDVAKKIIAAKLNLLSPKHSFLSTFLLQERK-NKESKWKPYLDILPSDYNQFPIFFSED 266
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
+L +L+ S + + E+ ++ Y+D I S P+ F E F E F W+ S
Sbjct: 267 DLS-WLKGSPFQNQVREKKADIKRDYDD----ICSVAPE-FAEYTF--EDFCWARMTASS 318
Query: 249 RLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
R+ L + + A VP ADMLNH +T YD +G V GEQV+ SYG
Sbjct: 319 RVFGLQINEQKTDAFVPLADMLNHRRPKQTSWQYDDQREGFVIQALEDIPRGEQVYDSYG 378
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
+K N L+YGF+ + N ++ V L L+ D + K E +
Sbjct: 379 RKCNSRFFLNYGFINLD--NDANEVALRLTFDAEDPTIERKKEMM 421
>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
carolinensis]
Length = 440
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 88/295 (29%), Positives = 142/295 (48%), Gaps = 20/295 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + G K+ + RGLV K ++ GE ++ +P ++T D+ +
Sbjct: 36 LKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTVLN-SYLR 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
E + + + P PL+A T+LI+E ++ S W Y+ LP + YS ++ L
Sbjct: 95 EYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLP-EIYSCPVCLEQKIVN-LFP 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
+R +A E+ V + + FS P LFP++V FN + FKW++ + +R V +
Sbjct: 153 EPLRRKAHEQRKLVQELFISSQQFFFSLQP-LFPKDVASVFNYQAFKWAWCTINTRTVYM 211
Query: 254 P-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
S D AL P+ D+LNH+ V+ +++ ++ TT Q +VFI
Sbjct: 212 KHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYEITTVTQCHHYNEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK---SDKCYKEKLEALRKYGL 357
YG N LLL YGFV R+ + S V LK DK +KL L+++ L
Sbjct: 272 YGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDKQRPKKLSILQEHKL 326
>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
latipes]
Length = 606
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 149/315 (47%), Gaps = 16/315 (5%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+ L W ++G I GL ++I+ E L+VP +++T +S +
Sbjct: 80 ADLMSWAQENGASCDGFTITNFGTEGYGLRTTRDIKAEELFLWVPRKMLMTVESAQNSV- 138
Query: 137 AGEVLKQCSVPDW---PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
G + Q + LA +L+ E + +S WS YI +LP++ + LY+ + ++ +
Sbjct: 139 LGPIYSQDRILQAMGNVTLALHLLCERG-DPASFWSPYIRSLPQEYDTPLYYQQEDV-QL 196
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRL 250
L +Q + + + N Y ++ +P L ++ F+ + ++W+ + +R
Sbjct: 197 LLGTQAVQDVLNQYKNTARQYAYF-YKLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQ 255
Query: 251 VRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
++P++DG +AL+P DM NH+ + T Y+ + Y+ EQ++I YG
Sbjct: 256 NQIPTVDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDYKKNEQIYIFYG 314
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
+SN E ++ GF ++ N D V++ L + KS++ Y K E L + G+ AS F +
Sbjct: 315 TRSNAEFVIHNGFFFQD--NAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC 372
Query: 368 TGWPLELMAYAYLVV 382
P+ A+L V
Sbjct: 373 NDPPISAQLLAFLRV 387
>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
Length = 444
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 77/310 (24%), Positives = 145/310 (46%), Gaps = 16/310 (5%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
W + G + A+ GL A ++I+ E L++P +++T +S G +
Sbjct: 3 WAQEHGASCEGFAVTNFGAEGYGLRATRDIKAEELFLWIPRKMLMTVESAKKSV-LGPLY 61
Query: 142 KQCSV---PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
Q + D LA +L+ E + + +S W YI LP++ + L++ + ++ + L +Q
Sbjct: 62 TQDRILQAMDNVTLALHLLCERA-DPASFWLPYIRTLPQEYDTPLFYQQQDV-QLLHGTQ 119
Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRLVRLPS 255
+ + + N Y ++ +P L ++ F + ++W+ + +R ++P+
Sbjct: 120 AIQDVLSQYRNTARQYAYF-YKLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 178
Query: 256 MDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
DGR +AL+P DM NH + T Y+ + Y+ EQ++I YG +SN
Sbjct: 179 EDGRQVTLALIPLWDMCNHRNGLIT-TGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNA 237
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
E ++ GF +E N D V++ L + KS++ Y K E L + G+ S F + P+
Sbjct: 238 EFVIHNGFFYQE--NAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCNEPPI 295
Query: 373 ELMAYAYLVV 382
A+L V
Sbjct: 296 SAQLLAFLRV 305
>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
Length = 577
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 153/321 (47%), Gaps = 35/321 (10%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVIT- 127
ID + WL + G+ + IQ+V E G++AL++I L+ +P ++T
Sbjct: 74 IDRTTAIPKFKSWLKEHGVEYSAIDIQEVSEEEGFGVIALQDIEIKCPLVTIPRKAMMTY 133
Query: 128 ADSKWS----CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
D+K S E EVL SV LA YL E F +S++ YI +P++ ++L
Sbjct: 134 EDAKSSYLAGLIEGNEVL---SVMPNVCLALYLHCE-RFTLNSKYQPYIDMIPQEFNTIL 189
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPDLFPEEVF 234
Y+ E+ +YL+ + AI + +++ + L ++F+ K P L F
Sbjct: 190 YFKPHEM-KYLKGTAALSVAINQFKSIVRQFA-LLYQVFNGSHQKEDVEKLP-LQARNAF 246
Query: 235 NMETFKWSFGILFSRLVRLPSMDGRV----------ALVPWADMLNHSCEVETFLDYDKS 284
+T++W + +R ++P+ G V AL+P DM NH+ + Y+
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLS-TAYNAL 305
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC 344
++G+ + ++ GEQV I YG ++N +LL+ GFV +E +P D V + L + + D
Sbjct: 306 TRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKE--SPFDKVRIHLGVSQKDPL 363
Query: 345 YKEKLEALRKYGLSASECFPI 365
Y K + L K + S F +
Sbjct: 364 YSLKAKLLEKLNVEVSGQFAV 384
>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 451
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/254 (30%), Positives = 129/254 (50%), Gaps = 16/254 (6%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
L A K+I+ G+ +L VP SL +T D+ PE + + V + LAT L+ + +
Sbjct: 64 LFASKSIQTGDCILQVPYSLQLTPDNL--PPEIKPFISE-DVGNIAKLATVLLIHKNLGQ 120
Query: 165 SSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
S W YIS LP Q ++ ++W +EL+ + S + + I + + + + +++ +F
Sbjct: 121 DSEWHPYISCLPPQAEMHNTIFWNESELE-MIRQSSVYQETIYQKSQIEKDFLEIK-PVF 178
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYD 282
+ F + F + F + ++ SR S G ++L+P+AD LNH E + D
Sbjct: 179 QPFCQSFGD--FTWKDFMHACTLVGSR--AWGSTKG-LSLIPFADFLNHDGISEAIVMSD 233
Query: 283 KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKS 341
++ +DR Y PGEQV I YGK SN L+L +GF +P N D V++ + K
Sbjct: 234 DDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFTIPY---NIYDQVQIQYDIPKY 290
Query: 342 DKCYKEKLEALRKY 355
D KLE L++Y
Sbjct: 291 DPLRHTKLELLQQY 304
>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
Length = 549
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 151/339 (44%), Gaps = 27/339 (7%)
Query: 17 SHLHKAQSPAGFTDFPRKRCGH---------RIVVHCSVSTTNDASRTKTTVTQNMIPWG 67
+ L S TD C H R + +S D + K V +N
Sbjct: 88 NELEALNSIVLLTDISLSTCTHLHTNILQGLRQTILDLISDFGDKNSVKGVVEKN----- 142
Query: 68 CEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT 127
S + L +W +GL Q + I ++ RG +A K+++ G+ L +P S++I+
Sbjct: 143 ---HSCDQEERLLEWGESNGLMTQ-LKIAYIEGASRGAIARKDLKVGDIALEIPVSIIIS 198
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
+ G + + + +L + + E + S++ Y LP + + L ++
Sbjct: 199 EELVHETDMYGVLKEIDGISSETILLLWSMKE-KYNCDSKFKIYFDTLPEKFNTGLSFSI 257
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
+ L+ + + E ++ ++ Y++L + + +PD+FP E++ E F W+ + +
Sbjct: 258 QAI-TMLDGTLLLEEIMQARQHLHAQYDELFPALCNNFPDIFPPELYTWEKFLWACELWY 316
Query: 248 SRLVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
S +++ DG R L+P A LNHS V + D ++ + F R + GE+
Sbjct: 317 SNSMKIMYSDGKLRTCLIPLAGFLNHSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECC 376
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
+SYG S+ L+ YGF+P +G N D + PL + SD
Sbjct: 377 LSYGNFSSSHLITFYGFLP-QGDNSYDVI--PLDIDGSD 412
>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
Length = 465
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 159/332 (47%), Gaps = 44/332 (13%)
Query: 69 EIDSLENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
EI+SL+ ++WL ++ + P + I+ +D R +VA K+I+K +KL+ +P +++
Sbjct: 36 EIESLK---EFKEWLVNNNAYINPN-IDIELLDKYGRSIVAKKSIKKQDKLISIPKDIIM 91
Query: 127 TADSKWSCPEAGEVLKQC-SVPDWPL-LATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
+ + E+ +Q S+ P L I + + S W Y++ LP + LY
Sbjct: 92 SNIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMYSKLNEKSFWHPYVTVLPESFSTSLY 151
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE----EVFNMETFK 240
++ ELD L+ASQ++E I R + Y FS+ L PE ++N E F
Sbjct: 152 FSDNELDE-LQASQLKEFTIIRKDGIERHYE----STFSRLSKLVPEFSNLALYNQELFT 206
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE---------VETFLDYDKSSQGVVFT 291
W+ ++SR L DG +VP ADM N +T LDY +
Sbjct: 207 WALSCVWSRAFSLAENDG--GMVPLADMFNAEDRSKSKVLPKVTDTTLDY--------YA 256
Query: 292 TDRQYQPGEQVFISYGKK---SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK-CYKE 347
+D GEQ+F YG S+ ++L+ YGF+ EGT SD+V + + + +D+
Sbjct: 257 SD-DIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGT-VSDNVAITVPVFHNDEPNLST 314
Query: 348 KLEALRKYGLSASECFPIQITG-WPLELMAYA 378
K E L + + +E F +Q T P +L+ YA
Sbjct: 315 KQEILEENDI-INEVFLLQKTDPLPADLLLYA 345
>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplast precursor (ISS)
[Ostreococcus tauri]
Length = 588
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 126/283 (44%), Gaps = 25/283 (8%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD----WPLLATYLISE 159
G+ A +R+G + + +P + + A + G L+ W +A L+ E
Sbjct: 76 GVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALLLLKE 135
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
++ Y++ LP+ S L+W+ EL R + +Q+ + A V Y +L+
Sbjct: 136 RERGADGEFAAYVATLPKTLDSPLFWSADEL-RDIAGTQLLDNAAGYDAYVRAVYEELKN 194
Query: 220 RIFSKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSC---- 273
+F +Y F + F+ +F+W+FGIL SR + +DG VALVP D++NHS
Sbjct: 195 GVFVEYASTFDVDGAFDEASFRWAFGILRSRT--MAPLDGANVALVPGLDLINHSSLSGA 252
Query: 274 ---------EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS-NGELLLSYGFVPR 323
F S DR Y G ++F++Y + + + L YGF+
Sbjct: 253 RWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEIFVNYDPEGIDSKFALDYGFI-- 310
Query: 324 EGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ 366
+ NPS L LS+ + D +KL+ L GL + F ++
Sbjct: 311 DVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTFTLR 353
>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
tropicalis]
Length = 442
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 133/268 (49%), Gaps = 21/268 (7%)
Query: 79 LQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
LQ+WL + G + + A + D G RGL+A ++++ GE ++ +P S +IT ++
Sbjct: 36 LQRWLKERGFQGRHLRAAEFTDTG-RGLMATRDLQPGELIISLPDSCLITTETVLQS-YL 93
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
G+ ++ S P PLLA T+LI+E + S W Y+ LP +YW +E+ L
Sbjct: 94 GKYIRTWSPPVSPLLALCTFLIAERVARERSPWKPYLDVLPSSYSCPVYW-ESEIISLLP 152
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF---KWSFGILFSRLVR 252
A +R++A+E+ T V + + F LF + ++ T+ +W++ + +R V
Sbjct: 153 AP-LRQKALEQQTEVKELHTE-SWSFFVSLQPLFGGNITDIYTYGALRWAWCTVNTRTVY 210
Query: 253 L--PSMDGR------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ P G A+ P+ D+LNHS V+ +++ + T+ + +Q FI
Sbjct: 211 MKHPRRHGLSAQQDVYAMAPYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFI 270
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSV 332
YG N LLL YGF+ NP SV
Sbjct: 271 CYGPHDNQRLLLEYGFI--AANNPHRSV 296
>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
domestica]
Length = 437
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 138/310 (44%), Gaps = 29/310 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + RGL+A+K+++ GE ++ +P ++T D+ G
Sbjct: 37 LRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTVIR-SYLG 95
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + + P PLLA +LISE S W Y+ LP+ Y+ L E+ R L
Sbjct: 96 DYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK-AYTCLVCLEPEVVRLL-P 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
++ +A E+ V + R FS LF E+V F+ F W++ + +R V +
Sbjct: 154 RPLQMKAEEQRMQVQKLFISSR-GFFSSLQSLFTEDVKHVFHYHAFLWAWCTINTRTVYM 212
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V +++ + T + E++FI
Sbjct: 213 KHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFIC 272
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSAS 360
YG N LLL YGFV NP +V + + L DK +K+ L+++G S +
Sbjct: 273 YGPHDNHRLLLEYGFVA--SNNPHSAVYIAIDSLVDHLPSVDKQMNKKISLLKEHGFSEN 330
Query: 361 ECFPIQITGW 370
F GW
Sbjct: 331 LTF-----GW 335
>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloropl [Ectocarpus
siliculosus]
Length = 460
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/312 (29%), Positives = 146/312 (46%), Gaps = 18/312 (5%)
Query: 82 WLSDSGLPPQKMAI---QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
WL+ SG+ A+ + GERGLVA K I G+ +L +P SL +TA S A
Sbjct: 18 WLTKSGVRLTDNAVLAGRSPLAGERGLVAAKAIETGQSVLAIPQSLGLTATGLKSSGIAQ 77
Query: 139 EVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLE 195
V + + L+A ++ E + + S+ + +I+ LP++ L+W A+L +
Sbjct: 78 YVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKEGELEMPLFWGEADL-TLAD 136
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
AS R + + +V + L F+K+P +FP + F F+W+ G+ SR
Sbjct: 137 ASSTRGIS-GFVADVDEDFAWLSENAFAKHPKVFPADKFGPGDFRWAVGVALSRSF---F 192
Query: 256 MDGRVALVPWADMLNHSC-----EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+DG + L P D NHS E S+ VV + Y+ GE+ F+SYG K
Sbjct: 193 VDGELRLTPLVDFANHSSLRGVSEPTGGTTGLFGSKAVVLRAGKNYEEGEEFFVSYGPKG 252
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
L GFVP + + EL S+ + DK + +K + L + GL S F + G
Sbjct: 253 AAGYLEENGFVPPV-SGSEVTCELEFSIPEDDKFFDDKEDILERAGLRTSSTFDLTAVGL 311
Query: 371 P-LELMAYAYLV 381
P EL+ + L+
Sbjct: 312 PDAELVRFLRLL 323
>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
Length = 439
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 143/311 (45%), Gaps = 22/311 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW-SCPEA 137
L+KWL D G + + RGL+ K ++ GE ++ +P ++T + SC
Sbjct: 35 LKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTVLNSC--L 92
Query: 138 GEVLKQCSVPDWPLLAT--YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
GE + + P PL+A +LI+E + S W Y+ LP+ YS ++ + L
Sbjct: 93 GEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-TYSCPVCLEQDVVQLL- 150
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRLVR 252
+R++A E+ T V Y + FS LF E +FN +W++ + +R +
Sbjct: 151 PEPLRKQAQEQRTAVHELYMSSK-AFFSSLQSLFAENTATIFNYSALEWAWCTINTRTIY 209
Query: 253 LP-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ S++ V AL P+ D+LNHS V+ +++ S+ T+ Q + E+VFI
Sbjct: 210 MKHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQCKKYEEVFI 269
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASE 361
YG N LLL YGFV + + S V LK DK KL L+++ L +
Sbjct: 270 CYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDKQKNAKLSILKEHDLLENL 329
Query: 362 CFPIQITGWPL 372
F W L
Sbjct: 330 TFGWDGPSWRL 340
>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 475
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 129/291 (44%), Gaps = 26/291 (8%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKW-SCPEA-----GEVLKQCSVPDWPLLATYLIS 158
LVA +++ GE L+ VP D+ W S P G++ + W LA L++
Sbjct: 71 LVASADVQPGESLIVVP-------DAAWVSVPNVAKTTVGKLASSAGLEPWLQLALVLVA 123
Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
E S + Y S+LP + L W+ E R L +Q+ +T T+ L+
Sbjct: 124 ERFGSAKSELAGYASSLPEDLGTPLLWSEEE-TRALAGTQVAGTLNSYLTFFRSTFAQLQ 182
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVET 277
+F+ P FP VF + F W+ + SR P ++G ++AL P D+++H T
Sbjct: 183 AGLFTANPAAFPPAVFTLPNFVWAVAAVRSR--SHPPLEGDKIALAPLVDLVSHRRAANT 240
Query: 278 FLDYDKS-----SQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDS 331
L S Q V R + GE + + Y K +G +LL YG + + +P
Sbjct: 241 KLSVRSSGLFGRGQVAVVEATRAIRKGEALGMDYAPGKLDGPVLLDYGVM--DTASPKPG 298
Query: 332 VELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL-ELMAYAYLV 381
L L+L +SDK +K + + GL S + I P E+MA+ L+
Sbjct: 299 YSLTLTLDESDKFVDDKADIVEGAGLRPSMTYSITPDQQPGEEMMAFLRLM 349
>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
Length = 581
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 134/276 (48%), Gaps = 18/276 (6%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
LE TL +WL K+ I+ RG+ A + I E +LF+P S +IT +
Sbjct: 133 LERQKTLLEWLKHGKAQFPKIKIECYSESYRGVNAKQKINAKELILFIPKSHMITLEMAK 192
Query: 133 SCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRA 188
P A ++++ P L+T+L+ E S +S W Y+ LP+ P +++
Sbjct: 193 ETPVAKKMIQFRLDLLSPKHSFLSTFLLQEKS-RPNSFWKPYLDILPQSYPSFPIFFNNY 251
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILF 247
+L+ +L+ S ++ ++++++ YND+ ++ PE ++ F W+
Sbjct: 252 DLE-WLQGSPFLKQINDKLSDLKKDYNDI--------CNVAPEFSQYSFYEFCWARMTAS 302
Query: 248 SRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
SR+ + + A VP ADMLNH T Y + QG + TD + G+ +F SY
Sbjct: 303 SRIFGINIKGVKTDAFVPLADMLNHKRPKLTSWCYSEEKQGFIIETDEKIDRGQMIFDSY 362
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
G+K N LL+YGFV + N ++ V + ++ + +D
Sbjct: 363 GRKCNSRFLLNYGFVVDD--NDANEVNVTVAAEFND 396
>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/305 (26%), Positives = 139/305 (45%), Gaps = 30/305 (9%)
Query: 54 RTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
+T+ ++ ++ IPW I + +TL S G R L A K I
Sbjct: 37 QTQASLDKDFIPWLERIAGAKITNTLSIGKSTYG---------------RSLFASKVIHA 81
Query: 114 GEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
G+ +L VP ++ IT D P+ L V + LA LI E + SRW YIS
Sbjct: 82 GDCMLKVPFNVQITPDE--LSPDIRVSLTD-EVGNIGKLAAVLIREKKKGQKSRWVPYIS 138
Query: 174 ALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
LP+ + +S ++W E + S + + +++ + ++ + YP +
Sbjct: 139 RLPQPAEMHSTIFWGEDEFS-MIRCSAVHKETVKQKAQIEKEFSFVAQAFKQHYPMVI-- 195
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
E +E F +++ ++ SR ++L+P+AD +NH + + D+ +Q T
Sbjct: 196 ERPYLEDFMYAYALVGSRAWE---TSKGISLIPFADFMNHDGLSASIVLSDEDNQLSEVT 252
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
DR Y PG++VFI YG+ SN L+L +GF VP N D V++ + + D KL
Sbjct: 253 ADRNYSPGDEVFIKYGEFSNATLMLDFGFTVP---YNIHDEVQIQMDVPNDDPLRDMKLG 309
Query: 351 ALRKY 355
L+ +
Sbjct: 310 LLQTH 314
>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
distachyon]
Length = 532
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 130/275 (47%), Gaps = 20/275 (7%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------ 127
+ +L KW D G+ K+ I RG+VA +NI G L +P SL+I+
Sbjct: 148 DTEDSLLKWGEDQGVK-SKLQIAFFQGAGRGMVASENIGVGHIALEIPESLIISEELLCQ 206
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
+D + + + + + W + + SS + + LP + L +
Sbjct: 207 SDMFLALKDLNSITTETMLLLWSMRERH-------NPSSNFKMFFETLPSNFNTGLNFGI 259
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
L LE + + + ++ ++ Y++L + +K+P++F ++++ + F W+ + +
Sbjct: 260 GAL-AALEGTLLFDELMQARQHLHQQYDELFPMLCTKFPEIFTQDIYTWDNFLWACELWY 318
Query: 248 SR--LVRLPSMDGRVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
S +V L S L+P A +LNHS + + D++++ + F R + G+Q F
Sbjct: 319 SNSMMVVLSSGKLTTCLIPVAGLLNHSVYPHILNYGRVDQATKSLKFPLSRPCKAGQQCF 378
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+SYGK S L+ YGF+PRE NP D V L L +
Sbjct: 379 LSYGKHSGSHLITFYGFLPRE-DNPYDVVPLDLDM 412
>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
Length = 445
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 168/379 (44%), Gaps = 24/379 (6%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
S + S+L +W+ G KM I + GER ++A +I +GE +L +P + + T + +
Sbjct: 8 SEQKLSSLLRWMEQGGALFPKMHIVRQADGERSVLARTDIAEGEVVLQIPTTHLFTLE-R 66
Query: 132 WSCPEAGEVLKQCSVPD--WPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRA 188
+ G ++ PD + LA++L+ E S W ++ +LP P+ L+++
Sbjct: 67 AKASDIGRRIQSQLQPDNDFLYLASWLLEEKHRGADSFWKPFVDSLPEAYPHVPLFYSEQ 126
Query: 189 ELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
E R ++ SQ+ ER +E + + Y LR K P+ E F E + W+ L+
Sbjct: 127 ERAR-MKGSQL-ERLVEVQRQSFEQEYAQLR----EKLPEY---ERFGFEEYVWARISLY 177
Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
SRL L +LVP +DM NH + + Q R G ++ YG
Sbjct: 178 SRLFSLKGGLQGPSLVPLSDMFNHRQPPDVLWSTSEDGQTFRMIAQRAVPAGTEIHTHYG 237
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL-SASECFPIQ 366
KS+ LL GFVP +G +D V L + L D K + +GL SA+ P +
Sbjct: 238 AKSSDVFLLHSGFVP-DGNEENDEVYLSVGLPPGDPLASVKQQM---FGLASATAKHPFK 293
Query: 367 ITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILD 423
++ L +++ V S M + A SN++ S P +E+ L +
Sbjct: 294 VSRQGKYLASWS--VFSFLRMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAA 351
Query: 424 SCESSISKYSRFLQVKELL 442
+CE + + L+ E L
Sbjct: 352 ACEERLKAFPTTLEEDERL 370
>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 719
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 140/293 (47%), Gaps = 23/293 (7%)
Query: 97 KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV--LKQCSVPDWPLLAT 154
+ + G RG VA ++I G+ ++ +P +L+++ + P+ G V L + LA
Sbjct: 43 ETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADPKYGHVHRLNTRLLGSDNGLAL 102
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
Y++ E E+ S + Y+ LP P +L W R L L+ ++ R R ++ Y
Sbjct: 103 YIMQEILKEERSFYWPYLRMLP-TPCNLRNWNRESL-LLLQDHKLVRRTAARSRQLLALY 160
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RLPSMDGRVALVPWADMLNHS 272
+ + S YP+L+ + + E F +++ + +R RL S ALVP+AD LNH
Sbjct: 161 RETIEFLSSSYPELYTADRYTFELFDFAWRTIQARAFGKRLKSS----ALVPFADCLNHG 216
Query: 273 CEVETFLDYDKSSQGVVF---TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
V+T D+D G + + +Y +V SYG+++N LLL YGF + N
Sbjct: 217 -NVQTKYDFDVGGNGTFRLFPSGNNRYPRNSEVLNSYGRRANDNLLLDYGFAMLD--NEW 273
Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG-----WPLELMAY 377
D+ E+ SL S + L+ RK L AS ++I WP EL+ +
Sbjct: 274 DAAEVICSLPPSHD--QSPLDRRRKACLRASGQHTVRILRVRRDVWPEELLRF 324
>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
Length = 440
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 174/382 (45%), Gaps = 35/382 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L++WL++ G Q + RGL+A + I+ ++ +P ++T + A
Sbjct: 37 LRRWLNERGFTSQSLIPVNFHDTGRGLMATQTIKAKNSVISLPEECLLTTSTVLKSYMA- 95
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD---RY 193
+ +K+ P PLLA +LISE ++S W+ YI LP+ LY+ ++ R
Sbjct: 96 DYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPKTYTCPLYFPDNVIELLPRS 155
Query: 194 LE--ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
L+ A+Q +E+ E ++ ++ L+ +F++ P EE+F+ + +W++ + +R V
Sbjct: 156 LQKKATQQKEQFQELFSSSQTFFHSLQ-PLFNQ-P---TEELFSQDALRWAWCSVNTRTV 210
Query: 252 RLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
+ + AL P+ D+LNH V+ ++K ++ + + +Q F
Sbjct: 211 YMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAF 270
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
I+YG N LLL YGFV NP V L + L + DK KEKL L+
Sbjct: 271 INYGPHDNHRLLLEYGFVA--PCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFL 328
Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL 418
+ F + W LM L+ P +++ + A+ ++ ++ C E AL
Sbjct: 329 RNLTFGMDGPSW--RLMTALRLLSLKPQQYTRWKSVLLGAA--VSQDREDWCI---ESAL 381
Query: 419 QFILDSCESSISKYSRFLQVKE 440
+ + E ++ R Q+KE
Sbjct: 382 KLCNNLTEDNVKALERLAQLKE 403
>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
Length = 567
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/386 (23%), Positives = 167/386 (43%), Gaps = 33/386 (8%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGER---GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+WL G K + K+D GLVA ++I++GE + +P +L IT +
Sbjct: 75 EWLKGKGFDESKCKV-KIDRNTSEGTGLVATQDIKEGEDFVEIPSNLFITTAVAFQGLGK 133
Query: 138 GEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
+L+ S+P LL+ +L+ E S +S W YI LP+Q ++ YW E ++
Sbjct: 134 PPILENDRLIQSIPGI-LLSIFLVKELS-NPTSEWGPYIKLLPKQYNTVYYWGLKEFTQF 191
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
+ + E A+ + + Y L I ++ P F + F W+ + SR +
Sbjct: 192 RGSPNL-EYAMRYVRGAMRQYCYLYSMIDRTQSNIMPISSFTWDAFVWAISTVQSRQNPV 250
Query: 254 PSMDGR---VALVPWADMLNHSC---EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
+ +G +AL+P+ D NHS ++ +F Y S + + ++ GEQV++ YG
Sbjct: 251 YAGNGNGSIMALIPFWDFCNHSSTGSKITSF--YHMDSNCMTSGAIKDFKKGEQVYMFYG 308
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ- 366
+ N +LL+ GF + + S EL L L+ + + +K+ L + G+ +
Sbjct: 309 PRDNTQLLMHAGFATKTNLHDSYPFELHL-LEGNHEIRHDKVHLLEERGIRDGVVVNLNQ 367
Query: 367 ---ITGWPLELMAYAYL---------VVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEID 414
PLEL+ + + ++PP + G+ I E +
Sbjct: 368 NPTSNELPLELIPFYRIYALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENE 427
Query: 415 EQALQFILDSCESSISKYSRFLQVKE 440
E+A ++ + + ++ Y L+ E
Sbjct: 428 EKAYSNLVQALKGKLASYPTTLEEDE 453
>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
Length = 199
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 21/200 (10%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE-----ASQIRERAIER 206
L +L+ E + SS W Y+ LP +L+WT E+D + A +R +A E
Sbjct: 1 LVIFLLCERNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLRLKAEES 60
Query: 207 ITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL---------PSMD 257
+ + L +R ++ F + FKW++ + +R V + P +
Sbjct: 61 FNRLCNGFLPLLVRQMPQF-----NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEE 115
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
+ AL P+ D+LNH+ +VE +D SS+ TT +P +QVFI+YG SN +LLL
Sbjct: 116 DKSALAPFLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLE 175
Query: 318 YGFVPREGTNPSDSVELPLS 337
YGF NP +++ L LS
Sbjct: 176 YGFT--LPCNPHNNISLTLS 193
>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
reading frame 18 (C21orf18)) [Danio rerio]
Length = 440
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 173/382 (45%), Gaps = 35/382 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L++WL++ G Q + RGL++ + I+ L+ +P ++T + A
Sbjct: 37 LRRWLNERGFTSQSLIPVNFHGNGRGLMSTQTIKAKNSLISLPEECLLTTSTVLKSYMA- 95
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD---RY 193
+ +K+ P PLLA +LISE ++S W+ YI LP+ LY+ ++ R
Sbjct: 96 DYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPKTYTCPLYFPDNVIELLPRS 155
Query: 194 LE--ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
L+ A+Q +E+ E ++ ++ L+ +F++ P EE+F+ + +W++ + +R V
Sbjct: 156 LQKKATQQKEQFQELFSSSQTFFHSLQ-PLFNQ-P---TEELFSQDALRWAWCSVNTRTV 210
Query: 252 RLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
+ + AL P+ D+LNH V+ ++K ++ + + +Q F
Sbjct: 211 YMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAF 270
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
I+YG N LLL YGFV NP V L + L + DK KEKL L+
Sbjct: 271 INYGPHDNHRLLLEYGFVA--PCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFL 328
Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL 418
+ F + W LM L+ P ++ + A+ ++ ++ C E AL
Sbjct: 329 RNLTFGMDGPSW--RLMTALRLLSLKPQQYTSWKSVLLGAA--VSQDREDWCI---ESAL 381
Query: 419 QFILDSCESSISKYSRFLQVKE 440
+ + E ++ R Q+KE
Sbjct: 382 KLCNNLTEDNVKALERLAQLKE 403
>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
Length = 583
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 140/303 (46%), Gaps = 20/303 (6%)
Query: 46 VSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGL 105
+ D +R +T + +++ S+E + +W + + K+ I V+ RG
Sbjct: 159 IQEVGDKNRLETRIVEDL--------SIEKEECILQWGERNDVR-TKLKIAYVEGAGRGA 209
Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS 165
+A ++++ G+ L +P S+VI+ + + K + +L + + E +
Sbjct: 210 IATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKH-NSN 268
Query: 166 SRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
S+++ Y +ALP + L + E D L + + E IE ++ Y +L +
Sbjct: 269 SKFNTYFNALPEAFNTGLSF---EFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALCK 325
Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFL 279
+PD+FP E + E F W+ + +S +++ DG R L+P A LNHS + +
Sbjct: 326 DHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHYG 385
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
D + + F + GEQ ++SYG S+ L+ YGF+P +G N D++ L +
Sbjct: 386 KVDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIP-QGDNLYDTIPLEIDNP 444
Query: 340 KSD 342
+ D
Sbjct: 445 QGD 447
>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
Length = 1335
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 132/289 (45%), Gaps = 9/289 (3%)
Query: 37 GHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQ 96
H ++ SV ++ T T +P+ D + + WL G+ K+ I
Sbjct: 766 SHLSIIEQSVILLSNLKNQFTKPTIKSVPFIKPTDEI--YRRFENWLKQGGVQFPKLQIA 823
Query: 97 K-VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY 155
D RG+V K + + E ++ VP +I D S P G + ++ + D +L +
Sbjct: 824 NFTDSTGRGVVTTKKVDEDEVVVSVPRKYLINVDVAKSNPILGPIFEELHLNDETILFLF 883
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
+I E ++ W + LP + ++++ EL LE + + + + +
Sbjct: 884 VIYEKE-NPNTFWRPFYDTLPSYFTTSIHYSSTELLE-LEGTNLFAETLAVKQQLQAFRD 941
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSC 273
L + ++YPD+FPE VF+ E F W+ +L SR ++L +DG++ LVP ADM+NH
Sbjct: 942 YLFPELSNQYPDIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMADMINHHT 1000
Query: 274 EVE-TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+ + +D+ S + Q+F+ YG N +L L YGFV
Sbjct: 1001 NAQISERHFDQDSNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGFV 1049
>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
Length = 482
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 128/279 (45%), Gaps = 17/279 (6%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPE 136
L +WL D K+ I+ G R L A + IR+GE +LF+P + ++ + K SC
Sbjct: 43 NLIEWLKDGKAEISKVQIEVQSEGHRTLRATQFIRQGEWVLFIPRTQYLSLEEVKKSCLI 102
Query: 137 AGEVLKQCSVPDWPLLATYLIS---EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
++++ P+ + TY ++ + + K S W YI LP+ + AE D
Sbjct: 103 NRKMIQINYKPN--NIQTYFVNHLLQENRRKYSFWKPYIDVLPKDVSGFPTYFDAEQDAL 160
Query: 194 LEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV-FNMETFKWSFGILFSRL 250
L+ S I + Y +L+ ++ F KY + + + F + T SF
Sbjct: 161 LKGSPTLFTVINQRKVFKEEYENLKEAVKEFQKYGYTYDDFIKFRILTISRSFT------ 214
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
V++ + + LVP AD +NH Y K + G R Q GE++F +YG+ S
Sbjct: 215 VQIGEKEQQQLLVPLADFINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWS 274
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
N ++YGF TNP + +L + L K+D+ + K+
Sbjct: 275 NKYFFMNYGFASL--TNPMNQFDLDICLNKNDRLFNLKI 311
>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
dendrobatidis JAM81]
Length = 966
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/307 (27%), Positives = 145/307 (47%), Gaps = 24/307 (7%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVD----VGERGLVALKNIRKGEKLLFVPPSL 124
++D L + + +WL +G+ ++I+KVD VG G+ + + I KGE L+ +P L
Sbjct: 551 KLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVG-LGIFSTRQIHKGECLVKIPLKL 609
Query: 125 VITADSKWSCPEAGEVLKQCSV--PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
+++ D+ + P ++K + D ++ + + S W Y LPR
Sbjct: 610 ILSNDTS-AMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYFDLLPRVFTIP 668
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDL-FPEEVFNMETFKW 241
+ + +L Y S I E + + ++ Y L+ IF P+ P F F W
Sbjct: 669 VLGSAQDLAAYTGTSIIDE-VVHDMIALMRQYLYLQ-HIFKSIPEPPIPLADFTFAAFSW 726
Query: 242 SFGILFSRLVRL----PS---MDGRVALVPWADMLNHS-CEVETFLDYDKSSQGVVFTTD 293
+ I+ +R + PS M + L+P DM NH T D + + + D
Sbjct: 727 ARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHKPGNSTTQFDTKEYCSETIASCD 786
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEAL 352
PGEQ+FI YGK+SN E+LL GFV + TN D ++L +S+ +SD ++++ L
Sbjct: 787 --VSPGEQIFIHYGKRSNQEMLLYSGFV--DPTNIEYDHIKLSVSIPQSDPIRNQRVQLL 842
Query: 353 RKYGLSA 359
+ + LS+
Sbjct: 843 KLFNLSS 849
>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
Length = 566
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 140/303 (46%), Gaps = 20/303 (6%)
Query: 46 VSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGL 105
+ D +R +T + +++ S+E + +W + + K+ I V+ RG
Sbjct: 142 IQEVGDKNRLETRIVEDL--------SIEKEECILQWGERNDVR-TKLKIAYVEGAGRGA 192
Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS 165
+A ++++ G+ L +P S+VI+ + + K + +L + + E +
Sbjct: 193 IATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKE-KHNSN 251
Query: 166 SRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
S+++ Y +ALP + L + E D L + + E IE ++ Y +L +
Sbjct: 252 SKFNTYFNALPEAFNTGLSF---EFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALCK 308
Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFL 279
+PD+FP E + E F W+ + +S +++ DG R L+P A LNHS + +
Sbjct: 309 DHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHYG 368
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
D + + F + GEQ ++SYG S+ L+ YGF+P +G N D++ L +
Sbjct: 369 KVDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIP-QGDNLYDTIPLEIDNP 427
Query: 340 KSD 342
+ D
Sbjct: 428 QGD 430
>gi|325186836|emb|CCA21381.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 473
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 71/231 (30%), Positives = 111/231 (48%), Gaps = 32/231 (13%)
Query: 151 LLATYLISEA-SFEKSSRWSNYISALPRQPYSLLYWTRAEL------DRYLEASQIRERA 203
LLA L+ E + S+W++++ LP++ +LLY++ E+ + Y A +++ER
Sbjct: 118 LLAIILLFEMYVLQSESKWAHHLEILPKEHRNLLYYSSDEVKALDGTNLYYVAHEMQERL 177
Query: 204 IERI----TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL------ 253
E T V+ + I S P + VF+ +KW+ I++SR V +
Sbjct: 178 HEDYEFIETRVLPELKHILKHILS--PSVSATTVFSFANYKWALSIIWSRFVSIEIDQEL 235
Query: 254 ---------PSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
P+ V A+VP DMLNH + E YD +S TT + G Q+
Sbjct: 236 VSTLPFTIDPTKKHCVKAMVPVFDMLNHDPKAEMTHKYDAASGMFQLTTHQHLAAGTQLH 295
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC-YKEKLEALR 353
I+YG SN LL YGF+ NP D+VE+ L ++ + Y+EK E LR
Sbjct: 296 INYGPLSNHALLALYGFM--HSHNPHDTVEVHLQMESDNTSFYEEKEEFLR 344
>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
Length = 572
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 137/283 (48%), Gaps = 20/283 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L +WL K+ I+ RG+ A + I E +LF+P S +IT + A
Sbjct: 139 LLEWLKIGKAIFPKIKIECYSEDYRGVNAKQTINAKELILFIPKSHMITLEMAKETTVAK 198
Query: 139 EVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYL 194
++++ P L+T+L+ E F +S W YI LP P +++ ++L+ +L
Sbjct: 199 KMMQFRLDLLSPKHSFLSTFLLQE-KFRPNSFWKPYIDILPSSYPSFPIFYNNSDLE-WL 256
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVRL 253
+ S ++ +++ ++ YND+ ++ PE + F W+ SR+ +
Sbjct: 257 KGSPFLKQIKDKLADLQKDYNDI--------CNVVPEFTQYQFHEFCWARMTASSRIFGI 308
Query: 254 PSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
+++G A VP ADMLNH T Y QG + TD + + G+ +F SYG+K N
Sbjct: 309 -NINGVKTDAFVPLADMLNHKRPKLTSWCYSDEKQGFIIETDEKIERGQMIFDSYGRKCN 367
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
L+YGFV EG N ++ V L + ++D + K +A+++
Sbjct: 368 SRFFLNYGFVV-EG-NDANEVNLAVEADQNDPLLQLKEQAIKE 408
>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Cucumis
sativus]
Length = 499
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 116/233 (49%), Gaps = 19/233 (8%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
++I K +G R L A + IR G+ +L VP ++ I+ DS P L + + L
Sbjct: 67 LSIGKSSIG-RFLFASETIRAGDCILKVPFNVQISPDS---LPLPIRDLLGNEIGNVAKL 122
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITN 209
A ++ E S W+ YI LP QP+ + ++W +EL+ + S + E ++ + +
Sbjct: 123 AVVVLLEHKLGLGSEWAPYIIRLP-QPWEMHNTIFWKESELE-MIRKSSLYEESLNQRSQ 180
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
+ + +R K + FPE + + + F ++ ++ SR R S +G V+L+P+AD
Sbjct: 181 IKREFLAIR-----KALEAFPEIIDRISCDDFMHAYALVTSRAWR--STEG-VSLIPFAD 232
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
LNH E L D Q DR + PGE V I YGK SN L+L +GF
Sbjct: 233 FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGF 285
>gi|302818853|ref|XP_002991099.1| hypothetical protein SELMODRAFT_429412 [Selaginella moellendorffii]
gi|300141193|gb|EFJ07907.1| hypothetical protein SELMODRAFT_429412 [Selaginella moellendorffii]
Length = 428
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 143/322 (44%), Gaps = 38/322 (11%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ-CSVPDWPLLATYLI 157
D G RGL +N+ +GE +L VP + +I PE G+VL + +L YL+
Sbjct: 36 DQGGRGLGVARNVEQGEMILRVPFAALIGVHCAREDPEFGKVLVDFAHLSSVQILTAYLL 95
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYN 215
SE + +SSRW +Y+ P+ ++L +++ E + + +A + + + E +
Sbjct: 96 SEVAKSRSSRWFSYLRHNPQVHHNLPHFSAMEAEELQVEDAISMAKSSFEDTQRQWRETS 155
Query: 216 DL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
L RLR+ P + + + W+ + SR + +P D V L P D+ N+
Sbjct: 156 SLLSRLRL--------PRKFTTFKAWLWAAATISSRTLHVPWDDAGV-LCPIGDLFNYDA 206
Query: 274 EVETFLD------------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
+E L Y+ S F R Y+ G+Q I YG+ +N ELL
Sbjct: 207 PIERTLSSRNEDDEHKFTSRLTDGGYETSISSYCFYARRSYKNGQQALICYGQYTNLELL 266
Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP-LEL 374
YGF+ + NP D + +PL + ++ R++ L+A + I+ +G P L
Sbjct: 267 EHYGFLLPD--NPCDVIYIPLPSSEEFGLKSTGDKSERQHNLAA---YCIEASGKPSFSL 321
Query: 375 MAYAYLVVSPPSMKGKFEEMAA 396
+ L P S++ MA+
Sbjct: 322 LQQLRLRAVPASLRKSHGYMAS 343
>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
Length = 605
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/276 (29%), Positives = 130/276 (47%), Gaps = 22/276 (7%)
Query: 53 SRTKTTVTQNMIPWGCEIDSLENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNI 111
++ KT Q+ I G L+ L +WL S L P K+ I+ RG+ A K I
Sbjct: 142 NQYKTQALQSSIDSG----ELDKQKRLLEWLKSGQALFP-KIKIECYAEDYRGVNARKAI 196
Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRW 168
E +LFVP S +IT + P A ++++ P L+T+L+ E + S W
Sbjct: 197 SSKEVILFVPRSHMITLEMAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQEKKIQ-DSFW 255
Query: 169 SNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
Y+ LP+ + +++ ++L+ +L+ S ++ ++IT++ Y D+
Sbjct: 256 KPYLDVLPKSYSNFPIFFNDSDLE-WLKGSPFLKQVKDKITDLKKDYCDI--------CQ 306
Query: 228 LFPEEVFN-METFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSS 285
+ PE + N + F W+ SR+ + + A VP ADMLNH T Y
Sbjct: 307 VAPEFLQNSFDEFCWARMTASSRIFGINIKGVKTDAFVPLADMLNHKRPKLTSWCYSDER 366
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
QG + TD + G+ +F SYG K N LL+YGFV
Sbjct: 367 QGFIIETDENIEKGQMIFDSYGSKCNSRFLLNYGFV 402
>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
Length = 474
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 174/386 (45%), Gaps = 42/386 (10%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKW 132
EN KWL D G+ K + V E GLVA ++I + E +L +P L W
Sbjct: 47 ENVRNFWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL-------W 99
Query: 133 SCPE---AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
PE A ++ C + W +A +LI E +E+ S W Y+ LP+ S ++W+
Sbjct: 100 INPETVTASKIGPLCGGLKPWVSVALFLIRE-KYEEESSWRVYLDMLPQSTDSTVFWSEE 158
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
EL L+ +Q+ + V + L I DLF + ++ F W+FGIL
Sbjct: 159 ELAE-LKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGIL-- 214
Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QPG 299
+ + ++ + + +NH+ ++T DY +G +F+ D + + G
Sbjct: 215 ------NRESLTSMFEF-EQINHNPAIKT-EDYAYEIKGAGLFSRDLLFSLKSPVYVKAG 266
Query: 300 EQVFISYG-KKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGL 357
EQV+I Y KSN EL L YGFV +NP +S L + + +SD + +KL+ +
Sbjct: 267 EQVYIQYDLNKSNAELALDYGFVE---SNPKRNSYTLTIEIPESDPFFGDKLDIAESNKM 323
Query: 358 SASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
+ F I + G L YL + F + + +N + ++ +E+
Sbjct: 324 GETGYFDI-VDGQTLPAGMLQYLRLVALGGPDAF-LLESIFNNTIWGHLELPVSRTNEEL 381
Query: 418 L-QFILDSCESSISKYSRFLQVKELL 442
+ + + D+C+S++S + ++ E L
Sbjct: 382 ICRVVRDACKSALSGFDTTIEEDEKL 407
>gi|320170264|gb|EFW47163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 938
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 138/305 (45%), Gaps = 38/305 (12%)
Query: 98 VDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADS--KWSCPEAGEVLKQCSVPDWP 150
VDV + R L+A ++ +++ +P L I+ D+ + P A + S
Sbjct: 55 VDVSQDWHQGRRLIADNPLKPDDRIAAIPTLLTISLDTALQVGLPRAFTTIWHESGSQDD 114
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
LLA +L+ E + S W+ YI LP++ +LL++ EL + L+ Q+ E+ ++ + +
Sbjct: 115 LLALFLLREKALGARSAWAPYIEILPKKLSNLLFFNDGELAQ-LQNEQLVEQVSQQKSEL 173
Query: 211 IGTYNDLRLR---IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
G + LR IF +L + F W+ I+ SR ++ R L+P+AD
Sbjct: 174 QGRFLALRQHEADIFGGKAELV------LSDFLWARAIVLSRAF---TIHARRYLIPFAD 224
Query: 268 MLNHSCEVETFLD---------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
+LNH LD +D + + T DR E+V YG SN + L Y
Sbjct: 225 LLNHRFHPTRGLDESGEFFYRHHDFQNGMFLLTCDRPVNENEEVEDDYGNLSNAQFLQLY 284
Query: 319 GFVPREGTNPSDSVELPLS--LKKSDKCYKEKLEALRKYGLSASECF----PIQITGWPL 372
GFVP +NP + VE+ L+ L + K E K G+ C P +TG L
Sbjct: 285 GFVPE--SNPHECVEINLADLLHGEREALLLKSEYAFKLGIPHIVCIGATRPPSVTG-AL 341
Query: 373 ELMAY 377
E +AY
Sbjct: 342 EAIAY 346
>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
anatinus]
Length = 499
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 135/297 (45%), Gaps = 24/297 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + RGL+A K+++ GE ++ +P + ++T D+ P G
Sbjct: 36 LKKWLKGRRFDGSNLRPARFPDTGRGLMATKSLKAGEMIISLPEACLLTTDTVLKSP-LG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + + P PLLA T+LI+E S W Y+ LP Q Y+ A + L
Sbjct: 95 DYIWKWKPPVSPLLALCTFLIAEKQAGARSLWQPYLGVLP-QAYTCPVGLDAAVLSLL-P 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
+ RA E+ T V + R FS LF E+V F ++ W++ + +R V +
Sbjct: 153 QPLGRRAREQRTAVRELFAASRA-FFSSLQPLFSEDVERVFTLDALGWAWCTVNTRTVYM 211
Query: 254 P-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
S + + AL P+ D+LNHS + ++K ++ T + + E+V I
Sbjct: 212 EHAQRDCFSAEADIYALAPYLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGL 357
YG N LLL YGFV NP +V L L DK +KL L+++G
Sbjct: 272 YGPHDNRRLLLEYGFVC--SNNPHSNVVVSPDVLVRHLPSGDKQMTKKLSLLKEHGF 326
>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
Length = 466
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 113/237 (47%), Gaps = 12/237 (5%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LA +L+ E + +S W YI LP + + LY+ E+ R L+++Q + N
Sbjct: 29 LAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RCLQSTQAIHDVFSQYKNTA 86
Query: 212 GTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPW 265
Y ++ +P L +E F E ++W+ + +R ++P+ DG +AL+P
Sbjct: 87 RQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 145
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
DM NH+ + T Y+ + +Q G+Q++I YG +SN E ++ GF
Sbjct: 146 WDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFF--D 202
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
N D V++ L + KSD+ Y K E L + G+ S F + T P+ A+L V
Sbjct: 203 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHSTEPPISAQLLAFLRV 259
>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
Length = 450
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 113/237 (47%), Gaps = 12/237 (5%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LA +L+ E + +S W YI LP + + LY+ E+ R L+++Q + N
Sbjct: 11 LAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RCLQSTQAIHDVFSQYKNTA 68
Query: 212 GTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPW 265
Y ++ +P L ++ F E ++W+ + +R ++P+ DG +AL+P
Sbjct: 69 RQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 127
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
DM NH+ + T Y+ + +Q G+Q++I YG +SN E ++ GF
Sbjct: 128 WDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFF--D 184
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
N D V++ L + KSD+ Y K E L + G+ S F + T P+ A+L V
Sbjct: 185 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLAFLRV 241
>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
Length = 490
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 139/323 (43%), Gaps = 54/323 (16%)
Query: 76 ASTLQKWLSDS-----------GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
T +WLSD GL PQ + GLVA ++I + E +L +P L
Sbjct: 50 VQTFWQWLSDQDVVSAKTPARPGLVPQGL----------GLVAQRDISRNEVVLEIPKKL 99
Query: 125 VITADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
I D A E+ C V W +A +LI E ++ S W Y+ LP S +
Sbjct: 100 WINPD----VVAASEIGNVCGGVKPWVSVALFLIRE-KLKEDSTWRPYLDVLPESTNSTI 154
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
+W+ EL L+ +Q+ + + + + + I + LFP V ++ F W+F
Sbjct: 155 FWSEEELAE-LQGTQLLSTTLGVKSYLRREFLKVEEEILVPHKQLFPSPV-TLDDFSWAF 212
Query: 244 GILFSR---------LVRLPSMDGRVALVPW-ADMLNHSCEVETFLD--YDKSSQGVVFT 291
GIL SR LV +P D L W D +NHS ++ T D Y+ G +F+
Sbjct: 213 GILRSRSFSRLRGQNLVLIPLADLCNFLHTWLLDQVNHSPDI-TIEDGVYEIKGAG-LFS 270
Query: 292 TDRQY--------QPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
D + + GEQV I Y SN EL + YGF+ E + + L L + +SD
Sbjct: 271 RDLIFSLRSPISLKAGEQVLIQYNLNLSNAELAVDYGFI--EAKSDRNMYTLTLQISESD 328
Query: 343 KCYKEKLEALRKYGLSASECFPI 365
+ +KL+ GL F I
Sbjct: 329 PFFGDKLDIAETNGLGEIADFDI 351
>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
Length = 439
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 138/308 (44%), Gaps = 25/308 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 35 LRKWLKERKFEDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLG 93
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+K+ P PLLA T+L+SE S W +Y+ LP+ Y+ E+ L
Sbjct: 94 PYIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-P 151
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
S ++ +A E+ V + R FS LF E VF+ F W++ + +R V L
Sbjct: 152 SPLKAKAEEQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYL 210
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
S AL P+ D+LNHS V+ +++ ++ T + + ++VFI
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270
Query: 306 YGKKSNGELLLSYGFVPREGTN---PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
YG N LLL YGFV + P + L L +DK K+ L+ +G + +
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLT 330
Query: 363 FPIQITGW 370
F GW
Sbjct: 331 F-----GW 333
>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
Length = 439
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 138/308 (44%), Gaps = 25/308 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 35 LRKWLKERKFEDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLG 93
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+K+ P PLLA T+L+SE S W +Y+ LP+ Y+ E+ L
Sbjct: 94 PYIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-P 151
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
S ++ +A E+ V + R FS LF E VF+ F W++ + +R V L
Sbjct: 152 SPLKAKAEEQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYL 210
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
S AL P+ D+LNHS V+ +++ ++ T + + ++VFI
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270
Query: 306 YGKKSNGELLLSYGFVPREGTN---PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
YG N LLL YGFV + P + L L +DK K+ L+ +G + +
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLT 330
Query: 363 FPIQITGW 370
F GW
Sbjct: 331 F-----GW 333
>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
Length = 468
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 113/237 (47%), Gaps = 12/237 (5%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LA +L+ E + +S W YI LP + + LY+ E+ R L+++Q + N
Sbjct: 29 LAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RCLQSTQAIHDVFSQYKNTA 86
Query: 212 GTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPW 265
Y ++ +P L ++ F E ++W+ + +R ++P+ DG +AL+P
Sbjct: 87 RQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 145
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
DM NH+ + T Y+ + +Q G+Q++I YG +SN E ++ GF
Sbjct: 146 WDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFF--D 202
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
N D V++ L + KSD+ Y K E L + G+ S F + T P+ A+L V
Sbjct: 203 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLAFLRV 259
>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
Length = 371
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 67/258 (25%), Positives = 116/258 (44%), Gaps = 26/258 (10%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
R L A + + G + L +P +IT ++ P L S P L+ +L+SE
Sbjct: 6 RALFATRRVPAGSRFLEIPRIAIITPEN---VPSQVSHLLSTSNPK-TRLSLFLLSEKHK 61
Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ S+W+ Y+ LP+ S ++W EL +L+ S +E + + ++ L
Sbjct: 62 AQESQWAPYLRCLPQLGDIESTMFWKDEEL-AWLKHSPTYRETMECLKIIKSEFHVLEAN 120
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+F D+ E + + + S D +P+AD NH +T L
Sbjct: 121 VFPWCRDVLGE-------------VSLTDFMHAYSTDQ----IPFADFFNHDHNCQTRLS 163
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
YDK V D+ Y+ G+++F+SYG N L + YGF +NP + VE+P+ +
Sbjct: 164 YDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAV--ASNPHEQVEVPMGVSL 221
Query: 341 SDKCYKEKLEALRKYGLS 358
+D KL+ L ++ +S
Sbjct: 222 TDPLRDLKLQTLSRHNMS 239
>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
Length = 430
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 138/303 (45%), Gaps = 22/303 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L +WL G + I+ + E G+ A + + G+ L VP L + +S + A
Sbjct: 10 LLEWLEAHGAADSLLDIRYLGKLEGHGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLA 69
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD-----R 192
+ K +PD +LA +L+ E S S ++ +I+++P ++WT AEL+
Sbjct: 70 PVLEKYPQIPDDEVLALHLMHERSKGGESFFAPFIASMPTTFDLPVFWTEAELNELKGTN 129
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLV 251
L +Q+ ++ +ER + ++ + + +PD+F ++ + W+ +++SR
Sbjct: 130 VLLLTQLMKQHLER------DFENIHQAVAADFPDIFASLPTLTIDDYMWAMSVIWSRAF 183
Query: 252 RLPSMDGRV--ALVPWADMLNHSCEV----ETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ S G+ L P DM NH V + F+ +++ Q + G V IS
Sbjct: 184 GV-SKGGKYLHVLCPAMDMFNHDVTVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHIS 242
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
YG+ SN +LL SYGFV E N V+ + + SD +K K L L+ + +
Sbjct: 243 YGQYSNAKLLYSYGFVSPE--NFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDF 300
Query: 366 QIT 368
T
Sbjct: 301 HGT 303
>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
Length = 439
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 138/312 (44%), Gaps = 32/312 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L+KWL + + + G RGL++ ++++G+ ++ +P S ++T D+
Sbjct: 34 LRKWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIR-SSV 92
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDR 192
G +K+ P PLLA T+L+SE S W +Y+ LP+ P L L
Sbjct: 93 GPYIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPG 152
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSR 249
L A +RA R+ ++ + D FS LF E V F+ F W++ + +R
Sbjct: 153 PLRAKAEEQRA--RVQDLFASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTR 206
Query: 250 LVRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
V L S AL P+ D+LNHS V+ +++ ++ T + + ++
Sbjct: 207 AVYLKSRRQECLSSEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQE 266
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLS 358
FI YG N LLL YGFV + V + LK +DK +KL L +G +
Sbjct: 267 AFICYGPHDNQRLLLEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFT 326
Query: 359 ASECFPIQITGW 370
+ F GW
Sbjct: 327 GNLTF-----GW 333
>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
Length = 440
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 133/296 (44%), Gaps = 24/296 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKDRKFQDSNLVPARFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIQ-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYIAKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPI 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRKYG 356
YG N L L YGFV G NP V L L +DK +K+ L+ +G
Sbjct: 272 YGPHDNHRLFLEYGFV--SGHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
variabilis]
Length = 398
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 119/268 (44%), Gaps = 19/268 (7%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
LV K + KGE+L VP + ITAD+ + G L + W +A +L+ E +
Sbjct: 1 LVCSKAVNKGEQLFAVPEAAWITADTAQQS-QIGSHL--TGLESWLAIALFLLHERAMGN 57
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
+SRW+ YI+ LP S + W A+L L+ SQ+ ++ L+ +F
Sbjct: 58 ASRWAPYIALLPADSGSPVQWEEADLAE-LQGSQVLGTVQGYRAYFQQRFDQLQAEVFGP 116
Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH-----------SC 273
F VFN + F W+ + +R P G +ALVP ADM+
Sbjct: 117 NSQAFDPIVFNFDAFLWAACTVRAR-AHPPLDGGNIALVPLADMVRSQPSWPPDSAGWQL 175
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSV 332
+ L S+Q +V G+ + + +G +KS+G+LL+ +G + P S
Sbjct: 176 KQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPLVNQP--SY 233
Query: 333 ELPLSLKKSDKCYKEKLEALRKYGLSAS 360
L L L K D+ Y +K + L L+ S
Sbjct: 234 ALTLELSKEDRNYDDKADILELNELAES 261
>gi|302819975|ref|XP_002991656.1| hypothetical protein SELMODRAFT_236359 [Selaginella moellendorffii]
gi|300140505|gb|EFJ07227.1| hypothetical protein SELMODRAFT_236359 [Selaginella moellendorffii]
Length = 428
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 143/322 (44%), Gaps = 38/322 (11%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ-CSVPDWPLLATYLI 157
D G RGL +N+ +GE +L VP + +I PE G+VL + +L YL+
Sbjct: 36 DQGGRGLGVARNVEQGEMILRVPFAALIGVHCAREDPEFGKVLVDFAHLSSVQILTAYLL 95
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYN 215
SE + SSRW +Y+ P+ +SL +++ E + + +A + + ++E +
Sbjct: 96 SEVAKSCSSRWFSYLRHNPQVHHSLPHFSAMEAEELQVEDAISMAKSSLEDTQRQWRETS 155
Query: 216 DL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
L RLR+ P + + + W+ + SR + +P D V L P D+ N+
Sbjct: 156 SLLSRLRL--------PRKFTTFKAWLWAAATISSRTLHVPWDDAGV-LCPIGDLFNYDA 206
Query: 274 EVETFLD------------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
+E + Y+ S F R Y+ G+Q I YG+ +N ELL
Sbjct: 207 PIERTMSSRNEDDELEFTNRLTDGGYETSISSYCFYARRSYKKGQQALICYGQYTNLELL 266
Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP-LEL 374
YGF+ + NP D + +PL + ++ R++ L+A + I+ +G P L
Sbjct: 267 EHYGFLLPD--NPCDVIYIPLPSPEEFGLKSTGDKSERQHNLAA---YCIEASGKPSFSL 321
Query: 375 MAYAYLVVSPPSMKGKFEEMAA 396
+ L P S++ MA+
Sbjct: 322 LQQLRLRAVPASLRKSHGYMAS 343
>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
Length = 475
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 138/312 (44%), Gaps = 32/312 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L+KWL + + + G RGL++ ++++G+ ++ +P S ++T D+
Sbjct: 70 LRKWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIR-SSV 128
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDR 192
G +K+ P PLLA T+L+SE S W +Y+ LP+ P L L
Sbjct: 129 GPYIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPG 188
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSR 249
L A +RA R+ ++ + D FS LF E V F+ F W++ + +R
Sbjct: 189 PLRAKAEEQRA--RVQDLFASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTR 242
Query: 250 LVRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
V L S AL P+ D+LNHS V+ +++ ++ T + + ++
Sbjct: 243 AVYLKSRRQECLSSEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQE 302
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLS 358
FI YG N LLL YGFV + V + LK +DK +KL L +G +
Sbjct: 303 AFICYGPHDNQRLLLEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFT 362
Query: 359 ASECFPIQITGW 370
+ F GW
Sbjct: 363 GNLTF-----GW 369
>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
Length = 266
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 110/235 (46%), Gaps = 18/235 (7%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE--- 159
RGL A + +R GE++L + L+I P + +V W LA ++ E
Sbjct: 21 RGLFASRPVRAGERVLEISLDLMIAPSD---LPGELSTVLSSTVKPWTKLALIVLMERYK 77
Query: 160 -ASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
+ +SS W+ YIS LP +P L W EL YL AS + + ER+ + +
Sbjct: 78 GQAKLQSSAWAPYISCLP-EPAELDNTFLWEDTELS-YLRASPLYGKTRERLEMITTEFG 135
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
++ +P LF + ++E FK + +FSR + + D + ++P D NH+
Sbjct: 136 QVQ-NALDVWPQLFGK--VSLEDFKHVYATVFSRSLAIGE-DSTLVMIPMLDFFNHNATS 191
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
L ++ V T DR Y +Q++I+YG SN EL L YGF E NP D
Sbjct: 192 FAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYGFAVPE--NPYD 244
>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
Length = 374
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/262 (26%), Positives = 116/262 (44%), Gaps = 31/262 (11%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
R L A + + G + L +P +IT ++ P L S P L+ +L+SE
Sbjct: 6 RALFATRRVPAGSRFLEIPRIAIITPEN---VPSQVSHLLSTSNPK-TRLSLFLLSEKHK 61
Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ S+W+ Y+ LP+ S ++W EL +L+ S +E + + ++ L L
Sbjct: 62 AQESQWAPYLRCLPQLGDIESTMFWKAEEL-AWLKHSPTYRETMECLKIIKSEFHLLTLA 120
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
N + F W L + + S D +P+AD NH +
Sbjct: 121 --------------NKQVFPWCRDALGEVSLTDFMHAYSTDQ----IPFADFFNHDHNCQ 162
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
T L YDK V D+ Y+ G+++F+SYG N L + YGF +NP + VE+P+
Sbjct: 163 TRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAV--ASNPHEQVEVPM 220
Query: 337 SLKKSDKCYKEKLEALRKYGLS 358
+ +D KL+ L ++ +S
Sbjct: 221 GVSLTDPLRDLKLQTLSRHNMS 242
>gi|389622275|ref|XP_003708791.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
gi|351648320|gb|EHA56179.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
gi|440464619|gb|ELQ34017.1| hypothetical protein OOU_Y34scaffold00823g1 [Magnaporthe oryzae
Y34]
Length = 419
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 119/234 (50%), Gaps = 15/234 (6%)
Query: 79 LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
++ WL ++G + + + + RG+ L++ ++GEK+L +P + T + +
Sbjct: 1 MENWLKETGAVGLDNLELADFPITGRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLL 60
Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
G L+ SV D +LATY++ S E ++++ALP S +++ EL+
Sbjct: 61 GPALRSVRPPLSVED--ILATYILFVRSRESGYDGLRSHVAALPSSYSSSIFFAEEELEV 118
Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
S + ++ +RI + Y L +R+ ++ DLFP E F +E +KW+ ++SR
Sbjct: 119 CAGTSLYTVTKQLEQRIED---DYRALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRA 175
Query: 251 VR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ LP + L P+ADMLNHS V+ YD SS+ + + Y+ G+Q+
Sbjct: 176 MDFVLPGGNSIRLLAPFADMLNHSDNVKQCHAYDSSSKTLSVLAGKDYEAGDQL 229
>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 131/275 (47%), Gaps = 8/275 (2%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
S + L+KW +G+ + I V+ RG +A K+++ G+ L +P S++I+ +
Sbjct: 115 SCDKEKCLEKWGESNGVK-TSLKIACVEGAGRGAIATKDLKVGDIALEIPVSIIISEEHV 173
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+ K + +L + + E SS++ Y LP + + L + +
Sbjct: 174 HKSDMYHILEKIDGITSETMLLLWSMKE-RHNCSSKFKIYFDTLPEEFKTGLSFGVDAI- 231
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
L+ + + E ++ ++ Y++L + YPD+F E++ E F W+ + +S +
Sbjct: 232 MALDGTLLLEEIMQAKEHLRVQYDELVPPLCKNYPDVFLPELYTWEQFLWACELWYSNSM 291
Query: 252 RLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
++ +DG R L+P A LNHS + + D ++ + F R GEQ +SYG
Sbjct: 292 KVMFVDGKLRTCLIPIAGFLNHSLYPHIVHYGKVDSATNTLKFPLTRPCCFGEQCCLSYG 351
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
S+ L+ YGF+P +G NP D + L + + +D
Sbjct: 352 NFSSSHLITFYGFMP-QGDNPCDVIPLDIDVGDAD 385
>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 132/308 (42%), Gaps = 57/308 (18%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS--KWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
RG+V +N+ KGE L+ +P ++ S K + EA + + V ++A +L+ E
Sbjct: 149 RGVVTTRNVTKGETLVAIPLEKCLSTFSARKSAIGEALKTITSREVTIDAVIALHLLHEL 208
Query: 161 SFEKS-SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
++ S W ++S LPR + L WT EL + LE S + I V+ + R
Sbjct: 209 YVQREKSEWWPWVSILPRDVETPLLWTPRELAQ-LEGSNL----IGFRDAVLKGWTTQRD 263
Query: 220 RIF----SKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--------------SMDGRVA 261
+F K+P LFPEE F E + W+ I++SR +P S + RV
Sbjct: 264 ALFPKLTQKFPSLFPEEHFRTERWAWAMAIVWSRAADVPVPRPEAIFPSGDDKSRELRV- 322
Query: 262 LVPWADMLNHSCEVETFL---------------------------DYDKSSQGVVFTTDR 294
+VP DM+NH + +D S + V
Sbjct: 323 IVPLFDMINHGYDHAPVTPGGVKGGGGEGREKGGVGVDDSPALIPSWDPSRRMVAIRAGV 382
Query: 295 QY-QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ P +V +YG K + +LL YGFVP NP +SVE+ + DK K E LR
Sbjct: 383 PFPGPNYEVRFNYGAKPSQHVLLQYGFVPM--NNPDESVEVAMHAGSRDKLKSLKSELLR 440
Query: 354 KYGLSASE 361
+ LS E
Sbjct: 441 THELSPRE 448
>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
Length = 397
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 138/300 (46%), Gaps = 31/300 (10%)
Query: 87 GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV 146
GLP + + + G RGL++ ++++G+ ++ +P S ++T D+ G +K+
Sbjct: 7 GLP-----VIRTEAG-RGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLGPYIKKWKP 59
Query: 147 PDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
P PLLA T+L+SE S W +Y+ LP+ Y+ E+ L S ++ +A
Sbjct: 60 PVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-PSPLKAKAE 117
Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
E+ V + R FS LF E VF+ F W++ + +R V L S
Sbjct: 118 EQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECL 176
Query: 261 -------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
AL P+ D+LNHS V+ +++ ++ T + + ++VFI YG N
Sbjct: 177 SAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQR 236
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
LLL YGFV + V + +K +DK K+ L+ +G + + F GW
Sbjct: 237 LLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLTF-----GW 291
>gi|301119251|ref|XP_002907353.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262105865|gb|EEY63917.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 424
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 102/203 (50%), Gaps = 14/203 (6%)
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR----LRI 221
S+W+ +I LP+ ++ LY+ E+ + LE S + A + V Y L+ +
Sbjct: 108 SKWAKHIELLPKTYHNALYFEAGEI-KALEGSNLFFIAQQMEEKVASDYAVLKESVLFEL 166
Query: 222 FSKYP-----DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
F DLF +E+F+++ +KW+ ++SR V + A+VP DMLNH E E
Sbjct: 167 FENITEGITVDLF-DEIFSLDNYKWALSTIWSRFVLPVAKQSFKAMVPVFDMLNHDPEAE 225
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+D +Q + + + G Q+FI+YG SN +LL YGFV N D+V++ L
Sbjct: 226 MSHFFDMETQCFKLVSHQHWNAGAQMFINYGALSNHKLLSLYGFVII--GNLFDAVDMWL 283
Query: 337 SLKK-SDKCYKEKLEALRKYGLS 358
+ + S K Y EK + L GL
Sbjct: 284 PMDEASTKFYHEKEQLLLVNGLD 306
>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
Length = 370
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/280 (30%), Positives = 131/280 (46%), Gaps = 17/280 (6%)
Query: 77 STLQKWLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS--KW 132
+ ++WL + Q +++++ D R +A K I KG LL +P L+IT + KW
Sbjct: 6 NQFERWLEAHQVSQWKQLLSLERYDNNYRTFLAKKPITKGSILLEIPDPLLITGNKVCKW 65
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR-AELD 191
+Q S LL + + S + S W Y+ LP Y LL+ R L
Sbjct: 66 LERNNWIGHQQISSVQGVLLVSIFLFFESRQSDSFWKPYLQVLP-TSYDLLFLYRDGLLL 124
Query: 192 RYLEASQIRERAIERITNVI-GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
Y+ + I + +E + ++ T+ + FS D V E +W ++ SR+
Sbjct: 125 SYVTEADIMQ-MVESVRRILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVV-SRI 182
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLD--YDKSSQGV-VFTTDRQYQPGEQVFISYG 307
LP D ALVP D+ NH V+T +D Y K +G VF R + G QVF+SYG
Sbjct: 183 CYLPD-DIAGALVPLGDIFNHEA-VDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYG 240
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYK 346
SN EL++ YGF + NP D++ P L +S K Y+
Sbjct: 241 ALSNTELMMYYGFTLND--NPWDTLSFYPHELDESIKFYE 278
>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
occidentalis]
Length = 381
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 127/287 (44%), Gaps = 36/287 (12%)
Query: 79 LQKWLSDSGLPPQK-MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L W+ G P + + RG+V L NI G+ ++ +P +L+IT D
Sbjct: 24 LYSWIQRLGFKPTSVLRLACTPASGRGIVCLSNIEAGDVIIDLPSTLLITPD----LVRK 79
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
+ + ++ +L +++SE S + S+W YI ++P +
Sbjct: 80 ELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESIPD---------------VFDGL 124
Query: 198 QIRE--RAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRL 253
Q R+ R R+ I +N R +FS+ F N ETF W++ + +R + +
Sbjct: 125 QCRKSVRLPRRLAQAIDRWNAERRNVFSRLRMFFRGRGIDLNFETFSWAWSAVNTRCIYV 184
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
L P+ D+LNH + ++ + + ++ Y+ G +VFI YG N
Sbjct: 185 EGHGS--TLAPFLDLLNHHWKAS--IETSFVNNHFIIRSNVGYEAGSEVFIGYGSHDNRT 240
Query: 314 LLLSYGFVPREGTNPSDSVELPLS----LKKSDKCYK--EKLEALRK 354
L L+YGFV E NP+D + + L LK+S ++ K+E LR+
Sbjct: 241 LFLNYGFVLDE--NPNDCITVELEHLEKLKRSRNIHEFARKIEFLRQ 285
>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
Length = 514
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 155/341 (45%), Gaps = 25/341 (7%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+L KW G+ ++ I RG++A ++I G+ L +P SL+I+ D E
Sbjct: 125 SLLKWGEHLGIK-SRLQIAYFQGAGRGMIASESIGVGDIALEIPESLIIS-DELLCQSEV 182
Query: 138 GEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
LK + + +L + + E + S++ Y LP + L + L LE
Sbjct: 183 FLSLKDFNNITSETMLLLWSMRE-RYNLGSKFKPYFDTLPANFNTGLSFGIDAL-AALEG 240
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+ + + I+ ++ Y++L + + +P++F ++V + F W+ + +S + +
Sbjct: 241 TLLFDEIIQARQHLRQQYDELFPLLCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLS 300
Query: 257 DGRVA--LVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
G+++ LVP A +LNHS + + D++++ + F R GEQ F+SYGK
Sbjct: 301 SGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGS 360
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA--------LRKYGLSASECFP 364
LL YGF+PR G NP D + L L D+ + A +R LS S FP
Sbjct: 361 HLLTFYGFLPR-GDNPYDVIPLDLDTSADDEDITAQSSATTSQTTHMVRGTWLSTSGGFP 419
Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSK 405
G P L+ Y ++ + +E AA A K + K
Sbjct: 420 TY--GLPQPLLTYLR-----AALGCEVDEPAAEADMKESDK 453
>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 426
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 71/300 (23%), Positives = 139/300 (46%), Gaps = 20/300 (6%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
+TL +WL +G + I+ + E G+ A + + G+ L +P L + +S
Sbjct: 7 VTTLLEWLKANGGVDNLLDIRYLGKLEGHGVFAKQALTSGQVTLRIPFKLTMNIESAARS 66
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--- 191
A + K +PD +LA +L+ E S S ++ +I++LP ++W+ +EL+
Sbjct: 67 DLARVLEKYPQIPDDEVLALHLMHERSKRSDSFFAPFIASLPTTFDLPVFWSESELNELK 126
Query: 192 --RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFS 248
L +Q+ ++ ++R + ++ + +P++F +E + W+ +++S
Sbjct: 127 GTNVLLLTQLMKQQLQR------DFENIHQAVVEDFPEVFALLPTLTLEDYTWAMSVIWS 180
Query: 249 RLVRLPSMDGRV-ALVPWADMLNHSCEVETFLD----YDKSSQGVVFTTDRQYQPGEQVF 303
R + + L P DM NH + LD +D+ +Q + ++ G +
Sbjct: 181 RAFGVTREKKYLRVLCPAMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQ 240
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
ISYG+ SN +LL SYGFV +E N +V+ + + +D K K L L+ + +
Sbjct: 241 ISYGQYSNAKLLFSYGFVAKE--NSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTY 298
>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 129/259 (49%), Gaps = 21/259 (8%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
++I K G R L A K+I+ G+ +L VP ++ I+ D+ P L V + L
Sbjct: 61 LSIGKSTYGSRSLFASKSIQTGDCILKVPYNVQISPDN---VPSKINSLLGDEVGNIAKL 117
Query: 153 ATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
A + E + S W+ YI+ LP+ + +S ++W+ EL + ++ S + + I + +
Sbjct: 118 AIVISVEWKMGQDSEWAPYINRLPQPGEMHSTIFWSEGEL-KMIQQSSVYQETINQKAQI 176
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
+ ++ + +LF + +++ F + ++ SR S G ++L+P+AD +N
Sbjct: 177 QKDFLAIKPVLHHFSENLFKD--ISLKEFMHACALVGSR--AWGSTKG-LSLIPFADFVN 231
Query: 271 HSCEVETFL--DYDK----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPR 323
H ++ L D DK SS + DR Y PGEQV I YGK N LLL +GF +P
Sbjct: 232 HDGFSDSVLLGDEDKQLSESSSTLEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLP- 290
Query: 324 EGTNPSDSVELPLSLKKSD 342
N D V++ +++ D
Sbjct: 291 --YNIYDQVQIQVNIPHHD 307
>gi|149059901|gb|EDM10784.1| hypothetical protein RDA279, isoform CRA_d [Rattus norvegicus]
Length = 399
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 129/287 (44%), Gaps = 31/287 (10%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLA--TYLISEA 160
RGL++ ++++G+ ++ +P S ++T D+ G +K+ P PLLA T+L+SE
Sbjct: 19 RGLMSKASLQEGQVIISLPESCLLTTDTVIR-SSVGPYIKKWKPPVSPLLALCTFLVSER 77
Query: 161 SFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
S W +Y+ LP+ P L L L A +RA R+ ++ + D
Sbjct: 78 HAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPGPLRAKAEEQRA--RVQDLFASSRDF 135
Query: 218 RLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWA 266
FS LF E V F+ F W++ + +R V L S AL P+
Sbjct: 136 ----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFL 191
Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
D+LNHS V+ +++ ++ T + + ++ FI YG N LLL YGFV
Sbjct: 192 DLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEAFICYGPHDNQRLLLEYGFVAFGNP 251
Query: 327 NPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
+ V + LK +DK +KL L +G + + F GW
Sbjct: 252 HACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFTGNLTF-----GW 293
>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Oreochromis niloticus]
Length = 607
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 145/327 (44%), Gaps = 37/327 (11%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L W ++G + + GL A ++I+ E L++P +++T +S +
Sbjct: 82 LMSWAKENGASCECFTVANFGKEGYGLRATRDIKAEELFLWIPRKMLMTVESAQNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+L D L LA +L+ E + +S W YI +LP Q Y + + + E
Sbjct: 138 -ILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLP-QEYDIPLYYQQEDV 194
Query: 192 RYLEASQIRERAIERITNVI-------------GTYNDLRLRIFSKYPDLFPEEVFNMET 238
+ L +Q + + + N G + LR+F+ + ++F+
Sbjct: 195 QLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFASLTPVMGGKLFD--- 251
Query: 239 FKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
+W+ + +R ++P+ DG +AL+P DM NH+ + T Y+ +
Sbjct: 252 -QWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQD 309
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
Y+ EQ++I YG +SN E ++ GF ++ + D V++ L + KS++ Y K E L +
Sbjct: 310 YKENEQIYIFYGTRSNAEFVIHNGFFFQD--DAHDRVKIKLGVSKSERLYAMKAEVLARA 367
Query: 356 GLSASECFPIQITGWPLELMAYAYLVV 382
G+ AS F + P+ A+L V
Sbjct: 368 GIPASYVFALHCNEPPISAQLLAFLRV 394
>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
Length = 559
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 150/324 (46%), Gaps = 46/324 (14%)
Query: 71 DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+S E + L +W +G+ + I V+ RG +A +++ G+ +L +P +++I+
Sbjct: 154 NSCEKGNCLLEWGESNGVR-TSLKIAYVEGAGRGTIAKEDLDVGDTVLEIPLAIIISE-- 210
Query: 131 KWSCPEAGEVLKQCSVPDWPLLA--------TYLISEASFEK---SSRWSNYISALPRQP 179
E++++ ++ +P+L+ T ++ + EK S + Y LP
Sbjct: 211 --------ELVQKSTM--YPVLSKVEGMLPETMMLLWSMKEKHIVDSEFRVYFDTLPEAF 260
Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
+ L + + L + + + ++ ++ YN+L + + +PD+FPEE ++ E F
Sbjct: 261 NTGLSFGVGAMTT-LVGTLLFDELMQAKEHLRKQYNELFPALCNNHPDIFPEEFYSWEEF 319
Query: 240 KWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQ 295
W+ + +S +++ DG R LVP A LNHS + + D + + F R
Sbjct: 320 LWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGKVDSDTDSLKFRLSRP 379
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK------------ 343
+ GE+ ++SYG S L+ YGF+P EG N +D + L + D
Sbjct: 380 CRAGEECYLSYGNYSGSHLVTFYGFLP-EGDNVNDVIPLDIDFGDDDNNNITSDWSTHMV 438
Query: 344 --CYKEKLEALRKYGLSAS--ECF 363
+ K++++ YGL + ECF
Sbjct: 439 RGTWLSKIQSIFHYGLPSPFLECF 462
>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
Length = 353
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 126/273 (46%), Gaps = 27/273 (9%)
Query: 79 LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L +W+ +G ++ +++ RGL L+ + GE L VP L+IT + S
Sbjct: 29 LLEWMIANGFELHVQLCVREFTETGRGLATLQKVTAGETFLRVPTCLLITTTTALSSSLH 88
Query: 138 GEVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
G +++ + +L +LI+E S W +I +LP + ++ L R E
Sbjct: 89 GFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSLPVSYTTPVFLGSKLLARLPET 148
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM-ETFKWSFGI---------- 245
+ +A +++ + T+ +RL+I K L + N+ E F W +
Sbjct: 149 --MCRKAEAQVSRIRRTF--VRLQILLKRALLDDSALLNLSENFTWHLFVWAWTAVNTRC 204
Query: 246 LFSRLVRLPSM--DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF--TTDRQYQPGEQ 301
+FS+ S D AL P+ D LNH + D + + +G F T+ Y+P +Q
Sbjct: 205 IFSKHRTDHSFWDDDYCALAPFLDCLNHHWKA----DVETTVEGSYFEIVTNNNYEPNDQ 260
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
VFISYG N +LLL YGFV + NP+D V +
Sbjct: 261 VFISYGSHDNKKLLLEYGFVLAD--NPNDVVAI 291
>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 133/282 (47%), Gaps = 21/282 (7%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPE 136
L +WL D K++I+ G R L A + IR+GE +LFVP + ++ + K SC
Sbjct: 42 NLIQWLKDGKAEVSKVSIEVKSEGYRTLRASQFIRQGEWVLFVPRTHYLSLEEVKKSCLI 101
Query: 137 AGEVLKQCSVPDWPLLATYLIS---EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
++++ +P+ + TY ++ + + ++S W YI LP+ AE D
Sbjct: 102 NRKMIQLNYIPN--NIQTYFVNHLLQENRRQNSFWKPYIDVLPKDVSGFPTNFDAEQDAL 159
Query: 194 LEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV-FNMETFKWSFGILFSRL 250
L+ S + + Y++L+ ++ F +Y + + V F T SF
Sbjct: 160 LKGSPTLFTVMNQRKTFQEEYDNLKEAVKEFQRYGYTYNDFVKFRTLTISRSFP------ 213
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKS--SQGVVFTTDRQYQPGEQVFISYGK 308
V + + + LVP AD +NH + FL Y S + G R Q GE++F +YG+
Sbjct: 214 VYIGENEQQQLLVPLADFINH--DNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQ 271
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
SN ++YGF TNP + + + L ++D+ +K K+E
Sbjct: 272 WSNKYFFMNYGFASL--TNPMNQFDFDICLDRNDRMFKMKVE 311
>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
Length = 440
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 135/310 (43%), Gaps = 29/310 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + + RGL++ ++R+G+ ++ +P S ++T D+
Sbjct: 36 LKKWLKDRNFEDTNLMPARFPGTGRGLMSKTSLREGQMIISLPGSCLLTTDTVIRSSLGA 95
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
++K P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 96 YIIKWKPPPS-PLLALCTFLVSEKHAGDQSVWKPYLDILPKS-YTCPVCLEPEVVNLL-P 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
++ +A E+ +V + R FS LF E VF+ W++ + +R V L
Sbjct: 153 EPLKAKAEEQRMSVQQFFASSR-DFFSSLQPLFEEATDSVFSYSALLWAWCTVNTRAVYL 211
Query: 254 PSMD--------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T Y+ ++VFI
Sbjct: 212 RTRRRDCLSLEPDTCALAPYLDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRKYGLSAS 360
YG N LLL YGFV NP V L L +DK +K+ L+ +G +
Sbjct: 272 YGPHDNHRLLLEYGFVSL--CNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLEN 329
Query: 361 ECFPIQITGW 370
F GW
Sbjct: 330 LTF-----GW 334
>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
Length = 440
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 124/270 (45%), Gaps = 24/270 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLF-VPPSLVITADSKWS-CP 135
+KWL D G + + D G RGL+ K ++ L+ +P ++T D+ S C
Sbjct: 35 FRKWLKDRGFEDSHLRPAEFWDTG-RGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSSC- 92
Query: 136 EAGEVLKQCSVPDWPL--LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
GE + + P PL L T+LI+E + S W Y+ LP+ YS ++
Sbjct: 93 -LGEYIMKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPKT-YSCPVCLEHDVVSL 150
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRL 250
L +R++A E+ T V Y + FS LF E +FN +W++ + +R
Sbjct: 151 L-PEPLRKKAQEQRTKVHELYISSK-AFFSSLQPLFAENTETIFNYSALEWAWCTINTRT 208
Query: 251 VRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ + + AL P+ D+LNHS V+ +++ ++ T+ + E+V
Sbjct: 209 IYMKHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEV 268
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSV 332
FI YG N LLL YGFV + NP SV
Sbjct: 269 FICYGPHDNQRLLLEYGFVAMD--NPHSSV 296
>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
Length = 559
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 75/319 (23%), Positives = 143/319 (44%), Gaps = 36/319 (11%)
Query: 71 DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD- 129
+S E + L +W +G+ + I V+ RG +A +++ G+ +L +P +++I+ +
Sbjct: 154 NSCEKGNCLLEWGESNGVR-TSLKIAYVEGAGRGTIAKEDLDVGDTVLEIPLAIIISEEL 212
Query: 130 --SKWSCPEAGEV---LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
P +V L + W + +++ S + Y LP + L
Sbjct: 213 VQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVD-------SEFRVYFDTLPEAFNTGLS 265
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
+ + L + + + ++ ++ YN+L + + +PD+FPEE ++ E F W+
Sbjct: 266 FGVGAMTT-LVGTLLFDELMQAKEHLRKQYNELFPALCNNHPDIFPEEFYSWEEFLWACE 324
Query: 245 ILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGE 300
+ +S +++ DG R LVP A LNHS + + D + + F R + GE
Sbjct: 325 LWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGKVDSDTDSLKFRLSRPCRAGE 384
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK--------------CYK 346
+ ++SYG S L+ YGF+P EG N +D + L + D +
Sbjct: 385 ECYLSYGNYSGSHLVTFYGFLP-EGDNVNDVIPLDIDFGDDDNNNITSDWSTHMVRGTWL 443
Query: 347 EKLEALRKYGLSAS--ECF 363
K++++ YGL + ECF
Sbjct: 444 SKIQSIFHYGLPSPFLECF 462
>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
boliviensis]
Length = 440
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 132/296 (44%), Gaps = 24/296 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKDRKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRKYG 356
YG N L L YGFV NP V L L +DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFV--SAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
Length = 527
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 110/243 (45%), Gaps = 17/243 (6%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
++A I KG L VP ++ + C G +L++ W + +L+ E S K
Sbjct: 65 MIASGAIDKGSVLAEVPLQAFLSEKTAERCLLVGPMLRKNDFRPWLTMCAHLLVERSRGK 124
Query: 165 SSRWSNYISALPR-QPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
S W YISALP + S+ L W + L+ S + + R+ + L
Sbjct: 125 ESFWHPYISALPSVEELSISHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTA 184
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD-----GRVALVPWADMLNH--SC 273
K+ L E + +W+ +L SR L +D + LVPWADMLNH S
Sbjct: 185 GIEKF--LPGGETLSEGDVRWASAVLLSRAFSL-ELDVDDDFDTLCLVPWADMLNHCSSA 241
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG-ELLLSYGFVPREGTNPSDSV 332
E+ L +D+ ++ + Y G++VF SYG G +L L YGFV E N + +V
Sbjct: 242 GEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDE--NENYAV 299
Query: 333 ELP 335
+LP
Sbjct: 300 DLP 302
>gi|291410015|ref|XP_002721306.1| PREDICTED: SET domain containing 4 [Oryctolagus cuniculus]
Length = 440
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 137/316 (43%), Gaps = 41/316 (12%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
LQKWL D + +A + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LQKWLKDRKFEDKNLAPARFPGTGRGLMSTVSLQEGQMIISLPESCLLTTDTVIES-YLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL-- 194
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 PYITKWKPPPSPLLALCTFLVSEKHAGDRSPWQPYLEILPKA-YTCPVCLDPEVVNLLPK 153
Query: 195 ----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILF 247
+A + R R E + G FS LF E +F+ W++ +
Sbjct: 154 PLQMKAEEQRARLWEFFASSRG--------FFSSLQPLFVEPIDSIFSYSALLWAWCTVN 205
Query: 248 SRLVRL--------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
+R V L + AL P+ D+LNHS V+ +++ ++ T +++
Sbjct: 206 TRAVYLRRRPRECLSAEPDTCALAPYLDLLNHSPHVQVEAAFNEETRCYEIRTASRFRKH 265
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRK 354
E+VFI YG N LLL YGFV NP V L L +DK +K+ L+
Sbjct: 266 EEVFICYGPHDNQRLLLEYGFV--SVRNPHACVYVSGEILVKYLPPTDKQLNKKVAILKD 323
Query: 355 YGLSASECFPIQITGW 370
+G + F GW
Sbjct: 324 HGFIENLTF-----GW 334
>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 753
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/257 (31%), Positives = 124/257 (48%), Gaps = 24/257 (9%)
Query: 96 QKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK---QCSVPDW 149
Q++D G RG+ A +I + +P S +IT + + P ++L + P
Sbjct: 115 QQLDDGSKEMRGVHAKTSIPPNTICVSIPKSCLITVEMGQATPIGRKILTSDLELDAPKH 174
Query: 150 PLLATYLISEASFE-KSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERI 207
L Y++ + ++S ++ Y LP ++ ++WTR ELD LE S + + +R
Sbjct: 175 IFLMIYILWDRKVNGETSFFAPYYKILPETLRNMPIFWTREELD-ALEGSYLLLQIADRA 233
Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWA 266
+ Y + I S P+ ++ +E F+W+ I+ SR L R ALVP A
Sbjct: 234 EAIKEDY----ISICSIAPEF--GDIATLEEFQWARMIVCSRNFGLLINGHRTSALVPHA 287
Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-----V 321
DMLNH ET + + SQ TT ++ GEQVF SYG+K N LL+YGF V
Sbjct: 288 DMLNHLRPRETKWTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLLNYGFCVERNV 347
Query: 322 PREGTNPSDSVELPLSL 338
+G P+ E+PL L
Sbjct: 348 EVDGFCPN---EVPLEL 361
>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
Length = 540
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/384 (24%), Positives = 166/384 (43%), Gaps = 37/384 (9%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGER-----GLVALKNIRKGEKLLFVPPSL---VIT 127
S +WL +SG K KV +G GLV+ +I++GE+ L +P L ++T
Sbjct: 70 VSNFMEWLKNSGFDETK---SKVKIGRNLAEGSGLVSTCDIKEGEEFLEIPEKLFIDIMT 126
Query: 128 ADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
A + +L+ + P +LA YLI E++ SS + Y+ LP+ ++ YW
Sbjct: 127 ALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSS-IAPYLKVLPKTYSTIGYW 185
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
+ + LE S + + A+ + Y ++F P + F E F W+
Sbjct: 186 GIEDF-KQLEGSPVFQTAVNYTRGSMRQYCYF-YQLFDNNPGILQTSNFTYEAFIWAVAT 243
Query: 246 LFSRLVRLPSMDGR-VALVPWADMLNHSC---EVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ SR + P G+ +AL+P+ D NHS ++ TF+D K + + + Y+ GEQ
Sbjct: 244 VQSR--QNPVGGGQEMALIPFWDFCNHSSHGGKITTFIDPVKHV--LTCSAAKSYKKGEQ 299
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL-RKYGLSAS 360
V++ YG + N + L GF + N S ++ L + +K+ L + GL
Sbjct: 300 VYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLRVG 359
Query: 361 ECFPIQIT----GWPLELMA-YAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK------ 409
+ + P E++ Y +SP K + D+K
Sbjct: 360 QTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAFNI 419
Query: 410 -CPEIDEQALQFILDSCESSISKY 432
E +++A + +LDS ++ +S Y
Sbjct: 420 ISEENEKKAFKLLLDSLKARLSGY 443
>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Entamoeba nuttalli P19]
Length = 791
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 148/316 (46%), Gaps = 33/316 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
++KW+ +G + ++ D RGL A K +K E ++ +P S+ I +
Sbjct: 4 IKKWVIQNGGVIDGVDVKTFDGYGRGLCANKEFKKDEIIMSIPYSIQINRIN------LN 57
Query: 139 EVLKQCSVPDWP-----------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
+ + +P + L+ YL + K W YI+ LP L +T
Sbjct: 58 HIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWP-YINVLPETYDCPLSYTI 116
Query: 188 AELDRYLEASQIRERAIERIT----NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
EL+ ++ +++ A+E+I V+ YN+ ++ F +Y F +++F + +W+
Sbjct: 117 DELNL-MKGTKLY-AAVEKINAFLMKVVDYYNNKLIQQFPQYFQPF-DDLF--KRLQWAH 171
Query: 244 GILFSR--LVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPG 299
+SR LV P G V +L+P+ D NH + + + ++ F T+ +PG
Sbjct: 172 QSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEALVKPG 231
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
EQ+F +Y +SN +LLL YGFV E NP D++ L + + D Y E E L++ + +
Sbjct: 232 EQIFNNYRIRSNEKLLLGYGFV--EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKS 289
Query: 360 SECFPIQITGWPLELM 375
+ F PLELM
Sbjct: 290 FDFFLKLDEDIPLELM 305
>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
Length = 440
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/308 (25%), Positives = 134/308 (43%), Gaps = 25/308 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + + RGL++ +++ G+ ++ +P S +++ D+ G
Sbjct: 36 LKKWLKDRKFEDTNLIPARFPGTGRGLMSKTSLQVGQMIISLPESCLLSTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+ E S W Y+ LP+ + W ++ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLPKTYTCPVCWEPEVVN--LLP 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK---WSFGILFSRLVRL 253
+R +A E+ T V + R FS LF E V N+ T+ W++ + +R V L
Sbjct: 153 RPLRAKAQEQRTRVQEFFTSFR-DFFSSLQPLFSEAVENIFTYSALLWAWCTVNTRAVYL 211
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R AL P+ D+LNHS +V+ +++ ++ + E+VFI
Sbjct: 212 RHRQLRCFSAEPDTCALAPYLDLLNHSPDVQVKAAFNEKTRCYEIVAVSSCRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASEC 362
YG N LLL YGFV + V + +K +DK +K+ L+ + +
Sbjct: 272 YGPHDNHRLLLEYGFVSTRNPHACVYVSRDILVKYLPSTDKQMNKKISILKDHDFIENLT 331
Query: 363 FPIQITGW 370
F GW
Sbjct: 332 F-----GW 334
>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 791
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 149/316 (47%), Gaps = 33/316 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
++KW+ +G + ++ + RGL A K +K E ++ +P S+ I +
Sbjct: 4 IKKWVIQNGGVIDGVDVKTFEGYGRGLCANKEFKKDEVIMSIPYSIQINRIN------LN 57
Query: 139 EVLKQCSVPDWP-----------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
+ + +P + L+ YL + K W YI+ LP L +T
Sbjct: 58 HIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWP-YINVLPETYDCPLSYTI 116
Query: 188 AELDRYLEASQIRERAIERIT----NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
EL+ ++ +++ A+E+I V+ YN+ ++ F +Y F +++F + +W+
Sbjct: 117 DELNL-MKGTKLY-AAVEKINAFLMKVVDYYNNKLIQQFPQYFQSF-DDLF--KRLQWAH 171
Query: 244 GILFSR--LVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPG 299
+SR LV P G V +L+P+ D NH + + + ++ F T+ + +PG
Sbjct: 172 QSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPG 231
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
EQ+F +Y +SN +LLL YGFV E NP D++ L + + D Y E E L++ + +
Sbjct: 232 EQIFNNYRIRSNEKLLLGYGFV--EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKS 289
Query: 360 SECFPIQITGWPLELM 375
+ F PLELM
Sbjct: 290 FDFFLKLDEDIPLELM 305
>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
Length = 403
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 133/287 (46%), Gaps = 15/287 (5%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL---VITADSKWSCPEAG 138
W +G + + I GL A ++++ + + VP L V+TA P
Sbjct: 1 WFKANGGTAEHVEIHDFGDQGLGLRATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLIS 60
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
S+P +LA +++ E E S+ W+ Y++ LPR + LY++ ++ L+ S
Sbjct: 61 REHGLRSMPH-VVLALHVLCERLHEDST-WAPYLNILPRSYSTCLYFSPDDM-MALQGSP 117
Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRL--VRL 253
A+++ ++ Y R+ P+ L + F + F+W+ + +R V++
Sbjct: 118 SMGEALKQFRGIVKQYVYF-FRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKV 176
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
S + AL+P DM NH C +D S++ V + + G+QVFI YG+++N +
Sbjct: 177 SSNETVKALIPMWDMCNH-CNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNAD 235
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
L GFV E D V + L + K+D+ Y K + L GL AS
Sbjct: 236 RLFHNGFVYTEAE--EDWVNIQLGVSKNDRLYAMKAQILAMVGLDAS 280
>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
Length = 440
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 134/311 (43%), Gaps = 31/311 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + RGL++ ++++G+ ++ +P + ++T D+ G
Sbjct: 36 LKKWLKDRKFEDTNLMPAHFPGTGRGLMSKTSLQEGQMIISLPENCLLTTDTVIE-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRY 193
+ + P PLLA T+L+SE S W Y+ LP+ P L L +
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDQSPWKPYLEILPKAYTCPVCLEPEVVNLLPKP 154
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRL 250
L+A +RA + + D FS LF E V F+ W++ + +R
Sbjct: 155 LKAKAEEQRA--HVQEFFASSRDF----FSSLQPLFAEAVDSIFSYSALLWAWCTVNTRA 208
Query: 251 VRL--------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
V L + AL P+ D+LNHS V+ +++ ++ T ++ E+V
Sbjct: 209 VYLRHRRRECLSAEPDTCALAPYLDLLNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEV 268
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSA 359
FI YG N LLL YGFV + + V + +K +DK +K+ L+ +G
Sbjct: 269 FICYGHHDNQRLLLEYGFVSIQNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFIE 328
Query: 360 SECFPIQITGW 370
+ F GW
Sbjct: 329 NLTF-----GW 334
>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
Length = 459
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 72/280 (25%), Positives = 135/280 (48%), Gaps = 20/280 (7%)
Query: 79 LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
KWL ++ + K+ I+ ++ R +VA ++I+K EKL+ VP ++++ +S
Sbjct: 38 FNKWLINNKVYKNPKIEIKVLEKYGRSIVAKQSIKKNEKLISVPKLIIMSNMGGFSHHLP 97
Query: 138 GEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
E+ + + L A +L+ S W Y+S LP++ + +Y++ ELD L
Sbjct: 98 NEIYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKEFTTSIYFSEEELDE-L 155
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--------EEVFNMETFKWSFGIL 246
++S+++E I R + YN R+ ++ F ++ + +E F W+ +
Sbjct: 156 QSSKLKEFTIIRKDGIERHYNSTFTRLSNRGIAEFSPTSTQTLQQKGYTLELFTWALSCV 215
Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
+SR L DG +VP ADM N ++ + + + + + GEQ+F Y
Sbjct: 216 WSRAFSLSDSDG--GMVPLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPY 273
Query: 307 GKK---SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
G S+ ++L+ YGFV GT PSD+V + + + D+
Sbjct: 274 GVYKPLSSSQMLMDYGFVFDHGT-PSDNVAISVPIFHPDE 312
>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase, chloroplastic-like [Vitis
vinifera]
Length = 480
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 126/253 (49%), Gaps = 16/253 (6%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
++I K G R L A K+I+ G+ +L VP ++ I+ D+ P L V + L
Sbjct: 61 LSIGKSTYG-RSLFASKSIQTGDCILKVPYNVQISPDN---VPSKINSLLGDEVGNIAKL 116
Query: 153 ATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
A + E + S W+ YI+ LP+ + +S ++W+ EL + ++ S + + I + +
Sbjct: 117 AIVISVEWKMGQDSEWAPYINRLPQPGEMHSTIFWSEGEL-KMIQQSSVYQETINQKAQI 175
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
+ ++ + +LF + +++ F + ++ SR S G ++L+P+AD +N
Sbjct: 176 QKDFLAIKPVLHHFSENLFKD--ISLKEFMHACALVGSR--AWGSTKG-LSLIPFADFVN 230
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPS 329
H ++ L D+ Q DR Y PGEQV I YGK N LLL +GF +P N
Sbjct: 231 HDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLP---YNIY 287
Query: 330 DSVELPLSLKKSD 342
D V++ +++ D
Sbjct: 288 DQVQIQVNIPHHD 300
>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 353
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 27/218 (12%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRY----LEASQIRERA- 203
+LA LI E + SRW YIS LP+ + +S ++W EL + ++++A
Sbjct: 1 MLAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQ 60
Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
IE+ + + I ++ PDL E F +++ ++ SR R++L+
Sbjct: 61 IEKDFSFVAQAFKQHCPIVTERPDL--------EDFMYAYALVGSRAW---ENSKRISLI 109
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTT-----DRQYQPGEQVFISYGKKSNGELLLSY 318
P+AD +NH + + D+ +Q F+T DR Y PG++VFI YG+ SN L+L +
Sbjct: 110 PFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDF 169
Query: 319 GFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
GF P N D V++ + + D KL L+ +
Sbjct: 170 GFTFP---YNIHDEVQIQMDVPNDDPLRNMKLGLLQTH 204
>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
gorilla]
Length = 440
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 132/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE + S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGRRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
Length = 390
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 124/298 (41%), Gaps = 22/298 (7%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E L WL G K+A+ + +GL A GE LL +P + +++ +S
Sbjct: 25 EEYDELVDWLKQCGATVDKVAVDHFNGMGQGLKATAEAAPGETLLRIPEACMLSEESARR 84
Query: 134 CPEAGEVLKQCSVPDWP-LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
+ + P + + + + S W YI+ LP LYW +L
Sbjct: 85 STLGAYMDSDTMLKLMPNVTLAFHLLLELHDLDSFWRPYIACLPVSYSVPLYWDLPDL-M 143
Query: 193 YLEASQIRERAIERITNVIGTY----NDLRLR------IFSKYPDLFPEEVFNMETFKWS 242
L S + AI +V Y N L +R F L PE F E ++W+
Sbjct: 144 SLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCFPLTLGLSPE-AFTFEDWRWA 202
Query: 243 FGILFSRLVRLPSM--DGR----VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
+ +R +P DG+ +AL+P DM+NH+ + +D + + F
Sbjct: 203 VATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS-TQFDSERECLEFVCPAPA 261
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+PG Q+ + YG ++NG+ LL GF N D V +P SL ++D YK K LR
Sbjct: 262 KPGSQITMWYGDRNNGQFLLHQGFFFAGHAN--DYVNVPFSLDETDSLYKIKALLLRN 317
>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
Length = 593
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 135/296 (45%), Gaps = 24/296 (8%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
E L +W G+ + I + GL A ++I+ GE++L VP L+
Sbjct: 168 EQTRLSKIEAFNEWARAGGVKTDCVEIATFPGYQLGLRATRDIKAGEQVLSVPRKLIF-- 225
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
S+ PE L + + P + LI E S W +I LP + ++LY+T
Sbjct: 226 -SEELLPEKQRQLFR-NFPTHLKVTYTLIMEKLRGADSPWQPFIDTLPSRYNTVLYFTVE 283
Query: 189 ELDRYLEASQ----IRE-RAIERITNVI--GTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
++ R S +R R I R+ + + L + +LF + E ++W
Sbjct: 284 QMQRLRGTSACSAAVRHCRVIARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRW 343
Query: 242 SFGILFSR--LV---RLPSMDGRV---ALVPWADMLNH-SCEVETFLDYDKSSQGVVFTT 292
+ + +R LV +PS + AL+P+ DM NH S ++ +F YD+++ + T
Sbjct: 344 AVSTVTTRQNLVPRQEIPSDAANLPISALIPYWDMANHRSGKITSF--YDQAAGQMECTA 401
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
Y+ GEQ FI YG +SN + L+ GFV + NP D V++ L L +D +++
Sbjct: 402 QEAYKSGEQYFIYYGDRSNADRLVHNGFVDMQ--NPKDYVQIRLGLSPTDALAEQR 455
>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
Length = 536
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 109/243 (44%), Gaps = 17/243 (6%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
++A I KG L VP ++ + C G +L++ W + +L+ E S K
Sbjct: 65 MIASGAIDKGSVLAEVPLQAFLSEKTAERCRLVGPMLRKNDFRPWLTMCAHLLVERSRGK 124
Query: 165 SSRWSNYISALPR-QPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
S W YI+ALP S+ L W + L+ S + + R+ + L
Sbjct: 125 ESFWHPYIAALPSVDELSISHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTA 184
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD-----GRVALVPWADMLNH--SC 273
K+ L E + +W+ +L SR L +D + LVPWADMLNH S
Sbjct: 185 GIEKF--LPGGETLSEGDVRWASAVLLSRAFSL-ELDVDDDFDTLCLVPWADMLNHCSSA 241
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG-ELLLSYGFVPREGTNPSDSV 332
E+ L +D+ ++ + Y G++VF SYG G +L L YGFV E N + +V
Sbjct: 242 GEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDE--NENYAV 299
Query: 333 ELP 335
+LP
Sbjct: 300 DLP 302
>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
Length = 421
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 121/284 (42%), Gaps = 43/284 (15%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L WL G ++ K +RGL A+++I+ GE +L V ++TAD P
Sbjct: 40 LVSWLKIRG-EHDACSLLKTGPDKRGLFAVRDIKAGECILRVSRDTMMTADR---LPLEF 95
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEA 196
+ L V +W LA L+ E ++S W+ YIS LPR +S +W + EL ++
Sbjct: 96 QQLLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWGTIHSTAFWRKEELT-MIQE 154
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
S + + R + +N+++ F+ + + WS + +
Sbjct: 155 SSLSYETMSRRAAIREEFNEMQSVPFADFMN-----------HDWSSNAMLTY------- 196
Query: 257 DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT----DRQYQPGEQVFISYGKKSNG 312
D N S EVE Y +F D+ Y GEQV IS+G N
Sbjct: 197 ----------DTDNGSTEVEEVKVYSDCLYIALFCAQLFADKNYAAGEQVTISFGPLCNA 246
Query: 313 ELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
L L +GF VP NP D V+L L + + D KEKL+ L +
Sbjct: 247 SLALDFGFTVP---YNPWDKVQLWLGISRRDSLRKEKLQYLHSH 287
>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 104/234 (44%), Gaps = 27/234 (11%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
L + K +R+G FVP +LV+ + P A + PD L+A AS +K
Sbjct: 83 LASNKALREGSS--FVPSALVLGVHMLVNFPHAED-------PDGLLMAM-----ASVDK 128
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
Y+SALPR LYW DR + Q E A + + Y+ + +F
Sbjct: 129 PPLDELYVSALPRYVDLPLYWD----DRKFKELQGCEEARRAVQHGARFYSQVYQHLFGT 184
Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKS 284
+ + E F W+ IL SR + AL+P+ D NH+ + +
Sbjct: 185 N-----NQFVSAEAFFWAISILMSRATS--GQNQPFALIPFFDWFNHADNGDECVQEFDP 237
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+G T + Y+PGEQ++I+YG SN LL +YGF NP D V LP+ +
Sbjct: 238 QKGFTVHTTKAYEPGEQLYINYGSHSNLRLLRNYGFT--TPNNPYDVVTLPMPI 289
>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
Length = 416
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 12 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 70
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 71 AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 129
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 130 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 187
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 188 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 247
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 248 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 301
>gi|335300684|ref|XP_003358991.1| PREDICTED: SET domain-containing protein 4 [Sus scrofa]
Length = 440
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 139/311 (44%), Gaps = 31/311 (9%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LKKWLKDRNFEDTNLIPARFPGTGRGLMSKTSLQEGQLVIALPESCLLTTDTVLRS-YLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 PYIAKWQPPPSPLLALCTFLVSEKHAGDQSPWKPYLEVLPKT-YTCPVCLEPEVVNLLPG 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
++ +A E+ T V ++ R S P LFPE V F+ W++ + +R V +
Sbjct: 154 P-LKSKAREQRTRVWEFFSSSRDFFSSLQP-LFPEAVESIFSYSALLWAWCTVNTRAVYM 211
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ S+ T + E+VFI
Sbjct: 212 KQRPRQCFSTEPDTCALAPYLDLLNHSPAVQVKAAFNEESRCYEIRTGTSCRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFV-PREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSA 359
YG + LLL YGFV PR NP V +P L +DK +K+ L+ +
Sbjct: 272 YGPHGSHRLLLEYGFVSPR---NPHACVYVPKDILVKYLPSTDKQMNKKISILKDHDFIE 328
Query: 360 SECFPIQITGW 370
+ F GW
Sbjct: 329 NLTF-----GW 334
>gi|308812738|ref|XP_003083676.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
gi|116055557|emb|CAL58225.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
Length = 483
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 123/256 (48%), Gaps = 30/256 (11%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE----VLKQCSVPDWPLLA-TY 155
G+RG RK L+ P + V++A + P+ GE +++ ++PD + A +
Sbjct: 81 GKRGW-----FRKNRVLMTSPDAAVVSARTATMDPKLGEKYADLMRDGTLPDERVAAMVF 135
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ E ++S W YI ALPR + L + EL+R L+ + + + A+ + V ++
Sbjct: 136 LMVERRKGEASAWGGYIDALPRSYDAPLSLSDVELERELKGTNVYDAAVAQRAKVREMFD 195
Query: 216 -DLR--LRIFSKYPDLFPEEVF-------NMETFKWSFGILFSRLVRLPSMD-GRV--AL 262
++R +R S+ + ++ FKW+F ++R + +P D G V +
Sbjct: 196 ENVRPAMRGLSEVAAASGDAKLATSLNNATIDEFKWAFQTFWTRALAIPVNDTGEVVEGI 255
Query: 263 VPWADMLNHS-----CEVETFLDYDKSSQGVV--FTTDRQYQPGEQVFISYGKKSNGELL 315
VP DM+NHS E D + GV+ + ++ G+++FI YG+ S+ L
Sbjct: 256 VPGIDMVNHSRTKANARWEHVDDNTRPDGGVIALVSNGKKLGHGDEIFIDYGESSSEALF 315
Query: 316 LSYGFVPREGTNPSDS 331
++GFVP + SD
Sbjct: 316 FTHGFVPEDDDTVSDG 331
>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 789
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 132/282 (46%), Gaps = 18/282 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KWLSD+ K+ + + RG+ A + I+KGE +LF+P +IT + P
Sbjct: 351 LLKWLSDTSSEFNKIKMVYYN-NYRGVHARQKIKKGECILFIPVDNMITLELSKELP-IC 408
Query: 139 EVLKQCSV----PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRY 193
++++ ++ P L+ Y+I E KS W ++ LP + + + +T EL +
Sbjct: 409 QLIESKNIRLLSPKHTFLSIYIIIEKKNHKSF-WKPFLDILPVEYTTFPILYTDEEL-FW 466
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
L+ S + ER + Y I SK P+ ++ ++ F W+ + SR+ L
Sbjct: 467 LKGSPFLNQVKERRECITQDYQ----AIVSKIPEF--AKLCTLDEFAWARMMAASRIYGL 520
Query: 254 PSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
R A VP ADM NH T + + G + + G+Q++ S G+K N
Sbjct: 521 FINKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDIRRGDQIYYSCGRKCNS 580
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
LL+YGFV + N ++ ++L + K D+ KL+ + K
Sbjct: 581 RFLLNYGFVVK--NNEANEIQLRVDFDKKDETLPIKLQMIGK 620
>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
Length = 440
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
Length = 440
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
troglodytes]
gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
Length = 440
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
Length = 278
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 129/278 (46%), Gaps = 34/278 (12%)
Query: 75 NASTLQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
+ + +W +G + I+ + G +GL A + R+ E ++ +P L+ITA
Sbjct: 2 DCTGFMEWAVGNGAYHSGIDIRDCSNEGGKGLFATTDFRENETIISIPVGLIITAGFIAE 61
Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
P+ +V K+ + + L + + E E++S+W+ Y+ LP+ + T A L
Sbjct: 62 MPDYCDVFKRYCLKPFEALVYFFLVEK--EQNSKWTPYLEVLPKS-----FSTPASLHPS 114
Query: 194 LEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
L+ ++ +++ N+L++ ++ K+ + + + F W++ I+ +
Sbjct: 115 LKPEDFPYCLRKQWYVQK--------NELKI-MYEKFVTILADNTI-WDHFLWAWHIVNT 164
Query: 249 RLV----RLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
R + +L + D +A+VP DMLNHS + + +D R + G
Sbjct: 165 RCIYRNNKLHPLIDNTEDDSLAIVPLIDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKG 224
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
EQ+FI YG +NG L + YGF ++ N D VE+ L
Sbjct: 225 EQIFICYGSHTNGSLWIEYGFYLKD--NICDKVEISLG 260
>gi|198417784|ref|XP_002130734.1| PREDICTED: similar to SET domain-containing protein 4 [Ciona
intestinalis]
Length = 473
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/311 (25%), Positives = 134/311 (43%), Gaps = 42/311 (13%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--------SVPDWP 150
D G RG++A I +G+ +L +P + ++ +S ++ + + + +
Sbjct: 55 DTG-RGMMAKTRICEGDVILSIPQAAMVGVNSAFNLSKFAQSISSVYHSMHDGLKLSGIQ 113
Query: 151 LLATYLISE----ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-RERAIE 205
+L +LI E + SS W Y+ LP+ LYW E+ + QI + I+
Sbjct: 114 ILCIFLIEEKRKLGKNKPSSTWGYYVKVLPQTFTHPLYWEMEEIHTLPKQLQICVNKTID 173
Query: 206 RITNVIGTYNDL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-- 261
+ N++ +L++ S DL E + ++W++ + +R V D +
Sbjct: 174 CVKQQFKELNEMIKKLKLGS---DLNYHEEISWIEYRWAWCCVNTRCVYSTHDDPTIMKC 230
Query: 262 -----------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
LVP+ D+LNHS EV T +++ +++ T +++ QVFISYG S
Sbjct: 231 CYQSSAADKYFLVPYLDLLNHSNEVNTKAEFNNTNKCFELRTHCKFKRFAQVFISYGALS 290
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK---------LEALRKYGLSASE 361
N LL+ YGFV + N D V L + S Y K L L+KY L
Sbjct: 291 NSTLLVEYGFVCKT-PNKHDVVALDVGHVLSFIKYSVKTAICPSNSLLNQLKKYDLDKGL 349
Query: 362 CFPIQITGWPL 372
F I W L
Sbjct: 350 AFTIHGPSWTL 360
>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
Length = 440
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
Length = 490
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 69/244 (28%), Positives = 120/244 (49%), Gaps = 9/244 (3%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-SVPDWPLLATYLISEAS 161
RG+VA ++I GE L +P SL+I+ D E LK S+ +L + + E
Sbjct: 180 RGMVASESIGVGEIALEIPESLIIS-DELLCQSEVFLALKDFNSITSETMLLLWSMRE-R 237
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
+ +S++ Y LP + L + L LE + + + ++ ++ Y++L +
Sbjct: 238 YNLASKFKPYFDTLPANFNTGLSFGIDGL-AALEGTLLFDEIMQAKQHLRQQYDELFPLL 296
Query: 222 FSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRVALVPWADMLNHSC--EVET 277
+ +P++F ++V + F W+ + +S +V L S LVP A +LNHS +
Sbjct: 297 CTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLSTCLVPVAGLLNHSVSPHILN 356
Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
+ D++++ + F R GEQ F+SYGK L+ YGF+PR G NP D + L
Sbjct: 357 YGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPR-GDNPYDVIPLGCD 415
Query: 338 LKKS 341
+ +S
Sbjct: 416 IDES 419
>gi|145537195|ref|XP_001454314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422069|emb|CAK86917.1| unnamed protein product [Paramecium tetraurelia]
Length = 481
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 136/287 (47%), Gaps = 31/287 (10%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPE 136
L +WL D K+ I+ G R L A + IR+GE +LF+P + ++ + K SC
Sbjct: 42 NLIQWLKDGKAEVSKVQIEVKSEGYRTLRASQFIRQGEWVLFIPRTHYLSLEEVKKSCLI 101
Query: 137 AGEVLKQCSVPDWPLLATYLIS---EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
++++ +P+ + TY ++ + + ++S W YI LP+ + AE D
Sbjct: 102 NRKMIQLNYIPN--NIQTYFVNHLLQENRRQNSFWKPYIDVLPKDVSGFPTYFDAEQDAL 159
Query: 194 LEAS-----QIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV-FNMETFKWSFGI 245
L+ S + +R I R Y++L+ ++ F +Y + + + F + T SF
Sbjct: 160 LKGSPTLFTVMNQRKIFR-----EEYDNLKEAVKEFQRYGYTYNDFIKFRILTISRSFP- 213
Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKS--SQGVVFTTDRQYQPGEQVF 303
V + + + LVP AD +NH + FL Y S + G R Q GE++F
Sbjct: 214 -----VYIGENEQQQLLVPLADFVNH--DNNGFLQYGYSPDADGFFMQAVRNIQKGEELF 266
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
+YG+ SN ++YGF TNP + + + L ++D+ + K++
Sbjct: 267 YNYGQWSNKYFFMNYGFASL--TNPMNQFDFDVCLDRNDRLFNLKVD 311
>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 791
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 150/316 (47%), Gaps = 33/316 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
++KW+ +G + ++ + RGL A K ++ E ++ +P S+ I +
Sbjct: 4 IKKWVIQNGGIIDGVDVKTFEGYGRGLCANKEFKQDEIIMSIPYSIQINRIN------LN 57
Query: 139 EVLKQCSVPDWP-----------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
+ + +P + L+ YL + K W YI+ LP+ L +T
Sbjct: 58 HIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWP-YINVLPKTYDCPLSYTI 116
Query: 188 AELDRYLEASQIRERAIERIT----NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
EL+ ++ +++ A+E+I V+ YN+ ++ F +Y F +++F + +W+
Sbjct: 117 DELN-IMKGTKLY-VAVEKINAFLMKVVDYYNNKLIQQFPQYFQPF-DDLF--KRLQWAH 171
Query: 244 GILFSR--LVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPG 299
+SR LV P G V +L+P+ D NH + + + ++ F T+ + +PG
Sbjct: 172 QSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTRTETFSFQTNEEVVKPG 231
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
EQ+F +Y +SN +LLL YGFV E NP D++ L + + D Y E E L++ + +
Sbjct: 232 EQIFNNYRIRSNEKLLLGYGFV--EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKS 289
Query: 360 SECFPIQITGWPLELM 375
+ F PLELM
Sbjct: 290 FDFFLKLDEDIPLELM 305
>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
Length = 440
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/308 (25%), Positives = 137/308 (44%), Gaps = 25/308 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LKKWLKERKFEDMNLTPARFPGTGRGLMSKISLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L++E S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWKPYLEVLPKA-YTCPVCLEPEVVDLL-P 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
++ +A E+ T + + R FS LF E V F+ F W++ + +R V +
Sbjct: 153 KPLKAKAREQRTRLQAFFTSSR-DFFSSLRPLFSEAVESIFSYSAFLWAWCTVNTRAVYM 211
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R AL P+ D+LNHS +V+ +++ ++ T + E+VFI
Sbjct: 212 KPRRRRCFSAEPDTYALAPYLDLLNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASEC 362
YG N LLL YGFV + V + +K +DK K+K+ L+ + +
Sbjct: 272 YGPHDNQRLLLEYGFVSIHNPHACVYVSKDILVKYLPSTDKQMKKKISILKDHDFIENLT 331
Query: 363 FPIQITGW 370
F GW
Sbjct: 332 F-----GW 334
>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
Length = 440
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 132/295 (44%), Gaps = 22/295 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPQ 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 ---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+D AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRHRECLSAELDT-CALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFI 270
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K +DK +K+ L+ +G
Sbjct: 271 CYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325
>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
Length = 350
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 79/155 (50%), Gaps = 6/155 (3%)
Query: 231 EEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQG 287
+E F E ++W+ + +R ++P+ DG +AL+P DM NH+ + T Y+
Sbjct: 11 KESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDR 69
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
+ ++ GEQ++I YG +SN E ++ GF N D V++ L + KSD+ Y
Sbjct: 70 CECVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFF--DNNSHDRVKIKLGVSKSDRLYAM 127
Query: 348 KLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
K E L + G+ S F + +T P+ A+L V
Sbjct: 128 KAEVLARAGIPTSSVFALHVTELPISAQLLAFLRV 162
>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
Length = 529
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 95/365 (26%), Positives = 158/365 (43%), Gaps = 55/365 (15%)
Query: 69 EIDSLENASTLQKWLS-DSGLPPQKMAIQKVDVGERGLVALKN-IRKGEKLLFVPPSLVI 126
++D+ A L WL+ + GLP A + V G+ G+ L N +R GE L+ +P +L +
Sbjct: 43 DVDT-RTARELVAWLTVEKGLP--GGAAKAVSFGDGGVAKLVNDVRAGEPLIEIPQNLAV 99
Query: 127 T----ADSKWSCPEA---GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP 179
T ADS A GE++ LA +L E S W+ Y++ LP
Sbjct: 100 TSVDVADSPIVAGLAAGRGELVG---------LALWLCLERHKGPLSEWAPYVATLPSAG 150
Query: 180 YSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE--EVFNM 236
L WT EL L+ S +RE+A+ R+ + Y + +I S D P+ E
Sbjct: 151 SDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASIADQIRSNPNDFPPDAYEFLTR 210
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADML-----------------------NHSC 273
+ F + + +R V L + + A+VP D+L
Sbjct: 211 DAFVDALATVLARAVWLNAANC-YAMVPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLA 269
Query: 274 EVETFLDYDKSSQGV-VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
+DYD +++ V V + + Q V + ++ G+L L+ G V + ++ D +
Sbjct: 270 AAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLARNAGDLFLATGAV--DESHCGDYL 327
Query: 333 ELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGWPLELMAYA-YLVVSPPS--MK 388
S ++D+ Y+ K + L G+SA + FP+ P++L+AY + V P M
Sbjct: 328 AFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADRMPMQLLAYMRFARVQDPGELMS 387
Query: 389 GKFEE 393
FEE
Sbjct: 388 VSFEE 392
>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
Length = 604
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 113/222 (50%), Gaps = 21/222 (9%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
+L+ +L+ E + K S W YI +LP + +Y+T +EL+ + ++E+A + +
Sbjct: 238 VLSLFLLLEKNKGKDSFWYPYIRSLPNSFTTPVYFTESELNAL--SPSLQEKARDLKKEL 295
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-----RLPSMDGR----VA 261
+ +NDL + S P+L + F + F+W++ +L +R + R P + +
Sbjct: 296 LHAFNDLEPFVTSCLPEL--DSTFTFDAFRWAWSVLKTRTLYQEDCRSPYLSNKEPQTST 353
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
LVP D++NHS + Y+ ++ Y+ +QVFISYG + N EL+L +GF
Sbjct: 354 LVPMLDLINHSPSAKARFGYNVNTSCYEVRVLEPYRKYDQVFISYGFEENTELMLKFGFF 413
Query: 322 PREGTNPSDSVELPLSL------KKSDKCYKEKLEALRKYGL 357
E NP D +++ LS + +D+ K K++ L GL
Sbjct: 414 VPE--NPKDFMKINLSEMLESLPQINDEERKNKVDLLFDSGL 453
>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Brachypodium distachyon]
Length = 316
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 59/187 (31%), Positives = 96/187 (51%), Gaps = 10/187 (5%)
Query: 158 SEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
S A+ K S W+ Y+ +LPR Q +++++W EL + S I + AIER + ++
Sbjct: 32 SAAATPKKSGWAPYVRSLPRNDQMHNMMFWDLNEL-HMVRISSICDEAIERRERAMKEFS 90
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
++ + +P LF E +E F + ++ SR + V+L+P+AD LNH
Sbjct: 91 AVKPSL-ECFPHLFGE--IKLEDFMHASALVSSRAWQTSR---GVSLIPFADFLNHDGVS 144
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVEL 334
++ L YD +DR Y GEQV + YGK SN L L++GF +PR + + + E+
Sbjct: 145 DSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYDQNGNREV 204
Query: 335 PLSLKKS 341
S K
Sbjct: 205 KYSGGKG 211
>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
troglodytes]
gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
Length = 307
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 116/256 (45%), Gaps = 17/256 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFV 321
YG N L L YGFV
Sbjct: 272 YGPHDNQRLFLEYGFV 287
>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
Length = 307
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 116/256 (45%), Gaps = 17/256 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFV 321
YG N L L YGFV
Sbjct: 272 YGPHDNQRLFLEYGFV 287
>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 490
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 145/353 (41%), Gaps = 52/353 (14%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
KWL D+ + + + + RG+ A +++R GE ++ VP V+T D+ E GE
Sbjct: 33 KWLRDNKVELDGVEVARFARTGRGVRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEF 92
Query: 141 L----KQCSVP--DWPLLATYLISEASFEKSSRWSNYISALP---RQPYSLLYWTRAEL- 190
+ + P D LL ++ E KSS W Y+ + R +S+L W ++
Sbjct: 93 VGDGDDEAPSPRLDKELLVIAVMCEMCAGKSSAWCEYLETVHEAVRVGHSVLAWDDEQVT 152
Query: 191 -----DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFG 244
D + +A + + ++ + ++ F +P L V + ++
Sbjct: 153 ALFGTDAWRDAYENDDETLDLPMMTEEHFENVVTLFFKLFPKLASGLSVEALRELHFAAT 212
Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ + D A+VP+ DMLNH+ CE L +D+ + + T R + GE+V
Sbjct: 213 AMVAGYSFTLGDDEIQAMVPFWDMLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEV 272
Query: 303 FISYGKKSNGELLLSYGFV-PRE-----------------GTNPSDSVELPLSLKKSDKC 344
F +YG N ELL YGFV PR G NP ELPL
Sbjct: 273 FNTYGPLRNAELLRRYGFVLPRNPHGGTTVGLAEVIQAAMGANPLVVDELPL-------- 324
Query: 345 YKEKLEALRKYGLSASEC---FPIQITGWPLE--LMAYAYLVVSPPSMKGKFE 392
+L L GL+ E F + TG P + L+A L + P M E
Sbjct: 325 ---RLAWLESRGLADEELSTRFFVHRTGRPSDKLLIAMRLLTLKPEEMTALIE 374
>gi|443699166|gb|ELT98776.1| hypothetical protein CAPTEDRAFT_151537 [Capitella teleta]
Length = 413
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 117/261 (44%), Gaps = 39/261 (14%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWS---CPEAGEVLKQCSVPDW-PLLATYLISEA 160
+VA +I +G+ + +P SL++T + E + L++ S W PLL T +
Sbjct: 1 MVATSDISQGDTIFEIPRSLLLTPQNSTIGVLLNEEADSLQEAS--RWVPLLITLMYEYT 58
Query: 161 SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
S SSRW Y +P Q ++W+ E+ R L+ + I + N+ +NDL
Sbjct: 59 S--PSSRWKPYFDLVPDFDQLDLPMFWSSDEVKRELKGTGIPSLVESDLLNISKEFNDLV 116
Query: 219 LRIFSKYPDLFPEEVFNMETFK--WSFGILFS---------------------RLVRLPS 255
L K+ ++F +E ++ +K +F + +S L+ P
Sbjct: 117 LPFIQKHSNVFSDECKCLKFYKKMVAFVMAYSFTEPPPSPDLDDSDDLSGDEHDLMPQPM 176
Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
M VP AD+LNH + LD+ K S + + Q GE++F +YG+ +N LL
Sbjct: 177 M------VPMADILNHVAKNSARLDFPKGSSSLKMVATQDIQKGEEIFNTYGELANMNLL 230
Query: 316 LSYGFVPREGTNPSDSVELPL 336
YGF G N D E+P+
Sbjct: 231 HMYGFAEDIGCNEYDIAEIPV 251
>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
Length = 440
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 130/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFVEAVDSIFSYSALLWAWCTINTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHG 325
>gi|145349778|ref|XP_001419305.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579536|gb|ABO97598.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 457
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/295 (25%), Positives = 124/295 (42%), Gaps = 41/295 (13%)
Query: 78 TLQKWLSDSGLPPQKM--AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
L +W + G + A++ RGL A + +R GE +L + + I D+K
Sbjct: 19 ALVRWCVERGARGSGLTVALETGAGAGRGLEATRALRAGEGVLELKLASGIVDDAKGHPE 78
Query: 136 EAGEVLKQCSVPDWPL-LATYLISEASFEKSSRWSNYISALP-RQPYSLLYWTRAELDRY 193
A + +K+ W + LA L+ E + S ++ Y LP R P S +++ D
Sbjct: 79 SARDAMKEAP---WGVRLACRLLQEKKLGEGSAYAAYARTLPERVPTSPIHY-----DEK 130
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
A A+ I + + K P+ + F+ E F + G++ SR +
Sbjct: 131 AIADVQYPPAMSEIREMQAACRKWHETLREKAPEALGDAYFDYEAFANAVGVVHSRTYGV 190
Query: 254 PSMDGRVA----LVPWADMLNHSCEVETFLDYDKSS----------------------QG 287
S + L+P ADMLNH ++ T L D+++ +G
Sbjct: 191 ASAEDNAGYFRVLLPLADMLNHGGDIVTSLTRDETTGELTDMTTAATDNIAWSTLDAEEG 250
Query: 288 VV-FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
V+ F R + GE+ +SYG++SN L+ YGF P NP D L +L+ +
Sbjct: 251 VIQFAATRDIEEGEEALMSYGERSNDHFLIYYGFAPD--NNPHDDCVLFSNLEHA 303
>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
Length = 440
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 130/294 (44%), Gaps = 20/294 (6%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL +A RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFVEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ + T +++ E+VFI
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
YG N L L YGFV + V + +K DK +K+ L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHG 325
>gi|321462357|gb|EFX73381.1| hypothetical protein DAPPUDRAFT_58066 [Daphnia pulex]
Length = 425
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 140/287 (48%), Gaps = 33/287 (11%)
Query: 70 IDSLENASTLQKWLSDSG--LPPQKMAIQK---VDVGERGLVALKNIRKGEKLLFVPPSL 124
IDS L KW+S +G + + K + RGL+A+ NI L+ +P SL
Sbjct: 24 IDSHSEFVELCKWMSANGWNAVSKNCLVTKPALFNSTGRGLMAMSNIAPNHLLVQIPQSL 83
Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
+IT + + E ++L Q S+ L T+ I + F + +S+YIS LP+ +S+
Sbjct: 84 LITKEKVLA--EISDLL-QFSMTTAECL-TFFILNSKF--NGLYSSYISTLPK-SFSVGG 136
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
+++ L S ++E+ + V+ Y +IF+ + ++ + ++E F+W++
Sbjct: 137 LCKSQEIAAL-PSFLQEKIMCNQNFVLKKYE----KIFAIWRKIYGSTL-SLELFQWAWF 190
Query: 245 ILFSRLV-------------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
+ +R V ++ M+ +AL P+ DM NH EV ++K++Q
Sbjct: 191 CVNTRAVFYQDSKQHSHGLNKVDGMENNMALAPYLDMFNHDAEVVVEAGFNKTTQCYEIR 250
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+DR + +QVFI+YG N +L L YGF+ + N +VE + +
Sbjct: 251 SDRHIKKYQQVFINYGPHDNMKLFLEYGFLATK--NLHKAVEFDIDV 295
>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
Length = 378
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 126/277 (45%), Gaps = 25/277 (9%)
Query: 110 NIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLA--TYLISEASFEKSSR 167
++++G+ ++ +P S ++T D+ G +K+ P PLLA T+L+SE S
Sbjct: 5 SLQEGQVMISLPESCLLTTDTVIR-SSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSL 63
Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
W +Y+ LP+ Y+ E+ L S ++ +A E+ V + R FS
Sbjct: 64 WKSYLDILPKS-YTCPVCLEPEVVDLL-PSPLKAKAEEQRARVQDLFTSAR-GFFSTLQP 120
Query: 228 LFPE---EVFNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCEVE 276
LF E VF+ F W++ + +R V L S AL P+ D+LNHS V+
Sbjct: 121 LFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDLLNHSPHVQ 180
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+++ ++ T + + ++VFI YG N LLL YGFV + V +
Sbjct: 181 VKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHACVPVSADM 240
Query: 337 SLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
+K +DK K+ L+ +G + + F GW
Sbjct: 241 LVKFLPAADKQLHRKITILKDHGFTGNLTF-----GW 272
>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
Length = 536
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 143/313 (45%), Gaps = 20/313 (6%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+L KW G+ ++ I RG++A ++I G+ L +P L+I+ D E
Sbjct: 156 SLLKWGEHLGIK-SRLQIAYFQGAGRGMIASESIGVGDIALEIPEFLIIS-DELLCQSEV 213
Query: 138 GEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
LK + + +L + + E + S++ Y LP + L + L LE
Sbjct: 214 FLALKDFNNITSETMLLLWSMRE-RYNLGSKFKPYFDTLPANFNTGLSFGIDAL-AALEG 271
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+ + + I+ ++ Y++L + + +P++F ++V + F W+ + +S + +
Sbjct: 272 TLLFDEIIQARQHLRQQYDELFPLLCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLS 331
Query: 257 DGRVA--LVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
G+++ LVP A +LNHS + + D++++ + F R GEQ F+SYGK
Sbjct: 332 SGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGS 391
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA--------LRKYGLSASECFP 364
L+ YGF+PR G NP D + L L D+ + A +R LS S FP
Sbjct: 392 HLVTFYGFLPR-GDNPYDVIPLDLDTSVDDEDIAAQSSATTSQTTHMVRGTWLSTSGGFP 450
Query: 365 IQITGWPLELMAY 377
G P L+ +
Sbjct: 451 TY--GLPQPLLTH 461
>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
Length = 543
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 134/286 (46%), Gaps = 43/286 (15%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E S L +W D+G+ K+ I ++D RG +A ++++ G+ L +P S +I+ + ++
Sbjct: 148 EKESKLVEWGQDNGVK-TKLQIAQIDGYGRGAIASEDLKLGDVALEIPVSSIISEEYVYN 206
Query: 134 CPEAGEVLKQCSVPDWPLLATY--LISEASF------EK---SSRWSNYISALPRQPYSL 182
+P+L T+ + SE EK S++ Y +L +
Sbjct: 207 SDM------------YPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTG 254
Query: 183 LYW---TRAELDRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
L + ELD L E Q +E ER Y++L + + S + ++FP E++
Sbjct: 255 LSFGVDAIMELDGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHREVFPPELYTW 306
Query: 237 ETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSC--EVETFLDYDKSSQGVVFTT 292
E + W+ + +S +++ DG++ L+P A LNHS + + D + + F
Sbjct: 307 EHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPV 366
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
R GEQ F+SYG S+ LL YGF+P+ G NP D + L +
Sbjct: 367 SRPCNKGEQCFLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 411
>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 515
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 164/384 (42%), Gaps = 41/384 (10%)
Query: 72 SLENASTLQKWLS-DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+ E+A L WLS D G+ + ++ GE + ++ G ++L VP +T+
Sbjct: 45 TAEDARELAAWLSYDKGVDASGLVFKEGARGEVEVALRGDVDAGARVLAVPQDCAVTSVD 104
Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
+ P + K P+ LA +L +E +S W+ Y+ L P + L+WT AE
Sbjct: 105 VDAHPIVSGLAK--GRPELVGLALWLCAERIKGGASDWAPYVKTLAANPDAPLFWTEAED 162
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILF 247
L+ S I A+ER + Y + + + P FP E F E F + +
Sbjct: 163 FALLKGSPIVNDAVERSRSAREEYAAI-VEVIKGDPTAFPAEAYEFFTEERFVDALATVC 221
Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCE--------------VETFLDYDKSSQGVVFTTD 293
++ LP+ ALVP D++ + DYD S VV +
Sbjct: 222 AKATWLPTASC-YALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAV 280
Query: 294 RQYQPGEQVF-ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
+ +V + +++NGEL L+ G V ++ +P D + + ++ SD+ + K + L
Sbjct: 281 VKAPANSRVVQLDPLQRNNGELFLNTGRVDQK--HPGDYLYMRTEIQPSDRLFSAKKQVL 338
Query: 353 RKYGLSA-SECFPIQITGWPLELMAY-AYLVVSPPS--MKGKFEEMAAAASNKMTSKKDI 408
G +A ++ FP+ P +L +Y + V P M FEE +K+ S +
Sbjct: 339 EGMGFTAENQYFPVYEDRMPTQLYSYLRFARVQDPGEMMAVSFEE------DKIVSVMN- 391
Query: 409 KCPEIDEQALQFILDSCESSISKY 432
+ + LQ ++ C +S+Y
Sbjct: 392 -----EYEILQLLMGDCRELMSEY 410
>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 543
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 134/286 (46%), Gaps = 43/286 (15%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E S L +W D+G+ K+ I ++D RG +A ++++ G+ L +P S +I+ + ++
Sbjct: 148 EKESKLVEWGQDNGVK-TKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYN 206
Query: 134 CPEAGEVLKQCSVPDWPLLATY--LISEASF------EK---SSRWSNYISALPRQPYSL 182
+P+L T+ + SE EK S++ Y +L +
Sbjct: 207 SDM------------YPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTG 254
Query: 183 LYW---TRAELDRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
L + ELD L E Q +E ER Y++L + + S + ++FP E++
Sbjct: 255 LSFGVDAIMELDGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHREVFPPELYTW 306
Query: 237 ETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSC--EVETFLDYDKSSQGVVFTT 292
E + W+ + +S +++ DG++ L+P A LNHS + + D + + F
Sbjct: 307 EHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPV 366
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
R GEQ F+SYG S+ LL YGF+P+ G NP D + L +
Sbjct: 367 SRPCNKGEQCFLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 411
>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
Length = 489
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 148/312 (47%), Gaps = 28/312 (8%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
WL + G + + I + D GL A K+ +G +L VP ++++ P+A ++
Sbjct: 89 WLHEHGAEFEGVEISEFDGYGFGLKATKDFSEGSLILTVPGKVMMSEKD----PKASDLS 144
Query: 142 KQCSVPDWPLL--------ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
+ ++ PLL A +L+ E + +S W YI LP + ++LY+ EL
Sbjct: 145 EFINID--PLLQNMPNVTLALFLLLEKN-NPNSFWKPYIDVLPEKYSTVLYFNSEELAE- 200
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFS-KYPDLFP-EEVFNMETFKWSFGILFSR-- 249
L S + E +++ +++ Y +I + P L +++F + ++W+ + +R
Sbjct: 201 LRPSPVFESSLKLYRSIVRQYAYFYNKIHTIDLPVLKNLQDIFTFDNYRWAVSTVMTRQN 260
Query: 250 -LVRLPSMDGRVALVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVFISYG 307
+V+ + A +P DM NH T D++ + ++G + + Y+ EQ+FI YG
Sbjct: 261 NIVQGTAFTLTNAFIPLWDMCNHKHGKIT-TDFNLELNRGECYAL-QDYRRDEQIFIFYG 318
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
+ N +L L GFV + N DS+ + L + +D K+ L K GLS F +
Sbjct: 319 ARPNSDLFLHNGFVYPD--NDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYK 376
Query: 368 TGWPL--ELMAY 377
P+ EL+A+
Sbjct: 377 GASPISVELLAF 388
>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
Length = 536
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 134/286 (46%), Gaps = 43/286 (15%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E S L +W D+G+ K+ I ++D RG +A ++++ G+ L +P S +I+ + ++
Sbjct: 148 EKESKLVEWGQDNGVK-TKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYN 206
Query: 134 CPEAGEVLKQCSVPDWPLLATY--LISEASF------EK---SSRWSNYISALPRQPYSL 182
+P+L T+ + SE EK S++ Y +L +
Sbjct: 207 SDM------------YPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTG 254
Query: 183 LYW---TRAELDRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
L + ELD L E Q +E ER Y++L + + S + ++FP E++
Sbjct: 255 LSFGVDAIMELDGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHREVFPPELYTW 306
Query: 237 ETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSC--EVETFLDYDKSSQGVVFTT 292
E + W+ + +S +++ DG++ L+P A LNHS + + D + + F
Sbjct: 307 EHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPV 366
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
R GEQ F+SYG S+ LL YGF+P+ G NP D + L +
Sbjct: 367 SRPCNKGEQCFLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 411
>gi|390367697|ref|XP_787519.3| PREDICTED: N-lysine methyltransferase setd6-like
[Strongylocentrotus purpuratus]
Length = 466
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 113/256 (44%), Gaps = 32/256 (12%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-----SVPDWPLLATYLIS 158
G++AL +I KGE L VP S+++ + P + L++ + W L ++
Sbjct: 62 GMIALDDISKGETLFTVPRSVLLHPAT--CSPVVAQRLEEDEDSLETESGWVPLILAVMY 119
Query: 159 EASFEKSSRWSNYISALPR-----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
E + +SSRW Y+ P QP ++W + L + I E + N+
Sbjct: 120 EHT-NRSSRWRPYLDLFPDYSELDQP---MFWDSNYMQPELRGTGIAEAVQRDLRNIDRD 175
Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS-RLVRLPSMDGRVA----------- 261
Y+D+ L K DLF EE N++ +K + + + P D
Sbjct: 176 YHDVALPFIKKNADLFSEEKHNLDLYKRTVSFIMAYSFTESPDYDEDDDDSDDDDEETHP 235
Query: 262 --LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
+VP AD LNH + L + K S +V T D + G +VF +YG+ +N +LL YG
Sbjct: 236 PMMVPLADALNHIAKNNAQLKFGKESLRMVATED--IKKGSEVFNTYGEIANWQLLHMYG 293
Query: 320 FVPREGTNPSDSVELP 335
F N D+V++P
Sbjct: 294 FAEEYPENIYDTVDIP 309
>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
Length = 287
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 114/247 (46%), Gaps = 24/247 (9%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK-QCSVPDWPLLATYLISEAS 161
RG++A K ++ E +L +P L+IT D+ A + + + LA +L+ E
Sbjct: 48 RGMMATKALKHEELMLVIPQRLLITMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKY 107
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
+ S W YI LP + ++T E D L + +R +A + Y +L
Sbjct: 108 RREKSFWRPYIDILPEEYSCPTFFT--EDDFRLLPNSLRGKAKAKKYECHKEYKEL-APF 164
Query: 222 FSKYPDLFP--EEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------------LVPW 265
F DLFP E+ FN + FKW++ + +R + +P GR + + P
Sbjct: 165 FKMLADLFPDQEDAFNFKDFKWAWSAIKTRALDVPI--GRESCRHLRDAEDTPTPTMFPL 222
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D +NH+ + + Y++ S+ + T+ Y+ +V SYG+ N LLL +GFV
Sbjct: 223 VDSINHAAQAKIRHRYNEKSRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFV--VP 280
Query: 326 TNPSDSV 332
NP D+V
Sbjct: 281 GNPEDTV 287
>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
Length = 459
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 72/274 (26%), Positives = 125/274 (45%), Gaps = 28/274 (10%)
Query: 85 DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC 144
D L P+ ++ + G+VA + + +GE L V S V++ ++ E +LK+
Sbjct: 33 DFQLNPKVHVGREGSCAQYGMVAQEELEEGECLFKVDKSAVLSTETT----EIAHLLKEE 88
Query: 145 SV---------PDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRY 193
+ W L+ E + +SRW Y+ +P Q ++WT E++R
Sbjct: 89 TSLHGDSLHGDSGWVPQILALMYEYT-NPNSRWRPYLQLVPDFSQLDQPMFWTEDEIERD 147
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
L + I E + +T + Y L L K+ +F EEV + E +K + +
Sbjct: 148 LCNTGIPEASSSDLTKMKLEYTSLALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFFE 207
Query: 254 PSMDGRVA---------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
P ++GR +VP AD+LNH + L++D +V T R GE+VF
Sbjct: 208 P-VNGREDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMV--TTRTVAAGEEVFN 264
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
++G+ +N +LL YGF N D+V++P+ +
Sbjct: 265 TFGQLANWQLLHMYGFAEAWPENIYDTVDIPMQV 298
>gi|440802833|gb|ELR23759.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 518
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 56/108 (51%), Gaps = 6/108 (5%)
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE---- 274
+ IF YPD+F V + W+F ++SR L D A+VP ADMLNH+ E
Sbjct: 196 ISIFKDYPDMFSPAVHTCDELMWAFATIWSRGYWLDGDDTMPAIVPLADMLNHNTEKGGE 255
Query: 275 -VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
V + YD +Q + Y+PG+QV YG K+NG L YGFV
Sbjct: 256 RVAHYF-YDADAQIFKVISKTSYEPGQQVLTHYGNKANGNFLEDYGFV 302
>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
Length = 509
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 110/257 (42%), Gaps = 32/257 (12%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
R L A + I++G+ ++ VP + +T D P+ L +V D LA LI E
Sbjct: 59 RSLFASEPIQEGDCIMQVPYHVQLTLDK---LPQKFNTLLDHAVGDTSKLAALLIMEQHL 115
Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
S W+ YI +LP Q ++++ W EL ++ S I + AIE + L+
Sbjct: 116 GNESGWAPYIKSLPTKDQMHNMVLWDLNEL-HAVQNSSIYDEAIEHKEQAKKEFLALKPA 174
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ +P LF E V+L AL D LNH + L
Sbjct: 175 L-DHFPHLFGE-------------------VKLGDFMHASAL----DFLNHDGVFGSVLI 210
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
YD+ DR Y GEQV I YGK SN L L++GF N D + + +
Sbjct: 211 YDEQKDVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFT--LARNIYDQALIRIDMPV 268
Query: 341 SDKCYKEKLEALRKYGL 357
D YK+KL+ +K+ L
Sbjct: 269 QDPLYKKKLDIWQKHRL 285
>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
Length = 380
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 76/152 (50%), Gaps = 6/152 (3%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVF 290
F E ++W+ + +R ++P+ DG +AL+P DM NH+ + T Y+
Sbjct: 25 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCEC 83
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
+ ++ GEQ++I YG +SN E ++ GF N D V++ L + KSD+ Y K E
Sbjct: 84 VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAE 141
Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
L + G+ S F + T P+ A+L V
Sbjct: 142 VLARAGIPTSSVFALHFTEPPISAQLLAFLRV 173
>gi|41054567|ref|NP_955894.1| N-lysine methyltransferase setd6 [Danio rerio]
gi|82177062|sp|Q803K4.1|SETD6_DANRE RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|27882107|gb|AAH44440.1| SET domain containing 6 [Danio rerio]
Length = 460
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 115/254 (45%), Gaps = 22/254 (8%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDW-PLLATYLI 157
E G++A ++I +G +LF P + + E K+C S W PLL + +
Sbjct: 47 AEYGMLAKEDIEEGH-VLFTIPREALLHQGTTKVKKVLEEGKKCLESASGWVPLLLSLMY 105
Query: 158 SEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
S +S W Y+S P R ++W+ E D+ L+ + I E I + + YN
Sbjct: 106 EYTS--STSHWKPYLSLWPDFRTLDQPMFWSEEECDKLLKGTGIPESVITDLRKLQDEYN 163
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA------------LV 263
+ L +PDL+ E N+E +K + + + P D +V
Sbjct: 164 SVVLPFMKSHPDLWDPEKHNLELYKSLVAFVMAYSFQEPVEDDDEDEEDDEKKPNLPMMV 223
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
P ADMLNH + L+Y + + + + R+ GE+VF +YG+ +N +LL YGF
Sbjct: 224 PMADMLNHISKHNANLEY--TPECLKMVSIRRIGKGEEVFNTYGQMANWQLLHMYGFAEP 281
Query: 324 EGTNPSDSVELPLS 337
N +++ ++ ++
Sbjct: 282 FPNNINETADIKMA 295
>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
Length = 510
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 101/217 (46%), Gaps = 25/217 (11%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKV-------DVGER-----------GLVALKNIRKGE 115
E + KWL D+G+ I+ V DV + G++AL++++
Sbjct: 10 EQLDIVVKWLDDNGVKINHKLIEIVCQKQSVDDVTNKNTPHEQVVEGLGVIALQDLKIDH 69
Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
+ +P S ++T + LK+ + D + L+ EAS S+W YI +L
Sbjct: 70 TVAIIPKSCLLTPHT----TSISAYLKKYKIKDATATSIALLYEASIGSQSKWYGYIKSL 125
Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND-LRLRIFSKYPDLFPEEVF 234
P + W A+L + L+ + I E V TYN ++ ++ + +PD+F E VF
Sbjct: 126 PLSVDLPILWNDADL-KNLKGTSIETVVYENKETVDATYNKYIKSKLIANHPDVFNEHVF 184
Query: 235 NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
+++ FK + ++ SR + + G ++VP AD+ NH
Sbjct: 185 SLDNFKRASCLVSSRAFNIDTYHGD-SMVPLADIFNH 220
>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
Length = 539
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 32/301 (10%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
E L + +W G+ + I + GL +++ +GE +L VP L+
Sbjct: 112 EQKRLAKVAAFNEWARAGGVQTDCVEITTFPGYQLGLRVTRDLAEGELVLTVPRQLIF-- 169
Query: 129 DSKWSCPEAGEVLKQCSVPDWP--LLATY-LISEASFEKSSRWSNYISALPRQPYSLLYW 185
S+ PEA L D+P L TY LI E +S W +I LP + ++LY+
Sbjct: 170 -SEELLPEAQRKL----FIDFPTHLNVTYMLIIEKVRGAASNWQPFIDTLPTRYNTVLYF 224
Query: 186 TRAELDRYLEASQ----IRE-RAIERITNVI--GTYNDLRLRIFSKYPDLFPEEVFNMET 238
T ++ R S +R R I RI + Y + + +LF E E
Sbjct: 225 TVEQMQRLRGTSACSAAVRHCRVIARIYASMYKCAYMQPDDSVMAGMANLFTEYGLCYEL 284
Query: 239 FKWSFGILFSRLVRLPSM-----DG-----RVALVPWADMLNHSC-EVETFLDYDKSSQG 287
++W+ + +R +P DG AL+P+ DM NH C ++ ++ Y S+Q
Sbjct: 285 YRWAVSTVTTRQNLVPRQLATDSDGVRNSPMSALIPFWDMANHRCGKITSY--YKPSAQQ 342
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
+ ++ GEQ FI YG + N + L+ +GF+ + N D V + L L +D ++
Sbjct: 343 MECIAQEAFKAGEQFFIYYGDRCNADRLVHHGFL--DMNNLKDYVHIRLGLSPTDALAEQ 400
Query: 348 K 348
+
Sbjct: 401 R 401
>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 1 [Galdieria
sulphuraria]
Length = 487
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 128/298 (42%), Gaps = 32/298 (10%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
WL+ + K+ + + G RG+VA++ I E L VP L + C + V
Sbjct: 80 WLTRENVYMPKIKLDQNKDGLRGVVAVEGIECDESFLKVPRDLSLQVTEHEECTMSEFVD 139
Query: 142 KQC-SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYS-LLYWTRAELDRYLEASQ 198
+ S +W + + + + + S W YI LP + L+YW+ +EL +Q
Sbjct: 140 PELWSQENWYVKLSLKLLKEKYLGKLSLWKPYIDILPHALNTGLVYWSSSEL------AQ 193
Query: 199 IRERAIERITNVIGTYND-LRLRIFSKYPDLFPEEVF----NMETFKWSFGILFSRLVRL 253
++ R + + Y + L R+F P V+ F W+ ++ SR +
Sbjct: 194 LQYRPLIEEVKINQYYREALYTRVFESLSS--PVRVWLQNEKENVFFWALDMVQSRAFGI 251
Query: 254 PSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
P + + AL+P DMLNH +T YD + T + PG ++ISYG N
Sbjct: 252 PDVGNKTYALLPMMDMLNHRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDND 311
Query: 313 ELLLSYGFVPREGTNPSDSVE-------LPLSLKKSD------KCYKEKLEALRKYGL 357
LL YGF+ + NPSD + L L ++ + +EKL LRKY +
Sbjct: 312 HLLHFYGFL--QTNNPSDYFQVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHI 367
>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
Length = 453
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 114/250 (45%), Gaps = 22/250 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEAS 161
RGL+A K ++ E +L +P L+IT D+ A + + S + LA +L+ E
Sbjct: 57 RGLMATKALKHEELILVIPKRLLITIDAIMDSYLAPYIERADSQLTPSQALAVFLMCEKC 116
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
+ S W YI LP + ++T E D L + +R +A + + +L
Sbjct: 117 RREKSFWRPYIDILPEEYTCPAFFT--EEDFRLLPNSLRGKAKAKKYECHKEFMEL-APF 173
Query: 222 FSKYPDLFP--EEVFNMETFKWSFGILFSRLVRLPSMDGRV-------------ALVPWA 266
F DLFP E+ FN + FKW++ + +R +P + G + P
Sbjct: 174 FKMLADLFPDQEDAFNFKDFKWAWSAIKTRAFDVP-LGGETCYRLRDSEDTSNPTMFPLV 232
Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
D +NH+ + + Y++ + + T+ Y+ +V SYG+ N LLL +GFV
Sbjct: 233 DSINHAAQAKIRHRYNEKRRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVV--PG 290
Query: 327 NPSDSVELPL 336
NP+D+V L
Sbjct: 291 NPADTVTFHL 300
>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
CCMP526]
Length = 452
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 154/347 (44%), Gaps = 24/347 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEAS 161
GLVA I++GE L VP +L + DS + P G+V+ + ++ D L+A L+ EA
Sbjct: 94 GLVATAPIKQGETLATVPLNLCFSMDSVRASP-LGKVIGEFEPALGDASLIALQLLYEAH 152
Query: 162 FEKSSRWSNYISALPRQPYSL----LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
S+++ YI +LPR L+W+ AE L S R I V Y +
Sbjct: 153 MGPKSKYAVYIKSLPRPGQDGFDHPLFWSTAE-QGVLAKSSTRNLGETLIDAVAEDYGWI 211
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN--HSC-- 273
+ + + F++ F+W+ ++ SR + + L P DM N C
Sbjct: 212 QSALARGGISGLQADSFDLSDFEWAVAVVLSRSFF---AENGLRLAPLLDMANRGEGCTN 268
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--PREGTNPSDS 331
E + +G+ DR G+++ ISYG KS E L +GFV P EG N
Sbjct: 269 EPQIGGLGIFGGKGLKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFVPPPLEG-NALVG 327
Query: 332 VELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMK 388
L+ K S D+ Y +K + + GL + F ++ G ++ +L K
Sbjct: 328 GMCSLTFKISPEGDRFYDDKEDVMGTLGLPMAFSFDVRSDG-DVDPEMLQFLRFLKLGGK 386
Query: 389 GKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSR 434
F + A N++ ++ E +EQA+ + I+D+C +++ ++
Sbjct: 387 DAF-LLEAVFRNEVWRFMELPVSEANEQAVDEVIIDACTQALANFAE 432
>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
Length = 350
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 21/219 (9%)
Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ--- 295
F W+FGIL SR P + +A+VP+AD++NH + ++ + V DRQ
Sbjct: 72 FLWAFGILRSRAFP-PFIGDNLAMVPFADLVNHGFSIN--VEEPSWERKVTGLFDRQEAL 128
Query: 296 -------YQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPS--DSVELPLSLKKSDKCY 345
++ GEQV + YG KSNG+L L YGFV R N S D L L + +SD +
Sbjct: 129 TMRAPAAFRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFF 188
Query: 346 KEKLEALRKYGLSASECFPI-QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTS 404
+KL+ G+ + F I Q G P ++ + L+ + E + + + S
Sbjct: 189 ADKLDIAELNGMETTAYFDITQGQGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS 248
Query: 405 KKDIKCPEIDEQAL-QFILDSCESSISKYSRFLQVKELL 442
+ + +E A+ + +LD C+S++S Y ++ E L
Sbjct: 249 ---LPVSQENEAAICKVVLDGCQSTLSGYGTTIEEDEAL 284
>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
gaditana CCMP526]
Length = 446
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 91/347 (26%), Positives = 154/347 (44%), Gaps = 24/347 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEAS 161
GLVA I++GE L VP +L + DS + P G+V+ + ++ D L+A L+ EA
Sbjct: 88 GLVATAPIKQGETLATVPLNLCFSMDSVRASP-LGKVIGEFEPALGDASLIALQLLYEAH 146
Query: 162 FEKSSRWSNYISALPRQPYSL----LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
S+++ YI +LPR L+W+ AE L S R I V Y +
Sbjct: 147 MGPKSKYAVYIKSLPRPGQDGFDHPLFWSTAE-QGVLAKSSTRNLGETLIDAVAEDYGWI 205
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN--HSC-- 273
+ + + F++ F+W+ ++ SR + + L P DM N C
Sbjct: 206 QSALARGGISGLQADSFDLSDFEWAVAVVLSRSFF---AENGLRLAPLLDMANRGEGCTN 262
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--PREGTNPSDS 331
E + +G+ DR G+++ ISYG KS E L +GFV P EG N
Sbjct: 263 EPQIGGLGIFGGKGLKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFVPPPLEG-NALVG 321
Query: 332 VELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMK 388
L+ K S D+ Y +K + + GL + F ++ G ++ +L K
Sbjct: 322 GMCSLTFKISPEGDRFYDDKEDVMGTLGLPMAFSFDVRSDG-DVDPEMLQFLRFLKLGGK 380
Query: 389 GKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSR 434
F + A N++ ++ E +EQA+ + I+D+C +++ ++
Sbjct: 381 DAF-LLEAVFRNEVWRFMELPVSEANEQAVDEVIIDACTQALANFAE 426
>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 519
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 133/295 (45%), Gaps = 22/295 (7%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVIT 127
E D L +++T + D + + +V E RG+ A ++I + +P +IT
Sbjct: 29 EYDCLNSSNTSASEVDDEEKKESPLTVSLEEVSEMRGVHARRSIPPHTTCVSIPRRCLIT 88
Query: 128 ADSKWSCPEAGEVLK---QCSVPDWPLLATYLI-SEASFEKSSRWSNYISALPRQPYSL- 182
+ + P +L+ P L YL+ + SS + Y LP ++
Sbjct: 89 VEMGQATPIGRAILQADLDLDAPKHIFLMIYLLWDRKTHGSSSFFHPYYEILPPTLRNMP 148
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
++W+ EL LE S + + +R + Y I P L + ++ FKW+
Sbjct: 149 IFWSAFELQE-LEGSHLLSQIADRGQAIQDDYE----AILEVAPSLGT--LCTLDEFKWA 201
Query: 243 FGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
+ SR L +DG ALVP ADMLNH ET +D+ +Q T+ + Q G
Sbjct: 202 RMCVCSRNFGL-QIDGHRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGA 260
Query: 301 QVFISYGKKSNGELLLSYGFVPR-----EGTNPSDSVELPLSLKKSDKCYKEKLE 350
QV+ SYG+K N LL+YGF +G P++ V L L + +D +++KLE
Sbjct: 261 QVYDSYGQKCNHRFLLNYGFAVEDNRELDGFCPNE-VPLELYVDPADILFQDKLE 314
>gi|432862431|ref|XP_004069852.1| PREDICTED: N-lysine methyltransferase setd6-like [Oryzias latipes]
Length = 450
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 68/256 (26%), Positives = 119/256 (46%), Gaps = 22/256 (8%)
Query: 96 QKVDVGERGLVALKNIRKGEKLLFVPPSLVI----TADSKWSCPEAGEVLKQCSVPDWPL 151
Q+ V + G++A +I +GE L +P S ++ TA S EA + Q S PL
Sbjct: 39 QEGTVADYGMLAKADIEEGEVLFTIPRSALLHQRTTAVSALLQKEAASL--QSSSCWVPL 96
Query: 152 LATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
L L S S W Y+S P R+ ++W++ E DR L + + E + ++N
Sbjct: 97 LLALLYEYTS--PQSDWKPYLSLWPDLRRLDHPMFWSKEERDRLLRGTGVPEAVDKDLSN 154
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS----------MDGR 259
+ Y D+ L +++PDL+ + +E + + + + P
Sbjct: 155 IQREYEDVVLPFMTRHPDLWNPKTHTLELYTELVAFVMAYSFQEPQEDEDDDEEEKPPNP 214
Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
+VP ADMLNH + L++ S+ + + R+ GE+VF +YG+ +N +LL YG
Sbjct: 215 PMMVPMADMLNHVSDHNANLEF--SADSLKMVSVRRIHAGEEVFNTYGQMANWQLLHMYG 272
Query: 320 FVPREGTNPSDSVELP 335
F N +++ ++P
Sbjct: 273 FTEPYPNNSNETADIP 288
>gi|354502761|ref|XP_003513450.1| PREDICTED: SET domain-containing protein 4 [Cricetulus griseus]
Length = 440
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 120/261 (45%), Gaps = 27/261 (10%)
Query: 79 LQKWLS-----DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
L++WL D+GL P G RGL++ +++G+ ++ +P S ++T ++
Sbjct: 34 LRRWLKGRKFEDTGLVPACFP----GTG-RGLMSKTALQEGQMIISLPESCLLTTNTVIR 88
Query: 134 CPEAGEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
G +K+ P PLLA T+LISE S W +Y+ LP+ Y+ ++
Sbjct: 89 -SSLGPYMKKWKPPPSPLLALCTFLISERHAGGQSLWKSYLDILPKS-YTCPVCLEPDVV 146
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFS 248
L ++ +A E+ +V + R FS LF E V F+ F W++ + +
Sbjct: 147 DLL-PQPLKAKAEEQRADVQDFFASSR-AFFSTLQPLFVEPVDGIFSYSAFLWAWCTVNT 204
Query: 249 RLVRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
R V L S AL P+ D+LNHS V+ + + + T + + E
Sbjct: 205 RAVYLRSTRQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFSEKTGCYEIRTASRCRKHE 264
Query: 301 QVFISYGKKSNGELLLSYGFV 321
QVFI YG N LLL YGFV
Sbjct: 265 QVFICYGPYDNQRLLLEYGFV 285
>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
Length = 403
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 114/262 (43%), Gaps = 37/262 (14%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LA L+ + + S+RW Y ALP SL+ W+ EL+ L+ S +R+RA+ R
Sbjct: 49 LAVALMQQTNGGASARWRAYCDALPAAVDSLMMWSDEELE-VLQGSALRQRAVFRRDLCK 107
Query: 212 GTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
Y+ L + P+ F + E ++ + F+W++ + +R LP + +AL+P D+ N
Sbjct: 108 REYDALFPALARADPETFGDVEAYSFDVFRWAYATVMARAFVLPDLQC-MALLPGLDIYN 166
Query: 271 H--------------SCEVETFLDYDKSSQGVVFTTD-RQYQPGEQVFISYGKKSNGELL 315
+CEV+ +D+S V Q G Q+F Y ++G L
Sbjct: 167 SARDAEKCVVERDEGACEVDDSSSFDESEARVTLRVGVGGVQAGSQLFHDYADHASGGAL 226
Query: 316 LSYGFV---PREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSASECFPIQI 367
L +GFV RE + D++++ L L + + + K+ + S F I
Sbjct: 227 LEFGFVYHGERERGSGVDALDVCLKPALARLDARSRAFLVDEDVFHKFNVRKSLTFEISN 286
Query: 368 TGWPLELMAYAYLVVSPPSMKG 389
G V P++KG
Sbjct: 287 VGG-----------VYKPALKG 297
>gi|259155405|ref|NP_001158764.1| N-lysine methyltransferase setd6 [Salmo salar]
gi|325530257|sp|C0H8I2.1|SETD6_SALSA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|223647186|gb|ACN10351.1| SET domain-containing protein 6 [Salmo salar]
gi|223673059|gb|ACN12711.1| SET domain-containing protein 6 [Salmo salar]
Length = 449
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 122/271 (45%), Gaps = 31/271 (11%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSK---WSCPEAGEVLKQCSVPDWPLLATYL 156
V E G++A ++I +GE LLF P + + + E G+ + + PLL +
Sbjct: 42 VAEYGMLAKEDIDEGE-LLFTIPRMALLHQGTTKVLAVLEEGKASLENTSGWVPLLLALM 100
Query: 157 ISEASFEKSSRWSNYIS------ALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
S S W Y+S AL ++W++ E DR L+ + I E +TN+
Sbjct: 101 YEYTS--PQSHWRPYLSLWSDFTALDHP----MFWSKDERDRLLKGTGIPEAVDTDLTNI 154
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------L 262
Y D+ L + +PDL+ E ++ ++ + + + P + +
Sbjct: 155 QKEYKDIVLPFITLHPDLWDPERHTLDLYRSLVAFVMAYSFQEPLDEEDEDEKDPNPPMM 214
Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
VP ADMLNH L+Y + + + + R + GE+VF +YG+ +N +LL YG
Sbjct: 215 VPIADMLNHVSNHNANLEY--TPECLKMVSVRSIRKGEEVFNTYGQMANWQLLHMYGLXE 272
Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+N +D+ ++P+S YK ++ R
Sbjct: 273 PYQSNSNDTADIPMS-----NVYKAAVQVTR 298
>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 503
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 69/263 (26%), Positives = 125/263 (47%), Gaps = 23/263 (8%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
+E L W+ GLP +K+ ++ ++ G+ LV K +KG+ L+ VP S +T
Sbjct: 49 VETLPPLSAWVEQRGLPLKKLNVRPEIVEGDLCLVVSKPTKKGQPLVAVPSSAWLTQQVV 108
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
S G +++ + W +A +L+ E S + + W ++ ++P P L+W+ EL
Sbjct: 109 RSS-SIGSLVE--DLEPWLQIALFLLHERS-KPDAAWQGFLDSIPAAPDVPLFWSEEELS 164
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
+ LE +Q+ Y +L ++F+ + + FP + ++ F W+ + SR V
Sbjct: 165 Q-LEGTQLLSSVQGYRQFFEAKYAELEEQLFAPHREAFPPKSHQLDDFLWAVATVRSR-V 222
Query: 252 RLPSMDGR-VALVPWADMLNH------SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
P +DG VALVP AD++ H +++ +Q +V R Y GE V +
Sbjct: 223 HSP-LDGEDVALVPLADLVQHRKLQGARWQLQLAGGLFSKAQALVVEAQRDYAEGEVVTM 281
Query: 305 SYG--------KKSNGELLLSYG 319
+G +K + ++LL YG
Sbjct: 282 DFGAPLTEEDQEKLDSQVLLDYG 304
>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
nagariensis]
Length = 484
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/285 (24%), Positives = 131/285 (45%), Gaps = 14/285 (4%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
L+A + ++G+ L VP S ++A+S P W +A L+++
Sbjct: 71 LIASTDAQQGDVLFSVPDSAWLSAESVKKAAVGKLAAAAGLEP-WLQIALQLVADRFGST 129
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
S S Y +++P + L W+ EL + L+ +Q+ + +T T+ L+ +F+
Sbjct: 130 KSELSAYAASIPEDLDTPLLWSEDEL-QELQGTQVLQTLGGYLTFFRSTFQQLQSGLFTS 188
Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVETFLDYDK 283
P FP +F + F W+ + SR P +DG ++AL P ++++H + L
Sbjct: 189 NPAAFPPSIFTLPRFLWAVAAVRSR--SHPPLDGPKIALAPLTELVSHRRAANSKLSVRS 246
Query: 284 S-----SQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLS 337
+ Q +V R + GE + + YG K +G +L+ YG + + T+P L L
Sbjct: 247 AGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVM--DVTSPKPGYSLTLK 304
Query: 338 LKKSDKCYKEKLEALRKYGLSASECFPIQITGWP-LELMAYAYLV 381
+ SD+ +KL+ L L S + + P +E++A+ L+
Sbjct: 305 MPDSDRFIDDKLDILESNDLPQSVVYNLTPDEQPTIEMLAFLRLM 349
>gi|302854198|ref|XP_002958609.1| hypothetical protein VOLCADRAFT_108207 [Volvox carteri f.
nagariensis]
gi|300256070|gb|EFJ40346.1| hypothetical protein VOLCADRAFT_108207 [Volvox carteri f.
nagariensis]
Length = 360
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 114/272 (41%), Gaps = 41/272 (15%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
W+S G K+ I + G RG+ +++RKGE L+++P LV + + + A +L
Sbjct: 37 WISQEG-GEFKVTISRTSAGVRGVFTTQDVRKGELLIYIPDHLVFSVRNVPAAEGAPLLL 95
Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE------LDRYLE 195
K+ P SR + Y+ LPR+ L + E D LE
Sbjct: 96 KELFTP-----------------CSRLTPYLRVLPRETQVLTGYNFPEEYIKFLADDNLE 138
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
Q+R + + ND + + P+ ++ F++ +L SR L
Sbjct: 139 L-QVRGSFKKHCRSTFEGQNDENM--MTTIPEAIGSVNISLPYFEYVVSMLSSRTFSLRR 195
Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
+++VP D++NH LD ++ +GV + GE+V I+YG + ELL
Sbjct: 196 --DALSMVPLLDLMNHDIRDINQLDSSRAYRGVRVVAGKDLAKGEEVTITYGNMRSDELL 253
Query: 316 LSYGFV------PR------EGTNPSDSVELP 335
L YGF+ PR NP D ELP
Sbjct: 254 LYYGFLDTITDPPRLLAVDHRNYNPQDGAELP 285
>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
putative [Ricinus communis]
Length = 558
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 133/277 (48%), Gaps = 12/277 (4%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
S + ++ +W +G+ ++ I V+ RG +A ++++ G+ L +P S++I+ +
Sbjct: 156 SCDKEKSIAEWGQRNGVH-SRLEIVYVEGAGRGAIATEDLKVGDIALEIPVSIIISEELV 214
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+ K + +L + + E +S+ Y LP++ + L + +D
Sbjct: 215 RHSDMYHILEKIDGISSETMLLLWSMKE-RHNCNSKSKIYFDTLPKEFNTGLSFG---VD 270
Query: 192 RYL--EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
+ + + + + ++ ++ Y++L + + YPD+FP E++ E F W+ + +S
Sbjct: 271 AIMASDGTLLFDEIMQAKEHLRVQYDELVPALCNNYPDVFPPELYTWEQFLWACELWYSN 330
Query: 250 LVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+++ +DG R L+P A LNHS + + D + + F R + GEQ +S
Sbjct: 331 SMKIKFLDGKLRTCLIPIAGFLNHSLHPHIIHYGKVDSITNTLKFPLSRPCRVGEQCCLS 390
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
YG S L+ YGF+P +G N D + L + ++D
Sbjct: 391 YGNFSGAHLITFYGFLP-QGDNRYDIIPLDIDAGEAD 426
>gi|147777505|emb|CAN60498.1| hypothetical protein VITISV_027869 [Vitis vinifera]
Length = 2077
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/184 (28%), Positives = 90/184 (48%), Gaps = 10/184 (5%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+S+++ Y +ALP + L + E D L + + E IE ++ Y +L +
Sbjct: 1470 NSKFNTYFNALPEAFNTGLSF---EFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALC 1526
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETF 278
+PD+FP E + E F W+ + +S +++ DG R L+P A LNHS + +
Sbjct: 1527 KDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHY 1586
Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
D + + F + GEQ ++SYG S+ L+ YGF+P +G N D++ L +
Sbjct: 1587 GKVDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIP-QGDNLYDTIPLEIDN 1645
Query: 339 KKSD 342
+ D
Sbjct: 1646 PQGD 1649
>gi|313216417|emb|CBY37730.1| unnamed protein product [Oikopleura dioica]
gi|313234608|emb|CBY10563.1| unnamed protein product [Oikopleura dioica]
Length = 432
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 125/266 (46%), Gaps = 37/266 (13%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT--------ADSKWSCPEAGEVLKQCSVPDWPLLAT 154
RG+ A++ ++KG+ + + +IT ++K+ C AG + + +V + LL
Sbjct: 33 RGVQAVERVQKGKNIFHITSDWLITPRFVVENLKEAKYFCEAAGRIGR--AVDAFDLLIL 90
Query: 155 YLISEAS----FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIR---ERAIERI 207
++++E++ F + +R++ Y LP + YS+ Y+ + R L +Q+R E+ + ++
Sbjct: 91 WIVTESTRETYFGRKTRFAGYFETLPIK-YSVPYFVPEKYRRLL-TNQVRTDVEKELNKL 148
Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL-------PSMDG-- 258
+ T+ +R I S Y E + F+W+ + +R + + +DG
Sbjct: 149 YDRHETFEIIRKEIRSTYHQEIISEC-SWVKFRWAAATIKTRQIYIFDEKYEELKIDGLQ 207
Query: 259 ------RVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
L PW DMLNH V + G+V R PGEQ+ ISY ++S
Sbjct: 208 IGTPFDSSGLAPWFDMLNHGDVAHVNCKFYCTSPADGLVCEALRDILPGEQLLISYDERS 267
Query: 311 NGELLLSYGFVPREGTNPSDSVELPL 336
+ ++L+ YGF G N +E+ L
Sbjct: 268 DDQMLVDYGFSLGPGENQRTFLEITL 293
>gi|66806627|ref|XP_637036.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
gi|60465490|gb|EAL63575.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
Length = 532
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/371 (23%), Positives = 154/371 (41%), Gaps = 94/371 (25%)
Query: 81 KWLSDSGLP-PQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
+W ++G+ +K+++ D+G RG++A I++ E L+ +P +I + SK+S +
Sbjct: 3 EWGINNGIEWNEKLSVHDFEDIG-RGVIANHEIKQDEVLISIPEKFLIHSKSKFSLEKLN 61
Query: 139 ----------------EVLKQCSVPDWPL------------LATYLISEASFEKSSRWSN 170
L S+ P ++ +LI E +K+S W N
Sbjct: 62 PPIIKKIKSYIKTFVENNLSPSSIFYKPFHDSVNQFNSKQRISFHLIIEKLLKKNSIWYN 121
Query: 171 YISALPRQPYSLLYWTRAELDR-----YLE-ASQIRERAIERITN----VIGTY-NDLRL 219
Y++ LP + + E++ Y+E +++ +E + ++ Y NDL
Sbjct: 122 YLNDLPTEYNITSTYDDDEIEHLGYPIYVEKVLELKNEMLESFDSFKEILMDNYKNDLN- 180
Query: 220 RIFSKYPD------------------------------LFPEEVFNMETFKWSFGILFSR 249
RI K D + +E+ + ++W +G + SR
Sbjct: 181 RIVIKLNDNSNDDDDDGGGGGGGGGGGGGGGGDDENITIKLKEIIDFNLYQWCWGTIQSR 240
Query: 250 -------LVRLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
+ LP ALVP AD+ NHS +V T +D+ Q T +++
Sbjct: 241 TYYYDRNMKELPKHLQLEDKDDCALVPLADLFNHSSDVNTETKFDEKKQCYQVITKTKFE 300
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPS------DSVE---LPLSLKKSDKCYKEK 348
QVFISYGK SN L+ YGF+ +N S D++ L +K+ K Y+ K
Sbjct: 301 KDSQVFISYGKHSNFTLMNYYGFIIENNSNDSIPLVQEDAIPDIILEKEMKQDLKSYERK 360
Query: 349 LEALRKYGLSA 359
+ L +YGLS
Sbjct: 361 MSILEQYGLSV 371
>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
Length = 400
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 160/359 (44%), Gaps = 61/359 (16%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RG+ A K+I++G+ L +P S+++ S+ + +V + + W L ++ E
Sbjct: 39 RGVTANKDIKEGDLLFSLPRSILL---SQLTSSLKDQVSELSELSGWSPLILCMMYEIE- 94
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+ S W Y LPR+ + ++W + +L + LE + I + ++ + + +N+L I
Sbjct: 95 KPDSFWKPYFDVLPREFTTPMFWNQEDL-KELEGTDIISKIGKKESEEL-FHNELE-PII 151
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFS-----RLVRLPSMD-------------------- 257
KYP+LF E+ +E F ++ + L + P +
Sbjct: 152 KKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEEEEEEEEEEEE 211
Query: 258 ---GRVALVPWADMLNHSC---EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
G +++VP ADMLNH F + D + + + GEQ++ +YG N
Sbjct: 212 EEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAI----KDIKEGEQIYNTYGDLCN 267
Query: 312 GELLLSYGFVPREGTNPSDSVEL--PLSL----KKSDKCYKE-KLEALRKYGLSASECFP 364
+LL YGFV + N D VEL PL + + D+ KE K++ L + G+ ECF
Sbjct: 268 ADLLRKYGFVDEK--NDFDLVELDGPLLVEVCCEDQDEALKERKIDFLMEEGV-LDECFV 324
Query: 365 IQITG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFIL 422
I P EL+ +++ + K EE K+T K+IK + LQ IL
Sbjct: 325 IDKEHEIPPELIVSVHVLCTTADDFQKMEEKQKLPKPKLT--KEIK------EKLQIIL 375
>gi|428174289|gb|EKX43186.1| hypothetical protein GUITHDRAFT_110913 [Guillardia theta CCMP2712]
Length = 437
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 90/188 (47%), Gaps = 21/188 (11%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWT--RAELDRYL-EASQIRERA-IER 206
LA +L+ E+ KSS W Y+ +LP+ P + Y R +L L E +++ A +E
Sbjct: 168 LAVFLLLESQ-NKSSFWRPYLCSLPKHVPLPMFYSKERRQQLKEQLPEDQRVKFDALVEA 226
Query: 207 ITNVIG-TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA---- 261
+V+ Y L +F KYP LF EVF+ E F W+ I+ SR D +
Sbjct: 227 RRDVVDLHYMQLLPVLFLKYPTLFSPEVFSYEKFAWAISIIMSRTWGKTYFDSALGPRGR 286
Query: 262 ------LVPWADMLNHSCEVETFLDYDKSSQG-VVFTTDRQYQPGEQVFISYGKKSNGEL 314
L P ADM NH + L+ ++ +G + + GEQ FISYG K + E
Sbjct: 287 NITVHTLAPAADMPNHDS---SGLEANRDPRGRMTLNAQKNLSVGEQFFISYGSKCDAEF 343
Query: 315 LLSYGFVP 322
L YGFVP
Sbjct: 344 LAHYGFVP 351
>gi|308806756|ref|XP_003080689.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
gi|116059150|emb|CAL54857.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
Length = 472
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/301 (25%), Positives = 126/301 (41%), Gaps = 42/301 (13%)
Query: 71 DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
D TL +W +D G + + + RGL +++R GE++L + I D
Sbjct: 31 DDASRVETLARWCADRGTYARALRVDLDQGSGRGLELSRDVRAGERVLGASLTSGIV-DE 89
Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS--LLYWTRA 188
PE P LA ++ E +S ++ Y++ LP + S LY RA
Sbjct: 90 ARGHPERTRA-AMAEAPWGVRLACRVLQERKKGGASAYAAYVATLPERVESSPALYDARA 148
Query: 189 -ELDRYLEA-SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
E +Y A ++IRE + R T + ++ P+ + VF+ + F + ++
Sbjct: 149 IEEVQYPPAMTEIRE--MRRATR------EWHEKLQKTAPEALGDAVFDYDAFVDAVSVV 200
Query: 247 FSRLVRLPSMDGRV----ALVPWADMLNHSCEVETFLDYDKSS----------------- 285
SR + S + AL+P ADM+NH ++ T L D+ +
Sbjct: 201 HSRTYGIASANDNAGLFRALLPLADMINHGGDIVTGLTKDEETGAVTNVETTATDNIAWS 260
Query: 286 ----QGVV-FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
GVV F R GE +SYG++SN L+ YGF P NP D L +L+
Sbjct: 261 ELDDDGVVHFAATRDIAEGEAALMSYGERSNDHFLIYYGFAP--DNNPHDDCVLFSNLEH 318
Query: 341 S 341
+
Sbjct: 319 A 319
>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
Length = 440
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 136/312 (43%), Gaps = 33/312 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + RGL++ ++R+G+ ++ +P S +IT D+ G
Sbjct: 36 LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLITTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL-- 194
+ + P PLLA T+L+SE S W Y+ LP Q Y+ E+
Sbjct: 95 TYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILP-QAYTCPVCLEPEVVNLFPK 153
Query: 195 ----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
+A + R R E ++ ++ L+ +FS+ E +F+ W++ + +R
Sbjct: 154 PLKAKAEEQRARVQEFFSSSRDFFSSLQ-PLFSEAV----ESIFSYRALLWAWCTVNTRA 208
Query: 251 VRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
V + + AL P+ D+LNHS EV+ +++ ++ T + E+V
Sbjct: 209 VYVKHRQRQCFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEV 268
Query: 303 FISYGKKSNGELLLSYGFV----PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
FI YG N LLL YGFV P S+ + L L +DK +K+ L+ +
Sbjct: 269 FICYGPHDNQRLLLEYGFVSIHNPHACVYVSEDI-LVKYLPTTDKQMNKKISILKDHDFI 327
Query: 359 ASECFPIQITGW 370
+ F GW
Sbjct: 328 ENLTF-----GW 334
>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
Length = 493
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 131/311 (42%), Gaps = 27/311 (8%)
Query: 73 LENASTLQKWLSDSGLPPQKMAI-QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
+E + +W + G + + + + + G GL + I GE ++ VP S+ ++
Sbjct: 69 METVAHFMRWAVERGCQVENVRVAEHAEYGGLGLESCGPIPAGECIITVPRSMFFYVTNE 128
Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
+ E++ + + +LA LI E F S W Y+ LP + + LY+T +
Sbjct: 129 PRYRQLLELMPGAMMSEQGNIMLALALIME-RFRAKSDWKPYLDLLPDRYTTPLYYTTED 187
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
+ E A++ ++ Y +R + K +L + F + F+W+ + +R
Sbjct: 188 MGELAETDAFLP-ALKLCKHIARQYGFIRRFVQEKVDEL--RDCFTYDVFRWAVSTVMTR 244
Query: 250 LVRLP-------SMDGRVALVPWADMLNHS---------CEVETFLDYDKSSQGVVFTTD 293
++P MD +AL+P DM NH+ C ET + T +
Sbjct: 245 QNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTRE 304
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
+FI YG +++ E L+ GFV PR NP +V+ +L + YKE+ L
Sbjct: 305 VSDIASVPIFIVYGTRTDAEFLVHNGFVCPR---NPHANVQKRFTLVPAIPLYKERAHLL 361
Query: 353 RKYGLSASECF 363
G+ + F
Sbjct: 362 ELLGMPTTGTF 372
>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Metaseiulus occidentalis]
Length = 278
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 74/133 (55%), Gaps = 6/133 (4%)
Query: 237 ETFKWSFGILFSRLVRLPSMD-GRV--ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
E +W+ + +R LPS+ GR+ ALVP DM NH + + DYD +SQ +V
Sbjct: 37 EPDEWACSTVMTRQNELPSLTPGRMQMALVPLWDMCNHDT-LRSGTDYDVASQQLVSFAT 95
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
R+Y+ EQV I YG ++N + +L GFVP E N DS+ + + L K+DK ++ K
Sbjct: 96 REYKKNEQVNIFYGNRANAQFMLHNGFVPDE--NQWDSLAIKIGLSKADKLFEMKRRLCE 153
Query: 354 KYGLSASECFPIQ 366
+ + S+ F ++
Sbjct: 154 QMKIPTSDVFELK 166
>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
Length = 452
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 112/264 (42%), Gaps = 44/264 (16%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
L + + +R+G FVP +LV+ + P + P+ T L++ AS
Sbjct: 83 LASSRVLREGSS--FVPSALVLGVHMLVNFPHSEN-------PE-----TSLMAMASMNT 128
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
Y++ALPR LYW D+ E Q E A + + Y+ + +F
Sbjct: 129 PPLDELYVNALPRYVDLPLYWD----DKQFEELQGCEEARRAMQHGARFYSQVYKHLFGA 184
Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH----SCEVETFLD 280
+ N E F W+ IL SR + AL+P+ D NH S LD
Sbjct: 185 N-----NQFVNAEAFFWAISILMSRATS--GQNQPFALIPFFDWFNHAGNGSDNCRHALD 237
Query: 281 YDKSSQ------GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVE 333
D+ Q G T R Y+PGEQ+FI+YG N LL +YGF +P NP D V
Sbjct: 238 SDECVQDFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMP---NNPYDVVN 294
Query: 334 LPLSL-----KKSDKCYKEKLEAL 352
LP+ ++D + +K + L
Sbjct: 295 LPMPAALQQPNEADPAFAQKRDLL 318
>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
Length = 320
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/246 (28%), Positives = 116/246 (47%), Gaps = 24/246 (9%)
Query: 77 STLQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
S L +WL G+ + A+ + +G L A K I GE L VP L++ +
Sbjct: 2 SALLRWLEGGGVQLGGVEAVWREGMGW-ALRASKRISPGETFLKVPRHLLL-GPHQLRAS 59
Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+L+ +PD LL L+ S SS + Y+ LP + + W++ E + L
Sbjct: 60 SLDRLLEGWQLPDCMLL---LLMCESVNSSSFFRPYLDLLPDTVDTPITWSKEEA-KELV 115
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
S + RA++ + ++ +++ ++F KYPD FP +F+ E ++W++ IL SR
Sbjct: 116 GSPVLHRAVKLRHELARSFQEMKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAF---- 171
Query: 256 MDGRVALVPWADMLNHSCE---VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
G L+P D++NH + T L S R+Y V+ YG+KS+
Sbjct: 172 --GNYTLMPLIDLMNHHPDSRLAPTLL----SDGSDALIARREY----NVWGFYGRKSDA 221
Query: 313 ELLLSY 318
+LLL+Y
Sbjct: 222 DLLLNY 227
>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
Length = 520
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/385 (24%), Positives = 163/385 (42%), Gaps = 46/385 (11%)
Query: 74 ENASTLQKWLS-DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
E+A L WLS D G+ +A ++ G ++ + G L VP S +T+
Sbjct: 48 EDARELAAWLSYDKGVDASALAFKEDAKGGVRVILKADAEAGATALRVPQSAAVTSVDVG 107
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
P E+ P+ LA +L +E +S W+ Y+ L P + L+WT A+
Sbjct: 108 EHPIVSEL--ASGRPELIGLALWLCAERIKGGASEWAPYVKTLRANPDAPLFWTDAKDFA 165
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET---FKWSFGILFSR 249
L+ S + AIER + Y + + P +P E + T F + + ++
Sbjct: 166 LLKGSPVAADAIERSKSARTEYASI-TEVIKSDPSSYPPEAYEFLTEARFVDALATVCAK 224
Query: 250 LVRLPSMDGRVALVPWADMLN-HSCEVETFL--------------DYDKSSQGVVFTTDR 294
LP+ ALVP D+++ V L DYD + VV
Sbjct: 225 ATWLPTAQC-YALVPLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDVDTASVVLRCAT 283
Query: 295 QYQPGEQVF-ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ +V + +++NGEL L+ G+V ++ +P D + + ++ SD+ + K + L
Sbjct: 284 KAAANSEVIQLDALQRNNGELFLNTGYVDQK--HPGDYIYMKTDIQTSDRLFTAKKQVLE 341
Query: 354 KYGLSASE-CFPIQITGWPLELMAYAYL----VVSPPSMKG-KFEEMAAAASNKMTSKKD 407
G +A++ FP+ P +L Y+YL V P M FEE +K+ S +
Sbjct: 342 GMGFTAADQYFPVYKDRMPTQL--YSYLRFSRVQDPGEMMAVSFEE------DKIVSVMN 393
Query: 408 IKCPEIDEQALQFILDSCESSISKY 432
+ + LQ ++ C +++Y
Sbjct: 394 ------EYEILQILMGDCRELMAEY 412
>gi|357479689|ref|XP_003610130.1| 3-hydroxy-3-methylglutaryl-coenzyme A reductase [Medicago
truncatula]
gi|355511185|gb|AES92327.1| 3-hydroxy-3-methylglutaryl-coenzyme A reductase [Medicago
truncatula]
Length = 689
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 71/133 (53%), Gaps = 7/133 (5%)
Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNH 271
Y++L + + +PD+FP E++ E F W+ + +S +++ DG R L+P A LNH
Sbjct: 423 YDELVPALCNGFPDIFPPEIYTWENFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFLNH 482
Query: 272 SC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
S + + D S+ + F R + GE+ +SYG S+ + YGF+P +G NP
Sbjct: 483 SLCPHITHYGKVDPSTNSLKFCLSRSCRSGEECCLSYGNFSSSHFITFYGFLP-QGDNPY 541
Query: 330 DSVELPLSLKKSD 342
D + PL + SD
Sbjct: 542 DVI--PLDIDSSD 552
>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
Length = 415
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 129/285 (45%), Gaps = 24/285 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + RGL++ ++R+G+ ++ +P S ++T D+ G
Sbjct: 12 LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLLTTDTVIR-SYLG 70
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 71 AYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPKA-YTCPVCLEPEVVN-LFP 128
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
++ +A E+ V G ++ R S P LF E V F+ W++ + +R V +
Sbjct: 129 KPLKAKAEEQRARVQGFFSSSRDFFSSLQP-LFSEAVESIFSYSALLWAWCTVNTRAVYV 187
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ ++ T + E+VFI
Sbjct: 188 KHRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFIC 247
Query: 306 YGKKSNGELLLSYGFV----PREGTNPSDSV---ELPLSLKKSDK 343
YG N +LLL YGFV P S+ V LPL+ K+ +K
Sbjct: 248 YGPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQMNK 292
>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
Length = 336
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 137/304 (45%), Gaps = 22/304 (7%)
Query: 82 WLSDSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
WL G +A+ + R L A + + GE ++ LV+T + K C E +
Sbjct: 45 WLRSRGEDMNSIAVAIGMSKHGRALFAHRPMCAGECMIKFSQDLVLTPE-KLPC-EVIAL 102
Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQ 198
L Q + ++ ++ +++E ++S W+ YI LP + +S ++W EL LE S
Sbjct: 103 LDQAN--EFTRVSLLVMAEKRKGQNSAWAPYIECLPSFGEIHSTIFWDPKEL-ACLECSP 159
Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL-VRLPSMD 257
I ER + Y +++ ++ P L+ +V ++E FK + + SR + P D
Sbjct: 160 IHRGTGERNALLQSEYREVK-KVVESCPHLYDPDV-SLEQFKHEYATVSSRAWGQGPHSD 217
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF-----ISYGKKSNG 312
+ ++P D NH T + + VV + R YQ G++ F I YG SN
Sbjct: 218 --MTMIPLVDFANHDPRSRTLFSHADDNCTVVVAS-RDYQTGDENFHLKVHICYGDHSNA 274
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGWP 371
L L YGFV + NP D E+ L + D + KL+ + + ++ + Q G P
Sbjct: 275 VLALDYGFVVPD--NPFDEAEIFLEIPSEDPLREIKLQYMAQNNMNTLQDSNGTQTGGRP 332
Query: 372 LELM 375
+M
Sbjct: 333 FTIM 336
>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
Length = 475
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/262 (26%), Positives = 117/262 (44%), Gaps = 37/262 (14%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE-VLKQCSVPDWPLLATYLISE 159
G RGL A++++R+GE +L VP S ++T ++ + + V + S+ +L L+ E
Sbjct: 51 GGRGLGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYE 110
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL----EASQIRERAIERITNVIGTYN 215
K+SRW Y+ LP Y +L E +++ EA + E+A+ + + +
Sbjct: 111 MGKGKTSRWHPYLMHLP-HTYDVLA-MFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAH 168
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC-- 273
L + +F + F + + W+ + SR + +P D L P D+ N+
Sbjct: 169 SLMQDL------MFKPQFFTFKAWVWAAATISSRTLHIP-WDEAGCLCPVGDLFNYDAPG 221
Query: 274 -EVETFLDYDKSSQ------------------GVVFTTDRQYQPGEQVFISYGKKSNGEL 314
E D D + Q F Y+ G+QV + YG +N EL
Sbjct: 222 IEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYGTYTNLEL 281
Query: 315 LLSYGFVPREGTNPSDSVELPL 336
L YGF+ +E NP+D V +PL
Sbjct: 282 LEHYGFLLQE--NPNDKVFIPL 301
>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
Length = 629
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 117/258 (45%), Gaps = 14/258 (5%)
Query: 71 DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
++L+ + L W+ + + ++ + R +V+ + I+ E ++ +P VIT D
Sbjct: 145 ETLKKSENLLSWVQANKGEFSSIKLKYLSTHNRSIVSKRIIQADETVISIPQEQVITLDV 204
Query: 131 KWS---CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
S C E Q A +L+ E + +S + YI +LP S
Sbjct: 205 ASSSDFCKILTEKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYIDSLPTDLSSFPALFS 264
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
E +YLE + + E+ ++ Y + I P+ E F+ E F+W+F
Sbjct: 265 EEELQYLEGTAALKLVQEQKEDIKTDYESISQVI----PEFKSE--FSFEQFRWAFLCSH 318
Query: 248 SRL--VRLPSMDGRVALVPWADMLNH--SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
SR+ +++ + V +VP ADMLNH S + ++ +D ++ ++ Q +Q+
Sbjct: 319 SRVFGIKVKGVKTSV-MVPLADMLNHKHSGQEDSEWVFDDATNCFTVKALKKIQRNQQIH 377
Query: 304 ISYGKKSNGELLLSYGFV 321
SYG K N +L L+YGFV
Sbjct: 378 FSYGSKCNSKLFLNYGFV 395
>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
melanoleuca]
Length = 440
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 129/285 (45%), Gaps = 24/285 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + RGL++ ++R+G+ ++ +P S ++T D+ G
Sbjct: 36 LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPK-AYTCPVCLEPEVVN-LFP 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
++ +A E+ V G ++ R S P LF E V F+ W++ + +R V +
Sbjct: 153 KPLKAKAEEQRARVQGFFSSSRDFFSSLQP-LFSEAVESIFSYSALLWAWCTVNTRAVYV 211
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
+ AL P+ D+LNHS V+ +++ ++ T + E+VFI
Sbjct: 212 KHRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFV----PREGTNPSDSV---ELPLSLKKSDK 343
YG N +LLL YGFV P S+ V LPL+ K+ +K
Sbjct: 272 YGPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQMNK 316
>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
Length = 749
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 117/233 (50%), Gaps = 16/233 (6%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT-- 208
L+ YL + K W YI+ LP L +T EL+ ++ +++ A+E+I
Sbjct: 39 LVYLYLAVNKTNPKCFHWP-YINVLPETYDCPLSYTIDELNL-MKGTKLY-AAVEKINAF 95
Query: 209 --NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRV-ALV 263
V+ YN+ ++ F +Y F +++F + +W+ +SR LV P G V +L+
Sbjct: 96 LMKVVDYYNNKLIQQFPQYFQSF-DDLF--KRLQWAHQSFWSRAFLVIYPQPFGEVGSLI 152
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPGEQVFISYGKKSNGELLLSYGFVP 322
P+ D NH + + + ++ F T+ + +PGEQ+F +Y +SN +LLL YGFV
Sbjct: 153 PFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGFV- 211
Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
E NP D++ L + + D Y E E L++ + + + F PLELM
Sbjct: 212 -EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFFLKLDEDIPLELM 263
>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
Length = 403
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 138/304 (45%), Gaps = 22/304 (7%)
Query: 82 WLSDSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
WL G +A+ + R L A + + GE ++ +LV+T + K C E +
Sbjct: 45 WLRRRGEDMNSIAVAIGMSKHGRALFAHRPMCAGECMIKFSQNLVLTPE-KLPC-EVIAL 102
Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQ 198
L Q + ++ ++ +++E ++S W+ YI LP + +S ++W EL LE S
Sbjct: 103 LDQAN--EFTRVSLLVMAEKRKGQNSAWAPYIECLPSFGEIHSTIFWDPKEL-ACLECSP 159
Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL-VRLPSMD 257
I ER + Y +++ ++ P L+ +V ++E FK + + SR + P D
Sbjct: 160 IHRGTGERNALLQSEYREVK-KVVESCPHLYDPDV-SLEQFKHEYATVSSRAWGQGPHSD 217
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF-----ISYGKKSNG 312
+ ++P D NH T + + VV + R YQ G++ F I YG SN
Sbjct: 218 --MTMIPLVDFANHDPRSRTLFSHADDNCTVVVAS-RDYQTGDENFHLKVHICYGDHSNA 274
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGWP 371
L L YGFV + NP D E+ L + D + KL+ + + ++ + Q G P
Sbjct: 275 VLALDYGFVVPD--NPFDEAEIFLEIPSEDPLREIKLQYMAQNNMNTLRDSNGTQTGGRP 332
Query: 372 LELM 375
+M
Sbjct: 333 FTIM 336
>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
Length = 439
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 126/267 (47%), Gaps = 19/267 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LKKWLKDRRFEDATLIPARFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYIAKWQPPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPKA-YTCPVCLEPEVVNLL-P 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLV-- 251
+ ++ +A E+ ++V ++ R S P LF E + F+ +W++ + +R V
Sbjct: 153 NPLKTKAWEQRSHVQEFFSSSRGFFSSLQP-LFSEAIETIFSYRALRWAWCTVNTRAVYM 211
Query: 252 -RLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R P + AL P+ D+LNHS +V+ +++ ++ T + ++VFI
Sbjct: 212 KRPPQLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFIC 271
Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSV 332
YG N LLL YGFV +NP V
Sbjct: 272 YGPHDNHRLLLEYGFV--SVSNPHACV 296
>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
Length = 386
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 128/288 (44%), Gaps = 21/288 (7%)
Query: 80 QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
Q+ S + L P A D G RGL L+N++ G+ L+ +P S ++T + + G
Sbjct: 40 QRGFSTTLLHPAAFA----DTG-RGLQVLRNVKPGDMLISLPESCLLTTSTVLNS-YLGS 93
Query: 140 VLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
+K PLLA +L+ E ++S W YI LP+ Y+T + +
Sbjct: 94 FIKSWKPHLSPLLALCVFLVCERHRGEASDWFPYIDVLPKSYTCPAYFTDEVMALLPPSV 153
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS-- 255
Q + R I + N R E+V E +W++ + +R V +
Sbjct: 154 QRKAREQREAVREIHSSNKAFFRSLQPVLTQPAEDVLTYEALRWAWCSVNTRSVFMLHSS 213
Query: 256 ---MDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ G+ AL P+ D+LNH +V+ +++ ++ + + +Q FI+YG
Sbjct: 214 NDFLRGQDVYALAPFLDLLNHCPDVQVKASFNEETKCYEIRSVSRMLQYQQAFINYGSHD 273
Query: 311 NGELLLSYGFVPREGTNPSDSV----ELPLSLKKSDKCYKEKLEALRK 354
N L+L YGFV NP V +L + + D+ ++KL+ LR+
Sbjct: 274 NQRLMLEYGFV--APCNPHSVVYVDKDLIADVLRGDQSLEQKLKFLRE 319
>gi|312098619|ref|XP_003149111.1| hypothetical protein LOAG_13557 [Loa loa]
gi|307755724|gb|EFO14958.1| hypothetical protein LOAG_13557 [Loa loa]
Length = 288
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 124/292 (42%), Gaps = 37/292 (12%)
Query: 77 STLQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
+ +W+ ++G + I+ + G +GL A + R+ E ++ +P ++ITA P
Sbjct: 4 TDFMEWVIENGGEHFGVDIRDCSNEGGKGLYATTDFRENETVICIPMEIIITAGFVAEMP 63
Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+V K+ + + L + + E EK+S+W Y LP+ + T A L LE
Sbjct: 64 GYCDVFKRYRLKPFEALVYFFLVEK--EKNSKWDPYFKVLPKS-----FSTPASLHPVLE 116
Query: 196 ASQ----IRERAIERITNVIGTYNDLRL-------------RIFSKYPDLFPEEVFNMET 238
+R++ + + Y R R +S++ + +
Sbjct: 117 PEDFPYCLRKQWCIQKNELKTMYEKARFVTEGTAGEFVPHNRFYSQFVAILADNTI-WGH 175
Query: 239 FKWSFGILFSRLVR-----LPSMDG----RVALVPWADMLNHSCEVETFLDYDKSSQGVV 289
F W++ I+ +R + P +D +A+VP DMLNHS + + +D
Sbjct: 176 FLWAWHIVNTRCIYRDNKPHPLIDNTEGDSLAIVPLIDMLNHSNDSQCCAIWDSKLNLYK 235
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
R GEQ+FI YG +NG L + YGF ++ N + VE+ L S
Sbjct: 236 AIVTRPIHEGEQIFICYGSHTNGSLWIEYGFYLKD--NICNKVEISLGWFNS 285
>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 724
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/307 (26%), Positives = 127/307 (41%), Gaps = 47/307 (15%)
Query: 72 SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITAD 129
S E Q+W SG+ + + + G RG+ A NI KGE L+ +P +LV++
Sbjct: 70 SGEGPLGFQEWALQSGITSPSLRLAEF-AGLRGMAAADNIAKGEVLVSLPVAAALVVSPK 128
Query: 130 SKWSCPEAGEVLKQCSVPDWPL-LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
+ P S W + +A L+ E +S+ + Y++ALP + L W+ A
Sbjct: 129 ERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALPVDFSTPLSWSEA 188
Query: 189 ELDRYLEASQIRERAIER------------------IT--NVIGTYNDLRLRIFSKYPDL 228
+L IRE A +R IT ++I +R R FS P
Sbjct: 189 QLQALCYPQLIREVATQREGLKRLHAELAVSTPGTPITEQDLIWALQAVRSRAFSG-PYA 247
Query: 229 FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------------------LVPWADM 268
P ++TF + + + ++G +A + P D
Sbjct: 248 GPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDF 307
Query: 269 LNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
LNH V++ ++Y+ + + GEQVFISYGK+SN LL YGFV E P
Sbjct: 308 LNHKSTVQSEVEYEYFADRFSVRCQSYFSKGEQVFISYGKQSNDSLLQYYGFV--EPGIP 365
Query: 329 SDSVELP 335
D+ +P
Sbjct: 366 HDTYTIP 372
>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
Length = 441
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 132/297 (44%), Gaps = 26/297 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L K+L G + +RGL L+ I+ G L+ +P S ++T + G
Sbjct: 36 LMKFLHGRGFTSTPLQPALFSDTDRGLQTLQPIQPGGMLVSLPESCLLTTSTVLH-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
LK L+A +L+ E ++S W YI LP Y+T + +
Sbjct: 95 PFLKSWKPRPSSLVALCVFLVCERHRGEASDWFPYIDVLPCSYCCPPYFTDTVMA--VLP 152
Query: 197 SQIRERAIER---ITNVIGTYND--LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
S +R RA E+ + ++ + D + L+ +P PEEV E +W++ + +R V
Sbjct: 153 SGVRRRAEEQREGLQHLYAVHQDFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSV 209
Query: 252 RL--PS---MDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ PS + G AL P+ D+LNH +V+ ++++S + Q Q FI
Sbjct: 210 FMDRPSSSFLSGPDNYALAPFLDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFI 269
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSV----ELPLSLKKSDKCYKEKLEALRKYGL 357
+YG N LLL YGFV NP + +L + + D+ EK++ LR+ G
Sbjct: 270 NYGSHDNQRLLLEYGFV--SSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGF 324
>gi|156538697|ref|XP_001607787.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Nasonia
vitripennis]
Length = 403
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 54/193 (27%), Positives = 96/193 (49%), Gaps = 10/193 (5%)
Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNM 236
Y L+ + + L+ S E A+++ N+ Y+ + ++F + + +VF
Sbjct: 104 YGLVLYMSMDDMMELKESPALETALKQCRNIARQYSYFK-KLFHNSKNSVSKLLADVFTY 162
Query: 237 ETFKWSFGILFSRLVRLPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
E ++W+ + +R +PS + +L+P DM NHS E + ++++ S
Sbjct: 163 EEYRWAVSTIMTRQNVIPSENQSAMVHSLIPMWDMCNHS-EGKITTNFNEISNCCECYAM 221
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ ++ +Q+FI YG ++N E + GFV + N DS EL L + SDK EK+E L
Sbjct: 222 KSFKTDDQIFIYYGSRTNAEFFVHSGFVYPDNAN--DSYELHLGIGSSDKLRSEKVELLS 279
Query: 354 KYGLSASECFPIQ 366
K GL S FP++
Sbjct: 280 KIGLPVSNQFPLK 292
>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 440
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 124/255 (48%), Gaps = 24/255 (9%)
Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP---LLATYLISEASF 162
+ +N+ G LL +P S V++ +S + G +L+ PD P L +L+ E +
Sbjct: 37 ITTENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLLEVN--PDLPSAIALGLHLLEERAL 93
Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+S +S++++ LP S L+++ E+ + LE SQ++ + R V Y+ L
Sbjct: 94 GAASNFSDFVATLPTIEAINSTLFYSEDEM-KGLEGSQLQRFTLGRAQAVDAFYDALVQP 152
Query: 221 IFSKY---PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML-------- 269
+ S+ P +F + F ++ F+W+ G+++S + + V L P + +
Sbjct: 153 VTSREAVDPPIFHKSEFTLDKFRWAMGVVWSSTFQFGENEDDVILAPVLNTIGICTDLNQ 212
Query: 270 --NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
N +C ET + D +Q + Y G++V +S KS+ +L+LS+GF R +
Sbjct: 213 EGNEACP-ETSIKVDTDTQRLTVYASVAYSKGQEVRLSMPGKSSTQLMLSHGFA-RARAS 270
Query: 328 PSDSVELPLSLKKSD 342
D ++L ++L SD
Sbjct: 271 KLDKLDLTVTLDSSD 285
>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
Length = 467
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 78/341 (22%), Positives = 137/341 (40%), Gaps = 51/341 (14%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+ AL+++ GE + +P + +T + A + +++ + L L+ E S
Sbjct: 33 GVRALRDLHHGELIATIPKAACLTLLT----TAARDAIERARLGGGLGLTVALMYERSKG 88
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
K S+W Y+ LPRQ W+ E+D L +++ + E + + + +
Sbjct: 89 KGSKWYRYLKTLPRQESVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEENIAPLTK 148
Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF----- 278
+ P FP + F E++ + ++ SR + + G +VP AD+ NH + E
Sbjct: 149 EDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHG-YGMVPLADLFNHKTDAEDVHFMLN 207
Query: 279 ------------------------LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
+ DKS +V D G ++F +YG+ N L
Sbjct: 208 ASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKD--VAAGSEIFNTYGQLGNAAL 265
Query: 315 LLSYGFVPREGTNPSDSVELPLSL-------KKSDKCYKEKLEALRKYGLS-----ASEC 362
L YGF E NP D V L + + K +++ RK G S SE
Sbjct: 266 LHRYGFT--EPNNPHDIVNLDMDCVLEVLLSRFQKKRVRKRGRVWRKAGFSGCESQGSEY 323
Query: 363 FPIQITGWP-LELMAYAYLVVSPPSMKGKFEEMAAAASNKM 402
F I +G P +EL+ +++ S E+ AA ++
Sbjct: 324 FEISASGKPQIELLLLLFVIQSRARDCKALEDAAAKVKGRV 364
>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
Length = 440
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 121/257 (47%), Gaps = 19/257 (7%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 36 LKKWLKDRRFEDTTLIPAHFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYIAKWQPPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPKA-YTCPVCLEPEVVNLL-P 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
+ ++ +A E+ ++V ++ R S P LF E V F+ +W++ + +R V +
Sbjct: 153 NPLKTKAWEQRSHVWEFFSSSRGFFSSLQP-LFSEAVETIFSYRALRWAWCAVNTRAVYM 211
Query: 254 ---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
P D AL P+ D+LNHS +V+ +++ ++ T + ++VFI
Sbjct: 212 KRPPLLCLSPEPDT-CALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFI 270
Query: 305 SYGKKSNGELLLSYGFV 321
YG N LLL YGFV
Sbjct: 271 CYGPHDNHRLLLEYGFV 287
>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
Length = 449
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 73/265 (27%), Positives = 117/265 (44%), Gaps = 41/265 (15%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP-LLATYLIS 158
G RGL A +++ +GE +L VP S ++T+ S + +K+ + P +L L++
Sbjct: 46 AGGRGLAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLA 105
Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERI----TNVIG 212
E S KSS W Y+ LPR +L +++ E + +A + ERAI + I
Sbjct: 106 EMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIP 165
Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
+L+L+ ++ N + W+ + SR + +P D L P D N++
Sbjct: 166 LMEELKLK----------PQLQNFRAWLWASSTVSSRTMHIP-WDDAGCLCPVGDFYNYA 214
Query: 273 ------CEVETFLD---------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
C E D Y + F + Y+ GEQV +SYG +N
Sbjct: 215 APGEEPCGWEDLKDAEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTN 274
Query: 312 GELLLSYGFVPREGTNPSDSVELPL 336
ELL YGF+ E NP+D +PL
Sbjct: 275 LELLEHYGFLLDE--NPNDKAFIPL 297
>gi|143584415|sp|Q5ZK17.2|SETD6_CHICK RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
Length = 447
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 131/293 (44%), Gaps = 30/293 (10%)
Query: 82 WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP---- 135
W +G L P+ ++ V GL+A ++ GE L VP S ++ S+ +C
Sbjct: 24 WCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALL---SQHTCAIRAL 80
Query: 136 --EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL----YWTRAE 189
+A E L+ SV W L L+ E + +SRW Y S Q +S L +W E
Sbjct: 81 LHDAQESLQSQSV--WVPLLLALLHEYT-TGTSRWRPYFSLW--QDFSSLDHPMFWPEEE 135
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
R L+ + I E + + N+ Y+ + L +PD+F E+ +E +K + +
Sbjct: 136 RVRLLQGTGIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELYKQLVAFVMAY 195
Query: 250 LVRLPSMDGRVA--------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ P + +VP AD+LNH L+Y + +V T + G++
Sbjct: 196 SFQEPLEEEDEDEKGPNPPMMVPVADILNHVANHNASLEYAPTCLRMV--TTQPISKGQE 253
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+F +YG+ +N +LL YGF N +D+ ++ + + + K EA ++
Sbjct: 254 IFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQMVTVRKAALQRAKNEAQQQ 306
>gi|158508540|ref|NP_001025734.2| N-lysine methyltransferase SETD6 [Gallus gallus]
Length = 447
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 131/293 (44%), Gaps = 30/293 (10%)
Query: 82 WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP---- 135
W +G L P+ ++ V GL+A ++ GE L VP S ++ S+ +C
Sbjct: 24 WCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALL---SQHTCAIRAL 80
Query: 136 --EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL----YWTRAE 189
+A E L+ SV W L L+ E + +SRW Y S Q +S L +W E
Sbjct: 81 LHDAQESLQSQSV--WVPLLLALLHEYT-TGTSRWRPYFSLW--QDFSSLDHPMFWPEEE 135
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
R L+ + I E + + N+ Y+ + L +PD+F E+ +E +K + +
Sbjct: 136 RVRLLQGTGIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELYKQLVAFVMAY 195
Query: 250 LVRLPSMDGRVA--------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ P + +VP AD+LNH L+Y + +V T + G++
Sbjct: 196 SFQEPLEEEDEDEKGPNPPMMVPVADILNHVANHNASLEYAPTCLRMV--TTQPISKGQE 253
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+F +YG+ +N +LL YGF N +D+ ++ + + + K EA ++
Sbjct: 254 IFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQMVTVRKAALQRAKSEAQQQ 306
>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
Length = 393
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/185 (30%), Positives = 87/185 (47%), Gaps = 14/185 (7%)
Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRE 201
+V W LA ++ E ++ + W+ YIS LP QP L W EL YL AS +
Sbjct: 160 TVKPWTKLALIVLME-RYKGQAIWAPYISCLP-QPAELDNTFRWEDTELS-YLRASPLYG 216
Query: 202 RAIERITNVIGTY----NDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPS 255
+A ER+ + + ND + + D++P+ ++E K + +FSR L
Sbjct: 217 KARERLEMITTEFGQVQNDFCTCVLEQALDVWPQLFGKVSLEDLKHVYATVFSR--SLAI 274
Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
+ L+P D NH+ L ++ V T DR Y +Q++I+YG SN EL
Sbjct: 275 GEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELA 334
Query: 316 LSYGF 320
L YGF
Sbjct: 335 LDYGF 339
>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
Length = 705
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 76/155 (49%), Gaps = 9/155 (5%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVF 290
F E ++W+ + +R ++P+ DG +AL+P DM NH+ + T Y+
Sbjct: 348 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCEC 406
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
+ +Q GEQ++I YG +SN E ++ GF N D V++ L + KSD+ Y K E
Sbjct: 407 VALQDFQAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAE 464
Query: 351 ALRKYGLSA---SECFPIQITGWPLELMAYAYLVV 382
L + G+ S F + T P+ A+L V
Sbjct: 465 VLARAGIPTYVWSSVFALHFTEPPISAQLLAFLRV 499
>gi|414881221|tpg|DAA58352.1| TPA: hypothetical protein ZEAMMB73_556117 [Zea mays]
Length = 270
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/55 (65%), Positives = 42/55 (76%)
Query: 375 MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSI 429
MAYA+LVVSPP M FEEMAAA SNK +SK + P ++EQ LQFILD CES+I
Sbjct: 1 MAYAFLVVSPPDMSQCFEEMAAAKSNKTSSKPGLNYPGLEEQTLQFILDCCESNI 55
>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
Length = 245
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 99/206 (48%), Gaps = 13/206 (6%)
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVV 289
+FN E W+ + +R +PS DG AL+P DM NH T D++ +S
Sbjct: 1 MFN-EFSSWAVSTVMTRQNLVPSPDGSRMIHALIPMWDMCNHENGRIT-TDFNATSDRCE 58
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
R +Q GEQ+FISYG ++N + + GFV + N D +L L + K+D KE+
Sbjct: 59 CYALRNFQKGEQIFISYGPRTNSDFFVHSGFVYMD--NEQDGFKLRLGISKADSLQKERT 116
Query: 350 EALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK 409
E L K L + F ++ P+ M A+L V SM+ K E S+K+ K +
Sbjct: 117 ELLGKLDLPSVGEFLLKPGTEPISDMLLAFLRVF--SMR-KAELAHWLRSDKVFDLKHMD 173
Query: 410 CP---EIDEQALQFILDSCESSISKY 432
C ++E +F+L + I+ Y
Sbjct: 174 CALETVVEENVRKFLLTRLQLLIANY 199
>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
Length = 449
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 165/375 (44%), Gaps = 26/375 (6%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
S L +WL + G K+ + + + GER ++A I GE +L VP + ++T + +
Sbjct: 19 SNLLRWLEEGGARFPKLQLVRREDGERAVLAQAPISAGETVLQVPRTHMLTLELARES-D 77
Query: 137 AGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
G + + PD LA++L+ E E S W YI +LP + + +E L
Sbjct: 78 IGRAIAEGLDPDNEDLYLASFLLQEKHRE-GSFWKPYIDSLPESYSQMPLFYGSEEHALL 136
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
+ A+ +T+ + + L + P E F F W+ + SRL L
Sbjct: 137 KGC----FALTLLTHQAQSLREDYLSLCQNVPGY---ERFTPGEFVWARLSVSSRLFSLK 189
Query: 255 SMD--GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
G+ LVP ADMLNH + + + + V + G++V SYG KSN
Sbjct: 190 KGGFLGQT-LVPMADMLNHRRPPDVLWETTEDGESFVMKANNAVAAGDEVHDSYGAKSND 248
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI-QITGWP 371
+LL +GFV + N D L L + D K L +A+ F I +
Sbjct: 249 LMLLHFGFVTDD--NEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHT 306
Query: 372 LELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESS 428
MA+++L ++ ++ E++ S+++ S + P E +E L+ + +C++
Sbjct: 307 TTRMAFSFLRIA-AAVPNDIEDI----SSRVMSGERALGPLSVENEENVLELLAATCQAR 361
Query: 429 ISKY-SRFLQVKELL 442
+S + + Q +ELL
Sbjct: 362 LSIFPTSLAQDEELL 376
>gi|219110715|ref|XP_002177109.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411644|gb|EEC51572.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 531
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 140/337 (41%), Gaps = 58/337 (17%)
Query: 92 KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL----------------VITADSKWSCP 135
K+ + V GLVA + IRKGE L +P + V++ D
Sbjct: 73 KVTVAPSSVNRLGLVATEKIRKGEVFLAMPYDVRYELSADLARNVVFKDVLSEDYNSWTG 132
Query: 136 EAGEVL-----KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTR 187
+AG + + C D L I + S + + S +++ALP P + L W+
Sbjct: 133 DAGLIALLILNEVCLAADTGLGTKEPIRQNSLQ--AFMSAWVAALP-GPEDINHPLLWS- 188
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP-------EEV--FNMET 238
E D+ + S R + ++ L+ +F K + FP EE+ F++
Sbjct: 189 -EEDQEILQSSSTNRIYRVLDDIEEDVTWLKTNVFEKDGNRFPVSIPWNGEEIPCFSLTG 247
Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV-----ETFLDYDKSSQGVVFTTD 293
FKW+ + SR + D V L+P D NH+ E F+ +++G
Sbjct: 248 FKWAMALAQSRSFFV---DNAVRLLPLMDFCNHADEGTEEARAGFMGTFGTTKGAELVAG 304
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ Y+ GE+VFI YG KS + LL + F P + + S EL + D+ Y +KL+ L
Sbjct: 305 QSYEVGEEVFICYGPKSAADYLLEHAFCPEQSWKTAVS-ELFFEVDPKDRFYDDKLDILE 363
Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGK 390
AS P+Q ++ VVS P G+
Sbjct: 364 FETYDASPMDPVQ-----------SFDVVSAPGRDGE 389
>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
Length = 591
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 138/324 (42%), Gaps = 48/324 (14%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITN----------VIGTY---NDLRLRIFSKYPDLFPEEVFNMET 238
R L+++Q + N VI T+ N L L+ Y D + + ++
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDAFTYED-YRLGLVSLAL 253
Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
+W+ G+ + + G+ + ++ + CE D+ +
Sbjct: 254 GRWALGLECGVGI---ARCGKPQITTGYNLEDDRCECVALQDF---------------RA 295
Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
GEQ++I YG +SN E ++ GF N D V++ L + KSD+ Y K E L + G+
Sbjct: 296 GEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIP 353
Query: 359 ASECFPIQITGWPLELMAYAYLVV 382
S F + T P+ A+L V
Sbjct: 354 TSSVFALHFTEPPVSAQLLAFLRV 377
>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
lyrata]
Length = 541
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 126/276 (45%), Gaps = 23/276 (8%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E S L +W D+G+ K+ I +D RG +A ++++ G+ L +P S +I+ + ++
Sbjct: 141 EKESRLVEWGQDNGVK-TKLQIAHIDGYGRGAIASEDLKFGDVALEIPISSIISEEYVFN 199
Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAEL 190
+ K + ++ + + E S++ Y +L S EL
Sbjct: 200 SDMYPILEKIDGITSETMVLLWTMREKH-NLDSKFKPYFDSLQENFCTGMSFGVNAIMEL 258
Query: 191 DRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
D L E Q +E ER Y++L + + S + +FP E + E + W+ + +
Sbjct: 259 DGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHRHVFPPEHYTWEHYLWACELYY 310
Query: 248 SRLVRLPSMDGRV--ALVPWADMLNHSCEVETFLDYDK---SSQGVVFTTDRQYQPGEQV 302
S +++ DG++ L+P A LNHS + Y K + + F R GEQ
Sbjct: 311 SNSMQIKFPDGKLKTCLIPVAGFLNHSI-YPHIVKYGKVCVETSSLKFPVSRPCNKGEQC 369
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
F+SYG S+ LL YGF+P+ G NP D + L +
Sbjct: 370 FLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 404
>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 377
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 101/221 (45%), Gaps = 15/221 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+ A +++++GE +L+VP LV S + PE LLA L E
Sbjct: 32 GMAAGRHLKEGEDILYVPTGLV---RSLHTVPEHVSRKLPSDTSIHALLAADLTVNGMTE 88
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
+ W + + L + + +L L A E + N +G ++ R+
Sbjct: 89 LAL-WRDCLPTLADFSTGMPFMWHKKLQELLPKP-----ARELLENQLGNFHRDWARVTK 142
Query: 224 KYPDLFPEEVFN--METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY 281
+PDL E+ + + SF ++ P D R+ALVP AD+ NH+ +T
Sbjct: 143 AFPDLQQEDYLHNWLAVSTRSFYYWTPQMELYPPAD-RLALVPIADLFNHA---DTGCGA 198
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
+ G V +TDR+Y G++++ISYG +N LL YGFVP
Sbjct: 199 SFTPDGFVVSTDRKYHVGQEIYISYGTHTNDLLLAEYGFVP 239
>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 563
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 71/142 (50%), Gaps = 12/142 (8%)
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--EEVFNMETFK 240
++WT E+ R L+ S + + ER + G Y + DL+P +V +E FK
Sbjct: 213 IFWTEEEM-RLLQGSYLVTQVEERNQAIEGDYGVI--------CDLYPPFRDVATLEEFK 263
Query: 241 WSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
W+ + SR L R ALVP+ADMLNH ET YD + G TT + G
Sbjct: 264 WARMCVCSRNFGLDINGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTLHRILGG 323
Query: 300 EQVFISYGKKSNGELLLSYGFV 321
QV+ SYG+K N LL+YGF
Sbjct: 324 AQVYDSYGQKCNHRFLLNYGFA 345
>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 514
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 115/270 (42%), Gaps = 24/270 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA-- 160
RG+ +N+ GE L VP + A S + +LA +++ EA
Sbjct: 83 RGVATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFD 142
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT-----YN 215
+ KS+ W ++ LPR S + W EL S++ + T I Y+
Sbjct: 143 AGSKSAYWP-WLRLLPRDVDSTVGWNEDEL------SELSGSNVVVFTRAIKAQWRMEYD 195
Query: 216 DLRL-RIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRV----ALVPWAD 267
L + + K+PD+F E + + F W+ I++SR + L + LVP D
Sbjct: 196 ALDVPTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESAEAPTIRVLVPLLD 255
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
M NH+ + ++D S V ++ ++ +Y K + LL YGF+P TN
Sbjct: 256 MANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGFIPE--TN 313
Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
P++ VE + + D K E LR +GL
Sbjct: 314 PAECVEATVRVSDHDSLRDAKEELLRLHGL 343
>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
Length = 457
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/285 (25%), Positives = 126/285 (44%), Gaps = 37/285 (12%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + G + + RGL+ K ++ G+ ++ +P ++T + S G
Sbjct: 35 LKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISLPEKCLLTTGTVLSSCLGG 94
Query: 139 EVLKQCSVPDWPLLA--TYLI------------------SEASFEKSSRWSNYISALPRQ 178
+ K P PLLA T+LI +E + S W Y+ LP+
Sbjct: 95 HIEKW-KPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKHAGQKSPWKPYLDVLPK- 152
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFN 235
Y+ ++ L ++++A E+ + + R FS LF E+ +FN
Sbjct: 153 AYTCPACLEPDIINLL-PKPLQKKAQEQKMLIQELFQSSR-AFFSSLQPLFAEDTGNIFN 210
Query: 236 METFKWSFGILFSRLVRLP-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQG 287
+W++ + +R + + S++ V AL P+ D+LNHS V+ +++ ++
Sbjct: 211 FSALQWAWCTVNTRTIYMKHPHRECFSLEPDVYALAPYLDLLNHSPNVQVKAGFNEQTRS 270
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
TD Q + ++V I YG N LLL YGFV + NP SV
Sbjct: 271 YEIWTDSQCKKYQEVLICYGPHDNQRLLLEYGFVATD--NPHSSV 313
>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
magnipapillata]
Length = 819
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 145/334 (43%), Gaps = 50/334 (14%)
Query: 32 PRKRCGHRIVVHCSVSTT-----NDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDS 86
P+K C R + +T N+ K+ QN+ W E L +W +
Sbjct: 55 PKKFCNIRDIPIMQPNTVITFSDNEKDNDKSVGGQNIEQWTVEKKLL----NFLQWCKAN 110
Query: 87 GLP-PQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADS-----------KWS 133
L K+ + R G++A ++I+KGE L VP L++ ++ KW
Sbjct: 111 NLNLSSKVKVDFNGTSHRYGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKW- 169
Query: 134 CPEAGEVLKQCSVPDW-PLLATYLISEASFEKSSRWSNYISALPR-----QPYSLLYWTR 187
G+ L S W PLL T L+ E + +K S W++Y+ +P P L+W
Sbjct: 170 LDTHGKSLNDSS--GWLPLLIT-LMWEFN-QKDSFWASYLLLVPEISEFGHP---LFWKE 222
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGIL 246
E + + + I N+ Y + L + DLF E +++E FK +
Sbjct: 223 EEYNLEFQGMPLLNDIIVDRENIETEYAEFVLLFLRRNKDLFGSLENYSLEFFKRMVAFV 282
Query: 247 ----FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
F+ PSM VP AD+LNH L + KS+ ++ + R+ + GE+V
Sbjct: 283 MAYSFTEDEESPSM------VPMADILNHHSNNNAHLVFHKSNLQMI--SIRRIKKGEEV 334
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
F ++GK N ELL YG+V +N DS+ LP+
Sbjct: 335 FNTFGKLGNTELLQMYGYVEI-PSNQYDSLLLPV 367
>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
Length = 433
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 129/289 (44%), Gaps = 37/289 (12%)
Query: 74 ENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
E L W +G + P D+G G+ A N+R +L +P SLV++ +
Sbjct: 5 EGLKVLLDWFKSNGGSVQPHVEFASYPDMG-CGMRATSNLRSETELFSIPRSLVLSVHTS 63
Query: 132 ---WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
S P+ E+ S W L L+ E + +S W Y++++P SL++W+
Sbjct: 64 PLPKSLPDWSEI----STQGWVGLILCLMYE-QIDPASHWKRYLNSMPTCFDSLMFWSDD 118
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-PEEVFNMETFKWSFGILF 247
EL R L+ S + ++ I R G+Y + + SK+ D+F P E +++ + ++
Sbjct: 119 EL-RELQGSSVLDK-IGR-EEAEGSYYSILVPYLSKHADIFKPLEAYSLALYHRCGSLIL 175
Query: 248 SRLVRLPSMDGR-----------------VALVPWADMLN-HSCEVETFLDYDKSSQGVV 289
SR + + D V +VP AD+LN S L Y +V
Sbjct: 176 SRSFHVSNQDDSASDASDDDDAAYHEVETVGMVPMADVLNAKSGSANACLVY--HPDALV 233
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
TT ++ GEQ+F +Y N +LL YG V + N +D+VE+ L
Sbjct: 234 MTTTKEIAAGEQIFNTYNDPPNADLLRRYGHV--DEVNLNDNVEISADL 280
>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
Length = 451
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 108/252 (42%), Gaps = 66/252 (26%)
Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPR---QPY-----SLLYWTRAELDRYLEAS 197
+P LLA YL KS+ + Y+ +LP+ PY L+Y + L R +E +
Sbjct: 109 LPFQALLAFYL----CVTKSAHFDAYLQSLPQTFSNPYFCTKQELVYLSEVLLQRMVEQN 164
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
+ + +ERI +V+ D + + V +E FKW++ ++ +R V L M
Sbjct: 165 GLIKSGLERINSVL--------------RDEWKDTV-ELERFKWAYFVVNTRSVFLDPMA 209
Query: 258 GRV---------------------ALVPWADMLNHSCEVETFLDYDKSSQGVV------- 289
++ AL P+ D NH C +T S+ +
Sbjct: 210 VKMINSFLPSGSLFEDFLADEPSMALAPFLDFFNHRCGAKTVNGLSLSTSQIRDCLLKER 269
Query: 290 -------FTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKS 341
TD Y+ GEQ+FISYG +N +LLL YGF +P +NP D VEL + +
Sbjct: 270 PLELYYNLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIP---SNPDDFVELTIGTINA 326
Query: 342 DKCYKEKLEALR 353
+ +L LR
Sbjct: 327 FMKHDPELRCLR 338
>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 487
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 114/272 (41%), Gaps = 49/272 (18%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
P ++A + RG+VA NI +GE+L VP ++V+T + GE L++ P
Sbjct: 34 PKLRLADLRATGAGRGVVAQSNISEGEELFSVPRAMVLTVQNSELRTLLGENLEEQMGP- 92
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER-----A 203
W L ++ E + SRW+ Y LP + +L++W+ AEL + L+AS I E+ A
Sbjct: 93 WLSLMLVMVYEYLQGEKSRWAPYFRVLPSRFDTLMFWSPAEL-QELQASTIVEKIGRSGA 151
Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFP--------------EEVFNMETFKWSFGILFSR 249
E I N I I +K PDLFP + + S + ++
Sbjct: 152 EESIRNSIAP-------ILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAF 204
Query: 250 LVRLPSMDGR--------------------VALVPWADMLNHSCEVETFLDYDKSSQGVV 289
+ DG +VP AD+LN + Y + +V
Sbjct: 205 DIEKSEDDGDEGEANDESYMTDDEEEEQLPKGMVPLADLLNADADRNNARLYQEEG-ALV 263
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+ Q GE++F YG+ +LL YG+V
Sbjct: 264 MKAIKPIQQGEEIFNDYGEIPRADLLRRYGYV 295
>gi|242009061|ref|XP_002425311.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
gi|212509085|gb|EEB12573.1| SET domain-containing protein, putative [Pediculus humanus
corporis]
Length = 399
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 122/232 (52%), Gaps = 17/232 (7%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLI 157
D G RG+ K + KG+ L+ +P +L+IT S+ +A + L ++ D L L+ +L+
Sbjct: 45 DTG-RGVKCRKKLEKGDLLIALPLNLLITPTSQ---SDAYKFLNDENIVDPQLRLSIFLM 100
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYW-TRAELDRYLEASQIRERAIERITNVIGTY-- 214
E + S++ NYI LP Q YS +Y+ T +E+ L I++ + + T++ +
Sbjct: 101 YENHLKNDSKYFNYIQTLP-QSYSNVYFCTDSEIQ--LLPDLIKKLVVTQKTDLEFLFEK 157
Query: 215 --NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADML 269
N+L I S + D ++++N F W++ + +R V R +AL P+ DM
Sbjct: 158 LQNNLNDEICS-HCDKSIKKLYNRYEFIWAWFTVNTRSVYYEDKSMRKKSLALAPFLDMF 216
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
NHS + T + D ++ + T ++ +Q+FI YG SN +LL+ YGF+
Sbjct: 217 NHSSDANTKMYIDFDNELYILKTLNSFRKHQQIFIKYGPHSNLKLLIEYGFI 268
>gi|268573124|ref|XP_002641539.1| C. briggsae CBR-SET-27 protein [Caenorhabditis briggsae]
Length = 483
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 130/314 (41%), Gaps = 65/314 (20%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LA +L + S+W +YIS LP + L++T +L + L+ S I E A+ +
Sbjct: 136 LALFLAVHWLQNEKSKWHSYISILPNSFPTPLFYTEEQLLQ-LKPSPIFEEALTFYRTIA 194
Query: 212 GTY-----------------------NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
+ N + + IF P F F + WS G + +
Sbjct: 195 RQFCYFLLAVAKNKIYESAQRRKDARNTMDVPIFYNAP--FTVYNFTPRLYFWSVGTVTT 252
Query: 249 RLVRLPSMDGRV---------ALVPWADMLNHSCEVETFLD----YDKSSQGVVFTTDRQ 295
R+ +PS +G AL+P DM NH V +D Y + + V T+
Sbjct: 253 RVNMVPSENGSGDDGKAIMIPALIPLLDMANHESVVTDPVDDLVCYAPADECAVITSHCD 312
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
+ G++V I YG +S GE L+ GF+P D +L + + KSDK + K + + KY
Sbjct: 313 LEAGKEVTIFYGCRSKGEHLIHNGFIPINHQK-QDFFKLKIGIPKSDKTLEAKKKLIEKY 371
Query: 356 GLSA---SECFPIQITGW-----PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD 407
+ F + + + PL+L+ +A + V P E AA K
Sbjct: 372 VQNVYCTGNIFHVDLYNYPEQPFPLDLLMFAAIFVCP--------EATDAAITK------ 417
Query: 408 IKCPEIDEQALQFI 421
PEI ++ L+F+
Sbjct: 418 ---PEIRKKGLEFL 428
>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
Length = 502
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/328 (22%), Positives = 135/328 (41%), Gaps = 51/328 (15%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW------PLLATYLI 157
GL A + ++ +LL VP +++ W +LK+C D + ++
Sbjct: 109 GLEATHSFKQDAELLRVPRKAMLS----WDQARKSAMLKKCFEQDMIVKTMDNVALALMV 164
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-----LEASQIRERAIER--ITNV 210
S W Y+ ALP+ + LY++ EL + E S I R + R + +
Sbjct: 165 CCQKLSPDSSWLPYLDALPQTFSTPLYFSALELRKLSPSPAYEESLIMYRNVARQFVYFL 224
Query: 211 IGTYNDLRLR---------------IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
R R +F P F F + ++W+ + +R+ +PS
Sbjct: 225 AAVQRSERSRSAKKDKNHAAVGMEPLFLNAP--FTVSNFTFDLYRWAVACVTTRINFIPS 282
Query: 256 MDGRVA---------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
+ + L+P DM NH + + + + Y+ G++V I Y
Sbjct: 283 QYAKDSNGQPVAVPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFY 342
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPI 365
G ++N + L GFVP +G N +D+ +L + + DK + +L+ + G +A S F
Sbjct: 343 GIRTNRQFFLHNGFVP-DGENKNDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVF 401
Query: 366 QITG----WPLELMAYA--YLVVSPPSM 387
++ PL L+ +A +LV +P S+
Sbjct: 402 EVNASERPVPLSLLDFARVFLVENPDSV 429
>gi|189237481|ref|XP_001810520.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270006984|gb|EFA03432.1| hypothetical protein TcasGA2_TC013422 [Tribolium castaneum]
Length = 413
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 150/338 (44%), Gaps = 44/338 (13%)
Query: 70 IDSLENASTLQKWLSDSGL-PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
+D ENA L+K+++ +G P + ++ RG+ +N+++ + L+ VP L+I+
Sbjct: 17 LDHEENAINLRKFMARNGFNDPINLKLRNFPDTGRGVATPRNLKESDVLITVPYELMISY 76
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY---SLLYW 185
+ + + LL +L+ E E +S W +YI +LP QP +LL
Sbjct: 77 TTLQKSNFLHLFTPESRLSIVDLLTAFLVIERDKE-NSFWRDYIKSLPPQPPWIPALLSQ 135
Query: 186 TRAEL---DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
R EL D L A + R R +E +++ LR I + V ++ +F W
Sbjct: 136 DRVELLPADLRLAAKKSR-RLLEE------SWSRLRKSIRRE-----ASCVIDLHSFIWG 183
Query: 243 FGILFSRLVRLPSMDGR---------------VALVPWADMLNHSCEVET--FLDYDKSS 285
+ ++ +R V + R +AL P+ DM NHS E +T L D+
Sbjct: 184 YVLVNTRAVYVNPRIVRELCDCGSDILSDEPCMALCPFLDMFNHSHEAKTEATLMNDQGK 243
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN---PSDSVELPLSLKKSD 342
TT + EQVFISYG N +LL+ YGF +N P S E+ L+ +
Sbjct: 244 FVYQLTTLVGTRKHEQVFISYGDHDNVKLLIEYGFFIPGNSNDSIPIQSEEVFRVLEPNL 303
Query: 343 KCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYL 380
++ K +R + L +C + G L A+ ++
Sbjct: 304 NDFQYKF--IRSHNL--DKCLYLTEAGASFNLKAFLFV 337
>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 510
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 124/255 (48%), Gaps = 24/255 (9%)
Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP---LLATYLISEASF 162
+ +N+ G +LL +P S V++ +S + G +L+ PD P L +L+ E +
Sbjct: 107 ITAENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEVN--PDLPSAIALGLHLLEERAL 163
Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+S +S++++ LP S L+++ E++ LE SQ++ + R V Y+ L
Sbjct: 164 GAASNFSDFVATLPTIEAINSTLFYSEDEMNE-LEGSQLQRFTLGRAQAVEAFYDALVQP 222
Query: 221 IFSKY---PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML-------- 269
+ S+ P +F + F ++ F+W+ G+++S + + V L P + +
Sbjct: 223 VTSREAVDPPIFHKSEFTLDKFRWAMGVVWSSTFQFGENEDDVILAPVLNTIGICTDLNQ 282
Query: 270 --NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
N +C ET + D +Q + Y ++V +S KS+ +L+LS+GF R +
Sbjct: 283 EGNEACP-ETSIKVDTDTQRLTVYASVAYSKSQEVRLSMPGKSSTQLMLSHGFA-RARAS 340
Query: 328 PSDSVELPLSLKKSD 342
D ++L ++L SD
Sbjct: 341 KLDKLDLTVTLDPSD 355
>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
Length = 403
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 66/254 (25%), Positives = 118/254 (46%), Gaps = 14/254 (5%)
Query: 81 KWLSDSGLP-PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
+WL+++G + ++ D RG+ A +++ E L+ VP +IT + +
Sbjct: 36 QWLTENGGKFADCVELRSYDDEVRGVHATRDLETEEILVEVPLKCLITVEMGKATDVGRA 95
Query: 140 VLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLE 195
VL+ + P L +++ + + S+ ++ Y LP ++ ++W EL+ +L+
Sbjct: 96 VLEAELELDAPKHVFLMLFVLLDRR-DSSTFFAPYYDILPSTLSNMPIFWQPDELE-WLK 153
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
S + + ER + Y I +P +V +E FKW+ + SR +
Sbjct: 154 GSYLLTQIEERKRAIKADYE----AICGIWPSFI--DVCTLEEFKWARMCVCSRNFGVVV 207
Query: 256 MDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
R A+VP+ADMLNH ET +D S T+ ++ G Q++ SYG+K N
Sbjct: 208 NGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKISVGSQIYDSYGQKCNHRF 267
Query: 315 LLSYGFVPREGTNP 328
LL+YGF + P
Sbjct: 268 LLNYGFAIEDNKEP 281
>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
Length = 353
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 109/243 (44%), Gaps = 43/243 (17%)
Query: 101 GERGL--VALKNIRKGEKLLFVPPSLVITADSKWSCP----EAGEVLKQCSVPDWPLLAT 154
G+ GL +K++++GE LL +P ++ D +CP GE+ + ++P + LA
Sbjct: 44 GDNGLELRLVKDVKRGEVLLAIPRRAILEIDDAATCPCKEYITGEMWQ--AIPSYAKLAI 101
Query: 155 YLI-SEASFEKSSR-WSNYISALPRQPYSLLYWTRAEL----DRYL-EASQIRERAIERI 207
YL+ S E+ R +Y LP+Q S W+ + D Y+ E Q R R I+R+
Sbjct: 102 YLLYSIDHAEQDPRPLRDYFDVLPKQVLSTFSWSEEAIQELQDPYMIEQIQTRRRKIQRL 161
Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
+ I L RI + W+ I+ SR G D
Sbjct: 162 FHEI--QKGLSPRI-------------TYDRLLWAIEIVLSRAFAFSRTGG-------DD 199
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
++ V+ YD S Q ++ ++ G+ V ISYG KSN ELLLSYGF+ + N
Sbjct: 200 LVFSGTSVK----YDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHELLLSYGFILPD--N 253
Query: 328 PSD 330
P D
Sbjct: 254 PED 256
>gi|308810511|ref|XP_003082564.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116061033|emb|CAL56421.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
Length = 1472
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 117/245 (47%), Gaps = 15/245 (6%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RG ++++++G+ LL VP + D + E E+ K C D ++A ++ E
Sbjct: 692 RGHGVVRDVQRGDVLLEVPLRRGFSYDDAMADDEMREIAKACVRRD-DVVALHVCLERYR 750
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE-RAIERITNVIGTYNDL--RL 219
K ++ + ++ ALP+ W+ EL + + +++ RA+ I Y+ + RL
Sbjct: 751 GKEAKHAAHVEALPKTFDCAFNWSEDELSELVGTTCLKDTRAL--IEETREDYDAIGRRL 808
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSCEVET 277
K L E + E + W+ L+SR L DG A++P+ D+ NHS E
Sbjct: 809 MAMGKGGWLL-ERGVDYERYAWARQCLWSRQCDLMRPDGTRTRAMIPYFDIFNHSPEAPL 867
Query: 278 FLDYDKSSQG--VVFTTDRQYQPGEQVFISY--GKKSNGELLLSYGFVPREGTNPSDSVE 333
+ +++ V R Y+ GEQ FISY G+ +N +LL YGF NP + ++
Sbjct: 868 GKTHKLNAERNCVTVYAGRDYKEGEQAFISYGSGEAANAKLLTWYGFCIE--NNPYEELD 925
Query: 334 LPLSL 338
L L++
Sbjct: 926 LTLTI 930
>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
Length = 447
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 139/323 (43%), Gaps = 52/323 (16%)
Query: 73 LENASTLQKWLSDSGLPPQKMAI---QKVDVGERGLVALKNIRKGEKLLFVPPSLVI-TA 128
+ +++ +W + +G K + + VD RGLVA+ +I+ L +P +V+ T
Sbjct: 1 MTDSAKFLEWFTTNGGEFSKDIVAIGENVDGMGRGLVAVADIKAQTSLFTIPRDIVLSTR 60
Query: 129 DSKWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
S + +V KQ ++ W L + E + SS+W Y LP+Q SL++W
Sbjct: 61 TSSFKEKVGQDVYKQLENDNIGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQFTSLMFW 120
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
++ EL + + + +E I N D+ + + + D+ + ++ FK +
Sbjct: 121 SKEELSLLKGTTVVDKIGLEDIENEFERVRDIVKQNENVFGDIAN---YTLDLFKRMGSL 177
Query: 246 LFSR-------------------------LVRLPSMDGRVALVPWADMLNHSCE-VETFL 279
+ SR + L + VA+VP AD+LN + V
Sbjct: 178 ILSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRTDSVNAHT 237
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--------PREGTNPSDS 331
+Y+++ ++ D + G+Q+F +Y N +L+ YG V P N +D
Sbjct: 238 EYEENCLRMISLQD--IKAGDQIFNTYNDPPNADLIRRYGHVDYSPLSQDPDFMGNKNDV 295
Query: 332 VELP------LSLKKSDKCYKEK 348
VELP L+L + + +KE+
Sbjct: 296 VELPADILLELALPDAKESHKER 318
>gi|341877649|gb|EGT33584.1| CBN-SET-27 protein [Caenorhabditis brenneri]
Length = 501
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 155/365 (42%), Gaps = 55/365 (15%)
Query: 71 DSLENASTLQ---KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT 127
+S+ +A T++ KW ++G+ + I L A I KG + VP + ++T
Sbjct: 66 NSVRDAETIKAFLKWSDENGIARNNVTIGPTKTSGLSLQATGPIPKGHIVARVPRNAMMT 125
Query: 128 ADSKW---SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
D+ S +A E + + D LA +L + + SR+S YI+ LP + L+
Sbjct: 126 LDNARKSNSLRKAFEKDQIVAGMDNVGLALFLATHWMQNEKSRFSPYIAILPNCFPTPLF 185
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTY-----------------------NDLRLRI 221
+T +L + L+ S I E A+ + + N + + I
Sbjct: 186 YTEEQLLQ-LKPSPIFEEALTFYRTISRQFCYFLMAVSKNKMYEAAQRRKDARNTMEVPI 244
Query: 222 FSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS---MD------GRVALVPWADMLNH- 271
F P F F + W+ G++ +R+ +PS +D AL+P+ DM NH
Sbjct: 245 FYNSP--FTVANFTSRLYFWAVGVVTTRVNMVPSETLIDKDEKPIAIPALIPFLDMANHE 302
Query: 272 ----SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
+E + Y + V T+ G +V I YG +S GE L+ GFVP
Sbjct: 303 NFETDGPIEDLVCYSPLEECAVITSHCDMDAGREVTIFYGCRSKGEHLIHNGFVPL-NHG 361
Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSA---SECFPIQITG-----WPLELMAYAY 379
+ +++ + + K+DK K + + KY + F + + +PL+L+ +A
Sbjct: 362 KQEIMKMKIGIPKTDKNLDVKKKLIEKYVANVFCTGNIFHVDLYNHPEHPFPLDLLMFAA 421
Query: 380 LVVSP 384
+ VSP
Sbjct: 422 IFVSP 426
>gi|255086705|ref|XP_002509319.1| predicted protein [Micromonas sp. RCC299]
gi|226524597|gb|ACO70577.1| predicted protein [Micromonas sp. RCC299]
Length = 784
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 89/211 (42%), Gaps = 45/211 (21%)
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
++ S + W Y+ LPR+ SL+ W+ ELD L+ S++ +RA ERI Y++
Sbjct: 279 LAAGSAPHDAHWKAYVDLLPREVDSLIEWSENELD-ALQGSRLADRARERIALADSVYDE 337
Query: 217 LRLRIFSKYPDLF----------------------------PEEVFNMETFKWSFGILFS 248
+ R+ P L+ ++ + E F+W++ + +
Sbjct: 338 VFPRLNDADPTLWMSGKLGSAVAGGTGIDVTAAARKKGERARDKYTSKEAFRWAWATVLA 397
Query: 249 RLVRLPSM--DGRVALVPWADMLNHS-----CEVETFL------DYDKSSQG---VVFTT 292
R LP + DG + L P D+ NH CEV L D D G V+
Sbjct: 398 RAFSLPDVGEDGEMGLCPGLDLFNHGSEAEKCEVRGVLGASLELDEDDPQVGPRIVLRAG 457
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
+ GEQ+F Y +++G LL +GF R
Sbjct: 458 VGGAESGEQLFHDYADRASGGSLLEFGFTHR 488
>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
Length = 428
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/268 (23%), Positives = 118/268 (44%), Gaps = 21/268 (7%)
Query: 79 LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L W+ +G K+ ++ RG+VAL+ + GE L +P SL+I+ +
Sbjct: 31 LLTWMEANGFRLHSKLGLRDFPDTGRGVVALEKLVGGETFLKLPTSLLISTRTALQSLLH 90
Query: 138 GEVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + + +L +++ + ++SRW ++ +LPR + ++ R +
Sbjct: 91 SFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPRTFTTPVFLRRTVFESL--P 148
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-----EVFNMETFKWSFGILFSRLV 251
+RE RIT++ T+ L++ + + + PE F F W++ + +R +
Sbjct: 149 KDLREEVHTRITSIQRTFLKLKV-LLGGHVEEEPEVQSLSTGFTWNNFVWAWTAVNTRCI 207
Query: 252 RLPSMDG-------RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ AL P+ D LNH + ++ + + + + EQVFI
Sbjct: 208 FAQGSNSSSLWENDHCALAPFLDCLNHHWKAS--IETAMVGENFEILSHKSHDANEQVFI 265
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSV 332
SYG SN L L YGFV + NP+D V
Sbjct: 266 SYGPHSNRRLFLDYGFVLPD--NPNDVV 291
>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1033
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 116/265 (43%), Gaps = 43/265 (16%)
Query: 82 WLSDSGLP-PQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
W+ +G +K++I + G+ RG+V LKNIR+GE L +P + + +
Sbjct: 535 WMEGNGFSISEKLSITHLLAGDGKLVRGVVVLKNIRRGETLCNLPLDMGLYDN------- 587
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
E + V W A L+ E + SS W++YI+ LP+ + EL
Sbjct: 588 --ETIVAGEVDSWDRAAARLLREKAKGSSSAWASYINILPQNMTVPILLEDHELHEVQWW 645
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLP 254
+RE +R I + L +++ + E ++W+ ++ SR LP
Sbjct: 646 PVLRELV------------QVRKSIRESFSLLSVDDLAGADFEEYRWAAMMVHSRAFTLP 693
Query: 255 SM-DGRVA---LVPWADMLNHSCEVETFLDYDKSSQ-----GVVFTTDRQYQPGEQVFIS 305
D A ++P+ DM+NH + D SQ V R + GE++F S
Sbjct: 694 VFADDHYAPYVMMPYMDMINHHYHYQA----DWMSQPIWGGKVEIVARRDIKKGEELFAS 749
Query: 306 YGKKSNGELLLSYGFVPREGTNPSD 330
+G ++N L L YGFV ++ NP D
Sbjct: 750 FGPRANDNLFLYYGFVLKD--NPFD 772
>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
tauri]
gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
[Ostreococcus tauri]
Length = 446
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 156/352 (44%), Gaps = 30/352 (8%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE--AGEVLKQCSVPDWPLLATYLIS 158
ERG+ +++ +GE L VP ++ S + G + + D +LA +++
Sbjct: 21 AERGVATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLD-GILAAHVLR 79
Query: 159 EASF--EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE--RAIERITNVIGTY 214
EA E+S+ W ++ LP + + + W EL R L+ S + RAI++ + Y
Sbjct: 80 EAFGLGERSAFWP-WLRLLPSETDAAVGWDEDEL-RELQGSNVVAFARAIKK--SWREEY 135
Query: 215 NDLRLRIFS-KYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRVA-----LVPW 265
+ L +P+ F E + E F W+ +++SR + L + D A LVP
Sbjct: 136 DALDFAGLGVDFPEAFGGEHAAHYTFEKFTWARFVVWSRAIDLKT-DSTSAPVIRMLVPI 194
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
DM NH+ + +D + V ++ ++ +Y K + LL YGF+P
Sbjct: 195 LDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLLQYGFIPE-- 252
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQITGWPLELMAYAYLVVSP 384
NP++ VE+ + L + D + K LR++GL ++ F ++ G +L+A A ++
Sbjct: 253 ANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKVRGLDYDLLAAARIIAMD 312
Query: 385 PSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFL 436
S +A + S S K+ D + +L S +S+ Y L
Sbjct: 313 ESELDDDTSVALSVSGASVSAKN------DARTKAVLLKSLITSLDGYGTTL 358
>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
Length = 984
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 128/294 (43%), Gaps = 40/294 (13%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA- 137
+ WL + ++ ++G RG +A ++I GE+L +P LV+T +
Sbjct: 4 FEGWLQANEARYPRLTFAVSELGGRGGIATEDILPGEELCSIPVRLVLTTEIARKSEVGR 63
Query: 138 ----------GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
GE L+ + +L YLI + + + + W Y+ +LP+ + R
Sbjct: 64 LVAAHLNAVQGERLRVSA--GRAILCAYLIHQRA-AQDAFWGPYLRSLPK------HDDR 114
Query: 188 AELD-RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
+ D ++L + + E+ + +++ L + +P +FP ++F + F W+F
Sbjct: 115 PDEDIQHLAGTNLFYAMQEKQQQIRESFDLLFPALCHAHPTVFPPDLFTWDHFLWTFTAC 174
Query: 247 FSR-----LVRLPS------------MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV 289
SR LV+ P+ ++ L+P DMLNH + D S+ +
Sbjct: 175 SSRSFPQTLVQQPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLK 234
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
F T+ + G + F +YG K N ELL+ YGF + N D V + LS + K
Sbjct: 235 FVTEDTVEKGTEAFNNYGPKGNEELLMGYGFCIED--NEQDYVMIRLSFSPAGK 286
>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
Length = 688
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 107/237 (45%), Gaps = 31/237 (13%)
Query: 103 RGLVALKNIRKGEKLLFV-----------PPSLVITADSKW----SC-PEAGEVLKQCSV 146
RGL A + +R GE++L + P L S W SC PE E+ +V
Sbjct: 437 RGLFASRPVRAGERVLEISLDLMIAPTRLPDQLSTLQSSAWAPYISCLPEPAEL--DNTV 494
Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERA 203
D+ + +S+ +SS W+ YIS LP +P L W EL YL AS + +
Sbjct: 495 LDYRVF----VSQKFQLQSSAWAPYISCLP-EPAELDNTFLWEDTELS-YLRASPLYGKT 548
Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
ER+ + + ++ +P LF + ++E F + +FSR + + D + ++
Sbjct: 549 RERLEIITTEFGQVQ-NALDVWPQLFGK--VSVEDFMHVYATVFSRPLAI-GEDSTLVMI 604
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
P D NH+ L ++ V T DR +Q++I+ G SN EL L YGF
Sbjct: 605 PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661
>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
reinhardtii]
Length = 411
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 84/353 (23%), Positives = 151/353 (42%), Gaps = 72/353 (20%)
Query: 87 GLPPQKMAIQKV--DVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ 143
GL +Q++ DVGER +VA +++R E ++ +P +L +T S P G + +
Sbjct: 19 GLKADACGVQRMTGDVGERVAIVAARDVRDKETVMVIPENLAVTRVDAESHPVVGPLAAE 78
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERA 203
S + L +L++E + S ++ ++ LP S L W+ AEL+ + S + A
Sbjct: 79 AS--ELTALTLWLLAERAAGAGSNYAGLLATLPESTLSPLLWSDAELEELMAGSPVLPEA 136
Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
R + T+ L ++ + P FP GR A
Sbjct: 137 RSRKKALADTWAALAPKLAAD-PARFPA--------------------------GRRAA- 168
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
+ +GVV G ++ ++ G+ NGELLL+ G +
Sbjct: 169 -------------------GARKGVVVWDG----AGSEMLLNDGR-PNGELLLATGTL-- 202
Query: 324 EGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLV-V 382
+ N SD + P L +D+ Y K + L G SA+E FP+ P++L+AY L V
Sbjct: 203 QDNNSSDFLSWPAGLVPADRYYMMKSQVLESMGYSAAEEFPVYADRMPIQLLAYLRLSRV 262
Query: 383 SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCESSISKYSR 434
+ P++ K T + D++ +++E + LQ ++ C ++ Y++
Sbjct: 263 ADPALLAKC-----------TFEADVELSQMNEYEILQILMGDCRERLASYTK 304
>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
Length = 591
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 115/251 (45%), Gaps = 23/251 (9%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV-----PDWPLLATY 155
GERG+ + +I E++ +P ++++ S + A K V + L
Sbjct: 39 GERGVRVISDIAPCEEMFSIPEKILMSRKSCMASSIAHVFRKHKDVLFSSRDELALTLLI 98
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L + +S W I LP P + W+ EL + S ++ A+ + ++ TY
Sbjct: 99 LYEKLDQGNASFWKPMIDILPADPGAASKWSEEELQELQDES-LKAEAMIVVASMQQTYQ 157
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWADMLNH 271
+ I ++ D+F + + E F+W+ + F R + PS +VP+AD+LNH
Sbjct: 158 RVLRPILVQHGDVFSVDRYTWEEFRWALLCVESRTFGRFLPHPS------IVPFADLLNH 211
Query: 272 SCEVETFLDYDKSSQGVVFTTD----RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
V+T + + + D ++ GE+ F+SYG +SN ELLL YGF + +N
Sbjct: 212 -VNVQTSYRWLPEERRAAYMCDASGEHVHRRGEEAFMSYGPRSNAELLLHYGFALQ--SN 268
Query: 328 PSDSVELPLSL 338
++VEL +
Sbjct: 269 RYEAVELNFRI 279
>gi|351697762|gb|EHB00681.1| SET domain-containing protein 6 [Heterocephalus glaber]
Length = 486
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 123/279 (44%), Gaps = 19/279 (6%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV- 146
L P+ ++ V G+VA ++++ GE L VP + ++ S +C G + ++ V
Sbjct: 73 LSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---SPHTCSIGGLLERERDVL 129
Query: 147 ---PDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
W L L+ E +S WS Y + P + ++W E R L+ + + E
Sbjct: 130 QSQSGWVPLLMALLHELQ-APASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVPE 188
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
+ + N+ G Y + L +PDLF V ++E ++ ++ + + P +
Sbjct: 189 AVDKDLVNIRGEYYAIVLPFMEAHPDLFGPSVRSLELYRQLVALVMAYSFQEPLEEEEEE 248
Query: 262 -------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
+VP AD+LNH + L+Y +V T R G ++F +YG+ +N +L
Sbjct: 249 KEPNSPLMVPAADILNHLANHNSNLEYSADYLRMVAT--RSIPKGHEIFNTYGQMANWQL 306
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ YGFV N D+ ++ + + K EA R
Sbjct: 307 IHMYGFVEPYPHNTDDTADIQMVTVREAALQGVKEEAER 345
>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
74030]
Length = 483
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/266 (28%), Positives = 124/266 (46%), Gaps = 34/266 (12%)
Query: 81 KWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+WLS G+ P + K + RG+VA + + E + +P + V+ ++ ++ ++
Sbjct: 15 EWLSKIGVRINPKMTLKDLKSEGRGRGVVAAADFEEDEVVFCIPRTAVLNVNNVFAGQDS 74
Query: 138 G---EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
G E L Q +P+W L ++SE + SRW+ Y++ LP++ SL++W+ EL L
Sbjct: 75 GASKEALLQ--MPNWLALTATMMSEGQ-QSDSRWAPYLAVLPQKLDSLVFWSEEELAE-L 130
Query: 195 EASQI-----RERAIERITNVI-----GTYN-DLRLRIFS---KYPDLFPEEVFNMETFK 240
+AS + R A E T I G +N +L ++ S Y PEE E K
Sbjct: 131 QASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE----EPAK 186
Query: 241 WSFGILFSRLVRLPSMDGR-----VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
G L S DG ++++P ADMLN E Y ++ + T +
Sbjct: 187 QENGGAEGETDDLVSDDGEDEKTILSMIPLADMLNADAERNNARIY-YENEDLEMRTIKP 245
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFV 321
GE++F YG+ +LL YG+V
Sbjct: 246 IMAGEEIFNDYGQLPRSDLLRRYGYV 271
>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
nagariensis]
Length = 593
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 132/301 (43%), Gaps = 17/301 (5%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEA 160
RGL A + G+ +L VP L+I+ ++ + G+VL + D + + E
Sbjct: 193 RGLRADTAVAPGDVVLHVPADLLISYETAKKS-DLGKVLSALPLDLSDDSIALIWTCVE- 250
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE---RAIERITNVIGTYNDL 217
E + + + +ALP +S E LE + + RA + ++ + +
Sbjct: 251 RHEPEAPHAPFWAALPHS-FSTALSASQEDVALLEGTPLHGDAVRARQHLSEAFESSSPA 309
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRVALVPWADMLNHSC-- 273
+ YPD F E F+ E++ W+ + +S V+ S D R L P+ ++NH
Sbjct: 310 FRSLLGAYPDYFKPEWFSWESYLWAAELWYSYGIQVQFASGDIRTCLAPYLGLMNHHPLP 369
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
V F D + + R + G Q+F+SYG SN +LLL YGF R+ NP+D VE
Sbjct: 370 HVVHFSKVDPETGCLRVRAFRPCEAGNQLFLSYGPYSNAKLLLFYGFAVRD--NPADEVE 427
Query: 334 LPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEE 393
L L + + AL GLS P L++ A L+ +PP + ++
Sbjct: 428 LVLQVPPGAAATDRR--ALLAAGLSLEHRLRAGGRLAP-PLLSCARLLAAPPPLLKQWRR 484
Query: 394 M 394
M
Sbjct: 485 M 485
>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
prasinos]
Length = 866
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 130/291 (44%), Gaps = 40/291 (13%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW------------- 149
RG +++R+G+ LL +P S + +S + E+L +
Sbjct: 25 RGNAVTEDVRRGDVLLEIPLSRCFSLESA----QKSEMLTKAMAKAAAAAAGTRFTPTHD 80
Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR-YLEASQIRERAIERIT 208
+A +++ E + K S +I ++P+ L+W+ E R L + +
Sbjct: 81 QYMAMFILLEQNLGKQSSHYEHILSIPKAYDLPLFWSEEERQRSLLFGTTTYAETLALDE 140
Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL----PSMDGRVALVP 264
VI Y L+ + D F E+ M+ FKW L+SR L P L+P
Sbjct: 141 EVIQDYELLKHHLGE---DFFREQNITMDRFKWVRATLWSRQCDLLRPAPETTRLRVLIP 197
Query: 265 WADMLNHSCEV----ETFLDYDKSSQGVVFTTDRQYQP-GEQVFISY--GKKSNGELLLS 317
DM NHS +V L+Y S+G+V P GEQ +ISY G+ S+ +LLL
Sbjct: 198 EFDMFNHSSKVPLGSSHKLNY---SRGLVTAFATANVPKGEQAYISYGSGEASSSKLLLW 254
Query: 318 YGFVP-REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
YGF P EG NP + +++ L + +C ++ E L++ ++++ + +I
Sbjct: 255 YGFAPLNEGENPFEQLDVTL----TSQCSADRAECLKQALFASAQVYLRKI 301
>gi|395839524|ref|XP_003792639.1| PREDICTED: N-lysine methyltransferase SETD6 [Otolemur garnettii]
Length = 448
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 134/305 (43%), Gaps = 30/305 (9%)
Query: 67 GCEIDSLENASTLQKWLSDSGLP-PQKMAI-QKVDVGERGLVALKNIRKGEKLLFVPPSL 124
GC+ D + + W GL K+A+ ++ V G+VAL++++ GE L VP +
Sbjct: 16 GCDADPV---AGFLSWCGQVGLELSSKVAVTRQGTVAGYGMVALESVQPGELLFAVPRAA 72
Query: 125 VITADSKWSCPEAG----EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--Q 178
++ S+ +C +G E + S W L L+ E +S W Y + P +
Sbjct: 73 LL---SQHTCSISGLLEQERVALQSQSGWVPLLLALLHEVQ-APASPWRPYFALWPELGR 128
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
++W E R L+ + + E + +TN+ Y + L +P+LF V ++E
Sbjct: 129 LEHPMFWPEEERHRLLQGTGVPEAVEKDLTNIRSEYCSIVLPFMEAHPELFSPRVRSLEL 188
Query: 239 FKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
+ ++ + + P + +VP AD+LNH L+Y + +V T
Sbjct: 189 YHQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSANYLRMVAT 248
Query: 292 TDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
QP G ++F +YG+ +N +L+ YGFV N D+ ++ + + K
Sbjct: 249 -----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTK 303
Query: 349 LEALR 353
EA R
Sbjct: 304 GEAER 308
>gi|330798760|ref|XP_003287418.1| hypothetical protein DICPUDRAFT_32466 [Dictyostelium purpureum]
gi|325082565|gb|EGC36043.1| hypothetical protein DICPUDRAFT_32466 [Dictyostelium purpureum]
Length = 479
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 132/329 (40%), Gaps = 83/329 (25%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG-------------------- 138
D G RG++A +I++ E L+ +P +I + SK+S P
Sbjct: 23 DTG-RGVIANNDIKENEILISIPSKYLIHSHSKFSIPSLNIPELNNSDSSNSSSSSDDIY 81
Query: 139 ----EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR-- 192
LK+ + ++ LI E +K S W NY++ LP ++ E++
Sbjct: 82 TPFHNCLKKLNSKQ--RISLILIIEKLIKKHSIWFNYLNELPDDYTITSTYSDEEIESLS 139
Query: 193 ---YLEASQIRERAI----------------ERITNVIGTYNDLRLRIFSKYPDLFPEEV 233
Y+E+S+ + + + V+ NDL++++ ++
Sbjct: 140 YPIYVESSKKLKNEMLNSFKLFCEIFQLYYGTDLDRVVIELNDLQVKL---------SDI 190
Query: 234 FNMETFKWSFGILFSR-------LVRLPSMDGR-----VALVPWADMLNHSCEVETFLDY 281
N E + W +G + +R + + S + LVP AD+ NH+ VET +
Sbjct: 191 LNKELYIWCWGTIQTRTYFYDKNMKKNNSKENNEEKDDCTLVPLADLFNHTSNVETEALF 250
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL------- 334
+ T + G QVFISYGK SN L+ YGF+ N DS+ L
Sbjct: 251 NDELNCYQVKTKTPFSKGSQVFISYGKHSNFTLMNYYGFIIE--NNDQDSIPLLQSNCIP 308
Query: 335 -----PLSLKKSDKCYKEKLEALRKYGLS 358
P + K Y++K+ L YGLS
Sbjct: 309 TEFAVPPTSSDEAKLYEKKIGILNNYGLS 337
>gi|145355885|ref|XP_001422177.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582417|gb|ABP00494.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 70/262 (26%), Positives = 107/262 (40%), Gaps = 49/262 (18%)
Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF-EK 164
V I +G+ ++ +P ++ A S A E + + + LL L+ E +
Sbjct: 88 VCDDGIARGDVIVAIPRDAMLDARSALG-DAAFERARARGLSSFQLLTVSLLREWRLKDT 146
Query: 165 SSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVI-GTYNDLRL-R 220
+SRW Y+ LP + L W +++++L A+ A R+ +I D RL R
Sbjct: 147 TSRWKPYLDTLPEDDGRWHPLLWRDEDVEQHLPANSTHAGA--RLRGLIRACEEDTRLFR 204
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR--------------------- 259
++ E +M +W+ I+ SR RL +D
Sbjct: 205 SIVDELNIDDENWPSMRHVRWAVSIVISRAFRLNELDDEECLREVRDDALLETLNDLDAD 264
Query: 260 -----------------VALVPWADMLNHSCEV--ETFLDYDKSSQGVVFTTDRQYQPGE 300
+ALVPWAD LNHS + E L YD SQ + Y GE
Sbjct: 265 CWEGSGGDSGEDDEFSVMALVPWADGLNHSSDAGDEAILTYDTLSQTATLRAHKAYACGE 324
Query: 301 QVFISYGKK-SNGELLLSYGFV 321
QVF SYG S+ +L ++YGFV
Sbjct: 325 QVFDSYGSNLSDEDLFVNYGFV 346
>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
Length = 511
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 123/285 (43%), Gaps = 24/285 (8%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
L + +W G+ + I + GL A K+I +++L VP + + + +
Sbjct: 84 LAKIAAFSEWAKAGGIHSDGVEIAIFPGYQMGLRATKDINADQQVLRVPRKKIFS-EEQL 142
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
S E C+ LA L+ E S S W YI LP + ++LY+T ++ R
Sbjct: 143 SKTERESF---CNFTTNFNLANALVVEKSRGADSIWKPYIDVLPSRYNTVLYFTVEQM-R 198
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIF--SKY--PD--LFPEEVFNMETFKWSFGIL 246
L + + A+ + + Y L + S Y PD LF + E ++W+ +
Sbjct: 199 RLRGTSVCSSALRQCRMIARKYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTV 258
Query: 247 FSRLVRLPSM-----DGRV---ALVPWADMLNHS-CEVETFLDYDKSSQGVVFTTDRQYQ 297
+R +P DG AL+P DM NH ++ +F YD ++ + T +
Sbjct: 259 MTRQNLVPREIATKDDGNSPISALIPCWDMANHRPGKITSF--YDSNAHQMECTAQEFCK 316
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
G Q FI YG + N +LL+ GFV + N D V + L L +D
Sbjct: 317 AGNQFFIYYGDRPNADLLVHNGFV--DPNNNKDFVNIRLGLSPTD 359
>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
Length = 418
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 94/211 (44%), Gaps = 10/211 (4%)
Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
+V W LA ++ E + ++ + R + W EL YL AS + +A
Sbjct: 162 TVKPWTKLALIVLMERYKGQYGHHTSRVFLNQRSSTTRFRWEDTELS-YLRASPLYGKAR 220
Query: 205 ERITNVIGTY----NDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDG 258
ER+ + + ND + + D++P+ ++E K + +FSR + + D
Sbjct: 221 ERLEMITTEFGQVQNDFCTCVLEQALDVWPQLFGKVSLEDLKHVYATVFSRSLAI-GEDS 279
Query: 259 RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
+ ++P D NH+ L ++ V T DR Y +Q++I+YG SN EL L Y
Sbjct: 280 TLVMIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDY 339
Query: 319 GFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
GF E NP D EL + + K++L
Sbjct: 340 GFTVPE--NPYDETELLTQFPEMNTIIKDQL 368
>gi|412991339|emb|CCO16184.1| predicted protein [Bathycoccus prasinos]
Length = 519
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 129/317 (40%), Gaps = 54/317 (17%)
Query: 42 VHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVG 101
V C A + T+ Q+++ W ++ S SGL + K
Sbjct: 46 VKCQSKELETALIREQTIVQDLVGW-----------CIENGFSGSGLGVRPSTSGKG--- 91
Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS 161
RGL A + + K E +L +P I ++K EV+++ P LA L+ E
Sbjct: 92 -RGLEATRLVEKDECVLTLPLRSGIVDEAKGHPEHTREVIEK--APWGVRLACRLLQERK 148
Query: 162 FEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGT-YNDLRL 219
S ++ Y+ +P S L++ E+ R E+ IE + + Y+DL
Sbjct: 149 KGAESAYAAYLELIPENVETSPLHYASEEVSRICYPPM--EKEIEEMRKAVKKWYDDLNA 206
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNHSCEV 275
+ EE FK + ++ SR + S D AL+P AD+LNH +
Sbjct: 207 GEGKEALAGASEE-----EFKCAVAVVHSRTYGVSSGDTGEGYFRALLPLADLLNHGGD- 260
Query: 276 ETFLDY--------------------DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
++D ++ + FT + +PGE+ +SYG++SN L
Sbjct: 261 -EYIDETRSSTSTVSTETVAWSEITDEEDESEIAFTAQKTLEPGEEALMSYGERSNDHFL 319
Query: 316 LSYGFVPREGTNPSDSV 332
L YGFVPR+ NP D V
Sbjct: 320 LYYGFVPRK--NPHDDV 334
>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
[Acanthamoeba castellanii str. Neff]
Length = 400
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 106/242 (43%), Gaps = 23/242 (9%)
Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV-----PDWPLLATYL-ISEASFEKS 165
++GE L VP ++ + ++ C A ++ K + +LA L + E+
Sbjct: 3 QEGELLAEVPARFILHSRNERVC-HAADLRKALAAHPRVASHRHMLAAVLWLLESVNCAQ 61
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY 225
S W Y+S LP ++ W + EL + E + + Y + L +
Sbjct: 62 SFWQPYLSELPDAVATVDRWNQEELAEVGHTLMLYEMVEYKKKKIAADYAAILLPFLQEN 121
Query: 226 PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSS 285
LF + + E ++ + +++SR + G +P+ D LNHS D K++
Sbjct: 122 TQLFGGSIPSEEEYRRALSLVYSRTFDFSELIGEHVFIPFVDFLNHSIN-----DTGKAA 176
Query: 286 QGVVFTTDR---------QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+ D+ Y GE+VFISYG+K++ +LL SYGF+ N D+V++
Sbjct: 177 CTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSSQLLASYGFMYE--NNAEDTVDITA 234
Query: 337 SL 338
SL
Sbjct: 235 SL 236
>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
Length = 395
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 125/307 (40%), Gaps = 31/307 (10%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
L W D G+ + + I + GL A + + K E +L VP L+ + +S
Sbjct: 25 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRATRPLAKDELVLSVPRKLIFSEESNS 84
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
C G++ + + LA L+ E + S W YI LP + ++LY+T +++
Sbjct: 85 DCRLFGKMTQATHLN----LAYDLVIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTKQME- 139
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
L + A+ + + Y L + +P F + E +
Sbjct: 140 LLRGTAAASLALRQCRVIAKQYAFLYRYAHTMTEPSTGNRSHPGERGLFFTQHGLCYELY 199
Query: 240 KWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
+W+ + +R +PS AL+P+ DM NH ++ +F Y + +
Sbjct: 200 RWAVSTVMTRQNLVPSEKQESEDTPKLISALIPYWDMANHRPGKITSF--YAAVPRQLEC 257
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
T GEQ FI YG +SN +LL+ GFV + N D V + + L +D ++
Sbjct: 258 TAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV--DDNNLKDYVNIRVGLSLTDALAAKRAS 315
Query: 351 ALRKYGL 357
L K +
Sbjct: 316 ILDKLNI 322
>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
Length = 440
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 134/309 (43%), Gaps = 27/309 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL D + RGL++ ++++G+ ++ +P + ++T D+ G
Sbjct: 36 LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLQEGQVIISLPETCLLTTDTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWKPYLEILPKA-YTCPVCLEPEVVN-LFP 152
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
+R +A E+ V ++ R S P LF E V F+ W++ + +R V +
Sbjct: 153 KPLRAKAEEQRARVREFFSSSRGFFSSLQP-LFSEAVGSIFSYRALLWAWCTVNTRAVYV 211
Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
R AL P+ D+LNHS V+ +++ ++ T + E+VFI
Sbjct: 212 KPRRRRCFSAEPDTCALAPYLDLLNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFIC 271
Query: 306 YGKKSNGELLLSYGFV----PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
YG N LLL YGFV P S+ + L L +DK +K+ L+ + +
Sbjct: 272 YGPHDNQRLLLEYGFVSIHNPHACVYVSEDI-LVKYLPSTDKQMNKKISILKDHDFIENL 330
Query: 362 CFPIQITGW 370
F GW
Sbjct: 331 TF-----GW 334
>gi|353236313|emb|CCA68310.1| related to SET7-Regulatory protein [Piriformospora indica DSM
11827]
Length = 493
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 138/344 (40%), Gaps = 97/344 (28%)
Query: 74 ENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV------ 125
E+ KW DSG L P + I+ + RG VAL +I+K L VP S++
Sbjct: 4 ESTQEFLKWFRDSGATLHP-AVGIKDFEGVGRGAVALHDIQKDTVLFTVPRSILLSTRTA 62
Query: 126 ----ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
I D WS ++ W L ++ E S K S WS Y+ LP + +
Sbjct: 63 PLRDILGDEDWS-----------TLKGWEGLILSMMYEDSRVKDSPWSGYLQDLPTKFDT 111
Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP----EEVFNME 237
L++WT EL++ L+AS +R++ + T +++ L + + D+F + F +E
Sbjct: 112 LMFWTDEELEQ-LQASTVRDKIGKAATE--KDFHERVLPLLQRRTDVFEPALRDTFFTLE 168
Query: 238 TFKWSFGILFSR----------------------------LVRLP-SMD---GR------ 259
F + + SR + R P +MD GR
Sbjct: 169 RFHINGSRILSRSFHVEEWHDEHASDDESIPSEPDHKPVEMSRDPDAMDTDEGRPEGGED 228
Query: 260 ------------VALVPWADMLN--HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
V +VP AD+LN + C Y+K ++ T D GEQ++ +
Sbjct: 229 DDAESDTESPDDVNMVPMADILNARYGCHNAKLF-YEKDHLNMIATKD--IPAGEQIWNT 285
Query: 306 YGKKSNGELLLSYGFV-------PREGT----NPSDSVELPLSL 338
YG N +LL YG + P G NP+D VE+ L
Sbjct: 286 YGDPPNADLLRQYGHIDRIPILNPEVGVYPFENPADEVEIRADL 329
>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
Length = 539
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 125/311 (40%), Gaps = 70/311 (22%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + GL A + I+ E L+VP L++T +S +
Sbjct: 82 LMKWASENGASVDGFEMVNFKEEGFGLRATREIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
RYL+++Q AI + FS+Y + + + + + + G
Sbjct: 195 RYLQSTQ----AIHDV--------------FSQYKNTARQYAYFYKVIQITTGY------ 230
Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
++ + CE D+ +PGEQ++I YG +SN
Sbjct: 231 ---------------NLEDDRCECVALQDF---------------RPGEQIYIFYGTRSN 260
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP 371
E ++ GF N D V++ L + KSD+ Y K E L + G+ S F + T P
Sbjct: 261 AEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTDPP 318
Query: 372 LELMAYAYLVV 382
+ A+L V
Sbjct: 319 ISAQLLAFLRV 329
>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
Length = 487
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 115/267 (43%), Gaps = 39/267 (14%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
P ++A + RG+VA NI +GE+L +P ++V+T + E L++ P
Sbjct: 34 PKLRLADLRATGAGRGVVAQSNIVEGEELFSIPRTMVLTVQNSELRTLLAENLEEQMGP- 92
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
W L ++ E + SRW+ Y LP + +L++W+ AEL + L+AS I E+ I R +
Sbjct: 93 WLSLMLVMVYEYLQGEKSRWAPYFRVLPSRFDTLMFWSPAEL-QELQASTIVEK-IGR-S 149
Query: 209 NVIGTYNDLRLRIFSKYPDLFPE-------EVFNMETFKWSFGILFSRLVRLPSMD---- 257
N + D I +K PDLFP E + G + L+ + D
Sbjct: 150 NAEESIRDSIAPILAKRPDLFPPPPGLASWEGIAGDAALIQVGHVMGSLIMAYAFDIEKA 209
Query: 258 ------GRV-----------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
G V +VP AD+LN + Y + +V +
Sbjct: 210 EDDDDEGEVNDESYMTDDEEEEQLPKGMVPLADLLNADADRNNARLYQEEG-ALVMKAIK 268
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFV 321
Q G+++F YG+ +LL YG+V
Sbjct: 269 PIQKGDEIFNDYGEIPRADLLRRYGYV 295
>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
Length = 504
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 93/435 (21%), Positives = 174/435 (40%), Gaps = 87/435 (20%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVD--------VGERGLVALKNIRKGEKLLFVPPSLVITA 128
+ +++WL D+ + + I+ VD V G++A ++++ E + +P V++
Sbjct: 8 TIIKQWLRDNCVVIDESKIEIVDTTTHPHVIVEGLGIIAKQDLKVDEIIAVIPKRCVLSP 67
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
+ P +L++ + + + L+ E S S+W +YI ++P + W +
Sbjct: 68 KTTSIAP----ILEKYELEEAVATSIALMYETSKGVQSKWYSYIQSMPTVIDLPILWDKE 123
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
++ YL + + E IE I + Y + I +P+ F E +F +E+FK + I+ S
Sbjct: 124 SIE-YLVGTDLEEIVIENIETLEEQYREDVEPIIKNHPETFKENIFTLESFKIASTIVSS 182
Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETF-LDYDKS----------------------- 284
R + G +LVP AD+ NH E ++ D +
Sbjct: 183 RAFNIDQYHGE-SLVPLADIFNHKTGRENVHVEADGNVCKQCGELDGCEHKKKKGGKKVV 241
Query: 285 --------------SQGVVFTTDRQYQPGEQVFIS--------------YGKKSNGELLL 316
+ F + P + +FI+ YG N LL
Sbjct: 242 KGAPSLKKATPQDIEKKTTFKDRIELLPKDSLFITIVKPVNKDCEVFNTYGDHDNSLLLS 301
Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDK-------CYKEKLEALRK--------YGLSASE 361
YGF+ E NP D + + L DK YK + AL K + + E
Sbjct: 302 KYGFL--EMDNPCDLLRIDRQL--VDKLLFSYLDLYKIEQPALMKRLEFYCENFDEDSRE 357
Query: 362 CFPIQITGWPLE-LMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD-IKCPEIDEQALQ 419
++ G + L+ Y+ ++PP ++++M A K+ KK I + +Q +
Sbjct: 358 HHAFELDGHGDDALITTLYIALAPPETFNQWKKMKQAQFYKIFEKKQAIDMVKEFQQVRK 417
Query: 420 FILDSCESSISKYSR 434
ILD + + KY++
Sbjct: 418 AILDVIDKRLEKYNQ 432
>gi|221131915|ref|XP_002160713.1| PREDICTED: SET domain-containing protein 4-like [Hydra
magnipapillata]
Length = 429
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/273 (23%), Positives = 121/273 (44%), Gaps = 28/273 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L W ++GL + + + RGL K++ G+ ++ +P +L+IT D+ +
Sbjct: 33 LFSWSLNNGLVLKAVTPKVFKKTGRGLKTTKSVSPGDLIIALPLNLLITFDTILENNDLN 92
Query: 139 EVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL----DRY 193
+ + S+ L +L+ E ++S + +Y++ LP + Y ++ E+ +
Sbjct: 93 FIFRNHPSICQKYLFILFLLIEKKKGENSYFFHYLNTLPENFSTPSYISQDEMQLCPNFI 152
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-- 251
E + ++ R I I + L S EV KW++ ++ +R V
Sbjct: 153 QEETGLQNRQILNAIKHISCIHSLIANDLS----CIDSEV------KWAWNVINTRSVYF 202
Query: 252 ---------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ S++ AL P D+LNH+ ++KS++ T+ Y PG Q+
Sbjct: 203 NAKHLKCFKNISSINVDFALAPVLDLLNHNDTANVVAGFNKSTKHYEVHTNDIYTPGSQL 262
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
FI+YG SN +L YGFV N D++ +P
Sbjct: 263 FINYGPHSNRKLFCEYGFVLPFNMN--DTIPIP 293
>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
Length = 232
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 90/186 (48%), Gaps = 12/186 (6%)
Query: 253 LPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
+PS DG AL+P DM NH T D++ +S R ++ GEQVFISYG +
Sbjct: 7 IPSPDGSRMIHALIPMWDMCNHENGRIT-TDFNATSDHCECYALRNFKKGEQVFISYGPR 65
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
+N + + GFV N D +L L + K+D KE++E L K GL + F ++
Sbjct: 66 TNSDFFVHSGFVYM--NNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPGT 123
Query: 370 WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCE 426
P+ A+L V SM+ K E S+K+ K + C ++E +F+L +
Sbjct: 124 EPISDTLLAFLRVF--SMR-KAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQ 180
Query: 427 SSISKY 432
I+ Y
Sbjct: 181 LLIANY 186
>gi|159473090|ref|XP_001694672.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276484|gb|EDP02256.1| predicted protein [Chlamydomonas reinhardtii]
Length = 515
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 128/311 (41%), Gaps = 67/311 (21%)
Query: 76 ASTLQKWLSDSGLPPQKMA----IQKVDVGERGLVALKNIRKGEKLLFVPP--SLVITAD 129
A ++WL G +K A ++++ G+RG+VA I +GE LL +P +L + D
Sbjct: 2 AEAFEEWLDKHG--GKKHASLDLVKELPNGDRGVVATAPIAEGELLLLLPINCALYMPND 59
Query: 130 SKW-----SCPEAGEVLKQCSVPDWPLLAT--YLISEASFEKSSRWSNYISALPRQ-PYS 181
+W S PEA L + P LAT L+SE + S W+ Y+ LP P
Sbjct: 60 EEWAKRGSSFPEAVGYLHEHHRTLSPFLATTLALMSEVARGGESAWAAYVGTLPPSCPDC 119
Query: 182 LLYWTRAELDRYLEASQIRE-------RAIER-ITNVIGTYNDL---------------- 217
LL W++ E + LE + + E A +R + ++ DL
Sbjct: 120 LLNWSKEE-KKDLEGTALEELGPDPAADAFKRHVAPILAARRDLWPLQQQGEGEGGAAAD 178
Query: 218 ----RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
L +F + L F++E W G + +L +V L+P DM+NHS
Sbjct: 179 EAAADLALFVRVAGLVQSRAFHLEAENWVSGA--KEISKLEGGGTQVFLLPGIDMINHSH 236
Query: 274 E----------------VETFLDYDKSSQGV----VFTTDRQYQPGEQVFISYGKKSNGE 313
L + +GV V D+ GE+V +YG S+ +
Sbjct: 237 NPARRNAHLQRLNVAQAAAAKLTEGGAPEGVEAFFVMRADKPIAEGEEVLHTYGNLSDAQ 296
Query: 314 LLLSYGFVPRE 324
LL +YGF+ E
Sbjct: 297 LLQTYGFLDSE 307
>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
Length = 429
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 65/270 (24%), Positives = 118/270 (43%), Gaps = 25/270 (9%)
Query: 79 LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L W+ +G K+ ++ RG+VAL+ + GE L +P +L+I+ +
Sbjct: 32 LLTWMEANGFRLHSKLGLRDFPDTGRGVVALEKLVGGETFLKLPATLLISTRTALQSRLH 91
Query: 138 GEVLKQ-CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+++ + +L +++ + ++SRW ++ +LPR + ++ R +
Sbjct: 92 SFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPRTFTTPVFLRRKVFESL--P 149
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-----EVFNMETFKWSFGILFSRLV 251
+RE IT + T+ L++ + + + PE F F W++ + +R +
Sbjct: 150 KDLREEVQTGITFIQRTFLKLKV-LLGGHVEEEPEVQCLSTGFTWNNFVWAWTAVNTRCI 208
Query: 252 RLPSM-------DGRVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
D AL P+ D LNH +ET + + + + + EQV
Sbjct: 209 FAQGSNSSSLWEDDHCALAPFLDCLNHHWKASIETAM----VGENFEILSHKSHDANEQV 264
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSV 332
FISYG SN L L YGFV + NP+D V
Sbjct: 265 FISYGPHSNRRLFLDYGFVLPD--NPNDVV 292
>gi|308501895|ref|XP_003113132.1| CRE-SET-27 protein [Caenorhabditis remanei]
gi|308265433|gb|EFP09386.1| CRE-SET-27 protein [Caenorhabditis remanei]
Length = 501
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 116/279 (41%), Gaps = 51/279 (18%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI------- 204
LA +L ++ + S+W YIS LP + L++T +L + L+ S I E A+
Sbjct: 153 LALFLATQWLLNEKSKWLPYISILPNSFPTPLFYTDEQLLQ-LKPSPIFEEALLFYRTIS 211
Query: 205 ----------------ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
E N + +F P F + W+ G++ +
Sbjct: 212 RQFCYFLMAVAKNKIYESAQRRKDARNTMETPLFYNAP--FTVANLTPGLYFWAVGVVTT 269
Query: 249 RLVRLPSMDGR---------VALVPWADMLNHSC-----EVETFLDYDKSSQGVVFTTDR 294
R+ +PS AL+P+ DM NH VE + Y + + V T+
Sbjct: 270 RVNMVPSEHSTDKDEKPNLIAALIPFLDMANHENVVTEDPVEDLVCYSPAEECAVITSHC 329
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+ G QV I YG +S GE LL GFVP D ++L + + K+DK K + + K
Sbjct: 330 DLEAGNQVTIFYGCRSRGEHLLHNGFVPIHHQR-QDVLKLKIGIPKTDKTLDSKTKLIEK 388
Query: 355 YGLSASEC----FPIQITGW-----PLELMAYAYLVVSP 384
Y + +C F + + + PL+L+ +A + V P
Sbjct: 389 Y-VQNVQCNGNIFQVDLYNYPEQPFPLDLLMFAAIFVCP 426
>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
Length = 504
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 132/320 (41%), Gaps = 34/320 (10%)
Query: 73 LENASTLQKWLSDSGLPPQKMAI-QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
+E + +W + G + + + + + G GL + I GE ++ VP S+ ++
Sbjct: 69 METVAHFMRWAVERGCQVENVRVAEHAEYGGLGLESCGPIPAGECIITVPRSMFFYVTNE 128
Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
+ E++ + + +LA LI E F S W Y+ LP + + LY+T +
Sbjct: 129 PRYRQLLELMPGAMMSEQGNIMLALALIME-RFRAKSDWKPYLDLLPDRYTTPLYYTTED 187
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDL---FPEEVFNMETFK------ 240
+ E A++ ++ Y +R + K +L F +VF + F
Sbjct: 188 MGELAETDAFLP-ALKLCKHIARQYGFIRRFVQEKVDELRDCFTYDVFRLLLFSLLIPHS 246
Query: 241 WSFGILFSRLVRLP-------SMDGRVALVPWADMLNHS---------CEVETFLDYDKS 284
W+ + +R ++P MD +AL+P DM NH+ C ET +
Sbjct: 247 WAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNE 306
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDK 343
T + +FI YG +++ E L+ GFV PR NP +V+ +L +
Sbjct: 307 QLECSLTREVSDIASVPIFIVYGTRTDAEFLVHNGFVCPR---NPHANVQKRFTLVPAIP 363
Query: 344 CYKEKLEALRKYGLSASECF 363
YKE+ L G+ + F
Sbjct: 364 LYKERAHLLELLGMPTTGTF 383
>gi|156717956|ref|NP_001096520.1| N-lysine methyltransferase setd6 [Xenopus (Silurana) tropicalis]
gi|325530258|sp|A4QNG5.1|SETD6_XENTR RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|140832737|gb|AAI35641.1| LOC100125156 protein [Xenopus (Silurana) tropicalis]
Length = 454
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 63/286 (22%), Positives = 127/286 (44%), Gaps = 25/286 (8%)
Query: 71 DSLEN---ASTLQKWLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
D L+N S W GL P+ + V + G++A +++ GE L +P S +
Sbjct: 14 DHLQNDLPVSCFLAWCKKVGLELNPKVYISTEGTVSQYGMLAREDLSDGELLFSIPRSAI 73
Query: 126 ITADS---KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPY 180
++ ++ + + + L+ CS W L L+ EA+ + SS W+ Y P P
Sbjct: 74 LSQNTTRIRDLIEKEQDSLQSCS--GWVPLLISLLYEAT-DSSSHWAPYFGLWPELDPPD 130
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
++W+ E + L+ + I E + + N+ YN + L + P+ F ++ +K
Sbjct: 131 MPMFWSEEEQTKLLQGTGILEAVHKDLKNIEKEYNSIVLPFIRRNPEKFCPMKHTLDLYK 190
Query: 241 WSFGILFSRLVRLPSMDGRVA----------LVPWADMLNHSCEVETFLDYDKSSQGVVF 290
+ + + P + +VP AD+LNH + L++ + + +
Sbjct: 191 RLVAFVMAYSFQEPQEEDEEEDIEKDILPPMMVPVADLLNHVAQHNAHLEF--TPECLRM 248
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
T + G+++F +YG+ +N +LL YGF N +++ ++ +
Sbjct: 249 ITTKSVCAGQELFNTYGQMANWQLLHMYGFAEPHPQNCNETADIQM 294
>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
Length = 236
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 97/222 (43%), Gaps = 34/222 (15%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RGL A + +R GE++L + L+I +PD + S
Sbjct: 38 RGLFASRPVRAGERMLEISLDLMIAP---------------SDLPD----------QLST 72
Query: 163 EKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
+SS W+ YIS LP +P L W EL YL AS + + ER+ + + ++
Sbjct: 73 LQSSAWAPYISCLP-EPAGLDNTFLWEDTEL-SYLRASPLYGKTRERLEIITTEFGQVQ- 129
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFL 279
+P LF + ++E F + +FSR + + D + ++P D NH+ L
Sbjct: 130 NALDVWPQLFGK--VSVEDFMHVYATVFSRPLAI-GEDSTLVMIPMLDFFNHNAASFAKL 186
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
++ V T DR +Q++I+ G SN EL L YGF
Sbjct: 187 SFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT 228
>gi|348679693|gb|EGZ19509.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase small
subunit N-methyltransferase I [Phytophthora sojae]
Length = 606
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/251 (24%), Positives = 118/251 (47%), Gaps = 21/251 (8%)
Query: 109 KNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP---LLATYLISEASFEKS 165
+++ +G +LL +P S V++ S G +L+ PD P L +L+ E +
Sbjct: 205 EDVEQGAELLSLPMSKVMSVASAARG-RVGLLLEVN--PDLPPAIALGLHLLEEQALGAK 261
Query: 166 SRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
S +S ++S+LP S L+++ +L + +E SQ+ + R V Y+ L + S
Sbjct: 262 SNFSEFVSSLPGVEAINSTLFYSENQL-KEMEGSQLLRYTLGRAQAVEAFYDALLQPVTS 320
Query: 224 KY---PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE---- 276
P +F E+ F ++ F+W+ G++++ + + V L P D + +V
Sbjct: 321 PEAVDPPIFKEQDFTLDKFRWAMGVVWASAFPVGEDEADVVLAPVLDTIGICTDVADEGD 380
Query: 277 -----TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
++ D+SSQ +V + G +V +S KS+ + +L+ GF + D
Sbjct: 381 EACPPNQIEVDQSSQRLVVHASSPLEKGREVRLSMPGKSSAQFMLNNGFARDRASKKLDK 440
Query: 332 VELPLSLKKSD 342
++L ++L SD
Sbjct: 441 LDLTVTLDPSD 451
>gi|24640264|ref|NP_727144.1| CG32732 [Drosophila melanogaster]
gi|22831862|gb|AAF46222.2| CG32732 [Drosophila melanogaster]
gi|28316927|gb|AAO39485.1| RE55639p [Drosophila melanogaster]
gi|220957744|gb|ACL91415.1| CG32732-PA [synthetic construct]
Length = 537
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 129/307 (42%), Gaps = 31/307 (10%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
L W D G+ + + I + GL A + + K E +L VP L+++ ++
Sbjct: 115 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRATRPLAKDELVLSVPRKLILSEENNS 174
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
C G++ + + LA L+ E + S W YI LP + ++LY+T +++
Sbjct: 175 DCRLFGKMTQATHLN----LAYDLVIEKIRGEFSEWRPYIDVLPAKYNTVLYFTTKQME- 229
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
L + A+ + + Y L + +P F + + +
Sbjct: 230 LLRGTAAAALAMRQCRVIAKQYAFLYKYAHTMTEPSTGNRSHPGERGLFFTQHGLCYKLY 289
Query: 240 KWSFGILFSRLVRLPSM-----DG---RVALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
+W+ + +R +PS DG AL+P+ DM NH ++ +F Y S+ +
Sbjct: 290 RWAVSTVMTRQNLVPSEKQESEDGPKLISALIPYWDMANHRPGKITSF--YATVSRQLEC 347
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
T GEQ FI YG +SN +LL+ GFV + N D V + + L +D ++
Sbjct: 348 TAQEAVNTGEQFFIYYGDRSNTDLLVHNGFV--DPNNTKDYVNIRVGLSLTDALAAKRAS 405
Query: 351 ALRKYGL 357
L K +
Sbjct: 406 ILDKLNI 412
>gi|307107214|gb|EFN55457.1| hypothetical protein CHLNCDRAFT_52262 [Chlorella variabilis]
Length = 478
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/371 (24%), Positives = 137/371 (36%), Gaps = 83/371 (22%)
Query: 14 PSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSL 73
P F LH + A PR+R G S + S+ + VT++ + W E L
Sbjct: 22 PDFRQLHSCRPAAAL---PRRRHGTAAAAATPQSGSGSTSK-EAAVTKDYLSWATEAGIL 77
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITADSK 131
P+ M Q G RG AL +I E + VP +LV+ + +
Sbjct: 78 S---------------PKLM--QAYFGGLRGGQALSDIAADEVFVTVPRGAALVVAPNER 120
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
CP+ + P + +A L+ E + S YI LP + + W A+L
Sbjct: 121 CPCPDFVDPGFYKEAPWFVKMAVLLLWERRKGRGSSVWGYIEQLPSSIDTPVRWEEADLA 180
Query: 192 RYLEASQIRERAIERIT----------------------NVIGTYNDLRLRIFS------ 223
I+E ++ N + ++R R FS
Sbjct: 181 ELQYQPAIKEIKQQQTAWRQQYNRFCAAVRPGQGNYSWDNFLWAAENVRSRAFSGPYTGS 240
Query: 224 --------------------KYPDLFPEEVFN----METFKWSFGILFSRLVRLPSMDGR 259
+ L E+V N + F + +L S+ ++
Sbjct: 241 SVGEKARTLGLLLAAGGGYAAWQQLPLEQVLNGFISVLLFNIIYDVLISKKLKW------ 294
Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
AL P D LNHS VE+ + Y+ V +T Y+ G+QVFISYG ++NG LL Y
Sbjct: 295 YALCPVVDALNHSSLVESDVAYEYFKDTFVLSTKSAYKAGQQVFISYGAQANGSLLQYYA 354
Query: 320 FVPREGTNPSD 330
F E NP+D
Sbjct: 355 FT--EPGNPND 363
>gi|405953717|gb|EKC21325.1| SET domain-containing protein 6 [Crassostrea gigas]
Length = 384
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 111/281 (39%), Gaps = 22/281 (7%)
Query: 76 ASTLQKWLS--DSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
A+ KW S D K+ I + G+VA+ ++++GE L + +++ S
Sbjct: 30 ANAFLKWFSSNDDNFFSGKVTIGPDGSCAQNGMVAIADVQEGESLFRISRKILLHPKSS- 88
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
S E S W L ++ E + K S W Y LP ++WT E ++
Sbjct: 89 SISALFEKDPVNSESGWSELLICMMQEYN-TKDSPWKPYFDVLPETVDLPMFWTEEEREK 147
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
L + + E ++ + + K+ D E ++E +K + +
Sbjct: 148 LLTGTGVVEAVNRDNKKILTEFQSVVSPYLKKHKDTISESCDDLELYKRMVSYVMAYSFT 207
Query: 253 LP---------------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
P + +VP ADMLNH L + ++ T D +
Sbjct: 208 EPPKDDDSDDFGEEDEEEEKSTIYMVPMADMLNHIANNNAHLSFKPDCLEMIATKD--IK 265
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
GE+VF +YG+ +N LL YGF N D+V++PL L
Sbjct: 266 KGEEVFNTYGELANWHLLHMYGFSEAYPANHYDTVDIPLDL 306
>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 640
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 121/267 (45%), Gaps = 31/267 (11%)
Query: 79 LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L W+ ++GL A + DV E L A I + VP LV+ ++ E
Sbjct: 173 LTAWIDNAGLEINSNARPGLNDVDELYLFASNPIEAATLVATVPAPLVMF-ETYLRTLEN 231
Query: 138 GEVL------KQCSVPDWP-LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
+L K SVPD LA L+ E S+E S W +IS+LP+ S ++W+ E
Sbjct: 232 PMILAIDRRFKTMSVPDPSYALAMALLYE-SYEPKSMWREWISSLPQTLDSTVFWSAEEQ 290
Query: 191 DRYL-----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
D +QI ER ++++ YN R+ + +P +F ++ E FKW++ I
Sbjct: 291 DALQSLPLKRKTQILERHLQQL------YNATTPRLLAAFPHIFAGGNYSYEMFKWAYMI 344
Query: 246 LFSRLVRL---PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV-----FTTDRQYQ 297
+ SR + P ++ L P D+L+H V+T + + V+ T R +
Sbjct: 345 VDSRSLTFSTGPDTLPQIMLAPLVDLLHHD-PVQTNIQLGVHPEEVLGFEISLKTTRAIK 403
Query: 298 PGEQVFISYGKKSNGELLLSYGF-VPR 323
GE + G+ N +LLL +G +PR
Sbjct: 404 KGEPLVRHIGELPNHQLLLRFGLAMPR 430
>gi|406603886|emb|CCH44637.1| hypothetical protein BN7_4206 [Wickerhamomyces ciferrii]
Length = 477
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 133/287 (46%), Gaps = 25/287 (8%)
Query: 74 ENASTLQKWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
E+ Q WL +SG+ P K+ + RGL++L++I + E L +P ++++ ++
Sbjct: 5 EDTHNFQNWLINSGVQISPKIKIEDLRYLSQGRGLISLQDINQDEILFKIPRNVLLNIET 64
Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
K + D W L ++ E S S+W Y LP + +SL++W + E
Sbjct: 65 GSLSQINNNKEKLLTNYDHWEGLILTILYELSLGNESKWFQYFKILPNEFHSLMFWEKDE 124
Query: 190 LDRYLEASQI-----RERAIERITNVI-GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
L+ L+ S + +E+A+E +I DL + + DLF + + +SF
Sbjct: 125 LE-LLKPSLVLDRIGQEKALETFNKLIPNALVDLGINHLNISLDLFHKVASTI--LSYSF 181
Query: 244 GILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEVETF-LDYDKSSQGVVFTTDRQ 295
+ D +V ++V AD+LN + L Y+ ++ ++ + +
Sbjct: 182 DVERPDFNEDMEDDEQVQYDGYFKSMVTLADLLNADTNLSNANLFYE--TEFLIMKSIKP 239
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS-LKKS 341
G+Q++ +YG N ELL YG+V G+ D ELP+S +KK+
Sbjct: 240 IPQGQQIYNTYGDHPNSELLRRYGYVEYNGS-KFDFGELPISTIKKT 285
>gi|327291705|ref|XP_003230561.1| PREDICTED: n-lysine methyltransferase SETD6-like, partial [Anolis
carolinensis]
Length = 324
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 78/162 (48%), Gaps = 10/162 (6%)
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK-- 240
++W+R E + L+ + + E + + ++ ++ + L +PDLF +V N+E +K
Sbjct: 6 MFWSREEQKQLLQGTGVPEAVEKDLASIQEEFSSVVLPFMKAHPDLFNPKVHNLELYKRL 65
Query: 241 ------WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+SF L + +VP AD+LNH L++ S + + R
Sbjct: 66 VAFVMAYSFQELLDEEEEEEGKPSPLVMVPLADLLNHVANHNANLEF--SPEHLQMVATR 123
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
G++VF +YGK SN +LL YGF N +D+ ++P+
Sbjct: 124 TIPKGQEVFNTYGKLSNWQLLHMYGFAEPYPGNTNDAADIPM 165
>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
Length = 986
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 123/270 (45%), Gaps = 32/270 (11%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK----GEKLLFVPPSLVITADSKWS 133
TL +W++ G K + D R ++A +N+ G+ + +P + ++T + ++
Sbjct: 20 TLWEWVTRHGGSAPKARLS--DAYPRTVIAAENVNGAQDGGDTIFSIPITCLMTPAAAFA 77
Query: 134 CPEAGEVLK----QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
G+V + SV D +L +L E +S W YI LP + L W+RAE
Sbjct: 78 DVTYGKVFELFAAHQSVEDRTVLVFFLAIERQRGMTSHWGPYIRELPSIFSNPLNWSRAE 137
Query: 190 L-----DRYLEASQIRERAIERITNVIGTYNDLRLR---IFSKYPDL-------FPEEVF 234
R A++ + A+ ++T V LR I S ++
Sbjct: 138 TLRLAGTRLGGATKFHDCALLQLTEVCVPAFIAILRAQLILSANTKAIASGAISLAQDAL 197
Query: 235 NMETFKWSFGILFSRLVRLPSMDGR--VALVPWADMLNHS--CEVETFLDYDKSSQGVVF 290
+ + WS + SR L ++G+ +ALVP DML+HS ++E D D + Q ++
Sbjct: 198 SPDRLAWSHSCVSSRAFSL-FLNGQRTIALVPLGDMLDHSPDAQIEWRTD-DTAGQFLII 255
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
+ DR G +F +YG KSN EL+L YGF
Sbjct: 256 SHDR-LPAGSIMFNNYGAKSNEELILGYGF 284
>gi|363747293|ref|XP_003643967.1| PREDICTED: N-lysine methyltransferase SETD6-like [Gallus gallus]
Length = 447
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 128/293 (43%), Gaps = 30/293 (10%)
Query: 82 WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP---- 135
W +G L P+ ++ V GL+A ++ GE L VP S ++ S+ +C
Sbjct: 24 WCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALL---SQHTCAIRAL 80
Query: 136 --EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL----YWTRAE 189
+A E L+ S W L L+ E + +S W Y S Q +S L +W E
Sbjct: 81 LHDAQESLQSQS--GWVPLLLALLHEYT-TGTSHWRPYFSLW--QDFSSLDHPMFWPEEE 135
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
R L+ + I E + + N+ Y+ + L +PD+F E+ +E +K + +
Sbjct: 136 RVRLLQGTGIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELYKQLVAFVMAY 195
Query: 250 LVRLPSMDGRVA--------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ P + +VP AD+LNH L Y + +V T + G++
Sbjct: 196 SFQEPLEEEDEDEKGPNPPMMVPVADILNHVANHNASLKYAPTCLRMV--TTQPISKGQE 253
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+F +YG+ +N +LL YGF N +D+ ++ + + + K EA ++
Sbjct: 254 IFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQMVTVRKAALQRAKSEAQQQ 306
>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
Length = 606
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 96/208 (46%), Gaps = 26/208 (12%)
Query: 155 YLISEASFEK-SSRWSNYISALPRQPYSLLYWTRAE-----LDRYLEASQIRERAIERIT 208
+LI E ++ +SRW Y LP + + + + E LD L + ++ +
Sbjct: 281 FLIHEMKTKRETSRWKTYFDFLPGKFETGICFEEEEGGGLNLDEELAGTGFVQKRWKERE 340
Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RLPS-MDGRVA---- 261
V TYN L + ++P +F E F+ ++F W+ G+ +R V + P+ G+V
Sbjct: 341 VVEHTYNMLFPWLTEEFPQVFDREHFDFQSFMWARGVFDTRCVTVKFPAEKTGKVGVDNN 400
Query: 262 ------------LVPWADMLNHSCEVE-TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
LVPWADM NH + D + + + F T + G QVF++YG
Sbjct: 401 GEGEKGTRDVTCLVPWADMCNHHPYAQLNKPSLDPTRKFLQFCTMAPIKQGSQVFLNYGP 460
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPL 336
N +LLL YG+ ++ + ++EL L
Sbjct: 461 LDNTQLLLYYGYAEQDNPYQTYAIELEL 488
>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
str. Neff]
Length = 408
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 123/287 (42%), Gaps = 26/287 (9%)
Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-----SVPDWPLLATYLISEASFEKS 165
+ E++L VP SL++ A + + G V +V + LA +++ E +
Sbjct: 2 VLASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELR-KPD 60
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY 225
S W Y ALP + ++W ++ + L S + + + + + + I +Y
Sbjct: 61 SFWRPYFDALPSRVNLPMFWADEDM-QLLAGSPLHAAVLAQKKQARDWHTEHIVPIVRRY 119
Query: 226 PDLFP--------EEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSCEV 275
P F E +++ F+W ++ SR + +VP AD++NHS
Sbjct: 120 PRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPMADLINHSLTN 179
Query: 276 ETFLDY--DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
+ Y D +Q + + Y GEQVFI+Y SN ELL +Y + + N +
Sbjct: 180 DNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVEDNYNKYTEIR 239
Query: 334 LPLSLKKSDKCYKE-----KLEALRKYGLSASECFPIQITGWPLELM 375
L + C E K AL + GL A + +P++ +PL+L+
Sbjct: 240 LD-ETTIARICPDEVERLTKTRALTQRGL-AKQTYPVKSEEFPLDLV 284
>gi|348676124|gb|EGZ15942.1| hypothetical protein PHYSODRAFT_561656 [Phytophthora sojae]
Length = 429
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 89/183 (48%), Gaps = 13/183 (7%)
Query: 156 LISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
L++E + + S + YI LP P+S +R E+ R+ A I + + V+
Sbjct: 95 LLAELARGEESGFHGYIQQLPTSISLPFSWGAESR-EMLRHTTAHLILDDKL-----VLK 148
Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNH 271
Y D + ++ ++P EV +E F+W++ ++ SR ++ DG+ L+P DM NH
Sbjct: 149 MYADYAEPLMKEFSTIWPAEVSTLEKFQWAYSMVSSRAFKV--TDGQEPTLLPVIDMANH 206
Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
+ E + TT R+ + E V ISYG SN +LL YGFV PSDS
Sbjct: 207 AAENPAAHIVKTETGSFQLTTLRKVEKDESVTISYGDLSNAQLLCRYGFVLPTSV-PSDS 265
Query: 332 VEL 334
+ +
Sbjct: 266 IHI 268
>gi|325530255|sp|E1BI64.1|SETD6_BOVIN RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
Length = 450
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 135/310 (43%), Gaps = 34/310 (10%)
Query: 65 PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
P G + D AS L W GL P+ ++ V G+VA ++++ GE L VP
Sbjct: 13 PAGSDDDPAPVASFL-SWCQRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPR 71
Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALP-- 176
+ ++ S+ +C +G + ++ S W L L+ E +S WS Y + P
Sbjct: 72 AALL---SQHTCSISGVLERERGALQSQSGWVPLLLALLHEMQ-APASPWSPYFALWPEL 127
Query: 177 ---RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV 233
+ P ++W E R L+ + + E + + N+ Y + L +PDLF V
Sbjct: 128 GRLQHP---MFWPEEERRRLLQGTGVPEAVEKDLVNIRSEYYSIVLPFMDAHPDLFSPRV 184
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQ 286
++E ++ ++ + + P + +VP AD+LNH L+Y +
Sbjct: 185 RSLELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSPTCL 244
Query: 287 GVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
+V QP G ++F +YG+ +N +L+ YGF N +D+ ++ + +
Sbjct: 245 RMV-----AIQPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTNDTADIQMVTVREAA 299
Query: 344 CYKEKLEALR 353
K+EA R
Sbjct: 300 LQGTKVEAER 309
>gi|308813462|ref|XP_003084037.1| unnamed protein product [Ostreococcus tauri]
gi|116055920|emb|CAL58453.1| unnamed protein product [Ostreococcus tauri]
Length = 467
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 117/313 (37%), Gaps = 75/313 (23%)
Query: 68 CEIDSLENASTLQKWLSD-SGLPPQKMAIQKVDVG---ERGLVALKNIRKGEKLLFVPPS 123
C+ +L+ L WLS+ G + + + + G V IR+GE ++ +P
Sbjct: 46 CDAAALDR---LHAWLSNIEGFSDRGLRLTTANDGFGVRATCVRESGIRRGEVIIQIPRE 102
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP--YS 181
+TA + E G D+ L ++ E +++SRW Y+ LP +
Sbjct: 103 AFLTAHPRM---EYG--------SDFRRLVAAVLEEMKRKEASRWWPYLQTLPHHGDWHP 151
Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR--------LRIFSKYPDLFPEEV 233
LL+ A R + +R ER+ N +D R + K D +P E
Sbjct: 152 LLWSDEARTSRLPTWTVASKRLTERLRNCANDAHDFRSMGLGKDVKDVSGKNDDEWPSEA 211
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRV--------------------------------- 260
+W+ I SR RL D V
Sbjct: 212 ----DVRWASAICASRAFRLEFFDDDVFENFDETDARAFAVLERLSDVDEDVWGPGPSED 267
Query: 261 -------ALVPWADMLNHSCEVE--TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK-S 310
LVPWAD LNHS + E + L YD S + Y G+QVF SYG S
Sbjct: 268 DFDTSVLVLVPWADGLNHSSDAEESSILRYDAGSATATLRAHKSYARGDQVFDSYGVHLS 327
Query: 311 NGELLLSYGFVPR 323
+ ++ L +GFV R
Sbjct: 328 DVDVFLDFGFVVR 340
>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
Length = 1078
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 129/282 (45%), Gaps = 55/282 (19%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITADSK 131
++ +Q W DS P + + E+G+ + ++I++GE+LL +P SL + +
Sbjct: 27 DDGINIQTWKQDSKQP-----LLSLTPNEKGIFSSRDIKEGEELLSLPWYNSLSMNKVQQ 81
Query: 132 ---WSCPEAGEVLKQCSVPDWPLLA----TYLISEASFEKSSRWSNYISALPRQPYSLLY 184
W + ++ + + D ++A Y + + SF+ +S + SA+P S L+
Sbjct: 82 QLPWLFNKIQDL--ELTAEDGLVVALLYYRYCMDDLSFD----YSEWFSAMPEVLNSGLF 135
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDL---RL---RIFSKYPDLFPEEVFN--- 235
++ AE + + N Y DL RL +F + LF E+ F+
Sbjct: 136 FSDAEAE---------------LLNGSPAYIDLMNQRLDAKELFGRLKSLFKEQQFSKCA 180
Query: 236 --METFKWSFGILFSRLV--RLPSMDGR------VALVPWADMLNHSCEVETFLDYDKSS 285
+ KW++ ++ SR + P++D V L P+ D NH+ + + D+D
Sbjct: 181 MTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAPFLDYFNHAEDAQAAYDFDYDE 240
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKS-NGELLLSYGFVPREGT 326
+ + + GEQ+F++YG + N +LL+ YGF+ + T
Sbjct: 241 SAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSST 282
>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
Length = 588
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 121/295 (41%), Gaps = 33/295 (11%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA-GEVLKQ-CSVPDWPLLATYLISEA 160
RG A +I G+ +P + T P G+ + ++ + + A +LI+E
Sbjct: 186 RGAAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFAALGEDTIAALWLIAER 245
Query: 161 SFEKSSRWSNYISALP--------RQPY---SLLYWTRAELDRYLEASQIRERAIERITN 209
+ ++S W I++LP P + + W R D L + + AI
Sbjct: 246 ALGEASPWHAVIASLPWPEGGEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFK-----W-SFGILFSRLVRLPSMDGRVALV 263
+ + L + D+FP + ++ F+ W S+G+ P L
Sbjct: 306 LARQHAALFPALSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQAS---PGEPAATCLP 362
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVV-FTTDRQYQPGEQVFISYGKKSNGELLLSYGF-V 321
P A + NH+ + Y + G + R GE+VF+SYG KSN ELLL YGF +
Sbjct: 363 PVAMLCNHALWPH-VVRYSRLRDGTLRLPVARSVHAGEEVFVSYGAKSNAELLLFYGFAL 421
Query: 322 PREGTNPSDSVELPLSLKKSD--KCYKEKLEALRKYGLSASECFPIQITGWPLEL 374
P NP D V L L L + K + AL + GL+ S P + PL L
Sbjct: 422 P---GNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLS---PHAVRAGPLPL 470
>gi|449472508|ref|XP_002187588.2| PREDICTED: N-lysine methyltransferase SETD6 [Taeniopygia guttata]
Length = 383
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 64/263 (24%), Positives = 113/263 (42%), Gaps = 18/263 (6%)
Query: 105 LVALKNIRKGEKLLFVPPSLVI---TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS 161
++A + + GE L +P + ++ T EA E L+ S W L L+ E +
Sbjct: 1 MLAAEELEAGEVLFTIPRTALLSQHTTSIHALLQEAQESLQ--SQSGWVPLLLALLHEYT 58
Query: 162 FEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
+S W Y S R ++W + E R L+ + I E + + N+ YN + L
Sbjct: 59 -ASNSHWQPYFSLWQDFRSLDHPMFWPQEERTRLLQGTGIPEAVDKDLANIQLEYNSIIL 117
Query: 220 RIFSKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
+PD+F ++ +E +K +SF +VP AD+LNH
Sbjct: 118 PFMETHPDIFDPKLHTLELYKELVAFVMAYSFQEPLEEEEEDEKGPNPPMMVPVADILNH 177
Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
L+Y S Q + T + + G+++F +YG+ +N +LL YGF N D+
Sbjct: 178 VANHNANLEY--SPQCLRMVTTQPVRKGQEIFNTYGQMANWQLLHMYGFAEPYPGNSHDT 235
Query: 332 VELPLSLKKSDKCYKEKLEALRK 354
++ + + + K EA ++
Sbjct: 236 ADIQMVTLRRAALQRAKSEAQQQ 258
>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
Length = 391
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 66/134 (49%), Gaps = 14/134 (10%)
Query: 214 YNDLRLRIFSKYPDLFPEE--VFNMETFKWSFGILFSRLVRLP------SMDGRVALVPW 265
+ DL I + DLF + N TF W + L + RLP + D A+ P+
Sbjct: 122 WKDLHPHIPAISKDLFTYTWLIVNTRTFYWEYPDLPNSHPRLPKKRKQLTADDCYAMCPF 181
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D NHS + D + S+G T DR+Y+ GE+VF+SYG +N LL+ YGF+
Sbjct: 182 MDYFNHS---DVGCDPESDSKGYSVTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSN 238
Query: 326 TN---PSDSVELPL 336
N P D + LPL
Sbjct: 239 RNDAIPLDHLILPL 252
>gi|116200882|ref|XP_001226253.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
gi|88175700|gb|EAQ83168.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
Length = 400
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 119/260 (45%), Gaps = 36/260 (13%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE--AGEVLKQCSVPDWPLLATYLISEAS 161
G+VA + +++G+K+L VP LV S + PE +G + S+ LLA L +
Sbjct: 54 GMVAHRRLKRGQKILRVPTQLV---HSLHTVPERISGRLPPDMSI--HALLAANLTVDG- 107
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
S W + + L L + EL L RE ++ + +N ++
Sbjct: 108 MAGLSTWKDSLPTLGDFNTGLPFMWHKELQELL-PKPARELLKKQQDSFHRDWN----KV 162
Query: 222 FSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS--- 272
+PDL ++ V N TF ++ R + P +D R+A+VP AD NH+
Sbjct: 163 AKAFPDLRQDDYLHSWFVINTRTFYYAT----PRTEKYPPVD-RLAIVPIADFFNHADTG 217
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
CEV +DK G + + DR Y ++V+ISYG +N LL YGF+P N D V
Sbjct: 218 CEVT----FDKD--GFIVSADRDYHGDQEVYISYGAHTNDFLLAEYGFLP--AANRWDEV 269
Query: 333 ELP-LSLKKSDKCYKEKLEA 351
+ + L K +KE L+
Sbjct: 270 CVDEVILPKPSTAHKELLQG 289
>gi|255083504|ref|XP_002504738.1| predicted protein [Micromonas sp. RCC299]
gi|226520006|gb|ACO65996.1| predicted protein [Micromonas sp. RCC299]
Length = 453
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/307 (25%), Positives = 132/307 (42%), Gaps = 61/307 (19%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKWSC 134
+ + +WL +G + + + VG RG L A +N+R GE ++ +P IT D
Sbjct: 43 TDIAEWLVANG---GECSAVRAGVGSRGRGLFAARNLRAGESIVRIPLKACIT-DIASPN 98
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
P G P LA +++E SSRW+ Y+++LP++ A D L
Sbjct: 99 PYPG-------CPYSVTLAAAILTERDAGSSSRWAQYVASLPKEVVGY-----ANCDEAL 146
Query: 195 EASQIRERAI----ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
+ RA + + + + TY L + + + + +N + W+ + SR
Sbjct: 147 VGDEDVIRAAVGGDDALVDELQTYASL---VIGSHAAIV-QRGWNSRDWTWAMSQVHSRT 202
Query: 251 VRL----PSMDG-RVA-----------LVPWADMLNHS-------CE-------VETFLD 280
R+ P+ G RV L P+AD+LNH CE V L
Sbjct: 203 FRVDLEVPAAHGARVGNDGNRERTVRLLAPFADLLNHDSDQNEVCCEWGVEQRAVGNELG 262
Query: 281 YDKSSQGVVFT--TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
D S+ GV F R Q G + +SYG++S+ + YGF+P+ NP + L +L
Sbjct: 263 SDLSN-GVDFVVKASRDIQEGSEALVSYGERSDPHFFMYYGFLPK--INPFNRAPLFRTL 319
Query: 339 KKSDKCY 345
+++ + Y
Sbjct: 320 REASRWY 326
>gi|198470241|ref|XP_001355267.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
gi|198145358|gb|EAL32324.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
Length = 568
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 132/308 (42%), Gaps = 40/308 (12%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+ +W G+ + I + GL A ++I + +L VP +L+ S+ PE
Sbjct: 137 TAFSEWAKAGGVKTDCLEIAIFPGYQLGLRATQDIAAEQPVLSVPRTLIF---SEEHLPE 193
Query: 137 AGEVLKQCSVPDWPLLATY-----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
L ++PLL + L+ E S W YI LP + ++LY++ ++
Sbjct: 194 TDRKL----FCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLYFSIEQMQ 249
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---------LFPEEVFNMETFKWS 242
R L + A+ + + Y ++ + PD LF + E ++W+
Sbjct: 250 R-LRGTAACTSALRQCRVIARQYANM-YKCAHIRPDASSASSMGVLFTQHGLCYELYRWA 307
Query: 243 FGILFSRLVRLP-----SMDGR-------VALVPWADMLNHS-CEVETFLDYDKSSQGVV 289
+ +R +P + DG AL+P+ DM NH ++ ++ YD +
Sbjct: 308 VSTVMTRQNLVPRELQANDDGDDLSQLPISALIPYWDMANHRPGKITSY--YDSGVHQMD 365
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
T + GEQ FI YG +SN +LL+ GF+ + N D V++ L L SD +++
Sbjct: 366 CTAQEACKAGEQFFIYYGDRSNADLLVHNGFI--DVNNRKDYVKIRLGLGLSDALVEQRA 423
Query: 350 EALRKYGL 357
+ L + +
Sbjct: 424 KILARLNI 431
>gi|195168946|ref|XP_002025291.1| GL13316 [Drosophila persimilis]
gi|194108747|gb|EDW30790.1| GL13316 [Drosophila persimilis]
Length = 568
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 132/308 (42%), Gaps = 40/308 (12%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+ +W G+ + I + GL A ++I + +L VP +L+ S+ PE
Sbjct: 137 TAFSEWAKAGGVKTDCLEIAIFPGYQLGLRATQDIAAEQPVLSVPRTLIF---SEEHLPE 193
Query: 137 AGEVLKQCSVPDWPLLATY-----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
L ++PLL + L+ E S W YI LP + ++LY++ ++
Sbjct: 194 TDRKL----FCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLYFSIEQMQ 249
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---------LFPEEVFNMETFKWS 242
R L + A+ + + Y ++ + PD LF + E ++W+
Sbjct: 250 R-LRGTAACTSALRQCRVIARQYANM-YKCAHIRPDASSASSMGVLFTQHGLCYELYRWA 307
Query: 243 FGILFSRLVRLP-----SMDGR-------VALVPWADMLNHS-CEVETFLDYDKSSQGVV 289
+ +R +P + DG AL+P+ DM NH ++ ++ YD +
Sbjct: 308 VSTVMTRQNLVPRELQANDDGDDLSQLPISALIPYWDMANHRPGKITSY--YDSGVHQMD 365
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
T + GEQ FI YG +SN +LL+ GF+ + N D V++ L L SD +++
Sbjct: 366 CTAQEACKAGEQFFIYYGDRSNADLLVHNGFI--DVNNRKDYVKIRLGLGLSDALVEQRA 423
Query: 350 EALRKYGL 357
+ L + +
Sbjct: 424 KILARLNI 431
>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
Length = 548
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 127/307 (41%), Gaps = 31/307 (10%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
L W D G+ + + I + GL A + + K E +L VP L+ + ++
Sbjct: 119 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRANRPLAKEELVLSVPRKLIFSEENNS 178
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
C G++ + + LA L+ E + S W YI LP + ++LY+T +++R
Sbjct: 179 DCRLFGKMTQATHLN----LAYDLLIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTKQMER 234
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
L + A+ + + Y L + +P F + + +
Sbjct: 235 -LRGTAACSLALRQCRVIAKQYAFLYRYAHTLAESSTGNRSHPGERGLFFTQRGLCYKLY 293
Query: 240 KWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
+W+ + +R +PS AL+P+ DM NH ++ +F Y S+ +
Sbjct: 294 RWAVSTVMTRQNLVPSEKQEAQDSPKFISALIPYWDMANHRPGKITSF--YAAVSRQLEC 351
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
T GEQ FI YG +SN +LL+ GFV + N D V + + L +D ++
Sbjct: 352 TAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV--DVNNLKDYVNIRVGLSPTDALAAKRAS 409
Query: 351 ALRKYGL 357
L K +
Sbjct: 410 ILDKLNI 416
>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 110/271 (40%), Gaps = 50/271 (18%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RG+VA NI++GE L +P +V+ + + LK W L +I E S
Sbjct: 48 RGVVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKNLGP--WLSLVVVMIYEYSL 105
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+ S W Y LP + +L++W+ E + L+AS + ++ +R D IF
Sbjct: 106 GEQSNWKQYFQVLPSKFDTLMFWSEEEFSQ-LQASAVVDKVGKR---------DAEEDIF 155
Query: 223 SK-------YPDLFP------------------EEVFNMETF--KWSFGILFSRLVRLPS 255
K +PDLFP E M + ++F I +
Sbjct: 156 EKVLPLVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEG 215
Query: 256 MDGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
DG + +VP AD+LN + + + +V + + G+++F Y
Sbjct: 216 EDGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEEG-ALVMRAIKPIKAGDEIFNDY 274
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
G+ +LL YG+V + D VELPL+
Sbjct: 275 GELPRSDLLRRYGYVT-DNYAQYDVVELPLT 304
>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
Length = 537
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 71/307 (23%), Positives = 125/307 (40%), Gaps = 31/307 (10%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
L W D G+ + + I + GL + + + K E +L VP L+ + +S
Sbjct: 115 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRSTRPLAKDELVLSVPRKLIFSEESNS 174
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
C G++ + + LA L+ E + S W YI LP + ++LY+T +++
Sbjct: 175 DCRLFGKMTQATHLN----LAYDLVIEKIRGEFSEWRTYIDVLPAKYSTVLYFTTKQME- 229
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
L + A+ + + Y L + +P F + E +
Sbjct: 230 LLRGTAAASLALRQCRVIAKQYAFLYRYAHTMTEPSTGNRSHPGERGLFFTQHGLCYELY 289
Query: 240 KWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNH-SCEVETFLDYDKSSQGVVF 290
+W+ + +R +PS AL+P+ DM NH ++ +F Y + +
Sbjct: 290 RWAVSTVMTRQNLVPSEKQESEDTPKLISALIPYWDMANHRQGKITSF--YAAVPRQLEC 347
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
T GEQ FI YG +SN +LL+ GFV + N D V + + L +D ++
Sbjct: 348 TAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV--DDYNLKDYVNIRVGLSLTDALAAKRAS 405
Query: 351 ALRKYGL 357
L K +
Sbjct: 406 ILDKLNI 412
>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
Length = 963
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 109/264 (41%), Gaps = 32/264 (12%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE-----AGEVLKQCSVPDWPLLAT 154
RG+VAL++I L VP S ++++++ K PE A EV + W L
Sbjct: 535 RGIVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLII 594
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE------------- 201
++ E S+W YI LP + ++W+ AELD L+AS R
Sbjct: 595 VMMYEYFKGSESKWKPYIDVLPSSFETPMFWSDAELDE-LQASATRSKVGKASAEEMFQD 653
Query: 202 ------RAIERITNVIGTYNDLRL-RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
RA + + TY+D L ++ + F+ + V
Sbjct: 654 KVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEEWVEER 713
Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
+ +VP AD+LN E ++Y + T R + GE++F YG N EL
Sbjct: 714 EAKSTMGMVPMADILNADAEYNAHVNY--GDDALTVTALRTIKAGEEIFNYYGPHPNSEL 771
Query: 315 LLSYGFVPREGTNPSDSVELPLSL 338
L YG+V + + D VELP +L
Sbjct: 772 LRRYGYVTPKHSR-YDVVELPWTL 794
>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 441
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 19/285 (6%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
++L +WL + + QK D E G+ A K+++KGE + +P L I+ +
Sbjct: 10 ASLLQWLRSKSVTTDSLHFQKSDGHEGVGVYAAKSLQKGEITMEIPFHLTISKVTAMQSD 69
Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+ + + ++A +L+ E S + +I +LP Q ++W ++ LE
Sbjct: 70 LRQILQDKNELDQDEIVALFLMIERFKSSDSFFEPFIQSLPSQFDLPIFWNDSDFAE-LE 128
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET-------FKWSFGILFS 248
+ + A + + + + + Y EE N+ T ++W+ I+++
Sbjct: 129 GTNVALLAKIMRKQIEADFQAIHIPLLRAY-----EERLNLRTSEISISDYEWALSIIWT 183
Query: 249 RLVRLPSMDGRV-ALVPWADMLNHSCEVET----FLDYDKSSQGVVFTTDRQYQPGEQVF 303
R + + L P DM NHS V+ F+ YD + + + +
Sbjct: 184 RAFGITRYGEYLRVLCPALDMFNHSVLVQEPLDEFIKYDHMKDVLAHCVVMETSANDPFY 243
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
ISYG S+ +LL SYGFV N + ++L + + +D +K K
Sbjct: 244 ISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLK 288
>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
Length = 762
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 102/252 (40%), Gaps = 27/252 (10%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
S WL G K+AI+ +G RG+VA GE LL VP +L++T D +
Sbjct: 18 SEFVAWLRAGGASFDKLAIKHTALG-RGVVATAAYEPGETLLSVPEALLLTVDKASRRAD 76
Query: 137 AGEVLKQCSVPDWPL------LATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAE 189
L LA +L + +S W Y + + R L +W A+
Sbjct: 77 VAASLGAARARGVDANGGNLALALFLAGD----RSEAWRPYRNVISRSVSHLPCFWPTAD 132
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
+ L S + E + R + L L V + + F ++ + SR
Sbjct: 133 -EALLAGSPLGEDVVRRRDEIRRDCRSLGL-----------TAVEDRQAFAFAEAQVLSR 180
Query: 250 LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
+ +G A+VP+AD++N + E +D+ V R+ GE V SYG K
Sbjct: 181 AF---AFNGTRAMVPFADLMNTARHHERHVDFAFERGAFVMRAVRRGAAGEPVTDSYGPK 237
Query: 310 SNGELLLSYGFV 321
SN LL+YGF
Sbjct: 238 SNARYLLNYGFA 249
>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 470
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 149/320 (46%), Gaps = 46/320 (14%)
Query: 69 EIDSLE-NASTLQKWLSDSGLPPQ-KMAIQKVDVGE----RGLVALKNIRKGEKLLFVPP 122
E+D E +T WL + G+ KMA+ VD+ + RG+VA +I E + +P
Sbjct: 2 EVDDFEARTATFSSWLKEMGVRTNPKMAL--VDLRQEGRGRGVVATGDIDDDEIIFSIPR 59
Query: 123 SLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
+ V+ A + P + + ++ +P W +L + L++EA E +S+W+ Y++ LP + SL
Sbjct: 60 NAVLNAQNVAPLPVSRRLFEK--MPSWLVLTSILMTEAQME-NSKWAPYLAVLPERLDSL 116
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTY------NDLRLRIFSKYPDLFPEEVFNM 236
++W+ +EL ++ +++ + ++ +Y + K + F++
Sbjct: 117 VFWSDSELAELQASAVVKKIGKKDAEDMFKSYIAPQGLKHSSTEMCHKVASVIMAYAFDI 176
Query: 237 ------ETFKWSFGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGV 288
T G LV D + ++++P ADMLN D D+++ +
Sbjct: 177 PDPSDAPTSGGKGGEAGDDLVSDDGEDEKTILSMIPLADMLN--------ADADRNNARL 228
Query: 289 VFTTD----RQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSD----SVELPLS 337
+ + R +P GE++F YG+ +LL YG+V +G + D S EL +S
Sbjct: 229 ICDNEELEMRAIKPISKGEEIFNDYGQLPRSDLLRRYGYVT-DGYSAYDVAEISAELIVS 287
Query: 338 LKKSDKCYKEKLEALRKYGL 357
L ++ K + L L + GL
Sbjct: 288 LFRNGKVHP-SLHKLTQDGL 306
>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
Length = 232
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 90/186 (48%), Gaps = 12/186 (6%)
Query: 253 LPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
+PS DG AL+P DM NH T D++ +S R ++ GEQVFISYG +
Sbjct: 7 VPSPDGSRMIHALIPMWDMCNHENGRIT-TDFNATSDRCECYALRDFKKGEQVFISYGPR 65
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
+N + + GFV + N D +L L + K+D KE++E L K L + F ++
Sbjct: 66 TNSDFFVHSGFVCMD--NEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPGT 123
Query: 370 WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCE 426
P+ A+L V SM+ K E S+K+ K + C ++E +F+L +
Sbjct: 124 EPISDTLLAFLRVF--SMR-KAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQ 180
Query: 427 SSISKY 432
I+ Y
Sbjct: 181 LLIANY 186
>gi|452825745|gb|EME32740.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
N-methyltransferase, putative isoform 2 [Galdieria
sulphuraria]
Length = 331
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 74/295 (25%), Positives = 124/295 (42%), Gaps = 25/295 (8%)
Query: 34 KRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKM 93
+RC H+ ++ +V T +SR ++ C L+ WL+ + K+
Sbjct: 40 RRCFHKPIL--NVPTLRPSSRLCKPSRLFLV---CSSGRLD---LFYHWLTRENVYMPKI 91
Query: 94 AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-SVPDWPLL 152
+ + G RG+VA++ I E L VP L + C + V + S +W +
Sbjct: 92 KLDQNKDGLRGVVAVEGIECDESFLKVPRDLSLQVTEHEECTMSEFVDPELWSQENWYVK 151
Query: 153 ATYLISEASFEKS-SRWSNYISALPRQPYS-LLYWTRAELDRYLEASQIRERAIERITNV 210
+ + + + S W YI LP + L+YW+ +EL +Q++ R + +
Sbjct: 152 LSLKLLKEKYLGKLSLWKPYIDILPHALNTGLVYWSSSEL------AQLQYRPLIEEVKI 205
Query: 211 IGTYND-LRLRIFSKYPDLFPEEVF----NMETFKWSFGILFSRLVRLPSMDGRV-ALVP 264
Y + L R+F P V+ F W+ ++ SR +P + + AL+P
Sbjct: 206 NQYYREALYTRVFESLSS--PVRVWLQNEKENVFFWALDMVQSRAFGIPDVGNKTYALLP 263
Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
DMLNH +T YD + T + PG ++ISYG N LL YG
Sbjct: 264 MMDMLNHRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYG 318
>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
Length = 544
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 134/318 (42%), Gaps = 45/318 (14%)
Query: 71 DSLENASTLQK------WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
DS ++ S L K W D G+ + + I + GL A + + K E +L VP L
Sbjct: 109 DSPDDQSRLAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRATRPLAKEELVLTVPRKL 168
Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS----SRWSNYISALPRQPY 180
+ + ++ C G++ + AT+ + + EK S W YI LP +
Sbjct: 169 IFSEENNSDCRLFGKMPQ----------ATHWVYDLVIEKIRGEFSEWRPYIDILPAKYS 218
Query: 181 SLLYWTRAELDRY-------LEASQIRERA-----IERITNVIGTYNDLRLRIFSKYPDL 228
++LY+T +++R L Q R A + R + + +D +
Sbjct: 219 TVLYFTIKQMERLRGTAACSLALRQCRVIAKQYAFLYRYAHTLTEPSDGNRSHPGERGLF 278
Query: 229 FPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHS-CEVETFL 279
F + + ++W+ + +R +PS AL+P+ DM NH ++ +F
Sbjct: 279 FTQHGLCYKLYRWAVSTVMTRQNLVPSEKQESQDSPKFISALIPYWDMANHKPGKITSF- 337
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
Y S+ + T + GEQ FI YG +SN +LL+ GFV + N D V + + L
Sbjct: 338 -YAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV--DVNNLKDYVNIRVGLS 394
Query: 340 KSDKCYKEKLEALRKYGL 357
+D ++ L K +
Sbjct: 395 PTDALAAKRASILDKLNI 412
>gi|343470335|emb|CCD16940.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 593
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 67/152 (44%), Gaps = 10/152 (6%)
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--------FNMETF 239
A L ++L + RE+ +E VI + I + Y + E F +E F
Sbjct: 206 AYLQQFLCFRRHREKVLEEQDCVIEEFRTFLSLISTYYSHVCCEASKTRLETFSFTLEQF 265
Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
W++ L SR SM A++PW D NHS + +DK VF T G
Sbjct: 266 TWAYNTLMSRAFAYDSM--VWAVMPWVDYFNHSTLNNATMRFDKRLNCYVFVTVVPIAKG 323
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
EQ+F+ YG ++ ELLL YGF PS S
Sbjct: 324 EQIFLQYGSYTDAELLLWYGFTVTPSLFPSYS 355
>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
Length = 448
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 108/244 (44%), Gaps = 27/244 (11%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC---PEAGEVLKQCSVPDWPLLATY 155
D G RGL+ + I++G+ ++ +P ++++ + P Q + + T+
Sbjct: 54 DTG-RGLMVPRKIKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQ--AITTF 110
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
LI + ++S W Y+ LP + +Y+ E D +R + I +Y
Sbjct: 111 LIYQKHIGETSFWKPYLDILPNEYTHPVYF--GEEDFLYLPHSLRANIKAKKQECIKSYE 168
Query: 216 DLRLRIFSKYPDLFP--EEVFNMETFKWSFGILFSRLVR--------LPSMD----GRVA 261
+L+ F L P E +F + ++W++ + +R + L ++D G +
Sbjct: 169 ELK-PFFPSLEPLLPNWEGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTS 227
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
LVP D+LNHS T L KS + T + Y+ G+QV Y + N LLL+
Sbjct: 228 LVPMVDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYRRADNQTLLLN 287
Query: 318 YGFV 321
YGFV
Sbjct: 288 YGFV 291
>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 372
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 121/265 (45%), Gaps = 35/265 (13%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
++ L W ++ G+ + Q + G++A ++I+ GE +LFVP L T
Sbjct: 1 MDTYEELLSWATERGIKLSGIKPQNILSRGTGIIATRDIQAGETILFVPFKLFRTLKH-- 58
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP---YSLLYWTRAE 189
P+A ++ LLATYL S +K+ ++ LP + + AE
Sbjct: 59 -VPKAISRRLPRNMSLHALLATYL----SLDKTDTFAIPNKTLPDLSSFEAGMPFLWPAE 113
Query: 190 LDRYLEASQI-----RERAIERITNVIG-TYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
L +L + ++R+ +R +++ Y+++ S+ L + N +F +
Sbjct: 114 LHPFLPKPALDLLMKQQRSFKRDWDIVSKAYSNI-----SQDQYLHAWLLVNTRSFYCTT 168
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGE 300
I+ RLP D R+A++P AD+ NH+ CE +S+ F DR Y+ GE
Sbjct: 169 PIM----ERLPH-DDRLAILPVADLFNHADVGCEARF------ASENYSFIADRDYRTGE 217
Query: 301 QVFISYGKKSNGELLLSYGFVPREG 325
++ ISYG S LL YGFVP E
Sbjct: 218 ELHISYGSHSTDFLLTEYGFVPTEN 242
>gi|452000836|gb|EMD93296.1| hypothetical protein COCHEDRAFT_1170833 [Cochliobolus
heterostrophus C5]
Length = 643
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 105/237 (44%), Gaps = 20/237 (8%)
Query: 141 LKQC--SVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEA 196
L+QC +PD L L+ + S WS Y++ LP R + L++ + +L
Sbjct: 88 LQQCRDKIPDHILAYLLLLEQRDKGNDSPWSAYLACLPGPRDMTTPLWFDDVDF-AFLAG 146
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR---- 252
+ + A ER + + I K+ DL +V ++E+ +W+ I SR
Sbjct: 147 TSLAPAAKERKAELRQQWEHALQVI--KHLDLHLADVISLESLQWAATIFTSRAFISTHI 204
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
LP + L P D+LNHS + D++ S + +PGE++F +Y K N
Sbjct: 205 LPGRETIPMLFPVIDILNHSVTAKVEWDFEPHRSFALKCLQADSVKPGEELFNNYAPKQN 264
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT 368
ELLL YGF + NP + L L+ + + Y +L GL + P ++T
Sbjct: 265 DELLLGYGFCLED--NPIEQFALKLAFQPQLQQYAHQL------GLLDGKNVPFEMT 313
>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
Length = 823
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 123/273 (45%), Gaps = 20/273 (7%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS----------K 131
W+ + G + ++ V GL + K +G+ LL +P L +
Sbjct: 7 WVKEHGGHIDGVYVKNFPVYGNGLCSSKEFHEGDTLLSIPYHLQLNTIELHNVFESMVPG 66
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+ P GE K + ++ YL + EK + YI+ LP L ++ EL
Sbjct: 67 FEVPRLGEGAKNRD-DENSVVYLYLAMNKTNEKCFHFP-YINTLPTTFSCPLSYSENEL- 123
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-- 249
+ L+ +++ +E+ + +D + +YP F + + W+ + +SR
Sbjct: 124 KMLKGTKLL-VTVEKTKTFLKKLSDYYETLTHQYPTRFQQFDDFYQRLVWAHQVFWSRAF 182
Query: 250 LVRLPSMDGRVA-LVPWADMLNHSCEVE-TFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
LV P G VA L+P+AD NH+ E + T++ ++ + T ++ GEQ+F +Y
Sbjct: 183 LVIYPDPIGDVASLIPFADFSNHNTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNYR 242
Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
+ N ++LL YGFV E NP D V L ++ K+
Sbjct: 243 IRPNEKMLLGYGFVISE--NPYDEVLLRINFKE 273
>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
Length = 573
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/320 (23%), Positives = 132/320 (41%), Gaps = 49/320 (15%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
E L + +W G+ + I + GL A ++I E +L VP L+
Sbjct: 120 EKTRLAKVAAFNEWARAGGVQSDCVEITTFPGYQLGLRAKRDIAAEELVLSVPRKLIF-- 177
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
S+ PE L + + P + LI E +S W +I LP + ++LY+T
Sbjct: 178 -SEELLPEWKRELFR-NFPTHLNVTYTLIIEKVRGAASAWQPFIDTLPTRYSTVLYFTVD 235
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRL--------RIFSKYPDLFPEEVFNMETFK 240
++ R L + A+ + Y + + + +LF E E ++
Sbjct: 236 QMQR-LRGTSACSAAMRHCLVIARLYASMYKCAYIQPGDNVMAAKANLFTEYGLCYELYR 294
Query: 241 WSFGILFSRLVRLP---SMDGRV----------------------------ALVPWADML 269
W+ + +R +P S G V AL+P+ DM
Sbjct: 295 WAVSTVTTRQNLVPRELSTVGEVDQVCQLGGFEGTEIKRDAETGARNAPISALIPYWDMT 354
Query: 270 NHSC-EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
NH C ++ ++ YD+++Q + T ++ GEQ FI YG +SN + L+ +GF+ + N
Sbjct: 355 NHRCGKITSY--YDRAAQQMECTAQEAFKAGEQFFIYYGDRSNADRLVHHGFL--DMHNL 410
Query: 329 SDSVELPLSLKKSDKCYKEK 348
D V++ L L +D +++
Sbjct: 411 KDYVQIRLGLSPTDPLVEQR 430
>gi|357504157|ref|XP_003622367.1| SET domain-containing protein [Medicago truncatula]
gi|355497382|gb|AES78585.1| SET domain-containing protein [Medicago truncatula]
Length = 497
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/310 (22%), Positives = 123/310 (39%), Gaps = 57/310 (18%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLV--ALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
L++W+ +G A+Q VD E G+ AL I G+ + +P +T + +C
Sbjct: 9 LKRWMKSNGFEWSS-ALQFVDTPEEGISVKALCEINAGDVVAKMPKKACLTIKTSGAC-- 65
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
E+++ + + LA ++ E S + S W Y+ LP+Q L W+ E+D+ L
Sbjct: 66 --EIIENACLGGYLGLAVAIMYERSLAEESPWEGYLQLLPQQECLPLVWSVEEVDQLLCG 123
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+++ + E V + + L + P F +E + + ++ SR +
Sbjct: 124 TELHQTVQEDKALVYEDWRENILPLLDSEPSKLNPAFFGVEQYFAAKSLISSRSFEIDDY 183
Query: 257 DGRVALVPWADMLNHSCEVET-------------------FLD---------YDKSSQGV 288
G +VP AD+ NH E +D DK+ +GV
Sbjct: 184 HG-FGMVPLADLFNHKTGAEDVHFTALSSNNESEDDTDDEIVDEEALAQNSSMDKTEKGV 242
Query: 289 VFTTDRQY-------------------QPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
+D +Y G +VF +YG N LL YGF ++ T
Sbjct: 243 --DSDMEYSSITEDDTSMLEMVMIKDVSSGAEVFNTYGILGNAALLHRYGFTEQDNTYDI 300
Query: 330 DSVELPLSLK 339
+++L L L+
Sbjct: 301 VNIDLELVLQ 310
>gi|343475275|emb|CCD13287.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 593
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 67/152 (44%), Gaps = 10/152 (6%)
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--------FNMETF 239
A L ++L + RE+ +E VI + I + Y + E F +E F
Sbjct: 206 AYLQQFLCFRRHREKVLEEQDCVIEEFRTFLSLISTYYSHVSCEASKTRLETFSFTLEQF 265
Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
W++ L SR SM A++PW D NHS + +DK VF T G
Sbjct: 266 TWAYNTLMSRAFAYDSM--VWAVMPWVDYFNHSTLNNATMRFDKRLNCYVFVTVVPIAKG 323
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
EQ+F+ YG ++ ELLL YGF PS S
Sbjct: 324 EQIFLQYGSYTDAELLLWYGFTVTPSLFPSYS 355
>gi|241955755|ref|XP_002420598.1| SET domain-containing protein, putative; lysine methyltransferase,
putative [Candida dubliniensis CD36]
gi|223643940|emb|CAX41679.1| SET domain-containing protein, putative [Candida dubliniensis CD36]
Length = 542
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 133/316 (42%), Gaps = 68/316 (21%)
Query: 79 LQKWLSDSGLPPQ-KMAIQKV-DVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
Q WL + + K+AI D + RG++AL+NI E + +P S+V+ D+
Sbjct: 11 FQDWLIKNNVEISPKIAIHDYRDTNQGRGIIALQNINPDEMIFKLPRSIVLNIDNNSLIK 70
Query: 136 EAGEVLKQCSVPD-WPLLATYLISEASFE---------KSSRWSNYISALPRQPYSLLYW 185
+ LK+ + D W L L E F+ +S W Y++ LP L+YW
Sbjct: 71 QYPSALKKLRLLDQWIGLIIVLSFEMKFKFNPSDNDDNNNSFWYEYLNILPNDFNQLIYW 130
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
EL+ +L+ S I +R IG N+L + +++ + +++ +E FK S +
Sbjct: 131 NDEELN-HLQPSCILDR--------IGKENNLNM--YNQIISIINQDLSIIEEFKSS-PL 178
Query: 246 LFSRLVRLP------SMDGRV---------------------------------ALVPWA 266
F ++ S D V ++VP+A
Sbjct: 179 TFEEYNKVATIIMSYSFDVEVPKSKPNKEMTENGTNEEDDEDDYEEEEDHEYYKSMVPFA 238
Query: 267 DMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D LN + L Y S+ ++ T + GEQV+ +Y N ELL YG+V G
Sbjct: 239 DTLNADTHLNNAILIY--STDQLIMTCIKVIAKGEQVYNTYSDHPNSELLRRYGYVELNG 296
Query: 326 TNPSDSVELPLSLKKS 341
+ D E+PLS+ K+
Sbjct: 297 S-KYDFGEIPLSIIKT 311
>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
Length = 486
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 77/164 (46%), Gaps = 11/164 (6%)
Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY----NDLRLRIFSKYPDL 228
S LP Q S W EL YL AS + +A ER+ + + ND + + D+
Sbjct: 306 SDLPDQ-LSTFRWEDTELS-YLRASPLYGKARERLEMITTEFGQVQNDFCTCVLEQALDV 363
Query: 229 FPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQ 286
+P+ ++E K + +FSR + + D + ++P D NH+ L ++
Sbjct: 364 WPQLFGKVSLEDLKHVYATVFSRSLAIGE-DSTLVMIPMLDFFNHNATSFAKLSFNGLLN 422
Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
V T DR Y +Q++I+YG SN EL L YGF E NP D
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPE--NPYD 464
>gi|412992279|emb|CCO19992.1| predicted protein [Bathycoccus prasinos]
Length = 640
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 135/325 (41%), Gaps = 43/325 (13%)
Query: 68 CEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPS 123
C+I++ A L KW + G + + RGL A ++IR GE +L +P
Sbjct: 195 CKIETKGYAVALTKWAASQGKDANVSKVAPCLLSSMNDARGLCATEDIRAGENILEIPRR 254
Query: 124 LVITADSKWSCPEAG------EVLKQCSVPDWPLLATYLISEASFEKSSR---WSNYISA 174
+++ A + E G +L++C ++ +++ E K+ + WS Y +
Sbjct: 255 MLLDAGT-ICISEQGPFGDLLRILERCGAD--TIMTLWIMKERMKMKTKQETFWSLYFLS 311
Query: 175 LP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
LP Q + L W + L + I E + V Y+ L + + P+ F
Sbjct: 312 LPDGSQKLTPLSWPEDIVRVGLGNTPIFETVMHERQKVRNGYDALLPSLLANCPESFEG- 370
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRV------------ALVPWADMLNHSC--EVETF 278
N E F WS+ S L S V L P A NH +
Sbjct: 371 --NQEEF-WSYDQYISALELWMSYAMTVKPVHNSDSGTIDVLSPVAFFCNHGIYPHCVHY 427
Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL-- 336
S + +VF R + E++ +SYG KSNGELLL YGF + NP DS+++ L
Sbjct: 428 SQLRLSDECLVFPAMRDIEKNEEIMLSYGAKSNGELLLFYGFCIDD--NPYDSIDITLDF 485
Query: 337 -SLKKSDK--CYKEKLEALRKYGLS 358
SL +K K + E L K+ L+
Sbjct: 486 DSLNGVEKPEVRKRREELLVKHDLT 510
>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 813
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 131/285 (45%), Gaps = 20/285 (7%)
Query: 85 DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC 144
D+ L + A D G RG AL++ +GE LL +P T + V C
Sbjct: 5 DATLARRVEAHDFADTG-RGQRALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCC 63
Query: 145 SVPDWPLLATYLISEA-SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERA 203
+ D ++A ++ +E EK++R + +++ LPR + +W+ EL + +RE
Sbjct: 64 ARHD-DVVALHVCAERFRGEKATR-AAHVATLPRSFDTAFFWSEEELRELTGTTCLRETM 121
Query: 204 IERITNVIGTYNDLRLRIFS-KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-- 260
R Y L ++ + E + E + W+ L+SR L DG+
Sbjct: 122 NLR-EETKNDYETLTKKMEAIGEGGWMREHEVDYERYAWARSNLWSRQCDLLMPDGKRTR 180
Query: 261 ALVPWADMLNHSCEV----ETFLDYDKSSQGVVFTTDRQYQPGEQVFISY--GKKSNGEL 314
A+VP D+ NHS + L+ +K+ V D Y+ GEQ FISY G+ +N +L
Sbjct: 181 AMVPTFDIFNHSAKAPLGKTHKLNAEKNCVTVYAADD--YKAGEQAFISYGSGEAANSKL 238
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE-ALRKYGLS 358
L YGF + NP + +++ L++ DK K LE ALR ++
Sbjct: 239 LTWYGFCIDD--NPYEELDVTLTI-TVDKLRKTVLETALRASAVA 280
>gi|148237199|ref|NP_001085404.1| N-lysine methyltransferase setd6 [Xenopus laevis]
gi|82184826|sp|Q6INM2.1|SETD6_XENLA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
domain-containing protein 6
gi|48734800|gb|AAH72257.1| MGC82362 protein [Xenopus laevis]
Length = 455
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/295 (22%), Positives = 128/295 (43%), Gaps = 40/295 (13%)
Query: 70 IDSLENASTLQK---WLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
+D L+N + + W GL P+ + V + G++A ++I GE L VP S
Sbjct: 13 VDHLQNGFPVTRFLAWCEKVGLELNPKVYISTEGTVSQYGMLAREDIADGELLFTVPRSA 72
Query: 125 VITADSKWSCPEAGEVLKQ-----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR-- 177
+++ ++ E+L++ S W L L+ EA+ + SS W+ Y P
Sbjct: 73 ILSQNTT----RIQELLEKEQESLQSTSGWVPLLISLLYEAT-DSSSLWAPYFGLWPELD 127
Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
P ++W+ E + L+ + + E + N+ YN + L ++ P+ F ++
Sbjct: 128 PPDMPMFWSEEEQTKLLQGTGVLEAIRNDLKNIEEEYNSIVLPFITRNPEKFCPMKHTLD 187
Query: 238 TFKWSFGILFSRLVR----------------LPSMDGRVALVPWADMLNHSCEVETFLDY 281
+K + + + LP M +VP AD+LNH L++
Sbjct: 188 LYKRLVAFVMAYSFQEPLEENDEEDEDEKDILPPM-----MVPVADLLNHVAHHNAHLEF 242
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+ + + T + G+++F +YG+ +N +LL YGF N +++ ++ +
Sbjct: 243 --TPECLRMVTTKSVHAGQELFNTYGEMANWQLLHMYGFAEPHPQNSNETADIQM 295
>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 510
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 121/288 (42%), Gaps = 59/288 (20%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL----KQCSVPDWPLLAT 154
D G RGL A ++++KGE +L VP S ++T D S + G +L ++ L
Sbjct: 51 DAGGRGLGAARDLKKGELVLRVPKSALLTKD---SFLKDGLLLSAINNHSALSPTQTLTV 107
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
L+ E S +SS W Y+ LPR Y +L T +E ++ +A Q+ + AI I
Sbjct: 108 CLLYEMSKGQSSFWYPYLMHLPRS-YEILA-TFSEFEK--QALQV-DDAIWTAEKAISKA 162
Query: 215 NDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH-- 271
R +S +L + + + W+ + SR + +P D L P D N+
Sbjct: 163 ELDRKEAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIP-WDEAGCLCPVGDFFNYAA 221
Query: 272 -----------------SC----------------------EVETFLD--YDKSSQGVVF 290
SC ++++ D +D+ F
Sbjct: 222 PGEESSSPENDESWKPASCLEDASLSSERSTSNFCSETFDVQLKSLTDGGFDEDKAAYCF 281
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+ Y+ G QV +SYG +N ELL YGF+ E NP+D V +PL L
Sbjct: 282 YARQNYKKGAQVLLSYGTYTNLELLEHYGFLLNE--NPNDKVFIPLEL 327
>gi|428182191|gb|EKX51052.1| hypothetical protein GUITHDRAFT_134587 [Guillardia theta CCMP2712]
Length = 365
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/271 (24%), Positives = 116/271 (42%), Gaps = 32/271 (11%)
Query: 86 SGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC- 144
+G+ K+ +++ + G RG+ A K I +GE ++ VPPSL+ + ++ AG LK
Sbjct: 2 AGVKFPKLEVRR-EGGVRGMYATKKIDRGEVIVSVPPSLLFSYET------AGGALKDVW 54
Query: 145 -SVPDWPLLATYLISEASFEKS--SRWSNYISALP--RQPYSLLYWTRAELDRYLEASQI 199
D L + F SRW ++ +P + + W+ +L+ E +
Sbjct: 55 KRTKDMQELDRLTLLLLYFSSKVRSRWDFFLCGIPGMNELGPAVLWSPKKLNETCEREEY 114
Query: 200 RERAIERITNVIGTYNDL-RLRIF---SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
+ N Y L R + K+P +F ++ + W+ + SR+ +
Sbjct: 115 SS-LCSFVENRRSMYKRLWRTEVAPLPRKFPHIFSQQDTGYSNYLWAIAAVLSRMWLMRR 173
Query: 256 MD-------------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ + + P A++LNH + + +D Y+PGEQV
Sbjct: 174 FEEPEFYPNGTWIGPAKWVMAPVAELLNHKPRAGHIRWGSQRRPHLEVVSDVSYRPGEQV 233
Query: 303 FISYGKKSNGELLLSYGF-VPREGTNPSDSV 332
F+SYG K N ELLL YGF +P T +D +
Sbjct: 234 FVSYGNKCNLELLLEYGFEIPGNPTKCADEI 264
>gi|303284022|ref|XP_003061302.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226457653|gb|EEH54952.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 536
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/335 (25%), Positives = 124/335 (37%), Gaps = 55/335 (16%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW--PL-----LA 153
G RG+VA I G LL VP +L+++ + + L + D PL LA
Sbjct: 50 GWRGVVATAPIAAGATLLRVPTALLMSGRTAAADDVLARALSEHHERDGEPPLTPTDRLA 109
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
+L+ E S S W Y+ LPR WT AE + A Q+ A++
Sbjct: 110 VHLLRELSRGAESFWHLYLRQLPRSYALTCGWTAAERN----ALQL-PHAVDAADRSAAA 164
Query: 214 YNDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
D R + P + + W+ + SR V +P D AL P D+ N+
Sbjct: 165 CRDAWARATPVMEKIGLPATYRSFGAWAWAAATISSRTVFVP-FDAAGALCPVGDLFNYV 223
Query: 273 CE--------VETFLD----------------------------YDKSSQGVVFTTDRQY 296
V T L+ + ++S VF R Y
Sbjct: 224 PPTPPHVPKVVGTPLEGPSDERDDEEDDENDSYFLRRGVGGDGAWHEASDAWVFVARRDY 283
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
+ GE++ + YG+ +N LL YGF G N D L + K D + + + Y
Sbjct: 284 RKGEEISLCYGQHTNLGLLTHYGFTMSHGENAHDEAPLVVDAKTLDVDFPAAVGDVGSYR 343
Query: 357 LSASEC-FPIQITGWP----LELMAYAYLVVSPPS 386
+ C FP W L L A+A LV PP
Sbjct: 344 NAHEICVFPRGGVSWSTLERLRLAAHATLVGKPPG 378
>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 467
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 125/291 (42%), Gaps = 54/291 (18%)
Query: 74 ENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLV---ALKNIRKGEKLLFVPPSLVIT- 127
+ STL +W S G L P Q G V A ++ E ++ +P SL ++
Sbjct: 5 DTISTLVEWASSHGATLHPSIEVYQDPQTGLSFRVKPSAKSPVQPYEPIVQLPTSLTLSY 64
Query: 128 ----ADSKWSCPEA--GEVLKQCS---VPDWPLLATYLISEASFEKSSRWSNYISALPRQ 178
++ + P+A GE L + + V L+ YL + SF W YI ALP Q
Sbjct: 65 FNAVSEQGTASPDAFRGEFLARAAPHVVGRLFLIKEYLKRDKSF-----WWPYIRALP-Q 118
Query: 179 P-------YSLL-YWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----LRIFSKY 225
P ++L +W E + LE + + E I++I N + DL+ LR+
Sbjct: 119 PGQGNKSQWALAPFWDDDEAE-LLEGTNV-EVGIDKIRNDV--RRDLQEAQELLRLHGDA 174
Query: 226 PDLFPEEVFNMETFKWSFGILFSRLVR------------LP---SMDGRVALVPWADMLN 270
F + E ++W++ I SR R LP +MD L+P D+ N
Sbjct: 175 DGAF-GKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRSLPRGVTMDDFSVLLPLFDIGN 233
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
H E D D Q R + PG+QVF +Y K+N ELLL YGF+
Sbjct: 234 HDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMKTNAELLLGYGFM 284
>gi|145344075|ref|XP_001416564.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576790|gb|ABO94857.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 398
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 110/247 (44%), Gaps = 24/247 (9%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
L+ L S ++ Y +P + + +W+ + + L S E I +
Sbjct: 45 LIVCLLYERYELGDRSAFAEYFRTMPGEFDTPTHWS-DDTAKELRGSDTYEVDIVDEYKL 103
Query: 211 IGT-YNDLRLRIFSKYPDLF-PEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADM 268
+ T +N LR+R+F Y D+F + ++ +W++ + +R R+ DG +ALVP DM
Sbjct: 104 LNTVWNALRVRVFDVYTDVFVGKAARSLYALRWAWTVAHARATRVSGKDG-LALVPVIDM 162
Query: 269 LNHS-----------CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
+ + E F YD + VV R Y PGE++ +G +NGE +
Sbjct: 163 IRECGKDVDADKTDIVDDEGFAVYDPHADEVVVYAKRDYAPGEELCERFGGWNNGESVQH 222
Query: 318 YGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGLSA--SECFPIQITGWPLEL 374
G++P TN + + V + L+ +K K E +RK G C P + L++
Sbjct: 223 LGYLPDVHTNSTRNCVLMVLTPEK-----KRNEEKVRKAGFDVPWRVCVPSAASESSLDM 277
Query: 375 M-AYAYL 380
+ AYA L
Sbjct: 278 LSAYAEL 284
>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
Length = 377
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 116/252 (46%), Gaps = 13/252 (5%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
+E L +W D G+ +A +++ G+VA K ++ E+LL VP S + + ++
Sbjct: 1 MEAYDELLRWAQDRGVEIHGIAPREIPGKGVGMVATKPLKANERLLHVPTSALRSLET-- 58
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELD 191
P+ + L + L A + + + +K + W+ + + +L L W+ L
Sbjct: 59 IRPKVKKALPADTRVHALLAADLALDKPTTKKYAPWNAIVPSAADLATALPLAWSSPVLH 118
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN--METFKWSFGILFSR 249
YL RA+ R + + + +P L P+ + + T +F +R
Sbjct: 119 NYLPPPA---RALLRAQQ--AKFARDWAAVSAAFPALAPDAFRHAWLLTNTRTFYHETAR 173
Query: 250 LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
RLP D R+ L P AD+ NH+ + + + +S + T DR Y GE+V I YG+
Sbjct: 174 TARLPH-DDRMVLQPVADLFNHAADGGCEVAFTPASFAI--TADRAYAEGEEVLICYGRH 230
Query: 310 SNGELLLSYGFV 321
SN LL+ YGFV
Sbjct: 231 SNDFLLVEYGFV 242
>gi|66819805|ref|XP_643561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
gi|60471605|gb|EAL69561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
Length = 526
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 106/225 (47%), Gaps = 35/225 (15%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGER---------------------------GLVAL 108
A + KWL D+G+ I+ V +G+ G+++L
Sbjct: 13 AEIINKWLIDNGVRINNKLIKIVYLGKENNFEQTENTTATTTTSERINDSIVSGLGVISL 72
Query: 109 KNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRW 168
K ++ + + +P S++++ + +L++ + + + LI EAS + S+W
Sbjct: 73 KELKVDDIVAKIPKSIILSIHT----SSISNILEKYKIENNIGTSIALIHEASLGEKSKW 128
Query: 169 SNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK-YPD 227
YIS+LPR+ + W +E + L+ + I + + + Y D+ I SK +P+
Sbjct: 129 YGYISSLPRKVDVPILWD-SESRKLLKGTAIEDVLNDDDILINQVYADVIESILSKNHPE 187
Query: 228 LFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
+F + E++++E FK + I+ SR + S G +LVP AD+ NH
Sbjct: 188 IFGDKELYSIENFKIANSIISSRAFCVDSYHGD-SLVPLADIFNH 231
>gi|348519120|ref|XP_003447079.1| PREDICTED: N-lysine methyltransferase setd6-like [Oreochromis
niloticus]
Length = 460
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/304 (25%), Positives = 133/304 (43%), Gaps = 29/304 (9%)
Query: 52 ASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLP-PQKMAIQKVD-VGERGLVALK 109
A+R K ++ G E+ L+ + +W GL K+ + K V E G++A
Sbjct: 2 AARAKRAKVED----GSELSPLQ---SFLQWCDGVGLELSSKVCVSKEGIVAEYGMLAKD 54
Query: 110 NIRKGEKLLFVPPS-LVITADSKWSCPEAGEVLKQCSVPDW-PLLATYLISEASFEKSSR 167
+I +GE L +P S L+ +K S E S W PLL L S + S
Sbjct: 55 DIEEGEVLFTIPRSALLHQGTTKVSTLLEKEQSSLQSSSGWVPLLLALLYEYTSSQ--SH 112
Query: 168 WSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY 225
W Y+S + ++W++ E R L+ + + E + N+ Y D+ L +++
Sbjct: 113 WKAYLSLWTDFKTLDHPMFWSKEERGRLLKGTGVPEAVDRDLANIQREYTDVVLPFMTRH 172
Query: 226 PDLFPEEVFNMETFKW--SFGILFS----------RLVRLPSMDGRVALVPWADMLNHSC 273
PDL+ ++ + +F + +S +VP ADMLNH
Sbjct: 173 PDLWNPGTHTLDLYTQLVAFVMAYSFQEPQDEEDEEEEEEEKPPNPPMMVPMADMLNHVS 232
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
L++ + S +V R G++VF +YG+ +N +LL YGF +N +D+ +
Sbjct: 233 NHNANLEFTQDSLKMVCV--RPIHKGQEVFNTYGQIANWQLLHMYGFTEPCSSNSNDTAD 290
Query: 334 LPLS 337
+P++
Sbjct: 291 IPVA 294
>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
reinhardtii]
Length = 656
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 112/245 (45%), Gaps = 21/245 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEA 160
RGL A + G+ +L VP L+++ ++ + G+VL + D + + E
Sbjct: 75 RGLRADTGVGPGDVVLHVPVELLMSYETAKES-DLGKVLTALPLGLGDDSVALIWTCVER 133
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL-------EASQIRERAIERITNVIGT 213
++ + + S LP++ ++L + A++D L EA Q R E
Sbjct: 134 REPEAPAAAWWAS-LPQRFSTVLSASDADVDAALAGSPLAAEAGQARRHLAEAFAASQPA 192
Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRVALVPWADMLNH 271
+ L YPD F F+ E++ W+ + +S V++ + D R LVP+ ++NH
Sbjct: 193 FESL----LKAYPDYFQPHWFSWESYLWAAELWYSYGIQVQVAAGDIRTCLVPYLGLMNH 248
Query: 272 SC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
V F D +S+G+ R G Q+F+SYG N +LLL YGF + NP
Sbjct: 249 HPLPHVVHFSKVDPASRGLRVRAFRPCARGRQLFLSYGPYPNSKLLLFYGFALPD--NPV 306
Query: 330 DSVEL 334
D VEL
Sbjct: 307 DEVEL 311
>gi|118353077|ref|XP_001009809.1| hypothetical protein TTHERM_00160790 [Tetrahymena thermophila]
gi|89291576|gb|EAR89564.1| hypothetical protein TTHERM_00160790 [Tetrahymena thermophila
SB210]
Length = 409
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/298 (24%), Positives = 126/298 (42%), Gaps = 44/298 (14%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDV-------GERGLVALKNIRKGEKLLFVPPSLV--- 125
L KWL+++G + +V++ G G A +KG+ L +P +
Sbjct: 14 VDNLFKWLNENG----ATGLDQVEIKPSPECNGSIGCFAKIEFKKGDILAKIPKKCILGL 69
Query: 126 --------ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
I+ ++++ E G+ L + + +L Y EK + W Y+ +LP
Sbjct: 70 GQAVKSPLISKLNEYAQQEYGKKLDKQVFSNEFMLWIYEGQCLIEEKDNHWKAYLESLPS 129
Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
+ + W L + + + A+E+ + + I SK+PDL E+ +
Sbjct: 130 ESPIVCSWDNNILQKISKTN--LGSAVEKELAIFQKQIEFLQSIQSKFPDLLHPEI--TK 185
Query: 238 TFKWSFGILFSR-LVRLPSMDGRVA-----------LVPWADMLNHSCEVETFLDYDKSS 285
+WS G SR V ++DG + +VP+ D+LNH + + +D+
Sbjct: 186 YIEWSKGNYLSRRFVGKLAIDGEGSGLEQYGGKMGCMVPFFDLLNHKNDHKVNFQHDEEY 245
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
V + + + GE+VF +Y K SN ELL +YGF N D LPL L DK
Sbjct: 246 --VWYVCEYDIKAGEEVFNNYCKASNEELLFTYGFAVE--NNQLDV--LPLKLMACDK 297
>gi|313234004|emb|CBY19580.1| unnamed protein product [Oikopleura dioica]
Length = 791
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 106/221 (47%), Gaps = 42/221 (19%)
Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL--EASQI 199
++ +VP + +L +E +K+SR + +IS+LPR+ Y+L + A+L + + ++ Q+
Sbjct: 137 EKVAVPAEDVFIHFLCTEEKQKKASRVAGWISSLPRESYNLPFNWPADLQKCVCDDSLQL 196
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM--- 256
+A + +N L R F Y + + F+ +T W++ + +RL LP
Sbjct: 197 AIKA------QVADFNTLIAR-FEHYSTVIDDVKFSRDTLAWAYSVWSARLFELPQYPES 249
Query: 257 ----------------DGR-VALVPWADMLNHSCEVETFLD------YDKSS-----QGV 288
D R A +P D+LNHS FLD +DK+ +
Sbjct: 250 SSNSVNLPKWLDNDPNDLRSFAFLPIFDLLNHSSTPNVFLDIREKHVWDKTKKIHPEEKF 309
Query: 289 VFTTDRQYQ--PGEQVFISYGKKSNGELLLSYGFVPREGTN 327
V + + + + GE++ +SYG S+ +L L YGFV ++G N
Sbjct: 310 VLSLEAKTKIAKGEELRMSYGNLSDRDLFLKYGFVLKKGEN 350
>gi|255714603|ref|XP_002553583.1| KLTH0E02156p [Lachancea thermotolerans]
gi|238934965|emb|CAR23146.1| KLTH0E02156p [Lachancea thermotolerans CBS 6340]
Length = 498
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 141/334 (42%), Gaps = 64/334 (19%)
Query: 58 TVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGE 115
T T N + W + L W+SD K+ + G++G +VA + I KGE
Sbjct: 8 TKTNNFLRWASKDAKL--------WISD------KVKLLGTRDGDQGRYMVATQEIIKGE 53
Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCS--VPDWPLLATYLISE-ASFEKSSRWSNYI 172
KL VP + + +V K+ + V W L ++ E ++S+W Y
Sbjct: 54 KLFEVPRGSALNVATLSLSMRDKQVYKKLTTEVGHWEGLVIAILYEFKVMNQNSKWWPYF 113
Query: 173 SALPR--QPYSLLYWTRAELDRYLEASQIRER-----AIERITNVIGTYNDLRLRIFSKY 225
LP + SL+YWT AEL+ YL+ S + ER A E V+ DL++ ++
Sbjct: 114 EVLPEPARLNSLMYWTGAELE-YLKPSGVYERVDREGAEEMYARVMKCAEDLKI---TEL 169
Query: 226 PDLFPEEVFNMETFKWSFGILFSR-------------------LVRLPSMDGRV-ALVPW 265
++ EE ++ + ++ R DG ++VP
Sbjct: 170 TNITWEEFMHVASIIMAYSFDMERPDYEDSDEEVEESDEEEEEEKNTVWNDGYFKSMVPM 229
Query: 266 ADMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
ADMLN H C L Y S + ++ GEQ++ +YG+ N ELL YG+V
Sbjct: 230 ADMLNSDTHKCNAN--LTY--SPEALIMVAVADIPSGEQIYNNYGEYPNSELLRRYGYVE 285
Query: 323 REGTNPSDSVELPLS--LKKSDKCYKEKLEALRK 354
G+ D E+PL LK + C LEA R+
Sbjct: 286 WSGSK-FDCGEMPLETLLKAIEVC----LEAPRR 314
>gi|303288325|ref|XP_003063451.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455283|gb|EEH52587.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 478
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 90/193 (46%), Gaps = 24/193 (12%)
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-RERA-IERITNVIGTY 214
IS A + S + ++ALP + W+ +++R L + R RA + RI V
Sbjct: 113 ISRARYVLSLPGAELVNALP------IGWSDEDIERRLRGDLLSRTRATLARIHAVADAI 166
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL--VRLP----SMDGRV-ALVPWAD 267
R + F D F VF+++ KW+ + +SR +R P + G V ALVP D
Sbjct: 167 ARSRSKAFGDADDAFF--VFSVDDLKWAHAVFWSRAMTLRFPRKGFTGGGDVDALVPLVD 224
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQ---YQPGEQVFISYGKKSNGELLLSYGFV-PR 323
M NH L+ + G F R + G++VFI+YG K N ELL +GFV P
Sbjct: 225 MCNHRAGSTATLEIVEDDAGDAFYELRAGVATKAGDEVFINYGAKGNEELLRCHGFVIP- 283
Query: 324 EGTNPSDSVELPL 336
NP D + + L
Sbjct: 284 --NNPCDVLAVDL 294
>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
Length = 504
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 119/287 (41%), Gaps = 63/287 (21%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP-LLATYLIS 158
G RGL A +++ +GE +L VP S ++T+ S + +K+ + P +L L++
Sbjct: 46 AGGRGLAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLA 105
Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERI----TNVIG 212
E S KSS W Y+ LPR +L +++ E + +A + ERAI + I
Sbjct: 106 EMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIP 165
Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
+L+L+ ++ N + W+ + SR + +P D L P D N++
Sbjct: 166 LMEELKLK----------PQLQNFRAWLWASSTVSSRTMHIP-WDDAGCLCPVGDFYNYA 214
Query: 273 CEVE--------------------TFLDYDKSSQ-----------------------GVV 289
E +F + D +S
Sbjct: 215 APGEEPCGWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKEDLAAYC 274
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
F + Y+ GEQV +SYG +N ELL YGF+ E NP+D +PL
Sbjct: 275 FYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDE--NPNDKAFIPL 319
>gi|71995786|ref|NP_497604.2| Protein SET-27 [Caenorhabditis elegans]
gi|373220599|emb|CCD73865.1| Protein SET-27 [Caenorhabditis elegans]
Length = 502
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/384 (24%), Positives = 151/384 (39%), Gaps = 69/384 (17%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
++ T W G+ + I L A I K + VP +IT D
Sbjct: 72 DSIKTFLAWADGVGIARNNVTIGSTKTAGLSLQATGPIPKSHIVARVPRHAMITLD---- 127
Query: 134 CPEAGEVLKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
+ +LK+ D P+ LA +L + + S+W +YIS LP + L++
Sbjct: 128 LAKKSSLLKKAFEKD-PIVGGMDNVGLALFLACQWIQNEKSKWKSYISILPTTFPTPLFY 186
Query: 186 TRAELDRYLEASQIRERAI-----------------------ERITNVIGTYNDLRLRIF 222
+ +L + L+ S I E AI E N + IF
Sbjct: 187 SEEQLLQ-LKPSPIFEEAILFYRTISRQFCYFLLAIAKNKIYEAAQRRKDARNAMETPIF 245
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM-----DGRV----ALVPWADMLNH-- 271
P F F + + W+ G++ +R+ +PS DG AL+P DM NH
Sbjct: 246 YNVP--FNVANFTPKLYFWAVGVVTTRVNMVPSENQVGEDGNPVIIPALIPVLDMANHEN 303
Query: 272 ------SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
+ +E + Y + V T+ + G +V I YG +S GE LL GFVP
Sbjct: 304 VLTDVLTEPIEDLVCYSPEEECAVITSHCDVKAGNEVTIFYGCRSKGEHLLHNGFVPIYH 363
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKY---GLSASECFPIQITGW-----PLELMAY 377
D ++L + + K+DK K + ++K+ A F + + + PL+L+ +
Sbjct: 364 -GKFDVLKLKIGIPKTDKTLDAKKKLIQKFVKKVYCAGNIFHVDLYNYHEQPFPLDLLMF 422
Query: 378 AYLVVSPPSMKGKFEEMAAAASNK 401
A + VS EE +A N+
Sbjct: 423 AAIFVSTTPT----EEAVSAPENR 442
>gi|18041979|gb|AAL57769.1|AF388528_1 hypothetical protein RDA279 [Rattus norvegicus]
Length = 328
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/233 (27%), Positives = 98/233 (42%), Gaps = 28/233 (12%)
Query: 155 YLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
+L+SE S W +Y+ LP+ P L L L A +RA R+ ++
Sbjct: 1 FLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPGPLRAKAEEQRA--RVQDLF 58
Query: 212 GTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRV-------- 260
+ D FS LF E V F+ F W++ + +R V L S
Sbjct: 59 ASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTC 114
Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
AL P+ D+LNHS V+ +++ ++ T + + ++ FI YG N LLL YGF
Sbjct: 115 ALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEAFICYGPHDNQRLLLEYGF 174
Query: 321 VPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
V + V + LK +DK +KL L +G + + F GW
Sbjct: 175 VAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFTGNLTF-----GW 222
>gi|302805649|ref|XP_002984575.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
gi|300147557|gb|EFJ14220.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
Length = 530
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 133/362 (36%), Gaps = 77/362 (21%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPP---------- 122
E +W + G+ + AI++V E GL + +G+ L F P
Sbjct: 3 ERLERFSRWSQEHGIQFRGCAIKRVSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVLV 62
Query: 123 ----SLVITADSKWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISAL 175
L +T + P G V ++ + D L+ +LI E + ++S W+ Y+ L
Sbjct: 63 VTPLDLALTPVTIVKDPVLGNVYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEML 122
Query: 176 PRQPYSLLYWTRAEL-----DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY---PD 227
P + L++ EL EA++ ++R + + IGT +L + S Y PD
Sbjct: 123 PSGFGTPLWFEDEELMELDGTTLFEATKAQQRCLPSV--YIGTLC-CQLFLVSLYLFRPD 179
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLP--------------SMDGRV------------- 260
+ + F W+ I ++R + +P DG
Sbjct: 180 ---DRELEFQEFLWANCIFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISC 236
Query: 261 -----------ALVPWADMLNHSCEVETFLDYDKSS-------QGVVFTTDRQYQPGEQV 302
LVP D NH+ + D S + D + PG +V
Sbjct: 237 SSKDVSTIWIEGLVPGIDFCNHTRRASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEV 296
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
I+YG K N ELL YGFV + +N V P D KL+ LR+ LS
Sbjct: 297 LINYGDKGNEELLFLYGFVEEDNSNDYVMVHFPKMFLDEDNTMDFKLQLLRELDLSLKWL 356
Query: 363 FP 364
P
Sbjct: 357 LP 358
>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
Length = 447
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/343 (22%), Positives = 132/343 (38%), Gaps = 53/343 (15%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+ AL+++ GE + +P + +T + + + L ++ E S
Sbjct: 9 GVRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKG 64
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
K S+W Y+ LP Q W+ E+D L +++ + E + + + +
Sbjct: 65 KGSKWYRYLKTLPCQESVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEENIAPLTK 124
Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF----- 278
+ P FP + F E++ + ++ SR + + G +VP AD+ NH + E
Sbjct: 125 EDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHG-YGMVPLADLFNHKTDAEDVHFMLN 183
Query: 279 --------------------------LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
+ DKS +V D G ++F +YG+ N
Sbjct: 184 ASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKD--VAAGSEIFNTYGQLGNA 241
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSL-------KKSDKCYKEKLEALRKYGLS-----AS 360
LL YGF E NP D V L + + K +++ RK G S S
Sbjct: 242 ALLHRYGFT--EPNNPHDIVNLDMDCLLEVLLSRFQKKRVRKRGRVWRKAGFSGCESQGS 299
Query: 361 ECFPIQITGWP-LELMAYAYLVVSPPSMKGKFEEMAAAASNKM 402
E F I G P +EL+ +++ SP E+ AA ++
Sbjct: 300 EYFEISAAGKPQIELLLLLFVIQSPARDCEALEDAAAKVKGRV 342
>gi|238882716|gb|EEQ46354.1| hypothetical protein CAWG_04701 [Candida albicans WO-1]
Length = 549
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/327 (25%), Positives = 136/327 (41%), Gaps = 73/327 (22%)
Query: 74 ENASTLQKWLSDSGLPPQ-KMAIQK-VDVGE-RGLVALKNIRKGEKLLFVPPSLVITADS 130
E + Q WL + + K+AI D + RG++AL++I E + +P S+V+ D+
Sbjct: 6 EKSKLFQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN 65
Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFE----------KSSRWSNYISALPRQP 179
VLK+ V D W L L E F+ S W Y++ LP Q
Sbjct: 66 NSLIKLYPSVLKKLRVLDQWIGLIIVLGFEMKFKFNPNNNNDNNNKSFWYEYLNILPDQF 125
Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
L+YW EL+ +L+ S I +R IG N+L + +++ + +++ +E F
Sbjct: 126 NQLIYWNDEELN-HLQPSCILDR--------IGKENNLNM--YNQIISIINQDLSGVEEF 174
Query: 240 KWS------------FGILFSRLVRLP-----SMDGR----------------------- 259
K S + +S V +P + +G
Sbjct: 175 KSSPLTFEEYNKVATIIMSYSFDVEVPKSKKVTKNGTNEKGNDEDKEDDGDDDDDDEEED 234
Query: 260 ----VALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
++VP+AD LN + L Y S+ ++ T + GEQV+ +Y N EL
Sbjct: 235 NEYYKSMVPFADTLNADTHLNNAILIY--STDQLIMTCIKPIAKGEQVYNTYSDHPNSEL 292
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKS 341
L YG+V G+ D E+PLS KS
Sbjct: 293 LRRYGYVELNGS-KYDFGEIPLSTIKS 318
>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
NZE10]
Length = 445
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 131/303 (43%), Gaps = 52/303 (17%)
Query: 71 DSLENASTLQKWLSDSGLP-PQKMAIQKV---DVGERGLVALKNIRKGEKLLFVPPSLVI 126
D E + WL D+G K+ + + + G RG+VA++++ + E+L VP S ++
Sbjct: 6 DFQERSRAFVNWLRDNGASISAKITLDDLRQQNAG-RGIVAVEDLDEDEELFSVPRSTML 64
Query: 127 TADSKWSCPEAGE-VLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
T ++ + GE VL++ P W L + E SRW Y LP +L++W
Sbjct: 65 TTETSRN----GEAVLQEVDDP-WLSLIVVMALEYLDGSQSRWKPYFDVLPVSFDNLMFW 119
Query: 186 TRAELDRYLEASQI-------------RER---AIERITNVIGTYNDLRLRIFSKYPDLF 229
+ EL R+LE S + RE+ IERI+ N+ LR+ +
Sbjct: 120 SDREL-RHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTI 178
Query: 230 PEEVFNMETF---------KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE---VET 277
F++ET +W LP +VP ADMLN + +
Sbjct: 179 MAYGFDLETSSDQAKNDGEEWEEDSDAGET--LPK-----GMVPLADMLNADADRNNAKL 231
Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
F + DK VV T + + GE+++ +G +LL YG++ + D VE+P
Sbjct: 232 FYEDDK----VVMKTIKPVKAGEELYNDFGSLPRADLLRRYGYLT-DNYAQYDVVEIPAD 286
Query: 338 LKK 340
L K
Sbjct: 287 LIK 289
>gi|400602527|gb|EJP70129.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 493
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 53/105 (50%), Gaps = 17/105 (16%)
Query: 232 EVFNMETFKWSFGILFSRLVR------------LP---SMDGRVALVPWADMLNHSCEVE 276
E F E ++W+F I SR R LP ++D L+P D+ NH V
Sbjct: 193 EQFRPELYRWAFAIFSSRSFRPSLVLSDEQARLLPPGVAIDDFSVLLPLFDIGNHDMTVP 252
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+ + + G T R +QPGEQVF +YG K+N ELLL YGF+
Sbjct: 253 --VRWQRDGDGCALRTGRAHQPGEQVFNNYGLKTNAELLLGYGFM 295
>gi|428177025|gb|EKX45907.1| hypothetical protein GUITHDRAFT_138732 [Guillardia theta CCMP2712]
Length = 505
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 97/201 (48%), Gaps = 12/201 (5%)
Query: 166 SRWSNYISALPR-QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
S + YI LP Y + + E + L + +I + ++ + + ++ ++ + +
Sbjct: 184 SEFKPYIDLLPEYHDYEMTWLWSVEEQQDLLSGKILKDSMSITSQIEREHHTIK-EVLGR 242
Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLP--------SMDGRVALVPWADMLNHSCEVE 276
+ D F++E++KW+ + SR L + + LVP DM+NHS +
Sbjct: 243 FQDCAEFGEFSLESYKWAQATIMSRAFDLDEGQETARRQGEQNLLLVPLCDMVNHSPDAS 302
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+D D + +F ++ Y+ G++V I+YG SN +LLLS+GFV EG + E+ L
Sbjct: 303 FSIDCDAAGNVNLFASE-NYKAGQEVHINYGSSSNEQLLLSFGFV-LEGGWQAQETEITL 360
Query: 337 SLKKSDKCYKEKLEALRKYGL 357
+ + + ++ K L GL
Sbjct: 361 EVPQDVEGFEIKRNLLFNGGL 381
>gi|313242187|emb|CBY34355.1| unnamed protein product [Oikopleura dioica]
Length = 563
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 106/221 (47%), Gaps = 42/221 (19%)
Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL--EASQI 199
++ +VP + +L +E +K+SR + +IS+LPR+ Y+L + A+L + + ++ Q+
Sbjct: 32 EKVAVPAEDVFIHFLCTEEKQKKASRVAGWISSLPRESYNLPFNWPADLQKCVCDDSLQL 91
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM--- 256
+A + +N L R F Y + + F+ +T W++ + +RL LP
Sbjct: 92 AIKA------QVADFNTLIAR-FEHYSTVIDDVKFSRDTLAWAYSVWSARLFELPQYPES 144
Query: 257 ----------------DGR-VALVPWADMLNHSCEVETFLD------YDKSS-----QGV 288
D R A +P D+LNHS FLD +DK+ +
Sbjct: 145 SSNSVNLPKWLDNDPNDLRSFAFLPIFDLLNHSSTPNVFLDIREKHVWDKTKKIHPEEKF 204
Query: 289 VFTTDRQYQ--PGEQVFISYGKKSNGELLLSYGFVPREGTN 327
V + + + + GE++ +SYG S+ +L L YGFV ++G N
Sbjct: 205 VLSLEAKTKIAKGEELRMSYGNLSDRDLFLKYGFVLKKGEN 245
>gi|255089515|ref|XP_002506679.1| predicted protein [Micromonas sp. RCC299]
gi|226521952|gb|ACO67937.1| predicted protein [Micromonas sp. RCC299]
Length = 584
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/340 (23%), Positives = 130/340 (38%), Gaps = 87/340 (25%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD 129
+D + + +W G+ + + V G RG+VA ++I + +L VP ++++A
Sbjct: 9 LDGPDADADFWRWARARGVVAVRCEARDVAEGWRGIVATEDIERDAVVLRVPGDILMSAR 68
Query: 130 SKWSCPEAGEVLKQC--------SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
S E E L+ S+ LA +L+ EAS + SRW YIS LPR
Sbjct: 69 SM----ERDEQLRDALRTHGETRSMTPADKLAVHLLLEASRGRGSRWHEYISRLPRAYNL 124
Query: 182 LLYWTRAE---------LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
L WTR E + A ++ ER +V+ L L + +++
Sbjct: 125 LCCWTRRERAMLQDPAAIAVARRARDATRQSWERARDVLAA---LGLTLANRWG------ 175
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE------------------ 274
+++ ++W+ + SR V +P D AL P D+ N++
Sbjct: 176 --SIDAWRWARCTVSSRTVYVP-YDAAGALCPVGDLFNYAPPPPPHRHAIVGTPLEGGDE 232
Query: 275 ----------------------------VETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
V +D++S+ VF R Y GEQ+ + Y
Sbjct: 233 GKDDDEGKDDDKRGEDDDKGGEEGKRWTVSGDGAWDEASREYVFRARRPYVAGEQIMLCY 292
Query: 307 GKKSNGELLLSYGFV--------PREGTNPSDSVELPLSL 338
G+ +N LL YGF+ E NP D+ L
Sbjct: 293 GRHTNLGLLEHYGFLLDEKESETGDEPGNPHDAAAFALGF 332
>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
atroviride IMI 206040]
Length = 956
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 108/265 (40%), Gaps = 38/265 (14%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI---TADSKWSCPE-------AGEVLKQCSVPDWPLL 152
RG+VAL++I L VP S ++ T++ + P+ A EV + W L
Sbjct: 529 RGIVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTL 588
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
LI E S W Y+ LP + ++W+ AE+D L+AS R + + TN
Sbjct: 589 IIVLIYEYFKGDQSSWKPYLDVLPASFETPMFWSDAEVDE-LQASATRSKIGK--TNAEE 645
Query: 213 TYNDLRLRIFSKYPDLF-------PEEVFN----METFKWSFGILFSRLVRLP------- 254
++ L + PD+F EE+ M + S+ F
Sbjct: 646 MFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFDFQNEDEEEEDDSEEW 705
Query: 255 ----SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ +VP AD+LN E ++Y + T R + GE++ YG
Sbjct: 706 VEDREAKSTMGMVPMADILNADAEYNAHVNY--GDDALTVATLRTIKAGEEILNYYGPHP 763
Query: 311 NGELLLSYGFVPREGTNPSDSVELP 335
N ELL YG+V + + D VELP
Sbjct: 764 NSELLRRYGYVTPKHSR-YDVVELP 787
>gi|291390222|ref|XP_002711632.1| PREDICTED: SET domain containing 6 [Oryctolagus cuniculus]
Length = 817
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/292 (23%), Positives = 129/292 (44%), Gaps = 29/292 (9%)
Query: 81 KWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
KW GL P+ ++ V G+VA +++++GE L VP + ++ S+ +C G
Sbjct: 394 KWGLQVGLELSPKVAVSRQGTVAGYGMVARESVQRGELLFAVPRAAIL---SQHTC-SIG 449
Query: 139 EVLKQ-----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELD 191
++L++ S W + + +S WS Y + P + ++W E
Sbjct: 450 DLLERERGALQSQSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERR 508
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
R L+ + + E + + ++ Y + L +PDLF +V ++E + ++ +
Sbjct: 509 RLLQGTGVPEAVEKDLASIRSEYYSIVLPFMEAHPDLFSPKVHSLELYHQLVALVMAYSF 568
Query: 252 RLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQ 301
+ P + +VP AD+LNH L+Y +V T QP G +
Sbjct: 569 QEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSADYLRMVAT-----QPIPKGHE 623
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+F +YG+ +N +L+ YGFV N D+ ++ + + K K+EA R
Sbjct: 624 IFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQKAKVEAER 675
>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 247
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 78/167 (46%), Gaps = 13/167 (7%)
Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
++LV+L R AL P D++NH +++ + Y+ TT R + GEQV ISY
Sbjct: 1 MTKLVQLK----RYALTPVVDLINHQSGIDSDVSYNYFYGYFAVTTQRGWTAGEQVLISY 56
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK-SDKCYKEKLEALRKYGLSASECFPI 365
G +SN LL YGFV ++ NP+D + + K SD K+ + LR+ G +
Sbjct: 57 GPRSNDHLLRRYGFVEQD--NPNDVYRITGLIDKLSDVLGKDSVRVLRESG------GKL 108
Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
TG E + V + G+ EE ++ KD + PE
Sbjct: 109 GTTGDNAEGRPVESVTVGRSGLLGEKEEGRVMPVFRLAVVKDDQLPE 155
>gi|85099007|ref|XP_960703.1| hypothetical protein NCU06658 [Neurospora crassa OR74A]
gi|28922220|gb|EAA31467.1| predicted protein [Neurospora crassa OR74A]
gi|28950107|emb|CAD70887.1| conserved hypothetical protein [Neurospora crassa]
Length = 469
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 117/261 (44%), Gaps = 26/261 (9%)
Query: 111 IRKGE-----KLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEASFE 163
+R GE L+ VP SLV+ + + + KQ +V +LA+ +A
Sbjct: 38 VRDGELQPEVPLMTVPNSLVLNVQAVDEYAKEDKNFKQLLGAVGHHLVLASK-THQAPVG 96
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIF 222
S+ W+ YI LP+ WT E R L E A+ ++T + ++ +R
Sbjct: 97 VSNPWTEYIKFLPKTVLVPTLWTEDE--RLLLRGTSLESAVNAKMTAITAEFDAVR-EAA 153
Query: 223 SKYPD----LFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWADMLNHSCE 274
S P L+P E N S+ +L SR++ LP ++VP DM+NHS
Sbjct: 154 SSLPSWNDVLWPYEDGNSSASLRSWILLDALYRSRVLELPK--SGESMVPCIDMINHSTR 211
Query: 275 VETFLDYDKSSQGVVF-TTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSV 332
+ D + + V+ D PGE+V ISYG K E+L SYGF+ E T +S+
Sbjct: 212 ASAYYDENAKDEVVLLPRPDSSISPGEEVTISYGDAKPAAEMLFSYGFIDPEAT--VESL 269
Query: 333 ELPLSLKKSDKCYKEKLEALR 353
LPL + D K KL A +
Sbjct: 270 VLPLEPFEDDPLAKAKLFAFK 290
>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
Length = 370
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 74/151 (49%), Gaps = 20/151 (13%)
Query: 222 FSKYPDLFPEEVFNMETFKW------SFGILFSRLVRLPSMDGRVALVPWADMLNHS--- 272
++ + D FP+ + T+ W SF ++ P D R+AL+P AD+ NHS
Sbjct: 130 WNAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYPWED-RLALIPVADLFNHSDDG 188
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS--- 329
C+V S+ G DR+Y+ GE++FISY SN +LL YGF+P E +
Sbjct: 189 CKVYY------SADGYHIVADREYKKGEELFISYSSHSNDYILLEYGFIPDESLDDDVYI 242
Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
D P L + K EK + L +Y L +S
Sbjct: 243 DDAVFP-KLSEGQKEELEKRDLLGEYPLESS 272
>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
Length = 619
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 89/197 (45%), Gaps = 22/197 (11%)
Query: 155 YLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAEL------DRYLEASQIRERAIERI 207
+LI E EK S Y++ LPR+ + LY+ E+ + Y IR+
Sbjct: 112 FLIYELHVEKEKSTHFPYLNLLPREFTTALYFDEDEMAALRSTNLYKSVQSIRQ------ 165
Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL--VRLPSMDGRVA---- 261
N+ Y + +KYP F +VF+ E F W+F ++SR+ + P+ +G
Sbjct: 166 -NLKQIYETKVEYLMNKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEYPAENGEGVEIVP 224
Query: 262 -LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
L+P D+LNH + + S + T + G+ V +YG KSN LLSYGF
Sbjct: 225 TLLPTVDILNHKFNAKITY-FTGSDRRFYLKTRESLKSGDYVCNNYGAKSNDSFLLSYGF 283
Query: 321 VPREGTNPSDSVELPLS 337
V + + V+ +S
Sbjct: 284 VIPNNSEDTLYVQFGIS 300
>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
Length = 457
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 99/402 (24%), Positives = 159/402 (39%), Gaps = 75/402 (18%)
Query: 79 LQKWLSDSGLPPQKMAIQ---KVDVG------ERGLVALKNIRKGEKLLFVPPSLVITAD 129
+ WL +G + AI+ ++D G G+ A ++I GE + +P S T +
Sbjct: 4 FKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACFTCE 63
Query: 130 SKWSCPEAGEVLK---QCSVPDW-PLLATYLISEASFEKSSRWSNYISALPR-QPYSLLY 184
+ C A V K +W L T L+ E + SSRW+ Y+ +LP +P ++
Sbjct: 64 N---CAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSEPDVVMM 120
Query: 185 WTR-AELDRYLEASQIR-----ERAIER------ITNVIGTYNDLRLRIFSKYPDLFPEE 232
W+ E RYL + I ERA R + V+ T LR +K
Sbjct: 121 WSEDGERRRYLCGTDIEQSLRDERAAARTEWTRHVKPVLDT-----LRGAAKD------- 168
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
+ F + + SR + G LVP AD+ NH D V
Sbjct: 169 -VGFDDFLAARSVASSRAFTVNPRVG-AGLVPIADLFNHRTGGHHVYLSDARGTAAVSER 226
Query: 293 D-------------RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
D + + GE+VF +YGK N +LL SYGF + NP+D V + +
Sbjct: 227 DEGSDDDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLD--NPADKVTIGVPAL 284
Query: 340 K--------SDKCYKEKLEALRKYGLSASE-CFPIQITGWPLELMAYAYLVVSPP----- 385
+ S +L GL E F +++ P +++ V++
Sbjct: 285 RAAAALRGVSGAQIATRLAWCDAIGLCDDETTFELRLGADPPDVLLLVSWVLASTDDRFD 344
Query: 386 ---SMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
++K +E + AA+ K T + + DE AL IL++
Sbjct: 345 AVRAVKTGGDETSIAAALKETIESGARGGLKDESALGIILET 386
>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
Length = 495
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 81/357 (22%), Positives = 151/357 (42%), Gaps = 46/357 (12%)
Query: 107 ALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEK 164
A K + E L +P L+++ ++ S + + P LA ++++E ++
Sbjct: 108 ATKQVSTDELLFSIPQKLMLSNETANSSTIGHFINNDPILSQMPNVALAFHVLNEL-YDP 166
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
S W Y+ ALP +++Y+T E+ L+ S + A+ N+ R +S
Sbjct: 167 KSFWKPYLDALPSSYDTVMYFTPDEITE-LKGSPAFDDALRMCRNIA--------RQYSY 217
Query: 225 YPDLFPEEV----------FNMETFKWSFGILFSRLVRLPSMDGRV-----------ALV 263
+ L + V F ++W+ + +R +PS + AL+
Sbjct: 218 FYSLLQKNVDPALSNLRANFTYNDYRWAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALI 277
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
P D NH + + ++ S+ V R + PGEQVFI YG ++ E + GFV
Sbjct: 278 PLWDFCNHQ-DGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFV-- 334
Query: 324 EGTNPSDSVELPLSLKKSDKCYKE------KLEALRKYGLSASECFPIQITGWPLE--LM 375
+ N D++ L + L KSD + KL L +S F ++ P++ L+
Sbjct: 335 DINNAHDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAGPQPVDGKLL 394
Query: 376 AYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKY 432
A+ L ++ + + ASN M + I+ E+D+++ F+ C+ + Y
Sbjct: 395 AFLRLFCMTKDSLDRWLQ-SDNASNLMHEECGIET-EVDDKSWSFLKARCQLLLQLY 449
>gi|336472467|gb|EGO60627.1| hypothetical protein NEUTE1DRAFT_75928 [Neurospora tetrasperma FGSC
2508]
gi|350294307|gb|EGZ75392.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 469
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/290 (27%), Positives = 128/290 (44%), Gaps = 23/290 (7%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L W +G+ + + ++ G+V ++ L+ VP SLV+ + +
Sbjct: 10 ALPAWALLNGIAFPHVKVANIEGKGFGVVRDGELKPEVPLMTVPNSLVLNVQTVDEYAKE 69
Query: 138 GEVLKQC--SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
+ KQ +V +LA+ +A S+ W+ YI LP+ WT E R L
Sbjct: 70 DKNFKQLLGAVGHHLVLASK-THQAPVGVSNPWTEYIKFLPKTVLVPTLWTEDE--RLLL 126
Query: 196 ASQIRERAIE-RITNVIGTYNDLR-----LRIFSKYPDLFPEEVFNMETF--KWSF--GI 245
E A+ ++T + ++ +R L I++ L+P E N +W +
Sbjct: 127 RGTSLESAVNAKMTAITAEFDAVREAASSLPIWNDI--LWPYEDGNSSASLRRWILLDAL 184
Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF-TTDRQYQPGEQVFI 304
SR++ LP ++VP DM+NHS + D + + V+ D PGE+V I
Sbjct: 185 YRSRVLELPK--SGESMVPCIDMINHSTRASAYYDENAKDEVVLLPRPDSSISPGEEVTI 242
Query: 305 SYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
SYG K E+L SYGF+ E T +S+ LPL + D K KL A +
Sbjct: 243 SYGDAKPAAEMLFSYGFIDPEAT--VESLVLPLEPFEDDPLAKAKLFAFK 290
>gi|428177623|gb|EKX46502.1| hypothetical protein GUITHDRAFT_138238 [Guillardia theta CCMP2712]
Length = 486
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 124/272 (45%), Gaps = 34/272 (12%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E L +W + G+ +K+++Q+ + G + A +++ + E + VP S+ I +S W
Sbjct: 168 ERREKLLEWAREHGIGFEKISLQEDEFGGTAMFASEDLEEDEVIGVVPFSISIGRESLWR 227
Query: 134 CPEAGEVLKQC-----SVPDWPLLATYLISEASFEKSSRWSNYISALPR----------Q 178
GE+L Q + PD L++ + SS + Y+ LP
Sbjct: 228 S-RHGELLGQLYEDERTPPD--LISCIFLLLERRSSSSFFRPYLDMLPTPSGVSNVFHWD 284
Query: 179 PYSLLYWTRAELDRYLEASQIR--ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
++L ++ E R L A+ + ER +R V+ + + R F K+ +E+F+
Sbjct: 285 AHALSAFSPHEEARSLAAAHLSLFERTYQRYFTVVNKNEEFQ-RQFGKH-----QEIFSR 338
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNH--SCEVETFLDYD-KSSQGVVF 290
+ W++ +L SR P + R + ++P AD+ NH S ++ + ++ QG VF
Sbjct: 339 DQVLWAYSLLISRAWEHPDYNYRTSFHRMLPIADIANHKMSPTGSGWMSVEFRNQQGAVF 398
Query: 291 TTDR--QYQPGEQVFISYGKKSNGELLLSYGF 320
R + G+++ SY N LL+ YGF
Sbjct: 399 LVTRGGAIRRGQEIVTSYSNAGNALLLVQYGF 430
>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
C-169]
Length = 685
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 102/224 (45%), Gaps = 11/224 (4%)
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
G+ L + PLL Y + + +K S ++ + ++LP + L T E+ LE +
Sbjct: 18 GQALAALGIEKDPLLLLYTMID-RHDKDSDFAPFWASLPEVFMTGLSATEEEVS-MLEGT 75
Query: 198 QIRERAIERITNVIGTYNDLR---LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
+E ++ Y + + + YPD ++ + F W+ + +S + +
Sbjct: 76 PAHTTFVEARQHIREQYRAAQPVLQALTAAYPDDITPDLVTEDKFIWACELWYSYAIEVE 135
Query: 255 SMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+DG R LVP A +LNHS + + D ++ + R GEQ F+SYG
Sbjct: 136 YVDGAVRQTLVPIAHLLNHSPWPHIVRYGRLDAATDSLRLRAFRHCAAGEQCFLSYGPLP 195
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
N +LLL YGF + NP D+V + +K++ + LEA K
Sbjct: 196 NLKLLLFYGFALPD--NPHDTVPITFEAEKNEGDVTDMLEACLK 237
>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
Length = 495
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 120/312 (38%), Gaps = 80/312 (25%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA--GEVLKQCSVPDWPLLATYLISEA 160
RG VA+ +I E+L +P SLV+T + S P + E+ + + WP L +I E
Sbjct: 48 RGAVAIADIASDEELFAIPRSLVLTTATS-SIPRSVLKELEDKGATGAWPPLIVTIIYEY 106
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYN 215
+SS W Y LP +L++W AEL L+AS + R +A E N I
Sbjct: 107 LRGESSPWHPYFKILPTTFNTLMFWNDAELAE-LQASAVVDKIGRRQAEEEWQNTI---- 161
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--------------- 260
+ + +PDLFP G ++L+ L M G +
Sbjct: 162 ---IPTMADHPDLFP------------VGGSSAKLIELAHMAGSLIMAYAFDIDRDDMED 206
Query: 261 --------------------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+VP+ADMLN + + + ++ +
Sbjct: 207 DNDNDKDGADSADDEFEEDDEDEPFKGMVPFADMLNADADKNNARLFQEPDY-LIMKATK 265
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL---------KKSDKCY 345
GEQ+F YG +LL YG+V + D VE L K D+ +
Sbjct: 266 PISAGEQIFNDYGPLPRSDLLRMYGYV-TDNYAQYDVVEFSHDLLLEVAGKHSKSKDQVW 324
Query: 346 KEKLEALRKYGL 357
+E+ + L + G+
Sbjct: 325 REREQQLDELGV 336
>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
Length = 1118
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 115/281 (40%), Gaps = 51/281 (18%)
Query: 94 AIQKVDV----GERGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE----AGEVLK 142
AI+ VD+ RG+VAL++I L +P +I +D+ + PE G+ +
Sbjct: 679 AIEIVDLRSRDAGRGIVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDE 738
Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER 202
Q ++ W L ++ E S+W YI LP + ++W+ EL YL+AS
Sbjct: 739 QQALDSWSALILIMMYEFFLGHQSKWKPYIDVLPLTFDTPMFWSEEELS-YLQASA---- 793
Query: 203 AIERITNVIGTYND---LRLR----------IFSKYPDLFPEEVFNME------TFKWSF 243
N IG + R R +F+ D E++ + ++F
Sbjct: 794 ----TVNKIGKADAEEMFRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAF 849
Query: 244 GILFSRLVRLPSMDGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+ DG V +V AD+LN E +++ + + T+ R
Sbjct: 850 DLENEEAENDDESDGWVEDREGKSMMGMVAMADILNADAEFNAHVNH--GDEELTVTSIR 907
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
+ GE++ YG N ELL YG++ E + D VE+P
Sbjct: 908 DIKAGEEILNYYGPHPNSELLRRYGYIT-EKHSRYDVVEIP 947
>gi|311257193|ref|XP_003127001.1| PREDICTED: N-lysine methyltransferase SETD6 [Sus scrofa]
gi|335289289|ref|XP_003355838.1| PREDICTED: N-lysine methyltransferase SETD6-like [Sus scrofa]
Length = 448
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/278 (23%), Positives = 122/278 (43%), Gaps = 18/278 (6%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE L VP + V+ S+ +C +G + ++
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQPGELLFVVPRAAVL---SQHTCSISGLLERERGAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +S WS Y + P + ++W E R L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEDEK 211
Query: 262 ------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
+VP AD+LNH L+Y + +V T + G ++F +YG+ +N +L+
Sbjct: 212 EPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT--QSIPKGHEIFNTYGQMANWQLI 269
Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
YGFV N D+ ++ + + K+EA R
Sbjct: 270 HMYGFVEPYPDNKDDTADIQMVTVREAALQGTKIEAER 307
>gi|344301751|gb|EGW32056.1| hypothetical protein SPAPADRAFT_138237 [Spathaspora passalidarum
NRRL Y-27907]
Length = 483
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/260 (25%), Positives = 116/260 (44%), Gaps = 26/260 (10%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSK---WSCPEAGEVLKQCSVPDWPLLATYLISE 159
RG++AL++I E L +P +++I + + PE + L +W L L+ E
Sbjct: 38 RGIIALEDIEIDETLFTIPRTVLINSLNNSLVQDQPELADKLAGLE-NEWDALILVLLYE 96
Query: 160 ASFEKSSRWSNYISALPR----QPYSLLYWTRAELDRYLEASQIRER--------AIERI 207
K S+W++Y + LP + + LL+W +L L+ S + +R ER+
Sbjct: 97 YK-RKESKWTDYFNVLPDLDTFEFHELLFWNDEQLSD-LKPSLVLDRIGKDKTVEMYERL 154
Query: 208 TNVIGTYN-----DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
++ +N + + F+K + F++ ++ ++
Sbjct: 155 VAIVNQWNLEELKGMTMEEFTKIATIIMSYSFDVAQGTEDEDEDEDDEEEEEEVEYIKSM 214
Query: 263 VPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
VP AD LN + L Y+K+ Q +V T + + GEQV+ +Y N E+L YG+V
Sbjct: 215 VPLADTLNADTHLNNAILTYNKN-QDLVMTCIKPIKKGEQVYNTYSDHPNCEILRRYGYV 273
Query: 322 PREGTNPSDSVELPLSLKKS 341
G+ D E+PL+L S
Sbjct: 274 ETTGS-KYDFGEIPLTLITS 292
>gi|390477743|ref|XP_003735352.1| PREDICTED: N-lysine methyltransferase SETD6 [Callithrix jacchus]
Length = 449
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 121/283 (42%), Gaps = 27/283 (9%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+++C G + ++
Sbjct: 36 LSPKVEVSRQGTVAGYGMVARESVQAGELLFVVPRAAIL---SQYTCSIGGLLERERGAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRE 201
S W + + +S W Y + P + ++W E R L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASHWRPYFALWPELGHLKHPMFWPEEERRRLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF--------KWSFGILFSRLVRL 253
+ + ++ Y+ + L +PDLF V ++E + +SF
Sbjct: 152 AVEKDLDSIRSEYHSIVLPFMEAHPDLFSLRVHSLELYLQLVALVMAYSFQEPLEEEEDE 211
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
+ V +VP AD+LNH L+Y +V T QP G ++F +YG+ +
Sbjct: 212 KEPNSPV-MVPAADILNHLANHNANLEYSADCLRMVAT-----QPIPKGHEIFNTYGQMA 265
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
N +L+ YGFV N D+ ++ + + + K EA R
Sbjct: 266 NWQLIHMYGFVEPYPDNTDDTADIQMVIVREAALQGTKTEAER 308
>gi|115472017|ref|NP_001059607.1| Os07g0471100 [Oryza sativa Japonica Group]
gi|22093661|dbj|BAC06955.1| SET-domain transcriptional regulator family-like protein [Oryza
sativa Japonica Group]
gi|50510036|dbj|BAD30661.1| SET-domain transcriptional regulator family-like protein [Oryza
sativa Japonica Group]
gi|113611143|dbj|BAF21521.1| Os07g0471100 [Oryza sativa Japonica Group]
gi|218199573|gb|EEC82000.1| hypothetical protein OsI_25940 [Oryza sativa Indica Group]
Length = 479
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 121/279 (43%), Gaps = 51/279 (18%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITA------DSKWSCPEAGEVLKQCSVPDWPLL 152
D G RGL A +++R+GE +L P + ++T+ D + + A + + SV L
Sbjct: 37 DAGGRGLAAARDLRRGELVLRAPRAALLTSGRVMDDDPRIASSVASHLPRLSSVQT---L 93
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-LEASQIRERAIERITNVI 211
L+SE KSS W Y+S LP Y++L A + + EA Q+ E +
Sbjct: 94 IICLLSEVGKGKSSNWYLYLSQLPSY-YTIL----ATFNDFETEALQVDEAIWVAQKALR 148
Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
G +D F ++ +++ W+F + SR + + + D L P D+ N+
Sbjct: 149 GIRSDWEEATPLMKGLGFKPKLLMFKSWIWAFATVSSRTLHI-AWDDAGCLCPIGDLFNY 207
Query: 272 SC------------------EVETFLD--------------YDKSSQGVVFTTDRQYQPG 299
+ E LD Y+ ++ ++ R Y+ G
Sbjct: 208 AAPNDDNSSTDEDRDDMMHQETNKMLDQTDFDSSEKLTDGGYEDVNEYRLYARKR-YRKG 266
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
EQV ++YG +N ELL YGF+ G NP++ + +PL L
Sbjct: 267 EQVLLAYGTYTNLELLEHYGFLL--GENPNEKIYIPLDL 303
>gi|10437194|dbj|BAB15011.1| unnamed protein product [Homo sapiens]
Length = 449
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 118/282 (41%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QGQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP------- 254
+ + N+ Y + L +PDLF V ++E + ++ + ++P
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQVPLEEEEDE 211
Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|432119027|gb|ELK38252.1| SET domain-containing protein 4 [Myotis davidii]
Length = 339
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 101/225 (44%), Gaps = 21/225 (9%)
Query: 110 NIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLA--TYLISEASFEKSSR 167
R+G+ ++ +P S ++T D+ G + + P PLLA T+L++E S
Sbjct: 29 GAREGQVIISLPESCLLTTDTVIRS-YLGAYIAKWQPPPSPLLALCTFLVAEKHAGDRSP 87
Query: 168 WSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
W Y+ LP+ P L A L R LEA +A E+ T V + R R S
Sbjct: 88 WKPYLEVLPKAYTCPVCLEPEVVALLPRPLEA-----KAREQRTRVRELFTSSRGRFSSL 142
Query: 225 YPDL--FPEEVFNMETFKWSFGILFSRLV--------RLPSMDGRVALVPWADMLNHSCE 274
P L VF+ F+W++ + +R V L + AL P+ D+LN+S
Sbjct: 143 QPLLSEAAASVFSYRAFRWAWCTVNTRAVYMERGRRQGLSAEPDTCALAPYLDLLNNSPA 202
Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
V+ +++ ++ T + E+VFI YG + LLL YG
Sbjct: 203 VQVKAAFNEETRCYEIRTGSGCRRHEEVFICYGPHDSRRLLLEYG 247
>gi|158254422|dbj|BAF83184.1| unnamed protein product [Homo sapiens]
Length = 473
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)
Query: 89 PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
PP ++A+ + V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 60 PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHACSIGGLLERERVAL 116
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 117 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 175
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 176 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 235
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 236 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 290
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 291 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 332
>gi|238550107|ref|NP_001153777.1| N-lysine methyltransferase SETD6 isoform a [Homo sapiens]
gi|308153495|sp|Q8TBK2.2|SETD6_HUMAN RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
gi|119603387|gb|EAW82981.1| SET domain containing 6, isoform CRA_b [Homo sapiens]
Length = 473
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)
Query: 89 PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
PP ++A+ + V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 60 PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 116
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 117 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 175
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 176 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 235
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 236 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 290
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 291 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 332
>gi|255553959|ref|XP_002518020.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
gi|223543002|gb|EEF44538.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
Length = 471
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 138/335 (41%), Gaps = 51/335 (15%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLV---ALKNIRKGEKLLFVPPSLVITADSKWSCP 135
++W+ G+ A++ +D ++ + AL+ +++GE + +P + +T+ +
Sbjct: 12 FKRWMKSQGISWCSDALELIDAPDQDGIFVKALRALKEGEVVASIPKAACLTSRT----S 67
Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
A +++ S L+ L+ E S S W++Y+ LP L WT E+D +L
Sbjct: 68 GARHIIEATSFTGCLGLSFALMYEISLGHLSPWASYLHLLPDSECLPLVWTLDEVDYFLS 127
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
+++ + E + + + L + + L P+ F + + ++ SR ++
Sbjct: 128 GTELHKIVKEDKALIYDDWKECILPLVDVHH-LNPQ-YFGAHQYFAARTLIASRSFQIDD 185
Query: 256 MDGRVALVPWADMLNHSCEVETFL------------------DYDKSSQGVVFTTDRQ-- 295
G + +VP AD+ NH E + +++ V + DR+
Sbjct: 186 YHG-IGMVPLADLFNHKTGAEDVHFTCGSSDSDSDDNSNGNHSFTENTVDEVPSDDREIL 244
Query: 296 -------YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK-------KS 341
+ G +VF +YG N LL YGF E NP D V + L L S
Sbjct: 245 EMIMVKDVKSGAEVFNTYGSAGNAGLLHRYGFT--EPDNPYDIVNIDLDLVFKWSSSLFS 302
Query: 342 DKCYKEKLEALRKYGLSA-----SECFPIQITGWP 371
D+ + +L RK G S +E F I G P
Sbjct: 303 DRYTRARLSLWRKLGYSGCVSENAEYFEISFDGDP 337
>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 269
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 66/109 (60%), Gaps = 5/109 (4%)
Query: 262 LVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
L+P DM NH + ++ T Y++++Q V Y+ GEQ+FI YG ++N + L+ GF
Sbjct: 41 LIPLWDMANHVNGQITT--GYNEAAQQVESLALGDYRKGEQIFIYYGNRTNADFLVHNGF 98
Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
V + N + +V +PLSL +++ ++++ + L K GL++S F +Q G
Sbjct: 99 VYPD--NANSAVAIPLSLNPTEEQFEQRKQLLEKLGLASSGDFNVQRGG 145
>gi|412991387|emb|CCO16232.1| predicted protein [Bathycoccus prasinos]
Length = 622
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 87/174 (50%), Gaps = 28/174 (16%)
Query: 111 IRKGEKLLFVPPSLVITA-----DSKWSCPEAGEVLKQCSV-----PDWPLLATYLISEA 160
I+K E +L V S+ +TA D+ C G++LK+ + P W L+ YL+ E
Sbjct: 156 IKKNESILKVGDSVWMTAEKAREDADGKC---GKILKRLAAQGEAAPAWVELSVYLVCEL 212
Query: 161 SFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
+SS ++ Y+S L S L+W+ +++ + SQ+ + A + V GTY L
Sbjct: 213 EKGESSFYAPYLSYLREATVLESPLFWSTEDVNA-IAGSQLLDDAAGYDSYVRGTYESLN 271
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNH 271
L D PE+ TF W+FGIL SR + P DG V LVP DMLNH
Sbjct: 272 LS-----NDGVPED-----TFLWAFGILRSR-AQQPMRDGSEVTLVPGLDMLNH 314
>gi|126305181|ref|XP_001376097.1| PREDICTED: n-lysine methyltransferase SETD6-like [Monodelphis
domestica]
Length = 453
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/257 (22%), Positives = 111/257 (43%), Gaps = 33/257 (12%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVI----TADSKWSCPEAGEVLKQCS-VPDWPLLAT 154
V G+VAL+++++GE L VP ++++ TA E G + Q VP
Sbjct: 50 VAGYGMVALEDVQRGELLFVVPRAVLLSQKTTAIRDLLEKEHGALQSQSGWVP-----LL 104
Query: 155 YLISEASFEKSSRWSNYISALP-----RQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
+ + S WS Y S P + P ++W+ EL + L+ + + E + N
Sbjct: 105 LALLYEYLAEDSPWSCYFSLWPDLGSLQHP---MFWSEGELRQLLQGTGVPEAVQRDLAN 161
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS-------RLVRLPSMDGRVAL 262
+ Y+ + +P++FP + ++E ++ ++ + +
Sbjct: 162 ISQEYDAIVQPFLEAHPEIFPPQARSLELYRRLVAMVMAYSFQEPLEEEEDEKEPNPPMM 221
Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYG 319
VP AD+LNH L+Y +V T QP G+++F +YG+ +N +L+ YG
Sbjct: 222 VPAADILNHVANHNANLEYSPEYLRMVAT-----QPILKGQEIFNTYGQMANWQLVHMYG 276
Query: 320 FVPREGTNPSDSVELPL 336
F N D+ ++ +
Sbjct: 277 FAEPYPGNTDDTADIQM 293
>gi|268535512|ref|XP_002632889.1| C. briggsae CBR-SET-29 protein [Caenorhabditis briggsae]
Length = 319
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 83/173 (47%), Gaps = 19/173 (10%)
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE------RAIERITNVIGTYNDL 217
++S WS Y+ LP++ + + + D IR+ + I I+ +G + ++
Sbjct: 84 ETSAWSPYLKVLPKE-FDTPAFKGIDYDVNTLPLSIRKFWVDQKKEISEISEKVGDHYEV 142
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADM 268
R +I + LFPE + W++ ++ +R + + + DG +A++P+ DM
Sbjct: 143 RKKIVFQLRRLFPE--LTHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDM 200
Query: 269 LNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
LNH + + ++K + V RQ GEQVF+ YG N LL+ YGF
Sbjct: 201 LNHDPQKYQGVAIHEKRNGRYVVQAKRQIMEGEQVFVCYGAHDNARLLVEYGF 253
>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
6054]
gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 453
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 117/282 (41%), Gaps = 54/282 (19%)
Query: 92 KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------------------------ 127
K+ I++ + RG+ A +I++ +++L +P S ++
Sbjct: 34 KIEIKQFEQSGRGIAAKDDIKRSQQILRIPHSFLLNFTTVVSHITRHNSNIKLKEPYYLG 93
Query: 128 ----------ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
D + ++ E+ ++ + LL+ YL E SS W ++ LP
Sbjct: 94 IYVPLESTNNNDKFTNIYKSLELQDLLALTSFQLLSLYLCFERQRIHSSFWKPFLEMLPD 153
Query: 178 -QPYSL--LYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDL- 228
+SL L W ++D++ E Q + RA + + Y +R + DL
Sbjct: 154 ISDFSLNPLIWQVLQVDQWEELIQFLPESAKRRAEDVYERFLEDYVVVRALVSRILDDLK 213
Query: 229 ----FPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-----ALVPWADMLNHSCEVETFL 279
+E ++ F W++ + SR + + G+ + P+ D LNHSC E +
Sbjct: 214 LSESSADEYIPVDLFLWAWMCINSRCLYMTIPQGKTNADNFTMAPYVDFLNHSCNDECSI 273
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
D + V TT Y PG+Q+F+SYG N LL YGFV
Sbjct: 274 LIDTTGFHVRTTT--PYMPGDQLFLSYGPHCNEFLLCEYGFV 313
>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
Length = 538
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 74/311 (23%), Positives = 128/311 (41%), Gaps = 70/311 (22%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+L D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 138 -ILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
R L+++Q AI + FS+Y + + + F +++
Sbjct: 195 RCLQSTQ----AIHDV--------------FSQYKNTARQYAY------------FYKVI 224
Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
++ + ++ + CE D+ Q G+Q++I YG +SN
Sbjct: 225 QITT---------GYNLEDDRCECVALQDF---------------QAGDQIYIFYGTRSN 260
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP 371
E ++ GF N D V++ L + KSD+ Y K E L + G+ S F + T P
Sbjct: 261 AEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPP 318
Query: 372 LELMAYAYLVV 382
+ A+L V
Sbjct: 319 ISAQLLAFLRV 329
>gi|109128729|ref|XP_001102146.1| PREDICTED: SET domain-containing protein 6-like isoform 1 [Macaca
mulatta]
Length = 481
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 124/282 (43%), Gaps = 26/282 (9%)
Query: 89 PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
PP ++A+ + V G+VA ++++ GE L VP + ++ S+++C G + ++
Sbjct: 61 PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQYTCSIGGLLERERGAL 117
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W + L+ + + E
Sbjct: 118 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPE 176
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y+ + L +PDLF V ++E + ++ + + P +
Sbjct: 177 AVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 236
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 237 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 291
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 292 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 333
>gi|408390178|gb|EKJ69586.1| hypothetical protein FPSE_10234 [Fusarium pseudograminearum CS3096]
Length = 456
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 120/280 (42%), Gaps = 42/280 (15%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGL--VALKNIRKGEKLLFVPPSLVITADSK 131
E A+ L +W + +G ++Q + E GL A + ++ +PPSL ++ +
Sbjct: 8 ERAAALVQWATSNGATINP-SVQVSHLPETGLSFCATAPTSPFDTIVSIPPSLTLSYLNT 66
Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--- 186
+ + + P ++ +++ + + S W+ YI ALP QP + W+
Sbjct: 67 LPGRDDPKPFSSNFLAKTPPHVIGRFVLIKHFLLRESFWTPYIQALP-QPNDVDSWSLPP 125
Query: 187 -----RAELDRYLEASQIRERAIERITNVIGTYN---DLRLRIFSKYPDLFPE--EVFNM 236
AEL E + I NV+ + DL R D PE + F +
Sbjct: 126 FWPDEDAEL---FEGTNIEVGVANIKANVMREFRAGCDLLDR-----DDWEPELLKQFTL 177
Query: 237 ETFKWSFGILFSR-----LV-------RLPS---MDGRVALVPWADMLNHSCEVETFLDY 281
++W++ I SR LV RLP +D L+P D+ NH + +
Sbjct: 178 PLYQWAYSIFSSRSFRPSLVLGLEDQQRLPENVKLDDFSVLMPLFDVGNHDMTTQVRWER 237
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
D+ S + YQPGEQ+F +Y K+N ELLL YGF+
Sbjct: 238 DEKSNDCSLKVGKAYQPGEQIFNNYSMKTNAELLLGYGFM 277
>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 524
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 135/325 (41%), Gaps = 58/325 (17%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVAL--KNIRKGEKLLFVPPSLVITADSKWSCPE 136
L W + G K+ ++ + GE L L + + KGE ++ +P SL +T DS
Sbjct: 31 LLSWFVEHGGSMTKLCLEDLG-GEMSLSLLTGQALNKGEVVMSIPISLCMTVDS------ 83
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD---RY 193
+LA +L++E S W Y+ LP + L W + + R
Sbjct: 84 --------------VLALHLMAERRKGDGSFWKQYLRTLPDDVDTPLRWLVEQAEEEFRL 129
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-- 251
L+ + + + + V + + L + +P++ F E + W+ ++SR
Sbjct: 130 LDGTMVGLLSRMMHSQVRKDWEEFHLPLVEAHPEILGGVTF--EDYLWAMSSIWSRSFDY 187
Query: 252 RLPSMD----GRVALVPWADMLNH----SCEVETFLDYDKSSQGVVF------------- 290
+ P D R A+VP + NH + + +++ G+
Sbjct: 188 QEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGGLSMGIGEPGRARGTLR 247
Query: 291 -TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
+ R Y EQ FI YG+ SN +LL SYGFV +NP ++ + + ++D + K
Sbjct: 248 VSAGRDYAAREQFFILYGRYSNAKLLYSYGFVL--ASNPYGGLDYWVRVPQTDPGFAWKQ 305
Query: 350 EALRKYGLSASECF----PIQITGW 370
L ++ L+A++ + ++ GW
Sbjct: 306 ALLDEHPLTAAQAYDFSGTVRAGGW 330
>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
dendrobatidis JAM81]
Length = 648
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 121/305 (39%), Gaps = 68/305 (22%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL---------------KQCSVP 147
RG A K+I ++ F+P S ++ ++S E G+ + + P
Sbjct: 65 RGAYASKDIPPNSEICFIP-STILLSESDVRASEIGKAILTYIDEHQDAKQKISDKIKHP 123
Query: 148 DWPLL---ATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERA 203
+L A +++ + S + S W Y+++LP+ L WTR + L + +
Sbjct: 124 HAEILLAMAAFIVHQVSLPTADSHWLPYLASLPKNYALPLMWTRDRIQNLLGGTSLLYMM 183
Query: 204 IER----------ITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-- 251
IER + N G Y FP +++ +W+ ++SR
Sbjct: 184 IERLEWIQNSTKVVENACGHY--------------FPTGALTVQSMQWATCSIWSRAFPK 229
Query: 252 RLPSMD---------------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
PS+D + L P DM NH +++ + +GV F T
Sbjct: 230 AKPSLDLQDGSHQDVQDWIGLSEICLFPILDMFNHKRGYR--VEWRMTEKGVSFITPDGI 287
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK---LEALR 353
G ++ +YG K N LL +YGFV NP D ++ L L++ D Y K LE +
Sbjct: 288 CKGSELLNNYGPKGNENLLSNYGFVIE--NNPEDYFKVFLGLQQEDPLYTAKKAVLEVVS 345
Query: 354 KYGLS 358
+ LS
Sbjct: 346 ENDLS 350
>gi|18490888|gb|AAH22451.1| SETD6 protein [Homo sapiens]
Length = 473
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)
Query: 89 PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
PP ++A+ + V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 60 PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 116
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 117 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 175
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 176 AVEKDLANISSEYQSIVLPFMEAHPDLFSLGVRSLELYHQLVALVMAYSFQEPLEEEEDE 235
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 236 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 290
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 291 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 332
>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
Length = 903
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 122/285 (42%), Gaps = 30/285 (10%)
Query: 70 IDSLENASTLQKWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
++S + L +WL ++G+ ++I + RG++A + I G ++L +P L+I
Sbjct: 381 LESRKIGDNLLQWLHNAGMTSIAENHLSIADFEHTGRGVLANERIEAGVEVLHLPQHLLI 440
Query: 127 TADSKW--SCPEAGEVLKQC--SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYS 181
S P G VL D LL Y++ E S SRW+ + LP S
Sbjct: 441 NIHVALDESHP-IGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWAPFFETLPATYNS 499
Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
L + EL LE +++ + E I + + ++ + YP LFP + F E W
Sbjct: 500 PLLFHVTEL-LELEGTRLIDETFE-IKDGLRVLHESLGPLAEAYPALFPTDAFTYENLLW 557
Query: 242 SFGILFSRLVRLPSMDGRVA-----------------LVPWADMLNHSCEVE-TFLDYDK 283
++ SR ++LP A L+P+ DM+NH + YD
Sbjct: 558 VRAMIDSRAMKLPVPAAAAAVAAAAPEDATETPFVANLIPFVDMINHEEHSHISVRRYDT 617
Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
S++ +V TT G Q+ + Y + + LL YG + E NP
Sbjct: 618 SAKALVLTTLGACAAGTQLSLHYSTLPSWQQLLYYGMLSTE-LNP 661
>gi|345565943|gb|EGX48890.1| hypothetical protein AOL_s00079g111 [Arthrobotrys oligospora ATCC
24927]
Length = 445
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 116/244 (47%), Gaps = 30/244 (12%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+VA + ++K E++ F+P SL++ P + + V LA Y+ S+ F
Sbjct: 52 GIVASERVKKNEEITFIPKSLLVNLHD-IPFPNSSPIDHPTKVH--SSLAAYIASQ--FH 106
Query: 164 KSSRWSNYISALPR----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
KS +IS LP + L+W+ LD + +R AI++ + Y
Sbjct: 107 KSDNNDPFISILPSFSSFKSSMPLFWSNEVLDNC--SPWVRSFAIKQQEKLKDDY----A 160
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDGRVALVPWADMLNH 271
+ + E F+ E ++W++ + +R + ++P+ D + + P+ D NH
Sbjct: 161 HALKMHGERGVE--FSKEEYEWAWAAVNTRTIYYRPKKWYKVPAEDC-MTMCPFIDYYNH 217
Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSD 330
+ + S G+ TT ++Y GE++F++YG+ +N LL+ YGF +P+ N +D
Sbjct: 218 DAKGDESCTVSFSIDGLRVTTQKEYSVGEEIFVTYGEYNNDHLLVEYGFTLPK---NQAD 274
Query: 331 SVEL 334
++ +
Sbjct: 275 NMNI 278
>gi|322700433|gb|EFY92188.1| hypothetical protein MAC_01789 [Metarhizium acridum CQMa 102]
Length = 469
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 85/197 (43%), Gaps = 36/197 (18%)
Query: 155 YLISEASFEKSSRWSNYISALPRQP-------YSLL-YWTRAELDRYLEASQIRERAIER 206
+LI E S W YI ALP QP ++L +W E + LE + + E I++
Sbjct: 96 FLIKEYLKRDKSFWHPYIQALP-QPGQGNKSQWALAPFWDDDEAE-LLEGTNV-EVGIDK 152
Query: 207 ITNVIGTYNDLR-----LRIFSKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPSM---- 256
I N + DL+ LR+ D E E ++W++ I SR R PS+
Sbjct: 153 IRNDVK--RDLKEARELLRLHGDGADGGAFNEALTTELYQWAYCIFSSRSFR-PSLVLSD 209
Query: 257 ------------DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
D L+P D+ NH E D D Q + + PG+QVF
Sbjct: 210 KQRRMLPEGVDTDDFSVLLPLFDIGNHDMTTEVRWDLDNERQNCELRVGKTHMPGQQVFN 269
Query: 305 SYGKKSNGELLLSYGFV 321
+Y K+N ELLL YGF+
Sbjct: 270 NYSMKTNAELLLGYGFM 286
>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 275
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 116/275 (42%), Gaps = 19/275 (6%)
Query: 75 NASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
+ S + WL +G+ +I D RG+VA K+I GE ++ VP V+ ++ SC
Sbjct: 2 DVSKVLAWLRLAGIECSCCSIDVFDGSGRGVVATKDISCGEVVVHVPDESVLMPEN-CSC 60
Query: 135 PEAGEVLKQCSVP-DWPL----LATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRA 188
EA E + D + L L++E KSS+W Y+ LP+ P L+W
Sbjct: 61 SEALEDAGLTNASGDAEMESIGLILALMTEKKLGKSSKWKGYLDFLPKSIPGMPLFWDSE 120
Query: 189 ELDRYLEASQIRERAI------ERITNVIGTYNDLRLRIFSKYPDL-FPEEVFNM-ETFK 240
+L + LE + + E+ +R +N + L L P + +
Sbjct: 121 QL-QSLEGTSLIEKMNGCKAMPDRPLEPPCKFNSVVLPFLQSNAHLKLPHNAASTRRLYV 179
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPG 299
W+ ++ + + D A+VP D LNH + L + + G
Sbjct: 180 WATAMVSAYSFTI-GEDRFQAMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKG 238
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
EQV SYG N ELL YGFV + NP D +E+
Sbjct: 239 EQVINSYGDLPNSELLRRYGFVETD-PNPHDCLEV 272
>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 499
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 107/251 (42%), Gaps = 42/251 (16%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA----ELD-RYLEASQIRERAIE 205
L+ YL E SF W YI+ LP QP + WT E D ++LE + E
Sbjct: 113 LIKEYLKGENSF-----WWPYIATLP-QPEQVNSWTLPAFWPEDDIQFLEGTNAHVAIGE 166
Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR------------ 252
N+ Y R ++ + + FP + ++ +KW+F I SR R
Sbjct: 167 IQANIKREYKQAR-KVLKE--ENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDY 223
Query: 253 ----LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
LPS +D L P D+ NHS D Y+PG+QVF +
Sbjct: 224 VSTLLPSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNN 283
Query: 306 YGKKSNGELLLSYGFV-PREGTNPSDSVEL-------PLSLKKSDKCYKEKLEALRKYGL 357
YG K+N ELLL+YGF+ P T +D V + + KSD K+ L +LR
Sbjct: 284 YGFKTNSELLLAYGFILPETDTLHNDYVHVRKRQQPEGENASKSDDQPKDFLISLRSMND 343
Query: 358 SASECFPIQIT 368
++S I++T
Sbjct: 344 ASSLAGKIRLT 354
>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
Length = 454
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 71/266 (26%), Positives = 117/266 (43%), Gaps = 49/266 (18%)
Query: 92 KMAIQ--KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK----WSCPEAGEVLKQCS 145
K+AI + D RG++A ++I + E L +P S ++ ++ PEA ++
Sbjct: 27 KIAIHDYRSDHQGRGVIASEDIEEDEVLFKIPRSSFLSVENDPDFIKQVPEAKKL----- 81
Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE 205
W L Y++ S ++W Y LP Q SL+ WT EL+ L+ S I ++
Sbjct: 82 -NSWLQLILYMMKAGSM---TKWKPYFDVLPTQLDSLMMWTDDELEG-LKGSMI----VK 132
Query: 206 RITNVIGTYNDLRLR---IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS------- 255
+I G D + + I +P+ F + ++E+F G++ + P
Sbjct: 133 KIGKA-GAEEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAPDSFSEDEE 191
Query: 256 -----------MDGRV-ALVPWADMLN-HS--CEVETFLDYDKSSQGVVFTTDRQYQPGE 300
+G V A+VP AD LN H+ C + D G T + + GE
Sbjct: 192 DDEDIEHDDLYNEGLVKAMVPLADTLNAHTRFCNANLIAEDD---GGFSMTAIQPIKKGE 248
Query: 301 QVFISYGKKSNGELLLSYGFVPREGT 326
QV+ +YG+ N + L YG+V EGT
Sbjct: 249 QVYNTYGELPNCDFLRRYGYVENEGT 274
>gi|341883062|gb|EGT38997.1| CBN-SET-29 protein [Caenorhabditis brenneri]
Length = 414
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 101/228 (44%), Gaps = 23/228 (10%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASF 162
G+ A R G+ ++ +P +I + P + L + + P+ + T S F
Sbjct: 30 GIYATTGFRTGKPIITLPEHDMINSALVVDLPFYKKKLAKINEKMKPMEILTMFFSFEDF 89
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
E+S+ WS Y+ LP+ + + + D IR+ I++ + LR
Sbjct: 90 EQSA-WSPYLKVLPKT-FDTPAFKGIDYDVNTLPLSIRKYWIDQKKEISEISEKLR---- 143
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADMLNHSC 273
LFPE + W++ ++ +R + + + DG +A++P+ DMLNH
Sbjct: 144 ----HLFPE--LTHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDMLNHDP 197
Query: 274 E-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
E + ++K + V RQ Q GEQ+F+ YG N LL+ YGF
Sbjct: 198 EKYQGVALHEKRNGRYVVQAKRQIQEGEQIFVCYGAHDNARLLVEYGF 245
>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
Length = 536
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 125/311 (40%), Gaps = 70/311 (22%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S +
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 138 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
R L+++Q AI + FS+Y + + + + + + G
Sbjct: 195 RCLQSTQ----AIHDV--------------FSQYKNTARQYAYFYKVIQITTGY------ 230
Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
++ + CE D+ Q G+Q++I YG +SN
Sbjct: 231 ---------------NLEDDRCECVALQDF---------------QAGDQIYIFYGTRSN 260
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP 371
E ++ GF N D V++ L + KSD+ Y K E L + G+ S F + T P
Sbjct: 261 AEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHSTEPP 318
Query: 372 LELMAYAYLVV 382
+ A+L V
Sbjct: 319 ISAQLLAFLRV 329
>gi|121719466|ref|XP_001276432.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119404630|gb|EAW15006.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 426
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 93/406 (22%), Positives = 168/406 (41%), Gaps = 68/406 (16%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
E + +W G+ + + G++A + I++GE ++ VP +++IT D S
Sbjct: 8 EEHTQFMQWAKSHGVKINGITPAHIPGRGAGMIATRCIQEGEVMISVPLNIMITID---S 64
Query: 134 CPEAGEVLKQCSVPDWPLLATYL-ISEASFEKSSRWSNYISALP-RQPY--SLLYWTRAE 189
P + +LA +L + + F K +W ++ P R+ + S+
Sbjct: 65 IPASFIKRFPSGTSIHGILAAFLTVGDQKFLK--KWDSWRKVWPSRKDFEESMPILWPGH 122
Query: 190 LDRYLEASQIRERAIER-------ITNVIGTYNDLR-----------------LRIFSKY 225
L R S+ + + +ER + + T+++++ R+ +
Sbjct: 123 LRR--SNSRFQAQPLERPYLLPQPASGIWNTFDNIQRDSTSVPKCQSLLSQQETRLQGAW 180
Query: 226 PDL---FPEEVFNMETFKW----SFGILFSRLVRLP--SMDGRVALVPWADMLNHSCEVE 276
++ FP ++ +F W S + + R P + + LVP+AD NH+ + +
Sbjct: 181 RNVLAVFPNMDWDAFSFHWLILNSRSFYYVKPGRQPPDEWNDAIGLVPFADYFNHADDAD 240
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-- 334
T + +D + FT RQ++ GE++F+SYG SN L + YGF N SD + L
Sbjct: 241 TEVVFD--GRKYTFTATRQFEKGEEIFMSYGAHSNDFLFVEYGFFLDH--NESDVIFLDD 296
Query: 335 --PLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFE 392
L + ++ E + L Y ++ + P +T L+ M+ F
Sbjct: 297 IISKELSEDERKELESQQGLEDYQVTMAGICPRTLTAACLKYMSTE-----------DFR 345
Query: 393 EMAAAASNK-MTSKKDIKC----PEIDEQALQFILDSCESSISKYS 433
E A S K SKK K E+ +Q Q LDS E + K S
Sbjct: 346 EYAHGHSTKAFDSKKTWKIIHEWVELYQQECQATLDSLEGILKKRS 391
>gi|332846060|ref|XP_003315172.1| PREDICTED: N-lysine methyltransferase SETD6 [Pan troglodytes]
Length = 474
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)
Query: 89 PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
PP ++A+ + V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 61 PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIRGLLERERVAL 117
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 118 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 176
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 177 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 236
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 237 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 291
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 292 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 333
>gi|320170563|gb|EFW47462.1| hypothetical protein CAOG_05400 [Capsaspora owczarzaki ATCC 30864]
Length = 479
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 128/314 (40%), Gaps = 29/314 (9%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLIS---- 158
RGLVA + I +P + +I + + G V+ + D +++ +L
Sbjct: 70 RGLVAKQAIPPKTVFARIPLTALINIEHAM-VSDLGPVIDASDLSDQEIMSVFLWHQLHG 128
Query: 159 ----EASFEKSSRWSNYISALP-RQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIG 212
E S W ++ LP RQ L + WT +L +L+ S +R+ + RI +
Sbjct: 129 CGQVEDGGVAESNWQPFLDTLPDRQEMHLTMLWTPEQL-AHLDGSLLRDFSERRIQVLEA 187
Query: 213 TYNDLRLRIFSKYPDLFPEE--VFNMETFKWSFGILFSRLVRLPSMDGRVA------LVP 264
++ + F K+P + F +E F W I +SR + DG A LVP
Sbjct: 188 SFKRHQQSTFGKFPSAESCDWTKFTLEDFLWGMAIGWSRTHAVRVRDGEGAWQTANCLVP 247
Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFT--TDRQYQPGEQVFISYGKKS----NGELLLSY 318
AD+LN + + + + F T Q E++ Y S N LL+ Y
Sbjct: 248 VADLLNTDIASKVNAECYTNDESTHFECRTRHQLAQSEELLAQYNADSASIDNHHLLMDY 307
Query: 319 GFVPREGT-NPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAY 377
GFV + + + ++ P+ L D+ ++L L+++ + P+ T + L L Y
Sbjct: 308 GFVLNDDSARRAATIGRPIPLDDPDRA--KRLSVLKQHKMQMGMSLPLSFTDFELPLTYY 365
Query: 378 AYLVVSPPSMKGKF 391
L + K+
Sbjct: 366 RILCARSQVLNSKW 379
>gi|145349216|ref|XP_001419036.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579266|gb|ABO97329.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 476
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 112/244 (45%), Gaps = 22/244 (9%)
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSR----WSNYISALPRQPYSLLYWTRAELDRYL 194
EVLKQ + ++ T ++ A + SR W+ + ALPR P + L W + +
Sbjct: 118 EVLKQLTAMGDQIIMTIWLAAAMSGQDSRLYEAWAPTLRALPRAPCTALAWDVDTMRLVM 177
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVF-NMETFKWSFGILFSRLVRL 253
+ Q+ ER I+ V Y+ L + + P+ FP VF + F ++ I S +++
Sbjct: 178 DHDQV-ERLIDYQRKVRVQYDALFPALCEQVPEAFPASVFGDYSRFALAYDIWTSYAMKV 236
Query: 254 P---SMDGRVALVPWADMLNHSCEVET--FLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
++ +VP + NH+ + + ++ ++ R +PG+ + ISYG+
Sbjct: 237 QDPQTLQIYEVIVPGVFLCNHALYAHSVRYTSLERGTRAFRLELARGARPGDAITISYGR 296
Query: 309 KSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
N +L+ YGF +P +NP D V L + + +E+ AL + ASE + +
Sbjct: 297 LDNADLMAYYGFTLP---SNPYDRVVLS---SLASQANEEQTAALAR----ASEMCGVDL 346
Query: 368 TGWP 371
T P
Sbjct: 347 TELP 350
>gi|145354720|ref|XP_001421625.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581863|gb|ABO99918.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 375
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 119/277 (42%), Gaps = 32/277 (11%)
Query: 118 LFVPPSLVITADSKWSCPEAGE----VLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
+ +P + V+ ++ + P G +L++ V + + + + E + S W YI
Sbjct: 1 MTIPDAAVVNWNTAAAHPTLGSTFESLLRRGVVDERLAVMCFFMIERRRGEESAWKEYID 60
Query: 174 ALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL---RLRIFSKYPD--- 227
+LPR + L ++ EL+R L + + + +V + + +R ++ +
Sbjct: 61 SLPRAYDAPLSFSDEELERELSGTTVYAPVKAQKAHVKKMFEECVRPAMRELTQADNAAG 120
Query: 228 ----LFPEEVFNMETFKWSFGILFSRLVRLP-SMDGRV---ALVPWADMLNHSCEV---- 275
+ P+ + + F W+F +SR + +P G V ++VP DM+NH+
Sbjct: 121 SSLHMLPD--VSEKEFAWAFQTFWSRALAIPVGAGGSVTVDSVVPGVDMVNHAPRARANA 178
Query: 276 --ETFLDYDKSSQGVVFTT----DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
E D + G V +R + GE++FI+YG KSN ELL +YGF ++
Sbjct: 179 RWEHVEDSSRPDGGYVALVSAPPNRTMKDGEEIFINYGDKSNEELLFTYGFALKDNAVEE 238
Query: 330 DSVELP--LSLKKSDKCYKEKLEALRKYGLSASECFP 364
V P + + ++E LR GL P
Sbjct: 239 RMVFFPPWAGDAEHSEDVTRRIELLRAKGLPQHVVLP 275
>gi|307107162|gb|EFN55406.1| hypothetical protein CHLNCDRAFT_134525 [Chlorella variabilis]
Length = 705
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 49/203 (24%), Positives = 91/203 (44%), Gaps = 12/203 (5%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGER------GLVALKNIRKGEKLLFVPPSLVITADSKW 132
Q W+ G+ P AI+ VD + G+ A++++ +GE+L +P + ++ +
Sbjct: 9 FQAWMESVGIEPNSDAIELVDAAQGCSGLALGVRAVRDVAEGERLCAIPKAACLSIRTT- 67
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
+ +V++ + L ++ E S SRW Y +ALP + Y L+W+ A+L R
Sbjct: 68 ---QLADVIEAEELGGGLGLVLAVMHEMSLGAESRWHGYFAALPPREYLPLFWSDAQL-R 123
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
L +++ A + + L + KYP ++ F+ + + SR
Sbjct: 124 LLAGTELEGSAESDREASAEDFEEHVLPLLHKYPGRLRPAACTLDRFRVAASFVGSRAFC 183
Query: 253 LPSMDGRVALVPWADMLNHSCEV 275
+ G A+VP AD+ NH V
Sbjct: 184 VDEWHGD-AMVPLADIFNHKASV 205
>gi|397614297|gb|EJK62711.1| hypothetical protein THAOC_16665 [Thalassiosira oceanica]
Length = 467
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/276 (24%), Positives = 118/276 (42%), Gaps = 51/276 (18%)
Query: 75 NASTLQKWLSDSGLPPQKMAIQKVD----VGERGL----VALKNIRKGEKLLFVPPSLVI 126
N + L+ W + +G IQKVD E GL + +I G ++F+P + +
Sbjct: 54 NMAQLEDWGAQNG-------IQKVDGLELYSEDGLDWQYITTVDIPAGTTIMFIPAGVCL 106
Query: 127 TADS-------------KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
+ + + + + G + Q SV D+ L L++E E +S + +I
Sbjct: 107 ASSAVEAELKAASNGGMQAAIDQLGRIGGQNSVADFNLFVK-LLAEYEQEDNSPFLPWID 165
Query: 174 ALPRQPYSLLYWTRAELD----RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF 229
+LPR Y+ + T + S++ + + V+G ++ I S Y
Sbjct: 166 SLPRLFYNAVSMTDFCYECLPPLVFSLSRLEKVKFDNFKQVLG-----KVDIVSDYVKQ- 219
Query: 230 PEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQ 286
N E KW+F +++R DG+ V L+P DM NH E E + +D+
Sbjct: 220 -----NDEVLKWAFNSVYTR--AYADKDGQGSDVTLIPMGDMFNHGTETEIEVYFDEGGN 272
Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLS-YGFV 321
G+V+T+ + ISYG +N L + YGF+
Sbjct: 273 GMVYTS-ADVAANSPLRISYGCPTNPSFLFARYGFL 307
>gi|428162643|gb|EKX31766.1| hypothetical protein GUITHDRAFT_149078 [Guillardia theta CCMP2712]
Length = 581
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 118/264 (44%), Gaps = 28/264 (10%)
Query: 82 WLSDSGLPPQKMAIQKVDVGERGL-VALKNIRKGEKLLFVPPSLVITA---DSKWSCPEA 137
W+ + G + +++ + G+ G+ + I KG +L+ VP ++ D + +
Sbjct: 32 WVRERGGEVGPIVLREGEGGDCGVFTSSAKINKGHELVKVPTCCLLLGRQEDIEGMKLKL 91
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP-YSLL--YWTRAELDRYL 194
+ C LA L+ + ++SS + YIS LP Q ++ L +W+R E + L
Sbjct: 92 NRGERDCERD--VALALALLHHRNLKESSAFHAYISTLPPQDLFTSLPAWWSREEREELL 149
Query: 195 EASQIRERAIERITNVIGTYNDLRL--RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
+S++ + A +N Y +L+ R+ S + F W+ + SR
Sbjct: 150 GSSELADAATTMASNADQDYEELKAAGRMSSSKGE-----------FLWALACVSSRSFD 198
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDK----SSQGVVFTTDRQYQPGEQVFISYGK 308
+ G V +VP D NH +T Y + + G V T+ R E+V+I+YG
Sbjct: 199 ADEL-GEV-MVPILDCFNHKRPRDTAYSYRREEAPARAGFVLTSLRDLGEEEEVYIAYGA 256
Query: 309 KSNGELLLSYGFVPREGTNPSDSV 332
K + ELLL+YGF + P S+
Sbjct: 257 KGSRELLLNYGFCVMDNVEPDGSM 280
>gi|255088291|ref|XP_002506068.1| set domain protein [Micromonas sp. RCC299]
gi|226521339|gb|ACO67326.1| set domain protein [Micromonas sp. RCC299]
Length = 513
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 122/292 (41%), Gaps = 30/292 (10%)
Query: 72 SLENA--STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT-- 127
+L+NA S+ Q+W G +++ + GL A +++ GE+L+ +P L +T
Sbjct: 41 ALQNAMLSSFQRWFEAYGGTTSGISMVQTPSIGWGLTASRDVDVGERLILLPRVLQMTYS 100
Query: 128 ---ADSKWSCPEAGEVLKQ------------CSVPD--WPL-LATYLISEASFEKSSRWS 169
+S S +A L + +PD W + L L+ E + S +
Sbjct: 101 LQDRESTSSSDQATAELDREPDTPLYLKELIAQIPDELWSVRLGLALLHERALGGKSPFF 160
Query: 170 NYISALPRQPYSL-LYWTRAELD--RYLE-ASQIRERAIERITNVIGTYNDLRLRIFSKY 225
YIS LP L L++ +D +YL Q++ R+ I G ++ +
Sbjct: 161 QYISLLPAMHRGLPLFFGPEAVDALQYLPLVVQVKRRSRFLIDYSSGPLKNVTAGKNGET 220
Query: 226 PDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV--ETFLDYD 282
+ F + W+F SR R+ A++P D+ NHS E E
Sbjct: 221 ESVPFNGYSVGADALGWAFACASSRAFRVAGEGKPAAMLPLIDVANHSFEASAEVRAAMG 280
Query: 283 KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
+ + R + G++V ++YG SN LL YGFVP +G N D+ L
Sbjct: 281 EGPGAIEMVASRPLRAGDEVTLNYGNLSNDHFLLDYGFVP-QGINKHDTASL 331
>gi|354495008|ref|XP_003509624.1| PREDICTED: N-lysine methyltransferase SETD6-like [Cricetulus
griseus]
Length = 492
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/306 (22%), Positives = 129/306 (42%), Gaps = 29/306 (9%)
Query: 65 PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
P G + D A L +W + GL P+ ++ V G+VA ++++ GE L VP
Sbjct: 50 PRGGQSDGDAVAGFL-RWCAGVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPR 108
Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR- 177
S ++ S +C +G + ++ S+ W + + +S WS Y + P
Sbjct: 109 SALL---SPHTCSISGLLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPEL 164
Query: 178 -QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
+ ++W E R L+ + + E + + N+ Y+ + L + DLF V ++
Sbjct: 165 GRLEHPMFWPEEERRRLLQGTGVPEAVEKDLVNITSEYHSIVLPFMEAHSDLFSPTVRSL 224
Query: 237 ETFKWSFGILFSRLVRLP--------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV 288
E ++ ++ + + P +VP AD+LNH L+Y +
Sbjct: 225 ELYRQLVALVMAYSFQEPLEEEDDDEKEPNSPLMVPAADLLNHIANHNANLEYSADYLRM 284
Query: 289 VFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY 345
V T QP G ++F +YG+ +N +L+ YGF N D+ ++ + +
Sbjct: 285 VAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTDDTADIQMVTVRDAALQ 339
Query: 346 KEKLEA 351
K EA
Sbjct: 340 GTKDEA 345
>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
Length = 518
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 112/284 (39%), Gaps = 70/284 (24%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---CSVPDWPLLATY 155
D G RGL A+++++KGE +L VP S++IT DS + + S+ +LA
Sbjct: 78 DAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSLSPTQILAVC 137
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ E KSS W Y+ LPR Y +L A + + ++ + + +
Sbjct: 138 LLYEMGKGKSSWWYPYLMHLPRS-YDVL----ASFKKAVSKAKSEWKEANSLMDA----- 187
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
L+L+ ++ + W+ + SR + +P D L P D+ N++
Sbjct: 188 -LKLK----------PQLLTFRAWIWASATISSRALHIP-WDEAGCLCPVGDLFNYAAPG 235
Query: 276 ETFLD-------------------------------------------YDKSSQGVVFTT 292
E D +D++ F
Sbjct: 236 EESNDLENVVHWMNASSLEDSSLSNGETTDDFIGDQPDIGLERLTDGGFDENMAAYCFYA 295
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+ Y+ G QV + YG +N ELL YGF+ E NP+D V +PL
Sbjct: 296 RKNYKKGTQVLLGYGTYTNLELLEHYGFLLNE--NPNDKVFIPL 337
>gi|449453201|ref|XP_004144347.1| PREDICTED: N-lysine methyltransferase setd6-like [Cucumis sativus]
Length = 500
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/200 (22%), Positives = 95/200 (47%), Gaps = 8/200 (4%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLV--ALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
++W++ G+ A+Q D + G+ AL ++R+G+ + VP +T +
Sbjct: 9 FKRWMTSQGIQCSD-ALQFTDTPDNGISVKALYDLREGDVVANVPKLACLTVKT----TS 63
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
A ++++ + + L+ L+ E S ++S W+ Y+ LP + L W+ ++D++L
Sbjct: 64 ASSIIEEVGLGGYLGLSVALMYERSLGENSNWAGYLQLLPDKECVPLLWSLQDVDQFLCG 123
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+++ + E T + + + L + P +F E F +E + + ++ SR +
Sbjct: 124 TELHKTVKEDKTLMYEDWKENILPLMMSAPLMFSPEFFGIEQYFSARSLISSRSFDIDDF 183
Query: 257 DGRVALVPWADMLNHSCEVE 276
G +VP AD+ NH E
Sbjct: 184 HG-FGMVPLADLFNHKTNAE 202
>gi|422293679|gb|EKU20979.1| set domain containing protein, partial [Nannochloropsis gaditana
CCMP526]
Length = 193
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 239 FKWSFGILFSRLVRLPSMDGRVA-LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
FKW+ + SR L R A LVP+ADMLNH ET +D + Q TT +
Sbjct: 77 FKWARMCVCSRNFGLEVNRIRTAALVPYADMLNHQRPRETKWTFDNARQAFTITTLQPIA 136
Query: 298 PGEQVFISYGKKSNGELLLSYGFV 321
PG QV+ SYG+K N LL+YGF
Sbjct: 137 PGAQVYDSYGQKCNHRFLLNYGFA 160
>gi|72389450|ref|XP_845020.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62176703|gb|AAX70803.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801554|gb|AAZ11461.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 586
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 8/97 (8%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
F M+ F W++ L SR S D V A++PW D NHS + +D+ +F T
Sbjct: 255 FTMQQFIWAYNTLMSRGF---SYDPEVWAVIPWVDYFNHSLTNNATMRFDRCMGAYIFVT 311
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
G+QVF+ YG ++ EL+L YGF+ T PS
Sbjct: 312 TAPVSKGDQVFLQYGSYTDAELVLWYGFI----TTPS 344
>gi|302417794|ref|XP_003006728.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
gi|261354330|gb|EEY16758.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
Length = 457
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/294 (26%), Positives = 120/294 (40%), Gaps = 43/294 (14%)
Query: 79 LQKWLSDSG--LPPQKMAIQKVDVGERGLVAL-KNIRKGEKLLFVPPSLVIT----ADSK 131
L W S G L P D G V + + +R GE ++ P SL ++ D K
Sbjct: 4 LTSWASSHGAELHPAIEIFNDNDTGNSFRVKVGQQLRPGETIVTCPFSLTLSFLNALDLK 63
Query: 132 WSCPEAGEVLKQC------SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
E+ + + +VP + +LI + + S W YI LP QP L W
Sbjct: 64 SHGHESHDDTQPLPREFVETVPPHIVARFFLIKQYLLGRESFWYPYICTLP-QPDQLSSW 122
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--EEVFNMETF---- 239
+ L + + + TN+ +++ R+ ++Y P E + N +
Sbjct: 123 SLPPLWPSDDIELLED------TNIHTAVAEIKARLKAEYKQATPLLEALPNANDYTRLL 176
Query: 240 -KWSFGILFSRLVR------------LP---SMDGRVALVPWADMLNHSCEVETFLDYDK 283
W++ I SR R LP ++D L+P D+ NHS + D
Sbjct: 177 YHWAYSIFTSRSFRPSRVVPDHESLPLPEGCAIDDFHILMPLFDVGNHSHSAKISWDIAP 236
Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPL 336
+ V T Y G QVF +YG K+N EL+L+YGF +P T +D V L L
Sbjct: 237 GTSTTVLKTLDAYDSGAQVFNNYGSKTNAELMLAYGFLIPESPTVHNDFVHLQL 290
>gi|297698886|ref|XP_002826530.1| PREDICTED: N-lysine methyltransferase SETD6 isoform 2 [Pongo
abelii]
Length = 449
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 120/282 (42%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFMVPRAALL---SQHTCSIGGLLERERVAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E ++ ++ + + P +
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYRQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|410983655|ref|XP_003998153.1| PREDICTED: N-lysine methyltransferase SETD6, partial [Felis catus]
Length = 417
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/283 (23%), Positives = 123/283 (43%), Gaps = 27/283 (9%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G +L+Q
Sbjct: 4 LSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---SQHTC-SIGGLLEQERGA 59
Query: 144 -CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIR 200
S W + + +S WS Y + P + ++W E R L+ + +
Sbjct: 60 LQSQSGW-VPLLLALLHELQAPASPWSPYFAMWPELGRLEHPMFWPEEERRRLLQGTGVP 118
Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
E + + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 119 EAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEED 178
Query: 261 A-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +
Sbjct: 179 EKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-----QPIPKGHEIFNTYGQMA 233
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
N +L+ YGFV N D+ ++ + + K+EA R
Sbjct: 234 NWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALLGTKVEAER 276
>gi|426243560|ref|XP_004015620.1| PREDICTED: N-lysine methyltransferase SETD6 [Ovis aries]
Length = 450
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/307 (23%), Positives = 132/307 (42%), Gaps = 28/307 (9%)
Query: 65 PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
P G + D AS L W GL P+ ++ V G+VA ++++ GE L VP
Sbjct: 13 PAGSDEDPAPVASFL-SWCQRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPR 71
Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR- 177
+ ++ S+ +C +G + ++ S W L L+ E +S W Y + P
Sbjct: 72 AALL---SQHTCSISGVLERERGALQSQSGWVPLLLALLHEMQ-APASLWRPYFALWPEL 127
Query: 178 -QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
+ ++W E R L+ + + E + + N+ Y + L + DLF V ++
Sbjct: 128 GRLEHPMFWPEEERRRLLQGTGVPEAVEKDLANIRSEYYSIVLPFMDAHADLFSPRVRSL 187
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVV 289
E ++ ++ + + P + +VP AD+LNH L+Y + +V
Sbjct: 188 ELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSPTCLRMV 247
Query: 290 FTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYK 346
QP G ++F +YG+ +N +L+ YGF N +D+ ++ + +
Sbjct: 248 -----AIQPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTNDTADIQMVTVREAALQG 302
Query: 347 EKLEALR 353
K+EA R
Sbjct: 303 TKVEAER 309
>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 451
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/275 (24%), Positives = 113/275 (41%), Gaps = 24/275 (8%)
Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS 161
ERG+ +NI ++ VP ++T DS P G +++ + D L L
Sbjct: 38 ERGVFCEENIPAETIVVSVPWEALMTVDSAKGTPFEG-LMEAGAREDDVLCLLLLYHRHI 96
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
++ S ++ LPR+ + ++++ EL+ +R ++ +T D R
Sbjct: 97 LKERSPLKGHMDVLPREYHQTIFYSDDELE------LLRGTSLHAVTVQWKAQVDTDFRE 150
Query: 222 FSKYPDLFP--------------EEVFNMETFKWSFGILFSRLVRLPSMD-GRVALVPWA 266
P P E E + W+ G ++SR V + G A+ P
Sbjct: 151 LEALPLPSPRSEEGGSSTARDALEGFLTKEEYLWALGTVWSRFVTVERAGRGLKAMAPVF 210
Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
DM NH T Y +S+ + T + + G +V SYG N LLL +GF +
Sbjct: 211 DMFNHGPLSSTVHGYQESNDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPD-- 268
Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
NP +SVEL ++ + EK + + G+ S+
Sbjct: 269 NPFESVELWAMMEPGAPGFAEKNKIMLDNGVDPSK 303
>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 514
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 111/265 (41%), Gaps = 24/265 (9%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC----------SVPDWPLLA 153
GLV R+GE ++ +PP +++ P L+ ++ LA
Sbjct: 88 GLVLNAPARRGEAIVTLPPR------ARFRVPAFDSALRSLIDEFNEQHDNAIDPMTALA 141
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
L+ E S S W ++ LP S+L W EL +E ++E ERI N+
Sbjct: 142 LGLMYERS-RADSPWRAWLRMLPDPIESMLEWNDVEL-WPVEQLYVKELREERIRNLEAV 199
Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
Y + Y F +E F W+ I +R + +G ++L+P DM+NH
Sbjct: 200 YESVITPFIDTYESDLVGVDFTIEAFVWAAVIAQTRGLHESEKNG-LSLLPIVDMINHHR 258
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
E + S ++ T + GE++ I Y + S+ LLL YGFV E + D
Sbjct: 259 EPNAVV--VASGPNILVRTKTSLKAGEEITIDY-EMSSHVLLLLYGFV--EMSENLDFYP 313
Query: 334 LPLSLKKSDKCYKEKLEALRKYGLS 358
+ LS + D Y +L L GLS
Sbjct: 314 IRLSWESKDIDYPRRLRLLEGRGLS 338
>gi|46117158|ref|XP_384597.1| hypothetical protein FG04421.1 [Gibberella zeae PH-1]
Length = 456
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 72/278 (25%), Positives = 118/278 (42%), Gaps = 38/278 (13%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQKVDVGERGL--VALKNIRKGEKLLFVPPSLVITADSK 131
E AS L +W + +G ++Q + E GL A + ++ +PP+L ++
Sbjct: 8 ERASALVQWATSNGATINP-SVQVSHLPETGLSFCATAPTSPFDTIVSIPPTLTLSYLDT 66
Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--- 186
+ + + P ++ +++ + + S W+ YI ALP QP + W+
Sbjct: 67 LPGRDDPKPFSSNFLVKTPPHVIGRFVLIKHFLLRESFWTPYIQALP-QPNDVDSWSLPP 125
Query: 187 -----RAELDRYLEASQIRERAIERITNVIGTYN---DLRLRIFSKYPDLFPEEVFNMET 238
AEL E + I NV+ + DL R P L + F +
Sbjct: 126 FWPDEDAEL---FEGTNIEVGVANIKANVMREFRAGCDLLDRD-DWEPQLLKQ--FTLPL 179
Query: 239 FKWSFGILFSR-----LV-------RLPS---MDGRVALVPWADMLNHSCEVETFLDYDK 283
++W++ I SR LV RLP +D L+P D+ NH + + D+
Sbjct: 180 YQWAYSIFSSRSFRPSLVLGPEDQQRLPEGVKLDDFSVLMPLFDVGNHDMTTQVRWERDE 239
Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
S + YQPGEQ+F +Y K+N ELLL YGF+
Sbjct: 240 KSSDCSLKVGKAYQPGEQIFNNYSMKTNAELLLGYGFM 277
>gi|261328372|emb|CBH11349.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 586
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 8/97 (8%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
F M+ F W++ L SR S D V A++PW D NHS + +D+ +F T
Sbjct: 255 FTMQQFIWAYNTLMSRGF---SYDPEVWAVIPWVDYFNHSLTNNATMRFDRCMGAYIFET 311
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
G+QVF+ YG ++ EL+L YGF+ T PS
Sbjct: 312 TAPVSKGDQVFLQYGSYTDAELVLWYGFI----TTPS 344
>gi|344290687|ref|XP_003417069.1| PREDICTED: N-lysine methyltransferase SETD6-like [Loxodonta
africana]
Length = 452
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/290 (23%), Positives = 125/290 (43%), Gaps = 27/290 (9%)
Query: 82 WLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
W GL P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G
Sbjct: 31 WCGGVGLELSPKVAVSRQGTVAGYGMVAQESVQPGELLFAVPRAAIL---SQHTCCIGGL 87
Query: 140 VLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRY 193
+ ++ S W + + SS WS Y + P + ++W E R
Sbjct: 88 LERERGALQSQSGW-VPLLLALLHELQAPSSPWSPYFALWPELSRLEHPMFWPEEEWRRL 146
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
L+ + + E + + N+ Y + L +P+LF V ++E ++ ++ + +
Sbjct: 147 LQGTGVPEAVEKDLANIRSEYYSIVLPFMEAHPELFSPCVRSLELYQQLVALVMAYSFQE 206
Query: 254 PSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVF 303
P + +VP AD+LNH L+Y +V T QP G+++F
Sbjct: 207 PLEEEEDEKEPNSPLMVPAADILNHLANHNAHLEYSPDCLRMVAT-----QPIPKGQEIF 261
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+YG+ +N +L+ YGFV N D+ ++ + + K+EA R
Sbjct: 262 NTYGQMANWQLIHMYGFVEPYPGNTDDTADIQMVTVREAALQGTKVEAER 311
>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
Length = 573
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 141/336 (41%), Gaps = 49/336 (14%)
Query: 72 SLENASTLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+L+ T +W D G + ++ + +A I+ E L+ VP +L+IT +
Sbjct: 3 NLDQLKTCVEWCKDHGAIIDDRLEFKVTQAAGVTAIAKSVIKTTEPLISVPANLLITKE- 61
Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSR--WSNYISALPR---QPYSLLYW 185
+ E G S + L ++ F+ S+R Y LP QPY +W
Sbjct: 62 -LAEKEFGSASGAVSSENPNALVQLFTAKMKFDPSARPFHKPYFDILPTKLDQPY---FW 117
Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDL--RLRIFSKYPDLFPE-EVFNMETFK-- 240
E++ L+ + I + + ++ ++ L +L++ + +L+ + E + + K
Sbjct: 118 KLQEVE-LLKGTDIYLLMKQNLRKIVKEWHVLLDQLKLKPEDGELYEQSEAQDFDILKYI 176
Query: 241 -------------------WSFGILFSR------LVRLPSMDGRVALVPWADMLNHSCEV 275
W+ GI SR L S L P D+LNH +
Sbjct: 177 CEYREQHKSISWKSFVGYLWATGIFTSRAFPKLILEEKCSSINEAFLYPLVDLLNHKND- 235
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
T + + ++ V F + + GE+VF +YG+KSN +LLLSYGFV + NP D L
Sbjct: 236 -TKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFV--QDQNPYDLTRLT 292
Query: 336 LSLKKS--DKCYKEKLEALRKYGLSASECFPIQITG 369
L L K D+ +L K + A +C QIT
Sbjct: 293 LRLTKEMIDEALNAELGFSEKNKV-ADDCVQFQITA 327
>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
Length = 480
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/233 (28%), Positives = 103/233 (44%), Gaps = 32/233 (13%)
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPR-QPYSL--LYWTRAELDRYLEASQI----RE 201
+ LL+ Y+ E SS W +I LP + L L W ++D Y E ++ +
Sbjct: 146 FQLLSFYICFEKQRGSSSFWKPFIDMLPETSDFDLAPLVWKVLKVDHYEELLKLLPNSTK 205
Query: 202 RAIERITNVIGT-YNDLRLRIFSKYPDLFPEEVFN-----------METFKWSFGILFSR 249
R +++I + T YN ++ I K ++ E N +E + WS+ + SR
Sbjct: 206 RHMDKIYDRFQTDYNVVKDLISIKLKEISDNERSNDLTDAIRHLVPIELYLWSWMCINSR 265
Query: 250 LVRLPSMDGRVA-----LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ + + A + P+ D LNHSC+ + L D + G T Y P EQ+F+
Sbjct: 266 CLYMEIPQSKNAADNFTMAPYVDFLNHSCDDQCGLKIDGT--GFQVYTTCSYNPDEQLFL 323
Query: 305 SYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
SYG SN LL YGF +P N D + L L K +++E L Y
Sbjct: 324 SYGPHSNEFLLCEYGFTLPENKWNDLDVSDYILPLMKP-----QQIEFLNDYS 371
>gi|302845036|ref|XP_002954057.1| hypothetical protein VOLCADRAFT_94881 [Volvox carteri f.
nagariensis]
gi|300260556|gb|EFJ44774.1| hypothetical protein VOLCADRAFT_94881 [Volvox carteri f.
nagariensis]
Length = 598
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 66/143 (46%), Gaps = 30/143 (20%)
Query: 239 FKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNHSC----------------EVET-- 277
F+W+ ++ SR + G V LVP DMLNH EV T
Sbjct: 173 FRWALSVVHSRTFANAAPGGGVGVRMLVPLVDMLNHGGDTAAQGSLGLVGPGGGEVATDN 232
Query: 278 ----FLDYDKSSQG---VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
L D+SS G + + R PG+++ +SYG++ N + L YGFVPR NP D
Sbjct: 233 VRWDLLPPDRSSAGGWSMAVSATRDIHPGQELLLSYGERPNDDFFLHYGFVPR--ANPHD 290
Query: 331 SVELPLSLKKSDKCYKEKLEALR 353
L L+ + + + E+L AL+
Sbjct: 291 DAVLWPDLEAALEWHYERLGALQ 313
>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
chloroplastic-like, partial [Cucumis sativus]
Length = 923
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 103/214 (48%), Gaps = 19/214 (8%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
++I K +G R L A + IR G+ +L VP ++ I+ DS P L + + L
Sbjct: 724 LSIGKSSIG-RFLFASETIRAGDCILKVPFNVQISPDS---LPLPIRDLLGNEIGNVAKL 779
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITN 209
A ++ E S W+ YI LP QP+ + ++W +EL+ + S + E ++ + +
Sbjct: 780 AVVVLLEHKLGLGSEWAPYIIRLP-QPWEMHNTIFWKESELEM-IRKSSLYEESLNQRSQ 837
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
+ + +R K + FPE + + + F ++ ++ SR R S +G V+L+P+AD
Sbjct: 838 IKREFLAIR-----KALEAFPEIIDRISCDDFMHAYALVTSRAWR--STEG-VSLIPFAD 889
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
LNH E L D Q DR + PGE
Sbjct: 890 FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEH 923
>gi|332227974|ref|XP_003263165.1| PREDICTED: N-lysine methyltransferase SETD6 [Nomascus leucogenys]
Length = 449
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 121/282 (42%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE LL VP + ++ S+ +C G + ++
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLLVVPRAALL---SQHTCSIGGLLDRERGAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 450
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/286 (24%), Positives = 113/286 (39%), Gaps = 42/286 (14%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGER---GLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
+ W+ +G+ + I+ GE GL A K+ +G L+ P L IT + P
Sbjct: 18 FRDWMQINGVQSRFCEIRPSSNGENAGFGLFATKDNAQG-VLMVTPLLLAITPMTVLQDP 76
Query: 136 EAG----EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
E G +++++ V D L+ +L+ E + + S W+ Y+ LP + + L ++ EL
Sbjct: 77 ELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFGTPLSFSEEELS 136
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM---ETFKWSFGILFS 248
+ + T +I LR + + +F N+ +F F +
Sbjct: 137 ELKGTHLFQATQQQSTTGLI-----LRCPVLDRANSVFWTRALNIPCPHSFNNRFAVDLD 191
Query: 249 RL---------------VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
V++PS LVP D NH + + D V +
Sbjct: 192 STTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVALWEVDGPEGSVTGVPN 251
Query: 294 RQY---------QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
Y G ++FISYG KSN ELL YGFV E NP D
Sbjct: 252 SMYLVTGLDVVISNGSEIFISYGNKSNEELLYLYGFVLVE--NPDD 295
>gi|118357514|ref|XP_001012006.1| hypothetical protein TTHERM_00808050 [Tetrahymena thermophila]
gi|89293773|gb|EAR91761.1| hypothetical protein TTHERM_00808050 [Tetrahymena thermophila
SB210]
Length = 454
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 28/235 (11%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWS-CPEAGEVLKQCSVPDWPLLATYLISEASFE 163
LVA +++ +GE+LL +P +L IT + P EV +V +LA +L E +
Sbjct: 39 LVAKQSVNEGEELLRIPETLFITLSVAITKLPILREVKSNLNVQKKSILAFFLFKEKK-D 97
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELD--------RYLEASQIRERAIERITNVIGTYN 215
SS + Y++++P+Q + + W + + ++ + Q + I N I +
Sbjct: 98 ASSFYHCYLNSIPKQYTNTITWQEIQFNLLRDELKTKHQKKQQKLLSEFDAIKNYISSNK 157
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNH 271
D Y +F E N F ++ SR + + A++P+ D+ NH
Sbjct: 158 D--------YSHIF--EGINEAEFLQLVAMIESRTLFFKNEQDSTSEVGAMIPFYDLANH 207
Query: 272 S----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
+ + + +D+ S+ V + + EQ+FI+YG +N L YGF+P
Sbjct: 208 TFMEGIDHFKYFYFDQISKEYVMRAYKHFVAEEQIFITYGNYNNEHFLDYYGFIP 262
>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
206040]
Length = 481
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 89/194 (45%), Gaps = 33/194 (17%)
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRERAIERI 207
LI E + S W YI ALP QP + W AEL LE + + E +++I
Sbjct: 92 LIKELLRGEESFWWPYIQALP-QPEDVDDWALPPFWPEEEAEL---LEGTNV-EVGLDKI 146
Query: 208 TNVIG-TYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSRLVR------------ 252
+ + + + + + + D + E+ E + W++ I SR R
Sbjct: 147 RDDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQA 206
Query: 253 LP---SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV--FTTDRQYQPGEQVFISYG 307
LP S+D L+P D+ NH V+ + D ++ G R++QPG+Q+F +Y
Sbjct: 207 LPEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYS 266
Query: 308 KKSNGELLLSYGFV 321
K+N ELLL YGF+
Sbjct: 267 PKTNAELLLGYGFM 280
>gi|238550105|ref|NP_079136.2| N-lysine methyltransferase SETD6 isoform b [Homo sapiens]
gi|333944471|pdb|3QXY|A Chain A, Human Setd6 In Complex With Rela Lys310
gi|333944473|pdb|3QXY|B Chain B, Human Setd6 In Complex With Rela Lys310
gi|333944524|pdb|3RC0|A Chain A, Human Setd6 In Complex With Rela Lys310 Peptide
gi|333944526|pdb|3RC0|B Chain B, Human Setd6 In Complex With Rela Lys310 Peptide
gi|119603386|gb|EAW82980.1| SET domain containing 6, isoform CRA_a [Homo sapiens]
gi|307686123|dbj|BAJ20992.1| SET domain containing 6 [synthetic construct]
Length = 449
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
Length = 1114
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 75/273 (27%), Positives = 114/273 (41%), Gaps = 35/273 (12%)
Query: 116 KLLFVPPSLVITADSKWSCP---EA------GEVLKQCSVPDWPLLATYLISEASFEKSS 166
+L+FVP ++ A S C EA G +L + + D LA L+ E S
Sbjct: 57 ELVFVPFDAMLHARSPLVCSGEREANDARALGALLGKVTRED-DALALRLLYERRKGAKS 115
Query: 167 RWSNYISALPRQP-YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL-------- 217
RW +I+ LP P ++LL W+ AEL L S E A + V ++++
Sbjct: 116 RWGPHIALLPATPPHALLRWSEAELAE-LAGSDALELANRWRSQVSSDFSEIVDKSRAAV 174
Query: 218 -------RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWAD 267
+L K FP ++E F W+ +++SR V + S G A +P D
Sbjct: 175 EESDPGKQLSAAVKASLRFP--WLDLEGFSWAVSMIWSRCVSV-SRKGAPPIKAFLPVVD 231
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
M NH +D + G V + G+++ + Y N LLL YGF +
Sbjct: 232 MHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYDGLPNAWLLLLYGFALDHAAH 291
Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
+ PLS + Y+ K AL K GL A+
Sbjct: 292 AGRDLYAPLSPEAPH--YEAKRAALEKLGLGAT 322
>gi|307173810|gb|EFN64588.1| SET domain-containing protein 4 [Camponotus floridanus]
Length = 376
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/296 (22%), Positives = 134/296 (45%), Gaps = 29/296 (9%)
Query: 74 ENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
E+ +L+ WL +++ L + + + + RGL LK+I E L+ +P ++IT D+
Sbjct: 2 ESLISLKSWLLNENCLSIRHLIPEYFPLTGRGLKTLKHIECNEVLIQLPFRMLITTDTLL 61
Query: 133 SCPEAGEVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE-- 189
L S +LAT+L+ E S+W Y+ LP+ + + + E
Sbjct: 62 QSNIRFLFLNTTDSFNPQCMLATFLVYETHLGIKSKWYLYLKTLPQSFTNPDFCSNKEKR 121
Query: 190 -LDRYLEASQIRERAIERITNVIGT---YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
L ++ S + +E +++ + D+ + + +L ++ E +KW++ +
Sbjct: 122 ILPSFILNSLHQAHRLESNFSLLMKAVKHLDIINKNHCSHCNLHLRKIITFEKYKWAYYV 181
Query: 246 LFSRLVRLPSMDGR------------VALVPWADMLNHSCEVE---TFLDYDKSSQGVVF 290
+ +R V + + R +AL P+ D+ NH+ + + + + +Q
Sbjct: 182 VNTRAVYIDTKLLREKNIFNIKQPNNLALAPFLDLFNHNVDTAVKVSIITDNNQNQFYQI 241
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYG-FVPREGTNPSDSVELPLSLKKSDKCY 345
T + + QVFI+YG +N +L + YG F+P NP D E+ + + +C+
Sbjct: 242 ITLKPFDRESQVFINYGAHNNLKLYIDYGFFIP---CNPLD--EIYFDILEIQRCF 292
>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
Length = 533
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 116/269 (43%), Gaps = 39/269 (14%)
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND----- 216
F+ S W + PR+ +W L L+ + IR+ AI ++ +I D
Sbjct: 159 FDPDSFWQPWFQLFPRELDCAGFWDDLLL-MELDNTSIRD-AIRQLEALIEYEYDQLDLP 216
Query: 217 -LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR-VALVPWADMLNHSCE 274
LRLR +PD F + F+ + FKW+F +L SR + + + ++P+ D NH+
Sbjct: 217 ALRLR----FPDSFVADRFSYDDFKWAFMVLASRGLTMSVNNAPCTVMIPFVDFFNHNGA 272
Query: 275 VETFL---------------DYDKSSQGV---VFTTDRQYQPGEQVFISYGKKSNGELLL 316
+YD S + + V + + + PGEQ+F++Y SN LLL
Sbjct: 273 KSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHSNEVLLL 332
Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDK---CYKEKLEALRKYGLSASECFPIQITGWPLE 373
YGF + + V L +K++ +E L LR G+ + F ++ G ++
Sbjct: 333 HYGFALPHNEHDTFLVRLHFDREKTNDPLMDLREHLLELR--GIQENHPFLLRWHGDIID 390
Query: 374 ---LMAYAYLVVSPPSMKGKFEEMAAAAS 399
L A ++ + + EE A+ S
Sbjct: 391 PDILFALRVMIATKDQLDFLLEEGIASNS 419
>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
Length = 481
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 76/192 (39%), Gaps = 29/192 (15%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRA-----ELDRYLEASQIRERAIERITN 209
+LI E + S W+ YI+ LP QP + W E YL + E N
Sbjct: 108 FLIKEYLKGRDSFWAPYIATLP-QPEHVSAWALPAFWPEEDIAYLAGTNAHVAIAEIQAN 166
Query: 210 VIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR---------------- 252
V + R + + FP + + +KW+F I SR R
Sbjct: 167 VKSEFKQARKALKAAG---FPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAEL 223
Query: 253 LP---SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
LP +D L P D+ NHS D YQPGEQV+ +YG K
Sbjct: 224 LPPGCQLDDFSILQPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYGLK 283
Query: 310 SNGELLLSYGFV 321
+N ELLL+YGF+
Sbjct: 284 TNSELLLAYGFI 295
>gi|109128727|ref|XP_001102235.1| PREDICTED: SET domain-containing protein 6-like isoform 2 [Macaca
mulatta]
Length = 456
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/282 (22%), Positives = 122/282 (43%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+++C G + ++
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQYTCSIGGLLERERGAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W + L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y+ + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 49/273 (17%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA--GEVLKQCSVPDWPLLATYLI 157
G RGL A++ ++KGE +L VP + ++T +S + V+ S+ +L+ L+
Sbjct: 49 AGGRGLGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLL 108
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
E K S W Y+ LPR Y LL T E ++ +A Q+ E A+ I
Sbjct: 109 YEMGKGKRSFWYPYLVHLPRD-YDLLA-TFGEFEK--QALQV-EDAVWATEKAIA----- 158
Query: 218 RLRIFSKYPDLFPEEVFNMETFK------WSFGILFSRLVRLPSMDGRVALVPWADMLN- 270
+ + K L EE+ F+ W+ + SR + +P D L P D+ N
Sbjct: 159 KCQFEWKEVGLLMEELELKSKFRSFQAWLWASATISSRTLHVP-WDSAGCLCPVGDLFNY 217
Query: 271 -------HSCE--------------VETFLD------YDKSSQGVVFTTDRQYQPGEQVF 303
H+ E VET + +++ R YQ GEQV
Sbjct: 218 DAPGDDLHTLEGPESANDVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVL 277
Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+ YG +N ELL YGF+ E +N D V +PL
Sbjct: 278 LCYGTYTNLELLEHYGFMLEENSN--DKVFIPL 308
>gi|133902101|ref|NP_490849.4| Protein SET-29 [Caenorhabditis elegans]
gi|373219869|emb|CCD70787.1| Protein SET-29 [Caenorhabditis elegans]
Length = 401
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 36/235 (15%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE--------AGEVLKQCSVPDWPLLATY 155
G+ A R G+ + +P + +I A P GE LK + L +
Sbjct: 31 GIYATTGFRTGKAFITLPETDMINAALVVDLPVYRKKLAKIGGEKLKPMEI-----LTMF 85
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
E + + S WS Y+ LP++ ++ + + D IR+ I++ +
Sbjct: 86 FAFEDT--EHSAWSPYLKVLPKE-FNTPAFKGIDYDVNTLPLSIRKYWIDQKKEISEISE 142
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWA 266
LR LFPE + + W++ ++ +R + + + DG +A++P+
Sbjct: 143 KLR--------RLFPE--LSHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYV 192
Query: 267 DMLNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
DMLNH E + ++K + V RQ Q GEQ+F+ YG N LL+ YGF
Sbjct: 193 DMLNHDPEKYQGLALHEKRNGRYVVQAKRQIQEGEQIFVCYGAHDNARLLVEYGF 247
>gi|68488193|ref|XP_712057.1| hypothetical protein CaO19.10177 [Candida albicans SC5314]
gi|46433419|gb|EAK92860.1| hypothetical protein CaO19.10177 [Candida albicans SC5314]
Length = 552
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 132/321 (41%), Gaps = 58/321 (18%)
Query: 74 ENASTLQKWLSDSGLPPQ-KMAIQK-VDVGE-RGLVALKNIRKGEKLLFVPPSLVITADS 130
E + Q WL + + K+AI D + RG++AL++I E + +P S+V+ D+
Sbjct: 6 EKSKLFQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN 65
Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFE----------KSSRWSNYISALPRQP 179
VLK+ V D W L L E F+ S W Y++ LP Q
Sbjct: 66 NSLIKSYPSVLKKLRVLDQWIGLIIVLGFEIKFKFNPSDNNDNHNRSFWYEYLNILPDQF 125
Query: 180 YSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYN-DLR-LRIFSKYPDLFPEE 232
L+YW EL+ +L+ S I +E + +I N DL + F P F EE
Sbjct: 126 NQLIYWNDEELN-HLQPSCILDRIGKENNLNMYNQIISIINQDLSGVEEFKSSPLTF-EE 183
Query: 233 VFNMETF--KWSFGILFSRLVRLP----SMDGRV-------------------------A 261
+ T +SF + + ++ + G +
Sbjct: 184 YNKVATIIMSYSFDVEVPKSKKMTKNGTNEKGNDEEDEEEDEDKEDDDDDEEEDNEYYKS 243
Query: 262 LVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
+VP+AD LN + L Y S+ ++ T + GEQV+ +Y N ELL YG+
Sbjct: 244 MVPFADTLNADTHLNNAILIY--STDQLIMTCIKPIAKGEQVYNTYSDHPNSELLRRYGY 301
Query: 321 VPREGTNPSDSVELPLSLKKS 341
V G+ D E+PLS KS
Sbjct: 302 VELNGS-KYDFGEIPLSTIKS 321
>gi|403306046|ref|XP_003943557.1| PREDICTED: N-lysine methyltransferase SETD6 [Saimiri boliviensis
boliviensis]
Length = 449
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/283 (22%), Positives = 118/283 (41%), Gaps = 27/283 (9%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE L VP + ++ S +C G + ++
Sbjct: 36 LSPKVEVSRQGTVAGYGMVARESVQAGELLFVVPRAAIL---SPHTCSIGGLLERERGAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRE 201
S W + + +S W Y + P + ++W E R L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAAASHWRPYFALWPELGHLEHPMFWPEEERRRLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF--------KWSFGILFSRLVRL 253
+ + ++ Y+ + L +PDLF V ++E + +SF
Sbjct: 152 AVEKDLDSIRSEYHSIVLPFMEAHPDLFSLRVHSLELYLQLVALVMAYSFQEPLEEEEDE 211
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
+ + +VP AD+LNH L+Y +V T QP G ++F +YG+ +
Sbjct: 212 KEPNSPI-MVPAADILNHLANHNANLEYSADCLRMVAT-----QPIPKGHEIFNTYGQMA 265
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
N +L+ YGFV N D+ ++ + + K EA R
Sbjct: 266 NWQLIHMYGFVEPYPNNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
Length = 254
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 90/190 (47%), Gaps = 12/190 (6%)
Query: 258 GRVA--LVPWADMLNHSC-EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
G VA LVP D++NH +V T D+D S ++F + Q+ + YGK+++ E
Sbjct: 13 GAVAMCLVPIWDLINHKLGQVTT--DFDPESGELIFYSMEFTPKNTQILMDYGKRTSAEF 70
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLEL 374
L+ GFVP TNP ++V + L + KSD+ ++ + L L + + ITG L
Sbjct: 71 LMFSGFVP--ATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSP--LILHITGDLSSL 126
Query: 375 ---MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISK 431
+A+A + V M +A + + + ID+QA+ F++ E +S
Sbjct: 127 SDAIAFARVFVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIAFLIMRFELLVSA 186
Query: 432 YSRFLQVKEL 441
Y + E+
Sbjct: 187 YGPMVSEDEV 196
>gi|322802325|gb|EFZ22721.1| hypothetical protein SINV_12919 [Solenopsis invicta]
Length = 435
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 72/326 (22%), Positives = 137/326 (42%), Gaps = 49/326 (15%)
Query: 74 ENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
E+ L+ WL S++ + + + RGL LK I K E L+ +P ++IT D
Sbjct: 25 ESLICLKSWLLSENCMSISYFIPEHFPLSGRGLKTLKRIEKNEVLIQLPLRMLITTDILM 84
Query: 133 SCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE- 189
L + P +LAT+L+ E S+W Y+ LP+ + + + E
Sbjct: 85 QSDVKTLFLYSTTDSFSPQCMLATFLVYETHLGIKSKWYLYLKTLPQSFTNPDFCSNKEK 144
Query: 190 -------LDRYLEASQIRE------RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
L +A ++++ +A++R+ D+ R + + +++
Sbjct: 145 AILPDFILHPLHQAHKLQKDFSLLMKAVKRL--------DINSRNSCPHCNACLQKIITF 196
Query: 237 ETFKWSFGILFSRLVRLPS-----------MDGRVALVPWADMLNH----SCEVETFLDY 281
+KW++ ++ +R V + + +AL P+ D+ NH + +V
Sbjct: 197 AKYKWAYYVVNTRAVYIDNGVCKENVFNIKQPNNLALAPFLDLFNHDINTAVKVSIVTVS 256
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG-FVPREGTNPSDSVELPLSLKK 340
D ++ T + + G QVFI+YG + +L + YG F+P NP D E+ +
Sbjct: 257 DCQNKFYQIVTLKPFDKGSQVFINYGAHDSLKLYIDYGFFIPH---NPLD--EIKFDIFD 311
Query: 341 SDKCY---KEKLEALRKYGLSASECF 363
+C+ + KL+ + G S F
Sbjct: 312 IQRCFDVSRNKLDFIMLNGFHKSMSF 337
>gi|307103410|gb|EFN51670.1| hypothetical protein CHLNCDRAFT_139898 [Chlorella variabilis]
Length = 543
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 113/291 (38%), Gaps = 39/291 (13%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE-- 136
L W+ SG +A+++ + G GL A ++ G L+ +P +T D S P
Sbjct: 50 LVAWVESSGGSAAGVAVRRNEAGF-GLAASRDCGAGSTLVSLPQRCHLTYDDS-SDPRLL 107
Query: 137 --AGEV----LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
G+V L+ + P P ++ L+ A + ++AL P + R
Sbjct: 108 ALIGQVVAHRLQGATSPFAPYISNLLLGVAGLPMFF-GGDALAALQYPPVTEQVKRRC-- 164
Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
R+L A RE A R D F + W+ ++ SR
Sbjct: 165 -RWLLAFAQRELAAARRGG----------------GDPFGGADVDANALGWALAVVTSRA 207
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
R D A++P DM NHS + + R Q GE V ISYG S
Sbjct: 208 FRTRGPDQPAAMLPLIDMANHSFQAANAKIAPGPGGSMCMVATRALQAGEPVLISYGALS 267
Query: 311 NGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
N LL+ YGF VP NP D+V+L + D+ E +A+ G +
Sbjct: 268 NDFLLMDYGFIVP---GNPHDTVQL-----RFDRGLIEAAKAVAGVGCTGG 310
>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
Length = 449
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 83/395 (21%), Positives = 163/395 (41%), Gaps = 49/395 (12%)
Query: 75 NASTLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKW 132
+ S + +W + +G K+ ++ D GE G L A ++I GE +L +P +L+
Sbjct: 25 DGSDVYEWAAANGANVSKVVLR--DDGEAGPILHAKEDIEAGEVILSLPANLLFPTRVSD 82
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
P +++ ++ + YLISE + + SS W ++ +LP + + L ++ ++
Sbjct: 83 HSPVV-HMIENTTIGRITAICLYLISERA-DSSSHWKPWLQSLPPRFFHALSYSEDDM-L 139
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP------------EEVFNMETFK 240
+ +AS +E + NV Y + K P P E F E F+
Sbjct: 140 HFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVTREDFTYEAFE 199
Query: 241 WSFGILFSRLVRLPSMDGR---------VALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
W++ ++ +R + P + G + L P AD H + YD VF+
Sbjct: 200 WAYSVVTTRGI-FPGLLGEEEREGEVPLLVLGPLADSFIHGAS-GVKISYDAQEHRCVFS 257
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
+ + I G SN ELL + GF+ + N ++ V + L ++ + E+
Sbjct: 258 ALHKVAKNSPISIGVGMSSNMELLANRGFMMQ--NNGNNFVLMKFQLDRNSDMHASARES 315
Query: 352 -LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
+++ LS + ++ P L+A + P G + K +
Sbjct: 316 MMKQLNLSNPMTYVVRYGEMPQGLLASLRIQSLSPVEFGSY-------------GKALAT 362
Query: 411 P---EIDEQALQFILDSCESSISKYSRFLQVKELL 442
P E + +A + ++ SC S ++ Y ++ E++
Sbjct: 363 PVTLENEWRAYRLLISSCNSILAMYPTTIEEDEIV 397
>gi|58261130|ref|XP_567975.1| nucleus protein [Cryptococcus neoformans var. neoformans JEC21]
gi|134115865|ref|XP_773415.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256040|gb|EAL18768.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57230057|gb|AAW46458.1| nucleus protein, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 495
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 106/264 (40%), Gaps = 63/264 (23%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITA----------DSKWSCPEAGEVLKQCSVPDWPLLA 153
G VA+K+I +G L VP L+++A S+W G W L
Sbjct: 44 GAVAVKDIEEGTPLFHVPDDLILSAYTSDLKDHLDASEWDQLNKG----------WAQLI 93
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWT---RAELDRYLEASQI-RERAIERITN 209
++ E SRW+ Y++ +P + ++WT R +L A +I RE A T+
Sbjct: 94 LVMMWETIKGSKSRWAGYLANMPVLFETPMFWTERQREQLSGTDIADRIGREDAEAEYTS 153
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLP-----------SM 256
V+ + +PDLFP + M+ F + SR +P
Sbjct: 154 VLAPF-------IKAHPDLFPVDSPHITMDAFHIQGSRILSRSFTVPLHRFGRSHSQSRS 206
Query: 257 DGR-------------VALVPWADMLNHSC---EVETFLDYDKS---SQGVVFTTDRQYQ 297
DG V ++P+ADMLN + ++D D +GVV + + +
Sbjct: 207 DGNSEKESDDEDEEEMVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTQLVK 266
Query: 298 PGEQVFISYGKKSNGELLLSYGFV 321
EQ++ +Y N ELL YG V
Sbjct: 267 QSEQIYNTYDSPPNSELLRKYGHV 290
>gi|145511243|ref|XP_001441549.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408799|emb|CAK74152.1| unnamed protein product [Paramecium tetraurelia]
Length = 731
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 15/201 (7%)
Query: 92 KMAIQKVDVGER--GLVALKNIRKGEKLLFVPPSLVITADSKWSCP------EAGEVLKQ 143
K AI K G + GLVA + I E L+ VP L++T + P + +
Sbjct: 50 KYAIFKTKNGLKYPGLVASEKILSNETLVSVPRDLLLTTRHAFESPLKQMFLDHPQYFSN 109
Query: 144 CSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE 201
P W L +++ E S W IS LPR L++W E + L+ Q+ +
Sbjct: 110 QFYPSWEDHQLMAFILYEYQRGPESEWHLLISNLPRDIDYLVFWNPEEQE-LLDDQQLVK 168
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
A ++ + Y L+ I KYP LF E E +W + L +R V
Sbjct: 169 LARKQYQEFVIEYETLKC-ITDKYPQLFKPETVTFENARWVYTHLVTRC--FGKYLAYVT 225
Query: 262 LVPWADMLNHSCEVETFLDYD 282
+VP+ ++ NH C + F D++
Sbjct: 226 MVPFCELFNHEC-TDVFYDFE 245
Score = 41.6 bits (96), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 27/154 (17%)
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA--- 351
Q++ G QV+ YG+ SN +L+ YG N D L + K Y + +EA
Sbjct: 516 QFEKGAQVYFCYGRLSNRMMLMRYGMTLE--YNKYDHAHLRIDYLK----YVQNIEAVWL 569
Query: 352 LRKYGLSASECFPIQITGWPLELMAYA---YLVVSPPSMKGKFEEMAAAASNKMTSKKDI 408
+ KY LS + F ++ T +P++ + + Y + S+ F+ I
Sbjct: 570 VHKYQLSKYKRFKLKHTTFPIDFIVFCKSIYWTFNVHSLDTFFK---------------I 614
Query: 409 KCPEIDEQALQFILDSCESSISKYSRFLQVKELL 442
+ +++ +ALQ L+ ISK++ L+ E L
Sbjct: 615 QDLKLERKALQLALEILVEEISKFTDKLEDNEKL 648
>gi|326927087|ref|XP_003209726.1| PREDICTED: n-lysine methyltransferase SETD6-like [Meleagris
gallopavo]
Length = 410
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 61/241 (25%), Positives = 108/241 (44%), Gaps = 28/241 (11%)
Query: 114 GEKLLFVPPSLVITADSKWSCP------EAGEVLKQCSVPDWPLLATYLISEASFEKSSR 167
GE L VP S ++ S+ +C +A E L+ S W L L+ E + +SR
Sbjct: 12 GELLFSVPRSALL---SQHTCAIRALLHDAQESLQSQS--GWVPLLLALLHEYT-TSTSR 65
Query: 168 WSNYISALPRQPYSLL----YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
W Y S Q +S L +W E + L+ + I E + + N+ Y+ + L
Sbjct: 66 WQPYFSLW--QDFSSLDHPMFWPEEERTKLLQGTGIPEAVDKDLANIQLEYSSIILPFMK 123
Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------LVPWADMLNHSCEV 275
+PD+F E+ +E +K + + + P + +VP AD+LNH
Sbjct: 124 SHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPMMVPVADILNHVANH 183
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
L+Y + + + T + G+++F +YG+ +N +LL YGF N +D+ ++
Sbjct: 184 NASLEY--APRCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQ 241
Query: 336 L 336
+
Sbjct: 242 M 242
>gi|68488236|ref|XP_712036.1| hypothetical protein CaO19.2654 [Candida albicans SC5314]
gi|46433396|gb|EAK92838.1| hypothetical protein CaO19.2654 [Candida albicans SC5314]
Length = 552
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 132/321 (41%), Gaps = 58/321 (18%)
Query: 74 ENASTLQKWLSDSGLPPQ-KMAIQK-VDVGE-RGLVALKNIRKGEKLLFVPPSLVITADS 130
E + Q WL + + K+AI D + RG++AL++I E + +P S+V+ D+
Sbjct: 6 EKSKLFQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN 65
Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFE----------KSSRWSNYISALPRQP 179
VLK+ V D W L L E F+ S W Y++ LP Q
Sbjct: 66 NSLIKSYPSVLKKLRVLDQWIGLIIVLGFEIKFKFNPSDNNDNHNRSFWYEYLNILPDQF 125
Query: 180 YSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYN-DLR-LRIFSKYPDLFPEE 232
L+YW EL+ +L+ S I +E + +I N DL + F P F EE
Sbjct: 126 NQLIYWNDEELN-HLQPSCILDRIGKENNLNMYNQIISIINQDLSGVEEFKSSPLTF-EE 183
Query: 233 VFNMETF--KWSFGILFSRLVRLP----SMDGRV-------------------------A 261
+ T +SF + + ++ + G +
Sbjct: 184 YNKIATIIMSYSFDVEVPKSKKVTENGTNEKGNDEEDDDEDEDKEDDDDDEEEDNEYYKS 243
Query: 262 LVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
+VP+AD LN + L Y S+ ++ T + GEQV+ +Y N ELL YG+
Sbjct: 244 MVPFADTLNADTHLNNAILIY--STDQLIMTCIKPIAKGEQVYNTYSDHPNSELLRRYGY 301
Query: 321 VPREGTNPSDSVELPLSLKKS 341
V G+ D E+PLS KS
Sbjct: 302 VELNGS-KYDFGEIPLSTIKS 321
>gi|145545977|ref|XP_001458672.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426493|emb|CAK91275.1| unnamed protein product [Paramecium tetraurelia]
Length = 666
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 48/176 (27%), Positives = 82/176 (46%), Gaps = 12/176 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVLKQCSVPDWP--LLATY 155
GL ++ I L+ VP L++T +D + + Q W +L TY
Sbjct: 69 GLKTIEKIESDSILVSVPRELMLTTKIAYFSDIQEIFDAYPQFFSQHCAGGWQDRILLTY 128
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ ++ + S+W + I+ LPR L++W+ EL + L ++ +A + + +
Sbjct: 129 LLYQSQLGRQSQWYHLIANLPRDIDYLIFWSDEEL-KLLNDEKLVLKAKRELQDFLLIQK 187
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
L I +YP F +E +++E KW F L SR S +VA VP+ +M NH
Sbjct: 188 TLT-HILDQYPQHFKKETYSLENIKWIFIHLVSRC--FGSTLEQVAFVPFCEMFNH 240
>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
Length = 339
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/238 (27%), Positives = 109/238 (45%), Gaps = 21/238 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPEAGEVLKQCSVPDWPLLATYLISE 159
R LVA +NI+ GE ++ VP V+ A++ + E G + + S + L L+
Sbjct: 66 RALVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEG--MTKDSADEEILEVQGLVIA 123
Query: 160 ASFEK----SSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERI------- 207
+E+ SRW+ Y++ LP + LYW R E R L + ++ + R
Sbjct: 124 VMWERWRGPESRWAPYLALLPDDMTHMPLYWKRREF-RELRGTAAYDKMLGRAQHPSDAP 182
Query: 208 TNVIGTYNDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWA 266
T V ++++ +++P+L P E ++W+ + S L D A+VP
Sbjct: 183 TQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD-DKYQAMVPVW 241
Query: 267 DMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
D+LNH + +V L + + R G ++ +YG+ SN ELL YGFV R
Sbjct: 242 DLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYGFVER 299
>gi|402908594|ref|XP_003917022.1| PREDICTED: N-lysine methyltransferase SETD6 [Papio anubis]
Length = 456
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 64/282 (22%), Positives = 121/282 (42%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G + ++
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERGAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y+ + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
Length = 436
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 131/322 (40%), Gaps = 33/322 (10%)
Query: 68 CEIDSLENASTLQKWLSDS-GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
E+D + L+ WL + G+ K+ +Q+ + G+ A + + GE L +P S I
Sbjct: 18 VEVDGGLRGNALRIWLEEEHGVDMSKVDLQRSPLEGLGVFANRRLEPGETLFMIPKSCCI 77
Query: 127 TADSKWSCPEAGEVLKQCSVP-----DWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
+ + + G+ +++ + + LAT+L E S + +I LP
Sbjct: 78 YPELVFEDRQLGKSMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVLPWDSLH 137
Query: 182 LLYWTRAELD------RYLEASQIRER---AIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
L WT E+D + E RE+ A E V+ + + + PEE
Sbjct: 138 PLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKGWKQFFQTIETEKMTPEE 197
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVAL-----VPWADMLNH-----SCEVETFLDYD 282
M + +F + SR G L +P D+ NH S +T L+ D
Sbjct: 198 FGFM--MRGAFASVLSRAFDSKIGRGDKGLEERVVIPLLDIFNHGSYGPSITFDTALERD 255
Query: 283 KSSQGVVFTTDR--QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS-DSVELPLSLK 339
V D+ + GE++F YG K N +L +YGFV NP L +S+
Sbjct: 256 NEKGFPVRVADKGKSIEEGEELFGFYGDKPNWNMLTTYGFV---SPNPKCQETTLSVSID 312
Query: 340 KSDKCYKEKLEALRKYGLSASE 361
+ D + +K E L+ G+ A E
Sbjct: 313 EKDPYFAQKEEILKARGMVAVE 334
>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
n-methyltransferase i [Nannochloropsis gaditana CCMP526]
Length = 385
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 107/251 (42%), Gaps = 35/251 (13%)
Query: 82 WLSDS---GLPPQKMAIQKVDVGE-------RGLVALKNIRKGEKLLFVPPSLVITADSK 131
W+ D G+PP + + + E RGL+ I G L +P S+VI +
Sbjct: 121 WMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIVIDKEKS 180
Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAE 189
+ + + ++ +A LI E + SS W+ YI LP + L W +
Sbjct: 181 LAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTEEVNPTLVWPEGD 237
Query: 190 LDRYLEASQI--RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
L LEAS + R+++R + + L + D+F VF E + W+F +F
Sbjct: 238 L-ALLEASPLVAATRSLKR--KLAAEFALLEEQYMRARSDVFDPSVFTFEAYLWAFINIF 294
Query: 248 SRLVRLPSMDGR---------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
SR +R+ R + + P+AD++NH+ T++ +K + +F R
Sbjct: 295 SRAIRVKIGGKRGPSGEEEESIIMCPYADLINHNPFANTYIVAEKPFK--MFNPIR---- 348
Query: 299 GEQVFISYGKK 309
GE+V Y K
Sbjct: 349 GEEVITIYADK 359
>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
Length = 540
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 123/313 (39%), Gaps = 64/313 (20%)
Query: 71 DSLENAST--LQKWLSDSGLPPQKMAIQKVDVGERG----LVALKNIRKGEKLLFVPPSL 124
DSL N T WL+ G I+ D+ G ++A K+I E L +P S
Sbjct: 50 DSLFNEQTESFLSWLTTDGKVTVSSKIKIEDLRSEGQGRCIIASKDIDTDELLFEIPRSS 109
Query: 125 VITADSKWSCPEAGEVL-KQCSVPDWPLLATYLISEAS-FEKSSRWSNYISALPRQP--Y 180
++ + C + + K + W L ++ E + SRWS+Y + LP
Sbjct: 110 ILNVTTSQLCVDFPHITGKLMELSQWDSLIICMMYEMKVLQHESRWSSYFNVLPSSESLN 169
Query: 181 SLLYWTRAELDRYLEASQIRERA--------IERITNVIGTYNDLRLRIFSKYPDLFPEE 232
+L+YW EL +L S + R RI + I +N+ D+ E+
Sbjct: 170 TLMYWNDKELS-FLTPSLVVNRVGKGDAETMYRRILDTINEFNE----------DILTEK 218
Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRV------------------------ALVPWADM 268
+ + W + ++ S D + +++P AD
Sbjct: 219 ---LGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEFDEKEEEPELLKSMIPLADT 275
Query: 269 LN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
LN H C L YDK S ++ + + GEQV+ +YG+ N ELL YG+V G
Sbjct: 276 LNADTHKCNAN--LTYDKDSLKMLAI--KPIKKGEQVYNTYGELPNSELLRKYGYVEWGG 331
Query: 326 TNPSDSVELPLSL 338
+ D E+P L
Sbjct: 332 SQ-FDYGEVPFDL 343
>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
Length = 442
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 83/203 (40%), Gaps = 31/203 (15%)
Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL------LYWTRAELDRYLEASQ 198
SVP L +LI E K S W Y++ LP P + +W ++ YLE +
Sbjct: 104 SVPPHVLGRFFLIKEYLKGKDSFWWPYLATLP-SPDQVNAWVLPAFWPEDDI-AYLECTN 161
Query: 199 IRERAIERITNVIGTYNDLRLRIFSK-YPDLFPEEVFNMETFKWSFGILFSRLVR----- 252
E NV G + R + ++ +PD+ + +KW+F I SR R
Sbjct: 162 AHVAIQEIQANVKGEFKQARKILKNENFPDV---AAYTSLMYKWAFTIFTSRSFRPSLIL 218
Query: 253 -----------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
LP +D L P D+ NHS D + +Y P
Sbjct: 219 SDTTKRHISTLLPQSVELDDFSILQPLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPP 278
Query: 299 GEQVFISYGKKSNGELLLSYGFV 321
G QVF +YG K+N ELLL YGF+
Sbjct: 279 GAQVFNNYGLKTNSELLLGYGFI 301
>gi|410215066|gb|JAA04752.1| SET domain containing 6 [Pan troglodytes]
Length = 449
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIRGLLERERVAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|410257726|gb|JAA16830.1| SET domain containing 6 [Pan troglodytes]
gi|410351697|gb|JAA42452.1| SET domain containing 6 [Pan troglodytes]
Length = 449
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 25/282 (8%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
L P+ ++ V G+VA ++++ GE L VP + ++ S+ +C G E +
Sbjct: 36 LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIRGLLERERVAL 92
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
S W + + +SRW Y + P + ++W E L+ + + E
Sbjct: 93 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
+ + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211
Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266
Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+L+ YGFV N D+ ++ + + K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308
>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
nagariensis]
Length = 510
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 65/133 (48%), Gaps = 6/133 (4%)
Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
A+ P D+ NHS V++ + Y+ R+++ GEQVFISYG +SN L+ YGF
Sbjct: 294 AICPLIDLFNHSSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGF 353
Query: 321 VPREGTNPSD---SVELPLSLKKSDKCYKEKLEALRKYGLSAS-ECFPIQITGWPLELMA 376
E NP D ++ L + +L+AL+ L+ S + IQ G+P E +
Sbjct: 354 A--EANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFPSETLQ 411
Query: 377 YAYLVVSPPSMKG 389
+++ S G
Sbjct: 412 AVRFLLAADSEAG 424
>gi|308498155|ref|XP_003111264.1| CRE-SET-29 protein [Caenorhabditis remanei]
gi|308240812|gb|EFO84764.1| CRE-SET-29 protein [Caenorhabditis remanei]
Length = 401
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 101/228 (44%), Gaps = 23/228 (10%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASF 162
G+ A ++ R G ++ +P +I + P + + + P+ + T F
Sbjct: 30 GIYATRSFRSGLPIITLPEYDMINSALVLDLPFYRKKMANVNEKLKPMEILTMFFCFEDF 89
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
E+S+ WS Y+ LP++ + + R + D IR+ I++ + LR
Sbjct: 90 EQSA-WSPYLKILPKE-FDTPAFKRIDYDVNTLPLSIRKYWIDQKKEISEISEKLR---- 143
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADMLNHSC 273
LFPE + W++ ++ +R + + + DG +A++P+ DMLNH
Sbjct: 144 ----RLFPE--LTHDKILWAWHVVNTRCIFVENEEHDNVDNTDGDTIAVIPYVDMLNHDP 197
Query: 274 E-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
E + ++K + V RQ GEQVF+ YG N LL+ YGF
Sbjct: 198 EKYQGVALHEKRNGRYVVQARRQILEGEQVFVCYGAHDNARLLVEYGF 245
>gi|348572449|ref|XP_003472005.1| PREDICTED: LOW QUALITY PROTEIN: N-lysine methyltransferase
SETD6-like [Cavia porcellus]
Length = 466
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 112/249 (44%), Gaps = 23/249 (9%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ + ++ V G+VA ++++ GE L VP + ++ S +C G++L++
Sbjct: 77 LSPKVVVSKQGTVAGYGMVARESVQPGELLFAVPRAALL---SPHTC-SIGDLLERERSA 132
Query: 144 -CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIR 200
S W + + +S WS Y + P + ++W E R L+ + +
Sbjct: 133 LQSQSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVP 191
Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
E + + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 192 EAVDKDLANIRSEYYAIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEEE 251
Query: 261 A-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP-GEQVFISYGKKSNG 312
+VP AD+LNH L+Y +V T Q+ P G ++F +YG+ +N
Sbjct: 252 EKDPNSPLMVPGADILNHLANHNANLEYSADYLRMVAT---QFIPKGHEIFNTYGQMANW 308
Query: 313 ELLLSYGFV 321
+L+ YGFV
Sbjct: 309 QLIHMYGFV 317
>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
gallopavo]
Length = 241
Score = 62.4 bits (150), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 102/212 (48%), Gaps = 19/212 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW-SCPEA 137
L+KWL D G + + RGL+ + ++ GE ++ +P ++T ++ SC
Sbjct: 35 LKKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVLNSC--L 92
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
GE + + P PL+A T+LI+E + S W Y+ LP+ YS ++ + L
Sbjct: 93 GEYIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPKT-YSCPVCLEQDVIQ-LF 150
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRLVR 252
+R++A E+ T V Y + FS LF E +FN +W++ + +R +
Sbjct: 151 PEPLRKQAQEQRTTVHELYMSSK-AFFSSLQSLFAENTATIFNHSALEWAWCTINTRTIY 209
Query: 253 LP-------SMDGRV-ALVPWADMLNHSCEVE 276
+ S++ V AL P+ D+LNHS V+
Sbjct: 210 MKHSQRECFSLEPDVYALAPYLDLLNHSPNVQ 241
>gi|428178458|gb|EKX47333.1| hypothetical protein GUITHDRAFT_152084 [Guillardia theta CCMP2712]
Length = 294
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/262 (23%), Positives = 107/262 (40%), Gaps = 40/262 (15%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL-KQCSVPDWPLLATYLISEAS 161
RG+ L+ ++ + ++ VP S ++ S PE ++ ++ S+ + L L+ EA+
Sbjct: 17 RGVAVLQEMKSDDVIVEVPASSFLSIWSVKDVPELSKIFGEEKSIDSFTGLMILLLHEAN 76
Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAEL-DRYLEASQIRERAIERITNVIGTYNDLRLR 220
E S+ W Y+ +LP W+ A++ ++ ++ E + +YN
Sbjct: 77 KETSA-WRKYLCSLPLYMPLPFMWSDADIPADFMRMPEVVEERKMLLEYTSLSYNSTIAP 135
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA------------------- 261
+ KYP +FPE+ F + W+ I+ SR + + G +
Sbjct: 136 LILKYPQVFPEDRFTKSKWAWALSIVVSRSIAMKRTGGVLGYSWSLADPEVLDVANVLEA 195
Query: 262 -------------LVPWADMLNHSCEVETFLDYDKSSQGVVFTT--DRQYQPGEQVFISY 306
LVP DM+NH + + G + T D Q G +V I+Y
Sbjct: 196 LKSGKSDAHVAPVLVPVVDMMNHDSNSSLACKMKQKTDGTIIVTAADEGLQRGYEVAINY 255
Query: 307 GKKSNGELLLS-YGFV--PREG 325
K G L+ +GFV P EG
Sbjct: 256 SPKLCGNKPLNRWGFVLPPCEG 277
>gi|431912319|gb|ELK14453.1| SET domain-containing protein 6, partial [Pteropus alecto]
Length = 847
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 74/337 (21%), Positives = 138/337 (40%), Gaps = 39/337 (11%)
Query: 44 CSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVG 101
C+ S +++ + P G L+ + W GL P+ ++ V
Sbjct: 382 CTHSESSEGLGQVNHTMEVAGPVGARDPDLDPVAGFLSWCRRVGLELSPKVAVSRQGTVA 441
Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLI 157
G+VA ++++ GE L VP ++++ S+ +C +G + ++ S W + +
Sbjct: 442 GYGMVARESVQPGELLFAVPRAVLL---SQHTCSISGLLERERGALQSQSGW-VPLLLAL 497
Query: 158 SEASFEKSSRWSNYISALPR-----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
+S W+ Y + P P ++W E R L+ + + E + + N+
Sbjct: 498 LHELQAPASPWTPYFALWPELGSLEHP---MFWPEEERRRLLQGTGVPEAVEKDLANIRS 554
Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVA--- 261
Y + L +PDLF V ++E + +S + S L D
Sbjct: 555 EYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSQALYGSFQEPLEEEDDEKEPNS 614
Query: 262 --LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLL 316
+VP AD+LNH L+Y + +V T QP G ++F +YG+ +N +L+
Sbjct: 615 PLMVPAADILNHLASHNANLEYSPNYLRMVAT-----QPIPKGHEIFNTYGQMANWQLIH 669
Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
YGFV N D+ ++ + + ++EA R
Sbjct: 670 MYGFVEPYPNNTDDTADIQMVTVREAALQGTEVEAER 706
>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
dendrobatidis JAM81]
Length = 419
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 76/335 (22%), Positives = 134/335 (40%), Gaps = 29/335 (8%)
Query: 67 GCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
G I+ E + QKWL + + + RGL+A + + G+ ++ +P L++
Sbjct: 9 GTIIEDNECWALFQKWLVLNNCSISSLVLAHFSDTGRGLMATSDFQIGDPVVRIPARLLL 68
Query: 127 T---ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
++ A LKQ P +A + I+ + WS YI LPR ++
Sbjct: 69 VPRRTHKLFNNHPAIVALKQH-----PSIALF-IAWQKIHPTPEWSPYIDILPRSFDTMP 122
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
+L L I+E A + + + Y + + ++ P+++F KW++
Sbjct: 123 LCIDLKLLAML-PYDIQEIAKNQQSKLDTDYAFVCTALAVSGYEMIPKDIF-----KWAW 176
Query: 244 GILFSRLVRL-------PSMDGR-----VALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
++ +R + + P + + L P+ D LNH+ YD + +
Sbjct: 177 IVVNTRCITMNTNAISKPQLSHIHQQPIITLAPFLDCLNHTSTARISAGYDTVEKAYIIR 236
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
T Y+ G QVFI+YG N LL YGF + NP + V L + + + +
Sbjct: 237 TLVPYKKGSQVFINYGPHDNNFLLAEYGFAILK--NPFNHVVLDREVDFMMQHFGTVSDL 294
Query: 352 LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPS 386
L+ GL G+ L Y+ VS S
Sbjct: 295 LKSEGLYGEFIIANDDLGYRLMNAMRLYVAVSQGS 329
>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
Length = 480
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 86/385 (22%), Positives = 152/385 (39%), Gaps = 83/385 (21%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RG+VA +I++GE L +P +V+ + + LK ++ W L +I E S
Sbjct: 48 RGVVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADELK--NLGPWLSLVVVMIYEYSL 105
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+ S W+ Y LP + +L++W+ EL SQ++ A + + IG D IF
Sbjct: 106 GERSNWNQYFQVLPTKFDTLMFWSGEEL------SQLQASA---VIHKIGK-KDAEEDIF 155
Query: 223 SK-------YPDLFP------------------EEVFNMETF--KWSFGI---------- 245
K +PDLFP E M + ++F I
Sbjct: 156 EKIIPLVRSHPDLFPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEG 215
Query: 246 ----LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
L +LP +VP AD+LN + + + +V + + G++
Sbjct: 216 EDGYLTDDEEQLPK-----GMVPLADLLNADADRNNARLFQEDG-ALVMRAIKPIKTGDE 269
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
+F YG+ +LL YG+V + D VELPL + C+ L+ + S+
Sbjct: 270 IFNDYGELPRSDLLRRYGYVT-DNYAQYDVVELPL----TGICHAAGLDNIE------SQ 318
Query: 362 CFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAAS-------------NKMTSKKDI 408
+P LE++ Y ++ P + + + ++ SK+
Sbjct: 319 EYPHLKLLHELEILEDGYCILRPSAEDSLTDILPDELLALLKSLTLEREELQRLQSKQKP 378
Query: 409 KCPEIDEQALQFILDSCESSISKYS 433
P + + + +LDS +S +S+Y
Sbjct: 379 PKPILAAREARILLDSVKSKLSQYG 403
>gi|321257099|ref|XP_003193469.1| nucleus protein [Cryptococcus gattii WM276]
gi|317459939|gb|ADV21682.1| nucleus protein, putative [Cryptococcus gattii WM276]
Length = 491
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 107/259 (41%), Gaps = 54/259 (20%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVI---TAD-------SKWSCPEAGEVLKQCSVPDWPLLA 153
G VA+K+I +G L V +L++ T+D S+W G W L
Sbjct: 43 GAVAVKDIEEGTPLFHVTDNLILSPYTSDLKDHLDASEWDQLNKG----------WAQLI 92
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
++ E SRW+ Y++ +P + ++WT + D+ L + I +R I R +
Sbjct: 93 LVMMWETIKGSKSRWAGYLTNMPVMFETPMFWTEQQRDQ-LSGTDIADR-IGR-EDAEAE 149
Query: 214 YNDLRLRIFSKYPDLFPEE-------VFNME---TFKWSFGILFSRLVRLPSM---DGRV 260
Y L +PDLFP + F+++ SF + R R S DG
Sbjct: 150 YTSLLAPFIKAHPDLFPVDSPHTTIDAFHIQGSRILSRSFTVPLHRFGRSQSQSQSDGNE 209
Query: 261 A------------LVPWADMLNHSC---EVETFLDYDKS---SQGVVFTTDRQYQPGEQV 302
++P+ADMLN + ++D D +GVV + R + EQ+
Sbjct: 210 TESDDEEEEEVVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTRLVKQSEQI 269
Query: 303 FISYGKKSNGELLLSYGFV 321
+ +Y N ELL YG V
Sbjct: 270 YNTYDSPPNSELLRKYGHV 288
>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
Length = 348
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 84/190 (44%), Gaps = 20/190 (10%)
Query: 83 LSDSGLPPQKMAIQKVDVGERGL------VALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+ +GLPP K+ +++ + L A ++++ G+ VP SLV+T +
Sbjct: 1 MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAE 189
E+L + + LA YL+ E K S W YI L RQ S L W+ AE
Sbjct: 61 VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSF 243
L YL S + +ER + Y++L +F +YP P E F E FK +F
Sbjct: 121 L-AYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAF 179
Query: 244 GILFSRLVRL 253
+ S +V L
Sbjct: 180 VAIQSCVVHL 189
>gi|46136815|ref|XP_390099.1| hypothetical protein FG09923.1 [Gibberella zeae PH-1]
Length = 484
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 87/190 (45%), Gaps = 11/190 (5%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDL--RLRI 221
S+ W+ Y+ LPR W+ E++R L E A+E + ++ + DL + +
Sbjct: 132 STPWTEYLKFLPRDVPVPTMWS--EVERALLQGTSLEAALEAKFASLSKEFEDLTDKSSV 189
Query: 222 FSKYPDLFPEE-VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ LF E+ ++ + SR + LP G A+VP DM NHS + D
Sbjct: 190 LPFWNSLFWEKGTVTIQDWILVDAWYRSRCLELPR--GGDAMVPGLDMANHSHHPTAYYD 247
Query: 281 YDKSSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
D V+ + GE+V ISYG K+ E+L SYGF+ E T + + LP+ +
Sbjct: 248 EDDKDDVVLLVRPETTVSAGEEVNISYGDKNPAEMLFSYGFIDNEST--VEGLNLPVKVL 305
Query: 340 KSDKCYKEKL 349
D K KL
Sbjct: 306 PDDPLGKAKL 315
>gi|73950321|ref|XP_544379.2| PREDICTED: N-lysine methyltransferase SETD6 isoform 2 [Canis lupus
familiaris]
Length = 453
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 128/303 (42%), Gaps = 35/303 (11%)
Query: 73 LENASTLQKWLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
L+ + W GL P+ ++ V G+VA ++++ GE L VP + ++ S
Sbjct: 23 LDPVAGFLSWCPQVGLELIPKVTVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---S 79
Query: 131 KWSCP-------EAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPR--QPY 180
+ +C E G + Q VP L L + AS WS Y + P +
Sbjct: 80 QHTCSIGGLLERERGALQSQSGWVPLLLALLHELQTPASL-----WSPYFALWPELGRLE 134
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
++W E + L+ + + E + + N+ Y + L +PDLF V +++ ++
Sbjct: 135 HPMFWPEEERRQLLQGTGVPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLDLYR 194
Query: 241 WSFGILFSRLVRLPSMD-------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
++ + + P + +VP AD+LNH L+Y + +V T
Sbjct: 195 QLVALVMAYSFQEPLEEEDDEKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-- 252
Query: 294 RQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
QP G ++F +YG+ +N +L+ YGF N D+ ++ + + K+E
Sbjct: 253 ---QPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTDDTADIQMVTVREAALQGTKVE 309
Query: 351 ALR 353
A R
Sbjct: 310 AER 312
>gi|68467835|ref|XP_722076.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|68468152|ref|XP_721915.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|46443858|gb|EAL03137.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
gi|46444024|gb|EAL03302.1| potential protein lysine methyltransferase [Candida albicans
SC5314]
Length = 433
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 65/276 (23%), Positives = 111/276 (40%), Gaps = 36/276 (13%)
Query: 80 QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------------ 127
+K +S+ K+ ++ V RG+ A++ ++KGE +L +P S ++
Sbjct: 22 EKKISNHTYISPKIDVKDVRSSGRGIYAVEPLKKGELILNIPHSFLLNFTTVMAHIAKYN 81
Query: 128 ---ADSKWSCP------EAGEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYI 172
DS P E E+ + + + + LL+ YL E S W ++
Sbjct: 82 GMAIDSHIHVPFDKSEDEYTEIYRTLTKEEILELSSFQLLSLYLTFERKRSHKSFWKPFL 141
Query: 173 SALPR-QPYSLL--YWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFSKYPD- 227
LP + L+ W + ++++R + + R N +L K D
Sbjct: 142 DMLPSMDDFELMPIDWPQEVCTLLPSSTEVRNKKVRSRFDNDYQVICELIKTKIDKDGDV 201
Query: 228 --LFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSS 285
L P + + + L+ L + + P+ D +NHSC+ L D
Sbjct: 202 TTLLPRQEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYVDFMNHSCDDHCTLKID--G 259
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+G T QY G+QV++SYG SN LL YGFV
Sbjct: 260 KGFQVRTTSQYNTGDQVYLSYGPHSNDFLLCEYGFV 295
>gi|345794208|ref|XP_003433871.1| PREDICTED: N-lysine methyltransferase SETD6 isoform 1 [Canis lupus
familiaris]
Length = 476
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 122/285 (42%), Gaps = 33/285 (11%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP-------EAGEVL 141
P Q ++ V G+VA ++++ GE L VP + ++ S+ +C E G +
Sbjct: 64 PVQVTVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---SQHTCSIGGLLERERGALQ 120
Query: 142 KQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQ 198
Q VP L L + AS WS Y + P + ++W E + L+ +
Sbjct: 121 SQSGWVPLLLALLHELQTPASL-----WSPYFALWPELGRLEHPMFWPEEERRQLLQGTG 175
Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD- 257
+ E + + N+ Y + L +PDLF V +++ ++ ++ + + P +
Sbjct: 176 VPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLDLYRQLVALVMAYSFQEPLEEE 235
Query: 258 ------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGK 308
+VP AD+LNH L+Y + +V T QP G ++F +YG+
Sbjct: 236 DDEKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-----QPIPKGHEIFNTYGQ 290
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+N +L+ YGF N D+ ++ + + K+EA R
Sbjct: 291 MANWQLIHMYGFAEPYPDNTDDTADIQMVTVREAALQGTKVEAER 335
>gi|393245275|gb|EJD52786.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
Length = 519
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 80/160 (50%), Gaps = 20/160 (12%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY----- 155
G RGLVA+K I+ GE L VP +L+++ P ++ + DW L +
Sbjct: 29 GGRGLVAVKEIQVGETLFAVPRTLLLS-------PRTCQLPQLIGAQDWKRLNLHKGWSG 81
Query: 156 ----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
++ E + +S+W+ Y +A+P + +L++WT EL+ L+ S I E+ + +V
Sbjct: 82 LILCMLWEEAQGPASQWAGYFAAMPTEFSTLMFWTPEELED-LKGSSITEKIGKE--DVE 138
Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
Y+D L PDLFP E + T + F I SR++
Sbjct: 139 SEYHDRVLPAVKARPDLFPPEQADRYTLE-RFHIAGSRIL 177
>gi|302900929|ref|XP_003048357.1| hypothetical protein NECHADRAFT_106330 [Nectria haematococca mpVI
77-13-4]
gi|256729290|gb|EEU42644.1| hypothetical protein NECHADRAFT_106330 [Nectria haematococca mpVI
77-13-4]
Length = 460
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 86/196 (43%), Gaps = 36/196 (18%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRER 202
L+ YL+ + SF WS YI ALP QP W+ AEL E + I E
Sbjct: 91 LVKHYLLGDESF-----WSPYIRALP-QPEDQDSWSLPPFWPDDDAEL---FEGTNI-EV 140
Query: 203 AIERITNVIGTYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSR-----LV---- 251
+ RI + L + D PE + F + ++W++ I SR LV
Sbjct: 141 GVGRIKADVKRDFKAALDALTA-EDWEPELRKGFTLGLYQWAYSIFSSRSFRPSLVLGPE 199
Query: 252 ---RLPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
RLP +D L+P D+ NH E + D+ + + Y+ GEQVF +
Sbjct: 200 DQKRLPEGVKIDDFSVLMPLFDVGNHDMRTEVRWELDEEKKHCSLKVSKAYEAGEQVFNN 259
Query: 306 YGKKSNGELLLSYGFV 321
Y K+N ELLL YGF+
Sbjct: 260 YSMKTNAELLLGYGFM 275
>gi|417410782|gb|JAA51857.1| Hypothetical protein, partial [Desmodus rotundus]
Length = 447
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 125/283 (44%), Gaps = 27/283 (9%)
Query: 88 LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
L P+ ++ V G+VA + ++ G+ L VP + ++ S+++C +G + ++
Sbjct: 34 LSPKVAVSRQGTVAGYGMVAREYVQPGDLLFAVPRAALL---SQYTCSISGLLERERGAL 90
Query: 144 CSVPDW-PLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIR 200
S W PLL L + +S WS Y + P + ++W E R L+ + +
Sbjct: 91 QSQSGWVPLLLALLHELQA--PASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVP 148
Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
E + + N+ Y + L +PDLF V ++E + ++ + + P +
Sbjct: 149 EAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEED 208
Query: 261 A-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
+VP AD+LNH L+Y + +V T QP G ++F +YGK +
Sbjct: 209 EKEPNPPLMVPAADILNHLANHNANLEYSSNCLRMVAT-----QPIPKGHEIFNTYGKMA 263
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
N +L+ YGFV N D+ ++ + + K +A R
Sbjct: 264 NWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKGDAER 306
>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
DL-1]
Length = 499
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 119/268 (44%), Gaps = 39/268 (14%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI---TADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
RGL A +I+K L + ++ TA P EVL+ ++ W L L E
Sbjct: 38 RGLRARNDIQKDTVLFRLARDHILNIRTAALGKLKPGNQEVLE--TLNQWEALILCLAYE 95
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTY 214
+ SRWS+Y++ LP + SL++W+ EL++ L+ S + RE+A + + ++ Y
Sbjct: 96 MMLGEESRWSSYLAVLPEKFNSLMFWSSEELEK-LKPSNVLQRIGREQAEQMYSKLVPEY 154
Query: 215 NDLRLRIFSK----YPDLFPEEVFNMETFKWSFGILFSRLVRLPSM-------------- 256
LR+ SK Y + V +SF +
Sbjct: 155 C---LRLGSKKLVEYLTIDRFHVVASIIMSYSFDVDDPEDDPEDDEDEEEDFDEIEQECI 211
Query: 257 --DGRV-ALVPWADMLNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
DG + ++VP AD LN + V L Y+ + +V T + + GEQ++ YG+ N
Sbjct: 212 KYDGYLKSMVPLADTLNSNTNLVNANLSYE--NDALVMTATKDIKKGEQIYNIYGELPNS 269
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKK 340
E+L YG+V + + ELPL++ K
Sbjct: 270 EILRKYGYVELPAS-KYEFAELPLTVIK 296
>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 468
Score = 61.6 bits (148), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 102/249 (40%), Gaps = 44/249 (17%)
Query: 130 SKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPYSL-LYWTR 187
S +C A E L+ + L ++ E + SRW +Y + LP R +L ++WT
Sbjct: 80 SARTCSVAKE-LRDARLGGGLALNVAVMVERALGSESRWRDYFAVLPSRGERTLPMFWTE 138
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLR-IFSKYPDLFPEEVFNMETFKWSFGIL 246
A L+ L+ + + E N+ Y++ + + +P+ F E E + + +
Sbjct: 139 ARLE-ALKGTDLATHVREDAENLRADYDEEVVNGLCVAHPEKFRREELTFERYLEAASLS 197
Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK----------------------- 283
SR + G ALVPWADM NH + ET
Sbjct: 198 ASRAFYIGEECGE-ALVPWADMFNHRTDDETVRVLGADEEEEEDEEEEEEEEEDEEEDDE 256
Query: 284 --------------SSQG-VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
+ QG +V T + G+++F ++G+++N LL YGF R +
Sbjct: 257 EEDDDAAAPPPPLTTPQGALVIHTHKAVSKGDELFNTFGQQNNASLLHKYGFCERGNAHA 316
Query: 329 SDSVELPLS 337
+ +V+L L+
Sbjct: 317 TIAVDLALA 325
>gi|302307608|ref|NP_984333.2| ADR237Cp [Ashbya gossypii ATCC 10895]
gi|299789080|gb|AAS52157.2| ADR237Cp [Ashbya gossypii ATCC 10895]
gi|374107548|gb|AEY96456.1| FADR237Cp [Ashbya gossypii FDAG1]
Length = 574
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 86/182 (47%), Gaps = 32/182 (17%)
Query: 171 YISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL----RLRIFSK 224
Y+S LP ++ ++ +WT +EL L I +A + + ++ +++L LR +K
Sbjct: 108 YLSVLPTHKEMHTPYFWTNSEL-LLLRGMDIYLKAKKNLRQLVNEWHELVTAGELRNDTK 166
Query: 225 YPDLF-PEEVFN-------------------METFKWSFGILFSR----LVRLPSMDGRV 260
+ DLF E F+ + W+ I SR L+ + D
Sbjct: 167 FYDLFNSSENFDAGEYISNQLADPTTTDWTDFPAYLWASSIFSSRAFPTLILGTTTDLNE 226
Query: 261 ALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
A + P D+LNHS Y++ V F+T + + G++++ +YG KSN ELLL+YG
Sbjct: 227 AFLNPIIDLLNHSAGTNVTWSYNEQVAAVTFSTAQTLETGDELYNNYGDKSNDELLLNYG 286
Query: 320 FV 321
FV
Sbjct: 287 FV 288
>gi|395508683|ref|XP_003758639.1| PREDICTED: N-lysine methyltransferase SETD6 [Sarcophilus harrisii]
Length = 396
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 85/189 (44%), Gaps = 26/189 (13%)
Query: 166 SRWSNYISALP-----RQPYSLLYWTRAELDRYLEASQIRERAIER-ITNVIGTYNDLRL 219
S W Y S P R P ++W+ E + L+ + + E A+ER + ++ Y + L
Sbjct: 59 SPWKGYFSLWPELGSLRHP---MFWSEEERKQLLQGTGVPE-AVERDLASISYEYGTIVL 114
Query: 220 RIFSKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
+PD+FP + ++E ++ +SF +VP AD+LNH
Sbjct: 115 PFLEAHPDVFPLQAQSLELYRQLVAMVMAYSFQEPLEEEEEEEEEPNPPMMVPAADILNH 174
Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNP 328
L+Y +V T QP G+++F +YG+ +N +L+ YGF N
Sbjct: 175 VANHNANLEYSPECLKMVAT-----QPIPKGQEIFNTYGQMANWQLIHMYGFAEPYPGNT 229
Query: 329 SDSVELPLS 337
+DS ++ ++
Sbjct: 230 NDSADIQMA 238
>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
Length = 1000
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 114/267 (42%), Gaps = 34/267 (12%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKW---SCPEAGEVLK--QCSVP---DWPLLAT 154
RG++ALK+I L +P +I ++ P+ ++ K + VP W L
Sbjct: 576 RGIIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLIL 635
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
+I E SS+W +Y LP + ++W+ ELD+ L+AS +R + + + +
Sbjct: 636 IMIYEYLQGDSSQWKSYFDVLPSSFDTPMFWSENELDQ-LQASHMRHKIGK--ADAEDMF 692
Query: 215 NDLRLRIFSKYPDLFPEE------------VFNMETFKWSFGILFSRLVRLP------SM 256
+ I P +F E ++F +
Sbjct: 693 KKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVEDR 752
Query: 257 DGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
+G+ + +VP AD+LN E +++++ S + T+ R + GE++ YG N EL
Sbjct: 753 EGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTSLRPIKAGEEILNYYGPHPNSEL 810
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKS 341
L YG+V E + D VE+P + +S
Sbjct: 811 LRRYGYV-TEKHSRYDVVEIPWDIVES 836
>gi|358332734|dbj|GAA51355.1| SET domain-containing protein 4 [Clonorchis sinensis]
Length = 493
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/84 (33%), Positives = 45/84 (53%)
Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
R+ +P AL+P+ D LNH V++ L+ D++ + + + + PGEQV I+YG
Sbjct: 234 RIKLIPDRYSDTALIPFFDFLNHCPLVDSRLEVDRTGKAIQLFVQQSFGPGEQVLINYGP 293
Query: 309 KSNGELLLSYGFVPREGTNPSDSV 332
N L + YGF NP ++V
Sbjct: 294 HDNLTLFIEYGFSLLPSENPHNAV 317
>gi|428176276|gb|EKX45161.1| hypothetical protein GUITHDRAFT_139093 [Guillardia theta CCMP2712]
Length = 281
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 69/255 (27%), Positives = 109/255 (42%), Gaps = 21/255 (8%)
Query: 78 TLQKWLSDSG--LPPQKMAIQKVDVGERGLVAL--KNIRKGEKLLFVPPSLVITADSKWS 133
LQ W+S +G + P A Q D+ GL K++R+GE ++ +PP L ++ +
Sbjct: 27 ALQTWISSNGGSIHPSVCAKQAGDMQGVGLFVKEGKSVRRGEVMVSIPPKLHLSYEKVVG 86
Query: 134 CPEAGEVLKQCSVPDWPL----LATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRA 188
+ K+ W + + S AS + +W Y+ +LP+ +L +++ A
Sbjct: 87 KDLNTLIDKEVPAEKWDVKLALALLSVASSASAAEGQQWGPYLESLPQTLNNLPIFYKGA 146
Query: 189 ELDRYLEASQIRERAIERITN-VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
L +E I++ V+G L+ S E ++ W++GI
Sbjct: 147 ALKE-------KEETYPGISSEVVGRAALLKTVSSSLANAHACLEGLSVRRLAWAYGIAT 199
Query: 248 SRLVRL-PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQ-GVVFTTDRQYQPGEQVFIS 305
SR VRL DG L+P+ D NH E + SS R EQ+ I
Sbjct: 200 SRSVRLDKKRDG--LLLPFVDFANHDFEPNAQIRRSGSSSPSAELVAQRDLSASEQITIC 257
Query: 306 YGKKSNGELLLSYGF 320
YG N ELLL+YGF
Sbjct: 258 YGNLGNQELLLNYGF 272
>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
Y34]
Length = 373
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 128/303 (42%), Gaps = 45/303 (14%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV--ITADSKWSC 134
S L W S GL + ++ +G+VA + ++ E +L P + IT S+
Sbjct: 5 SELLGWASAEGLILNGIQPAWINGCGKGIVACRELKAEEAILIAPIQAIRSITTVSR--- 61
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSR---WSNYISALPRQPYSLLYWTRAELD 191
+++K+ P PL L +E + +S W + A+ +L + EL
Sbjct: 62 ----DLIKRLP-PSLPLHGI-LAAELALTDTSTPSPWQKSLPAMADITATLPFMWPKELQ 115
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL- 250
+ L S A + N YN + P + + E F++ + I+ +R
Sbjct: 116 KLLPTS-----ARVFLENQQTKYNHEWNTVSQAMPSI------SEERFQYYWHIVNTRTF 164
Query: 251 ------VRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
S + R+ALVP AD+ NH+ C V ++ V TTDR Y+ GE+
Sbjct: 165 LYEVSETECYSWEDRLALVPLADIFNHADEGCRVSYMPEH------YVITTDRAYEAGEE 218
Query: 302 VFISYGKKSNGELLLSYGFV---PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
+FISYG SN LL YGF+ R D V LP L +S K + + L Y L
Sbjct: 219 LFISYGDHSNDCLLTEYGFLLPKNRWDIICIDEVVLP-RLDESAKELLRQRDLLGDYTLH 277
Query: 359 ASE 361
A +
Sbjct: 278 AEK 280
>gi|255719552|ref|XP_002556056.1| KLTH0H04004p [Lachancea thermotolerans]
gi|238942022|emb|CAR30194.1| KLTH0H04004p [Lachancea thermotolerans CBS 6340]
Length = 585
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 127/315 (40%), Gaps = 69/315 (21%)
Query: 74 ENASTLQKWLSDSGLP-PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
E L W D+G+ P ++ V+VG +G + E +P SL+I +
Sbjct: 4 EKLKVLLDWGLDNGVKCPDD--VEFVNVGGKGFACIAKSDITEAEFIIPESLIIKSSLAV 61
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSS----------RWSNYISALPRQPYSL 182
S + Q S W L LI++ F+KSS +++ YI ALP + S
Sbjct: 62 SFFKVNS--NQTS---WLKL---LIAKLKFDKSSTTVDDENLKAKFAPYIDALPDEIDSP 113
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME----- 237
L W +ELD L + +R ++ ++ + L + K+ + E+ N+E
Sbjct: 114 LVWNPSELD-LLGNTNLRSSLRIKLYSIFNEWK-LIMETLKKHRNEVQAEILNIEETLGQ 171
Query: 238 --------------------------TFKWSFGILFSR----LVRLPSMD-GRVALVPWA 266
F WS + SR V PS D V L+P
Sbjct: 172 SEDHVYRNITSKVFQHSSETDWWSFPAFLWSHMMFLSRAFPEYVINPSTDPSNVVLLPII 231
Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPR 323
D+LNH + + + GV R+ + GE++F +YG K N ELL YGFV
Sbjct: 232 DLLNHDYRSKVEWNQRDGAFGV-----RKLETVLRGEEIFNNYGGKGNEELLSGYGFVLE 286
Query: 324 EGTNPSDSVELPLSL 338
E N D+V L + L
Sbjct: 287 E--NIFDTVALKIQL 299
>gi|146419922|ref|XP_001485920.1| hypothetical protein PGUG_01591 [Meyerozyma guilliermondii ATCC
6260]
Length = 592
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 129/311 (41%), Gaps = 57/311 (18%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEK-LLFVPPSLVITADSKW-S 133
+ L +W GL + I+ +GE A GEK + +P L IT DS S
Sbjct: 2 VAELVQWAKTQGLELNE-GIEFRGIGENNTGAFYTTNNGEKPYIRLPVELAITVDSALRS 60
Query: 134 CPEAGEVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-QPYSLLYWTRAELD 191
+ E L+ QC + +L L E S K+S Y+ LP Q + Y AE
Sbjct: 61 FGQDLEALRDQCDSSN-TVLKLCLARERSRLKNSTIKKYLECLPTLQQMNTPYCWDAETK 119
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF--PEEVF-NME----------- 237
RYL+ + + E I ++ + +I + PDL PE+ F NM+
Sbjct: 120 RYLQGTNLGSSLKENIGVLVEEW----WKIINLLPDLVQKPEQHFVNMKYYYESKFYTDD 175
Query: 238 -------------------TFKWSFGILFSR----LVRLPSMDGRV-----ALVPWADML 269
F W+ IL SR + ++D V L+P D+L
Sbjct: 176 DAYAYFVTNEDPANWTSFPNFLWASIILKSRSFPAYLIADAVDWDVKRHDTMLLPVIDLL 235
Query: 270 NH--SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
NH S VE L+ +S VF +D + G Q+F +YG K N ELLL+YGF + N
Sbjct: 236 NHLPSAHVEWGLERKESKSYFVFKSD-DVKSGSQLFNNYGMKGNEELLLAYGFCLED--N 292
Query: 328 PSDSVELPLSL 338
SD L + +
Sbjct: 293 SSDVSALKIKV 303
>gi|449456212|ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus]
Length = 483
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 122/279 (43%), Gaps = 53/279 (18%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV-LKQC-SVPDWPLLATYL 156
D G RGL A++ ++KGE +L P S+++T S E ++ LK+ S+ L L
Sbjct: 43 DTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCL 102
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
+ E S SS W Y+ LP Q Y +L T E ++ +A Q+ + + D
Sbjct: 103 LYEISKGPSSWWFPYLKHLP-QSYDILA-TFGEFEK--QALQVDYAIWATEKAALKSRTD 158
Query: 217 LRLRIFSKYPDLFPEEVF--NMETFK---WSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
R L E ++TFK W+ + SR + +P D L P D+ N+
Sbjct: 159 WR-----GVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVP-WDEAGCLCPVGDLFNY 212
Query: 272 SC-EVETF--LD-------------------------------YDKSSQGVVFTTDRQYQ 297
+ E E+F +D +++++ F Y+
Sbjct: 213 AAPEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYARESYR 272
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
GEQV +SYG +N ELL YGF+ +E NP+D V +P+
Sbjct: 273 KGEQVLLSYGTYTNLELLEYYGFLLQE--NPNDKVFIPI 309
>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
206040]
Length = 378
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 60/115 (52%), Gaps = 16/115 (13%)
Query: 257 DGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
+ R+AL+P AD+ NH+ C V S +G DR Y+ GE+++ISY SN
Sbjct: 177 EDRLALIPVADLFNHADAGCRVYY------SPEGYHIVADRDYKRGEELYISYSSHSNDY 230
Query: 314 LLLSYGFVPREGTNPSDSVELP----LSLKKSDKCYKEKLEALRKYGLS-ASECF 363
L+ YGFVP E NPSD V + L +S K EK + L Y L A+E F
Sbjct: 231 NLVEYGFVPDE--NPSDDVYIDDVIFPKLSESQKADLEKRDLLGVYPLGEATEEF 283
>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
CIRAD86]
Length = 391
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 71/292 (24%), Positives = 122/292 (41%), Gaps = 46/292 (15%)
Query: 55 TKTTVTQNMIPWGCEI-DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNI 111
TK T + + EI S + +W D G+ Q +++ + RG LV NI
Sbjct: 8 TKRRRTSSGVAVDAEIRHSQDEHEHFTQWAKDRGV--QIGSVKPAHIAGRGVGLVTTANI 65
Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSR-- 167
++ EKL+F P + +T ++ +C P+ P + + FE S
Sbjct: 66 KQDEKLIFASPHVQLTL----------SIMAECEEPESPYNVWRSTWPGPQDFESSMPLF 115
Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
WS+ + L P SL +LD + + ++ R + +LR K D
Sbjct: 116 WSHKLRDL--LPPSLQQPLDRQLDDWRKDAEFRRTIVA----------NLRDNSARKEQD 163
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLP---SMDGRVALVPWADMLNHSCEVETFLDYDKS 284
+ FK+ + I+ SR + G + L P+ D +NH T ++ ++
Sbjct: 164 ---------DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPS-GTGVNVRQT 213
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--PREGTNPSDSVEL 334
++G T +R Y GE+V +YG N +LL+ YGF+ + G D + L
Sbjct: 214 AKGYEVTANRDYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRL 265
>gi|223994225|ref|XP_002286796.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220978111|gb|EED96437.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 346
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/246 (25%), Positives = 102/246 (41%), Gaps = 58/246 (23%)
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
G R + +++ G+ LL +P S ITAD+ S LA L E
Sbjct: 30 AGGRYVTCRQDVTAGDDLLQIPLSSCITADNLES------------------LAERLAYE 71
Query: 160 ASFEKSSRWSNYISALPR-QPYSLL----YWTRAELDRYLEASQIRERAIERITNVIGTY 214
S+++ YI+ LP + SLL +W + LD + Q+ R
Sbjct: 72 RELGSKSKFTAYINVLPTLESKSLLELPRFWKGSRLDLVTDGGQLEAR------------ 119
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
+ +E +++ +W+ + SR L D ++ P DM+NH
Sbjct: 120 -------------MSKDERKDLD--QWALACVDSRANFLG--DEGYSMTPMLDMINHDAS 162
Query: 275 VETF--LDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
V+T ++ DK G + T+ + Y GE+ FISYG +N + L YGFV +
Sbjct: 163 VQTRARIEEDKGFAGDGDVLHLTSGKSYSKGEEAFISYGNLANLDTLADYGFVTEKNPCN 222
Query: 329 SDSVEL 334
+S+E+
Sbjct: 223 VESIEV 228
>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
Length = 491
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 87/318 (27%), Positives = 129/318 (40%), Gaps = 63/318 (19%)
Query: 61 QNMIPWGCEI---DSLENA----STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
+ + W EI DS++++ S L LS S P D G RGL A + ++K
Sbjct: 9 ETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFP---------DAGGRGLGAARELKK 59
Query: 114 GEKLLFVPPSLVITADSKWS--CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
GE +L VP ++T +S + + V S+ +L+ L+ E S EK S W Y
Sbjct: 60 GELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPY 119
Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
+ +PR Y LL A + + + E A+ S +L +
Sbjct: 120 LFHIPRD-YDLL----ATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELK 174
Query: 232 EVF-NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG--- 287
F + + + W+ + SR + +P D L P D+ N+ DY + QG
Sbjct: 175 PKFRSFQAWLWASATISSRTLHVP-WDSAGCLCPVGDLFNYDAPG----DYSNTPQGPES 229
Query: 288 ---------VVFT-----TD---------------RQYQPGEQVFISYGKKSNGELLLSY 318
VV T TD R YQ GEQV + YG +N ELL Y
Sbjct: 230 ANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHY 289
Query: 319 GFVPREGTNPSDSVELPL 336
GF+ E +N D V +PL
Sbjct: 290 GFMLEENSN--DKVFIPL 305
>gi|241956097|ref|XP_002420769.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
gi|223644111|emb|CAX41854.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
Length = 435
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/290 (23%), Positives = 117/290 (40%), Gaps = 37/290 (12%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI--- 126
+D ++N +K + + + PQ + ++ V RG+ A++ ++K E +L +P S ++
Sbjct: 13 LDWVKNTDDEKKISNHTYISPQ-IDVKDVRSSGRGIYAVRPLKKAELILNIPHSFLLNFT 71
Query: 127 ------------TADSKWSCP------EAGEVLKQCS------VPDWPLLATYLISEASF 162
T DS P E E+ + + + + LL+ YL E
Sbjct: 72 TVMAHIAKYNGMTIDSHIHVPFDKHKDEYTEIYRMLTKEEILDLSSFQLLSLYLTFERRR 131
Query: 163 EKSSRWSNYISALPR-QPYSLL--YWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLR 218
S W ++ LP + + L+ W ++ +R R + R N +L
Sbjct: 132 SSKSFWKPFLDMLPSMEDFELMPIDWPHEIYTLLPSSTGVRNRKVRSRFENDYRVICELI 191
Query: 219 LRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
K D L P + + + L+ L + + P+ D +NHSC+
Sbjct: 192 KTKIDKAGDVTTLLPRQEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYVDFMNHSCDD 251
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
L D +G T QY G+QV++SYG SN LL YGFV E
Sbjct: 252 HCTLKID--GKGFQVRTTSQYNIGDQVYLSYGPHSNEFLLCEYGFVIPEN 299
>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
Length = 497
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/284 (22%), Positives = 119/284 (41%), Gaps = 59/284 (20%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE-VLKQCSVPDWPLLATYLISE 159
G RGL A++++R+GE +L VP S ++T ++ + + V + S+ +L L+ E
Sbjct: 51 GGRGLGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYE 110
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL----EASQIRERAIERITNVIGTYN 215
K+SRW Y+ LP Y +L E +++ EA + E+A+ + + +
Sbjct: 111 MGKGKTSRWHPYLMHLP-HTYDVLA-MFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAH 168
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC-- 273
L + +F + F + + + + SR + +P D L P D+ N+
Sbjct: 169 SLMQDL------MFKPQFFTFKAWVRAAATISSRTLHIP-WDEAGCLCPVGDLFNYDAPG 221
Query: 274 -------EVETFLD----------------------------------YDKSSQGVVFTT 292
+++ L +++ + F
Sbjct: 222 IEPSGIEDLDRLLSNTSIPDTIVLNGDKNIVVDAEQLDSHSWRLTDGGFEEDANAYCFYA 281
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
Y+ G+QV + YG +N ELL YGF+ +E NP+D V +PL
Sbjct: 282 REHYKKGDQVLLCYGTYTNLELLEHYGFLLQE--NPNDKVFIPL 323
>gi|255720552|ref|XP_002556556.1| KLTH0H16126p [Lachancea thermotolerans]
gi|238942522|emb|CAR30694.1| KLTH0H16126p [Lachancea thermotolerans CBS 6340]
Length = 571
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 8/97 (8%)
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
L P D LNH + +K GV F++ Q + G+++F +YG KSN ELLL+YGF
Sbjct: 224 LYPIVDFLNHHSGQKVQWQLNKDRNGVSFSSGNQIEKGQEIFNNYGDKSNEELLLNYGFA 283
Query: 322 PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
+ N S ++ L L +LE+L+ Y L+
Sbjct: 284 IQNNMNDSSTLTLRLP--------PGQLESLKSYDLT 312
>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
Length = 372
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/77 (42%), Positives = 45/77 (58%), Gaps = 10/77 (12%)
Query: 252 RLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
RLP D R+A++P AD+ NH+ CE + +S+ F DR Y+ GE+++ISYG
Sbjct: 173 RLPH-DDRLAILPVADLFNHADVGCEAQF------ASENYSFIADRTYRAGEELYISYGT 225
Query: 309 KSNGELLLSYGFVPREG 325
S LL YGFVP E
Sbjct: 226 HSTDFLLAEYGFVPAEN 242
>gi|301122457|ref|XP_002908955.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099717|gb|EEY57769.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 423
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 79/166 (47%), Gaps = 4/166 (2%)
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L++E + + +S + YI LP L W + + L+ + +++ V+ Y
Sbjct: 94 LLAELARKDTSDFHGYIQQLPTAISLPLSWDENQ-RKMLKDTTAFPILDDKL--VLKLYE 150
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
D + +++P ++P EV ++ F+W++ I+ SR ++ + L+P DM NHS E
Sbjct: 151 DYAVPFANEFPVIWPTEVSTLKKFQWAYSIVSSRAFKVAN-GLEPTLLPVIDMANHSAEN 209
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
S R+ + E V ISYG SN +LL YGFV
Sbjct: 210 PAAHIVKTESGSFQLVALREVEKKEPVTISYGDLSNAQLLCRYGFV 255
>gi|195018080|ref|XP_001984717.1| GH16622 [Drosophila grimshawi]
gi|193898199|gb|EDV97065.1| GH16622 [Drosophila grimshawi]
Length = 455
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 119/275 (43%), Gaps = 49/275 (17%)
Query: 103 RGLVA-LKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW----------PL 151
RGL + + R G++L+ +P +I+ + E+ E K PD L
Sbjct: 51 RGLCSKTQCFRAGDELIRLPAGCLISIAT----LESDEEFKALFDPDLFDKDSRISFQAL 106
Query: 152 LATYLISEASFEKS---SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
+A YL+ ++ S ++ Y+ +LPR + + +EL EA I ER + +
Sbjct: 107 IACYLLHLQHLHEARQESPFAAYLDSLPRSYTTPYFCAVSELQCLPEA--ILERTVSQNR 164
Query: 209 NVIGTYNDLRLRIFSKYPDL----FPEEVFNMETFKWSFGILFSRLVRLPSMD---GR-- 259
+ Y L+ + +++ + EE++ + ++ ++ + SR V L S GR
Sbjct: 165 QIRDCYQVLKSLVGAQHCQCCGQRYCEEIWTLAEYRRAYFAVNSRSVYLSSRQLYTGRSH 224
Query: 260 ----------VALVPWADMLNHSCEVETFLDYD----KSSQGVVFTTDR----QYQPGEQ 301
+AL P+ D+ NHS V+T + Q V T D Q +P EQ
Sbjct: 225 FQELLSGTNNLALAPFLDLFNHSDTVQTTAELQLLASSKCQDYVLTLDSLAAAQLKPYEQ 284
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+FISYG N +LL YGF + N D E L
Sbjct: 285 LFISYGALPNLKLLTEYGFYLKR--NAHDYFEFSL 317
>gi|78097104|ref|NP_001030295.1| N-lysine methyltransferase SETD6 [Mus musculus]
gi|81904260|sp|Q9CWY3.1|SETD6_MOUSE RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
gi|12845648|dbj|BAB26837.1| unnamed protein product [Mus musculus]
gi|74198625|dbj|BAE39788.1| unnamed protein product [Mus musculus]
gi|148679234|gb|EDL11181.1| RIKEN cDNA 0610039J04 [Mus musculus]
gi|187951385|gb|AAI39199.1| SET domain containing 6 [Mus musculus]
gi|187952351|gb|AAI39200.1| SET domain containing 6 [Mus musculus]
Length = 473
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 118/274 (43%), Gaps = 27/274 (9%)
Query: 81 KWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
+W GL P+ ++ V G+VA +++R GE L VP S ++ S +C +G
Sbjct: 51 RWCRRVGLELSPKVTVSRQGTVAGYGMVARESVRAGELLFAVPRSALL---SPHTCSISG 107
Query: 139 EVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
+ ++ S+ W + + +S WS Y + P + ++W E R
Sbjct: 108 LLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERLR 166
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
L+ + + E + + N+ Y + L + DLF V ++E ++ ++ + +
Sbjct: 167 LLKGTGVPEAVEKDLVNIRSEYYSIVLPFMEAHSDLFSPSVRSLELYQQLVALVMAYSFQ 226
Query: 253 LPSMD-------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQV 302
P + +VP AD+LNH L+Y +V T QP G ++
Sbjct: 227 EPLEEDDDEKEPNSPLMVPAADILNHIANHNANLEYSADYLRMVAT-----QPILEGHEI 281
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
F +YG+ +N +L+ YGF N D+ ++ +
Sbjct: 282 FNTYGQMANWQLIHMYGFAEPYPNNTDDTADIQM 315
>gi|448092000|ref|XP_004197467.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
gi|448096594|ref|XP_004198498.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
gi|359378889|emb|CCE85148.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
gi|359379920|emb|CCE84117.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
Length = 595
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 94/230 (40%), Gaps = 45/230 (19%)
Query: 152 LATYLISEASFEKSSRWSN--YISALPRQPYSLLYWTRAE------------LDRYL--- 194
L T LIS+ F+KS S YI LP + YW E LDR
Sbjct: 80 LTTLLISKLKFDKSCEHSFGPYIDILPDKLSLPFYWNHQERSLVEDTDLKVILDRNFQKL 139
Query: 195 --EASQIRERAIERITNV---IGTYNDLRLR---IFSKYPDLFPEEVFN--------MET 238
E + E I++ ++ G DL I KY + E N
Sbjct: 140 VEEWHSLVESLIDKEKHLSFEAGLKADLNFYEEYITGKYDEYRLYEYLNKKIQSWTSFSA 199
Query: 239 FKWSFGILFSRLV--RLPSMDG------RVALVPWADMLNHSCE--VETFLDYDKSSQGV 288
+ WS IL SR L + D + L+P D+LNH + + + V
Sbjct: 200 YVWSRSILMSRGFPYLLVAEDNSKPNLTKACLIPLFDILNHKSNSPIRWTPVMESGTGNV 259
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+F +R + GEQ+F +YG KSN ELLLSYGF E NP DS + L +
Sbjct: 260 IFQLERGVKKGEQLFNNYGNKSNCELLLSYGFA--EEKNPHDSASITLKI 307
>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
Length = 353
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 103/230 (44%), Gaps = 32/230 (13%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
G G A +I +GE+LLFVP S +T P + L + V +LA L+
Sbjct: 41 GMIGCTATADICQGERLLFVPHSACVT-------PSGVQGLYEPQV----MLAASLVKHR 89
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ + +S + +Y+ +LP + L W+ EL L+ + + E + L L
Sbjct: 90 T-DPNSPFHDYLQSLPSEFEHPLEWSADEL-VCLKGTTVWE------------MHQLSLE 135
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ +L P M +W+ ++ SR S + ++P AD NHS +
Sbjct: 136 VVDSVAELCPNSPRAM--IRWAVEVMMSRA--FESEVCGLCVIPLADQFNHS-STKWHTR 190
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
+ +G ++ + GE++F +YG +N LLL++GF+ E NP D
Sbjct: 191 VREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGFI--EFDNPHD 238
>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
Length = 256
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 107/243 (44%), Gaps = 27/243 (11%)
Query: 147 PDWPLLA-----TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE 201
P PLLA T+L+SE S W Y+ LP+ + ++ + E ++
Sbjct: 3 PPSPLLALCTLCTFLVSEKHAGDQSLWKPYLDILPKAYTCPVCLEPKVVNLFPEP--LKA 60
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK---WSFGILFSRLVRLPSMDG 258
+A E+ V G ++ R S P LF E V N+ ++ W++ + +R V +
Sbjct: 61 KAEEQRARVQGFFSSSRDFFSSLQP-LFSEAVENIFSYSALLWAWCTVNTRAVYMKHGQR 119
Query: 259 RV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ AL P+ D+LNHS +V+ +++ ++ T + EQVFI YG
Sbjct: 120 KCFSPEPDTYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFICYGPHD 179
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQI 367
N LLL YGFV + + V L +K +DK +K+ L+ + + F
Sbjct: 180 NQRLLLEYGFVSIQNPHACVYVSADLLVKYLPSTDKQMNKKISILKDHDFIENLTF---- 235
Query: 368 TGW 370
GW
Sbjct: 236 -GW 237
>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
Length = 523
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/354 (22%), Positives = 127/354 (35%), Gaps = 68/354 (19%)
Query: 74 ENASTLQKWLSDSGLPPQKMAIQK-VDVGERGLVALKNIRKGEKLLFVPP---------- 122
E +W + G+ + AI++ D GL + +G+ L F P
Sbjct: 3 ERLERFSRWSQEHGIQFRGCAIKRGSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVLV 62
Query: 123 ----SLVITADSKWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISAL 175
L +T + P G V ++ + D L+ +LI E + ++S W+ Y+ L
Sbjct: 63 VTPLDLALTPVTIVKDPVLGNVYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEML 122
Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
P + L++ EL L+ + + E A + T+ + ++ PD +
Sbjct: 123 PSGFGTPLWFEDEEL-MELDGTTLFE-ATKAQVFFPSTFVSTCMSLYLFRPD---DRELE 177
Query: 236 METFKWSFGILFSRLVRLP--------------SMDGRV--------------------- 260
+ F W+ I ++R + +P DG
Sbjct: 178 FQEFLWANCIFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISCSAKDVSTI 237
Query: 261 ---ALVPWADMLNHSCEVETFLDYDKSS-------QGVVFTTDRQYQPGEQVFISYGKKS 310
LVP D NH+ + D S + D + PG +V I+YG K
Sbjct: 238 WIEGLVPGIDFCNHTRRASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINYGDKG 297
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
N ELL YGFV + +N V P D KL+ LR+ LS P
Sbjct: 298 NEELLFLYGFVEEDNSNDYVMVHFPKMFLDEDNTMDFKLQLLRELDLSLQWLLP 351
>gi|303272869|ref|XP_003055796.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463770|gb|EEH61048.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 677
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 62/135 (45%), Gaps = 22/135 (16%)
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP-----SMDGRV-ALVPWADML 269
DLR R +K P P M+ F W++ +SR + LP G V A+VP D
Sbjct: 327 DLRRRRATK-PSAKP---ITMDEFLWAYATFWSRALALPIGPDPEASGAVEAIVPGIDFA 382
Query: 270 NHSC-------EVETFLDYDKSSQG---VVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
NHSC V + ++ G V PGE+V ISYG K N ELL +G
Sbjct: 383 NHSCARPNARWAVANASGREGATAGEPTVTLECLSVPGPGEEVLISYGDKPNEELLFVHG 442
Query: 320 FVPREGTNPSDSVEL 334
F RE NP D++ L
Sbjct: 443 FAERE--NPHDALVL 455
>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
Length = 492
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 112/268 (41%), Gaps = 42/268 (15%)
Query: 81 KWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
KWL+ G+ ++ + D R LVA + + E + VP + ++ K + PE
Sbjct: 14 KWLNHVGVRISAKAELTCLRADGRGRALVAKGDFAEDELIFSVPRTSTLSV--KAALPEM 71
Query: 138 GEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+ S +P W L +ISE S+W+ Y + LP + SL++W+ EL
Sbjct: 72 LSGRQDISPEDIESMPGWAALTAVIISEG-LRPESKWAPYFNVLPTKLDSLVFWSPEELA 130
Query: 192 RYLEASQI-----RERAIERITNVI------GTYNDLRLRIFS-------KYPDLFPEEV 233
L+AS + +++A E I GT D+ R+ S PD+ E+
Sbjct: 131 E-LQASAVLKKVGKDKAEEIFHQSISKVTPEGTDVDIFHRVASTIMAYAFDIPDIEQED- 188
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
G LV +A++P ADMLN + L YD + + T
Sbjct: 189 --------EEGANEDDLVDDDEQKTSLAMIPLADMLNADADNNARLHYD--GEELEMRTI 238
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFV 321
+ GE++ YG+ +LL YG+V
Sbjct: 239 NPIKTGEEILNDYGQLPRSDLLRRYGYV 266
>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
CIRAD86]
Length = 449
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/284 (23%), Positives = 125/284 (44%), Gaps = 37/284 (13%)
Query: 69 EIDSLENAST-LQKWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
+ID ++ S WL ++G P ++A + RG+VA ++ E++ +P +
Sbjct: 2 DIDDFQSMSDKFLTWLKNTGATISPKIQLADLRDRAAGRGVVATSDLTSDEEIFRIPRTS 61
Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
++T ++ P+ E+L+Q + P W L +I E +SR+ Y+ LP +L++
Sbjct: 62 ILTTETT-DLPQ--EILQQLTDP-WLSLILAMIFEYLLGTNSRFKPYLDILPESFNTLMF 117
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF----- 239
WT EL +YL+ S I + + + T+++ L I +K P++F N +
Sbjct: 118 WTDNEL-QYLQGSAILSKIGKEEAD--NTFSEQLLPIITKNPEIFKIGTCNNQDLLALCH 174
Query: 240 -------KWSFGILFSRLVRLPSMDGRV-------------ALVPWADMLNHSCEVETFL 279
++F + S + AL+P ADMLN + ++ T
Sbjct: 175 RMGSIIMSYAFDLDPPPTTTTSSSEEWESDSDSENEKISPKALIPLADMLNANGDL-TNS 233
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
SS + T + GE++ +G +LL YGFV +
Sbjct: 234 KLFFSSDSFIMKTLQPVAAGEELLNDFGPLPPADLLRRYGFVTK 277
>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
Length = 465
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/267 (24%), Positives = 115/267 (43%), Gaps = 34/267 (12%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI---TADSKWSCPEAGEVLK--QCSVP---DWPLLAT 154
RG++AL++I L +P I T++ P+ ++ K + VP W L
Sbjct: 41 RGIIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLIL 100
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
+I E SS+W +Y LP + ++W+ ELD+ L+AS +R + + + +
Sbjct: 101 IMIYEYLRGDSSQWKSYFDVLPSSFDTPMFWSENELDQ-LQASHMRHKIGK--ADAENMF 157
Query: 215 NDLRLRIFSKYPDLFPEEV------------FNMETFKWSFGILFSRLVRLPSM------ 256
+ I P +F E ++F + +
Sbjct: 158 KKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVEDR 217
Query: 257 DGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
DG+ + +VP AD+LN E +++++ S + T+ R + GE++ YG N EL
Sbjct: 218 DGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTSLRPIKAGEEILNYYGPHPNSEL 275
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKS 341
L YG+V E + D VE+P + +S
Sbjct: 276 LRRYGYVT-EKHSRYDVVEIPWDIVES 301
>gi|238882888|gb|EEQ46526.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 433
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 111/276 (40%), Gaps = 36/276 (13%)
Query: 80 QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------------ 127
+K +S+ K+ ++ V RG+ A++ ++KGE +L +P S ++
Sbjct: 22 EKKISNHTYISPKIDVKDVRSSGRGIYAVEPLKKGELILNIPHSFLLNFTTVMAHIAKYN 81
Query: 128 ---ADSKWSCP------EAGEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYI 172
DS P E E+ + + + + LL+ YL E S W ++
Sbjct: 82 GMAIDSHIHVPFDKSEDEYTEIYRTLTKEEILELSSFQLLSLYLTFERKRSHKSFWKPFL 141
Query: 173 SALPR-QPYSLL--YWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFSKYPDL 228
LP + L+ W + ++++R + + R N +L K D+
Sbjct: 142 DMLPSMDDFELMPIDWPQEVCTLLPSSTEVRNKKVRSRFDNDYQVICELIKTKIDKDGDV 201
Query: 229 ---FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSS 285
P + + + L+ L + + P+ D +NHSC+ L D
Sbjct: 202 TTFLPRQEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYVDFMNHSCDDHCTLKID--G 259
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+G T QY G+QV++SYG SN LL YGFV
Sbjct: 260 KGFQVRTTSQYNTGDQVYLSYGPHSNDFLLCEYGFV 295
>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
Length = 468
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 94/218 (43%), Gaps = 34/218 (15%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL-----DRYLEASQIRERAIERITN 209
+L+ E EK S W YIS LP QP + W + LE + E N
Sbjct: 105 FLVKEYLKEKDSYWWPYISTLP-QPDRVDTWALPAVWPEDDIECLEETNAHVAVREIQAN 163
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM-------DGRVAL 262
+ Y R + K D + + +KW+F I SR R PS+ D + L
Sbjct: 164 IKKEYKHARKLL--KEVDFPGWQEYTQLLYKWAFCIFTSRSFR-PSLILSQETQDHVLGL 220
Query: 263 VPWADMLNHSCEVETFLD---YDKSSQ---------GVVFTTDRQYQPGEQVFISYGKKS 310
P ++ ++ LD +D +SQ + YQPG+QVF +YG KS
Sbjct: 221 TPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLEVDGTCQLICNNAYQPGQQVFNNYGLKS 280
Query: 311 NGELLLSYGFV-PREGTNPSDSVEL-----PLSLKKSD 342
N ELLL YGF+ P T +D V + P +L+K++
Sbjct: 281 NSELLLGYGFILPVTDTLHNDYVHVKSRRPPSTLQKNE 318
>gi|412989087|emb|CCO15678.1| predicted protein [Bathycoccus prasinos]
Length = 640
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 48/89 (53%), Gaps = 4/89 (4%)
Query: 261 ALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
ALVP+ DMLNH+ L +D S+ + R + G+++F +YG+ S+GELL Y
Sbjct: 392 ALVPFWDMLNHAHPALASVKLSHDASTNRLNMIAVRDIRKGDEIFNTYGELSDGELLRRY 451
Query: 319 GFVPREGTNPSDSVELPLS--LKKSDKCY 345
GF+P NP +SV + DK Y
Sbjct: 452 GFLPTSSRNPHNSVTISFKELFAACDKVY 480
>gi|408397548|gb|EKJ76689.1| hypothetical protein FPSE_03100 [Fusarium pseudograminearum CS3096]
Length = 467
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 11/197 (5%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDL--RLRI 221
S+ W+ Y+ LPR W+ E++R L E A+E + ++ + DL + +
Sbjct: 132 STPWTEYLKFLPRDVPVPTMWS--EVERALLQGTSLEAALEAKFASLSKEFEDLTEKSSV 189
Query: 222 FSKYPDLFPEE-VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ LF E+ ++ + SR + LP G +VP DM NHS + D
Sbjct: 190 LPFWNSLFWEKGTVAIQDWILVDAWYRSRCLELPR--GGDVMVPGLDMANHSHHPTAYYD 247
Query: 281 YDKSSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
D V+ + GE+V ISYG K+ E+L SYGF+ E T + + LP+ +
Sbjct: 248 EDDKDDVVLLVRPGTKVSAGEEVNISYGDKNPAEMLFSYGFIDNEST--VEGLNLPVKVL 305
Query: 340 KSDKCYKEKLEALRKYG 356
D K KL G
Sbjct: 306 PDDPLGKAKLHIFGSSG 322
>gi|303272707|ref|XP_003055715.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463689|gb|EEH60967.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 647
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 110/284 (38%), Gaps = 57/284 (20%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL---------KQCSVP--DW 149
G RGLVA+ I G L VPPS +TA + E G+ L K+ +P D
Sbjct: 101 GGRGLVAVAPIPAGAILFRVPPSRTLTAAAALES-ETGKALLAAIDASPTKKTRLPVGDL 159
Query: 150 PLLATYLISEASF-------EKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIR 200
L +S +F + +S + Y + L +P S ++W E +R L S +
Sbjct: 160 ALAVRVALSTRAFGGKHVCVDDASPFRGYFALLADEPLDESPVWWDEDERERRLRGSMLL 219
Query: 201 ERAIERITNVIGTYNDLR---LRIFSKYPDLFPEEV-----------------FNMETFK 240
A+ +V Y + +R K P +V N E F+
Sbjct: 220 HDAVALAADVRADYEGVVEGVMRDAEKTPARRDVDVDAAGNRRAAERFTRTKTGNYERFR 279
Query: 241 WSFGILFSRLVRLPSMDGRV----------ALVPWADMLNHSC---EVETFLDYDKSSQG 287
+ ++SR + + G+ ALVP D NH E +D D
Sbjct: 280 RALAAIWSRSFNVGGVRGQTQTDEDESFAGALVPLLDCANHHRKPRECSWTIDEDGC--- 336
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
V+ R + G + I+YG + N +L+L YGF + T P S
Sbjct: 337 VLVVAIRAFDAGGAIRIAYGARGNHDLMLRYGFAVEDNTEPDGS 380
>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
Length = 537
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 147/344 (42%), Gaps = 49/344 (14%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVAL--KNIRKGEKLLFVPPSLVITADS---- 130
S LQ+ ++ + Q+ D E +A+ K+I + L+ P S +IT
Sbjct: 3 SFLQEAFNNGCYLHPGIQFQRSDNVEGTFIAIASKDIDGDQVLISCPESYIITLQKAKNE 62
Query: 131 --KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
+ S A E + ++ T+ E + S+W+ YI LP+ + LY+T
Sbjct: 63 LCRLSPKFADEKMHT-------IVCTFFALERLKGEKSQWAKYIEYLPKTFDTPLYFTDD 115
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF----G 244
EL +++E TN+ ND R RI+ + + + N + F W+
Sbjct: 116 EL-----------KSLEH-TNIFYGCND-RKRIWKEEHATAAKLLDNPDNFSWNMYLWAA 162
Query: 245 ILFSR-------LVRLPSMDGRVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQ 295
+FS L + D L+P D LNH C + + K S V + +
Sbjct: 163 TVFSSRCFSSALLGEEDTDDAAPILIPLVDSLNHKPRCPI-IWNKVTKESHAVQLVSVKP 221
Query: 296 YQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
G QV+ +YG K N ELL+ YGF +P N ++ L LSL K+ ++K L
Sbjct: 222 ISSGGQVYNNYGPKGNEELLMGYGFCLPN---NEFETFALRLSLDKAVYNSEKKRSILAS 278
Query: 355 YGLSASECF-PIQITGWPLELMAYAYLVV--SPPSMKGKFEEMA 395
+GLS + P Q+ L+ + A +V+ SP +K E +A
Sbjct: 279 HGLSKLNFWIPKQVDFSHLQNILDALVVITASPFELKTLEEHLA 322
>gi|313234617|emb|CBY10572.1| unnamed protein product [Oikopleura dioica]
Length = 395
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 126/293 (43%), Gaps = 40/293 (13%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---CSVPDWPLLATYLISE 159
RG+ + + E ++ P ++T K C E + +Q + + +L YL
Sbjct: 34 RGMKSDVDFEANEFIISFPNEALLTG--KRMCQELPHLEEQRVKHHLTNELVLVFYL--- 88
Query: 160 ASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRER-AIE--RITNVIGTYN 215
+F + Y ++LP++ P LYW++ + D E +R + +E R +V+G
Sbjct: 89 -AFYCEEHFPEYYASLPKEFPNYWLYWSKEDWDSLYEDRVLRRKIKVEKRRYHSVMG--- 144
Query: 216 DLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRLVRLPSMDGRV------------ 260
R+ + +K + + + ME FK ++ L +R V L V
Sbjct: 145 --RVEMSNKEAIVMDQNHHFLLFMEPFKLAWAQLNTRTVYLSDQWHDVGKNENNDDSTLN 202
Query: 261 -ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
AL P+ D NH + +T + D S+ ++++ GEQ+FISYG+ + LLL YG
Sbjct: 203 WALAPFLDQFNHHHDAKTVIHDD--SEKFAIKVEKKHGKGEQLFISYGEHPDVFLLLEYG 260
Query: 320 FVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
F+ +NP + +EL + D K + +G +Q W L
Sbjct: 261 FIIG-NSNPHNLIELTNTTVLRDPVTK---KMAIDFGCGDGHAISLQGASWSL 309
>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
thermophila ATCC 42464]
Length = 357
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 78/202 (38%), Gaps = 29/202 (14%)
Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL--DR---YLEASQI 199
SVP L +L+ E K S W YI+ LP P + W D YLE +
Sbjct: 108 SVPPHVLGRFFLVKEYLKGKDSFWWPYIATLP-PPEQVAVWALPPFWPDHDIAYLEGTNA 166
Query: 200 RERAIERITNVIGTYNDLR-LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR------ 252
E NV + R L +PDL + +KW+F I SR R
Sbjct: 167 HVAIQEIQENVKREFKQARKLLKEEDFPDL---PAYTQLLYKWAFCIFTSRSFRPSLVLS 223
Query: 253 ----------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
LP +D L P D+ NHS D YQPG
Sbjct: 224 DATKRRLSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPG 283
Query: 300 EQVFISYGKKSNGELLLSYGFV 321
QV+ +YG K+N ELLL+YGF+
Sbjct: 284 TQVYNNYGLKTNSELLLAYGFI 305
>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 110/252 (43%), Gaps = 26/252 (10%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPD--WP-LLATYLISE 159
L A +I GE+L+ +PP L++ DS + E LK VP+ W + L+ E
Sbjct: 84 LEADGDIADGERLVSLPPKLMLRCDSD----DVSEPLKNVVDRVPNEFWSSKVGLVLLRE 139
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY--NDL 217
S ++ YI+ LP + + R LE + I ++ I + +GT+ N L
Sbjct: 140 RVAGAHSAFAPYITLLPAVHEGSPTFFPPDAVRALEYAPIVQQ-INKRARFLGTFAGNAL 198
Query: 218 RLRIFSKYPD-LFP-----EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
+ Y D P E + W+ SR ++ + A++P D+ NH
Sbjct: 199 TVDDGESYVDEAHPGRQRVEMTIDANALGWATACASSRAFKV-GANSAPAMLPVIDICNH 257
Query: 272 S----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
S V + D + GV R GE + +SYG SN ELLL YGF+ ++ N
Sbjct: 258 SFNPSVSVRAIEEGDNAG-GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKD--N 314
Query: 328 PSDSVELPLSLK 339
P D V+L LK
Sbjct: 315 PFDCVKLRWDLK 326
>gi|308802351|ref|XP_003078489.1| unnamed protein product [Ostreococcus tauri]
gi|116056941|emb|CAL53230.1| unnamed protein product [Ostreococcus tauri]
Length = 433
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 63/116 (54%), Gaps = 7/116 (6%)
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA----LVPWADMLNHS-CEVETFLDYDKSSQGVVFT 291
+ ++W+ ++ SR R+ GR A L+ AD+LNHS E D+ + V T
Sbjct: 197 DEWRWALSMVHSRTFRIEDEYGRRATRRALIAAADLLNHSSVRGEVNCDWSANDDYFVVT 256
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
T R + GE++ ISYG++ + L YGF+P + NP + V+L + +++ Y+E
Sbjct: 257 TTRDVRAGEELCISYGEQCDRHFALFYGFLPSQ--NPFNRVKLFFNGREALDWYQE 310
>gi|366992371|ref|XP_003675951.1| hypothetical protein NCAS_0C05970 [Naumovozyma castellii CBS 4309]
gi|342301816|emb|CCC69587.1| hypothetical protein NCAS_0C05970 [Naumovozyma castellii CBS 4309]
Length = 580
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 50/101 (49%), Gaps = 1/101 (0%)
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
L P D+LNH + +++K + F + + +++F +YG KS ELLL YGF+
Sbjct: 225 LYPVVDLLNHKNDTNVKWEFEKDEERADFIFNETLKANDELFNNYGDKSKEELLLGYGFI 284
Query: 322 PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
P E NP D+ L L L ++ L L L+A C
Sbjct: 285 P-EDINPYDTSSLTLRLDENHISQARLLAKLPDVNLAADNC 324
>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
Length = 489
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 114/282 (40%), Gaps = 59/282 (20%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP------LL 152
+ G RGL A +++R+GE +L P + ++T+D + C P +L
Sbjct: 51 NAGGRGLAAARDLRRGELVLRAPRAALLTSDR---VTADDPRIAACVSAHRPRLSSVQIL 107
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-LEASQIRERAIERITNVI 211
L++E ++S W Y+S LP Y++L A D + +EA Q+ + I
Sbjct: 108 IVCLLAEVGKGRNSVWYPYLSQLPSY-YTIL----ATFDDFEVEALQV--------DDAI 154
Query: 212 GTYNDLRLRIFSKYPDL--------FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
+ I S + D+ F ++ +++ W+F + SR + + + D L
Sbjct: 155 WVAQKAKSAIKSDWEDVTPLMKELEFKPKLLMFKSWLWAFATVSSRTLHI-AWDEAGCLC 213
Query: 264 PWADMLNHSC----------EVETFLDYDK-----------------SSQGVVFTTDRQY 296
P D+ N++ + +Y + S + Y
Sbjct: 214 PVGDLFNYAAPDDDTSLEAEDTAELTNYQQKNEMINSSERLTDGGYEDSNAYCLYARKNY 273
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+ GEQV + YG +N ELL YGF+ E N +EL L +
Sbjct: 274 KQGEQVLLGYGTYTNLELLEHYGFLLGENPNEKTFIELDLDI 315
>gi|346978889|gb|EGY22341.1| SET domain-containing protein [Verticillium dahliae VdLs.17]
Length = 457
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 87/209 (41%), Gaps = 35/209 (16%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
L+ YL+ SF W YI LP QP L W+ L + + + TN+
Sbjct: 94 LMKQYLLGRDSF-----WYPYICTLP-QPDQLSSWSLPPLWPSDDIELLED------TNI 141
Query: 211 IGTYNDLRLRIFSKYPDLFP-------EEVFNMETFKWSFGILFSRLVR----------- 252
+++ R+ ++Y P + + W++ I SR R
Sbjct: 142 HTAVAEIKARLKAEYKQATPLLAALPNANDYTRLLYNWAYSIFTSRSFRPSRVVPDHESL 201
Query: 253 -LP---SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
LP ++D L+P D+ NHS D + V T Y+ G QVF +YG
Sbjct: 202 PLPEGCAIDDFHILMPLFDIGNHSHSAGISWDIAPGTSTTVLKTLDAYESGAQVFNNYGS 261
Query: 309 KSNGELLLSYGF-VPREGTNPSDSVELPL 336
K+N EL+L+YGF +P T +D V L L
Sbjct: 262 KTNAELMLAYGFLIPESPTLHNDFVHLQL 290
>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
Length = 478
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 120/292 (41%), Gaps = 60/292 (20%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI-TADSKWSCPEAGEVLKQC--S 145
P ++A + RG+VA +I +GE+L +P V+ T +SK ++L Q
Sbjct: 34 PKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQNSKLK-----DLLSQDVEE 88
Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----R 200
+ W L +I E S W++Y LPR+ +L++W+ +EL + L+ S I +
Sbjct: 89 LGPWLSLMLVMIYEYLLGDQSAWASYFKILPRKFDTLMFWSPSEL-QELQGSAIVDRIGK 147
Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
E A E I +I I P LFP V + ++ G L+ L + G +
Sbjct: 148 EGAEESILEMIAP-------IVRANPSLFP-PVDGLASYDGDAGT--QALLNLAHVMGSL 197
Query: 261 ----------------------------------ALVPWADMLNHSCEVETFLDYDKSSQ 286
+VP AD+LN + + + +
Sbjct: 198 IMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSKGMVPLADLLNADADQNNARLFQEET- 256
Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
G+V + G ++F YG+ +LL YG+V + +P D VEL L L
Sbjct: 257 GLVMKAIKPISAGAEIFNDYGEIPRADLLRRYGYV-TDNYSPYDVVELSLEL 307
>gi|145501218|ref|XP_001436591.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403732|emb|CAK69194.1| unnamed protein product [Paramecium tetraurelia]
Length = 716
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 85/193 (44%), Gaps = 15/193 (7%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK---------QCSVPDWPLLAT 154
GLVA + I E L+ VP L++T + P + Q S D L+A
Sbjct: 64 GLVASEKILSNETLVSVPRDLLLTTRHAFESPLKQMFIDHPQYFSNQFQSSWEDHQLMA- 122
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
+++ E S W IS LPR L++W+ E L+ ++ + A ++ + + Y
Sbjct: 123 FILYEYQRGPESEWHLLISNLPRDIDYLVFWSHEE-QELLDDEKLIKLARKQYSEFLLEY 181
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
L+ I KYP F E +E +W + L +R V +VP+ ++ NH C
Sbjct: 182 ETLKC-ITDKYPQHFKPETVTLENARWVYTHLVTRC--FGKYLAYVTMVPFCELFNHEC- 237
Query: 275 VETFLDYDKSSQG 287
+ F D++ ++
Sbjct: 238 TDVFYDFEYNADN 250
Score = 41.6 bits (96), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 67/151 (44%), Gaps = 21/151 (13%)
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
Q+ G QV+ YG+ SN +L+ YG N D V L + K + E + + K
Sbjct: 501 QFDKGAQVYFCYGRLSNRMMLMRYGMSLE--YNKYDHVHLRIEYLKYLQS-NEAIWLVHK 557
Query: 355 YGLSASECFPIQITGWPLELMAYA---YLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP 411
Y +S + F ++ T +PL+ + + Y + S+ F+ I+
Sbjct: 558 YQISKFKKFKLKHTTFPLDFIVFCKSIYWTFNVHSLDSFFK---------------IQDL 602
Query: 412 EIDEQALQFILDSCESSISKYSRFLQVKELL 442
+++ +ALQ L+ ISK+S L+ E L
Sbjct: 603 KLERKALQLALEILVEEISKFSDKLEDNEKL 633
>gi|242813336|ref|XP_002486146.1| hypothetical protein TSTA_101480 [Talaromyces stipitatus ATCC
10500]
gi|218714485|gb|EED13908.1| hypothetical protein TSTA_101480 [Talaromyces stipitatus ATCC
10500]
Length = 426
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 83/343 (24%), Positives = 141/343 (41%), Gaps = 64/343 (18%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
+ +W D G+ + ++ G++A ++I++ E L+ VP S +++ DS S +
Sbjct: 12 TAFMQWAIDEGVKVNGVEPARITGRGLGMIATRDIQEHEMLIDVPLSAMLSVDSVPS--D 69
Query: 137 AGEVLKQCSVPDWPLLATYLI--SEASFEKSSRW-------SNYISALP----------- 176
+ S+ LLA YL +K W S++ +P
Sbjct: 70 FVNLFSGISIQG--LLAAYLTHGDPRCLKKYDLWKATWPTYSDFEEGMPILWPKELGGSG 127
Query: 177 -RQPYSLLYWTRAELDRYLEAS------QIRERAI----ERITNVIGTYNDLRLR----- 220
+ P S T D L S IR++A+ E I + RL+
Sbjct: 128 LKHPISPTATTHHPPDGKLPPSISGSWTTIRKKALVEEYETKHQNILFQQEKRLQDAWRD 187
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVR--LPSM------DGRVALVPWADMLNHS 272
+ + +PD + ETF + + +L +R +P + +A+VP+AD NH+
Sbjct: 188 VLAVFPDT------DWETFSYHWLVLNTRCFYYVMPGTEPPEDTNDAIAMVPFADYFNHT 241
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
E E + +D + F R Y+ GE++++SYG N L + YGF N SDS+
Sbjct: 242 DETECDVKFD--GKNYTFRAMRAYKKGEEIYMSYGPHPNDFLFVEYGFYLDH--NKSDSL 297
Query: 333 EL-PLSLKKSDKCYKEKLEALRKYG-----LSASECFPIQITG 369
L + K KE+L R YG L + CF ++
Sbjct: 298 FLDDIIFKDFTVAEKEELIHHRYYGNYQITLESGPCFRTEVAA 340
>gi|384246167|gb|EIE19658.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 523
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 109/270 (40%), Gaps = 18/270 (6%)
Query: 72 SLENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD 129
S +A L +W+S SG + P V GL A K R GE L+ +P S ++ D
Sbjct: 37 SASSADRLVQWVSSSGGTVSPTVHVSPPDSVMGAGLRASKACRSGELLVSLPRSCQLSYD 96
Query: 130 SKWSCPEAGEVLKQCSVPDWPL-LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
+ P +++ + W LA ++ E S + +YI LP + +
Sbjct: 97 GS-TEPNLLQLISKVPEELWGAKLALRVLKERIMGPDSPFHSYIDNLPMGVPGIPMFFSP 155
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP-DLFPEEVFNMETFKWSFGILF 247
+ R LE +++ + +++ L P D F + W+ +
Sbjct: 156 DAIRALEQYPPLSEQVKKRCRWLLSFSSEHLSALPGSPADPFLGTPVDANILGWALAMTT 215
Query: 248 SRLVRLPSMDGRVALVPWADMLNHS----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
SR R+ AL+P DM NHS CEV+ V R + E +
Sbjct: 216 SRAFRVQGPQHPAALLPLIDMSNHSFAPNCEVKP-----GPGGSVEMVASRDIRAEEDLL 270
Query: 304 ISYGKKSNGELLLSYGF-VPREGTNPSDSV 332
+SYGK N LLL YGF VP NP D+V
Sbjct: 271 LSYGKLDNTFLLLDYGFMVP---GNPHDTV 297
>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
JN3]
Length = 415
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 17/132 (12%)
Query: 209 NVIGTYNDLRLRIFSKYPDLFPEE--VFNMETFKWSFGILFSRLVRLPSMDGRV------ 260
N+ ++D++ I S DLF V N TF W + L + RLP ++
Sbjct: 138 NLEQDWSDVKADIPSIDKDLFTYVWLVVNTRTFYWDYPDLSNAHPRLPKRRAKLTSADCY 197
Query: 261 ALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
A+ P+ D NHS CE + ++ G DR Y+ GE+V++SYG +N LL+
Sbjct: 198 AMCPFMDYFNHSDSGCEPQ------HNAHGYSVLADRAYRAGEEVYVSYGPHTNDFLLVE 251
Query: 318 YGFVPREGTNPS 329
YGF+ +N S
Sbjct: 252 YGFLLDANSNDS 263
>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
Length = 1037
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 119/281 (42%), Gaps = 37/281 (13%)
Query: 94 AIQKVDVGER----GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS---- 145
AI+ VD+ +R G++AL++I L +P +I ++ + +V
Sbjct: 595 AIKIVDLRDRNAGRGIIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDD 654
Query: 146 ------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI 199
+ W L L+ E + S+W Y LP + ++W+ +ELD+ L+AS +
Sbjct: 655 DDEAPRLDSWSSLILVLMYEYLQGEKSQWKPYFDVLPSSFDTPMFWSESELDQ-LQASHM 713
Query: 200 RERA----------------IERITNVIGTYN---DLRLRIFSKYPDLFPEEVFNMETFK 240
R + I + ++V G N D + I + F++E +
Sbjct: 714 RHKIGKADAESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDE 773
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
V + +VP AD+LN E +++++ S + T+ R + GE
Sbjct: 774 DEEEEETDGWVEDREGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTSLRPIKAGE 831
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
++F YG N ELL YG+V E + D VE+P + +S
Sbjct: 832 EIFNYYGPHPNSELLRRYGYVT-ERHSRYDVVEIPWDVVES 871
>gi|313228180|emb|CBY23330.1| unnamed protein product [Oikopleura dioica]
Length = 421
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/256 (21%), Positives = 109/256 (42%), Gaps = 29/256 (11%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT----ADSKWSCPEAGEVLKQ-CSVPDWPLLATYLI 157
RG +A K+I +G+ +L +P + VIT ++ SC VL + + L+ +L+
Sbjct: 115 RGFIAKKSINRGQMVLEIPKAAVITPNWIYNNAISCVSNIVVLNEDFKIDSTDLMIIWLV 174
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
E S +YI+++P L + L L Q+R++ + ++ + L
Sbjct: 175 KEKQKGMQSPVRDYITSMPSLCTPLFNYRPRHLK--LFPKQLRQKVENQKKELLARFEYL 232
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM----------------DGRVA 261
+ F ++ + F+W+ ++ +R ++ + +
Sbjct: 233 D-KCFRRHG-----RGISFHEFQWAASMVLTRSIQAKGLTLCTELFEAPWFNNDSSHEIG 286
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
L P+ D+LNHS E D++ + V + +Q++I Y + + +L++YGF
Sbjct: 287 LCPFFDLLNHSSENNCDWDFNPVTGSVWVEAVGDIKNSDQLYIDYDQGCDDYMLMNYGFC 346
Query: 322 PREGTNPSDSVELPLS 337
NP ++EL S
Sbjct: 347 METAANPKTALELSWS 362
>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
Length = 377
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 74/301 (24%), Positives = 128/301 (42%), Gaps = 44/301 (14%)
Query: 77 STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
L W G+ + K+ G++A + I+ E++L VPP ++ C E
Sbjct: 5 QNLMTWAKAQGVAINGIQPSKIPGRGTGILATRKIKAQEEILRVPPRVL-------RCLE 57
Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR----QPYSLLYWTRAELDR 192
+ + + +P + L ++ ++S+ + + LP+ + + W R EL +
Sbjct: 58 SVPLRVREKLPADSTIQALLAADLVLDRSANSKPWKAVLPKMADFEAGMPMLWPR-ELKQ 116
Query: 193 YL--EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW------SFG 244
L E+ ER R ++D + + FP+ + T+ W +F
Sbjct: 117 LLPLESQVTLER---REKEFQDNWDDFK--------EAFPDVPRDDYTYAWLVVNTRTFY 165
Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
++ P D R+AL+P AD+ NH+ C V Y DR Y+ GE+
Sbjct: 166 HETPETLKYPWED-RLALIPVADLFNHAAGGCRV-----YYSPEGCYHVVADRAYKKGEE 219
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPS---DSVELPLSLKKSDKCYKEKLEALRKYGLS 358
+FISY SN LL YGF+P E + D V +P L +S K ++ + L +Y L
Sbjct: 220 LFISYSSHSNDYNLLEYGFIPDENSLDDVYIDDVVMP-KLSESHKAELQRRDLLGEYPLG 278
Query: 359 A 359
+
Sbjct: 279 S 279
>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
nagariensis]
Length = 517
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 91/197 (46%), Gaps = 14/197 (7%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERI--- 207
L ++ E S + SRW+ Y++ +P + LYW E + L + ++ + ++
Sbjct: 187 LIIAVMYEKSRGRQSRWAPYLNLIPDDMTHMPLYWKHREF-KELRGTAAYDKMMGKVQCP 245
Query: 208 ----TNVIGTYNDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
T V ++++ ++P+L PE + ++W+ + S L D A+
Sbjct: 246 ADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASYSFILGD-DKYQAM 304
Query: 263 VPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
VP D+LNH + V L + + R GE++ +YG+ SN ELL YGFV
Sbjct: 305 VPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNYGELSNAELLRGYGFV 364
Query: 322 PREGTNPSDSVELPLSL 338
E N ++ V++PL
Sbjct: 365 --EARNRNNHVQVPLGF 379
>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
Length = 371
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 116/266 (43%), Gaps = 39/266 (14%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE-AGEVLKQCSVPDWPLLATYLISEASF 162
G+VA ++I+ E +L VP + T D+ P+ E L+ SV L +E +
Sbjct: 32 GIVATRDIKPNETILSVPMKALRTIDT---VPKNITEALQGVSV------HGILAAEIAL 82
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDR---YLEASQIRERAIERITNVIGTYNDLRL 219
+KS +S + + LP TR +L+ + S+++ +R +++ N
Sbjct: 83 DKSDDFSVWKTVLP---------TREDLEAGVPMMWPSELQALLPKRAKDILDNQNTTFR 133
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-----RVALVPWADMLNHS-- 272
R FP+ + + W + +P M R+ +P AD+ NH+
Sbjct: 134 RECEIVLKAFPKLTRDEYLYSWVLINTRTFYNSMPKMKSYAHVDRLVCMPTADLFNHADQ 193
Query: 273 -CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
C++ S+ G DR Y GE+V++SYG SN LL YGF+ TN D
Sbjct: 194 GCKLAY------SALGYSVQADRVYHQGEEVYVSYGPHSNDFLLSEYGFIL--DTNRWDE 245
Query: 332 VELP-LSLKKSDKCYKEKLEALRKYG 356
V L + L K +K + LE++ G
Sbjct: 246 VYLDEVILPKLNKTQRADLESINFLG 271
>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
Length = 227
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 96/218 (44%), Gaps = 34/218 (15%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
KW D+G + +Q + E G A ++I+ G L VP +++T + G +
Sbjct: 10 KWCLDNGATINGITLQALPDDEYGFAAEQDIQVGPVFLGVPLGMMMTTIGARK-SKLGAL 68
Query: 141 LKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
LK D P+ L+ +LI E +S W YIS LPR ++LY++ EL +
Sbjct: 69 LK-----DDPIMKSMENVALSMFLILELCAGSASFWHPYISILPRSFNTVLYFSVDEL-Q 122
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSR 249
L S + + A++ ++ Y +IF +P L ++ F + ++W+ + +R
Sbjct: 123 LLTGSSVLDEALKLHRSIARQYAYFH-KIFRTHPLAKSLPYKDCFTYDLYRWAVSAVMTR 181
Query: 250 LVRLP---------------SMDGRVALVPWADMLNHS 272
+P S G ALVP D+ NHS
Sbjct: 182 QNAVPRAVVCGGADDACARGSGSGVAALVPLFDLCNHS 219
>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
Length = 292
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
R ++P+ DG +AL+P DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280
>gi|342879010|gb|EGU80287.1| hypothetical protein FOXB_09214 [Fusarium oxysporum Fo5176]
Length = 530
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 82/191 (42%), Gaps = 11/191 (5%)
Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP- 226
W+ Y+ LPR W+ EL+R L E A++ + + D + S P
Sbjct: 158 WTEYLKFLPRDIPVPTMWS--ELERALLQGTSLEVALDAKLSALNKEFDELIERSSALPF 215
Query: 227 --DLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK 283
F E E ++ + SR + LP A+VP DM NHS + D D
Sbjct: 216 WNSFFWEREAVTIDDWVLVDAWYRSRCLELPRSGH--AMVPVLDMANHSHSQTAYYDEDD 273
Query: 284 SSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
V+ + G++V ISYG+KS E++ SYGF+ RE T + + LPL D
Sbjct: 274 EDNVVLLPRPGMEISIGDEVTISYGEKSPAEMIFSYGFIDREST--VEGLTLPLESLADD 331
Query: 343 KCYKEKLEALR 353
K KL R
Sbjct: 332 PLGKAKLHIFR 342
>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
Length = 376
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 103/236 (43%), Gaps = 37/236 (15%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLIS------ 158
++A ++I GE ++ VP + +IT +S L T+ +S
Sbjct: 1 MMATEDIEAGEVIVSVPRNFLITNESLTK-----------------LYGTHSLSPHQLLA 43
Query: 159 ----EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
+ +K S W Y LP ++ +EL +L S +++ +++ N+ Y
Sbjct: 44 LHLVLLTRDKQSWWKPYTDLLPMHFNTMPVNYPSELLSHLPNS-LKQETMQQKDNIHTDY 102
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD-----GRVALVPWADML 269
+ F K L P+++ E FKW++ + +R + + D +AL P D L
Sbjct: 103 --VTCLKFCKSKQL-PQDI-TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLDFL 158
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
NH+ E + ++ +Q T Y+ GEQV+I+YG N +L YGFV E
Sbjct: 159 NHTTEAKIESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYGFVLNEN 214
>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 490
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 6/98 (6%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG---VVF 290
F + +K+ + SR+V LP A+VP DM NH+CE YD+ G +
Sbjct: 198 FTFDDWKYVDAVYRSRVVDLPRSGH--AIVPCVDMANHACEDSVKAKYDEEGAGNAVLQL 255
Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
T ++ + GE+V ISYG +K E++ SYGFV E T+
Sbjct: 256 RTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERTD 293
>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
Length = 284
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
R ++P+ DG +AL+P DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280
>gi|451852073|gb|EMD65368.1| hypothetical protein COCSADRAFT_159025 [Cochliobolus sativus
ND90Pr]
Length = 408
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 113/284 (39%), Gaps = 40/284 (14%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP-PSLVITA 128
+D N + W +G+ +A + G+VA ++I+KG+KL+ V SLV A
Sbjct: 5 LDPGTNHTDFVAWAKSNGVEINGIAPARFVGRGMGIVAAQDIKKGDKLVHVSNKSLVHVA 64
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSR-------WSN-------YISA 174
P ++ +PD + L + + R W N + S
Sbjct: 65 -----LPS----IRSLKLPDTITVHGKLALSLALWYTGRKDHDYTLWQNVWPTSSDFKST 115
Query: 175 LPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVF 234
+P L R L Q++ +ER I +N + Y L +
Sbjct: 116 MPLYYPPSLQPLLPPAARTLLTKQLQN--LERDWTSIAPHNPGITKETYTYTWL----II 169
Query: 235 NMETFKWSFGILFSRLVRLP------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV 288
N TF WS+ L + LP + D + P+ D NHS ++ D S G
Sbjct: 170 NTRTFYWSYPDLPNASALLPKRRAKLTADDCYCMCPFTDYFNHS---DSGCDPQMSPSGY 226
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
T DR Y GE+VF++YG +N LL YGF+ +E N D V
Sbjct: 227 TVTADRAYVAGEEVFVTYGPHTNDFLLTEYGFILQE-KNRHDGV 269
>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 499
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 80/200 (40%), Gaps = 45/200 (22%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE--RITNVIG 212
YLI + KSS W+ YIS L ++LD++ E IE R TN
Sbjct: 108 YLIQQYLKGKSSLWAPYISTLT---------DPSQLDKWALPPFWTEHDIELLRGTNAYV 158
Query: 213 TYNDLRLRIFSKY------------PDLFPEEVFNMETFKWSFGILFSRLVR-------- 252
+++ + S+Y PD + + W++ + SR R
Sbjct: 159 AIQEIQDNVKSEYKQARKILKQEGSPDY---RAYTQVLYNWAYCMFTSRSFRPSLILSES 215
Query: 253 --------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
LP +D L P D+ NHS E E + Y+PG+Q
Sbjct: 216 AREYVERLLPEGAKIDDFSILQPLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQ 275
Query: 302 VFISYGKKSNGELLLSYGFV 321
VF +YGKK+N ELLL YGFV
Sbjct: 276 VFNNYGKKTNSELLLGYGFV 295
>gi|242804795|ref|XP_002484448.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218717793|gb|EED17214.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 409
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 74/303 (24%), Positives = 125/303 (41%), Gaps = 38/303 (12%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
+W G+ ++ + G++A + I GE ++ VP ++T DS P +
Sbjct: 12 RWCESQGIKIHGVSPAGIPGRRLGMIATRRISAGETIVTVPLVAMLTIDS---VPPSFVR 68
Query: 141 LKQCSVPDWPLLATYLI--SEASFEKSSRW-------SNYISALPRQPYSLLYWTRAELD 191
+ + P +LA + EK W ++ +LP +L + L
Sbjct: 69 MFSKATPLHAILAAFFTHGDPVLLEKWEYWRRVWPLRHDFEKSLPLFWSEMLPANESILP 128
Query: 192 RYLEAS-QIRERAIE------RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
+ S R + E R TN++ +S+ +FP +N ++ W
Sbjct: 129 PSVSGSWSFRNKKPEDIEYGSRYTNILSHQKKRLQDAWSEVLLVFPHTDWNFFSYNWLIL 188
Query: 245 ILFSRLVRLPSMD------GRVALVPWADMLNHS----CEVETFLDYDKSSQGVVFTTDR 294
S P D +ALVP+AD NH CEV +Y F R
Sbjct: 189 NTRSFFYVSPEKDEPEDWNDAIALVPFADYFNHDDKAPCEVNFNGEY------YTFKASR 242
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALR 353
+++ GE++FISYG SN LL+ YGF+ + N SD++ L + L + K++L + +
Sbjct: 243 RFEKGEELFISYGSHSNDFLLVEYGFLLDD--NKSDAIFLDDIVLPELATANKKELLSRQ 300
Query: 354 KYG 356
YG
Sbjct: 301 LYG 303
>gi|85090666|ref|XP_958526.1| hypothetical protein NCU09827 [Neurospora crassa OR74A]
gi|28919896|gb|EAA29290.1| predicted protein [Neurospora crassa OR74A]
Length = 532
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 93/233 (39%), Gaps = 39/233 (16%)
Query: 122 PSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
PS AD + PE E S W LL L+ E + SS WS Y+S LP Q +
Sbjct: 99 PSHTADADDEPPSPENDEEDDSQSQDSWTLLILILMHE-YLQGSSNWSPYLSILPTQFDT 157
Query: 182 LLYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
++WT AEL L+AS + +E A + I I +F YP P+
Sbjct: 158 PMFWTEAELSE-LQASALVAKVGKEEADKMIRTKIVKVVQEHEEVF--YPADTPKTQRLE 214
Query: 237 ETFKWSFGILFSRLV----------------------------RLPSMDGRVALVPWADM 268
E G + ++ M+ + +VP ADM
Sbjct: 215 EGELLKLGHRMGSAIMAYAFDLANDDEDEDEEEEEEEDGWVEDKIAGMNDSMGMVPMADM 274
Query: 269 LNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
LN +++ ++ + T+ R+ + GE++ YG S+ ELL YG+V
Sbjct: 275 LNADAVFNAHINHGEAC--LTATSLREIKEGEEILNYYGPLSSAELLRRYGYV 325
>gi|209489216|gb|ACI49001.1| hypothetical protein Cbre_JD01.008 [Caenorhabditis brenneri]
Length = 333
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 82/170 (48%), Gaps = 22/170 (12%)
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
FE+S+ WS Y+ LP+ + + + D IR+ I++ + LR R
Sbjct: 7 DFEQSA-WSPYLKVLPK-TFDTPAFKGIDYDVNTLPLSIRKYWIDQKKEISEISEKLR-R 63
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADMLNH 271
+F P+L ++V W++ ++ +R + + + DG +A++P+ DMLNH
Sbjct: 64 LF---PELTHDKVL------WAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDMLNH 114
Query: 272 SCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
E + ++K + V RQ Q GEQ+F+ YG N LL+ YGF
Sbjct: 115 DPEKYQGVALHEKRNGRYVVQAKRQIQEGEQIFVCYGAHDNARLLVEYGF 164
>gi|357445001|ref|XP_003592778.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
gi|355481826|gb|AES63029.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
N-methyltransferase [Medicago truncatula]
Length = 243
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 51/99 (51%), Gaps = 14/99 (14%)
Query: 268 MLNHSCEVETFLDYDKSSQGVVFT----------TDRQYQPGEQVFISYGKKSNGELLLS 317
M H + FL++D S+ +V + +DR Y PGEQV I YGK SN L+L
Sbjct: 1 MGKHKGFISDFLNHDGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLD 60
Query: 318 YGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
+GF +P N D V++ + K D KLE L++Y
Sbjct: 61 FGFTIP---YNIYDQVQIQYDIPKYDPLRHTKLELLQQY 96
>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 612
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 131/325 (40%), Gaps = 41/325 (12%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA-GEVLKQ-CSVPDWPLLATYLISEA 160
RG A ++ G L +P S ++T+ P A G+ + + + L+ +L+ E
Sbjct: 193 RGAAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEK 252
Query: 161 -SFEKSSRWSNYISALP----------RQPYSLLYWTRAE-----LDRYLEASQIRERAI 204
+ S W+ +++LP R L T A D L + + + A+
Sbjct: 253 YALGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGLTPASWPAEVTDALLRGAPLLDDAV 312
Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK-----W-SFGILF-SRLVRLPSMD 257
+ + L + +P++FP E++ + F+ W ++G+ + V S
Sbjct: 313 KARETTARQHAALFPALGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVGGASGG 372
Query: 258 GR-------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
G L P A + NH+ + R + GE++F+SYG KS
Sbjct: 373 GEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVSYGAKS 432
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSD----KCYKEKLEALRKYGLSASECFPIQ 366
N ELLL YGF R+ NP D V L L L + + +E++ K LS ++
Sbjct: 433 NAELLLFYGFGVRD--NPYDDVPLSLELPQGEVRDVSALRERVLHRAKLSLSPHS---VR 487
Query: 367 ITGWPLELMAYAYLVVSPPSMKGKF 391
PL L+ ++ + S G +
Sbjct: 488 CGALPLPLVGTLRVLTADASTLGTY 512
>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
Length = 228
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 14 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 68
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 69 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 126
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 127 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 185
Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
R ++P+ DG +AL+P DM NH+
Sbjct: 186 RQNQIPTEDGSRVTLALIPLWDMCNHT 212
>gi|50557274|ref|XP_506045.1| YALI0F30327p [Yarrowia lipolytica]
gi|49651915|emb|CAG78858.1| YALI0F30327p [Yarrowia lipolytica CLIB122]
Length = 430
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 86/187 (45%), Gaps = 21/187 (11%)
Query: 163 EKSSRWSNYISALPR-----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
+ SS S+Y+S+LP+ QP+S W E+ L+ + + + + +V Y
Sbjct: 93 QSSSGMSDYVSSLPKSTEMDQPWS---WPETEIFDSLKGTSLLMACVHKKMHVQAKY--- 146
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR----VALVPWADMLNHSC 273
L I K E++ + + F S + SR + +P G + +VP D +NHS
Sbjct: 147 -LAIVGKKGG---EKLVSEQQFYLSEQWVVSRSLEIPESPGSETLALTMVPVLDYVNHSP 202
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFV-PREGTNPSDS 331
+ + D ++ D + G++VFI+YG KS E L YGF+ G S +
Sbjct: 203 KANCRFEVDSGEVVLIVNEDVLIKAGDEVFINYGPDKSAAEFLFCYGFIDAAHGVTKSIT 262
Query: 332 VELPLSL 338
+E PL L
Sbjct: 263 LETPLML 269
>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
Length = 296
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW S++G + + GL A ++I+ E L+VP L++T +S
Sbjct: 82 LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
RYL+++Q + N Y ++ +P L ++ F E ++W+ + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
R ++P+ DG +AL+P DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280
>gi|320586350|gb|EFW99029.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 537
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 92/238 (38%), Gaps = 63/238 (26%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL-----------DRYLEASQIRERA 203
+L+ + + S W YI++LP QP + W L LE + A
Sbjct: 113 FLMQQYLMGQQSHWHAYIASLP-QPEHVSSWNLPALWPRNGEDAESGPALLEGTNAGIAA 171
Query: 204 IERITNVIGTYNDLRLRIFS-KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM------ 256
E NV Y R + S YP L F + W+F I SR R PS+
Sbjct: 172 AEMQANVAQEYRQARRLLKSVAYPALAE---FTRLLYHWAFCIFTSRSFR-PSLVLSAEA 227
Query: 257 ------DGRVA-------------------LVPWADMLNH------SCEVETFLDYDKSS 285
DG + L+P D+ NH + E + D S+
Sbjct: 228 QTELGTDGEASGGRSVPRLSSGCKTDDFSILLPVLDLANHDPTARATWEATSAHLADGSA 287
Query: 286 QG--------VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVEL 334
G V F ++Y PG+QVF +YG K+N ELLL YGFV P T +D V L
Sbjct: 288 SGHEAASVTGVSFRVQQRYAPGQQVFNNYGMKTNSELLLGYGFVLPPTATLHNDYVHL 345
>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
Length = 490
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 6/98 (6%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG---VVF 290
F + +K+ + SR+V LP A+VP DM NH+CE YD+ G +
Sbjct: 198 FTFDDWKYVDAVYRSRVVDLPRSGH--AIVPCVDMANHACEDSVKARYDEEGAGNAVLQL 255
Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
T ++ + GE+V ISYG +K E++ SYGFV E T+
Sbjct: 256 RTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERTD 293
>gi|358374896|dbj|GAA91484.1| ribosomal N-lysine methyltransferase [Aspergillus kawachii IFO
4308]
Length = 445
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 70/145 (48%), Gaps = 23/145 (15%)
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPS--------MDGRVALVPWADMLNHSCEVETFL 279
++PE + M + W I+ SR S + + +VP+AD NH + +
Sbjct: 197 VYPETEWKMFAYYWC--IINSRSFYYVSPGKDEPEDWNDAIGMVPFADYFNHVDDAACDV 254
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSL 338
++D + F R+Y+ GE+V++SYG SN LL+ YGF +P TNPSDS+ L
Sbjct: 255 NFD--GKKYTFRATRRYEKGEEVYMSYGNHSNDFLLVEYGFTLP---TNPSDSIYL---- 305
Query: 339 KKSDKCYKEKLEALRKYGLSASECF 363
D + L +K L+ E F
Sbjct: 306 ---DDIIFQDLSISQKQELAKQEIF 327
>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
Length = 402
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 132/311 (42%), Gaps = 41/311 (13%)
Query: 73 LENAST-LQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
+EN+ L +W+ S+ G K+ I+++ RGLV + E+L+ + ++ S
Sbjct: 1 MENSHVQLIEWITSNGGYISPKLEIRELPGRGRGLVVNDRVYPNERLIHLKLRQLLNYSS 60
Query: 131 KWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL---- 183
+ G L + S+ +LA +L+ + S S W ++ LP + L
Sbjct: 61 IHNAMVDGGHLSETEYRSMSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEGFLDVPL 120
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYN---DLRLRIFSKYPDLFPEEVFNMETFK 240
W++ + D + I + + T+ D +KY D P + +
Sbjct: 121 QWSKEDQD------SLTPEGIVVLKKTLDTFEADYDKTKTFVAKY-DSDPRDAY-----L 168
Query: 241 WSFGILFSRLV--RLPSMDGR---------VALVPWADMLNHSCEVE-TFLDYDKSSQGV 288
W++ + SR + L G+ + L P+ D++NHS E T SS G
Sbjct: 169 WAWLCVNSRCLYFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGF 228
Query: 289 -VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS--DKCY 345
+ R Y E++F+ YG +SN LL YGF E NP D V++ +L+ + K +
Sbjct: 229 EILCGQRGYTADEEIFLCYGPRSNSVLLCEYGFTVPE--NPWDDVDISDALENTFLTKQH 286
Query: 346 KEKLEALRKYG 356
+ L + YG
Sbjct: 287 ETVLREMGYYG 297
>gi|412990233|emb|CCO19551.1| predicted protein [Bathycoccus prasinos]
Length = 417
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 72/297 (24%), Positives = 125/297 (42%), Gaps = 36/297 (12%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERG--------LVALKNIRKGEKLLFVP 121
++ +E S W +G+ Q + +VD G +V+ +++ EKLL +P
Sbjct: 11 LEVIERLSKFLSWSVSNGI--QVLDAVRVDARWDGVNKKYTLCIVSTRHLHCFEKLLSIP 68
Query: 122 PSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPY 180
++ + A + C + E++ + L + E + S W +Y+ LP +
Sbjct: 69 KTVCLGAKT---CSISKELV-AVGLGGGLALNFAIAQELALGPDSCWFDYLCILPSKGEQ 124
Query: 181 SL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
SL ++W++ E + L+ + + I + Y + ++ KY + F + N E +
Sbjct: 125 SLPMFWSKQERKK-LKGTSLYSHIIMDDQSFADDY-EFGFKLLQKYIN-FKSDRVNFELY 181
Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY-DKSSQGVVFTTD----- 293
K + I SR + G L+PWAD+ NHS Y KSS F D
Sbjct: 182 KKAVSIAASRAFYIDEYFGE-CLIPWADLFNHSTHNMHVKVYCSKSSSRNTFDMDENEVL 240
Query: 294 -------RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
R+YQ ++F ++G +SN LL YGF N S+ P + DK
Sbjct: 241 IRSVQSVRKYQ---ELFNTFGLQSNSSLLHKYGFCELSNKNGFVSIYSPFGKLRRDK 294
>gi|159468798|ref|XP_001692561.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278274|gb|EDP04039.1| predicted protein [Chlamydomonas reinhardtii]
Length = 724
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 46/183 (25%), Positives = 82/183 (44%), Gaps = 15/183 (8%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
L ++ EA+ +S+W Y+ +LP + Y ++W+ +L + L + + ++A E ++
Sbjct: 93 LVAAVMYEAARGPASKWHGYLRSLPAREYLPVFWSARQLQQ-LAGTDLADKAEEDRASMA 151
Query: 212 GTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
++ + S+YP +++E F + + SR + G ALVP AD+
Sbjct: 152 ADFSTHLAPLLSRYPGRLGHLAAGWSLEAFMHAASWVASRAFYVDDTHGD-ALVPLADVF 210
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQY--------QP---GEQVFISYGKKSNGELLLSY 318
NH + S G V QP G +V+ +YG+ SN EL+ Y
Sbjct: 211 NHKAARVDLGEGSGWSAGFVVAEQEGVELLDIVAAQPLAGGTEVYNTYGEHSNAELVNKY 270
Query: 319 GFV 321
GF
Sbjct: 271 GFA 273
>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
carolinensis]
Length = 311
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ ++ GEQ++I YG +SN E ++ GF N D V++ L + KSD+ Y K E L
Sbjct: 18 QDFKAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 75
Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
+ G+ S F + T P+ A+L V
Sbjct: 76 RAGIPTSSVFALHATEPPISAQLLAFLRV 104
>gi|313230936|emb|CBY18934.1| unnamed protein product [Oikopleura dioica]
Length = 303
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 39/62 (62%)
Query: 259 RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
R AL PW DMLNHS ++D ++ ++ T + GEQ+FISYG +++ +LLL Y
Sbjct: 66 RAALAPWFDMLNHSRSNNAEFEFDFTTGRLLVTCVSPIKAGEQLFISYGARADDDLLLEY 125
Query: 319 GF 320
GF
Sbjct: 126 GF 127
>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
distachyon]
Length = 480
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 117/282 (41%), Gaps = 53/282 (18%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS-CPEAGEVLKQCSVPDWPLLATY-- 155
D G RG A +++R+GE +L VP + ++T+D + PE + C P L++
Sbjct: 38 DAGGRGFAAARDLRRGELVLRVPRAALLTSDRVMADDPE----IASCIAARHPRLSSVQR 93
Query: 156 ----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITN 209
L++E KSS W Y+S LP L + E++ + +A I ++++ I +
Sbjct: 94 LIVCLLAEVGKGKSSSWYLYLSQLPSYYTVLATFNDFEIEALQVDDAIWIAQKSLSAIRS 153
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
L + F ++ +T+ W+F + SR + + + D L P D+
Sbjct: 154 EWEDATPLMQGL------KFKPKLLIFKTWLWAFATVSSRTLHV-AWDDAGCLCPVGDLF 206
Query: 270 NHSC----------------------EVETFLDYDKS-----------SQGVVFTTDRQY 296
N++ E+ + + +S S+ + Y
Sbjct: 207 NYAAPDDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDGGYEDSEAYCLYARKCY 266
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
GEQV + YG +N ELL YGF+ E N ++L L L
Sbjct: 267 TKGEQVLLGYGTYTNLELLEHYGFLLAENPNEKTYIQLDLDL 308
>gi|26344391|dbj|BAC35846.1| unnamed protein product [Mus musculus]
Length = 462
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/272 (22%), Positives = 116/272 (42%), Gaps = 27/272 (9%)
Query: 81 KWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
+W GL P+ ++ V G+VA +++R G L VP S ++ S +C +G
Sbjct: 51 RWCRRVGLELSPKVTVSRQGTVAGYGMVARESVRAGTLLFAVPRSALL---SPHTCSISG 107
Query: 139 EVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
+ ++ S+ W + + +S WS Y + P + ++W E R
Sbjct: 108 LLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERLR 166
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
L+ + + E + + N+ Y + L + DLF V ++E ++ ++ + +
Sbjct: 167 LLKGTGVPEAVEKDLVNIRSEYYSIVLPFMEAHSDLFSPSVRSLELYQQLVALVMAYSFQ 226
Query: 253 LPSMD-------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQV 302
P + +VP AD+LNH L+Y +V T QP G ++
Sbjct: 227 EPLEEDDDEKEPNSPLMVPAADILNHIANHNANLEYSADYLRMVAT-----QPILEGHEI 281
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
F +YG+ +N +L+ YGF N D+ ++
Sbjct: 282 FNTYGQMANWQLIHMYGFAEPYPNNTDDTADI 313
>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
Length = 467
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 65/281 (23%), Positives = 113/281 (40%), Gaps = 51/281 (18%)
Query: 94 AIQKVDV----GERGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE----AGEVLK 142
AI+ VD+ RG+ AL++I L +P +I +++ + P+ G+ +
Sbjct: 28 AIEIVDLRSRDAGRGITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDE 87
Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER 202
+ ++ W L ++ E S+W YI LP + ++W+ EL YL+AS
Sbjct: 88 EQALDSWSALILIMMYEFFLGDESKWKPYIDVLPLTFDTPMFWSEEEL-SYLQASA---- 142
Query: 203 AIERITNVIGTYND---LRLR----------IFSKYPDLFPEEVFNME------TFKWSF 243
N IG + R R +F D E++ + ++F
Sbjct: 143 ----TVNKIGKADAEEMFRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAF 198
Query: 244 GILFSRLVRLPSMDGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+ DG V +V AD+LN E +++ + + T+ R
Sbjct: 199 DLENEEAENDEESDGWVEDREGKSMMGMVAMADILNADAEFNAHVNH--GDEELTVTSIR 256
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
+ GE++ YG N ELL YG++ E + D VE+P
Sbjct: 257 DIKAGEEILNYYGPHPNSELLRRYGYIT-EKHSRYDVVEIP 296
>gi|313216036|emb|CBY37421.1| unnamed protein product [Oikopleura dioica]
gi|313219606|emb|CBY30528.1| unnamed protein product [Oikopleura dioica]
Length = 346
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 131/332 (39%), Gaps = 65/332 (19%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI-- 126
E++ + S L+ ++ ++G ++ + K + G++A +I++ E L+ +P S I
Sbjct: 2 EVEKDKAISHLRSFIEENGFVDPRIQVCKTAF-DLGIIASASIKEDEVLIRIPRSCQINL 60
Query: 127 -TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-----PY 180
+ DS + K + +LA YL E S +I LP Y
Sbjct: 61 HSFDSSQTSISKFIRQKNIKLTCHQILALYLALERRNPLSPAMKKFIPTLPSSCCLPLNY 120
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
S + YL + +++ E + +G L E
Sbjct: 121 SPKAMANLPCEVYLLSVALQKNVTE-LCFALGKQIGL-----------------TKEDLT 162
Query: 241 WSFGILFSRLVRLPSMDGR--------------VALVPWADMLNHSCEVETFLDYDKSSQ 286
W+F ++ SR LP D L P+ D++NHS + + D +
Sbjct: 163 WAFSMVLSRTFSLPKYDKSSDFDYCSQVDSSKSAFLCPFMDLINHSSAPNCYYETDSETG 222
Query: 287 GVVFTTDRQYQPGEQVFISY-GKKSNGELLLSYGFVPREGTN-------------PSD-- 330
V DR+ Q E++FI+Y G KS+ LL YGF G N PS
Sbjct: 223 DFVLRADRELQQKEELFITYGGSKSDHVLLAFYGFCLPPGVNRNSYIVFSPNFIGPSSHS 282
Query: 331 ---SVELPLSLKKSDKCYKE----KLEALRKY 355
+ + L+ KK+ KC+KE KLE+ RK+
Sbjct: 283 KFTAFKFFLNSKKA-KCFKESLNKKLESWRKF 313
>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
Length = 736
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 85/192 (44%), Gaps = 11/192 (5%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNV 210
LA L++E S W Y+ LP + + +++ +E S +R ++ +
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEKYAHMPIFYNNSEFGSIQIPSLMR--TVQSRCRM 317
Query: 211 IGTYNDLRLRIFSK---YPDLFPEEVFNMETFKWSFGILFSRLVR-LPSMDGRVALVPWA 266
+ +D LR S + F ++V + W SR +R +P + +VP
Sbjct: 318 LVNISDGYLRQLSHGGPAENPFLDDV-HANDMGWGLCAASSRALRNIPGLGSTPLMVPVI 376
Query: 267 DMLNHSCEVETFL-DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D H+ ++ DY KS + R QPG+ + ISYG +N +LLL YGF +
Sbjct: 377 DFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNPQLLLDYGFTLSD- 435
Query: 326 TNPSDSVELPLS 337
NP D E+ LS
Sbjct: 436 -NPHDRFEVTLS 446
>gi|190345582|gb|EDK37493.2| hypothetical protein PGUG_01591 [Meyerozyma guilliermondii ATCC
6260]
Length = 592
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 128/311 (41%), Gaps = 57/311 (18%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEK-LLFVPPSLVITADSKW-S 133
+ L +W GL + I+ +GE A GEK + +P L IT DS S
Sbjct: 2 VAELVQWAKTQGLELNE-GIEFRGIGENNTGAFYTTNNGEKPYIRLPVELAITVDSALRS 60
Query: 134 CPEAGEVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-QPYSLLYWTRAELD 191
+ E L+ QC + +L L E S K+S Y+ LP Q + Y AE
Sbjct: 61 FGQDLEALRDQCDSSN-TVLKLCLARERSRLKNSTIKKYLECLPTLQQMNTPYCWDAETK 119
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF--PEEVF-NME----------- 237
RYL+ + + E I ++ + +I + PD PE+ F NM+
Sbjct: 120 RYLQGTNLGSSLKENIGVLVEEW----WKIINLLPDSVQKPEQHFVNMKYYYESKFYTDD 175
Query: 238 -------------------TFKWSFGILFSR----LVRLPSMDGRV-----ALVPWADML 269
F W+ IL SR + ++D V L+P D+L
Sbjct: 176 DAYAYFVTNEDPANWTSFPNFLWASIILKSRSFPAYLIADAVDWDVKRHDTMLLPVIDLL 235
Query: 270 NHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
NHS VE L+ +S VF +D + G Q+F +YG K N ELLL+YGF + N
Sbjct: 236 NHSPSAHVEWGLERKESKSYFVFKSD-DVKSGSQLFNNYGMKGNEELLLAYGFCLED--N 292
Query: 328 PSDSVELPLSL 338
SD L + +
Sbjct: 293 SSDVSALKIKV 303
>gi|380490713|emb|CCF35823.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 403
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 61/119 (51%), Gaps = 14/119 (11%)
Query: 257 DGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
D + L P AD+LNH+ C V +D S ++ DR Y PG+++ I YG+ SN
Sbjct: 206 DDHMVLQPVADLLNHAPRGCSVA----FDARSFTIL--ADRDYSPGDEIHICYGRHSNDF 259
Query: 314 LLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEA---LRKYGLSA-SECFPIQI 367
LL+ YGFV G N D L + L + ++ +LE L KY L A + C+ Q+
Sbjct: 260 LLVEYGFVMAPGENDWDEACLDDVLLPRLSDAHRRRLEERGFLGKYMLDAETVCYRTQV 318
>gi|145524165|ref|XP_001447910.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415443|emb|CAK80513.1| unnamed protein product [Paramecium tetraurelia]
Length = 717
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 42/176 (23%), Positives = 83/176 (47%), Gaps = 11/176 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCP------EAGEVLKQCSVPDWP--LLATY 155
GL+A + I + L+ +P ++T + P E + +P W ++ ++
Sbjct: 64 GLIATEKIVPNDNLVILPRETLLTTRQAFESPLKPMFLEFPQFFSPKFMPSWQYHIILSF 123
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ E S+W I+ P+ ++W +L+ L+ ++ + AI++ +I T+
Sbjct: 124 LLYEYQKGAESKWHLLINNFPKDIDYAVFWKSEDLE-LLQDKKMAKHAIQKNRYLITTFQ 182
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
L+ I SK+PDLF EV +E W + + +R + V +VP+ ++ NH
Sbjct: 183 TLQY-ITSKFPDLFKPEVVTLENIIWIYTSIVTRCFGGQGL-KYVTMVPFCELFNH 236
>gi|357491725|ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatula]
gi|355517485|gb|AES99108.1| Protein SET DOMAIN GROUP [Medicago truncatula]
Length = 532
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 127/319 (39%), Gaps = 94/319 (29%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADS-----KWSC------------------PEA 137
G RGL A++++++GE +L VP S ++T++S K C P+
Sbjct: 51 GGRGLGAVRDLKRGEIILRVPKSALMTSESVIMEDKKLCLAVNRHSSLSSVQRNTPNPKR 110
Query: 138 GEVLKQ----------CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
V ++ C +L L+ E K+SRW Y+ LP Q Y LL
Sbjct: 111 CHVTERSRVQVLETASCVKQGKAILTVCLLYEVGKGKTSRWHPYLVHLP-QSYDLLA-MF 168
Query: 188 AELDRYL----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
E ++ EA + E+A+++ + + L + +F ++ + + W+
Sbjct: 169 GEFEKQALQVDEAMWVTEKAVQKAKSEWKEAHALMEDL------MFKPQLLTFKAWVWAA 222
Query: 244 -------------GILFSRLVRLPSMDGRVALVPWADMLNHSC---------EVETFLD- 280
G++ SR + +P D L P D+ N+ +V+ FL
Sbjct: 223 ATGRTVPETFHLPGLISSRTLHIP-WDEAGCLCPVGDLFNYDAPGEELSGVEDVDHFLSN 281
Query: 281 -----------------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
+++ + F Y+ G+QV + YG +N ELL
Sbjct: 282 GDMNVVIDEGQIDFNSQRLTDGGFEEDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEH 341
Query: 318 YGFVPREGTNPSDSVELPL 336
YGF+ +E NP+D + +PL
Sbjct: 342 YGFLLQE--NPNDKIFIPL 358
>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
[Ornithorhynchus anatinus]
Length = 345
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 46/89 (51%), Gaps = 2/89 (2%)
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ + GEQ++I YG +SN E ++ GF N D V++ L + KSD+ Y K E L
Sbjct: 51 QDFTAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 108
Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
+ G+ S F + T P+ A+L V
Sbjct: 109 RAGIPTSSVFALHFTEPPISAQLLAFLRV 137
>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
Length = 474
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 89/407 (21%), Positives = 154/407 (37%), Gaps = 54/407 (13%)
Query: 29 TDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGL 88
+D P +R + SV+ T A + ++ + N + + +G+
Sbjct: 22 SDAPARRRAPVVAARASVTPTEPAPQAPQQQPPAVLLDDSRTQTFMNWARGPASIRFAGV 81
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITADSKWSCPEAGEVLKQCSV 146
P A G RGL A +I ++ VP ++V+ + SCP +
Sbjct: 82 KPSTFA------GIRGLAASSDIANDALIVEVPRHSAVVLAPKQRNSCPGMVNDEWWKNA 135
Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR----YLEASQIRER 202
P + + L+ S + +I+ LP L W+ +L YL A Q++E+
Sbjct: 136 PWFAKMGAMLLWHKRQGSQSPLAPWIAQLPADTGVPLNWSDKQLAALQYPYLVA-QVKEQ 194
Query: 203 AIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG---- 258
E T + T + + P + E F W+ G++ SR P +
Sbjct: 195 QRE-WTALYDTLRGSGMAAGAAPP--------SREEFWWAMGVVRSRTFSGPYIGSTLSD 245
Query: 259 --------------------RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
+ A+ P D+ NH+ ++ + Y+ R ++
Sbjct: 246 RLRLAGLVAALVVILSRSLKQYAICPLIDLFNHTSAAQSEVSYNYFGDSYSVVASRDFKK 305
Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV---ELPLSLKKSDKCYKEKLEALRKY 355
GEQVFI+YG +SN L+ YGF E NP D+ ++ L+ +++AL+
Sbjct: 306 GEQVFITYGAQSNDSLMQYYGFA--EADNPQDTYVISDVLRWLQGFRPLPPGRVQALQGS 363
Query: 356 GLSAS--ECFPIQITGWPLE-LMAYAYLVVSPPSMKGKFEEMAAAAS 399
L A+ +Q G+P E L A +L+ S A A S
Sbjct: 364 SLGAACLSNVAVQRAGFPAEALQALRFLLASDAEAAAGVSAFAKAGS 410
>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
Length = 429
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 16/154 (10%)
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPSMD------GRVALVPWADMLNHSCEVETFLDY 281
+FP+ + ++ W S +P D +A+VP+AD NH+ + E + +
Sbjct: 194 VFPDTDWEKFSYHWLIVNTRSFYYLMPGQDPPEDTNDAMAMVPFADYFNHTDDAECEVHF 253
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKK 340
D S F R Y+ GE++++SYG N L + YGF TN SD++ L + K
Sbjct: 254 DGKS--YTFRATRLYKKGEEIYMSYGPHPNDFLFVEYGFYLE--TNESDAIFLDDIIFKD 309
Query: 341 SDKCYKEKLEALRKYG-----LSASECFPIQITG 369
KE+L R YG L + CF ++
Sbjct: 310 FTVAEKEELIRQRYYGNYQITLESGPCFRTEVAA 343
>gi|444314545|ref|XP_004177930.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
gi|387510969|emb|CCH58411.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
Length = 550
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 82/184 (44%), Gaps = 19/184 (10%)
Query: 167 RWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP 226
R+ YI+ALP S L W EL + L+ + + ER NV + + KY
Sbjct: 100 RFEPYINALPEIIDSPLNWNEDEL-KLLQNTNLGNCLKERFQNVYDEW----FKFLEKYQ 154
Query: 227 DLFPEEV------FNMETFKWSFGILFSR-----LVRLPSMDGRVALVPWADMLNHSCEV 275
+ E +N F W+ I+ SR ++ V L+P D+LNHS
Sbjct: 155 NYQEFETQSETSWYNFSNFLWAHLIITSRSFPEYIINPNCPRDSVMLLPVLDLLNHSNYS 214
Query: 276 ETFLDYDKSSQGVVFTTDRQ-YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
+ D +K + D Q + G++++ +YG K N ELL YGFV + N DSV L
Sbjct: 215 KVEWDGNKGGNFIYKKLDLQEIEIGDEIYNNYGGKGNEELLNGYGFVIED--NLFDSVLL 272
Query: 335 PLSL 338
+ +
Sbjct: 273 KIKI 276
>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
Length = 565
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 106/245 (43%), Gaps = 27/245 (11%)
Query: 148 DWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEA 196
D PLL LI E ++S W+ + + P + P +W +L D +
Sbjct: 197 DEPLLVLSLIYERYVAETSHWNELLLSCPGEYPNVPSFWDWEDLAELEGLDVLDDVLAKK 256
Query: 197 SQIRERAIERITNVIGTYNDLRLRI-FSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
+Q+ + E + + + L F K L E F++E W+ SR L +
Sbjct: 257 AQLAQFQTETMAVLPFIHEALAGGCRFGKDEFL---ECFSIEAMMWARATFDSRAFNL-N 312
Query: 256 MDGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKK 309
+DGRV ALVP ADM+NH + + + + G + + G ++++SYG
Sbjct: 313 VDGRVVIALVPVADMINHHNRSDVLVRRVEPNGGDFVMQIGASLTAQDIGREIWMSYGPL 372
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQ 366
N ELL YGFV EG N D + PL ++ D+ + + KYGL + C I
Sbjct: 373 QNWELLQFYGFV-LEG-NEHDRLPFPLDFPEAAVGDEWDGRRAALVAKYGLHLAGCCWIC 430
Query: 367 ITGWP 371
G P
Sbjct: 431 HDGRP 435
>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
Length = 525
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 86/179 (48%), Gaps = 9/179 (5%)
Query: 95 IQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLAT 154
+ K + G+++ K+++ + +P ++++ + +L + ++ A
Sbjct: 72 LNKTIISGLGIISNKDLKVNNIVAKIPKDIILSIHT----SSISNILTKYTMERNIATAI 127
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG-T 213
LI EAS + S+W YIS+LP + + W + + L + E I+ +I
Sbjct: 128 ALIYEASIGEKSKWYGYISSLPLKVDIPILWDKES--QQLLNGTVMEDVIQDDNILINHA 185
Query: 214 YNDLRLRIFSK-YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
Y D+ + K +P+ F +E+F+ E FK + I+ SR + S G +LVP AD+ NH
Sbjct: 186 YADIVESLLIKNHPEYFSKEIFSFENFKIANSIVSSRAFCIDSYHGD-SLVPLADIFNH 243
>gi|310794069|gb|EFQ29530.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 375
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 59/116 (50%), Gaps = 8/116 (6%)
Query: 257 DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
D + L P AD+LNH+ + D+S +R Y PGE++ I YG+ SN LL+
Sbjct: 176 DDHMILQPVADLLNHASRGCSVAFDDRS---FTIKAERDYAPGEEMHICYGRHSNDFLLV 232
Query: 317 SYGFVPREGTNPSDSVELPLS-LKKSDKCYKEKLEA---LRKYGLSA-SECFPIQI 367
YGFV +G N D L + L + D + +LE L KY L A + C+ Q+
Sbjct: 233 EYGFVMAQGENEWDEACLDDAILPRLDAACRRRLEERGFLGKYMLDAETVCYRTQV 288
>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
Length = 492
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 37/98 (37%), Positives = 54/98 (55%), Gaps = 6/98 (6%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE--VETFLDYDKSSQGVV-F 290
F + K+ + SR+V LP A+VP DM NH+CE V+ D D + V+
Sbjct: 198 FTFDDLKYVDAVYRSRVVDLPRSGH--AIVPCVDMANHACEDLVKARYDEDGAGNAVLQL 255
Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
T ++ + GE+V ISYG +K E++ SYGFV E T+
Sbjct: 256 RTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERTD 293
>gi|294896472|ref|XP_002775574.1| Protein SET DOMAIN GROUP, putative [Perkinsus marinus ATCC 50983]
gi|239881797|gb|EER07390.1| Protein SET DOMAIN GROUP, putative [Perkinsus marinus ATCC 50983]
Length = 416
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 104/266 (39%), Gaps = 50/266 (18%)
Query: 87 GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC-PEAGEVLKQCS 145
G ++A+++ RG+ I+ G+ + +P +ITA+ C + EV ++
Sbjct: 18 GAELNRVAVEEFSGAGRGVRVKTAIQSGQVAIGIPHKFIITANVSSPCLSDDSEVYRK-- 75
Query: 146 VPDWPLLATYLISEASFEKS---------------SRW-SNYISALPRQPYSLLYWTRAE 189
W + ++S A S W S Y+ LP + ++ YW+ A+
Sbjct: 76 ---WIMKMGKILSGAELLSLILLRLLERSRSNLSPSDWRSLYLRTLPLEYTTISYWSEAD 132
Query: 190 LDRYLEASQIRERAIERITNVIG-TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
+ AS + +E + +R +Y ME +W++ + +
Sbjct: 133 KKMFSAASSVLAEELETAERTCKISCEKIRQATGGRYA---------MEDIEWAYWTIET 183
Query: 249 RLV--RLPSMDGRVALVPWADMLNHSCEV------------ETFLDYDKSSQGVVFTTDR 294
R RL G V LVP DM+NHS E +Y+ FT R
Sbjct: 184 RGCYHRL----GGVCLVPLGDMVNHSAEAYSTENCGKCGMQNRCGNYNVREHRYEFTAMR 239
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGF 320
Y EQ F+ Y ++ +LL+ YGF
Sbjct: 240 DYNENEQFFVVYSGCASTDLLMRYGF 265
>gi|149059900|gb|EDM10783.1| hypothetical protein RDA279, isoform CRA_c [Rattus norvegicus]
Length = 314
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 97/216 (44%), Gaps = 24/216 (11%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
L+KWL + + + G RGL++ ++++G+ ++ +P S ++T D+
Sbjct: 70 LRKWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRS-SV 128
Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDR 192
G +K+ P PLLA T+L+SE S W +Y+ LP+ P L L
Sbjct: 129 GPYIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPG 188
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSR 249
L A +RA R+ ++ + D FS LF E V F+ F W++ + +R
Sbjct: 189 PLRAKAEEQRA--RVQDLFASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTR 242
Query: 250 LVRLPSMDGR--------VALVPWADMLNHSCEVET 277
V L S AL P+ D+LNHS V+
Sbjct: 243 AVYLKSRRQECLSSEPDTCALAPFLDLLNHSPHVQV 278
>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 797
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 52/105 (49%), Gaps = 20/105 (19%)
Query: 226 PDLFPEEVFNMETFKW------SFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVE 276
PDL +E + W SF + PS D R+AL+P AD+LNH+ C V
Sbjct: 559 PDLEKQEYL----YSWFLVGTRSFYYEIEETLSYPSHD-RLALLPVADVLNHANAGCSVA 613
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
S++ T DR YQ GE+V+ SYG SN LL YGFV
Sbjct: 614 F------STEAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYGFV 652
>gi|300124011|emb|CBK25282.2| unnamed protein product [Blastocystis hominis]
Length = 366
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 62/227 (27%), Positives = 104/227 (45%), Gaps = 20/227 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVP----DWPLLATYLIS 158
RG+ ++I++G K++ +P L+++ D C E L++ +V A +L+
Sbjct: 2 RGVYLNRDIQRGTKIIKIPKKLIMSCDMGRDC-ELSANLREQNVDFEKYKHVYFANFLLE 60
Query: 159 EASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
+ E S + Y LP ++ + WT +E+++ L S R+ + Y +
Sbjct: 61 DMENEDSF-YKPYYDTLPEDISNIPVIWTNSEINQ-LHGSYFSICIRSRVVEIYRDYQKM 118
Query: 218 --RLRIFSKYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
F +YP F + + + +FG F+ L LVP ADMLNH+
Sbjct: 119 CDVNSFFCRYP--FDQYLRVRLLIGSRNFGSFFNSL-------NNGILVPLADMLNHTRP 169
Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+T +YD + V T+ + G QV SYG++ N LL SYGFV
Sbjct: 170 RQTTWEYDDKEKAFVITSLLNLRQGAQVMDSYGRRDNRRLLFSYGFV 216
>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
Length = 330
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 102/230 (44%), Gaps = 32/230 (13%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
G G A +I +GE+LL+VP S +T P + L + V +LA L+
Sbjct: 41 GMIGCTATADICQGERLLYVPHSACVT-------PSGVQGLYEPQV----MLAASLVKHR 89
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ + +S + +Y+ +LP + L W+ EL L+ + + E + L L
Sbjct: 90 T-DPNSPFHDYLQSLPSEFDHPLEWSADEL-VCLKGTTVWE------------MHQLSLE 135
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ +L P M +W+ ++ SR S + ++P AD NHS +
Sbjct: 136 VVDSVVELCPNSPRAM--IRWAVEVMMSRA--FESEVCGLCVIPLADQFNHS-STKWHTR 190
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
+ G ++ + GE++F +YG +N LLL++GF+ E NP D
Sbjct: 191 VREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGFI--EFDNPHD 238
>gi|145551877|ref|XP_001461615.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429450|emb|CAK94242.1| unnamed protein product [Paramecium tetraurelia]
Length = 666
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 81/176 (46%), Gaps = 12/176 (6%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVLKQCSVPDWP--LLATY 155
GL ++ I L+ VP L++T +D + + Q W +L TY
Sbjct: 69 GLQTIQKIETDSILVSVPRELMLTTKIAYFSDIQEIFDAYPQFFCQHCAGGWQDRILLTY 128
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
++ ++ + S+W + IS LP+ L++W+ EL + L ++ +A + + +
Sbjct: 129 ILYQSQLGRQSQWYHLISNLPKDIDYLIFWSEQEL-KLLNDEKLILKAKRDLQDFLLIQK 187
Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
L I ++P F +E ++ E KW F L SR S +VA VP+ +M NH
Sbjct: 188 TLT-HILDQFPQHFQKETYSFENIKWIFIHLVSRC--FGSTLEQVAFVPFCEMFNH 240
>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
Length = 464
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 118/248 (47%), Gaps = 21/248 (8%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITA----DSKWSCPEAGEVLKQCS-VPD--WPL-LATY 155
GLVA +++ +G ++ +P + + ++ P G + + + VP+ W + L
Sbjct: 49 GLVATQDLPQGSTIITLPRRIPMPMPDPENAAVLAPSEGVICEIANRVPEELWAMRLGLK 108
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ E + +K S W YIS LP ++++ +++ ++ + + + +R ++ +
Sbjct: 109 LLYERA-QKGSYWWPYISMLPHSFTLPIFFSGVDIES-IDYAPVTHQVKKRCRFLLQFSS 166
Query: 216 DL-RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNH 271
+L +L + F + + W+ + SR R+ + ++ ++P DM NH
Sbjct: 167 ELAKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGVTNKLCSAMMLPLIDMCNH 226
Query: 272 SCEVETFLDYD--KSSQGVVF---TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
S + ++ D + +Q V F T R + G + ++YG SN LLL YGFV +
Sbjct: 227 SFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPD-- 284
Query: 327 NPSDSVEL 334
NP D +EL
Sbjct: 285 NPHDRIEL 292
>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
Length = 464
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 127/300 (42%), Gaps = 52/300 (17%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCP----------EAGEVLKQCSVPDWPLLA 153
G++A ++I G+ + VP +L++TA+ ++ E+ + D LL
Sbjct: 169 GVIARRDIPAGQTFINVPEALMMTAEKARKSETFQLITSGALDSTELSPAMAKLDNFLLR 228
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
+LI E +S WS YI LP++ LY+T AEL+ L+ S + A ++ NV+
Sbjct: 229 MFLIVERRRGGNSYWSPYIDLLPQRFRLPLYFTEAELE-LLKPSPALQEAFVQLRNVVRQ 287
Query: 214 Y----------------------NDLRLRIFSK------YPDLFPEEVFNMETFKWSFGI 245
Y D +I + P + E +++ F W+
Sbjct: 288 YAAWKQYLMMLELARAAELPSGSGDAHQKILDQRRRAQAMPVRYNELTYDL--FCWASSA 345
Query: 246 LFSRLVRLPSMDGR--------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
+ +R ++ + R +AL+P DM NH+ + YD ++ +
Sbjct: 346 VATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGGASSF-YDTQTRSLECVAVAPIA 404
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
GE V + YG +S+ + FVP + +P+D + L++ K D +K K L+ G+
Sbjct: 405 KGEPVLLHYGDRSSMAYFGNSEFVPAD--HPTDQYLILLAVGKQDPLFKSKSTILQALGV 462
>gi|397589374|gb|EJK54638.1| hypothetical protein THAOC_25717 [Thalassiosira oceanica]
Length = 468
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 74/316 (23%), Positives = 134/316 (42%), Gaps = 46/316 (14%)
Query: 45 SVSTTNDASRTKTTVTQNMI----PWGC------EIDSLENASTLQKWLSDSGLPPQK-- 92
VS + A RT+TT + P +I + + ++++W + G+ QK
Sbjct: 28 GVSVFDSARRTRTTANAAALNQYAPQAAGLVDVSDIYAQRDVYSMEEWAAQFGM--QKAP 85
Query: 93 -MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS------ 145
+ I D + L A + I G+ +++VP +V+ + S E GE L Q
Sbjct: 86 GVEIASEDGVDYSLQATQPISTGQSVVYVPSDIVLNSASIQQ--EFGESLAQAEAVLVQG 143
Query: 146 --------------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
+P + L+A L+ ++S+ + ++++LPRQ ++ + T+A
Sbjct: 144 IKVRVRVKEGINYRLPLFRLMAKILVEYEKGQESAFYP-WLNSLPRQFFNGVSMTKAC-- 200
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
S + A +N Y + + L E V N E KW++ + F+R
Sbjct: 201 ----TSCLPPYAGWLTSNEKINYARFAQALRQGWVPLSQETVSNEEVVKWAYNVAFTRFH 256
Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
+ + + P ADMLNH+ + ++ D S + R+ G + ISYG +N
Sbjct: 257 EVWQPERAKLIGPMADMLNHAADPNCAIEVDYSG-NINVVALREIPAGSALTISYGDPTN 315
Query: 312 -GELLLSYGFVPREGT 326
L YGF+P++ T
Sbjct: 316 PTPLFAQYGFLPQDCT 331
>gi|328866266|gb|EGG14651.1| hypothetical protein DFA_10909 [Dictyostelium fasciculatum]
Length = 581
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 43/78 (55%), Gaps = 2/78 (2%)
Query: 257 DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
+ LVP AD+ NH+ V+T Y + + TD +++ GEQVFISYG +N LL
Sbjct: 352 NDNACLVPLADLFNHNPNVKTMASYCAADRCYRVYTDTRFEKGEQVFISYGLHNNATLLH 411
Query: 317 SYGFVPREGTNPSDSVEL 334
YGFV N D +E+
Sbjct: 412 YYGFVI--DNNHLDGIEI 427
Score = 40.0 bits (92), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 91/198 (45%), Gaps = 21/198 (10%)
Query: 63 MIPWGCEIDSLENASTLQ-KWLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKL 117
MI EI + E+ L +W +G+ + ++ D G+ RG++A + I G+ L
Sbjct: 1 MISNNKEISNQEHQERLMIEWGKKNGVKWDEEMMEIHDFGDSGGGRGVIAKRTIESGDLL 60
Query: 118 LFVPPSLVITADSKWSC-PEAGEVLKQCSVPDWPLLATY-LISEASFEKSSRWSNYISAL 175
+ VP SL+I + S P + + D + LI E SRW Y+ +
Sbjct: 61 VEVPLSLLIHSLPILSVVPPFEHIETVLKLLDSKQTICFQLIYERLIRNRSRWYGYLDCI 120
Query: 176 PRQPYSLLYWTRAELDRYL------EASQIRERAIE---RITNVIGTYNDLRLRIFSKYP 226
P++ + + +T AE+ EA+++R+ ++ + ++ ++ L L++ S +
Sbjct: 121 PKEYNTTVSYTDAEIGELSYPYYKNEATKLRKEMLDSHKQYKEILQSH--LTLKVLSSHS 178
Query: 227 DLFPEEVFNMETFKWSFG 244
+L + N TFK S G
Sbjct: 179 EL---DNNNNSTFKSSGG 193
>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 493
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 74/273 (27%), Positives = 117/273 (42%), Gaps = 33/273 (12%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVI----TADSK--W------SCPEAGEVLK-QCSVP 147
G RG+VA K+I + L+ + S + T +SK W S +A + + + S P
Sbjct: 66 GYRGVVATKDIPRDAVLVRIARSCCLGPETTDESKNSWTKAMSTSAVDATKTTQGESSKP 125
Query: 148 DWP------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE 201
P L L+ E +SS + +Y+S LP+ L WT AE+ S
Sbjct: 126 PAPRLTRACLTVLRLLHERGLGESSPFHSYLSVLPQDHRLPLEWTEAEVGLLQGTSAEPL 185
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
+ + + + +++P ++ V F + SR ++ G
Sbjct: 186 VGAGSLDSQFEAFQS----VVAQHPTVWEPSVCTKAAFAKGVNWVRSRGF---TVMGDPH 238
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
++P ADM NH ++ + V T + + GE+VF S+G SN +LL SYGFV
Sbjct: 239 MIPGADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQLLNSYGFV 298
Query: 322 -PREGTNPSDSVELPLSLKKSDKCYKE--KLEA 351
P N D+V +P L + CY KLEA
Sbjct: 299 LP---GNSFDTVLIPTQL-VVNTCYATFVKLEA 327
>gi|397569514|gb|EJK46791.1| hypothetical protein THAOC_34522 [Thalassiosira oceanica]
Length = 702
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 60/264 (22%), Positives = 109/264 (41%), Gaps = 25/264 (9%)
Query: 78 TLQKWLSDSGLPP-QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK----W 132
++++W +G+ + + + D + GL ++ G +++VP S VI++D
Sbjct: 99 SMEEWCVQNGVERIEGIQLYTEDGADYGLSTQVDLPAGSTIVYVPSSTVISSDQVAEDLG 158
Query: 133 SCPEAGE----VLKQCSVPDWPLLATYLISEASFEKSSRWSNY--ISALPRQPYSLLYWT 186
EA E ++ + PL + +EK +Y + +LPRQ Y+ + T
Sbjct: 159 GSLEAAENALIQMETLTARRIPLFRLMVFVLKEYEKGVNSKHYAWLQSLPRQYYNGVSMT 218
Query: 187 R---AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
L Y ER YN+ + Y D+ E V + W++
Sbjct: 219 EDCFGVLPPYAAKLAKSERE---------NYNNFVAGLREGYVDIADEIVDDDTIVNWAY 269
Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
+ +R + + + + +VP DM+NHS E + +D +V TT G +
Sbjct: 270 NVALTRFIEVWQPNRQKKIVPMVDMINHSSEPNVDISFDDDGNCLV-TTLYDIPAGSALT 328
Query: 304 ISYGKKSN-GELLLSYGFVPREGT 326
IS G +N + YGF+P + T
Sbjct: 329 ISLGDPTNPTPIFAQYGFLPLDCT 352
>gi|392864101|gb|EJB10745.1| hypothetical protein CIMG_00433 [Coccidioides immitis RS]
Length = 435
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 94/421 (22%), Positives = 157/421 (37%), Gaps = 71/421 (16%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
+W D G+ +A + G+ AL+ I GE ++ VP S ++T D P
Sbjct: 16 FTQWAKDQGIQINGVAAVRFPGRGIGIAALRGIDAGETIVSVPTSSLLTLDK---IPSTF 72
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-----------PYSLLYWTR 187
Q P + A YL + + SR++ + + P P L+
Sbjct: 73 REKFQGDTPVQGIFAAYLACDD--DARSRYAPWRATWPTMRDFEDSIPLLWPKYLIGTPG 130
Query: 188 AELDRYLEASQIRERAIERIT--NVIGTYNDLRLRIFSKYPDLFPEEV------------ 233
EL E + + + ++ G +N L R+ D P+
Sbjct: 131 DELKGQGETTGRGQEVFPSLLPPSISGHFN-LSNRVGRFSGDYTPDHQNLLENQRSRFRK 189
Query: 234 -----------FNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCE 274
N+E F + + +R + D V AL P+AD NHS
Sbjct: 190 AFSRVKLACPGINLEIFTYYWFATHTRCFFYVAKDSEVPEDRNDAMALCPFADYFNHSSN 249
Query: 275 ---VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
+ D D G FT + Y GE+VF+ YG ++ LL YGFVP E N D+
Sbjct: 250 DPGCKASFDGD----GYTFTATKSYAKGEEVFVCYGNHTSDVLLTDYGFVPDE--NKWDA 303
Query: 332 VELP----LSLKKSDKCYKEKLEALRKYGLS-ASECFPIQITGWPLELMAYA----YLVV 382
+ L + K + Y ++ L Y ++ A CF ++ L M+ Y+
Sbjct: 304 IFLDDIVLQDINKIKRRYLKEDNYLGNYQVTRAGPCFRTEVAA-SLTYMSVDDWSLYVGG 362
Query: 383 SPPSM--KGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQVKE 440
+ PS K + + A+ K + D+ ++ I ++ S R+ Q++E
Sbjct: 363 TIPSSFDSEKTDRIVASWIRKYRDEADLAIARVEGMVRSGIYETANRLKSIILRWKQIRE 422
Query: 441 L 441
L
Sbjct: 423 L 423
>gi|358380690|gb|EHK18367.1| hypothetical protein TRIVIDRAFT_47382 [Trichoderma virens Gv29-8]
Length = 479
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 86/205 (41%), Gaps = 43/205 (20%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRERAIER 206
+LI E + S W YI ALP QP W AEL LE + + E +E+
Sbjct: 92 FLIKELLRGQESFWYPYIQALP-QPEDFDDWALPPFWPEEEAEL---LEGTNV-EIGLEK 146
Query: 207 ITNVIG-TYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSRLVR----------- 252
I +G + D R + + D + + E ++W++ I SR R
Sbjct: 147 IREDLGREFRDARNLLIASQKDAEDDFSDHLTRELYQWAYCIFSSRSFRPSLVLSEEQQQ 206
Query: 253 -LP---SMDGRVALVPWADMLNHSCEVETFLDYDK------------SSQGVVFTTDRQY 296
LP S++ L+P D+ NH V D S V R++
Sbjct: 207 SLPDGVSVNDFSVLLPLFDIGNHDMTVHVRWDLAAGDEAAAGAGVRGSGAAVQLKVGREH 266
Query: 297 QPGEQVFISYGKKSNGELLLSYGFV 321
+PG+Q+F +Y K+N ELLL YGF+
Sbjct: 267 KPGQQIFNNYSPKTNAELLLGYGFM 291
>gi|345328941|ref|XP_001507526.2| PREDICTED: LOW QUALITY PROTEIN: N-lysine methyltransferase
SETD6-like [Ornithorhynchus anatinus]
Length = 495
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/198 (23%), Positives = 86/198 (43%), Gaps = 11/198 (5%)
Query: 166 SRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
S W +Y S P ++W + E R L+ + + E + + N+ Y+ + L
Sbjct: 160 SPWHHYFSLWPDLNDLDHPMFWPKEERGRLLQGTGVPEAVEKDLANISHEYSSIVLPFTE 219
Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVE 276
+PDLFP ++E + ++ + + P + +VP AD+LNH
Sbjct: 220 AHPDLFPAGSCSLELYCRLVAVVMAYSFQEPLEEEEEDEEPNPPLMVPVADILNHVANHN 279
Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
L+Y +V T R G ++F +YG+ +N +L+ YGF N D+ ++ +
Sbjct: 280 ANLEYAPECLRMVAT--RPIPKGHEIFNTYGQMANWQLVHMYGFAEPYPGNTDDTADIQM 337
Query: 337 SLKKSDKCYKEKLEALRK 354
++ + EA R+
Sbjct: 338 VTVRAAALQGAETEAERQ 355
>gi|320580679|gb|EFW94901.1| hypothetical protein HPODL_3273 [Ogataea parapolymorpha DL-1]
Length = 423
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 90/383 (23%)
Query: 71 DSLENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKN-IRKGEKLLFVPPSLVIT 127
+SLEN L KW G + AI + D G G LKN + E L+ VP +L IT
Sbjct: 5 NSLEN---LLKWSKSHGAYFDNVEFAIDR-DKGVHG--TLKNEVTFDEPLIIVPEALFIT 58
Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATY--------------LISEASFEKSSRWSNYIS 173
E+ K D P+ LI S S+ + YI+
Sbjct: 59 P----------ELAKGVFGLDNPIDDLSLLLLAKLKFDKKETLIDGNSL--STMYEPYIA 106
Query: 174 ALPRQPYSL---LYWTRAE------LDRYLEASQIRERAIERITNVIGTYNDLR-LRIFS 223
LP + L+WT E D Y + RE ER ++++ N+ + L +
Sbjct: 107 FLPDSCLEVGLPLFWTDHEQELLKGTDAYPRLKRTREELFERWSSLMSLLNEQKKLDLVM 166
Query: 224 KYPDLFPEEVF--NMETFKWSFGILFSRLVRLPSMDGRVA--------LVPWADMLNH-- 271
K L + + + E F W++ I +R P+ + + L P D+LNH
Sbjct: 167 KEAPLCKDSLSWKSFEAFSWAYSIYCTR--AFPNFLRKQSERSLNIGFLCPIVDLLNHKN 224
Query: 272 ------SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
+CE +F V + ++ + GE+++ +YG KSN +LLL+YGF+ +
Sbjct: 225 GEKVTWTCEDNSF---------VFKASAKRIRAGEEIYNNYGNKSNTDLLLNYGFILND- 274
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE-------CFPIQ-ITGWPLELMAY 377
N S++ L L +++S +EA K+GL E CF + +T P +L+ +
Sbjct: 275 -NESETTTLTLKVEES------VIEAGTKFGLKLPEGTSANGICFNLSLVTPLPKDLLRF 327
Query: 378 AYLVVSPPSMKGKFEEMAAAASN 400
+ P+ ++ A N
Sbjct: 328 MGFLHQLPAGDSPLRFISDAYEN 350
>gi|317035930|ref|XP_001397212.2| ribosomal N-lysine methyltransferase [Aspergillus niger CBS 513.88]
Length = 434
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 14/115 (12%)
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPS--------MDGRVALVPWADMLNHSCEVETFL 279
++PE + M + W I+ SR S + + +VP+AD NH + +
Sbjct: 186 VYPETEWKMFAYYWC--IINSRSFYYVSPGKDEPEDWNDAIGMVPFADYFNHVDDAACEV 243
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
++D + F R+Y+ GE+V++SYG SN LL+ YGF TNPSD + L
Sbjct: 244 NFD--GKKYTFRATRRYEKGEEVYMSYGNHSNDFLLIEYGFTL--STNPSDCIYL 294
>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
Length = 482
Score = 58.2 bits (139), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 62/241 (25%), Positives = 94/241 (39%), Gaps = 58/241 (24%)
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRY--- 193
E ++C P W L L+ E + + SR +Y+ LPR + + + W+ +LDR
Sbjct: 60 EAWQRC--PWWVRLGVRLLKERADGEGSRLQDYVGMLPRPGETGAPVNWSAEQLDRLHYP 117
Query: 194 --LEASQIRERAIE-----------RITNVIGTY-NDLRLRIFSKYPDLFPEEVFNMETF 239
L +++ R E R N T D R+ S D F
Sbjct: 118 RLLSQIKLQRRLFEGFRKFLLADARRGDNAPSTREGDGVNRLVSALAD--------PAMF 169
Query: 240 KWSFGILFSRLVRLPSMDG--------------------------RVALVPWADMLNHSC 273
W+ + SR +LP R+AL+P D +NH
Sbjct: 170 SWALECVLSRAFQLPPRSAAALVVEEGDDVPVKAPEVTPPAPDEMRMALLPLIDSINHYS 229
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
+ T + Y ++ + + + PG+ F SYG SN +LL YGFV E NPSD+
Sbjct: 230 RMPTHM-YWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYGFV--EQDNPSDTYV 286
Query: 334 L 334
L
Sbjct: 287 L 287
>gi|397506651|ref|XP_003823836.1| PREDICTED: N-lysine methyltransferase SETD6 [Pan paniscus]
Length = 386
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 86/201 (42%), Gaps = 17/201 (8%)
Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+SRW Y + P + ++W E L+ + + E + + N+ Y + L
Sbjct: 50 ASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFM 109
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEV 275
+PDLF V ++E + ++ + + P + +VP AD+LNH
Sbjct: 110 EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANH 169
Query: 276 ETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
L+Y + +V T QP G ++F +YG+ +N +L+ YGFV N D+
Sbjct: 170 NANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTA 224
Query: 333 ELPLSLKKSDKCYKEKLEALR 353
++ + + K EA R
Sbjct: 225 DIQMVTVREAALQGTKTEAER 245
>gi|148671822|gb|EDL03769.1| SET domain containing 4, isoform CRA_c [Mus musculus]
Length = 269
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 96/212 (45%), Gaps = 17/212 (8%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + + RGL++ ++++G+ ++ +P S ++T D+ G
Sbjct: 35 LRKWLKERKFEDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLG 93
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+K+ P PLLA T+L+SE S W +Y+ LP+ Y+ E+ L
Sbjct: 94 PYIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-P 151
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
S ++ +A E+ V + R FS LF E VF+ F W++ + +R V L
Sbjct: 152 SPLKAKAEEQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYL 210
Query: 254 PSMDGR--------VALVPWADMLNHSCEVET 277
S AL P+ D+LNHS V+
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQV 242
>gi|350636529|gb|EHA24889.1| hypothetical protein ASPNIDRAFT_40813 [Aspergillus niger ATCC 1015]
Length = 437
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 64/142 (45%), Gaps = 17/142 (11%)
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPSMD------GRVALVPWADMLNHSCEVETFLDY 281
++PE + M + W S P D + +VP+AD NH + +++
Sbjct: 197 VYPETEWKMFAYYWCIINSRSFYYVSPGKDEPEDWNDAIGMVPFADYFNHVDDAACEVNF 256
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
D + F R+Y+ GE+V++SYG SN LL+ YGF TNPSD + L
Sbjct: 257 D--GKKYTFRATRRYEKGEEVYMSYGNHSNDFLLIEYGFTL--STNPSDCIYL------- 305
Query: 342 DKCYKEKLEALRKYGLSASECF 363
D + L +K L+ E F
Sbjct: 306 DDIIFQDLSISQKQELAKQEIF 327
>gi|426382401|ref|XP_004057794.1| PREDICTED: N-lysine methyltransferase SETD6 [Gorilla gorilla
gorilla]
Length = 541
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 19/202 (9%)
Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+SRW Y + P + ++W E L+ + + E + + N+ Y + L
Sbjct: 205 ASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFM 264
Query: 223 SKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
+PDLF V ++E + +SF + V +VP AD+LNH
Sbjct: 265 EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPV-MVPAADILNHLAN 323
Query: 275 VETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
L+Y + +V T QP G ++F +YG+ +N +L+ YGFV N D+
Sbjct: 324 HNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDT 378
Query: 332 VELPLSLKKSDKCYKEKLEALR 353
++ + + K EA R
Sbjct: 379 ADIQMVTVREAALQGTKTEAER 400
>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
Length = 408
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ +Q G+Q++I YG +SN E ++ GF N D V++ L + KSD+ Y K E L
Sbjct: 115 QDFQAGDQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 172
Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
+ G+ S F + T P+ A+L V
Sbjct: 173 RAGIPTSSVFALHSTEPPISAQLLAFLRV 201
>gi|317033156|ref|XP_001394952.2| ribosomal N-lysine methyltransferase [Aspergillus niger CBS 513.88]
Length = 415
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 139/337 (41%), Gaps = 59/337 (17%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G++A + I K L+ VP S ++T P P L A YL + AS
Sbjct: 39 GMIATRKIEKDSILVKVPHSAMLTPSK---LPSTFTSRFPADTPTHTLYAAYL-TNASPS 94
Query: 164 KSSRWSN-------YISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN- 215
W N + S++P +L+ + + L + S+I++ I+N T
Sbjct: 95 HLKPWRNTWPTMEDFTSSMP-----ILWSSTSPLTPNSKTSKIQDLLPPSISNTWSTITP 149
Query: 216 ------------------DLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+ RLR + +FPE + T+ W S LP
Sbjct: 150 GKRKHKSDTRHQNLLKAQETRLRKAWDIVVRVFPETDKELFTYHWVIVNTRSFFYLLPGA 209
Query: 257 ------DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ +ALVP+AD NHS +V + +D + VF ++Y GE++++SYG S
Sbjct: 210 EMPEDRNDAMALVPFADYFNHS-DVACNVKFD--GEEYVFRAAKEYNEGEEIYMSYGPHS 266
Query: 311 NGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYG----LSASECFPI 365
N L YGF TN S+++ L + L+ + +E+LE + YG S C+
Sbjct: 267 NDFLFTEYGFY--LDTNASETLYLDEIILQDLNASKQEELEFHQYYGNYQLTSDGVCYRT 324
Query: 366 QI----TGWPLELMAYAYLVVSPPSMKGKFEEMAAAA 398
+I T PL L L S +G E+M+AA
Sbjct: 325 EIAAGLTYMPLRLWQDYVLGY---STEGVDEKMSAAV 358
>gi|407396203|gb|EKF27396.1| hypothetical protein MOQ_008884 [Trypanosoma cruzi marinkellei]
Length = 572
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 9/140 (6%)
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
A L YL+ + R + + NV + + L P EE +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANVEAEFQLCKSVLSFLQTMPHSNGEERSMPVTVEQFLWA 256
Query: 243 FGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ L +R + D V +L+PW D NH+ + YD+ + +F + GEQ
Sbjct: 257 YNTLMTRGF---AYDPEVWSLMPWVDYFNHALTSNATMKYDERRRAYIFEALFPIETGEQ 313
Query: 302 VFISYGKKSNGELLLSYGFV 321
+F+ YG ++ ELLL YGF
Sbjct: 314 IFLPYGAYTDMELLLWYGFT 333
>gi|149238199|ref|XP_001524976.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451573|gb|EDK45829.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 488
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/95 (36%), Positives = 48/95 (50%), Gaps = 9/95 (9%)
Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
+ P+ D +NHSC+ L D ++G TT Y+PG+Q+++SYG SN LL YGF
Sbjct: 285 TMAPYIDFINHSCDDHCTLKID--AKGFQITTTTAYKPGDQLYLSYGPHSNEFLLCEYGF 342
Query: 321 V---PREGTNPSD----SVELPLSLKKSDKCYKEK 348
V P E +D S LP+ C KE
Sbjct: 343 VVTLPEEENRWNDLDISSYLLPMFNANQIDCLKEN 377
>gi|255078794|ref|XP_002502977.1| set domain protein [Micromonas sp. RCC299]
gi|226518243|gb|ACO64235.1| set domain protein [Micromonas sp. RCC299]
Length = 536
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/259 (25%), Positives = 107/259 (41%), Gaps = 51/259 (19%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEAS 161
RGL A ++I GE +L +P + I EA EV+ W + LA L+ E +
Sbjct: 107 RGLEASRDIENGEPVLRLPLEMGICDYQDGHPAEAWEVMSNAP---WGVRLACRLLQERA 163
Query: 162 FEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
+ S ++ YI+ +P P S L WT E+ + E + + + T+
Sbjct: 164 KGEDSDYAPYIALIPESVPGSPLMWTDDEVASLQYPPAVAE--AREMRDAVATW------ 215
Query: 221 IFSKYPDLFPEEV--FNMETFKWSFGILFSR---LVRLPSMDGRV-ALVPWADMLNHSCE 274
F K P + +++ FK + ++ SR + S +G AL+P AD+LNH +
Sbjct: 216 -FRKLSAEAPVALAGADLDAFKSAVSVVHSRTYGVASSASGEGYFRALLPLADLLNHGGD 274
Query: 275 ------------------------------VETFLDYDKSSQGVV-FTTDRQYQPGEQVF 303
+ + S +GV+ F R P E+
Sbjct: 275 EYPESASSPANRGGKANKSPASPKWPPAGCSDNIAWSELSDEGVIEFAATRAIAPHEEAA 334
Query: 304 ISYGKKSNGELLLSYGFVP 322
+SYG++SN L+ YGFVP
Sbjct: 335 MSYGERSNDHFLVYYGFVP 353
>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
Length = 854
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/256 (24%), Positives = 119/256 (46%), Gaps = 35/256 (13%)
Query: 102 ERGLVALKNIRKGEKLL------FVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY 155
+RGLVA++ I+ L+ + PS V+ S PE+ + L ++ D +LA +
Sbjct: 59 DRGLVAVEEIKPNSTLIELDLDDVIYPSTVLK-----SVPESEKNLF-LAMSDDLMLAAF 112
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
LI E ++SRW ++ LP+ P +T++E+ + + + I+ I+R ++ TY
Sbjct: 113 LIQERIKGRASRWYPWLQTLPKHPTVPSSFTQSEIKEFEDPAIIQRLNIQR-SDYYSTYF 171
Query: 216 DLRLRIFSKYPDL---FPEEVF--NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
+ + + + F + ++ + F+W + ++ +R V + R+ L+P D N
Sbjct: 172 AFTRHMCTYFLQVDAPFRDRLWACSYSGFEWGYTMVITRTV----TENRL-LIPLMDYRN 226
Query: 271 HSCEVETFLDYDKSSQGVVF----------TTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
F D+S + F TD++ + G QV++ Y + L +GF
Sbjct: 227 FISTDSPFEAVDRSHERTHFIINEQNQLRVVTDKRVKRGRQVYLDYEAFPSHYYLQHFGF 286
Query: 321 VPREGTNPSDSVELPL 336
VP +N D + +PL
Sbjct: 287 VP--ISNIHDCLLIPL 300
>gi|224001788|ref|XP_002290566.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973988|gb|EED92318.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 595
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/159 (26%), Positives = 73/159 (45%), Gaps = 31/159 (19%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK---------- 283
F+ E F+++ ++ SR + DG + L+P+ D NH D+D
Sbjct: 296 FSQEGFRYAVSLVRSRSFFV---DGSLRLLPYLDFANHD-------DFDSLELVGGGIGT 345
Query: 284 ---SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP-----REGTNPSDSVELP 335
S++G + + + + G+++ ISYG K + LL +GFVP GT + + EL
Sbjct: 346 LWGSAKGALMKSGKALEVGDEIRISYGPKGPADYLLDHGFVPPMCQTTSGTGGAITAELS 405
Query: 336 LSLKKSDKCYKEKLEALRKYGLSASECFPIQ---ITGWP 371
+ SD+ +KL+ L + P+Q +TG P
Sbjct: 406 FEIDDSDRFRDDKLDVLEYETYDLAPMEPLQVFDVTGGP 444
>gi|219126019|ref|XP_002183265.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405540|gb|EEC45483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 344
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 7/101 (6%)
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY-----DKSSQGVVFTTDRQ 295
W+ + SR L M ++ P DMLNH C V T D+ + + ++
Sbjct: 132 WAMACVCSRSNFLNDMS--YSMTPLLDMLNHDCTVRTSAKVSKNKLDEDDKWLSLQIEQC 189
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
Y+ G+QVFISYG SN E L YGFV R + +S+++ +
Sbjct: 190 YRAGDQVFISYGSLSNLETLCDYGFVDRSNSCNFESIQVQM 230
>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
Length = 595
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 91/351 (25%), Positives = 140/351 (39%), Gaps = 51/351 (14%)
Query: 38 HRIVVHCSVST----------TNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSG 87
H+ +H +V T T ASR V ++M P + S T W +
Sbjct: 66 HKAPLHTAVCTDTAGTPSRKQTEFASRASPEVNKHM-PSKPSMSSQLPIDTFPAWAHLND 124
Query: 88 LPPQKMAIQKVDVGER-GLVALKNIRKGEK------LLFVPPSLVITADSKWSCPEAGE- 139
+ + +Q V G+ GLVA ++ E + +P LV++A++ +
Sbjct: 125 VQFTHVNLQDVGEGKGFGLVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHN 184
Query: 140 -------VLKQCSVPDWPLLATYLISEASFEKSSR--------WSNYISALPRQPYSLLY 184
V +Q + D + YL+S+ F +SSR W+ YI LPR
Sbjct: 185 FKQLLEAVGRQSTRGD---IMLYLVSQ--FAQSSRPKGLSPTPWTEYIRLLPRPIPVPTM 239
Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD----LFPEEVFNMETFK 240
WT E R L E A+E +G D + +P L+ E ++E +
Sbjct: 240 WTEPE--RLLLNGTSLEAALEAKLLSLGKEFDTLREVSEDFPFWNEFLWSGEEVSLEDWV 297
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR-QYQPG 299
SR + LP A+VP DM+NHS + + + D V+ + G
Sbjct: 298 LVDAWYRSRCLELPR--SGTAMVPGLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSG 355
Query: 300 EQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
E+V ISYG K E+L SYGF+ + N D + L L D + KL
Sbjct: 356 EEVTISYGDAKPASEMLFSYGFI--DPNNIVDKLTLRLDPFPDDPLARAKL 404
>gi|134079652|emb|CAK97078.1| unnamed protein product [Aspergillus niger]
Length = 443
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 121/291 (41%), Gaps = 48/291 (16%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G++A + I K L+ VP S ++T P P L A YL + AS
Sbjct: 39 GMIATRKIEKDSILVKVPHSAMLTPSK---LPSTFTSRFPADTPTHTLYAAYL-TNASPS 94
Query: 164 KSSRWSN-------YISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN- 215
W N + S++P +L+ + + L + S+I++ I+N T
Sbjct: 95 HLKPWRNTWPTMEDFTSSMP-----ILWSSTSPLTPNSKTSKIQDLLPPSISNTWSTITP 149
Query: 216 ------------------DLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+ RLR + +FPE + T+ W S LP
Sbjct: 150 GKRKHKSDTRHQNLLKAQETRLRKAWDIVVRVFPETDKELFTYHWVIVNTRSFFYLLPGA 209
Query: 257 ------DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
+ +ALVP+AD NHS +V + +D + VF ++Y GE++++SYG S
Sbjct: 210 EMPEDRNDAMALVPFADYFNHS-DVACNVKFD--GEEYVFRAAKEYNEGEEIYMSYGPHS 266
Query: 311 NGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYGLSAS 360
N L YGF TN S+++ L + L+ + +E+LE + YG S
Sbjct: 267 NDFLFTEYGFY--LDTNASETLYLDEIILQDLNASKQEELEFHQYYGYVTS 315
>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
FGSC 2508]
gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 504
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 83/201 (41%), Gaps = 33/201 (16%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTR----AELDRYLEASQIRERAIERI-TN 209
YLI + KSS W+ YIS L P L W AE D L AI+ I +N
Sbjct: 108 YLIQQYLKGKSSFWAPYISTLA-DPSQLDKWALPPFWAEDDIELLKGTNAYVAIQEIQSN 166
Query: 210 VIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR---------------- 252
V Y R +I K + FP+ + + W++ + SR R
Sbjct: 167 VKSEYKQAR-KILKK--EGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERL 223
Query: 253 LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
LP +D L P D+ NHS + + + Y PG+QVF +YG K
Sbjct: 224 LPEGSKIDDFSILQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFK 283
Query: 310 SNGELLLSYGFVPREGTNPSD 330
+N ELLL YGF+ NP D
Sbjct: 284 TNSELLLGYGFI----INPKD 300
>gi|303270905|ref|XP_003054814.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462788|gb|EEH60066.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 58/122 (47%), Gaps = 16/122 (13%)
Query: 239 FKWSFGILFSRLVRL-------PSMDGRVALVPWADMLNHS-------CEVETFLDYDKS 284
++W+ ++ SR RL + R +VP+ D+LNH CE + D
Sbjct: 222 WRWAMSMVHSRTFRLEEPAAGVAGFETRRVMVPYVDLLNHDSRADVWQCEWDCEWDLGGG 281
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC 344
V T R + GE+V +SYG++ + L +GF+P NP ++V L + +++
Sbjct: 282 GGTFVVTATRDVRAGEEVLVSYGERCDRHFFLFFGFLP--APNPHNTVALFANAREAAAW 339
Query: 345 YK 346
Y+
Sbjct: 340 YE 341
>gi|313225781|emb|CBY07255.1| unnamed protein product [Oikopleura dioica]
Length = 346
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 68/156 (43%), Gaps = 38/156 (24%)
Query: 237 ETFKWSFGILFSRLVRLPSMDGR--------------VALVPWADMLNHSCEVETFLDYD 282
E W+F ++ SR LP D L P+ D++NHS + + D
Sbjct: 159 EDLTWAFSMVLSRTFSLPKYDKSSDFDYCSQVDSSKSAFLCPFMDLINHSSAPNCYYETD 218
Query: 283 KSSQGVVFTTDRQYQPGEQVFISY-GKKSNGELLLSYGFVPREGTN-------------P 328
+ V DR+ Q E++FI+Y G KS+ LL YGF G N P
Sbjct: 219 SETGDFVLRADRELQQKEELFITYGGSKSDHVLLAFYGFCLPPGVNRNSYIVFSPNFIGP 278
Query: 329 SD-----SVELPLSLKKSDKCYKE----KLEALRKY 355
S + + L+ KK+ KC+KE KLE+ RK+
Sbjct: 279 SSHSKFTAFKFFLNSKKA-KCFKESLNKKLESWRKF 313
>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
Length = 458
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
+ +Q G+Q++I YG +SN E ++ GF N D V++ L + KSD+ Y K E L
Sbjct: 165 QDFQAGDQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 222
Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
+ G+ S F + T P+ A+L V
Sbjct: 223 RAGIPTSSVFALHSTEPPISAQLLAFLRV 251
>gi|428165190|gb|EKX34191.1| hypothetical protein GUITHDRAFT_147375 [Guillardia theta CCMP2712]
Length = 681
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/300 (24%), Positives = 123/300 (41%), Gaps = 27/300 (9%)
Query: 42 VHCSVSTTNDASRT---KTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKV 98
V VS ND R+ + M P + L++W +G+ + I+++
Sbjct: 39 VVSGVSKLNDRGRSECPRACPRTVMAPLMASGTGARREAMLEEWARANGIFCM-LNIKRM 97
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS----VPDWPL-LA 153
GER VA + GE+++ VP + + P + Q + W LA
Sbjct: 98 ADGERKAVAANALAGGERVVRVPREVSFVTFQGDASPLPPSFVDQDTWQQLDEHWNAKLA 157
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWT---RAELDRYLEASQIR-----ERAIE 205
L+ E + W++ A + P + + R+ +EAS+ R + A E
Sbjct: 158 LMLLHE--MRRGVHWTDEELAELQNPRLVAAASDSKRSHAGLTIEASEWRYFDRMQAAGE 215
Query: 206 RITNVIG--TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR--VA 261
+ T+++ T+ R+ P ++ +W+ SR + + +G
Sbjct: 216 QETSMLDQETFQRDLHRLLCDRMTAPP----SLAELRWAMDCAQSRSFGVSTTEGVKCFC 271
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
L P ADMLNH D+D ++ T R + GE+V ISYG+ SN +LL YGFV
Sbjct: 272 LCPLADMLNHDPSSPALFDFDPATSCFAIRTSRAWSEGEEVTISYGELSNEDLLQFYGFV 331
>gi|388578758|gb|EIM19096.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
Length = 413
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 122/292 (41%), Gaps = 39/292 (13%)
Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE------ 159
+A +I + +++ V ++ IT + C EA + L +P+ L+A Y+
Sbjct: 38 LARNDIPEDAEIVSVNKNICITEST---CKEAFKNLNNEGLPEKLLIAVYISLHYIYDQL 94
Query: 160 -ASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
S + Y+ LP Q + LYWT EL+ Y + + + ER Y
Sbjct: 95 PESLKSKLHHRRYVDILPEIGQTLTTLYWTDDELE-YTKPTSLFNATKEREIQWKSDY-- 151
Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS---MDGRVA---LVPWADMLN 270
+ K+ EVF + FK S ++ SR PS D ++ LVP D+ N
Sbjct: 152 ---EVVKKWSRANDVEVFTWDVFKHSLTMISSR--AFPSKLIQDDEISSPMLVPLWDIGN 206
Query: 271 HSCE---VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
H + V T + Y + + + Q +VF +YG K ELLL+YGF +
Sbjct: 207 HKSQSAIVWTDVKY-TGTDNIGMKLPQGAQKDNEVFNNYGGKPTNELLLAYGF----AVD 261
Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC----FPIQIT-GWPLEL 374
+ +P + + K + L+K+GL +C F I + G PL L
Sbjct: 262 NINYDVVPFRIGAGVSLSESKKDILKKHGLLNEDCTLKTFNINLNEGLPLGL 313
>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
Length = 396
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 5/71 (7%)
Query: 255 SMDGRVALVPWADMLNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
+ D ++AL P AD+LNHS E E D + + DR+Y+ GE+V+I YG SN
Sbjct: 197 TKDDKMALQPVADLLNHSDEGCEVVFD----TGCYTISADREYKQGEEVYICYGTHSNDF 252
Query: 314 LLLSYGFVPRE 324
L++ YGF P E
Sbjct: 253 LMVEYGFCPEE 263
>gi|154290554|ref|XP_001545870.1| hypothetical protein BC1G_15621 [Botryotinia fuckeliana B05.10]
Length = 336
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 71/157 (45%), Gaps = 22/157 (14%)
Query: 229 FPEEVFNMETFKWSFGILFSRLVRL-----------PSMDGRVALVPWADMLNHSCEVET 277
FPE + F +++ I+ SR PS + R+AL P+AD +NHS E
Sbjct: 101 FPEPPITYDEFIYNYSIVNSRTFYYLSPTIKPSKPQPSKENRLALNPFADYINHSSE--P 158
Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL--- 334
+D S G T + + G +V ISYG +N LL+ YGF+ + N D V L
Sbjct: 159 TVDATLSRAGYTLTASQPIKQGSEVHISYGSHNNDFLLVEYGFILED--NRWDEVTLDPW 216
Query: 335 --PLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQIT 368
PL L K + E+ L KY L C+ Q+
Sbjct: 217 ITPL-LSVEQKEHLEETGFLGKYLLDRDTICYRTQVV 252
>gi|159474448|ref|XP_001695337.1| predicted protein [Chlamydomonas reinhardtii]
gi|158275820|gb|EDP01595.1| predicted protein [Chlamydomonas reinhardtii]
Length = 360
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/236 (23%), Positives = 101/236 (42%), Gaps = 31/236 (13%)
Query: 92 KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL 151
++ I + + G RGL + ++KGE ++ +P +V++ + + + ++LK+ P
Sbjct: 45 RVTISRDEAGVRGLYTTQPVKKGEVIVSIPQHIVLSVKNVAAAEASPQLLKEIHSP---- 100
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLL--YWTRAELDRYLEASQIRERAIE---- 205
SR Y+ LP P +L Y E +YL + E+
Sbjct: 101 -------------CSRLRPYLDTLP-GPDGVLTAYNWPEEYIKYLADPAMEEQLKNSFKL 146
Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPW 265
N +ND + + P+ + ++ ++ +L SR + G ++LVP
Sbjct: 147 HARNTWLGHNDDEMEV--TIPEAIGRKNITLKEWEHVVSLLSSRTFSI--RKGALSLVPV 202
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
D++NH ++ +S V + GEQV I+YG N ELL+ YGFV
Sbjct: 203 LDLVNHDVR---DINQLGNSSTVDLVAGKDLAAGEQVTITYGSMRNDELLMYYGFV 255
>gi|451997605|gb|EMD90070.1| hypothetical protein COCHEDRAFT_1022164 [Cochliobolus
heterostrophus C5]
Length = 408
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 116/279 (41%), Gaps = 30/279 (10%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP-PSLVITA 128
+D N + W +G+ +A + G+VA ++I+KG+KL+ V SLV A
Sbjct: 5 LDPGTNHTDFVSWAKSNGVEINGIAPARFVGRGMGIVAAQDIKKGDKLVHVSNKSLVHVA 64
Query: 129 ---DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS---SRW---SNYISALPRQP 179
P+ V + ++ LA + + + + W S++ S +P
Sbjct: 65 LPSIHSLKLPDTITVHGKLALA----LALWYTGRKDHDYTLWQNVWPTASDFKSTMPFYY 120
Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
S L R L Q++ +ER I +N + Y L + N TF
Sbjct: 121 PSPLQSLLPPAARTLLTKQLQN--LERDWTSITPHNPGITKETYTYTWL----IVNTRTF 174
Query: 240 KWSFGILFSRLVRLP------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
WS+ L + LP + D + P+ D NHS ++ D S G T D
Sbjct: 175 YWSYPDLPNASPLLPKRRAKLTADDCYCMCPFTDYFNHS---DSGCDPQMSPSGYTVTAD 231
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
R Y GE+VF++YG +N LL YGF+ +E N D V
Sbjct: 232 RAYAAGEEVFVTYGPHTNDFLLTEYGFILQE-KNRHDGV 269
>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
Length = 478
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/270 (25%), Positives = 116/270 (42%), Gaps = 43/270 (15%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+VA+ +IR + +L VP V T D+ + L SV L +E + +
Sbjct: 32 GIVAVCDIRANQTILSVPTRAVRTIDTVPK--HIKDALHGVSV------HGILAAEIALD 83
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEA---SQIRERAIERITNVIGTYNDLRLR 220
S ++ + + LP TR +L+ + S+++ +R +++ N R
Sbjct: 84 DSDDFAIWRTVLP---------TREDLEGGMPMMWPSELQALLPKRAKDLLDNQNTTFRR 134
Query: 221 ----IFSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
+ +P L +E + N TF S + ++ + R+ +P AD+ N
Sbjct: 135 ECDIVLKAFPTLTRDEYMLSWVLINTRTFYNSMPKM-----KIYAHSDRLVCMPVADLFN 189
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---PREGTN 327
H + S+ G TDR Y+ GE+V++SYG SN LL YGF+ R
Sbjct: 190 HDQGCKLVY----SALGYSVQTDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDEV 245
Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
D V LPL L K+ + E + L +Y L
Sbjct: 246 YLDEVILPL-LNKTQRAELESVGFLGRYTL 274
>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
Length = 481
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/273 (26%), Positives = 113/273 (41%), Gaps = 57/273 (20%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT-ADSKWSCPEAGEVLKQCSVP 147
P K+A + + RG+VAL NI + E+L +P +LV++ +SK + L S
Sbjct: 34 PKIKIADLRSEGAGRGIVALSNINEDEELFAIPQNLVLSFQNSKL------KDLLHISEK 87
Query: 148 D---WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
D W L +I E +S WS Y LP + +L++WT EL RE +
Sbjct: 88 DLGPWLCLILVMIYEYLQGGASPWSRYFQVLPTEFDTLMFWTDEEL---------RELSG 138
Query: 205 ERITNVIGTYND----LR--LRIFSKYPDLFPEEVFNMETFKWSFG--ILFSRLVRLPSM 256
+ N IG + LR I S P LFP + + ++ G L S R+ S+
Sbjct: 139 SAVLNKIGKSDAEAAILRDIFPIVSTNPHLFP-PISGLGSYDSPDGRATLLSLAHRMGSL 197
Query: 257 -------------------DGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGV 288
DG + +VP AD+LN + + + +
Sbjct: 198 IMAYAFDIEKGEDEEGEVQDGYITDEGEELTKGMVPLADLLNADADRNNARLFQEDGY-L 256
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+ + + GE++F YG+ +LL YG+V
Sbjct: 257 AMKSIKPIRNGEEIFNDYGELPRADLLRRYGYV 289
>gi|168021415|ref|XP_001763237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685720|gb|EDQ72114.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 489
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/275 (23%), Positives = 111/275 (40%), Gaps = 44/275 (16%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK-QCSVPDWPLLATYLI 157
D G RGL A ++++ GE +L VP ++ S E L S+ +L +L+
Sbjct: 51 DAGGRGLAAARDLKLGELILRVPEKALMNGRSARLDAELTRALALYPSLSHVQVLCVHLL 110
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
E + ++S Y+ LPR ++ +++ E +A Q+++ A+ V+ +
Sbjct: 111 REIAKGRTSERFPYLVHLPRYYHTASFYSPFE----AQALQVKD-AVSMAEGVVQNSREE 165
Query: 218 RLRIFSKYPDLFPEEVF-NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC--- 273
L+ L F ++ + W+F + SR + +P D L P D N++C
Sbjct: 166 WLQARPVLEKLGLGRRFCTLQGWLWAFATISSRTLYVP-WDEAGTLCPVGDFFNYACPGV 224
Query: 274 --------------EVETFLDYDKSSQGVVFTTDR-------------------QYQPGE 300
E + + D + G + DR YQ G+
Sbjct: 225 PYNLPPTAQDTQMREGDLISEEDVDTSGGIEIRDRLRDGGFEDERGEYCFYARQDYQEGQ 284
Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
QV + YG +N ELL YGF+ N +ELP
Sbjct: 285 QVLLCYGTYTNLELLEHYGFLLPFNPNDKVHIELP 319
>gi|118383229|ref|XP_001024769.1| hypothetical protein TTHERM_00237390 [Tetrahymena thermophila]
gi|89306536|gb|EAS04524.1| hypothetical protein TTHERM_00237390 [Tetrahymena thermophila
SB210]
Length = 840
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/177 (28%), Positives = 84/177 (47%), Gaps = 14/177 (7%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVL--KQCSVPDWPLLATY 155
G A + I + L+ VP L++ +D + E K+ + + ++ Y
Sbjct: 81 GFRATEEINPSDILIKVPRKLILNTRTCMFSDIQKVVKENLNFFTAKKGGLVEDHIMLVY 140
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ + K+S W + IS L R ++ +W E +YL+ RER R+ Y
Sbjct: 141 LLRQYQLGKASPWYHLISNLSRYIDTVDFWEDEEY-KYLDDPIFRERI--RLQRNYFNYI 197
Query: 216 DLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
LR I KYPDLF E+ ++ E +W + ++SR SMD +++VP +M+NH
Sbjct: 198 AKNLREILPKYPDLFEEQTYSEENIRWIYIHIWSRSFG-GSMD-YISMVPIVEMMNH 252
>gi|356574815|ref|XP_003555540.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like [Glycine
max]
Length = 506
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 72/360 (20%), Positives = 137/360 (38%), Gaps = 72/360 (20%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERGLV--ALKNIRKGEKLLFVPPSLVITADSKWSCP 135
++W+ GL A++ VD E G+ AL +++G+ + +P +T +
Sbjct: 8 AFKRWMKSKGLEWSD-ALEFVDTPEEGVEVRALCQLKEGDVVAKMPKEACLTTKTSG--- 63
Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
A +++++ + LA ++ E S + S ++ Y+ LP Q + WT E++ L
Sbjct: 64 -ARKIIEEAGLDGHLGLAFAIMYERSLDGDSPFAGYLQLLPHQECVPIVWTLDEVNELLC 122
Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
+++ + E + + + L + P + F +E + + ++ SR +
Sbjct: 123 GTELHQTVQEDKALIYDDWKENILPLLDLAPLKLNPKFFGVEQYFAAKSLISSRSFEIDD 182
Query: 256 MDGRVALVPWADMLNHSC--------------EVETFLDYDKSSQGVV------------ 289
G +VP AD+ NH E +T +D +G+V
Sbjct: 183 YHG-FGMVPLADLFNHKTGAEDVHFTAMSSNDESDTDVDGCNDDEGIVKEETLAQNSSID 241
Query: 290 ----------------------------FTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
+ G +VF +YG N LL YGF
Sbjct: 242 MTVLNNGNCNVSDSDSSSVSDGDTSMLEMIMIKDVSSGTEVFNTYGLLGNAALLHRYGFT 301
Query: 322 PREGTNPSDSVELPLSLK-----KSDKCYKEKLEALRKYGLSA-----SECFPIQITGWP 371
++ + ++++ L L+ SD+ + ++ RK G SA SE F I G P
Sbjct: 302 EQDNSYDIVNIDMELVLQWCTSVFSDRHSRARVSLWRKLGYSACGSQNSEYFEISFDGEP 361
>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
Length = 483
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 115/282 (40%), Gaps = 59/282 (20%)
Query: 99 DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP------LL 152
D G RGL A +++R+GE +L +P + ++T+D + C P +L
Sbjct: 45 DAGGRGLAAARDLRRGELVLRLPRAALLTSDR---VTADDPRIAACVSAHKPRLSSVQIL 101
Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-LEASQIRERAIERITNVI 211
L++E +S W Y+ LP Y++L A + + +EA Q+ + I
Sbjct: 102 IVCLLAEVGKGSNSVWYPYLCQLPSY-YTIL----ATFNDFEVEALQV--------DDAI 148
Query: 212 GTYNDLRLRIFSKYPDL--------FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
+ I S + D F ++ +++ W+F + SR + + + D L
Sbjct: 149 WVAQKAKSAIKSDWEDATPLMKELEFKPKLLMFKSWLWAFATVSSRTLHI-AWDEAGCLC 207
Query: 264 PWADMLNHSC-------------EVETFLDYDKSSQGVVFTTD--------------RQY 296
P D+ N++ E+ + + + TD + Y
Sbjct: 208 PVGDLFNYAAPDDDTLLEDEDTAELTNYQQKNGMTNSSERLTDGGYEDCNAYCLYARKNY 267
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+ GEQV ++YG +N ELL YGF+ E N +EL L +
Sbjct: 268 KKGEQVLLAYGTYTNLELLEHYGFLLGENPNEKTFIELDLDI 309
>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 470
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 141/308 (45%), Gaps = 45/308 (14%)
Query: 69 EIDSLE-NASTLQKWLSDSGLPPQ-KMAIQKVDVGE----RGLVALKNIRKGEKLLFVPP 122
++D E +T WL + G+ KMA+ VD+ + RG+VA+++I E + +P
Sbjct: 2 DVDDFEARTATFSAWLQEMGIRTNPKMAL--VDLRQEGRGRGVVAIEDIDDDEIIFSIPR 59
Query: 123 SLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
S V+ A + + P A +P W L + L++E + S+W+ Y++ LP Q SL
Sbjct: 60 SAVLNAQN--AKPLAISKRLAEKMPSWLALTSILMAEGQVD-DSKWAPYLAILPEQLNSL 116
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTY------NDLRLRIFSKYPDLFPEEVFNM 236
++W+ +EL ++ +++ + ++ TY + K + F++
Sbjct: 117 VFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEMCHKVASVIMAYAFDI 176
Query: 237 ---ETFKWSFGILFSRLVRLPSMDGR-----VALVPWADMLNHSCEVETFLDYDKSSQGV 288
S G L S DG ++++P ADMLN D D+++ +
Sbjct: 177 PDPSEGPTSGGKGEEAADDLVSDDGEDEKTILSMIPLADMLNA--------DADRNNARL 228
Query: 289 VFTTD----RQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSD----SVELPLS 337
+ + R +P GE++F YG+ +LL YG+V +G + D S EL +S
Sbjct: 229 ICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDLLRRYGYVT-DGYSAYDVAEISAELIVS 287
Query: 338 LKKSDKCY 345
L ++ K +
Sbjct: 288 LFRNGKVH 295
>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
Length = 498
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 81/174 (46%), Gaps = 16/174 (9%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI--- 221
S+ W+ YI +P ++T AEL+ L + +R ++ ++ + LR
Sbjct: 131 SNPWTEYIRFMPPSIRLPTFYTEAELE-LLRGTSLRTAVFAKLASLEKEFERLRQSTEGI 189
Query: 222 --FSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCEVET 277
KY + E+ + W + + SR+V LP + A+VP DM NH+ E
Sbjct: 190 PWCQKY--WWDEDTGRLTFDDWKYVDAVYRSRVVELP--ESGHAIVPCVDMANHASEDSV 245
Query: 278 FLDYDKSSQGVVFTTDRQYQ---PGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
YD+SS RQ + GE+V ISYG +K E++ SYGFV E T+
Sbjct: 246 KARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTD 299
>gi|340053796|emb|CCC48089.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 587
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 10/138 (7%)
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLR-----LRIFSKYPDLFPEEVFN-METFKWSF 243
L ++L S+ R++ I NV ++ LR ++ D F+ +E F W++
Sbjct: 209 LQQFLHFSRHRKKVILEQENVKRKFDHCLSVLSVLRFLLRFDDENKLSRFSSLEKFVWAY 268
Query: 244 GILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
L SR S V L+PW D NHS + YD + VF + GEQ+
Sbjct: 269 NTLMSRGF---SYHTEVWVLMPWVDYFNHSSVNNATMRYDSCRRSYVFESRLAISKGEQI 325
Query: 303 FISYGKKSNGELLLSYGF 320
++ YG ++ ELLL YGF
Sbjct: 326 WLQYGSYNDIELLLWYGF 343
>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
Length = 464
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 117/248 (47%), Gaps = 21/248 (8%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITA----DSKWSCPEAGEVLKQCS-VPD--WPL-LATY 155
GLVA +++ +G ++ +P + + ++ P G + + + VP+ W + L
Sbjct: 49 GLVATQDLPQGSTIITLPRRVPMPMPDPENAAVLAPSEGVICEIANRVPEELWAMRLGLK 108
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
L+ E + +K S W YIS LP ++++ +++ ++ + + + +R ++
Sbjct: 109 LLYERA-QKGSYWWPYISMLPHSFTLPIFFSGVDIES-IDYAPVTHQVKKRCRFLLQFSA 166
Query: 216 DL-RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNH 271
+L +L + F + + W+ + SR R+ + ++ ++P DM NH
Sbjct: 167 ELAKLESLPEEVHPFAGQSVDSGALGWAMAAVSSRAFRIHGVTNKLCSAMMLPLIDMCNH 226
Query: 272 SCEVETFLDYD--KSSQGVVF---TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
S + ++ D + +Q V F T R + G + ++YG SN LLL YGFV +
Sbjct: 227 SFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPD-- 284
Query: 327 NPSDSVEL 334
NP D +EL
Sbjct: 285 NPHDRIEL 292
>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
Length = 504
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 82/201 (40%), Gaps = 27/201 (13%)
Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR----AELDRYLEASQIR 200
S+P + YLI + KSS W+ YIS L P L W AE D L
Sbjct: 98 SLPPHVIGRFYLIQQYLKGKSSFWAPYISTLA-DPSQLDKWALPPFWAEDDIELLQGTNA 156
Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR------- 252
AI+ I N + + +I K + FP+ + + W++ + SR R
Sbjct: 157 YIAIQEIQNNVKSEYKQARKILKK--EGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSE 214
Query: 253 ---------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
LP +D L P D+ NHS + + + Y PG+
Sbjct: 215 SAREYVERLLPEGTKIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQ 274
Query: 301 QVFISYGKKSNGELLLSYGFV 321
QVF +YG K+N ELLL YGF+
Sbjct: 275 QVFNNYGFKTNSELLLGYGFI 295
>gi|347841961|emb|CCD56533.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
Length = 377
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 71/157 (45%), Gaps = 22/157 (14%)
Query: 229 FPEEVFNMETFKWSFGILFSRLVRL-----------PSMDGRVALVPWADMLNHSCEVET 277
FPE + F +++ I+ SR PS + R+AL P+AD +NHS E
Sbjct: 142 FPEPPITYDGFIYNYSIVNSRTFYYLSPTIKPSKPQPSKENRLALNPFADYINHSSE--P 199
Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL--- 334
+D S G T + + G +V ISYG +N LL+ YGF+ + N D V L
Sbjct: 200 TVDATLSRAGYTLTASQPIKQGSEVHISYGSHNNDFLLVEYGFILED--NRWDEVTLDPW 257
Query: 335 --PLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQIT 368
PL L K + E+ L KY L C+ Q+
Sbjct: 258 ITPL-LSVEQKEHLEETGFLGKYLLDRDTICYRTQVV 293
>gi|358369129|dbj|GAA85744.1| SET domain-containing protein [Aspergillus kawachii IFO 4308]
Length = 416
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/186 (30%), Positives = 86/186 (46%), Gaps = 23/186 (12%)
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPSM------DGRVALVPWADMLNHSCEVETFLDY 281
+FPE + T+ W S LP + +ALVP+AD NHS +V + +
Sbjct: 183 VFPETDKELFTYHWVIVNTRSFFYLLPGAEMPEDRNDAMALVPFADYFNHS-DVACNVKF 241
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKK 340
D + VF ++Y GE++++SYG SN L YGF TN S+++ L + L+
Sbjct: 242 D--GEEYVFRAAKEYNEGEEIYMSYGPHSNDFLFTEYGFY--LDTNASETLYLDEIILQD 297
Query: 341 SDKCYKEKLEALRKYG----LSASECFPIQI----TGWPLELMAYAYLVVSPPSMKGKFE 392
+ +E+LE + YG S C+ +I T PL L L S G E
Sbjct: 298 LNASKQEELEFHQYYGNYQLTSEGVCYRTEIAAGLTYMPLRLWQDYVLGY---STDGVDE 354
Query: 393 EMAAAA 398
+M+AA
Sbjct: 355 KMSAAV 360
>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
42464]
Length = 496
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 108/278 (38%), Gaps = 47/278 (16%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-----------------TADSKWSCPEAGEVLKQCS 145
RG+VA +I L +P S +I D + GE S
Sbjct: 47 RGIVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSS 106
Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----R 200
W LL LI E +S+W Y+ LP + ++W+ EL L+AS + R
Sbjct: 107 QDSWTLLILILIYEYLQGDASQWKPYLDVLPSAFDTPMFWSPTEL-AELQASALVTKVGR 165
Query: 201 ERAIERITN----VIGTYN-------------DLRLRIFSKYPDLFPEEVFNMETFKWSF 243
E A I + VI ++ D + + F++E +
Sbjct: 166 EEADRMIRSKILPVIRGHDHVFFPHGRQRLDDDQLFELAHRMGSAIMAYAFDLEKDDDAN 225
Query: 244 GILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ + +GR + +VP ADMLN E ++++ S + T R + GE+
Sbjct: 226 EEASEQDEWVDDREGRTMLGMVPMADMLNADAEFNAYINHGADS--LTATALRTIKAGEE 283
Query: 302 VFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSL 338
+ YG NGELL YG+V P+ D VELP L
Sbjct: 284 ILNYYGPLPNGELLRRYGYVTPKHAR--YDVVELPWDL 319
>gi|340516784|gb|EGR47031.1| predicted protein [Trichoderma reesei QM6a]
Length = 483
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 89/210 (42%), Gaps = 49/210 (23%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRER 202
L+ YL SF W YI ALP QP + W AEL LE + + E
Sbjct: 92 LIKEYLRGAESF-----WHPYIQALP-QPEDVDDWALPPLWPEEEAEL---LEGTNV-EI 141
Query: 203 AIERITNVIG-TYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSRLVR------- 252
+++I +G + D + + + D + + E ++W++ I SR R
Sbjct: 142 GLDKIREDLGREFRDAQKLLLASNGDAEDDFSSLLTRELYQWAYCIFSSRSFRPSLVLSR 201
Query: 253 ------LP---SMDGRVALVPWADMLNHSCEVETFLDY------DKSSQG------VVFT 291
LP S + L+P D+ NH V D D SS G V
Sbjct: 202 EQQEALLPPGVSANDFSVLLPVFDIGNHDMTVHVRWDVTSGGQADASSTGGSAAAAVQLK 261
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
R+++PGEQ+F +Y K+N ELLL YGF+
Sbjct: 262 VGREHKPGEQIFNNYSPKTNAELLLGYGFM 291
>gi|302842147|ref|XP_002952617.1| hypothetical protein VOLCADRAFT_118106 [Volvox carteri f.
nagariensis]
gi|300261961|gb|EFJ46170.1| hypothetical protein VOLCADRAFT_118106 [Volvox carteri f.
nagariensis]
Length = 713
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 75/320 (23%), Positives = 123/320 (38%), Gaps = 83/320 (25%)
Query: 82 WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP--SLVITADSKW----- 132
WL G + P+ + G+RG++A +I +GE LL +P ++ I D ++
Sbjct: 6 WLRSRGGRIHPELDLFHTLPSGDRGVIARSDIAEGELLLLLPIDCAIYIPTDEEFKKHPN 65
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLI--SEASFEKSSRWSNYISALPRQ-PYSLLYWTRAE 189
P+A L++ P LAT L+ SE + S W+ Y++ LP P LL WT E
Sbjct: 66 DFPDAVRYLREAHPGLSPFLATTLVLMSEMTRGSVSPWAAYVATLPASCPDCLLNWTEEE 125
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP------------------- 230
L+ + + + + +V Y I + DL+P
Sbjct: 126 -KLELKGTSLEQSGPDPAVDV---YRRHVAPILACRTDLWPGLAAKEPPAAEATGATLDA 181
Query: 231 -------------EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS----- 272
F++E W G + L + +V L+P DM+NHS
Sbjct: 182 GLAAFARAAGLVQSRAFHLEAENWVSGA--KEIAHLENGGTQVFLLPGIDMINHSHNPDR 239
Query: 273 --CEVETF-----------------LDYDKSSQGV---------VFTTDRQYQPGEQVFI 304
+E D + ++GV V D+ + GE+V
Sbjct: 240 RNAHLERLNVAQAAAAKLLEREPGEEDAREGAKGVGVRGVEAFFVMRADKPIKAGEEVLH 299
Query: 305 SYGKKSNGELLLSYGFVPRE 324
+YG S+ +LL +YGF+ E
Sbjct: 300 TYGNLSDAQLLQTYGFLDSE 319
>gi|326427686|gb|EGD73256.1| hypothetical protein PTSG_04969 [Salpingoeca sp. ATCC 50818]
Length = 455
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/325 (22%), Positives = 135/325 (41%), Gaps = 56/325 (17%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-SVPDWPL 151
+AI+ + G+ A +++ + VP SL++ + P+ G + + D +
Sbjct: 50 VAIRTSPLTGNGVYATADMKAHTTVFAVPFSLMMNVEHALVDPDLGRLWDMLPDLSDLEV 109
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LA +L E S + W Y+++L P + +++ E+D ++ + + A +R ++
Sbjct: 110 LAGFLAFE-SLRGTGFWQPYLASLGPPPTTPTLFSQEEMDLLAPSAAVFDIAQQRHLDLS 168
Query: 212 GTYNDLRLR-----IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG------RV 260
Y DL ++ + K + + + + W+ +L SR +L D +
Sbjct: 169 EVY-DLIVKAATNGLNKKEKEAWQQLGMRKSDYLWAHVVLRSRSHKLSIKDAVGQWHDAM 227
Query: 261 ALVPWADMLN-----HSCEVETFLDYDKSSQGVVF--TTDRQYQPGEQVFISY----GKK 309
LVP AD+ N ++ V + + F T R E++ + Y ++
Sbjct: 228 CLVPLADLFNTDLRNNTANVACYTGEEGHGTASTFYCETTRDINHSEELLVEYIGDAMRR 287
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLS-----------------------LKKSDKCYK 346
S+G+LLL YGFVP T+ SDSV L L L+ SD C K
Sbjct: 288 SSGKLLLDYGFVPT--THDSDSVLLHLPKLSETAEQRLKHFHFREYEPLPWLENSDACLK 345
Query: 347 EKL------EALRKYGLSASECFPI 365
+L + L K G S+ F +
Sbjct: 346 LRLVIYFAIDVLDKNGFDISDDFKV 370
>gi|310798181|gb|EFQ33074.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 485
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 88/226 (38%), Gaps = 45/226 (19%)
Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL------------D 191
SVP + +LI + K+S W YIS LP QP L W L +
Sbjct: 96 ASVPPHVIGRFFLIHQYLLGKASFWHPYISTLP-QPEHLQSWILPPLWPSDDVELLEDTN 154
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
++ ++ + R + + D Y L + W++ I SR
Sbjct: 155 VHVAVAETKARLKAEFKHAVAALGDDTAAARRGYTRLL---------YHWAYCIFASRSF 205
Query: 252 R--------------LPS---MDGRVALVPWADMLNHSCEV----ETFLDYDKSSQGVVF 290
R LP+ +D L+P D+ NH+ +T D D +S
Sbjct: 206 RPSLVIPAARKATLALPAGCAVDDFSLLMPLLDVGNHAPTAAVAWDTDADGDGASNSCAL 265
Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGF-VPREGTNPSDSVEL 334
T Y PG QVF +YG K+N EL+L+YGF VP +D V +
Sbjct: 266 RTLDPYAPGAQVFNNYGTSKTNAELMLAYGFCVPESAGLHNDYVHV 311
>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
10762]
Length = 381
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/321 (23%), Positives = 127/321 (39%), Gaps = 52/321 (16%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-- 130
++ +W G+ K+ ++ GLV +++K +++LFVP + DS
Sbjct: 1 MQRHEAFTEWAQARGVEIGKVKPARLPGRGLGLVTTASVKKNQRILFVPEKAMFKPDSAL 60
Query: 131 ------KWSCPEAG---EVLKQCS--VPDWPLLATYLISEASFEKSS--RWSNYISALPR 177
+ + P+A VL C+ V L ++ FE+ RW + L
Sbjct: 61 LKQHNLQHASPQAQLAVSVLAACATQVSAIALWEATWPTDTDFEQGMPMRWDGCLQDL-- 118
Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
P S+ + + D Y++ +V + LR ++ + + N
Sbjct: 119 LPPSVQQPLQRQQDDYMK-------------DVGSVHTFLRHARVTEQRFRYYWSIVNSR 165
Query: 238 TFKWSFGILFSRLVRLP-SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
+F W + P G + + P+ D NH T + + S G +R Y
Sbjct: 166 SFHW----------KPPKGKAGSMVMCPFIDYTNHG-PTGTGCNVSQRSNGYEMLANRDY 214
Query: 297 QPGEQVFISYGKKSNGELLLSYGFV---PREGTNPSDSVELP-LSLKKSDKCYKEKLEA- 351
GE+V +YG SN +LL+ YGF+ P N D + L L + K D ++KL+
Sbjct: 215 DAGEEVLFTYGAHSNDKLLVHYGFICESPPGLRNKDDDIRLDHLLIPKLDSHVRDKLQDV 274
Query: 352 --LRKYGLSASE---CFPIQI 367
L Y L +E CF Q+
Sbjct: 275 GFLGAYALLPAENELCFKTQV 295
>gi|355710254|gb|EHH31718.1| hypothetical protein EGK_12845 [Macaca mulatta]
Length = 379
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 87/201 (43%), Gaps = 17/201 (8%)
Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+SRW Y + P + ++W + L+ + + E + + N+ Y+ + L
Sbjct: 36 ASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPEAVEKDLANIRSEYHSIVLPFM 95
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEV 275
+PDLF V ++E + ++ + + P + +VP AD+LNH
Sbjct: 96 EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANH 155
Query: 276 ETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
L+Y + +V T QP G ++F +YG+ +N +L+ YGFV N D+
Sbjct: 156 NANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTA 210
Query: 333 ELPLSLKKSDKCYKEKLEALR 353
++ + + K EA R
Sbjct: 211 DIQMVTVREAALQGTKTEAER 231
>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
HHB-10118-sp]
Length = 513
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 89/197 (45%), Gaps = 26/197 (13%)
Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
VA+VP ADMLN ET + + T + + GEQ++ +YG N +LL YG
Sbjct: 270 VAMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYG 329
Query: 320 FV-------PREGT-NPSDSVELPLSL--KKSDKCYKEKLEALRKYGLSASECFPIQITG 369
F+ P G NP+D VE+P +L + + K K + + L +E + + G
Sbjct: 330 FIDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDRVDWWLEEAED-DVFVVG 388
Query: 370 ----WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
P E+++ A L++ P K ++E K +K + P +D +D
Sbjct: 389 TDCELPPEMVSLARLLLQP---KAEWE--------KTKAKGKVPKPTMDTTIAAIAMDVL 437
Query: 426 ESSISKYSRFLQVKELL 442
+S + +Y ++ E L
Sbjct: 438 QSRLKEYPTSVEEDERL 454
>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
972h-]
gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
pombe]
Length = 547
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 85/192 (44%), Gaps = 14/192 (7%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
L T+L E+ S+W YI LP+ + LY+ + + +L ++ A ER+
Sbjct: 82 LCTFLALESLKGIQSKWYGYIEYLPKTFNTPLYFNEND-NAFLISTNAYSAAQERLHIWK 140
Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWAD 267
Y + S +P P E F + + WS + FS + + L+P D
Sbjct: 141 HEYQE----ALSLHPS--PTERFTFDLYIWSATVFSSRCFSSNLIYKDSESTPILLPLID 194
Query: 268 MLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
LNH + + D + + V + G Q+F +YG K N ELL+ YGF +
Sbjct: 195 SLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGNEELLMGYGFCLPD-- 252
Query: 327 NPSDSVELPLSL 338
NP D+V L +++
Sbjct: 253 NPFDTVTLKVAI 264
>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 229
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 84/194 (43%), Gaps = 24/194 (12%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
+WL G + V G RG++A NI +G L+ VP L+++A S E
Sbjct: 8 EWLQKGGALIADIEPGAVAEGFRGVIAKANIEEGTLLVAVPERLLLSAHSAKKDRAFAEA 67
Query: 141 L---KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
L + S+ +LA +L+ EAS + S W Y++ LPRQ Y+ L Y
Sbjct: 68 LLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ-YTCL--------SYFSPE 118
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
IRE +E + I S + + +++ + + SR + LP D
Sbjct: 119 DIRELQVEYAMD-----------IASSVVEALRSDHTSVKPLLNALATVASRTMYLPD-D 166
Query: 258 GRVALVPWADMLNH 271
AL+P+ D+ NH
Sbjct: 167 AAGALMPFGDLHNH 180
>gi|194864902|ref|XP_001971164.1| GG14807 [Drosophila erecta]
gi|190652947|gb|EDV50190.1| GG14807 [Drosophila erecta]
Length = 1183
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 98/214 (45%), Gaps = 35/214 (16%)
Query: 151 LLATYLISEASFEK------SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
L+A Y++ +K SS +S Y+ LPR + + + EL E+ + ER +
Sbjct: 154 LVACYVLYHKHLQKCSLGTRSSPYSAYLDTLPRGYTTPYFCSIPELQCLPES--LLERTV 211
Query: 205 ERITNVIGTYNDLRLRIFSK--YPDLFPEEVFNMETFKWSFGILFSRLVRLPSM------ 256
+ + G + ++ + + + +E++ + FKW++ + +R V L S
Sbjct: 212 AQNRQIRGYFEIIKNLVLNCDCCAKSYGQEIWTLADFKWAYFTVNTRSVHLSSRFLKKQS 271
Query: 257 ---------DGRVALVPWADMLNHSCEVETFL-----DYDKSSQGVVFTTDRQYQPGEQV 302
D +AL P+ D+ NHS V+ DY + + + F+ + Y +Q+
Sbjct: 272 NYFQPLISGDTNMALAPFLDLFNHSDSVQITAEIEGPDYVLTLKSLPFSKTKPY---DQL 328
Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
FISYG SN +LL YGF +E N D E+ L
Sbjct: 329 FISYGALSNFKLLTEYGFWLQE--NKHDYFEVSL 360
>gi|412987807|emb|CCO19203.1| predicted protein [Bathycoccus prasinos]
Length = 638
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 91/222 (40%), Gaps = 48/222 (21%)
Query: 163 EKSSR------WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
E+SSR W Y+ LP SLL W+ EL +L+ S+I ERA R V +
Sbjct: 234 EESSRDDYSPHWQAYVDLLPTNVDSLLLWSTEEL-AFLQNSRIGERAKRRKKLVAEEFQV 292
Query: 217 L-----------RLRI---------FSKYPDLFPEEVFNMET----FKWSFGILFSRLVR 252
L RL + + L P F ET F+W++ + +R
Sbjct: 293 LFSNASLREEFERLMMPQTKQTQTQTRQRKRLTP---FRFETLREWFEWAYATVLARAFT 349
Query: 253 LPSM-DGRVALVPWADMLNHS-----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
LP + +G + L P D+ N + CEV D S V Y+ G Q F Y
Sbjct: 350 LPEIENGALLLCPGLDIFNSARDAAKCEVRLSPHDDISLHATVGG----YRAGTQAFHDY 405
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
KS+G LL +GF+ + + + PL + D+ K K
Sbjct: 406 ADKSSGGALLEFGFI----YDDDERLNFPLFMDDDDEKEKAK 443
>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
Length = 471
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/229 (22%), Positives = 103/229 (44%), Gaps = 7/229 (3%)
Query: 109 KNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASFEKSSR 167
+++++G +L+ +P + +T + P ++++ W LA LI++ S+
Sbjct: 29 RDVQQGHRLITLPNAAHLTYGAN-DDPRLLALIEKVPSELWGAKLALQLIAQRLQGGESQ 87
Query: 168 WSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP 226
+++Y++ LP+ P +++ R LD ++ ++ +R + ++ R+
Sbjct: 88 FASYVAELPKGFPGIPVFFPRTALD-MIDYPPCSQQVKKRCKWLYEFSTEVLARLPGSPE 146
Query: 227 DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET-FLDYDKSS 285
D F ++ W+ + SR R A++P DM NH+ L +
Sbjct: 147 DPFGGVAVDINALGWAMAAVSSRAFRTRGPTQPAAMLPLIDMANHTFSPNAEVLPLEGGG 206
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
V R GE + +SYG+ SN L + YGF+ + NP DSV+L
Sbjct: 207 GAVGLFARRAITEGEPLLLSYGQLSNDFLFMDYGFIVED--NPYDSVQL 253
>gi|157818191|ref|NP_001099637.1| N-lysine methyltransferase SETD6 [Rattus norvegicus]
gi|325530256|sp|D3ZSK5.1|SETD6_RAT RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
domain-containing protein 6
gi|149032380|gb|EDL87271.1| similar to hypothetical protein FLJ21148 (predicted) [Rattus
norvegicus]
Length = 474
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/288 (23%), Positives = 125/288 (43%), Gaps = 28/288 (9%)
Query: 65 PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
P G E +S A L +W + GL P+ + ++ V G+VA ++++ GE L VP
Sbjct: 37 PSGHEPESDAVAGFL-RWCTRVGLELSPKVLVSRQGTVAGYGMVARESVQPGELLFAVPR 95
Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR- 177
S ++ S +C + + ++ S+ W + + +S WS Y + P
Sbjct: 96 SALL---SPHTCSISDLLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPEL 151
Query: 178 -QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
+ ++W E R L+ + + E + + N+ Y + L + DLF V ++
Sbjct: 152 GRLEHPMFWPEEERLRLLKGTGVPEAVEKDLVNIRSEYYSIVLPFMEAHSDLFSPTVRSL 211
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVV 289
E ++ ++ + + P + +VP AD+LNH L+Y +V
Sbjct: 212 ELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHIANHNANLEYSAEYLRMV 271
Query: 290 FTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
T QP G ++F +YG+ +N +L+ YGF N D+ ++
Sbjct: 272 AT-----QPILKGHEIFNTYGQMANWQLIHMYGFAEPYPNNTDDTADI 314
>gi|429851283|gb|ELA26485.1| set domain-containing protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 196
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 42/80 (52%), Gaps = 9/80 (11%)
Query: 262 LVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
L P AD+LNH+ C V +D S DR Y PGE++ I YG+ SN LL+ Y
Sbjct: 3 LQPVADLLNHASRGCNVA----FDTES--FTIRADRDYSPGEEIHICYGRHSNDFLLVEY 56
Query: 319 GFVPREGTNPSDSVELPLSL 338
GFV EG N D L +L
Sbjct: 57 GFVMGEGENEWDEACLDEAL 76
>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
Length = 527
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 135/322 (41%), Gaps = 52/322 (16%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+VA+ +IR + +L VP V T D+ + L SV L +E + +
Sbjct: 32 GIVAVYDIRANQTILSVPTRAVRTIDTVPK--HIKDALHGVSV------HGILAAEIALD 83
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEA---SQIRERAIERITNVIGTYNDLRLR 220
S ++ + + LP TR +L+ + S+++ +R +++ N R
Sbjct: 84 DSDDFAIWRTVLP---------TREDLEGGMPMMWPSELQALLPKRAKDLLDNQNTTFRR 134
Query: 221 ----IFSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
+ +P L +E + N TF S + S + R+ +P D+ N
Sbjct: 135 ECDIVLKAFPTLTRDEYMLSWVLINTRTFYNSMPKMKSY-----AHSDRLVCMPVLDLFN 189
Query: 271 HSCEVETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---PREGT 326
H + + L Y S+ G TDR Y+ GE+VF+SYG SN LL YGF+ R
Sbjct: 190 HEDQSQGCKLVY--SALGYSVQTDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDE 247
Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPS 386
D V LPL L K+ + E + L +Y L Q G +A L SP
Sbjct: 248 VYLDEVILPL-LNKTQRAELESVGFLGRYTLDD------QTPGCHRTQVALRMLCCSP-- 298
Query: 387 MKGKFEEMAAAASNKMTSKKDI 408
G+++ A + +S+ ++
Sbjct: 299 --GQWQRFFDACEDGRSSQVEV 318
>gi|355756831|gb|EHH60439.1| SET domain-containing protein 6, partial [Macaca fascicularis]
Length = 371
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 87/201 (43%), Gaps = 17/201 (8%)
Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
+SRW Y + P + ++W + L+ + + E + + N+ Y+ + L
Sbjct: 36 ASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPEAVEKDLANIRSEYHSIVLPFM 95
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEV 275
+PDLF V ++E + ++ + + P + +VP AD+LNH
Sbjct: 96 EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANH 155
Query: 276 ETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
L+Y + +V T QP G ++F +YG+ +N +L+ YGFV N D+
Sbjct: 156 NANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTA 210
Query: 333 ELPLSLKKSDKCYKEKLEALR 353
++ + + K EA R
Sbjct: 211 DIQMVTVREAALQGTKTEAER 231
>gi|68479052|ref|XP_716460.1| hypothetical protein CaO19.7326 [Candida albicans SC5314]
gi|46438129|gb|EAK97465.1| hypothetical protein CaO19.7326 [Candida albicans SC5314]
Length = 579
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 143/332 (43%), Gaps = 71/332 (21%)
Query: 73 LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKL-------LFVPPS 123
+E+ + L KW +G + P V+ E + I KG K+ + +P
Sbjct: 4 IESINKLLKWAESNGAQISPD------VEFKEISKNYIGAIYKGNKVPDSPFCPISIPSK 57
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-----Q 178
L+IT + + E + LK + D +L +L E +S + Y++ LP
Sbjct: 58 LIITPQTAFK--EFSKSLKNTDINDNSILKLHLCHE-RLNGNSFFYPYLNLLPSLSEIDS 114
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVI--------GTYNDL--------RLRIF 222
PY+ W+ A YL+ + + E + ++ ++DL ++ +
Sbjct: 115 PYT---WS-ANDKSYLQGTNLGNSLKENLVTLVEEWWKAINALHDDLPKPEQHYINMKFY 170
Query: 223 SKYP---------DLFPEEVFNMETFK---WSFGILFSR-----LVRLPSMDGRVALVPW 265
+Y L E + N +F W+ IL SR L+ + L+P
Sbjct: 171 YEYKFYTDDDLNKYLNDENIENWTSFPNYLWASLILKSRSFPAYLIDKNNKQDSAMLLPV 230
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D+LNH+ + + + +D S F+++ PG+++F +YG K N ELLL+YGF
Sbjct: 231 VDLLNHNSKSK--VHWDVSDNYFKFSSE-SIVPGKEIFNNYGLKGNEELLLAYGFCIE-- 285
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
N DSV L + + +EK++A+ +YG+
Sbjct: 286 NNSQDSVALKIKMP------EEKIKAIEEYGI 311
>gi|302848348|ref|XP_002955706.1| hypothetical protein VOLCADRAFT_106928 [Volvox carteri f.
nagariensis]
gi|300258899|gb|EFJ43131.1| hypothetical protein VOLCADRAFT_106928 [Volvox carteri f.
nagariensis]
Length = 542
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/238 (23%), Positives = 111/238 (46%), Gaps = 17/238 (7%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASF 162
GL A +++ G +L+ +P + +T +K P ++++ W LA L+S+
Sbjct: 101 GLQASQDLEPGRRLIVLPAACHLTYGAK-DDPRLLALIEKVPNELWGAKLALQLLSQRLR 159
Query: 163 EKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
S ++ YIS LPR P +++++ LD ++ + ++ +++ + T++ ++
Sbjct: 160 GADSLFAAYISNLPRGIPGIPMFFSKRALD-LIDYPPVTQQ-VQKRCRWLHTFSQ---QV 214
Query: 222 FSKYP----DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET 277
+K P D F ++ W+ + SR R A++P DM NH+
Sbjct: 215 MAKLPGSPEDPFGGVTVDINALGWALACVTSRAFRTRGPAHPAAMLPLIDMANHTFTPNA 274
Query: 278 -FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
L G+ + + GE + +SYGK +N L + YGF+ + NP D+V+L
Sbjct: 275 EVLPLPGGDMGLFAKS--KVATGEPLLLSYGKLNNDFLFMDYGFIVPD--NPYDTVQL 328
>gi|342877200|gb|EGU78693.1| hypothetical protein FOXB_10798 [Fusarium oxysporum Fo5176]
Length = 456
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/194 (29%), Positives = 81/194 (41%), Gaps = 32/194 (16%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
L+ +L+ SF W+ YI ALP QP W+ +A IE V
Sbjct: 91 LIKHFLLKGESF-----WAPYIQALP-QPDDHDSWSLPPFWPDEDAELFEGTNIE--VGV 142
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFK--------WSFGILFSR-----LV------ 251
+++ R F DL E + +E K W++ I SR LV
Sbjct: 143 TSIRANVK-REFKTAHDLLAAESWELELLKQFTLPLYQWAYSIFSSRSFRPSLVLGPEDQ 201
Query: 252 -RLPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
RLP +D L+P D+ NH + ++ G ++ YQ GEQVF +Y
Sbjct: 202 QRLPEGVKLDDFSVLMPLFDVGNHDMTTKVEWVRNERINGCSLKVEKAYQAGEQVFNNYS 261
Query: 308 KKSNGELLLSYGFV 321
K+N ELLL YGF+
Sbjct: 262 MKTNAELLLGYGFM 275
>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
strain 10D]
Length = 576
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 9/97 (9%)
Query: 243 FGILFSRLVRLPSMDG----RVALVPWADMLNHSCEVETFLDYDKSSQGVVFT-TDRQYQ 297
F L+ R+ + SM G + + P+ D+ NHS V++ + Y+ + ++R
Sbjct: 384 FDALYPRIYQ--SMQGAPLKKYVIAPFIDLFNHSSRVQSKVAYEYFYDAFSLSISNRDTH 441
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
G+QVFISYG +N ELL YGFV E NP D+ +L
Sbjct: 442 AGDQVFISYGTLTNDELLALYGFV--EEDNPHDTYKL 476
>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
[Gallus gallus]
Length = 283
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 94/207 (45%), Gaps = 21/207 (10%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L KW +++G + I + GL A + I+ E L+VP L++T +S
Sbjct: 82 LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KN 136
Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
VL D L LA +L+ E + +S W YI LP + + LY+ E+
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPSEYDTPLYFEEDEV- 194
Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
+YL ++Q + N Y ++ +P+ L ++ F + ++W+ + +
Sbjct: 195 QYLRSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253
Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
R ++P+ DG +AL+P DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280
>gi|313239023|emb|CBY14007.1| unnamed protein product [Oikopleura dioica]
Length = 299
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 11/107 (10%)
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
GR L P D+LNH+ E ++ D++ + R+ + GE++F S+G+ SN LL
Sbjct: 74 GRFML-PVGDLLNHTAENNARIELDENDHTI--NAIREIESGEEIFNSFGEMSNSRLLHM 130
Query: 318 YGFVPREGTNPSDSVELPLSLKKSDKCYKE-------KLEALRKYGL 357
YGF +E + D+ P S+KK + +K+ KL+ L K+G
Sbjct: 131 YGFAEKE-NDADDAFLDPNSIKKGIEEFKDEIPAIQAKLKLLEKHGF 176
>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
Length = 706
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 102/227 (44%), Gaps = 28/227 (12%)
Query: 151 LLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
L+ YL+ + SF W+ YI +LP Q L Y+T +L ++LE + + + + +
Sbjct: 130 LMDQYLLGDESF-----WAPYIQSLPDDSQFTRLEYYTGDDL-KWLEGTNLLKLREKLLE 183
Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL--------------VRLP 254
+ Y + LR+ ++P+ + + E F W+ I+ SR R+
Sbjct: 184 RLKAKY-ETGLRLLKEFPNKNTPK-YTWERFLWASSIILSRAFSSEVLKDYIKGTPTRVK 241
Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
++ LVP D+ NH + +++ S + + + PGE+V +YG +SN L
Sbjct: 242 PLEDFSVLVPLVDISNHQPLAQ--VEWATSLEKIGLIVHKTLLPGEEVPNNYGPRSNERL 299
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
+++YGF R N D E+ L K E ++G S S+
Sbjct: 300 MMNYGFCIR--GNVCDYREMNLRAPPDSPLAIAKQEQQTRFGASKSK 344
>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
Length = 401
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/160 (27%), Positives = 75/160 (46%), Gaps = 10/160 (6%)
Query: 125 VITADSKWSCPEA-----GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQ 178
V+TA K +C A E L++ + L ++ E S + SRW+ Y + LP R
Sbjct: 22 VVTAIPKAACLSARTCSVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLPARG 81
Query: 179 PYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR-IFSKYPDLFPEEVFNM 236
+L ++WT A+L+ +L + + E ++ +N+ + + +P FP +
Sbjct: 82 ERTLPMFWTSAQLE-HLRGTDLLRHVTEDAESMRLDFNENVVDGLCVTHPVAFPPGKHTL 140
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
E + + + SR + G ALVPWADM NH + E
Sbjct: 141 EAYMEAASLAASRAFYIGEECGE-ALVPWADMFNHKTDGE 179
>gi|328867430|gb|EGG15812.1| hypothetical protein DFA_09480 [Dictyostelium fasciculatum]
Length = 466
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 55/250 (22%), Positives = 112/250 (44%), Gaps = 33/250 (13%)
Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPEAGEVLKQCSVPDWPLLATYLI--S 158
ERG+ ++I++G ++ + + + D K P ++L++ V LA L+
Sbjct: 75 ERGIFVNEDIKEGSDIVNLQWINIFSLDRIKHETPHLYQLLQENDVSAEIGLAISLMYYR 134
Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
+ +S + ++++LP + + LY+ E+ + L+ S + +I Y L
Sbjct: 135 YCKDDTTSEYYQWMNSLPTELNTGLYFNPDEI-QLLKGSPAFVHLMLQIDQTREMYQKLN 193
Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RLP---------------------- 254
+F +F N + +KW+ I+ SR + LP
Sbjct: 194 GEMFK--DKIFDGCAINWDRYKWAVSIVGSRGIYTELPIDKELEKKKETKEKEEKEEEEE 251
Query: 255 SMDGRVALV--PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS-N 311
S+ R+ +V P+ D NH+ + + D+D S + + + + GEQ+F++YG ++ +
Sbjct: 252 SLQQRLTIVLAPFIDYFNHNNDAQATYDFDHESSSIRVSLLKSVKSGEQIFLNYGNQNCD 311
Query: 312 GELLLSYGFV 321
+ L+ YGFV
Sbjct: 312 SDFLIDYGFV 321
>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
Length = 578
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 68/269 (25%), Positives = 113/269 (42%), Gaps = 26/269 (9%)
Query: 115 EKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISA 174
EK+ F+ ++V D +GE+ S D PLL LI E ++S W++ + +
Sbjct: 166 EKMFFID-TVVKYCDLGRVVHASGELSSMIS-GDEPLLVLSLIYERYVAETSHWNDLLCS 223
Query: 175 LPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-- 231
P P +W +L LE + + + + + + + + Y L
Sbjct: 224 CPVDYPNVPSFWDWEDLAE-LEGLDVLDDVLAKKAQLAQFHTETMAVLPFIYEALAGSCR 282
Query: 232 -------EVFNMETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSCEVETFLDYD 282
E F++E W+ SR L ++DGRV ALVP ADM+NH + +
Sbjct: 283 LGKDEFLECFSIEAMMWARATFDSRAFNL-NVDGRVVIALVPVADMINHHNRSDVLVRKV 341
Query: 283 KSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+ + G + + G ++++SYG N ELL YGFV E N D + P
Sbjct: 342 EPNGGDFVMQIGASLTAQDIGRELWMSYGPLQNWELLQFYGFVVEE--NEHDRLPFPFDF 399
Query: 339 KK---SDKCYKEKLEALRKYGLS-ASECF 363
+ D+ + + + YGL A C+
Sbjct: 400 PEGVAGDEWDRRRATLVATYGLHLAGRCW 428
>gi|330924024|ref|XP_003300479.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
gi|311325361|gb|EFQ91406.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
Length = 642
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 91/214 (42%), Gaps = 28/214 (13%)
Query: 141 LKQCS--VPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEA 196
L+QC +PD L LI + ++S W YI+ LP R + L++ ++ +L
Sbjct: 84 LRQCQGRIPDHILTYLLLIEQRDRGQASPWHAYIACLPSPRDMTTPLWFNEGDM-AFLAG 142
Query: 197 SQIRERAIERITNV-------IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
+ + A ER + + +L + + + ++E+ W+ + SR
Sbjct: 143 TSLAPAAKERRAELQQQWERAVAVMEELSIPL---------AKGIDIESLLWAATVFTSR 193
Query: 250 LVR----LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD-RQYQPGEQVFI 304
LP + L P D+LNHS + D+ + D +QP +++F
Sbjct: 194 AFISTHILPEKETVPILFPVVDILNHSVSAKVEWDFQPRQSFALKCLDGHSFQPRQELFN 253
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+Y K N ELLL YGF + NP + L L+
Sbjct: 254 NYAPKQNDELLLGYGFCLED--NPIEQFALKLAF 285
>gi|397574384|gb|EJK49180.1| hypothetical protein THAOC_31979 [Thalassiosira oceanica]
Length = 462
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/259 (22%), Positives = 115/259 (44%), Gaps = 23/259 (8%)
Query: 79 LQKWLSDSGL-PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-----KW 132
+++W GL + + D + LVA + G+ +L+VP ++++++
Sbjct: 75 MEQWAQQYGLQKADGVELYSDDGADYQLVAQNGVAAGQMILYVPSDIILSSNGVAEEFGG 134
Query: 133 SCPEAGEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
+ +A +VL Q +P + L+A +++E ++S + ++++LPRQ Y+ + T
Sbjct: 135 AIAQAEQVLVQMDQGTQKRLPLFRLMAK-ILNEYDQGENSPFYPWLASLPRQYYNGVSMT 193
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
A + + A +N Y+ + Y L V + + W++ +
Sbjct: 194 DACF------ACLPPYAGWLTSNERNNYHFFVAALRKGYVQL--NSVHDEKAVMWAYNVA 245
Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
F+R + S + + P ADM+NHS E + +D V T PG + SY
Sbjct: 246 FTRFEEVWSPSRQKLIAPMADMINHSAEPNCQISFDDMGNCQV-TALYDIPPGTPITKSY 304
Query: 307 GKKSN-GELLLSYGFVPRE 324
G +N + YGF+P++
Sbjct: 305 GDPTNPTPIFAQYGFLPQD 323
>gi|302914506|ref|XP_003051150.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
77-13-4]
gi|256732088|gb|EEU45437.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
77-13-4]
Length = 499
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 86/193 (44%), Gaps = 9/193 (4%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI--F 222
S+ W+ Y+ LPR W+ E L+ + + ++ + +++LR +
Sbjct: 134 STPWTEYLKFLPRHVPVPTMWSEVER-ALLQGTSLEAALEAKLAALNNEFDELREKSSGL 192
Query: 223 SKYPDLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY 281
+ + LF E E ++ + + SR + LP A+VP DM NHS + + +
Sbjct: 193 AFWNSLFWEKETATIQDWILIDALYRSRCLELPRAGD--AMVPGLDMANHSHDPTAYYEE 250
Query: 282 DKSSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
D V+ + GE+V ISYG KS E+L SYGF+ R+ + + LPL
Sbjct: 251 DDKDDVVLLLRLGVEVTGGEEVSISYGDKSPAEMLFSYGFIDRDSA--AHDLTLPLEALP 308
Query: 341 SDKCYKEKLEALR 353
D K KL +
Sbjct: 309 DDPLGKAKLHIFK 321
>gi|358384957|gb|EHK22554.1| hypothetical protein TRIVIDRAFT_83966 [Trichoderma virens Gv29-8]
Length = 378
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 73/143 (51%), Gaps = 14/143 (9%)
Query: 221 IFSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
+ + YP+L ++ + N TF ++ +LP D R+AL P AD+ NH+ E
Sbjct: 140 VTAAYPELHKDDYLYSWLLINTRTFYYTD----RGTDKLPRED-RMALQPVADLFNHTPE 194
Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
++D + FTT R +QPGE+VFI YG +N +LL+ YGF NP D L
Sbjct: 195 GYCVANFD--DKFFTFTTTRTHQPGEEVFIRYGPHANDKLLVEYGFTLPSSVNPWDETCL 252
Query: 335 PLSLKKSDKC-YKEKLEALRKYG 356
+ S +++LEA+ +G
Sbjct: 253 DSYICPSMTVDQRDRLEAVGFWG 275
>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
Length = 410
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 93/228 (40%), Gaps = 27/228 (11%)
Query: 134 CPEAGEVLKQCSVPD-----WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
C EV+K S D LL+ +L+ E + SRW +I ++P +L W+
Sbjct: 3 CRFVREVVKAASQEDEQLCERQLLSLHLLVEKWKAERSRWWRFIRSIPPSYDTLENWSEQ 62
Query: 189 ELDR-----YLEASQIRERAI-ERITNVIGTYNDLRLRIFSKYPDLFPE-------EVFN 235
+ R +L + R+R + + + + + + R +++ F+
Sbjct: 63 SVARLQYKPFLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFS 122
Query: 236 METFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEVETFLDYDKSSQGV 288
E + W+ G + +R G LVP D LNHS + K +
Sbjct: 123 REDYLWAAGTVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVY 182
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
T R Y+ GEQV I YG SN LL YGFV + NP DS L L
Sbjct: 183 RVTCLRSYEEGEQVMIHYGNWSNAGLLEHYGFVLED--NPLDSCMLWL 228
>gi|258569485|ref|XP_002543546.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237903816|gb|EEP78217.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 480
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 90/407 (22%), Positives = 156/407 (38%), Gaps = 85/407 (20%)
Query: 92 KMAIQKVDVGERGL--VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW 149
++A + + RG+ VA + I + E+L +P +LV++ + K+ W
Sbjct: 42 RIADLRSNAAGRGVETVACEEIAQDEELFAIPENLVLSVQNSKLKDHLNFTDKELD--SW 99
Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAI 204
L +I E +SRWS+Y + LP +L++W++ EL R L+ S + R+ A
Sbjct: 100 LSLIVTMIYEYLHGGASRWSSYFAVLPTDFDTLMFWSQDEL-RELQGSSVLSKIGRQEAD 158
Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPE----EVFNMETFKWSFGILFSRLVRL------- 253
E I + + YP LFP FN + K + L R+ L
Sbjct: 159 EMIMGKV-------YPLILDYPGLFPTPKELSSFNSQQGKEAILHLAHRMGTLIMAYAFD 211
Query: 254 ------------PSMDGRV---------ALVPWADMLN---HSCEVETFLDYDKSSQGVV 289
DG + +VP ADMLN H F + +
Sbjct: 212 IENEMDREEEDQDGEDGYITDNEQETAKGMVPLADMLNADAHRNNARLF----QEDGYFI 267
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
+ E++F YG+ +LL YG++ E +P D VE+ L
Sbjct: 268 MKSIVPISMEEEIFNDYGELPRADLLRRYGYIT-ENYSPYDVVEI-------------SL 313
Query: 350 EALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEE-------------MAA 396
EA+ K C +++ E++ Y ++ P + E M
Sbjct: 314 EAICKVAGVEKNCPQLELLETA-EILEDGYSLLRPETDANLVEAISPELIVLLRTLTMTP 372
Query: 397 AASNKMTSKKDIKCPEIDEQALQFILDSCESSISKY-SRFLQVKELL 442
N+M K + P++D+ + + +++ +S + Y + Q ELL
Sbjct: 373 DNLNQMRVKGKLPKPQLDQASAKLLIEVLQSRQNDYPTTIAQDDELL 419
>gi|380477696|emb|CCF44010.1| SET domain-containing protein, partial [Colletotrichum
higginsianum]
Length = 448
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 93/197 (47%), Gaps = 16/197 (8%)
Query: 165 SSRWSNYISALP-RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR--LRI 221
S+ W+ Y+ LP R P L WT E D L+ + + +I + +++LR
Sbjct: 118 STPWTEYVKYLPPRVPVPTL-WTEQERD-MLQGTSLESATAAKIVALTDEFDELRETSST 175
Query: 222 FSKYPDLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
+ + +LF E E ++ + SR + LP A+VP D+ NHS E +
Sbjct: 176 LTFWNELFWESEKISLIDWVRVDAWFRSRCLELPK--SGEAMVPVLDLANHSSEANAY-- 231
Query: 281 YDKSSQGVVFTTDR---QYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVELPL 336
Y+++ + V R + GE++ ISYG KS E+L SYGF+ + + +D + LPL
Sbjct: 232 YEENGKDEVVLLLRPGCRVSSGEEMTISYGDAKSGAEMLFSYGFI--DPVSAADRMTLPL 289
Query: 337 SLKKSDKCYKEKLEALR 353
+ D K KL +
Sbjct: 290 MPLEDDPLGKAKLHIFK 306
>gi|428182388|gb|EKX51249.1| hypothetical protein GUITHDRAFT_103166 [Guillardia theta CCMP2712]
Length = 386
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 84/195 (43%), Gaps = 29/195 (14%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW----PLLATYLIS 158
RGL A I E L +PP L+I + P EV K S+P + LA YLI
Sbjct: 148 RGLKANGTIHDHEILFRIPPKLLINVGTVSQDPNFAEVWK--SIPQFHKGLSGLAVYLIH 205
Query: 159 EASFEKSSRWSNYISALPRQ-PYSLLYWTRAE--------LDRYLEASQIRERAIERITN 209
E S KSS W Y+ ALPR P + Y R L E + IER +
Sbjct: 206 E-SLNKSSFWRPYLCALPRHVPLPIFYSERKFERERREKILKPLPEQVTRFDDLIERARD 264
Query: 210 VIGT-YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR----------LVRLPSMDG 258
V+ Y +L ++FS++P F + + W+ I+ SR L ++ +
Sbjct: 265 VLEVHYVELMPKLFSQFPLKFSPADYTYARWAWACSIIMSRTWGRKFKDDVLGKMTGENV 324
Query: 259 RVA--LVPWADMLNH 271
V LVP ADM NH
Sbjct: 325 SVVHTLVPAADMPNH 339
>gi|405119695|gb|AFR94467.1| nuclear protein [Cryptococcus neoformans var. grubii H99]
Length = 495
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 63/260 (24%), Positives = 108/260 (41%), Gaps = 55/260 (21%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVI---TAD-------SKWSCPEAGEVLKQCSVPDWPLLA 153
G VA+K+I +G L +P L++ T+D S+W G W L
Sbjct: 44 GAVAVKDIEEGTPLFHIPDDLILSPYTSDLKDHLDASEWDQLNKG----------WAQLI 93
Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
L+ E SRW+ Y++ +P + ++WT + ++ L + I +R I R +
Sbjct: 94 LVLMWETIKGSKSRWAGYLANMPVTFETPMFWTEQQREQ-LAGTDIADR-IGR-EDAEAE 150
Query: 214 YNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMD-GR----------- 259
Y + +PDLFP + ++ F + SR +P GR
Sbjct: 151 YTSVLAPFIKAHPDLFPIDSPHITIDAFHIQGSRILSRSFTVPLHRFGRSQSQTRSDSNS 210
Query: 260 ------------VALVPWADMLNHSC---EVETFLDYDKS---SQGVVFTTDRQYQPGEQ 301
V ++P+ADMLN + ++D D + ++GVV + + + EQ
Sbjct: 211 EMEGDGEEEEEVVVMIPFADMLNAAWGKDNAHLYIDEDTNEGFNEGVVMKSIQLVKQSEQ 270
Query: 302 VFISYGKKSNGELLLSYGFV 321
++ +Y N ELL YG +
Sbjct: 271 IYNTYDSPPNSELLRKYGHI 290
>gi|336262426|ref|XP_003345997.1| hypothetical protein SMAC_06551 [Sordaria macrospora k-hell]
gi|380089590|emb|CCC12472.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 482
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 90/200 (45%), Gaps = 18/200 (9%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFS 223
S+ W+ YI LP+ WT E R L E A+ ++T + ++ +R S
Sbjct: 111 SNPWTEYIKFLPKTVLVPTLWTEDE--RLLLRGTSLESAVNAKMTALTAEFDAVR-EAAS 167
Query: 224 KYPD----LFPEEVFN----METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
P L+P E N + + + SR++ LP ++VP DM+NHS
Sbjct: 168 SLPTWNDVLWPFENGNSPASLRNWILLDALYRSRVLELPK--SGESMVPCIDMINHSTCA 225
Query: 276 ETFLDYDKSSQGVVF-TTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVE 333
+ D + + ++ DR G++V ISYG K E+L SYGF+ E T +S+
Sbjct: 226 SAYYDENTKDEVILLPRPDRTISSGKEVTISYGDAKPAAEMLFSYGFIDPETT--VESLV 283
Query: 334 LPLSLKKSDKCYKEKLEALR 353
LPL D K KL A +
Sbjct: 284 LPLEPFGDDPLEKAKLFAFK 303
>gi|119493213|ref|XP_001263813.1| hypothetical protein NFIA_070870 [Neosartorya fischeri NRRL 181]
gi|119411973|gb|EAW21916.1| hypothetical protein NFIA_070870 [Neosartorya fischeri NRRL 181]
Length = 362
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 19/122 (15%)
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSM------DGRVALVPWADMLNHS 272
+ S +PD+ + ET+ +++ I+ +R +P + +AL+P+AD NHS
Sbjct: 124 VVSVFPDV------DWETYSYNWLIVNTRSFYYLMPGQKPPEDRNDAMALLPFADYFNHS 177
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
+VE + +D Q VF R+Y GE++++SYG N LL+ YGF E N SD++
Sbjct: 178 -DVEDDVKFD--GQKYVFRATRRYDSGEEIYMSYGPHPNDFLLVEYGFYLDE--NGSDAI 232
Query: 333 EL 334
L
Sbjct: 233 YL 234
>gi|444320075|ref|XP_004180694.1| hypothetical protein TBLA_0E01160 [Tetrapisispora blattae CBS 6284]
gi|387513737|emb|CCH61175.1| hypothetical protein TBLA_0E01160 [Tetrapisispora blattae CBS 6284]
Length = 615
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 8/109 (7%)
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQG-VVFTTDRQYQPGEQVFISYGKKSNGELLL 316
V L P D+LNHS + + + + + F T + +++F +YG KS + LL
Sbjct: 244 NSVFLFPIVDLLNHSNNSNVIWNLNPNDKNSICFNTIDPIEKSQELFNNYGNKSTEDFLL 303
Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
SYGF+ +E T P D L L L KS ++ L+ +GL ++ F +
Sbjct: 304 SYGFILKEET-PFDYASLTLRLDKS------IIQNLKNFGLGLNDNFIV 345
>gi|402862437|ref|XP_003895567.1| PREDICTED: SET domain-containing protein 4 [Papio anubis]
Length = 456
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 72/310 (23%), Positives = 125/310 (40%), Gaps = 36/310 (11%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
L+KWL + RGL++ ++++G+ ++ +P S ++TAD+ G
Sbjct: 36 LRKWLKARKFQDSHLVPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTADTVIR-SYLG 94
Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
+ + P PLLA T+L+SE S W Y+ LP+ Y+ E+ L
Sbjct: 95 AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 153
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
S ++ +A E+ +V + R FS LF E V F+ W++ + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFVEAVDSIFSYSALLWAWCTVNTRAVYL 211
Query: 254 -PSMDGRVALVPWADMLNHSC-----------------------EVETFLDYDKSSQGVV 289
P ++ P L +C + +++ +
Sbjct: 212 RPRQRECLSAEPDTCALAPACLCPFSPLCSSGLQNLMLHPLSGLSQQVKAAFNEETHSYE 271
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYK 346
T +++ E+VFI YG N L L YGFV + V + +K DK
Sbjct: 272 IRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMD 331
Query: 347 EKLEALRKYG 356
+K+ L+ +G
Sbjct: 332 KKISILKDHG 341
>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
Length = 667
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 9/145 (6%)
Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHS-CEVETFLD--YDKSSQGVVFTTDRQYQ 297
W+ + +R +P + AL+P DM NH+ + T + D+S V + ++
Sbjct: 432 WAVSTVMTRQNTIPFQEDYYALIPLWDMCNHTNGTISTAYNPVLDRSECLAV----KNFK 487
Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
GEQ+FI YG +SN +L + GFV N D + L + KSD +++ L K +
Sbjct: 488 AGEQLFIFYGSRSNADLFVHNGFVFE--NNDYDVYWIRLGISKSDPLQQKRGHLLGKLSI 545
Query: 358 SASECFPIQITGWPLELMAYAYLVV 382
+++ F I+ P++ A+L V
Sbjct: 546 ASTCDFSIRKGASPIDGQLLAFLRV 570
>gi|115492035|ref|XP_001210645.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114197505|gb|EAU39205.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 514
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 7/119 (5%)
Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
+++ +K+ + SRLV LP A+VP DM NH+ + Y++ ++G
Sbjct: 196 LDIDDWKYVDALYRSRLVDLPRSGH--AMVPCVDMANHASDDTVKALYEEDAEGNALLQL 253
Query: 294 RQYQ---PGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
R+ Q PG++V ISYG +K E+L SYGF+P E + V L LS+ D K
Sbjct: 254 REGQVLHPGDEVTISYGSEKPAAEMLFSYGFLP-EDKEDAGQVFLDLSIPDDDPLRNHK 311
>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 484
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 108/280 (38%), Gaps = 47/280 (16%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT--------------ADSKWSCPEAG----EVLKQC 144
RG+VAL++I L +P +I +D + P+ G + L
Sbjct: 40 RGIVALRDIAPETVLFTIPRQSIINVETSGLRSQLPQLFSDEEGLAPQHGVADDDPLSSS 99
Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
+ W L L+ E +S W Y+ LP + ++WT AEL + +
Sbjct: 100 PLDAWGALILVLLYEHLRGAASAWRPYLDVLPATFETPMFWTGAELGALQAGATAGKVGR 159
Query: 205 ERITNVIGTYNDLRLRIFSKYPDLF------PEEVFNMETFKWSFGIL------------ 246
E + T+ + L + +PD+F +E + I+
Sbjct: 160 ESAED---TFRGILLPVVRAHPDVFQGSAALSDEALVALAHRMGSTIMAYAFDLENDEER 216
Query: 247 ---FSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+ DG+ + +VP AD+LN E +++ + + T R + GE+
Sbjct: 217 EDEEDEDGWVEDRDGKAMMGMVPMADILNADAEFNAHVNHGDNE--LTVTALRPIKAGEE 274
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
+ YG N ELL YG+V E + D VE+P L ++
Sbjct: 275 ILNYYGPHPNSELLRRYGYV-TERHSRYDVVEIPWELVEA 313
>gi|45552859|ref|NP_995955.1| CG33230 [Drosophila melanogaster]
gi|45445739|gb|AAS64931.1| CG33230 [Drosophila melanogaster]
gi|223364426|gb|ACM86246.1| MIP03820p [Drosophila melanogaster]
Length = 446
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 98/215 (45%), Gaps = 37/215 (17%)
Query: 151 LLATYLISEASFEK------SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
L+A Y++ ++ SS ++ Y+ LPR + + + EL E+ + ER +
Sbjct: 105 LIACYILYHKHLQECTLGTHSSTYAAYLDTLPRCYSTPYFCSIPELQCLPES--LLERTV 162
Query: 205 ERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRLVRLPSM----- 256
+ + G + ++ I K + +E++ + FKW++ + +R V L S
Sbjct: 163 AQNRQIRGYFEIIK-NIVHKCDCCGKSYGQEIWTLADFKWAYFSVNTRSVHLSSRFLKKQ 221
Query: 257 ----------DGRVALVPWADMLNHSCEVETFL-----DYDKSSQGVVFTTDRQYQPGEQ 301
D +AL P+ D+ NHS +VE DY + + + F+ + P +Q
Sbjct: 222 SNYFQPLISGDTNLALAPFLDLFNHSDQVEITAGIEGPDYVLTLKSLPFSETK---PYDQ 278
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
+FISYG N +LL YGF RE N D E+ L
Sbjct: 279 LFISYGALPNFKLLTEYGFWLRE--NKHDYFEVSL 311
>gi|213408453|ref|XP_002174997.1| SET domain-containing protein [Schizosaccharomyces japonicus
yFS275]
gi|212003044|gb|EEB08704.1| SET domain-containing protein [Schizosaccharomyces japonicus
yFS275]
Length = 441
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 65/267 (24%), Positives = 102/267 (38%), Gaps = 57/267 (21%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS----VPDWPLLATYLISE 159
G+VA+ NI+ E ++F P V+ +G L+ +P+W L +++E
Sbjct: 39 GIVAVDNIKADETVVFFPKDSVMKV--------SGSYLQHLEGIEELPNWAALLLLMMNE 90
Query: 160 ASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
+ S W YIS P + SL YW AE + L S + E +R + V + +
Sbjct: 91 KN-NPESFWKPYISVFPTKERITSLFYWD-AEKQKRLLKSTVLENMQDR-SEVKTVWKET 147
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFS-----RLVRLPSMDGRVA----------- 261
L K E +E F+ ++ S + ++ + D + A
Sbjct: 148 VLPFIDKNKSKL-REGLTLEDFEHMAAVMSSYSFDVKRIKTENNDSQKASKQMDVDNSEH 206
Query: 262 ----------------------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
+ P ADM N E+ YD G R + G
Sbjct: 207 SENNEDDSDLESEYDPEVFEKAMCPIADMFNGDDELCNVRMYD-LEDGYHMMVTRDIEKG 265
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGT 326
EQ++ +YG NGELL YGF +GT
Sbjct: 266 EQLWNTYGDIDNGELLRKYGFTKPDGT 292
>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 252
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 5/114 (4%)
Query: 255 SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
S+ R ALVP +L + + L D S V DR Y+ GE + I G ++N
Sbjct: 17 SLARRFALVPLGPPLLTYKSNCKAMLTVDGES--VRLVVDRPYKAGEPIIIWCGPQTNSR 74
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
L+L+YGFV + NP D + + SL D Y+EK ++ G A + F + +
Sbjct: 75 LVLNYGFV--DENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYV 126
>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
Length = 499
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 126/333 (37%), Gaps = 68/333 (20%)
Query: 73 LENASTLQKWLSDS---GLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSL---V 125
L+N WL++S L P+ D + R ++A ++I+ E L +P V
Sbjct: 7 LKNTENFHSWLTNSVGYKLSPKIKIADGRDTNQGRFILATEDIKTDELLFEIPRESILNV 66
Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS-SRWSNYISALPRQP--YSL 182
+T+ P +L V W L ++ E +K+ S+W+ Y LP SL
Sbjct: 67 LTSSLVSEYPAWENILLDGDVGHWEGLIICMLFEIKVKKNMSKWAPYFDVLPESTDLNSL 126
Query: 183 LYWTRAEL---------DRYLE--ASQIRERAIERITNVIGTYNDLRLRIFSKYPDL-FP 230
+YWT EL DR A Q+ E+ +E I R F K +
Sbjct: 127 MYWTAEELEALKPSLVLDRIGNDGAHQMHEKVMELI------------RTFEKDHSVDLS 174
Query: 231 EEVFNMETFKWSFGIL--FSRLVRLPSMDGRV----------------------ALVPWA 266
E F + I+ +S V LP +++P A
Sbjct: 175 FGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEVEEDVEQTVRNEGSLKSMIPLA 234
Query: 267 DMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
D LN + C D D + + GEQV+ YG N E+L YG+V
Sbjct: 235 DTLNSDTNKCNAHLIYDEDSLKMRAI----SNIKAGEQVYNIYGNHPNAEILRRYGYVEW 290
Query: 324 EGTNPSDSVELPLS--LKKSDKCYKEKLEALRK 354
EG+ D ELPL ++ + Y +E +RK
Sbjct: 291 EGSK-YDFGELPLEVIIETLHEQYDIPIEKIRK 322
>gi|298712711|emb|CBJ48736.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 1030
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 68/267 (25%), Positives = 111/267 (41%), Gaps = 34/267 (12%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
W P +K+ + G R GLVA + I +G+ L VP S+++ A + PE
Sbjct: 486 FNAWTRTVDFPIRKVEAGLIGNGMRLGLVATEAIPQGQTYLSVPGSIILDASKARTDPEL 545
Query: 138 GEVLKQCSVPDWPL---------LATYLISEASFEKS--SRWSNYISALPR----QPYSL 182
G L + PL L LI+E +F ++ S W+ Y+ LP + Y
Sbjct: 546 GPPLARLEASLGPLGLWDQDTDSLRVLLIAE-TFVRADLSPWAPYLRLLPTLSEMEAYHP 604
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYND---LRLRIFSKYPDLFPEEVFNMETF 239
L++ A + + E S ++ ER + + + + + D+ +E +
Sbjct: 605 LFFDNATIASF-EGSDVQASLRERRDSEMAGFTTKFAVEGSAGRELQDVLGVGWITLERY 663
Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV------ETFLDYDKSSQGVVFTTD 293
W+ I+ SR + GR LVP D++N + + ET D D S+ V
Sbjct: 664 LWAAAIVDSRCIW---WGGRKHLVPLLDLVNDARDSPLDFVHETLQDSDGSA---VTAAA 717
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGF 320
R G+QV YG N L+ +GF
Sbjct: 718 RNVDKGDQVMEDYGHP-NHVLIFEHGF 743
>gi|440636605|gb|ELR06524.1| hypothetical protein GMDG_02159 [Geomyces destructans 20631-21]
Length = 682
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/185 (29%), Positives = 81/185 (43%), Gaps = 33/185 (17%)
Query: 155 YLISEASFEKSSRWSNYISALPR--QPYSL---LYWTRAELDRYLEASQIRERAIERITN 209
+LI + SS W+ YI +LP+ +P+ L LY+ E R+L + + +R
Sbjct: 137 FLIQQYLRGSSSHWAPYIRSLPQPDEPHKLATPLYYPE-EARRWLGGTNLPAAIAQREGM 195
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-------L 262
G + SK L P+EV+ +L + PS ++A L
Sbjct: 196 WRGDF-------VSK---LIPDEVYG--------DVLDQPVDGYPSWREKIAEEGPYPVL 237
Query: 263 VPWADMLNHSCE--VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
P D+ NH + VE F++ + TD GEQVF +Y K N ELLL YGF
Sbjct: 238 FPLLDIANHDAKAWVEWFVNAQGPVKDFSIITDAAIGEGEQVFNNYAPKGNTELLLGYGF 297
Query: 321 VPREG 325
+ R G
Sbjct: 298 LRRRG 302
>gi|432119396|gb|ELK38474.1| N-lysine methyltransferase SETD6 [Myotis davidii]
Length = 322
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 84/187 (44%), Gaps = 21/187 (11%)
Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
++W E R L+ + + E + + N+ Y+ + L +PDLF V ++E ++
Sbjct: 1 MFWPEGERRRLLQGTGVPEAVEKDLANIRSEYHSVVLPFMEAHPDLFSPRVRSLELYQQL 60
Query: 243 FGIL--FSRLVR------LPSMDGRVA-----LVPWADMLNHSCEVETFLDYDKSSQGVV 289
++ +S+++ L D +VP AD+LNH + L+Y + +V
Sbjct: 61 VALVMAYSQVLSGSFQEPLEEEDDEKEPNPPLMVPAADILNHVAKHNANLEYSPNCLQMV 120
Query: 290 FTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYK 346
T QP G ++F +YG+ +N +L+ YGF N D+ ++ + +
Sbjct: 121 AT-----QPIPKGREIFNTYGQMANWQLIHMYGFAEPYPDNTDDTADIQMVTVRKAALQG 175
Query: 347 EKLEALR 353
K EA R
Sbjct: 176 TKDEAER 182
>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
Length = 321
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 111/260 (42%), Gaps = 32/260 (12%)
Query: 74 ENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
E+ + KW S+ G+ K+ + KV G +G+ + +R+GE ++ P +L + +
Sbjct: 69 EDWTAFVKWFRSNGGIISSKLTV-KVRNGRQGVYFKERMRRGETIVSFPRNLRLDEKTAM 127
Query: 133 SCPEAGEVLKQCS----VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
+AG V ++ PD ++ +++ E K S W Y L RQ ++++ T
Sbjct: 128 KG-KAGHVFQRLKQDKCYPDL-MVILHVVHEDKLGKDSFWFPYFKLLRRQYNNIMFLTEP 185
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF---------PEEVFNMETF 239
++ L R E N+ + R F+ + + + PE F +
Sbjct: 186 QMKTLL-----RRPGCENTYNL----GVMMRRTFNNFYEWYKKNIEPWAPPEFQFTRDEI 236
Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
W F L + + +G +VP++D+ NH E K+S + ++YQ G
Sbjct: 237 LWGFNTLVTCGWGQQNGNGDKLMVPFSDIPNHRRESA-----QKASNRGFISAAKEYQAG 291
Query: 300 EQVFISYGKKSNGELLLSYG 319
E++ YG N +L YG
Sbjct: 292 EELTFDYG-LLNDAVLAYYG 310
>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
[Ostreococcus tauri]
Length = 394
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 66/268 (24%), Positives = 117/268 (43%), Gaps = 22/268 (8%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS--KWSCPE-AGEVLKQCS-VPDWPLLATYLIS 158
R + A++ + GE + VP ++ + + S P E+LKQ + + D ++ +L +
Sbjct: 2 REVRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTA 61
Query: 159 EA-SFE-----KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
+FE W+ + ALP + S L W +L + + R E +V
Sbjct: 62 ALDAFECGDASAYEEWAPALRALPTRASSSLAWNADDLG-AVAGEDLANRLREYRRSVKV 120
Query: 213 TYNDLRLRIFSKYPDLFPEEVF-NMETFKWSFGILFSRLVRLPSMDG---RVALVPWADM 268
Y+ L + + P+ FP F + F+ ++ I S +++ D R +VP +
Sbjct: 121 QYDALFPALCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKVQDPDSLQIREVIVPGVFL 180
Query: 269 LNHSCEVET--FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
NHS + + ++ ++ R GE + ISYG+ N +LL+ YGF
Sbjct: 181 CNHSLSAHSVRYTSLERGTKAFRLELSRGCVEGEAITISYGRLDNADLLMFYGFSLE--N 238
Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRK 354
NP D V L +++ +LEALR
Sbjct: 239 NPYDRVSLHSITGDANET---QLEALRH 263
>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
heterostrophus C5]
Length = 476
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 74/293 (25%), Positives = 117/293 (39%), Gaps = 44/293 (15%)
Query: 82 WLSDSG--LPP--QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
WL +G + P Q ++ D G RG+ A ++I + E L +P S +++ ++ E
Sbjct: 14 WLKHTGAQINPKIQLEDLRAKDAG-RGVAAKQDIAEHELLFSIPRSSILSVENSILSTEI 72
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
P W L ++ E +S W+ Y + LP +L++WT EL L+AS
Sbjct: 73 PPTTFALLGP-WLSLILVMLYEYHNGSASNWAPYFAVLPTDFDTLMFWTEDELTE-LQAS 130
Query: 198 QI---------RERAIERITNVIGTYNDLRLRIFSKYPDLFPE----EVFNMETFKWSFG 244
+ E IE++ VI + D+ + DL E E + S
Sbjct: 131 AVVNKIGKEGANEVFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLI 190
Query: 245 ILFSRLVRLPSMDGRV----------------ALVPWADMLNHS---CEVETFLDYDKSS 285
+ ++ V D V +VP ADMLN C F + D
Sbjct: 191 MAYAFDVEPAISDKEVDEEGFAEEEEDAALPKGMVPLADMLNADADRCNARLFYEKD--- 247
Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
G+ + Q GE++F YG +LL YG++ E D VE+P L
Sbjct: 248 -GLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYIT-ENYAQYDVVEIPADL 298
>gi|308804211|ref|XP_003079418.1| N-methyltransferase (ISS) [Ostreococcus tauri]
gi|116057873|emb|CAL54076.1| N-methyltransferase (ISS), partial [Ostreococcus tauri]
Length = 305
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 60/123 (48%), Gaps = 13/123 (10%)
Query: 261 ALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
+VP+ DMLNH+ C L++D +G+ T R+ + GE+VF +YG N ELL Y
Sbjct: 171 GMVPFWDMLNHAPPCAASVRLNHD-PKRGLQMITVREVKKGEEVFNTYGPLRNAELLRRY 229
Query: 319 GFV----PREGTNPSDSVELPLSLKKSDKCYKE---KLEALRKYGLSASEC---FPIQIT 368
GFV P GT + ++ + Y+E +L L GL+ E F + T
Sbjct: 230 GFVLARNPHGGTTVGLDEVIEAAMMANPDLYEELPLRLAWLESRGLADEELSTRFFVHQT 289
Query: 369 GWP 371
G P
Sbjct: 290 GRP 292
>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
Length = 281
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 5/114 (4%)
Query: 255 SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
S+ R ALVP +L + + L D S V DR Y+ GE + I G ++N
Sbjct: 17 SLARRFALVPLGPPLLTYKSNCKAMLTVDGES--VRLVVDRPYKAGEPIIIWCGPQTNSR 74
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
L+L+YGFV + NP D + + SL D Y+EK ++ G A + F + +
Sbjct: 75 LVLNYGFV--DENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYV 126
>gi|384501024|gb|EIE91515.1| hypothetical protein RO3G_16226 [Rhizopus delemar RA 99-880]
Length = 354
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 65/254 (25%), Positives = 115/254 (45%), Gaps = 41/254 (16%)
Query: 148 DWPLLATYLISEASFEKSSRWSNYISALPR----QPYSLLYWTRAELDRYLEASQIRER- 202
D +L +LI F ++++W Y+ LP Q +L+ LE S IR +
Sbjct: 6 DRTILCLFLIYYRFFNENTKWKPYMDILPTLEFFQKTHVLFNPGTVKGTCLENS-IRSKI 64
Query: 203 -AIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
++ER I Y R+ ++ + W+ ++SR+V + + +A
Sbjct: 65 SSLERELEEINQYWPTRIE---------------LDMYLWADCTVWSRVVGIT--ETEIA 107
Query: 262 LVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQP-GEQVFISYGKKSNGELLLSYG 319
LVP+ D+ NHS E+ + ++ +G++ T + + E++ + YG KSN ELL +G
Sbjct: 108 LVPYFDLANHSLN-ESNIKWELTDDEGLMLVTTKDIKSQDEELTLFYGSKSNQELLFLHG 166
Query: 320 FVPREGTNPSDS-VELPLS--LKKSDKCYKEKLEALRKYGLS---------ASECFPIQI 367
F ++ NP S + +PL L SD K++ L+ G S P+
Sbjct: 167 FCIQD--NPETSRITIPLMPFLDLSDPVDISKIQWLKSVGAKPILTLMGSRTSNLDPLVA 224
Query: 368 TGWPLELMAYAYLV 381
GW ++ +A YL+
Sbjct: 225 DGWTVDSVAALYLI 238
>gi|281208586|gb|EFA82762.1| hypothetical protein PPL_04457 [Polysphondylium pallidum PN500]
Length = 534
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 66/310 (21%), Positives = 126/310 (40%), Gaps = 63/310 (20%)
Query: 81 KWLSDSGLP-PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
+W ++G+ + I + + + + A+ +I G L+ VP SL+I + P + +
Sbjct: 3 EWSKENGIVWNDHLEIYQDNTSGQSVRAINDIAAGSLLVSVPESLLIHINK----PVSEQ 58
Query: 140 VLK-QCSVPDWPLLATY--LISEASFEKS----SRWSNYISALPRQPYSLLYWTRAELDR 192
+L C LL LI + EK+ S+W Y++ +P+Q + +T E++
Sbjct: 59 LLSLSCIDKVLDLLDNVQRLIFHINVEKAIGEKSKWYRYLNDIPQQYDTSSMYTDEEIED 118
Query: 193 YLEASQIRERAIERITNVIGTYNDL----------------------------RLRIFSK 224
L +E A + ++ +YN + F+
Sbjct: 119 -LTYPYYKEEAYKLKHELLQSYNSFIDIIDNHFNIDNNNNSNISNNNNNNNNNSIDSFTL 177
Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL-------------------VPW 265
++ + + N++ FKW++ ++ +R + + +P
Sbjct: 178 LKEI-KQNLTNLDNFKWAWAVIQTRTYYFNNNNYVNNNNNNNKRKYSNSNNNNNNSSIPM 236
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
AD+ NH +V T ++ + T +++ EQV+ISYG SN LL YGF
Sbjct: 237 ADLFNHRYDVVTRAVFNDEERCFQVFTGTEFKKNEQVYISYGNHSNATLLHFYGFAI--D 294
Query: 326 TNPSDSVELP 335
NP DS+ +P
Sbjct: 295 NNPLDSIVIP 304
>gi|322696758|gb|EFY88546.1| 2-hydroxyacid dehydrogenase, putative [Metarhizium acridum CQMa
102]
Length = 1025
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 109/278 (39%), Gaps = 44/278 (15%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
+E L W G+ +A Q++ G VA ++I+ G+ L+ +P ++ DS
Sbjct: 427 MEGLEPLINWARTRGVELDGVAPQQMPGRGIGAVATRSIKAGQVLMTIPARAILRLDSVL 486
Query: 133 SC-----PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
+ P A + LLA L + + E + R S + L+W R
Sbjct: 487 ASISSRLPSASSIHG--------LLAAQLAASSDAETTLRRDAMPSLQSFAATTPLFWHR 538
Query: 188 AELDRYLEASQIR-----ERAIERITNVI-----GTYNDLRLRI-FSKYPDLFPEEVFNM 236
L L A R E A+ER G D LR F F E
Sbjct: 539 -RLQDLLPAGARRLVDRQEAALERDWAAFHEAFPGVARDAYLRCWFLVGTRAFYHETDAT 597
Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
+ W + R+AL+P ADM NH+ + + S + T R
Sbjct: 598 LLYPW---------------EDRLALLPVADMFNHAGVPGCSVAF--SPEAYTVTATRAC 640
Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
G++VF+SYG+ SN LL YGF+ + N DSV+L
Sbjct: 641 ARGDEVFLSYGEHSNDFLLAEYGFLLDD--NQWDSVDL 676
>gi|255723006|ref|XP_002546437.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240130954|gb|EER30516.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 578
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 136/327 (41%), Gaps = 60/327 (18%)
Query: 73 LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKL--LFVPPSLVITA 128
LE ++L KW +G L P + +++ G + I + + +P L+IT
Sbjct: 4 LEKINSLVKWAESNGAELSPD-VQFKEITTNNIGAIYDGKIAPSDNGYPISIPFKLIITT 62
Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-----QPYSLL 183
+ + E G+ LK + + + + S + Y+ LP PY+
Sbjct: 63 QN--AITEFGKYLKSTEDKNSNAILKFYLCHERINADSFYHPYLKLLPSLAAIDSPYT-- 118
Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTY----NDLRLRI---------------FSK 224
W+ A+ YL+ + + E + +++ + N L+ + F
Sbjct: 119 -WS-AQDKTYLKGTNLGNSLKENLGSLVEEWWEVINLLKDEVSKPEQHFINMKFYYDFKF 176
Query: 225 YPD------LFPEEVFNMETFK---WSFGILFSR-----LVRLPSMDGRVALVPWADMLN 270
Y D L E++ N +F W+ IL SR L+ L+P D+LN
Sbjct: 177 YTDDDLDKYLNEEDINNWTSFPNYLWASLILKSRSFPAYLIDKSCNKNDAMLLPVVDLLN 236
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
H+ + + D S G QPG+++F +YG K N ELLL+YGF EG NP D
Sbjct: 237 HNPQAKVNWD---VSDGFFRFKSESIQPGKEIFNNYGLKGNEELLLAYGFCI-EG-NPRD 291
Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGL 357
SV L + K +EKL+ + + G+
Sbjct: 292 SVALKI------KMPEEKLKEIEEQGI 312
>gi|313239201|emb|CBY14158.1| unnamed protein product [Oikopleura dioica]
Length = 393
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 60/270 (22%), Positives = 114/270 (42%), Gaps = 39/270 (14%)
Query: 92 KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----AGEVLKQCS-- 145
K+ I D G RG+ + I + E L+ VP ++T E A +VL+ S
Sbjct: 22 KLKISDGDCG-RGVFSSAVIEQSELLISVPIDALLTTRKAQHVVESHKSARQVLQNFSTC 80
Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE 205
+ LL L E E++S+WS ++S++P Q ++ EL+ ++ + +
Sbjct: 81 LNGTDLLVCALFLELENEENSKWSAFLSSIPNQLWNPFMLDEKELNLLTAKCRLPSKCFK 140
Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-----RLPSMD--- 257
+ +++I +++ E+ N E W F ++ SR R + +
Sbjct: 141 Q-----------KIKISTEFLKALGFEI-NEEILNWCFSVVLSRSFGGSSERCETRNHFK 188
Query: 258 ------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKS 310
L P D++NH E +++ + ++ G+++F++YG KS
Sbjct: 189 IEIDNSANFCLCPAIDLINHEKEYNCEYRWNEDKTAFQVFSRKKILQGQELFVNYGTTKS 248
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKK 340
E+ YGFV PSD ++ L++
Sbjct: 249 EYEIYNFYGFVL-----PSDDFQVEFELQR 273
>gi|428170888|gb|EKX39809.1| hypothetical protein GUITHDRAFT_114060 [Guillardia theta CCMP2712]
Length = 476
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 104/254 (40%), Gaps = 42/254 (16%)
Query: 103 RGLVALKNIRKGEKLLFVP--PSLVITAD---SKWSCPEAGEVLK--QCSVPDWPLLATY 155
RG+ AL+ + +GE ++ + S I+ D S E + L+ + + D LL Y
Sbjct: 28 RGIWALEQVEEGEVVMRLANSASFRISGDFIRQSSSLAEGVQALEASEGRLADDLLLTLY 87
Query: 156 LISEASFEKSSRWSNYISALP-RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
L S E S +S Y+ +LP +P ++W AEL + A+ Y
Sbjct: 88 LASSRKKEGSFPFSEYVRSLPLEKPDLPIFWDSAELQTLPRMTMTLVEAMRE------EY 141
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA----------LVP 264
D +I K + +V + E KW + ++ SR +R + G P
Sbjct: 142 EDYSSKI-RKVTEALQVQVDD-EDVKWGYAMVKSRALREKVVKGSSGQDVPETLERYCFP 199
Query: 265 WADMLNHSCEV----------ETFLDYDKSSQGVV------FTTDRQYQPGEQVFISYGK 308
ADM NH + + S +G V FT R G +V +YG+
Sbjct: 200 LADMFNHEPSAVPPPASDLLRQADVHRGPSVRGSVVGDHFEFTATRAIPAGSEVSWTYGQ 259
Query: 309 KSNGELLLSYGFVP 322
+N ELLL YGFVP
Sbjct: 260 LTNEELLLRYGFVP 273
>gi|328873307|gb|EGG21674.1| SET domain-containing protein [Dictyostelium fasciculatum]
Length = 514
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 76/158 (48%), Gaps = 15/158 (9%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKW 132
+ KWLSD+G K A KV + GLVAL +I +G++ + VP L +T ++
Sbjct: 65 SFTKWLSDNGC---KEAFDKVKIVRGLTEGSGLVALGDIGEGDEFIAVPSKLFMTQET-- 119
Query: 133 SCPEAGE-VLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
+ GE V+++ P LL +LI E S W+ YI LPR ++L +T +
Sbjct: 120 AIKSIGEKVIREPLFRYIPSLLLTVHLIQEQLIMPKSFWAPYIRMLPRTYRTILQFTMDD 179
Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
R L S + E AI N + Y L F+K PD
Sbjct: 180 F-RALLGSAVLEEAISTYRNTLRQYCFL-YDFFNKTPD 215
>gi|312101598|ref|XP_003149686.1| hypothetical protein LOAG_14135 [Loa loa]
Length = 314
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 70/275 (25%), Positives = 108/275 (39%), Gaps = 43/275 (15%)
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF--- 222
S W YI LP + L++T +L + L S + E ++ NV + L I
Sbjct: 25 SHWQPYIKVLPECFDTPLFFTVEQL-QCLRPSPLFEESLLLYRNVSRQFIHFLLEIIRSD 83
Query: 223 ----------SKYPDLFPEEV--------FNMETFKWSFGILFSRLVRLPSMDGR----- 259
K +L P V F ++WS + +R+ +PS +
Sbjct: 84 EFRHRKKKSKDKISELEPIYVNSPLTAANFTFNLYRWSVACISTRINMIPSEVWKDDIGQ 143
Query: 260 ----VALVPWADMLNHSCEVETF---LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
L+P+ DM NHS F + + R Y+P E V I YG +SN
Sbjct: 144 PRMIPGLIPFLDMANHSYTESAFHEAVHFSDEFDCAEVIAVRDYKPLEPVNIFYGWRSNR 203
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGW- 370
+ LL GF+P E N D +L + L KS + ++ G A S F +I+
Sbjct: 204 DFLLHNGFIPLE-KNIRDIYKLKIGLPKSKR-EDARMRLFHALGFVAESTIFAFEISVCE 261
Query: 371 -----PLELMAYAYLVVSPPSMKGKFEEMAAAASN 400
L A Y++ PS + EE A+++ N
Sbjct: 262 PYFHDSLFRFAQIYILDEVPSAAEQVEEAASSSDN 296
>gi|302422352|ref|XP_003009006.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
gi|261352152|gb|EEY14580.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
Length = 485
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 89/196 (45%), Gaps = 20/196 (10%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
S+ W+ Y+ LPR+ W+ E D +L+ + + +I + + LR K
Sbjct: 116 STPWTEYVKFLPREVPVPTMWSEQERD-FLQGTSLELAVSAKIQALTNEFEALR----EK 170
Query: 225 YPDL-FPEEVF----NMETFKWSFGILF--SRLVRLPSMDGRVALVPWADMLNHSCEVET 277
DL F +F N+ W + SR + LP + V++VP D+ NH+
Sbjct: 171 SSDLPFWNAIFWDKNNVILADWFLVDAWYRSRSLELPGVG--VSMVPVLDLANHAPTPNA 228
Query: 278 FLDYDKSSQG---VVFTTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVE 333
+ + +G ++ G++V ISYG KS E+L SYGF+ + +D+V
Sbjct: 229 YYEESARREGDVELLLRPGSTLAAGDEVTISYGAGKSGAEMLFSYGFI--DPARSTDTVA 286
Query: 334 LPLSLKKSDKCYKEKL 349
LPL+ + D K K+
Sbjct: 287 LPLAPLEDDPLSKAKV 302
>gi|71652808|ref|XP_815053.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70880079|gb|EAN93202.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 572
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 7/139 (5%)
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
A L YL+ + R + + N + + L F P EE +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANAEAEFQLCKSTLSFFQTMPHSDCEERSMPITLEQFLWA 256
Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ L +R +L+PW D N++ + YD+ VF + GEQ+
Sbjct: 257 YNTLMTR--GFAYYSEVWSLMPWVDYFNYALNSNATMKYDERRGAYVFEVLFPIESGEQI 314
Query: 303 FISYGKKSNGELLLSYGFV 321
F+ YG ++ ELLL YGF
Sbjct: 315 FLQYGAYTDMELLLWYGFT 333
>gi|310800174|gb|EFQ35067.1| SET domain-containing protein [Glomerella graminicola M1.001]
Length = 485
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 88/191 (46%), Gaps = 18/191 (9%)
Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP- 226
W+ YI LP Q WT E + L + + +I + +++LR + S P
Sbjct: 121 WTEYIKYLPPQVPVTTLWTVRE-RQMLNGTSLESATAAKIVALSDEFDELR-EVSSSLPL 178
Query: 227 --DLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYD 282
+LF E + W SR + LP A+VP D+ NHS + + Y+
Sbjct: 179 WNELFWES-GKVSLIDWVRVDAWFRSRCLELPK--SGEAMVPVLDLANHSSKANAY--YE 233
Query: 283 KSSQGVVFTTDR---QYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSL 338
++S+ V R + GE++ ISYG KS E+L SYGF+ + + +D + LPL+
Sbjct: 234 QNSKDEVVLLLRPGCRVSSGEEMTISYGDAKSGAEMLFSYGFI--DPASAADRITLPLTP 291
Query: 339 KKSDKCYKEKL 349
+ D K KL
Sbjct: 292 LEDDPLGKAKL 302
>gi|238880307|gb|EEQ43945.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 579
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 79/332 (23%), Positives = 143/332 (43%), Gaps = 71/332 (21%)
Query: 73 LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKL-------LFVPPS 123
+E+ + L KW +G + P V+ E + I KG K+ + +P
Sbjct: 4 IESINKLLKWAESNGAQISPD------VEFKEISKNYIGAIYKGNKVPDSPFCPISIPSK 57
Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-----Q 178
L+IT + + E + LK + D +L +L E +S + Y++ LP
Sbjct: 58 LIITPQTAFK--EFSKSLKNTDINDNSILKLHLCHE-RLNGNSFFYPYLNLLPSLSEIDS 114
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVI--------GTYNDL--------RLRIF 222
PY+ W+ A YL+ + + + + ++ ++DL ++ +
Sbjct: 115 PYT---WS-ANDKSYLQGTNLGNSLKKNLVTLVEEWWKAINALHDDLPKPEQHYINMKFY 170
Query: 223 SKYP---------DLFPEEVFNMETFK---WSFGILFSR-----LVRLPSMDGRVALVPW 265
+Y L E + N +F W+ IL SR L+ + L+P
Sbjct: 171 YEYKFYTDDDLNKYLNDENIENWTSFPNYLWASLILKSRSFPAYLIDKNNKQDSAMLLPV 230
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
D+LNH+ + + + +D S F+++ PG+++F +YG K N ELLL+YGF
Sbjct: 231 VDLLNHNSKSK--VHWDVSDNYFKFSSE-SIVPGKEIFNNYGLKGNEELLLAYGFCIE-- 285
Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
N DSV L + + +EK++A+ +YG+
Sbjct: 286 NNSQDSVALKIKMP------EEKIKAIEEYGI 311
>gi|145485580|ref|XP_001428798.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124395886|emb|CAK61400.1| unnamed protein product [Paramecium tetraurelia]
Length = 331
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 56/256 (21%), Positives = 114/256 (44%), Gaps = 30/256 (11%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-------- 130
L +W G+ + + I ++ G G+VA + I + ++ +P L I ++
Sbjct: 4 LLQWFESEGIHTESIKIAELTHGCNGVVATQPIPSDQIVIKIPLHLCIFSEDLLKNHYQR 63
Query: 131 -KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSR--WSNYISALPRQPYSLLYWTR 187
K P + ++ L Y++ + E S + +Y+ + P ++L WT+
Sbjct: 64 YKKFYPHIFNI-NLNEDAEFNSLVLYILQQRDNEMSLHKPYFDYV----KDPQNILSWTQ 118
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFG 244
+++ ++ + ++ I+R+ +G L+L F ++ F E+ N + F +++
Sbjct: 119 EQVNTIMDEN--LKKTIQRMR--VG----LQLN-FVRFVTFFKEQFKKGLNYDQFLYAYQ 169
Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+ +R LVP+ DMLNH + +T +VF T +Q Q E+++
Sbjct: 170 FVMTRCFGGDDHLQSPCLVPFGDMLNHHDKCQT--KQKIIGTDLVFITTKQIQENEEIYN 227
Query: 305 SYGKKSNGELLLSYGF 320
+G+ N LL YGF
Sbjct: 228 FFGEHGNSFLLCWYGF 243
>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
Length = 496
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 111/283 (39%), Gaps = 46/283 (16%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITA-----------------DSKWSCPEAGE---VLK 142
RG++A +I L +P S ++ A D ++G+
Sbjct: 40 RGIIAKADIPADTVLFTIPRSAILCAATSALRDKIPDVFDLEGDHGAGHSDSGDEDGAAS 99
Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT---------------- 186
S W LL LI E ++SRW Y+ LP + ++W+
Sbjct: 100 SSSQDSWTLLILVLIYEHLQGEASRWRPYLDVLPPTFDTPMFWSPTELSELQASALVAKV 159
Query: 187 -RAELDRYLEASQIRE-RAIERITNVIG--TYNDLRL-RIFSKYPDLFPEEVFNMETFKW 241
RAE DR +EA + RA E + G +D +L + + F++E
Sbjct: 160 GRAEADRMIEAKVLPVIRAHEEVFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLENDDS 219
Query: 242 SFGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
+ +GR + +VP ADMLN E +++ + T R + G
Sbjct: 220 DNDEADEDDEWVEDREGRTMLGMVPMADMLNADAEFNAHINH--GDDALTATALRPIRAG 277
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
+++ YG NGELL YG+V + + D VELP L +++
Sbjct: 278 DEILNYYGPLPNGELLRRYGYVTPKHSR-YDVVELPWELVEAE 319
>gi|380482827|emb|CCF40997.1| SET domain-containing protein [Colletotrichum higginsianum]
Length = 472
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/258 (24%), Positives = 105/258 (40%), Gaps = 34/258 (13%)
Query: 110 NIRKGEKLLFVPPSLVITADSKWSCPEAG-------------EVLKQCSVPDWPLLATYL 156
+ GE ++ P L ++ + S P G SVP + +L
Sbjct: 39 GVEPGETIVTCPLDLTLSYLNAASTPSPGFHHEGAAPSSSPFPPSFLASVPPHVIGRFFL 98
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL------------DRYLEASQIRERAI 204
I + K S W YI LP QP+ L W L + ++ ++I+ R
Sbjct: 99 IHQYLLGKESFWYPYIKTLP-QPHHLQSWILPPLWPADDLELLEDTNVHVAVAEIKSRLK 157
Query: 205 ERITNVIGTYNDLRLRI-FSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLP---SMDG 258
+ I ++ D R +++ + +F +F+ S I +R + LP ++D
Sbjct: 158 AEFKHAIASFADDPARHDYTRLLYNWAYCIFTSRSFRPSLVIPAARQPTLSLPEGCAIDX 217
Query: 259 RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLS 317
L+P D+ NH+ D D + T Y PG QVF +YG K+N EL+L+
Sbjct: 218 FSLLLPLFDVGNHAPTAAIAWDADADTNKCTLRTLHPYVPGAQVFNNYGTTKTNAELMLA 277
Query: 318 YGF-VPREGTNPSDSVEL 334
YGF +P +D V +
Sbjct: 278 YGFCIPESAHLHNDYVHV 295
>gi|392580059|gb|EIW73186.1| hypothetical protein TREMEDRAFT_59348 [Tremella mesenterica DSM
1558]
Length = 503
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 108/285 (37%), Gaps = 84/285 (29%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVIT----------ADSKWSCPEAGEVLKQCSVPDWPLLA 153
G VAL +I L +P +L++T AD +W + G W L
Sbjct: 44 GAVALIDIPVDTPLFHIPSNLLLTPYTSRLASLLADEEWDKLDIG----------WARLI 93
Query: 154 TYLISEASFEKSSRWSNYISAL--PRQPYSL--------------LYWT---RAEL-DRY 193
++ E S + S+W Y+S L P +P L ++W RAEL
Sbjct: 94 LVMMYETSLGQKSKWYQYLSKLFLPCRPTWLIAETESMPTKFDTPMFWDETRRAELIGTD 153
Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN--METFKWSFGILFSRLV 251
LE RE A ++ Y ++ L I +PD+FP + +E + + SR
Sbjct: 154 LEGRIGREDADKQ-------YFEILLPIIQAHPDIFPPNSIDTSLEAYHLQGSRILSRSF 206
Query: 252 RLP----------------------SMDGRVALVPW-ADMLNHSCEVETFLDYDKSSQ-- 286
+P +G VA + W ADMLN + E++ YD S
Sbjct: 207 TIPRSKAGGPAPHVVESDDSDSDSDEEEGGVAAMVWMADMLNAAYELDNARLYDTSENTS 266
Query: 287 ----------GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
G + + + GEQ++ +Y N ELL YG V
Sbjct: 267 ATSTEPWDRPGYTMRSTKLIKAGEQIYNTYDSPPNSELLRKYGHV 311
>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
Length = 327
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 49/194 (25%), Positives = 88/194 (45%), Gaps = 23/194 (11%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEA 160
RG VAL +I E ++ +P L++T D P+ G+V + D +L L+ E
Sbjct: 98 RGAVALDDINSNEDMVSIPEPLLLTPDVALKDPDIGKVFEDNLEDFSDEDMLLILLMHER 157
Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL-RL 219
++S + Y++ LPR P +LL W L +L+ + R + + Y L
Sbjct: 158 GKGETSFFYPYLATLPRLPDTLLNWNEEGLS-WLQDEGLSLEVFLRESQLTAHYTRLVEE 216
Query: 220 RIFSKYPDLFPE-------------EVFNMETFKWSFGILFSRLV--RLPSMDGRVALVP 264
++ + +P LF E + +++E F++++ + +R RLP AL+P
Sbjct: 217 KLKAGWPGLFGEAPDDASDSESKGADPYSLENFRFAWLTIQARAFGRRLPY----SALIP 272
Query: 265 WADMLNHSCEVETF 278
D NH+ T+
Sbjct: 273 LCDSFNHANVAVTY 286
>gi|393904017|gb|EJD73630.1| SET domain-containing protein 3 [Loa loa]
Length = 444
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 71/275 (25%), Positives = 108/275 (39%), Gaps = 43/275 (15%)
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF--- 222
S W YI LP + L++T +L + L S + E ++ NV + L I
Sbjct: 19 SHWQPYIKVLPECFDTPLFFTVEQL-QCLRPSPLFEESLLLYRNVSRQFIHFLLEIIRSD 77
Query: 223 ----------SKYPDLFPEEV--------FNMETFKWSFGILFSRLVRLPSMDGR----- 259
K +L P V F ++WS + +R+ +PS +
Sbjct: 78 EFRHRKKKSKDKISELEPIYVNSPLTAANFTFNLYRWSVACISTRINMIPSEVWKDDIGQ 137
Query: 260 ----VALVPWADMLNHSCEVETF---LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
L+P+ DM NHS F + + R Y+P E V I YG +SN
Sbjct: 138 PRMIPGLIPFLDMANHSYTESAFHEAVHFSDEFDCAEVIAVRDYKPLEPVNIFYGWRSNR 197
Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGW- 370
+ LL GF+P E N D +L + L KS K ++ G A S F +I+
Sbjct: 198 DFLLHNGFIPLE-KNIRDIYKLKIGLPKS-KREDARMRLFHALGFVAESTIFAFEISVCE 255
Query: 371 -----PLELMAYAYLVVSPPSMKGKFEEMAAAASN 400
L A Y++ PS + EE A+++ N
Sbjct: 256 PYFHDSLFRFAQIYILDEVPSAAEQVEEAASSSDN 290
>gi|302921343|ref|XP_003053266.1| hypothetical protein NECHADRAFT_105995 [Nectria haematococca mpVI
77-13-4]
gi|256734206|gb|EEU47553.1| hypothetical protein NECHADRAFT_105995 [Nectria haematococca mpVI
77-13-4]
Length = 371
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 113/269 (42%), Gaps = 38/269 (14%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+VA ++IR+ E +L VP + T D+ + + L SV L +E + +
Sbjct: 32 GVVATRDIRENEAILTVPMKALRTIDT--VPKQISKALHGVSV------HGILAAEIALD 83
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEA---SQIRERAIERITNVIGTYNDLRLR 220
KS ++ + + LP T+ +L+ + S+++ R N++ R
Sbjct: 84 KSDDFAVWKTVLP---------TKEDLESGMPMMWPSELQLLLPRRAKNLLDKQTTTFRR 134
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-----RVALVPWADMLNHS--- 272
+ FP + + W + +P M R+ +P AD+ NH+
Sbjct: 135 ECEIVLNAFPNLTRDDYLYAWVLINTRTFYNSMPKMKAYAQADRLVCMPAADLFNHADQG 194
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---PREGTNPS 329
C++ S G DR Y+ GE+V++SYG SN LL YGF+ R
Sbjct: 195 CQLSF------SPLGYTIKADRVYRQGEEVYVSYGPHSNDFLLTEYGFILGPNRWDEVYL 248
Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLS 358
D V LPL L K+ + ++ L ++ L
Sbjct: 249 DDVILPL-LNKTQRAELASVDFLGRFTLD 276
>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
Length = 480
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 120/277 (43%), Gaps = 33/277 (11%)
Query: 78 TLQKWLSDSGLPPQKMAIQKV---DVGERG--LVALKNIRKG---------EKLLFVPPS 123
TL W+ +G+ +A +K+ D ++G +V + G E LL VP
Sbjct: 10 TLPSWIKLNGVSVNGIAFRKLQADDGTDKGSAIVGTEVKSTGNAEGSEVEPEVLLRVPTD 69
Query: 124 LVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
LV++ D ++ L++ +V D+ + Y SS W+ Y+ +P
Sbjct: 70 LVLSFDFVEEYSKSDRQLREVLEAVGDFGRV-DYASERHQIGLSSPWTEYMKYMPPAISL 128
Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----LRIFSKYPDLFPEEVFNM 236
+++ EL+ L S +R +I ++ + LR L KY + E+ +
Sbjct: 129 PTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRRSTEGLDWCEKY--WWDEDTGKL 185
Query: 237 ETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCE--VETFLDYDKSSQGVV-FT 291
W + + SR+V LP A+VP DM NH+ E V+ D D V+
Sbjct: 186 TFNDWKYVDALYRSRMVDLPRHGH--AMVPCIDMANHASEGTVKALYDEDADGNAVLQLR 243
Query: 292 TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
R + E+V ISYG +KS EL+ SYGF+ T+
Sbjct: 244 EGRSLRADEEVTISYGDEKSASELIFSYGFLDEHTTD 280
>gi|189193345|ref|XP_001933011.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187978575|gb|EDU45201.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 642
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 89/214 (41%), Gaps = 28/214 (13%)
Query: 141 LKQCS--VPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEA 196
L+QC +PD L LI + ++S W YI+ LP R + L++ ++ +L
Sbjct: 84 LRQCRGRIPDHILTYLLLIEQRDKGQASPWHAYIACLPNSRDMTTPLWFDEGDM-AFLAG 142
Query: 197 SQIRERAIERITNV-------IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
+ + A ER + I +L + + + + E+ W+ I SR
Sbjct: 143 TSLVPAAKERKAELQQQWEGAIAVMEELSIPL---------AKGIDTESLLWAATIFTSR 193
Query: 250 LVR----LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD-RQYQPGEQVFI 304
LP + L P D+LNHS + D+ + D +QP +++F
Sbjct: 194 AFISTHILPERETVPILFPVVDILNHSVSAKVEWDFQPGQSFALKCLDGDSFQPEQELFN 253
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
+Y K N ELLL YGF NP + L L+
Sbjct: 254 NYAPKQNDELLLGYGFCLE--NNPIEQFALKLAF 285
>gi|392863014|gb|EAS36291.2| hypothetical protein CIMG_01513 [Coccidioides immitis RS]
Length = 746
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 95/205 (46%), Gaps = 32/205 (15%)
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
PE +K+ + L+ YL+ + SF W+ YI +LP Q L Y++ +L+
Sbjct: 132 PEFLPAVKEKGASAFLLMDQYLLGDESF-----WAPYIRSLPEDSQLTRLEYYSDEDLE- 185
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
+LE + + + + + TY ++ L++ + P+ + + E F W+ I+ SR
Sbjct: 186 WLEGTNLLKLRENMLIKLKTTY-EVGLQMLKESPNKNTKN-YTWERFLWASSIIISRAFS 243
Query: 253 LPSMDGRV--------------ALVPWADMLNHS--CEVETFLDYDKSSQGVV-FTTDRQ 295
+ V LVP DM NH +VE ++SQGVV +
Sbjct: 244 SEVLKDYVKNSKSINVTGGEFSVLVPLLDMTNHQPLAQVEW-----RTSQGVVGLIVHKT 298
Query: 296 YQPGEQVFISYGKKSNGELLLSYGF 320
PG++V +YG ++N L+L+YGF
Sbjct: 299 LLPGQEVPNNYGPRNNERLMLNYGF 323
>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
gi|194693232|gb|ACF80700.1| unknown [Zea mays]
gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
Length = 252
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 5/114 (4%)
Query: 255 SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
S+ R ALVP +L + + L D S V DR Y+ GE + I G ++N
Sbjct: 17 SLARRFALVPLGPPLLTYRSNCKAMLTADGDS--VRLVVDRPYKAGEPIIIWCGPQTNSR 74
Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
L+L+YGFV + NP D V + SL D Y+EK ++ G A + F + +
Sbjct: 75 LVLNYGFVDED--NPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYV 126
>gi|453088140|gb|EMF16181.1| SET domain-containing protein, partial [Mycosphaerella populorum
SO2202]
Length = 307
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 70/297 (23%), Positives = 121/297 (40%), Gaps = 56/297 (18%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
GL+ I K ++++FVP + + T + S + S LA ++SE
Sbjct: 31 GLLTTAKISKDDQIIFVPKNAMFTPKTAESHTKKPSPSPSPSPSPQAHLAISIMSECLSP 90
Query: 164 KS------SRW---SNYISALPRQPYSLLYWTRAELDRYLEAS------QIRERAIERIT 208
S W S++ S +P L+W+ EL +L S ++RE + +T
Sbjct: 91 SSPYLTWKKTWPTLSDFESGMP------LFWS-PELCHHLPESVKQPLERMREDYEKDLT 143
Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL--PSMD-GRVALVPW 265
++ D ++ + E FK+ + I+ SR P + G + L P+
Sbjct: 144 YMLSLNCD--------------DQTWKEEDFKYYWAIVNSRCFHFKPPGLKPGFMVLCPF 189
Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---- 321
D +NH T + +S +G DR Y+P ++ +YG N +LL+ YGF
Sbjct: 190 IDYMNHG-PTGTGVKVSQSPKGYEVVADRDYEPNTEILATYGSHPNDKLLVHYGFCLSYK 248
Query: 322 PREGTNPS---DSVELPLSLKKSDKCYKEKLEALRKYGL--------SASECFPIQI 367
P E ++ D + LP ++ + K + + L Y L A CF Q+
Sbjct: 249 PNEPSDDDIRLDHILLP-AMSANTKSQLQDVGMLGSYALLPPSAQRQQAELCFKTQV 304
>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
Length = 634
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 87/213 (40%), Gaps = 10/213 (4%)
Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPYSLLYWTRAELDRYLEASQIRE 201
Q +PD L LI + + K S W YI+ LP + + W E +L + +
Sbjct: 94 QGKIPDHILTYLLLIEQRNKGKESPWHAYIACLPGAESMTTPLWFDDEDMAFLAGTSLAP 153
Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR----LPSMD 257
A ER + + L I +EV + E+ W+ I SR LP +
Sbjct: 154 AAKERKSLYYQQWEQ-ALGIMKDAGVALADEV-DFESLLWAATIFTSRAFISTHILPDHE 211
Query: 258 GRVALVPWADMLNHSCEVETFLDYDK-SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
L P D+LNHS + ++ +S + + G+++F +Y K N ELLL
Sbjct: 212 TVPLLFPIVDILNHSVSAKVEWEFQPLASFSLKLLEGDTFTAGQELFNNYAPKQNDELLL 271
Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
YGF NP + L L+ + Y E +
Sbjct: 272 GYGFCLEH--NPIEQFPLKLAFPPMLQEYAEAM 302
>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
Length = 1313
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/212 (26%), Positives = 95/212 (44%), Gaps = 14/212 (6%)
Query: 81 KWLSDSGLPPQKMAIQKVDVGE--RGLVALKNIRKGEKLLFVPPSLVITADSKWSCP-EA 137
W + +G AI D+ E RG VAL++I +GEKL +P SL+++ + S P
Sbjct: 819 NWFTSAGGTFDSSAIGIEDLPETGRGAVALRDIYEGEKLFTIPRSLLLSTRTS-SLPFLL 877
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
GE W L ++ E + + S W Y+ ++P + +L++WT EL L+ S
Sbjct: 878 GEEDWNALGDGWAGLILCMMWEEARAEESPWRGYLESMPTEFSTLMFWTDEELG-LLKGS 936
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
+ ++ I R YN+ L + K DLF +F ++ I SR+ +
Sbjct: 937 LVLDK-IGR-AGAEKDYNEKVLPLLQKRTDLFAPSLFQTRYTLQNYHIQGSRI-----LS 989
Query: 258 GRVALVPWADML--NHSCEVETFLDYDKSSQG 287
+ PW+ + N E +D +S G
Sbjct: 990 RSFTVSPWSGAVPENDEDEAPELVDTSMASAG 1021
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 8/81 (9%)
Query: 260 VALVPWADMLNHSCEVETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
VA+VP ADMLN C L Y + ++ T + GEQ++ +YG N +LL Y
Sbjct: 1059 VAMVPMADMLNARCGCNNAKLFYTRDDLQMMAT--KPIAKGEQIWNTYGDPPNSDLLRRY 1116
Query: 319 GFV-----PREGTNPSDSVEL 334
G+V P +PSD VE+
Sbjct: 1117 GYVDALTLPDGVGSPSDVVEI 1137
>gi|367021574|ref|XP_003660072.1| hypothetical protein MYCTH_2297882 [Myceliophthora thermophila ATCC
42464]
gi|347007339|gb|AEO54827.1| hypothetical protein MYCTH_2297882 [Myceliophthora thermophila ATCC
42464]
Length = 426
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT--DRQYQPGEQVFISYGKKSNG 312
+ D R+ L P AD+LNH+ +D + FT DR Y PGE+V I YG+ N
Sbjct: 221 ARDDRMVLQPVADLLNHAAAGYATAGFDGAGGIGWFTVAADRAYAPGEEVHICYGRHHND 280
Query: 313 ELLLSYGFV 321
LL+ YGF+
Sbjct: 281 LLLVEYGFL 289
>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
Length = 486
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 71/285 (24%), Positives = 115/285 (40%), Gaps = 48/285 (16%)
Query: 54 RTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
+T+ ++ N +PW I + +TL S G R L A K I
Sbjct: 37 QTQASLDNNFLPWLERIAGAKITNTLSIGKSTYG---------------RSLFASKVIYA 81
Query: 114 GEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
G+ +L VP + IT D P VL V + +LA LI E + SRW YIS
Sbjct: 82 GDCMLKVPFNAQITPDE---LPSDIRVLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYIS 138
Query: 174 ALPR--QPYSLLYWTRAELDRY----LEASQIRERA-IERITNVIGTYNDLRLRIFSKYP 226
LP+ + +S ++W EL + ++++A IE+ + + I ++ P
Sbjct: 139 RLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFSFVAQAFKQHCPIVTERP 198
Query: 227 DLFPEEVFNMETFKWSFGILFSRLVRLP--------------SMDGRVALVPWADMLNHS 272
DL E F +++ + L + S G++ D +NH
Sbjct: 199 DL--------EDFMYAYALGEKVLCIVLFLLNLDNLLLDSGISCVGKLK-THITDFMNHD 249
Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
+ + D+ +Q T DR Y PG++V +S K G L L+
Sbjct: 250 GLSASIVLRDEDNQLSEVTADRNYSPGDEVDLSDWLKLMGLLKLT 294
>gi|412994115|emb|CCO14626.1| unnamed protein product [Bathycoccus prasinos]
Length = 390
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/125 (29%), Positives = 62/125 (49%), Gaps = 16/125 (12%)
Query: 225 YPDLFPEE--VFNMETFKWSFGILFSRLVRLPSMDGRV-----ALVPWADMLNH----SC 273
+ +F EE + E F W+ + SR + + S + + + +P D+LNH +C
Sbjct: 172 HAGIFGEENKAVSYEMFAWAISTVLSRALSVSSENKNIDSLFYSFIPGVDLLNHDANANC 231
Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK-KSNGELLLSYGF-VPREGTNPSDS 331
E+ + + +S + R + E+ ISYG +SN ELL YGF VP N +DS
Sbjct: 232 EIRLVSNKNNASTSIEVYAIRDIENDEECTISYGNHRSNDELLRKYGFCVP---NNRNDS 288
Query: 332 VELPL 336
+++ L
Sbjct: 289 IDVRL 293
>gi|71406326|ref|XP_805712.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70869221|gb|EAN83861.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 572
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 59/139 (42%), Gaps = 7/139 (5%)
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
A L YL+ + R + + N + + L F P EE +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANAEAEFQLCKSALSFFQTMPHSDCEERSMPVTLEQFLWA 256
Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ L +R +L+PW D N++ + YD+ +F + GEQ+
Sbjct: 257 YNTLMTR--GFAYYSEVWSLMPWVDYFNYALNSNATMKYDERRGAYIFEVLFPIESGEQI 314
Query: 303 FISYGKKSNGELLLSYGFV 321
F+ YG ++ ELLL YGF
Sbjct: 315 FLQYGAYTDMELLLWYGFT 333
>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/304 (25%), Positives = 117/304 (38%), Gaps = 59/304 (19%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----AGEVLKQCSVPDWPL-LATYLI 157
RG+ A +I + L+ +P + P + E+ + W L +A L+
Sbjct: 107 RGVKATSDIASEDDLVRLPREATMLVVEGQENPHEEYISNELWAKAGDERWALRVALVLL 166
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAEL--------DRYLEASQI-RERAIERIT 208
E S S++ YI LP+ +L WT E+ +++ + ++ E+A E I
Sbjct: 167 YEKSLGSRSKFYEYIEQLPKSFENLGTWTEEEVRELQYSVGEKFAKEQRLENEKACELIQ 226
Query: 209 N--------------VIGTYNDLRLRIFS-KYPD-------LFPEEVFNMETF------- 239
VI + +R R+FS K D L P + F
Sbjct: 227 EYARDGGLKTIEREEVIWALDVVRSRVFSGKIADQEALQRKLLPRALSVGTVFASFLTAQ 286
Query: 240 ----KWSFGILFSRLVRLPSM-------DGRVALVPWADMLNHSCEVETFLDYDKSSQGV 288
KW LV S D L+P D NH ++T ++ S
Sbjct: 287 TTELKWLCVFALLALVVFDSTKENDVKTDTAYVLMPLIDAFNHQTMLKTEFEFTNSE--F 344
Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE--LPLSLKKSDKCYK 346
+ + Y+ GE+V ISYG N ELLL YGFV + + E LP L ++D K
Sbjct: 345 ALKSPKSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLLPY-LTQNDPTLK 403
Query: 347 EKLE 350
E LE
Sbjct: 404 ENLE 407
>gi|412986734|emb|CCO15160.1| predicted protein [Bathycoccus prasinos]
Length = 450
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 9/101 (8%)
Query: 239 FKWSFGILFSRLVRLPSMDGRVA----LVPWADMLNHSC---EVETFLDYDKSSQGVVFT 291
+ W+ +FSR R+ GR A ++P D+LNHS EV + +
Sbjct: 207 YGWALSQVFSRTFRIEDARGRRAPRRVMIPIVDLLNHSSVEEEVNVTWRVKEDLSAFIVE 266
Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
R E++ +SYG++++ LL YGF+P NP +SV
Sbjct: 267 AKRNVGKDEELILSYGERNDQHFLLFYGFLP--SMNPCNSV 305
>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
Length = 508
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 108/251 (43%), Gaps = 37/251 (14%)
Query: 103 RGLVALKNIRKGEKLLFVPPSL---VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
RG++A+K+I +GE L +P V+T+ + E L+ S+ W L L+ E
Sbjct: 39 RGVIAVKDIAEGEVLFEIPRDSILNVLTSSLSSDFSDLEETLQ--SIGSWEGLILCLLYE 96
Query: 160 ASFEKS-SRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
+K S+W Y + LP L+YW EL+ +L S + +R ++ + +
Sbjct: 97 WKGKKEKSKWWKYFNVLPSSNAMNGLMYWNEQELE-HLRPSLVLDRIGKKSAKNM-YHKV 154
Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV---------------- 260
L L SK+P++ E F ++ ++ + + + + +
Sbjct: 155 LTLVKESKFPEVLCN--VEWEDFVYAASVIMAYSFDVENGESQTLNEEDDDQDEEENTGY 212
Query: 261 --ALVPWADMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
+++P AD LN H C D DK + + + GEQVF YG N E+L
Sbjct: 213 IKSMIPLADTLNSDTHQCNANLMYD-DKFLKMYAI---KPIKKGEQVFNIYGNHPNAEIL 268
Query: 316 LSYGFVPREGT 326
YG+V G+
Sbjct: 269 RRYGYVEWSGS 279
>gi|410082986|ref|XP_003959071.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
gi|372465661|emb|CCF59936.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
Length = 584
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 52/108 (48%), Gaps = 16/108 (14%)
Query: 241 WSFGILFSR----LVRLP---SMDGRVALVPWADMLNHSCEVE---TFLDYDKSSQGVVF 290
WSFGI SR ++ P S + L P D+LNH TF D Q F
Sbjct: 202 WSFGIFTSRAFPEILINPDNCSNVNQAFLYPIVDLLNHKNGTSVKWTFED----DQAHFF 257
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
T ++ + ++F +YG KSN ELLL YGFV + N D +L L L
Sbjct: 258 TNEKNLKKHTELFNNYGDKSNEELLLGYGFV--QSNNAHDDTKLTLKL 303
>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
C-169]
Length = 306
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/178 (23%), Positives = 84/178 (47%), Gaps = 8/178 (4%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G+ A++++ +G++L +P + V++ + ++L+Q + L ++ E S
Sbjct: 3 GVFAVQDLCEGQRLCEIPKTAVLSVQNTG----IADILEQHRIRGGLGLIIAIMYELSIG 58
Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDR-YLEASQIRERAIERITNVIGTYNDLRLRIF 222
K S W Y+ L ++ Y L+W AE +R L+ ++ R E + +
Sbjct: 59 KESFWHGYLEELHKREYLPLFW--AEQERSLLQGTEAEHRPQEDEELTQEDFETHVPPLV 116
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
++ D + F +E+F+ + + SR + S G +++VP AD+ NH + F D
Sbjct: 117 EQHADRLRADSFTLESFRVAASWVASRAFGVDSFHG-MSMVPLADIFNHKAAIVQFSD 173
>gi|406602709|emb|CCH45757.1| hypothetical protein BN7_5343 [Wickerhamomyces ciferrii]
Length = 569
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 65/134 (48%), Gaps = 15/134 (11%)
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQG--VVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
LVP D+LNH E D SS G +F T+++ + G++++ SYG K+N EL+ YG
Sbjct: 231 LVPIFDLLNHDNEANVKWDSLDSSNGKNFIFKTEQKLKNGDEIYNSYGPKTNQELMFGYG 290
Query: 320 FVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE---CFPIQITG-WPLELM 375
F N D L L + +++ +E+ +GL + +PI P L+
Sbjct: 291 FAIE--NNKEDRATLALRIPEAN------IESANTFGLKLTTNEVSYPITKENPLPTPLI 342
Query: 376 -AYAYLVVSPPSMK 388
+AYLV S K
Sbjct: 343 DLFAYLVKSDEETK 356
>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 486
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 111/279 (39%), Gaps = 61/279 (21%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-TADSKWSCPEAGEVLKQC--SVPDWPLLATYLISE 159
RG+VA +I + E+L +P LV+ T +SK ++L Q + W L ++ E
Sbjct: 48 RGVVAQTDIPENEELFTIPRDLVLSTQNSKLK-----DLLSQDLEELGPWLSLMLVMMYE 102
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTY 214
S W+ Y LPR+ +L++WT +EL L+ S + R+ A E I +I
Sbjct: 103 YLLGDQSTWAAYFKVLPRKFDTLMFWTPSEL-LELQGSAVIDKIGRQGADESILEMIAP- 160
Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------------- 260
I +P LFP V + ++ G L+ L G +
Sbjct: 161 ------IVRAHPSLFP-PVDGLPSYDGDAGT--QALLHLAHTMGSLIMAYAFDIEKPEDE 211
Query: 261 ---------------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
+VP AD+LN + + + +V + G
Sbjct: 212 DEEGDGEGGYMTDEEEEQLSKGMVPLADLLNADADRNNARLF-QDENALVMKAIKPIAKG 270
Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
E++F YG+ +LL YG+V + P D VE+ L +
Sbjct: 271 EEIFNDYGEIPRADLLRRYGYV-TDNYAPYDVVEVSLDV 308
>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 465
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 108/260 (41%), Gaps = 30/260 (11%)
Query: 93 MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVLKQCSV 146
A+QK + +RG+ A + + +G L +P +T AD S EA ++K +
Sbjct: 36 FAVQKAN-KDRGVTAKRALERGAILAVIPFEACLTLKTCSRADVAASVEEA--LVKTKTE 92
Query: 147 PDWPL-LATYLISEASFEKSSRWSNYISALPR-QPYSLLYWTRAELDRYLEASQIRERAI 204
W L L E S SR+ Y LPR + + W E YL +++
Sbjct: 93 ASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCAWNDGERS-YLAGTEVETSLR 151
Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVP 264
+ + + +F ++ E F E F + ++ SR L G V LVP
Sbjct: 152 DEAAAAKNEWERVVAPVFKEHG---VECSF--EQFIEARTVVSSRAFTLSPNAG-VGLVP 205
Query: 265 WADMLNH--SCEVETFLDYD---KSSQG-----VVFTTDRQYQPGEQVFISYGKKSNGEL 314
AD NH D D +S G V T ++ + G+++F +YG N +L
Sbjct: 206 IADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVRRGDEIFNTYGFHGNAKL 265
Query: 315 LLSYGFVPREGTNPSDSVEL 334
L SYGF + NP+D V L
Sbjct: 266 LNSYGFTQND--NPADEVRL 283
>gi|145345009|ref|XP_001417016.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577242|gb|ABO95309.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 390
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 38/60 (63%), Gaps = 2/60 (3%)
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
L+P D LNH ++T +++ S V + R+Y+ GE+VFISYG +N EL+ YGFV
Sbjct: 178 LMPLIDALNHKTMLKT--EFEFSGGAFVLRSPREYKTGEEVFISYGVLNNDELITRYGFV 235
>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 485
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/274 (26%), Positives = 113/274 (41%), Gaps = 59/274 (21%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL--KQCSV 146
P K+A + + RG+VA +I + E+L +P SLV++ + ++L +
Sbjct: 34 PKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLS----FQNSRLKDLLDFNERDF 89
Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIER 206
W L +I E +S WS Y LP +L++WT EL R L S + + +
Sbjct: 90 DPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEEL-RELSGSAV----LNK 144
Query: 207 ITNVIGTYNDLR--LRIFSKYPDLFPEEVFNMETFKWSFG--ILFSRLVRLPSM------ 256
I N R L + S P LFP + + +F G L S R+ S+
Sbjct: 145 IGRSDAEANIFRNILPLVSGNPSLFP-PMSGVASFDSPEGKAALLSLAHRMGSLVMAYAF 203
Query: 257 -------DGR---------------VALVPWADMLNHSCEVETFLDYDKSS----QGVVF 290
DGR +VP AD+LN D D+++ Q +
Sbjct: 204 DIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNA--------DADRNNARLFQEDCY 255
Query: 291 TTDRQYQP---GEQVFISYGKKSNGELLLSYGFV 321
+ R +P GE++F YG+ +LL YG+V
Sbjct: 256 LSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYV 289
>gi|171692069|ref|XP_001910959.1| hypothetical protein [Podospora anserina S mat+]
gi|170945983|emb|CAP72784.1| unnamed protein product [Podospora anserina S mat+]
Length = 454
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 56/109 (51%), Gaps = 6/109 (5%)
Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT-DRQYQPGEQVFISY 306
SR + LP ++VP DM+NHS + + D + + V+ G++V ISY
Sbjct: 178 SRCLELPK--SGESMVPCIDMINHSSDPSAYYDQNSDYEAVLLLRPGASMSKGQEVTISY 235
Query: 307 GK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
G KS E+L SYGF+ E T S+S+ LPL+ D K KL A K
Sbjct: 236 GDTKSAAEMLFSYGFIDPEST--SESLVLPLAPFPDDPLAKAKLVAFGK 282
>gi|303313087|ref|XP_003066555.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
gi|240106217|gb|EER24410.1| SET domain containing protein [Coccidioides posadasii C735 delta
SOWgp]
Length = 329
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 66/289 (22%), Positives = 100/289 (34%), Gaps = 49/289 (16%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
+W D G+ +A + G+ AL+ I GE ++ VP S ++T D S
Sbjct: 16 FTQWAKDQGIQINGVAAVRFPGRGMGIAALRGIDAGETIVSVPTSSLLTLDKIRSTFRE- 74
Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-----------PYSLLYWTR 187
Q P + A YL + + SR++ + + P P L+
Sbjct: 75 --KFQGDTPVQGIFAAYLACDD--DARSRYAPWRATWPTMRDFEDSIPLLWPKYLIGTPG 130
Query: 188 AELDRYLEASQIRERAI---------------ERITNVIGTYNDLRLRIFSKYPDLFPEE 232
EL E + E R+ G Y + F +
Sbjct: 131 DELKGQGETTARGEEVFASLLPPSISGHFTLSNRVGRFSGDYTPDHQNLLENQRSRFRKA 190
Query: 233 V---------FNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCEV 275
N+E F + + +R + D V AL P+AD NHS
Sbjct: 191 FSRVKLACPGINLEIFTYYWFATHTRCFFYVAKDSEVPEDRNDAMALCPFADYFNHSSN- 249
Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPRE 324
+ G FT + Y GE+VF+ YG ++ LL YGFVP E
Sbjct: 250 DPGCKASFDGGGYTFTATKSYAKGEEVFVCYGNHTSDVLLTDYGFVPDE 298
>gi|449016030|dbj|BAM79432.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
subunit N-methyltransferase [Cyanidioschyzon merolae
strain 10D]
Length = 458
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 84/353 (23%), Positives = 141/353 (39%), Gaps = 55/353 (15%)
Query: 82 WLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
+L ++ P +A VD RGLVA IR GE + +P L I S+ P
Sbjct: 139 YLREAHFPKVALAEVPVDGASSLKMRGLVATAAIRAGEVICRIPRRLAICLGSEGENP-- 196
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL---YWTRAELDRYL 194
+P LL EA ++ Y LPR + ++ EL +
Sbjct: 197 -------GLPALHLLRMMTDGEAV----HKYKAYFDVLPRPEMCQMTTDFYNDEELGQIA 245
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKY--PDLFPEEVFNMETFK---WSFGILFSR 249
+ E R + T+ LR + Y P + + + +M F+ W+ ++ SR
Sbjct: 246 HTPTVEE-TRRRRQQLRDTFLQEFLRTGADYLHPQVAAQNLDHMPEFQRYLWAVHLVVSR 304
Query: 250 LVRLPSMD-GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
+ + + D + L+P DM+N + + L Y ++ V + E++ I YG
Sbjct: 305 ALAVRTGDEAQRYLIPLLDMINCRMDSKHELRYRIATDEFVLIAGESVRRSEEIRIPYGG 364
Query: 309 K--SNGELLLSYGFVPREGTNPSDSVE-LPLS-LKKSDKCYKEKLEALRKYGLSASECFP 364
SN L+ YGF+ NP+D + LP ++++D E+ E +R+ EC
Sbjct: 365 GFVSNDRLIQDYGFIVER--NPADLLLFLPRHCVQRADLLSSEERENVRQ------ECAA 416
Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
+ +A V+S PS + + A N+ KC E+ E A
Sbjct: 417 V---------LARQQQVLSVPSERIR-------AFNRAVVDAARKCIELLETA 453
>gi|407832777|gb|EKF98587.1| hypothetical protein TCSYLVIO_010514 [Trypanosoma cruzi]
Length = 572
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 59/139 (42%), Gaps = 7/139 (5%)
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
A L YL+ + R + + N + + L F P EE +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANAEAEFQLCKSTLSFFQTMPHSDCEERSMPITLEHFLWA 256
Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
+ L +R +L+PW D N++ + YD+ +F + GEQ+
Sbjct: 257 YNTLMTR--GFAYYSEVWSLMPWVDYFNYALNSNATMKYDELRGAYIFEVLFPIESGEQI 314
Query: 303 FISYGKKSNGELLLSYGFV 321
F+ YG ++ ELLL YGF
Sbjct: 315 FLQYGAYTDMELLLWYGFT 333
>gi|261328667|emb|CBH11645.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 583
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 91/212 (42%), Gaps = 28/212 (13%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEASQI 199
LL LI E ++S W + + + P P YW+ +L D + ++
Sbjct: 202 LLVLALIYERFVARTSHWKDLLLSCPTDFPTVPSYWSWNDLSGLYGLDVLDDVLAKQERL 261
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSM 256
R+ E +T+V+ D + EE F +E W+ + SR L ++
Sbjct: 262 RQFHTE-VTSVLPLIYD----ALEGCSGIEREEFMGHFTIENIMWARAVFDSRAFNL-NV 315
Query: 257 DGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKS 310
DGRV ALVP ADM+NHS + + + G V + G ++ +SYG
Sbjct: 316 DGRVVLALVPCADMINHSNHPDVLIRRVEPCGGDFVMQVGAGLAREDVGRELGMSYGPLQ 375
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
N ELL YGFV + N D + P + ++D
Sbjct: 376 NWELLQHYGFVLDD--NEHDKLPFPFDVHEAD 405
>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
Length = 679
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 93/204 (45%), Gaps = 29/204 (14%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERI 207
L+A YL F W Y+ LP QP L L++ ++D +++ + I E A+ERI
Sbjct: 110 LMAQYLRGPEGF-----WYPYLRTLP-QPGQLTTPLFFGEEDVD-WIQGTGIPEAAVERI 162
Query: 208 TNVIGTYNDLRLRI-FSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV------ 260
Y+ L++ + +PD E + E + W+ I+ SR + G V
Sbjct: 163 KIWEEKYDSGYLQLGATGFPDC---ETYTWELYLWASTIITSRAFSAKVLSGAVQPGDLP 219
Query: 261 -----ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
AL+P D+ NH + G++ D + G+++ +YG ++N +LL
Sbjct: 220 EDGVSALLPLIDLPNHRPMAKVEWRAGDKDIGLLVLED--HSAGQEISNNYGPRNNEQLL 277
Query: 316 LSYGFVPREGTNPSDSVELPLSLK 339
++YGF NP+D + L +K
Sbjct: 278 INYGFC--IAGNPTDYRIVHLGVK 299
>gi|119194277|ref|XP_001247742.1| hypothetical protein CIMG_01513 [Coccidioides immitis RS]
Length = 718
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/205 (28%), Positives = 90/205 (43%), Gaps = 46/205 (22%)
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
PE +K+ + L+ YL+ + SF W+ YI +LP Q L Y++ +L+
Sbjct: 118 PEFLPAVKEKGASAFLLMDQYLLGDESF-----WAPYIRSLPEDSQLTRLEYYSDEDLE- 171
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
+LE TN++ LR + K P ++F E F W+ I+ SR
Sbjct: 172 WLEG-----------TNLL----KLRENMLIKLKTTVPNKIFR-ERFLWASSIIISRAFS 215
Query: 253 LPSMDGRV--------------ALVPWADMLNHS--CEVETFLDYDKSSQGVV-FTTDRQ 295
+ V LVP DM NH +VE ++SQGVV +
Sbjct: 216 SEVLKDYVKNSKSINVTGGEFSVLVPLLDMTNHQPLAQVEW-----RTSQGVVGLIVHKT 270
Query: 296 YQPGEQVFISYGKKSNGELLLSYGF 320
PG++V +YG ++N L+L+YGF
Sbjct: 271 LLPGQEVPNNYGPRNNERLMLNYGF 295
>gi|428179814|gb|EKX48683.1| hypothetical protein GUITHDRAFT_136380 [Guillardia theta CCMP2712]
Length = 335
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 123/286 (43%), Gaps = 47/286 (16%)
Query: 76 ASTLQKWLSDSGLPPQKMAIQKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKW 132
+ L +WL +G + +AI++ D G RG+ A +IR+G++LL VP SL + K
Sbjct: 11 SDQLLEWLQRNGGQAESIAIRQFDHGGEKVRGVGASSSIRRGQELLRVPRSLFLAPGKK- 69
Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPYSLL---YWTRA 188
AG L+Q V LA + + ++ Y+ ALP R+ L Y +
Sbjct: 70 ----AG--LEQQEVT----LAAEIAKQFQLGSDGQYERYLQALPGREELDGLHPFYASNE 119
Query: 189 ELDRYLE---ASQIR-ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
+L + + S IR +R + R + D F D F+ F +F
Sbjct: 120 DLVLFSDIPYVSAIRKKRTVLRDAWLKHKNGDSASSSFDWEGD------FDWPEFLHAFI 173
Query: 245 ILFSRLVR--LPSMDG-----RVALVPWADMLN-----HSCEVETFLDYDKSSQGVVFTT 292
+ SR +R LP+ G +ALVP DM+N + VE L D + +V +
Sbjct: 174 LQLSRRMRIVLPAEKGGTSEETIALVPIIDMINFCGSKEAANVELKLVDDGDA--LVVVS 231
Query: 293 DRQYQPGEQVFI-----SYGKKSNGELLLSYGFVPREGTNPSDSVE 333
R GE++ + KK NG L+ YG + + PS +E
Sbjct: 232 KRSINEGEELLLYAAAAEQKKKENGALVYQYGVMMGDNDVPSQQLE 277
>gi|313214063|emb|CBY42615.1| unnamed protein product [Oikopleura dioica]
Length = 393
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/273 (21%), Positives = 118/273 (43%), Gaps = 39/273 (14%)
Query: 92 KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----AGEVLKQCS-- 145
K+ I D G RG+ + I + E L+ VP ++T E A +VL+ S
Sbjct: 22 KLKISDGDCG-RGVFSSAVIEQSELLISVPIDALLTTRKAQHVVESHKSARQVLQNFSTC 80
Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE 205
+ LL L E ++S+W+ ++S++P+Q ++ EL+ ++ + ++
Sbjct: 81 LNGTDLLVCALFLELETGENSKWTAFLSSIPKQLWNPFMLDEKELNLLTAKCRLPSKCLK 140
Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-----RLPSMD--- 257
+ +++I +++ E+ N E W F ++ SR R + +
Sbjct: 141 Q-----------KIKISTEFLKALGFEI-NEEILSWCFSVVLSRSFGGSPERCQTRNHFK 188
Query: 258 ------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKS 310
L P D++NH E ++++ + ++ G+++F++YG KS
Sbjct: 189 IEVDNSANFCLCPAIDLINHEKEYNCEYRWNENKTAFQVFSRKKILQGQELFVNYGTTKS 248
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
E+ YGF+ PSD+ ++ L++ K
Sbjct: 249 EYEIYSFYGFIL-----PSDNFQVEFELQRIRK 276
>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
Length = 777
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 95/235 (40%), Gaps = 43/235 (18%)
Query: 90 PQKMAIQKVDVGER----GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG------E 139
PQ+ A Q ++ E G+ A K I E L++P L+I D + A
Sbjct: 93 PQQFADQTENLSEIPYLIGVAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPN 152
Query: 140 VLKQCSVPDWPLLATYLISEASFEKSSRWSNYI-----SALPRQPYSLLYWTRAELDRYL 194
K D L ++ E + S W Y S LP+ +W +D L
Sbjct: 153 EFKNTEKSDQTSLIFFVALELLKGEESYWHPYFETAQDSDLPQ------FWEDQNIDE-L 205
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--R 252
E + I+ + IG Y ++ I + YPDL E F +E +K ++ I+ +R
Sbjct: 206 EDALIKAELQMHQVDFIGDY-EIAHGIANHYPDLVHAEKFTIEIYKRAYNIVTTRCFGWS 264
Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD-----RQYQPGEQV 302
PS LVP+AD NH ++ +Q +F +D R Y P ++V
Sbjct: 265 CPS----TCLVPFADCFNH---------FNLDNQYEIFNSDLHFKLRDYDPKKKV 306
>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
Length = 476
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 96/411 (23%), Positives = 168/411 (40%), Gaps = 81/411 (19%)
Query: 82 WLSDSG--LPP--QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
WL +G + P Q ++ D G RG+VA ++I + E L +P S ++ ++ E
Sbjct: 14 WLKHTGAQINPKIQLEDLRAKDAG-RGVVAKQDIAEHELLFSIPRSSILGVENSILSTEI 72
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
P W L ++ E +S W+ Y + LP +L++WT EL L+AS
Sbjct: 73 PPATFAHLGP-WLSLILIMLYEYHNGSASNWAPYFAVLPTDFDTLMFWTEDELAE-LQAS 130
Query: 198 QI---------RERAIERITNVIGTYNDLRLRIFS-------KYPDLFPEEVFNMETFKW 241
+ E IE++ VI + D+ IFS K ++ E +
Sbjct: 131 AVVNKIGKEGANEVFIEQLLPVIEEFADV---IFSGDERAKHKAKEMRAPENLELMHKMG 187
Query: 242 SFGILFSRLVRLPSMDGRV----------------ALVPWADMLNHS---CEVETFLDYD 282
S + ++ V D V +VP ADMLN C F + D
Sbjct: 188 SLIMAYAFDVEPAISDKEVDEEGFAEEEEDAALPKGMVPLADMLNADGDRCNARLFYEKD 247
Query: 283 KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL-----S 337
G+ + Q G+++F YG +LL YG++ + D VE+P+ +
Sbjct: 248 ----GLEMKALKPIQAGDEIFNDYGPLPRSDLLRRYGYIT-DNYAQYDVVEIPVDLVSQT 302
Query: 338 LKKSDKCYKEKLEALRK-------YGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGK 390
L ++E++E L + Y ++AS F ++ + P EL+ ++ P + +
Sbjct: 303 LAHDGLWHEERIEYLDEQEIVDTGYDIAASIPFSLEESLSP-ELVILVETMLLP---REE 358
Query: 391 FEEMAAAA----SNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
FE + + + KMT K A +F+ ++ I++Y L+
Sbjct: 359 FERLQSKGRLPKAEKMTGK-----------AAKFLYKIVQARIAQYPTTLE 398
>gi|340054011|emb|CCC48305.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 572
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 63/228 (27%), Positives = 96/228 (42%), Gaps = 25/228 (10%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITN 209
LL LI E S W + A P + P YW +L L + + + +
Sbjct: 200 LLILALIYERFVVDLSHWHELLVACPSEYPTVPSYWEFDDLSE-LHGLDVLDDVLTKRAR 258
Query: 210 VIGTYNDLRL------RIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGR- 259
V Y+++ L + + L EE F++E W+ SR L ++DGR
Sbjct: 259 VHDFYSEIMLVLPVIHSLVAGSSGLEREEFLRRFSVENIMWARATFDSRAFNL-NVDGRT 317
Query: 260 -VALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGEL 314
+ALVP ADM+NHS + + + G + Q G ++ +SYG N EL
Sbjct: 318 LLALVPNADMVNHSNRADVLVRMVEPDGGDFVMRIGAGLTQEDIGRELSMSYGPLQNWEL 377
Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR-----KYGL 357
L YGFV + N D + PL L + +++ +A R KY L
Sbjct: 378 LQHYGFVLED--NEHDKLPFPLDLPGTADEDRDEWDARRAVLIEKYAL 423
>gi|255723423|ref|XP_002546645.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
gi|240130776|gb|EER30339.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
Length = 428
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 63/291 (21%), Positives = 119/291 (40%), Gaps = 48/291 (16%)
Query: 70 IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI--- 126
+D ++N +K + S + P K+ ++ RG+ A +++KGE ++ +P S ++
Sbjct: 12 LDWIKNTQDEKKPSTHSYISP-KIDVKDARSSGRGIYATNSLKKGELIMNIPHSFLLNFT 70
Query: 127 TADSKWSCPEAGEVLKQCSVP-------------------------DWPLLATYLISEAS 161
T + S + VP + LL+ YL E
Sbjct: 71 TVMAHISRYNGMQDESHIYVPFDNSDGDQFTNIYSKLTREEILELSSFQLLSIYLTFEKQ 130
Query: 162 FEKSSRWSNYISALPR-QPYSLL--YWTRAELDRYLEASQ-----IRER---AIERITNV 210
+S W ++ LP + ++L+ +W+ +++Q +R+R + I ++
Sbjct: 131 RGTNSFWKPFLDMLPSMEDFALMPIHWSDETCKLAPDSTQKSSLKVRDRFENDYKLICDL 190
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
I T DL + L P + + + L+ L + + P+ D +N
Sbjct: 191 IQTKTDLDVTT------LLPRQDVLLSWLCINSRCLYMNLPTSKNTADNFTMAPYVDFMN 244
Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
HSC+ L D +G ++ Y +QV++SYG SN LL YGF+
Sbjct: 245 HSCDDHCTLKID--GKGFQVSSTCSYNIDDQVYLSYGPHSNDFLLCEYGFI 293
>gi|294882647|ref|XP_002769782.1| hypothetical protein Pmar_PMAR004863 [Perkinsus marinus ATCC 50983]
gi|239873531|gb|EER02500.1| hypothetical protein Pmar_PMAR004863 [Perkinsus marinus ATCC 50983]
Length = 433
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/351 (23%), Positives = 147/351 (41%), Gaps = 57/351 (16%)
Query: 99 DVGERG--LVALKNIRKGEKLLFVP--PSLVITADSKWSCPEAGEV---LKQCSVPDWPL 151
D+G +G LV +I+ G ++ +P ++I D+ P+ G+V LK + +
Sbjct: 51 DMGTKGMGLVVSTDIKAGTAMITIPRKSKVLINIDTACDDPDFGKVICYLKGAGLDERGC 110
Query: 152 LATYLI-----SEASFEKSSRWSNYISALPR----QPYSLLY---WTRAELDRYLEAS-- 197
LA +L+ S S + +RW Y + LP + + LL A + L AS
Sbjct: 111 LAFWLVLQKLASTRSSKVKTRWCPYAAMLPTAQKLRDHPLLLDDSGMSAIANTALHASVT 170
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYP-DLFPEEVFNMETFKWSFGILFSR----LVR 252
++E + ++ +++G + L ++ + P D F + + W+ +L SR L
Sbjct: 171 SMKENTLRQLGHLLGILSALEVK--DEGPVDTFIRHTKIKQLWLWAHAVLLSRSGFGLSG 228
Query: 253 LPSMDGRVA-----LVPWADMLNH---SCEVETFLD------YDKSSQGVVFTTDRQYQP 298
P G +A ++P D NH E + + SS+ + R +
Sbjct: 229 EPGDSGVMAGEGLLMIPLVDFANHDSSGGNAEIRIQHASTGWFGGSSESISLVAKRDIKA 288
Query: 299 GEQVFISYGKKSNGELL------LSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
GE++ ISY + G++L +YGF + R G D +P K ++ +
Sbjct: 289 GEEILISY---TGGDILSAEQSIFTYGFRMDRLGA--PDKFAVPTVSKPDGSDMRDVIRR 343
Query: 352 LRKYGLSASEC--FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASN 400
L ++ E IQ +P L+AY + + +GK E + A SN
Sbjct: 344 LVHMDVAKDEADVITIQKDTYPEALVAYMCIDILAEK-RGKLESLCQAYSN 393
>gi|72389967|ref|XP_845278.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62359268|gb|AAX79710.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801813|gb|AAZ11719.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 583
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 90/212 (42%), Gaps = 28/212 (13%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEASQI 199
LL LI E ++S W + + + P P YW +L D + ++
Sbjct: 202 LLVLALIYERFVARTSHWKDLLLSCPTDFPTVPSYWNWNDLSGLYGLDVLDDVLAKQERL 261
Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSM 256
R+ E +T+V+ D + EE F +E W+ + SR L ++
Sbjct: 262 RQFHTE-VTSVLPLIYD----ALEGCSGIEREEFMGHFTIENIMWARAVFDSRAFNL-NV 315
Query: 257 DGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKS 310
DGRV ALVP ADM+NHS + + + G V + G ++ +SYG
Sbjct: 316 DGRVVLALVPCADMINHSNHPDVLIRRVEPCGGDFVMQVGAGLTREDVGRELGMSYGPLQ 375
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
N ELL YGFV + N D + P + ++D
Sbjct: 376 NWELLQHYGFVLDD--NEHDKLPFPFDVHEAD 405
>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
Length = 371
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/308 (25%), Positives = 122/308 (39%), Gaps = 62/308 (20%)
Query: 73 LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
++ +L +W +D G+ + ++ G+VA ++I KGE L+ VP + S+
Sbjct: 1 MDTIDSLLQWAADQGVVLDGVRPSRIPGRGLGMVATRHIHKGEVLIAVPTPAI---RSRH 57
Query: 133 SCPEA--GEVLKQCSV------------PDWPLLATYLISEASFEKSSRWSNYISALPRQ 178
+ P++ G+ ++ PD T + S A FE S+ +
Sbjct: 58 TLPKSLMGKAPTNMTLHGLLAADLLLHPPDVAAWGTLVPSLADFESSTPFF--------W 109
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
P +L E + L Q R R R +S FP
Sbjct: 110 PETLQDLLPPEAKKLLRTQQQRFR-----------------RDWSHAHAGFPSVAEQDYL 152
Query: 239 FKW------SFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVV 289
+ W SF + P D R+AL+P ADM NH+ C V S++
Sbjct: 153 YAWFLVGTRSFYYQVDETLPYPWHD-RLALLPVADMFNHASVGCAVAF------STEVYD 205
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
T DR Y+ E+++ SYG SN LL YGF+ ++ NP D + L L + E
Sbjct: 206 VTADRDYEADEELYTSYGAHSNDFLLAEYGFMLQD--NPHDQLCLDAVLLA--RLSAEHK 261
Query: 350 EALRKYGL 357
AL + GL
Sbjct: 262 AALLQRGL 269
>gi|358395796|gb|EHK45183.1| hypothetical protein TRIATDRAFT_39811 [Trichoderma atroviride IMI
206040]
Length = 484
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 82/166 (49%), Gaps = 12/166 (7%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL--RLRIF 222
S+ W+ YI LPR WT E + +L+ + + +++ + Y++L +
Sbjct: 119 STPWTEYIKFLPRSISVPTMWTSEERE-FLQGTSLESSVNAKLSVLSREYDELSEKASTL 177
Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYD 282
+ DL E +E + + + SR + LP +A+VP DM NHS + YD
Sbjct: 178 PFWNDLLSESGM-LEDWILADALYRSRCLELPH--AGLAMVPGLDMANHSPKY--LARYD 232
Query: 283 KSSQGVVF---TTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPRE 324
++ +G V ++ GE++ ISYG+ KS E+L SYGF+ +E
Sbjct: 233 ETPEGDVVLLPSSGSGVSSGEEITISYGEAKSAAEMLFSYGFIDQE 278
>gi|303311395|ref|XP_003065709.1| hypothetical protein CPC735_049340 [Coccidioides posadasii C735
delta SOWgp]
gi|240105371|gb|EER23564.1| hypothetical protein CPC735_049340 [Coccidioides posadasii C735
delta SOWgp]
gi|320039566|gb|EFW21500.1| hypothetical protein CPSG_01657 [Coccidioides posadasii str.
Silveira]
Length = 636
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 95/205 (46%), Gaps = 32/205 (15%)
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
PE +K+ + L+ YL+ + SF W+ YI +LP Q L Y++ +L+
Sbjct: 22 PEFLPAVKEKGALAFLLMDQYLLGDESF-----WAPYIRSLPEDSQLTRLEYYSDEDLE- 75
Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
+LE + + + + + TY ++ L++ + P+ + + E F W+ I+ SR
Sbjct: 76 WLEGTNLLKLRENMLIKLKTTY-EVGLQMLKESPNKNTKN-YTWERFLWASSIIISRAFS 133
Query: 253 LPSMDGRV--------------ALVPWADMLNHS--CEVETFLDYDKSSQGVV-FTTDRQ 295
+ V LVP DM NH +VE ++SQGVV +
Sbjct: 134 SEVLKDYVKNSKSINVTGGEFSVLVPLLDMTNHQPLAQVEW-----RTSQGVVGLIVHKT 188
Query: 296 YQPGEQVFISYGKKSNGELLLSYGF 320
PG++V +YG ++N L+L+YGF
Sbjct: 189 LLPGQEVPNNYGPRNNERLMLNYGF 213
>gi|303284299|ref|XP_003061440.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456770|gb|EEH54070.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 644
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/204 (25%), Positives = 79/204 (38%), Gaps = 39/204 (19%)
Query: 151 LLATYLISEASFEKS---SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERI 207
LLA ++ E++ + S W+ Y++ LPR+ SL+ W EL L+ S+ RA ERI
Sbjct: 214 LLALGVLHESAHSRDDPPSHWATYVTLLPRRVGSLMEWDERELS-ALQGSRHATRARERI 272
Query: 208 TNVIGTYNDLRLRIFSKY----PDLFPEEVFNMET------FKWSFGILFSRLVRLPSMD 257
++D R + F LF E+ ++W+ + +R P +
Sbjct: 273 A----LFDDARAKCFPALLRADESLFGEDEATRRAHESPRAWRWAVATVLARAFYFPDAN 328
Query: 258 GRVALVPWADMLNHSCEVETFLDYD-------------------KSSQGVVFTTDRQYQP 298
L P D+ NH E E + D K + V P
Sbjct: 329 -EHGLCPGLDLFNHCSEAEKCVVEDGTADDEGDEGDEGEGKYAHKEAPRVTLRAGVGGVP 387
Query: 299 -GEQVFISYGKKSNGELLLSYGFV 321
G Q+F Y + G LL +GF
Sbjct: 388 AGTQIFHDYADHARGGCLLEFGFT 411
>gi|406868331|gb|EKD21368.1| SET domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 480
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 92/208 (44%), Gaps = 26/208 (12%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFS 223
S+ W+ Y+ LP W+ E R + E A+ + ++I DLR
Sbjct: 117 SNPWTEYVRMLPESIPVPTMWSEEE--RVMLTGTSLETAVSAKCASLISEIEDLR----G 170
Query: 224 KYPDL-------FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
K ++ + EE E WS + R L + +++P DM+NH+ E
Sbjct: 171 KTAEIAWCQKCWWEEESLRYEN--WSLLDAWYRSRSLEVPNAGESMIPCVDMVNHAAEAN 228
Query: 277 TFLDYDKSSQ---GVVFTTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSV 332
++ Y+++S ++ D Q + +V ISYG KS E+L SYGF+ +GT+ ++
Sbjct: 229 SY--YERTSDNNIALLLRPDTQLEAESEVTISYGSSKSEAEMLFSYGFIDEQGTSKGLTL 286
Query: 333 ELPLS----LKKSDKCYKEKLEALRKYG 356
+ S L K+ K + LR +G
Sbjct: 287 NIDPSPDDPLGKAKAAAFSKSKTLRIFG 314
>gi|50287013|ref|XP_445936.1| hypothetical protein [Candida glabrata CBS 138]
gi|49525242|emb|CAG58855.1| unnamed protein product [Candida glabrata]
Length = 599
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/216 (24%), Positives = 96/216 (44%), Gaps = 40/216 (18%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY-------NDL 217
S ++ YI ALP+ +S L W +EL+ L + + E++ +VI + DL
Sbjct: 101 SKKFHPYIQALPKLIHSPLVWNPSELETLLVGTNLGGSVKEKLCSVIKEWIVLIESREDL 160
Query: 218 RLRI-------FSKYPDLFPEEVFNM-----------------ETFKWSFGILFSR---- 249
+ ++ F Y DL E+++N+ F +S + SR
Sbjct: 161 KSKVDGKYLINFENYNDLVYEDIYNIFVKPVEFEFADLVWLSFPAFLYSHLVFTSRAFPE 220
Query: 250 -LVRLPSMDGRVALVPWADMLNH--SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
++ + + V L+P D++NH + +V+ F ++ + + G+++ +Y
Sbjct: 221 YVIDKNANEFSVILLPILDLMNHNYNSKVQWFPKEHQNGTSFCYQCLADMKAGDELDNNY 280
Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
G K N ELL YGFV + N D+V L + L + +
Sbjct: 281 GGKGNEELLNGYGFVIDD--NIFDTVALRIKLSEEE 314
>gi|156374449|ref|XP_001629819.1| predicted protein [Nematostella vectensis]
gi|156216828|gb|EDO37756.1| predicted protein [Nematostella vectensis]
Length = 281
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 123/281 (43%), Gaps = 25/281 (8%)
Query: 69 EIDSLENASTLQKWLSDSGLPPQKM--AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
E+DS S+ W D+ L ++QK G+VA+++I E L VP L++
Sbjct: 11 ELDSA--ISSFLLWCHDNDLKLNNKVSSMQKGSCHRYGMVAMEDISPDECLFKVPRGLLL 68
Query: 127 TADS-KWSCPEAGEVLKQ--CSVPDWPLLATYLISEASFEKSSRWSNYISALP-----RQ 178
+ S G+V++ W L L+ E + +S W Y+ +P Q
Sbjct: 69 EPKTCGISKILTGKVIQNMLSQHEGWVPLLLALMYEYT-NPTSLWKPYMDIVPGIDILDQ 127
Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
P ++W L+ + + + + Y + + I K+ F + ++
Sbjct: 128 P---MFWPDETRQSLLQGTGFEDDVEDDKQRIERQYFTVAVPIMKKFKKFFDLKRHSLSL 184
Query: 239 FKW--SFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
+K +F + +S PS G +VP AD+LNH L++ + +V T +
Sbjct: 185 YKHMAAFIMAYSFTEDSPSFHGNNVPVMVPMADILNHHSNNNARLEFGEEELSMVST--Q 242
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREG-TNPSDSVEL 334
G +VF +YG+ +N LL SYGFV EG NP+D+V L
Sbjct: 243 HILKGGEVFNTYGQLANCHLLQSYGFV--EGPDNPNDTVSL 281
>gi|397563943|gb|EJK44003.1| hypothetical protein THAOC_37500 [Thalassiosira oceanica]
Length = 595
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 73/155 (47%), Gaps = 31/155 (20%)
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK-------- 283
+ F ++ F+++ ++ SR + DG + L+P+ D NH DYD
Sbjct: 296 QCFTVDGFRYAVAVVRSRSFFV---DGALRLLPYVDYANHD-------DYDSNELVGGGI 345
Query: 284 -----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP----REGTNPSDSVEL 334
SS+GV+ + + + G++V ISYG K + +L +GFVP + + EL
Sbjct: 346 GMLWGSSKGVLLKSGKALRVGDEVRISYGPKGPADYILDHGFVPPMCQLSTQGGAITAEL 405
Query: 335 PLSLKKSDKCYKEKLEAL--RKYGLSASECFPIQI 367
+ +SD+ +KL+ L Y L+ E P Q+
Sbjct: 406 SFEVDESDRFRDDKLDILEFETYDLAPME--PAQV 438
>gi|402581480|gb|EJW75428.1| hypothetical protein WUBG_13665, partial [Wuchereria bancrofti]
Length = 118
Score = 53.5 bits (127), Expect = 2e-04, Method: Composition-based stats.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 11/107 (10%)
Query: 239 FKWSFGILFSRLV----RLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVV 289
F W++ I+ +R + +L + D +A+VP DMLNHS + + +D
Sbjct: 14 FLWAWHIVNTRCIYRNNKLHPLIDNTEDDSLAIVPLIDMLNHSNDSQCCAIWDGKLNLCK 73
Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
R + GEQ+FI YG +NG L + YGF ++ N + VE+ L
Sbjct: 74 VIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKD--NICNKVEISL 118
>gi|365989204|ref|XP_003671432.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
gi|343770205|emb|CCD26189.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
Length = 589
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 114/272 (41%), Gaps = 58/272 (21%)
Query: 117 LLFVPPSLVITADS-----KWSCPEAGEVLKQC--SVPDWPLLATYLISEASFEKSSRWS 169
L+ VP +L++T + WS + G + ++ L L EA K S+W
Sbjct: 51 LISVPTNLLLTYQNATDFFNWSIAKGGNLPTNNPNAITQLYLSHLKLNPEA---KKSQWD 107
Query: 170 NYISALP---RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL--------- 217
Y+ L QPY +WT +L + L+ + + + + +I Y +L
Sbjct: 108 KYVEILSLDLNQPY---FWTVDQLQQ-LKGTDLYIKIQQDFATIIQEYIELLQILKVDIL 163
Query: 218 ---RLR-------IFSKYPDLFPEEV--FNMETFKWSFGILFSR------LVRLPSMDGR 259
+L+ I S P L ++ + ++ WS I SR L S G
Sbjct: 164 DQEKLQTATISHYINSHLPTLLDGKLPWNHFVSYLWSHCIFKSRAFPQLLLNNAGSDVGN 223
Query: 260 VALV---PWADMLNHSCEVETFLDYDKSSQG--------VVFTTDRQYQPGEQVFISYGK 308
+ L P D+LNH +V + ++ S+ + F T G+Q+F +YG
Sbjct: 224 INLAFLFPIVDLLNHKNDV--VVKWESSNDINNKNDNKVLTFITQETLHVGDQIFNNYGN 281
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
KSN ELLL YGF+ +E N D EL L L +
Sbjct: 282 KSNEELLLGYGFI-QENNNNYDYSELTLKLNE 312
>gi|348679311|gb|EGZ19127.1| hypothetical protein PHYSODRAFT_493969 [Phytophthora sojae]
Length = 776
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/202 (25%), Positives = 87/202 (43%), Gaps = 27/202 (13%)
Query: 151 LLATYLISEASFEKSSRWSNYISALP-----RQPYSLLYWTRAELDRYLEASQIRERAI- 204
LL +L+ E SRW+ Y+S LP S L+++ E+ L+ ++ + A
Sbjct: 11 LLTAFLLWEELSGHESRWTPYLSLLPPLSSRDDVVSPLFFSSDEVVEALQDERMVKTARA 70
Query: 205 --ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
+R G + R+F +P L + + W+ ++ SR S+ G+ L
Sbjct: 71 ERQRAKKAHGRFK----RLFRSFPAL---KALEWPQYAWARFLVNSRAF---SIQGQRVL 120
Query: 263 VPWADMLNHSCEVET--------FLD-YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
VP+ D+ N + + FL +D S G+ DR G+Q+F YG SN
Sbjct: 121 VPFGDIFNGKPDDDAREHDNGQRFLQLHDLQSMGMTIRADRGAAKGKQLFEDYGDNSNYV 180
Query: 314 LLLSYGFVPREGTNPSDSVELP 335
L +GF+ +G + LP
Sbjct: 181 YFLHHGFLMGDGCFDCAAFRLP 202
>gi|294657576|ref|XP_459875.2| DEHA2E13090p [Debaryomyces hansenii CBS767]
gi|199432797|emb|CAG88116.2| DEHA2E13090p [Debaryomyces hansenii CBS767]
Length = 505
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 127/298 (42%), Gaps = 53/298 (17%)
Query: 81 KWLSDSGLP-PQKMAIQKV--DVGERGLVALKNIRKGEKLLFVPPSLVITADS---KWSC 134
+WLS+ + K+ ++ + D RG+VA ++I + E+L +P +I D+ +
Sbjct: 13 EWLSEENVTISSKLVVKDLRKDNQGRGMVANEDIEEDEELFSIPRETIINIDNCSLTKTN 72
Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP---RQPY---SLLYWTRA 188
+A + L S+ W L L+ E S+WS Y + LP Q Y L++W+
Sbjct: 73 SKARDGL--LSLNQWEALIIVLLYELKVNGKSKWSAYFNTLPIKDTQNYKFNQLMFWSHE 130
Query: 189 ELDRYLEASQIRERAIERI--TNVIGTYNDLRLRIFS--KYPDLFP---EEVFNMETFKW 241
+L L S I I+RI YN L ++ P+LF EE + +
Sbjct: 131 QLAD-LSPSLI----IDRIGKDEAEAMYNKLFPKVVEDLNIPELFKVTLEEYHKVASLIM 185
Query: 242 SFGILFSR-------------------LVRLPSMDGRV--ALVPWADMLNHSCEVETF-L 279
S+ R ++G ++VP AD+LN ++ L
Sbjct: 186 SYSFDVERPEFNQVEDDEAEDDEEEDDEGDGTILNGNYYKSMVPLADILNADTKLHNASL 245
Query: 280 DYDKSSQGV-VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
Y + GV V + + + GEQ++ +Y N E+L YG+V G+ D E+PL
Sbjct: 246 VY---TPGVLVMKSVKPIKKGEQIYNTYSDHPNSEILRRYGYVETNGS-ELDFGEIPL 299
>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
VaMs.102]
Length = 469
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 69/296 (23%), Positives = 124/296 (41%), Gaps = 41/296 (13%)
Query: 81 KWLSDSGLPPQKMAIQKVDV----GERGLVALKNIRKGEKLLFVPPSLVI---TADSKWS 133
+W +G + +Q VD+ RG++A ++I + L +P +I T++
Sbjct: 13 QWFKAAGGEFRDDLLQIVDLRPQAAGRGIIATRDIPEETTLFTIPRQAIINVLTSELPQK 72
Query: 134 CPEA--GEV--LKQCSVP--DWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
P+ G + + + P W L ++ E SSRW Y LP+Q + ++W+
Sbjct: 73 LPQVFDGSIDEMDDNAEPLDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQFDTPIFWSD 132
Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-PE--------EVFNME- 237
EL L+ + + I ++ + + L I P +F PE E+ ++
Sbjct: 133 GEL-LELQGTSLTAEKIGKVES-DAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAH 190
Query: 238 -----TFKWSFGILFSRLVR------LPSMDGR--VALVPWADMLNHSCEVETFLDYDKS 284
++F + + +GR + +VP AD LN + E +++ +S
Sbjct: 191 RMGSTIMAYAFDLENDDENENEEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGES 250
Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
+ D + G+QV YG ELL YG+V E + D VE+P +L K
Sbjct: 251 LEATAIRAD--IRAGDQVLNYYGPLPTSELLRRYGYVTPEHSR-YDVVEVPWTLVK 303
>gi|346970168|gb|EGY13620.1| SET domain-containing protein [Verticillium dahliae VdLs.17]
Length = 485
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 88/199 (44%), Gaps = 20/199 (10%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
S+ W+ Y+ LPR+ W+ E + L+ + + +I + + LR K
Sbjct: 116 STPWTEYVKFLPREVPVPTMWSEQERE-LLQGTSLELAVSAKIQALTSEFEALR----EK 170
Query: 225 YPDL-FPEEVF----NMETFKWSFGILF--SRLVRLPSMDGRVALVPWADMLNHSCEVET 277
DL F +F N+ W + SR + LPS V++VP D+ NH+
Sbjct: 171 SSDLPFWHAIFWDTNNVSLADWFLVDAWYRSRSLELPS--AGVSMVPVLDLANHAPAPSA 228
Query: 278 FLDYDKSSQGVV---FTTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVE 333
+ + +G V G++V ISYG KS E+L SYGF+ + +D+V
Sbjct: 229 YYEESARREGDVELRLRPGSTLAAGDEVTISYGAGKSGAEMLFSYGFI--DPARSTDTVA 286
Query: 334 LPLSLKKSDKCYKEKLEAL 352
LPL+ + D K K+ +
Sbjct: 287 LPLAPLEDDPLGKAKVHSF 305
>gi|302839507|ref|XP_002951310.1| hypothetical protein VOLCADRAFT_91853 [Volvox carteri f.
nagariensis]
gi|300263285|gb|EFJ47486.1| hypothetical protein VOLCADRAFT_91853 [Volvox carteri f.
nagariensis]
Length = 730
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL--PLSLK 339
D +S G +PGE+++ISYG+KSN ELL+ YGF + NP D + L PL +
Sbjct: 374 DSASLGATLHGGAHVRPGEELYISYGEKSNEELLMLYGFALED--NPHDHLMLYCPLPPR 431
Query: 340 KS-DKCYKEKLEALRKYGL 357
D ++E L+ YGL
Sbjct: 432 AEWDDVMYARMELLQAYGL 450
>gi|67900706|ref|XP_680609.1| hypothetical protein AN7340.2 [Aspergillus nidulans FGSC A4]
gi|40742521|gb|EAA61711.1| hypothetical protein AN7340.2 [Aspergillus nidulans FGSC A4]
gi|259483305|tpe|CBF78584.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 441
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 66/144 (45%), Gaps = 25/144 (17%)
Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPS--------MDGRVALVPWADMLNHSCEVETFL 279
+FPE + + W+ I+ SR S + + +VP+AD NH + +
Sbjct: 212 VFPETDWKAVAYNWA--IINSRSFYYVSPGKNEPSDWNDAIGMVPFADYFNHRDDASCEV 269
Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
+D+ S +F ++ GE++++SYG SN LL+ YGF + NPSD V L
Sbjct: 270 TFDRDS--YIFRAEK----GEEIYMSYGPHSNDFLLVEYGFYLDD--NPSDRVYL----- 316
Query: 340 KSDKCYKEKLEALRKYGLSASECF 363
D KL K L+ ECF
Sbjct: 317 --DDIILPKLTRSEKKELAERECF 338
>gi|294950065|ref|XP_002786443.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239900735|gb|EER18239.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 551
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/86 (38%), Positives = 44/86 (51%), Gaps = 2/86 (2%)
Query: 253 LPSMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
LP +VP AD+LNH + F +DK S+ V T + G ++FI+YG N
Sbjct: 328 LPPDQSITCVVPGADLLNHHQRGQCGFPRFDKKSRSFVITAEANVPAGSELFINYGGLQN 387
Query: 312 GELLLSYGFVPREGTNPSDSVELPLS 337
E L+ YGF NP DSV L L+
Sbjct: 388 WEQLMYYGFC-EFAQNPYDSVTLDLA 412
>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
Length = 565
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 54/96 (56%), Gaps = 11/96 (11%)
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
L+P D+LNH+ + T + + + G +F +D GE++F +YG+K N ELLL+YGF
Sbjct: 216 LLPVVDLLNHNPK--TKVQWSGTDGGFLFQSDDA-SSGEELFNNYGQKGNEELLLAYGFA 272
Query: 322 PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
NP+DS L + + S KL+ ++ G+
Sbjct: 273 IE--NNPADSAALKIKIPDS------KLQVVKDLGI 300
>gi|340509072|gb|EGR34645.1| SET domain protein [Ichthyophthirius multifiliis]
Length = 326
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 78/174 (44%), Gaps = 14/174 (8%)
Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIE 205
P ++ Y+ E S WS YI+ LP +++ +L+ +L S + +E
Sbjct: 157 PIVKIIEEYIHEEKFINPDSLWSIYINILPSDYNQYPIFFPEEDLE-WLSGSPFLNQVLE 215
Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFN-METFKWSFGILFSRLVRLPSMDGRV--AL 262
+ ++ Y+D+ + PE N + F W+ SR+ L +DG+ A
Sbjct: 216 KKADIKRDYDDI--------CSIAPEFAINTFQDFCWARITASSRVFGL-QIDGQKTDAF 266
Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
VP ADMLNH +T YD + QG + G+QV+ SYG+K N L
Sbjct: 267 VPLADMLNHRRPKQTSWQYDDNRQGFIIEALEDIPRGDQVYDSYGQKCNSRFSL 320
>gi|317151155|ref|XP_001824477.2| ribosomal N-lysine methyltransferase [Aspergillus oryzae RIB40]
gi|391868702|gb|EIT77912.1| hypothetical protein Ao3042_05894 [Aspergillus oryzae 3.042]
Length = 407
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 123/280 (43%), Gaps = 43/280 (15%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G++A +NI + E ++ VP ++T++ P P L A +L + + E
Sbjct: 39 GMIATRNIEEDEAIVTVPLKAMLTSER---IPSYFTSKFPDGTPTHALYAAFL-TNGNAE 94
Query: 164 KSSRWSNYISALP-RQPYSL---LYWTRAELDRYLEAS--------QIRER-AIERITNV 210
++ + P RQ + + W+ + L YL S Q R++ E
Sbjct: 95 DLEEFNAWRKTWPSRQDFEDSMPILWSES-LRNYLPPSISSHWHSIQSRDKLQYETTHQN 153
Query: 211 IGTYNDLRLR-----IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSM------D 257
+ + RLR + S +PD + ETF + + I+ +R +P +
Sbjct: 154 LLAQQEQRLRTAWDIVVSVFPDT------DWETFSYHWLIVNTRSFFYLMPGQEPPEDRN 207
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
+AL+P+AD NHS +V + +D + VF + Y GE++++SYG N L
Sbjct: 208 DAMALLPFADYFNHS-DVACNVKFD--GENYVFRATKHYDEGEEIYMSYGPHPNDFLFAE 264
Query: 318 YGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYG 356
YGF E N S+++ L + LK +E+LE + YG
Sbjct: 265 YGFYLDE--NESETLYLDDIILKDLSTSLQEELEFQQYYG 302
>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 476
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/298 (24%), Positives = 115/298 (38%), Gaps = 54/298 (18%)
Query: 82 WLSDSGL---PPQKMA-IQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
WL SG P K+ ++ D G RG+VA ++I + E L +P + +++ ++ E
Sbjct: 14 WLRQSGAEISPKIKLEDLRNKDAG-RGVVASQDIAEHELLFRIPRASILSVENSILSTEI 72
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
P W L ++ E +S W+ Y + LP + +L++WT EL L+AS
Sbjct: 73 PAATLSLLGP-WLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDELAE-LQAS 130
Query: 198 QI---------RERAIERITNVIGTYNDL-------------------RLRIFSKYPDLF 229
+ E +E++ VI + D+ L + K L
Sbjct: 131 AVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKMGSLI 190
Query: 230 PEEVFNME------TFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLD 280
F++E LP +VP ADMLN C F +
Sbjct: 191 MAYAFDVEPATPTKEVDEEGFAEEEEDAALPK-----GMVPLADMLNADADRCNARLFYE 245
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
D + + Q GE++F YG +LL YG+V + D VE+P L
Sbjct: 246 KDCLEMKAL----KPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDVVEIPTDL 298
>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 94/224 (41%), Gaps = 31/224 (13%)
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSCEVETFLDYDKSSQG-- 287
E F++E W+ SR L ++DGRV ALVP ADM+NH + + + + G
Sbjct: 290 ECFSIEAMMWARATFDSRAFNL-NVDGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDF 348
Query: 288 --VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK---SD 342
+ + G ++++SYG N ELL YGFV EG N D + P + D
Sbjct: 349 VMQIGASLTAQDIGREIWMSYGPLQNWELLQFYGFV-LEG-NEHDRLPFPFDFPEGVVGD 406
Query: 343 KCYKEKLEALRKYGLS-ASECFPIQITGWPLELMAYAYLVVSPP------SMKGKFEEMA 395
+ + + YGL A C+ P L+A + ++ KG F +
Sbjct: 407 EWDGRRAALVATYGLHLAGRCWICHDGRPPPALVALLRVHLAEAEEFDTMERKGPFASLG 466
Query: 396 AAASNKM--TSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
A ++ T I+C ILD +S+ + R L+
Sbjct: 467 AGTEARVVATIADTIRC----------ILDLFSTSLEEDERLLE 500
>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
Length = 430
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/270 (24%), Positives = 109/270 (40%), Gaps = 22/270 (8%)
Query: 151 LLATYLISEASFEKSSRWSNYISALPR-QPYSL--LYWTRAELDRYLEASQIRER-AIER 206
LLA YL+ E +S W +I LP + SL + W ++ + ++ R A +
Sbjct: 122 LLAIYLVLEKERGAASFWKPFIDMLPSIEELSLAPVVWKVLQVPHCDDLWRMLSRSARKH 181
Query: 207 ITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP-----SMDGRVA 261
+V+ + + ++ DL F +F W++ + SR + +
Sbjct: 182 AESVVARFE----KDYAVVCDLPSVPAFERSSFLWAWMCINSRCLYMSMPQAKDTSDNFT 237
Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF- 320
+ P+ D LNHS E + + D G T Y+P E+++ SYG SN LL YGF
Sbjct: 238 MAPYVDFLNHSNEDQCGIKIDP--HGFHVLTSSAYKPQEELYFSYGPHSNEFLLCEYGFT 295
Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYL 380
+P N D + L L + E++ L+ G + + E+ A A L
Sbjct: 296 LPHNKWNYIDITDFILPLLR-----PEQVSFLKDMGYYGDYTVNTEGMSFRTEI-ALATL 349
Query: 381 VVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
S P K + + S+ +K K
Sbjct: 350 QESEPQQSRKLKALVEGMSDGAVFEKQSKV 379
>gi|156035929|ref|XP_001586076.1| hypothetical protein SS1G_13169 [Sclerotinia sclerotiorum 1980]
gi|154698573|gb|EDN98311.1| hypothetical protein SS1G_13169 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 291
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 229 FPEEVFNMETFKWSFGILFSRLVRL-----------PSMDGRVALVPWADMLNHSCEVET 277
FPE + + F +++ I+ SR P + R+AL P+AD +NHS +
Sbjct: 140 FPEPPISYDEFMYNYSIVNSRTFYYLSPTIKSSKLQPPKEDRLALNPFADYMNHSSQ--P 197
Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
++ S G T + + G +V ISYG +N LL+ YG R G N D
Sbjct: 198 TVNATLSRAGYTLTASQPIKEGSEVHISYGSHNNDFLLIEYG---RHGNNSED 247
>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
Length = 509
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 90/205 (43%), Gaps = 39/205 (19%)
Query: 258 GRVALVPWADMLNHSCEVETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
G VA+VP ADMLN E E L YD+ +V T + + GEQ++ +YG N +LL
Sbjct: 261 GDVAMVPMADMLNARFESENAKLFYDERELKMVST--KPIKAGEQIWNTYGDPPNSDLLR 318
Query: 317 SYGFV-------PREG-TNPSDSVELPLSL------KKSDKCYKEKL-----EALRKYGL 357
YG V P G NP D VE+ L KK KE++ EA +
Sbjct: 319 RYGHVDLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFV 378
Query: 358 SASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
++C + EL+++ L++ P K ++E K K + P++D+
Sbjct: 379 LRTDCELAE------ELVSFVRLLLLP---KDEWE--------KAAQKSKLPKPKLDKDV 421
Query: 418 LQFILDSCESSISKYSRFLQVKELL 442
L +D E + Y L+ E L
Sbjct: 422 LTIAVDVLEKRLKDYPTTLEEDEAL 446
Score = 38.9 bits (89), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 30/119 (25%), Positives = 55/119 (46%), Gaps = 3/119 (2%)
Query: 76 ASTLQKW--LSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
AS W L L +K+ I + RG +AL++I + L +P L ++ +
Sbjct: 6 ASEFVHWFQLQHGNLDTEKVGIVEFPEHGRGAIALQDIPEDYTLFTIPRELTLSTRTCSL 65
Query: 134 CPEAGEVLKQCSVPD-WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
G+ K+ + + W L +I E S S+WS Y++ LP + ++W + +L+
Sbjct: 66 PTLMGQAWKEHGLHEGWAGLILCMIWEESRGSDSKWSGYLATLPSSFDTPMFWGQEDLN 124
>gi|167534011|ref|XP_001748684.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772925|gb|EDQ86571.1| predicted protein [Monosiga brevicollis MX1]
Length = 945
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 67/286 (23%), Positives = 119/286 (41%), Gaps = 46/286 (16%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITAD------------SKWSCPEAGEVLKQCSVPDWPL 151
G+VA ++ +G+ L +P S +IT + + W+ E W
Sbjct: 523 GMVAATSLAEGDVLFEIPRSALITVNNSQINQQLSEMAAAWAEEEDEPEDGDGDPRQWTQ 582
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSL--LYWTRAELDRYLEASQIRERAIERITN 209
L ++ E + + +SR+ Y+ LP + WT AE D+ L ++ + +
Sbjct: 583 LVCAMMVENT-DPASRFRPYLDFLPDHTTLAHPMLWTSAERDQLLAGLRLAQDVENDLEM 641
Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMD----GRVALVP 264
+ + +L L ++ FP + E + +F + F+ +V S G V +VP
Sbjct: 642 INSHFQELALPFLRRHA--FPALAELSDEDLRRNF-MAFAAVVMAYSFTDDTTGEVCMVP 698
Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPRE 324
AD+LNH K + + + D Q+F +YG N +LL +GFV
Sbjct: 699 VADILNHVT--------GKCNAKLYYAKD-----ALQIFNTYGSLDNQQLLQKHGFVEPT 745
Query: 325 GTNPSDSV----ELPLSLKKS------DKCYKEKLEALRKYGLSAS 360
GT +S+ EL +L+ S D + KL+ L + G ++
Sbjct: 746 GTPFDESILPVEELVAALRPSFEGVLDDAAVERKLDLLLERGFASG 791
>gi|429850390|gb|ELA25672.1| set domain-containing protein, partial [Colletotrichum
gloeosporioides Nara gc5]
Length = 443
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/204 (28%), Positives = 91/204 (44%), Gaps = 30/204 (14%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAEL---------DRYLEASQIRERAIERITNVIGTYN 215
S+ W+ Y+ LP Q WT E D L +SQ A +I + ++
Sbjct: 105 STPWTEYVKFLPPQVPVTTLWTEQEREMLVGTSLEDHVLTSSQSATAA--KIVTLTDEFD 162
Query: 216 DLRLRIFSKYPDLFPEEVF----NMETFKWSF--GILFSRLVRLPSMDGRVALVPWADML 269
+LR P F E+F + W+ SR + LP A+VP D+
Sbjct: 163 ELR-ETSEALP--FWNELFWESDKVSLIDWARVDAWFRSRCLELPK--SGEAMVPVLDLA 217
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDR---QYQPGEQVFISYGK-KSNGELLLSYGFVPREG 325
NHS + + Y+++S+ V R + G+++ ISYG KS E+L SYGF+ +
Sbjct: 218 NHSAQANAY--YEENSKDEVVLLLRPGCRVLSGDEMTISYGDAKSGAEMLFSYGFI--DP 273
Query: 326 TNPSDSVELPLSLKKSDKCYKEKL 349
+ +D + LPL+ + D K KL
Sbjct: 274 ASAADRITLPLAPLEDDPLGKAKL 297
>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 485
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 47/268 (17%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
P K+A + + RG+VA +I + E+L +P +LV++ + S + + ++ P
Sbjct: 34 PKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQNS-SLKDLLDFNERDFDP- 91
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
W L +I E +S WS Y LP +L++WT EL R L S + + +I
Sbjct: 92 WLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEEL-RELSGSAV----LNKIG 146
Query: 209 NVIGTYNDLR--LRIFSKYPDLFPEEVFNMETFKWSFG--ILFSRLVRLPSM-------- 256
N LR L + S P FP + + +F G L S R+ S+
Sbjct: 147 RSDAEANILRNILPLVSGNPSHFP-PMSGVASFDSPEGKAALLSLAHRMGSLIMAYAFDI 205
Query: 257 -----------DGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
DG V +VP AD+LN +T + + Q + + R
Sbjct: 206 EKGENDGGEGQDGYVTDDEEELSKGMVPLADLLN----ADTDRNNARLFQEDCYLSMRSI 261
Query: 297 QP---GEQVFISYGKKSNGELLLSYGFV 321
+P GE++F YG+ +LL YG+V
Sbjct: 262 KPIRKGEEIFNDYGELPRADLLRRYGYV 289
>gi|320167915|gb|EFW44814.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 614
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 71/317 (22%), Positives = 131/317 (41%), Gaps = 45/317 (14%)
Query: 79 LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP--E 136
LQ + +++ + K I+ + GL A +I +G++++ P +L I + P E
Sbjct: 106 LQDFYTENSIELIKANIRYSPETDFGLYATADIDQGDEIVRAPVTLTIASQYLEDSPLTE 165
Query: 137 AGEVLKQCSVPD-WPLLATYLISEASFEKSSRWSNYIS---------------ALPRQPY 180
+ L PD +A +++ E + S +S +I +
Sbjct: 166 EMQRLFGDQQPDELTAIALHILHEKVHKSQSFYSRWIHIGAHNCSMISNGFDCVAVEELN 225
Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF---------SKYPDLFPE 231
S + W E++ QI E + + +++ + R F + + +
Sbjct: 226 STVMWDFNEVNEL----QISEEFVAMMQSLVDHMQEQYHRYFEPVSKARALAGFLSIMDG 281
Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQG-- 287
+ E F+W++ +R V + S G V+ +VP D +NH+ + LD+ S QG
Sbjct: 282 IIVKPEVFQWAYLTAIARGVPMKSKTGDVSYGIVPGIDWVNHAYDNNAHLDF--SMQGRM 339
Query: 288 ---VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD--SVELPLSLKKSD 342
+ R GEQ+ +Y N +LLL +GF R+ NP D SV L ++ +
Sbjct: 340 LGSMTLRATRDIAAGEQIVRNYVPMPNNQLLLRFGFAIRD--NPHDFVSVFLDQAVGATQ 397
Query: 343 KCYKEKLEALRKYGLSA 359
+ K LR++ L A
Sbjct: 398 MAARRK-AILRRHQLDA 413
>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
2508]
gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 533
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/234 (25%), Positives = 92/234 (39%), Gaps = 39/234 (16%)
Query: 122 PSLVITADSKWSCPEAGE--VLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP 179
PS AD + PE + S W LL L+ E SS WS Y+S LP Q
Sbjct: 99 PSHTADADDEPPSPENDDDDAEDSQSQDSWTLLILILMHEYLQGSSSNWSPYLSILPHQF 158
Query: 180 YSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVF 234
+ ++WT AEL L+AS + ++ A + I I +F YP P+
Sbjct: 159 DTPMFWTEAELAE-LQASALVAKVGKDEADKMIRTKIVKVVQENEDVF--YPAGTPKTQR 215
Query: 235 NMETFKWSFGILFSRLV---------------------------RLPSMDGRVALVPWAD 267
E G + ++ M+ + +VP AD
Sbjct: 216 LDEGELLKLGHRMGSAIMAYAFDLAKEEDDDEDEEEEEDGWVEDKIGGMNDTMGMVPMAD 275
Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
MLN +++ ++ + T+ R+ + GE++ YG S+ ELL YG+V
Sbjct: 276 MLNADAVFNAHINHGEAC--LTATSLREIKEGEEILNYYGPLSSAELLRRYGYV 327
>gi|389639446|ref|XP_003717356.1| hypothetical protein MGG_06237 [Magnaporthe oryzae 70-15]
gi|351643175|gb|EHA51037.1| hypothetical protein MGG_06237 [Magnaporthe oryzae 70-15]
gi|440465360|gb|ELQ34683.1| hypothetical protein OOU_Y34scaffold00748g2 [Magnaporthe oryzae
Y34]
gi|440490993|gb|ELQ70482.1| hypothetical protein OOW_P131scaffold00027g18 [Magnaporthe oryzae
P131]
Length = 500
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 84/199 (42%), Gaps = 36/199 (18%)
Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTR----AELDRYLEASQIRERAIERITNV 210
+LI + S W YI++LP QP L W E D + A+ A + + +
Sbjct: 110 FLIQQYLLGPKSHWHPYIASLP-QPEHLASWNLPPFWPEEDAAVLAATNAGVAAKEMAGI 168
Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS--------------- 255
G + R + L ++ ++W++ I SR R PS
Sbjct: 169 AGRESKQGRRAL-RESGLENWREYSPLLYRWAYCIFTSRSFR-PSLVVPPAVWESLRNEH 226
Query: 256 --------MDGRVALVPWADMLNHSCEVETFLDYDKSSQG-----VVFTTDRQYQPGEQV 302
MD L+P D+ NH V+ D +S G V + + Y+PGEQ+
Sbjct: 227 AKYLDGCEMDDFSILMPLFDIANHDPLVQATWD-SESVPGECRLLVNGRSGQGYRPGEQI 285
Query: 303 FISYGKKSNGELLLSYGFV 321
F +YG K+N ELL++YGFV
Sbjct: 286 FNNYGLKTNSELLVAYGFV 304
>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
70294]
Length = 494
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 68/272 (25%), Positives = 115/272 (42%), Gaps = 56/272 (20%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKW---SCPEAGEVLKQCSVPDWPLLATYLISE 159
R +VA++++ +GE L +P ++ ++ P G + +W L ++ E
Sbjct: 41 RAMVAVEDVAEGETLFEIPRGSILNVNTSALTRDYPSFG----TSQLGEWEELILCMLYE 96
Query: 160 A-SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRER-----AIERITNVI 211
++SRW Y + LP + SL+YW+ EL L+ S + ER + E + V+
Sbjct: 97 MFVLGENSRWYPYFNVLPSSAELNSLIYWSDRELG-LLKPSFVIERIGRGKSQEMFSKVL 155
Query: 212 G--TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--------- 260
D L + +KY E F + I+ S + ++ +
Sbjct: 156 SYIENQDSDLSLIAKY--------LTWENFVYVASIIMSYSFDVEDLNPQSDEDDEIEDD 207
Query: 261 -------------ALVPWADMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+++P AD LN H C L YDK + + T + + GE+VF
Sbjct: 208 DNDSEMSPDKSIKSMIPLADTLNSDTHLCNAN--LMYDKET--LKMTAIKPIRAGEEVFN 263
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
YG+ N E+L YG+V +G+ D ELPL
Sbjct: 264 IYGEHPNSEILRRYGYVEWKGS-KYDFAELPL 294
>gi|356499056|ref|XP_003518360.1| PREDICTED: LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 40-like
[Glycine max]
Length = 497
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 116/281 (41%), Gaps = 56/281 (19%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE-VLKQCSVPDWPLLATYLISE 159
G RGL A +++ +GE +L VP S ++T +S + + V + S+ +L L+ E
Sbjct: 64 GRRGLGAARDLGRGEIVLRVPKSALMTRESVMEDEKLCDAVNRHSSLSPAQMLIVCLLYE 123
Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL---EASQIRERAIERITNVIGTYND 216
+ +SRW Y+ +P Q Y +L R L EA + E+A+ + + +
Sbjct: 124 MG-KXTSRWHPYLVHMP-QTYDILAMFGEFEKRALQVDEAMWVTEKAMLKAKSEWKEAHA 181
Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC--- 273
L + +F + + + W+ + S+ + +P D L D+ N+
Sbjct: 182 LMEDL------MFKPQFLTFKAWVWAAATISSQTMHIP-WDEAGCLCLVGDLFNYDAPGM 234
Query: 274 ------EVETFLD--------------------------------YDKSSQGVVFTTDRQ 295
++E FL +++ F +
Sbjct: 235 EPSGIEDLEHFLSNSSIHDTSLLNGDNNIMMQTTLILTQRLTDGWFEEDVNAYCFYSRAH 294
Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
Y+ G+QV + YG +N EL+ YGF+ +E NP+D V +PL
Sbjct: 295 YKKGDQVLLCYGIYTNLELVEHYGFLLQE--NPNDKVFIPL 333
>gi|402584499|gb|EJW78440.1| hypothetical protein WUBG_10651 [Wuchereria bancrofti]
Length = 362
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 88/214 (41%), Gaps = 42/214 (19%)
Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF--- 222
S W YI LP + L++T +L ++L S + E ++ NV + L I
Sbjct: 19 SHWQPYIKVLPENFNTPLFFTVEQL-QFLRPSPLFEESLLLYRNVSRQFIHFLLEIIRSD 77
Query: 223 ---------SKYPDLFPEEV--------FNMETFKWSFGILFSRLVRLPSMDGR------ 259
+ L P V F ++WS + +R+ +PS R
Sbjct: 78 QFRHRKKKSKEMSKLEPIYVNSPLTAANFTFNLYRWSVACISTRINMIPSEVLRDDIGQP 137
Query: 260 ---VALVPWADMLNHSC-------EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
L+P+ DM NHS V +++D + V R Y+P E V I YG +
Sbjct: 138 RLIPGLIPFLDMANHSYIEGAFHESVHFSVEFDCAEIIAV----RDYKPLEPVNIFYGWR 193
Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
SN + LL GFVP E N D +L + L KS +
Sbjct: 194 SNRDFLLHNGFVPSE-KNIRDIYKLKIGLPKSKR 226
>gi|238505934|ref|XP_002384169.1| SET domain-containing protein, putative [Aspergillus flavus
NRRL3357]
gi|220690283|gb|EED46633.1| SET domain-containing protein, putative [Aspergillus flavus
NRRL3357]
Length = 418
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 123/280 (43%), Gaps = 43/280 (15%)
Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
G++A +NI + E ++ VP ++T++ P P L A +L + + E
Sbjct: 39 GMIATRNIEEDEAIVTVPLKAMLTSER---IPSYFTSKFPDGTPTHALYAAFL-TNGNAE 94
Query: 164 KSSRWSNYISALP-RQPYSL---LYWTRAELDRYLEAS--------QIRER-AIERITNV 210
++ + P RQ + + W+ + L YL S Q R++ E
Sbjct: 95 DLEEFNAWRKTWPSRQDFEDSMPILWSES-LRNYLPPSISSHWHSIQSRDKLQYETTHQN 153
Query: 211 IGTYNDLRLR-----IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSM------D 257
+ + RLR + S +PD + ETF + + I+ +R +P +
Sbjct: 154 LLAQQEQRLRTAWDIVVSIFPDT------DWETFSYHWLIVNTRSFFYLMPGQEPPEDRN 207
Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
+AL+P+AD NHS +V + +D + VF + Y GE++++SYG N L
Sbjct: 208 DAMALLPFADYFNHS-DVACNVKFD--GENYVFRATKHYDEGEEIYMSYGPHPNDFLFAE 264
Query: 318 YGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYG 356
YGF E N S+++ L + LK +E+LE + YG
Sbjct: 265 YGFYLDE--NESETLYLDDIILKDLSTSLQEELEFQQYYG 302
>gi|223995875|ref|XP_002287611.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976727|gb|EED95054.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 538
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 72/311 (23%), Positives = 124/311 (39%), Gaps = 64/311 (20%)
Query: 43 HCSVSTTNDASRTKTTVTQNMIP---WGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVD 99
H + +T A T +N+ P W E+ ++ L + D G+ MAI VD
Sbjct: 58 HLTTTTALAAISTDENTPRNIPPFQSWCAEM-GVQQMDGLDLYTQDGGV--DYMAITTVD 114
Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITAD---------SKWSCPEA----GEVLKQCSV 146
I G +L+VP +V+++D S +A G + SV
Sbjct: 115 -----------IPAGTTILYVPSGMVLSSDRVAEELNAISNGGVADAVNQLGRIGGGSSV 163
Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY---------LEAS 197
P + L L+ E +++S + ++ +LPR LY+ + + S
Sbjct: 164 PKFYLFIKMLM-EYENQENSPFYPWLDSLPR-----LYFNAVSMTDFCYECLPPLVFNLS 217
Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
+ + + VI ++ + S Y N E KW+F +++R D
Sbjct: 218 RTEKVKFDNFLQVIK-----KVDVVSDYVKS------NEEVLKWAFNSVYTR--TYCHKD 264
Query: 258 GR----VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
G+ VAL P+AD NH + E + +D+ + +TT + ISYG +N
Sbjct: 265 GQNEDDVALTPFADYFNHGTDTEVEVCFDEEGNCMAYTT-TDVAANSPLRISYGCPTNPS 323
Query: 314 LLLS-YGFVPR 323
L + YGF+ +
Sbjct: 324 FLFARYGFLDQ 334
>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
Length = 915
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 57/232 (24%), Positives = 91/232 (39%), Gaps = 29/232 (12%)
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE------- 201
W +L ++ E S+W Y+ LP + ++W+ AELD L+AS R
Sbjct: 542 WSILIIIMMFEYFKGDESKWKPYMDVLPASFETPMFWSGAELDE-LQASATRTKVGKADA 600
Query: 202 ------------RAIERITNVIGTYNDLRL-RIFSKYPDLFPEEVFNMETFKWSFGILFS 248
RA I +Y+D L ++ + F+ +
Sbjct: 601 EEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEEDEEDEE 660
Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
V + +VP AD+LN E ++Y + T R + GE++ YG
Sbjct: 661 EWVEDRESKSTMGMVPMADILNADAEYNAHVNY--GDDALTVTALRTIKAGEEILNYYGP 718
Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPL-----SLKKSDKCYKEKLEALRKY 355
N ELL YG+V + + D VELP SL S +++L+ R+Y
Sbjct: 719 HPNSELLRRYGYVTPKHSR-YDVVELPWKLVENSLAASLGLSEQQLDNAREY 769
>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/299 (22%), Positives = 120/299 (40%), Gaps = 62/299 (20%)
Query: 79 LQKWLSD-----SGLPPQKMAIQKVDV-----GERGLVALKNIRKGEKLLFVPPSLVITA 128
L +WLS S P KV+V RG+ A +N+ E L+ +P S ++
Sbjct: 37 LVQWLSSPPAPFSAAPNVTHISSKVNVLSNETSGRGVYATQNVSAKETLVRIPHSFLMNT 96
Query: 129 DS----------KWSCPEAG---------------EVLKQCSVPDWPLL------ATYLI 157
++ K S P+ G E+ + + W L A Y+
Sbjct: 97 NTIIKHISRFNGKESVPDLGYSVLLPSEYTTDQWTELYAKIPISKWLQLTAFQRTALYIC 156
Query: 158 SEASFEKSSRWSNYISALPRQP---YSLLYWTRAELDRYLEASQIRE-------RAIERI 207
E +++S W +IS+LP+ ++ + W E++ L S+ + +
Sbjct: 157 LEKKRKENSFWCAFISSLPKLEELDFAPIVW---EVESELTGSKAADFFELLPRSSRNHA 213
Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL--PSMDGRV---AL 262
V+ +N+ + S++ E N F W++ + SR + + PS L
Sbjct: 214 KKVLVRFNEDYTAV-SEFLTAAKSEPLNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTL 272
Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
P+ D LNH C+ + + D S+G + + + G+++ SYG SN LL Y F
Sbjct: 273 APYVDFLNHDCDEKCAIKID--SRGFLVISCVDHAAGQELLFSYGPHSNEFLLCEYAFT 329
>gi|425767698|gb|EKV06264.1| hypothetical protein PDIG_78250 [Penicillium digitatum PHI26]
gi|425780393|gb|EKV18400.1| hypothetical protein PDIP_26670 [Penicillium digitatum Pd1]
Length = 494
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 128/305 (41%), Gaps = 44/305 (14%)
Query: 78 TLQKWLSDSGLPPQKMAIQKVDVGERG------LVALK------NIRKGEKLLFVPPSLV 125
TL W +G+ Q +++ K+ + G L+A + N K + LL VPP LV
Sbjct: 10 TLPAWQRLNGIVTQGISVHKIGSDQHGADKGSALIATETQMSSENDAKPKILLQVPPELV 69
Query: 126 ITADSKWSCPEAGEVLKQC--SVPDWPLLA-----TYLISEASFEKS------------S 166
++ ++ + + L+ ++ D+ A +LI + S S +
Sbjct: 70 LSLETVQNQAKTDGHLRDVLEAIGDFGRTARGAILIFLIIQISHSSSDLHPKHETIGISN 129
Query: 167 RWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR---IFS 223
W+ Y+ LP + L + AE L + + E + ++ + + LR I
Sbjct: 130 PWTEYVKFLP-PSFPLPTFYTAEEQELLRGTSLAEPLVAKLAFLEREFEQLRQATGGIAW 188
Query: 224 KYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNH--SCEVETFL 279
+ E + W + SRL+ LP +A+VP DM NH S V+
Sbjct: 189 CQRSWWHERTGALTIDDWKYVDAAYRSRLLDLPGSG--LAMVPCIDMANHVSSDGVKALY 246
Query: 280 DYDKSSQGVV-FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLS 337
D D V+ + QPGE++ ISYG +KS E++ SYGF+ G + + L L
Sbjct: 247 DTDSEGNAVLQLRWGKTIQPGEEITISYGNEKSASEMIFSYGFL-ESGITQAREMFLNLE 305
Query: 338 LKKSD 342
+ + D
Sbjct: 306 IPEDD 310
>gi|444727366|gb|ELW67864.1| SET domain-containing protein 4 [Tupaia chinensis]
Length = 268
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 11/107 (10%)
Query: 226 PDLFPEEVFNMETFK---WSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCE 274
P FP+ V N+ ++ W++ + +R V L AL P+ D+LNHS
Sbjct: 116 PACFPDAVDNVFSYSALLWAWCTVNTRAVYLRHRQRECLSAEPDTYALAPYLDLLNHSPN 175
Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
V+ +++ ++ Y+ E+VFI YG N LLL YGFV
Sbjct: 176 VQVKAAFNEETRCYEIQAASNYRKYEEVFICYGPHDNQRLLLEYGFV 222
>gi|145344581|ref|XP_001416808.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577034|gb|ABO95101.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 303
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 6/96 (6%)
Query: 237 ETFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNHSCEVETF-LDYDKSSQGVVF- 290
+ ++W+ ++ SR RL GR ALV AD++NHS E D+ + V F
Sbjct: 66 DEWRWAVSLVHSRTFRLEDERGRRPTRRALVAGADLINHSSVPEDVNCDWVANDADVFFI 125
Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
+ + + GE+ F+SYG++ + L YGF+PR +
Sbjct: 126 SATKDVRKGEEFFLSYGEQCDRHFALFYGFLPRRNS 161
>gi|428163640|gb|EKX32701.1| hypothetical protein GUITHDRAFT_121141 [Guillardia theta CCMP2712]
Length = 135
Score = 52.8 bits (125), Expect = 4e-04, Method: Composition-based stats.
Identities = 37/110 (33%), Positives = 53/110 (48%), Gaps = 12/110 (10%)
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRL----VRLPSMDGRVALVPWADMLNHSCEV 275
+ F++ P LFP + ++ + W+ I+ SR D LVP ADM+NH +
Sbjct: 11 KFFAENPGLFPSPI-DVREWMWASAIIMSRSWGQKAERAGNDKMHILVPLADMVNHDAKA 69
Query: 276 ETFLDYDKSSQG--VVFTTDRQYQPGEQVFISYGKKSNGELLLSYG-FVP 322
S+G +V R GE+V I+YG K N EL+ YG FVP
Sbjct: 70 RKV----GISEGGAIVIYAGRNLAAGEEVCITYGDKCNMELMAHYGFFVP 115
>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
Length = 467
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/289 (23%), Positives = 119/289 (41%), Gaps = 52/289 (17%)
Query: 94 AIQKVDVGERG--------LVALKNIRKGEKLLFVPPSLVITADSKW---SCPEA----- 137
AI+ VD+ +R VAL++I L +P +I ++ P+A
Sbjct: 28 AIKIVDLRDRNAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDK 87
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
+ + W L +I E ++S+W Y LP + ++W+ ELD+ L+AS
Sbjct: 88 PDDDDAPGLDSWSSLILIMIYEYLQGENSKWKPYFDVLPSSFDTPMFWSDNELDQ-LQAS 146
Query: 198 QIRE-------------------RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
+R R+ I N + + I + F++E
Sbjct: 147 HMRHKIGKADAENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLEN 206
Query: 239 ----FKWSFGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
+ + G + R DG+ + +VP AD+LN E +++++ S + T+
Sbjct: 207 DEEEEEEADGWVEDR-------DGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTS 257
Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
R + GE++ YG N ELL YG+V E + D VE+P + +S
Sbjct: 258 LRPIKAGEEILNYYGPHPNSELLRRYGYVT-EKHSRYDVVEIPWDIVES 305
>gi|429857094|gb|ELA31976.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
Length = 466
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 105/245 (42%), Gaps = 34/245 (13%)
Query: 107 ALKNIRKGEKLLFVPPSLVITADSKWSCPEAG-----------EVLKQCSVPDWPLLATY 155
A ++ G+ ++ P L ++ + + P G LK SVP + +
Sbjct: 36 ASAGVKPGDTVVTCPLGLTLSYLNATTTPNPGFHHEDTPPFPPSFLK--SVPPHVIGRFF 93
Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL------------DRYLEASQIRERA 203
LI++ K S W YI LP QP L W L + ++ ++I+ R
Sbjct: 94 LINQYLLGKDSFWYPYIRTLP-QPEHLQSWALPPLWPSDDIELLEDTNIHVAITEIKARL 152
Query: 204 IERITNVIGTYNDLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLP---SMD 257
I + + +R +++ + +F +F+ S I SR + LP ++D
Sbjct: 153 KSEYKQAIAAFGEDPVRKDYTRLLYNWAYCIFTSRSFRPSLVIPASRQHTLSLPEGCAID 212
Query: 258 GRVALVPWADMLNHSCEVETFLDY-DKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELL 315
L+P D+ NHS + D+ + + T Y PG+QV+ +YG K+N EL+
Sbjct: 213 DFSLLLPLFDVGNHSTLAKISWDHPEDAVDTCALRTLDAYGPGDQVYNNYGTNKTNAELM 272
Query: 316 LSYGF 320
L+YGF
Sbjct: 273 LAYGF 277
>gi|351695156|gb|EHA98074.1| SET domain-containing protein 4 [Heterocephalus glaber]
Length = 449
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 65/154 (42%), Gaps = 22/154 (14%)
Query: 231 EEVFNMETFKWSFGILFSRLVRL---------PSMDGRVALVPWADMLNHSCEVETFLDY 281
+ VF+ W++ + +R V L P D AL P+ D+LNHS V+ +
Sbjct: 198 DRVFSYSALLWAWCTVNTRAVYLRTRRRDCLSPEPDT-CALAPYLDLLNHSPHVQVKAAF 256
Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE-----LPL 336
++ + T + E+VFI YG N LLL YGFV NP V L
Sbjct: 257 NEETGCYEIRTASSCRKHEEVFICYGPHDNHRLLLEYGFVSLR--NPHACVYVSREILVR 314
Query: 337 SLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
L +DK K+ L+ +G + F GW
Sbjct: 315 YLPSTDKQMNRKIAILKDHGFMENLTF-----GW 343
>gi|320034953|gb|EFW16895.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 469
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 130/330 (39%), Gaps = 64/330 (19%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT-ADSKWSCPEAGEVLKQCSVPD-----WPLLATYL 156
RG+VA + I + E+L +P LV++ A+SK + + + D W L +
Sbjct: 44 RGVVACEEIVQDEELFAIPEDLVLSVANSK--------IKDRINFADENFDTWLSLIVTM 95
Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
I E S+WS Y LP +L++WT EL R L+ S + ++ ++ T+ + D
Sbjct: 96 IFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENEL-RELQGSSVLDKIGKQETDQV--ILD 152
Query: 217 LRLRIFSKYPDLFP-------------EEV----------------FNMETFKWSFGILF 247
L + ++PDLFP +EV F++E +
Sbjct: 153 KVLPVVLEHPDLFPPVNGLASFDSPSGKEVVLQLAHRMGTLIMAYAFDIEMDQDEDQDGE 212
Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE---QVFI 304
V + +VP AD+LN + + ++ R P ++F
Sbjct: 213 DGYVTDDEQEKAKGMVPLADLLNADAHRNNARLFQEDGYFIM----RSIAPISIEMEIFN 268
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE---KLEALRK------- 354
YG+ +LL YG++ E P D VE+ L + +E +LE L
Sbjct: 269 DYGELPRSDLLRRYGYIT-ENYAPYDVVEISLEAICNIAGVEEGCCQLELLEDAGVLEDG 327
Query: 355 YGLSASECFPIQITGWPLELMAYAYLVVSP 384
Y LS E I P EL+ + SP
Sbjct: 328 YALSRPEGDAITTEAIPAELLILLRTLRSP 357
>gi|367036851|ref|XP_003648806.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
gi|346996067|gb|AEO62470.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
Length = 479
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 57/109 (52%), Gaps = 10/109 (9%)
Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR---QYQPGEQVFI 304
SR + LP + G ++VP DMLNHS + YD++ Q V R G+++ I
Sbjct: 202 SRCLELP-VHGE-SMVPCIDMLNHSATPSAY--YDENPQDDVVLLLRPGISLAEGDEITI 257
Query: 305 SYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
SYG KS E+L SYGF+ T +DS+ LPLS D K KL A
Sbjct: 258 SYGDAKSAAEMLFSYGFIDPRST--ADSLVLPLSPFPDDPLAKAKLVAF 304
>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
6260]
Length = 466
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 115/298 (38%), Gaps = 60/298 (20%)
Query: 79 LQKWLSD-----SGLPPQKMAIQKVDV-----GERGLVALKNIRKGEKLLFVPPSLVITA 128
L +WLS S P KV+V RG+ A +N+ E L+ +P S ++
Sbjct: 37 LVQWLSSPPAPFSAAPNVTHISSKVNVLSNETSGRGVYATQNVSAKETLVRIPHSFLMNT 96
Query: 129 DS----------KWSCPEAG---------------EVLKQCSVPDWPLL------ATYLI 157
++ K S P+ G E+ + + W L A Y+
Sbjct: 97 NTIIKHISRFNGKESVPDLGYSVSLPSEYTTDQWTELYAKIPISKWLQLTAFQRTALYIC 156
Query: 158 SEASFEKSSRWSNYISALPRQP---YSLLYWTRAELDRYLEASQIRE------RAIERIT 208
E +++S W +IS+LP+ ++ + W E++ L S+ + R+
Sbjct: 157 LEKKRKENSFWCAFISSLPKLEELDFAPIVW---EVESELTGSKAADFFELLPRSSRNHA 213
Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL--PSMDGRV---ALV 263
+ + S++ E N F W++ + SR + + PS L
Sbjct: 214 KKVSVRFNEDYTAVSEFLTAAKSEPLNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLA 273
Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
P+ D LNH C+ + + D V+ D + G+++ SYG SN LL Y F
Sbjct: 274 PYVDFLNHDCDEKCAIKIDSRGFSVISCVD--HAAGQELLFSYGPHSNEFLLCEYAFT 329
>gi|358371988|dbj|GAA88594.1| SET domain protein [Aspergillus kawachii IFO 4308]
Length = 497
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 81/174 (46%), Gaps = 16/174 (9%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----L 219
SS WS Y+ LP +++ EL+ L S +R +I ++ + LR L
Sbjct: 130 SSPWSEYMKYLPSSIPLPTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRQSTEGL 188
Query: 220 RIFSKY--PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE--V 275
+Y D + FN +K+ + SR+V LP A+VP DM NH+ E V
Sbjct: 189 DWCKRYWWDDDTGKLTFN--DWKYVDALYRSRMVDLPQHGH--AMVPCIDMANHAPEGTV 244
Query: 276 ETFLDYDKSSQGVVFTTD-RQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
+ D D V+ D R + E+V ISYG +KS E++ SYGF+ T+
Sbjct: 245 KALYDEDADGNAVLQLRDGRSLRADEEVTISYGDEKSASEMIFSYGFLDEHTTD 298
>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
Length = 473
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 79/174 (45%), Gaps = 16/174 (9%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----L 219
SS W+ Y+ +P +++ EL+ L S +R +I ++ + LR L
Sbjct: 130 SSPWTEYMKYMPPAISLPTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRRSTEGL 188
Query: 220 RIFSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCE--V 275
KY + E+ + W + + SR+V LP A+VP DM NH+ E V
Sbjct: 189 DWCEKY--WWDEDTGKLTFNDWKYVDALYRSRMVDLPRHGH--AMVPCIDMANHASEGTV 244
Query: 276 ETFLDYDKSSQGVV-FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
+ D D V+ R + E+V ISYG +KS EL+ SYGF+ T+
Sbjct: 245 KALYDEDADGNAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHTTD 298
>gi|320586761|gb|EFW99424.1| set domain containing protein [Grosmannia clavigera kw1407]
Length = 437
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
P D +A+ P AD+ NH+ + + + + G F DR+Y G +V ISYG ++
Sbjct: 208 PQRDDHMAMQPVADLFNHTADGGCRVAFGPA--GFTFVADRRYDAGVEVPISYGAHADDA 265
Query: 314 LLLSYGFV 321
LL+ YGFV
Sbjct: 266 LLVEYGFV 273
>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
Length = 476
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 96/410 (23%), Positives = 157/410 (38%), Gaps = 79/410 (19%)
Query: 82 WLSDSGL---PPQKMA-IQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
WL SG P K+ ++ D G RG+VA + I + E L +P + +++ ++ E
Sbjct: 14 WLRKSGAEISPKIKLEDLRNKDAG-RGVVASQEIAEHELLFRIPRTSILSVENSILSTEI 72
Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
P W L ++ E +S W+ Y + LP + +L++WT EL L+AS
Sbjct: 73 PAATLSLLGP-WLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDELAE-LQAS 130
Query: 198 QI---------RERAIERITNVIGTYNDL-------------------RLRIFSKYPDLF 229
+ E +E++ VI + D+ L + K L
Sbjct: 131 AVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNLELMHKMGSLI 190
Query: 230 PEEVFNME------TFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLD 280
F++E LP +VP ADMLN C F +
Sbjct: 191 MAYAFDVEPATPTKEVDEEGFAEEEEDAALPK-----GMVPLADMLNADADRCNARLFYE 245
Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
D + + Q GE++F YG +LL YG+V + D VE+P L
Sbjct: 246 KDCLEMKAL----KPIQAGEEIFNDYGPLPRSDLLRRYGYV-TDNYAQYDVVEIPTDLVS 300
Query: 341 SDKCY-----KEKLEALRK-------YGLSASECFPIQITGWPLELMAYAYLVVSPPSMK 388
+ + +LE L + Y ++AS F +Q + P LVV SM
Sbjct: 301 EVLVHEGVWQENRLEYLDEQEVLDTGYDIAASTPFTLQESLSP-------ELVVLVESML 353
Query: 389 GKFEEMAAAASNKMTSKKDIKCPE-IDEQALQFILDSCESSISKYSRFLQ 437
E+ ++ SK + PE I Q + ++ I++Y+ L+
Sbjct: 354 LSDEDF-----ERLKSKGKLPKPEKITAQGADMLHKMLQARIAQYATTLE 398
>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
Y34]
gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
P131]
Length = 390
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 59/134 (44%), Gaps = 15/134 (11%)
Query: 242 SFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQP 298
+F + R RL D R+ L P AD+ NH+ C V F D D DR Y
Sbjct: 164 TFYFVCPRTERLGKED-RMVLQPVADLFNHADAGCAV-AFNDED-----FTIRADRDYDA 216
Query: 299 GEQVFISYGKKSNGELLLSYGFV---PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
GE+V I YG SN LL YGFV R D LPL K + KE+ L Y
Sbjct: 217 GEEVLICYGNHSNDFLLAEYGFVLAANRWDEVCIDDAILPLLTKAQRELLKER-NFLGNY 275
Query: 356 GLSASE-CFPIQIT 368
L A+ C+ ++
Sbjct: 276 MLDAATVCYRTEVA 289
>gi|440636170|gb|ELR06089.1| hypothetical protein GMDG_07800 [Geomyces destructans 20631-21]
Length = 373
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 4/80 (5%)
Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
+AL P+AD NH+ +V + + + S+ G TDR+ + GE+++ISYG SN LL YG
Sbjct: 180 LALAPFADCFNHA-DVASKVTF--STSGYDICTDRRIEKGEEIYISYGNHSNDFLLAEYG 236
Query: 320 FVPREGTNPSDSV-ELPLSL 338
F+ E S+ E+ LSL
Sbjct: 237 FILDENKWDEISIDEVILSL 256
>gi|312377428|gb|EFR24260.1| hypothetical protein AND_11267 [Anopheles darlingi]
Length = 273
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 28/141 (19%)
Query: 241 WSFGILFSRLVRLP-------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV-FTT 292
W+ + +R ++P +D +AL+P DM NH + D +SS G T
Sbjct: 5 WAVSTVMTRQNKVPVNLSTFEELDFTLALIPLWDMANHITPEQR--DGHRSSNGSSPLVT 62
Query: 293 DRQY----------------QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
D Y + GE VFI+YGK+++ E L+ GF G NP+ +
Sbjct: 63 DTTYCSKLEKLESILQADCTKAGEPVFINYGKRTDAEFLVHNGF--SFGKNPNTRITKLF 120
Query: 337 SLKKSDKCYKEKLEALRKYGL 357
+L ++D YK++ L G+
Sbjct: 121 ALNRTDSLYKKRARLLELLGV 141
>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
Length = 323
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 23/119 (19%)
Query: 237 ETFKWSFGILFSRLVRLP-----SMDGRVALVPWADMLNHSC-------EVETFLD---- 280
+ F+W++ +SR + LP S A+VP D NHSC EV
Sbjct: 116 DEFRWAYSAYWSRALSLPIGADPSAPTVEAIVPGIDFANHSCGAPNARWEVRGVRGGAPD 175
Query: 281 -YDKSSQGVVFTTDRQY----QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
D S G ++ PGE+V ISYG K+N ELL +GF R+ NP D++ L
Sbjct: 176 PNDPSGSGPRVELLGEFGSLPAPGEEVVISYGDKTNEELLFVHGFADRD--NPHDALVL 232
>gi|428182869|gb|EKX51728.1| hypothetical protein GUITHDRAFT_102333 [Guillardia theta CCMP2712]
Length = 398
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 4/93 (4%)
Query: 244 GILFSRLVRLPSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
G + + LP+ G + P DM+NH + ++ + Y + + G+Q
Sbjct: 203 GGILANTFLLPNFLGLTHYVIAPMIDMINHDGQSKSIVTYQALQAAFEVQSSSNFNVGDQ 262
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
VFISYG +SN +LL YGFV E N D E+
Sbjct: 263 VFISYGDRSNDQLLQYYGFV--EMDNVHDLYEI 293
>gi|355718768|gb|AES06378.1| SET domain containing 6 [Mustela putorius furo]
Length = 216
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 15/173 (8%)
Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
E R L+ + + E + + N+ Y + L +PDLF V ++E ++ ++ +
Sbjct: 4 ERRRLLQGTGVPEAVEKDLANIRSEYYSIVLPFMETHPDLFSPRVRSLELYRQLVALVMA 63
Query: 249 RLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP--- 298
+ P + +VP AD+LNH L+Y + +V T QP
Sbjct: 64 YSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-----QPIPK 118
Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
G ++F +YG+ +N +L+ YGFV N D+ ++ + + K EA
Sbjct: 119 GHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKAEA 171
>gi|256079856|ref|XP_002576200.1| hypothetical protein [Schistosoma mansoni]
Length = 330
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 42/72 (58%), Gaps = 2/72 (2%)
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
++F T Y+PG+Q+F+ YG +SN + + GF+P+ N ++ + + L + SD
Sbjct: 117 LIFCTMEAYKPGDQIFMDYGNRSNDDFFMFSGFIPQ--VNLNNKLTITLGISSSDSLALT 174
Query: 348 KLEALRKYGLSA 359
+ + L+ +GLS
Sbjct: 175 RKQLLQTFGLSV 186
>gi|241959368|ref|XP_002422403.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
gi|223645748|emb|CAX40410.1| ribosomal N-lysine methyltransferase, putative [Candida
dubliniensis CD36]
Length = 579
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 47/183 (25%)
Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
Y ++T +LD+YL ND + ++ +P+ +
Sbjct: 171 YEYKFYTDDDLDKYL--------------------NDENIENWTSFPN-----------Y 199
Query: 240 KWSFGILFSR-----LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
W+ IL SR L+ + L+P D+LNH+ + + + +D F ++
Sbjct: 200 LWASLILKSRSFPAYLIDKNNKQDSAMLLPVVDLLNHNSKSK--VHWDIFENHFKFGSE- 256
Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
+PG+++F +YG K N ELLL+YGF N DSV L + K +EK++A+ +
Sbjct: 257 SIEPGKEIFNNYGLKGNEELLLAYGFCIE--NNLQDSVALKI------KMPEEKIKAIEE 308
Query: 355 YGL 357
YG+
Sbjct: 309 YGV 311
>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
Length = 230
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 68/138 (49%), Gaps = 9/138 (6%)
Query: 248 SRLVRLPSMDGRVALVPWADMLNHS-CEVETFLD--YDKSSQGVVFTTDRQYQPGEQVFI 304
+R +P + AL+P DM NH+ + T + D+S V + ++ GEQ+FI
Sbjct: 2 TRQNTIPFQEDYYALIPLWDMCNHTNGTISTAYNPVLDRSECLAV----KNFKAGEQLFI 57
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
YG +SN +L + GFV N D + L + KSD +++ L K ++++ F
Sbjct: 58 FYGSRSNADLFVHNGFVFE--NNDYDVYWIRLGISKSDPLQQKRGHLLGKLSIASTCDFS 115
Query: 365 IQITGWPLELMAYAYLVV 382
I+ P++ A+L V
Sbjct: 116 IRKGASPIDGQLLAFLRV 133
>gi|145343084|ref|XP_001416296.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576521|gb|ABO94589.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1280
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 105/263 (39%), Gaps = 48/263 (18%)
Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
G RG+ A++++ GE LL VP LV+ + S + + + S +A L+ E
Sbjct: 76 GTRGVEAVRDLAPGETLLRVPWGLVVESASASGDDDGDDDARWSSA-----MAMTLLEEL 130
Query: 161 SFEKSSRWSNYISALPR-QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
S +S+ + +++ LPR P + LD EA +RE E + +
Sbjct: 131 SEGESNERAAWLTRLPRPAPKT------PALDFDDEA--LREIEDESVVDEALAVRRAHE 182
Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-----LVPWADMLNHSC- 273
R Y + +E KW+ ++ SR+ DG LVP DM NH
Sbjct: 183 RAREAYGERLAAIGATVEDLKWATAVVHSRV--FTRRDGAAERATRLLVPGVDMCNHDAA 240
Query: 274 -------------------------EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
E+ DK + VV + GE++ ISYG
Sbjct: 241 RFNAIVRVVTSPETCQGAAATEEIAEIAPSSTMDKFFELVVDPDGETVEAGEEILISYGS 300
Query: 309 KSNGELLLSYGFVPREGTNPSDS 331
N L+ +GF+PR GTN +D+
Sbjct: 301 FPNDVWLMYFGFIPR-GTNVNDT 322
>gi|71656153|ref|XP_816628.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70881769|gb|EAN94777.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 565
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 101/244 (41%), Gaps = 25/244 (10%)
Query: 148 DWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEA 196
D PLL LI E ++S W+ + + P + P +W +L D +
Sbjct: 197 DEPLLVLSLIYERYVAETSHWNELLFSCPGEYPNVPTFWDWEDLAELEGLDVLDDVLAKK 256
Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
+Q+ + E + + + L + E F++E W+ SR L ++
Sbjct: 257 AQLAQFQTETMAVLPFIHEALAGSCRLGKDEFL--ECFSIEAMMWARTTFDSRAFNL-NV 313
Query: 257 DGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKS 310
DGRV ALVP ADM+NH + + + + G + + G ++++SYG
Sbjct: 314 DGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDFVMQIGASLTAQDIGREIWMSYGPLQ 373
Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKK---SDKCYKEKLEALRKYGLSASECFPIQI 367
N ELL YGFV EG N + + P + D+ + + YGL + C I
Sbjct: 374 NWELLQFYGFVV-EG-NEHERLPFPFDFPEGAVGDEWDGRRAALVATYGLHLAGCCWICH 431
Query: 368 TGWP 371
G P
Sbjct: 432 DGRP 435
>gi|440633283|gb|ELR03202.1| hypothetical protein GMDG_01185 [Geomyces destructans 20631-21]
Length = 372
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 38/65 (58%), Gaps = 9/65 (13%)
Query: 260 VALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
+AL P+AD NH+ C VE +G TT++ Y GE+++ISYGK SN LL
Sbjct: 180 MALNPFADYFNHASQGCTVEF------GPEGFEITTNKVYGEGEEIYISYGKHSNDFLLA 233
Query: 317 SYGFV 321
YGF+
Sbjct: 234 EYGFI 238
>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
Length = 482
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 67/260 (25%), Positives = 111/260 (42%), Gaps = 34/260 (13%)
Query: 95 IQKVDVGERGLVALKNIRKGEK-------LLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
I+ ++ GLVA +I + +L +P LV++A++ + + KQ
Sbjct: 25 IRNIEGKGFGLVAKHDITDESRDASGPATILRIPRDLVLSAEAVEEYAKVDQNFKQLLDV 84
Query: 144 ----CSVPDWPL-LATYLISEASFEKSSR------WSNYISALPRQPYSLLYWTRAELDR 192
+ D L L T+L+ + +R W+ YI LPR WT E +
Sbjct: 85 AGHQSTRGDIMLYLLTHLVQSKATSPGTRAFASTPWTEYIRFLPRPIPVPTMWTNDERE- 143
Query: 193 YLEASQIRERAIERITNVIGTYNDL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
L+ + + +++ + Y+ L S + L E +E + + SR
Sbjct: 144 LLKGTSLEAAVSAKLSALSSEYDKLCEEASALSFWSTLLSESA-TLEDWVLADAWYRSRC 202
Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR---QYQPGEQVFISYG 307
+ LP A+VP DM NHS + YD+SS G V R + G ++ ISYG
Sbjct: 203 LELPRAGH--AMVPGLDMANHSQSHSAY--YDESSDGDVVLLPRPGSKIPAGAEITISYG 258
Query: 308 K-KSNGELLLSYGFVPREGT 326
+ K E+L SYGF+ ++ T
Sbjct: 259 EAKPAAEMLFSYGFIDKDST 278
>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 579
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 52/112 (46%), Gaps = 12/112 (10%)
Query: 259 RVALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
R + P DM NH S + + ++ + TD+ G++V+ISYG +SN +LL
Sbjct: 333 RYVICPMIDMANHQSVKFAGQVSFEYFANAYSLATDQAIPSGDEVYISYGPRSNDQLLQY 392
Query: 318 YGFVPREGTNPSDSVELP---------LSLKKSDKCYKEKLEALRKYGLSAS 360
YGFV R NP+D +P L K +LE L + GL S
Sbjct: 393 YGFVER--NNPNDVYVMPPLREWDIEALERATDRKFAVGRLEKLNRAGLLGS 442
>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
Length = 498
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/174 (29%), Positives = 79/174 (45%), Gaps = 16/174 (9%)
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----L 219
SS W+ Y+ +P +++ EL+ L S +R +I ++ + LR L
Sbjct: 130 SSPWTEYMKYMPPAISLPTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRRSTEGL 188
Query: 220 RIFSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCE--V 275
KY + E+ + W + + SR+V LP A+VP DM NH+ E V
Sbjct: 189 DWCEKY--WWDEDTGKLTFNDWKYVDALYRSRMVDLPRHGH--AMVPCIDMANHASEGTV 244
Query: 276 ETFLDYDKSSQGVV-FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
+ D D V+ R + E+V ISYG +KS EL+ SYGF+ T+
Sbjct: 245 KALYDEDADGNAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHTTD 298
>gi|413925566|gb|AFW65498.1| hypothetical protein ZEAMMB73_874532 [Zea mays]
Length = 450
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 27/34 (79%)
Query: 360 SECFPIQITGWPLELMAYAYLVVSPPSMKGKFEE 393
SE FP+ +TGW +ELMAYA+LVVSPP M FE+
Sbjct: 201 SESFPLWVTGWSVELMAYAFLVVSPPDMSQCFED 234
>gi|384249279|gb|EIE22761.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 438
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 36/165 (21%)
Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--------SMDGR-VALVPWADMLNH 271
+ + PDL+P + F + G++ SR + + DG + L+P DM+NH
Sbjct: 35 VLEQRPDLWPPASCGYDAFVHAAGMVQSRAFHMKKENWITGENEDGEELYLIPGIDMINH 94
Query: 272 SCEV-ETFLDYDKSSQGVVF------------------TTDRQYQPGEQVFISYGKKSNG 312
S + E ++SS GV F R+ GEQ+ +YG S+
Sbjct: 95 SSRLQERNTALEQSSDGVTFRRKPDLPPEEYNGGLFVMKAGRKVAAGEQILHTYGDLSDA 154
Query: 313 ELLLSYGFV--PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
+LL +YGFV P + +V LP S ++ + ALR++
Sbjct: 155 QLLQTYGFVEDPARPNRHNRNVRLPTS------DLQKGIRALRRH 193
>gi|116180202|ref|XP_001219950.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
gi|88185026|gb|EAQ92494.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
Length = 510
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 94/210 (44%), Gaps = 29/210 (13%)
Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
S A+ S+ W+ Y+ LP WT E R L E A++ + + DL
Sbjct: 110 SHANVGLSNPWTEYLKFLPETVLVPTLWTEDE--RLLLRGTSLEAAVDAKISALDAEFDL 167
Query: 218 RLRIFSKYPDLFP-EEVFNMETFKWSF-------GILFSRLVRLPSMDGRVALVPWADML 269
+ K D+ ++ ME SF + SR + LP+ ++VP DM+
Sbjct: 168 ---VREKSSDIIAWNDLLWMEGVPVSFTDWIRLDALYRSRCLELPT--SGESMVPCIDMI 222
Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQPG------EQVFISYGK-KSNGELLLSYGFVP 322
NHS + YD++++ V R PG +++ ISYG KS E+L SYGF+
Sbjct: 223 NHSATPSAY--YDENTKDEVVLLPR---PGVGISKDDEITISYGDAKSAAEMLFSYGFI- 276
Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
+ + ++S+ LPL+ D K KL A
Sbjct: 277 -DPASSAESLVLPLNPFPDDPLAKAKLVAF 305
>gi|326427099|gb|EGD72669.1| hypothetical protein PTSG_04400 [Salpingoeca sp. ATCC 50818]
Length = 1063
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 90/235 (38%), Gaps = 54/235 (22%)
Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
LAT ++ AS +S+W Y+S+LP+ + + + L L + + I+R
Sbjct: 158 LATAIVFHAS-NPTSKWHGYLSSLPKHNLTTMTFDERAL-HLLRGTNLHHATIDRRNATA 215
Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--------------LVRLPSMD 257
T + + K+P F ++ + W+ + SR L+ L +D
Sbjct: 216 RTAATICRWLQHKWPQ--HAAAFTLDAYVWAAETISSRALSGRVSQPDTVIHLLHLGIVD 273
Query: 258 GRV------------------------------ALVPWADMLNHSCEVETFLDYDKSSQG 287
G L+P D+ +H + + + + +
Sbjct: 274 GDTPTSASDEQSSTRTSKSVVTVAPFPVLAHTPCLLPLLDLFDHDPQAD--VTWRNTGTH 331
Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKS 341
V T PGE VF +YG K N EL+L+YGF +P N D + + L L S
Sbjct: 332 VRLITREAVAPGEPVFNNYGGKGNEELMLAYGFALP---NNKHDDMHVMLGLPAS 383
>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
Full=SET domain-containing protein 7
gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 494
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ R+ + F + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D V +++P ADMLN + + L YD + +V
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284
>gi|157167893|ref|XP_001662890.1| hypothetical protein AaeL_AAEL002998 [Aedes aegypti]
gi|108881507|gb|EAT45732.1| AAEL002998-PA [Aedes aegypti]
Length = 259
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 55/106 (51%), Gaps = 5/106 (4%)
Query: 261 ALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
L+P DM NH + ++ T Y++ Q V T + + GEQ+FI YG ++N + L+ G
Sbjct: 33 VLIPLWDMANHINGQITT--GYNEELQRVESQTLKAFAKGEQIFIHYGNRTNADFLVHNG 90
Query: 320 FVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
FV + +N V + L+L + + ++ E L K + + F +
Sbjct: 91 FVFPDNSNT--EVTIQLALNSGEDLFDQRKELLEKLNVPIAGEFTV 134
>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
Length = 494
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ R+ + F + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D V +++P ADMLN + + L YD + +V
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284
>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
Length = 494
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 62/273 (22%), Positives = 106/273 (38%), Gaps = 76/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ R+ + F + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D + +++P ADMLN + + L YD + +V
Sbjct: 194 MQDSSINENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284
>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
Length = 469
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 126/297 (42%), Gaps = 43/297 (14%)
Query: 81 KWLSDSGLPPQKMAIQKVDV----GERGLVALKNIRKGEKLLFVPPSL----VITADSKW 132
+W +G + +Q VD+ RG++A ++I + E +LF P V+T++
Sbjct: 13 QWFKAAGGEFRDDLLQIVDLRPQAAGRGIIATRDIPE-ETILFTIPRQAIINVLTSELPQ 71
Query: 133 SCPEA--GEV--LKQCSVP--DWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
P+ G + + + P W L ++ E +SRW Y LP+Q + ++W+
Sbjct: 72 KLPQVFDGSIDEMDDNAEPLDSWGQLILVMLYEVLQGDASRWKPYFDILPQQFDTPIFWS 131
Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-PE--------EVFNME 237
EL L+ + + I ++ + + L I P +F PE E+ ++
Sbjct: 132 DGEL-LELQGTSLTAEKIGKVES-DAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLA 189
Query: 238 ------TFKWSFGILFSRLVR------LPSMDGR--VALVPWADMLNHSCEVETFLDYDK 283
++F + + +GR + +VP AD LN + E +++ +
Sbjct: 190 HRMGSTIMAYAFDLENDDENENEEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGE 249
Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
S + D + G+Q+ YG ELL YG+V E + D VE+P +L K
Sbjct: 250 SLEATAIRAD--IKAGDQILNYYGPLPTSELLRRYGYVTPEHSR-YDVVEVPWTLVK 303
>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 494
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 62/273 (22%), Positives = 106/273 (38%), Gaps = 76/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ R+ + F + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D + +++P ADMLN + + L YD + +V
Sbjct: 194 MQDSSINENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284
>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
Length = 515
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISA--LPRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ R+ + F + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D V +++P ADMLN + + L YD + +V
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284
>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
Length = 515
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ R+ + F + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D V +++P ADMLN + + L YD + +V
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284
>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
Length = 491
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 52/241 (21%), Positives = 94/241 (39%), Gaps = 27/241 (11%)
Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
L A+++I + E+L +P L+++ ++ + G L + + W L +I E +
Sbjct: 61 LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118
Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRY--------LEASQIRERAIERITNVIGTY-- 214
SRW Y LP +L++WT A+L + S E ++++ +I
Sbjct: 119 QSRWEPYFHILPTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVPLIQANPH 178
Query: 215 -------------NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
ND L + + L F++E + + D
Sbjct: 179 HFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTDDEDEPAK 238
Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
+VP AD+ N + + + V R Q GE++F YG+ +LL YG+
Sbjct: 239 GMVPLADIFNADAQRNNARLFQEEG-SFVMKAIRNIQAGEEIFNDYGELPRADLLRRYGY 297
Query: 321 V 321
V
Sbjct: 298 V 298
>gi|308798945|ref|XP_003074252.1| unnamed protein product [Ostreococcus tauri]
gi|116000424|emb|CAL50104.1| unnamed protein product [Ostreococcus tauri]
Length = 405
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 70/165 (42%), Gaps = 25/165 (15%)
Query: 207 ITNVIGTYNDLRLRIFSKYPDLFPEEVFNM-ETFKWSFGILFSRLVR---LPSMDGRVAL 262
+T + GT RLR + M + W+ G + S ++ +P AL
Sbjct: 202 LTALDGTVTAARLRKRGDFVRALASSTGLMVKDVSWAIGAVSSHAMKSEIVP-----YAL 256
Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK-SNGELLLSYGFV 321
VP D+L+HS + D+++ V + R PGE++ ISYGK N L YGF
Sbjct: 257 VPGCDLLDHSTTPNCVVRRDETTNDVFCASTRDVAPGEKLTISYGKSLCNDRALRMYGFA 316
Query: 322 PRE--------------GTNPSDSVELPLSLKKSDKCYKEKLEAL 352
RE G +PS+ S+ +S+K + + AL
Sbjct: 317 SRELYSNDARVLPGGFRGVHPSNEA-FDASVDESEKVFGGRKAAL 360
>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
Length = 496
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 63/273 (23%), Positives = 105/273 (38%), Gaps = 74/273 (27%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
R +VA + I+K E L +P S V++ + +++K D+P L ++E
Sbjct: 39 RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86
Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
++ SRW+ Y P +L++W EL
Sbjct: 87 WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146
Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
EA ++ ER I+ I + G ++ SK+ + F + I+ S L
Sbjct: 147 EAKEMHERIIKSIKQIGGEFSTCVANCPSKF-----------DNFAYIASIILSYSFDLE 195
Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
D V +++P ADMLN + + L YD + +V
Sbjct: 196 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 253
Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
R + EQV+ YG+ N ELL YG+V +G+
Sbjct: 254 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 286
>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
Length = 492
Score = 51.6 bits (122), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 126/325 (38%), Gaps = 70/325 (21%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
RG+VA +I GE+L +P LV++A + + L++ W L ++ E
Sbjct: 48 RGVVARSDIFDGEELFSIPRGLVLSAQNSKLKDLLSQDLEELGP--WLSLILVMMYEYLL 105
Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDL 217
+ S W+ Y LP+ +L++W+ +EL R L+ S I +E A + I +I
Sbjct: 106 GEQSAWAPYFKILPKSFDTLMFWSPSEL-RELQGSAIVSKIGKEGAEDSIMQMIAP---- 160
Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV----------------- 260
+ P LFP V + ++ G L+RL + G +
Sbjct: 161 ---VVRANPSLFP-SVDGLASWDGEAG--SHALLRLAHIMGSLIMAYAFDIEKVEDEDDE 214
Query: 261 -------------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
+VP AD+LN + + + +V + + GE+
Sbjct: 215 NNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLF-QEDDSLVMKAIKPIRVGEE 273
Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS- 360
+F YG+ +LL YG+V + D VEL L + R GL +
Sbjct: 274 IFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD------------QICRSAGLQNAD 320
Query: 361 -ECFPIQITGWPLELMAYAYLVVSP 384
E +P LEL+ Y++ P
Sbjct: 321 IESYPPLAFLEDLELLDDGYVIPRP 345
>gi|326435209|gb|EGD80779.1| hypothetical protein PTSG_01368 [Salpingoeca sp. ATCC 50818]
Length = 627
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 79/185 (42%), Gaps = 24/185 (12%)
Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERI 207
PLL ++ + E +S ++ Y + LP + WT E L+ S+++E +
Sbjct: 179 PLLLAMMLDMDAGE-ASEFAPYFNILPEDDELHHPHVWTDRERSTLLKDSRLQEDVARDL 237
Query: 208 TNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRLPSMDGRVALVP 264
T + Y+ + ++P +FP+ + F+ + I+ DGRV LVP
Sbjct: 238 TLMKREYDTIAKPFMIRHPKIFPQPGKKAFSFRKYAQCAAIVMGYSF-TDEEDGRVCLVP 296
Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ--------PGEQVFISYGKKSNGELLL 316
AD+LNH + +F +D+ Q G ++F +YG N +L+
Sbjct: 297 VADILNH---------VTGKNNARLFFSDKTLQMRSIKRIPAGAEIFNTYGDLDNLQLVQ 347
Query: 317 SYGFV 321
+GF
Sbjct: 348 QHGFA 352
>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
Length = 488
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 37/278 (13%)
Query: 89 PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
P K+A + + RG+VA +I + E+L +P LV++ + + E+ + +
Sbjct: 34 PKIKIADLRSEGAGRGIVAYDDINEEEELFAIPQGLVLSFQNS-KLKDLMEI-NERDLGQ 91
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
W L +I E +S W+ Y LP +L++WT AEL L+ S + R I + T
Sbjct: 92 WLCLILVMIYEYLQGAASPWAPYFKVLPTDFDTLMFWTDAEL-LELKGSAVLGR-IGKST 149
Query: 209 NVIGTYNDLRLRIFSKYPDLFP--------------------EEVFNMETFKWSFGILFS 248
DL L + SK +LFP ++F +
Sbjct: 150 AEEVFLRDL-LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVEND 208
Query: 249 RLVRLPSMDGRV----------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
+ DG V ++P AD+LN + + + + + + +
Sbjct: 209 EAEEVEGEDGYVTDDEERQLPKGMIPLADLLNADADRNNARLFQEDGY-LSMKSIKSIRK 267
Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
GE++F YG+ ELL YG+V + D E+P+
Sbjct: 268 GEEIFNDYGELPRAELLRRYGYV-TDSYAQYDEAEVPI 304
>gi|171676308|ref|XP_001903107.1| hypothetical protein [Podospora anserina S mat+]
gi|170936220|emb|CAP60879.1| unnamed protein product [Podospora anserina S mat+]
Length = 495
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 121/287 (42%), Gaps = 44/287 (15%)
Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE-----------AGEVLKQCSVPD 148
RG+VA +I L +P + ++ A + K PE +G+ +
Sbjct: 46 RGIVAQADIAADTVLFTIPRNSILCAATSPLKDILPEIFDLDNDDEDESGDESDGDNQNS 105
Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERA 203
W LL LI E SS+W Y+ LP + ++WT ++L +L+AS + +E A
Sbjct: 106 WTLLILILIHEYLQGSSSQWKPYLDVLPSTFNTPMFWTPSQL-SFLQASAVTSKIGQEEA 164
Query: 204 IERITNVIGTYNDLRLRIF---SKYP---DLFPEEVFNMETFKWSFGILFSRLVRLPSM- 256
+ I + I +IF S P D + M + S+ + + +P
Sbjct: 165 DKMIASKILPVIRSHPQIFFPSSATPLSDDQLIQLAHRMGSTIMSYAFDLEQDMEIPEQL 224
Query: 257 ----------DGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
+G+ + +VP AD+LN E +++ + + + R + GE++
Sbjct: 225 ENDDEWEEDREGKTMLGMVPMADILNADAEFNAHINH--AEDALTAVSLRPIRKGEEILN 282
Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
YG S+ ELL YG+V E D VEL SL S + KL+A
Sbjct: 283 FYGPLSSAELLRRYGYVT-EKHARWDVVELSWSLISS--ALQSKLQA 326
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.133 0.400
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,702,432,964
Number of Sequences: 23463169
Number of extensions: 270060229
Number of successful extensions: 625139
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 460
Number of HSP's successfully gapped in prelim test: 1116
Number of HSP's that attempted gapping in prelim test: 622485
Number of HSP's gapped (non-prelim): 1997
length of query: 442
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 296
effective length of database: 8,933,572,693
effective search space: 2644337517128
effective search space used: 2644337517128
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)