BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013485
         (442 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255582876|ref|XP_002532210.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223528106|gb|EEF30179.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 508

 Score =  724 bits (1869), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 355/443 (80%), Positives = 390/443 (88%), Gaps = 8/443 (1%)

Query: 1   MAEASRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRI-VVHC---SVSTTND-ASRT 55
           MAEASR F T LLP+FS L K   P   +  P      +   +HC   SVST++D  +  
Sbjct: 1   MAEASRIFQTTLLPTFSSLQK---PRLVSHHPPNLAHKKYQTIHCLSSSVSTSDDITTAK 57

Query: 56  KTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGE 115
             T    M+PWGC+IDS +NA+ LQ+WLS++GLP QKMAI KV+VGERGLVALKNIRKGE
Sbjct: 58  AATTVTQMVPWGCDIDSSDNAAALQRWLSNNGLPDQKMAIDKVEVGERGLVALKNIRKGE 117

Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
           KLLFVPPSLVITADS+WSCPEAGEVLKQ SVPDWPLLA YLISEA+ +KSS+WSNYISAL
Sbjct: 118 KLLFVPPSLVITADSEWSCPEAGEVLKQYSVPDWPLLAIYLISEANLQKSSKWSNYISAL 177

Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
           PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN
Sbjct: 178 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 237

Query: 236 METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
           +ETFKWSFGILFSRLVRLPSMDG+VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ
Sbjct: 238 LETFKWSFGILFSRLVRLPSMDGKVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 297

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           Y+PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL LSLKKSDK YKEKLEAL+K+
Sbjct: 298 YEPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKSYKEKLEALKKH 357

Query: 356 GLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE 415
           G SAS+CFP+++TGWP+EL+AYAYL VSPPSM  KFEE+AAAASNK T KKD+  PEI+E
Sbjct: 358 GFSASQCFPVRVTGWPVELLAYAYLAVSPPSMSSKFEELAAAASNKTTIKKDVGFPEIEE 417

Query: 416 QALQFILDSCESSISKYSRFLQV 438
           QALQFILDSCESSISKY++FLQ 
Sbjct: 418 QALQFILDSCESSISKYTKFLQA 440


>gi|224129218|ref|XP_002320530.1| predicted protein [Populus trichocarpa]
 gi|222861303|gb|EEE98845.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  722 bits (1864), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/440 (80%), Positives = 385/440 (87%), Gaps = 7/440 (1%)

Query: 1   MAEASRT-FHTILLPSFSHLHKAQSPAGFTD-FPRKRCGHRIVVHCSVSTTNDASRTKTT 58
           MAEA R   +T  LPS   LHK      ++  F  KR      + CS+ST++D ++    
Sbjct: 1   MAEACRIILNTTFLPSLHSLHKTHKKVSYSQPFLHKR---HPAIQCSISTSSD-TKAAAK 56

Query: 59  VTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLL 118
           V++  +PWGC+IDSLENA  LQKWLSDSGLPPQKMAIQKV+VGERGLVALKNIRKGE LL
Sbjct: 57  VSET-VPWGCDIDSLENAEALQKWLSDSGLPPQKMAIQKVEVGERGLVALKNIRKGEMLL 115

Query: 119 FVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ 178
           FVPPSLVI ADS+WSCPEAGEVLK+ SVPDWPLLATYLISEASFEKSSRWSNYISALPRQ
Sbjct: 116 FVPPSLVIAADSEWSCPEAGEVLKKYSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ 175

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
           PYSLLYWTRAELD YLEASQIRERAIERITNV GTYNDLRLRIFSKYP LFPEEVFNMET
Sbjct: 176 PYSLLYWTRAELDTYLEASQIRERAIERITNVTGTYNDLRLRIFSKYPHLFPEEVFNMET 235

Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
           FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLDYDKSS+GVVFTTDR YQP
Sbjct: 236 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSSEVETFLDYDKSSKGVVFTTDRPYQP 295

Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           GEQVFISYG+KSNGELLLSYGFVPREGTNPSDSVEL LSLKKSDKCYKEKLEAL+K+GLS
Sbjct: 296 GEQVFISYGRKSNGELLLSYGFVPREGTNPSDSVELSLSLKKSDKCYKEKLEALKKHGLS 355

Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL 418
            S+CFP+Q+TGWPLELMAYAYL VSPPSM  +FEEMAAAASNK T+ K I  P+I+EQAL
Sbjct: 356 VSQCFPLQVTGWPLELMAYAYLAVSPPSMSRQFEEMAAAASNKTTTNKKITYPDIEEQAL 415

Query: 419 QFILDSCESSISKYSRFLQV 438
           QFILDSCE SISKY++FLQ 
Sbjct: 416 QFILDSCELSISKYTKFLQA 435


>gi|225447500|ref|XP_002267469.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296085051|emb|CBI28466.3| unnamed protein product [Vitis vinifera]
          Length = 497

 Score =  715 bits (1845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/442 (80%), Positives = 388/442 (87%), Gaps = 19/442 (4%)

Query: 1   MAEASRTFHTILL----PSFSHLHKAQSPAGFTD-FPRKRCGHRIVVHCSVSTTNDASRT 55
           MAEA R FHT L     PSFS     Q+P+  +   PR     R  + CS+STT+ A   
Sbjct: 1   MAEACRMFHTALTLTLPPSFS-----QTPSRHSQPIPR-----RHPIRCSISTTDTA--- 47

Query: 56  KTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGE 115
           KT+VTQ  IPWGCE+DSLENA+ LQKWLSDSGLPPQKM I++V+VGERGLVALKNIRKGE
Sbjct: 48  KTSVTQK-IPWGCEVDSLENAALLQKWLSDSGLPPQKMGIERVEVGERGLVALKNIRKGE 106

Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
           KLLFVPPSLVITADS+WSC EAGEVLK+ SVPDWPLLATYLI EASF +SSRWSNYISAL
Sbjct: 107 KLLFVPPSLVITADSEWSCTEAGEVLKRNSVPDWPLLATYLIGEASFMQSSRWSNYISAL 166

Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
           PRQPYSLLYWTRAELD+YLEASQIRERAIERI +V GTYNDLRLRIFSK+P LFPEEVFN
Sbjct: 167 PRQPYSLLYWTRAELDKYLEASQIRERAIERINDVTGTYNDLRLRIFSKHPHLFPEEVFN 226

Query: 236 METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
           METFKWSFGILFSRLVRLPSMD ++ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 
Sbjct: 227 METFKWSFGILFSRLVRLPSMDEKIALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRT 286

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           YQP EQVFISYGKKSNGELLLSYGFVPREGTNP+D VEL LSLKKSDKCYKEK EA++K+
Sbjct: 287 YQPSEQVFISYGKKSNGELLLSYGFVPREGTNPNDKVELLLSLKKSDKCYKEKSEAMKKH 346

Query: 356 GLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE 415
           GLS S+CFPIQITGWPLELMAYAYLVVSPPSM   FEE+AA ASNK TSKKDI+ PE++E
Sbjct: 347 GLSTSQCFPIQITGWPLELMAYAYLVVSPPSMSQHFEEIAAVASNKTTSKKDIRYPELEE 406

Query: 416 QALQFILDSCESSISKYSRFLQ 437
           QALQFILDSCE+SISKYS+FLQ
Sbjct: 407 QALQFILDSCEASISKYSKFLQ 428


>gi|449453618|ref|XP_004144553.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
 gi|449511789|ref|XP_004164054.1| PREDICTED: LOW QUALITY PROTEIN: ribulose-1,5 bisphosphate
           carboxylase/oxygenase large subunit N-methyltransferase,
           chloroplastic-like [Cucumis sativus]
          Length = 497

 Score =  708 bits (1828), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/438 (78%), Positives = 387/438 (88%), Gaps = 9/438 (2%)

Query: 1   MAEASRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVT 60
           MAEASR F++ LLP+F  L    S       P      R  ++CSVSTT D +R   T  
Sbjct: 1   MAEASRVFNSSLLPNFRPLQNTLSTK-----PHTATFRRHSINCSVSTT-DGARVAAT-- 52

Query: 61  QNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFV 120
              IPWGCEIDSLENAS LQKWLS+SGLP QKM+IQ+V+VGERGLVALKN+RKGEKLLFV
Sbjct: 53  -GPIPWGCEIDSLENASALQKWLSESGLPDQKMSIQRVNVGERGLVALKNVRKGEKLLFV 111

Query: 121 PPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY 180
           PPSLVI+A+S+WSCPEAGEVLK+ SVPDWPL+ATYLISEAS  KSSRW+NYISALPRQPY
Sbjct: 112 PPSLVISAESEWSCPEAGEVLKRNSVPDWPLIATYLISEASLMKSSRWNNYISALPRQPY 171

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
           SLLYWTR ELDRYLEAS+IRERAIERITNV+GTYNDL +R+FSK+P+LFPEEVFN+ETFK
Sbjct: 172 SLLYWTREELDRYLEASEIRERAIERITNVVGTYNDLSIRVFSKHPELFPEEVFNIETFK 231

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
           WSFGILFSRLVRLPSMDG+VALVPWADMLNH+CEVETFLDYDK+SQGVVFTTDR YQPGE
Sbjct: 232 WSFGILFSRLVRLPSMDGKVALVPWADMLNHNCEVETFLDYDKASQGVVFTTDRAYQPGE 291

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
           QVFISYGKKSNGELLLSYGFVP+EG+NPSDSVEL LSLKKSDKCYKEKLEAL+K+GL AS
Sbjct: 292 QVFISYGKKSNGELLLSYGFVPKEGSNPSDSVELLLSLKKSDKCYKEKLEALKKHGLRAS 351

Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQF 420
           +CFPIQ+TGWPLEL A+AYL VSPPS+  +F+EMAAAASNK T+KKD+  P+I+E+ALQF
Sbjct: 352 QCFPIQVTGWPLELKAFAYLAVSPPSLSNQFDEMAAAASNKSTAKKDLNYPDIEEEALQF 411

Query: 421 ILDSCESSISKYSRFLQV 438
           ILDSCE+SISKY++FLQ 
Sbjct: 412 ILDSCETSISKYNKFLQA 429


>gi|356547583|ref|XP_003542190.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 499

 Score =  701 bits (1810), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/448 (77%), Positives = 378/448 (84%), Gaps = 31/448 (6%)

Query: 1   MAEASR-TFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIV---------VHCSVSTTN 50
           MAEASR T H+ LLP F+              P+   GHR +         V CSVS   
Sbjct: 5   MAEASRITLHSTLLPFFT--------------PKTHVGHRHLSLSSSRKHQVQCSVSAGA 50

Query: 51  DASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKN 110
            A         N + WGCEIDSLEN+S LQ+WLS+SGLPPQKM I++V+VGERGLVALKN
Sbjct: 51  AAQ-------TNPVAWGCEIDSLENSSALQRWLSESGLPPQKMGIERVEVGERGLVALKN 103

Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSN 170
           IRKGEKLLFVPPSLVIT DS+WSCPEAGEVLK+ SVPDWPLLATYLISEAS  +SSRWSN
Sbjct: 104 IRKGEKLLFVPPSLVITPDSEWSCPEAGEVLKRNSVPDWPLLATYLISEASLMESSRWSN 163

Query: 171 YISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP 230
           YISALPRQPYSLLYWT+AELDRYLEASQIRERAIERI NVIGTYNDLRLRIFSKYPDLFP
Sbjct: 164 YISALPRQPYSLLYWTQAELDRYLEASQIRERAIERINNVIGTYNDLRLRIFSKYPDLFP 223

Query: 231 EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF 290
           +EVFN+E+FKWSFGILFSRLVRLPSM G VALVPWADMLNHSC+VETFLDYDK+S+G+VF
Sbjct: 224 DEVFNIESFKWSFGILFSRLVRLPSMGGNVALVPWADMLNHSCDVETFLDYDKTSKGIVF 283

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
           TTDR YQPGEQVFISYGKKSNGELLLSYGFVP+EG NPSDSVEL LSLKKSD  YKEKLE
Sbjct: 284 TTDRPYQPGEQVFISYGKKSNGELLLSYGFVPKEGANPSDSVELSLSLKKSDASYKEKLE 343

Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
            L+ YGLSAS+CFPIQITGWPLELMAYAYL VSP SM+G FEEMAAAASN  TSKKD++ 
Sbjct: 344 LLKNYGLSASQCFPIQITGWPLELMAYAYLAVSPSSMRGDFEEMAAAASNNTTSKKDLRY 403

Query: 411 PEIDEQALQFILDSCESSISKYSRFLQV 438
           PEI+EQALQFILDSCESSISKY++FLQ 
Sbjct: 404 PEIEEQALQFILDSCESSISKYNKFLQA 431


>gi|297829320|ref|XP_002882542.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328382|gb|EFH58801.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 504

 Score =  682 bits (1761), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/440 (75%), Positives = 375/440 (85%), Gaps = 8/440 (1%)

Query: 1   MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
           MA+A     + LLP++S LHK ++      F   P  RC  R  +HCSVS     +R+  
Sbjct: 1   MAKAC-VLQSTLLPAYSPLHKLRNQNFTLSFPPLPVSRC--RPGIHCSVSAGETTTRSVE 57

Query: 58  TVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKL 117
              +  I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+RKGEKL
Sbjct: 58  EAPE--ISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLRKGEKL 115

Query: 118 LFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
           LFVPPSLVI+ADS+W+ PEAGEV+K+  VPDWPLLATYLISEAS +KSSRW NYISALPR
Sbjct: 116 LFVPPSLVISADSEWTNPEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWYNYISALPR 175

Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
           QPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+EVFN E
Sbjct: 176 QPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPHLFPKEVFNDE 235

Query: 238 TFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
           TFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GVVFTTDR YQ
Sbjct: 236 TFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVVFTTDRPYQ 295

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
           PGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCYKEKL+AL+K+GL
Sbjct: 296 PGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYKEKLDALKKHGL 355

Query: 358 SASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
           S  +CFP++ITGWP+ELMAYAYLVVSPP M   FEEMA AASNK ++K D+K PEI+E A
Sbjct: 356 STPQCFPVRITGWPMELMAYAYLVVSPPDMGNNFEEMAKAASNKTSTKTDLKYPEIEEDA 415

Query: 418 LQFILDSCESSISKYSRFLQ 437
           LQFILDSCE+SISKYSRFL+
Sbjct: 416 LQFILDSCETSISKYSRFLK 435


>gi|357462493|ref|XP_003601528.1| SET domain-containing protein [Medicago truncatula]
 gi|355490576|gb|AES71779.1| SET domain-containing protein [Medicago truncatula]
 gi|388500078|gb|AFK38105.1| unknown [Medicago truncatula]
          Length = 497

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/433 (77%), Positives = 371/433 (85%), Gaps = 18/433 (4%)

Query: 7   TFHTILLPSFSH-LHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIP 65
           T  T L+P+F+  +HK       T     R  H    H ++S T+    T        IP
Sbjct: 14  TNTTTLIPAFNQTIHKT------THLGLSRRNH---AHFTLSATSSLIET--------IP 56

Query: 66  WGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
           WGCE DS+EN+S+LQKWLS SGLP QKM+I KVDVGERGLVAL NIRKGEKLLFVPP LV
Sbjct: 57  WGCENDSIENSSSLQKWLSQSGLPSQKMSIDKVDVGERGLVALNNIRKGEKLLFVPPQLV 116

Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
           IT DS+WSCPEAGEVLK+ SVPDWPLLATYLISEAS  KSSRW +YISALPRQPYSLLYW
Sbjct: 117 ITPDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQPYSLLYW 176

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
           ++AELDRYLEASQIRERAIER  NVIGTYND+R+RIFSKYPD FPEEVFN+E+FKWSFGI
Sbjct: 177 SQAELDRYLEASQIRERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFKWSFGI 236

Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           LFSR+VRLPSMDG+ ALVPWADM+NHSCEVETFLDYDKSS+G+VF TDR YQPGEQVFIS
Sbjct: 237 LFSRMVRLPSMDGKNALVPWADMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFIS 296

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YGKKSNGELLLSYGFVP+EGTNPSDSVEL LSLKKSD+ YKEKLE L+KYGLS S+CFPI
Sbjct: 297 YGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPI 356

Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
           ++TGWPLELMAYAYL VSP SM+GKFEEMAAAASNK TSKKD++ PEI+EQALQFILDSC
Sbjct: 357 RVTGWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQALQFILDSC 416

Query: 426 ESSISKYSRFLQV 438
           ESSISKY++FLQV
Sbjct: 417 ESSISKYNKFLQV 429


>gi|357469947|ref|XP_003605258.1| SET domain-containing protein [Medicago truncatula]
 gi|355506313|gb|AES87455.1| SET domain-containing protein [Medicago truncatula]
          Length = 494

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/433 (77%), Positives = 371/433 (85%), Gaps = 18/433 (4%)

Query: 7   TFHTILLPSFSH-LHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIP 65
           T  T L+P+F+  +HK       T     R  H    H ++S T+    T        IP
Sbjct: 11  TNTTTLIPAFNQTIHKT------THLGLSRRNH---AHFTLSATSSLIET--------IP 53

Query: 66  WGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
           WGCE DS+EN+S+LQKWLS SGLP QKM+I KVDVGERGLVAL NIRKGEKLLFVPP LV
Sbjct: 54  WGCENDSIENSSSLQKWLSQSGLPSQKMSIDKVDVGERGLVALNNIRKGEKLLFVPPQLV 113

Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
           IT DS+WSCPEAGEVLK+ SVPDWPLLATYLISEAS  KSSRW +YISALPRQPYSLLYW
Sbjct: 114 ITPDSEWSCPEAGEVLKKNSVPDWPLLATYLISEASLMKSSRWFSYISALPRQPYSLLYW 173

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
           ++AELDRYLEASQIRERAIER  NVIGTYND+R+RIFSKYPD FPEEVFN+E+FKWSFGI
Sbjct: 174 SQAELDRYLEASQIRERAIERTNNVIGTYNDMRVRIFSKYPDFFPEEVFNIESFKWSFGI 233

Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           LFSR+VRLPSMDG+ ALVPWADM+NHSCEVETFLDYDKSS+G+VF TDR YQPGEQVFIS
Sbjct: 234 LFSRMVRLPSMDGKNALVPWADMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQVFIS 293

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YGKKSNGELLLSYGFVP+EGTNPSDSVEL LSLKKSD+ YKEKLE L+KYGLS S+CFPI
Sbjct: 294 YGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQCFPI 353

Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
           ++TGWPLELMAYAYL VSP SM+GKFEEMAAAASNK TSKKD++ PEI+EQALQFILDSC
Sbjct: 354 RVTGWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQALQFILDSC 413

Query: 426 ESSISKYSRFLQV 438
           ESSISKY++FLQV
Sbjct: 414 ESSISKYNKFLQV 426


>gi|15231493|ref|NP_187424.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
 gi|6466950|gb|AAF13085.1|AC009176_12 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|6648179|gb|AAF21177.1|AC013483_1 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|15028205|gb|AAK76599.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|19310671|gb|AAL85066.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
 gi|332641064|gb|AEE74585.1| rubisco methyltransferase-like protein [Arabidopsis thaliana]
          Length = 504

 Score =  678 bits (1749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/445 (74%), Positives = 376/445 (84%), Gaps = 18/445 (4%)

Query: 1   MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
           MA+A     + LLP++S LHK ++      F+  P  RC  R  +HCSVS          
Sbjct: 1   MAKAC-LLQSTLLPAYSPLHKLRNQNITLSFSPLPLSRC--RPGIHCSVSAGE------- 50

Query: 58  TVTQNM-----IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIR 112
           T  Q+M     I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+R
Sbjct: 51  TTIQSMEEAPKISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLR 110

Query: 113 KGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYI 172
           KGEKLLFVPPSLVI+ADS+W+  EAGEV+K+  VPDWPLLATYLISEAS +KSSRW NYI
Sbjct: 111 KGEKLLFVPPSLVISADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYI 170

Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
           SALPRQPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+E
Sbjct: 171 SALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKE 230

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
           VFN ETFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GVVFTT
Sbjct: 231 VFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVVFTT 290

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
           DR YQPGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCY+EKL+AL
Sbjct: 291 DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDAL 350

Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
           +K+GLS  +CFP++ITGWP+ELMAYAYLVVSPP M+  FEEMA AASNK ++K D+K PE
Sbjct: 351 KKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPE 410

Query: 413 IDEQALQFILDSCESSISKYSRFLQ 437
           I+E ALQFILDSCE+SISKYSRFL+
Sbjct: 411 IEEDALQFILDSCETSISKYSRFLK 435


>gi|21537309|gb|AAM61650.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Arabidopsis thaliana]
          Length = 504

 Score =  676 bits (1745), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/445 (74%), Positives = 376/445 (84%), Gaps = 18/445 (4%)

Query: 1   MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
           MA+A     + LLP++S LHK ++      F+  P  RC  R  +HCSVS          
Sbjct: 1   MAKAC-LLQSTLLPAYSPLHKLRNQNITLSFSPLPLSRC--RPGIHCSVSAGE------- 50

Query: 58  TVTQNM-----IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIR 112
           T  Q+M     I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+R
Sbjct: 51  TTIQSMEEAPKISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLR 110

Query: 113 KGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYI 172
           KGEKLLFVPPSLVI+ADS+W+  EAGEV+K+  VPDWPLLATYLISEA+ +KSSRW NYI
Sbjct: 111 KGEKLLFVPPSLVISADSEWTNAEAGEVMKRYDVPDWPLLATYLISEANLQKSSRWFNYI 170

Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
           SALPRQPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+E
Sbjct: 171 SALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKE 230

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
           VFN ETFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GV+FTT
Sbjct: 231 VFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVIFTT 290

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
           DR YQPGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCY+EKL+AL
Sbjct: 291 DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDAL 350

Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
           +K+GLS  +CFP++ITGWP+ELMAYAYLVVSPP M+  FEEMA AASNK ++K D+K PE
Sbjct: 351 KKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKAASNKTSTKNDLKYPE 410

Query: 413 IDEQALQFILDSCESSISKYSRFLQ 437
           I+E ALQFILDSCE+SISKYSRFL+
Sbjct: 411 IEEDALQFILDSCETSISKYSRFLK 435


>gi|3065835|gb|AAC14296.1| putative methyltransferase [Arabidopsis thaliana]
          Length = 504

 Score =  667 bits (1722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 330/445 (74%), Positives = 372/445 (83%), Gaps = 18/445 (4%)

Query: 1   MAEASRTFHTILLPSFSHLHKAQSPA---GFTDFPRKRCGHRIVVHCSVSTTNDASRTKT 57
           MA+A     + LLP++S LHK ++      F+  P  RC  R  +HCSVS          
Sbjct: 1   MAKAC-LLQSTLLPAYSPLHKLRNQNITLSFSPLPLSRC--RPGIHCSVSAGE------- 50

Query: 58  TVTQNM-----IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIR 112
           T  Q+M     I WGCEIDSLENA++LQ WLSDSGLPPQKMAI +VD+GERGLVA +N+R
Sbjct: 51  TTIQSMEEAPKISWGCEIDSLENATSLQNWLSDSGLPPQKMAIDRVDIGERGLVASQNLR 110

Query: 113 KGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYI 172
           KGEKLLFV PSLVI ADS+W+  EAGEV+K+  VPDWPLLATYLISEAS +KSSRW NYI
Sbjct: 111 KGEKLLFVSPSLVICADSEWTNAEAGEVMKRYDVPDWPLLATYLISEASLQKSSRWFNYI 170

Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
           SALPRQPYSLLYWTR ELD YLEASQIRERAIERITNV+GTY DLR RIFSK+P LFP+E
Sbjct: 171 SALPRQPYSLLYWTRTELDMYLEASQIRERAIERITNVVGTYEDLRSRIFSKHPQLFPKE 230

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
           VFN ETFKWSFGILFSRLVRLPSMDGR ALVPWADMLNH+CEVETFLDYDKSS+GVVFTT
Sbjct: 231 VFNDETFKWSFGILFSRLVRLPSMDGRFALVPWADMLNHNCEVETFLDYDKSSKGVVFTT 290

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
           DR YQPGEQVFISYG KSNGELLLSYGFVPREGTNPSDSVEL LSL+K+DKCY+EKL+AL
Sbjct: 291 DRPYQPGEQVFISYGNKSNGELLLSYGFVPREGTNPSDSVELALSLRKNDKCYEEKLDAL 350

Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
           +K+GLS  +CFP++ITGWP+ELMAYAYLVVSPP M+  FEEMA  ASNK ++K D+K PE
Sbjct: 351 KKHGLSTPQCFPVRITGWPMELMAYAYLVVSPPDMRNNFEEMAKRASNKTSTKNDLKYPE 410

Query: 413 IDEQALQFILDSCESSISKYSRFLQ 437
           I+E ALQFILDSCE+SISK SRFL+
Sbjct: 411 IEEDALQFILDSCETSISKCSRFLK 435


>gi|226501968|ref|NP_001140387.1| uncharacterized protein LOC100272441 [Zea mays]
 gi|194699272|gb|ACF83720.1| unknown [Zea mays]
 gi|413923744|gb|AFW63676.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  619 bits (1597), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 291/374 (77%), Positives = 341/374 (91%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGEKLLFVPPS
Sbjct: 61  VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGEKLLFVPPS 120

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W  PE G+V+K+ SVPDWPL+ATYLISEAS E SSRW +YI+ALPRQPYSLL
Sbjct: 121 LVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQPYSLL 180

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++PDLFPEEV+N+ETF WSF
Sbjct: 181 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFLWSF 240

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLD+DKSS+G+VFTTDR YQPGEQVF
Sbjct: 241 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVF 300

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSD CYKEKL+AL++ GLS SE F
Sbjct: 301 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360

Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
           P+++TGWP+ELMAYA+LVVSPP M   FEEMAAAASNK +SK  +  P+++EQALQFILD
Sbjct: 361 PLRVTGWPVELMAYAFLVVSPPDMSQCFEEMAAAASNKTSSKPGLNYPDLEEQALQFILD 420

Query: 424 SCESSISKYSRFLQ 437
            CES+I KY+++L+
Sbjct: 421 CCESNIEKYTKYLE 434


>gi|242066146|ref|XP_002454362.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
 gi|241934193|gb|EES07338.1| hypothetical protein SORBIDRAFT_04g029430 [Sorghum bicolor]
          Length = 499

 Score =  617 bits (1591), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 290/374 (77%), Positives = 340/374 (90%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGEKLLFVPPS
Sbjct: 57  VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGEKLLFVPPS 116

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W  PE GEV+K+ SVPDWPL+ATYLISEAS E SSRWS+YI+ALPRQPYSLL
Sbjct: 117 LVITADSEWGRPEVGEVMKRNSVPDWPLIATYLISEASLEGSSRWSSYIAALPRQPYSLL 176

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++ DLFPEEV+N+ETF WSF
Sbjct: 177 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHSDLFPEEVYNIETFLWSF 236

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           GILFSRLVRLPSMD +VALVPWADMLNHS EVETFLD+DKSSQG+VFTTDR YQPGEQVF
Sbjct: 237 GILFSRLVRLPSMDEKVALVPWADMLNHSPEVETFLDFDKSSQGIVFTTDRSYQPGEQVF 296

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSDKCYKEKL+AL++ GLS SE F
Sbjct: 297 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDKCYKEKLQALKRNGLSESESF 356

Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
           P+++TGWP+ELMAYA+LVVSPP M   FEEMA AASNK +SK  +  P+++EQALQFILD
Sbjct: 357 PLRVTGWPVELMAYAFLVVSPPDMSQHFEEMAVAASNKSSSKPRLSYPDLEEQALQFILD 416

Query: 424 SCESSISKYSRFLQ 437
            CE +I+KY+++L+
Sbjct: 417 CCEPNIAKYTKYLE 430


>gi|195651313|gb|ACG45124.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Zea mays]
          Length = 503

 Score =  615 bits (1587), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 289/374 (77%), Positives = 339/374 (90%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGE LLFVPPS
Sbjct: 61  VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGENLLFVPPS 120

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W  PE G+V+K+ SVPDWPL+ATYLISEAS E SSRW +YI+ALPRQPYSLL
Sbjct: 121 LVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQPYSLL 180

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++PDLFPEEV+N+ETF WSF
Sbjct: 181 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFLWSF 240

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           GILFSRLVRLPSMDGRV LVPWADMLNHS EVETFLD+DKSS+G+VFTTDR YQPGEQVF
Sbjct: 241 GILFSRLVRLPSMDGRVVLVPWADMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPGEQVF 300

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSD CYKEKL+AL++ GLS SE F
Sbjct: 301 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLDKSDNCYKEKLQALKRNGLSESESF 360

Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
           P+++TGWP+ELMAYA+LVVSPP M   FEEMAAAASNK +SK  +  P+++EQALQFILD
Sbjct: 361 PLRVTGWPVELMAYAFLVVSPPDMSQCFEEMAAAASNKTSSKPGLNYPDLEEQALQFILD 420

Query: 424 SCESSISKYSRFLQ 437
            CES+I KY+++L+
Sbjct: 421 CCESNIEKYTKYLE 434


>gi|218191491|gb|EEC73918.1| hypothetical protein OsI_08761 [Oryza sativa Indica Group]
          Length = 502

 Score =  610 bits (1572), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 295/374 (78%), Positives = 339/374 (90%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A +L++WL+DSGLP Q++ IQ+VDVGERGLVALKNIRKGEKLLFVPPS
Sbjct: 60  VPWGCEIESLESAVSLERWLTDSGLPEQRLGIQRVDVGERGLVALKNIRKGEKLLFVPPS 119

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W CPE G VLK+ SVPDWPL+ATYLISEAS E SSRWS+YI+ALPRQPYSLL
Sbjct: 120 LVITADSEWGCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQPYSLL 179

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTR ELD YL AS IRERAI+RIT+V+GTYNDLR RIFSK+ DLFPEEV+N+ETF+WSF
Sbjct: 180 YWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSF 239

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLDYDKSS G+VFTTDR YQPGEQVF
Sbjct: 240 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVF 299

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSDKCYKEKL+AL++ GLS  E F
Sbjct: 300 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359

Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
           P+++TGWP+ELMAYA+LVVSPP M  +FEEMA AASNK  SK  +  PE++EQALQFILD
Sbjct: 360 PLRVTGWPVELMAYAFLVVSPPEMSQRFEEMAVAASNKSPSKPGLNYPELEEQALQFILD 419

Query: 424 SCESSISKYSRFLQ 437
            CES+I+KY++FL+
Sbjct: 420 CCESNIAKYTKFLE 433


>gi|115448405|ref|NP_001047982.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|45735887|dbj|BAD12920.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|45736017|dbj|BAD13045.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase [Oryza sativa Japonica
           Group]
 gi|113537513|dbj|BAF09896.1| Os02g0725200 [Oryza sativa Japonica Group]
 gi|215737236|dbj|BAG96165.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623589|gb|EEE57721.1| hypothetical protein OsJ_08208 [Oryza sativa Japonica Group]
          Length = 502

 Score =  610 bits (1572), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 295/374 (78%), Positives = 339/374 (90%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A +L++WL+DSGLP Q++ IQ+VDVGERGLVALKNIRKGEKLLFVPPS
Sbjct: 60  VPWGCEIESLESAVSLERWLTDSGLPEQRLGIQRVDVGERGLVALKNIRKGEKLLFVPPS 119

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W CPE G VLK+ SVPDWPL+ATYLISEAS E SSRWS+YI+ALPRQPYSLL
Sbjct: 120 LVITADSEWGCPEVGNVLKRNSVPDWPLIATYLISEASLESSSRWSSYIAALPRQPYSLL 179

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTR ELD YL AS IRERAI+RIT+V+GTYNDLR RIFSK+ DLFPEEV+N+ETF+WSF
Sbjct: 180 YWTRPELDAYLVASPIRERAIQRITDVVGTYNDLRDRIFSKHSDLFPEEVYNLETFRWSF 239

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLDYDKSS G+VFTTDR YQPGEQVF
Sbjct: 240 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDYDKSSGGIVFTTDRSYQPGEQVF 299

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYGKKS+GELLLSYGFVP+EGTNP+DSVEL +SL KSDKCYKEKL+AL++ GLS  E F
Sbjct: 300 ISYGKKSSGELLLSYGFVPKEGTNPNDSVELLVSLNKSDKCYKEKLQALKRNGLSEFESF 359

Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
           P+++TGWP+ELMAYA+LVVSPP M  +FEEMA AASNK  SK  +  PE++EQALQFILD
Sbjct: 360 PLRVTGWPVELMAYAFLVVSPPEMSQRFEEMAVAASNKSPSKPGLNYPELEEQALQFILD 419

Query: 424 SCESSISKYSRFLQ 437
            CES+I+KY++FL+
Sbjct: 420 CCESNIAKYTKFLE 433


>gi|326495906|dbj|BAJ90575.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 507

 Score =  609 bits (1571), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 284/374 (75%), Positives = 337/374 (90%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A++L++WL+ SGLP Q++A++KVD+GERGLVALKN+R GEKLLFVPP+
Sbjct: 64  VPWGCEIESLESAASLERWLTASGLPEQRLALEKVDIGERGLVALKNVRNGEKLLFVPPT 123

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W+  E G+V+K+ SVPDWPLLATYLISEAS E SSRWS+YI ALPRQPYSLL
Sbjct: 124 LVITADSEWTNREVGDVMKRYSVPDWPLLATYLISEASLEGSSRWSSYIDALPRQPYSLL 183

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTR E+D YL AS IRERAI RI++VIGTYNDLR RIFSK+PDLFPE+V+NME F+WSF
Sbjct: 184 YWTRTEIDAYLVASPIRERAISRISDVIGTYNDLRDRIFSKHPDLFPEKVYNMENFRWSF 243

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           GILFSRLVRL SM G+VALVPWADMLNHS EV+ FLDYDKSSQG+VFTTDR YQPGEQVF
Sbjct: 244 GILFSRLVRLESMGGKVALVPWADMLNHSPEVDAFLDYDKSSQGIVFTTDRSYQPGEQVF 303

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYGKKS+GELLLSYGFVP+EGTNP+DSVE  +SLKKSD+CYKEKL+AL+K+GLS SE F
Sbjct: 304 ISYGKKSSGELLLSYGFVPKEGTNPNDSVEFLVSLKKSDECYKEKLQALKKHGLSESESF 363

Query: 364 PIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILD 423
           P+++TGWP+ELMAYA+LVVSPP M  +FEEMA AASNK +SK  +  PE+DEQALQFILD
Sbjct: 364 PLRVTGWPVELMAYAFLVVSPPEMIQRFEEMAVAASNKGSSKPAVNYPELDEQALQFILD 423

Query: 424 SCESSISKYSRFLQ 437
            CESSI +Y+++L+
Sbjct: 424 CCESSIKRYTKYLE 437


>gi|357137766|ref|XP_003570470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 389

 Score =  566 bits (1459), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 265/345 (76%), Positives = 308/345 (89%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
           MA+Q+VDVGERGLVAL N+R GEKLLFVPPSLVI+ADS+WS  E G+V+K  SVPDWPLL
Sbjct: 1   MALQRVDVGERGLVALTNVRNGEKLLFVPPSLVISADSEWSNREVGDVMKSYSVPDWPLL 60

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
           ATYLISEAS E SSRWS+YI ALPRQPYSLLYWTR E+D YL AS IRERAI RI +VIG
Sbjct: 61  ATYLISEASLEGSSRWSSYIDALPRQPYSLLYWTRTEIDAYLVASPIRERAISRIGDVIG 120

Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
           TYNDLR RIFSK+P+LFPEEV+NME F+WSFGILFSRLVRLPSMDG+VALVPWADMLNH+
Sbjct: 121 TYNDLRDRIFSKHPELFPEEVYNMENFRWSFGILFSRLVRLPSMDGKVALVPWADMLNHN 180

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
            EV+ FLD+DKSSQG+VFTTDR YQPGEQVFISYGKKS+GELLLSYGFVP+EGTNP+DSV
Sbjct: 181 PEVDAFLDFDKSSQGIVFTTDRSYQPGEQVFISYGKKSSGELLLSYGFVPKEGTNPNDSV 240

Query: 333 ELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFE 392
           E  +SL KSD CY+EKL+AL+++GLS SE FP+++TGWP+ELMAYA+LVVSPP M  +FE
Sbjct: 241 EFSVSLNKSDDCYREKLQALKRHGLSESESFPLRVTGWPVELMAYAFLVVSPPDMIQRFE 300

Query: 393 EMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
           EMA AASNK +SK  +  PE+DEQALQFILD CES+I+KY+++L+
Sbjct: 301 EMAVAASNKSSSKPAVNYPELDEQALQFILDCCESNITKYTKYLE 345


>gi|350595011|ref|XP_003484025.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Sus
           scrofa]
          Length = 326

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 200/256 (78%), Positives = 228/256 (89%), Gaps = 8/256 (3%)

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
           L WT    D Y+   ++R R+   I  ++ TYND+R+RIFSKYPD FPEEVFN+E+FKWS
Sbjct: 47  LCWT----DYYM---RMRXRSTNYIA-MMHTYNDMRVRIFSKYPDFFPEEVFNIESFKWS 98

Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           FGILFSR+VRLPSMDG+ ALVPWADM+NHSCEVETFLDYDKSS+G+VF TDR YQPGEQV
Sbjct: 99  FGILFSRMVRLPSMDGKNALVPWADMMNHSCEVETFLDYDKSSKGIVFPTDRPYQPGEQV 158

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           FISYGKKSNGELLLSYGFVP+EGTNPSDSVEL LSLKKSD+ YKEKLE L+KYGLS S+C
Sbjct: 159 FISYGKKSNGELLLSYGFVPKEGTNPSDSVELSLSLKKSDESYKEKLELLKKYGLSGSQC 218

Query: 363 FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFIL 422
           FPI++TGWPLELMAYAYL VSP SM+GKFEEMAAAASNK TSKKD++ PEI+EQALQFIL
Sbjct: 219 FPIRVTGWPLELMAYAYLAVSPSSMRGKFEEMAAAASNKTTSKKDLRYPEIEEQALQFIL 278

Query: 423 DSCESSISKYSRFLQV 438
           DSCESSISKY++FLQ+
Sbjct: 279 DSCESSISKYNKFLQL 294


>gi|413923745|gb|AFW63677.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
 gi|413923746|gb|AFW63678.1| hypothetical protein ZEAMMB73_839660 [Zea mays]
          Length = 306

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 192/236 (81%), Positives = 220/236 (93%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           +PWGCEI+SLE+A++L++WL DSGLP Q++AIQ+VD+GERGLVALKNIRKGEKLLFVPPS
Sbjct: 61  VPWGCEIESLESAASLERWLIDSGLPEQRLAIQRVDIGERGLVALKNIRKGEKLLFVPPS 120

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           LVITADS+W  PE G+V+K+ SVPDWPL+ATYLISEAS E SSRW +YI+ALPRQPYSLL
Sbjct: 121 LVITADSEWGRPEVGDVMKRNSVPDWPLIATYLISEASLEGSSRWISYIAALPRQPYSLL 180

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           YWTRAELD YL AS IR+RAI+RIT+VIGTYNDLR RIFS++PDLFPEEV+N+ETF WSF
Sbjct: 181 YWTRAELDAYLVASPIRKRAIQRITDVIGTYNDLRDRIFSRHPDLFPEEVYNIETFLWSF 240

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           GILFSRLVRLPSMDGRVALVPWADMLNHS EVETFLD+DKSS+G+VFTTDR YQPG
Sbjct: 241 GILFSRLVRLPSMDGRVALVPWADMLNHSPEVETFLDFDKSSRGIVFTTDRSYQPG 296


>gi|168003103|ref|XP_001754252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694354|gb|EDQ80702.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 431

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 188/382 (49%), Positives = 269/382 (70%), Gaps = 9/382 (2%)

Query: 64  IPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPS 123
           + WGC+  S+E  S LQ WL   GL  QK+ + +VD G RGLVA +++R+GE+LLFVP  
Sbjct: 1   VNWGCDPQSIEKGSLLQDWLMKEGLAKQKLVLDRVDSGGRGLVATQSLRQGERLLFVPSG 60

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
           L+ITADS+W C E G ++K+  +P+WP+LA +LISEAS E+SSRW  Y + LP+ P S+L
Sbjct: 61  LLITADSEWGCAETGRIIKEAGLPEWPMLAIFLISEASREESSRWFPYFATLPKTPSSIL 120

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
            WT  E++ +L AS +RE+A+E I +V  TY DLR  IF K+P++FP +V+ +  FKW+F
Sbjct: 121 QWTEEEVNTWLTASPVREKALECIRDVTETYRDLRATIFLKHPEVFPSQVYTLAAFKWAF 180

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK-SSQGVVFTTDRQYQPGEQV 302
           GILFSRLVRLPS+ G++ALVPWADMLNHS +V++FLD+D+ +++ VV  TDR YQ GEQV
Sbjct: 181 GILFSRLVRLPSV-GKLALVPWADMLNHSPQVDSFLDFDQNNAKSVVTVTDRAYQSGEQV 239

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           FISYGK+S+GEL L+YGF+P E  N  DSVEL + +   D  ++ KL A  + GLS+ + 
Sbjct: 240 FISYGKRSSGELFLAYGFIPSE-LNVHDSVELEMEIDSDDPSFEAKLRAANEQGLSSPQR 298

Query: 363 FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD------IKCPEIDEQ 416
           FP++  G+P +L+AYA L+ S  S   +   +A AA+    ++KD      +   E +  
Sbjct: 299 FPVRKDGFPAQLLAYARLIASRTSDPAQLSRIARAATEANATEKDDSDVIAMLSAEEETN 358

Query: 417 ALQFILDSCESSISKYSRFLQV 438
           A + +L  CE+SI++Y++FL+V
Sbjct: 359 AYERVLAVCENSIAEYTKFLEV 380


>gi|302785554|ref|XP_002974548.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
 gi|300157443|gb|EFJ24068.1| hypothetical protein SELMODRAFT_101776 [Selaginella moellendorffii]
          Length = 467

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 181/390 (46%), Positives = 273/390 (70%), Gaps = 4/390 (1%)

Query: 54  RTKTTVTQNMIPWGCEIDS--LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNI 111
           R +     + +PWGC+ DS  L +   LQ+WLS +GLP QK+ ++ V  G RGLV+ + +
Sbjct: 14  RPRLACRASSVPWGCDTDSSALNSGIALQQWLSQAGLPIQKVELKNVGAGGRGLVSKRML 73

Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
            KG++LLF+P +L IT +S+W+C EAG+V++   +P+WP LA YLISEAS  KSS W  Y
Sbjct: 74  YKGDRLLFLPATLAITTESEWACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPY 133

Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
           I+ALPR+P S+L WT  +++ +L A+ I++RA++ +  V  T+NDL  ++F K  + FP 
Sbjct: 134 IAALPRRPGSILLWTALDVEAHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNREEFPP 193

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
           EVFN+E+FKW+FGILFSRLVRLPS+  ++AL+P+ DMLNH  EV TFLD+D  S+ +  T
Sbjct: 194 EVFNLESFKWAFGILFSRLVRLPSLGQKLALIPFGDMLNHDTEVTTFLDFDSGSKSITCT 253

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
            DR Y+  ++VFISYGK+SNGELL++YGFVP  G N  DSV + L L  +D+ Y+ KL A
Sbjct: 254 LDRGYESNKEVFISYGKRSNGELLVAYGFVP-SGKNSEDSVSITLGLDPADEMYEAKLGA 312

Query: 352 LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSK-KDIKC 410
           L+++GLS  + +PI++ GWP++L A+A L+ SPPS   ++ E+ +AA+ K   + + +  
Sbjct: 313 LKEHGLSPQQSYPIKLKGWPVQLTAFARLITSPPSQLHRYSELTSAATEKQGRRMQPVFT 372

Query: 411 PEIDEQALQFILDSCESSISKYSRFLQVKE 440
            E   +A + IL +C+ +I+    +L+V++
Sbjct: 373 TEEQMKAYEMILSACKQAIAASKNYLEVEQ 402


>gi|302759643|ref|XP_002963244.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
 gi|300168512|gb|EFJ35115.1| hypothetical protein SELMODRAFT_80789 [Selaginella moellendorffii]
          Length = 467

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 179/390 (45%), Positives = 273/390 (70%), Gaps = 4/390 (1%)

Query: 54  RTKTTVTQNMIPWGCEIDS--LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNI 111
           R +     + +PWGC+ DS  L++   LQ+WLS +GLP QK+ ++ V  G RGLV+ + +
Sbjct: 14  RPRLACRASSVPWGCDTDSSALDSGIALQQWLSQAGLPIQKVELKNVGAGGRGLVSKRML 73

Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
            KG++LLF+P +L IT +S+W+C EAG+V++   +P+WP LA YLISEAS  KSS W  Y
Sbjct: 74  YKGDRLLFLPATLAITTESEWACAEAGKVIRAKDLPEWPFLACYLISEASLGKSSPWYPY 133

Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
           I+ALPR+P S+L WT  +++ +L A+ I++RA++ +  V  T+NDL  ++F K  + FP 
Sbjct: 134 IAALPRRPGSILLWTALDVETHLSATSIKDRALQCVREVEDTFNDLNKQVFMKNREEFPP 193

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
           EVFN+++FKW+FGILFSRLVRLPS+  ++AL+P+ DMLNH  EV TFLD+D  S+ +  T
Sbjct: 194 EVFNLKSFKWAFGILFSRLVRLPSLGQKLALIPFGDMLNHDTEVTTFLDFDSGSKSITCT 253

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
            DR Y+   +VFISYGK+SNGELL++YGFVP  G N  DSV + L L  +D+ Y+ KL  
Sbjct: 254 LDRGYESNREVFISYGKRSNGELLVAYGFVP-SGKNSEDSVSITLGLDPADEMYEAKLGT 312

Query: 352 LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSK-KDIKC 410
           L+++GLS  + +PI++ GWP++L A+A L+ SPPS   ++ E+A+AA+ +   + + +  
Sbjct: 313 LKEHGLSPQQSYPIKLKGWPVQLTAFARLITSPPSQLHRYSELASAATEEQGRRMQPVFT 372

Query: 411 PEIDEQALQFILDSCESSISKYSRFLQVKE 440
            E   +A + IL +C+ +I+    +L+V++
Sbjct: 373 TEEQMKAYELILSACKQAIAASKNYLEVEQ 402


>gi|388516285|gb|AFK46204.1| unknown [Lotus japonicus]
          Length = 271

 Score =  364 bits (934), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 171/203 (84%), Positives = 188/203 (92%)

Query: 236 METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
           ME+FKWSFGILFSR+VRLPSMDG+VALVPWADMLNHSC+VETFLDYDK S+G+VFTTDR 
Sbjct: 1   MESFKWSFGILFSRMVRLPSMDGKVALVPWADMLNHSCDVETFLDYDKQSKGIVFTTDRP 60

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           YQPGEQVFISYGKKSNGELLLSYGFV REG NPSDSVEL LSLKKSD  YKEKLE L+KY
Sbjct: 61  YQPGEQVFISYGKKSNGELLLSYGFVTREGANPSDSVELSLSLKKSDGSYKEKLELLKKY 120

Query: 356 GLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE 415
           GLS S+CFPI+ITGWPLELMAYAYL VSP SM+G+FE+MAAAASNK+TS KD K PEI+E
Sbjct: 121 GLSGSQCFPIRITGWPLELMAYAYLAVSPSSMRGQFEKMAAAASNKITSTKDFKYPEIEE 180

Query: 416 QALQFILDSCESSISKYSRFLQV 438
           QALQFILDSCESS+SKY++FLQ 
Sbjct: 181 QALQFILDSCESSMSKYNKFLQA 203


>gi|412990750|emb|CCO18122.1| predicted protein [Bathycoccus prasinos]
          Length = 543

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 189/339 (55%), Gaps = 18/339 (5%)

Query: 79  LQKWLSD-SGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L  WL+D   LP QKM ++      RGLVA ++I++GEK+L +P   +IT +        
Sbjct: 91  LDAWLADIMKLPEQKMKLEYFKEEGRGLVATESIKRGEKVLEIPQEAIITVEVALKESLL 150

Query: 138 GEVLKQCSVPDWPLLATYLISEA---SFEKSS----RWSNYISALPRQPYSLLYWTRAEL 190
            E  K   + +W +LAT+L   A   S E +S    R++ Y+ ALPR   S+L W  +++
Sbjct: 151 REKKKLAELQEWSILATFLAETAQNLSTEDNSSNKYRFATYVKALPRSTGSVLEWPESDV 210

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
              L  S     A+ER  +V     ++R+         FPE   N +T +W+F ILFSRL
Sbjct: 211 RTLLAGSPSLFSALERRASVAAAIAEIRVN--------FPE--LNEKTLQWAFDILFSRL 260

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           +RL S+ G +ALVPWADMLNH    E F+D D+ S+ V  TTDR Y+PGEQV+ SYG++ 
Sbjct: 261 IRLESLGGNLALVPWADMLNHQPGCEAFIDLDRGSRKVCLTTDRSYEPGEQVWASYGQRP 320

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           + ELL+SYGF P  G NP D   L L + + D     K+ AL    + A E FP+++ G+
Sbjct: 321 SSELLISYGFAPAVGDNPDDEYALNLQIDEEDPFASAKVNALASQNIQAFETFPLRLNGY 380

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK 409
           P +L+ YA   +  P    K +E+  AA   + + K ++
Sbjct: 381 PRQLLQYASFAMCTPDDPSKVDELCRAAFVDIQAPKSLR 419


>gi|308807993|ref|XP_003081307.1| putative methyltransferase (ISS) [Ostreococcus tauri]
 gi|116059769|emb|CAL55476.1| putative methyltransferase (ISS) [Ostreococcus tauri]
          Length = 505

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/362 (36%), Positives = 194/362 (53%), Gaps = 20/362 (5%)

Query: 40  IVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVD 99
           + V+   S T   +   T      IP G    + E+   L +WL+ +GL  QKM ++   
Sbjct: 31  VTVNEGASGTRGKNAEVTRYDDADIPRGVGSATRED---LTRWLASNGLRAQKMTLESNL 87

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
              RGLVA + I++GE LL V  S +IT +   +  EA    +   + +W +LAT+L  +
Sbjct: 88  AEGRGLVATEEIKRGEALLGVDASCLITVER--AIAEAKLGPRHAELQEWSVLATFLAQQ 145

Query: 160 ASFEKSSR---WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           A   +S     +  YI ALPR+  S+L W   E++  L+ S  R  A ER  +V     +
Sbjct: 146 AMALESGNAGTFGEYIRALPRRTGSVLDWPEDEVETLLKGSPSRLAAAERQESVNAAIAE 205

Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
           +R    S +PD+           +W+F ILFSRL+RL +M G +ALVPWADMLNH     
Sbjct: 206 IR----SSFPDI------TEGALRWAFDILFSRLIRLDAMGGELALVPWADMLNHKPGCA 255

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
            F+D + S+  V  TTDR Y  GEQV+ SYG++ + ELL+SYGF P  G NP D   L L
Sbjct: 256 AFIDLNGSA--VNLTTDRAYAAGEQVWASYGQRPSSELLISYGFAPEVGENPDDEYSLTL 313

Query: 337 SLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAA 396
            +  +D   + K + LR+ GLS  E FP+++ G+P +L+ YA  ++  P    + E +A 
Sbjct: 314 GVDVNDPYAQAKADVLRRMGLSPVETFPLRLNGYPRQLLQYASFILCNPDKPSELEGLAR 373

Query: 397 AA 398
            A
Sbjct: 374 TA 375


>gi|145350419|ref|XP_001419603.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579835|gb|ABO97896.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 524

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 136/363 (37%), Positives = 195/363 (53%), Gaps = 31/363 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L +WL    LP QKMA++ V++ E RGLVA + I++GE LL VP + +IT +   +  EA
Sbjct: 84  LARWLEGRRLPGQKMALE-VNLAEGRGLVATEEIKRGEALLGVPRTTLITVER--AIAEA 140

Query: 138 GEVLKQCSVPDWPLLATYLISEA-SFEKSS--RWSNYISALPRQPYSLLYWTRAELDRYL 194
               K   + +W +LAT+L  +A + E  +   +  YI ALPR+  S+L W   E+D+ L
Sbjct: 141 KLGPKHAELQEWSVLATFLAQQALALESGTAGTFGEYIRALPRRTGSVLDWPEDEVDKLL 200

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           + S  R  A ER  +V    +++R          FPE    +   +W+F ILFSRL+RL 
Sbjct: 201 KGSPSRLAAAERQDSVNAAIDEIR--------SYFPE--ITVGALRWAFDILFSRLIRLD 250

Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
           +M G +ALVPWADMLNH      F+D +  +  V  TTDR Y  GEQV+ SYG++ + EL
Sbjct: 251 AMGGELALVPWADMLNHKPGCAAFIDLNGDA--VNLTTDRSYVKGEQVWASYGQRPSSEL 308

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLEL 374
           L+SYGF P  G NP D   L L +  +D     K + LR  GLS  E FP+++ G+P +L
Sbjct: 309 LISYGFAPEVGENPDDEYALTLGVDVNDPLADAKAQVLRDMGLSPVETFPLRLNGYPRQL 368

Query: 375 MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSR 434
           + YA  ++  P    K  E+   A +  T   +I          Q I DS     +  +R
Sbjct: 369 LQYASFILCNPE---KPSELKGLAQSAFTGSANIG---------QSIFDSVRGLTNGKAR 416

Query: 435 FLQ 437
             Q
Sbjct: 417 GKQ 419


>gi|307109960|gb|EFN58197.1| hypothetical protein CHLNCDRAFT_142047 [Chlorella variabilis]
          Length = 485

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 203/370 (54%), Gaps = 12/370 (3%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT- 127
           EI +      L+ WL + GLPP K+A        RGLVA + I KGE LL +P  LV+T 
Sbjct: 51  EIATDAEGEELKAWLIERGLPPPKLAAAATPGSGRGLVAAQPIGKGESLLSIPQQLVLTP 110

Query: 128 -ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
            A  + SC     +L++  +P W +LA +L  + +   +  W  Y+  LP +   +L W+
Sbjct: 111 AAALEQSCLR--PLLEEQPLPAWSVLALWLAEQRAAGSAGGWWPYVRLLPERTGCVLEWS 168

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS-KYPDLFPEE-VFNMETFKWSFG 244
             E++ +L  SQ+   A+E       ++ +++  + + K     P    F     +W+F 
Sbjct: 169 EEEVE-WLCGSQLHSDALEIRAAAEASWAEMQAVLAAAKAQGRAPAHGAFGRAQLQWAFA 227

Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           +L SRLVRL  +  + AL+PWAD+LNH C   +FLD+  +   VV   +R+Y+ GEQ+ I
Sbjct: 228 VLLSRLVRLAGLGDQEALLPWADLLNHDCAAASFLDWSATEAAVVLRAERRYRAGEQLLI 287

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
           SYG+K++GELLLSYGF P  G+NP D   L L L   D     K  ALR++GL+AS+ FP
Sbjct: 288 SYGQKTSGELLLSYGFCPDLGSNPHDGCRLLLELAPGDAARNWKAAALRQHGLAASQLFP 347

Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
           +++   P EL+ Y     +    + + E++A     ++  + DI  P +   AL+ ++ +
Sbjct: 348 LRMAAAPFELVHYTAFSAAVVGSRQEAEQLA----RRLFEEGDIP-PALQTAALEAVVAA 402

Query: 425 CESSISKYSR 434
           C+++++ Y R
Sbjct: 403 CKAALAAYPR 412


>gi|255083899|ref|XP_002508524.1| set domain protein [Micromonas sp. RCC299]
 gi|226523801|gb|ACO69782.1| set domain protein [Micromonas sp. RCC299]
          Length = 425

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 112/303 (36%), Positives = 174/303 (57%), Gaps = 20/303 (6%)

Query: 88  LPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV 146
           LP QK+ +  VD+ E RGLVA + +R+GE LL +P S +IT +   +    G      ++
Sbjct: 7   LPAQKLELV-VDLPEGRGLVATEEVRRGESLLDIPESTLITVERAIAESNLGPA--HANL 63

Query: 147 PDWPLLATYLISEA----SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER 202
            +W +LA +L  +A    +    SR++ Y+ ALPR+   +L W   ++   L  S  +  
Sbjct: 64  QEWSVLAAFLAEQALAIDAGADGSRFATYVRALPRRTGGVLDWPEEDVKELLAGSPSQRA 123

Query: 203 AIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
           A+ER  +V    +++R    + +P L P         +W+F +LFSRL+RLP+  G +AL
Sbjct: 124 AMERQASVDAAIDEIR----ASFPQLTPG------ALRWAFDVLFSRLIRLPNRGGALAL 173

Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
           VPWADMLNH    + ++D   +   V  + DR+Y+PGEQV+ SYG + + ELL+SYGF P
Sbjct: 174 VPWADMLNHRPGCDAYID--DTGGAVCLSPDRRYKPGEQVYASYGPRPSSELLISYGFAP 231

Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
             G NP D  E+ L +  +D+    K +ALR+ GLS  E FP+++ G+P +L+ YA   +
Sbjct: 232 AVGENPDDEFEVVLGIDPNDRHADAKADALRRIGLSPVEAFPLKLNGYPKQLLQYASFAL 291

Query: 383 SPP 385
             P
Sbjct: 292 CDP 294


>gi|303275964|ref|XP_003057276.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226461628|gb|EEH58921.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 308

 Score =  200 bits (509), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 119/323 (36%), Positives = 181/323 (56%), Gaps = 21/323 (6%)

Query: 82  WLSDSG-LPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           WL++S  LP QK+ +  VD+ E RGLVA +++++GE LL +P + +IT +      + G 
Sbjct: 1   WLTNSQRLPAQKLDL-VVDLPEGRGLVAREDVKRGEPLLEIPDASLITVERAVKESKLGP 59

Query: 140 VLKQCSVPDWPLLATYLISEA----SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
             K   + +W LLA +L  +A    + ++S  ++ Y+ ALPR+   +L W   ++   L 
Sbjct: 60  --KHAELQEWSLLAAFLAEQALDIENGDESGVFAAYVKALPRRTGGVLDWPEEDVKTLLA 117

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
            S  +  A ER  +V G   ++R    +++P L P         +W+F +LFSRL+RLP+
Sbjct: 118 GSPSQRAAYERQASVDGAIEEIR----AEFPQLTPG------ALRWAFDVLFSRLIRLPN 167

Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
             G +ALVPWADMLNH      ++D   S   V    DR Y+PGEQVF SYG++ + ELL
Sbjct: 168 RGGELALVPWADMLNHKPGCNAYID--DSGGKVCLQPDRAYKPGEQVFASYGQRPSAELL 225

Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
           +SYGF P  G NP D  E+ L +  +D+    K  AL K GL   E FP+++ G+P +L+
Sbjct: 226 ISYGFAPEVGENPDDEYEITLGIDPNDRYADAKAAALEKIGLRPVESFPLRLNGYPKQLL 285

Query: 376 AYAYLVVSPPSMKGKFEEMAAAA 398
            YA   +  P    + E +A  A
Sbjct: 286 QYASFALCDPDDPKELEGLAEKA 308


>gi|168002824|ref|XP_001754113.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694667|gb|EDQ81014.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 638

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 180/361 (49%), Gaps = 28/361 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA-----DSKWS 133
           L +WLS  G P Q + +        GL A ++ ++GE  L +P +  +T          +
Sbjct: 85  LSEWLSKQGFPTQDVILTGFGEEGVGLAAGRDFKEGEVALKIPENYTVTGVDVVNHPVVA 144

Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
            P AG         D   L  +L+ E S  + S W  Y+   P    S + WT  E  + 
Sbjct: 145 APAAGR-------GDVIGLTLWLMYERSLGEKSVWYPYLQTFPSTTLSPILWTAEEQQKL 197

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
           L+ S   E   +R   + G Y DL+   F+K P  FP+E F++E FK +F ++ SR V L
Sbjct: 198 LKGSPALEEVQQRSAALEGEYEDLQ-SYFTKDPQAFPQEYFSLEAFKSAFSVILSRAVYL 256

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK-KSNG 312
           PS D   ALVP+AD LNH  + + +LDY    Q VVF  DR Y+ GEQVF SYG+ +SN 
Sbjct: 257 PSAD-LFALVPYADALNHRADSQAYLDYSMEDQAVVFPVDRNYKEGEQVFTSYGRERSNA 315

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
           +LL++YGFV  +  N  D ++L + L   D+    K + L++  L + + FP+ +  +P 
Sbjct: 316 DLLITYGFV--DENNAMDYLDLEVGLVDGDRLLVLKQQILQQAMLDSPQTFPLYLDRFPT 373

Query: 373 ELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCESSISK 431
           +L+ Y  L         + ++   A   K+   KDI   + +E + LQ ++  C + +  
Sbjct: 374 QLLTYMRL--------SRLQD--PALFPKIVFDKDIMLDQANEYECLQLLMGECRTKLGN 423

Query: 432 Y 432
           Y
Sbjct: 424 Y 424


>gi|302832548|ref|XP_002947838.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
 gi|300266640|gb|EFJ50826.1| hypothetical protein VOLCADRAFT_88145 [Volvox carteri f.
           nagariensis]
          Length = 508

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 115/354 (32%), Positives = 171/354 (48%), Gaps = 44/354 (12%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
           ++D  +     Q WL   GL  Q + ++      RGLVA +++ +GE L+ +P  LVITA
Sbjct: 15  DVDGPDLEPEFQSWLRSEGLSTQPLLLRHCGREGRGLVASRSLSRGEVLVKLPDHLVITA 74

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLI--------SEASFEKSSRWSNYISALPRQPY 180
           +       AGE         W LLA  L          + S   ++RW  Y++ LP++P 
Sbjct: 75  ERA-----AGE---------WSLLALLLAEVKGRLAAGDRSSPAAARWGPYVAVLPQRPG 120

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD-LFPEEV-FNMET 238
           +LL W   E+ + L  S ++  A    +    ++ +L   I     D L PE V  +   
Sbjct: 121 TLLDWPAKEVQQLLRGSPLQRLADSITSAASASWRELEPLIAQGRADGLVPEHVPLSKGD 180

Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNH--SCEVETFLDYDKSSQG--------- 287
            +W+FG+L SR +RLPS      L PWAD LNH  S E    LD+     G         
Sbjct: 181 LEWAFGVLLSRCIRLPSRGDLQVLAPWADQLNHDVSAEEGCHLDWSWDVAGPAVPGGDRA 240

Query: 288 -------VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL-K 339
                  +V   DR Y  G+QV++SYG KS+GELLLSYGF P   +NP     L +++ +
Sbjct: 241 GGATKGALVLRADRPYAAGQQVYVSYGPKSSGELLLSYGFCPPPASNPHQDCRLRVAVDR 300

Query: 340 KSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAY-AYLVVSPPSMKGKFE 392
           + D     K +AL ++GL +   FP+++ G P  L+ Y A+L   P   +  FE
Sbjct: 301 QGDPLADLKEQALARHGLPSELEFPLKLEGIPEGLLQYLAFLDARPKVAQETFE 354


>gi|298706765|emb|CBJ29688.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Ectocarpus siliculosus]
          Length = 521

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 196/371 (52%), Gaps = 39/371 (10%)

Query: 84  SDSGLPPQKM--AIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           SD G+ P  +  A+  VD  E     RG++A + I++G++L  +P  L++T D+  +  E
Sbjct: 107 SDWGVGPHALSVAVDTVDENENETAGRGMIANREIKEGDELFTLPIDLLLTKDA--AKKE 164

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYL 194
            G  +    + ++  +A   + E +  K S WS+YI  LP   + Y    W   +L   L
Sbjct: 165 FGADVITEDLSEYIAIALLAVHEKAKGKESFWSSYIGVLPTVEEVYPTYLWAEEDL-ALL 223

Query: 195 EASQI--RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
           E S +     ++ R   V   Y  +   +  K+P++ P EV   E F+W+F +LFSR +R
Sbjct: 224 EGSPVIAATESMRRKLEV--EYATVENDLLDKFPEILPREVHTYEEFQWAFAMLFSRAIR 281

Query: 253 LPSMDG--RVALVPWADMLNHSCEVETFLD-------YDKSSQGVVFTTDRQYQPGEQVF 303
           L  +     VALVP+AD+ NH+    +++D       + K+ + VV+  DR Y+  EQV+
Sbjct: 282 LGGLSTGEAVALVPYADLFNHNPFANSYIDARQQGLFFSKTDEVVVYA-DRSYKKMEQVY 340

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYG K N +LLL YGF      NP +SV++ +SL ++D+ Y+ K   L + GL  ++ F
Sbjct: 341 ISYGPKGNSDLLLLYGFSLDR--NPYNSVDVTVSLDENDELYERKKAFLSEAGLPPTKAF 398

Query: 364 PIQITGWPLELMAYAYLV-VSPPSMKGK-FEEMAAAASNKMTSKKDIKCPEIDEQALQFI 421
           P+    +P EL+ Y  L+ ++   ++G+  E+++         KK     E+    L  +
Sbjct: 399 PLYNDRYPDELLQYLRLIQLNTDQLRGRTLEDLS-------FEKKQTDVNEL--MVLDSL 449

Query: 422 LDSCESSISKY 432
           +++C+++I+ Y
Sbjct: 450 VEACKATIAGY 460


>gi|159465555|ref|XP_001690988.1| lysine N-methylase [Chlamydomonas reinhardtii]
 gi|158279674|gb|EDP05434.1| lysine N-methylase [Chlamydomonas reinhardtii]
          Length = 563

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 176/370 (47%), Gaps = 55/370 (14%)

Query: 80  QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD--SKWSCPEA 137
           Q WL   G+  Q + +++     RGLVA + + +GE LL +P SL++T    ++ SC   
Sbjct: 50  QAWLRGQGIITQPLVLRQCGREGRGLVADRPLGRGEALLQLPDSLLLTPQRAAEESC--L 107

Query: 138 GEVLKQCS---------------VPDWPLLATYLIS----EASFEKSSRWSNYISALPRQ 178
             +L+Q S               +P+W LLA YL       A+ ++ SRW+ Y+  LP++
Sbjct: 108 APLLRQLSPAGASTSAAGAAALPLPEWSLLALYLAELRGRAAAGDRGSRWAAYVDMLPQR 167

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-LRIFSKYPDLFPEEV-FNM 236
           P ++L W   E  + L  S +   A         ++ +L  L    +   L P  V  + 
Sbjct: 168 PGTVLDWPAKETRQLLRGSPLLRLADSIAAAAAASWEELAPLIARGRAEGLVPAHVSLSK 227

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET------------------- 277
               W+FG+L SR +RLP  D    L PWAD+LNH    ET                   
Sbjct: 228 ADLDWAFGVLLSRCIRLPGRDQLQVLAPWADLLNHDVNAETGAAAAGAAGSGATGSGASG 287

Query: 278 ----FLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
                LD++ +++G    +V  TDR Y  G+QV++SYG KS+GELLLSYGF P    NP 
Sbjct: 288 SGGCHLDWEPTARGGAGALVLRTDRAYAAGQQVYVSYGPKSSGELLLSYGFCPPPAANPH 347

Query: 330 DSVELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPS 386
              +L + +  S   D     K E L K+GL  S  FP+++ G P  L+ Y   V + P 
Sbjct: 348 QDYKLLVGVNDSAAADPLAALKAEVLAKHGLPPSLEFPLKLEGLPAGLLNYLAFVEAAPQ 407

Query: 387 MKGKFEEMAA 396
           +  +  ++ +
Sbjct: 408 VPQELHDLGS 417


>gi|384251962|gb|EIE25439.1| ResB-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 889

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 101/311 (32%), Positives = 147/311 (47%), Gaps = 48/311 (15%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
            +L+ WL+  GLPPQK+AI       RGLVA + +RK EKLL VP  L++TAD       
Sbjct: 32  GSLEDWLTHRGLPPQKVAISHEIPEGRGLVATRRVRKHEKLLNVPAQLLLTADVALQHSA 91

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSR--WSNYISALPRQPYSLLYWTRAELDRYL 194
            G +L+ C VP W +LAT+L       +  +  W  Y+ ALP Q   +L W   E+D  L
Sbjct: 92  YGGLLESCGVPAWSVLATFLAETRRQPEGDKNVWGQYVDALPSQTGCVLEWASEEVD-LL 150

Query: 195 EASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
             +     A E I     +  +L   LR  +      P      +  +W F +L SRL+R
Sbjct: 151 RGTAAMRAADEIIAACSASVAELAPILRESAS----MPGGPLTEQDLRWGFSMLLSRLIR 206

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
           LP   G+  L         +C                            V++SYG+KS+ 
Sbjct: 207 LP---GKQDL--------EAC----------------------------VYVSYGQKSDT 227

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
           +LLLSYGF+P   +NP  +  L LSL++ D C+  K   L + G SA   FP+++   P 
Sbjct: 228 QLLLSYGFMPAPLSNPHSACNLRLSLQRDDPCFDAKRALLEEAGHSACMEFPLRLDSLPQ 287

Query: 373 ELMAYAYLVVS 383
           +L+ YA  + +
Sbjct: 288 KLINYAAFLCT 298


>gi|160331079|ref|XP_001712247.1| met [Hemiselmis andersenii]
 gi|159765694|gb|ABW97922.1| met [Hemiselmis andersenii]
          Length = 464

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/359 (28%), Positives = 177/359 (49%), Gaps = 27/359 (7%)

Query: 87  GLPPQKMAI--QKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           G  P +  +  +  D GE   RGL+A K I++GEKL+ +P +L+++ D      E  + L
Sbjct: 80  GRAPHQCFVSNETTDEGEPCGRGLLAFKKIQQGEKLIEIPENLILSVDRDQIKNEGNDFL 139

Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR--YLEASQI 199
            +     +  L  +LI + +    S+W  Y   LPR+   L    R  L+   +L  S+ 
Sbjct: 140 NE-----YDSLGIFLIQQMAMGDKSKWKIYFDILPREE-DLNLGFRWNLNDIVFLRGSKT 193

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR 259
              ++     +   +  L   IFSK    +P  +FN+  ++W+  IL SR + L ++  +
Sbjct: 194 LNASLYLKEKIKIQFLRLEKTIFSKNRLKYPVSIFNLAQWEWALSILLSRAIFLQNL-KK 252

Query: 260 VALVPWADMLNHSCEVETFLDYDKSS----QGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
           V+LVP+AD +NH+    ++++  K S      +V   D+ Y   +Q+F +YG+K+N ELL
Sbjct: 253 VSLVPYADFMNHNPFSTSYINSKKISFSKNHEIVMYADKDYNKFDQIFTTYGQKTNLELL 312

Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
           L YGF+     NP DS+EL +SL   D  +++K + + +   ++   FPI    +P EL 
Sbjct: 313 LLYGFILER--NPFDSIELRISLSDKDSFFEKKKQFMIECEKTSEITFPIFYYKYPKELY 370

Query: 376 AYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSR 434
            +    +S        EE+ +   +      D    EI++   + +L SCE  +  YS+
Sbjct: 371 EFLRFCISNQ------EELGSTDLSDFNF-NDENNYEIEKIIRKLVLFSCEKLLKNYSK 422


>gi|452821842|gb|EME28868.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Galdieria sulphuraria]
          Length = 490

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 105/334 (31%), Positives = 162/334 (48%), Gaps = 35/334 (10%)

Query: 66  WGCEIDSLENASTLQKWLSDSGL----------PPQKMAI--QKVDVGE---RGLVALKN 110
           W  EI      S    WL ++G+           P ++ I  +  D GE   RGL++ ++
Sbjct: 76  WSSEI------SAFYDWLKENGVYLSEKASWTHAPHRLVIAEETKDEGEYSGRGLLSSRS 129

Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSN 170
           +  GEK+L +P  L+ T   K +       +      ++  +   L+ E +    S +  
Sbjct: 130 VNLGEKVLEIPEKLMFT--RKLALETFPTSIIASIEDEYVSIGLLLLYEKAKGFDSFFKP 187

Query: 171 YISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDL 228
           Y+  LP   +   L  W+  +LD  L+ S       +    ++  Y  L   I  + P+ 
Sbjct: 188 YLDILPTLDELNPLFLWSNKDLD-LLQGSPTLSACEQLRDKLLREYTYLGKNIIPQIPN- 245

Query: 229 FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG- 287
           F  +  + + F+W+FGILFSR +  PS   R+ALVP+AD+LNHS     F+D +K   G 
Sbjct: 246 FASKPIDFKQFQWAFGILFSRAICFPS-SKRIALVPYADLLNHSPFCSAFIDEEKIPFGN 304

Query: 288 ----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
                V   DR Y+P EQV++SYG +SN ELLL YGF      NP D VE+ + L K+D 
Sbjct: 305 GVTEAVVYVDRLYEPYEQVYVSYGPRSNQELLLLYGFSLER--NPFDCVEITIGLDKTDP 362

Query: 344 CYKEKLEALRKYGLSASECFPIQITGWPLELMAY 377
            Y EK   L  YG S  + FP+ +  +P+E+  +
Sbjct: 363 LYLEKCRMLESYGKSPLQSFPLYMDRYPVEMAEF 396


>gi|380089029|emb|CCC12973.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 465

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 109/365 (29%), Positives = 185/365 (50%), Gaps = 37/365 (10%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSK 131
           +T++ WL +SG     + +  +++ +     RG+  L+  ++GEK+L +P S++ T +  
Sbjct: 8   NTMESWLKESG----AVGLDGLELADFPDTGRGVKTLRPFKEGEKILTIPSSILWTVEHA 63

Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRA 188
           ++ P  G  L     P  P   L TYL+   S E       ++++ALP    S +++T  
Sbjct: 64  YADPLLGPALCSVQPPLSPEDTLTTYLLFVRSRESGYDGQRSHVAALPTSYSSSIFFTEE 123

Query: 189 ELD------RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
           EL+       Y    Q+ E++IE        +  L +++F ++ DLFP + F++E +KW+
Sbjct: 124 ELEVCAGTSLYTITKQL-EQSIE------DDHRALVMQLFIQHRDLFPLDKFSIEDYKWA 176

Query: 243 FGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
              ++SR +     DG+    L P+ADMLNHS E +    YD SS  +     + Y+PG+
Sbjct: 177 LCTVWSRRMDFQLRDGKSMRLLAPFADMLNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGD 236

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
           QVFI+YG   N  LL  YGFV     NP+D+ +L LS       Y++K +     GL ++
Sbjct: 237 QVFINYGSVPNSRLLRLYGFVIP--GNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDST 294

Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQ 419
              P+ +T  PL      YL +     +    ++AA A     +K D K  + +E + LQ
Sbjct: 295 STIPLTLTD-PLPKNVLRYLRI----QRADASDLAAMALQN--AKADEKVSDSNEVEILQ 347

Query: 420 FILDS 424
           F+++S
Sbjct: 348 FLVES 352


>gi|399949805|gb|AFP65462.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Chroomonas mesostigmatica
           CCMP1168]
          Length = 464

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 160/296 (54%), Gaps = 18/296 (6%)

Query: 99  DVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY 155
           D GE   RGL+A + I++GEKL+ +P +L++    K +  E    L +     +  LA  
Sbjct: 89  DEGESCGRGLLAFRKIQQGEKLIEIPENLILKKSLKENRSEDLSFLNE-----YDSLAIK 143

Query: 156 LISEASFEKSSRWSNYISALPRQP-YSLLY-WTRAELDRYLEASQIRERAIERITNVIGT 213
            I E +  + S+W  Y   LP++   +L++ W  +++  +L  S++   +      +   
Sbjct: 144 AIQERAIGEKSKWKVYYEILPKEKDLNLVFRWKISDI-VFLRGSKVLNASFYLKEKIKIQ 202

Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
           +  +   IFSK   ++PE++FN+++++W+  +L SR + L +M  ++ALVP+AD +NH+ 
Sbjct: 203 FLRIEKTIFSKNRLVYPEKIFNLQSWEWAISLLLSRAIFLQNM-KKIALVPYADFINHNP 261

Query: 274 EVETFLDYDK----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
              ++++  K     +  +V   D+ Y   +Q+F +YG+K+N ELL+ YGF+     NP 
Sbjct: 262 FSTSYINSKKIAFSENNEIVMYADKDYNKFDQIFTTYGQKTNLELLVLYGFIIER--NPF 319

Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPP 385
           DS+EL ++L   D+ Y +K + +     +    FP+    +P EL  +  L +S P
Sbjct: 320 DSIELRVALSTKDELYNKKEKFINDCEKTEQITFPVFYYKYPKELYEFMRLCLSGP 375


>gi|336260071|ref|XP_003344832.1| hypothetical protein SMAC_06115 [Sordaria macrospora k-hell]
          Length = 456

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 183/363 (50%), Gaps = 37/363 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           ++ WL +SG     + +  +++ +     RG+  L+  ++GEK+L +P S++ T +  ++
Sbjct: 1   MESWLKESG----AVGLDGLELADFPDTGRGVKTLRPFKEGEKILTIPSSILWTVEHAYA 56

Query: 134 CPEAGEVLKQCSVPDWP--LLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAEL 190
            P  G  L     P  P   L TYL+   S E       ++++ALP    S +++T  EL
Sbjct: 57  DPLLGPALCSVQPPLSPEDTLTTYLLFVRSRESGYDGQRSHVAALPTSYSSSIFFTEEEL 116

Query: 191 D------RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
           +       Y    Q+ E++IE        +  L +++F ++ DLFP + F++E +KW+  
Sbjct: 117 EVCAGTSLYTITKQL-EQSIED------DHRALVMQLFIQHRDLFPLDKFSIEDYKWALC 169

Query: 245 ILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
            ++SR +     DG+    L P+ADMLNHS E +    YD SS  +     + Y+PG+QV
Sbjct: 170 TVWSRRMDFQLRDGKSMRLLAPFADMLNHSSEAKPCHVYDVSSGNLSVLAGKDYEPGDQV 229

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           FI+YG   N  LL  YGFV     NP+D+ +L LS       Y++K +     GL ++  
Sbjct: 230 FINYGSVPNSRLLRLYGFVIP--GNPNDTYDLVLSTHPQAPFYEQKHKLWVSAGLDSTST 287

Query: 363 FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFI 421
            P+ +T  PL      YL +     +    ++AA A     +K D K  + +E + LQF+
Sbjct: 288 IPLTLTD-PLPKNVLRYLRI----QRADASDLAAMALQ--NAKADEKVSDSNEVEILQFL 340

Query: 422 LDS 424
           ++S
Sbjct: 341 VES 343


>gi|223992783|ref|XP_002286075.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
 gi|220977390|gb|EED95716.1| rubisco small subunit small subunit n-methyltransferase
           [Thalassiosira pseudonana CCMP1335]
          Length = 434

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 163/341 (47%), Gaps = 42/341 (12%)

Query: 87  GLPPQKMAIQKVDVGE-------RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           G  P  +AI    V E       RGL+A ++I  G++LL +P  L IT  S       G+
Sbjct: 23  GEAPHPLAISTETVDEITNESSGRGLLARRSINDGDELLKIPMDLCITRKSARKA--LGK 80

Query: 140 VLKQCSVPDWPLLATYLISEA-SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEA 196
              Q  + ++  +A  LI E       S W  Y+  LP   +      W   +L  +L+ 
Sbjct: 81  DALQDGINEYLAIACQLIHEKYVLGDESEWDAYMGVLPEVEEVNPTFTWKDEDL-AFLDG 139

Query: 197 SQIRERAIERITNVIGTYNDL---RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
           S +          +   Y+ L   +  + +K+PD FP E F  E + W+F +LFSR +RL
Sbjct: 140 SPVVAATRSLQMKLRREYDALLGGQDGLIAKFPDRFPAEHFTYENWVWAFTMLFSRAIRL 199

Query: 254 PSMD--GRVALVPWADMLNHSCEVETFLD--------YDKSSQGVVFTTDRQYQPGEQVF 303
            ++    R+A+VP+AD++NHS     F+D        +    + V+   DR Y+  EQV+
Sbjct: 200 RNLQVGERLAMVPYADLINHSAFSGAFIDARESGDWLFKNGEEEVILYADRGYRQMEQVY 259

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK-------------KSDKCYKEKLE 350
           ISYG+KSN ELLL YGF      NP +SV++ +S+              + D   +EK+E
Sbjct: 260 ISYGQKSNAELLLLYGFALER--NPYNSVDVTVSIAPRTAALAAANEGIEVDPLAQEKVE 317

Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYL-VVSPPSMKGK 390
            L   G   +  FP     +P+E++ +  L +++P   +GK
Sbjct: 318 FLASVGRDQTVDFPCYADRYPVEMLEFLRLMMMTPEDTRGK 358


>gi|397613505|gb|EJK62256.1| hypothetical protein THAOC_17139 [Thalassiosira oceanica]
          Length = 648

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 112/376 (29%), Positives = 182/376 (48%), Gaps = 53/376 (14%)

Query: 55  TKTTVTQNMIPWGCEIDSLENASTLQKWLSDS---GLPPQKMAIQKVDVGE-------RG 104
           TK +    +I W      LE    +  +L++S   G  P  +AI    V E       RG
Sbjct: 156 TKLSANARLISW------LEEEGGV--YLAESSTWGEAPHPLAISTETVDEITNESSGRG 207

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE- 163
           L+A ++I  G++LL +P  L +T   K +  E G+   Q  + ++  +A  LI E   + 
Sbjct: 208 LLARRSINDGDELLKIPLDLCLT--RKSARRELGKDALQEGINEYLAVACQLIHEKFVKG 265

Query: 164 KSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR- 220
           + S ++ Y+  LP   +      W   +L  +LE S +          +   Y+DL    
Sbjct: 266 EDSFYAAYMGVLPEVDEVNPTFTWPDEDL-AFLEGSPVVAATRSLQMKLRREYDDLLGGP 324

Query: 221 --IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD--GRVALVPWADMLNHSCEVE 276
             + +K+P  FP E +  E ++W+F +LFSR +RL ++    R+A+VP+AD++NHS   +
Sbjct: 325 DGLVAKFPLRFPAEHYTFENWEWAFTMLFSRAIRLRNLQVGERLAMVPYADLINHSAFSQ 384

Query: 277 TFLD--------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
            F+D        +    + V+   DR Y+  EQV+ISYG+KSN ELLL YGF      NP
Sbjct: 385 AFIDARESGDWLFKSGEEEVILYADRGYRQMEQVYISYGQKSNAELLLLYGFALER--NP 442

Query: 329 SDSVELPLSLK-------------KSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
            +SV++ +S+              + D    EKLE L   G   +  FP     +P+E++
Sbjct: 443 YNSVDVTVSIAPRTKQIAEANEGVEEDPLADEKLEFLLSVGRDQTVDFPCYADRYPVEML 502

Query: 376 AYAYL-VVSPPSMKGK 390
            Y  L +++P   +GK
Sbjct: 503 EYLRLMMMTPEDTRGK 518


>gi|219121061|ref|XP_002185762.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
 gi|209582611|gb|ACI65232.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Phaeodactylum tricornutum CCAP
           1055/1]
          Length = 575

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 177/380 (46%), Gaps = 50/380 (13%)

Query: 50  NDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE------- 102
           NDA+  K T   N+I W    +     S    W    G  P  +AI      E       
Sbjct: 73  NDAN-PKFTANTNLIQW-LTTEGNVYLSEESSW----GEAPHPLAISTETKDEITNESSG 126

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RGL+A ++I  G++LL +P +L +T  +       G+ +    + ++  +A +LI E + 
Sbjct: 127 RGLLARRDINDGDELLRIPMALCMTKSAARKA--VGKDVLPSEINEYLAMACHLIYERNV 184

Query: 163 E-KSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL-- 217
             + S W  Y+  LP   +      W   +L  +L  S +          +   Y+ L  
Sbjct: 185 RGEESPWKPYLDVLPDIDEVNPTFTWPDEDL-AFLNGSPVIAATKSLQMKLRREYDALLG 243

Query: 218 -RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSCE 274
               + +KYPD FP E FN + ++W+F +LFSR +RL S+     +ALVP+AD++NHS  
Sbjct: 244 GEDGLLAKYPDRFPAEAFNFKAWEWAFTMLFSRAIRLRSLKQGETLALVPYADLINHSPF 303

Query: 275 VETFLD--------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
            + ++D        +    + V+   DR Y+  EQ++ISYG KSN ELLL YGF      
Sbjct: 304 SQAYIDARQNGDWLFKSGDEEVILYADRGYRRMEQIYISYGPKSNAELLLLYGFAVER-- 361

Query: 327 NPSDSVELPLSLKKS---------------DKCYKEKLEALRKYGLSASECFPIQITGWP 371
           NP +SV++ +S+                  D   +EK   L + G  A+  FP     +P
Sbjct: 362 NPFNSVDVTVSIAPRTASFVKELDDDTIPVDPLAEEKAAFLEQVGRDATVDFPCYADRYP 421

Query: 372 LELMAYAYLV-VSPPSMKGK 390
           +E++ Y  L+ ++P   +GK
Sbjct: 422 VEMLEYLRLMQMTPEDTRGK 441


>gi|302823067|ref|XP_002993188.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
 gi|300138958|gb|EFJ05708.1| hypothetical protein SELMODRAFT_449044 [Selaginella moellendorffii]
          Length = 600

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 176/361 (48%), Gaps = 20/361 (5%)

Query: 71  DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           + L+ A  + KWL + G P Q + +   +    G  A ++++ G+  L +P +  +TA  
Sbjct: 42  ERLDAARDMTKWLQEQGFPQQPLLVSSFEDKGLGCCATRDLQAGDAALSIPENFTVTAVD 101

Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
             + P      +     +   LA +L+ E    + S W  Y+   P    SLL W + E 
Sbjct: 102 VANHPVISSAAE--GRDELVGLALWLMYEQERSQDSPWYPYVKVFPASTLSLLLWEQEEQ 159

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
           +  L  S    +  +++T++  T++ L+  +  K    FP E F    FK +F ++ SR 
Sbjct: 160 EELLRGSSALAKVKDQLTSLRQTFDALKDTL--KDNKDFPMEKFTFSAFKTAFSVVLSRA 217

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK- 309
           V LPS +   ALVP+ D++NH     + LDYD   Q V    D++Y+ G+QVF SY +  
Sbjct: 218 VYLPSAE-LFALVPFGDLINHESS-RSLLDYDIEEQKVKLAVDKRYKKGDQVFASYAQNL 275

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           ++ + L+ YGF+  + ++ +D +E+ + L   D     K E L++ GL+  + FP+ +  
Sbjct: 276 TSADFLIRYGFL--DESDENDCIEIEVGLVSGDSLAPLKREILQEVGLTVPQKFPLYLNR 333

Query: 370 WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCESS 428
           +P +L+ Y  L     S  G F         K+T +KD+   + +E + L  ++  C + 
Sbjct: 334 FPTQLLTYTRLARIQDS--GLFA--------KITFEKDLIVSQTNEYETLMLLMADCRTK 383

Query: 429 I 429
           +
Sbjct: 384 L 384


>gi|449017905|dbj|BAM81307.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Cyanidioschyzon merolae strain 10D]
          Length = 567

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 157/350 (44%), Gaps = 80/350 (22%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE--- 159
           RG +A ++I+ GE L  VP  L  T D       A  V +     ++  LAT L+ E   
Sbjct: 116 RGFLARRDIQAGEVLFQVPFHLCFTKDVAVRRFAALNVPELADEEEFFALATLLLYERGL 175

Query: 160 -ASFEKSSR-----WSNYISALPRQPY-----------------SLLYWTRAELDRYLEA 196
             S++KS R     W  Y+  LP  P+                 +L  W   E+ ++L+ 
Sbjct: 176 DESWKKSGRGPGSFWGPYLDILPPVPWEFKGAEPAESLSMDPLDALWLWAEDEM-QWLQG 234

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPS 255
           S     A    + V   Y +   R++ ++P +F  E  F +E F W+FG+LFSR V LP+
Sbjct: 235 SPTLLSARALRSKVEREYAEACERLYRRHPHIFDLEGAFRLERFLWAFGVLFSRAVSLPA 294

Query: 256 MDGRVALVPWADMLNHSCEVETFLD----------------------------------- 280
            +G +ALVP+AD+ NHS    +F+D                                   
Sbjct: 295 ENGMLALVPYADLANHSAFCVSFIDARTAAFPYAFRASSKQKRGQWWQRFLAPNSDDAGA 354

Query: 281 --------YDKSSQG-VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
                   Y + +Q  VV   DR Y   EQV++SYG+KSN ELLL YGFV     NP +S
Sbjct: 355 VANTDSSHYREDAQREVVAYADRFYDKFEQVYVSYGQKSNAELLLLYGFV--SDRNPYNS 412

Query: 332 VELPLSLKKSDKCYKEKLEALRKYGLSAS------ECFPIQITGWPLELM 375
           VE+ +SL  S+      L+  R + L+        ECFP+    +PLELM
Sbjct: 413 VEVCVSLSGSEAAGAGLLDRKRSFLLACGRDPDKPECFPLYADRYPLELM 462


>gi|302764082|ref|XP_002965462.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
 gi|300166276|gb|EFJ32882.1| hypothetical protein SELMODRAFT_406852 [Selaginella moellendorffii]
          Length = 481

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 175/375 (46%), Gaps = 25/375 (6%)

Query: 66  WGCEI-----DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFV 120
           W C       + L+ A  + KWL + G P Q + +   +    G  A ++++ G+  L +
Sbjct: 32  WLCRASIADEERLDAARDMTKWLQEQGFPQQPLLVSSFEDKGLGCCATRDLQAGDAALSI 91

Query: 121 PPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY 180
           P +  +TA    + P      +     +   LA +L+ E    + S W  Y+   P    
Sbjct: 92  PENFTVTAVDVANHPVISSAAE--GRDELVGLALWLMYEQERSQDSPWYPYLKVFPASTL 149

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
           S L W + E +  L  S    +  +++T++  T++ L+  +  K    FP E F    FK
Sbjct: 150 SPLLWEQEEQEELLRGSSALAKVKDQLTSLRQTFDALKDTL--KDNKDFPMEKFTFSAFK 207

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
            +F ++ SR V LPS +   ALVP+ D++NH     + LDYD   Q V    D++Y+ G+
Sbjct: 208 AAFSVVLSRAVYLPSAE-LFALVPFGDLINHESS-RSLLDYDIEEQKVKLAVDKRYKKGD 265

Query: 301 QVFISYGKK-SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
           QVF SY +  ++ + L+ YGF+     N  D +E+ + L   D     K E L++ GL+ 
Sbjct: 266 QVFASYAQNLTSADFLIRYGFLDESDEN--DFIEIEVGLVSGDSLAPLKREILQEVGLTV 323

Query: 360 SECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD-IKCPEIDEQAL 418
            + FP+ +  +P +L+ Y  L     S  G F         K+T +KD I C   + + L
Sbjct: 324 PQKFPVYLNRFPTQLLTYTRLARIQDS--GLFA--------KITFEKDLIVCQTNEYETL 373

Query: 419 QFILDSCESSISKYS 433
             ++  C + +  +S
Sbjct: 374 MLLMADCRTKLLSFS 388


>gi|346319394|gb|EGX88996.1| Protein kinase-like domain [Cordyceps militaris CM01]
          Length = 1753

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 182/373 (48%), Gaps = 32/373 (8%)

Query: 76   ASTLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
            + +++ WL  SG +    + +    V  RG+ AL++ +KGE++L +P + + TA++  + 
Sbjct: 882  SESMEAWLKHSGAVGVDAIEVADFPVTGRGVKALRSFKKGERILTIPSACLWTAEAARAD 941

Query: 135  PEAGEVLKQC----SVPDWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWTR 187
            P  G VL+      SV D   LA +L+   S  +  +  R   +I+A+P++  + +++  
Sbjct: 942  PLLGPVLRSAQPPLSVED--TLAIHLLFVKSRTAGYEGQRL--HIAAMPQRHSASIFFAE 997

Query: 188  AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
             EL +  E S +     +    V   +  L +++ S++ DLFP + F +E +KW+   ++
Sbjct: 998  DEL-QVCEGSSLHTLTTQLEQRVQDDFRQLLVQLLSQHRDLFPLDQFTIEDYKWALCTIW 1056

Query: 248  SRLVRLPSMDG-RVALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            SR +     D   V LV P ADMLNHS +V+    YD +S  +     + YQ G+Q+FI 
Sbjct: 1057 SRAMDFAVSDTTSVRLVAPLADMLNHSLDVKQCHAYDPTSGDLSILAAKDYQVGDQIFIY 1116

Query: 306  YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
            YG   N  LL  YGFV  +  NP+DS +L L        Y++K       GL ++   P+
Sbjct: 1117 YGSVPNNRLLRLYGFVLLD--NPNDSYDLVLQTSPMAPLYEQKERLWALAGLDSTCTIPL 1174

Query: 366  QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMT----SKKDIKCPEIDE-QALQF 420
                 PL      YL         + + + AA    MT    +  D K  + +E Q LQF
Sbjct: 1175 -TAKHPLPKNVLRYL---------RTQRLDAADVADMTLQLLNGTDGKVNDGNEIQVLQF 1224

Query: 421  ILDSCESSISKYS 433
            ++DS  S +  + 
Sbjct: 1225 LIDSLGSVLEGFG 1237


>gi|336468018|gb|EGO56181.1| hypothetical protein NEUTE1DRAFT_83233 [Neurospora tetrasperma FGSC
           2508]
 gi|350289741|gb|EGZ70966.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 459

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/365 (29%), Positives = 176/365 (48%), Gaps = 40/365 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           +  WL  SG     + +  +++ +     RG+  L+  ++GEK+L +P  ++ T    ++
Sbjct: 1   MNSWLKQSG----AVGLDSLELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYA 56

Query: 134 CPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRA 188
            P  G  L+      SV D   LATY++   S E       ++I+ALP    S + +   
Sbjct: 57  DPLLGPALRSAQPPLSVED--TLATYILFVKSRESGYDGQRSHIAALPTSYSSSILFAED 114

Query: 189 ELDR------YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
           +L+       Y    Q+ E++IE        +  L +R+F ++PDLFP + F +E +KW+
Sbjct: 115 DLEACAGTSLYTITKQL-EQSIED------DHRALVVRLFVQHPDLFPLDKFTVEDYKWA 167

Query: 243 FGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
              ++SR +     DG     L P+ADMLNH+ EV+    YD SS  +     + Y+ G+
Sbjct: 168 LCTVWSRAMDFVLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGNLSVLAGKDYEAGD 227

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
           QVFI+YG   N  LL  YGFV     NP+DS +L LS       +++K +     GL ++
Sbjct: 228 QVFINYGPVPNSRLLRLYGFVIP--GNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDST 285

Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQ 419
              P+ +T  PL      YL +      G      A  + + T   D K  + +E + L+
Sbjct: 286 ATIPLTLTD-PLPKKVLRYLRIQRLDASG-----LAVIARQQTDATDGKISDSNEVEILR 339

Query: 420 FILDS 424
           F+++S
Sbjct: 340 FLVES 344


>gi|440797255|gb|ELR18348.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 431

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 95/316 (30%), Positives = 157/316 (49%), Gaps = 24/316 (7%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-TADSKWS--CPEAGEVLKQCSVPDWPLLAT----Y 155
           R +VA  +I  GE LL VP SLV+ +AD+  +   PE   +L +    ++PL AT     
Sbjct: 57  RSVVAAHDIATGETLLSVPFSLVVDSADAPLATAAPEIRRILDE----EFPLSATNENAL 112

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+     + +S W  YI  LP    + L+++  EL  YLE S +   A +R   +   Y+
Sbjct: 113 LLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELS-YLEGSSLHHFARQRRRAIESQYD 171

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
            +   +F  YP+ F  E F+++ +KW+  +++SR   +   +G+  LVPWADM N + E 
Sbjct: 172 TIFTPLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVD--EGKRGLVPWADMFNMAPET 229

Query: 276 ETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKK---SNGELLLSYGFVPREGTNPSDS 331
           E   +  D     ++++     + GEQ+F++YG+    SN +LL+ YGFV     NP D+
Sbjct: 230 EQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLE--NNPHDA 287

Query: 332 VELPLSLKKSDKCYKEKLEALRKYGLSASECF--PIQITGWPLELMAYAYLVVSPPSMKG 389
           V  P++   S    K  L  LR + L   + F  P  +  +P  L+A   + V+      
Sbjct: 288 VVFPMTHSSSASPRKRGL--LRAHDLDRDQFFVGPPALGEFPEHLLAAFRVTVATEQELD 345

Query: 390 KFEEMAAAASNKMTSK 405
              E +A    ++ S+
Sbjct: 346 ALLEQSAQGRQRLPSR 361


>gi|164423408|ref|XP_963594.2| hypothetical protein NCU08733 [Neurospora crassa OR74A]
 gi|157070080|gb|EAA34358.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 459

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/365 (29%), Positives = 176/365 (48%), Gaps = 40/365 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           +  WL  SG     + +  +++ +     RG+  L+  ++GEK+L +P  ++ T    ++
Sbjct: 1   MNSWLKQSG----AVGLDSLELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHAYA 56

Query: 134 CPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRA 188
            P  G  L+      SV D   LATY++   S E       ++I+ALP    S + +   
Sbjct: 57  DPLLGPALRSAQPPLSVED--TLATYILFVKSRESGYDGQRSHIAALPASYSSSILFAED 114

Query: 189 ELDR------YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
           +L+       Y    Q+ E++IE        +  L +R+F ++PDLFP + F +E +KW+
Sbjct: 115 DLEACAGTSLYTITKQL-EQSIED------DHRALVVRLFVQHPDLFPLDKFTVEDYKWA 167

Query: 243 FGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
              ++SR +     DG     L P+ADMLNH+ EV+    YD SS  +     + Y+ G+
Sbjct: 168 LCTVWSRAMDFVLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSVFAGKDYEAGD 227

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
           QVFI+YG   N  LL  YGFV     NP+DS +L LS       +++K +     GL ++
Sbjct: 228 QVFINYGPVPNSRLLRLYGFVIP--GNPNDSYDLVLSTHPQAPFFEQKQKLWVSAGLDST 285

Query: 361 ECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQ 419
              P+ +T  PL      YL +      G      A  + + T   D K  + +E + L+
Sbjct: 286 ATIPLTLTD-PLPKKVLRYLRIQRLDASG-----LAVIARQQTDATDGKISDSNEVEILR 339

Query: 420 FILDS 424
           F+++S
Sbjct: 340 FLVES 344


>gi|357153645|ref|XP_003576520.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 492

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/386 (27%), Positives = 189/386 (48%), Gaps = 40/386 (10%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
             ++W+S  G      +   V  G  GLVA +N+ +GE +  VP  L + AD+      A
Sbjct: 60  NFRRWISSQGADTGAASPTVVPEG-LGLVAARNLPRGEVVAEVPKKLWMDADAV----AA 114

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
            ++ + C     +  W  ++  ++ EA+    S W+ Y++ LPRQ  S ++W+  EL   
Sbjct: 115 SDIGRACRSGGDLRPWVSVSLLILREAARGGDSLWAPYLAILPRQTDSTIFWSEEEL-LE 173

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
           ++ +Q+    +     V   ++++  +I     DLFP+ +   + F W+FGIL SR+   
Sbjct: 174 IQGTQLLSTTMGVKEYVQSEFDNVEAKIIGPNKDLFPDTI-TFDDFLWAFGILRSRV--F 230

Query: 254 PSMDG-RVALVPWADMLNHSCEVET-----------FLDYDKSSQGVVFT--TDRQYQPG 299
           P + G ++AL+P+AD++NHS ++ +           FL  D     VVF+  T  + + G
Sbjct: 231 PELRGDKLALIPFADLINHSADITSKQSCWEIQGKGFLGRD-----VVFSLRTPMEVKSG 285

Query: 300 EQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           EQV++ Y   KSN EL L YGF   E  +  DS  L L + +SD  Y +KL+     G+ 
Sbjct: 286 EQVYVQYDLDKSNAELALDYGFT--ETNSTRDSYTLTLEISESDPFYGDKLDIAELNGMG 343

Query: 359 ASECFPIQIT-GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
            +  F + +    P +++ Y  L+    +     E   A   NK+    ++     +E++
Sbjct: 344 ETAYFDVVLGESLPPQMITYLRLLCLGGTDAFLLE---ALFRNKVWGFLELPVSRDNEES 400

Query: 418 L-QFILDSCESSISKYSRFLQVKELL 442
           + Q I  +C+S+++ Y   ++  E L
Sbjct: 401 ICQVIQTACKSALTAYHTTIEEDEEL 426


>gi|322698908|gb|EFY90674.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 437

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 180/370 (48%), Gaps = 24/370 (6%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL +SG +    + +    +  RG+  LK  ++GE +L +P  ++ T +  ++    
Sbjct: 1   MESWLKESGAVGLDNLELADFPITGRGVRTLKCFKEGENILTIPSGILWTVEHAYADSIL 60

Query: 138 GEVLKQCSVP--DWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
           G VL+  S+P      LA Y++   S  S     R  N+++ALP    S +++   +L+ 
Sbjct: 61  GPVLRSTSLPLSVEDTLAIYILFVRSRKSGYDGPR--NHVAALPASYSSSIFFMEDQLEV 118

Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
               S   I ++  +RI +    Y  L +R+  +YPDLFP + F +E +KW+   ++SR 
Sbjct: 119 CAGTSLYTITKQLEQRIED---DYRGLVVRMLGQYPDLFPLDKFTVEDYKWALCTVWSRA 175

Query: 251 VRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           +     DG+    L P+ADMLNHS E +    YD SS  +     + Y+ G+QVFI+YG 
Sbjct: 176 MDFVLPDGKSIRLLAPFADMLNHSSEAKQCHVYDASSGNLSVLAGKDYEAGDQVFINYGP 235

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT 368
             N  LL  YGFV     NP+DS +L L+       +K+K +     GL ++    +  T
Sbjct: 236 MPNNRLLRLYGFVVP--GNPNDSYDLVLATHPMAPFFKQKQKLWASAGLDSTTTITLTFT 293

Query: 369 GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCES 427
             PL      YL V         E   A  + + T+  D K  + +E + L+F+++S   
Sbjct: 294 D-PLPKDVLRYLRVQRLD-----ESDVAVLALQQTNTTDAKISDSNEVEVLRFLVESIGG 347

Query: 428 SISKYSRFLQ 437
            ++ +   ++
Sbjct: 348 LLNNFGTHVE 357


>gi|293333172|ref|NP_001168589.1| uncharacterized protein LOC100382373 [Zea mays]
 gi|223949395|gb|ACN28781.1| unknown [Zea mays]
 gi|414885391|tpg|DAA61405.1| TPA: hypothetical protein ZEAMMB73_723554 [Zea mays]
          Length = 489

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 179/376 (47%), Gaps = 21/376 (5%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
            ++WL+  G      AI        GLVA +++ +GE +  VP  L + AD+  +     
Sbjct: 58  FRRWLASHGAGDGGKAIPAAVPEGLGLVAARDLPRGEVVAEVPKKLWMDADAVAASDIGR 117

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
                  +  W  +A  L+SE +    S W+ Y++ LPRQ  S ++W+  EL   ++ +Q
Sbjct: 118 ACGGGGGLRPWVAVALLLLSEVARGADSPWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQ 176

Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG 258
           +    +     V   ++ ++  I S   DLFP  +   + F W+FG+L SR+   P + G
Sbjct: 177 LLSTTVGVKEYVQSEFDSVQAEIISTNKDLFPGSI-TFDDFLWAFGMLRSRV--FPELRG 233

Query: 259 -RVALVPWADMLNHSCEVET-FLDYDKSSQGVV-------FTTDRQYQPGEQVFISYG-K 308
            ++AL+P+AD++NHS  + +    ++   +G+          T    + G+Q++I Y   
Sbjct: 234 DKLALIPFADLVNHSPNITSEGSSWEIKGKGLFGRELMFSLRTPVNVKSGQQIYIQYDLD 293

Query: 309 KSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           KSN EL L YGFV    +NPS DS  + L + +SD  Y +KL+     GL  +  F + I
Sbjct: 294 KSNAELALDYGFVE---SNPSRDSFTVTLEISESDPFYGDKLDIAEANGLGETAYFDV-I 349

Query: 368 TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK-CPEIDEQALQFILDSCE 426
              PL      YL +        F  + A   N +    ++   P+ +E   Q + D+C+
Sbjct: 350 LNEPLPPQMLPYLRLLCIGGTDAF-LLEALFRNSVWGHLELPLSPDNEESICQAMRDACK 408

Query: 427 SSISKYSRFLQVKELL 442
           S+++ Y   ++  E L
Sbjct: 409 SALADYHTTIEEDEEL 424


>gi|168044593|ref|XP_001774765.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673920|gb|EDQ60436.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 523

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 163/350 (46%), Gaps = 33/350 (9%)

Query: 45  SVSTTNDASRTKTTVTQNMIPWG------CEIDSLENASTLQKWLSDSGLPPQKMAI--Q 96
           +VS+    SR + T+T ++           +    E+   L++W+ + GLP  K+++   
Sbjct: 53  AVSSEKRGSRCRNTLTTDVYKQDENDLAQSKKQEHESGIDLKQWMEEQGLPECKVSLAEH 112

Query: 97  KVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL 151
           +   G++G     +VA ++++ GE  L +P SLV+T +         E+L    + +   
Sbjct: 113 QPSEGDKGKPIHYVVASEDLQPGELALTIPKSLVVTLERVLGDETIAELLTTNKLSELAC 172

Query: 152 LATYLISEASFEKSSRWSNYISALPRQP-------YSLLYWTRAELDRYLEASQIRERAI 204
           LA YL+ E    K S W  YI  L RQ         S L W+R EL+ Y   S ++E  +
Sbjct: 173 LALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSREELNEYFTGSTMKEVVL 232

Query: 205 ERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SM 256
           ER+  +   Y +L         +F +YP   P E F+ E FK +F  + S +V L   S+
Sbjct: 233 ERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQAFVAVQSCVVHLQGVSL 292

Query: 257 DGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
             R ALVP    +L +    +  L        VV   DR Y+ G+ + +  G + N +LL
Sbjct: 293 ARRFALVPLGPPLLAYKSNCKAML--KAVDDNVVLEVDRAYKAGDPIAVWCGPQPNSKLL 350

Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           L+YGFV  +  NP D + +  SL   D  Y++K   ++K      + F I
Sbjct: 351 LNYGFVDED--NPYDRLAVEASLDTEDPLYQQKRAIVQKNNRLTIQTFQI 398


>gi|255087300|ref|XP_002505573.1| set domain protein [Micromonas sp. RCC299]
 gi|226520843|gb|ACO66831.1| set domain protein [Micromonas sp. RCC299]
          Length = 509

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 95/321 (29%), Positives = 152/321 (47%), Gaps = 27/321 (8%)

Query: 68  CEIDSLENAS--TLQKWLSDSGLPPQKMAIQKVDV--GERG--LVALKNIRKGEKLLFVP 121
             +DS   A    L  WL   G+   K++   VD   G RG  LVA ++I  G+ +L +P
Sbjct: 48  ASVDSRTQADFDALWAWLGSEGVDVSKVSPALVDAAPGGRGWGLVAAEDIGGGDAVLAIP 107

Query: 122 PSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
            SL +T D+  + P       +     W  +A  L+ E S  + SRW+ Y++ALP Q  +
Sbjct: 108 RSLWMTVDTALASPIGAHCGDEAG---WIAVALQLLHERSIGEKSRWAAYVNALPAQLDA 164

Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
            L+W+  E+   L  +Q+ + A    +   GT+  L+   F   PD+FP + F+  +F W
Sbjct: 165 PLFWSAEEV-ATLTGTQLLDAAAGYDSYARGTWARLKESAFDANPDVFPSDAFDEPSFLW 223

Query: 242 SFGILFSRLVRLPSMDGRVALVPWADMLNHSC-----------EVETFLDYDKSSQGVVF 290
           +FGIL SR          +ALVP  DM NHS             V       KS   ++ 
Sbjct: 224 AFGILRSRCQAPVDQGADIALVPGLDMANHSGLSSQTWTLNNGGVAAVFGGGKSGGSMLL 283

Query: 291 TTDRQYQ----PGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY 345
            T++  +     G +VF++YG +K + +L L YGF     + P   V  P+++ +SD   
Sbjct: 284 RTEKGAKGLLAKGAEVFMNYGQRKIDNQLALDYGFTDAFASRPG-YVLGPIAIPESDPNA 342

Query: 346 KEKLEALRKYGLSASECFPIQ 366
            +K++ L   GL  +  F ++
Sbjct: 343 FDKMDVLEVAGLREAPSFVLR 363


>gi|50252331|dbj|BAD28364.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor
           [Oryza sativa Japonica Group]
 gi|215769445|dbj|BAH01674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 105/354 (29%), Positives = 177/354 (50%), Gaps = 22/354 (6%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
           G  GLVA +++ +GE L  VP  L + AD+  +    G V +    P W  +A  L+ EA
Sbjct: 86  GGLGLVAARDLPRGEVLAEVPKKLWLDADAVAASDLGGAVGRGGLRP-WVAVALLLLREA 144

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
           +    S W+ Y++ LPRQ  S ++W+  EL   ++ +Q+    +     V   +  +   
Sbjct: 145 ARGAGSPWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFESVEAE 203

Query: 221 IFSKYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET-F 278
           I S+  +LFP  V FN   F W+FGIL SR+      D ++AL+P+AD++NHS ++ +  
Sbjct: 204 IISENRELFPGTVTFN--DFLWAFGILRSRVFAELRGD-KLALIPFADLVNHSDDITSKE 260

Query: 279 LDYDKSSQG-----VVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSD 330
             ++   +G     VVF+  T    + GEQ++I Y   KSN EL L YGF   E  +  D
Sbjct: 261 SSWEIKGKGLFGRDVVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFT--ESNSSRD 318

Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT-GWPLELMAYAYLVVSPPSMKG 389
           +  L L + +SD  Y +KL+     G+  +  F I +    P +++ Y  L+    +   
Sbjct: 319 AYTLTLEISESDPFYDDKLDIAELNGMGETAYFDIVLGESLPPQMLPYLRLLCLGGTDAF 378

Query: 390 KFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSRFLQVKELL 442
             E   A   N +    ++   + +E+A+ Q I ++C+S++  Y   ++  E L
Sbjct: 379 LLE---ALFRNAVWGHLELPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEEL 429


>gi|315039895|ref|XP_003169325.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
 gi|311337746|gb|EFQ96948.1| hypothetical protein MGYG_08872 [Arthroderma gypseum CBS 118893]
          Length = 455

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 177/368 (48%), Gaps = 28/368 (7%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL DSG +    + +    V  RG+ AL++ ++GE++L +P + + T    ++ P  
Sbjct: 1   MEAWLKDSGAIGVDGIEVADFAVTGRGVKALRSFKEGERILTIPSACLWTVKKAYADPLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWS-----NYISALPRQPYSLLYWTRA 188
           G VL+      SV D   LA YL+    F KS         ++I+A+P+   + +++T  
Sbjct: 61  GPVLRAAQPPLSVEDS--LALYLL----FVKSRTLGYEGQRHHIAAMPQSYSASIFFTDD 114

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           EL +  + S +     +    V   Y  L + + S++ DLFP + F +E +KW+   ++S
Sbjct: 115 EL-QVCKGSSLYALTPQLEQRVHDDYRQLLVALLSQHRDLFPLDQFTIEDYKWALCSIWS 173

Query: 249 RLVRLP-SMDGRVALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
           R +    S    V LV P ADMLNHS +V+    YD +S  +     + YQ G+Q+FI Y
Sbjct: 174 RAMDFAVSETASVRLVAPLADMLNHSPDVKQCHAYDPTSGDLSILAAKDYQVGDQIFIYY 233

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ 366
           G   N  LL  YGFV  +  NP+DS +L L        Y++K       GL ++   P+ 
Sbjct: 234 GSVPNNRLLRLYGFVLPD--NPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLT 291

Query: 367 ITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSC 425
           +   PL      YL +         E      + ++ +  D K  + +E Q LQF++DS 
Sbjct: 292 VKD-PLPNNVLRYLRIQRLD-----ESNITDITLRLVNGTDGKVNDGNEIQVLQFLVDSI 345

Query: 426 ESSISKYS 433
            S +  + 
Sbjct: 346 GSLLEGFG 353


>gi|168043570|ref|XP_001774257.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674384|gb|EDQ60893.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 458

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 108/391 (27%), Positives = 181/391 (46%), Gaps = 49/391 (12%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           S L  WL   G   +  A+  + +G     R L+++++I++GE++L V   L+IT +   
Sbjct: 35  SPLLAWLESRG---ETEALTSLTIGNTNQGRALLSIRHIKRGEQVLRVSRELMITPNRLP 91

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAEL 190
           SC E  E L +  V +W  LA + +      K+S W  YI  LP  R   + ++W   EL
Sbjct: 92  SCVE--ESLSE-DVNEWSRLALFQLLHKHAGKASPWEPYIRCLPPLRGLQNTVFWRDEEL 148

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
           +  L  S + ++   R T +I    DL   + +KYP+LF E V  +E+FK ++ +  SR 
Sbjct: 149 E-LLRQSNVYDQTEHRKT-LISNQFDLVQAVVNKYPELFGETV-TLESFKHAYCVASSRS 205

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
             + ++ G + +VP+ DM NH       L Y +         D+ Y  G QV I+YG   
Sbjct: 206 WGVEAL-GSITMVPFVDMFNHDSSARALLAYYEEEGYAEVVADKDYNQGSQVVITYGTLP 264

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQITG 369
           N  L L +GF   +  NP D V++ +     D    EKL+ LR +G++    C   +  G
Sbjct: 265 NSSLALDFGFTLPD--NPHDEVQIWMEAPSGDPLRAEKLKLLRDHGIATDPFCDGTESGG 322

Query: 370 -WPLELMAYAYLVVSPPSMKGK--------------------FEEMAAAASNKMT--SKK 406
            W      +    +S P+ +GK                     E MA  A  +    +++
Sbjct: 323 AW------FGLREISSPAARGKGIPRALRTFVRVISASTTKELEAMAEDAKRRQGRLAQR 376

Query: 407 DIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
            +K  + + +AL+ +LD+ E  +S +   L+
Sbjct: 377 PLKDGK-EARALKLLLDNIEQCVSSHRSALK 406


>gi|116197927|ref|XP_001224775.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
 gi|88178398|gb|EAQ85866.1| hypothetical protein CHGG_07119 [Chaetomium globosum CBS 148.51]
          Length = 555

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 177/362 (48%), Gaps = 32/362 (8%)

Query: 78  TLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           +++ WL   G +    + +    V  RG+  L+  +KGE++L +P  ++ T +  ++ P 
Sbjct: 87  SMESWLKSCGAVGLDDLELADFPVTGRGVKTLRRFKKGERILTIPSGILWTVEHAYADPL 146

Query: 137 AGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELD 191
            G VL+      SV D   LATY++   S E       ++++A P    S +++   EL+
Sbjct: 147 VGPVLRSARPPLSVED--TLATYILFIRSRESGYDGLRSHVAAFPTSYPSSIFFAEEELE 204

Query: 192 RYLEAS-----QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
                S     +  +R+IE        Y  L +R+ ++  DLFP + F++E +KW+   +
Sbjct: 205 VCAGTSLYTITKKLDRSIED------DYRTLVVRVLAQSRDLFPLDKFSIEDYKWALCTV 258

Query: 247 FSRLVR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           +SR +   LP  +    + P+ADMLNHS EVE    YD SS  +     + Y+ G+Q FI
Sbjct: 259 WSRAMDFVLPDGNSIRLVAPFADMLNHSSEVEPCHIYDASSGNLSVLAGKDYEAGDQAFI 318

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
            YG   N  LL  YGFV     NP+DS +L +S   S   ++ K +     GL ++    
Sbjct: 319 YYGSIPNSRLLRLYGFV--MPGNPNDSYDLVISTHPSAPFFERKQKLWASAGLDSACTIS 376

Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMA-AAASNKMTSKKDIKCPEIDE-QALQFIL 422
           + +T  PL      YL +       + +E   AA +++  +  D K  + +E + L+F++
Sbjct: 377 LTLTD-PLPKNVLRYLRIQ------RLDESDFAAIAHRQLAAADEKINDSNEVEVLRFLV 429

Query: 423 DS 424
           +S
Sbjct: 430 ES 431


>gi|323456050|gb|EGB11917.1| hypothetical protein AURANDRAFT_61181 [Aureococcus anophagefferens]
          Length = 516

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 122/449 (27%), Positives = 185/449 (41%), Gaps = 71/449 (15%)

Query: 40  IVVHCSVSTTNDASRTKTTVTQNMIPWGCE-----------------------IDSLENA 76
           +V  C       A R KTT+  +   W  E                       +D  E A
Sbjct: 18  LVQTCGGFAATHAPRPKTTLAASDDSWAVELQQAAGGAEEKEAASWLKGKNAGVDGGERA 77

Query: 77  --STLQKWLSDS----------GLPPQKMAIQKVDVGE-------RGLVALKNIRKGEKL 117
               L  WL+D+           +PP  MA+      E       RGL+A + I +  +L
Sbjct: 78  RNDALMAWLTDNDVWVSELSGWNVPPHSMALATTTFDELEGEDSGRGLLARRAITQDAEL 137

Query: 118 LFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
           + +P  L +T  S     E    L      ++  +A  LI E S    S WS YI+ LP 
Sbjct: 138 IRLPVRLCMTKASALKARELRGSLND-DTNEYIAIALLLILERSKGSRSFWSEYIAILPT 196

Query: 178 QP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
                +   W   EL  YLE S         +  +   +      +      L PE +F 
Sbjct: 197 NEDVGATFTWPAEEL-AYLEGSPAASATASMMAKLRAEH----AAVLEGNSALDPE-IFT 250

Query: 236 METFKWSFGILFSRLVRL-PSMDGRV-ALVPWADMLNHSCEVETFLD---------YDKS 284
            E ++W+F  LFSR +RL  S  G + A+VP+ D +NHS    +++D         +++ 
Sbjct: 251 FEAWQWAFTNLFSRAIRLKASRAGELLAMVPYVDFINHSPFSSSYVDAREVPKAFPWEEK 310

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC 344
              VV   DR Y+  EQVFISYG KSN +LLL YGF      NP +SV+L +   K D  
Sbjct: 311 EDEVVLFADRAYKKFEQVFISYGPKSNADLLLLYGFALDR--NPFNSVDLAVGASKDDAL 368

Query: 345 YKEKLEALRKYGLS-ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMT 403
           Y  K    R  G   +S  FP+    +P EL+ +  +  +        + + A   +   
Sbjct: 369 YDAKERFARGAGRDVSSAAFPLYADRFPDELVQFLRMACATE------DHLGARPLDDPD 422

Query: 404 SKKDIKCPEIDEQALQFILDSCESSISKY 432
           +  DI   + +   L  I D+C+++++ Y
Sbjct: 423 NYVDILSLDNELAVLDTIRDACDAAVAAY 451


>gi|326510275|dbj|BAJ87354.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525555|dbj|BAJ88824.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 523

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 167/388 (43%), Gaps = 44/388 (11%)

Query: 5   SRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMI 64
           S T H +LLP F  L +   P        + C    +  C   T   +S  +   +    
Sbjct: 29  SGTHHRLLLPCF--LRRLPQPGS------RSCSRLRLAACHADTLLSSSGAQGPPS---- 76

Query: 65  PWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDV-GERG--------LVALKNIRKGE 115
           P  C   S  +A     WL  +GLPP K+AI +  V   RG        + A +++  G+
Sbjct: 77  PAACL--SASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFVAAGQDLEAGD 134

Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
               VP SLV+T +         E+L    + +   LA YL+ E    + S W  YI  L
Sbjct: 135 VAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKEL 194

Query: 176 PRQP-------YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIF 222
            RQ         S L WT +ELD YL  S +R+  + R   +   YN+L         +F
Sbjct: 195 DRQRGRGQLAVESPLLWTESELD-YLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLF 253

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL 279
            +YP   P E F  E FK +F  + S +V L   S+  R ALVP    +L +    +  L
Sbjct: 254 KQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAML 313

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
                S  V    DR Y+ GE + +  G + N  LLL+YGFV  +  NP D + +  SL 
Sbjct: 314 TAVDGS--VRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDED--NPYDRIAIEASLN 369

Query: 340 KSDKCYKEKLEALRKYGLSASECFPIQI 367
             D  Y+EK    ++ G  A + F + +
Sbjct: 370 TEDPQYQEKRMVAQRNGKLAIQKFQVCV 397


>gi|326503142|dbj|BAJ99196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 425

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 167/388 (43%), Gaps = 44/388 (11%)

Query: 5   SRTFHTILLPSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMI 64
           S T H +LLP F  L +   P        + C    +  C   T   +S  +   +    
Sbjct: 24  SGTHHRLLLPCF--LRRLPQPGS------RSCSRLRLAACHADTLLSSSGAQGPPS---- 71

Query: 65  PWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDV----GERG-----LVALKNIRKGE 115
           P  C   S  +A     WL  +GLPP K+AI +  V    G R      + A +++  G+
Sbjct: 72  PAACL--SASSAGGFSDWLLTNGLPPGKLAILERPVPCSRGGRDRPLHFVAAGQDLEAGD 129

Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
               VP SLV+T +         E+L    + +   LA YL+ E    + S W  YI  L
Sbjct: 130 VAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGRDSLWYPYIKEL 189

Query: 176 PRQP-------YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIF 222
            RQ         S L WT +ELD YL  S +R+  + R   +   YN+L         +F
Sbjct: 190 DRQRGRGQLAVESPLLWTESELD-YLNGSPMRDEVVVRDEGIKKEYNELDTLWFMAGSLF 248

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL 279
            +YP   P E F  E FK +F  + S +V L   S+  R ALVP    +L +    +  L
Sbjct: 249 KQYPFDVPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAML 308

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
                S  V    DR Y+ GE + +  G + N  LLL+YGFV  +  NP D + +  SL 
Sbjct: 309 TAVDGS--VRLLVDRPYKAGEPIIVWCGPQPNSRLLLNYGFVDED--NPYDRIAIEASLN 364

Query: 340 KSDKCYKEKLEALRKYGLSASECFPIQI 367
             D  Y+EK    ++ G  A + F + +
Sbjct: 365 TEDPQYQEKRMVAQRNGKLAIQKFQVCV 392


>gi|218202140|gb|EEC84567.1| hypothetical protein OsI_31339 [Oryza sativa Indica Group]
          Length = 649

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/351 (29%), Positives = 176/351 (50%), Gaps = 22/351 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           GLVA +++ +GE L  VP  L + AD+  +    G V +    P W  +A  L+ EA+  
Sbjct: 243 GLVAARDLPRGEVLAEVPKKLWLDADAVAASDLGGAVGRGGLRP-WVAVALLLLREAARG 301

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
             S W+ Y++ LPRQ  S ++W+  EL   ++ +Q+    +     V   +  +   I S
Sbjct: 302 AGSPWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFESVEAEIIS 360

Query: 224 KYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF-LDY 281
           +  +LFP  V FN   F W+FGIL SR+      D ++AL+P+AD++NHS ++ +    +
Sbjct: 361 ENRELFPGTVTFN--DFLWAFGILRSRVFAELRGD-KLALIPFADLVNHSDDITSKESSW 417

Query: 282 DKSSQG-----VVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVE 333
           +   +G     VVF+  T    + GEQ++I Y   KSN EL L YGF   E  +  D+  
Sbjct: 418 EIKGKGLFGRDVVFSLRTPVNVKSGEQIYIQYDLDKSNAELALDYGFT--ESNSSRDAYT 475

Query: 334 LPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT-GWPLELMAYAYLVVSPPSMKGKFE 392
           L L + +SD  Y +KL+     G+  +  F I +    P +++ Y  L+    +     E
Sbjct: 476 LTLEISESDPFYDDKLDIAELNGMGETAYFDIVLGESLPPQMLPYLRLLCLGGTDAFLLE 535

Query: 393 EMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSRFLQVKELL 442
              A   N +    ++   + +E+A+ Q I ++C+S++  Y   ++  E L
Sbjct: 536 ---ALFRNAVWGHLELPVSQDNEEAICQVIRNACKSALGAYHTTIEEDEEL 583


>gi|302809535|ref|XP_002986460.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
 gi|300145643|gb|EFJ12317.1| hypothetical protein SELMODRAFT_269129 [Selaginella moellendorffii]
          Length = 432

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 139/286 (48%), Gaps = 23/286 (8%)

Query: 82  WLSDSGLPPQKMAIQKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           W+ + GLPP K+++++ D+     R +VA ++++ G+  L VP SLV+T +         
Sbjct: 3   WMLEQGLPPCKVSLKERDLNGKTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNETIA 62

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAELD 191
           E+L    + +   LA YL+ E    K S W  +I  L RQ         S L WT  ELD
Sbjct: 63  ELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELD 122

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGI 245
            Y   S+++E  +ER+  +   Y +L         +F +YP   P E F+ E FK +F  
Sbjct: 123 EYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVA 182

Query: 246 LFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           + S +V L   S+  R ALVP    +L +    +  L    +   V    DR Y+ GEQ+
Sbjct: 183 VQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAML--KAAGDLVRLEVDRAYKKGEQI 240

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
            +  G + N  LLL+YGFV  +  NP D + +  SL   D  Y+ K
Sbjct: 241 LVWCGPQPNTRLLLNYGFV--DPDNPHDRLSVEASLNTRDPFYQNK 284


>gi|302794360|ref|XP_002978944.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
 gi|300153262|gb|EFJ19901.1| hypothetical protein SELMODRAFT_110000 [Selaginella moellendorffii]
          Length = 432

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 139/286 (48%), Gaps = 23/286 (8%)

Query: 82  WLSDSGLPPQKMAIQKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           W+ + GLPP K+++++ D+     R +VA ++++ G+  L VP SLV+T +         
Sbjct: 3   WMLEQGLPPCKVSLKERDLNGKTIRYVVASEDLKPGDLALSVPMSLVVTLERVLGNETIA 62

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP-------YSLLYWTRAELD 191
           E+L    + +   LA YL+ E    K S W  +I  L RQ         S L WT  ELD
Sbjct: 63  ELLTTNKLSELACLALYLMYEKKRGKESFWYPFIRELDRQRGRGQVAVESPLLWTSEELD 122

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGI 245
            Y   S+++E  +ER+  +   Y +L         +F +YP   P E F+ E FK +F  
Sbjct: 123 EYFTGSRMKEVVLERLEGIKREYQELDTVWFMAGSLFKEYPFDIPTEAFSFEIFKQAFVA 182

Query: 246 LFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           + S +V L   S+  R ALVP    +L +    +  L    +   V    DR Y+ GEQ+
Sbjct: 183 VQSCVVHLQGVSLPRRFALVPLGPPLLAYKSNCKAML--KAAGDLVRLEVDRAYKKGEQI 240

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
            +  G + N  LLL+YGFV  +  NP D + +  SL   D  Y+ K
Sbjct: 241 LVWCGPQPNTRLLLNYGFV--DPDNPHDRLSVEASLNTRDPFYQNK 284


>gi|12718364|emb|CAC28558.1| related to histone-lysine N-methyltransferase [Neurospora crassa]
          Length = 471

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 177/371 (47%), Gaps = 44/371 (11%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSK 131
           S +  WL  SG     + +  +++ +     RG+  L+  ++GEK+L +P  ++ T    
Sbjct: 7   SNMNSWLKQSG----AVGLDSLELADFPDTGRGVKTLRPFKEGEKILTIPAGILWTVKHA 62

Query: 132 WSCPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWT 186
           ++ P  G  L+      SV D   LATY++   S E       ++I+ALP    S + + 
Sbjct: 63  YADPLLGPALRSAQPPLSVED--TLATYILFVKSRESGYDGQRSHIAALPASYSSSILFA 120

Query: 187 RAELDR------YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET-- 238
             +L+       Y    Q+ E++IE        +  L +R+F ++PDLFP + F +E   
Sbjct: 121 EDDLEACAGTSLYTITKQL-EQSIED------DHRALVVRLFVQHPDLFPLDKFTVEDVG 173

Query: 239 --FKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
             +KW+   ++SR +     DG     L P+ADMLNH+ EV+    YD SS  +     +
Sbjct: 174 LHYKWALCTVWSRAMDFVLADGNSIRLLAPFADMLNHTSEVKQCHVYDPSSGTLSVFAGK 233

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
            Y+ G+QVFI+YG   N  LL  YGFV     NP+DS +L LS       +++K +    
Sbjct: 234 DYEAGDQVFINYGPVPNSRLLRLYGFVIP--GNPNDSYDLVLSTHPQAPFFEQKQKLWVS 291

Query: 355 YGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEID 414
            GL ++   P+ +T  PL      YL +      G      A  + + T   D K  + +
Sbjct: 292 AGLDSTATIPLTLTD-PLPKKVLRYLRIQRLDASG-----LAVIARQQTDATDGKISDSN 345

Query: 415 E-QALQFILDS 424
           E + L+F+++S
Sbjct: 346 EVEILRFLVES 356


>gi|322697804|gb|EFY89580.1| putative histone-lysine N-methyltransferase [Metarhizium acridum
           CQMa 102]
          Length = 466

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 169/374 (45%), Gaps = 24/374 (6%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL  SG +    + +    V  RG+ A +  ++GE++L +P +L  T     +    
Sbjct: 1   METWLEQSGAVGLDGLEVADFPVTGRGVKARRRFKQGERILTIPSALHWTVQHAQADSLL 60

Query: 138 GEVLKQCSVP--DWPLLATYLISEASFEKSSRW-SNYISALPRQPYSLLYWTRAELDRYL 194
           G  L+    P      LA Y++   S E       ++++ALP    S +++T  EL+   
Sbjct: 61  GPALRSARPPLTVEDTLAVYVLFVRSRESGYNGPRSHVAALPTSYSSSIFFTEDELEVCA 120

Query: 195 EAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
             S   I ++  +RI +    Y DL  R+    PDLFP   F +  +KW+   ++SR + 
Sbjct: 121 GTSLYTITKQLKQRIED---DYKDLIARVLGPRPDLFPLNKFTIHHYKWALCTVWSRAMD 177

Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
               DG     L P+ADMLNHS E +    YD S+  +     + Y+ G+QV+I YG   
Sbjct: 178 FELYDGSSMRLLAPFADMLNHSSESKQCHVYDASTGNLSILAGKDYEAGDQVYIHYGSIP 237

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           N  LL  YGFV  +  NP+DS +L L+       +++K +     GL A+    + +   
Sbjct: 238 NSRLLRLYGFVIPD--NPNDSYDLVLATHPMAPFFEQKQKLWALAGLDATCTISLTLAN- 294

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFILDSCE 426
           PL      YL +       + +E   A  N   S  D    +I      Q LQF+++S  
Sbjct: 295 PLPKSVLCYLRIQ------RLDESDLAVINLQQSNTDTAFEKISNSNEVQVLQFLVESIT 348

Query: 427 SSISKYSRFLQVKE 440
           S +  +   L+  E
Sbjct: 349 SLLDSFGTQLEKLE 362


>gi|296804474|ref|XP_002843089.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845691|gb|EEQ35353.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 455

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 176/366 (48%), Gaps = 24/366 (6%)

Query: 79  LQKWLSDSGL-PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL DSG      + +    V   G+ AL++ ++GE++L +P + + T +  ++ P  
Sbjct: 1   MEAWLKDSGARGVDGIEVANFAVTGSGVKALRSFKEGERILTIPSACLWTVEKAYADPLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
           G VL+      SV D   LA YL+   S  S  +  R  ++I+A+P+   + +++T  EL
Sbjct: 61  GPVLRSAQPPLSVED--ALAVYLLFVRSRTSGYEGQR--HHIAAMPQSYSASIFFTEDEL 116

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
            +    S +     +    V   Y  L + + S++ DLFP + F +E +KW+   ++SR 
Sbjct: 117 -QVCAGSSLYALTRQLEQRVRDDYRQLLVPLLSQHRDLFPLDQFTIEDYKWALCSIWSRA 175

Query: 251 VRLP-SMDGRVALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           +    S    V LV P ADMLNHS +V+    YD +S  +     + YQ G+QVFI YG 
Sbjct: 176 MDFAVSGTTSVRLVAPLADMLNHSPDVKQCHAYDPTSGDLSILAAKDYQVGDQVFIYYGS 235

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT 368
             N  LL  YGFV  +  NP+DS +L L        Y++K       GL ++   P+ + 
Sbjct: 236 VPNNRLLRLYGFVLPD--NPNDSYDLVLQTSPLAPLYEQKERLWALAGLDSTCTIPLTVK 293

Query: 369 GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCES 427
             PL      YL +         E      + ++ +  D K  + +E Q LQF++DS  S
Sbjct: 294 D-PLPNNVLRYLRIQRLD-----ESNITDITLQLVNGTDGKVSDGNEMQVLQFLVDSIGS 347

Query: 428 SISKYS 433
            +  + 
Sbjct: 348 LLEGFG 353


>gi|340520781|gb|EGR51016.1| N-methyltransferase [Trichoderma reesei QM6a]
          Length = 470

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 183/380 (48%), Gaps = 34/380 (8%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL+  G +    + +    V  RG+  L+  ++GE +L +P  ++ + +  +S P  
Sbjct: 1   MESWLTKVGAVGLSDLELTDFPVTGRGVKTLRRFKQGEMILTIPSDVLWSVEHAYSDPNL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
           G  L+      SV D  +LATY++   S E        ++SALP    S ++++  EL+ 
Sbjct: 61  GPALRSVMPPLSVED--ILATYILFVRSRESGYDGLRTHVSALPGIYSSSIFFSEGELEV 118

Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET----------FK 240
               S   + ++  +RI +    Y  L +R+F+++PDLFP + F +E           +K
Sbjct: 119 CAGTSLYTVTKQLEQRIKD---DYRQLAVRLFAQHPDLFPLQKFTIEDVRLLRRATDPYK 175

Query: 241 WSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
           W+   ++SR +     DG     L P+ADMLNHS EV+    YD  S  +     + Y+ 
Sbjct: 176 WALCTVWSRSMDFTLPDGSSIRLLAPFADMLNHSSEVKQCHAYDVKSGDLSVFAGKDYEI 235

Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           G+QV+I YG   N  LL  YGFV  +  NP+DS +L L+       Y++K +     GL 
Sbjct: 236 GDQVYIYYGPIPNNRLLRLYGFVIPD--NPNDSYDLVLTTHPMAPFYEQKQKLWVSAGLD 293

Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QA 417
           ++    + +T  PL      YL +    + G      AA + +     D K  + +E + 
Sbjct: 294 STTTVALTLTD-PLPKNILRYLRIQRADVSG-----LAAITLQQIDGTDEKISDSNEMEI 347

Query: 418 LQFILDSCESSISKYSRFLQ 437
           L+F+ +S  S ++ ++  L+
Sbjct: 348 LRFLEESISSLLNCFATPLE 367


>gi|345561352|gb|EGX44442.1| hypothetical protein AOL_s00188g347 [Arthrobotrys oligospora ATCC
           24927]
          Length = 468

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 176/357 (49%), Gaps = 19/357 (5%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL +SG +    + +       RGL +L++ ++GE++L +P S++ T +  ++    
Sbjct: 1   MESWLKESGAVGLDDLKLADFPATGRGLGSLRHFKEGERILTIPSSILWTVEHAYADSII 60

Query: 138 GEVLK--QCSVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDRYL 194
             VL+  Q ++     LA Y++   S E   +   +++ ALP    S +++T  EL+   
Sbjct: 61  RPVLQSMQGALSVDDTLAIYILFVRSRESGYNGLRSHVEALPTSYSSSIFFTDDELEVCA 120

Query: 195 EAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
            +S   I ++  ++I +    Y  L  R+F +Y D+F    F +E +KW+   ++SR + 
Sbjct: 121 GSSLYTITKQLKQQIQD---DYRTLVERLFGQYLDIFSLGKFTIEDYKWALCTVWSRAMD 177

Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
               DG+    L P+ADMLNHS +V+    YD SS  +     + Y+PG+QVFI+YG   
Sbjct: 178 FVQPDGKSIRLLAPFADMLNHSSDVKKCHVYDTSSGDLSILAGKDYEPGDQVFINYGSIP 237

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           N  LL  YGFV     NP+DS +L L  +     ++ K +     GL +     + +   
Sbjct: 238 NNRLLRLYGFVV--PNNPNDSYDLVLMTQPEAPFFELKQKLWVSAGLDSVSTISLSLND- 294

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS-CE 426
           PL      YL +     +    ++A  A  ++ +   I     +E+ LQ +++S CE
Sbjct: 295 PLPKSVLQYLRI----QRADESDLAIIALQQIDATDKILSNSNEEKVLQALIESFCE 347


>gi|156361027|ref|XP_001625323.1| predicted protein [Nematostella vectensis]
 gi|156212150|gb|EDO33223.1| predicted protein [Nematostella vectensis]
          Length = 447

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/274 (32%), Positives = 134/274 (48%), Gaps = 28/274 (10%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           EN  +L KW   +G+  +K+         RG++A++ I   E ++ VP  L+ITA S   
Sbjct: 48  ENYISLLKWAKRNGMVFKKIRPAIFSSTGRGMLAIERIHSSECVISVPERLLITASSVLE 107

Query: 134 CPEAGEVLKQC-----SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
                 V ++      S  D+ LL  +L+ E   EK S W+ YI  LP    +  Y+TR 
Sbjct: 108 SAIGNYVAERMKGGAKSSNDY-LLVLFLMYEKYLEKGSFWAPYIRTLPDTFNTPCYFTRK 166

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK-YPDLFPE------EVFNMETFKW 241
           EL  +L   Q RE+A E++T +  +Y     + F+K Y D+  +         + E+FKW
Sbjct: 167 EL--FLLPEQCREQAFEQVTQIKQSY-----KSFAKAYNDVLQDFDCNFWRTVDFESFKW 219

Query: 242 SFGILFSRLV-------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
           ++ ++ +R V       R   +DG  AL P  D+LNH  + E    ++ SS+        
Sbjct: 220 AWCVVNTRSVYHDEPNRRAQPIDGNCALAPLLDLLNHCDKAEMCGRFNSSSKNYEINVIT 279

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFV-PREGTN 327
           +YQ G QVFI+YG   N  L L YGFV PR   N
Sbjct: 280 EYQKGTQVFINYGPHDNTRLFLEYGFVLPRNVHN 313


>gi|358395377|gb|EHK44764.1| hypothetical protein TRIATDRAFT_80097 [Trichoderma atroviride IMI
           206040]
          Length = 463

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 185/368 (50%), Gaps = 29/368 (7%)

Query: 79  LQKWLSDSGLPP-QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL++SG      + + +  V  RG+   +  ++GE++L +P   + T +   S P  
Sbjct: 1   MEDWLNESGAAGLNDLELAEFPVTGRGVRTRRRFQQGERILTIPGDSLWTVEHADSDPLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLI----SEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
           G VL+      SV D   LA YL+     E  +E      ++++A+P +  S +++   E
Sbjct: 61  GPVLRSVQPPLSVED--TLAVYLLFVRLREHGYEGPR---SHVAAMPARYSSSIFFNEDE 115

Query: 190 LDRYLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
           L+     S   I ++  ERI +    Y  L +R+F+++PDL P    +++ +KW+   ++
Sbjct: 116 LEVCAGTSLYTITKQLEERIED---DYRVLVMRVFTQHPDLLPLAKISIQDYKWALCTVW 172

Query: 248 SRLVR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           SR +   LP+      L P+ADM+NHS EV+    YD SS  +     + Y+ G+Q++IS
Sbjct: 173 SRAMDFVLPNGKPLRVLAPFADMINHSPEVKQCHAYDPSSGNLSVLAGKDYEIGDQIYIS 232

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG   N  LL  YGFV  E  NP+DS +L LS       Y++K +     GL ++   P+
Sbjct: 233 YGSIPNNRLLRLYGFVIPE--NPNDSYDLVLSTHPMAPFYEQKQKLWASAGLDSASTIPL 290

Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
            +   PL      YL +     +    ++AA A  K+ + + I   + + + LQF+++S 
Sbjct: 291 TLID-PLPKSVLRYLRIQ----RLDASDLAAIALQKLDTNEKISNSK-EVEILQFLVESI 344

Query: 426 ESSISKYS 433
            + +  ++
Sbjct: 345 SALLDGFN 352


>gi|307107385|gb|EFN55628.1| hypothetical protein CHLNCDRAFT_57818 [Chlorella variabilis]
          Length = 435

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 163/361 (45%), Gaps = 35/361 (9%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQ-------KVDVGERGLVALKNIRKGEKLLFVPPSL 124
           ++E  S + +WL++SG P QK+ +Q       +VD+     VA + ++ G+  L +P  L
Sbjct: 2   TVERPSKMMQWLTESGAPQQKVKLQTVVREGTEVDI----TVAAEALQPGDVALRIPEHL 57

Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ------ 178
           ++T D         E++    + +   L  YL  E    K   W  +I  L R       
Sbjct: 58  IVTLDRVLEDNTLAELVTTGKLSELACLTLYLAYEKKRGKEGCWYRFIKELDRMQGRGSQ 117

Query: 179 -PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPE 231
              S L W   +    L  S +      R+  +   Y +L         +F++ P   P 
Sbjct: 118 GAKSPLLWDEGQAAELLAGSPVVGEIEARLQGIRKEYEELDTVWYLAGSLFNRQPFSPPT 177

Query: 232 EVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGV 288
           E F+   F+ +F  + S +V L   ++  R ALVP    +L +S   +  L +D  S  V
Sbjct: 178 EQFSFPVFRQAFTAVQSSVVHLQGVALGKRFALVPMGPPLLTYSSTAKAMLKFDPESHEV 237

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
               DR YQPGE V    G + N  LL++YG V  + +NP D + L +++   D  Y+ K
Sbjct: 238 RLAVDRAYQPGEAVLAWCGPQPNSRLLINYGIV--DESNPYDKLPLSITIPSDDPLYRLK 295

Query: 349 LEALRKYGLSASECFPIQITG-WPLELMAYAYLVVSP--PSMKG-KFEEMAA--AASNKM 402
            + L + GLS  + F +Q     P +L+ Y  LV S     ++G K+EE A   A  N++
Sbjct: 296 RDRLAERGLSTQQTFQLQAAASLPAQLLPYLRLVHSTREADVEGVKWEEEAGPVAPENEL 355

Query: 403 T 403
           T
Sbjct: 356 T 356


>gi|348676999|gb|EGZ16816.1| hypothetical protein PHYSODRAFT_251772 [Phytophthora sojae]
          Length = 424

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 83/250 (33%), Positives = 131/250 (52%), Gaps = 18/250 (7%)

Query: 81  KWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           +WL D+G   P  +  +   + G RG VA   +  GE +L +P  L+I+ D  W  P+ G
Sbjct: 16  QWLRDNGATFPKLQWPVTSPN-GLRGTVAAAAVASGEPMLCIPRRLLISEDLCWRDPQLG 74

Query: 139 EVL---KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
            V    +     D P+LA +L+ E      S +  Y++ LP  P S+  WT+AEL   L 
Sbjct: 75  RVFQDNRDVFTRDDPVLALFLVRELLLADRSFFHPYLAVLP-YPESVQDWTQAELGE-LH 132

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RL 253
             ++ + A  R + +   Y  + +R+ +KYP  FPE ++  + FK+++  + +R    RL
Sbjct: 133 DERLVDAAARRTSEIDVYYRRVMVRLQTKYPGEFPEALYTFDRFKFAWKTIQARTFGRRL 192

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF---TTDRQYQPGEQVFISYGKKS 310
           P      ALVP+AD LNH+  V T  D+D +  G+     +    +  G +VF SYG++S
Sbjct: 193 PW----TALVPFADCLNHT-NVATKYDFDVNDNGLFRLYPSGATSFAQGAEVFNSYGRRS 247

Query: 311 NGELLLSYGF 320
           N +LLL YGF
Sbjct: 248 NFQLLLDYGF 257


>gi|453087416|gb|EMF15457.1| SET domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 454

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 167/367 (45%), Gaps = 26/367 (7%)

Query: 79  LQKWLSDSGLPP-QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL  SG      + +       RG+ AL+   + EK+L +P  L+ T    ++ P  
Sbjct: 1   MEIWLKQSGASGLDDLELADFSDAGRGIRALRRFEEKEKILTIPHGLLWTVKRAYADPVL 60

Query: 138 GEVLKQC----SVPDWPLLATY-LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
           G +L       SV D   LATY L   A         ++++ALP    S +++  AEL+ 
Sbjct: 61  GPLLSSTRPPLSVDD--TLATYILFIRARKSGYDGPQSHVAALPASYSSSIFFADAELE- 117

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
               S +          +   Y DL  R+F ++ D+FP + F ++ +KW+   ++SR + 
Sbjct: 118 ICAGSSLYTTTKHLARQIEVDYKDLVARLFGRHRDVFPSDKFTIDDYKWALCTVWSRAMD 177

Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
               DG     + P+ADMLNHS +V     YD  S  +     + Y+PG+QVFI+YG   
Sbjct: 178 FKLRDGESIRLMAPFADMLNHSPDVGQCHVYDPQSGNLSILAGKSYEPGDQVFINYGPIP 237

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           N  L   YGFV     NP+DS +L LS       +++K +     GL ++    + +T  
Sbjct: 238 NNRLSRLYGFV--VPGNPNDSYDLVLSTHPMAPFFEQKHKLWIAAGLDSTSTVSLTLTD- 294

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFILDSCE 426
           PL      YL +       +  E   AA    T + D+   +I +    + L F+++S  
Sbjct: 295 PLPRSVLRYLRIQ------RLNETDLAAVG--TRQSDVAFEKISDSNETEVLTFLVESIS 346

Query: 427 SSISKYS 433
           + +  ++
Sbjct: 347 ALLDGFT 353


>gi|302816067|ref|XP_002989713.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
 gi|300142490|gb|EFJ09190.1| hypothetical protein SELMODRAFT_447801 [Selaginella moellendorffii]
          Length = 400

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 78/259 (30%), Positives = 135/259 (52%), Gaps = 15/259 (5%)

Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVP-DWPLLATYLISEA 160
           +RGL A ++IR GE+++ +P  LV+TA+    C     V K  S   DW  L   +++E 
Sbjct: 9   KRGLFAARSIRAGEQIVRIPHELVLTAEKLDDC-----VKKLLSTEYDWCPLTLLILAEQ 63

Query: 161 SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
              ++SRW+ Y+S LP     +S ++W + EL ++LE ++      ER   +   YN ++
Sbjct: 64  HKGEASRWAPYVSCLPSFGDHHSTIFWGKEEL-KFLECTRAFRGTAERREMISDEYNSVK 122

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF 278
             + S  P +F E++ ++  F  ++  + SR     ++   +++ P+ D  NH       
Sbjct: 123 -DVISSCPHVFGEDI-SLFQFAHAYATVVSRAWN-GALSSEISMRPFVDFCNHDPVSHAT 179

Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           + +D      +   +R Y  GE+VFISYGK+SN  L + YGFV     N SD  EL + +
Sbjct: 180 VSHDTCKDATIIA-ERDYTKGEEVFISYGKRSNAVLAVDYGFVL--PNNLSDQAELWMEI 236

Query: 339 KKSDKCYKEKLEALRKYGL 357
             +D   ++KLE +R + +
Sbjct: 237 PWNDPLREKKLELMRAFNM 255


>gi|356521657|ref|XP_003529470.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/345 (30%), Positives = 161/345 (46%), Gaps = 27/345 (7%)

Query: 37  GHRIVVHCSVSTTNDAS--RTKTTVTQNMIP-WGCEIDSLENASTLQKWLSDSGLPPQKM 93
           G  ++ H   S ++  S    K  ++ N +   G E+       T  +WL + G+   K 
Sbjct: 10  GSAVLFHSRNSFSSKGSFLHLKRPLSANCVASLGTEVSVSPAVDTFWQWLKEEGVVSGKT 69

Query: 94  AIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS-VPDWPL 151
            ++   V E  GLVALK+I + E +L VP  L I  D+      A E+ K CS +  W  
Sbjct: 70  PVKPGVVPEGLGLVALKDISRNEVVLQVPKRLWINPDA----VAASEIGKVCSGLKPWLA 125

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           +A +LI E S    S W +Y S LP++  S +YW+  EL   L+ +Q+          V 
Sbjct: 126 VALFLIRERS-RSDSLWKHYFSILPKETDSTIYWSEEELSE-LQGTQLLNTTRSVKQYVQ 183

Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVRLPSMDGRVALVPWADMLN 270
             +  L   I      LFP  +  ++ F W+FGIL SR   RL + +  + ++P AD++N
Sbjct: 184 NEFRRLEEEIIIPNKKLFPSSI-TLDDFFWAFGILRSRAFSRLRNEN--LVVIPLADLIN 240

Query: 271 HSCEVET-FLDYDKSSQGVVFTTDRQY--------QPGEQVFISYG-KKSNGELLLSYGF 320
           HS  V T    Y+      +F+ D  +        + G+QV+I Y   KSN EL L YGF
Sbjct: 241 HSARVTTDDHAYEIKGAAGLFSWDYLFSLRSPLSLKAGDQVYIQYDLNKSNAELALDYGF 300

Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           +  E     ++  L L + +SD  + +KL+     G   +  F I
Sbjct: 301 I--EPNTDRNAYTLTLQISESDPFFGDKLDIAESNGFGETAYFDI 343


>gi|384246822|gb|EIE20311.1| hypothetical protein COCSUDRAFT_48681 [Coccomyxa subellipsoidea
           C-169]
          Length = 539

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 90/318 (28%), Positives = 144/318 (45%), Gaps = 22/318 (6%)

Query: 84  SDSGLPPQKMAIQKVDVGERGL---VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           +D G P Q ++I++V      L   VA ++++ GE  L +P  LVIT D  +      E+
Sbjct: 23  TDHGAPQQGVSIKEVVQEGNTLDVSVAARDLQAGELALRIPDHLVITLDRVFEDESLAEL 82

Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-------QPYSLLYWTRAELDRY 193
           L    + +   L  YL+ E    + S W  +I  L R          S L W   ++D Y
Sbjct: 83  LTTDKLSELACLTLYLMYEKKNGRQSVWYEFIKELDRIQGRGQMGAKSPLLWDEGQVDEY 142

Query: 194 LEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGILF 247
           L  S +     ER+  +   Y +L         +F  YP   P E F+++ F+  F  + 
Sbjct: 143 LAGSPLVAEIKERLKGIEKEYAELDTVWFMAGSLFKSYPYDVPTEAFSLKLFRQGFAAVQ 202

Query: 248 SRLVRLPS--MDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           + +V L    +  R ALVP    +L++S   +  L Y++ ++ V    DR Y  GE +  
Sbjct: 203 ASVVHLQGVPLSKRFALVPLGPPLLSYSSTAKAMLTYNREAKEVQLAVDRSYTKGEPIEA 262

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
             G + N  LLL+YG V     NP D + L ++L  +D  ++ K   L++  LS  + F 
Sbjct: 263 WCGPQPNRRLLLNYGIVT--DNNPHDKMALTVTLPHADPLFQAKRAVLQQNNLSTQQTFQ 320

Query: 365 IQ-ITGWPLELMAYAYLV 381
           +Q   G P  L+ Y  L 
Sbjct: 321 LQRDKGLPELLLPYLRLA 338


>gi|326492674|dbj|BAJ90193.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 174/352 (49%), Gaps = 43/352 (12%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWS------CPEAGEVLKQCSVPDWPLLATYLI 157
           GLVA +N+ +GE +  VP  L + AD+  +      C   G++        W  ++  ++
Sbjct: 42  GLVAERNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRP------WVSVSLLIL 95

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
            EA+    S W+ Y++ LPRQ  S ++W+  EL   ++ +Q+    +     V   ++++
Sbjct: 96  REAARGGDSLWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFDNV 154

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVE 276
              I +   DLFP  +   + F W+FG+L SR+   P + G ++AL+P+AD++NH+ ++ 
Sbjct: 155 EAGIINVNKDLFPGTI-TFDDFLWAFGVLRSRV--FPELRGDKLALIPFADLINHNGDIT 211

Query: 277 T-----------FLDYDKSSQGVVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVP 322
           +           FL  D      VF+  T    + GEQ+++ Y   KSN EL L YGF  
Sbjct: 212 SKESCWEIKGKGFLGRD-----TVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFT- 265

Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT-GWPLELMAYAYLV 381
            E  +  DS  L L + +SD  Y++KL+     G+  +  F + +    P +++ Y  L+
Sbjct: 266 -ESNSSRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRLL 324

Query: 382 VSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKY 432
               +     E   A   NK+    ++     +E+++ Q I ++C+S+++ Y
Sbjct: 325 CLGGTDAFLLE---ALFRNKVWEHLELPVSRDNEESICQVIQNACKSALAAY 373


>gi|356577306|ref|XP_003556768.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Glycine
           max]
          Length = 487

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 148/312 (47%), Gaps = 24/312 (7%)

Query: 67  GCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLV 125
           G E+       T  +WL + G+   K  ++   V E  GLVALK+I + E +L VP  L 
Sbjct: 43  GTEVSVSPAVDTFWQWLKEEGVVSAKTPVKPSVVPEGLGLVALKDISRNEVVLQVPKRLW 102

Query: 126 ITADSKWSCPEAGEVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
           I  D+      A E+ K C  +  W  +A +LI E S   +S W +Y S LP++  S +Y
Sbjct: 103 INPDA----VAASEIGKVCIGLKPWLAVALFLIRERS-RSNSLWKHYFSVLPKETDSTIY 157

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
           W+  EL   L+ +Q+          V   Y  L   I      LFP  +  ++ F W+FG
Sbjct: 158 WSEEELSE-LQGTQLLNTTRSVKQYVENEYRRLEEEIILPNKKLFPSPL-TLDDFFWAFG 215

Query: 245 ILFSR-LVRLPSMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQY------ 296
           IL SR   RL + +  + ++P+AD +NHS  V T    Y+      +F+ D  +      
Sbjct: 216 ILRSRAFSRLRNEN--LVVIPFADFINHSARVTTEDHAYEIKGAAGLFSWDYLFSLRSPL 273

Query: 297 --QPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
             + G+QV+I Y   KSN EL L YGF+  E     ++  L L + +SD  + +KL+   
Sbjct: 274 SLKAGDQVYIQYDLNKSNAELALDYGFI--EPNADRNAYTLTLQISESDPFFGDKLDIAE 331

Query: 354 KYGLSASECFPI 365
             G   +  F I
Sbjct: 332 SNGFGETAYFDI 343


>gi|242053769|ref|XP_002456030.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
 gi|241928005|gb|EES01150.1| hypothetical protein SORBIDRAFT_03g029140 [Sorghum bicolor]
          Length = 512

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 142/311 (45%), Gaps = 30/311 (9%)

Query: 82  WLSDSGLPPQKMAIQK---------VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           WL   GLPP K+ I++          D+  R + A  +++ G+    VP SLV+T +   
Sbjct: 81  WLRARGLPPGKVDIRERPVPCLLNGKDLPLRYVAAGVDLQAGDVAFEVPMSLVVTLERVL 140

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    K S W  YI  L R          S L W
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           T +ELD YL  S +++  + R   +   YN+L         +F +YP   P E F  E F
Sbjct: 201 TESELD-YLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 259

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L +    +  L  D  S  V    DR Y
Sbjct: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTADGDS--VRLVVDRPY 317

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + GE + I  G ++N  L+L+YGFV  +  NP D + +  SL   D  Y+EK    ++ G
Sbjct: 318 KAGEPIIIWCGPQTNSRLVLNYGFVDED--NPFDRIAIEASLNSEDPQYQEKRMVAQRNG 375

Query: 357 LSASECFPIQI 367
             A + F + +
Sbjct: 376 KLAIQNFNVYV 386


>gi|302814473|ref|XP_002988920.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
 gi|300143257|gb|EFJ09949.1| hypothetical protein SELMODRAFT_129035 [Selaginella moellendorffii]
          Length = 389

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 171/378 (45%), Gaps = 57/378 (15%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RGL A + I  GE +L V   L+IT +     PE    L    V  W  LA +L++    
Sbjct: 1   RGLFASRPIHTGECMLHVSHDLMITPEK---LPEEVTKLLSKDVSAWAKLALFLLAHQKK 57

Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNV----IGTYND 216
           +++S W+ YIS LP     +S ++WT+ EL  YL+ S +    ++R   V        N 
Sbjct: 58  KETSAWAPYISCLPPFGSMHSTIFWTQDEL-VYLKVSPVYRETVQRKDVVRMEFAAAENA 116

Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
           L L      P +F   V  +E FK ++  + SR   + ++   +ALVP+ D  NH     
Sbjct: 117 LLL-----CPHIFGSRVSALE-FKHAYATVCSRAWGIETIKS-LALVPFVDFFNHDANCR 169

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVE-L 334
             L YD+        +DR Y  G+QV ISYG+ SN  L L +GF +P    NP D V  +
Sbjct: 170 AMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALP---FNPHDQVAGI 226

Query: 335 PLSLKKSDKCYKEKLEALRKYGL----------SASECFPIQIT--------GWPLELMA 376
            LSL + D     KL+ L  + +          +A   F +Q          G P  L A
Sbjct: 227 WLSLSEKDPLRDSKLKLLHSHNMQTCVTREGVDTAGSSFSLQEVKSKAGRGKGIPQTLRA 286

Query: 377 YAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFILDSCESSISKY 432
           +A +V +  S   + +EMA  A++  T  +  + P ID+    +A+  +    ++ I K+
Sbjct: 287 FARVVCATTS--EELDEMAKFAAD--TDGRLARRPSIDKTKEHKAMTLLQTVIDNRIQKH 342

Query: 433 SR---------FLQVKEL 441
            +          L VKE+
Sbjct: 343 EQAALVLVSLFLLMVKEI 360


>gi|22326803|ref|NP_196930.2| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|30684815|ref|NP_851038.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|42573363|ref|NP_974778.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|17473570|gb|AAL38260.1| putative protein [Arabidopsis thaliana]
 gi|23297671|gb|AAN13005.1| unknown protein [Arabidopsis thaliana]
 gi|332004624|gb|AED92007.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004625|gb|AED92008.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
 gi|332004626|gb|AED92009.1| Rubisco methyltransferase family protein [Arabidopsis thaliana]
          Length = 514

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/316 (28%), Positives = 149/316 (47%), Gaps = 27/316 (8%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVIT 127
           +++  L+ W+  +GLPP K+ +++    ++       + A ++++KG+    VP SLV+T
Sbjct: 78  DDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIHYVAASEDLQKGDVAFSVPDSLVVT 137

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
            +         E+L    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 138 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 197

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
           S L W+ AELD YL  S  +   +ER   +   YN+L         +F +YP   P E F
Sbjct: 198 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 256

Query: 235 NMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFT 291
           + E FK +F  + S +V L +  +  R ALVP    +L +    +  L        V   
Sbjct: 257 SFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAML--TAVDGAVELV 314

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
            DR Y+ G+ + +  G + N +LLL+YGFV  +  NP D V +  +L   D  Y++K   
Sbjct: 315 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRVIVEAALNTEDPQYQDKRMV 372

Query: 352 LRKYGLSASECFPIQI 367
            ++ G  + + F +++
Sbjct: 373 AQRNGKLSQQVFQVRV 388


>gi|3403236|gb|AAC29137.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase I [Spinacia oleracea]
          Length = 491

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 174/389 (44%), Gaps = 29/389 (7%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVIT 127
           E D+         WLSD G+   K  ++   V E  GLVA K+I + E +L VP    I 
Sbjct: 49  ETDTPPEIQKFWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWIN 108

Query: 128 ADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
            D+      A E+   C+ +  W  +A +L+ E     SS W  YI  LP    S +YW+
Sbjct: 109 PDTV----AASEIGSVCNGLKPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTIYWS 164

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
             EL   L+ SQ+    +     V   +  L   +   +  LFP +V   + F W+FG+L
Sbjct: 165 EEELSE-LQGSQLLNTTLGVKELVANEFAKLEEEVLVPHKQLFPFDV-TQDDFFWAFGML 222

Query: 247 FSRLVRLPSMDGR-VALVPWADMLNHSCEVETFLDYDKSSQG-------VVFTTDRQYQP 298
            SR      ++G+ + L+P AD+ NHS ++ T   Y    +G       +VF+  R   P
Sbjct: 223 RSR--AFTCLEGQSLVLIPLADLANHSPDI-TAPKYAWEIRGAGLFSRELVFSL-RNPTP 278

Query: 299 ---GEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
              G+QV I Y   KSN EL L YG    E  +  ++  L L + +SD  Y +KL+    
Sbjct: 279 VKAGDQVLIQYDLNKSNAELALDYGLT--ESRSERNAYTLTLEIPESDSFYGDKLDIAES 336

Query: 355 YGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC-PEI 413
            G+  S  F I +   PL      YL +     +  F  + +   N +    D+   P  
Sbjct: 337 NGMGESAYFDI-VLEQPLPANMLPYLRLVALGGEDAF-LLESIFRNSIWGHLDLPISPAN 394

Query: 414 DEQALQFILDSCESSISKYSRFLQVKELL 442
           +E   Q I D+C S++S YS  +   E L
Sbjct: 395 EELICQVIRDACTSALSGYSTTIAEDEKL 423


>gi|440464432|gb|ELQ33864.1| hypothetical protein OOU_Y34scaffold00857g1 [Magnaporthe oryzae
           Y34]
          Length = 464

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 99/358 (27%), Positives = 176/358 (49%), Gaps = 27/358 (7%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL ++G +    + +    +  RG+  L++ ++GEK+L +P   + T +   +    
Sbjct: 1   MESWLKETGAVGLDDLELADFPITGRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
           G  L+      SV D  +LATY++   S E       ++++ALP    S +++   EL+ 
Sbjct: 61  GPALRSIRPPLSVED--ILATYILFVRSRESGYDGLRSHVAALPSSYSSSIFFAGEELEV 118

Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
               S   I ++  +RI +    Y  L +R+  ++ DLFP E F +E +KW+   ++SR 
Sbjct: 119 CAGTSLYTITKQLEQRIED---DYRALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRA 175

Query: 251 VR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           +   LP  +    L P+ADMLNHS  V+    YD SS+ +     + Y+ G+QVFI YG 
Sbjct: 176 MDFVLPGGNSIRLLAPFADMLNHSDNVKQCHAYDSSSKTLSVLAGKDYEAGDQVFIYYGP 235

Query: 309 KSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
            SN  LL  YGFV P    N +D+ +L L+       +  KL+      L ++    + +
Sbjct: 236 VSNSRLLRLYGFVLP---GNSNDNYDLVLATHPEAPFFARKLKLWASARLDSTSTISLTL 292

Query: 368 TG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
           T   P +++ Y  +  S  S      E+A  A  ++ + + I     + + L+F+++S
Sbjct: 293 TDPLPNDVLRYLRIQRSGAS------ELAGMACQRIDATEKIS-DSNEVEVLRFLVES 343


>gi|17367341|sp|Q43088.1|RBCMT_PEA RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=PsLSMT; Short=RuBisCO LSMT;
           Short=RuBisCO methyltransferase; Short=rbcMT; Flags:
           Precursor
 gi|508551|gb|AAA69903.1| ribulose-1,5 bisphosphate carboxylase large subunit
           N-methyltransferase [Pisum sativum]
          Length = 489

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/302 (32%), Positives = 148/302 (49%), Gaps = 26/302 (8%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           T  KWL + G+   K  ++   V E  GLVALK+I + + +L VP  L I  D+      
Sbjct: 56  TFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA----VA 111

Query: 137 AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
           A E+ + CS +  W  +  +LI E S E S  W +Y   LP++  S +YW+  EL   L+
Sbjct: 112 ASEIGRVCSELKPWLSVILFLIRERSREDSV-WKHYFGILPQETDSTIYWSEEELQE-LQ 169

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVRLP 254
            SQ+ +  +     V      L   I      LFP+ V  ++ F W+FGIL SR   RL 
Sbjct: 170 GSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPV-TLDDFFWAFGILRSRAFSRLR 228

Query: 255 SMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQY--------QPGEQVFIS 305
           + +  + +VP AD++NHS  V T    Y+      +F+ D  +        + GEQV+I 
Sbjct: 229 NEN--LVVVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQ 286

Query: 306 YG-KKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           Y   KSN EL L YGF+ P E  +   +  L L + +SD  + +KL+     G + +  F
Sbjct: 287 YDLNKSNAELALDYGFIEPNENRH---AYTLTLEISESDPFFDDKLDVAESNGFAQTAYF 343

Query: 364 PI 365
            I
Sbjct: 344 DI 345


>gi|240278777|gb|EER42283.1| conserved hypothetical protein [Ajellomyces capsulatus H143]
 gi|325090312|gb|EGC43622.1| conserved hypothetical protein [Ajellomyces capsulatus H88]
          Length = 471

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 92/321 (28%), Positives = 153/321 (47%), Gaps = 24/321 (7%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL  SG +    + +    V  RG+  L++ ++GE++L +P  ++ T +  ++    
Sbjct: 1   MEGWLKQSGAVGLDVLELADFQVIGRGVKTLRHFKEGERILTIPSDVLWTVEHAYADSLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
           G  L       SV D   LATY++   S E   +   ++++ALP+   S +++T  EL+ 
Sbjct: 61  GPTLHSARPPLSVDD--TLATYILFVRSRESGYNGLRSHLAALPKSYSSSIFFTEDELEV 118

Query: 193 YLEASQIRERAIERITNVIG-----TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
               S      +  IT  +G      Y  L +R+  ++ DLFP   F +E +KW+   ++
Sbjct: 119 CTGTS------LYAITKQLGRCIQDDYKALVVRLLIQHRDLFPLSKFTIEDYKWALCTVW 172

Query: 248 SRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           SR +     DG+    L P+ADMLNHS +V     YD  S  +     + Y+ G+QVFI 
Sbjct: 173 SRAMDFVLPDGKSIRLLAPFADMLNHSSDVRQCHAYDPLSGNLSILAGKDYKAGDQVFIY 232

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG   N  LL  YGF+    +NP+D+ EL L        +++K +     GL  +    +
Sbjct: 233 YGSIPNNRLLRLYGFIIP--SNPNDNYELVLETHPMAPFFEQKHKLWESAGLDLTSTISL 290

Query: 366 QITGWPLELMAYAYLVVSPPS 386
            +T  PL      YL +   S
Sbjct: 291 TLTD-PLPKNVLQYLRIQRSS 310


>gi|260803924|ref|XP_002596839.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
 gi|229282099|gb|EEN52851.1| hypothetical protein BRAFLDRAFT_284593 [Branchiostoma floridae]
          Length = 500

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 88/309 (28%), Positives = 152/309 (49%), Gaps = 20/309 (6%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           KWL D G+    + I+K +VG  GL A+K+I+  E  + +P  L++T ++       G +
Sbjct: 97  KWLEDHGVKSDAVTIEKFEVGGYGLKAVKDIKAEELFITIPRKLMLTTETARES-SLGPL 155

Query: 141 LKQ---CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
           +K+     V     LA +++ E  +  +S W+ YI+  P    + LY+   E+  +L+ S
Sbjct: 156 IKKDRILQVMANVSLALHVLCE-KYSSNSFWAPYINIFPGTYTTPLYFEEGEM-LHLQGS 213

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLP 254
                 + +  ++   Y     ++F   P+   L  +E F  + ++W+   + +R  ++P
Sbjct: 214 LNFSDVLNQYKSIARQYAYF-YKLFQTQPEAAGLPLKECFTFDEYRWAVSTVMTRQNQVP 272

Query: 255 SMDGR---VALVPWADMLNHS-CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           + DGR    AL+P  DM NHS  EV T  +++  S        R++    QV+I YG +S
Sbjct: 273 TSDGRHLITALIPMWDMCNHSNGEVST--EFNLGSDSAECLAMREFPTDSQVYIFYGMRS 330

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           N E L+  GFV  E  N  D V + L + K+D  +  K E L + G+ AS  F +     
Sbjct: 331 NAEFLIHNGFVYPE--NVHDRVNVKLGVSKNDSLFAMKAEVLSRAGIHASTSFQVHCGKD 388

Query: 371 PL--ELMAY 377
           P+  EL+ +
Sbjct: 389 PIPPELLVF 397


>gi|109158151|pdb|2H21|A Chain A, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158152|pdb|2H21|B Chain B, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158153|pdb|2H21|C Chain C, Structure Of Rubisco Lsmt Bound To Adomet
 gi|109158154|pdb|2H23|A Chain A, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158155|pdb|2H23|B Chain B, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158156|pdb|2H23|C Chain C, Structure Of Rubisco Lsmt Bound To Trimethyllysine And
           Adohcy
 gi|109158157|pdb|2H2E|A Chain A, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158158|pdb|2H2E|B Chain B, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158159|pdb|2H2E|C Chain C, Structure Of Rubisco Lsmt Bound To Azaadomet And Lysine
 gi|109158160|pdb|2H2J|A Chain A, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158161|pdb|2H2J|B Chain B, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
 gi|109158162|pdb|2H2J|C Chain C, Structure Of Rubisco Lsmt Bound To Sinefungin And
           Monomethyllysine
          Length = 440

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/302 (32%), Positives = 148/302 (49%), Gaps = 26/302 (8%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           T  KWL + G+   K  ++   V E  GLVALK+I + + +L VP  L I  D+      
Sbjct: 8   TFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA----VA 63

Query: 137 AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
           A E+ + CS +  W  +  +LI E S E S  W +Y   LP++  S +YW+  EL   L+
Sbjct: 64  ASEIGRVCSELKPWLSVILFLIRERSREDSV-WKHYFGILPQETDSTIYWSEEELQE-LQ 121

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVRLP 254
            SQ+ +  +     V      L   I      LFP+ V  ++ F W+FGIL SR   RL 
Sbjct: 122 GSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPV-TLDDFFWAFGILRSRAFSRLR 180

Query: 255 SMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQY--------QPGEQVFIS 305
           + +  + +VP AD++NHS  V T    Y+      +F+ D  +        + GEQV+I 
Sbjct: 181 NEN--LVVVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVYIQ 238

Query: 306 YG-KKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           Y   KSN EL L YGF+ P E  +   +  L L + +SD  + +KL+     G + +  F
Sbjct: 239 YDLNKSNAELALDYGFIEPNENRH---AYTLTLEISESDPFFDDKLDVAESNGFAQTAYF 295

Query: 364 PI 365
            I
Sbjct: 296 DI 297


>gi|24987776|pdb|1MLV|A Chain A, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987777|pdb|1MLV|B Chain B, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|24987778|pdb|1MLV|C Chain C, Structure And Catalytic Mechanism Of A Set Domain Protein
           Methyltransferase
 gi|33357815|pdb|1OZV|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357816|pdb|1OZV|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357817|pdb|1OZV|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Lysine And Adohcy
 gi|33357822|pdb|1P0Y|A Chain A, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357823|pdb|1P0Y|B Chain B, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
 gi|33357824|pdb|1P0Y|C Chain C, Crystal Structure Of The Set Domain Of Lsmt Bound To
           Melysine And Adohcy
          Length = 444

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/304 (32%), Positives = 148/304 (48%), Gaps = 26/304 (8%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
             T  KWL + G+   K  ++   V E  GLVALK+I + + +L VP  L I  D+    
Sbjct: 10  VQTFWKWLQEEGVITAKTPVKASVVTEGLGLVALKDISRNDVILQVPKRLWINPDA---- 65

Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             A E+ + CS +  W  +  +LI E S E S  W +Y   LP++  S +YW+  EL   
Sbjct: 66  VAASEIGRVCSELKPWLSVILFLIRERSREDSV-WKHYFGILPQETDSTIYWSEEELQE- 123

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
           L+ SQ+ +  +     V      L   I      LFP+ V  ++ F W+FGIL SR   R
Sbjct: 124 LQGSQLLKTTVSVKEYVKNECLKLEQEIILPNKRLFPDPV-TLDDFFWAFGILRSRAFSR 182

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLD-YDKSSQGVVFTTDRQY--------QPGEQVF 303
           L + +  + +VP AD++NHS  V T    Y+      +F+ D  +        + GEQV+
Sbjct: 183 LRNEN--LVVVPMADLINHSAGVTTEDHAYEVKGAAGLFSWDYLFSLKSPLSVKAGEQVY 240

Query: 304 ISYG-KKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           I Y   KSN EL L YGF+ P E  +   +  L L + +SD  + +KL+     G + + 
Sbjct: 241 IQYDLNKSNAELALDYGFIEPNENRH---AYTLTLEISESDPFFDDKLDVAESNGFAQTA 297

Query: 362 CFPI 365
            F I
Sbjct: 298 YFDI 301


>gi|388250581|gb|AFK23406.1| histone-lysine N-methyltransferase [Cordyceps militaris]
          Length = 479

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 111/397 (27%), Positives = 181/397 (45%), Gaps = 62/397 (15%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL  SG +    + +    V  RG+ AL++ +KGE++L +P + + TA++  + P  
Sbjct: 1   MEAWLKHSGAVGVDAIEVADFPVTGRGVKALRSFKKGERILTIPSACLWTAEAARADPLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWSNY------ISALPRQPYSLLYWTR 187
           G VL+      SV D   LA +L+    F KS R + Y      I+A+P++  + +++  
Sbjct: 61  GPVLRSAQPPLSVED--TLAIHLL----FVKS-RTAGYEGQRLHIAAMPQRHSASIFFAE 113

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME---------- 237
            EL +  E S +     +    V   +  L +++ S++ DLFP + F +E          
Sbjct: 114 DEL-QVCEGSSLHTLTTQLEQRVQDDFRQLLVQLLSQHRDLFPLDQFTIEDVSYIAAFPR 172

Query: 238 --------------TFKWSFGILFSRLVRLPSMDG-RVALV-PWADMLNHSCEVETFLDY 281
                          +KW+   ++SR +     D   V LV P ADMLNHS +V+    Y
Sbjct: 173 PTRSISLMNLYFPFQYKWALCTIWSRAMDFAVSDTTSVRLVAPLADMLNHSLDVKQCHAY 232

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
           D +S  +     + YQ G+Q+FI YG   N  LL  YGFV  +  NP+DS +L L     
Sbjct: 233 DPTSGDLSILAAKDYQVGDQIFIYYGSVPNNRLLRLYGFVLLD--NPNDSYDLVLQTSPM 290

Query: 342 DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNK 401
              Y++K       GL ++   P+     PL      YL         + + + AA    
Sbjct: 291 APLYEQKERLWALAGLDSTCTIPL-TAKHPLPKNVLRYL---------RTQRLDAADVAD 340

Query: 402 MT----SKKDIKCPEIDE-QALQFILDSCESSISKYS 433
           MT    +  D K  + +E Q LQF++DS  S +  + 
Sbjct: 341 MTLQLLNGTDGKVNDGNEIQVLQFLIDSLGSVLEGFG 377


>gi|17368377|sp|P94026.1|RBCMT_TOBAC RecName: Full=Ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase; Short=RuBisCO LSMT; Short=RuBisCO
           methyltransferase; Short=rbcMT; Flags: Precursor
 gi|1731475|gb|AAC49565.1| ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
 gi|1731477|gb|AAC49566.1| ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Nicotiana tabacum]
          Length = 491

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 106/383 (27%), Positives = 183/383 (47%), Gaps = 31/383 (8%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
             T  +WL   G+   K  ++   V E  GLVA ++I KGE +L VP    I  D+  + 
Sbjct: 57  VQTFWQWLCKEGVVTTKTPVKPGIVPEGLGLVAKRDIAKGETVLQVPKRFWINPDAV-AE 115

Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
            E G V   CS +  W  +A +L+ E  +   S+W  Y+  LP+   S +YW+  EL   
Sbjct: 116 SEIGNV---CSGLKPWISVALFLLRE-KWRDDSKWKYYMDVLPKSTDSTIYWSEEELSE- 170

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
           ++ +Q+    +     V   +  +   +  +   LFP  +  ++ F W+FGIL SR   R
Sbjct: 171 IQGTQLLSTTMSVKDYVQNEFQKVEEEVILRNKQLFPFPI-TLDDFFWAFGILRSRAFSR 229

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG--VVFTTDRQY--------QPGEQV 302
           L + +  + LVP+AD+ NH+  V T  D+    +G   +F+ D  +        + G+Q+
Sbjct: 230 LRNQN--LILVPFADLTNHNARVTT-EDHAHEVRGPAGLFSWDLLFSLRSPLKLKAGDQL 286

Query: 303 FISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           FI Y   KSN ++ L YGF+  E ++  D+  L L + +SD+ Y +KL+     G+  + 
Sbjct: 287 FIQYDLNKSNADMALDYGFI--EPSSARDAFTLTLEISESDEFYGDKLDIAETNGIGETA 344

Query: 362 CFPIQI-TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-Q 419
            F I+I    P  ++ Y  LV    +     E +     N +     +     +E+ + +
Sbjct: 345 YFDIKIGQSLPPTMIPYLRLVALGGTDAFLLESI---FRNSVWGHLGLPVSRANEELICK 401

Query: 420 FILDSCESSISKYSRFLQVKELL 442
            + D+C+S++S Y   ++  E L
Sbjct: 402 VVRDACKSALSGYHTTIEEDEKL 424


>gi|400594002|gb|EJP61885.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 481

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 169/378 (44%), Gaps = 42/378 (11%)

Query: 79  LQKWLSDSG---LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
           ++ WL+DSG   LP  K+A     +  RG+ A +    GE++L +P   + T +  ++  
Sbjct: 1   MEAWLNDSGALGLPHLKVA--DFALTGRGVQAQRAFSAGERILTIPAQCLWTVEHAYADR 58

Query: 136 EAGEVLKQC----SVPDWPLLATYLI-------SEASFEKSSRWSNYISALPRQPYSLLY 184
             G VL+      SV D   L   L+        + ++E   R  +++  LP +    ++
Sbjct: 59  LLGPVLRALQPPLSVEDTLALHILLVRARRRPDDDGAYEAGRR--SHVDVLPDRYTMSIF 116

Query: 185 WTRAELD------RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
           ++  E+        Y   +Q+R R       +   Y  L  R+  ++ +LFP   F +E 
Sbjct: 117 FSDEEMQVCKGSSLYTLTTQLRGR-------IGDDYKKLLTRVLMRHRNLFPLSKFGIEH 169

Query: 239 FKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           +KW+   ++SR +     +G     L P+ADMLNHS +V+    YD ++  +     + Y
Sbjct: 170 YKWALCTVWSRGMDFTVSEGNSLRLLAPFADMLNHSSDVKQCHAYDPTTGDLSILASKDY 229

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
             G+QVFI YG   N  LL  YGFV  E  NP DS +L L        Y++K    +  G
Sbjct: 230 NVGDQVFIYYGPVPNNRLLRLYGFVLPE--NPHDSYDLVLQTSPMAPLYEQKERLWKLAG 287

Query: 357 LSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQ 416
           L  +   P+     PL      YL +         E +  A + ++ +  D K  +  E 
Sbjct: 288 LDTACTIPLTAND-PLPRSVLRYLRIQRLD-----ESLLGAMTMQIATGADEKISDDSET 341

Query: 417 -ALQFILDSCESSISKYS 433
             LQF++DS  + +  +S
Sbjct: 342 LILQFLIDSISAILEGFS 359


>gi|297807453|ref|XP_002871610.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317447|gb|EFH47869.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 516

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 27/316 (8%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGE------RGLVALKNIRKGEKLLFVPPSLVIT 127
           +++  L+ W+  +GLPP K+ +++    +        + A ++++KG+    VP SLV+T
Sbjct: 80  DDSEDLKFWMDKNGLPPCKVLLKERPAHDLKYKPIHYVAASEDLQKGDVAFSVPDSLVVT 139

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
            +         E+L    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 140 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 199

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
           S L W+ AELD YL  S  +   +ER   +   YN+L         +F +YP   P E F
Sbjct: 200 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 258

Query: 235 NMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFT 291
           + E FK +F  + S +V L +  +  R ALVP    +L +    +  L        V   
Sbjct: 259 SFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAML--TAVDGAVELV 316

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
            DR Y+ G+ + +  G + N +LLL+YGFV  +  NP D + +  +L   D  Y++K   
Sbjct: 317 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRIIVEAALNTEDPQYQDKRMV 374

Query: 352 LRKYGLSASECFPIQI 367
            ++ G  + + F +++
Sbjct: 375 AQRNGKLSQQVFQVRV 390


>gi|357160358|ref|XP_003578740.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Brachypodium distachyon]
          Length = 516

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 142/312 (45%), Gaps = 30/312 (9%)

Query: 81  KWLSDSGLPPQKMAIQKVDV-GERG--------LVALKNIRKGEKLLFVPPSLVITADSK 131
           +WL   GLPP K+AI +  V   RG        + A +++  G+    +P SLV+T +  
Sbjct: 84  EWLLTHGLPPGKVAILERPVPCSRGGKDRPLHFVAAGQDLEVGDVAFEMPMSLVVTLERV 143

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLY 184
                  E+L    + +   LA YL+ E    K S W  YI  L RQ         S L 
Sbjct: 144 LGDESVAELLTTNKLSELACLALYLMYEKKQGKDSLWYPYIKELDRQRGRGQLAVESPLL 203

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMET 238
           WT +ELD YL  S +R+  + R   +   YN+L         +F +YP   P E F  E 
Sbjct: 204 WTESELD-YLNGSPMRDEVVVRDEGIRREYNELDTLWFMAGSLFKQYPFDVPTEAFPFEI 262

Query: 239 FKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
           FK +F  + S +V L   S+  R ALVP    +L +    +  L     S  V    DR 
Sbjct: 263 FKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVDDS--VRLVVDRP 320

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           Y+ GE + +  G + N  LLL+YGFV  +  NP D + +  SL   D  Y+EK    ++ 
Sbjct: 321 YKAGEPIIVWCGPQPNSRLLLNYGFVDED--NPYDRIAIEASLNMEDPQYQEKRMVAQRN 378

Query: 356 GLSASECFPIQI 367
           G  A + F + +
Sbjct: 379 GKLAIQKFQVCV 390


>gi|449442309|ref|XP_004138924.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 183/383 (47%), Gaps = 29/383 (7%)

Query: 75  NASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           N  T  +W+   G+   K  ++     E  GL   KN+ K E +L VP    I  D+  +
Sbjct: 69  NVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWINPDAV-A 127

Query: 134 CPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
             E G V   CS +  W  +A +LI E + +  SRW  Y+  LP++  S ++W+  EL  
Sbjct: 128 DSEIGNV---CSGLKPWISVALFLIRE-NLKGDSRWRRYLDILPQETDSTVFWSEEELAE 183

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LV 251
            ++ +Q+    +     V   +  +   I  ++ DLFP  +  ++ F W+FGIL SR   
Sbjct: 184 -IQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRI-TLDDFFWAFGILRSRAFS 241

Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLD-YDKSSQGVVFTTDRQY--------QPGEQV 302
           RL   +  + L+P+AD++NHS  V T    ++      +F+ D  +        + G+QV
Sbjct: 242 RLRGQN--LVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLFSLRSPLSVKAGDQV 299

Query: 303 FISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           FI Y  KKSN +L L YGF+  E  +  ++  L L + +SD  + +KL+     GL+ + 
Sbjct: 300 FIQYDLKKSNADLALDYGFI--EQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTA 357

Query: 362 CFPIQITG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-Q 419
            F I +   +P  ++ +  L+    +     E +     N +    ++     +E+ + Q
Sbjct: 358 YFDIILERPFPPAMLPFLRLLALGGTDAFLLESL---FRNSVWGHLEMPVSRANEELICQ 414

Query: 420 FILDSCESSISKYSRFLQVKELL 442
            + ++CE+++S Y   ++  E L
Sbjct: 415 VVRNACEAALSGYHTTIEEDEKL 437


>gi|168063638|ref|XP_001783777.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664720|gb|EDQ51429.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 395

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 159/353 (45%), Gaps = 35/353 (9%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           R L A + I  GE++L V   L+IT +     P   + L    V +W  LA +++ E   
Sbjct: 1   RTLFAARPIEVGEQVLRVSGDLMITPNK---LPTEVKELLPTGVTEWARLALFILVEQHL 57

Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            ++S+W+ YI+ LP     +S ++W + EL+     S  RE    R   VIG+     L 
Sbjct: 58  GQASQWAPYINCLPTCGALHSTVFWKKEELELVRFTSLHRETMQRRA--VIGSEFASVLP 115

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +  K P +F E V + + FK ++    S L R  S + R+  VP+ D  NH       L 
Sbjct: 116 VLQKCPHIFGERVLHSK-FKQAYATGKS-LRR--SSNTRILTVPFVDFFNHDSNCRALLS 171

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
           YD+         D+ Y  GEQV ISYG+  N  L L +GF      NP D VE+ ++L  
Sbjct: 172 YDEERACAEVIADKNYARGEQVVISYGRLPNTTLALDFGFTI--SCNPYDQVEVWMALSH 229

Query: 341 SDKCYKEKLEALRKYGL----------SASECFPIQ----IT----GWPLELMAYAYLVV 382
            D   K KL  L  +G+          S    F ++    +T    G P  L A+A ++ 
Sbjct: 230 RDPLRKMKLALLHAHGMPTVVHADGSDSGGNGFHLREVKSVTGRGKGIPHALRAFARVLC 289

Query: 383 SPPSMKGKFEEMAAAAS--NKMTSKKDIKCPEIDEQALQFILDSCESSISKYS 433
           +  +   +  EMAA A   +   ++   K    + Q +  +L   ES I++ S
Sbjct: 290 A--TTPQELSEMAAEAMKYDGRLARLPAKSRNKEAQVMNLLLARLESLINQRS 340


>gi|162606198|ref|XP_001713614.1| putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
 gi|13794534|gb|AAK39909.1|AF165818_117 putative ribulose-1,5-bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Guillardia theta]
          Length = 460

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 84/294 (28%), Positives = 145/294 (49%), Gaps = 16/294 (5%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RGL+A +NI K EK++ +  +L+          E    +      ++  LA  L+ E   
Sbjct: 100 RGLIASRNILKNEKIIEISENLMFDK------FEHNLEINSNGSDNYSDLAIKLLVELFK 153

Query: 163 EKSSRWSNYISALPRQ-PYSLLY-WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            K S W  YI  LP +    LL+ W   EL  +++ S++ + +      +   Y  +   
Sbjct: 154 NKKSFWFPYIGILPEEYDLKLLFRWPLKEL-FFIKGSRLSKASDYLKKKLKAQYEMVNKE 212

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +F +   L+P ++FN + ++WS  IL SR + L     +V L+P+ D+LNH+    +F+ 
Sbjct: 213 VFQRNRLLYPSKIFNYQNWEWSMSILLSRTISLQET-KKVVLIPYIDLLNHNPFSSSFIS 271

Query: 281 YDK----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           Y K     S+ +V  +D+     +Q++ISYG+KSN ELL  YGF+     NP DSV + +
Sbjct: 272 YRKIPLSDSKEIVVYSDKNCNKFDQLYISYGQKSNLELLNLYGFIAER--NPYDSVIIRI 329

Query: 337 SLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGK 390
           S+   D  +KEK   L          +PI +  +P E++ +  + +   ++  K
Sbjct: 330 SMSPKDIFFKEKKSFLFSNKKFFYNSYPIFLYKYPDEMIEFIKICLFNTNINDK 383


>gi|224117488|ref|XP_002331687.1| SET domain protein [Populus trichocarpa]
 gi|222874165|gb|EEF11296.1| SET domain protein [Populus trichocarpa]
          Length = 502

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 165/375 (44%), Gaps = 36/375 (9%)

Query: 14  PSFSHLHKAQSPAGFTDFPRKRCG-HRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDS 72
           PS + L +      F++ P++    HR   +   +T  D  RT   V++     G E D 
Sbjct: 13  PSLTVLSRVS--ISFSNLPKRAVSFHRRRRNLCFATLVDGKRTSEVVSKR---GGEEEDE 67

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKL------LFVPPSLVI 126
             +   L+ W+  +GLPP K+ +++    ++ L  +  +   E L      + VP SLV+
Sbjct: 68  FGD---LKSWMHKNGLPPCKVVLKERPSHDKKLRPIHYVAASEDLQASDVAVSVPNSLVV 124

Query: 127 TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------P 179
           T +         E+L    + +   LA YL+ E    K S W  YI  L RQ        
Sbjct: 125 TLERVLGNETLAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAV 184

Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEV 233
            S L W+ AEL  YL  S  +   ++R   +   Y +L         +F +YP   P E 
Sbjct: 185 ESPLLWSEAEL-AYLTGSPTKAEVLDRADGIKREYEELDTVWFMAGSLFQQYPYDIPTEA 243

Query: 234 FNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVF 290
           F  E FK +F  + S +V L   S+  R ALVP    +L +S   +  L        V  
Sbjct: 244 FPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAYSSNCKAMLT--AVDGAVEL 301

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
             DR Y+ GE + +  G + N +LLL+YGFV  +  NP D + +  +L   D  Y++K  
Sbjct: 302 VVDRPYKAGEPIVVWCGPQPNSKLLLNYGFVDED--NPYDRIAVEAALNTEDPQYQDKRM 359

Query: 351 ALRKYGLSASECFPI 365
             ++ G  + + F +
Sbjct: 360 VAQRNGKLSVQVFQV 374


>gi|449495943|ref|XP_004159992.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 503

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 182/383 (47%), Gaps = 29/383 (7%)

Query: 75  NASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           N  T  +W+   G+   K  ++     E  GL   KN+ K E +L VP    I  D+  +
Sbjct: 69  NVHTFWQWVRQEGMVSYKTHVKPAIFPEGLGLATTKNLSKNEVVLEVPKRFWINPDAV-A 127

Query: 134 CPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
             E G V   CS +  W  +A +LI E + +  SRW  Y+  LP++  S ++W+  EL  
Sbjct: 128 DSEIGNV---CSGLKPWISVALFLIRE-NLKGDSRWRRYLDILPQETDSTVFWSEEELAE 183

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LV 251
            ++ +Q+    +     V   +  +   I  ++ DLFP  +  ++ F W+FGIL SR   
Sbjct: 184 -IQGTQLLSTTLNVKEYVKSEFLKVEEEILLRHKDLFPSRI-TLDDFFWAFGILRSRAFS 241

Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLD-YDKSSQGVVFTTD--------RQYQPGEQV 302
           RL   +  + L+P+AD++NHS  V T    ++      +F+ D           + G+QV
Sbjct: 242 RLRGQN--LVLIPFADLVNHSANVTTEEHAWEVKGPAGLFSWDVLCSLRSPLSVKAGDQV 299

Query: 303 FISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           FI Y  KKSN +L L YGF+  E  +  ++  L L + +SD  + +KL+     GL+ + 
Sbjct: 300 FIQYDLKKSNADLALDYGFI--EQKSDRNAYTLTLEIPESDLFFDDKLDIAETNGLNQTA 357

Query: 362 CFPIQITG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-Q 419
            F I +   +P  ++ +  L+    +     E +     N +    ++     +E+ + Q
Sbjct: 358 YFDIILERPFPPAMLPFLRLLALGGTDAFLLESL---FRNSVWGHLEMPVSRANEELICQ 414

Query: 420 FILDSCESSISKYSRFLQVKELL 442
            + ++CE+++S Y   ++  E L
Sbjct: 415 VVRNACEAALSGYHTTIEEDEKL 437


>gi|428175234|gb|EKX44125.1| hypothetical protein GUITHDRAFT_109909 [Guillardia theta CCMP2712]
          Length = 442

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 88/321 (27%), Positives = 155/321 (48%), Gaps = 41/321 (12%)

Query: 60  TQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLL 118
           T++++P+    D  ++     +WL+  G+  Q      +  G  RGL+A ++   GE +L
Sbjct: 32  TEDVVPFHDREDLDQDMKDFLEWLTAKGMNFQSQVDVAITNGTGRGLLARRSFMPGETML 91

Query: 119 FVPPSLVITADSKWSCPEAGEVLKQCSVPD-----------WPLLATYLISEASFEKSSR 167
            VPP L+IT D      E G   ++  + D            PLLA +L +   + +S  
Sbjct: 92  AVPPELLITPDMARRS-EVGRAFREHGLDDCSGGEDSTYECMPLLAMHL-TVLYYNESHD 149

Query: 168 WSNYISALPRQPYSLLYWTRAE--------LDRYLEASQIRERAIERIT-NVIGTYNDLR 218
           +  ++  LPR+  + L+W+  E        L   L+   +    + R T  V+G +N   
Sbjct: 150 FHPWMKILPRKLTTPLFWSDKEREELQGSNLYNMLDGWTMNVEKLHRSTARVLGQHN--- 206

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS----MDGRVALV-PWADMLNHSC 273
                 +PDL P+ +++++ FKW++  +F+R   +        GR  ++ P AD+ NH  
Sbjct: 207 -----VFPDL-PKAIYSLKEFKWAYATIFARAFDVDGKSFGFSGRQRIMAPMADLFNHG- 259

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
           +V+T   ++ +S      T + +  GEQ+F++Y  K+N E LL YGFV    +NP D V 
Sbjct: 260 DVKTSYTFNAASGHFELFTQQFFSRGEQIFMNYDSKNNAEFLLQYGFVIE--SNPHDYVG 317

Query: 334 LPLSLKKSDKCYKEK-LEALR 353
           +  S+      Y++K L+ LR
Sbjct: 318 IAASIGNDQPFYRDKSLDCLR 338


>gi|18377718|gb|AAL67009.1| unknown protein [Arabidopsis thaliana]
          Length = 514

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 27/316 (8%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVIT 127
           +++  L+ W+  +GLPP K+ +++    ++       + A ++++KG+    VP SLV+T
Sbjct: 78  DDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIHYVAASEDLQKGDVAFSVPDSLVVT 137

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
            +         E+L    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 138 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 197

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
           S L W+ AELD YL  S  +   +ER   +   YN+L         +F +YP   P E F
Sbjct: 198 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 256

Query: 235 NMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFT 291
           + E FK +F  + S +V L +  +  R ALVP    +L +    +  L        V   
Sbjct: 257 SFEIFKQAFVAIQSCVVHLQNVGLARRFALVPLGPPLLAYCSNCKAML--TAVDGAVELV 314

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
            DR Y+ G+ + +  G + N +LLL+YGFV  +  NP D V +  +L      Y++K   
Sbjct: 315 VDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRVIVEAALNTEGPQYQDKRMV 372

Query: 352 LRKYGLSASECFPIQI 367
            ++ G  + + F +++
Sbjct: 373 AQRNGKLSQQVFQVRV 388


>gi|414881266|tpg|DAA58397.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 512

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/311 (30%), Positives = 141/311 (45%), Gaps = 30/311 (9%)

Query: 82  WLSDSGLPPQKMAIQK---------VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           WL   GLPP K+ I++          D   R + A+ +++ G+    V  SLV+T +   
Sbjct: 81  WLRARGLPPGKVDIRERPVPCLRDGKDQPLRYVSAVVDLQAGDVAFEVSMSLVVTLERVL 140

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP-------YSLLYW 185
                 E+L    + +   LA YL+ E    K S W  YI  L R          S L W
Sbjct: 141 GDESIAELLTNNKLSELACLALYLMYEKKQGKDSFWYPYIKELDRHRGRGQLAVESPLLW 200

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           T +ELD YL  S +++  + R   +   YN+L         +F +YP   P E F  E F
Sbjct: 201 TESELD-YLTGSPLKDEVVARDEAIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 259

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L +    +  L  D  S  V    DR Y
Sbjct: 260 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYRSNCKAMLTADGDS--VRLVVDRPY 317

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + GE + I  G ++N  L+L+YGFV  +  NP D V +  SL   D  Y+EK    ++ G
Sbjct: 318 KAGEPIIIWCGPQTNSRLVLNYGFVDED--NPFDRVAIEASLNTEDPQYQEKRMVAQRNG 375

Query: 357 LSASECFPIQI 367
             A + F + +
Sbjct: 376 KLAIQNFNVYV 386


>gi|358384831|gb|EHK22428.1| hypothetical protein TRIVIDRAFT_84056 [Trichoderma virens Gv29-8]
          Length = 458

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 183/376 (48%), Gaps = 25/376 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           ++ WL +SG     + +       RG+  L+  ++GE++L +P   + T +  ++    G
Sbjct: 1   MEGWLRESGAELDGLELAHFPAIGRGVRTLRCFKQGERILTIPSGCLWTVEHAYADAVLG 60

Query: 139 EVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDRY 193
            VL+      SV D   LA Y++   S E       ++++ALP    S +++   EL+  
Sbjct: 61  PVLRSAQPPLSVED--TLAIYILFVRSRESGYDGLRSHVAALPASYSSSIFFEDDELEVC 118

Query: 194 LEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET--FKWSFGILFSR 249
             +S   I  +  +RI      Y  L +R+F  + DLFP   F +E   +KW+   ++SR
Sbjct: 119 AGSSLYTITRQLEQRIEE---DYRGLVVRVFGLHLDLFPLNKFTIENVGYKWALCTVWSR 175

Query: 250 LVR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
            +   LP+ +    L P+ADM+NHS EV+    YD SS  +     + Y+  +QVFI YG
Sbjct: 176 AMDFVLPNGNPLRLLAPFADMVNHSPEVKQCHVYDASSGNLSILAGKDYEAEDQVFIYYG 235

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
              N  LL  YGFV  +  NP+DS +L LS       Y++K +     GL+++    + +
Sbjct: 236 PMPNSRLLRLYGFVIPD--NPNDSYDLVLSTHPLAPFYEQKQKLWASAGLNSTCTISLTL 293

Query: 368 TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCES 427
              PL      YL +     +    ++AA A  K+ + + I   + + + L+F+++S  S
Sbjct: 294 DD-PLPKNVLRYLRIQ----RLDESDLAAIALQKIDTNEKISDSK-EVEILRFLVESIGS 347

Query: 428 SISKY-SRFLQVKELL 442
            +  + +R  +++E L
Sbjct: 348 LLDSFGTRLEKLQEQL 363


>gi|322712432|gb|EFZ04005.1| histone-lysine N-methyltransferase [Metarhizium anisopliae ARSEF
           23]
          Length = 462

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 175/371 (47%), Gaps = 28/371 (7%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL +SG +    + +    +  RG+ A +  ++GE++L +P  L  T     +    
Sbjct: 1   METWLEESGAVGLDGLEVADFPLTGRGVKARRRFKQGERILTIPSGLHWTVKHAQNDSLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
           G  L       SV D   LA +++   S E       +++  LP    S +++T  EL+ 
Sbjct: 61  GPALCSAQPPLSVED--TLAVHILFVRSRESGYDGLRSHVERLPASYSSSIFFTDDELEV 118

Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
              AS   I ++  +RI +    Y DL +R+  +YPDLFP + F +  +KW+   ++SR 
Sbjct: 119 CAGASLYTITKQLQQRIED---DYRDLVVRVLVQYPDLFPLDKFTLHHYKWALCAVWSRA 175

Query: 251 VRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           +     DG     L P+ADMLNHS E +    YD SS  +     + Y+ G+QV+I YG 
Sbjct: 176 MDFQLSDGSSIRLLAPFADMLNHSSESKQCHVYDASSGDLSVLAGKDYEAGDQVYIHYGS 235

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY--KEKLEALRKYGLSASECFPIQ 366
             N  LL  YGF+     NP+DS +L L+       +  K+KL AL   GL ++    + 
Sbjct: 236 IPNHRLLRLYGFIIP--GNPNDSYDLVLATHPLAPFFELKQKLWALA--GLDSTCTISLT 291

Query: 367 ITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCE 426
           +T  PL      YL +     +    ++A+ A  +   +K     E+  Q LQ +++S  
Sbjct: 292 LTD-PLPKNVIRYLRI----QRLDESDLASIALGQAADEKISNSNEV--QVLQSLVESIA 344

Query: 427 SSISKYSRFLQ 437
           S +  +   L+
Sbjct: 345 SLLGSFGTRLE 355


>gi|440804743|gb|ELR25614.1| SET domain containing protein, partial [Acanthamoeba castellanii
           str. Neff]
          Length = 273

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 78/240 (32%), Positives = 126/240 (52%), Gaps = 20/240 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-TADS--KWSCPEAGEVLKQCSVPDWPLLAT----Y 155
           R +VA  +I  GE LL VP SLV+ +AD+    S PE   +L +    ++PL  T     
Sbjct: 43  RSVVAAHDIAAGETLLSVPFSLVVDSADALLATSAPEIRRILDE----EFPLSPTNENAL 98

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+     + +S W  YI  LP    + L+++  EL  YLE S +   A +R   +   Y+
Sbjct: 99  LLLVHKNDPNSPWQRYIDVLPSTFSTTLFFSDDELS-YLEGSSLHYFARQRRRAIESQYD 157

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
            +   +F  YP+ F  E F+++ +KW+  +++SR   +   +G+  LVPWADM N + E 
Sbjct: 158 TIFTPLFVDYPEHFAPEQFSLDAWKWALSVIWSRSFVVD--EGKSGLVPWADMFNMAPET 215

Query: 276 ETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKK---SNGELLLSYGFVPREGTNPSDS 331
           E   +  D     ++++     + GEQ+F++YG+    SN +LL+ YGFV     NP D+
Sbjct: 216 EQVKVAVDAVDHHLIYSARSPIKKGEQIFVAYGQSRQMSNAQLLMDYGFVLE--NNPHDA 273


>gi|194038089|ref|XP_001925323.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sus scrofa]
 gi|456754196|gb|JAA74239.1| SET domain containing 3 [Sus scrofa]
          Length = 595

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 89/317 (28%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW SD+G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASDNGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGPLYAQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P       +E F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPQAHKLPLKESFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPVSAQLLAFLRV 387


>gi|427784595|gb|JAA57749.1| Putative histone-lysine n-methyltransferase setd3 [Rhipicephalus
           pulchellus]
          Length = 485

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 152/308 (49%), Gaps = 35/308 (11%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT-ADSKWSCPEAGE 139
           KW SD+G     ++I+ +  GE G VA ++I +  + L VP  L++T A +K S  + G 
Sbjct: 79  KWCSDNGAYLGSVSIKDLPDGEYGFVADEHIEESNQFLGVPLKLMMTTAAAKKS--KLGP 136

Query: 140 VLKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
           +L+     D P+        LA +LI E    +SS W  YIS LP    ++LY++  EL+
Sbjct: 137 LLR-----DDPIMMSMSNVALAMFLILEFCTGESSFWHPYISTLPASFNTVLYFSVEELE 191

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFS 248
             L  S + + A++   ++   Y+    +IF  +P    L  ++ F  + ++W+   + +
Sbjct: 192 -LLHGSTVLDEALKLHRSIARQYSYFH-KIFRTHPLAKSLPYKDCFTYDLYRWAVSAVMT 249

Query: 249 RLVRLP-----------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
           R   +P             D   A+VP  DM NHS + + F DYD S+  +     R ++
Sbjct: 250 RQNAVPLTDTAGGDDEDGTDAMTAMVPLWDMCNHS-DGKVFTDYDISANMLRCYAMRDFE 308

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
            G++V I YG+++N E  +  GFV  E  N  DSV++ L + K D  Y  K +    + L
Sbjct: 309 KGQEVTIFYGRRTNAEFFIHNGFVFPE--NRHDSVDIKLGISKQDPLYAVKAKLCDDHEL 366

Query: 358 SASECFPI 365
           + S  F +
Sbjct: 367 TPSGIFAL 374


>gi|115487958|ref|NP_001066466.1| Os12g0236900 [Oryza sativa Japonica Group]
 gi|77554044|gb|ABA96840.1| SET domain containing protein, expressed [Oryza sativa Japonica
           Group]
 gi|113648973|dbj|BAF29485.1| Os12g0236900 [Oryza sativa Japonica Group]
          Length = 509

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 142/311 (45%), Gaps = 30/311 (9%)

Query: 82  WLSDSGLPPQKMAI---------QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           WL + GLPP K+AI         +  D+    + A +++  G+    VP SLV+T +   
Sbjct: 78  WLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLERVL 137

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    + S W  YI  L RQ         S L W
Sbjct: 138 GDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLW 197

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           T +EL+ YL+ S I++  + R   +   YN+L         +F +YP   P E F  E F
Sbjct: 198 TESELN-YLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 256

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L +    +  L     S  V    DR Y
Sbjct: 257 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDS--VRLVVDRPY 314

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + GE + +  G + N  LLL+YGF+  +  NP D + +  SL   D  ++EK    ++ G
Sbjct: 315 KAGEPIIVWCGPQPNSRLLLNYGFIDED--NPYDRIVIEASLNIEDPQFQEKRMVAQRNG 372

Query: 357 LSASECFPIQI 367
             A + F + +
Sbjct: 373 KLAIQNFHVCV 383


>gi|125536207|gb|EAY82695.1| hypothetical protein OsI_37912 [Oryza sativa Indica Group]
          Length = 505

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 93/311 (29%), Positives = 142/311 (45%), Gaps = 30/311 (9%)

Query: 82  WLSDSGLPPQKMAI---------QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           WL + GLPP K+AI         +  D+    + A +++  G+    VP SLV+T +   
Sbjct: 74  WLREHGLPPGKVAILDRPVPCFREGKDLPLHYVAAGQDLEAGDVAFEVPMSLVVTLERVL 133

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    + S W  YI  L RQ         S L W
Sbjct: 134 GDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYPYIKELDRQRGRGQLAVESPLLW 193

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           T +EL+ YL+ S I++  + R   +   YN+L         +F +YP   P E F  E F
Sbjct: 194 TESELN-YLKGSPIKDEVVARDEGIRREYNELDTLWFMAGSLFQQYPFDIPTEAFPFEIF 252

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L +    +  L     S  V    DR Y
Sbjct: 253 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSNCKAMLTAVGDS--VRLVVDRPY 310

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + GE + +  G + N  LLL+YGF+  +  NP D + +  SL   D  ++EK    ++ G
Sbjct: 311 KAGEPIIVWCGPQPNSRLLLNYGFIDED--NPYDRIVIEASLNIEDPQFQEKRMVAQRNG 368

Query: 357 LSASECFPIQI 367
             A + F + +
Sbjct: 369 KLAIQNFHVCV 379


>gi|390354259|ref|XP_001201449.2| PREDICTED: SET domain-containing protein 4-like [Strongylocentrotus
           purpuratus]
          Length = 455

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 135/284 (47%), Gaps = 21/284 (7%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKV---DVGERGLVALKNIRKGEKLLFVPPSLVI 126
           +D  E   TL KW+ + G   +   ++     D G RGL+  KN+R G+ ++ +P  L++
Sbjct: 37  VDHDEQYITLMKWMKEHGFNCKGCCLKPAVFSDTG-RGLMTKKNLRPGDSIVEIPRHLLV 95

Query: 127 TADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
           TA    +  E G ++K+      P  ++  +L++E S  KSS W  YI+ LP+   +  +
Sbjct: 96  TAKDILNT-ELGPIIKRQRQKPTPYQVVCAFLLTERSKGKSSFWYPYINVLPKDFTTPAF 154

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE--VFNMETFKWS 242
            +  + D  +  +  R RAI ++ ++   +      +F      FP+    F++++F W+
Sbjct: 155 GSTKQADFDVLPTIARSRAINQLQDIRAAFESASC-LFEDIERTFPQYRIFFSLDSFVWA 213

Query: 243 FGILFSRLVRL---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
           + ++ SR V +         P      AL P+ D+LNHS   E    +D  S      T 
Sbjct: 214 WFVINSRSVYIEPSGCEAFDPKASDDFALAPFLDLLNHSPGAEVTAGFDPVSNCYRIKTL 273

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
             Y   +QVFI YG   N  LLL YGFV    +NP D+V   L 
Sbjct: 274 DSYHAYDQVFIHYGPHDNVNLLLEYGFVI--PSNPHDAVSFELG 315


>gi|168067849|ref|XP_001785817.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662541|gb|EDQ49381.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 111/390 (28%), Positives = 176/390 (45%), Gaps = 36/390 (9%)

Query: 68  CEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVI 126
           C + +     T+  W    G+  Q  A++  +V E  GL+A + +  G+++L VP S+ I
Sbjct: 35  CSVGAEAQVQTIWSWAQSHGI--QGEAVKPAEVSEGLGLIAQRPVNAGDEILNVPESVWI 92

Query: 127 T--ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
              A    S  +A E LK      W  +A +LI E+S   SS+W  Y+ +LP+   S L+
Sbjct: 93  NLAAVQNSSLGKACEGLKP-----WVAVALFLIHESS-NPSSKWRPYLDSLPKSLDSPLF 146

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
           W+  EL   L  +Q+       +  +   YN+L   +      +F   V+  + FKW+FG
Sbjct: 147 WSDEELAE-LVGTQLLGSVTGYLEFLENEYNNLVEEVLEPNNKIFNPAVYTFDGFKWAFG 205

Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHSCEV------------ETFLDYDK-SSQGVVFT 291
           IL SR    P     +ALVP AD++NH   +              F +  K SS  +   
Sbjct: 206 ILRSRTFS-PLTGEDIALVPIADLVNHGKGLGDGSPSWVRKGTSQFWNIGKGSSDLLTVR 264

Query: 292 TDRQYQPGEQVFISYGK-KSNGELLLSYGFVPRE-GTNPS-----DSVELPLSLKKSDKC 344
               +  GEQV + YG  KSN +L L YGFV R+ G+  S     DS+ L L +   D+ 
Sbjct: 265 ASANFSAGEQVLMQYGATKSNADLALDYGFVERDRGSQFSPGIERDSLALSLEISPDDRF 324

Query: 345 YKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTS 404
             +K + L   G   S  F +     P + M   +L +S  S    F  + A   N+   
Sbjct: 325 VDDKADILEINGFQCSMQFDLSRGQGPSDEM-ITFLRLSALSGPDSF-LLEALFRNEAWG 382

Query: 405 KKDIKCPEIDEQAL-QFILDSCESSISKYS 433
              +     +E+AL   +L+  ++++  YS
Sbjct: 383 HVSLPVSRDNEEALCTSMLEGLKAALDGYS 412


>gi|388493466|gb|AFK34799.1| unknown [Lotus japonicus]
          Length = 132

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 51/64 (79%), Positives = 58/64 (90%)

Query: 375 MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSR 434
           MAYAYL VSP SM+G+FE+MAAAASNK+TS KD K PEI+EQALQFILDSCESSISKY++
Sbjct: 1   MAYAYLAVSPSSMRGQFEKMAAAASNKITSTKDFKYPEIEEQALQFILDSCESSISKYNK 60

Query: 435 FLQV 438
           FLQ 
Sbjct: 61  FLQA 64


>gi|400596811|gb|EJP64567.1| histone-lysine N-methyltransferase [Beauveria bassiana ARSEF 2860]
          Length = 406

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 94/370 (25%), Positives = 168/370 (45%), Gaps = 27/370 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           +  WL+ SG     + +  +D+ +     RG+ A +  ++ E++L +P + + T    ++
Sbjct: 1   MDAWLNKSG----AVGLGDLDLADFPETGRGVKAQRPFKEDERILTIPANCLWTVKGAYA 56

Query: 134 CPEAGEVLKQC----SVPDWPLLATYLI---SEASFEKSSRWSNYISALPRQPYSLLYWT 186
            P  G VL+      SV D   LA Y++   S       +    +++ LP +    +Y+T
Sbjct: 57  DPLFGPVLQSVQPPLSVED--TLALYILFVRSRGEDPAYAERQTHVAMLPSEYTLSMYFT 114

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
             EL R    S +          V   Y  L   +F ++ DLFP + F+ + +KW+   +
Sbjct: 115 DEEL-RVCAGSSLYTLTTHLRGRVGDDYKKLLTGVFMRHRDLFPLDKFSFQHYKWALSSI 173

Query: 247 FSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           +SR +     +G     + P+ADMLNH+ + +    YD S+  +     R Y+ G+QVFI
Sbjct: 174 WSRGMDFTISEGNSVRLMAPFADMLNHASDAKQCHAYDPSTGSLTVLACRDYEVGDQVFI 233

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
            YG  SN  LL  YGFV  +  NP+D+ EL L        Y++K    +  GL      P
Sbjct: 234 YYGNVSNSRLLRLYGFVLPD--NPNDNYELVLQTSSMAPLYEQKQRLWKLAGLDEISTIP 291

Query: 365 IQITGWPLELMAYAYLVV---SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFI 421
           + +   PL      YL +       +     ++A  +  K++ + + +      Q+++ +
Sbjct: 292 LSLQN-PLPDSVLRYLRIQRLDASDLGTMTMQIATESYTKISDENESQILLFLSQSIEAL 350

Query: 422 LDSCESSISK 431
           L+  E S+ K
Sbjct: 351 LEGFEISLEK 360


>gi|225462926|ref|XP_002267249.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic [Vitis
           vinifera]
 gi|296087793|emb|CBI35049.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 172/374 (45%), Gaps = 34/374 (9%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
             T  KWL D G+   K  ++   V E  GLVA ++I + E +L VP    I  D+    
Sbjct: 51  VQTFWKWLFDQGVVSGKTPVKPGIVPEGLGLVAQRDIARNEAVLEVPKRFWINPDAV--- 107

Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             A E+   C  +  W  +A +LI E      S W +Y+  LP    S +YW+  EL   
Sbjct: 108 -AASEIGSVCGGLKPWVSVALFLIRE-KLRDESPWRSYLDILPEYTNSTIYWSEEELVE- 164

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
           ++ +Q+    +     V   +  +   +   +  LFP  V  ++ F W+FGIL SR   R
Sbjct: 165 IQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPV-TLDDFLWAFGILRSRAFSR 223

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QPGEQVF 303
           L   +  + L+P AD++NHS  + T  +Y    +G  +F+ D+ +        + GEQV 
Sbjct: 224 LRGQN--LVLIPLADLINHSPSITT-EEYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVL 280

Query: 304 ISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           I Y   KSN EL L YGF+  E     +S  L L + +SD  + +KL+     GLS    
Sbjct: 281 IQYDLDKSNAELALDYGFI--ESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAY 338

Query: 363 FPIQI-TGWPLELMAYAYLVV--SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL- 418
           F I +    P  ++ Y  LV    P +       + +   N +    ++     +E+ + 
Sbjct: 339 FDIVLGQSLPAAMLPYLRLVALGGPDAFL-----LESIFRNTIWGHLELPVSRANEELIC 393

Query: 419 QFILDSCESSISKY 432
           Q I D+C+S++S Y
Sbjct: 394 QVIQDACKSALSGY 407


>gi|147843303|emb|CAN82664.1| hypothetical protein VITISV_015206 [Vitis vinifera]
          Length = 507

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 172/374 (45%), Gaps = 34/374 (9%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
             T  KWL D G+   K  ++   V E  GLVA ++I + E +L VP    I  D+    
Sbjct: 51  VQTFWKWLFDQGVVSGKTPVKPGIVPEGLGLVAQRDIARNEAVLEVPKRFWINPDAV--- 107

Query: 135 PEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             A E+   C  +  W  +A +LI E      S W +Y+  LP    S +YW+  EL   
Sbjct: 108 -AASEIGSVCGGLKPWVSVALFLIRE-KLRDESPWRSYLDILPEYTNSTIYWSEEELVE- 164

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-LVR 252
           ++ +Q+    +     V   +  +   +   +  LFP  V  ++ F W+FGIL SR   R
Sbjct: 165 IQGTQLSNTTLGVKEYVQSEFLKVEEEVILPHSQLFPFPV-TLDDFLWAFGILRSRAFSR 223

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QPGEQVF 303
           L   +  + L+P AD++NHS  + T  +Y    +G  +F+ D+ +        + GEQV 
Sbjct: 224 LRGQN--LVLIPLADLINHSPSITTE-EYAWEIKGAGLFSRDQLFSLRTPVSVKAGEQVL 280

Query: 304 ISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           I Y   KSN EL L YGF+  E     +S  L L + +SD  + +KL+     GLS    
Sbjct: 281 IQYDLDKSNAELALDYGFI--ESRPNRNSYTLTLEISESDPFFGDKLDIAESNGLSEIAY 338

Query: 363 FPIQI-TGWPLELMAYAYLVV--SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL- 418
           F I +    P  ++ Y  LV    P +       + +   N +    ++     +E+ + 
Sbjct: 339 FDIVLGQSLPAAMLPYLRLVALGGPDAFL-----LESIFRNTIWGHLELPVSRANEELIC 393

Query: 419 QFILDSCESSISKY 432
           Q I D+C+S++S Y
Sbjct: 394 QVIQDACKSALSGY 407


>gi|260835124|ref|XP_002612559.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
 gi|229297937|gb|EEN68568.1| hypothetical protein BRAFLDRAFT_219602 [Branchiostoma floridae]
          Length = 327

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 89/320 (27%), Positives = 159/320 (49%), Gaps = 25/320 (7%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           S +++  L +WL  +G     + +       RG+++ +N+++G+ ++ +P +L+IT  + 
Sbjct: 2   SRDDSIQLMRWLRRNGFRDSHLVLTDFPDTGRGVMSTRNLKEGDCIVSLPENLLITTTTV 61

Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
            +    G+ +K       P  +L+ YLI+E S  K S W  YI  LP    +  Y++ AE
Sbjct: 62  VNS-HLGQYIKTWKPRLTPKQVLSLYLIAEKSRGKDSFWYPYIQTLPTSYTTPSYFSTAE 120

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILF 247
           +D     + +RE  +     +  +Y  L+  + +  P LFP+   VF +++++W++  ++
Sbjct: 121 VDAL--PALVREATLRHRKVLQNSYKSLQTSLHNLEP-LFPDWKTVFTLKSYRWAWATVY 177

Query: 248 SRLVRL---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
           +R V           PS     AL P+ DMLNHS  V+T  D++ SS+     T+   + 
Sbjct: 178 TRSVYKRGPGWEFLDPSDPDVYALAPFLDMLNHSPLVQTDTDFNVSSKCYEVKTEGACRK 237

Query: 299 GEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKK----SDKCYKEKLEALR 353
             QVFI+Y    NG LL+ YGFV PR   NP   V    ++K+    S    ++K+E L 
Sbjct: 238 YRQVFINYDPYDNGRLLMEYGFVMPR---NPHSVVTFTAAVKQNGLSSKNLLQKKMELLS 294

Query: 354 KYGLSASECFPIQITGWPLE 373
           +  L+ +     +   W L+
Sbjct: 295 QENLTVNLSCSGEGLSWRLQ 314


>gi|302847476|ref|XP_002955272.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
 gi|300259344|gb|EFJ43572.1| hypothetical protein VOLCADRAFT_76643 [Volvox carteri f.
           nagariensis]
          Length = 488

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/336 (27%), Positives = 151/336 (44%), Gaps = 26/336 (7%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQKVDVGERG-----LVALKNIRKGEKLLFVPPSLVI 126
           ++  AS L  WL ++G     + ++ +DV   G     +VA +++  GE  L VP  L +
Sbjct: 37  AVHTASELVDWLRENGAKIDAVEVKTMDVPSAGRPLDVVVAGRSLAAGEVALSVPERLCL 96

Query: 127 TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL-------PRQP 179
           T D  +      E+L    + +   LA YL+ E   +K S W  YI  L       P+  
Sbjct: 97  TLDRIFESEFVAELLTTDKLSELACLALYLMYEKKLKKKSFWYPYIKELDKQQARGPQAA 156

Query: 180 YSLLYWTRAELDRYLEASQI------RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV 233
            S L W   ELD  L+ S +      R+  I +    + T   +   +F+KYP   P E 
Sbjct: 157 ESPLLWGDQELDSLLKGSPLLPAVRQRQAGIRKEYEALDTVWFMAGSLFNKYPFDLPTET 216

Query: 234 FNMETFKWSFGILFSRLVRLPS--MDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVF 290
           F+ E F+ +F ++ + +V L    +  R ALVP    ++ +S   +  + YD+ S+ V  
Sbjct: 217 FSFELFQQAFAVVQASIVHLQGVPIAKRFALVPLGPPLMAYSSTSKNMMTYDEDSRSVRL 276

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE--LPLSLKKSDKCYKEK 348
                 + G  V    G + N  LLL+YG V  +  NP D ++     +L  SD  +  K
Sbjct: 277 VVSGPVEAGRPVAAWCGPQPNSRLLLNYGVV--DEHNPFDKLQARFTFTLPTSDPLFPAK 334

Query: 349 LEALRKYGLSASECFPIQIT-GWPLELMAYAYLVVS 383
              L + GL+  + F + +    P +L+ Y  L ++
Sbjct: 335 RAVLSEAGLATQQSFDVSVARPLPPQLLPYMMLALA 370


>gi|3403234|gb|AAC29136.1| ribulose-1,5-bisphosphate carboxylase/oxygenase N-methyltransferase
           [Spinacia oleracea]
 gi|3403238|gb|AAC29138.1| ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit
           N-methyltransferase II [Spinacia oleracea]
          Length = 495

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 116/393 (29%), Positives = 174/393 (44%), Gaps = 33/393 (8%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVIT 127
           E D+         WLSD G+   K  ++   V E  GLVA K+I + E +L VP    I 
Sbjct: 49  ETDTPPEIQKFWGWLSDKGIISPKCPVKPGIVPEGLGLVAQKDISRNEVVLEVPQKFWIN 108

Query: 128 ADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
            D+      A E+   C+ +  W  +A +L+ E     SS W  YI  LP    S +YW+
Sbjct: 109 PDTV----AASEIGSVCNGLKPWVSVALFLMREKKLGNSSSWKPYIDILPDSTNSTIYWS 164

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
             EL   L+ SQ+    +     V   +  L   +   +  LFP +V   + F W+FG+L
Sbjct: 165 EEELSE-LQGSQLLNTTLGVKELVANEFAKLEEEVLVPHKQLFPFDV-TQDDFFWAFGML 222

Query: 247 FSRLVRLPSMDGR-VALVPWADM----LNHSCEVETFLDYDKSSQG-------VVFTTDR 294
            SR      ++G+ + L+P AD+     NHS ++ T   Y    +G       +VF+  R
Sbjct: 223 RSR--AFTCLEGQSLVLIPLADLWVQQANHSPDI-TAPKYAWEIRGAGLFSRELVFSL-R 278

Query: 295 QYQP---GEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
              P   G+QV I Y   KSN EL L YG    E  +  ++  L L + +SD  Y +KL+
Sbjct: 279 NPTPVKAGDQVLIQYDLNKSNAELALDYGLT--ESRSERNAYTLTLEIPESDSFYGDKLD 336

Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
                G+  S  F I +   PL      YL +     +  F  + +   N +    D+  
Sbjct: 337 IAESNGMGESAYFDI-VLEQPLPANMLPYLRLVALGGEDAF-LLESIFRNSIWGHLDLPI 394

Query: 411 -PEIDEQALQFILDSCESSISKYSRFLQVKELL 442
            P  +E   Q I D+C S++S YS  +   E L
Sbjct: 395 SPANEELICQVIRDACTSALSGYSTTIAEDEKL 427


>gi|217038301|gb|ACJ76599.1| SET domain-containing protein 3 (predicted) [Oryctolagus cuniculus]
          Length = 394

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/317 (28%), Positives = 143/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S +G   +   +   +    GL A + I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASANGASVEGFEVVNFEEEGFGLRATREIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     R+   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YRVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R +  GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFHAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|357497055|ref|XP_003618816.1| SET domain protein [Medicago truncatula]
 gi|355493831|gb|AES75034.1| SET domain protein [Medicago truncatula]
          Length = 501

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/352 (28%), Positives = 163/352 (46%), Gaps = 32/352 (9%)

Query: 38  HRIVVHCSVSTTNDASRTKTTVTQN---MIPWGCEIDSLENASTLQKWLSDSGLPPQKMA 94
           HR+    S+ST +   R       N   ++    +    E+   L+ W+  +GLPP K+ 
Sbjct: 26  HRLPSFLSLSTNHRRRRRSFCSASNSDTLVAATGKKKRDEDDGDLKTWMHKNGLPPCKVV 85

Query: 95  IQK---VDVGERGL---VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
           ++    +D   + +    A ++++KG+    VP SLV+T +         E+L      +
Sbjct: 86  LKDKPSLDDSVKPIHYVAASEDLQKGDIAFSVPNSLVVTLERVLGNETIAELLTTNKFSE 145

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAELDRYLEASQIRE 201
              LA YL+ E    K S W  YI  L RQ         S L W+ +EL  YLE S +++
Sbjct: 146 LACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESEL-AYLEGSPLKD 204

Query: 202 RAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP- 254
             ++RI  +   YN+L         +F +YP   P E F  E FK +F  + S +V L  
Sbjct: 205 EIVKRIEGIRKEYNELDTVWFMSGSLFQQYPYDLPTEAFPFEIFKQAFAAVQSCVVHLQN 264

Query: 255 -SMDGRVALVPWA-DMLNHSCEVETFLD-YDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
            S+  R ALVP    +L +    +  L   D + Q VV   DR Y+ G+ + +  G + N
Sbjct: 265 VSLARRFALVPLGPPLLAYCSNCKAMLTAVDGAVQLVV---DRPYKAGDPIVVWCGPQPN 321

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
            +LL +YGFV  + +N    VE+ LS +  D  Y++K    ++ G  + + F
Sbjct: 322 TKLLTNYGFVDEDNSNDRLIVEVALSTE--DPQYQDKRIVAQRNGKLSIQTF 371


>gi|326496433|dbj|BAJ94678.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 453

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 169/347 (48%), Gaps = 43/347 (12%)

Query: 109 KNIRKGEKLLFVPPSLVITADSKWS------CPEAGEVLKQCSVPDWPLLATYLISEASF 162
           +N+ +GE +  VP  L + AD+  +      C   G++        W  ++  ++ EA+ 
Sbjct: 51  RNLPRGEVVAEVPKKLWLDADAVAASVLGRVCGSGGDLRP------WVSVSLLILREAAR 104

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
              S W+ Y++ LPRQ  S ++W+  EL   ++ +Q+    +     V   ++++   I 
Sbjct: 105 GGDSLWAPYLAILPRQTDSTIFWSEEEL-LEIQGTQLLSTTMGVKEYVQSEFDNVEAGII 163

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVET---- 277
           +   DLFP  +   + F W+FG+L SR+   P + G ++AL+P+AD++NH  ++ +    
Sbjct: 164 NVNKDLFPGTI-TFDDFLWAFGVLRSRV--FPELRGDKLALIPFADLINHDGDITSKESC 220

Query: 278 -------FLDYDKSSQGVVFT--TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
                  FL  D      VF+  T    + GEQ+++ Y   KSN EL L YGF   E  +
Sbjct: 221 WEIKGKGFLGRD-----TVFSLRTPVDVKSGEQIYVQYDLDKSNAELALDYGFT--ESNS 273

Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI-TGWPLELMAYAYLVVSPPS 386
             DS  L L + +SD  Y++KL+     G+  +  F + +    P +++ Y  L+    +
Sbjct: 274 SRDSYTLTLEISESDPFYEDKLDIAELNGMGETAYFDVVLGESLPPQMITYLRLLCLGGT 333

Query: 387 MKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKY 432
                E   A   NK+    ++     +E+++ Q I ++C+S+++ Y
Sbjct: 334 DAFLLE---ALFRNKVWEHLELPVSRDNEESICQVIQNACKSALAAY 377


>gi|7573451|emb|CAB87765.1| putative protein [Arabidopsis thaliana]
          Length = 537

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 90/334 (26%), Positives = 148/334 (44%), Gaps = 45/334 (13%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVIT 127
           +++  L+ W+  +GLPP K+ +++    ++       + A ++++KG+    VP SLV+T
Sbjct: 78  DDSEDLKFWMDKNGLPPCKVILKERPAHDQKHKPIHYVAASEDLQKGDVAFSVPDSLVVT 137

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PY 180
            +         E+L    + +   LA YL+ E    K S W  YI  L RQ         
Sbjct: 138 LERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSVWYPYIRELDRQRGRGQLDAE 197

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVF 234
           S L W+ AELD YL  S  +   +ER   +   YN+L         +F +YP   P E F
Sbjct: 198 SPLLWSEAELD-YLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPFDIPTEAF 256

Query: 235 NMETFKWSFGILFSRLVRLP--------------------SMDGRVALVPWA-DMLNHSC 273
           + E FK +F  + S +V L                      +  R ALVP    +L +  
Sbjct: 257 SFEIFKQAFVAIQSCVVHLQVVLVASSNLDCYASSCTQNVGLARRFALVPLGPPLLAYCS 316

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
             +  L        V    DR Y+ G+ + +  G + N +LLL+YGFV  +  NP D V 
Sbjct: 317 NCKAML--TAVDGAVELVVDRPYKAGDPIVVWCGPQPNAKLLLNYGFVDED--NPYDRVI 372

Query: 334 LPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           +  +L   D  Y++K    ++ G  + + F +++
Sbjct: 373 VEAALNTEDPQYQDKRMVAQRNGKLSQQVFQVRV 406


>gi|431839268|gb|ELK01195.1| SET domain-containing protein 3 [Pteropus alecto]
          Length = 805

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 91/319 (28%), Positives = 149/319 (46%), Gaps = 28/319 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           L KW S++G   +    + VD  E G  L A ++I+  E  L+VP  L++T +S      
Sbjct: 261 LMKWASENGASVE--GFEMVDFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA----- 313

Query: 137 AGEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
              VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E
Sbjct: 314 KNSVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDE 372

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGIL 246
           + RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   +
Sbjct: 373 V-RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSV 430

Query: 247 FSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
            +R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++
Sbjct: 431 MTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIY 489

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F
Sbjct: 490 IFYGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF 547

Query: 364 PIQITGWPLELMAYAYLVV 382
            +  T  P+     A+L V
Sbjct: 548 ALHFTEPPISAQLLAFLRV 566


>gi|426248573|ref|XP_004018037.1| PREDICTED: LOW QUALITY PROTEIN: histone-lysine N-methyltransferase
           setd3 [Ovis aries]
          Length = 596

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 89  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 143

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 144 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 201

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     R+   +P       ++ F  E ++W+   + +
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQYAYF-YRVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 260

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 319

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394


>gi|449455876|ref|XP_004145676.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
 gi|449492872|ref|XP_004159127.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cucumis
           sativus]
          Length = 521

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 89/307 (28%), Positives = 141/307 (45%), Gaps = 27/307 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
           L+ W+ D+GLPP K+ +++    ++       + A +++  G+    VP SLV+T +   
Sbjct: 90  LKAWMHDNGLPPCKVILEEKPSHDKNHRPIHYVAASEDLEVGDVAFSVPNSLVVTLERVL 149

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W
Sbjct: 150 GNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLW 209

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           +  ELD YL  S  ++  +ER   +   YN+L         +F +YP   P E F+ E F
Sbjct: 210 SEDELD-YLSGSPTKKEVLERAEGIKKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 268

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L +    +  L        V    DR Y
Sbjct: 269 KQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAML--TAVDGAVELVVDRPY 326

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + GE + +  G + N +LLL+YGFV  +  N  D + +  +L   D  Y++K    ++ G
Sbjct: 327 KAGESIAVWCGPQPNSKLLLNYGFVDED--NRYDRLVVEAALNTEDPQYQDKRMVAQRNG 384

Query: 357 LSASECF 363
             + + F
Sbjct: 385 RLSIQAF 391


>gi|443722302|gb|ELU11224.1| hypothetical protein CAPTEDRAFT_181634 [Capitella teleta]
          Length = 541

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/319 (27%), Positives = 153/319 (47%), Gaps = 17/319 (5%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           +N      WL  + +  + + IQ  DVG  G+ A ++ ++GE  L +P S+++T D+  +
Sbjct: 75  KNFDGFMGWLKSNSVDAEAVEIQHFDVGGYGIKATRDFKEGELFLAIPRSVMMTTDTAKN 134

Query: 134 CPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
                 +     +   P  LLA +++ E     +S W  Y+  LP    S LY+   +L 
Sbjct: 135 SALGALIADNRILQTMPNILLALHVLCELC-SPASFWLPYLKILPHSYSSPLYFNPEDL- 192

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKY--PDLFPEEVFNM--ETFKWSFGILF 247
           + L+AS      I +  N+   Y      +F  +      P +V N+  + ++W+   + 
Sbjct: 193 QLLKASPTLSEMINQFRNITRQYAYF-FNLFQGHELASKLPIQVKNICYDDYRWAVSSVM 251

Query: 248 SRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVF 303
           +R  ++P++DG+    AL+P  DM NH+   +   D+  K+ +   F+ +     G QVF
Sbjct: 252 TRQNQIPTLDGQRMISALIPLWDMCNHT-NGQITTDFSLKNDRSECFSLEGTV-AGAQVF 309

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           I YG +SN ELL+  GFV  +  N SD + + L + K+D  +  K E L +  + AS  F
Sbjct: 310 IFYGSRSNAELLIHNGFVYPQ--NHSDRLTIRLGISKNDPLFSMKSEVLSRLSMQASRLF 367

Query: 364 PIQITGWPLELMAYAYLVV 382
            +     P++    A+L V
Sbjct: 368 SLHCGVNPVDSDTLAFLRV 386


>gi|356571407|ref|XP_003553868.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 1 [Glycine max]
 gi|356571409|ref|XP_003553869.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           isoform 2 [Glycine max]
          Length = 502

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/309 (28%), Positives = 141/309 (45%), Gaps = 27/309 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
           L+ W+   GLPP K+ ++              + A ++++ G+    VP SLV+T +   
Sbjct: 71  LKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERVL 130

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W
Sbjct: 131 GNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLW 190

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
            ++ELD YL  S I++  I+R   +   YN+L         +F +YP   P E F+ E F
Sbjct: 191 LKSELD-YLSGSPIKDEVIQREEAIRKEYNELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L++    +  L        V    DR Y
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAML--TAVDGAVELAVDRPY 307

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + G+ + +  G + N +LL++YGFV    +N  D + +  +L   D  Y++K    ++ G
Sbjct: 308 KAGDPIVVWCGPQPNSKLLINYGFVDENNSN--DRLIVEAALNTEDPQYQDKRMVAQRNG 365

Query: 357 LSASECFPI 365
             + + F +
Sbjct: 366 KLSVQVFHV 374


>gi|291411315|ref|XP_002721936.1| PREDICTED: SET domain containing 3 [Oryctolagus cuniculus]
          Length = 591

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 89/317 (28%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S +G   +   +   +    GL A + I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASANGASVEGFEVVNFEEEGFGLRATREIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     R+   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YRVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R +  GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFHAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|242049248|ref|XP_002462368.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
 gi|241925745|gb|EER98889.1| hypothetical protein SORBIDRAFT_02g024510 [Sorghum bicolor]
          Length = 489

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 100/350 (28%), Positives = 169/350 (48%), Gaps = 39/350 (11%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS-----VPDWPLLATYLIS 158
           GLVA +++ +GE +  VP  L + AD+      A ++ + C      +  W  +A  L+S
Sbjct: 82  GLVAARDLPRGEVVAEVPKKLWMDADAV----AASDIGRACGGGGGGLRPWVAVALLLLS 137

Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI----GTY 214
           E +    S W+ Y++ LPRQ  S ++         L+ S +R + +     V       +
Sbjct: 138 EVARGADSPWAPYLAILPRQTDSTIFCAG------LKKSSLRYKLLSTTVGVKEYVQSEF 191

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSC 273
           + ++  I S+  DLFP  +   + F W+FGIL SR+   P + G ++ALVP+AD++NHS 
Sbjct: 192 DSVQAEIISRNKDLFPGSI-TFDDFLWAFGILRSRV--FPELRGDKLALVPFADLVNHSP 248

Query: 274 EVET-FLDYDKSSQGVV-------FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPRE 324
           ++ +    ++   +G+          T    + G+Q++I Y   KSN EL L YGFV   
Sbjct: 249 DITSEGSSWEIKGKGLFGREPMFSLRTPVDVKSGQQIYIQYDLDKSNAELALDYGFVE-- 306

Query: 325 GTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVS 383
            +NPS DS  + L + +SD  Y +KL+      L  +  F I I   PL      YL + 
Sbjct: 307 -SNPSRDSYTVTLEISESDPFYGDKLDIAELNELGETAYFDI-ILDEPLPPQMLPYLRLL 364

Query: 384 PPSMKGKFEEMAAAASNKMTSKKDIKC-PEIDEQALQFILDSCESSISKY 432
                  F  + A   N +    ++   P+ +E   Q + D+C+S+++ Y
Sbjct: 365 CIGGTDAF-ILEALFRNSVWGHLELPLSPDNEESICQVMRDACKSALAAY 413


>gi|297849804|ref|XP_002892783.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297338625|gb|EFH69042.1| hypothetical protein ARALYDRAFT_471564 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 482

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 183/389 (47%), Gaps = 40/389 (10%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKW 132
           EN     KWL D G+   K   +   V E  GLVA ++I + E +L +P  L       W
Sbjct: 47  ENVRNFWKWLGDQGVVSGKSPAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL-------W 99

Query: 133 SCPE---AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
             PE   A ++   C  +  W  +A +LI E  +E+ S W  Y+  LP+   S ++W+  
Sbjct: 100 INPETVTASKIGPLCGGLKPWVSVALFLIRE-KYEEESSWRLYLDMLPQSTDSTVFWSEE 158

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           EL   L+ +Q+    +     V   +  L   I     DLF   +  ++ F W+FGIL S
Sbjct: 159 ELAE-LKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGILKS 216

Query: 249 R-LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QP 298
           R   RL   +  + L+P AD++NH+  + T  DY    +G  +F+ D  +        + 
Sbjct: 217 RAFSRLRGQN--LVLIPLADLINHNPAITTE-DYAYEIKGAGLFSRDLLFSLKSPVYVKA 273

Query: 299 GEQVFISYG-KKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYG 356
           GEQV+I Y   KSN EL L YGFV    +NP+ +S  L + + +SD  + +KL+      
Sbjct: 274 GEQVYIQYDLNKSNAELALDYGFVE---SNPNRNSYTLTIEIPESDPFFGDKLDIAETNK 330

Query: 357 LSASECFPIQITG--WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEID 414
           +  +  F + + G   P  ++ Y  LV    S     E +    +N +    ++     +
Sbjct: 331 MGETGYFDV-VDGQTLPAGMLQYLRLVALGGSDAFLLESI---FNNTIWGHLELPVSRSN 386

Query: 415 EQAL-QFILDSCESSISKYSRFLQVKELL 442
           E+ + + + D+C+S++S +S  ++  E L
Sbjct: 387 EELICRVVRDACKSALSGFSTTIEEDEKL 415


>gi|15223054|ref|NP_172856.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
 gi|17369870|sp|Q9XI84.1|RBCMT_ARATH RecName: Full=[Fructose-bisphosphate aldolase]-lysine
           N-methyltransferase, chloroplastic; AltName:
           Full=Aldolases N-methyltransferase; AltName:
           Full=[Ribulose-bisphosphate carboxylase]-lysine
           N-methyltransferase-like; Short=AtLSMT-L;
           Short=LSMT-like enzyme; Flags: Precursor
 gi|5080779|gb|AAD39289.1|AC007576_12 Putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|28973755|gb|AAO64193.1| putative ribulose-1,5 bisphosphate carboxylase oxygenase large
           subunit N-methyltransferase [Arabidopsis thaliana]
 gi|332190979|gb|AEE29100.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase
           [Arabidopsis thaliana]
          Length = 482

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 180/387 (46%), Gaps = 36/387 (9%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKW 132
           EN     KWL D G+   K   +   V E  GLVA ++I + E +L +P  L       W
Sbjct: 47  ENVRNFWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL-------W 99

Query: 133 SCPE---AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
             PE   A ++   C  +  W  +A +LI E  +E+ S W  Y+  LP+   S ++W+  
Sbjct: 100 INPETVTASKIGPLCGGLKPWVSVALFLIRE-KYEEESSWRVYLDMLPQSTDSTVFWSEE 158

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           EL   L+ +Q+    +     V   +  L   I     DLF   +  ++ F W+FGIL S
Sbjct: 159 ELAE-LKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGILKS 216

Query: 249 R-LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QP 298
           R   RL   +  + L+P AD++NH+  ++T  DY    +G  +F+ D  +        + 
Sbjct: 217 RAFSRLRGQN--LVLIPLADLINHNPAIKT-EDYAYEIKGAGLFSRDLLFSLKSPVYVKA 273

Query: 299 GEQVFISYG-KKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYG 356
           GEQV+I Y   KSN EL L YGFV    +NP  +S  L + + +SD  + +KL+      
Sbjct: 274 GEQVYIQYDLNKSNAELALDYGFVE---SNPKRNSYTLTIEIPESDPFFGDKLDIAESNK 330

Query: 357 LSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQ 416
           +  +  F I + G  L      YL +        F  + +  +N +    ++     +E+
Sbjct: 331 MGETGYFDI-VDGQTLPAGMLQYLRLVALGGPDAF-LLESIFNNTIWGHLELPVSRTNEE 388

Query: 417 AL-QFILDSCESSISKYSRFLQVKELL 442
            + + + D+C+S++S +   ++  E L
Sbjct: 389 LICRVVRDACKSALSGFDTTIEEDEKL 415


>gi|110331827|gb|ABG67019.1| hypothetical protein LOC84193 [Bos taurus]
          Length = 488

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 89  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 143

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 144 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 201

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P       ++ F  E ++W+   + +
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 260

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 319

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394


>gi|380015248|ref|XP_003691619.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Apis
           florea]
          Length = 483

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 169/362 (46%), Gaps = 22/362 (6%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           WL ++G      ++ +    + GL A +N  + E +L +P  L+ +  +  + PE   + 
Sbjct: 87  WLKENGANVDGASVAEFPGYDLGLKAERNFLENELILRIPRGLIFSIHN--AAPELITLQ 144

Query: 142 KQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI 199
               +   P   LA  L+ E   E +S+W  Y+  LP    ++LY T A++   L+ S  
Sbjct: 145 NDPLIQHMPQVALAIALLIERHKE-NSKWKPYLDILPTTYTTVLYMTAADMIE-LKGSPT 202

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFSRLVRLPSM 256
            E A+++  N+   Y+    ++F    +       +VF  E + W+   + +R   +PS 
Sbjct: 203 LEAALKQCRNIARQYSYFN-KVFQNNNNAVSAILRDVFTYERYCWAVSTVMTRQNLIPSE 261

Query: 257 DGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           DG     AL+P  DM NH     T  D++ +S        R ++ GEQ+FISYG ++N +
Sbjct: 262 DGSRMIHALIPMWDMCNHENGRIT-TDFNATSNYCECYALRDFKKGEQIFISYGPRTNSD 320

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLE 373
             +  GFV  E  N  D  +L L + K+D   KE++E L K  L     F +++   P+ 
Sbjct: 321 FFVHSGFVYME--NKQDGFKLRLGISKADSLQKERIELLNKLDLPTVGEFLLKLGTEPIS 378

Query: 374 LMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESSIS 430
            +  A+L V   SM+ K E      S+++   K + C     ++E   +F+L   +  I+
Sbjct: 379 DLLLAFLRVF--SMR-KAELAHWIRSDRVNDLKHMDCALETVVEENVRKFLLTRLQLLIA 435

Query: 431 KY 432
            Y
Sbjct: 436 NY 437


>gi|168020073|ref|XP_001762568.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686301|gb|EDQ72691.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 427

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 88/306 (28%), Positives = 141/306 (46%), Gaps = 27/306 (8%)

Query: 83  LSDSGLPPQKMAI--QKVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
           + + GLP   +A+   ++  G++G     +VA ++++ G+  L VP SLV+T +      
Sbjct: 1   MEEQGLPKCNVALVEHQLAEGDKGKPIHYVVASQDLQPGDVALTVPKSLVVTLERVLGDE 60

Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRA 188
              E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W+  
Sbjct: 61  TIAELLTTNKLSELACLALYLMYEKKQGKESYWYPYIRELDRQRGRGQLSVASPLLWSPE 120

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWS 242
           EL+ Y   S ++E  +ER+  +   Y +L         +F +YP   P E F+ E FK +
Sbjct: 121 ELNEYFTGSTMKEVVLERLAGIKREYEELDTVWFMAGSLFKQYPFDLPTEAFSFEIFKQA 180

Query: 243 FGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           F  + S +V L   S+  R ALVP    +L +    +  L        V    D  Y+ G
Sbjct: 181 FVAVQSCVVHLQGVSLARRFALVPLGPPLLAYKSNCKAML--KAVGDNVQLEVDHAYKTG 238

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
           + + +  G + N +LLL+YGFV  +  NP D + +  SL   D  Y++K   ++K     
Sbjct: 239 DPIAVWCGPQPNSKLLLNYGFVDED--NPFDRLAVEASLNTEDPLYQQKRAVVQKNNRLT 296

Query: 360 SECFPI 365
            + F I
Sbjct: 297 IQTFQI 302


>gi|332321746|sp|B2KI88.1|SETD3_RHIFE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|183637154|gb|ACC64548.1| SET domain containing 3 isoform a (predicted) [Rhinolophus
           ferrumequinum]
          Length = 594

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVSFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFGEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|145356486|ref|XP_001422460.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
 gi|144582703|gb|ABP00777.1| chloroplast lysine N-methyltransferase [Ostreococcus lucimarinus
           CCE9901]
          Length = 529

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 26/282 (9%)

Query: 107 ALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD----WPLLATYLISEASF 162
           A + + +G K + VP SL IT +   +  E G+ L+   V      W  LA  L+ E   
Sbjct: 93  ATRALARGAKAIVVPKSLWITPEVGMNDDELGKALRDEDVAGGLARWTTLALTLLKERER 152

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
            + S+++ Y+  LP   +S L+W   EL   ++ +Q+ + A      V G Y  LR  +F
Sbjct: 153 GEESKYAAYVKTLPEVLHSPLFWNAEELSE-IQGTQLLDNAAGYDGYVRGVYETLRTGMF 211

Query: 223 SKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC-------- 273
           +K+ D+F  E  F+ + F+W+FGIL SR +  P     +ALVP  D++NHS         
Sbjct: 212 AKHADVFDVEGAFSEDNFRWAFGILRSRTM-APCDGANIALVPGVDLVNHSSLSQARWRV 270

Query: 274 ------EVETFLDYDKSSQGVV--FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPRE 324
                  V       K   GV      DR     E ++++Y  + ++    L +GFV  +
Sbjct: 271 SGGVAGAVAGLFGGGKGDDGVSARVECDRALNVNEPLYVNYNPEGTDTSFALDFGFV--D 328

Query: 325 GTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ 366
              PS    L LS+ + D    +KL+ L   GL  +  F ++
Sbjct: 329 TITPSPGYALSLSVPEDDPNVFDKLDVLDVCGLGETPTFTLR 370


>gi|338719872|ref|XP_001488117.2| PREDICTED: histone-lysine N-methyltransferase setd3-like [Equus
           caballus]
          Length = 609

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G       +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 96  LMKWASENGASVDGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 151

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 152 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 208

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     R+   +P  +  P ++ F  E ++W+   + +
Sbjct: 209 RYLQSTQAVHDVFSQYKNTARQYAYF-YRVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 267

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 268 RQNQIPTEDGSRVTLALIPLWDMCNHTTGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 326

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 327 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 384

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 385 HFTEPPISAQLLAFLRV 401


>gi|302821397|ref|XP_002992361.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
 gi|300139777|gb|EFJ06511.1| hypothetical protein SELMODRAFT_430576 [Selaginella moellendorffii]
          Length = 463

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 143/301 (47%), Gaps = 35/301 (11%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L  WL   G      ++ K    +RGL A+++I+ GE +L V    ++TAD     P   
Sbjct: 40  LVSWLKIRG-EHDACSLLKTGPDKRGLFAVRDIKAGECILRVSRDTMMTADR---LPLEF 95

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEA 196
           + L    V +W  LA  L+ E    ++S W+ YIS LPR    +S  +W + EL   ++ 
Sbjct: 96  QQLLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWGTIHSTAFWRKEEL-AMIQE 154

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLP 254
           S +    + R   +   +N+++  IF +Y  +F   V +  +FK ++    + SR  R+ 
Sbjct: 155 SSLSYETMSRRAAIREEFNEMQ-PIFQRYEHVFGGPV-SYASFKHAYVTATVCSRAWRID 212

Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV------VFT-------------TDRQ 295
            ++ ++A+VP+AD +NH       L YD  +         V++              D+ 
Sbjct: 213 GLE-KLAMVPFADFMNHDWSSNAMLTYDTDNGSTEVEEVKVYSDCLDIALFCAQLFADKN 271

Query: 296 YQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           Y  GEQV IS+G   N  L L +GF VP    NP D V+L L + + D   KEKL+ L  
Sbjct: 272 YAAGEQVTISFGPLCNASLALDFGFTVP---YNPWDKVQLWLGISRRDSLRKEKLQYLHA 328

Query: 355 Y 355
           +
Sbjct: 329 H 329


>gi|119914085|ref|XP_589822.3| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|297488270|ref|XP_002696879.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Bos taurus]
 gi|296475307|tpg|DAA17422.1| TPA: SET domain containing 3 [Bos taurus]
          Length = 601

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 89  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 144

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 145 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 201

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P       ++ F  E ++W+   + +
Sbjct: 202 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 260

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 319

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394


>gi|440907688|gb|ELR57800.1| SET domain-containing protein 3 [Bos grunniens mutus]
          Length = 594

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P       ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHAHKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTSGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|302804384|ref|XP_002983944.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
 gi|300148296|gb|EFJ14956.1| hypothetical protein SELMODRAFT_119151 [Selaginella moellendorffii]
          Length = 439

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 167/357 (46%), Gaps = 25/357 (7%)

Query: 90  PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW 149
           P  + +++ ++G  GL A +++ K ++++ +P +L + AD+     E GE  +   +  W
Sbjct: 9   PGGVEVRRGELG-LGLFAKRSVSKNQEVVSIPKTLWMDADTV-RRSEIGECCE--GLRPW 64

Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
             +A YL+ E + +  S WS YI  LPR   S L+W+  EL   L+ +Q+          
Sbjct: 65  IAVALYLLHEKA-KPHSDWSAYIRVLPRTLDSPLFWSEEELAE-LKGTQLLSSMNGFKEF 122

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
           +   Y+ +   +    PD+F   ++ +E F W+FGIL SR    P +   +ALVP AD +
Sbjct: 123 LKREYDKVMTEVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFP-PLIGDNLALVPLADFV 181

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQ--------PGEQVFISYGKK-SNGELLLSYGF 320
           NH   +       K     VF                 ++V I YGKK  N +L   YGF
Sbjct: 182 NHGFGLTNEDPGWKVKSAGVFARQETLTLQAAANCAEKQEVLIQYGKKKGNAQLATDYGF 241

Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI-QITGWPLELMAYAY 379
           V  +  N  DS  L L +  S++   +K++  +  GL ++  F + +  G P +++AY  
Sbjct: 242 VDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLYRNQGPPEDMIAYLR 301

Query: 380 LVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESSISKYS 433
           L+    S     +     A  + T    ++ P   E +E   + +++ C +++ +YS
Sbjct: 302 LIALFGS-----DSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYS 353


>gi|302786274|ref|XP_002974908.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
 gi|300157067|gb|EFJ23693.1| hypothetical protein SELMODRAFT_102436 [Selaginella moellendorffii]
          Length = 389

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 161/372 (43%), Gaps = 70/372 (18%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RGL A + I  GE +L V   L+IT +     PE    L    V  W  LA +L++    
Sbjct: 1   RGLFASRPIHTGECMLHVSHDLMITPEK---LPEEVTKLLSKDVSAWAKLALFLLAHQKK 57

Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIER----------ITNV 210
           +++S W+ YIS LP     +S ++WT+ EL  YL+ S +    ++R            NV
Sbjct: 58  KETSAWAPYISCLPPFGSMHSTIFWTQDEL-VYLKVSPVYRETVQRKDVVRMEFAAAENV 116

Query: 211 IGTYNDLRL----RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWA 266
                 ++L    RI + Y  +     + +ET K                   +ALVP+ 
Sbjct: 117 CMLMQQVKLFVCSRILTDYITVC-SRAWGIETIK------------------SLALVPFV 157

Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREG 325
           D  NH       L YD+        +DR Y  G+QV ISYG+ SN  L L +GF +P   
Sbjct: 158 DFFNHDANCRAMLSYDEDRHCAEVVSDRDYATGDQVVISYGQLSNATLALDFGFALP--- 214

Query: 326 TNPSDSVE-LPLSLKKSDKCYKEKLEALRKYGL----------SASECFPIQIT------ 368
            NP D V  + LSL + D     KL+ L  + +          +A   F +Q        
Sbjct: 215 FNPHDQVAGIWLSLSEKDPLRDSKLKLLHSHNMQTCVTREGVDTAGSSFSLQEVKSKAGR 274

Query: 369 --GWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE----QALQFIL 422
             G P  L A+A +V +  S   + +EMA  A++  T  +  + P ID+    +A+  + 
Sbjct: 275 GKGIPQTLRAFARVVCATTS--EELDEMAKFAAD--TDGRLARRPSIDKTKEHKAMTLLQ 330

Query: 423 DSCESSISKYSR 434
              ++ I K+ +
Sbjct: 331 TVIDNRIQKHEQ 342


>gi|328864871|gb|EGG13257.1| hypothetical protein DFA_11018 [Dictyostelium fasciculatum]
          Length = 1658

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/306 (27%), Positives = 146/306 (47%), Gaps = 14/306 (4%)

Query: 79   LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
             +KWL+D G+   K+ I    D   RG+V  K + + E ++ VP   +I  D     P  
Sbjct: 1185 FEKWLTDGGVHFPKLQIANFNDSTGRGVVTTKKVEENECVVSVPRKFLINVDCARKHPVL 1244

Query: 138  GEVL--KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
              +L  +   + D  +L  ++I E     +S W  +   LP    + +++T  EL   LE
Sbjct: 1245 NSILFEEATGLNDDTILFLFVIYEKE-NPNSFWRPFFDTLPSYFPTSIHYTTTELLE-LE 1302

Query: 196  ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
             + + E  I+   ++      L   + ++YPD+FPE +F ME F W+  +  SR ++L  
Sbjct: 1303 GTNLFEETIQIKEHLESIRELLFPELSNQYPDVFPESLFTMENFLWARSLFDSRAIQL-K 1361

Query: 256  MDGRVA--LVPWADMLNHSCEVETFLDY-DKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
            +DGR+   LVP ADM+NH  + +    Y D+ +      +        Q+F+ YG   + 
Sbjct: 1362 IDGRIVNCLVPMADMINHHDQAQISQRYFDQENDCFRMISCCNIPATSQIFLQYGALQSW 1421

Query: 313  ELLLSYGFVPREGTNPSDSVELPLSLKKSD--KCYKEKLEALRKYGLSASECFPIQITGW 370
            EL L YGFV     N  DSV +   + + D  +  +EK + L ++ L+    + +  +  
Sbjct: 1422 ELALYYGFVI--SNNHYDSVHIGFDMPEEDTPELREEKQKLLDRHLLTVDHHY-LHRSNI 1478

Query: 371  PLELMA 376
            P +L+A
Sbjct: 1479 PSKLLA 1484


>gi|343961019|dbj|BAK62099.1| SET domain containing 3 isoform a [Pan troglodytes]
          Length = 492

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|403274243|ref|XP_003928891.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Saimiri
           boliviensis boliviensis]
          Length = 513

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 87/315 (27%), Positives = 144/315 (45%), Gaps = 24/315 (7%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S         V
Sbjct: 2   KWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KNSV 56

Query: 141 LKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
           L      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ RY
Sbjct: 57  LGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RY 114

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRL 250
           L+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +R 
Sbjct: 115 LQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQ 173

Query: 251 VRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
            ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q GEQ++I YG
Sbjct: 174 NQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGEQIYIFYG 232

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
            +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +  
Sbjct: 233 TRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 290

Query: 368 TGWPLELMAYAYLVV 382
           T  P+     A+L V
Sbjct: 291 TEPPISAQLLAFLRV 305


>gi|171678927|ref|XP_001904412.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937534|emb|CAP62192.1| unnamed protein product [Podospora anserina S mat+]
          Length = 466

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/295 (29%), Positives = 146/295 (49%), Gaps = 21/295 (7%)

Query: 72  SLENASTLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           +L   ++++ WL  SG +    + +    V  RG+ AL+  +KGE++L +P  ++ T + 
Sbjct: 16  TLSRPNSMESWLKLSGAVGLDDLELADFPVTGRGVRALRRFKKGERILTIPCGVLWTVEH 75

Query: 131 KWSCPEAGEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYW 185
            ++ P  G  L+      SV D  +LATY++   S E       ++++ALP    S +++
Sbjct: 76  AFADPLLGPALRSARPPLSVED--ILATYILFIRSRESGYDGLRSHVAALPTSYSSSIFF 133

Query: 186 TRAELDRYLEASQIR-ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
           ++ EL+     S     + ++R  ++   Y  L + + +++ DL P + F +E   W+  
Sbjct: 134 SKDELEVCAGTSLYTITKQLDR--SIDDDYRALVVGVLAQHRDLLPLDKFTIE--DWALC 189

Query: 245 ILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
            ++SR +     DG     L P+ADMLNHS EV+    YD SS  +     + Y+ G+Q 
Sbjct: 190 TVWSRAMDFALPDGNSIRLLAPFADMLNHSSEVKPCHVYDVSSGNLSVLAGKDYEAGDQA 249

Query: 303 FISYGKKSNGELLLSYGFVPRE----GTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           FISYG   N  LL  YGFV +           +  +PL+L  +D   K  L  LR
Sbjct: 250 FISYGPIPNSRLLRLYGFVQKHKLWVSAGLDSTCTIPLTL--TDPLPKNVLRYLR 302


>gi|356511552|ref|XP_003524489.1| PREDICTED: probable ribulose-1,5 bisphosphate carboxylase/oxygenase
           large subunit N-methyltransferase, chloroplastic-like
           [Glycine max]
          Length = 503

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 141/309 (45%), Gaps = 27/309 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
           L+ W+   GLPP K+ ++              + A ++++ G+    VP SLV+T +   
Sbjct: 71  LKSWMHKHGLPPCKVVLKDKPCPNDSHKPIHYVAASQDLQVGDVAFSVPNSLVVTLERVL 130

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W
Sbjct: 131 GNETVAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLSVESPLLW 190

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           +++ELD YL  S I++  I+R   +   Y +L         +F +YP   P E F+ E F
Sbjct: 191 SKSELD-YLSGSPIKDEVIQREEAIRKEYKELDTVWFMAGSLFQQYPYDIPTEAFSFEIF 249

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           K +F  + S +V L   S+  R ALVP    +L++    +  L        V    DR Y
Sbjct: 250 KQAFAAIQSCVVHLQKVSLARRFALVPLGPPLLSYQSNCKAML--TAVDGAVELAVDRPY 307

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + G+ + +  G + N +LL++YGFV    +N  D + +  +L   D  Y++K    ++ G
Sbjct: 308 KAGDPIVVWCGPQPNSKLLINYGFVDENNSN--DRLIVEAALNTEDPQYQDKRMVAQRNG 365

Query: 357 LSASECFPI 365
             + + F +
Sbjct: 366 KLSVQVFHV 374


>gi|346474100|gb|AEO36894.1| hypothetical protein [Amblyomma maculatum]
          Length = 459

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/318 (29%), Positives = 149/318 (46%), Gaps = 37/318 (11%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           KW S++G     +AI+    G+ GLVA + I +  + L +P  LV+T  S     + G +
Sbjct: 49  KWCSENGAYLGSVAIKDRPDGDYGLVAEEKIEESMQFLGIPMKLVMTTASARKS-KLGPL 107

Query: 141 LKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
           L+     D P+        LA +LI E S  +SS W  YIS LP    ++LY+   EL+ 
Sbjct: 108 LR-----DDPIMKSMSNVALAIFLILELSAGESSFWHPYISVLPDSFNTVLYFNIEELE- 161

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSR 249
            L  S + + A++   ++   Y     +IF  +P    L  ++ F  + ++W+   + +R
Sbjct: 162 LLSGSAVLDEALKLHRSIARQYAYFH-KIFRTHPLAKSLPFKDCFTYDLYRWAVSAVMTR 220

Query: 250 LVRLP------------SMDGRVA---LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
              +P             +DG  A   LVP  DM NHS + +   DYD S+  V     R
Sbjct: 221 QNAVPWTESDGLGGDDVEIDGTAAVTALVPLWDMCNHS-DGKVLTDYDSSASMVRCYAMR 279

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
            +  GE+V I YGK++N E  +  GFV  +  N  D+V++ L + K D  +  K +    
Sbjct: 280 DFDKGEEVTIFYGKRTNAEFFIHNGFVFED--NRYDAVDIKLGVSKKDPLFAVKSKLCED 337

Query: 355 YGLSASECFPIQITGWPL 372
           + LS S  F +     P+
Sbjct: 338 HDLSLSGTFALVARDRPV 355


>gi|332321744|sp|B5FW36.1|SETD3_OTOGA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|197215622|gb|ACH53017.1| SET domain containing 3 isoform a (predicted) [Otolemur garnettii]
          Length = 595

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI +LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|296215874|ref|XP_002754318.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Callithrix jacchus]
          Length = 610

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 97  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 152

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 153 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 209

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 210 RYLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 268

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 269 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 327

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 328 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 385

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 386 HFTEPPISAQLLAFLRV 402


>gi|332320543|sp|B0VX69.2|SETD3_CALJA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|168986666|gb|ACA35060.1| SET domain containing 3 isoform a (predicted) [Callithrix jacchus]
          Length = 597

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 84  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 138

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 139 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 196

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 197 RYLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 255

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 256 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 314

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 315 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 372

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 373 HFTEPPISAQLLAFLRV 389


>gi|449280698|gb|EMC87934.1| SET domain-containing protein 3 [Columba livia]
          Length = 593

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW +++G   +   I   +    GL A + I+  E  L+VP  L++T +S        
Sbjct: 82  LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRRLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPSEYNTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HSTEPPISAQLLAFLRV 387


>gi|281182452|ref|NP_001162549.1| histone-lysine N-methyltransferase setd3 [Papio anubis]
 gi|332321745|sp|A9X1D0.1|SETD3_PAPAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|163781076|gb|ABY40825.1| SET domain containing 3, isoform 1 (predicted) [Papio anubis]
          Length = 595

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|355778846|gb|EHH63882.1| hypothetical protein EGM_16943 [Macaca fascicularis]
          Length = 595

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|386780935|ref|NP_001247800.1| SET domain containing 3 [Macaca mulatta]
 gi|355693560|gb|EHH28163.1| hypothetical protein EGK_18532 [Macaca mulatta]
 gi|380817110|gb|AFE80429.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|383422129|gb|AFH34278.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
 gi|384949778|gb|AFI38494.1| histone-lysine N-methyltransferase setd3 isoform a [Macaca mulatta]
          Length = 595

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|302754606|ref|XP_002960727.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
 gi|300171666|gb|EFJ38266.1| hypothetical protein SELMODRAFT_449995 [Selaginella moellendorffii]
          Length = 430

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 165/357 (46%), Gaps = 25/357 (7%)

Query: 90  PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW 149
           P  + +++ ++G  GL A +++ K ++++ +P +L +  D+     E GE      +  W
Sbjct: 9   PGGVEVRRGELG-LGLFAKRSVSKNQEVVSIPKTLWMDVDTV-RRSEIGECC--AGLRPW 64

Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
             +A YL+ E + +  S WS YI  LPR   S L+W+  EL   L+ +Q+          
Sbjct: 65  IAVALYLLHEKA-KPHSDWSAYIRVLPRTLDSPLFWSEEELAE-LKGTQLLSSINGFKEF 122

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
           +   Y+ +   +    PD+F   ++ +E F W+FGIL SR    P +   +ALVP AD +
Sbjct: 123 LKREYDKVMTEVIEPRPDVFDRSLYTLEAFTWAFGILRSRTFP-PLIGDNLALVPLADFV 181

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQ--------PGEQVFISYGKK-SNGELLLSYGF 320
           NH   +     Y       VF                 ++V + YGKK  N +L   YGF
Sbjct: 182 NHGFGLTNEDPYWHVKSAGVFARQETLTLQAAANCAEKQEVLMQYGKKKGNAQLATDYGF 241

Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI-QITGWPLELMAYAY 379
           V  +  N  DS  L L +  S++   +K++  +  GL ++  F + +  G P +++AY  
Sbjct: 242 VDSDEKNNRDSFTLTLQVSLSERFADDKVDIAQMAGLDSTAYFNLYRNQGPPEDMIAYLR 301

Query: 380 LVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESSISKYS 433
           L+    S     +     A  + T    ++ P   E +E   + +++ C +++ +YS
Sbjct: 302 LIALFGS-----DSFLLEALFRNTVWDHLRLPISRENEEAICEAMIEGCRATLREYS 353


>gi|80479475|gb|AAI08868.1| Unknown (protein for MGC:132347) [Xenopus laevis]
          Length = 456

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 157/326 (48%), Gaps = 29/326 (8%)

Query: 79  LQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           LQ+WL   G   + + A +  D G RGL+A ++++ GE ++ +P + +IT ++       
Sbjct: 36  LQRWLKGRGFQGRHLRAAEFADTG-RGLMATRDLKPGELIIALPETCLITTETVLQ-SYL 93

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD-RYL 194
           G+ ++       PLLA  T+LI+E    + S+W  Y+  +P      +YW   EL+  +L
Sbjct: 94  GKYIRLWRPHVSPLLALCTFLIAERFAGERSQWKPYLDVIPSTYSCPVYW---ELEIVHL 150

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF---KWSFGILFSRLV 251
             + +R++A+E+ T V   + +  L  F+    LF + V ++ T+   +W++  + +R V
Sbjct: 151 LPAPLRQKALEQKTEVQELHTE-SLAFFNSLQPLFCDNVADIYTYDALRWAWCTVNTRTV 209

Query: 252 --------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
                   RL +     AL P+ D+LNHS EV+   ++ K  +     T+   +  +Q F
Sbjct: 210 YMKHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAF 269

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
           I YG   N  LLL YGFV     NP  SV      +   L   DK   +K   L+++   
Sbjct: 270 ICYGPHDNQRLLLEYGFVA--ANNPHRSVYVTKDAILAHLSPGDKQMPKKWALLKEHDFL 327

Query: 359 ASECFPIQITGWPLELMAYAYLVVSP 384
            +  F ++   W L L A   L + P
Sbjct: 328 VNLTFGLEGPSWKL-LTAVKLLCLRP 352


>gi|255562948|ref|XP_002522479.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223538364|gb|EEF39971.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 502

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 104/391 (26%), Positives = 177/391 (45%), Gaps = 40/391 (10%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           +WLSD G+   K   +   V E  GL+A ++I + E +L +P  L I  D+      A +
Sbjct: 57  QWLSDQGVVSGKSPAKPGVVKEGLGLIAERDIARNEVVLEIPKKLWINPDAV----AASD 112

Query: 140 VLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
           +   CS +  W  +A +LI E   ++ S W  Y+  LP    S +YW    +  Y+    
Sbjct: 113 IGNVCSGLKPWISVALFLIREKLKKEGSTWWPYLDILPDTTNSTIYWWVLLVAFYVLVLS 172

Query: 199 IRERAIERITNVIGT----------------YNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
            + R+ E +  + GT                +  +   I   + +LFP  +  ++ F W+
Sbjct: 173 FQRRSEEELAELQGTQLLRTTLGVKEYMQREFAKVEEEILLPHKELFPSPI-TLDDFLWA 231

Query: 243 FGILFSR-LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY----- 296
           FGIL SR   RL   +  + L+P AD++NHS ++ T     +   G +F+ +  +     
Sbjct: 232 FGILRSRAFSRLRGQN--LVLIPLADLINHSPDITTEDYAYEIKGGGLFSRELLFSLRSP 289

Query: 297 ---QPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
              + GEQV I Y   KSN EL L YGF+  E T   ++  L L + +SD  + +KL+  
Sbjct: 290 ISVKSGEQVLIQYDLNKSNAELALDYGFI--EKTPDRNTYTLTLQISESDPFFGDKLDIA 347

Query: 353 RKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
              G   +  F I + G PL      YL +        F  + +   N +    ++    
Sbjct: 348 ETNGSGETADFDI-VLGNPLPPAMLPYLRLVALGGTDAF-LLESIFRNTIWGHLELPISR 405

Query: 413 IDEQAL-QFILDSCESSISKYSRFLQVKELL 442
            +E+ + + + D+C+S++S Y   ++  E L
Sbjct: 406 ANEELICRVVRDACKSALSGYHTTIEEDEKL 436


>gi|332252553|ref|XP_003275417.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Nomascus leucogenys]
 gi|332252555|ref|XP_003275418.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Nomascus leucogenys]
          Length = 595

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|281338628|gb|EFB14212.1| hypothetical protein PANDA_005835 [Ailuropoda melanoleuca]
          Length = 585

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           R L+ +Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RDLQCTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTDPPVSAQLLAFLRV 387


>gi|40068481|ref|NP_115609.2| histone-lysine N-methyltransferase setd3 isoform a [Homo sapiens]
 gi|74750394|sp|Q86TU7.1|SETD3_HUMAN RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|28071092|emb|CAD61927.1| unnamed protein product [Homo sapiens]
 gi|119602070|gb|EAW81664.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602072|gb|EAW81666.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|119602073|gb|EAW81667.1| SET domain containing 3, isoform CRA_a [Homo sapiens]
 gi|194380984|dbj|BAG64060.1| unnamed protein product [Homo sapiens]
 gi|307686103|dbj|BAJ20982.1| SET domain containing 3 [synthetic construct]
          Length = 594

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|148744485|gb|AAI42996.1| SET domain containing 3 [Homo sapiens]
          Length = 594

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|332321478|sp|B1MTJ4.2|SETD3_CALMO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 595

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|297695854|ref|XP_002825140.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Pongo abelii]
 gi|395746278|ref|XP_003778419.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Pongo abelii]
          Length = 595

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|225452167|ref|XP_002264334.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1
           [Vitis vinifera]
          Length = 509

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 91/312 (29%), Positives = 143/312 (45%), Gaps = 29/312 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKW 132
           L+ W+ ++GLPP K+ +++             + A ++++ G+    VP SLV+T +   
Sbjct: 78  LKSWMHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVL 137

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYW 185
                 E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W
Sbjct: 138 GNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLW 197

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETF 239
           + +EL  YL  S  +   +ER   +   YN+L         +F +YP   P E F  E F
Sbjct: 198 SESEL-AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIF 256

Query: 240 KWSFGILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQ 295
           K +F  + S +V L   S+  R ALVP    +L +    +  L   D S Q VV   DR 
Sbjct: 257 KQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVV---DRP 313

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           Y+ GE + +  G + N +LLL+YGFV  +  N  D + +  +L   D  Y++K    ++ 
Sbjct: 314 YKAGESIVVWCGPQPNSKLLLNYGFVDED--NSYDRIVVEAALNTEDPQYQDKRMVAQRN 371

Query: 356 GLSASECFPIQI 367
           G    + F + +
Sbjct: 372 GKLTVQKFHVSV 383


>gi|426377975|ref|XP_004055723.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Gorilla
           gorilla gorilla]
          Length = 594

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|114654683|ref|XP_522946.2| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           troglodytes]
 gi|332843114|ref|XP_003314566.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Pan
           troglodytes]
 gi|397525919|ref|XP_003832895.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 1 [Pan
           paniscus]
 gi|397525921|ref|XP_003832896.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2 [Pan
           paniscus]
 gi|410227562|gb|JAA11000.1| SET domain containing 3 [Pan troglodytes]
 gi|410255618|gb|JAA15776.1| SET domain containing 3 [Pan troglodytes]
 gi|410289938|gb|JAA23569.1| SET domain containing 3 [Pan troglodytes]
 gi|410342147|gb|JAA40020.1| SET domain containing 3 [Pan troglodytes]
          Length = 594

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|169409575|gb|ACA57918.1| SET domain containing 3 isoform a (predicted) [Callicebus moloch]
          Length = 597

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 84  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 138

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 139 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 196

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 197 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 255

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 256 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 314

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 315 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 372

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 373 HFTEPPISAQLLAFLRV 389


>gi|332321742|sp|E2RBS6.1|SETD3_CANFA RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
          Length = 588

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HYTDPPVSAQLLAFLRV 387


>gi|301764186|ref|XP_002917505.1| PREDICTED: SET domain-containing protein 3-like [Ailuropoda
           melanoleuca]
          Length = 591

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 145/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           R L+ +Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RDLQCTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTDPPVSAQLLAFLRV 387


>gi|395827792|ref|XP_003787079.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Otolemur
           garnettii]
          Length = 595

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI +LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQSLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIYDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|73964462|ref|XP_547974.2| PREDICTED: SET domain containing 3 [Canis lupus familiaris]
          Length = 589

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HYTDPPVSAQLLAFLRV 387


>gi|344273731|ref|XP_003408672.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Loxodonta
           africana]
          Length = 597

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEVVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-NPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           R+L+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|328700922|ref|XP_003241429.1| PREDICTED: SET domain-containing protein 3-like [Acyrthosiphon
           pisum]
          Length = 463

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 89/313 (28%), Positives = 153/313 (48%), Gaps = 19/313 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE-- 136
           L KW + +G     + I + +    G+ A KNI  G+KL+ VP +L++T ++  S P   
Sbjct: 89  LTKWATKNGAILNGVEIHQFENYAYGMKANKNITVGDKLVTVPRALMMTEENIPSSPLWK 148

Query: 137 -AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
              + +   ++P+  L    L+     +K S W +Y++ LP    + +Y+  A+L+  L+
Sbjct: 149 LHSQDMMLRNMPNVALAIFILVESLRKDKKSFWHSYLTTLPVTYSTPVYFDVADLEA-LK 207

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVR 252
            S   E A++   N+   Y   + ++F    D   +  ++ F  E ++W+   L SR   
Sbjct: 208 GSPAFEAALKLNRNIARQYAYFK-KLFQLSNDPASVILKDTFTYEYYRWAVSTLMSRQNT 266

Query: 253 LPSMDG----RVALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
           +PS D       AL+P  DM NH S  + T  D+ KSS   V   D  Y   EQV+I YG
Sbjct: 267 VPSSDNPSENVSALIPLWDMFNHRSGRLST--DFVKSSNVCVCYADGDYAADEQVYIFYG 324

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
            ++N + L+  GFV  +  N  D+V++ L + +SD  Y  +   L+   L A   F +  
Sbjct: 325 VRTNADFLVHNGFVYPD--NEHDAVKIRLGVSRSDPLYSLRYRLLQTLSLPALAEFYLTP 382

Query: 368 TGWPLE--LMAYA 378
             +P++  L+A+ 
Sbjct: 383 GPFPVDGKLLAFV 395


>gi|332321743|sp|C1FXW2.1|SETD3_DASNO RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|226526916|gb|ACO71275.1| SET domain containing 3 isoform a (predicted) [Dasypus
           novemcinctus]
          Length = 589

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 85/313 (27%), Positives = 146/313 (46%), Gaps = 16/313 (5%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +    G
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNSM-LG 140

Query: 139 EVLKQCSVPDWP---LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
            +  Q  +        LA +L+ E +   +S W  YI +LP +  + LY+   E+ RYL 
Sbjct: 141 PLYSQDRILQAMGNITLAFHLLCERA-NPNSFWQPYIQSLPGEYDTPLYFEEDEV-RYLH 198

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVR 252
           ++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +R  +
Sbjct: 199 STQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQ 257

Query: 253 LPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           +P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I YG +
Sbjct: 258 IPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIFYGTR 316

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +  T 
Sbjct: 317 SNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTE 374

Query: 370 WPLELMAYAYLVV 382
            P+     A+L V
Sbjct: 375 PPISAQLLAFLRV 387


>gi|10439587|dbj|BAB15525.1| unnamed protein product [Homo sapiens]
          Length = 512

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 144/315 (45%), Gaps = 24/315 (7%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S         V
Sbjct: 2   KWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KNSV 56

Query: 141 LKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
           L      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ RY
Sbjct: 57  LGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV-RY 114

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRL 250
           L+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +R 
Sbjct: 115 LQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQ 173

Query: 251 VRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
            ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I YG
Sbjct: 174 NQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIFYG 232

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
            +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +  
Sbjct: 233 TRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHF 290

Query: 368 TGWPLELMAYAYLVV 382
           T  P+     A+L V
Sbjct: 291 TEPPISAQLLAFLRV 305


>gi|355718753|gb|AES06373.1| SET domain containing 3 [Mustela putorius furo]
          Length = 585

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 146/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + +
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDAFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HSTEPPVSAQLLAFLRV 387


>gi|115657973|ref|XP_798530.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Strongylocentrotus purpuratus]
          Length = 682

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 88/317 (27%), Positives = 160/317 (50%), Gaps = 22/317 (6%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           T  KWL+ +G+    + + K D G  GL A ++I+  ++L+ +P  +++T  +    P  
Sbjct: 82  TFFKWLNTNGVTTDAVKMAKFDEG-YGLQATQDIKMDQELMNIPRKVMMTDQNAVDSPTI 140

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDR 192
           G++++       +P+  L A +++SE   +  S W  Y+  LP   YSL LY+T  E+ +
Sbjct: 141 GDLVRGDRLLKGMPNVSL-AIFILSE-KLKSDSFWKPYLDVLP-SSYSLPLYFTPDEI-Q 196

Query: 193 YLEASQIRERAIERITNVIGTYNDL-RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
             + S +    +++  N+   Y  L +L    +   L   E F  + ++W+   + +R  
Sbjct: 197 LFQGSTMYGECLKQHKNIARQYAYLFKLLNLPENSKLHIREYFTYDFYRWAVSTVMTRQN 256

Query: 252 RLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           ++P+ DG+   ++L+P  DM NH+   E   D+ +     V    R +  GEQ+FI YG+
Sbjct: 257 QIPAKDGKGMSLSLIPLWDMCNHA-NGEMKTDFIEERDSCVNMALRDFSVGEQIFICYGR 315

Query: 309 KSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCY--KEKLEALRKYGLSASECFPI 365
           +S+ +LLL  GFV P    N  D + + L L  SD+ Y  K +L ++ K G+  S+ + I
Sbjct: 316 RSSADLLLYSGFVYP---GNVYDGMAIQLGLSSSDRLYAMKAQLCSVMKLGV-PSQNYHI 371

Query: 366 QITGWPLELMAYAYLVV 382
                P+ L    +L +
Sbjct: 372 SAGKEPVTLELLTFLRI 388


>gi|66813084|ref|XP_640721.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
 gi|60468751|gb|EAL66753.1| hypothetical protein DDB_G0281543 [Dictyostelium discoideum AX4]
          Length = 1339

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 74/250 (29%), Positives = 118/250 (47%), Gaps = 13/250 (5%)

Query: 79   LQKWLSDSGLPPQKMAIQK-VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
             + WL   G+   K+ I    D   RG+V  K + + E ++ VP   +I  D   + P  
Sbjct: 760  FENWLKAGGVQFPKLQIANFTDSTGRGVVTTKKVDENEAVVVVPKKYLINVDVAKAHPIL 819

Query: 138  GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
            G + ++  + D  +L  ++I E     +S W  +   LP    + ++++  EL   LE +
Sbjct: 820  GPIFEELHLNDDTILFLFVIYEKG-NANSFWRPFYDTLPSYFTTSIHYSATELLE-LEGT 877

Query: 198  QIRERAIERITNVIGTYNDLRLRIFSK-YPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
             + E  +      + ++ D      SK YPD+FPE  F+ E F W+  +L SR ++L  +
Sbjct: 878  NLFEETL-HTKQQLNSFRDYLFPELSKQYPDIFPESQFSWENFLWARSLLDSRAIQL-KI 935

Query: 257  DGRVA--LVPWADMLNHSCEV---ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
            DG +   LVP ADM+NH       E F D+D  SQ     +        Q+F+ YG   N
Sbjct: 936  DGSIKSCLVPMADMINHHTNAQISERFFDHD--SQSFKMISSCNIPANNQIFLHYGALQN 993

Query: 312  GELLLSYGFV 321
             EL L YGF+
Sbjct: 994  WELALYYGFI 1003


>gi|225561342|gb|EEH09622.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 487

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 164/357 (45%), Gaps = 25/357 (7%)

Query: 82  WLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           WL +SG +    + +    V  RG+  L+  ++GE++  +P  ++ T +  ++    G  
Sbjct: 22  WLKESGAVGLNALELANFQVIGRGVRTLRCFKEGERIFTIPADVLWTVEHAYADSLLGPA 81

Query: 141 LKQC----SVPDWPLLATYLISEASFEKSSRW-SNYISALPRQPYSLLYWTRAELDRYLE 195
           L+      SV D   LA Y++   S E       ++++ LP+   S +++T  EL+  + 
Sbjct: 82  LRSARPPLSVDD--TLAMYILFVRSRESGYDGPRSHLATLPKSYSSSIFFTDDELE--VC 137

Query: 196 ASQIRERAIERITNVI-GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR-- 252
           A        +R+   I   Y  L +R+  ++ DLFP + F +E +KW+   ++SR +   
Sbjct: 138 AGSSLYALTKRLGRCIEDDYRALVVRLLVQHQDLFPLDKFTIEDYKWALCTVWSRAMDFV 197

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE-----QVFISYG 307
           LP       + P+ADMLNHS EV     YD  S  +     + Y+ G+     QVFI YG
Sbjct: 198 LPGGKSIRLMAPFADMLNHSSEVRQCHAYDPLSGNLTILAGKDYEAGDQGVFFQVFIYYG 257

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
              N  LL  YGFV     NP+DS +L L        +++K +     G  ++    I +
Sbjct: 258 SIPNNRLLRLYGFV--MPGNPNDSYDLVLETHPMAPFFEQKRKLWDLAGFDSTSTISITL 315

Query: 368 TGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
           T  PL      YL +     +    ++A+ A  ++  K +      + + LQ +++S
Sbjct: 316 TD-PLPKNVLGYLRIQ----RSDESDLASIARQRIDPKYEKISDSNEVEVLQSLIES 367


>gi|148226164|ref|NP_001079674.1| SET domain containing 4 [Xenopus laevis]
 gi|28422727|gb|AAH46855.1| MGC53706 protein [Xenopus laevis]
          Length = 456

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 89/312 (28%), Positives = 150/312 (48%), Gaps = 28/312 (8%)

Query: 79  LQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           LQ+WL   G   + + A +  D G RGL+A ++++ GE ++ +P + +IT ++       
Sbjct: 36  LQRWLKGRGFQGRHLRAAEFADTG-RGLMATRDLKPGELIIALPETCLITTETVLQ-SYL 93

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR-YL 194
           G+ ++       PLLA  T+LI+E      S+W  Y+  +P      +YW   EL+  +L
Sbjct: 94  GKYIRLWRPHVSPLLALCTFLIAERFAGDCSQWKPYLDVIPSTYSCPVYW---ELEIIHL 150

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF---KWSFGILFSRLV 251
             + +R++A+E+ T V   + +  L  FS    LF + V ++ T+   +W++  + +R V
Sbjct: 151 LPAPLRKKALEQKTEVQELHTE-SLAFFSSLQPLFCDNVADIYTYDALRWAWCTVNTRTV 209

Query: 252 --------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
                   RL +     AL P+ D+LNHS EV+   ++ K  +     T+   +  +Q F
Sbjct: 210 YMKHTQQDRLLAQQDVCALAPYLDLLNHSPEVQVEAEFSKDRRCYEIRTNSGCRKHDQAF 269

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
           I YG   N  LLL YGFV     NP  SV      +   L   DK   +K   L+++   
Sbjct: 270 ICYGPHDNQRLLLEYGFVA--ANNPHRSVYVTKDAILAHLSPGDKQMPKKWALLKEHDFL 327

Query: 359 ASECFPIQITGW 370
            +  F I+   W
Sbjct: 328 VNLTFGIEGPSW 339


>gi|134254196|gb|AAI35195.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 507

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +W  ++G       + +      GL A + I+  E  L+VP  L++T +S       G
Sbjct: 7   LMEWCKENGASTDGFELVEFPEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KG 61

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 62  SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWLPYIKTLPNEYDTPLYFNEDEV- 119

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           +YL+++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 120 QYLQSTQAILDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMT 178

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 179 RQNQIPTEDGSRVTLALIPLWDMCNHTNSLIT-TGYNLEDDRCECVALQDFKSGEQIYIF 237

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 238 YGTRSNAEFVIHNGFFFE--NNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 295

Query: 366 QITGWPLELMAYAYLVV 382
            +T  P+     A+L V
Sbjct: 296 HVTEPPISAQLLAFLRV 312


>gi|387016380|gb|AFJ50309.1| Histone-lysine N-methyltransferase setd3 [Crotalus adamanteus]
          Length = 592

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 84/319 (26%), Positives = 146/319 (45%), Gaps = 24/319 (7%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           S L KW  ++G       +   +    GL A ++I+  E  L+VP  L++T +S      
Sbjct: 80  SELIKWAGENGAFTDGFEVANFEEEGFGLKATRDIKAEELFLWVPRKLLMTVESA----- 134

Query: 137 AGEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
              +L      D  L       LA +L+ E  +  +S W  YI  LP +  + LY+   E
Sbjct: 135 KNSILGSLYSQDRILQAMGNITLAFHLLCE-RYNPNSFWLPYIQTLPNEYNTALYFEEDE 193

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGIL 246
           + +YL+++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   +
Sbjct: 194 V-QYLQSTQAIHDIFSQYKNTARQYAYF-YKVVQTHPNASKLPLKDSFTYDDYRWAVSSV 251

Query: 247 FSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
            +R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++
Sbjct: 252 MARQNQIPAEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLKDDRCECVALQDFKAGEQIY 310

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F
Sbjct: 311 IFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVF 368

Query: 364 PIQITGWPLELMAYAYLVV 382
            +  T  P+     A+L V
Sbjct: 369 ALHSTEPPISAQLLAFLRV 387


>gi|354483159|ref|XP_003503762.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Cricetulus
           griseus]
 gi|344254671|gb|EGW10775.1| SET domain-containing protein 3 [Cricetulus griseus]
          Length = 577

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|392349055|ref|XP_003750278.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Rattus
           norvegicus]
          Length = 416

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 21  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 75

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            +L      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 76  SILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 133

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 134 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 192

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q G+Q++I 
Sbjct: 193 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 251

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 252 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 309

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 310 HFTEPPISAQLLAFLRV 326


>gi|258563540|ref|XP_002582515.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237908022|gb|EEP82423.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 445

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 96/354 (27%), Positives = 164/354 (46%), Gaps = 38/354 (10%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL +SG +    + + +  V  RG+  L+   +GE++L +P  ++ T +  ++ P  
Sbjct: 1   MEGWLKESGAVGLDALELAEFPVIGRGVRTLRRFNEGERILTIPRDVLWTVEHAYADPLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
           G VL+      SV D   LATY++   S E       ++++A+P+   S +++T  EL+ 
Sbjct: 61  GPVLRSARPPLSVDD--TLATYILFVRSRESGYDGLRSHLAAVPKSYSSSIFFTEDELEV 118

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
               S      +  IT  +G       R          E+ +    +KW+   ++SR + 
Sbjct: 119 CAGTS------LYAITKQLG-------RCI--------EDDYRALVYKWALCTVWSRAMD 157

Query: 253 LPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
               DG+    L P+ADMLNHS EV     YD  S  +     + Y+ G+QVFI YG   
Sbjct: 158 FALPDGKSVRLLAPFADMLNHSSEVRQCHAYDPLSGNLSILAGKGYEAGDQVFIHYGSVP 217

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           N  LL  YGFV    +NP+DS +L L        +++K +     GL ++    + +T  
Sbjct: 218 NNRLLRLYGFVIP--SNPNDSYDLVLETHPLAPFFEQKRKLWALAGLDSTSTISLTLTD- 274

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
           PL      YL +     +    ++AA A  +   K +      + +ALQF+++S
Sbjct: 275 PLPNNVLRYLRI----QRSDESDLAAVALQQADPKYEKISNSSEVEALQFLIES 324


>gi|160774366|gb|AAI55279.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score =  101 bits (251), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 81/292 (27%), Positives = 140/292 (47%), Gaps = 24/292 (8%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-------LATYL 156
           GL A K+I+  E  L++P  +++T +S         VL      D  L       LA +L
Sbjct: 107 GLKATKDIKAEELFLWIPRKMLMTVESA-----KNSVLGPLYSQDRILQAMGNVTLALHL 161

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           + E +   SS W  YI  LP +  + LY+   E+ R+L A+Q  +  + +  N    Y  
Sbjct: 162 LCERA-NPSSPWLPYIKTLPSEYDTPLYFEEEEV-RHLLATQAIQDVLSQYKNTARQYAY 219

Query: 217 LRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLN 270
              ++   +P+   L  ++ F  + ++W+   + +R  ++P+ DG    +AL+P  DM N
Sbjct: 220 F-YKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCN 278

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
           H+  + T   Y+           + Y+ GEQ++I YG +SN E ++  GF   +  N  D
Sbjct: 279 HTNGLIT-TGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFED--NAHD 335

Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
            V++ L + KS++ Y  K E L + G+ AS  F +  +  P+     A+L V
Sbjct: 336 RVKIKLGVSKSERLYAMKAEVLARAGIPASSIFALHCSEPPISAQLLAFLRV 387


>gi|255568191|ref|XP_002525071.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
 gi|223535652|gb|EEF37318.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase, chloroplast precursor, putative
           [Ricinus communis]
          Length = 456

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 170/359 (47%), Gaps = 34/359 (9%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           R L A K+I+ G+ +L VP S  I +D+    PE  ++L    V     LA  L+ +   
Sbjct: 47  RSLFASKSIQTGDCILRVPYSAQIASDNL--LPELSDLLGD-EVGSVAKLAIVLLVDQKV 103

Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            + S+W+ YIS LP+  + +S ++W+++ELD   ++S  +E  I++   +   +  ++  
Sbjct: 104 GQESKWAPYISRLPQLGEMHSTIFWSKSELDMIFQSSVYKE-TIKQKAQIEKDFLTIK-P 161

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +   +P +     F  + F  ++ ++ SR     S  G V+L+P+AD LNH    E  + 
Sbjct: 162 VLEHFPQISRSITF--QDFMHAYALVKSR--AWGSTKG-VSLIPFADFLNHDGFSEAVVL 216

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLK 339
            D+  Q      DR Y P E+V I YGK SN  LLL +GF +P    N  + VE+ +++ 
Sbjct: 217 NDEDKQVSEVAADRNYAPHEEVLIRYGKFSNATLLLDFGFSLP---YNIHEQVEIQINIP 273

Query: 340 KSDKCYKEKLEALRKYGLSAS----------ECFPIQIT--------GWPLELMAYAYLV 381
             D   + K E LR + + A+          + F I+          G P  L A+A ++
Sbjct: 274 DHDTLREMKFEILRLHHIPATKDDNGFNSSWDSFLIKEVRSAGGKGKGLPQSLRAFARVL 333

Query: 382 VSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQVKE 440
                       + AA ++   +++ ++    + QA + +L      I +YS  +++ E
Sbjct: 334 CCTSHQDLNDLVLEAAQTDGRLARRALENSSREIQAHEILLSRINQVIEEYSASIKLLE 392


>gi|42565948|ref|NP_191068.2| SET domain-containing protein [Arabidopsis thaliana]
 gi|56236044|gb|AAV84478.1| At3g55080 [Arabidopsis thaliana]
 gi|59958342|gb|AAX12881.1| At3g55080 [Arabidopsis thaliana]
 gi|332645816|gb|AEE79337.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 463

 Score =  100 bits (250), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/310 (28%), Positives = 140/310 (45%), Gaps = 40/310 (12%)

Query: 54  RTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
           +T+ ++  N +PW   I   +  +TL    S  G               R L A K I  
Sbjct: 37  QTQASLDNNFLPWLERIAGAKITNTLSIGKSTYG---------------RSLFASKVIYA 81

Query: 114 GEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
           G+ +L VP +  IT D     P    VL    V +  +LA  LI E    + SRW  YIS
Sbjct: 82  GDCMLKVPFNAQITPDE---LPSDIRVLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYIS 138

Query: 174 ALPR--QPYSLLYWTRAELDRY----LEASQIRERA-IERITNVIGTYNDLRLRIFSKYP 226
            LP+  + +S ++W   EL       +    ++++A IE+  + +         I ++ P
Sbjct: 139 RLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFSFVAQAFKQHCPIVTERP 198

Query: 227 DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQ 286
           DL        E F +++ ++ SR         R++L+P+AD +NH     + +  D+ +Q
Sbjct: 199 DL--------EDFMYAYALVGSRAW---ENSKRISLIPFADFMNHDGLSASIVLRDEDNQ 247

Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCY 345
               T DR Y PG++VFI YG+ SN  L+L +GF  P    N  D V++ + +   D   
Sbjct: 248 LSEVTADRNYSPGDEVFIKYGEFSNATLMLDFGFTFP---YNIHDEVQIQMDVPNDDPLR 304

Query: 346 KEKLEALRKY 355
             KL  L+ +
Sbjct: 305 NMKLGLLQTH 314


>gi|62857953|ref|NP_001016577.1| histone-lysine N-methyltransferase setd3 [Xenopus (Silurana)
           tropicalis]
 gi|89272100|emb|CAJ81720.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 581

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +W  ++G       + +      GL A + I+  E  L+VP  L++T +S       G
Sbjct: 81  LMEWCKENGASTDGFELVEFPEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KG 135

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 136 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWLPYIKTLPNEYDTPLYFNEDEV- 193

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           +YL+++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 194 QYLQSTQAILDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMT 252

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 253 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKSGEQIYIF 311

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 312 YGTRSNAEFVIHNGFFFE--NNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 369

Query: 366 QITGWPLELMAYAYLVV 382
            +T  P+     A+L V
Sbjct: 370 HVTEPPISAQLLAFLRV 386


>gi|340720054|ref|XP_003398458.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           terrestris]
          Length = 484

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 170/363 (46%), Gaps = 24/363 (6%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           WL  +G      ++ +    + GL A +N  + E +L +P  L+ +  +  + PE   + 
Sbjct: 88  WLKQNGANVYGASVAEFPGYDLGLKAERNFLENELILRIPRELIFSIHN--AAPELVALQ 145

Query: 142 KQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI 199
               +   P   LA  L+ E   E  S+W  Y+  LP    ++LY T A+++  L+ S  
Sbjct: 146 NDPLLQLMPQVALAIALLIEKHKE-YSKWKPYLDILPTTYTTVLYMTAADMNE-LKGSPT 203

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFSRLVRLPSM 256
            E A+++  N+   Y     ++F K  +       +VF  E + W+   + +R   +PS 
Sbjct: 204 LEAALKQCRNIARQYAYFN-KLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIPSK 262

Query: 257 DGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           DG +   AL+P  DM NH  + +   D++ +         R ++  EQ+FISYG ++N +
Sbjct: 263 DGSLMIHALIPMWDMCNHE-DSKITTDFNATLNCCECYALRDFKKAEQIFISYGPRTNSD 321

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLE 373
             +  GFV  +  N  D  +L L + K+D  +KE++E L K  L A   F ++    P+ 
Sbjct: 322 FFVHSGFVYMD--NEQDGFKLRLGISKADPLHKERVELLNKLDLPAVGEFLLKPGTEPIS 379

Query: 374 LMAYAYLVVSPPSMKGKFEEMAA-AASNKMTSKKDIKCP---EIDEQALQFILDSCESSI 429
               A+L V   SM+   EE+A    S+++   K + C     ++E   +F+L   +  I
Sbjct: 380 DTLLAFLRVF--SMRK--EELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQLLI 435

Query: 430 SKY 432
           + Y
Sbjct: 436 ANY 438


>gi|320169513|gb|EFW46412.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 495

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 165/362 (45%), Gaps = 43/362 (11%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
           V  RG+ AL+++  GE +L VP SL++  +   + P  G +L    + D   +A +LI E
Sbjct: 74  VAGRGVFALRDLAAGETVLRVPLSLLLNVEHASASPLGG-ILDDFRLSDAEAMAFWLIYE 132

Query: 160 ASF-EKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
            +  E++S W  Y+ +LP     L +++   E+ R L+AS + E    R   +   +   
Sbjct: 133 LTRPERASPWLPYLESLPASIKQLTMFYDPFEMKR-LQASPVAEFTSRRTVKMRNKFGKY 191

Query: 218 RLRIFSKYPDL-----FPEEVFNMETFKWSFGILFSRL----VRLPSMDGR----VALVP 264
           R +I    P       FP E+  ++ F W+  + F+RL    V+ P+ DG       LVP
Sbjct: 192 REQISKHRPAHLAEIEFPVELITVDDFLWAMAVQFTRLITVQVKHPA-DGEWERTKCLVP 250

Query: 265 WADMLNHS------CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG---KKSNGELL 315
            AD+LN +       E  T LD    S      T R    G+++   YG   + SNG+L+
Sbjct: 251 LADLLNTAPADQINVECATNLD----STHFECATIRPVAEGQELLTPYGGAEQLSNGQLI 306

Query: 316 LSYGFVPREGTNPSDSVELPL-SLKKSDKCYKEKLEALRKYGLSASECFPIQI----TGW 370
           + YG   R   NPSD V LP+  L+++   Y  K+  L    L   +   + +       
Sbjct: 307 MDYGVTFR--NNPSDLVALPIPKLRETAVAYDSKMRLLMAMSLDRFDRLQLPVLDHFESI 364

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSIS 430
           P EL+A+A + VS PS     E +       M   + I  P  + +AL+ +L      I 
Sbjct: 365 PKELLAFARVYVSTPSDLSDLEHVL----ELMKEHRAIN-PSNERRALELLLQLTNEMIL 419

Query: 431 KY 432
           KY
Sbjct: 420 KY 421


>gi|116786810|gb|ABK24248.1| unknown [Picea sitchensis]
          Length = 507

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 147/320 (45%), Gaps = 27/320 (8%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE------RGLVALKNIRKGEKLLFVPPS 123
           I   E    L+ W+   GLPP ++ +++    +      + + A ++++ G+    +P S
Sbjct: 67  IRGKEEEVDLKSWMHRHGLPPCRVMLKERPSPDGKHKPIKYVAASEDLQPGDVAFSIPNS 126

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP---- 179
           L++T +         E+L    + +   LA YL+ E      S W  +I  L RQ     
Sbjct: 127 LIVTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGNQSFWRPFIRELDRQRGRGQ 186

Query: 180 ---YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFP 230
               S L W+  EL +Y   S ++E  +ER + +   Y +L         +F +YP   P
Sbjct: 187 LAVESPLLWSSEEL-KYFTGSPMKEIMLERNSGIKREYEELDTVWFMAGSLFKQYPYDIP 245

Query: 231 EEVFNMETFKWSFGILFSRLVRLPSMD--GRVALVPWAD-MLNHSCEVETFLDYDKSSQG 287
            E F  E FK +F  + S +V L +++   R ALVP    +L++    +  L     S  
Sbjct: 246 TEAFPFEIFKQAFVAVQSCVVHLQNVNLARRFALVPLGPPLLSYKSNCKAMLKAVGDS-- 303

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
           V    DR+Y+ GE + +  G + N  LLL+YGFV  +  NP D + + +SL   D  Y++
Sbjct: 304 VQLEVDREYKAGEPIVVWCGPQPNARLLLNYGFVDED--NPHDRLIVEVSLDTKDPLYQD 361

Query: 348 KLEALRKYGLSASECFPIQI 367
           K    ++ G  + + F I I
Sbjct: 362 KRIIAQRNGKLSVQTFNIYI 381


>gi|268370088|ref|NP_082538.2| histone-lysine N-methyltransferase setd3 [Mus musculus]
 gi|81879567|sp|Q91WC0.1|SETD3_MOUSE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=Endothelial differentiation inhibitory protein D10;
           AltName: Full=SET domain-containing protein 3
 gi|16359331|gb|AAH16123.1| SET domain containing 3 [Mus musculus]
 gi|18044800|gb|AAH19973.1| Setd3 protein [Mus musculus]
 gi|26327255|dbj|BAC27371.1| unnamed protein product [Mus musculus]
 gi|74145116|dbj|BAE27425.1| unnamed protein product [Mus musculus]
 gi|74151505|dbj|BAE38861.1| unnamed protein product [Mus musculus]
 gi|148686776|gb|EDL18723.1| mCG18357, isoform CRA_a [Mus musculus]
          Length = 594

 Score =  100 bits (249), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P    L  +E F  E ++W+   + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q G+Q++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HSTEPPISAQLLAFLRV 387


>gi|332321747|sp|B7ZUF3.1|SETD3_XENTR RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|213624517|gb|AAI71209.1| LOC549331 protein [Xenopus (Silurana) tropicalis]
          Length = 582

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 147/317 (46%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +W  ++G       + +      GL A + I+  E  L+VP  L++T +S       G
Sbjct: 82  LMEWCKENGASTDGFELVEFPEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KG 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWLPYIKTLPNEYDTPLYFNEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           +YL+++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 195 QYLQSTQAILDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTFDDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKSGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHNGFFFE--NNLHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
            +T  P+     A+L V
Sbjct: 371 HVTEPPISAQLLAFLRV 387


>gi|148686779|gb|EDL18726.1| mCG18357, isoform CRA_d [Mus musculus]
          Length = 597

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 85  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 139

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 140 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 197

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P    L  +E F  E ++W+   + +
Sbjct: 198 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMT 256

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q G+Q++I 
Sbjct: 257 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 315

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 316 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 373

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 374 HSTEPPISAQLLAFLRV 390


>gi|410928182|ref|XP_003977480.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Takifugu
           rubripes]
          Length = 598

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 146/313 (46%), Gaps = 16/313 (5%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L  W  D G   +  A+        GL A ++I+  E  L++P  +++T +S       G
Sbjct: 82  LMSWARDHGASCEGFAVTNFGAEGYGLRATRDIKAEELFLWIPRKMLMTVESAKKSV-LG 140

Query: 139 EVLKQCSV---PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
            +  Q  +    D   LA +L+ E +   +S W  YI  LP++  + L++ + E+ + L+
Sbjct: 141 PLYNQDRILQAMDNVTLALHLLCERA-NPASFWLPYIRTLPQEYDTPLFYEQDEV-QLLQ 198

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRLVR 252
            +Q  +  + +  N    Y     ++   +P    L  ++ F  + ++W+   + +R  +
Sbjct: 199 GTQAVQDVLSQYRNTARQYAYF-YKLIQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQ 257

Query: 253 LPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           +P+ DGR   +AL+P  DM NH   + T   Y+           + Y+  EQ++I YG +
Sbjct: 258 IPTEDGRQVTLALIPLWDMCNHRNGLIT-TGYNLEDDRCECVALQDYKKNEQIYIFYGTR 316

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           SN E ++  GF  +E  N  D V++ L + KS++ Y  K E L + G+  S  F +    
Sbjct: 317 SNAEFVIHNGFFYQE--NAHDQVKIKLGISKSERLYAMKAEVLARAGIPVSSIFALYCNE 374

Query: 370 WPLELMAYAYLVV 382
            P+     A+L V
Sbjct: 375 QPISAQLLAFLRV 387


>gi|432098266|gb|ELK28072.1| Histone-lysine N-methyltransferase setd3 [Myotis davidii]
          Length = 585

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/318 (27%), Positives = 146/318 (45%), Gaps = 24/318 (7%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
            L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +    
Sbjct: 70  NLMKWASENGASVEGFEMFNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS--- 126

Query: 138 GEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
             VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+
Sbjct: 127 --VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV 183

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP--DLFP-EEVFNMETFKWSFGILF 247
            R L+++Q       +  N    Y     ++   +P  +  P ++ F  E ++W+   + 
Sbjct: 184 -RSLQSTQAVHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVM 241

Query: 248 SRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           +R  ++P+ DG    +AL+P  DM NH+  + T   Y+           R ++ GEQ++I
Sbjct: 242 TRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALRDFRAGEQIYI 300

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
            YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F 
Sbjct: 301 FYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFA 358

Query: 365 IQITGWPLELMAYAYLVV 382
           +     P+     A+L V
Sbjct: 359 LHFMEPPISAQLLAFLRV 376


>gi|242007310|ref|XP_002424484.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212507902|gb|EEB11746.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 492

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 90/327 (27%), Positives = 151/327 (46%), Gaps = 31/327 (9%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           ++ S L  W+ ++G     + I+  +    GL A K++ + E +  +P ++++T D+   
Sbjct: 83  DHFSNLISWIKENGGVADNVTIKHFNEMGYGLEAAKDLEESELICAIPKNVMMTLDNVKV 142

Query: 134 CP-----EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
            P     E   +LK         LA +LI E    ++S W +YIS+LP    ++LY+   
Sbjct: 143 SPLKYLYENNPILKNMGNV---ALALFLILEHVKNENSFWHHYISSLPSDYNTVLYF--- 196

Query: 189 ELDRYLEA--SQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSF 243
           +L+ +LE   S   E A +   N+   Y      +F    D   L    VF  + ++W+ 
Sbjct: 197 DLNDFLEMKNSPTFEMATKHCKNIARQYAYFN-NLFQNSNDEASLILRNVFTYQLYRWAV 255

Query: 244 GILFSRLVRLPSM-------DGRVALVPWADMLNHSCE-VETFLDYDKSSQGVVFTTDRQ 295
             + +R   +PS        +G   L+P  DM NH+   + T    D+S         + 
Sbjct: 256 STVMTRQNFIPSSSTSNDVENGINGLIPLWDMCNHTNGYLSTQYKVDRSECLAC----KP 311

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           ++ GEQV I YG++SN + L+  GFV  E  NP DS  L L + KSDK +  + E L+  
Sbjct: 312 FKKGEQVLIFYGERSNSDFLVHNGFVYDE--NPHDSFRLRLGISKSDKLHGLRCELLKDL 369

Query: 356 GLSASECFPIQITGWPLELMAYAYLVV 382
           G+  S  F +     P+     A+L +
Sbjct: 370 GIPDSGDFYLYSGSEPVRENLLAFLRI 396


>gi|356534483|ref|XP_003535783.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Glycine
           max]
          Length = 463

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/269 (31%), Positives = 132/269 (49%), Gaps = 26/269 (9%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           R L A K I+ G+ +L VP  + ITAD+    PE   ++ +  V +   LAT ++ E   
Sbjct: 61  RSLFASKIIQTGDCILKVPYRVQITADNL--LPEIRSLIGE-EVGNIAKLATVILIEKKL 117

Query: 163 EKSSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            + S W  YIS LP+Q   ++ ++WT +EL+  +  S + +  I++ + +   +  ++  
Sbjct: 118 GQGSEWYPYISCLPQQGELHNTVFWTESELE-MIRPSSVYQETIDQKSQIEKDFLAIK-H 175

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRL-VRLP-------SMDGRVALVPWADMLNHS 272
           IF      F +  +       +  +LF    V LP       S +G +AL+P+AD LNH 
Sbjct: 176 IFECSHQSFGDSTYKDFMHACTL-VLFDHFNVELPVGSRAWGSTNG-LALIPFADFLNHD 233

Query: 273 CEVETFL--DYDKSS---QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGT 326
              E  +  D DK     Q +    DR Y PGEQV I YGK SN  L+L +GF +P    
Sbjct: 234 GVSEAIVMSDDDKQCSEVQSLQIIADRDYAPGEQVLIRYGKFSNATLMLDFGFTIP---Y 290

Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           N  D V++   + K D     KLE L +Y
Sbjct: 291 NIYDQVQIQFDIPKHDPLRDMKLELLHQY 319


>gi|359488614|ref|XP_003633789.1| PREDICTED: histone-lysine N-methyltransferase setd3 isoform 2
           [Vitis vinifera]
          Length = 515

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 94/319 (29%), Positives = 140/319 (43%), Gaps = 37/319 (11%)

Query: 79  LQKWLSDSGLPP-------------QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
           L+ W+ ++GLPP             Q  AI  +   E  L     ++ G+    VP SLV
Sbjct: 78  LKSWMHENGLPPCKVVLKERPSHHEQHKAIHYIAASE-DLQGFLLLQAGDVAFSVPDSLV 136

Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ------- 178
           +T +         E+L    + +   LA YL+ E    K S W  YI  L RQ       
Sbjct: 137 VTLERVLGNETIAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLA 196

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEE 232
             S L W+ +EL  YL  S  +   +ER   +   YN+L         +F +YP   P E
Sbjct: 197 VESPLLWSESEL-AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTE 255

Query: 233 VFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCEVETFLD-YDKSSQGV 288
            F  E FK +F  + S +V L   S+  R ALVP    +L +    +  L   D S Q V
Sbjct: 256 AFPFEIFKQAFVAIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLV 315

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
           V   DR Y+ GE + +  G + N +LLL+YGFV  +  N  D + +  +L   D  Y++K
Sbjct: 316 V---DRPYKAGESIVVWCGPQPNSKLLLNYGFVDED--NSYDRIVVEAALNTEDPQYQDK 370

Query: 349 LEALRKYGLSASECFPIQI 367
               ++ G    + F + +
Sbjct: 371 RMVAQRNGKLTVQKFHVSV 389


>gi|41056027|ref|NP_956348.1| histone-lysine N-methyltransferase setd3 [Danio rerio]
 gi|82187658|sp|Q7SXS7.1|SETD3_DANRE RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|32766447|gb|AAH55261.1| SET domain containing 3 [Danio rerio]
          Length = 596

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 80/292 (27%), Positives = 139/292 (47%), Gaps = 24/292 (8%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-------LATYL 156
           GL A K+I+  E  L++P  +++T +S         VL      D  L       LA +L
Sbjct: 107 GLKATKDIKAEELFLWIPRKMLMTVESA-----KNSVLGPLYSQDRILQAMGNVTLALHL 161

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           + E +   SS W  YI  LP +  + LY+   E+ R+L A+Q  +  + +  N    Y  
Sbjct: 162 LCERA-NPSSPWLPYIKTLPSEYDTPLYFEEEEV-RHLLATQAIQDVLSQYKNTARQYAY 219

Query: 217 LRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLN 270
              ++   +P+   L  ++ F  + ++W+   + +R  ++P+ DG    +AL+P  DM N
Sbjct: 220 F-YKVIHTHPNASKLPLKDAFTFDDYRWAVSSVMTRQNQIPTADGSRVTLALIPLWDMCN 278

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
           H+  + T   Y+           + Y+ GEQ++I YG +SN E ++  GF   +  N  D
Sbjct: 279 HTNGLIT-TGYNLEDDRCECVALKDYKEGEQIYIFYGTRSNAEFVIHNGFFFED--NAHD 335

Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
            V++ L + K ++ Y  K E L + G+ AS  F +  +  P+     A+L V
Sbjct: 336 RVKIKLGVSKGERLYAMKAEVLARAGIPASSIFALHCSEPPISAQLLAFLRV 387


>gi|340780678|pdb|3SMT|A Chain A, Crystal Structure Of Human Set Domain-Containing Protein3
          Length = 497

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 88/319 (27%), Positives = 146/319 (45%), Gaps = 28/319 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           L KW S++G   +    + V+  E G  L A ++I+  E  L+VP  L+ T +S      
Sbjct: 81  LXKWASENGASVE--GFEXVNFKEEGFGLRATRDIKAEELFLWVPRKLLXTVESA----- 133

Query: 137 AGEVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
              VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E
Sbjct: 134 KNSVLGPLYSQDRILQAXGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDE 192

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGIL 246
           + RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   +
Sbjct: 193 V-RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSV 250

Query: 247 FSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
            +R  ++P+ DG    +AL+P  D  NH+  + T   Y+           + ++ GEQ++
Sbjct: 251 XTRQNQIPTEDGSRVTLALIPLWDXCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIY 309

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F
Sbjct: 310 IFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAXKAEVLARAGIPTSSVF 367

Query: 364 PIQITGWPLELMAYAYLVV 382
            +  T  P+     A+L V
Sbjct: 368 ALHFTEPPISAQLLAFLRV 386


>gi|392341246|ref|XP_002726820.2| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|392349051|ref|XP_216781.6| PREDICTED: histone-lysine N-methyltransferase setd3 [Rattus
           norvegicus]
 gi|149044195|gb|EDL97577.1| rCG27725, isoform CRA_a [Rattus norvegicus]
          Length = 596

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            +L      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q G+Q++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|443730800|gb|ELU16158.1| hypothetical protein CAPTEDRAFT_140019 [Capitella teleta]
          Length = 255

 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 120/252 (47%), Gaps = 25/252 (9%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPEAGEVLKQCSVPDWPLLATYLISE 159
           RG++  + +  G+ ++ +P SL+IT  +    +  P   + L  C +     L  +L+ E
Sbjct: 6   RGVMVRRRLLTGDTIIAIPESLLITTSTVLRSYLGPVIHDFLP-CRLSPTETLVIFLLCE 64

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-----LEASQIRERAIERITNVIGTY 214
            +   SS W  Y+  LP     +L+WT  E+D         A  +R +A E    +   +
Sbjct: 65  RNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLRLKAEESFNRLCNGF 124

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL---------PSMDGRVALVPW 265
             L +R   ++        F  + FKW++  + +R V +         P  + + AL P+
Sbjct: 125 LPLLVRQMPQF-----NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEEDKSALAPF 179

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            D+LNH+ +VE    +D SS+    TT    +P +QVFI+YG  SN +LLL YGF     
Sbjct: 180 LDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLEYGFT--LP 237

Query: 326 TNPSDSVELPLS 337
            NP +++ L LS
Sbjct: 238 CNPHNNISLTLS 249


>gi|326921018|ref|XP_003206761.1| PREDICTED: LOW QUALITY PROTEIN: SET domain-containing protein
           3-like [Meleagris gallopavo]
          Length = 593

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 145/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW +++G   +   I   +    GL A + I+  E  L+VP  L++T +S  S     
Sbjct: 82  LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESAKSS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPNEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           +YL ++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 195 QYLRSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
                P+     A+L V
Sbjct: 371 HSIEPPISAQLLAFLRV 387


>gi|348671353|gb|EGZ11174.1| hypothetical protein PHYSODRAFT_361758 [Phytophthora sojae]
          Length = 486

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 145/291 (49%), Gaps = 22/291 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +WL  +G   +K+A+Q+     RG+ + K +  GE++L +P   +IT +         
Sbjct: 45  LIQWLEGNGADTKKLALQEYAPEVRGVHSRKVLAPGERILVIPKKCLITVEMGKQTDIGR 104

Query: 139 EVLKQ---CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYL 194
           ++L +      P    L  +L+++    ++S + NY S LP    ++ ++W+  EL  +L
Sbjct: 105 KLLARNVDFVAPKHIFLMMFLLTDMERAETSFFRNYYSTLPSTLSNMPIFWSDEELG-WL 163

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           + S I ++  ER   +   Y D+  R+   +        F+++ F W+  I+ SR   L 
Sbjct: 164 KGSYIIQQIQERKAAIRKDY-DVICRVDPAFAR------FSLDRFSWARMIVCSRNFGL- 215

Query: 255 SMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
           ++DG    ALVP+ADMLNH    ET   +D+S      T+      G QV+ SYGKK N 
Sbjct: 216 TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNH 275

Query: 313 ELLLSYGF-----VPREGTNPSDSVELPLSLKKSD-KCYKEKLEALRKYGL 357
             LL+YGF        +G NP++ V +   L ++D + + +K   L + G+
Sbjct: 276 RFLLNYGFAVEDNTEEDGRNPNE-VLIDFQLSQADGQLFYDKRAYLHESGI 325


>gi|302753470|ref|XP_002960159.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
 gi|300171098|gb|EFJ37698.1| hypothetical protein SELMODRAFT_437298 [Selaginella moellendorffii]
          Length = 377

 Score = 99.4 bits (246), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 75/250 (30%), Positives = 121/250 (48%), Gaps = 14/250 (5%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RGL A + +R GE++L +   L+I        P+   ++   +V  W  LA  ++ E   
Sbjct: 68  RGLFASRPVRAGERVLEISLDLMIAPSD---LPDELSMVLPSTVKPWTKLALIVLMERYK 124

Query: 163 EKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
            +SS W+ YIS LP QP  L     W   EL  YL+AS +  +  ER+  +   +  ++ 
Sbjct: 125 GQSSVWAPYISCLP-QPAELDNTFLWEDTELS-YLKASPLYGKTRERLEMITTEFGQVQ- 181

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFL 279
              + +P LF +   ++E FK  +  +FSR + +   D  + ++P  D  NH+      L
Sbjct: 182 NALNVWPQLFGK--VSLEDFKHVYATVFSRSLAI-GEDSTLVMIPMLDFFNHNATSFAKL 238

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
            ++      V T DR Y   +Q++I+YG  SN EL L YGF   E  NP D  +L     
Sbjct: 239 SFNGLLNYAVVTADRAYTENDQIWINYGDLSNAELALDYGFTVPE--NPYDETDLLTQFP 296

Query: 340 KSDKCYKEKL 349
           + +   K++L
Sbjct: 297 EMNTILKDQL 306


>gi|168046556|ref|XP_001775739.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672891|gb|EDQ59422.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 524

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 117/447 (26%), Positives = 188/447 (42%), Gaps = 63/447 (14%)

Query: 26  AGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSD 85
           +GF  F  KR G R V       +N ++ TK +   N+     +++ L N      WL  
Sbjct: 40  SGFRAFTHKRFGCRWV------QSNGSTHTKES---NVSISNTKVERLRN------WLKK 84

Query: 86  SGLPPQKMAIQKVDVGERG-----LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
                  + +++   G  G           +  G  ++ VP   ++T ++   C + G +
Sbjct: 85  LNHDDCNLKLERCPQGGSGSGYGAFAGPGGVGNGSTIVKVPRKALMTEETARLCQDVGPL 144

Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL----LYWTRAELDRYLEA 196
           +K+  +  W  +  +L+ E +  ++S W  YI+ LP++   +    + W++     +LE 
Sbjct: 145 VKKSDLTPWQAMCLHLLYERARGETSFWYPYIAVLPKELELIGIHPMLWSQKMRREWLEG 204

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP-----EEVFNMET-FKWSFGILFSRL 250
           S + +    R+      Y  +   + +    L P     E +   ET  +W+  +L SR 
Sbjct: 205 SPMLDVTERRLAICREDYEAM---LLAGAGRLTPRGNEGEPISITETAVQWAATMLLSRS 261

Query: 251 VRL--------PS--MDGRVALVPWADMLNHSCEV--ETFLDYDKSSQGVVFTTDRQYQP 298
             L        P    +  +ALVPWADMLNHS     E+ L YD+ S        R Y  
Sbjct: 262 FSLNLQTQKLRPGSFAEDTIALVPWADMLNHSSSAGRESCLVYDQKSGVATLQAHRTYSE 321

Query: 299 GEQVFISYGKK-SNGELLLSYGFVPREGTNPSDSVELPLSLKK--SDKCYKEKLEALRKY 355
           GEQVF SYG   S   LLL YGFV  E TN   SV+LP S+    + K  +  LEA+   
Sbjct: 322 GEQVFDSYGPSCSPSRLLLDYGFVDEENTN--HSVDLPASVLGPVNSKANELLLEAM--- 376

Query: 356 GLSA-SECFPIQITGWPLELMAYAYLVVSP------PSMKGKFEEMAAAASNKMTSKKDI 408
           GL      F +   G    +MA+  + V+          K    E AA   +  T     
Sbjct: 377 GLPLDGAIFSLTSAGVDESVMAWTRVAVATRQELYDAGWKEGIRERAAGYPSAATVMFRF 436

Query: 409 KCP---EIDEQALQFILDSCESSISKY 432
             P   + + + L+ +L +CE  + KY
Sbjct: 437 STPINRDNESEVLRRLLSTCEFLLQKY 463


>gi|328872715|gb|EGG21082.1| hypothetical protein DFA_00957 [Dictyostelium fasciculatum]
          Length = 643

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 102/406 (25%), Positives = 184/406 (45%), Gaps = 59/406 (14%)

Query: 74  ENASTLQKWLSDSG--LPPQKMAIQKVDVG---ERGLVALKNIRKGEKLLFVPPSLVITA 128
           E+  + Q+WLS+    L P   +I  VD+G    R +VA  NI+K E L+ +P  +++T 
Sbjct: 207 EDLKSFQQWLSNKNTYLNP---SIDIVDLGPPFGRSMVANTNIKKDEILVEIPKGIMMTP 263

Query: 129 DS------KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
            S      ++      E+  + S  D   +A   I  +   + S W  Y+S LP+Q  + 
Sbjct: 264 KSMIKNLPRFIIDWMDEM--KISRTDQQAIA---IIYSILHEDSYWYEYVSILPKQFTTT 318

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---------------- 226
           +Y+TR E+ + L+AS +      R+  V   Y+    R+   Y                 
Sbjct: 319 VYFTREEMTQ-LQASPVHRFTEMRLNGVHRHYDTTISRLRFGYEGGEDDSTKTKTKSQLD 377

Query: 227 --DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK- 283
               F ++ + ++ FKW+ G ++SR   L   DG   +VP ADM N    +     + K 
Sbjct: 378 AMKEFKDDRYTLDQFKWALGCVWSRAFSLSEEDG--GMVPLADMFNADTVISRSKVHPKI 435

Query: 284 --SSQGVVFTTDRQYQPGEQVFISYG---KKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
             SS  +V+T  +  + GEQ+F  YG      +G++L+ YGF+  +G++   ++     +
Sbjct: 436 SASSPSLVYTASQDIEAGEQIFTPYGVYKTLGSGQMLMDYGFIHEDGSSADSTIVTVAPI 495

Query: 339 KKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAA 398
             S+  Y  K   ++  G+ + E F I       EL  +A +     S+  K E   A+A
Sbjct: 496 PPSEPLYDLKRHLMQSNGIESEE-FTITKNKLAKELFLFARI----KSINKK-ESDQASA 549

Query: 399 SNKMTSKKDIKCPEIDEQALQFI-------LDSCESSISKYSRFLQ 437
               T +  +  P  ++ AL+ +       LD+ +++I + ++ L+
Sbjct: 550 HFMSTQRHSMLNPRNEKAALRLLSNLISRHLDAYQTTIDQDNQILK 595


>gi|12848462|dbj|BAB27964.1| unnamed protein product [Mus musculus]
 gi|46241521|gb|AAS82953.1| endothelial differentiation inhibitory protein D10 [Mus musculus]
          Length = 594

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/317 (27%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           R L+++Q       +  N    Y     ++   +P    L  +E F  E ++W+   + +
Sbjct: 195 RCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +Q G+Q++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAESVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HSTEPPISAQLLAFLRV 387


>gi|350408192|ref|XP_003488333.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Bombus
           impatiens]
          Length = 484

 Score = 99.0 bits (245), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 97/366 (26%), Positives = 173/366 (47%), Gaps = 30/366 (8%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----A 137
           WL  +G      ++ +    + GL A +N  + E +L +P  L+ +  +  + PE     
Sbjct: 88  WLKQNGANVYGASVAEFPGYDLGLKAERNFLENELILRIPRELIFSIHN--AAPELVALQ 145

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
            + L Q  +P   L    LI +   ++ S+W  Y+  LP    ++LY T A+++  L+ S
Sbjct: 146 NDPLLQL-MPQVALAIALLIEK--HKEYSKWKPYLDILPTTYTTVLYMTAADMNE-LKGS 201

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFP---EEVFNMETFKWSFGILFSRLVRLP 254
              E A+++  N+   Y     ++F K  +       +VF  E + W+   + +R   +P
Sbjct: 202 PTLEAALKQCRNIARQYAYFN-KLFQKNNNAVSAILRDVFTYEKYCWAVSTVMTRQNIIP 260

Query: 255 SMDGRV---ALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           S DG +   AL+P  DM NH + ++ T  D++ +         R ++  EQ+FISYG ++
Sbjct: 261 SKDGSLMIHALIPMWDMCNHENSKITT--DFNATLNCCECYALRDFKKAEQIFISYGART 318

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           N +  +  GFV  +  N  D  +L L + K+D   KE++E L K  L A   F ++    
Sbjct: 319 NSDFFVHSGFVYMD--NEQDGFKLRLGISKADPLQKERVELLNKLDLPAVGEFLLKPGTE 376

Query: 371 PLELMAYAYLVVSPPSMKGKFEEMAA-AASNKMTSKKDIKCP---EIDEQALQFILDSCE 426
           P+     A+L V   SM+   EE+A    S+++   K + C     ++E   +F+L   +
Sbjct: 377 PISDTLLAFLRVF--SMRK--EELAHWIQSDRVNDLKHMDCALETVVEENVKKFLLTRLQ 432

Query: 427 SSISKY 432
             I+ Y
Sbjct: 433 LLIANY 438


>gi|403350379|gb|EJY74649.1| SET domain containing protein [Oxytricha trifallax]
          Length = 2165

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 81/269 (30%), Positives = 129/269 (47%), Gaps = 22/269 (8%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           TL KWL   G   +K+ I+      RG+ A ++I+KGE +L+VP   +IT +   + P  
Sbjct: 149 TLLKWLEQGGSHFEKLKIRYYTADYRGVHAARDIKKGEIILYVPKHQIITLEMAMTSPVG 208

Query: 138 GEV----LKQCSV-PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
            ++    L+Q  + P    L+TY++ E   +  S+W  YI  LP+   +   +   E   
Sbjct: 209 KKMYEKGLRQRLISPKHSFLSTYIMQEKR-KPESQWQIYIDILPKNFSNFPIFFTEEERI 267

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN---METFKWSFGILFSR 249
           +L+ S   ++ +E+I ++   Y DL  +   +Y   FP   ++   M      FGI    
Sbjct: 268 WLKGSPFLDQILEKIEDIKADY-DLICKEVPEYVQ-FPIREYSEIRMMVSSRIFGIQIEG 325

Query: 250 LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           +      DG VA   +ADMLNH    +T   Y    QG +       Q GEQV+ SYGKK
Sbjct: 326 V----KTDGFVA---YADMLNHKRPRQTSWTYTDEKQGFIIEAMEDIQRGEQVYDSYGKK 378

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSL 338
            N    L+YGF+       +D+ E+P+ +
Sbjct: 379 CNSRFFLNYGFINLN----NDANEVPIKV 403


>gi|395504553|ref|XP_003756612.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Sarcophilus
           harrisii]
          Length = 602

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 143/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW +++G       +        GL A + I+  E  L+VP  L++T +S        
Sbjct: 89  LIKWAAENGASTDGFELVNFKEEGFGLRATREIKAEELFLWVPRKLLMTVESA-----KN 143

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   SS W  YI  LP +  + LY+   E+ 
Sbjct: 144 SVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSEYDTPLYFEEDEV- 201

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           ++L+++Q       +  N    Y     ++   +P+   L  ++ F  E ++W+   + +
Sbjct: 202 QHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTYEDYRWAVSSVMT 260

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +  GEQ++I 
Sbjct: 261 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFNVGEQIYIF 319

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 320 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 377

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 378 HFTEPPISAQLLAFLRV 394


>gi|348690659|gb|EGZ30473.1| hypothetical protein PHYSODRAFT_553476 [Phytophthora sojae]
          Length = 437

 Score = 98.2 bits (243), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 72/206 (34%), Positives = 108/206 (52%), Gaps = 15/206 (7%)

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR----LRI 221
           S+W+ +I  LPR  ++ LY+   EL R LE S +   A +    V   Y  L+    L +
Sbjct: 112 SKWAKHIELLPRTYHNALYFGPEEL-RALEGSNVYFIAQQMEEKVAHDYARLKESVLLEL 170

Query: 222 FSKYP-----DLFPEEVFNMETFKWSFGILFSRLVRLP-SMDGRVALVPWADMLNHSCEV 275
           F   P     DLF +E F++E +KW+   ++SR   +P +     A+VP  DMLNH  E 
Sbjct: 171 FENVPEGINVDLF-DEFFSLENYKWALSTIWSRFGDVPVAKQSFKAMVPVFDMLNHDPEA 229

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
           E    +D S+Q     + + +  G Q+FI+YG  SN +LL  YGFV     NP D+VE+ 
Sbjct: 230 EMSHFFDMSTQRFKLVSHQHWNAGAQMFINYGPLSNHKLLALYGFVII--GNPFDAVEMW 287

Query: 336 LSLKK-SDKCYKEKLEALRKYGLSAS 360
           L + + S K ++EK + L   GL  +
Sbjct: 288 LPMDEASTKFFQEKEQLLLTNGLDHA 313


>gi|57529914|ref|NP_001006486.1| histone-lysine N-methyltransferase setd3 [Gallus gallus]
 gi|363734802|ref|XP_003641459.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Gallus
           gallus]
 gi|75571462|sp|Q5ZML9.1|SETD3_CHICK RecName: Full=Histone-lysine N-methyltransferase setd3; AltName:
           Full=SET domain-containing protein 3
 gi|53127281|emb|CAG31024.1| hypothetical protein RCJMB04_1k10 [Gallus gallus]
          Length = 593

 Score = 97.8 bits (242), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW +++G   +   I   +    GL A + I+  E  L+VP  L++T +S        
Sbjct: 82  LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           +YL ++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 195 QYLRSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
                P+     A+L V
Sbjct: 371 HSIEPPISAQLLAFLRV 387


>gi|281201870|gb|EFA76078.1| hypothetical protein PPL_10657 [Polysphondylium pallidum PN500]
          Length = 1234

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 74/269 (27%), Positives = 128/269 (47%), Gaps = 10/269 (3%)

Query: 79   LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
             +KWL+  G+   K+ I    D   RG+V  K + + E ++ VP   +I        P  
Sbjct: 740  FEKWLASDGVQCPKLQIANFQDSTGRGIVTTKKVEENEVIIKVPRKFLINVQVAREHPIL 799

Query: 138  GEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
            G + ++ S + D  +L  ++I E     +S W  +   LP    + +++T  EL   LE 
Sbjct: 800  GRIFEEFSGLNDDTILFLFVIYEKE-NPNSFWRPFFDTLPSYFPTSIHYTSTELLE-LEG 857

Query: 197  SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
            + +    ++   ++    + L   +  +YP +FPE +F+ E F W+  +  SR ++L  +
Sbjct: 858  TNLFAETLQVKEHLQSIRDMLFPELSEQYPTIFPESLFSWENFLWARSLFDSRAIQL-KI 916

Query: 257  DGRVA--LVPWADMLNHSCEVETFLDY-DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
            D ++   LVP ADM+NH    +    + D++ Q     +     P  Q+F+ YG   N E
Sbjct: 917  DDKITNCLVPMADMINHHHNAQISQRFFDQTDQCFKMVSCCSVPPNAQIFLHYGALQNRE 976

Query: 314  LLLSYGFVPREGTNPSDSVELPLSLKKSD 342
            L L YGFV ++  NP DS+ +   L   D
Sbjct: 977  LALYYGFVIQD--NPYDSMLIGFDLPDED 1003


>gi|325183831|emb|CCA18289.1| conserved hypothetical protein [Albugo laibachii Nc14]
 gi|325183979|emb|CCA18437.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 561

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/276 (29%), Positives = 131/276 (47%), Gaps = 23/276 (8%)

Query: 69  EIDSLEN---ASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
           ++  LEN    + L  WL + G   +K+ +Q+     RG+     +  GE++LF+P + +
Sbjct: 107 DVADLENDVVGAELIDWLQNQGAETKKLMLQQYAPEVRGVHCRNELVPGERILFIPKNCL 166

Query: 126 ITADSKWSCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
           IT +         +VL    +   P    L  YL+++   +  + +  Y S LP    ++
Sbjct: 167 ITVEMGKQTEIGQKVLAHNIEFVAPKHIFLILYLLTDMEKKDLTFFKYYYSTLPSTLKNM 226

Query: 183 -LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
            ++W+  EL  +L+ S I  +  ER   +   Y+     I    P       F++E F W
Sbjct: 227 PIFWSDQEL-SWLKGSYILHQIQERKAAIRKDYD----AICRADPSF---SRFSLERFSW 278

Query: 242 SFGILFSRLVRLPSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           +  I+ SR   L ++DG    ALVP+ADMLNH    ET   +D+   G   T+      G
Sbjct: 279 ARMIVCSRNFGL-TIDGVKTAALVPFADMLNHYRPRETSWTFDQKLDGFTITSLESICSG 337

Query: 300 EQVFISYGKKSNGELLLSYGF-----VPREGTNPSD 330
            QV+ SYGKK N   LL+YGF        +G+NP++
Sbjct: 338 AQVYDSYGKKCNHRFLLNYGFAVEDNTEEDGSNPNE 373


>gi|395518633|ref|XP_003763464.1| PREDICTED: SET domain-containing protein 4 [Sarcophilus harrisii]
          Length = 440

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 86/310 (27%), Positives = 144/310 (46%), Gaps = 29/310 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL +       +   +     RGL+A+K+++ GE ++ +P   ++T D+       G
Sbjct: 36  LRKWLKERKFEDHNLRPTRFSGTGRGLMAVKSLQPGELIISLPEKCLLTTDTVIK-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
           + + + + P  PLLA  T+LISE +    S W  Y+  LP+  Y+ L     ++ R L  
Sbjct: 95  DYITKWTPPISPLLALCTFLISENNAGNKSPWKPYLDILPKD-YTCLVCLEPQVVRLL-P 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             ++ +A E+ T V   +   R   FS    LF E+V   F+   F W++  + +R V +
Sbjct: 153 KPLKIKAQEQKTQVQELFVSSR-GFFSSLQSLFTEDVKHIFHYHAFLWAWCTINTRTVYM 211

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                +         AL P+ D+LNHS  V+    +++ ++     T    +  E++FI 
Sbjct: 212 KHAQKKCLSAEPDVYALAPYLDLLNHSPGVQVNAAFNEKTRCYEIRTTSSCKKYEELFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSAS 360
           YG   N  LLL YGFV     NP  +V + +      L   D    +KL  L+++G S +
Sbjct: 272 YGPHDNHRLLLEYGFVAI--NNPHSAVYVSIDSLVDHLPSVDTQMNKKLSLLKEHGFSEN 329

Query: 361 ECFPIQITGW 370
             F     GW
Sbjct: 330 LTF-----GW 334


>gi|428173103|gb|EKX42007.1| hypothetical protein GUITHDRAFT_141487 [Guillardia theta CCMP2712]
          Length = 355

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 129/314 (41%), Gaps = 32/314 (10%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV---- 125
           I         +KW+    L   K+ ++ +     G  A  +I  GE  + +P  ++    
Sbjct: 40  ISDARKIDAFEKWIQSQKLAVNKLEVKSIPGFRMGTTAKDDIADGELYIAIPDHMLMGPE 99

Query: 126 --------------ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
                         I      S  E   +L + +     +L  +L+   + +K S W  Y
Sbjct: 100 RVEPGSRLDKKLMKIVKSQSISMQEQRRLLSEKN----KVLMYFLLQMYNPKKESFWKPY 155

Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
              +P    S ++W+  EL   L  S++   A      +   Y++LR RIF      F +
Sbjct: 156 FDIMPTNLTSPIFWSEDELQE-LAGSEVSNMARIEKKRLRAMYDELRERIFKHDRKTFLK 214

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRV---ALVPWADMLN-HSCEVETFLDYDKSSQG 287
           + F ++ + W+ G+  SR+++L    G       +P  DM+N    + +TF+ YDK  + 
Sbjct: 215 QAFTLKNWFWANGLYDSRVIQLNRQTGHGNVPTFIPLIDMVNCIESQDKTFIQYDKKLRA 274

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY-- 345
            V   DR    G QVF SYG KSN E LL  GFV  +  N    +  P S  +  K Y  
Sbjct: 275 AVMYADRAVSRGVQVFESYGNKSNYEYLLYNGFVMEDNPNDCVYISFPSSNARDAKSYLI 334

Query: 346 ---KEKLEALRKYG 356
              +EK    R++ 
Sbjct: 335 KHIEEKRRGYRRFA 348


>gi|281205954|gb|EFA80143.1| hypothetical protein PPL_06965 [Polysphondylium pallidum PN500]
          Length = 417

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 85/319 (26%), Positives = 156/319 (48%), Gaps = 25/319 (7%)

Query: 73  LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           +++  T ++W+ D G  L P    ++  D G R ++A   I++G+ L+ VP +++++   
Sbjct: 8   IDDLVTFKQWMDDEGIYLNPSLDIVKLEDYG-RSIIANTLIKEGDVLIRVPRNVMMSRTG 66

Query: 131 -KWSCPEAGEVLKQCSVPDWPLL---ATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
            +   P+    +   +  D       A YL+  +   K S W  Y S LP+Q  + +Y+ 
Sbjct: 67  IELHIPKEIRSIIDSNRDDIGSTDGQAVYLMY-SLLNKDSYWHQYTSILPKQFTTSIYFD 125

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
           + E+ + L+ S++R     R++ +   YN +  +  S   D F ++ +  E FKW+   +
Sbjct: 126 QDEM-KELQLSKLRYFTESRLSGIERHYN-VIFKKLSSLNDEFKKKEYTFELFKWALSCI 183

Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
           +SR   L S DG   +VP ADM N   + ++ +  D  +  +++   +  + GEQVF  Y
Sbjct: 184 WSRAFSLSSDDG--GMVPLADMFNAIEKAKSKVRPDSRADQLIYYASKDIERGEQVFTPY 241

Query: 307 G---KKSNGELLLSYGFV---PREGTNPSDSVELPLSLKKSDKCYKE-KLEALRKYGLSA 359
           G      N ++L+ YGF    P EG    D+++L L     D+ Y + K++ L +  L  
Sbjct: 242 GVYKTIGNAQMLMDYGFAFDDPSEG----DTIQLTLDNFSDDELYIDTKIDLLEQ--LDI 295

Query: 360 SECFPIQITGWPLELMAYA 378
              F ++    P EL+ YA
Sbjct: 296 VREFNLKRNQLPQELLIYA 314


>gi|301094750|ref|XP_002896479.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109454|gb|EEY67506.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/291 (28%), Positives = 143/291 (49%), Gaps = 22/291 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +WL  +G   +K+ +Q+     RG+ + K +  GE++L +P   +IT +         
Sbjct: 37  LIQWLETNGADSKKLTLQEYAPEVRGVHSRKVLVPGERILVIPKKCLITVEMGKQTDIGR 96

Query: 139 EVLKQ---CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYL 194
           ++L +      P    L  +L+++    ++S + NY S LP    ++ ++W+  EL  +L
Sbjct: 97  KLLARNVDFVAPKHIFLMMFLLTDMEHVETSFFRNYYSTLPSTLSNMPIFWSEEELS-WL 155

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           + S I ++  ER   +   Y D+  R+   +        F+++ F W+  I+ SR   L 
Sbjct: 156 KGSYIIQQIQERKAAIRKDY-DVICRVDPSFAR------FSLDRFSWARMIVCSRNFGL- 207

Query: 255 SMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
           ++DG    ALVP+ADMLNH    ET   +D+S      T+      G QV+ SYGKK N 
Sbjct: 208 TIDGVKTAALVPFADMLNHYRPRETSWTFDQSIDAFTITSLGTIGTGAQVYDSYGKKCNH 267

Query: 313 ELLLSYGF-----VPREGTNPSDSVELPLSLKKSD-KCYKEKLEALRKYGL 357
             LL+YGF        +G NP++ V +   L  +D + + +K   L + G+
Sbjct: 268 RFLLNYGFAVEDNTEEDGRNPNE-VLIDFQLSPADGQLFYDKRAYLHESGI 317


>gi|126290266|ref|XP_001367810.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Monodelphis domestica]
          Length = 595

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 142/317 (44%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW + +G       +        GL A + I+  E  L+VP  L++T +S        
Sbjct: 82  LIKWAAANGASTDGFELVNFKEEGFGLRATREIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   SS W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGALYSQDRILQAMGNITLAFHLLCERA-NPSSFWLPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           ++L+++Q       +  N    Y     ++   +P+   L  ++ F  E ++W+   + +
Sbjct: 195 QHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + +  GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFNVGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|303288796|ref|XP_003063686.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454754|gb|EEH52059.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 538

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 140/323 (43%), Gaps = 39/323 (12%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDV--GERG--LVALKNIRKGEKLLFVPPSLVITADSKWS 133
            L  WL   G     ++   VD   G RG  LVA +++  G+  + VP +L +T ++ ++
Sbjct: 81  ALWTWLEREGADVASVSPALVDATPGGRGWGLVATRDVGGGDAAIVVPRALWMTKETAFA 140

Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELD 191
             + G  L   + P W  LA  L+ E S    SRW+ YI  LPR     + L+W+  EL 
Sbjct: 141 S-KIGTALDPETTPPWCALALQLLHEKSLGDDSRWAAYIRCLPRVEALDAPLFWSSEELA 199

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-------PEEVFNMETFKWSFG 244
             L  +Q+   A    + V GT+  L+   F ++P LF           F+   F W+FG
Sbjct: 200 E-LAGTQLLANAAGYDSYVRGTHAALKETTFKEHPALFGDAGDDDGGGAFSEREFLWAFG 258

Query: 245 ILFSRLVRLPSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVF------------ 290
           +L SR   LP +D    +AL+P  DM NH          +      VF            
Sbjct: 259 VLRSRA--LPPVDQGESIALIPGIDMANHDGLCSQTWQLNNGGIAAVFGGRGGADGGGSV 316

Query: 291 ------TTDRQYQPGEQVFISYGKKS-NGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
                 T     + GE++  +YG  + + +  L YGFV    + P   V  PLS+ + D 
Sbjct: 317 LLRVEKTKAGGAKRGEEIRCNYGPANIDSQFALDYGFVDAFCSRPG-YVLGPLSIPEDDV 375

Query: 344 CYKEKLEALRKYGLSASECFPIQ 366
              +K++ L   GL  S  F I+
Sbjct: 376 NAFDKMDVLSVAGLKESPAFTIR 398


>gi|296090251|emb|CBI40070.3| unnamed protein product [Vitis vinifera]
          Length = 428

 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 89/308 (28%), Positives = 140/308 (45%), Gaps = 29/308 (9%)

Query: 83  LSDSGLPPQKMAIQKVDVGERG------LVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           + ++GLPP K+ +++             + A ++++ G+    VP SLV+T +       
Sbjct: 1   MHENGLPPCKVVLKERPSHHEQHKAIHYIAASEDLQAGDVAFSVPDSLVVTLERVLGNET 60

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAE 189
             E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W+ +E
Sbjct: 61  IAELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSESE 120

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSF 243
           L  YL  S  +   +ER   +   YN+L         +F +YP   P E F  E FK +F
Sbjct: 121 L-AYLTGSPTKAEVLERAEGIKREYNELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAF 179

Query: 244 GILFSRLVRLP--SMDGRVALVPWA-DMLNHSCEVETFL-DYDKSSQGVVFTTDRQYQPG 299
             + S +V L   S+  R ALVP    +L +    +  L   D S Q VV   DR Y+ G
Sbjct: 180 VAIQSCVVHLQKVSLARRFALVPLGPPLLAYRSNCKAMLAAVDGSVQLVV---DRPYKAG 236

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
           E + +  G + N +LLL+YGFV  +  N  D + +  +L   D  Y++K    ++ G   
Sbjct: 237 ESIVVWCGPQPNSKLLLNYGFVDED--NSYDRIVVEAALNTEDPQYQDKRMVAQRNGKLT 294

Query: 360 SECFPIQI 367
            + F + +
Sbjct: 295 VQKFHVSV 302


>gi|125578929|gb|EAZ20075.1| hypothetical protein OsJ_35675 [Oryza sativa Japonica Group]
          Length = 536

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/273 (30%), Positives = 125/273 (45%), Gaps = 21/273 (7%)

Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSN 170
           ++ G+    VP SLV+T +         E+L    + +   LA YL+ E    + S W  
Sbjct: 143 LQAGDVAFEVPMSLVVTLERVLGDESVAELLTTNKLSELACLALYLMYEKKQGQDSFWYP 202

Query: 171 YISALPRQP-------YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL---- 219
           YI  L RQ         S L WT +EL+ YL+ S I++  + R   +   YN+L      
Sbjct: 203 YIKELDRQRGRGQLAVESPLLWTESELN-YLKGSPIKDEVVARDEGIRREYNELDTLWFM 261

Query: 220 --RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--SMDGRVALVPWAD-MLNHSCE 274
              +F +YP   P E F  E FK +F  + S +V L   S+  R ALVP    +L +   
Sbjct: 262 AGSLFQQYPFDIPTEAFPFEIFKQAFVAVQSCVVHLQKVSLARRFALVPLGPPLLTYKSN 321

Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
            +  L     S  V    DR Y+ GE + +  G + N  LLL+YGF+  +  NP D + +
Sbjct: 322 CKAMLTAVGDS--VRLVVDRPYKAGEPIIVWCGPQPNSRLLLNYGFIDED--NPYDRIVI 377

Query: 335 PLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
             SL   D  ++EK    ++ G  A + F + +
Sbjct: 378 EASLNIEDPQFQEKRMVAQRNGKLAIQNFHVCV 410


>gi|348554489|ref|XP_003463058.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Cavia
           porcellus]
          Length = 789

 Score = 96.3 bits (238), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            +L      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SILGPLYSQDRILQAMGNIALAFHLLCERA-NPNSFWLPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           + L+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 QCLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFRAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFF--FDNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T  P+     A+L V
Sbjct: 371 HFTEPPISAQLLAFLRV 387


>gi|224051705|ref|XP_002200601.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Taeniopygia
           guttata]
          Length = 593

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 83/317 (26%), Positives = 144/317 (45%), Gaps = 24/317 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW +++G   +   I   +    GL A + I+  E  L+VP  L++T +S        
Sbjct: 82  LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +    S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPHSFWLPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           ++L+++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 195 QHLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
           R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + ++ GEQ++I 
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDFKAGEQIYIF 312

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +
Sbjct: 313 YGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFAL 370

Query: 366 QITGWPLELMAYAYLVV 382
             T   +     A+L V
Sbjct: 371 HSTEPAISAQLLAFLRV 387


>gi|291235388|ref|XP_002737626.1| PREDICTED: SET domain containing 4-like [Saccoglossus kowalevskii]
          Length = 353

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/284 (27%), Positives = 133/284 (46%), Gaps = 19/284 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +W+S +G     +         RGL+A K  + G++++ +P  L+IT  +  S    G
Sbjct: 34  LVRWMSRNGFKGALLKPANFKETGRGLMATKPFQIGDQVISIPEMLLITTQNVLS-SYLG 92

Query: 139 EVLKQCSVPD---WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
           + +KQ + P      ++ TYLI E S +K S W NYI  LP+   + +Y+T  E++    
Sbjct: 93  DFIKQQTRPKLSPMQVICTYLICERSRQKDSFWYNYIKVLPKSYSNPVYFTNEEIN--WL 150

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--EEVFNMETFKWSFGILFSRLVRL 253
             +I+ +  +    +   Y +L+  +FS     F   + +F    F+W++  + +R V +
Sbjct: 151 PRRIKRKVFDECEKINTAYRELK-NLFSILESTFVSFKGIFEYSAFRWAWCTVNTRSVYM 209

Query: 254 -----PSMD---GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                P +       AL P+ D+LNH+  VE    Y+  S+     T    +  +Q+FI 
Sbjct: 210 LQEQNPHLSIERDHYALAPFLDLLNHTNTVEVKASYNPVSKCYEIFTCTACKKYDQMFIY 269

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
           YG   N +L + YGFV  +  N  + VEL         C + KL
Sbjct: 270 YGPHDNVKLFIEYGFVLPQ--NQHNVVELDFEDIYCKTCEERKL 311


>gi|302820198|ref|XP_002991767.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
 gi|300140448|gb|EFJ07171.1| hypothetical protein SELMODRAFT_430007 [Selaginella moellendorffii]
          Length = 389

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 127/259 (49%), Gaps = 26/259 (10%)

Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVP-DWPLLATYLISEA 160
           +RGL A ++IR GE+++ +P  LV+TA+    C     V K  S   DW  L   +++E 
Sbjct: 9   KRGLFAARSIRAGEQIVRIPHDLVLTAEKLDDC-----VKKLLSTEYDWCPLTLLILAEQ 63

Query: 161 SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
              ++SRW+ Y+S LP     +S ++W + EL ++LE ++      ER   +   Y  ++
Sbjct: 64  HKGEASRWAPYVSCLPSFGDHHSTIFWEKEEL-KFLECTRAFRGTAERREMISDEYISVK 122

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF 278
             + S  P +F E++ ++  F  ++  + SR     ++   +++ P+ D  NH       
Sbjct: 123 -NVISSCPHVFGEDI-SLFQFAHAYATVVSRAWN-GALSSEISMRPFVDFCNHDPVSHAT 179

Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           + +D      V            VFISYGK+SN  L + YGFV     N SD  EL + +
Sbjct: 180 VSHDSCKDATV------------VFISYGKRSNAVLAVDYGFVL--PNNLSDQAELWMEI 225

Query: 339 KKSDKCYKEKLEALRKYGL 357
             +D   ++KLE +  + +
Sbjct: 226 PWNDPLREKKLELMGAFNM 244


>gi|440792294|gb|ELR13522.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 568

 Score = 95.9 bits (237), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 149/327 (45%), Gaps = 28/327 (8%)

Query: 65  PWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-----GLVALKNIRKGEKLLF 119
           P G   D LE    L+ WL  +GL  +   ++ ++         G+VA K+ +KGE  L 
Sbjct: 60  PSGPTRDDLEQ---LRVWLLKNGLDSK--WLEGIEFAANLPEGSGVVAKKDFKKGEPFLQ 114

Query: 120 VPPSLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
           VP  L+ T  +  + P  G++LK        P    LA +L+ E     SS W+ YI  L
Sbjct: 115 VPRKLMFTCQAMQNTP-LGQLLKVDKFLAQSPSL-CLALHLLVE-KHNHSSFWTPYIKTL 171

Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
           P+   + LY+T  EL+  L  S     AI+ I  V   Y  +   +F    D+     F 
Sbjct: 172 PKSYGTCLYFTLEELEG-LRGSPTFTSAIKVIATVAIQYTYIH-DLFQIRKDILHINAFT 229

Query: 236 METFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
            + F W+   + SR  ++P          AL+P  DM NH   +++TF  +D +S     
Sbjct: 230 WDEFIWAMSAVGSRQNQVPQWGHNALSEYALIPAWDMCNHDHGDLQTF--WDVNSDSTES 287

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
              R Y+ GEQV+I YG + N +LLL  GFV     N  D++ + + L    +  K+KL 
Sbjct: 288 HAMRAYKKGEQVYIFYGPRPNSDLLLHAGFVYE--NNRFDALAIRVRLAPDAEHIKDKLR 345

Query: 351 ALRKYGLSASECFPIQITGWPLELMAY 377
            L    +     + +   G  ++LMA+
Sbjct: 346 LLHLNNMKMDSQYYLYGLGLAVDLMAF 372


>gi|320163219|gb|EFW40118.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1188

 Score = 95.5 bits (236), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/313 (28%), Positives = 146/313 (46%), Gaps = 37/313 (11%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ--------CSVPDWPLL 152
           G+RG  A  ++  G++L  +P + +I+       P    +L            +P   L+
Sbjct: 157 GDRGFFATCDLAPGDELASMPIATIISEQLASRSPVGMAMLSSPMLKRRGVTPIPGRTLI 216

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
             YLI+    +  S + +YI+ LP+     L+W  AELD +L+ + I     ER   V  
Sbjct: 217 CAYLIANRG-KLDSPFYHYINILPQTYSDPLWWNDAELD-HLDGTNIGGYIQERRNQVRN 274

Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL-----------------PS 255
            + ++   +  + P LFP++VF  E + W+F    SR   L                 P 
Sbjct: 275 QFLNVFPVLSREQPALFPKDVFTYEAYLWAFSTCSSRAFPLRVTVNPTTGVESHAIGNPM 334

Query: 256 MDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
            +  V  L+P  DM+NH          D++S  V F T  + + GEQV+ +YG KSN EL
Sbjct: 335 KEPCVECLLPLLDMMNHQFGASITWFTDETS--VRFFTGAKVRKGEQVYNNYGPKSNEEL 392

Query: 315 LLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLE 373
           L+ YGF +P    N +D V++ L++  +D   + KL  LR +GLS +    +     P+E
Sbjct: 393 LMGYGFCLP---NNEADHVKIQLTV-GNDPDGEAKLAILRWHGLSLTHF--LHNRSVPVE 446

Query: 374 LMAYAYLVVSPPS 386
           L +   ++V  P+
Sbjct: 447 LFSALRVLVMTPA 459


>gi|146181028|ref|XP_001021989.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146144300|gb|EAS01744.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 590

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 81/285 (28%), Positives = 130/285 (45%), Gaps = 16/285 (5%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           L N +T+  WL D       + +Q      RG+ A + +   E +LF+P S +IT +   
Sbjct: 148 LVNHNTMINWLLDGKSEFDNLKLQWYSKNYRGVHARRKVYNKETILFIPKSHLITLEMAK 207

Query: 133 SCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRA 188
               A +++        P    L+T+L+ E    K S+W  Y+  LP       ++++  
Sbjct: 208 ETDVAKKIIAAKLNLLSPKHSFLSTFLLQERK-NKESKWKPYLDILPSDYNQFPIFFSED 266

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           +L  +L+ S  + +  E+  ++   Y+D    I S  P+ F E  F  E F W+     S
Sbjct: 267 DLS-WLKGSPFQNQVREKKADIKRDYDD----ICSVAPE-FAEYTF--EDFCWARMTASS 318

Query: 249 RLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
           R+  L   + +  A VP ADMLNH    +T   YD   +G V         GEQV+ SYG
Sbjct: 319 RVFGLQINEQKTDAFVPLADMLNHRRPKQTSWQYDDQREGFVIQALEDIPRGEQVYDSYG 378

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
           +K N    L+YGF+  +  N ++ V L L+    D   + K E +
Sbjct: 379 RKCNSRFFLNYGFINLD--NDANEVALRLTFDAEDPTIERKKEMM 421


>gi|327290197|ref|XP_003229810.1| PREDICTED: SET domain-containing protein 4-like [Anolis
           carolinensis]
          Length = 440

 Score = 95.1 bits (235), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 88/295 (29%), Positives = 142/295 (48%), Gaps = 20/295 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL + G    K+   +     RGLV  K ++ GE ++ +P   ++T D+  +     
Sbjct: 36  LKKWLKEKGCNVNKLRPAQFPETGRGLVTTKGLQVGELIISLPEKCLLTTDTVLN-SYLR 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
           E + + + P  PL+A  T+LI+E   ++ S W  Y+  LP + YS       ++   L  
Sbjct: 95  EYIVKWTPPISPLIALCTFLIAEKWAQEKSPWKPYLDLLP-EIYSCPVCLEQKIVN-LFP 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             +R +A E+   V   +   +   FS  P LFP++V   FN + FKW++  + +R V +
Sbjct: 153 EPLRRKAHEQRKLVQELFISSQQFFFSLQP-LFPKDVASVFNYQAFKWAWCTINTRTVYM 211

Query: 254 P-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                   S D    AL P+ D+LNH+  V+    +++ ++    TT  Q     +VFI 
Sbjct: 212 KHSQRDCFSRDTDTYALAPYLDLLNHNPTVQVKAGFNEKTKCYEITTVTQCHHYNEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK---SDKCYKEKLEALRKYGL 357
           YG   N  LLL YGFV R+  + S  V     LK     DK   +KL  L+++ L
Sbjct: 272 YGPHDNQRLLLEYGFVSRDNPHSSVYVGTDTLLKNVFPEDKQRPKKLSILQEHKL 326


>gi|432952574|ref|XP_004085141.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Oryzias
           latipes]
          Length = 606

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 78/315 (24%), Positives = 149/315 (47%), Gaps = 16/315 (5%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           + L  W  ++G       I        GL   ++I+  E  L+VP  +++T +S  +   
Sbjct: 80  ADLMSWAQENGASCDGFTITNFGTEGYGLRTTRDIKAEELFLWVPRKMLMTVESAQNSV- 138

Query: 137 AGEVLKQCSVPDW---PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
            G +  Q  +        LA +L+ E   + +S WS YI +LP++  + LY+ + ++ + 
Sbjct: 139 LGPIYSQDRILQAMGNVTLALHLLCERG-DPASFWSPYIRSLPQEYDTPLYYQQEDV-QL 196

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRL 250
           L  +Q  +  + +  N    Y     ++   +P    L  ++ F+ + ++W+   + +R 
Sbjct: 197 LLGTQAVQDVLNQYKNTARQYAYF-YKLVQTHPAASKLPLKDGFSFDDYRWAVSSVMTRQ 255

Query: 251 VRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
            ++P++DG    +AL+P  DM NH+  + T   Y+           + Y+  EQ++I YG
Sbjct: 256 NQIPTVDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQDYKKNEQIYIFYG 314

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
            +SN E ++  GF  ++  N  D V++ L + KS++ Y  K E L + G+ AS  F +  
Sbjct: 315 TRSNAEFVIHNGFFFQD--NAHDRVKIKLGVSKSERLYAMKAEVLARAGIPASCVFALHC 372

Query: 368 TGWPLELMAYAYLVV 382
              P+     A+L V
Sbjct: 373 NDPPISAQLLAFLRV 387


>gi|47215092|emb|CAF98166.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 444

 Score = 95.1 bits (235), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 77/310 (24%), Positives = 145/310 (46%), Gaps = 16/310 (5%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           W  + G   +  A+        GL A ++I+  E  L++P  +++T +S       G + 
Sbjct: 3   WAQEHGASCEGFAVTNFGAEGYGLRATRDIKAEELFLWIPRKMLMTVESAKKSV-LGPLY 61

Query: 142 KQCSV---PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
            Q  +    D   LA +L+ E + + +S W  YI  LP++  + L++ + ++ + L  +Q
Sbjct: 62  TQDRILQAMDNVTLALHLLCERA-DPASFWLPYIRTLPQEYDTPLFYQQQDV-QLLHGTQ 119

Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRLVRLPS 255
             +  + +  N    Y     ++   +P    L  ++ F  + ++W+   + +R  ++P+
Sbjct: 120 AIQDVLSQYRNTARQYAYF-YKLVQTHPASSKLPLKDSFTFDDYRWAVSSVMTRQNQIPT 178

Query: 256 MDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
            DGR   +AL+P  DM NH   + T   Y+           + Y+  EQ++I YG +SN 
Sbjct: 179 EDGRQVTLALIPLWDMCNHRNGLIT-TGYNLEDDRCECVALQDYKKNEQIYIFYGTRSNA 237

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
           E ++  GF  +E  N  D V++ L + KS++ Y  K E L + G+  S  F +     P+
Sbjct: 238 EFVIHNGFFYQE--NAHDQVKIKLGISKSERLYAMKAEVLGRAGIPVSSVFALYCNEPPI 295

Query: 373 ELMAYAYLVV 382
                A+L V
Sbjct: 296 SAQLLAFLRV 305


>gi|198413420|ref|XP_002131202.1| PREDICTED: similar to SET domain containing 3 [Ciona intestinalis]
          Length = 577

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 88/321 (27%), Positives = 153/321 (47%), Gaps = 35/321 (10%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVIT- 127
           ID        + WL + G+    + IQ+V   E  G++AL++I     L+ +P   ++T 
Sbjct: 74  IDRTTAIPKFKSWLKEHGVEYSAIDIQEVSEEEGFGVIALQDIEIKCPLVTIPRKAMMTY 133

Query: 128 ADSKWS----CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
            D+K S      E  EVL   SV     LA YL  E  F  +S++  YI  +P++  ++L
Sbjct: 134 EDAKSSYLAGLIEGNEVL---SVMPNVCLALYLHCE-RFTLNSKYQPYIDMIPQEFNTIL 189

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPDLFPEEVF 234
           Y+   E+ +YL+ +     AI +  +++  +  L  ++F+         K P L     F
Sbjct: 190 YFKPHEM-KYLKGTAALSVAINQFKSIVRQFA-LLYQVFNGSHQKEDVEKLP-LQARNAF 246

Query: 235 NMETFKWSFGILFSRLVRLPSMDGRV----------ALVPWADMLNHSCEVETFLDYDKS 284
             +T++W    + +R  ++P+  G V          AL+P  DM NH+    +   Y+  
Sbjct: 247 TFDTYRWCASAVTTRQNKIPTHVGDVLGDLDENSTLALIPMWDMFNHAIGPLS-TAYNAL 305

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC 344
           ++G+     + ++ GEQV I YG ++N +LL+  GFV +E  +P D V + L + + D  
Sbjct: 306 TRGIECLAMQDFKTGEQVKICYGARTNSDLLIHNGFVMKE--SPFDKVRIHLGVSQKDPL 363

Query: 345 YKEKLEALRKYGLSASECFPI 365
           Y  K + L K  +  S  F +
Sbjct: 364 YSLKAKLLEKLNVEVSGQFAV 384


>gi|357444999|ref|XP_003592777.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481825|gb|AES63028.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 451

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/254 (30%), Positives = 129/254 (50%), Gaps = 16/254 (6%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           L A K+I+ G+ +L VP SL +T D+    PE    + +  V +   LAT L+   +  +
Sbjct: 64  LFASKSIQTGDCILQVPYSLQLTPDNL--PPEIKPFISE-DVGNIAKLATVLLIHKNLGQ 120

Query: 165 SSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
            S W  YIS LP Q   ++ ++W  +EL+  +  S + +  I + + +   + +++  +F
Sbjct: 121 DSEWHPYISCLPPQAEMHNTIFWNESELE-MIRQSSVYQETIYQKSQIEKDFLEIK-PVF 178

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYD 282
             +   F +  F  + F  +  ++ SR     S  G ++L+P+AD LNH    E  +  D
Sbjct: 179 QPFCQSFGD--FTWKDFMHACTLVGSR--AWGSTKG-LSLIPFADFLNHDGISEAIVMSD 233

Query: 283 KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKS 341
             ++     +DR Y PGEQV I YGK SN  L+L +GF +P    N  D V++   + K 
Sbjct: 234 DDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLDFGFTIPY---NIYDQVQIQYDIPKY 290

Query: 342 DKCYKEKLEALRKY 355
           D     KLE L++Y
Sbjct: 291 DPLRHTKLELLQQY 304


>gi|356564844|ref|XP_003550657.1| PREDICTED: uncharacterized protein LOC100778605 [Glycine max]
          Length = 549

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/339 (23%), Positives = 151/339 (44%), Gaps = 27/339 (7%)

Query: 17  SHLHKAQSPAGFTDFPRKRCGH---------RIVVHCSVSTTNDASRTKTTVTQNMIPWG 67
           + L    S    TD     C H         R  +   +S   D +  K  V +N     
Sbjct: 88  NELEALNSIVLLTDISLSTCTHLHTNILQGLRQTILDLISDFGDKNSVKGVVEKN----- 142

Query: 68  CEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT 127
               S +    L +W   +GL  Q + I  ++   RG +A K+++ G+  L +P S++I+
Sbjct: 143 ---HSCDQEERLLEWGESNGLMTQ-LKIAYIEGASRGAIARKDLKVGDIALEIPVSIIIS 198

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
            +        G + +   +    +L  + + E  +   S++  Y   LP +  + L ++ 
Sbjct: 199 EELVHETDMYGVLKEIDGISSETILLLWSMKE-KYNCDSKFKIYFDTLPEKFNTGLSFSI 257

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
             +   L+ + + E  ++   ++   Y++L   + + +PD+FP E++  E F W+  + +
Sbjct: 258 QAI-TMLDGTLLLEEIMQARQHLHAQYDELFPALCNNFPDIFPPELYTWEKFLWACELWY 316

Query: 248 SRLVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           S  +++   DG  R  L+P A  LNHS    V  +   D ++  + F   R  + GE+  
Sbjct: 317 SNSMKIMYSDGKLRTCLIPLAGFLNHSLCPHVMHYGKVDPATNSLKFCLSRPCRSGEECC 376

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           +SYG  S+  L+  YGF+P +G N  D +  PL +  SD
Sbjct: 377 LSYGNFSSSHLITFYGFLP-QGDNSYDVI--PLDIDGSD 412


>gi|330806388|ref|XP_003291152.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
 gi|325078672|gb|EGC32310.1| hypothetical protein DICPUDRAFT_155733 [Dictyostelium purpureum]
          Length = 465

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 159/332 (47%), Gaps = 44/332 (13%)

Query: 69  EIDSLENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
           EI+SL+     ++WL ++   + P  + I+ +D   R +VA K+I+K +KL+ +P  +++
Sbjct: 36  EIESLK---EFKEWLVNNNAYINPN-IDIELLDKYGRSIVAKKSIKKQDKLISIPKDIIM 91

Query: 127 TADSKWSCPEAGEVLKQC-SVPDWPL-LATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
           +    +      E+ +Q  S+   P  L    I  +   + S W  Y++ LP    + LY
Sbjct: 92  SNIGGYPKKIPKEIYEQVQSIGLSPTNLQAVFIMYSKLNEKSFWHPYVTVLPESFSTSLY 151

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE----EVFNMETFK 240
           ++  ELD  L+ASQ++E  I R   +   Y       FS+   L PE     ++N E F 
Sbjct: 152 FSDNELDE-LQASQLKEFTIIRKDGIERHYE----STFSRLSKLVPEFSNLALYNQELFT 206

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE---------VETFLDYDKSSQGVVFT 291
           W+   ++SR   L   DG   +VP ADM N              +T LDY        + 
Sbjct: 207 WALSCVWSRAFSLAENDG--GMVPLADMFNAEDRSKSKVLPKVTDTTLDY--------YA 256

Query: 292 TDRQYQPGEQVFISYGKK---SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK-CYKE 347
           +D     GEQ+F  YG     S+ ++L+ YGF+  EGT  SD+V + + +  +D+     
Sbjct: 257 SD-DIAEGEQIFTPYGVYKPLSSSQMLMDYGFIFDEGT-VSDNVAITVPVFHNDEPNLST 314

Query: 348 KLEALRKYGLSASECFPIQITG-WPLELMAYA 378
           K E L +  +  +E F +Q T   P +L+ YA
Sbjct: 315 KQEILEENDI-INEVFLLQKTDPLPADLLLYA 345


>gi|308811012|ref|XP_003082814.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
 gi|116054692|emb|CAL56769.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplast precursor (ISS)
           [Ostreococcus tauri]
          Length = 588

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 126/283 (44%), Gaps = 25/283 (8%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD----WPLLATYLISE 159
           G+ A   +R+G + + +P  + + A       + G  L+          W  +A  L+ E
Sbjct: 76  GVRAKTTLRRGTRAMVIPREVWMDATRATEDADVGAALRDARYDAVKQPWVRVALLLLKE 135

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
                   ++ Y++ LP+   S L+W+  EL R +  +Q+ + A      V   Y +L+ 
Sbjct: 136 RERGADGEFAAYVATLPKTLDSPLFWSADEL-RDIAGTQLLDNAAGYDAYVRAVYEELKN 194

Query: 220 RIFSKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSC---- 273
            +F +Y   F  +  F+  +F+W+FGIL SR   +  +DG  VALVP  D++NHS     
Sbjct: 195 GVFVEYASTFDVDGAFDEASFRWAFGILRSRT--MAPLDGANVALVPGLDLINHSSLSGA 252

Query: 274 ---------EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS-NGELLLSYGFVPR 323
                        F     S        DR Y  G ++F++Y  +  + +  L YGF+  
Sbjct: 253 RWRVGGGGGMGGLFGGGSGSGVAAYVECDRDYDEGAEIFVNYDPEGIDSKFALDYGFI-- 310

Query: 324 EGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ 366
           +  NPS    L LS+ + D    +KL+ L   GL  +  F ++
Sbjct: 311 DVVNPSPGYALTLSIPEDDANLFDKLDVLETQGLPEAPTFTLR 353


>gi|62860180|ref|NP_001017105.1| SET domain containing 4 [Xenopus (Silurana) tropicalis]
 gi|89267009|emb|CAJ81787.1| novel protein containing a SET domain [Xenopus (Silurana)
           tropicalis]
          Length = 442

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/268 (29%), Positives = 133/268 (49%), Gaps = 21/268 (7%)

Query: 79  LQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           LQ+WL + G   + + A +  D G RGL+A ++++ GE ++ +P S +IT ++       
Sbjct: 36  LQRWLKERGFQGRHLRAAEFTDTG-RGLMATRDLQPGELIISLPDSCLITTETVLQS-YL 93

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
           G+ ++  S P  PLLA  T+LI+E    + S W  Y+  LP      +YW  +E+   L 
Sbjct: 94  GKYIRTWSPPVSPLLALCTFLIAERVARERSPWKPYLDVLPSSYSCPVYW-ESEIISLLP 152

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF---KWSFGILFSRLVR 252
           A  +R++A+E+ T V   + +     F     LF   + ++ T+   +W++  + +R V 
Sbjct: 153 AP-LRQKALEQQTEVKELHTE-SWSFFVSLQPLFGGNITDIYTYGALRWAWCTVNTRTVY 210

Query: 253 L--PSMDGR------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           +  P   G        A+ P+ D+LNHS  V+    +++  +     T+   +  +Q FI
Sbjct: 211 MKHPRRHGLSAQQDVYAMAPYLDLLNHSPAVQVEAAFNEERRCYEIRTNSGCRKHDQAFI 270

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSV 332
            YG   N  LLL YGF+     NP  SV
Sbjct: 271 CYGPHDNQRLLLEYGFI--AANNPHRSV 296


>gi|126325439|ref|XP_001376285.1| PREDICTED: SET domain-containing protein 4-like [Monodelphis
           domestica]
          Length = 437

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 85/310 (27%), Positives = 138/310 (44%), Gaps = 29/310 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +   +     RGL+A+K+++ GE ++ +P   ++T D+       G
Sbjct: 37  LRKWLKKRKFEDHNLRPTRFSNTGRGLMAVKSLQPGELIISLPKECLLTTDTVIR-SYLG 95

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
           + + +   P  PLLA   +LISE      S W  Y+  LP+  Y+ L     E+ R L  
Sbjct: 96  DYITKWMPPISPLLALCAFLISEKHAGNKSPWKPYLDVLPK-AYTCLVCLEPEVVRLL-P 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             ++ +A E+   V   +   R   FS    LF E+V   F+   F W++  + +R V +
Sbjct: 154 RPLQMKAEEQRMQVQKLFISSR-GFFSSLQSLFTEDVKHVFHYHAFLWAWCTINTRTVYM 212

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                +         AL P+ D+LNHS  V     +++ +      T    +  E++FI 
Sbjct: 213 KHAQKQCLSAEPDVYALAPYLDLLNHSPRVWVEAAFNEETCCYEIRTTSHCKKFEELFIC 272

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSAS 360
           YG   N  LLL YGFV     NP  +V + +      L   DK   +K+  L+++G S +
Sbjct: 273 YGPHDNHRLLLEYGFVA--SNNPHSAVYIAIDSLVDHLPSVDKQMNKKISLLKEHGFSEN 330

Query: 361 ECFPIQITGW 370
             F     GW
Sbjct: 331 LTF-----GW 335


>gi|299472213|emb|CBN77183.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloropl [Ectocarpus
           siliculosus]
          Length = 460

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 92/312 (29%), Positives = 146/312 (46%), Gaps = 18/312 (5%)

Query: 82  WLSDSGLPPQKMAI---QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           WL+ SG+     A+   +    GERGLVA K I  G+ +L +P SL +TA    S   A 
Sbjct: 18  WLTKSGVRLTDNAVLAGRSPLAGERGLVAAKAIETGQSVLAIPQSLGLTATGLKSSGIAQ 77

Query: 139 EVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLE 195
            V   +    +  L+A  ++ E +  + S+ + +I+ LP++      L+W  A+L    +
Sbjct: 78  YVEGFEGWTGETGLIALQVLWERAQGEGSKMAPWIAVLPKEGELEMPLFWGEADL-TLAD 136

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
           AS  R  +   + +V   +  L    F+K+P +FP + F    F+W+ G+  SR      
Sbjct: 137 ASSTRGIS-GFVADVDEDFAWLSENAFAKHPKVFPADKFGPGDFRWAVGVALSRSF---F 192

Query: 256 MDGRVALVPWADMLNHSC-----EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           +DG + L P  D  NHS      E          S+ VV    + Y+ GE+ F+SYG K 
Sbjct: 193 VDGELRLTPLVDFANHSSLRGVSEPTGGTTGLFGSKAVVLRAGKNYEEGEEFFVSYGPKG 252

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
               L   GFVP   +    + EL  S+ + DK + +K + L + GL  S  F +   G 
Sbjct: 253 AAGYLEENGFVPPV-SGSEVTCELEFSIPEDDKFFDDKEDILERAGLRTSSTFDLTAVGL 311

Query: 371 P-LELMAYAYLV 381
           P  EL+ +  L+
Sbjct: 312 PDAELVRFLRLL 323


>gi|71895277|ref|NP_001025965.1| SET domain-containing protein 4 [Gallus gallus]
 gi|53134599|emb|CAG32346.1| hypothetical protein RCJMB04_23h14 [Gallus gallus]
          Length = 439

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 143/311 (45%), Gaps = 22/311 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW-SCPEA 137
           L+KWL D G     +   +     RGL+  K ++ GE ++ +P   ++T  +   SC   
Sbjct: 35  LKKWLKDRGFGDSSLRPAQFWGTGRGLMTTKALQAGELVISLPEKCLVTTTTVLNSC--L 92

Query: 138 GEVLKQCSVPDWPLLAT--YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
           GE + +   P  PL+A   +LI+E    + S W  Y+  LP+  YS       ++ + L 
Sbjct: 93  GEYIMKWKPPVSPLIALCPFLIAEKHAGERSLWKPYLDVLPK-TYSCPVCLEQDVVQLL- 150

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRLVR 252
              +R++A E+ T V   Y   +   FS    LF E    +FN    +W++  + +R + 
Sbjct: 151 PEPLRKQAQEQRTAVHELYMSSK-AFFSSLQSLFAENTATIFNYSALEWAWCTINTRTIY 209

Query: 253 LP-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           +        S++  V AL P+ D+LNHS  V+    +++ S+     T+ Q +  E+VFI
Sbjct: 210 MKHSQRECFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQSRNYEIQTNSQCKKYEEVFI 269

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASE 361
            YG   N  LLL YGFV  +  + S  V     LK     DK    KL  L+++ L  + 
Sbjct: 270 CYGPHDNQRLLLEYGFVAVDNPHSSVYVSSDTLLKYFPSLDKQKNAKLSILKEHDLLENL 329

Query: 362 CFPIQITGWPL 372
            F      W L
Sbjct: 330 TFGWDGPSWRL 340


>gi|159479580|ref|XP_001697868.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158273966|gb|EDO99751.1| rubisco large subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 475

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 81/291 (27%), Positives = 129/291 (44%), Gaps = 26/291 (8%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKW-SCPEA-----GEVLKQCSVPDWPLLATYLIS 158
           LVA  +++ GE L+ VP       D+ W S P       G++     +  W  LA  L++
Sbjct: 71  LVASADVQPGESLIVVP-------DAAWVSVPNVAKTTVGKLASSAGLEPWLQLALVLVA 123

Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
           E      S  + Y S+LP    + L W+  E  R L  +Q+       +T    T+  L+
Sbjct: 124 ERFGSAKSELAGYASSLPEDLGTPLLWSEEE-TRALAGTQVAGTLNSYLTFFRSTFAQLQ 182

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVET 277
             +F+  P  FP  VF +  F W+   + SR    P ++G ++AL P  D+++H     T
Sbjct: 183 AGLFTANPAAFPPAVFTLPNFVWAVAAVRSR--SHPPLEGDKIALAPLVDLVSHRRAANT 240

Query: 278 FLDYDKS-----SQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDS 331
            L    S      Q  V    R  + GE + + Y   K +G +LL YG +  +  +P   
Sbjct: 241 KLSVRSSGLFGRGQVAVVEATRAIRKGEALGMDYAPGKLDGPVLLDYGVM--DTASPKPG 298

Query: 332 VELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL-ELMAYAYLV 381
             L L+L +SDK   +K + +   GL  S  + I     P  E+MA+  L+
Sbjct: 299 YSLTLTLDESDKFVDDKADIVEGAGLRPSMTYSITPDQQPGEEMMAFLRLM 349


>gi|145524453|ref|XP_001448054.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415587|emb|CAK80657.1| unnamed protein product [Paramecium tetraurelia]
          Length = 581

 Score = 92.0 bits (227), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 76/276 (27%), Positives = 134/276 (48%), Gaps = 18/276 (6%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           LE   TL +WL        K+ I+      RG+ A + I   E +LF+P S +IT +   
Sbjct: 133 LERQKTLLEWLKHGKAQFPKIKIECYSESYRGVNAKQKINAKELILFIPKSHMITLEMAK 192

Query: 133 SCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRA 188
             P A ++++       P    L+T+L+ E S   +S W  Y+  LP+  P   +++   
Sbjct: 193 ETPVAKKMIQFRLDLLSPKHSFLSTFLLQEKS-RPNSFWKPYLDILPQSYPSFPIFFNNY 251

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILF 247
           +L+ +L+ S   ++  ++++++   YND+         ++ PE   ++   F W+     
Sbjct: 252 DLE-WLQGSPFLKQINDKLSDLKKDYNDI--------CNVAPEFSQYSFYEFCWARMTAS 302

Query: 248 SRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
           SR+  +     +  A VP ADMLNH     T   Y +  QG +  TD +   G+ +F SY
Sbjct: 303 SRIFGINIKGVKTDAFVPLADMLNHKRPKLTSWCYSEEKQGFIIETDEKIDRGQMIFDSY 362

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           G+K N   LL+YGFV  +  N ++ V + ++ + +D
Sbjct: 363 GRKCNSRFLLNYGFVVDD--NDANEVNVTVAAEFND 396


>gi|297820264|ref|XP_002878015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323853|gb|EFH54274.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 82/305 (26%), Positives = 139/305 (45%), Gaps = 30/305 (9%)

Query: 54  RTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
           +T+ ++ ++ IPW   I   +  +TL    S  G               R L A K I  
Sbjct: 37  QTQASLDKDFIPWLERIAGAKITNTLSIGKSTYG---------------RSLFASKVIHA 81

Query: 114 GEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
           G+ +L VP ++ IT D     P+    L    V +   LA  LI E    + SRW  YIS
Sbjct: 82  GDCMLKVPFNVQITPDE--LSPDIRVSLTD-EVGNIGKLAAVLIREKKKGQKSRWVPYIS 138

Query: 174 ALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
            LP+  + +S ++W   E    +  S + +  +++   +   ++ +       YP +   
Sbjct: 139 RLPQPAEMHSTIFWGEDEFS-MIRCSAVHKETVKQKAQIEKEFSFVAQAFKQHYPMVI-- 195

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
           E   +E F +++ ++ SR          ++L+P+AD +NH     + +  D+ +Q    T
Sbjct: 196 ERPYLEDFMYAYALVGSRAWE---TSKGISLIPFADFMNHDGLSASIVLSDEDNQLSEVT 252

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
            DR Y PG++VFI YG+ SN  L+L +GF VP    N  D V++ + +   D     KL 
Sbjct: 253 ADRNYSPGDEVFIKYGEFSNATLMLDFGFTVP---YNIHDEVQIQMDVPNDDPLRDMKLG 309

Query: 351 ALRKY 355
            L+ +
Sbjct: 310 LLQTH 314


>gi|357145323|ref|XP_003573603.1| PREDICTED: SET domain-containing protein 4-like [Brachypodium
           distachyon]
          Length = 532

 Score = 91.7 bits (226), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 71/275 (25%), Positives = 130/275 (47%), Gaps = 20/275 (7%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------ 127
           +   +L KW  D G+   K+ I       RG+VA +NI  G   L +P SL+I+      
Sbjct: 148 DTEDSLLKWGEDQGVK-SKLQIAFFQGAGRGMVASENIGVGHIALEIPESLIISEELLCQ 206

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
           +D   +  +   +  +  +  W +   +         SS +  +   LP    + L +  
Sbjct: 207 SDMFLALKDLNSITTETMLLLWSMRERH-------NPSSNFKMFFETLPSNFNTGLNFGI 259

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
             L   LE + + +  ++   ++   Y++L   + +K+P++F ++++  + F W+  + +
Sbjct: 260 GAL-AALEGTLLFDELMQARQHLHQQYDELFPMLCTKFPEIFTQDIYTWDNFLWACELWY 318

Query: 248 SR--LVRLPSMDGRVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           S   +V L S      L+P A +LNHS    +  +   D++++ + F   R  + G+Q F
Sbjct: 319 SNSMMVVLSSGKLTTCLIPVAGLLNHSVYPHILNYGRVDQATKSLKFPLSRPCKAGQQCF 378

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           +SYGK S   L+  YGF+PRE  NP D V L L +
Sbjct: 379 LSYGKHSGSHLITFYGFLPRE-DNPYDVVPLDLDM 412


>gi|444909511|ref|ZP_21229702.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
 gi|444720460|gb|ELW61244.1| hypothetical protein D187_00317 [Cystobacter fuscus DSM 2262]
          Length = 445

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 168/379 (44%), Gaps = 24/379 (6%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           S +  S+L +W+   G    KM I +   GER ++A  +I +GE +L +P + + T + +
Sbjct: 8   SEQKLSSLLRWMEQGGALFPKMHIVRQADGERSVLARTDIAEGEVVLQIPTTHLFTLE-R 66

Query: 132 WSCPEAGEVLKQCSVPD--WPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRA 188
               + G  ++    PD  +  LA++L+ E      S W  ++ +LP   P+  L+++  
Sbjct: 67  AKASDIGRRIQSQLQPDNDFLYLASWLLEEKHRGADSFWKPFVDSLPEAYPHVPLFYSEQ 126

Query: 189 ELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
           E  R ++ SQ+ ER +E +  +    Y  LR     K P+    E F  E + W+   L+
Sbjct: 127 ERAR-MKGSQL-ERLVEVQRQSFEQEYAQLR----EKLPEY---ERFGFEEYVWARISLY 177

Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
           SRL  L       +LVP +DM NH    +      +  Q       R    G ++   YG
Sbjct: 178 SRLFSLKGGLQGPSLVPLSDMFNHRQPPDVLWSTSEDGQTFRMIAQRAVPAGTEIHTHYG 237

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL-SASECFPIQ 366
            KS+   LL  GFVP +G   +D V L + L   D     K +    +GL SA+   P +
Sbjct: 238 AKSSDVFLLHSGFVP-DGNEENDEVYLSVGLPPGDPLASVKQQM---FGLASATAKHPFK 293

Query: 367 ITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILD 423
           ++     L +++  V S   M     +   A SN++ S      P     +E+ L  +  
Sbjct: 294 VSRQGKYLASWS--VFSFLRMAHASPDEFLALSNRLLSGTKTIAPVSVACEERVLGTLAA 351

Query: 424 SCESSISKYSRFLQVKELL 442
           +CE  +  +   L+  E L
Sbjct: 352 ACEERLKAFPTTLEEDERL 370


>gi|298715435|emb|CBJ28046.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 719

 Score = 91.3 bits (225), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 84/293 (28%), Positives = 140/293 (47%), Gaps = 23/293 (7%)

Query: 97  KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV--LKQCSVPDWPLLAT 154
           + + G RG VA ++I  G+ ++ +P +L+++     + P+ G V  L    +     LA 
Sbjct: 43  ETESGVRGAVARRDIAPGDHMVIIPHALMMSEFHAKADPKYGHVHRLNTRLLGSDNGLAL 102

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
           Y++ E   E+ S +  Y+  LP  P +L  W R  L   L+  ++  R   R   ++  Y
Sbjct: 103 YIMQEILKEERSFYWPYLRMLP-TPCNLRNWNRESL-LLLQDHKLVRRTAARSRQLLALY 160

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RLPSMDGRVALVPWADMLNHS 272
            +    + S YP+L+  + +  E F +++  + +R    RL S     ALVP+AD LNH 
Sbjct: 161 RETIEFLSSSYPELYTADRYTFELFDFAWRTIQARAFGKRLKSS----ALVPFADCLNHG 216

Query: 273 CEVETFLDYDKSSQGVVF---TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
             V+T  D+D    G      + + +Y    +V  SYG+++N  LLL YGF   +  N  
Sbjct: 217 -NVQTKYDFDVGGNGTFRLFPSGNNRYPRNSEVLNSYGRRANDNLLLDYGFAMLD--NEW 273

Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG-----WPLELMAY 377
           D+ E+  SL  S    +  L+  RK  L AS    ++I       WP EL+ +
Sbjct: 274 DAAEVICSLPPSHD--QSPLDRRRKACLRASGQHTVRILRVRRDVWPEELLRF 324


>gi|113930683|ref|NP_001039027.1| SET domain-containing protein 4 [Danio rerio]
 gi|66911144|gb|AAH96876.1| SET domain containing 4 [Danio rerio]
          Length = 440

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 174/382 (45%), Gaps = 35/382 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L++WL++ G   Q +         RGL+A + I+    ++ +P   ++T  +      A 
Sbjct: 37  LRRWLNERGFTSQSLIPVNFHDTGRGLMATQTIKAKNSVISLPEECLLTTSTVLKSYMA- 95

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD---RY 193
           + +K+   P  PLLA   +LISE    ++S W+ YI  LP+     LY+    ++   R 
Sbjct: 96  DYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPKTYTCPLYFPDNVIELLPRS 155

Query: 194 LE--ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           L+  A+Q +E+  E  ++    ++ L+  +F++ P    EE+F+ +  +W++  + +R V
Sbjct: 156 LQKKATQQKEQFQELFSSSQTFFHSLQ-PLFNQ-P---TEELFSQDALRWAWCSVNTRTV 210

Query: 252 RLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
            +     +         AL P+ D+LNH   V+    ++K ++     +    +  +Q F
Sbjct: 211 YMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAF 270

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
           I+YG   N  LLL YGFV     NP   V      L + L + DK  KEKL  L+     
Sbjct: 271 INYGPHDNHRLLLEYGFVA--PCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFL 328

Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL 418
            +  F +    W   LM    L+   P    +++ +   A+  ++  ++  C    E AL
Sbjct: 329 RNLTFGMDGPSW--RLMTALRLLSLKPQQYTRWKSVLLGAA--VSQDREDWCI---ESAL 381

Query: 419 QFILDSCESSISKYSRFLQVKE 440
           +   +  E ++    R  Q+KE
Sbjct: 382 KLCNNLTEDNVKALERLAQLKE 403


>gi|66825817|ref|XP_646263.1| SET domain-containing protein [Dictyostelium discoideum AX4]
 gi|60474297|gb|EAL72234.1| SET domain-containing protein [Dictyostelium discoideum AX4]
          Length = 567

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 90/386 (23%), Positives = 167/386 (43%), Gaps = 33/386 (8%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGER---GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           +WL   G    K  + K+D       GLVA ++I++GE  + +P +L IT    +     
Sbjct: 75  EWLKGKGFDESKCKV-KIDRNTSEGTGLVATQDIKEGEDFVEIPSNLFITTAVAFQGLGK 133

Query: 138 GEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             +L+      S+P   LL+ +L+ E S   +S W  YI  LP+Q  ++ YW   E  ++
Sbjct: 134 PPILENDRLIQSIPGI-LLSIFLVKELS-NPTSEWGPYIKLLPKQYNTVYYWGLKEFTQF 191

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
             +  + E A+  +   +  Y  L   I     ++ P   F  + F W+   + SR   +
Sbjct: 192 RGSPNL-EYAMRYVRGAMRQYCYLYSMIDRTQSNIMPISSFTWDAFVWAISTVQSRQNPV 250

Query: 254 PSMDGR---VALVPWADMLNHSC---EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
            + +G    +AL+P+ D  NHS    ++ +F  Y   S  +     + ++ GEQV++ YG
Sbjct: 251 YAGNGNGSIMALIPFWDFCNHSSTGSKITSF--YHMDSNCMTSGAIKDFKKGEQVYMFYG 308

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQ- 366
            + N +LL+  GF  +   + S   EL L L+ + +   +K+  L + G+       +  
Sbjct: 309 PRDNTQLLMHAGFATKTNLHDSYPFELHL-LEGNHEIRHDKVHLLEERGIRDGVVVNLNQ 367

Query: 367 ---ITGWPLELMAYAYL---------VVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEID 414
                  PLEL+ +  +          ++PP + G+                 I   E +
Sbjct: 368 NPTSNELPLELIPFYRIYALSEQETRAIAPPQVPGEHNHHHGHQLELKPLAFKIITQENE 427

Query: 415 EQALQFILDSCESSISKYSRFLQVKE 440
           E+A   ++ + +  ++ Y   L+  E
Sbjct: 428 EKAYSNLVQALKGKLASYPTTLEEDE 453


>gi|443733230|gb|ELU17670.1| hypothetical protein CAPTEDRAFT_97123, partial [Capitella teleta]
          Length = 199

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 21/200 (10%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE-----ASQIRERAIER 206
           L  +L+ E +   SS W  Y+  LP     +L+WT  E+D   +     A  +R +A E 
Sbjct: 1   LVIFLLCERNKGCSSFWKPYVDILPSSYTDILHWTSKEMDLLPKFTKRRACDLRLKAEES 60

Query: 207 ITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL---------PSMD 257
              +   +  L +R   ++        F  + FKW++  + +R V +         P  +
Sbjct: 61  FNRLCNGFLPLLVRQMPQF-----NGAFTWDLFKWAWSSVNTRCVYMSQPQNSVLSPDEE 115

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
            + AL P+ D+LNH+ +VE    +D SS+    TT    +P +QVFI+YG  SN +LLL 
Sbjct: 116 DKSALAPFLDLLNHTVDVEVNARFDDSSKSYKITTLTACKPYDQVFINYGPHSNEKLLLE 175

Query: 318 YGFVPREGTNPSDSVELPLS 337
           YGF      NP +++ L LS
Sbjct: 176 YGFT--LPCNPHNNISLTLS 193


>gi|33468718|emb|CAE30375.1| SI:dZ63M10.4 (novel protein similar to human chromosome 21 open
           reading frame 18 (C21orf18)) [Danio rerio]
          Length = 440

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 173/382 (45%), Gaps = 35/382 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L++WL++ G   Q +         RGL++ + I+    L+ +P   ++T  +      A 
Sbjct: 37  LRRWLNERGFTSQSLIPVNFHGNGRGLMSTQTIKAKNSLISLPEECLLTTSTVLKSYMA- 95

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD---RY 193
           + +K+   P  PLLA   +LISE    ++S W+ YI  LP+     LY+    ++   R 
Sbjct: 96  DYIKRWHPPISPLLALCCFLISERHHGEASEWNPYIDILPKTYTCPLYFPDNVIELLPRS 155

Query: 194 LE--ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           L+  A+Q +E+  E  ++    ++ L+  +F++ P    EE+F+ +  +W++  + +R V
Sbjct: 156 LQKKATQQKEQFQELFSSSQTFFHSLQ-PLFNQ-P---TEELFSQDALRWAWCSVNTRTV 210

Query: 252 RLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
            +     +         AL P+ D+LNH   V+    ++K ++     +    +  +Q F
Sbjct: 211 YMEHDQSKYLSREKDVYALAPYLDLLNHCPNVQVEAGFNKETRCYEIRSVNGCKKFQQAF 270

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGLS 358
           I+YG   N  LLL YGFV     NP   V      L + L + DK  KEKL  L+     
Sbjct: 271 INYGPHDNHRLLLEYGFVA--PCNPHSVVYVDLETLKVGLDEKDKQLKEKLLYLKDNDFL 328

Query: 359 ASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQAL 418
            +  F +    W   LM    L+   P     ++ +   A+  ++  ++  C    E AL
Sbjct: 329 RNLTFGMDGPSW--RLMTALRLLSLKPQQYTSWKSVLLGAA--VSQDREDWCI---ESAL 381

Query: 419 QFILDSCESSISKYSRFLQVKE 440
           +   +  E ++    R  Q+KE
Sbjct: 382 KLCNNLTEDNVKALERLAQLKE 403


>gi|297735395|emb|CBI17835.3| unnamed protein product [Vitis vinifera]
          Length = 583

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 140/303 (46%), Gaps = 20/303 (6%)

Query: 46  VSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGL 105
           +    D +R +T + +++        S+E    + +W   + +   K+ I  V+   RG 
Sbjct: 159 IQEVGDKNRLETRIVEDL--------SIEKEECILQWGERNDVR-TKLKIAYVEGAGRGA 209

Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS 165
           +A ++++ G+  L +P S+VI+ +          + K   +    +L  + + E     +
Sbjct: 210 IATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKEKH-NSN 268

Query: 166 SRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           S+++ Y +ALP    + L +   E D    L  + + E  IE   ++   Y +L   +  
Sbjct: 269 SKFNTYFNALPEAFNTGLSF---EFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALCK 325

Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFL 279
            +PD+FP E +  E F W+  + +S  +++   DG  R  L+P A  LNHS    +  + 
Sbjct: 326 DHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHYG 385

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
             D  +  + F   +    GEQ ++SYG  S+  L+  YGF+P +G N  D++ L +   
Sbjct: 386 KVDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIP-QGDNLYDTIPLEIDNP 444

Query: 340 KSD 342
           + D
Sbjct: 445 QGD 447


>gi|330797452|ref|XP_003286774.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
 gi|325083217|gb|EGC36675.1| hypothetical protein DICPUDRAFT_54488 [Dictyostelium purpureum]
          Length = 1335

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 132/289 (45%), Gaps = 9/289 (3%)

Query: 37   GHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQ 96
             H  ++  SV   ++     T  T   +P+    D +      + WL   G+   K+ I 
Sbjct: 766  SHLSIIEQSVILLSNLKNQFTKPTIKSVPFIKPTDEI--YRRFENWLKQGGVQFPKLQIA 823

Query: 97   K-VDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY 155
               D   RG+V  K + + E ++ VP   +I  D   S P  G + ++  + D  +L  +
Sbjct: 824  NFTDSTGRGVVTTKKVDEDEVVVSVPRKYLINVDVAKSNPILGPIFEELHLNDETILFLF 883

Query: 156  LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
            +I E     ++ W  +   LP    + ++++  EL   LE + +    +     +    +
Sbjct: 884  VIYEKE-NPNTFWRPFYDTLPSYFTTSIHYSSTELLE-LEGTNLFAETLAVKQQLQAFRD 941

Query: 216  DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSC 273
             L   + ++YPD+FPE VF+ E F W+  +L SR ++L  +DG++   LVP ADM+NH  
Sbjct: 942  YLFPELSNQYPDIFPESVFSWENFLWARSLLDSRAIQL-KIDGKIKSCLVPMADMINHHT 1000

Query: 274  EVE-TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
              + +   +D+ S      +        Q+F+ YG   N +L L YGFV
Sbjct: 1001 NAQISERHFDQDSNCFRMVSSCNIPANNQIFLHYGALQNSDLALYYGFV 1049


>gi|145549620|ref|XP_001460489.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428319|emb|CAK93092.1| unnamed protein product [Paramecium tetraurelia]
          Length = 482

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 76/279 (27%), Positives = 128/279 (45%), Gaps = 17/279 (6%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPE 136
            L +WL D      K+ I+    G R L A + IR+GE +LF+P +  ++ +  K SC  
Sbjct: 43  NLIEWLKDGKAEISKVQIEVQSEGHRTLRATQFIRQGEWVLFIPRTQYLSLEEVKKSCLI 102

Query: 137 AGEVLKQCSVPDWPLLATYLIS---EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             ++++    P+   + TY ++   + +  K S W  YI  LP+       +  AE D  
Sbjct: 103 NRKMIQINYKPN--NIQTYFVNHLLQENRRKYSFWKPYIDVLPKDVSGFPTYFDAEQDAL 160

Query: 194 LEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV-FNMETFKWSFGILFSRL 250
           L+ S      I +       Y +L+  ++ F KY   + + + F + T   SF       
Sbjct: 161 LKGSPTLFTVINQRKVFKEEYENLKEAVKEFQKYGYTYDDFIKFRILTISRSFT------ 214

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           V++   + +  LVP AD +NH         Y K + G      R  Q GE++F +YG+ S
Sbjct: 215 VQIGEKEQQQLLVPLADFINHDNNGFLKYGYSKDADGFFMQAVRNIQKGEELFYNYGQWS 274

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
           N    ++YGF     TNP +  +L + L K+D+ +  K+
Sbjct: 275 NKYFFMNYGFASL--TNPMNQFDLDICLNKNDRLFNLKI 311


>gi|328772335|gb|EGF82373.1| hypothetical protein BATDEDRAFT_86177 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 966

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 84/307 (27%), Positives = 145/307 (47%), Gaps = 24/307 (7%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVD----VGERGLVALKNIRKGEKLLFVPPSL 124
           ++D L +  +  +WL  +G+    ++I+KVD    VG  G+ + + I KGE L+ +P  L
Sbjct: 551 KLDQLASLESFTQWLHANGINTDGISIKKVDDSKDVG-LGIFSTRQIHKGECLVKIPLKL 609

Query: 125 VITADSKWSCPEAGEVLKQCSV--PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
           +++ D+  + P    ++K   +   D  ++    + +      S W  Y   LPR     
Sbjct: 610 ILSNDTS-AMPALNSIVKSNVLLKTDPSVILVIRLLQEYINPMSLWQPYFDLLPRVFTIP 668

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDL-FPEEVFNMETFKW 241
           +  +  +L  Y   S I E  +  +  ++  Y  L+  IF   P+   P   F    F W
Sbjct: 669 VLGSAQDLAAYTGTSIIDE-VVHDMIALMRQYLYLQ-HIFKSIPEPPIPLADFTFAAFSW 726

Query: 242 SFGILFSRLVRL----PS---MDGRVALVPWADMLNHS-CEVETFLDYDKSSQGVVFTTD 293
           +  I+ +R   +    PS   M   + L+P  DM NH      T  D  +     + + D
Sbjct: 727 ARAIVSTRQNEICYANPSTSEMQQFLCLIPLFDMFNHKPGNSTTQFDTKEYCSETIASCD 786

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEAL 352
               PGEQ+FI YGK+SN E+LL  GFV  + TN   D ++L +S+ +SD    ++++ L
Sbjct: 787 --VSPGEQIFIHYGKRSNQEMLLYSGFV--DPTNIEYDHIKLSVSIPQSDPIRNQRVQLL 842

Query: 353 RKYGLSA 359
           + + LS+
Sbjct: 843 KLFNLSS 849


>gi|225446052|ref|XP_002268920.1| PREDICTED: uncharacterized protein LOC100256524 [Vitis vinifera]
          Length = 566

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 73/303 (24%), Positives = 140/303 (46%), Gaps = 20/303 (6%)

Query: 46  VSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGL 105
           +    D +R +T + +++        S+E    + +W   + +   K+ I  V+   RG 
Sbjct: 142 IQEVGDKNRLETRIVEDL--------SIEKEECILQWGERNDVR-TKLKIAYVEGAGRGA 192

Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS 165
           +A ++++ G+  L +P S+VI+ +          + K   +    +L  + + E     +
Sbjct: 193 IATEDLKVGDVALEIPMSIVISEELVHESDMFPILEKIDGISSETMLLLWSMKE-KHNSN 251

Query: 166 SRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           S+++ Y +ALP    + L +   E D    L  + + E  IE   ++   Y +L   +  
Sbjct: 252 SKFNTYFNALPEAFNTGLSF---EFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALCK 308

Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFL 279
            +PD+FP E +  E F W+  + +S  +++   DG  R  L+P A  LNHS    +  + 
Sbjct: 309 DHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHYG 368

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
             D  +  + F   +    GEQ ++SYG  S+  L+  YGF+P +G N  D++ L +   
Sbjct: 369 KVDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIP-QGDNLYDTIPLEIDNP 427

Query: 340 KSD 342
           + D
Sbjct: 428 QGD 430


>gi|325186836|emb|CCA21381.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 473

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 71/231 (30%), Positives = 111/231 (48%), Gaps = 32/231 (13%)

Query: 151 LLATYLISEA-SFEKSSRWSNYISALPRQPYSLLYWTRAEL------DRYLEASQIRERA 203
           LLA  L+ E    +  S+W++++  LP++  +LLY++  E+      + Y  A +++ER 
Sbjct: 118 LLAIILLFEMYVLQSESKWAHHLEILPKEHRNLLYYSSDEVKALDGTNLYYVAHEMQERL 177

Query: 204 IERI----TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL------ 253
            E      T V+     +   I S  P +    VF+   +KW+  I++SR V +      
Sbjct: 178 HEDYEFIETRVLPELKHILKHILS--PSVSATTVFSFANYKWALSIIWSRFVSIEIDQEL 235

Query: 254 ---------PSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
                    P+    V A+VP  DMLNH  + E    YD +S     TT +    G Q+ 
Sbjct: 236 VSTLPFTIDPTKKHCVKAMVPVFDMLNHDPKAEMTHKYDAASGMFQLTTHQHLAAGTQLH 295

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC-YKEKLEALR 353
           I+YG  SN  LL  YGF+     NP D+VE+ L ++  +   Y+EK E LR
Sbjct: 296 INYGPLSNHALLALYGFM--HSHNPHDTVEVHLQMESDNTSFYEEKEEFLR 344


>gi|145516108|ref|XP_001443948.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411348|emb|CAK76551.1| unnamed protein product [Paramecium tetraurelia]
          Length = 572

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 78/283 (27%), Positives = 137/283 (48%), Gaps = 20/283 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L +WL        K+ I+      RG+ A + I   E +LF+P S +IT +       A 
Sbjct: 139 LLEWLKIGKAIFPKIKIECYSEDYRGVNAKQTINAKELILFIPKSHMITLEMAKETTVAK 198

Query: 139 EVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYL 194
           ++++       P    L+T+L+ E  F  +S W  YI  LP   P   +++  ++L+ +L
Sbjct: 199 KMMQFRLDLLSPKHSFLSTFLLQE-KFRPNSFWKPYIDILPSSYPSFPIFYNNSDLE-WL 256

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVRL 253
           + S   ++  +++ ++   YND+         ++ PE   +    F W+     SR+  +
Sbjct: 257 KGSPFLKQIKDKLADLQKDYNDI--------CNVVPEFTQYQFHEFCWARMTASSRIFGI 308

Query: 254 PSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
            +++G    A VP ADMLNH     T   Y    QG +  TD + + G+ +F SYG+K N
Sbjct: 309 -NINGVKTDAFVPLADMLNHKRPKLTSWCYSDEKQGFIIETDEKIERGQMIFDSYGRKCN 367

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
               L+YGFV  EG N ++ V L +   ++D   + K +A+++
Sbjct: 368 SRFFLNYGFVV-EG-NDANEVNLAVEADQNDPLLQLKEQAIKE 408


>gi|449464220|ref|XP_004149827.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Cucumis
           sativus]
          Length = 499

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 72/233 (30%), Positives = 116/233 (49%), Gaps = 19/233 (8%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
           ++I K  +G R L A + IR G+ +L VP ++ I+ DS    P     L    + +   L
Sbjct: 67  LSIGKSSIG-RFLFASETIRAGDCILKVPFNVQISPDS---LPLPIRDLLGNEIGNVAKL 122

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITN 209
           A  ++ E      S W+ YI  LP QP+ +   ++W  +EL+  +  S + E ++ + + 
Sbjct: 123 AVVVLLEHKLGLGSEWAPYIIRLP-QPWEMHNTIFWKESELE-MIRKSSLYEESLNQRSQ 180

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
           +   +  +R     K  + FPE +   + + F  ++ ++ SR  R  S +G V+L+P+AD
Sbjct: 181 IKREFLAIR-----KALEAFPEIIDRISCDDFMHAYALVTSRAWR--STEG-VSLIPFAD 232

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
            LNH    E  L  D   Q      DR + PGE V I YGK SN  L+L +GF
Sbjct: 233 FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEHVLIRYGKYSNATLMLDFGF 285


>gi|302818853|ref|XP_002991099.1| hypothetical protein SELMODRAFT_429412 [Selaginella moellendorffii]
 gi|300141193|gb|EFJ07907.1| hypothetical protein SELMODRAFT_429412 [Selaginella moellendorffii]
          Length = 428

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 143/322 (44%), Gaps = 38/322 (11%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ-CSVPDWPLLATYLI 157
           D G RGL   +N+ +GE +L VP + +I        PE G+VL     +    +L  YL+
Sbjct: 36  DQGGRGLGVARNVEQGEMILRVPFAALIGVHCAREDPEFGKVLVDFAHLSSVQILTAYLL 95

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYN 215
           SE +  +SSRW +Y+   P+  ++L +++  E +  +  +A  + + + E         +
Sbjct: 96  SEVAKSRSSRWFSYLRHNPQVHHNLPHFSAMEAEELQVEDAISMAKSSFEDTQRQWRETS 155

Query: 216 DL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
            L  RLR+        P +    + + W+   + SR + +P  D  V L P  D+ N+  
Sbjct: 156 SLLSRLRL--------PRKFTTFKAWLWAAATISSRTLHVPWDDAGV-LCPIGDLFNYDA 206

Query: 274 EVETFLD------------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
            +E  L                   Y+ S     F   R Y+ G+Q  I YG+ +N ELL
Sbjct: 207 PIERTLSSRNEDDEHKFTSRLTDGGYETSISSYCFYARRSYKNGQQALICYGQYTNLELL 266

Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP-LEL 374
             YGF+  +  NP D + +PL   +         ++ R++ L+A   + I+ +G P   L
Sbjct: 267 EHYGFLLPD--NPCDVIYIPLPSSEEFGLKSTGDKSERQHNLAA---YCIEASGKPSFSL 321

Query: 375 MAYAYLVVSPPSMKGKFEEMAA 396
           +    L   P S++     MA+
Sbjct: 322 LQQLRLRAVPASLRKSHGYMAS 343


>gi|145528147|ref|XP_001449873.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124417462|emb|CAK82476.1| unnamed protein product [Paramecium tetraurelia]
          Length = 605

 Score = 89.0 bits (219), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 81/276 (29%), Positives = 130/276 (47%), Gaps = 22/276 (7%)

Query: 53  SRTKTTVTQNMIPWGCEIDSLENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNI 111
           ++ KT   Q+ I  G     L+    L +WL S   L P K+ I+      RG+ A K I
Sbjct: 142 NQYKTQALQSSIDSG----ELDKQKRLLEWLKSGQALFP-KIKIECYAEDYRGVNARKAI 196

Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLK---QCSVPDWPLLATYLISEASFEKSSRW 168
              E +LFVP S +IT +     P A ++++       P    L+T+L+ E   +  S W
Sbjct: 197 SSKEVILFVPRSHMITLEMAKDTPVAKKIIQYRLDLLSPKHSFLSTFLLQEKKIQ-DSFW 255

Query: 169 SNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
             Y+  LP+   +  +++  ++L+ +L+ S   ++  ++IT++   Y D+          
Sbjct: 256 KPYLDVLPKSYSNFPIFFNDSDLE-WLKGSPFLKQVKDKITDLKKDYCDI--------CQ 306

Query: 228 LFPEEVFN-METFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSS 285
           + PE + N  + F W+     SR+  +     +  A VP ADMLNH     T   Y    
Sbjct: 307 VAPEFLQNSFDEFCWARMTASSRIFGINIKGVKTDAFVPLADMLNHKRPKLTSWCYSDER 366

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           QG +  TD   + G+ +F SYG K N   LL+YGFV
Sbjct: 367 QGFIIETDENIEKGQMIFDSYGSKCNSRFLLNYGFV 402


>gi|8778402|gb|AAF79410.1|AC068197_20 F16A14.25 [Arabidopsis thaliana]
          Length = 474

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 100/386 (25%), Positives = 174/386 (45%), Gaps = 42/386 (10%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKW 132
           EN     KWL D G+   K   +   V E  GLVA ++I + E +L +P  L       W
Sbjct: 47  ENVRNFWKWLRDQGVVSGKSVAEPAVVPEGLGLVARRDIGRNEVVLEIPKRL-------W 99

Query: 133 SCPE---AGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
             PE   A ++   C  +  W  +A +LI E  +E+ S W  Y+  LP+   S ++W+  
Sbjct: 100 INPETVTASKIGPLCGGLKPWVSVALFLIRE-KYEEESSWRVYLDMLPQSTDSTVFWSEE 158

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           EL   L+ +Q+    +     V   +  L   I     DLF   +  ++ F W+FGIL  
Sbjct: 159 ELAE-LKGTQLLSTTLGVKEYVENEFLKLEQEILLPNKDLFSSRI-TLDDFIWAFGIL-- 214

Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV-VFTTDRQY--------QPG 299
                 + +   ++  + + +NH+  ++T  DY    +G  +F+ D  +        + G
Sbjct: 215 ------NRESLTSMFEF-EQINHNPAIKT-EDYAYEIKGAGLFSRDLLFSLKSPVYVKAG 266

Query: 300 EQVFISYG-KKSNGELLLSYGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGL 357
           EQV+I Y   KSN EL L YGFV    +NP  +S  L + + +SD  + +KL+      +
Sbjct: 267 EQVYIQYDLNKSNAELALDYGFVE---SNPKRNSYTLTIEIPESDPFFGDKLDIAESNKM 323

Query: 358 SASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
             +  F I + G  L      YL +        F  + +  +N +    ++     +E+ 
Sbjct: 324 GETGYFDI-VDGQTLPAGMLQYLRLVALGGPDAF-LLESIFNNTIWGHLELPVSRTNEEL 381

Query: 418 L-QFILDSCESSISKYSRFLQVKELL 442
           + + + D+C+S++S +   ++  E L
Sbjct: 382 ICRVVRDACKSALSGFDTTIEEDEKL 407


>gi|320170264|gb|EFW47163.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 938

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 86/305 (28%), Positives = 138/305 (45%), Gaps = 38/305 (12%)

Query: 98  VDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADS--KWSCPEAGEVLKQCSVPDWP 150
           VDV +     R L+A   ++  +++  +P  L I+ D+  +   P A   +   S     
Sbjct: 55  VDVSQDWHQGRRLIADNPLKPDDRIAAIPTLLTISLDTALQVGLPRAFTTIWHESGSQDD 114

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           LLA +L+ E +    S W+ YI  LP++  +LL++   EL + L+  Q+ E+  ++ + +
Sbjct: 115 LLALFLLREKALGARSAWAPYIEILPKKLSNLLFFNDGELAQ-LQNEQLVEQVSQQKSEL 173

Query: 211 IGTYNDLRLR---IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
            G +  LR     IF    +L       +  F W+  I+ SR     ++  R  L+P+AD
Sbjct: 174 QGRFLALRQHEADIFGGKAELV------LSDFLWARAIVLSRAF---TIHARRYLIPFAD 224

Query: 268 MLNHSCEVETFLD---------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
           +LNH       LD         +D  +   + T DR     E+V   YG  SN + L  Y
Sbjct: 225 LLNHRFHPTRGLDESGEFFYRHHDFQNGMFLLTCDRPVNENEEVEDDYGNLSNAQFLQLY 284

Query: 319 GFVPREGTNPSDSVELPLS--LKKSDKCYKEKLEALRKYGLSASECF----PIQITGWPL 372
           GFVP   +NP + VE+ L+  L    +    K E   K G+    C     P  +TG  L
Sbjct: 285 GFVPE--SNPHECVEINLADLLHGEREALLLKSEYAFKLGIPHIVCIGATRPPSVTG-AL 341

Query: 373 ELMAY 377
           E +AY
Sbjct: 342 EAIAY 346


>gi|345326326|ref|XP_001512617.2| PREDICTED: SET domain-containing protein 4-like [Ornithorhynchus
           anatinus]
          Length = 499

 Score = 88.6 bits (218), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 135/297 (45%), Gaps = 24/297 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +   +     RGL+A K+++ GE ++ +P + ++T D+    P  G
Sbjct: 36  LKKWLKGRRFDGSNLRPARFPDTGRGLMATKSLKAGEMIISLPEACLLTTDTVLKSP-LG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
           + + +   P  PLLA  T+LI+E      S W  Y+  LP Q Y+      A +   L  
Sbjct: 95  DYIWKWKPPVSPLLALCTFLIAEKQAGARSLWQPYLGVLP-QAYTCPVGLDAAVLSLL-P 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             +  RA E+ T V   +   R   FS    LF E+V   F ++   W++  + +R V +
Sbjct: 153 QPLGRRAREQRTAVRELFAASRA-FFSSLQPLFSEDVERVFTLDALGWAWCTVNTRTVYM 211

Query: 254 P-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                   S +  + AL P+ D+LNHS   +    ++K ++     T  + +  E+V I 
Sbjct: 212 EHAQRDCFSAEADIYALAPYLDLLNHSPGAQVEAAFNKETRCYEIRTASRCRKYEEVLIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSV-----ELPLSLKKSDKCYKEKLEALRKYGL 357
           YG   N  LLL YGFV     NP  +V      L   L   DK   +KL  L+++G 
Sbjct: 272 YGPHDNRRLLLEYGFVC--SNNPHSNVVVSPDVLVRHLPSGDKQMTKKLSLLKEHGF 326


>gi|148686777|gb|EDL18724.1| mCG18357, isoform CRA_b [Mus musculus]
          Length = 466

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 113/237 (47%), Gaps = 12/237 (5%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LA +L+ E +   +S W  YI  LP +  + LY+   E+ R L+++Q       +  N  
Sbjct: 29  LAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RCLQSTQAIHDVFSQYKNTA 86

Query: 212 GTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPW 265
             Y     ++   +P    L  +E F  E ++W+   + +R  ++P+ DG    +AL+P 
Sbjct: 87  RQYAYF-YKVIQTHPHANKLPLKESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 145

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            DM NH+  + T   Y+           + +Q G+Q++I YG +SN E ++  GF     
Sbjct: 146 WDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFF--D 202

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
            N  D V++ L + KSD+ Y  K E L + G+  S  F +  T  P+     A+L V
Sbjct: 203 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHSTEPPISAQLLAFLRV 259


>gi|58177849|gb|AAH89108.1| Setd3 protein [Rattus norvegicus]
          Length = 450

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 113/237 (47%), Gaps = 12/237 (5%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LA +L+ E +   +S W  YI  LP +  + LY+   E+ R L+++Q       +  N  
Sbjct: 11  LAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RCLQSTQAIHDVFSQYKNTA 68

Query: 212 GTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPW 265
             Y     ++   +P    L  ++ F  E ++W+   + +R  ++P+ DG    +AL+P 
Sbjct: 69  RQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 127

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            DM NH+  + T   Y+           + +Q G+Q++I YG +SN E ++  GF     
Sbjct: 128 WDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFF--D 184

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
            N  D V++ L + KSD+ Y  K E L + G+  S  F +  T  P+     A+L V
Sbjct: 185 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLAFLRV 241


>gi|224098926|ref|XP_002311320.1| SET domain-containing protein [Populus trichocarpa]
 gi|222851140|gb|EEE88687.1| SET domain-containing protein [Populus trichocarpa]
          Length = 490

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 139/323 (43%), Gaps = 54/323 (16%)

Query: 76  ASTLQKWLSDS-----------GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
             T  +WLSD            GL PQ +          GLVA ++I + E +L +P  L
Sbjct: 50  VQTFWQWLSDQDVVSAKTPARPGLVPQGL----------GLVAQRDISRNEVVLEIPKKL 99

Query: 125 VITADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
            I  D       A E+   C  V  W  +A +LI E   ++ S W  Y+  LP    S +
Sbjct: 100 WINPD----VVAASEIGNVCGGVKPWVSVALFLIRE-KLKEDSTWRPYLDVLPESTNSTI 154

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           +W+  EL   L+ +Q+    +   + +   +  +   I   +  LFP  V  ++ F W+F
Sbjct: 155 FWSEEELAE-LQGTQLLSTTLGVKSYLRREFLKVEEEILVPHKQLFPSPV-TLDDFSWAF 212

Query: 244 GILFSR---------LVRLPSMDGRVALVPW-ADMLNHSCEVETFLD--YDKSSQGVVFT 291
           GIL SR         LV +P  D    L  W  D +NHS ++ T  D  Y+    G +F+
Sbjct: 213 GILRSRSFSRLRGQNLVLIPLADLCNFLHTWLLDQVNHSPDI-TIEDGVYEIKGAG-LFS 270

Query: 292 TDRQY--------QPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
            D  +        + GEQV I Y    SN EL + YGF+  E  +  +   L L + +SD
Sbjct: 271 RDLIFSLRSPISLKAGEQVLIQYNLNLSNAELAVDYGFI--EAKSDRNMYTLTLQISESD 328

Query: 343 KCYKEKLEALRKYGLSASECFPI 365
             + +KL+     GL     F I
Sbjct: 329 PFFGDKLDIAETNGLGEIADFDI 351


>gi|17865444|sp|P58467.1|SETD4_MOUSE RecName: Full=SET domain-containing protein 4
 gi|17061796|gb|AAK68849.1| C21orf18 [Mus musculus]
          Length = 439

 Score = 88.2 bits (217), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 138/308 (44%), Gaps = 25/308 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL +       +         RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 35  LRKWLKERKFEDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLG 93

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             +K+   P  PLLA  T+L+SE      S W +Y+  LP+  Y+       E+   L  
Sbjct: 94  PYIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-P 151

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
           S ++ +A E+   V   +   R   FS    LF E    VF+   F W++  + +R V L
Sbjct: 152 SPLKAKAEEQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYL 210

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            S             AL P+ D+LNHS  V+    +++ ++     T  + +  ++VFI 
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270

Query: 306 YGKKSNGELLLSYGFVPREGTN---PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           YG   N  LLL YGFV     +   P  +  L   L  +DK    K+  L+ +G + +  
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLT 330

Query: 363 FPIQITGW 370
           F     GW
Sbjct: 331 F-----GW 333


>gi|172073177|ref|NP_663457.2| SET domain-containing protein 4 [Mus musculus]
 gi|148671824|gb|EDL03771.1| SET domain containing 4, isoform CRA_e [Mus musculus]
          Length = 439

 Score = 88.2 bits (217), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 138/308 (44%), Gaps = 25/308 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL +       +         RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 35  LRKWLKERKFEDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLG 93

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             +K+   P  PLLA  T+L+SE      S W +Y+  LP+  Y+       E+   L  
Sbjct: 94  PYIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-P 151

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
           S ++ +A E+   V   +   R   FS    LF E    VF+   F W++  + +R V L
Sbjct: 152 SPLKAKAEEQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYL 210

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            S             AL P+ D+LNHS  V+    +++ ++     T  + +  ++VFI 
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFIC 270

Query: 306 YGKKSNGELLLSYGFVPREGTN---PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           YG   N  LLL YGFV     +   P  +  L   L  +DK    K+  L+ +G + +  
Sbjct: 271 YGPHDNQRLLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLT 330

Query: 363 FPIQITGW 370
           F     GW
Sbjct: 331 F-----GW 333


>gi|149044197|gb|EDL97579.1| rCG27725, isoform CRA_c [Rattus norvegicus]
          Length = 468

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 66/237 (27%), Positives = 113/237 (47%), Gaps = 12/237 (5%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LA +L+ E +   +S W  YI  LP +  + LY+   E+ R L+++Q       +  N  
Sbjct: 29  LAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV-RCLQSTQAIHDVFSQYKNTA 86

Query: 212 GTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPW 265
             Y     ++   +P    L  ++ F  E ++W+   + +R  ++P+ DG    +AL+P 
Sbjct: 87  RQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPL 145

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            DM NH+  + T   Y+           + +Q G+Q++I YG +SN E ++  GF     
Sbjct: 146 WDMCNHTNGLIT-TGYNLEDDRCECVALQDFQAGDQIYIFYGTRSNAEFVIHSGFFF--D 202

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
            N  D V++ L + KSD+ Y  K E L + G+  S  F +  T  P+     A+L V
Sbjct: 203 NNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPPISAQLLAFLRV 259


>gi|302755392|ref|XP_002961120.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
 gi|300172059|gb|EFJ38659.1| hypothetical protein SELMODRAFT_402746 [Selaginella moellendorffii]
          Length = 371

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 67/258 (25%), Positives = 116/258 (44%), Gaps = 26/258 (10%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           R L A + +  G + L +P   +IT ++    P     L   S P    L+ +L+SE   
Sbjct: 6   RALFATRRVPAGSRFLEIPRIAIITPEN---VPSQVSHLLSTSNPK-TRLSLFLLSEKHK 61

Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            + S+W+ Y+  LP+     S ++W   EL  +L+ S      +E +  +   ++ L   
Sbjct: 62  AQESQWAPYLRCLPQLGDIESTMFWKDEEL-AWLKHSPTYRETMECLKIIKSEFHVLEAN 120

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +F    D+  E             +  +  +   S D     +P+AD  NH    +T L 
Sbjct: 121 VFPWCRDVLGE-------------VSLTDFMHAYSTDQ----IPFADFFNHDHNCQTRLS 163

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
           YDK     V   D+ Y+ G+++F+SYG   N  L + YGF     +NP + VE+P+ +  
Sbjct: 164 YDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAV--ASNPHEQVEVPMGVSL 221

Query: 341 SDKCYKEKLEALRKYGLS 358
           +D     KL+ L ++ +S
Sbjct: 222 TDPLRDLKLQTLSRHNMS 239


>gi|348675930|gb|EGZ15748.1| hypothetical protein PHYSODRAFT_561468 [Phytophthora sojae]
          Length = 430

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 138/303 (45%), Gaps = 22/303 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L +WL   G     + I+ +   E  G+ A + +  G+  L VP  L +  +S  +   A
Sbjct: 10  LLEWLEAHGAADSLLDIRYLGKLEGHGVFAKRALTSGQVTLQVPFKLTMNTESAATSDLA 69

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD-----R 192
             + K   +PD  +LA +L+ E S    S ++ +I+++P      ++WT AEL+      
Sbjct: 70  PVLEKYPQIPDDEVLALHLMHERSKGGESFFAPFIASMPTTFDLPVFWTEAELNELKGTN 129

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLV 251
            L  +Q+ ++ +ER       + ++   + + +PD+F       ++ + W+  +++SR  
Sbjct: 130 VLLLTQLMKQHLER------DFENIHQAVAADFPDIFASLPTLTIDDYMWAMSVIWSRAF 183

Query: 252 RLPSMDGRV--ALVPWADMLNHSCEV----ETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            + S  G+    L P  DM NH   V    + F+ +++  Q +          G  V IS
Sbjct: 184 GV-SKGGKYLHVLCPAMDMFNHDVTVRKPLDDFVSFNEEKQMMTHHVPEDVAAGSAVHIS 242

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           YG+ SN +LL SYGFV  E  N    V+  + +  SD  +K K   L    L+  + +  
Sbjct: 243 YGQYSNAKLLYSYGFVSPE--NFRRGVDFWMKIPLSDPYFKLKQTVLDSNELTKEQTYDF 300

Query: 366 QIT 368
             T
Sbjct: 301 HGT 303


>gi|166091525|ref|NP_001107219.1| SET domain-containing protein 4 [Rattus norvegicus]
 gi|165971256|gb|AAI58670.1| Setd4 protein [Rattus norvegicus]
          Length = 439

 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 138/312 (44%), Gaps = 32/312 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L+KWL +       + +     G  RGL++  ++++G+ ++ +P S ++T D+       
Sbjct: 34  LRKWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIR-SSV 92

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDR 192
           G  +K+   P  PLLA  T+L+SE      S W +Y+  LP+    P  L       L  
Sbjct: 93  GPYIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPG 152

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSR 249
            L A    +RA  R+ ++  +  D     FS    LF E V   F+   F W++  + +R
Sbjct: 153 PLRAKAEEQRA--RVQDLFASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTR 206

Query: 250 LVRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
            V L S             AL P+ D+LNHS  V+    +++ ++     T  + +  ++
Sbjct: 207 AVYLKSRRQECLSSEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQE 266

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLS 358
            FI YG   N  LLL YGFV     +    V   + LK    +DK   +KL  L  +G +
Sbjct: 267 AFICYGPHDNQRLLLEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFT 326

Query: 359 ASECFPIQITGW 370
            +  F     GW
Sbjct: 327 GNLTF-----GW 333


>gi|296232125|ref|XP_002761462.1| PREDICTED: SET domain-containing protein 4 [Callithrix jacchus]
          Length = 440

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 81/296 (27%), Positives = 133/296 (44%), Gaps = 24/296 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +   +     RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKDRKFQDSNLVPARFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIQ-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYIAKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPI 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV   G NP   V      L   L  +DK   +K+  L+ +G
Sbjct: 272 YGPHDNHRLFLEYGFV--SGHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|307108530|gb|EFN56770.1| hypothetical protein CHLNCDRAFT_8187, partial [Chlorella
           variabilis]
          Length = 398

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 75/268 (27%), Positives = 119/268 (44%), Gaps = 19/268 (7%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           LV  K + KGE+L  VP +  ITAD+     + G  L    +  W  +A +L+ E +   
Sbjct: 1   LVCSKAVNKGEQLFAVPEAAWITADTAQQS-QIGSHL--TGLESWLAIALFLLHERAMGN 57

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
           +SRW+ YI+ LP    S + W  A+L   L+ SQ+              ++ L+  +F  
Sbjct: 58  ASRWAPYIALLPADSGSPVQWEEADLAE-LQGSQVLGTVQGYRAYFQQRFDQLQAEVFGP 116

Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH-----------SC 273
               F   VFN + F W+   + +R    P   G +ALVP ADM+               
Sbjct: 117 NSQAFDPIVFNFDAFLWAACTVRAR-AHPPLDGGNIALVPLADMVRSQPSWPPDSAGWQL 175

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSV 332
           +    L    S+Q +V         G+ + + +G +KS+G+LL+ +G +      P  S 
Sbjct: 176 KQTGGLFGAGSTQALVMEASGSMAAGDAIAMDFGPQKSDGQLLVDHGVIDPLVNQP--SY 233

Query: 333 ELPLSLKKSDKCYKEKLEALRKYGLSAS 360
            L L L K D+ Y +K + L    L+ S
Sbjct: 234 ALTLELSKEDRNYDDKADILELNELAES 261


>gi|302819975|ref|XP_002991656.1| hypothetical protein SELMODRAFT_236359 [Selaginella moellendorffii]
 gi|300140505|gb|EFJ07227.1| hypothetical protein SELMODRAFT_236359 [Selaginella moellendorffii]
          Length = 428

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 83/322 (25%), Positives = 143/322 (44%), Gaps = 38/322 (11%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ-CSVPDWPLLATYLI 157
           D G RGL   +N+ +GE +L VP + +I        PE G+VL     +    +L  YL+
Sbjct: 36  DQGGRGLGVARNVEQGEMILRVPFAALIGVHCAREDPEFGKVLVDFAHLSSVQILTAYLL 95

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYN 215
           SE +   SSRW +Y+   P+  +SL +++  E +  +  +A  + + ++E         +
Sbjct: 96  SEVAKSCSSRWFSYLRHNPQVHHSLPHFSAMEAEELQVEDAISMAKSSLEDTQRQWRETS 155

Query: 216 DL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
            L  RLR+        P +    + + W+   + SR + +P  D  V L P  D+ N+  
Sbjct: 156 SLLSRLRL--------PRKFTTFKAWLWAAATISSRTLHVPWDDAGV-LCPIGDLFNYDA 206

Query: 274 EVETFLD------------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
            +E  +                   Y+ S     F   R Y+ G+Q  I YG+ +N ELL
Sbjct: 207 PIERTMSSRNEDDELEFTNRLTDGGYETSISSYCFYARRSYKKGQQALICYGQYTNLELL 266

Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP-LEL 374
             YGF+  +  NP D + +PL   +         ++ R++ L+A   + I+ +G P   L
Sbjct: 267 EHYGFLLPD--NPCDVIYIPLPSPEEFGLKSTGDKSERQHNLAA---YCIEASGKPSFSL 321

Query: 375 MAYAYLVVSPPSMKGKFEEMAA 396
           +    L   P S++     MA+
Sbjct: 322 LQQLRLRAVPASLRKSHGYMAS 343


>gi|149059902|gb|EDM10785.1| hypothetical protein RDA279, isoform CRA_e [Rattus norvegicus]
          Length = 475

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 85/312 (27%), Positives = 138/312 (44%), Gaps = 32/312 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L+KWL +       + +     G  RGL++  ++++G+ ++ +P S ++T D+       
Sbjct: 70  LRKWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIR-SSV 128

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDR 192
           G  +K+   P  PLLA  T+L+SE      S W +Y+  LP+    P  L       L  
Sbjct: 129 GPYIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPG 188

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSR 249
            L A    +RA  R+ ++  +  D     FS    LF E V   F+   F W++  + +R
Sbjct: 189 PLRAKAEEQRA--RVQDLFASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTR 242

Query: 250 LVRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
            V L S             AL P+ D+LNHS  V+    +++ ++     T  + +  ++
Sbjct: 243 AVYLKSRRQECLSSEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQE 302

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLS 358
            FI YG   N  LLL YGFV     +    V   + LK    +DK   +KL  L  +G +
Sbjct: 303 AFICYGPHDNQRLLLEYGFVAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFT 362

Query: 359 ASECFPIQITGW 370
            +  F     GW
Sbjct: 363 GNLTF-----GW 369


>gi|302804448|ref|XP_002983976.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
 gi|300148328|gb|EFJ14988.1| hypothetical protein SELMODRAFT_423083 [Selaginella moellendorffii]
          Length = 266

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/235 (30%), Positives = 110/235 (46%), Gaps = 18/235 (7%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE--- 159
           RGL A + +R GE++L +   L+I        P     +   +V  W  LA  ++ E   
Sbjct: 21  RGLFASRPVRAGERVLEISLDLMIAPSD---LPGELSTVLSSTVKPWTKLALIVLMERYK 77

Query: 160 -ASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
             +  +SS W+ YIS LP +P  L     W   EL  YL AS +  +  ER+  +   + 
Sbjct: 78  GQAKLQSSAWAPYISCLP-EPAELDNTFLWEDTELS-YLRASPLYGKTRERLEMITTEFG 135

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
            ++      +P LF +   ++E FK  +  +FSR + +   D  + ++P  D  NH+   
Sbjct: 136 QVQ-NALDVWPQLFGK--VSLEDFKHVYATVFSRSLAIGE-DSTLVMIPMLDFFNHNATS 191

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
              L ++      V T DR Y   +Q++I+YG  SN EL L YGF   E  NP D
Sbjct: 192 FAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDYGFAVPE--NPYD 244


>gi|302766942|ref|XP_002966891.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
 gi|300164882|gb|EFJ31490.1| hypothetical protein SELMODRAFT_408134 [Selaginella moellendorffii]
          Length = 374

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/262 (26%), Positives = 116/262 (44%), Gaps = 31/262 (11%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           R L A + +  G + L +P   +IT ++    P     L   S P    L+ +L+SE   
Sbjct: 6   RALFATRRVPAGSRFLEIPRIAIITPEN---VPSQVSHLLSTSNPK-TRLSLFLLSEKHK 61

Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            + S+W+ Y+  LP+     S ++W   EL  +L+ S      +E +  +   ++ L L 
Sbjct: 62  AQESQWAPYLRCLPQLGDIESTMFWKAEEL-AWLKHSPTYRETMECLKIIKSEFHLLTLA 120

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
                         N + F W    L     +  +   S D     +P+AD  NH    +
Sbjct: 121 --------------NKQVFPWCRDALGEVSLTDFMHAYSTDQ----IPFADFFNHDHNCQ 162

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           T L YDK     V   D+ Y+ G+++F+SYG   N  L + YGF     +NP + VE+P+
Sbjct: 163 TRLSYDKEKDCAVAVADQDYKAGDEIFLSYGSTPNSILAVDYGFAV--ASNPHEQVEVPM 220

Query: 337 SLKKSDKCYKEKLEALRKYGLS 358
            +  +D     KL+ L ++ +S
Sbjct: 221 GVSLTDPLRDLKLQTLSRHNMS 242


>gi|389622275|ref|XP_003708791.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
 gi|351648320|gb|EHA56179.1| hypothetical protein MGG_14610 [Magnaporthe oryzae 70-15]
 gi|440464619|gb|ELQ34017.1| hypothetical protein OOU_Y34scaffold00823g1 [Magnaporthe oryzae
           Y34]
          Length = 419

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 119/234 (50%), Gaps = 15/234 (6%)

Query: 79  LQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           ++ WL ++G +    + +    +  RG+  L++ ++GEK+L +P   + T +   +    
Sbjct: 1   MENWLKETGAVGLDNLELADFPITGRGVRTLRHFKEGEKILTIPCGSLWTVEQAHADSLL 60

Query: 138 GEVLKQC----SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDR 192
           G  L+      SV D  +LATY++   S E       ++++ALP    S +++   EL+ 
Sbjct: 61  GPALRSVRPPLSVED--ILATYILFVRSRESGYDGLRSHVAALPSSYSSSIFFAEEELEV 118

Query: 193 YLEAS--QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
               S   + ++  +RI +    Y  L +R+  ++ DLFP E F +E +KW+   ++SR 
Sbjct: 119 CAGTSLYTVTKQLEQRIED---DYRALVMRLLVQHRDLFPLEQFTIEDYKWALCTVWSRA 175

Query: 251 VR--LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           +   LP  +    L P+ADMLNHS  V+    YD SS+ +     + Y+ G+Q+
Sbjct: 176 MDFVLPGGNSIRLLAPFADMLNHSDNVKQCHAYDSSSKTLSVLAGKDYEAGDQL 229


>gi|224125978|ref|XP_002329631.1| predicted protein [Populus trichocarpa]
 gi|222870512|gb|EEF07643.1| predicted protein [Populus trichocarpa]
          Length = 513

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/275 (25%), Positives = 131/275 (47%), Gaps = 8/275 (2%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           S +    L+KW   +G+    + I  V+   RG +A K+++ G+  L +P S++I+ +  
Sbjct: 115 SCDKEKCLEKWGESNGVK-TSLKIACVEGAGRGAIATKDLKVGDIALEIPVSIIISEEHV 173

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
                   + K   +    +L  + + E     SS++  Y   LP +  + L +    + 
Sbjct: 174 HKSDMYHILEKIDGITSETMLLLWSMKE-RHNCSSKFKIYFDTLPEEFKTGLSFGVDAI- 231

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
             L+ + + E  ++   ++   Y++L   +   YPD+F  E++  E F W+  + +S  +
Sbjct: 232 MALDGTLLLEEIMQAKEHLRVQYDELVPPLCKNYPDVFLPELYTWEQFLWACELWYSNSM 291

Query: 252 RLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
           ++  +DG  R  L+P A  LNHS    +  +   D ++  + F   R    GEQ  +SYG
Sbjct: 292 KVMFVDGKLRTCLIPIAGFLNHSLYPHIVHYGKVDSATNTLKFPLTRPCCFGEQCCLSYG 351

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
             S+  L+  YGF+P +G NP D + L + +  +D
Sbjct: 352 NFSSSHLITFYGFMP-QGDNPCDVIPLDIDVGDAD 385


>gi|303271033|ref|XP_003054878.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226462852|gb|EEH60130.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 664

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 87/308 (28%), Positives = 132/308 (42%), Gaps = 57/308 (18%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS--KWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
           RG+V  +N+ KGE L+ +P    ++  S  K +  EA + +    V    ++A +L+ E 
Sbjct: 149 RGVVTTRNVTKGETLVAIPLEKCLSTFSARKSAIGEALKTITSREVTIDAVIALHLLHEL 208

Query: 161 SFEKS-SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
             ++  S W  ++S LPR   + L WT  EL + LE S +    I     V+  +   R 
Sbjct: 209 YVQREKSEWWPWVSILPRDVETPLLWTPRELAQ-LEGSNL----IGFRDAVLKGWTTQRD 263

Query: 220 RIF----SKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--------------SMDGRVA 261
            +F     K+P LFPEE F  E + W+  I++SR   +P              S + RV 
Sbjct: 264 ALFPKLTQKFPSLFPEEHFRTERWAWAMAIVWSRAADVPVPRPEAIFPSGDDKSRELRV- 322

Query: 262 LVPWADMLNHSCEVETFL---------------------------DYDKSSQGVVFTTDR 294
           +VP  DM+NH  +                                 +D S + V      
Sbjct: 323 IVPLFDMINHGYDHAPVTPGGVKGGGGEGREKGGVGVDDSPALIPSWDPSRRMVAIRAGV 382

Query: 295 QY-QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +  P  +V  +YG K +  +LL YGFVP    NP +SVE+ +     DK    K E LR
Sbjct: 383 PFPGPNYEVRFNYGAKPSQHVLLQYGFVPM--NNPDESVEVAMHAGSRDKLKSLKSELLR 440

Query: 354 KYGLSASE 361
            + LS  E
Sbjct: 441 THELSPRE 448


>gi|148671823|gb|EDL03770.1| SET domain containing 4, isoform CRA_d [Mus musculus]
          Length = 397

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 138/300 (46%), Gaps = 31/300 (10%)

Query: 87  GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV 146
           GLP     + + + G RGL++  ++++G+ ++ +P S ++T D+       G  +K+   
Sbjct: 7   GLP-----VIRTEAG-RGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLGPYIKKWKP 59

Query: 147 PDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
           P  PLLA  T+L+SE      S W +Y+  LP+  Y+       E+   L  S ++ +A 
Sbjct: 60  PVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-PSPLKAKAE 117

Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
           E+   V   +   R   FS    LF E    VF+   F W++  + +R V L S      
Sbjct: 118 EQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECL 176

Query: 261 -------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
                  AL P+ D+LNHS  V+    +++ ++     T  + +  ++VFI YG   N  
Sbjct: 177 SAEPDTCALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQR 236

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           LLL YGFV     +    V   + +K    +DK    K+  L+ +G + +  F     GW
Sbjct: 237 LLLEYGFVSVRNPHACVPVSADMLVKFLPAADKQLHRKITILKDHGFTGNLTF-----GW 291


>gi|301119251|ref|XP_002907353.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262105865|gb|EEY63917.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 424

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 102/203 (50%), Gaps = 14/203 (6%)

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR----LRI 221
           S+W+ +I  LP+  ++ LY+   E+ + LE S +   A +    V   Y  L+      +
Sbjct: 108 SKWAKHIELLPKTYHNALYFEAGEI-KALEGSNLFFIAQQMEEKVASDYAVLKESVLFEL 166

Query: 222 FSKYP-----DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
           F         DLF +E+F+++ +KW+   ++SR V   +     A+VP  DMLNH  E E
Sbjct: 167 FENITEGITVDLF-DEIFSLDNYKWALSTIWSRFVLPVAKQSFKAMVPVFDMLNHDPEAE 225

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
               +D  +Q     + + +  G Q+FI+YG  SN +LL  YGFV     N  D+V++ L
Sbjct: 226 MSHFFDMETQCFKLVSHQHWNAGAQMFINYGALSNHKLLSLYGFVII--GNLFDAVDMWL 283

Query: 337 SLKK-SDKCYKEKLEALRKYGLS 358
            + + S K Y EK + L   GL 
Sbjct: 284 PMDEASTKFYHEKEQLLLVNGLD 306


>gi|452823683|gb|EME30691.1| hypothetical protein Gasu_19370 [Galdieria sulphuraria]
          Length = 370

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 131/280 (46%), Gaps = 17/280 (6%)

Query: 77  STLQKWLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS--KW 132
           +  ++WL    +    Q +++++ D   R  +A K I KG  LL +P  L+IT +   KW
Sbjct: 6   NQFERWLEAHQVSQWKQLLSLERYDNNYRTFLAKKPITKGSILLEIPDPLLITGNKVCKW 65

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR-AELD 191
                    +Q S     LL +  +   S +  S W  Y+  LP   Y LL+  R   L 
Sbjct: 66  LERNNWIGHQQISSVQGVLLVSIFLFFESRQSDSFWKPYLQVLP-TSYDLLFLYRDGLLL 124

Query: 192 RYLEASQIRERAIERITNVI-GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
            Y+  + I +  +E +  ++  T+    +  FS   D     V   E  +W   ++ SR+
Sbjct: 125 SYVTEADIMQ-MVESVRRILRDTFQTYVIPHFSSVDDRDKWNVLFKEFVRWYCAVV-SRI 182

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLD--YDKSSQGV-VFTTDRQYQPGEQVFISYG 307
             LP  D   ALVP  D+ NH   V+T +D  Y K  +G  VF   R +  G QVF+SYG
Sbjct: 183 CYLPD-DIAGALVPLGDIFNHEA-VDTPVDILYAKWERGYYVFRAHRNFSIGTQVFVSYG 240

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYK 346
             SN EL++ YGF   +  NP D++   P  L +S K Y+
Sbjct: 241 ALSNTELMMYYGFTLND--NPWDTLSFYPHELDESIKFYE 278


>gi|391340216|ref|XP_003744440.1| PREDICTED: SET domain-containing protein 4-like [Metaseiulus
           occidentalis]
          Length = 381

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 127/287 (44%), Gaps = 36/287 (12%)

Query: 79  LQKWLSDSGLPPQK-MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L  W+   G  P   + +       RG+V L NI  G+ ++ +P +L+IT D        
Sbjct: 24  LYSWIQRLGFKPTSVLRLACTPASGRGIVCLSNIEAGDVIIDLPSTLLITPD----LVRK 79

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
              + + ++    +L  +++SE S  + S+W  YI ++P                  +  
Sbjct: 80  ELNMSKENLSAEEILTIFVLSERSLGEKSKWKPYIESIPD---------------VFDGL 124

Query: 198 QIRE--RAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRL 253
           Q R+  R   R+   I  +N  R  +FS+    F       N ETF W++  + +R + +
Sbjct: 125 QCRKSVRLPRRLAQAIDRWNAERRNVFSRLRMFFRGRGIDLNFETFSWAWSAVNTRCIYV 184

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
                   L P+ D+LNH  +    ++    +   +  ++  Y+ G +VFI YG   N  
Sbjct: 185 EGHGS--TLAPFLDLLNHHWKAS--IETSFVNNHFIIRSNVGYEAGSEVFIGYGSHDNRT 240

Query: 314 LLLSYGFVPREGTNPSDSVELPLS----LKKSDKCYK--EKLEALRK 354
           L L+YGFV  E  NP+D + + L     LK+S   ++   K+E LR+
Sbjct: 241 LFLNYGFVLDE--NPNDCITVELEHLEKLKRSRNIHEFARKIEFLRQ 285


>gi|413917183|gb|AFW57115.1| hypothetical protein ZEAMMB73_742803 [Zea mays]
          Length = 514

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 89/341 (26%), Positives = 155/341 (45%), Gaps = 25/341 (7%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           +L KW    G+   ++ I       RG++A ++I  G+  L +P SL+I+ D      E 
Sbjct: 125 SLLKWGEHLGIK-SRLQIAYFQGAGRGMIASESIGVGDIALEIPESLIIS-DELLCQSEV 182

Query: 138 GEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
              LK  + +    +L  + + E  +   S++  Y   LP    + L +    L   LE 
Sbjct: 183 FLSLKDFNNITSETMLLLWSMRE-RYNLGSKFKPYFDTLPANFNTGLSFGIDAL-AALEG 240

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
           + + +  I+   ++   Y++L   + + +P++F ++V   + F W+  + +S  + +   
Sbjct: 241 TLLFDEIIQARQHLRQQYDELFPLLCTNFPEIFRKDVCTWDDFLWACELWYSNSMMIVLS 300

Query: 257 DGRVA--LVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
            G+++  LVP A +LNHS    +  +   D++++ + F   R    GEQ F+SYGK    
Sbjct: 301 SGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGS 360

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA--------LRKYGLSASECFP 364
            LL  YGF+PR G NP D + L L     D+    +  A        +R   LS S  FP
Sbjct: 361 HLLTFYGFLPR-GDNPYDVIPLDLDTSADDEDITAQSSATTSQTTHMVRGTWLSTSGGFP 419

Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSK 405
               G P  L+ Y        ++  + +E AA A  K + K
Sbjct: 420 TY--GLPQPLLTYLR-----AALGCEVDEPAAEADMKESDK 453


>gi|301122791|ref|XP_002909122.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099884|gb|EEY57936.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 426

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 71/300 (23%), Positives = 139/300 (46%), Gaps = 20/300 (6%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
            +TL +WL  +G     + I+ +   E  G+ A + +  G+  L +P  L +  +S    
Sbjct: 7   VTTLLEWLKANGGVDNLLDIRYLGKLEGHGVFAKQALTSGQVTLRIPFKLTMNIESAARS 66

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--- 191
             A  + K   +PD  +LA +L+ E S    S ++ +I++LP      ++W+ +EL+   
Sbjct: 67  DLARVLEKYPQIPDDEVLALHLMHERSKRSDSFFAPFIASLPTTFDLPVFWSESELNELK 126

Query: 192 --RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFS 248
               L  +Q+ ++ ++R       + ++   +   +P++F       +E + W+  +++S
Sbjct: 127 GTNVLLLTQLMKQQLQR------DFENIHQAVVEDFPEVFALLPTLTLEDYTWAMSVIWS 180

Query: 249 RLVRLPSMDGRV-ALVPWADMLNHSCEVETFLD----YDKSSQGVVFTTDRQYQPGEQVF 303
           R   +      +  L P  DM NH   +   LD    +D+ +Q +     ++   G  + 
Sbjct: 181 RAFGVTREKKYLRVLCPAMDMFNHDVSLRILLDDFVSFDEETQMLTHHVPKEVAAGSALQ 240

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECF 363
           ISYG+ SN +LL SYGFV +E  N   +V+  + +  +D   K K   L    L+  + +
Sbjct: 241 ISYGQYSNAKLLFSYGFVAKE--NSRRAVDFWMKIPPNDPYLKLKQTVLDSNELTRDQTY 298


>gi|297736447|emb|CBI25318.3| unnamed protein product [Vitis vinifera]
          Length = 487

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 129/259 (49%), Gaps = 21/259 (8%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
           ++I K   G R L A K+I+ G+ +L VP ++ I+ D+    P     L    V +   L
Sbjct: 61  LSIGKSTYGSRSLFASKSIQTGDCILKVPYNVQISPDN---VPSKINSLLGDEVGNIAKL 117

Query: 153 ATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           A  +  E    + S W+ YI+ LP+  + +S ++W+  EL + ++ S + +  I +   +
Sbjct: 118 AIVISVEWKMGQDSEWAPYINRLPQPGEMHSTIFWSEGEL-KMIQQSSVYQETINQKAQI 176

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
              +  ++  +     +LF +   +++ F  +  ++ SR     S  G ++L+P+AD +N
Sbjct: 177 QKDFLAIKPVLHHFSENLFKD--ISLKEFMHACALVGSR--AWGSTKG-LSLIPFADFVN 231

Query: 271 HSCEVETFL--DYDK----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPR 323
           H    ++ L  D DK    SS  +    DR Y PGEQV I YGK  N  LLL +GF +P 
Sbjct: 232 HDGFSDSVLLGDEDKQLSESSSTLEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLP- 290

Query: 324 EGTNPSDSVELPLSLKKSD 342
              N  D V++ +++   D
Sbjct: 291 --YNIYDQVQIQVNIPHHD 307


>gi|149059901|gb|EDM10784.1| hypothetical protein RDA279, isoform CRA_d [Rattus norvegicus]
          Length = 399

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 80/287 (27%), Positives = 129/287 (44%), Gaps = 31/287 (10%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLA--TYLISEA 160
           RGL++  ++++G+ ++ +P S ++T D+       G  +K+   P  PLLA  T+L+SE 
Sbjct: 19  RGLMSKASLQEGQVIISLPESCLLTTDTVIR-SSVGPYIKKWKPPVSPLLALCTFLVSER 77

Query: 161 SFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
                S W +Y+  LP+    P  L       L   L A    +RA  R+ ++  +  D 
Sbjct: 78  HAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPGPLRAKAEEQRA--RVQDLFASSRDF 135

Query: 218 RLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWA 266
               FS    LF E V   F+   F W++  + +R V L S             AL P+ 
Sbjct: 136 ----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTCALAPFL 191

Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           D+LNHS  V+    +++ ++     T  + +  ++ FI YG   N  LLL YGFV     
Sbjct: 192 DLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEAFICYGPHDNQRLLLEYGFVAFGNP 251

Query: 327 NPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           +    V   + LK    +DK   +KL  L  +G + +  F     GW
Sbjct: 252 HACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFTGNLTF-----GW 293


>gi|348537527|ref|XP_003456245.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Oreochromis niloticus]
          Length = 607

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 78/327 (23%), Positives = 145/327 (44%), Gaps = 37/327 (11%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L  W  ++G   +   +        GL A ++I+  E  L++P  +++T +S  +     
Sbjct: 82  LMSWAKENGASCECFTVANFGKEGYGLRATRDIKAEELFLWIPRKMLMTVESAQNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            +L      D  L       LA +L+ E +   +S W  YI +LP Q Y +  + + E  
Sbjct: 138 -ILGPLYSQDRILQAMGNVTLALHLLCERA-NPASFWLPYIRSLP-QEYDIPLYYQQEDV 194

Query: 192 RYLEASQIRERAIERITNVI-------------GTYNDLRLRIFSKYPDLFPEEVFNMET 238
           + L  +Q  +  + +  N               G    + LR+F+    +   ++F+   
Sbjct: 195 QLLLGTQAVQDVLSQYKNTARQYAYFYKLVQDKGMLGSVELRLFASLTPVMGGKLFD--- 251

Query: 239 FKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
            +W+   + +R  ++P+ DG    +AL+P  DM NH+  + T   Y+           + 
Sbjct: 252 -QWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCECVALQD 309

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           Y+  EQ++I YG +SN E ++  GF  ++  +  D V++ L + KS++ Y  K E L + 
Sbjct: 310 YKENEQIYIFYGTRSNAEFVIHNGFFFQD--DAHDRVKIKLGVSKSERLYAMKAEVLARA 367

Query: 356 GLSASECFPIQITGWPLELMAYAYLVV 382
           G+ AS  F +     P+     A+L V
Sbjct: 368 GIPASYVFALHCNEPPISAQLLAFLRV 394


>gi|449506720|ref|XP_004162829.1| PREDICTED: uncharacterized LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 77/324 (23%), Positives = 150/324 (46%), Gaps = 46/324 (14%)

Query: 71  DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           +S E  + L +W   +G+    + I  V+   RG +A +++  G+ +L +P +++I+   
Sbjct: 154 NSCEKGNCLLEWGESNGVR-TSLKIAYVEGAGRGTIAKEDLDVGDTVLEIPLAIIISE-- 210

Query: 131 KWSCPEAGEVLKQCSVPDWPLLA--------TYLISEASFEK---SSRWSNYISALPRQP 179
                   E++++ ++  +P+L+        T ++  +  EK    S +  Y   LP   
Sbjct: 211 --------ELVQKSTM--YPVLSKVEGMLPETMMLLWSMKEKHIVDSEFRVYFDTLPEAF 260

Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
            + L +    +   L  + + +  ++   ++   YN+L   + + +PD+FPEE ++ E F
Sbjct: 261 NTGLSFGVGAMTT-LVGTLLFDELMQAKEHLRKQYNELFPALCNNHPDIFPEEFYSWEEF 319

Query: 240 KWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQ 295
            W+  + +S  +++   DG  R  LVP A  LNHS    +  +   D  +  + F   R 
Sbjct: 320 LWACELWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGKVDSDTDSLKFRLSRP 379

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK------------ 343
            + GE+ ++SYG  S   L+  YGF+P EG N +D + L +     D             
Sbjct: 380 CRAGEECYLSYGNYSGSHLVTFYGFLP-EGDNVNDVIPLDIDFGDDDNNNITSDWSTHMV 438

Query: 344 --CYKEKLEALRKYGLSAS--ECF 363
              +  K++++  YGL +   ECF
Sbjct: 439 RGTWLSKIQSIFHYGLPSPFLECF 462


>gi|346465219|gb|AEO32454.1| hypothetical protein [Amblyomma maculatum]
          Length = 353

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 126/273 (46%), Gaps = 27/273 (9%)

Query: 79  LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L +W+  +G     ++ +++     RGL  L+ +  GE  L VP  L+IT  +  S    
Sbjct: 29  LLEWMIANGFELHVQLCVREFTETGRGLATLQKVTAGETFLRVPTCLLITTTTALSSSLH 88

Query: 138 GEVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
           G +++    +    +L  +LI+E      S W  +I +LP    + ++     L R  E 
Sbjct: 89  GFLVRHHRQLTAIEVLTLFLINEKLRGLDSEWRFFIDSLPVSYTTPVFLGSKLLARLPET 148

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM-ETFKWSFGI---------- 245
             +  +A  +++ +  T+  +RL+I  K   L    + N+ E F W   +          
Sbjct: 149 --MCRKAEAQVSRIRRTF--VRLQILLKRALLDDSALLNLSENFTWHLFVWAWTAVNTRC 204

Query: 246 LFSRLVRLPSM--DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF--TTDRQYQPGEQ 301
           +FS+     S   D   AL P+ D LNH  +     D + + +G  F   T+  Y+P +Q
Sbjct: 205 IFSKHRTDHSFWDDDYCALAPFLDCLNHHWKA----DVETTVEGSYFEIVTNNNYEPNDQ 260

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           VFISYG   N +LLL YGFV  +  NP+D V +
Sbjct: 261 VFISYGSHDNKKLLLEYGFVLAD--NPNDVVAI 291


>gi|145553305|ref|XP_001462327.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124430166|emb|CAK94954.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 85.1 bits (209), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 77/282 (27%), Positives = 133/282 (47%), Gaps = 21/282 (7%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPE 136
            L +WL D      K++I+    G R L A + IR+GE +LFVP +  ++ +  K SC  
Sbjct: 42  NLIQWLKDGKAEVSKVSIEVKSEGYRTLRASQFIRQGEWVLFVPRTHYLSLEEVKKSCLI 101

Query: 137 AGEVLKQCSVPDWPLLATYLIS---EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             ++++   +P+   + TY ++   + +  ++S W  YI  LP+          AE D  
Sbjct: 102 NRKMIQLNYIPN--NIQTYFVNHLLQENRRQNSFWKPYIDVLPKDVSGFPTNFDAEQDAL 159

Query: 194 LEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV-FNMETFKWSFGILFSRL 250
           L+ S      + +       Y++L+  ++ F +Y   + + V F   T   SF       
Sbjct: 160 LKGSPTLFTVMNQRKTFQEEYDNLKEAVKEFQRYGYTYNDFVKFRTLTISRSFP------ 213

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKS--SQGVVFTTDRQYQPGEQVFISYGK 308
           V +   + +  LVP AD +NH  +   FL Y  S  + G      R  Q GE++F +YG+
Sbjct: 214 VYIGENEQQQLLVPLADFINH--DNNGFLQYGYSPDADGFFMQAVRNIQKGEELFYNYGQ 271

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
            SN    ++YGF     TNP +  +  + L ++D+ +K K+E
Sbjct: 272 WSNKYFFMNYGFASL--TNPMNQFDFDICLDRNDRMFKMKVE 311


>gi|348552908|ref|XP_003462269.1| PREDICTED: SET domain-containing protein 4-like [Cavia porcellus]
          Length = 440

 Score = 85.1 bits (209), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 135/310 (43%), Gaps = 29/310 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +   +     RGL++  ++R+G+ ++ +P S ++T D+        
Sbjct: 36  LKKWLKDRNFEDTNLMPARFPGTGRGLMSKTSLREGQMIISLPGSCLLTTDTVIRSSLGA 95

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
            ++K    P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 96  YIIKWKPPPS-PLLALCTFLVSEKHAGDQSVWKPYLDILPKS-YTCPVCLEPEVVNLL-P 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
             ++ +A E+  +V   +   R   FS    LF E    VF+     W++  + +R V L
Sbjct: 153 EPLKAKAEEQRMSVQQFFASSR-DFFSSLQPLFEEATDSVFSYSALLWAWCTVNTRAVYL 211

Query: 254 PSMD--------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            +             AL P+ D+LNHS  V+    +++ +      T   Y+  ++VFI 
Sbjct: 212 RTRRRDCLSLEPDTCALAPYLDLLNHSPNVQVKAAFNEETGCYEIRTASDYRKHKEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRKYGLSAS 360
           YG   N  LLL YGFV     NP   V      L   L  +DK   +K+  L+ +G   +
Sbjct: 272 YGPHDNHRLLLEYGFVSL--CNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFLEN 329

Query: 361 ECFPIQITGW 370
             F     GW
Sbjct: 330 LTF-----GW 334


>gi|449283795|gb|EMC90389.1| SET domain-containing protein 4 [Columba livia]
          Length = 440

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 124/270 (45%), Gaps = 24/270 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLF-VPPSLVITADSKWS-CP 135
            +KWL D G     +   +  D G RGL+  K ++    L+  +P   ++T D+  S C 
Sbjct: 35  FRKWLKDRGFEDSHLRPAEFWDTG-RGLMTTKTLQVSRDLIISLPEKCLLTTDTVLSSC- 92

Query: 136 EAGEVLKQCSVPDWPL--LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             GE + +   P  PL  L T+LI+E    + S W  Y+  LP+  YS       ++   
Sbjct: 93  -LGEYIMKWKPPVSPLTALCTFLIAEKHAGEKSLWKPYLDVLPKT-YSCPVCLEHDVVSL 150

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRL 250
           L    +R++A E+ T V   Y   +   FS    LF E    +FN    +W++  + +R 
Sbjct: 151 L-PEPLRKKAQEQRTKVHELYISSK-AFFSSLQPLFAENTETIFNYSALEWAWCTINTRT 208

Query: 251 VRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           + +     +         AL P+ D+LNHS  V+    +++ ++     T+   +  E+V
Sbjct: 209 IYMKHSQRKCFSLEPDVYALAPYLDLLNHSPNVQVKAAFNEQTRSYEIRTNSLCKKYEEV 268

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSV 332
           FI YG   N  LLL YGFV  +  NP  SV
Sbjct: 269 FICYGPHDNQRLLLEYGFVAMD--NPHSSV 296


>gi|449466129|ref|XP_004150779.1| PREDICTED: uncharacterized protein LOC101212907 [Cucumis sativus]
          Length = 559

 Score = 84.7 bits (208), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 75/319 (23%), Positives = 143/319 (44%), Gaps = 36/319 (11%)

Query: 71  DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD- 129
           +S E  + L +W   +G+    + I  V+   RG +A +++  G+ +L +P +++I+ + 
Sbjct: 154 NSCEKGNCLLEWGESNGVR-TSLKIAYVEGAGRGTIAKEDLDVGDTVLEIPLAIIISEEL 212

Query: 130 --SKWSCPEAGEV---LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
                  P   +V   L +     W +   +++        S +  Y   LP    + L 
Sbjct: 213 VQKSTMYPVLSKVEGMLPETMTLLWSMKEKHIVD-------SEFRVYFDTLPEAFNTGLS 265

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
           +    +   L  + + +  ++   ++   YN+L   + + +PD+FPEE ++ E F W+  
Sbjct: 266 FGVGAMTT-LVGTLLFDELMQAKEHLRKQYNELFPALCNNHPDIFPEEFYSWEEFLWACE 324

Query: 245 ILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGE 300
           + +S  +++   DG  R  LVP A  LNHS    +  +   D  +  + F   R  + GE
Sbjct: 325 LWYSNSLKIMFPDGNVRTCLVPIAGFLNHSLHPHILHYGKVDSDTDSLKFRLSRPCRAGE 384

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK--------------CYK 346
           + ++SYG  S   L+  YGF+P EG N +D + L +     D                + 
Sbjct: 385 ECYLSYGNYSGSHLVTFYGFLP-EGDNVNDVIPLDIDFGDDDNNNITSDWSTHMVRGTWL 443

Query: 347 EKLEALRKYGLSAS--ECF 363
            K++++  YGL +   ECF
Sbjct: 444 SKIQSIFHYGLPSPFLECF 462


>gi|403271547|ref|XP_003927684.1| PREDICTED: SET domain-containing protein 4 [Saimiri boliviensis
           boliviensis]
          Length = 440

 Score = 84.7 bits (208), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 81/296 (27%), Positives = 132/296 (44%), Gaps = 24/296 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKDRKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHCYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     NP   V      L   L  +DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFV--SAHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|302784522|ref|XP_002974033.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
 gi|300158365|gb|EFJ24988.1| hypothetical protein SELMODRAFT_414219 [Selaginella moellendorffii]
          Length = 527

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 110/243 (45%), Gaps = 17/243 (6%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           ++A   I KG  L  VP    ++  +   C   G +L++     W  +  +L+ E S  K
Sbjct: 65  MIASGAIDKGSVLAEVPLQAFLSEKTAERCLLVGPMLRKNDFRPWLTMCAHLLVERSRGK 124

Query: 165 SSRWSNYISALPR-QPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            S W  YISALP  +  S+   L W    +   L+ S + +    R+      +  L   
Sbjct: 125 ESFWHPYISALPSVEELSISHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTA 184

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD-----GRVALVPWADMLNH--SC 273
              K+  L   E  +    +W+  +L SR   L  +D       + LVPWADMLNH  S 
Sbjct: 185 GIEKF--LPGGETLSEGDVRWASAVLLSRAFSL-ELDVDDDFDTLCLVPWADMLNHCSSA 241

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG-ELLLSYGFVPREGTNPSDSV 332
             E+ L +D+ ++       + Y  G++VF SYG    G +L L YGFV  E  N + +V
Sbjct: 242 GEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDE--NENYAV 299

Query: 333 ELP 335
           +LP
Sbjct: 300 DLP 302


>gi|291410015|ref|XP_002721306.1| PREDICTED: SET domain containing 4 [Oryctolagus cuniculus]
          Length = 440

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 85/316 (26%), Positives = 137/316 (43%), Gaps = 41/316 (12%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           LQKWL D     + +A  +     RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LQKWLKDRKFEDKNLAPARFPGTGRGLMSTVSLQEGQMIISLPESCLLTTDTVIES-YLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL-- 194
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  PYITKWKPPPSPLLALCTFLVSEKHAGDRSPWQPYLEILPKA-YTCPVCLDPEVVNLLPK 153

Query: 195 ----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILF 247
               +A + R R  E   +  G         FS    LF E    +F+     W++  + 
Sbjct: 154 PLQMKAEEQRARLWEFFASSRG--------FFSSLQPLFVEPIDSIFSYSALLWAWCTVN 205

Query: 248 SRLVRL--------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           +R V L         +     AL P+ D+LNHS  V+    +++ ++     T  +++  
Sbjct: 206 TRAVYLRRRPRECLSAEPDTCALAPYLDLLNHSPHVQVEAAFNEETRCYEIRTASRFRKH 265

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE-----LPLSLKKSDKCYKEKLEALRK 354
           E+VFI YG   N  LLL YGFV     NP   V      L   L  +DK   +K+  L+ 
Sbjct: 266 EEVFICYGPHDNQRLLLEYGFV--SVRNPHACVYVSGEILVKYLPPTDKQLNKKVAILKD 323

Query: 355 YGLSASECFPIQITGW 370
           +G   +  F     GW
Sbjct: 324 HGFIENLTF-----GW 334


>gi|224012755|ref|XP_002295030.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969469|gb|EED87810.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 753

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/257 (31%), Positives = 124/257 (48%), Gaps = 24/257 (9%)

Query: 96  QKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK---QCSVPDW 149
           Q++D G    RG+ A  +I      + +P S +IT +   + P   ++L    +   P  
Sbjct: 115 QQLDDGSKEMRGVHAKTSIPPNTICVSIPKSCLITVEMGQATPIGRKILTSDLELDAPKH 174

Query: 150 PLLATYLISEASFE-KSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERI 207
             L  Y++ +     ++S ++ Y   LP    ++ ++WTR ELD  LE S +  +  +R 
Sbjct: 175 IFLMIYILWDRKVNGETSFFAPYYKILPETLRNMPIFWTREELD-ALEGSYLLLQIADRA 233

Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWA 266
             +   Y    + I S  P+    ++  +E F+W+  I+ SR   L     R  ALVP A
Sbjct: 234 EAIKEDY----ISICSIAPEF--GDIATLEEFQWARMIVCSRNFGLLINGHRTSALVPHA 287

Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-----V 321
           DMLNH    ET   + + SQ    TT ++   GEQVF SYG+K N   LL+YGF     V
Sbjct: 288 DMLNHLRPRETKWTFSEESQSFTITTLQEIGMGEQVFDSYGQKCNHRFLLNYGFCVERNV 347

Query: 322 PREGTNPSDSVELPLSL 338
             +G  P+   E+PL L
Sbjct: 348 EVDGFCPN---EVPLEL 361


>gi|330822500|ref|XP_003291689.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
 gi|325078125|gb|EGC31794.1| hypothetical protein DICPUDRAFT_57488 [Dictyostelium purpureum]
          Length = 540

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 94/384 (24%), Positives = 166/384 (43%), Gaps = 37/384 (9%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGER-----GLVALKNIRKGEKLLFVPPSL---VIT 127
            S   +WL +SG    K    KV +G       GLV+  +I++GE+ L +P  L   ++T
Sbjct: 70  VSNFMEWLKNSGFDETK---SKVKIGRNLAEGSGLVSTCDIKEGEEFLEIPEKLFIDIMT 126

Query: 128 ADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
           A   +       +L+   +   P  +LA YLI E++   SS  + Y+  LP+   ++ YW
Sbjct: 127 ALKSFGQSGYDILLRDNLIRRVPNLVLALYLIKESTNPDSS-IAPYLKVLPKTYSTIGYW 185

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
              +  + LE S + + A+      +  Y     ++F   P +     F  E F W+   
Sbjct: 186 GIEDF-KQLEGSPVFQTAVNYTRGSMRQYCYF-YQLFDNNPGILQTSNFTYEAFIWAVAT 243

Query: 246 LFSRLVRLPSMDGR-VALVPWADMLNHSC---EVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
           + SR  + P   G+ +AL+P+ D  NHS    ++ TF+D  K    +  +  + Y+ GEQ
Sbjct: 244 VQSR--QNPVGGGQEMALIPFWDFCNHSSHGGKITTFIDPVKHV--LTCSAAKSYKKGEQ 299

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL-RKYGLSAS 360
           V++ YG + N +  L  GF  +   N   S ++ L  +       +K+  L  + GL   
Sbjct: 300 VYMYYGPRPNSQFYLFQGFSLKTNLNDDYSFDMDLDNEDDRDIAHDKIHILEERCGLRVG 359

Query: 361 ECFPIQIT----GWPLELMA-YAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK------ 409
           +   +         P E++  Y    +SP   K           +      D+K      
Sbjct: 360 QTVSLSQNPSSEKLPAEIIPFYRIAALSPEETKKLAPPQEEGHHHHHQGPMDMKPEAFNI 419

Query: 410 -CPEIDEQALQFILDSCESSISKY 432
              E +++A + +LDS ++ +S Y
Sbjct: 420 ISEENEKKAFKLLLDSLKARLSGY 443


>gi|407035166|gb|EKE37568.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Entamoeba nuttalli P19]
          Length = 791

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 84/316 (26%), Positives = 148/316 (46%), Gaps = 33/316 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           ++KW+  +G     + ++  D   RGL A K  +K E ++ +P S+ I   +        
Sbjct: 4   IKKWVIQNGGVIDGVDVKTFDGYGRGLCANKEFKKDEIIMSIPYSIQINRIN------LN 57

Query: 139 EVLKQCSVPDWP-----------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
            +  +  +P +            L+  YL    +  K   W  YI+ LP      L +T 
Sbjct: 58  HIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWP-YINVLPETYDCPLSYTI 116

Query: 188 AELDRYLEASQIRERAIERIT----NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
            EL+  ++ +++   A+E+I      V+  YN+  ++ F +Y   F +++F  +  +W+ 
Sbjct: 117 DELNL-MKGTKLY-AAVEKINAFLMKVVDYYNNKLIQQFPQYFQPF-DDLF--KRLQWAH 171

Query: 244 GILFSR--LVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPG 299
              +SR  LV  P   G V +L+P+ D  NH  + +     +  ++   F T+    +PG
Sbjct: 172 QSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEALVKPG 231

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
           EQ+F +Y  +SN +LLL YGFV  E  NP D++ L +  +  D  Y E  E L++  + +
Sbjct: 232 EQIFNNYRIRSNEKLLLGYGFV--EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKS 289

Query: 360 SECFPIQITGWPLELM 375
            + F       PLELM
Sbjct: 290 FDFFLKLDEDIPLELM 305


>gi|344277088|ref|XP_003410336.1| PREDICTED: SET domain-containing protein 4 [Loxodonta africana]
          Length = 440

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 134/308 (43%), Gaps = 25/308 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +   +     RGL++  +++ G+ ++ +P S +++ D+       G
Sbjct: 36  LKKWLKDRKFEDTNLIPARFPGTGRGLMSKTSLQVGQMIISLPESCLLSTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+ E      S W  Y+  LP+     + W    ++  L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVLEKHAGDQSSWKPYLETLPKTYTCPVCWEPEVVN--LLP 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK---WSFGILFSRLVRL 253
             +R +A E+ T V   +   R   FS    LF E V N+ T+    W++  + +R V L
Sbjct: 153 RPLRAKAQEQRTRVQEFFTSFR-DFFSSLQPLFSEAVENIFTYSALLWAWCTVNTRAVYL 211

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                R         AL P+ D+LNHS +V+    +++ ++          +  E+VFI 
Sbjct: 212 RHRQLRCFSAEPDTCALAPYLDLLNHSPDVQVKAAFNEKTRCYEIVAVSSCRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASEC 362
           YG   N  LLL YGFV     +    V   + +K    +DK   +K+  L+ +    +  
Sbjct: 272 YGPHDNHRLLLEYGFVSTRNPHACVYVSRDILVKYLPSTDKQMNKKISILKDHDFIENLT 331

Query: 363 FPIQITGW 370
           F     GW
Sbjct: 332 F-----GW 334


>gi|67484540|ref|XP_657490.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474743|gb|EAL52100.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 791

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/316 (26%), Positives = 149/316 (47%), Gaps = 33/316 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           ++KW+  +G     + ++  +   RGL A K  +K E ++ +P S+ I   +        
Sbjct: 4   IKKWVIQNGGVIDGVDVKTFEGYGRGLCANKEFKKDEVIMSIPYSIQINRIN------LN 57

Query: 139 EVLKQCSVPDWP-----------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
            +  +  +P +            L+  YL    +  K   W  YI+ LP      L +T 
Sbjct: 58  HIWPEVKLPKFNEGDDDRDDLNGLVYLYLAVNKTNPKCFHWP-YINVLPETYDCPLSYTI 116

Query: 188 AELDRYLEASQIRERAIERIT----NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
            EL+  ++ +++   A+E+I      V+  YN+  ++ F +Y   F +++F  +  +W+ 
Sbjct: 117 DELNL-MKGTKLY-AAVEKINAFLMKVVDYYNNKLIQQFPQYFQSF-DDLF--KRLQWAH 171

Query: 244 GILFSR--LVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPG 299
              +SR  LV  P   G V +L+P+ D  NH  + +     +  ++   F T+ +  +PG
Sbjct: 172 QSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPG 231

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
           EQ+F +Y  +SN +LLL YGFV  E  NP D++ L +  +  D  Y E  E L++  + +
Sbjct: 232 EQIFNNYRIRSNEKLLLGYGFV--EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKS 289

Query: 360 SECFPIQITGWPLELM 375
            + F       PLELM
Sbjct: 290 FDFFLKLDEDIPLELM 305


>gi|156384284|ref|XP_001633261.1| predicted protein [Nematostella vectensis]
 gi|156220328|gb|EDO41198.1| predicted protein [Nematostella vectensis]
          Length = 403

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 133/287 (46%), Gaps = 15/287 (5%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL---VITADSKWSCPEAG 138
           W   +G   + + I        GL A  ++++ +  + VP  L   V+TA      P   
Sbjct: 1   WFKANGGTAEHVEIHDFGDQGLGLRATADLQENQVFVAVPEKLLMSVVTAKKSSLGPLIS 60

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQ 198
                 S+P   +LA +++ E   E S+ W+ Y++ LPR   + LY++  ++   L+ S 
Sbjct: 61  REHGLRSMPH-VVLALHVLCERLHEDST-WAPYLNILPRSYSTCLYFSPDDM-MALQGSP 117

Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFSRL--VRL 253
               A+++   ++  Y     R+    P+   L  +  F  + F+W+   + +R   V++
Sbjct: 118 SMGEALKQFRGIVKQYVYF-FRLVQINPEASRLPLKNSFTFDDFRWAVSTVMTRQNDVKV 176

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
            S +   AL+P  DM NH C       +D S++ V     +  + G+QVFI YG+++N +
Sbjct: 177 SSNETVKALIPMWDMCNH-CNGPFTTGFDDSTKEVKSLAFKPTRAGDQVFIFYGRRNNAD 235

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
            L   GFV  E     D V + L + K+D+ Y  K + L   GL AS
Sbjct: 236 RLFHNGFVYTEAE--EDWVNIQLGVSKNDRLYAMKAQILAMVGLDAS 280


>gi|395848935|ref|XP_003797093.1| PREDICTED: SET domain-containing protein 4 [Otolemur garnettii]
          Length = 440

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 134/311 (43%), Gaps = 31/311 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +         RGL++  ++++G+ ++ +P + ++T D+       G
Sbjct: 36  LKKWLKDRKFEDTNLMPAHFPGTGRGLMSKTSLQEGQMIISLPENCLLTTDTVIE-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRY 193
             + +   P  PLLA  T+L+SE      S W  Y+  LP+    P  L       L + 
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDQSPWKPYLEILPKAYTCPVCLEPEVVNLLPKP 154

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRL 250
           L+A    +RA   +     +  D     FS    LF E V   F+     W++  + +R 
Sbjct: 155 LKAKAEEQRA--HVQEFFASSRDF----FSSLQPLFAEAVDSIFSYSALLWAWCTVNTRA 208

Query: 251 VRL--------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           V L         +     AL P+ D+LNHS  V+    +++ ++     T   ++  E+V
Sbjct: 209 VYLRHRRRECLSAEPDTCALAPYLDLLNHSPNVQVRAAFNEETRCYEIRTASSWRKHEEV 268

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSA 359
           FI YG   N  LLL YGFV  +  +    V   + +K    +DK   +K+  L+ +G   
Sbjct: 269 FICYGHHDNQRLLLEYGFVSIQNPHACVYVSREILVKYLPSTDKQMNKKISILKDHGFIE 328

Query: 360 SECFPIQITGW 370
           +  F     GW
Sbjct: 329 NLTF-----GW 334


>gi|66828265|ref|XP_647487.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
 gi|60475797|gb|EAL73732.1| hypothetical protein DDB_G0268558 [Dictyostelium discoideum AX4]
          Length = 459

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 72/280 (25%), Positives = 135/280 (48%), Gaps = 20/280 (7%)

Query: 79  LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
             KWL ++ +    K+ I+ ++   R +VA ++I+K EKL+ VP  ++++    +S    
Sbjct: 38  FNKWLINNKVYKNPKIEIKVLEKYGRSIVAKQSIKKNEKLISVPKLIIMSNMGGFSHHLP 97

Query: 138 GEVLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
            E+ +      +    L A +L+        S W  Y+S LP++  + +Y++  ELD  L
Sbjct: 98  NEIYEPSISIGISPTNLQAIFLMY-CKLNDKSFWYPYVSVLPKEFTTSIYFSEEELDE-L 155

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--------EEVFNMETFKWSFGIL 246
           ++S+++E  I R   +   YN    R+ ++    F         ++ + +E F W+   +
Sbjct: 156 QSSKLKEFTIIRKDGIERHYNSTFTRLSNRGIAEFSPTSTQTLQQKGYTLELFTWALSCV 215

Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
           +SR   L   DG   +VP ADM N     ++ +    +   + +      + GEQ+F  Y
Sbjct: 216 WSRAFSLSDSDG--GMVPLADMFNAEEISKSKVQPKVTDSTLDYYASDDIEIGEQIFTPY 273

Query: 307 GKK---SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
           G     S+ ++L+ YGFV   GT PSD+V + + +   D+
Sbjct: 274 GVYKPLSSSQMLMDYGFVFDHGT-PSDNVAISVPIFHPDE 312


>gi|225448769|ref|XP_002275729.1| PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase, chloroplastic-like [Vitis
           vinifera]
          Length = 480

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/253 (28%), Positives = 126/253 (49%), Gaps = 16/253 (6%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
           ++I K   G R L A K+I+ G+ +L VP ++ I+ D+    P     L    V +   L
Sbjct: 61  LSIGKSTYG-RSLFASKSIQTGDCILKVPYNVQISPDN---VPSKINSLLGDEVGNIAKL 116

Query: 153 ATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           A  +  E    + S W+ YI+ LP+  + +S ++W+  EL + ++ S + +  I +   +
Sbjct: 117 AIVISVEWKMGQDSEWAPYINRLPQPGEMHSTIFWSEGEL-KMIQQSSVYQETINQKAQI 175

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
              +  ++  +     +LF +   +++ F  +  ++ SR     S  G ++L+P+AD +N
Sbjct: 176 QKDFLAIKPVLHHFSENLFKD--ISLKEFMHACALVGSR--AWGSTKG-LSLIPFADFVN 230

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPS 329
           H    ++ L  D+  Q      DR Y PGEQV I YGK  N  LLL +GF +P    N  
Sbjct: 231 HDGFSDSVLLGDEDKQLSEVIADRNYAPGEQVLIRYGKFPNATLLLDFGFTLP---YNIY 287

Query: 330 DSVELPLSLKKSD 342
           D V++ +++   D
Sbjct: 288 DQVQIQVNIPHHD 300


>gi|79315114|ref|NP_001030864.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|51971180|dbj|BAD44282.1| unnamed protein product [Arabidopsis thaliana]
 gi|332645817|gb|AEE79338.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 353

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/218 (28%), Positives = 106/218 (48%), Gaps = 27/218 (12%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRY----LEASQIRERA- 203
           +LA  LI E    + SRW  YIS LP+  + +S ++W   EL       +    ++++A 
Sbjct: 1   MLAAVLIREKKMGQKSRWVPYISRLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQ 60

Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
           IE+  + +         I ++ PDL        E F +++ ++ SR         R++L+
Sbjct: 61  IEKDFSFVAQAFKQHCPIVTERPDL--------EDFMYAYALVGSRAW---ENSKRISLI 109

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTT-----DRQYQPGEQVFISYGKKSNGELLLSY 318
           P+AD +NH     + +  D+ +Q   F+T     DR Y PG++VFI YG+ SN  L+L +
Sbjct: 110 PFADFMNHDGLSASIVLRDEDNQLSEFSTLQVTADRNYSPGDEVFIKYGEFSNATLMLDF 169

Query: 319 GFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           GF  P    N  D V++ + +   D     KL  L+ +
Sbjct: 170 GFTFP---YNIHDEVQIQMDVPNDDPLRNMKLGLLQTH 204


>gi|426392958|ref|XP_004062802.1| PREDICTED: SET domain-containing protein 4 [Gorilla gorilla
           gorilla]
          Length = 440

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 132/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE    + S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGRRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|167521575|ref|XP_001745126.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776740|gb|EDQ90359.1| predicted protein [Monosiga brevicollis MX1]
          Length = 390

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 124/298 (41%), Gaps = 22/298 (7%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E    L  WL   G    K+A+   +   +GL A      GE LL +P + +++ +S   
Sbjct: 25  EEYDELVDWLKQCGATVDKVAVDHFNGMGQGLKATAEAAPGETLLRIPEACMLSEESARR 84

Query: 134 CPEAGEVLKQCSVPDWP-LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
                 +     +   P +   + +     +  S W  YI+ LP      LYW   +L  
Sbjct: 85  STLGAYMDSDTMLKLMPNVTLAFHLLLELHDLDSFWRPYIACLPVSYSVPLYWDLPDL-M 143

Query: 193 YLEASQIRERAIERITNVIGTY----NDLRLR------IFSKYPDLFPEEVFNMETFKWS 242
            L  S +   AI    +V   Y    N L +R       F     L PE  F  E ++W+
Sbjct: 144 SLRGSSLFVEAIRLYKHVCRQYGYLHNKLSVRANPSCSCFPLTLGLSPE-AFTFEDWRWA 202

Query: 243 FGILFSRLVRLPSM--DGR----VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
              + +R   +P    DG+    +AL+P  DM+NH+    +   +D   + + F      
Sbjct: 203 VATVMTRQNSIPQAGPDGQMKPTLALIPLWDMINHANHPMS-TQFDSERECLEFVCPAPA 261

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           +PG Q+ + YG ++NG+ LL  GF      N  D V +P SL ++D  YK K   LR 
Sbjct: 262 KPGSQITMWYGDRNNGQFLLHQGFFFAGHAN--DYVNVPFSLDETDSLYKIKALLLRN 317


>gi|195132508|ref|XP_002010685.1| GI21676 [Drosophila mojavensis]
 gi|193907473|gb|EDW06340.1| GI21676 [Drosophila mojavensis]
          Length = 593

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 80/296 (27%), Positives = 135/296 (45%), Gaps = 24/296 (8%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
           E   L       +W    G+    + I      + GL A ++I+ GE++L VP  L+   
Sbjct: 168 EQTRLSKIEAFNEWARAGGVKTDCVEIATFPGYQLGLRATRDIKAGEQVLSVPRKLIF-- 225

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
            S+   PE    L + + P    +   LI E      S W  +I  LP +  ++LY+T  
Sbjct: 226 -SEELLPEKQRQLFR-NFPTHLKVTYTLIMEKLRGADSPWQPFIDTLPSRYNTVLYFTVE 283

Query: 189 ELDRYLEASQ----IRE-RAIERITNVI--GTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
           ++ R    S     +R  R I R+   +    +  L   +     +LF +     E ++W
Sbjct: 284 QMQRLRGTSACSAAVRHCRVIARLYASMYKCAFMQLDDSVMGGMANLFTDYGLCYELYRW 343

Query: 242 SFGILFSR--LV---RLPSMDGRV---ALVPWADMLNH-SCEVETFLDYDKSSQGVVFTT 292
           +   + +R  LV    +PS    +   AL+P+ DM NH S ++ +F  YD+++  +  T 
Sbjct: 344 AVSTVTTRQNLVPRQEIPSDAANLPISALIPYWDMANHRSGKITSF--YDQAAGQMECTA 401

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
              Y+ GEQ FI YG +SN + L+  GFV  +  NP D V++ L L  +D   +++
Sbjct: 402 QEAYKSGEQYFIYYGDRSNADRLVHNGFVDMQ--NPKDYVQIRLGLSPTDALAEQR 455


>gi|302803412|ref|XP_002983459.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
 gi|300148702|gb|EFJ15360.1| hypothetical protein SELMODRAFT_445547 [Selaginella moellendorffii]
          Length = 536

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 109/243 (44%), Gaps = 17/243 (6%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           ++A   I KG  L  VP    ++  +   C   G +L++     W  +  +L+ E S  K
Sbjct: 65  MIASGAIDKGSVLAEVPLQAFLSEKTAERCRLVGPMLRKNDFRPWLTMCAHLLVERSRGK 124

Query: 165 SSRWSNYISALPR-QPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            S W  YI+ALP     S+   L W    +   L+ S + +    R+      +  L   
Sbjct: 125 ESFWHPYIAALPSVDELSISHPLLWPAETIQELLQGSPMLDTIATRLKLCQEDHEALLTA 184

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD-----GRVALVPWADMLNH--SC 273
              K+  L   E  +    +W+  +L SR   L  +D       + LVPWADMLNH  S 
Sbjct: 185 GIEKF--LPGGETLSEGDVRWASAVLLSRAFSL-ELDVDDDFDTLCLVPWADMLNHCSSA 241

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG-ELLLSYGFVPREGTNPSDSV 332
             E+ L +D+ ++       + Y  G++VF SYG    G +L L YGFV  E  N + +V
Sbjct: 242 GEESCLIFDQDTKTASLEAHKSYSKGDEVFDSYGPALTGSQLFLDYGFVDDE--NENYAV 299

Query: 333 ELP 335
           +LP
Sbjct: 300 DLP 302


>gi|302768639|ref|XP_002967739.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
 gi|300164477|gb|EFJ31086.1| hypothetical protein SELMODRAFT_408995 [Selaginella moellendorffii]
          Length = 421

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/284 (27%), Positives = 121/284 (42%), Gaps = 43/284 (15%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L  WL   G      ++ K    +RGL A+++I+ GE +L V    ++TAD     P   
Sbjct: 40  LVSWLKIRG-EHDACSLLKTGPDKRGLFAVRDIKAGECILRVSRDTMMTADR---LPLEF 95

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEA 196
           + L    V +W  LA  L+ E    ++S W+ YIS LPR    +S  +W + EL   ++ 
Sbjct: 96  QQLLSSGVSEWAQLALLLLFEKRAGEASIWAPYISCLPRWGTIHSTAFWRKEELT-MIQE 154

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
           S +    + R   +   +N+++   F+ + +             WS   + +        
Sbjct: 155 SSLSYETMSRRAAIREEFNEMQSVPFADFMN-----------HDWSSNAMLTY------- 196

Query: 257 DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT----DRQYQPGEQVFISYGKKSNG 312
                     D  N S EVE    Y       +F      D+ Y  GEQV IS+G   N 
Sbjct: 197 ----------DTDNGSTEVEEVKVYSDCLYIALFCAQLFADKNYAAGEQVTISFGPLCNA 246

Query: 313 ELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
            L L +GF VP    NP D V+L L + + D   KEKL+ L  +
Sbjct: 247 SLALDFGFTVP---YNPWDKVQLWLGISRRDSLRKEKLQYLHSH 287


>gi|301094169|ref|XP_002997928.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262109714|gb|EEY67766.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 104/234 (44%), Gaps = 27/234 (11%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           L + K +R+G    FVP +LV+      + P A +       PD  L+A      AS +K
Sbjct: 83  LASNKALREGSS--FVPSALVLGVHMLVNFPHAED-------PDGLLMAM-----ASVDK 128

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
                 Y+SALPR     LYW     DR  +  Q  E A   + +    Y+ +   +F  
Sbjct: 129 PPLDELYVSALPRYVDLPLYWD----DRKFKELQGCEEARRAVQHGARFYSQVYQHLFGT 184

Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKS 284
                  +  + E F W+  IL SR       +   AL+P+ D  NH+   +  +     
Sbjct: 185 N-----NQFVSAEAFFWAISILMSRATS--GQNQPFALIPFFDWFNHADNGDECVQEFDP 237

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            +G    T + Y+PGEQ++I+YG  SN  LL +YGF      NP D V LP+ +
Sbjct: 238 QKGFTVHTTKAYEPGEQLYINYGSHSNLRLLRNYGFT--TPNNPYDVVTLPMPI 289


>gi|22328112|gb|AAH36556.1| SETD4 protein [Homo sapiens]
 gi|119630166|gb|EAX09761.1| SET domain containing 4, isoform CRA_d [Homo sapiens]
 gi|167773807|gb|ABZ92338.1| SET domain containing 4 [synthetic construct]
          Length = 416

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 12  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 70

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 71  AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 129

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 130 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 187

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 188 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 247

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 248 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 301


>gi|335300684|ref|XP_003358991.1| PREDICTED: SET domain-containing protein 4 [Sus scrofa]
          Length = 440

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 139/311 (44%), Gaps = 31/311 (9%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +   +     RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LKKWLKDRNFEDTNLIPARFPGTGRGLMSKTSLQEGQLVIALPESCLLTTDTVLRS-YLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  PYIAKWQPPPSPLLALCTFLVSEKHAGDQSPWKPYLEVLPKT-YTCPVCLEPEVVNLLPG 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             ++ +A E+ T V   ++  R    S  P LFPE V   F+     W++  + +R V +
Sbjct: 154 P-LKSKAREQRTRVWEFFSSSRDFFSSLQP-LFPEAVESIFSYSALLWAWCTVNTRAVYM 211

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                +         AL P+ D+LNHS  V+    +++ S+     T    +  E+VFI 
Sbjct: 212 KQRPRQCFSTEPDTCALAPYLDLLNHSPAVQVKAAFNEESRCYEIRTGTSCRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFV-PREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSA 359
           YG   +  LLL YGFV PR   NP   V +P       L  +DK   +K+  L+ +    
Sbjct: 272 YGPHGSHRLLLEYGFVSPR---NPHACVYVPKDILVKYLPSTDKQMNKKISILKDHDFIE 328

Query: 360 SECFPIQITGW 370
           +  F     GW
Sbjct: 329 NLTF-----GW 334


>gi|308812738|ref|XP_003083676.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
 gi|116055557|emb|CAL58225.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
          Length = 483

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 123/256 (48%), Gaps = 30/256 (11%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE----VLKQCSVPDWPLLA-TY 155
           G+RG       RK   L+  P + V++A +    P+ GE    +++  ++PD  + A  +
Sbjct: 81  GKRGW-----FRKNRVLMTSPDAAVVSARTATMDPKLGEKYADLMRDGTLPDERVAAMVF 135

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ E    ++S W  YI ALPR   + L  +  EL+R L+ + + + A+ +   V   ++
Sbjct: 136 LMVERRKGEASAWGGYIDALPRSYDAPLSLSDVELERELKGTNVYDAAVAQRAKVREMFD 195

Query: 216 -DLR--LRIFSKYPDLFPEEVF-------NMETFKWSFGILFSRLVRLPSMD-GRV--AL 262
            ++R  +R  S+      +           ++ FKW+F   ++R + +P  D G V   +
Sbjct: 196 ENVRPAMRGLSEVAAASGDAKLATSLNNATIDEFKWAFQTFWTRALAIPVNDTGEVVEGI 255

Query: 263 VPWADMLNHS-----CEVETFLDYDKSSQGVV--FTTDRQYQPGEQVFISYGKKSNGELL 315
           VP  DM+NHS        E   D  +   GV+   +  ++   G+++FI YG+ S+  L 
Sbjct: 256 VPGIDMVNHSRTKANARWEHVDDNTRPDGGVIALVSNGKKLGHGDEIFIDYGESSSEALF 315

Query: 316 LSYGFVPREGTNPSDS 331
            ++GFVP +    SD 
Sbjct: 316 FTHGFVPEDDDTVSDG 331


>gi|146162512|ref|XP_001009518.2| SET domain containing protein [Tetrahymena thermophila]
 gi|146146406|gb|EAR89273.2| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 789

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 77/282 (27%), Positives = 132/282 (46%), Gaps = 18/282 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KWLSD+     K+ +   +   RG+ A + I+KGE +LF+P   +IT +     P   
Sbjct: 351 LLKWLSDTSSEFNKIKMVYYN-NYRGVHARQKIKKGECILFIPVDNMITLELSKELP-IC 408

Query: 139 EVLKQCSV----PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRY 193
           ++++  ++    P    L+ Y+I E    KS  W  ++  LP +  +  + +T  EL  +
Sbjct: 409 QLIESKNIRLLSPKHTFLSIYIIIEKKNHKSF-WKPFLDILPVEYTTFPILYTDEEL-FW 466

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
           L+ S    +  ER   +   Y      I SK P+    ++  ++ F W+  +  SR+  L
Sbjct: 467 LKGSPFLNQVKERRECITQDYQ----AIVSKIPEF--AKLCTLDEFAWARMMAASRIYGL 520

Query: 254 PSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
                R  A VP ADM NH     T   + +   G +       + G+Q++ S G+K N 
Sbjct: 521 FINKKRTDAFVPLADMFNHRRPAYTNWGFCEDKGGFMLKASEDIRRGDQIYYSCGRKCNS 580

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
             LL+YGFV +   N ++ ++L +   K D+    KL+ + K
Sbjct: 581 RFLLNYGFVVK--NNEANEIQLRVDFDKKDETLPIKLQMIGK 620


>gi|8393013|ref|NP_059134.1| SET domain-containing protein 4 isoform 1 [Homo sapiens]
 gi|12229715|sp|Q9NVD3.1|SETD4_HUMAN RecName: Full=SET domain-containing protein 4
 gi|7023055|dbj|BAA91819.1| unnamed protein product [Homo sapiens]
 gi|119630162|gb|EAX09757.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630163|gb|EAX09758.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
 gi|119630165|gb|EAX09760.1| SET domain containing 4, isoform CRA_b [Homo sapiens]
          Length = 440

 Score = 82.4 bits (202), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|397507017|ref|XP_003824008.1| PREDICTED: SET domain-containing protein 4 [Pan paniscus]
          Length = 440

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|332872029|ref|XP_001168891.2| PREDICTED: SET domain-containing protein 4 isoform 8 [Pan
           troglodytes]
 gi|410222532|gb|JAA08485.1| SET domain containing 4 [Pan troglodytes]
 gi|410259176|gb|JAA17554.1| SET domain containing 4 [Pan troglodytes]
 gi|410287500|gb|JAA22350.1| SET domain containing 4 [Pan troglodytes]
 gi|410336605|gb|JAA37249.1| SET domain containing 4 [Pan troglodytes]
          Length = 440

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|170588849|ref|XP_001899186.1| SET domain containing protein [Brugia malayi]
 gi|158593399|gb|EDP31994.1| SET domain containing protein [Brugia malayi]
          Length = 278

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 67/278 (24%), Positives = 129/278 (46%), Gaps = 34/278 (12%)

Query: 75  NASTLQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           + +   +W   +G     + I+   + G +GL A  + R+ E ++ +P  L+ITA     
Sbjct: 2   DCTGFMEWAVGNGAYHSGIDIRDCSNEGGKGLFATTDFRENETIISIPVGLIITAGFIAE 61

Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
            P+  +V K+  +  +  L  + + E   E++S+W+ Y+  LP+      + T A L   
Sbjct: 62  MPDYCDVFKRYCLKPFEALVYFFLVEK--EQNSKWTPYLEVLPKS-----FSTPASLHPS 114

Query: 194 LEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           L+         ++  +++        N+L++ ++ K+  +  +     + F W++ I+ +
Sbjct: 115 LKPEDFPYCLRKQWYVQK--------NELKI-MYEKFVTILADNTI-WDHFLWAWHIVNT 164

Query: 249 RLV----RLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           R +    +L  +     D  +A+VP  DMLNHS + +    +D           R  + G
Sbjct: 165 RCIYRNNKLHPLIDNTEDDSLAIVPLIDMLNHSNDSQCCAIWDSKFNLYKVIVTRPIRKG 224

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
           EQ+FI YG  +NG L + YGF  ++  N  D VE+ L 
Sbjct: 225 EQIFICYGSHTNGSLWIEYGFYLKD--NICDKVEISLG 260


>gi|198417784|ref|XP_002130734.1| PREDICTED: similar to SET domain-containing protein 4 [Ciona
           intestinalis]
          Length = 473

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 80/311 (25%), Positives = 134/311 (43%), Gaps = 42/311 (13%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--------SVPDWP 150
           D G RG++A   I +G+ +L +P + ++  +S ++  +  + +            +    
Sbjct: 55  DTG-RGMMAKTRICEGDVILSIPQAAMVGVNSAFNLSKFAQSISSVYHSMHDGLKLSGIQ 113

Query: 151 LLATYLISE----ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-RERAIE 205
           +L  +LI E       + SS W  Y+  LP+     LYW   E+    +  QI   + I+
Sbjct: 114 ILCIFLIEEKRKLGKNKPSSTWGYYVKVLPQTFTHPLYWEMEEIHTLPKQLQICVNKTID 173

Query: 206 RITNVIGTYNDL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-- 261
            +       N++  +L++ S   DL   E  +   ++W++  + +R V     D  +   
Sbjct: 174 CVKQQFKELNEMIKKLKLGS---DLNYHEEISWIEYRWAWCCVNTRCVYSTHDDPTIMKC 230

Query: 262 -----------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
                      LVP+ D+LNHS EV T  +++ +++     T  +++   QVFISYG  S
Sbjct: 231 CYQSSAADKYFLVPYLDLLNHSNEVNTKAEFNNTNKCFELRTHCKFKRFAQVFISYGALS 290

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK---------LEALRKYGLSASE 361
           N  LL+ YGFV +   N  D V L +    S   Y  K         L  L+KY L    
Sbjct: 291 NSTLLVEYGFVCKT-PNKHDVVALDVGHVLSFIKYSVKTAICPSNSLLNQLKKYDLDKGL 349

Query: 362 CFPIQITGWPL 372
            F I    W L
Sbjct: 350 AFTIHGPSWTL 360


>gi|332229557|ref|XP_003263953.1| PREDICTED: SET domain-containing protein 4 [Nomascus leucogenys]
          Length = 440

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 131/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQWECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|242081035|ref|XP_002445286.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
 gi|241941636|gb|EES14781.1| hypothetical protein SORBIDRAFT_07g007800 [Sorghum bicolor]
          Length = 490

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 69/244 (28%), Positives = 120/244 (49%), Gaps = 9/244 (3%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-SVPDWPLLATYLISEAS 161
           RG+VA ++I  GE  L +P SL+I+ D      E    LK   S+    +L  + + E  
Sbjct: 180 RGMVASESIGVGEIALEIPESLIIS-DELLCQSEVFLALKDFNSITSETMLLLWSMRE-R 237

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
           +  +S++  Y   LP    + L +    L   LE + + +  ++   ++   Y++L   +
Sbjct: 238 YNLASKFKPYFDTLPANFNTGLSFGIDGL-AALEGTLLFDEIMQAKQHLRQQYDELFPLL 296

Query: 222 FSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRVALVPWADMLNHSC--EVET 277
            + +P++F ++V   + F W+  + +S   +V L S      LVP A +LNHS    +  
Sbjct: 297 CTNFPEIFRKDVCTWDNFLWACELWYSNSMMVVLSSGKLSTCLVPVAGLLNHSVSPHILN 356

Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
           +   D++++ + F   R    GEQ F+SYGK     L+  YGF+PR G NP D + L   
Sbjct: 357 YGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGSHLVTFYGFLPR-GDNPYDVIPLGCD 415

Query: 338 LKKS 341
           + +S
Sbjct: 416 IDES 419


>gi|145537195|ref|XP_001454314.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422069|emb|CAK86917.1| unnamed protein product [Paramecium tetraurelia]
          Length = 481

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/287 (26%), Positives = 136/287 (47%), Gaps = 31/287 (10%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPE 136
            L +WL D      K+ I+    G R L A + IR+GE +LF+P +  ++ +  K SC  
Sbjct: 42  NLIQWLKDGKAEVSKVQIEVKSEGYRTLRASQFIRQGEWVLFIPRTHYLSLEEVKKSCLI 101

Query: 137 AGEVLKQCSVPDWPLLATYLIS---EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
             ++++   +P+   + TY ++   + +  ++S W  YI  LP+       +  AE D  
Sbjct: 102 NRKMIQLNYIPN--NIQTYFVNHLLQENRRQNSFWKPYIDVLPKDVSGFPTYFDAEQDAL 159

Query: 194 LEAS-----QIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV-FNMETFKWSFGI 245
           L+ S      + +R I R       Y++L+  ++ F +Y   + + + F + T   SF  
Sbjct: 160 LKGSPTLFTVMNQRKIFR-----EEYDNLKEAVKEFQRYGYTYNDFIKFRILTISRSFP- 213

Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKS--SQGVVFTTDRQYQPGEQVF 303
                V +   + +  LVP AD +NH  +   FL Y  S  + G      R  Q GE++F
Sbjct: 214 -----VYIGENEQQQLLVPLADFVNH--DNNGFLQYGYSPDADGFFMQAVRNIQKGEELF 266

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
            +YG+ SN    ++YGF     TNP +  +  + L ++D+ +  K++
Sbjct: 267 YNYGQWSNKYFFMNYGFASL--TNPMNQFDFDVCLDRNDRLFNLKVD 311


>gi|167389227|ref|XP_001738871.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165897700|gb|EDR24782.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 791

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 82/316 (25%), Positives = 150/316 (47%), Gaps = 33/316 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           ++KW+  +G     + ++  +   RGL A K  ++ E ++ +P S+ I   +        
Sbjct: 4   IKKWVIQNGGIIDGVDVKTFEGYGRGLCANKEFKQDEIIMSIPYSIQINRIN------LN 57

Query: 139 EVLKQCSVPDWP-----------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
            +  +  +P +            L+  YL    +  K   W  YI+ LP+     L +T 
Sbjct: 58  HIWPEVKLPKFNEGDDDRDDLNGLVYLYLAINKTNPKCFHWP-YINVLPKTYDCPLSYTI 116

Query: 188 AELDRYLEASQIRERAIERIT----NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
            EL+  ++ +++   A+E+I      V+  YN+  ++ F +Y   F +++F  +  +W+ 
Sbjct: 117 DELN-IMKGTKLY-VAVEKINAFLMKVVDYYNNKLIQQFPQYFQPF-DDLF--KRLQWAH 171

Query: 244 GILFSR--LVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPG 299
              +SR  LV  P   G V +L+P+ D  NH  + +     +  ++   F T+ +  +PG
Sbjct: 172 QSFWSRAFLVIYPQPFGEVGSLIPFCDFSNHCTQAKVTYISNTRTETFSFQTNEEVVKPG 231

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA 359
           EQ+F +Y  +SN +LLL YGFV  E  NP D++ L +  +  D  Y E  E L++  + +
Sbjct: 232 EQIFNNYRIRSNEKLLLGYGFV--EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKS 289

Query: 360 SECFPIQITGWPLELM 375
            + F       PLELM
Sbjct: 290 FDFFLKLDEDIPLELM 305


>gi|149742140|ref|XP_001496337.1| PREDICTED: SET domain-containing protein 4 [Equus caballus]
          Length = 440

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/308 (25%), Positives = 137/308 (44%), Gaps = 25/308 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL +       +   +     RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LKKWLKERKFEDMNLTPARFPGTGRGLMSKISLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L++E      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYIAKWQPPLSPLLALCTFLVAEKHAGDRSVWKPYLEVLPKA-YTCPVCLEPEVVDLL-P 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             ++ +A E+ T +   +   R   FS    LF E V   F+   F W++  + +R V +
Sbjct: 153 KPLKAKAREQRTRLQAFFTSSR-DFFSSLRPLFSEAVESIFSYSAFLWAWCTVNTRAVYM 211

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                R         AL P+ D+LNHS +V+    +++ ++     T    +  E+VFI 
Sbjct: 212 KPRRRRCFSAEPDTYALAPYLDLLNHSPDVQVRAGFNEETRCYEIRTVSSCRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASEC 362
           YG   N  LLL YGFV     +    V   + +K    +DK  K+K+  L+ +    +  
Sbjct: 272 YGPHDNQRLLLEYGFVSIHNPHACVYVSKDILVKYLPSTDKQMKKKISILKDHDFIENLT 331

Query: 363 FPIQITGW 370
           F     GW
Sbjct: 332 F-----GW 334


>gi|297707870|ref|XP_002830708.1| PREDICTED: SET domain-containing protein 4 [Pongo abelii]
          Length = 440

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 132/295 (44%), Gaps = 22/295 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPQ 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 ---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
                      +D   AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI
Sbjct: 212 RPRHRECLSAELDT-CALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRRHEEVFI 270

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
            YG   N  L L YGFV     +    V   + +K    +DK   +K+  L+ +G
Sbjct: 271 CYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSTDKQMDKKISILKDHG 325


>gi|190402231|gb|ACE77646.1| hypothetical protein [Sorex araneus]
          Length = 350

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 79/155 (50%), Gaps = 6/155 (3%)

Query: 231 EEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQG 287
           +E F  E ++W+   + +R  ++P+ DG    +AL+P  DM NH+  + T   Y+     
Sbjct: 11  KESFTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDR 69

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
                 + ++ GEQ++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  
Sbjct: 70  CECVALQDFRAGEQIYIFYGTRSNAEFVVHSGFFF--DNNSHDRVKIKLGVSKSDRLYAM 127

Query: 348 KLEALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
           K E L + G+  S  F + +T  P+     A+L V
Sbjct: 128 KAEVLARAGIPTSSVFALHVTELPISAQLLAFLRV 162


>gi|255080880|ref|XP_002504006.1| predicted protein [Micromonas sp. RCC299]
 gi|226519273|gb|ACO65264.1| predicted protein [Micromonas sp. RCC299]
          Length = 529

 Score = 81.3 bits (199), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 95/365 (26%), Positives = 158/365 (43%), Gaps = 55/365 (15%)

Query: 69  EIDSLENASTLQKWLS-DSGLPPQKMAIQKVDVGERGLVALKN-IRKGEKLLFVPPSLVI 126
           ++D+   A  L  WL+ + GLP    A + V  G+ G+  L N +R GE L+ +P +L +
Sbjct: 43  DVDT-RTARELVAWLTVEKGLP--GGAAKAVSFGDGGVAKLVNDVRAGEPLIEIPQNLAV 99

Query: 127 T----ADSKWSCPEA---GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP 179
           T    ADS      A   GE++          LA +L  E      S W+ Y++ LP   
Sbjct: 100 TSVDVADSPIVAGLAAGRGELVG---------LALWLCLERHKGPLSEWAPYVATLPSAG 150

Query: 180 YSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE--EVFNM 236
               L WT  EL   L+ S +RE+A+ R+ +    Y  +  +I S   D  P+  E    
Sbjct: 151 SDHPLLWTAGELQTLLQGSPVREQAVSRLESADDEYASIADQIRSNPNDFPPDAYEFLTR 210

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADML-----------------------NHSC 273
           + F  +   + +R V L + +   A+VP  D+L                           
Sbjct: 211 DAFVDALATVLARAVWLNAANC-YAMVPLVDLLPLVGSPPPGVSPAAAAGGPAVGKPGLA 269

Query: 274 EVETFLDYDKSSQGV-VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
                +DYD +++ V V + +   Q    V +    ++ G+L L+ G V  + ++  D +
Sbjct: 270 AAAGVVDYDAATECVAVVSANDAQQTARVVCVDPLARNAGDLFLATGAV--DESHCGDYL 327

Query: 333 ELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGWPLELMAYA-YLVVSPPS--MK 388
               S  ++D+ Y+ K + L   G+SA  + FP+     P++L+AY  +  V  P   M 
Sbjct: 328 AFAASCTQTDRLYEAKRQILEGMGMSADGQTFPVFADRMPMQLLAYMRFARVQDPGELMS 387

Query: 389 GKFEE 393
             FEE
Sbjct: 388 VSFEE 392


>gi|260831632|ref|XP_002610762.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
 gi|229296131|gb|EEN66772.1| hypothetical protein BRAFLDRAFT_91548 [Branchiostoma floridae]
          Length = 604

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 113/222 (50%), Gaps = 21/222 (9%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           +L+ +L+ E +  K S W  YI +LP    + +Y+T +EL+    +  ++E+A +    +
Sbjct: 238 VLSLFLLLEKNKGKDSFWYPYIRSLPNSFTTPVYFTESELNAL--SPSLQEKARDLKKEL 295

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-----RLPSMDGR----VA 261
           +  +NDL   + S  P+L  +  F  + F+W++ +L +R +     R P +  +      
Sbjct: 296 LHAFNDLEPFVTSCLPEL--DSTFTFDAFRWAWSVLKTRTLYQEDCRSPYLSNKEPQTST 353

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           LVP  D++NHS   +    Y+ ++          Y+  +QVFISYG + N EL+L +GF 
Sbjct: 354 LVPMLDLINHSPSAKARFGYNVNTSCYEVRVLEPYRKYDQVFISYGFEENTELMLKFGFF 413

Query: 322 PREGTNPSDSVELPLSL------KKSDKCYKEKLEALRKYGL 357
             E  NP D +++ LS       + +D+  K K++ L   GL
Sbjct: 414 VPE--NPKDFMKINLSEMLESLPQINDEERKNKVDLLFDSGL 453


>gi|357131865|ref|XP_003567554.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Brachypodium distachyon]
          Length = 316

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/187 (31%), Positives = 96/187 (51%), Gaps = 10/187 (5%)

Query: 158 SEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           S A+  K S W+ Y+ +LPR  Q +++++W   EL   +  S I + AIER    +  ++
Sbjct: 32  SAAATPKKSGWAPYVRSLPRNDQMHNMMFWDLNEL-HMVRISSICDEAIERRERAMKEFS 90

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
            ++  +   +P LF E    +E F  +  ++ SR  +       V+L+P+AD LNH    
Sbjct: 91  AVKPSL-ECFPHLFGE--IKLEDFMHASALVSSRAWQTSR---GVSLIPFADFLNHDGVS 144

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVEL 334
           ++ L YD         +DR Y  GEQV + YGK SN  L L++GF +PR   + + + E+
Sbjct: 145 DSILLYDGQKDIAEVISDRNYAVGEQVMVRYGKYSNAMLALNFGFTLPRNIYDQNGNREV 204

Query: 335 PLSLKKS 341
             S  K 
Sbjct: 205 KYSGGKG 211


>gi|114684050|ref|XP_001168792.1| PREDICTED: SET domain-containing protein 4 isoform 4 [Pan
           troglodytes]
 gi|410222534|gb|JAA08486.1| SET domain containing 4 [Pan troglodytes]
 gi|410259178|gb|JAA17555.1| SET domain containing 4 [Pan troglodytes]
 gi|410287502|gb|JAA22351.1| SET domain containing 4 [Pan troglodytes]
 gi|410336607|gb|JAA37250.1| SET domain containing 4 [Pan troglodytes]
          Length = 307

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 116/256 (45%), Gaps = 17/256 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFV 321
           YG   N  L L YGFV
Sbjct: 272 YGPHDNQRLFLEYGFV 287


>gi|55953063|ref|NP_001007260.1| SET domain-containing protein 4 isoform 2 [Homo sapiens]
 gi|12804091|gb|AAH02898.1| SET domain containing 4 [Homo sapiens]
 gi|119630161|gb|EAX09756.1| SET domain containing 4, isoform CRA_a [Homo sapiens]
          Length = 307

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 116/256 (45%), Gaps = 17/256 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGHRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFAEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPHVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFV 321
           YG   N  L L YGFV
Sbjct: 272 YGPHDNQRLFLEYGFV 287


>gi|145346652|ref|XP_001417799.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578027|gb|ABO96092.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 490

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/353 (25%), Positives = 145/353 (41%), Gaps = 52/353 (14%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           KWL D+ +    + + +     RG+ A +++R GE ++ VP   V+T D+     E GE 
Sbjct: 33  KWLRDNKVELDGVEVARFARTGRGVRATRDLRVGEVVVSVPDDAVLTVDACAVKKELGEF 92

Query: 141 L----KQCSVP--DWPLLATYLISEASFEKSSRWSNYISALP---RQPYSLLYWTRAEL- 190
           +     +   P  D  LL   ++ E    KSS W  Y+  +    R  +S+L W   ++ 
Sbjct: 93  VGDGDDEAPSPRLDKELLVIAVMCEMCAGKSSAWCEYLETVHEAVRVGHSVLAWDDEQVT 152

Query: 191 -----DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFG 244
                D + +A +  +  ++        + ++    F  +P L     V  +    ++  
Sbjct: 153 ALFGTDAWRDAYENDDETLDLPMMTEEHFENVVTLFFKLFPKLASGLSVEALRELHFAAT 212

Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
            + +        D   A+VP+ DMLNH+  CE    L +D+ +  +   T R  + GE+V
Sbjct: 213 AMVAGYSFTLGDDEIQAMVPFWDMLNHAPPCEASVRLHHDQKNGCLQMITVRGVKKGEEV 272

Query: 303 FISYGKKSNGELLLSYGFV-PRE-----------------GTNPSDSVELPLSLKKSDKC 344
           F +YG   N ELL  YGFV PR                  G NP    ELPL        
Sbjct: 273 FNTYGPLRNAELLRRYGFVLPRNPHGGTTVGLAEVIQAAMGANPLVVDELPL-------- 324

Query: 345 YKEKLEALRKYGLSASEC---FPIQITGWPLE--LMAYAYLVVSPPSMKGKFE 392
              +L  L   GL+  E    F +  TG P +  L+A   L + P  M    E
Sbjct: 325 ---RLAWLESRGLADEELSTRFFVHRTGRPSDKLLIAMRLLTLKPEEMTALIE 374


>gi|443699166|gb|ELT98776.1| hypothetical protein CAPTEDRAFT_151537 [Capitella teleta]
          Length = 413

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 117/261 (44%), Gaps = 39/261 (14%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWS---CPEAGEVLKQCSVPDW-PLLATYLISEA 160
           +VA  +I +G+ +  +P SL++T  +        E  + L++ S   W PLL T +    
Sbjct: 1   MVATSDISQGDTIFEIPRSLLLTPQNSTIGVLLNEEADSLQEAS--RWVPLLITLMYEYT 58

Query: 161 SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
           S   SSRW  Y   +P   Q    ++W+  E+ R L+ + I       + N+   +NDL 
Sbjct: 59  S--PSSRWKPYFDLVPDFDQLDLPMFWSSDEVKRELKGTGIPSLVESDLLNISKEFNDLV 116

Query: 219 LRIFSKYPDLFPEEVFNMETFK--WSFGILFS---------------------RLVRLPS 255
           L    K+ ++F +E   ++ +K   +F + +S                      L+  P 
Sbjct: 117 LPFIQKHSNVFSDECKCLKFYKKMVAFVMAYSFTEPPPSPDLDDSDDLSGDEHDLMPQPM 176

Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
           M      VP AD+LNH  +    LD+ K S  +     +  Q GE++F +YG+ +N  LL
Sbjct: 177 M------VPMADILNHVAKNSARLDFPKGSSSLKMVATQDIQKGEEIFNTYGELANMNLL 230

Query: 316 LSYGFVPREGTNPSDSVELPL 336
             YGF    G N  D  E+P+
Sbjct: 231 HMYGFAEDIGCNEYDIAEIPV 251


>gi|355747383|gb|EHH51880.1| SET domain-containing protein 4 [Macaca fascicularis]
          Length = 440

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 130/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFVEAVDSIFSYSALLWAWCTINTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K     DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHG 325


>gi|145349778|ref|XP_001419305.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579536|gb|ABO97598.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 457

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 74/295 (25%), Positives = 124/295 (42%), Gaps = 41/295 (13%)

Query: 78  TLQKWLSDSGLPPQKM--AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
            L +W  + G     +  A++      RGL A + +R GE +L +  +  I  D+K    
Sbjct: 19  ALVRWCVERGARGSGLTVALETGAGAGRGLEATRALRAGEGVLELKLASGIVDDAKGHPE 78

Query: 136 EAGEVLKQCSVPDWPL-LATYLISEASFEKSSRWSNYISALP-RQPYSLLYWTRAELDRY 193
            A + +K+     W + LA  L+ E    + S ++ Y   LP R P S +++     D  
Sbjct: 79  SARDAMKEAP---WGVRLACRLLQEKKLGEGSAYAAYARTLPERVPTSPIHY-----DEK 130

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
             A      A+  I  +          +  K P+   +  F+ E F  + G++ SR   +
Sbjct: 131 AIADVQYPPAMSEIREMQAACRKWHETLREKAPEALGDAYFDYEAFANAVGVVHSRTYGV 190

Query: 254 PSMDGRVA----LVPWADMLNHSCEVETFLDYDKSS----------------------QG 287
            S +        L+P ADMLNH  ++ T L  D+++                      +G
Sbjct: 191 ASAEDNAGYFRVLLPLADMLNHGGDIVTSLTRDETTGELTDMTTAATDNIAWSTLDAEEG 250

Query: 288 VV-FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
           V+ F   R  + GE+  +SYG++SN   L+ YGF P    NP D   L  +L+ +
Sbjct: 251 VIQFAATRDIEEGEEALMSYGERSNDHFLIYYGFAPD--NNPHDDCVLFSNLEHA 303


>gi|388452885|ref|NP_001253203.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|355560299|gb|EHH16985.1| SET domain-containing protein 4 [Macaca mulatta]
 gi|387541878|gb|AFJ71566.1| SET domain-containing protein 4 isoform 1 [Macaca mulatta]
          Length = 440

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 130/294 (44%), Gaps = 20/294 (6%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +A        RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LRKWLKARKFQDSNLAPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPK-AYTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFVEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 --------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                    +     AL P+ D+LNHS  V+    +++ +      T  +++  E+VFI 
Sbjct: 212 RPRQRECLSAEPDTCALAPYLDLLNHSPRVQVKAAFNEETHSYEIRTTSRWRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYG 356
           YG   N  L L YGFV     +    V   + +K     DK   +K+  L+ +G
Sbjct: 272 YGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMDKKISILKDHG 325


>gi|321462357|gb|EFX73381.1| hypothetical protein DAPPUDRAFT_58066 [Daphnia pulex]
          Length = 425

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 140/287 (48%), Gaps = 33/287 (11%)

Query: 70  IDSLENASTLQKWLSDSG--LPPQKMAIQK---VDVGERGLVALKNIRKGEKLLFVPPSL 124
           IDS      L KW+S +G     +   + K    +   RGL+A+ NI     L+ +P SL
Sbjct: 24  IDSHSEFVELCKWMSANGWNAVSKNCLVTKPALFNSTGRGLMAMSNIAPNHLLVQIPQSL 83

Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
           +IT +   +  E  ++L Q S+     L T+ I  + F  +  +S+YIS LP+  +S+  
Sbjct: 84  LITKEKVLA--EISDLL-QFSMTTAECL-TFFILNSKF--NGLYSSYISTLPK-SFSVGG 136

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
             +++    L  S ++E+ +     V+  Y     +IF+ +  ++   + ++E F+W++ 
Sbjct: 137 LCKSQEIAAL-PSFLQEKIMCNQNFVLKKYE----KIFAIWRKIYGSTL-SLELFQWAWF 190

Query: 245 ILFSRLV-------------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
            + +R V             ++  M+  +AL P+ DM NH  EV     ++K++Q     
Sbjct: 191 CVNTRAVFYQDSKQHSHGLNKVDGMENNMALAPYLDMFNHDAEVVVEAGFNKTTQCYEIR 250

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           +DR  +  +QVFI+YG   N +L L YGF+  +  N   +VE  + +
Sbjct: 251 SDRHIKKYQQVFINYGPHDNMKLFLEYGFLATK--NLHKAVEFDIDV 295


>gi|148671819|gb|EDL03766.1| SET domain containing 4, isoform CRA_a [Mus musculus]
          Length = 378

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/277 (27%), Positives = 126/277 (45%), Gaps = 25/277 (9%)

Query: 110 NIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLA--TYLISEASFEKSSR 167
           ++++G+ ++ +P S ++T D+       G  +K+   P  PLLA  T+L+SE      S 
Sbjct: 5   SLQEGQVMISLPESCLLTTDTVIR-SSLGPYIKKWKPPVSPLLALCTFLVSEKHAGCRSL 63

Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
           W +Y+  LP+  Y+       E+   L  S ++ +A E+   V   +   R   FS    
Sbjct: 64  WKSYLDILPKS-YTCPVCLEPEVVDLL-PSPLKAKAEEQRARVQDLFTSAR-GFFSTLQP 120

Query: 228 LFPE---EVFNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCEVE 276
           LF E    VF+   F W++  + +R V L S             AL P+ D+LNHS  V+
Sbjct: 121 LFAEPVDSVFSYRAFLWAWCTVNTRAVYLRSRRQECLSAEPDTCALAPFLDLLNHSPHVQ 180

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
               +++ ++     T  + +  ++VFI YG   N  LLL YGFV     +    V   +
Sbjct: 181 VKAAFNEKTRCYEIRTASRCRKHQEVFICYGPHDNQRLLLEYGFVSVRNPHACVPVSADM 240

Query: 337 SLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
            +K    +DK    K+  L+ +G + +  F     GW
Sbjct: 241 LVKFLPAADKQLHRKITILKDHGFTGNLTF-----GW 272


>gi|226508108|ref|NP_001151788.1| SET domain containing protein [Zea mays]
 gi|195649689|gb|ACG44312.1| SET domain containing protein [Zea mays]
          Length = 536

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 143/313 (45%), Gaps = 20/313 (6%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           +L KW    G+   ++ I       RG++A ++I  G+  L +P  L+I+ D      E 
Sbjct: 156 SLLKWGEHLGIK-SRLQIAYFQGAGRGMIASESIGVGDIALEIPEFLIIS-DELLCQSEV 213

Query: 138 GEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
              LK  + +    +L  + + E  +   S++  Y   LP    + L +    L   LE 
Sbjct: 214 FLALKDFNNITSETMLLLWSMRE-RYNLGSKFKPYFDTLPANFNTGLSFGIDAL-AALEG 271

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
           + + +  I+   ++   Y++L   + + +P++F ++V   + F W+  + +S  + +   
Sbjct: 272 TLLFDEIIQARQHLRQQYDELFPLLCTNFPEMFRKDVCTWDDFLWACELWYSNSMMIVLS 331

Query: 257 DGRVA--LVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
            G+++  LVP A +LNHS    +  +   D++++ + F   R    GEQ F+SYGK    
Sbjct: 332 SGKLSTCLVPVAGLLNHSVSPHILNYGRVDEATKSLKFPLSRPCDAGEQCFLSYGKHPGS 391

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA--------LRKYGLSASECFP 364
            L+  YGF+PR G NP D + L L     D+    +  A        +R   LS S  FP
Sbjct: 392 HLVTFYGFLPR-GDNPYDVIPLDLDTSVDDEDIAAQSSATTSQTTHMVRGTWLSTSGGFP 450

Query: 365 IQITGWPLELMAY 377
               G P  L+ +
Sbjct: 451 TY--GLPQPLLTH 461


>gi|28393324|gb|AAO42088.1| unknown protein [Arabidopsis thaliana]
          Length = 543

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 134/286 (46%), Gaps = 43/286 (15%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E  S L +W  D+G+   K+ I ++D   RG +A ++++ G+  L +P S +I+ +  ++
Sbjct: 148 EKESKLVEWGQDNGVK-TKLQIAQIDGYGRGAIASEDLKLGDVALEIPVSSIISEEYVYN 206

Query: 134 CPEAGEVLKQCSVPDWPLLATY--LISEASF------EK---SSRWSNYISALPRQPYSL 182
                          +P+L T+  + SE         EK    S++  Y  +L     + 
Sbjct: 207 SDM------------YPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTG 254

Query: 183 LYW---TRAELDRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
           L +      ELD  L   E  Q +E   ER       Y++L + + S + ++FP E++  
Sbjct: 255 LSFGVDAIMELDGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHREVFPPELYTW 306

Query: 237 ETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSC--EVETFLDYDKSSQGVVFTT 292
           E + W+  + +S  +++   DG++   L+P A  LNHS    +  +   D  +  + F  
Sbjct: 307 EHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPV 366

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            R    GEQ F+SYG  S+  LL  YGF+P+ G NP D + L   +
Sbjct: 367 SRPCNKGEQCFLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 411


>gi|145344456|ref|XP_001416748.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576974|gb|ABO95041.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 515

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 164/384 (42%), Gaps = 41/384 (10%)

Query: 72  SLENASTLQKWLS-DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           + E+A  L  WLS D G+    +  ++   GE  +    ++  G ++L VP    +T+  
Sbjct: 45  TAEDARELAAWLSYDKGVDASGLVFKEGARGEVEVALRGDVDAGARVLAVPQDCAVTSVD 104

Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
             + P    + K    P+   LA +L +E     +S W+ Y+  L   P + L+WT AE 
Sbjct: 105 VDAHPIVSGLAK--GRPELVGLALWLCAERIKGGASDWAPYVKTLAANPDAPLFWTEAED 162

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILF 247
              L+ S I   A+ER  +    Y  + + +    P  FP E    F  E F  +   + 
Sbjct: 163 FALLKGSPIVNDAVERSRSAREEYAAI-VEVIKGDPTAFPAEAYEFFTEERFVDALATVC 221

Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCE--------------VETFLDYDKSSQGVVFTTD 293
           ++   LP+     ALVP  D++  +                     DYD  S  VV +  
Sbjct: 222 AKATWLPTASC-YALVPLLDVITIAGSPVPGVSPPSAKDGIARCAADYDVDSACVVLSAV 280

Query: 294 RQYQPGEQVF-ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
            +     +V  +   +++NGEL L+ G V ++  +P D + +   ++ SD+ +  K + L
Sbjct: 281 VKAPANSRVVQLDPLQRNNGELFLNTGRVDQK--HPGDYLYMRTEIQPSDRLFSAKKQVL 338

Query: 353 RKYGLSA-SECFPIQITGWPLELMAY-AYLVVSPPS--MKGKFEEMAAAASNKMTSKKDI 408
              G +A ++ FP+     P +L +Y  +  V  P   M   FEE      +K+ S  + 
Sbjct: 339 EGMGFTAENQYFPVYEDRMPTQLYSYLRFARVQDPGEMMAVSFEE------DKIVSVMN- 391

Query: 409 KCPEIDEQALQFILDSCESSISKY 432
                + + LQ ++  C   +S+Y
Sbjct: 392 -----EYEILQLLMGDCRELMSEY 410


>gi|79557522|ref|NP_179475.3| SET domain-containing protein [Arabidopsis thaliana]
 gi|56381987|gb|AAV85712.1| At2g18850 [Arabidopsis thaliana]
 gi|330251719|gb|AEC06813.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 543

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 134/286 (46%), Gaps = 43/286 (15%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E  S L +W  D+G+   K+ I ++D   RG +A ++++ G+  L +P S +I+ +  ++
Sbjct: 148 EKESKLVEWGQDNGVK-TKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYN 206

Query: 134 CPEAGEVLKQCSVPDWPLLATY--LISEASF------EK---SSRWSNYISALPRQPYSL 182
                          +P+L T+  + SE         EK    S++  Y  +L     + 
Sbjct: 207 SDM------------YPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTG 254

Query: 183 LYW---TRAELDRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
           L +      ELD  L   E  Q +E   ER       Y++L + + S + ++FP E++  
Sbjct: 255 LSFGVDAIMELDGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHREVFPPELYTW 306

Query: 237 ETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSC--EVETFLDYDKSSQGVVFTT 292
           E + W+  + +S  +++   DG++   L+P A  LNHS    +  +   D  +  + F  
Sbjct: 307 EHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPV 366

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            R    GEQ F+SYG  S+  LL  YGF+P+ G NP D + L   +
Sbjct: 367 SRPCNKGEQCFLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 411


>gi|357615786|gb|EHJ69829.1| putative SET domain containing 3 [Danaus plexippus]
          Length = 489

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 80/312 (25%), Positives = 148/312 (47%), Gaps = 28/312 (8%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           WL + G   + + I + D    GL A K+  +G  +L VP  ++++       P+A ++ 
Sbjct: 89  WLHEHGAEFEGVEISEFDGYGFGLKATKDFSEGSLILTVPGKVMMSEKD----PKASDLS 144

Query: 142 KQCSVPDWPLL--------ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY 193
           +  ++   PLL        A +L+ E +   +S W  YI  LP +  ++LY+   EL   
Sbjct: 145 EFINID--PLLQNMPNVTLALFLLLEKN-NPNSFWKPYIDVLPEKYSTVLYFNSEELAE- 200

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFS-KYPDLFP-EEVFNMETFKWSFGILFSR-- 249
           L  S + E +++   +++  Y     +I +   P L   +++F  + ++W+   + +R  
Sbjct: 201 LRPSPVFESSLKLYRSIVRQYAYFYNKIHTIDLPVLKNLQDIFTFDNYRWAVSTVMTRQN 260

Query: 250 -LVRLPSMDGRVALVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVFISYG 307
            +V+  +     A +P  DM NH     T  D++ + ++G  +   + Y+  EQ+FI YG
Sbjct: 261 NIVQGTAFTLTNAFIPLWDMCNHKHGKIT-TDFNLELNRGECYAL-QDYRRDEQIFIFYG 318

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
            + N +L L  GFV  +  N  DS+ + L +  +D     K+  L K GLS    F +  
Sbjct: 319 ARPNSDLFLHNGFVYPD--NDYDSLSIALGISPNDALRNGKVNLLNKLGLSGVTNFSLYK 376

Query: 368 TGWPL--ELMAY 377
              P+  EL+A+
Sbjct: 377 GASPISVELLAF 388


>gi|334184301|ref|NP_001189551.1| SET domain-containing protein [Arabidopsis thaliana]
 gi|330251720|gb|AEC06814.1| SET domain-containing protein [Arabidopsis thaliana]
          Length = 536

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 134/286 (46%), Gaps = 43/286 (15%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E  S L +W  D+G+   K+ I ++D   RG +A ++++ G+  L +P S +I+ +  ++
Sbjct: 148 EKESKLVEWGQDNGVK-TKLQIAQIDGYGRGAIASEDLKFGDVALEIPVSSIISEEYVYN 206

Query: 134 CPEAGEVLKQCSVPDWPLLATY--LISEASF------EK---SSRWSNYISALPRQPYSL 182
                          +P+L T+  + SE         EK    S++  Y  +L     + 
Sbjct: 207 SDM------------YPILETFDGITSETMLLLWTMREKHNLDSKFKPYFDSLQENFCTG 254

Query: 183 LYW---TRAELDRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
           L +      ELD  L   E  Q +E   ER       Y++L + + S + ++FP E++  
Sbjct: 255 LSFGVDAIMELDGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHREVFPPELYTW 306

Query: 237 ETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSC--EVETFLDYDKSSQGVVFTT 292
           E + W+  + +S  +++   DG++   L+P A  LNHS    +  +   D  +  + F  
Sbjct: 307 EHYLWACELYYSNSMQIKFPDGKLKTCLIPVAGFLNHSIYPHIVKYGKVDIETSSLKFPV 366

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            R    GEQ F+SYG  S+  LL  YGF+P+ G NP D + L   +
Sbjct: 367 SRPCNKGEQCFLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 411


>gi|390367697|ref|XP_787519.3| PREDICTED: N-lysine methyltransferase setd6-like
           [Strongylocentrotus purpuratus]
          Length = 466

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 113/256 (44%), Gaps = 32/256 (12%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-----SVPDWPLLATYLIS 158
           G++AL +I KGE L  VP S+++   +    P   + L++      +   W  L   ++ 
Sbjct: 62  GMIALDDISKGETLFTVPRSVLLHPAT--CSPVVAQRLEEDEDSLETESGWVPLILAVMY 119

Query: 159 EASFEKSSRWSNYISALPR-----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
           E +  +SSRW  Y+   P      QP   ++W    +   L  + I E     + N+   
Sbjct: 120 EHT-NRSSRWRPYLDLFPDYSELDQP---MFWDSNYMQPELRGTGIAEAVQRDLRNIDRD 175

Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS-RLVRLPSMDGRVA----------- 261
           Y+D+ L    K  DLF EE  N++ +K +   + +      P  D               
Sbjct: 176 YHDVALPFIKKNADLFSEEKHNLDLYKRTVSFIMAYSFTESPDYDEDDDDSDDDDEETHP 235

Query: 262 --LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
             +VP AD LNH  +    L + K S  +V T D   + G +VF +YG+ +N +LL  YG
Sbjct: 236 PMMVPLADALNHIAKNNAQLKFGKESLRMVATED--IKKGSEVFNTYGEIANWQLLHMYG 293

Query: 320 FVPREGTNPSDSVELP 335
           F      N  D+V++P
Sbjct: 294 FAEEYPENIYDTVDIP 309


>gi|260835045|ref|XP_002612520.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
 gi|229297897|gb|EEN68529.1| hypothetical protein BRAFLDRAFT_214305 [Branchiostoma floridae]
          Length = 287

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 114/247 (46%), Gaps = 24/247 (9%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK-QCSVPDWPLLATYLISEAS 161
           RG++A K ++  E +L +P  L+IT D+      A  + +    +     LA +L+ E  
Sbjct: 48  RGMMATKALKHEELMLVIPQRLLITMDAIMDSYIAPYIERADPRLTPTQALAVFLMCEKY 107

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
             + S W  YI  LP +     ++T  E D  L  + +R +A  +       Y +L    
Sbjct: 108 RREKSFWRPYIDILPEEYSCPTFFT--EDDFRLLPNSLRGKAKAKKYECHKEYKEL-APF 164

Query: 222 FSKYPDLFP--EEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------------LVPW 265
           F    DLFP  E+ FN + FKW++  + +R + +P   GR +              + P 
Sbjct: 165 FKMLADLFPDQEDAFNFKDFKWAWSAIKTRALDVPI--GRESCRHLRDAEDTPTPTMFPL 222

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            D +NH+ + +    Y++ S+ +   T+  Y+   +V  SYG+  N  LLL +GFV    
Sbjct: 223 VDSINHAAQAKIRHRYNEKSRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFV--VP 280

Query: 326 TNPSDSV 332
            NP D+V
Sbjct: 281 GNPEDTV 287


>gi|260822399|ref|XP_002606589.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
 gi|229291933|gb|EEN62599.1| hypothetical protein BRAFLDRAFT_277814 [Branchiostoma floridae]
          Length = 459

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 72/274 (26%), Positives = 125/274 (45%), Gaps = 28/274 (10%)

Query: 85  DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC 144
           D  L P+    ++    + G+VA + + +GE L  V  S V++ ++     E   +LK+ 
Sbjct: 33  DFQLNPKVHVGREGSCAQYGMVAQEELEEGECLFKVDKSAVLSTETT----EIAHLLKEE 88

Query: 145 SV---------PDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRY 193
           +            W      L+ E +   +SRW  Y+  +P   Q    ++WT  E++R 
Sbjct: 89  TSLHGDSLHGDSGWVPQILALMYEYT-NPNSRWRPYLQLVPDFSQLDQPMFWTEDEIERD 147

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
           L  + I E +   +T +   Y  L L    K+  +F EEV + E +K     + +     
Sbjct: 148 LCNTGIPEASSSDLTKMKLEYTSLALPFIRKHRHIFSEEVHSFELYKRMVAFIMAYSFFE 207

Query: 254 PSMDGRVA---------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
           P ++GR           +VP AD+LNH  +    L++D     +V  T R    GE+VF 
Sbjct: 208 P-VNGREDEGGKSSLPLMVPMADILNHVAKNNAQLEWDADCLRMV--TTRTVAAGEEVFN 264

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           ++G+ +N +LL  YGF      N  D+V++P+ +
Sbjct: 265 TFGQLANWQLLHMYGFAEAWPENIYDTVDIPMQV 298


>gi|440802833|gb|ELR23759.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 518

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 56/108 (51%), Gaps = 6/108 (5%)

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE---- 274
           + IF  YPD+F   V   +   W+F  ++SR   L   D   A+VP ADMLNH+ E    
Sbjct: 196 ISIFKDYPDMFSPAVHTCDELMWAFATIWSRGYWLDGDDTMPAIVPLADMLNHNTEKGGE 255

Query: 275 -VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
            V  +  YD  +Q     +   Y+PG+QV   YG K+NG  L  YGFV
Sbjct: 256 RVAHYF-YDADAQIFKVISKTSYEPGQQVLTHYGNKANGNFLEDYGFV 302


>gi|57899520|dbj|BAD87034.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
 gi|57899939|dbj|BAD87851.1| SET domain-containing protein-like [Oryza sativa Japonica Group]
          Length = 509

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 75/257 (29%), Positives = 110/257 (42%), Gaps = 32/257 (12%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           R L A + I++G+ ++ VP  + +T D     P+    L   +V D   LA  LI E   
Sbjct: 59  RSLFASEPIQEGDCIMQVPYHVQLTLDK---LPQKFNTLLDHAVGDTSKLAALLIMEQHL 115

Query: 163 EKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
              S W+ YI +LP   Q ++++ W   EL   ++ S I + AIE        +  L+  
Sbjct: 116 GNESGWAPYIKSLPTKDQMHNMVLWDLNEL-HAVQNSSIYDEAIEHKEQAKKEFLALKPA 174

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +   +P LF E                   V+L       AL    D LNH     + L 
Sbjct: 175 L-DHFPHLFGE-------------------VKLGDFMHASAL----DFLNHDGVFGSVLI 210

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
           YD+         DR Y  GEQV I YGK SN  L L++GF      N  D   + + +  
Sbjct: 211 YDEQKDVCEIIADRNYAVGEQVMIRYGKYSNATLALNFGFT--LARNIYDQALIRIDMPV 268

Query: 341 SDKCYKEKLEALRKYGL 357
            D  YK+KL+  +K+ L
Sbjct: 269 QDPLYKKKLDIWQKHRL 285


>gi|52545671|emb|CAH56365.1| hypothetical protein [Homo sapiens]
          Length = 380

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 76/152 (50%), Gaps = 6/152 (3%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVF 290
           F  E ++W+   + +R  ++P+ DG    +AL+P  DM NH+  + T   Y+        
Sbjct: 25  FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCEC 83

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
              + ++ GEQ++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E
Sbjct: 84  VALQDFRAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAE 141

Query: 351 ALRKYGLSASECFPIQITGWPLELMAYAYLVV 382
            L + G+  S  F +  T  P+     A+L V
Sbjct: 142 VLARAGIPTSSVFALHFTEPPISAQLLAFLRV 173


>gi|41054567|ref|NP_955894.1| N-lysine methyltransferase setd6 [Danio rerio]
 gi|82177062|sp|Q803K4.1|SETD6_DANRE RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|27882107|gb|AAH44440.1| SET domain containing 6 [Danio rerio]
          Length = 460

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 66/254 (25%), Positives = 115/254 (45%), Gaps = 22/254 (8%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDW-PLLATYLI 157
            E G++A ++I +G  +LF  P   +         +  E  K+C  S   W PLL + + 
Sbjct: 47  AEYGMLAKEDIEEGH-VLFTIPREALLHQGTTKVKKVLEEGKKCLESASGWVPLLLSLMY 105

Query: 158 SEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
              S   +S W  Y+S  P  R     ++W+  E D+ L+ + I E  I  +  +   YN
Sbjct: 106 EYTS--STSHWKPYLSLWPDFRTLDQPMFWSEEECDKLLKGTGIPESVITDLRKLQDEYN 163

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA------------LV 263
            + L     +PDL+  E  N+E +K     + +   + P  D                +V
Sbjct: 164 SVVLPFMKSHPDLWDPEKHNLELYKSLVAFVMAYSFQEPVEDDDEDEEDDEKKPNLPMMV 223

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
           P ADMLNH  +    L+Y  + + +   + R+   GE+VF +YG+ +N +LL  YGF   
Sbjct: 224 PMADMLNHISKHNANLEY--TPECLKMVSIRRIGKGEEVFNTYGQMANWQLLHMYGFAEP 281

Query: 324 EGTNPSDSVELPLS 337
              N +++ ++ ++
Sbjct: 282 FPNNINETADIKMA 295


>gi|281207968|gb|EFA82146.1| hypothetical protein PPL_04566 [Polysphondylium pallidum PN500]
          Length = 510

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 101/217 (46%), Gaps = 25/217 (11%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKV-------DVGER-----------GLVALKNIRKGE 115
           E    + KWL D+G+      I+ V       DV  +           G++AL++++   
Sbjct: 10  EQLDIVVKWLDDNGVKINHKLIEIVCQKQSVDDVTNKNTPHEQVVEGLGVIALQDLKIDH 69

Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISAL 175
            +  +P S ++T  +          LK+  + D    +  L+ EAS    S+W  YI +L
Sbjct: 70  TVAIIPKSCLLTPHT----TSISAYLKKYKIKDATATSIALLYEASIGSQSKWYGYIKSL 125

Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND-LRLRIFSKYPDLFPEEVF 234
           P      + W  A+L + L+ + I     E    V  TYN  ++ ++ + +PD+F E VF
Sbjct: 126 PLSVDLPILWNDADL-KNLKGTSIETVVYENKETVDATYNKYIKSKLIANHPDVFNEHVF 184

Query: 235 NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
           +++ FK +  ++ SR   + +  G  ++VP AD+ NH
Sbjct: 185 SLDNFKRASCLVSSRAFNIDTYHGD-SMVPLADIFNH 220


>gi|195396323|ref|XP_002056781.1| GJ16703 [Drosophila virilis]
 gi|194146548|gb|EDW62267.1| GJ16703 [Drosophila virilis]
          Length = 539

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 32/301 (10%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
           E   L   +   +W    G+    + I      + GL   +++ +GE +L VP  L+   
Sbjct: 112 EQKRLAKVAAFNEWARAGGVQTDCVEITTFPGYQLGLRVTRDLAEGELVLTVPRQLIF-- 169

Query: 129 DSKWSCPEAGEVLKQCSVPDWP--LLATY-LISEASFEKSSRWSNYISALPRQPYSLLYW 185
            S+   PEA   L      D+P  L  TY LI E     +S W  +I  LP +  ++LY+
Sbjct: 170 -SEELLPEAQRKL----FIDFPTHLNVTYMLIIEKVRGAASNWQPFIDTLPTRYNTVLYF 224

Query: 186 TRAELDRYLEASQ----IRE-RAIERITNVI--GTYNDLRLRIFSKYPDLFPEEVFNMET 238
           T  ++ R    S     +R  R I RI   +    Y      + +   +LF E     E 
Sbjct: 225 TVEQMQRLRGTSACSAAVRHCRVIARIYASMYKCAYMQPDDSVMAGMANLFTEYGLCYEL 284

Query: 239 FKWSFGILFSRLVRLPSM-----DG-----RVALVPWADMLNHSC-EVETFLDYDKSSQG 287
           ++W+   + +R   +P       DG       AL+P+ DM NH C ++ ++  Y  S+Q 
Sbjct: 285 YRWAVSTVTTRQNLVPRQLATDSDGVRNSPMSALIPFWDMANHRCGKITSY--YKPSAQQ 342

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
           +       ++ GEQ FI YG + N + L+ +GF+  +  N  D V + L L  +D   ++
Sbjct: 343 MECIAQEAFKAGEQFFIYYGDRCNADRLVHHGFL--DMNNLKDYVHIRLGLSPTDALAEQ 400

Query: 348 K 348
           +
Sbjct: 401 R 401


>gi|452825744|gb|EME32739.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 1 [Galdieria
           sulphuraria]
          Length = 487

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 79/298 (26%), Positives = 128/298 (42%), Gaps = 32/298 (10%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           WL+   +   K+ + +   G RG+VA++ I   E  L VP  L +       C  +  V 
Sbjct: 80  WLTRENVYMPKIKLDQNKDGLRGVVAVEGIECDESFLKVPRDLSLQVTEHEECTMSEFVD 139

Query: 142 KQC-SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYS-LLYWTRAELDRYLEASQ 198
            +  S  +W +  +  + +  +    S W  YI  LP    + L+YW+ +EL      +Q
Sbjct: 140 PELWSQENWYVKLSLKLLKEKYLGKLSLWKPYIDILPHALNTGLVYWSSSEL------AQ 193

Query: 199 IRERAIERITNVIGTYND-LRLRIFSKYPDLFPEEVF----NMETFKWSFGILFSRLVRL 253
           ++ R +     +   Y + L  R+F       P  V+        F W+  ++ SR   +
Sbjct: 194 LQYRPLIEEVKINQYYREALYTRVFESLSS--PVRVWLQNEKENVFFWALDMVQSRAFGI 251

Query: 254 PSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
           P +  +  AL+P  DMLNH    +T   YD  +      T  +  PG  ++ISYG   N 
Sbjct: 252 PDVGNKTYALLPMMDMLNHRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDND 311

Query: 313 ELLLSYGFVPREGTNPSDSVE-------LPLSLKKSD------KCYKEKLEALRKYGL 357
            LL  YGF+  +  NPSD  +       L L  ++ +         +EKL  LRKY +
Sbjct: 312 HLLHFYGFL--QTNNPSDYFQVKDIFQWLHLMYEQEEWQAQPSHLLEEKLSLLRKYHI 367


>gi|260819628|ref|XP_002605138.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
 gi|229290469|gb|EEN61148.1| hypothetical protein BRAFLDRAFT_122719 [Branchiostoma floridae]
          Length = 453

 Score = 78.2 bits (191), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 68/250 (27%), Positives = 114/250 (45%), Gaps = 22/250 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS-VPDWPLLATYLISEAS 161
           RGL+A K ++  E +L +P  L+IT D+      A  + +  S +     LA +L+ E  
Sbjct: 57  RGLMATKALKHEELILVIPKRLLITIDAIMDSYLAPYIERADSQLTPSQALAVFLMCEKC 116

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
             + S W  YI  LP +     ++T  E D  L  + +R +A  +       + +L    
Sbjct: 117 RREKSFWRPYIDILPEEYTCPAFFT--EEDFRLLPNSLRGKAKAKKYECHKEFMEL-APF 173

Query: 222 FSKYPDLFP--EEVFNMETFKWSFGILFSRLVRLPSMDGRV-------------ALVPWA 266
           F    DLFP  E+ FN + FKW++  + +R   +P + G                + P  
Sbjct: 174 FKMLADLFPDQEDAFNFKDFKWAWSAIKTRAFDVP-LGGETCYRLRDSEDTSNPTMFPLV 232

Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           D +NH+ + +    Y++  + +   T+  Y+   +V  SYG+  N  LLL +GFV     
Sbjct: 233 DSINHAAQAKIRHRYNEKRRCLESRTETVYRRHAEVMNSYGRADNDNLLLEFGFVV--PG 290

Query: 327 NPSDSVELPL 336
           NP+D+V   L
Sbjct: 291 NPADTVTFHL 300


>gi|422293951|gb|EKU21251.1| hypothetical protein NGA_2061300, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 452

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 154/347 (44%), Gaps = 24/347 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEAS 161
           GLVA   I++GE L  VP +L  + DS  + P  G+V+ +   ++ D  L+A  L+ EA 
Sbjct: 94  GLVATAPIKQGETLATVPLNLCFSMDSVRASP-LGKVIGEFEPALGDASLIALQLLYEAH 152

Query: 162 FEKSSRWSNYISALPRQPYSL----LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
               S+++ YI +LPR         L+W+ AE    L  S  R      I  V   Y  +
Sbjct: 153 MGPKSKYAVYIKSLPRPGQDGFDHPLFWSTAE-QGVLAKSSTRNLGETLIDAVAEDYGWI 211

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN--HSC-- 273
           +  +          + F++  F+W+  ++ SR       +  + L P  DM N    C  
Sbjct: 212 QSALARGGISGLQADSFDLSDFEWAVAVVLSRSFF---AENGLRLAPLLDMANRGEGCTN 268

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--PREGTNPSDS 331
           E +         +G+    DR    G+++ ISYG KS  E L  +GFV  P EG N    
Sbjct: 269 EPQIGGLGIFGGKGLKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFVPPPLEG-NALVG 327

Query: 332 VELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMK 388
               L+ K S   D+ Y +K + +   GL  +  F ++  G  ++     +L       K
Sbjct: 328 GMCSLTFKISPEGDRFYDDKEDVMGTLGLPMAFSFDVRSDG-DVDPEMLQFLRFLKLGGK 386

Query: 389 GKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSR 434
             F  + A   N++    ++   E +EQA+ + I+D+C  +++ ++ 
Sbjct: 387 DAF-LLEAVFRNEVWRFMELPVSEANEQAVDEVIIDACTQALANFAE 432


>gi|148908465|gb|ABR17345.1| unknown [Picea sitchensis]
          Length = 350

 Score = 78.2 bits (191), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 106/219 (48%), Gaps = 21/219 (9%)

Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ--- 295
           F W+FGIL SR    P +   +A+VP+AD++NH   +   ++     + V    DRQ   
Sbjct: 72  FLWAFGILRSRAFP-PFIGDNLAMVPFADLVNHGFSIN--VEEPSWERKVTGLFDRQEAL 128

Query: 296 -------YQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPS--DSVELPLSLKKSDKCY 345
                  ++ GEQV + YG  KSNG+L L YGFV R   N S  D   L L + +SD  +
Sbjct: 129 TMRAPAAFRTGEQVLMQYGMNKSNGQLALDYGFVERNRKNGSNRDIFTLTLEISESDPFF 188

Query: 346 KEKLEALRKYGLSASECFPI-QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTS 404
            +KL+     G+  +  F I Q  G P  ++ +  L+    +     E +   +  +  S
Sbjct: 189 ADKLDIAELNGMETTAYFDITQGQGVPESMLTFLRLIALGGTDAFLLEPLFRDSVWEHLS 248

Query: 405 KKDIKCPEIDEQAL-QFILDSCESSISKYSRFLQVKELL 442
              +   + +E A+ + +LD C+S++S Y   ++  E L
Sbjct: 249 ---LPVSQENEAAICKVVLDGCQSTLSGYGTTIEEDEAL 284


>gi|387193935|gb|AFJ68731.1| hypothetical protein NGATSA_2061300, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 446

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 154/347 (44%), Gaps = 24/347 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEAS 161
           GLVA   I++GE L  VP +L  + DS  + P  G+V+ +   ++ D  L+A  L+ EA 
Sbjct: 88  GLVATAPIKQGETLATVPLNLCFSMDSVRASP-LGKVIGEFEPALGDASLIALQLLYEAH 146

Query: 162 FEKSSRWSNYISALPRQPYSL----LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
               S+++ YI +LPR         L+W+ AE    L  S  R      I  V   Y  +
Sbjct: 147 MGPKSKYAVYIKSLPRPGQDGFDHPLFWSTAE-QGVLAKSSTRNLGETLIDAVAEDYGWI 205

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN--HSC-- 273
           +  +          + F++  F+W+  ++ SR       +  + L P  DM N    C  
Sbjct: 206 QSALARGGISGLQADSFDLSDFEWAVAVVLSRSFF---AENGLRLAPLLDMANRGEGCTN 262

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--PREGTNPSDS 331
           E +         +G+    DR    G+++ ISYG KS  E L  +GFV  P EG N    
Sbjct: 263 EPQIGGLGIFGGKGLKVIADRDTDKGQEIVISYGPKSGIEFLEDHGFVPPPLEG-NALVG 321

Query: 332 VELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMK 388
               L+ K S   D+ Y +K + +   GL  +  F ++  G  ++     +L       K
Sbjct: 322 GMCSLTFKISPEGDRFYDDKEDVMGTLGLPMAFSFDVRSDG-DVDPEMLQFLRFLKLGGK 380

Query: 389 GKFEEMAAAASNKMTSKKDIKCPEIDEQAL-QFILDSCESSISKYSR 434
             F  + A   N++    ++   E +EQA+ + I+D+C  +++ ++ 
Sbjct: 381 DAF-LLEAVFRNEVWRFMELPVSEANEQAVDEVIIDACTQALANFAE 426


>gi|219126444|ref|XP_002183467.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405223|gb|EEC45167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 519

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/295 (27%), Positives = 133/295 (45%), Gaps = 22/295 (7%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVIT 127
           E D L +++T    + D       + +   +V E RG+ A ++I      + +P   +IT
Sbjct: 29  EYDCLNSSNTSASEVDDEEKKESPLTVSLEEVSEMRGVHARRSIPPHTTCVSIPRRCLIT 88

Query: 128 ADSKWSCPEAGEVLK---QCSVPDWPLLATYLI-SEASFEKSSRWSNYISALPRQPYSL- 182
            +   + P    +L+       P    L  YL+    +   SS +  Y   LP    ++ 
Sbjct: 89  VEMGQATPIGRAILQADLDLDAPKHIFLMIYLLWDRKTHGSSSFFHPYYEILPPTLRNMP 148

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
           ++W+  EL   LE S +  +  +R   +   Y      I    P L    +  ++ FKW+
Sbjct: 149 IFWSAFELQE-LEGSHLLSQIADRGQAIQDDYE----AILEVAPSLGT--LCTLDEFKWA 201

Query: 243 FGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
              + SR   L  +DG    ALVP ADMLNH    ET   +D+ +Q    T+ +  Q G 
Sbjct: 202 RMCVCSRNFGL-QIDGHRTSALVPHADMLNHYRPRETKWTFDEVTQCFTITSLQSIQAGA 260

Query: 301 QVFISYGKKSNGELLLSYGFVPR-----EGTNPSDSVELPLSLKKSDKCYKEKLE 350
           QV+ SYG+K N   LL+YGF        +G  P++ V L L +  +D  +++KLE
Sbjct: 261 QVYDSYGQKCNHRFLLNYGFAVEDNRELDGFCPNE-VPLELYVDPADILFQDKLE 314


>gi|432862431|ref|XP_004069852.1| PREDICTED: N-lysine methyltransferase setd6-like [Oryzias latipes]
          Length = 450

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 119/256 (46%), Gaps = 22/256 (8%)

Query: 96  QKVDVGERGLVALKNIRKGEKLLFVPPSLVI----TADSKWSCPEAGEVLKQCSVPDWPL 151
           Q+  V + G++A  +I +GE L  +P S ++    TA S     EA  +  Q S    PL
Sbjct: 39  QEGTVADYGMLAKADIEEGEVLFTIPRSALLHQRTTAVSALLQKEAASL--QSSSCWVPL 96

Query: 152 LATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
           L   L    S    S W  Y+S  P  R+    ++W++ E DR L  + + E   + ++N
Sbjct: 97  LLALLYEYTS--PQSDWKPYLSLWPDLRRLDHPMFWSKEERDRLLRGTGVPEAVDKDLSN 154

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS----------MDGR 259
           +   Y D+ L   +++PDL+  +   +E +      + +   + P               
Sbjct: 155 IQREYEDVVLPFMTRHPDLWNPKTHTLELYTELVAFVMAYSFQEPQEDEDDDEEEKPPNP 214

Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
             +VP ADMLNH  +    L++  S+  +   + R+   GE+VF +YG+ +N +LL  YG
Sbjct: 215 PMMVPMADMLNHVSDHNANLEF--SADSLKMVSVRRIHAGEEVFNTYGQMANWQLLHMYG 272

Query: 320 FVPREGTNPSDSVELP 335
           F      N +++ ++P
Sbjct: 273 FTEPYPNNSNETADIP 288


>gi|354502761|ref|XP_003513450.1| PREDICTED: SET domain-containing protein 4 [Cricetulus griseus]
          Length = 440

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 120/261 (45%), Gaps = 27/261 (10%)

Query: 79  LQKWLS-----DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           L++WL      D+GL P          G RGL++   +++G+ ++ +P S ++T ++   
Sbjct: 34  LRRWLKGRKFEDTGLVPACFP----GTG-RGLMSKTALQEGQMIISLPESCLLTTNTVIR 88

Query: 134 CPEAGEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
               G  +K+   P  PLLA  T+LISE      S W +Y+  LP+  Y+       ++ 
Sbjct: 89  -SSLGPYMKKWKPPPSPLLALCTFLISERHAGGQSLWKSYLDILPKS-YTCPVCLEPDVV 146

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFS 248
             L    ++ +A E+  +V   +   R   FS    LF E V   F+   F W++  + +
Sbjct: 147 DLL-PQPLKAKAEEQRADVQDFFASSR-AFFSTLQPLFVEPVDGIFSYSAFLWAWCTVNT 204

Query: 249 RLVRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
           R V L S             AL P+ D+LNHS  V+    + + +      T  + +  E
Sbjct: 205 RAVYLRSTRQECLSAEPDTCALAPYLDLLNHSPHVQVKAAFSEKTGCYEIRTASRCRKHE 264

Query: 301 QVFISYGKKSNGELLLSYGFV 321
           QVFI YG   N  LLL YGFV
Sbjct: 265 QVFICYGPYDNQRLLLEYGFV 285


>gi|308809221|ref|XP_003081920.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116060387|emb|CAL55723.1| N-methyltransferase (ISS) [Ostreococcus tauri]
          Length = 403

 Score = 77.8 bits (190), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 67/262 (25%), Positives = 114/262 (43%), Gaps = 37/262 (14%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LA  L+ + +   S+RW  Y  ALP    SL+ W+  EL+  L+ S +R+RA+ R     
Sbjct: 49  LAVALMQQTNGGASARWRAYCDALPAAVDSLMMWSDEELE-VLQGSALRQRAVFRRDLCK 107

Query: 212 GTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
             Y+ L   +    P+ F + E ++ + F+W++  + +R   LP +   +AL+P  D+ N
Sbjct: 108 REYDALFPALARADPETFGDVEAYSFDVFRWAYATVMARAFVLPDLQC-MALLPGLDIYN 166

Query: 271 H--------------SCEVETFLDYDKSSQGVVFTTD-RQYQPGEQVFISYGKKSNGELL 315
                          +CEV+    +D+S   V         Q G Q+F  Y   ++G  L
Sbjct: 167 SARDAEKCVVERDEGACEVDDSSSFDESEARVTLRVGVGGVQAGSQLFHDYADHASGGAL 226

Query: 316 LSYGFV---PREGTNPSDSVELPLS-----LKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           L +GFV    RE  +  D++++ L      L    + +    +   K+ +  S  F I  
Sbjct: 227 LEFGFVYHGERERGSGVDALDVCLKPALARLDARSRAFLVDEDVFHKFNVRKSLTFEISN 286

Query: 368 TGWPLELMAYAYLVVSPPSMKG 389
            G            V  P++KG
Sbjct: 287 VGG-----------VYKPALKG 297


>gi|259155405|ref|NP_001158764.1| N-lysine methyltransferase setd6 [Salmo salar]
 gi|325530257|sp|C0H8I2.1|SETD6_SALSA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|223647186|gb|ACN10351.1| SET domain-containing protein 6 [Salmo salar]
 gi|223673059|gb|ACN12711.1| SET domain-containing protein 6 [Salmo salar]
          Length = 449

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 69/271 (25%), Positives = 122/271 (45%), Gaps = 31/271 (11%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSK---WSCPEAGEVLKQCSVPDWPLLATYL 156
           V E G++A ++I +GE LLF  P + +         +  E G+   + +    PLL   +
Sbjct: 42  VAEYGMLAKEDIDEGE-LLFTIPRMALLHQGTTKVLAVLEEGKASLENTSGWVPLLLALM 100

Query: 157 ISEASFEKSSRWSNYIS------ALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
               S    S W  Y+S      AL       ++W++ E DR L+ + I E     +TN+
Sbjct: 101 YEYTS--PQSHWRPYLSLWSDFTALDHP----MFWSKDERDRLLKGTGIPEAVDTDLTNI 154

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------L 262
              Y D+ L   + +PDL+  E   ++ ++     + +   + P  +            +
Sbjct: 155 QKEYKDIVLPFITLHPDLWDPERHTLDLYRSLVAFVMAYSFQEPLDEEDEDEKDPNPPMM 214

Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
           VP ADMLNH       L+Y  + + +   + R  + GE+VF +YG+ +N +LL  YG   
Sbjct: 215 VPIADMLNHVSNHNANLEY--TPECLKMVSVRSIRKGEEVFNTYGQMANWQLLHMYGLXE 272

Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
              +N +D+ ++P+S       YK  ++  R
Sbjct: 273 PYQSNSNDTADIPMS-----NVYKAAVQVTR 298


>gi|384246211|gb|EIE19702.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 503

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 125/263 (47%), Gaps = 23/263 (8%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           +E    L  W+   GLP +K+ ++ ++  G+  LV  K  +KG+ L+ VP S  +T    
Sbjct: 49  VETLPPLSAWVEQRGLPLKKLNVRPEIVEGDLCLVVSKPTKKGQPLVAVPSSAWLTQQVV 108

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            S    G +++   +  W  +A +L+ E S +  + W  ++ ++P  P   L+W+  EL 
Sbjct: 109 RSS-SIGSLVE--DLEPWLQIALFLLHERS-KPDAAWQGFLDSIPAAPDVPLFWSEEELS 164

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           + LE +Q+              Y +L  ++F+ + + FP +   ++ F W+   + SR V
Sbjct: 165 Q-LEGTQLLSSVQGYRQFFEAKYAELEEQLFAPHREAFPPKSHQLDDFLWAVATVRSR-V 222

Query: 252 RLPSMDGR-VALVPWADMLNH------SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
             P +DG  VALVP AD++ H        +++        +Q +V    R Y  GE V +
Sbjct: 223 HSP-LDGEDVALVPLADLVQHRKLQGARWQLQLAGGLFSKAQALVVEAQRDYAEGEVVTM 281

Query: 305 SYG--------KKSNGELLLSYG 319
            +G        +K + ++LL YG
Sbjct: 282 DFGAPLTEEDQEKLDSQVLLDYG 304


>gi|302836231|ref|XP_002949676.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
 gi|300265035|gb|EFJ49228.1| Rubisco large subunit N-methyltransferase [Volvox carteri f.
           nagariensis]
          Length = 484

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/285 (24%), Positives = 131/285 (45%), Gaps = 14/285 (4%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           L+A  + ++G+ L  VP S  ++A+S                P W  +A  L+++     
Sbjct: 71  LIASTDAQQGDVLFSVPDSAWLSAESVKKAAVGKLAAAAGLEP-WLQIALQLVADRFGST 129

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
            S  S Y +++P    + L W+  EL + L+ +Q+ +     +T    T+  L+  +F+ 
Sbjct: 130 KSELSAYAASIPEDLDTPLLWSEDEL-QELQGTQVLQTLGGYLTFFRSTFQQLQSGLFTS 188

Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNHSCEVETFLDYDK 283
            P  FP  +F +  F W+   + SR    P +DG ++AL P  ++++H     + L    
Sbjct: 189 NPAAFPPSIFTLPRFLWAVAAVRSR--SHPPLDGPKIALAPLTELVSHRRAANSKLSVRS 246

Query: 284 S-----SQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLS 337
           +      Q +V    R  + GE + + YG  K +G +L+ YG +  + T+P     L L 
Sbjct: 247 AGLFGRGQVLVLEATRAIRKGEPLSMDYGPGKLDGPVLVDYGVM--DVTSPKPGYSLTLK 304

Query: 338 LKKSDKCYKEKLEALRKYGLSASECFPIQITGWP-LELMAYAYLV 381
           +  SD+   +KL+ L    L  S  + +     P +E++A+  L+
Sbjct: 305 MPDSDRFIDDKLDILESNDLPQSVVYNLTPDEQPTIEMLAFLRLM 349


>gi|302854198|ref|XP_002958609.1| hypothetical protein VOLCADRAFT_108207 [Volvox carteri f.
           nagariensis]
 gi|300256070|gb|EFJ40346.1| hypothetical protein VOLCADRAFT_108207 [Volvox carteri f.
           nagariensis]
          Length = 360

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/272 (26%), Positives = 114/272 (41%), Gaps = 41/272 (15%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL 141
           W+S  G    K+ I +   G RG+   +++RKGE L+++P  LV +  +  +   A  +L
Sbjct: 37  WISQEG-GEFKVTISRTSAGVRGVFTTQDVRKGELLIYIPDHLVFSVRNVPAAEGAPLLL 95

Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE------LDRYLE 195
           K+   P                  SR + Y+  LPR+   L  +   E       D  LE
Sbjct: 96  KELFTP-----------------CSRLTPYLRVLPRETQVLTGYNFPEEYIKFLADDNLE 138

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
             Q+R    +   +     ND  +   +  P+       ++  F++   +L SR   L  
Sbjct: 139 L-QVRGSFKKHCRSTFEGQNDENM--MTTIPEAIGSVNISLPYFEYVVSMLSSRTFSLRR 195

Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
               +++VP  D++NH       LD  ++ +GV     +    GE+V I+YG   + ELL
Sbjct: 196 --DALSMVPLLDLMNHDIRDINQLDSSRAYRGVRVVAGKDLAKGEEVTITYGNMRSDELL 253

Query: 316 LSYGFV------PR------EGTNPSDSVELP 335
           L YGF+      PR         NP D  ELP
Sbjct: 254 LYYGFLDTITDPPRLLAVDHRNYNPQDGAELP 285


>gi|255581713|ref|XP_002531659.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
 gi|223528717|gb|EEF30729.1| [ribulose-bisphosphate carboxylase]-lysine N-methyltransferase,
           putative [Ricinus communis]
          Length = 558

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 133/277 (48%), Gaps = 12/277 (4%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           S +   ++ +W   +G+   ++ I  V+   RG +A ++++ G+  L +P S++I+ +  
Sbjct: 156 SCDKEKSIAEWGQRNGVH-SRLEIVYVEGAGRGAIATEDLKVGDIALEIPVSIIISEELV 214

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
                   + K   +    +L  + + E     +S+   Y   LP++  + L +    +D
Sbjct: 215 RHSDMYHILEKIDGISSETMLLLWSMKE-RHNCNSKSKIYFDTLPKEFNTGLSFG---VD 270

Query: 192 RYL--EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
             +  + + + +  ++   ++   Y++L   + + YPD+FP E++  E F W+  + +S 
Sbjct: 271 AIMASDGTLLFDEIMQAKEHLRVQYDELVPALCNNYPDVFPPELYTWEQFLWACELWYSN 330

Query: 250 LVRLPSMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            +++  +DG  R  L+P A  LNHS    +  +   D  +  + F   R  + GEQ  +S
Sbjct: 331 SMKIKFLDGKLRTCLIPIAGFLNHSLHPHIIHYGKVDSITNTLKFPLSRPCRVGEQCCLS 390

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           YG  S   L+  YGF+P +G N  D + L +   ++D
Sbjct: 391 YGNFSGAHLITFYGFLP-QGDNRYDIIPLDIDAGEAD 426


>gi|147777505|emb|CAN60498.1| hypothetical protein VITISV_027869 [Vitis vinifera]
          Length = 2077

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 90/184 (48%), Gaps = 10/184 (5%)

Query: 165  SSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITNVIGTYNDLRLRIF 222
            +S+++ Y +ALP    + L +   E D    L  + + E  IE   ++   Y +L   + 
Sbjct: 1470 NSKFNTYFNALPEAFNTGLSF---EFDAIMVLAGTLLLEEIIEAKKHLNAQYEELVPALC 1526

Query: 223  SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSC--EVETF 278
              +PD+FP E +  E F W+  + +S  +++   DG  R  L+P A  LNHS    +  +
Sbjct: 1527 KDHPDIFPPEFYTQEQFLWACELWYSNGMQVMFTDGKLRTCLIPIAGFLNHSLYPHIMHY 1586

Query: 279  LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
               D  +  + F   +    GEQ ++SYG  S+  L+  YGF+P +G N  D++ L +  
Sbjct: 1587 GKVDSKTNSLKFCVSKPCNMGEQCYLSYGNFSSSHLVTFYGFIP-QGDNLYDTIPLEIDN 1645

Query: 339  KKSD 342
             + D
Sbjct: 1646 PQGD 1649


>gi|313216417|emb|CBY37730.1| unnamed protein product [Oikopleura dioica]
 gi|313234608|emb|CBY10563.1| unnamed protein product [Oikopleura dioica]
          Length = 432

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/266 (25%), Positives = 125/266 (46%), Gaps = 37/266 (13%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT--------ADSKWSCPEAGEVLKQCSVPDWPLLAT 154
           RG+ A++ ++KG+ +  +    +IT         ++K+ C  AG + +  +V  + LL  
Sbjct: 33  RGVQAVERVQKGKNIFHITSDWLITPRFVVENLKEAKYFCEAAGRIGR--AVDAFDLLIL 90

Query: 155 YLISEAS----FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIR---ERAIERI 207
           ++++E++    F + +R++ Y   LP + YS+ Y+   +  R L  +Q+R   E+ + ++
Sbjct: 91  WIVTESTRETYFGRKTRFAGYFETLPIK-YSVPYFVPEKYRRLL-TNQVRTDVEKELNKL 148

Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL-------PSMDG-- 258
            +   T+  +R  I S Y      E  +   F+W+   + +R + +         +DG  
Sbjct: 149 YDRHETFEIIRKEIRSTYHQEIISEC-SWVKFRWAAATIKTRQIYIFDEKYEELKIDGLQ 207

Query: 259 ------RVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
                    L PW DMLNH     V         + G+V    R   PGEQ+ ISY ++S
Sbjct: 208 IGTPFDSSGLAPWFDMLNHGDVAHVNCKFYCTSPADGLVCEALRDILPGEQLLISYDERS 267

Query: 311 NGELLLSYGFVPREGTNPSDSVELPL 336
           + ++L+ YGF    G N    +E+ L
Sbjct: 268 DDQMLVDYGFSLGPGENQRTFLEITL 293


>gi|66806627|ref|XP_637036.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
 gi|60465490|gb|EAL63575.1| hypothetical protein DDB_G0287857 [Dictyostelium discoideum AX4]
          Length = 532

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/371 (23%), Positives = 154/371 (41%), Gaps = 94/371 (25%)

Query: 81  KWLSDSGLP-PQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           +W  ++G+   +K+++    D+G RG++A   I++ E L+ +P   +I + SK+S  +  
Sbjct: 3   EWGINNGIEWNEKLSVHDFEDIG-RGVIANHEIKQDEVLISIPEKFLIHSKSKFSLEKLN 61

Query: 139 ----------------EVLKQCSVPDWPL------------LATYLISEASFEKSSRWSN 170
                             L   S+   P             ++ +LI E   +K+S W N
Sbjct: 62  PPIIKKIKSYIKTFVENNLSPSSIFYKPFHDSVNQFNSKQRISFHLIIEKLLKKNSIWYN 121

Query: 171 YISALPRQPYSLLYWTRAELDR-----YLE-ASQIRERAIERITN----VIGTY-NDLRL 219
           Y++ LP +      +   E++      Y+E   +++   +E   +    ++  Y NDL  
Sbjct: 122 YLNDLPTEYNITSTYDDDEIEHLGYPIYVEKVLELKNEMLESFDSFKEILMDNYKNDLN- 180

Query: 220 RIFSKYPD------------------------------LFPEEVFNMETFKWSFGILFSR 249
           RI  K  D                              +  +E+ +   ++W +G + SR
Sbjct: 181 RIVIKLNDNSNDDDDDGGGGGGGGGGGGGGGGDDENITIKLKEIIDFNLYQWCWGTIQSR 240

Query: 250 -------LVRLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
                  +  LP           ALVP AD+ NHS +V T   +D+  Q     T  +++
Sbjct: 241 TYYYDRNMKELPKHLQLEDKDDCALVPLADLFNHSSDVNTETKFDEKKQCYQVITKTKFE 300

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPS------DSVE---LPLSLKKSDKCYKEK 348
              QVFISYGK SN  L+  YGF+    +N S      D++    L   +K+  K Y+ K
Sbjct: 301 KDSQVFISYGKHSNFTLMNYYGFIIENNSNDSIPLVQEDAIPDIILEKEMKQDLKSYERK 360

Query: 349 LEALRKYGLSA 359
           +  L +YGLS 
Sbjct: 361 MSILEQYGLSV 371


>gi|384484604|gb|EIE76784.1| hypothetical protein RO3G_01488 [Rhizopus delemar RA 99-880]
          Length = 400

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/359 (25%), Positives = 160/359 (44%), Gaps = 61/359 (16%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RG+ A K+I++G+ L  +P S+++   S+ +     +V +   +  W  L   ++ E   
Sbjct: 39  RGVTANKDIKEGDLLFSLPRSILL---SQLTSSLKDQVSELSELSGWSPLILCMMYEIE- 94

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           +  S W  Y   LPR+  + ++W + +L + LE + I  +  ++ +  +  +N+L   I 
Sbjct: 95  KPDSFWKPYFDVLPREFTTPMFWNQEDL-KELEGTDIISKIGKKESEEL-FHNELE-PII 151

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFS-----RLVRLPSMD-------------------- 257
            KYP+LF E+   +E F     ++ +      L + P  +                    
Sbjct: 152 KKYPNLFDEQKHTIELFHICGSLIMAYSFNDELQKAPKENNKEEEKEEEEEEEEEEEEEE 211

Query: 258 ---GRVALVPWADMLNHSC---EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
              G +++VP ADMLNH         F + D      +    +  + GEQ++ +YG   N
Sbjct: 212 EEEGLISMVPMADMLNHKTGFNNARLFHEPDSLQMRAI----KDIKEGEQIYNTYGDLCN 267

Query: 312 GELLLSYGFVPREGTNPSDSVEL--PLSL----KKSDKCYKE-KLEALRKYGLSASECFP 364
            +LL  YGFV  +  N  D VEL  PL +    +  D+  KE K++ L + G+   ECF 
Sbjct: 268 ADLLRKYGFVDEK--NDFDLVELDGPLLVEVCCEDQDEALKERKIDFLMEEGV-LDECFV 324

Query: 365 IQITG-WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFIL 422
           I      P EL+   +++ +      K EE       K+T  K+IK      + LQ IL
Sbjct: 325 IDKEHEIPPELIVSVHVLCTTADDFQKMEEKQKLPKPKLT--KEIK------EKLQIIL 375


>gi|428174289|gb|EKX43186.1| hypothetical protein GUITHDRAFT_110913 [Guillardia theta CCMP2712]
          Length = 437

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 90/188 (47%), Gaps = 21/188 (11%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWT--RAELDRYL-EASQIRERA-IER 206
           LA +L+ E+   KSS W  Y+ +LP+  P  + Y    R +L   L E  +++  A +E 
Sbjct: 168 LAVFLLLESQ-NKSSFWRPYLCSLPKHVPLPMFYSKERRQQLKEQLPEDQRVKFDALVEA 226

Query: 207 ITNVIG-TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA---- 261
             +V+   Y  L   +F KYP LF  EVF+ E F W+  I+ SR       D  +     
Sbjct: 227 RRDVVDLHYMQLLPVLFLKYPTLFSPEVFSYEKFAWAISIIMSRTWGKTYFDSALGPRGR 286

Query: 262 ------LVPWADMLNHSCEVETFLDYDKSSQG-VVFTTDRQYQPGEQVFISYGKKSNGEL 314
                 L P ADM NH     + L+ ++  +G +     +    GEQ FISYG K + E 
Sbjct: 287 NITVHTLAPAADMPNHDS---SGLEANRDPRGRMTLNAQKNLSVGEQFFISYGSKCDAEF 343

Query: 315 LLSYGFVP 322
           L  YGFVP
Sbjct: 344 LAHYGFVP 351


>gi|308806756|ref|XP_003080689.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
 gi|116059150|emb|CAL54857.1| SET domain-containing protein-like (ISS) [Ostreococcus tauri]
          Length = 472

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/301 (25%), Positives = 126/301 (41%), Gaps = 42/301 (13%)

Query: 71  DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           D      TL +W +D G   + + +       RGL   +++R GE++L    +  I  D 
Sbjct: 31  DDASRVETLARWCADRGTYARALRVDLDQGSGRGLELSRDVRAGERVLGASLTSGIV-DE 89

Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS--LLYWTRA 188
               PE          P    LA  ++ E     +S ++ Y++ LP +  S   LY  RA
Sbjct: 90  ARGHPERTRA-AMAEAPWGVRLACRVLQERKKGGASAYAAYVATLPERVESSPALYDARA 148

Query: 189 -ELDRYLEA-SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
            E  +Y  A ++IRE  + R T       +   ++    P+   + VF+ + F  +  ++
Sbjct: 149 IEEVQYPPAMTEIRE--MRRATR------EWHEKLQKTAPEALGDAVFDYDAFVDAVSVV 200

Query: 247 FSRLVRLPSMDGRV----ALVPWADMLNHSCEVETFLDYDKSS----------------- 285
            SR   + S +       AL+P ADM+NH  ++ T L  D+ +                 
Sbjct: 201 HSRTYGIASANDNAGLFRALLPLADMINHGGDIVTGLTKDEETGAVTNVETTATDNIAWS 260

Query: 286 ----QGVV-FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
                GVV F   R    GE   +SYG++SN   L+ YGF P    NP D   L  +L+ 
Sbjct: 261 ELDDDGVVHFAATRDIAEGEAALMSYGERSNDHFLIYYGFAP--DNNPHDDCVLFSNLEH 318

Query: 341 S 341
           +
Sbjct: 319 A 319


>gi|345795412|ref|XP_544872.3| PREDICTED: SET domain-containing protein 4 [Canis lupus familiaris]
          Length = 440

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/312 (25%), Positives = 136/312 (43%), Gaps = 33/312 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +         RGL++  ++R+G+ ++ +P S +IT D+       G
Sbjct: 36  LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLITTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL-- 194
             + +   P  PLLA  T+L+SE      S W  Y+  LP Q Y+       E+      
Sbjct: 95  TYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILP-QAYTCPVCLEPEVVNLFPK 153

Query: 195 ----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
               +A + R R  E  ++    ++ L+  +FS+      E +F+     W++  + +R 
Sbjct: 154 PLKAKAEEQRARVQEFFSSSRDFFSSLQ-PLFSEAV----ESIFSYRALLWAWCTVNTRA 208

Query: 251 VRLPSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           V +     +         AL P+ D+LNHS EV+    +++ ++     T    +  E+V
Sbjct: 209 VYVKHRQRQCFSTEPNTYALAPYLDLLNHSPEVQVKGAFNEETRCYEIRTASNCRKHEEV 268

Query: 303 FISYGKKSNGELLLSYGFV----PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           FI YG   N  LLL YGFV    P      S+ + L   L  +DK   +K+  L+ +   
Sbjct: 269 FICYGPHDNQRLLLEYGFVSIHNPHACVYVSEDI-LVKYLPTTDKQMNKKISILKDHDFI 327

Query: 359 ASECFPIQITGW 370
            +  F     GW
Sbjct: 328 ENLTF-----GW 334


>gi|347967018|ref|XP_321037.5| AGAP002018-PA [Anopheles gambiae str. PEST]
 gi|333469795|gb|EAA01259.5| AGAP002018-PA [Anopheles gambiae str. PEST]
          Length = 493

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 72/311 (23%), Positives = 131/311 (42%), Gaps = 27/311 (8%)

Query: 73  LENASTLQKWLSDSGLPPQKMAI-QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           +E  +   +W  + G   + + + +  + G  GL +   I  GE ++ VP S+     ++
Sbjct: 69  METVAHFMRWAVERGCQVENVRVAEHAEYGGLGLESCGPIPAGECIITVPRSMFFYVTNE 128

Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
               +  E++    + +    +LA  LI E  F   S W  Y+  LP +  + LY+T  +
Sbjct: 129 PRYRQLLELMPGAMMSEQGNIMLALALIME-RFRAKSDWKPYLDLLPDRYTTPLYYTTED 187

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
           +    E       A++   ++   Y  +R  +  K  +L   + F  + F+W+   + +R
Sbjct: 188 MGELAETDAFLP-ALKLCKHIARQYGFIRRFVQEKVDEL--RDCFTYDVFRWAVSTVMTR 244

Query: 250 LVRLP-------SMDGRVALVPWADMLNHS---------CEVETFLDYDKSSQGVVFTTD 293
             ++P        MD  +AL+P  DM NH+         C  ET  +          T +
Sbjct: 245 QNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNEQLECSLTRE 304

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
                   +FI YG +++ E L+  GFV PR   NP  +V+   +L  +   YKE+   L
Sbjct: 305 VSDIASVPIFIVYGTRTDAEFLVHNGFVCPR---NPHANVQKRFTLVPAIPLYKERAHLL 361

Query: 353 RKYGLSASECF 363
              G+  +  F
Sbjct: 362 ELLGMPTTGTF 372


>gi|391342782|ref|XP_003745694.1| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Metaseiulus occidentalis]
          Length = 278

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 74/133 (55%), Gaps = 6/133 (4%)

Query: 237 ETFKWSFGILFSRLVRLPSMD-GRV--ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
           E  +W+   + +R   LPS+  GR+  ALVP  DM NH   + +  DYD +SQ +V    
Sbjct: 37  EPDEWACSTVMTRQNELPSLTPGRMQMALVPLWDMCNHDT-LRSGTDYDVASQQLVSFAT 95

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           R+Y+  EQV I YG ++N + +L  GFVP E  N  DS+ + + L K+DK ++ K     
Sbjct: 96  REYKKNEQVNIFYGNRANAQFMLHNGFVPDE--NQWDSLAIKIGLSKADKLFEMKRRLCE 153

Query: 354 KYGLSASECFPIQ 366
           +  +  S+ F ++
Sbjct: 154 QMKIPTSDVFELK 166


>gi|348684109|gb|EGZ23924.1| hypothetical protein PHYSODRAFT_296170 [Phytophthora sojae]
          Length = 452

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 76/264 (28%), Positives = 112/264 (42%), Gaps = 44/264 (16%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           L + + +R+G    FVP +LV+      + P +         P+     T L++ AS   
Sbjct: 83  LASSRVLREGSS--FVPSALVLGVHMLVNFPHSEN-------PE-----TSLMAMASMNT 128

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
                 Y++ALPR     LYW     D+  E  Q  E A   + +    Y+ +   +F  
Sbjct: 129 PPLDELYVNALPRYVDLPLYWD----DKQFEELQGCEEARRAMQHGARFYSQVYKHLFGA 184

Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH----SCEVETFLD 280
                  +  N E F W+  IL SR       +   AL+P+ D  NH    S      LD
Sbjct: 185 N-----NQFVNAEAFFWAISILMSRATS--GQNQPFALIPFFDWFNHAGNGSDNCRHALD 237

Query: 281 YDKSSQ------GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVE 333
            D+  Q      G    T R Y+PGEQ+FI+YG   N  LL +YGF +P    NP D V 
Sbjct: 238 SDECVQDFDMQKGFTIHTTRSYEPGEQLFINYGSHGNLRLLRNYGFTMP---NNPYDVVN 294

Query: 334 LPLSL-----KKSDKCYKEKLEAL 352
           LP+        ++D  + +K + L
Sbjct: 295 LPMPAALQQPNEADPAFAQKRDLL 318


>gi|428163884|gb|EKX32933.1| hypothetical protein GUITHDRAFT_120884 [Guillardia theta CCMP2712]
          Length = 320

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/246 (28%), Positives = 116/246 (47%), Gaps = 24/246 (9%)

Query: 77  STLQKWLSDSGLPPQKM-AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
           S L +WL   G+    + A+ +  +G   L A K I  GE  L VP  L++    +    
Sbjct: 2   SALLRWLEGGGVQLGGVEAVWREGMGW-ALRASKRISPGETFLKVPRHLLL-GPHQLRAS 59

Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
               +L+   +PD  LL   L+   S   SS +  Y+  LP    + + W++ E  + L 
Sbjct: 60  SLDRLLEGWQLPDCMLL---LLMCESVNSSSFFRPYLDLLPDTVDTPITWSKEEA-KELV 115

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
            S +  RA++    +  ++ +++ ++F KYPD FP  +F+ E ++W++ IL SR      
Sbjct: 116 GSPVLHRAVKLRHELARSFQEMKDKVFDKYPDRFPPLLFSYERYQWAYSILRSRAF---- 171

Query: 256 MDGRVALVPWADMLNHSCE---VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
             G   L+P  D++NH  +     T L    S         R+Y     V+  YG+KS+ 
Sbjct: 172 --GNYTLMPLIDLMNHHPDSRLAPTLL----SDGSDALIARREY----NVWGFYGRKSDA 221

Query: 313 ELLLSY 318
           +LLL+Y
Sbjct: 222 DLLLNY 227


>gi|308802083|ref|XP_003078355.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
 gi|116056807|emb|CAL53096.1| ribulose-1,5-bisphosphate carb (ISS) [Ostreococcus tauri]
          Length = 520

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 93/385 (24%), Positives = 163/385 (42%), Gaps = 46/385 (11%)

Query: 74  ENASTLQKWLS-DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           E+A  L  WLS D G+    +A ++   G   ++   +   G   L VP S  +T+    
Sbjct: 48  EDARELAAWLSYDKGVDASALAFKEDAKGGVRVILKADAEAGATALRVPQSAAVTSVDVG 107

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
             P   E+      P+   LA +L +E     +S W+ Y+  L   P + L+WT A+   
Sbjct: 108 EHPIVSEL--ASGRPELIGLALWLCAERIKGGASEWAPYVKTLRANPDAPLFWTDAKDFA 165

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET---FKWSFGILFSR 249
            L+ S +   AIER  +    Y  +   +    P  +P E +   T   F  +   + ++
Sbjct: 166 LLKGSPVAADAIERSKSARTEYASI-TEVIKSDPSSYPPEAYEFLTEARFVDALATVCAK 224

Query: 250 LVRLPSMDGRVALVPWADMLN-HSCEVETFL--------------DYDKSSQGVVFTTDR 294
              LP+     ALVP  D+++     V   L              DYD  +  VV     
Sbjct: 225 ATWLPTAQC-YALVPLLDVISIGGAPVPGVLPPSASDGVVRCGPADYDVDTASVVLRCAT 283

Query: 295 QYQPGEQVF-ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           +     +V  +   +++NGEL L+ G+V ++  +P D + +   ++ SD+ +  K + L 
Sbjct: 284 KAAANSEVIQLDALQRNNGELFLNTGYVDQK--HPGDYIYMKTDIQTSDRLFTAKKQVLE 341

Query: 354 KYGLSASE-CFPIQITGWPLELMAYAYL----VVSPPSMKG-KFEEMAAAASNKMTSKKD 407
             G +A++  FP+     P +L  Y+YL    V  P  M    FEE      +K+ S  +
Sbjct: 342 GMGFTAADQYFPVYKDRMPTQL--YSYLRFSRVQDPGEMMAVSFEE------DKIVSVMN 393

Query: 408 IKCPEIDEQALQFILDSCESSISKY 432
                 + + LQ ++  C   +++Y
Sbjct: 394 ------EYEILQILMGDCRELMAEY 412


>gi|357479689|ref|XP_003610130.1| 3-hydroxy-3-methylglutaryl-coenzyme A reductase [Medicago
           truncatula]
 gi|355511185|gb|AES92327.1| 3-hydroxy-3-methylglutaryl-coenzyme A reductase [Medicago
           truncatula]
          Length = 689

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 71/133 (53%), Gaps = 7/133 (5%)

Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNH 271
           Y++L   + + +PD+FP E++  E F W+  + +S  +++   DG  R  L+P A  LNH
Sbjct: 423 YDELVPALCNGFPDIFPPEIYTWENFLWACELWYSNSMKIMYSDGKLRTCLIPLAGFLNH 482

Query: 272 SC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
           S    +  +   D S+  + F   R  + GE+  +SYG  S+   +  YGF+P +G NP 
Sbjct: 483 SLCPHITHYGKVDPSTNSLKFCLSRSCRSGEECCLSYGNFSSSHFITFYGFLP-QGDNPY 541

Query: 330 DSVELPLSLKKSD 342
           D +  PL +  SD
Sbjct: 542 DVI--PLDIDSSD 552


>gi|281338852|gb|EFB14436.1| hypothetical protein PANDA_005285 [Ailuropoda melanoleuca]
          Length = 415

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 79/285 (27%), Positives = 129/285 (45%), Gaps = 24/285 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +         RGL++  ++R+G+ ++ +P S ++T D+       G
Sbjct: 12  LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLLTTDTVIR-SYLG 70

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 71  AYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPKA-YTCPVCLEPEVVN-LFP 128

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             ++ +A E+   V G ++  R    S  P LF E V   F+     W++  + +R V +
Sbjct: 129 KPLKAKAEEQRARVQGFFSSSRDFFSSLQP-LFSEAVESIFSYSALLWAWCTVNTRAVYV 187

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                +         AL P+ D+LNHS  V+    +++ ++     T    +  E+VFI 
Sbjct: 188 KHRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFIC 247

Query: 306 YGKKSNGELLLSYGFV----PREGTNPSDSV---ELPLSLKKSDK 343
           YG   N +LLL YGFV    P      S+ V    LPL+ K+ +K
Sbjct: 248 YGPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQMNK 292


>gi|302771638|ref|XP_002969237.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
 gi|300162713|gb|EFJ29325.1| hypothetical protein SELMODRAFT_410177 [Selaginella moellendorffii]
          Length = 336

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 80/304 (26%), Positives = 137/304 (45%), Gaps = 22/304 (7%)

Query: 82  WLSDSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           WL   G     +A+   +    R L A + +  GE ++     LV+T + K  C E   +
Sbjct: 45  WLRSRGEDMNSIAVAIGMSKHGRALFAHRPMCAGECMIKFSQDLVLTPE-KLPC-EVIAL 102

Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQ 198
           L Q +  ++  ++  +++E    ++S W+ YI  LP   + +S ++W   EL   LE S 
Sbjct: 103 LDQAN--EFTRVSLLVMAEKRKGQNSAWAPYIECLPSFGEIHSTIFWDPKEL-ACLECSP 159

Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL-VRLPSMD 257
           I     ER   +   Y +++ ++    P L+  +V ++E FK  +  + SR   + P  D
Sbjct: 160 IHRGTGERNALLQSEYREVK-KVVESCPHLYDPDV-SLEQFKHEYATVSSRAWGQGPHSD 217

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF-----ISYGKKSNG 312
             + ++P  D  NH     T   +   +  VV  + R YQ G++ F     I YG  SN 
Sbjct: 218 --MTMIPLVDFANHDPRSRTLFSHADDNCTVVVAS-RDYQTGDENFHLKVHICYGDHSNA 274

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGWP 371
            L L YGFV  +  NP D  E+ L +   D   + KL+ + +  ++   +    Q  G P
Sbjct: 275 VLALDYGFVVPD--NPFDEAEIFLEIPSEDPLREIKLQYMAQNNMNTLQDSNGTQTGGRP 332

Query: 372 LELM 375
             +M
Sbjct: 333 FTIM 336


>gi|356553227|ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max]
          Length = 475

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 69/262 (26%), Positives = 117/262 (44%), Gaps = 37/262 (14%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE-VLKQCSVPDWPLLATYLISE 159
           G RGL A++++R+GE +L VP S ++T ++     +  + V +  S+    +L   L+ E
Sbjct: 51  GGRGLGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYE 110

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL----EASQIRERAIERITNVIGTYN 215
               K+SRW  Y+  LP   Y +L     E +++     EA  + E+A+ +  +     +
Sbjct: 111 MGKGKTSRWHPYLMHLP-HTYDVLA-MFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAH 168

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC-- 273
            L   +      +F  + F  + + W+   + SR + +P  D    L P  D+ N+    
Sbjct: 169 SLMQDL------MFKPQFFTFKAWVWAAATISSRTLHIP-WDEAGCLCPVGDLFNYDAPG 221

Query: 274 -EVETFLDYDKSSQ------------------GVVFTTDRQYQPGEQVFISYGKKSNGEL 314
            E     D D + Q                     F     Y+ G+QV + YG  +N EL
Sbjct: 222 IEPSGIEDLDHAEQLDSHSWRLTDGGFEEDANAYCFYAREHYKKGDQVLLCYGTYTNLEL 281

Query: 315 LLSYGFVPREGTNPSDSVELPL 336
           L  YGF+ +E  NP+D V +PL
Sbjct: 282 LEHYGFLLQE--NPNDKVFIPL 301


>gi|229596469|ref|XP_001008992.3| SET domain containing protein [Tetrahymena thermophila]
 gi|225565279|gb|EAR88747.3| SET domain containing protein [Tetrahymena thermophila SB210]
          Length = 629

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 117/258 (45%), Gaps = 14/258 (5%)

Query: 71  DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           ++L+ +  L  W+  +      + ++ +    R +V+ + I+  E ++ +P   VIT D 
Sbjct: 145 ETLKKSENLLSWVQANKGEFSSIKLKYLSTHNRSIVSKRIIQADETVISIPQEQVITLDV 204

Query: 131 KWS---CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
             S   C    E   Q         A +L+ E   + +S +  YI +LP    S      
Sbjct: 205 ASSSDFCKILTEKNTQLVQQKHAYFALFLLQEQKKKDASHYKAYIDSLPTDLSSFPALFS 264

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
            E  +YLE +   +   E+  ++   Y  +   I    P+   E  F+ E F+W+F    
Sbjct: 265 EEELQYLEGTAALKLVQEQKEDIKTDYESISQVI----PEFKSE--FSFEQFRWAFLCSH 318

Query: 248 SRL--VRLPSMDGRVALVPWADMLNH--SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           SR+  +++  +   V +VP ADMLNH  S + ++   +D ++        ++ Q  +Q+ 
Sbjct: 319 SRVFGIKVKGVKTSV-MVPLADMLNHKHSGQEDSEWVFDDATNCFTVKALKKIQRNQQIH 377

Query: 304 ISYGKKSNGELLLSYGFV 321
            SYG K N +L L+YGFV
Sbjct: 378 FSYGSKCNSKLFLNYGFV 395


>gi|301763371|ref|XP_002917104.1| PREDICTED: SET domain-containing protein 4-like [Ailuropoda
           melanoleuca]
          Length = 440

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 79/285 (27%), Positives = 129/285 (45%), Gaps = 24/285 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +         RGL++  ++R+G+ ++ +P S ++T D+       G
Sbjct: 36  LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLREGQMIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYIAKWQPPPSPLLALCTFLVSEKHAGDQSLWKPYLEILPK-AYTCPVCLEPEVVN-LFP 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             ++ +A E+   V G ++  R    S  P LF E V   F+     W++  + +R V +
Sbjct: 153 KPLKAKAEEQRARVQGFFSSSRDFFSSLQP-LFSEAVESIFSYSALLWAWCTVNTRAVYV 211

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                +         AL P+ D+LNHS  V+    +++ ++     T    +  E+VFI 
Sbjct: 212 KHRQEQCFSTEPNTCALAPYLDLLNHSPRVQVKAAFNEETRCYEIRTASGCRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFV----PREGTNPSDSV---ELPLSLKKSDK 343
           YG   N +LLL YGFV    P      S+ V    LPL+ K+ +K
Sbjct: 272 YGPHDNQQLLLEYGFVSIQNPHACVYVSEDVLVKYLPLTDKQMNK 316


>gi|449702130|gb|EMD42824.1| Hypothetical protein EHI5A_004190 [Entamoeba histolytica KU27]
          Length = 749

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 69/233 (29%), Positives = 117/233 (50%), Gaps = 16/233 (6%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT-- 208
           L+  YL    +  K   W  YI+ LP      L +T  EL+  ++ +++   A+E+I   
Sbjct: 39  LVYLYLAVNKTNPKCFHWP-YINVLPETYDCPLSYTIDELNL-MKGTKLY-AAVEKINAF 95

Query: 209 --NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRV-ALV 263
              V+  YN+  ++ F +Y   F +++F  +  +W+    +SR  LV  P   G V +L+
Sbjct: 96  LMKVVDYYNNKLIQQFPQYFQSF-DDLF--KRLQWAHQSFWSRAFLVIYPQPFGEVGSLI 152

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY-QPGEQVFISYGKKSNGELLLSYGFVP 322
           P+ D  NH  + +     +  ++   F T+ +  +PGEQ+F +Y  +SN +LLL YGFV 
Sbjct: 153 PFCDFSNHCTQAKVTYISNTQTETFSFQTNEELVKPGEQIFNNYRIRSNEKLLLGYGFV- 211

Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELM 375
            E  NP D++ L +  +  D  Y E  E L++  + + + F       PLELM
Sbjct: 212 -EENNPCDNLLLRIYFEVDDNQYNEIEEILKQEEIKSFDFFLKLDEDIPLELM 263


>gi|302754340|ref|XP_002960594.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
 gi|300171533|gb|EFJ38133.1| hypothetical protein SELMODRAFT_402971 [Selaginella moellendorffii]
          Length = 403

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 80/304 (26%), Positives = 138/304 (45%), Gaps = 22/304 (7%)

Query: 82  WLSDSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           WL   G     +A+   +    R L A + +  GE ++    +LV+T + K  C E   +
Sbjct: 45  WLRRRGEDMNSIAVAIGMSKHGRALFAHRPMCAGECMIKFSQNLVLTPE-KLPC-EVIAL 102

Query: 141 LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQ 198
           L Q +  ++  ++  +++E    ++S W+ YI  LP   + +S ++W   EL   LE S 
Sbjct: 103 LDQAN--EFTRVSLLVMAEKRKGQNSAWAPYIECLPSFGEIHSTIFWDPKEL-ACLECSP 159

Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL-VRLPSMD 257
           I     ER   +   Y +++ ++    P L+  +V ++E FK  +  + SR   + P  D
Sbjct: 160 IHRGTGERNALLQSEYREVK-KVVESCPHLYDPDV-SLEQFKHEYATVSSRAWGQGPHSD 217

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF-----ISYGKKSNG 312
             + ++P  D  NH     T   +   +  VV  + R YQ G++ F     I YG  SN 
Sbjct: 218 --MTMIPLVDFANHDPRSRTLFSHADDNCTVVVAS-RDYQTGDENFHLKVHICYGDHSNA 274

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGWP 371
            L L YGFV  +  NP D  E+ L +   D   + KL+ + +  ++   +    Q  G P
Sbjct: 275 VLALDYGFVVPD--NPFDEAEIFLEIPSEDPLREIKLQYMAQNNMNTLRDSNGTQTGGRP 332

Query: 372 LELM 375
             +M
Sbjct: 333 FTIM 336


>gi|426218421|ref|XP_004003445.1| PREDICTED: SET domain-containing protein 4 [Ovis aries]
          Length = 439

 Score = 75.5 bits (184), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 72/267 (26%), Positives = 126/267 (47%), Gaps = 19/267 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +   +     RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LKKWLKDRRFEDATLIPARFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYIAKWQPPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPKA-YTCPVCLEPEVVNLL-P 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLV-- 251
           + ++ +A E+ ++V   ++  R    S  P LF E +   F+    +W++  + +R V  
Sbjct: 153 NPLKTKAWEQRSHVQEFFSSSRGFFSSLQP-LFSEAIETIFSYRALRWAWCTVNTRAVYM 211

Query: 252 -RLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
            R P +         AL P+ D+LNHS +V+    +++ ++     T  +    ++VFI 
Sbjct: 212 KRPPQLCLSPEPDTCALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFIC 271

Query: 306 YGKKSNGELLLSYGFVPREGTNPSDSV 332
           YG   N  LLL YGFV    +NP   V
Sbjct: 272 YGPHDNHRLLLEYGFV--SVSNPHACV 296


>gi|410900968|ref|XP_003963968.1| PREDICTED: SET domain-containing protein 4-like [Takifugu rubripes]
          Length = 386

 Score = 75.5 bits (184), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 74/288 (25%), Positives = 128/288 (44%), Gaps = 21/288 (7%)

Query: 80  QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           Q+  S + L P   A    D G RGL  L+N++ G+ L+ +P S ++T  +  +    G 
Sbjct: 40  QRGFSTTLLHPAAFA----DTG-RGLQVLRNVKPGDMLISLPESCLLTTSTVLNS-YLGS 93

Query: 140 VLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
            +K       PLLA   +L+ E    ++S W  YI  LP+      Y+T   +     + 
Sbjct: 94  FIKSWKPHLSPLLALCVFLVCERHRGEASDWFPYIDVLPKSYTCPAYFTDEVMALLPPSV 153

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS-- 255
           Q + R        I + N    R          E+V   E  +W++  + +R V +    
Sbjct: 154 QRKAREQREAVREIHSSNKAFFRSLQPVLTQPAEDVLTYEALRWAWCSVNTRSVFMLHSS 213

Query: 256 ---MDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
              + G+   AL P+ D+LNH  +V+    +++ ++     +  +    +Q FI+YG   
Sbjct: 214 NDFLRGQDVYALAPFLDLLNHCPDVQVKASFNEETKCYEIRSVSRMLQYQQAFINYGSHD 273

Query: 311 NGELLLSYGFVPREGTNPSDSV----ELPLSLKKSDKCYKEKLEALRK 354
           N  L+L YGFV     NP   V    +L   + + D+  ++KL+ LR+
Sbjct: 274 NQRLMLEYGFV--APCNPHSVVYVDKDLIADVLRGDQSLEQKLKFLRE 319


>gi|312098619|ref|XP_003149111.1| hypothetical protein LOAG_13557 [Loa loa]
 gi|307755724|gb|EFO14958.1| hypothetical protein LOAG_13557 [Loa loa]
          Length = 288

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 124/292 (42%), Gaps = 37/292 (12%)

Query: 77  STLQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
           +   +W+ ++G     + I+   + G +GL A  + R+ E ++ +P  ++ITA      P
Sbjct: 4   TDFMEWVIENGGEHFGVDIRDCSNEGGKGLYATTDFRENETVICIPMEIIITAGFVAEMP 63

Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
              +V K+  +  +  L  + + E   EK+S+W  Y   LP+      + T A L   LE
Sbjct: 64  GYCDVFKRYRLKPFEALVYFFLVEK--EKNSKWDPYFKVLPKS-----FSTPASLHPVLE 116

Query: 196 ASQ----IRERAIERITNVIGTYNDLRL-------------RIFSKYPDLFPEEVFNMET 238
                  +R++   +   +   Y   R              R +S++  +  +       
Sbjct: 117 PEDFPYCLRKQWCIQKNELKTMYEKARFVTEGTAGEFVPHNRFYSQFVAILADNTI-WGH 175

Query: 239 FKWSFGILFSRLVR-----LPSMDG----RVALVPWADMLNHSCEVETFLDYDKSSQGVV 289
           F W++ I+ +R +       P +D      +A+VP  DMLNHS + +    +D       
Sbjct: 176 FLWAWHIVNTRCIYRDNKPHPLIDNTEGDSLAIVPLIDMLNHSNDSQCCAIWDSKLNLYK 235

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
               R    GEQ+FI YG  +NG L + YGF  ++  N  + VE+ L    S
Sbjct: 236 AIVTRPIHEGEQIFICYGSHTNGSLWIEYGFYLKD--NICNKVEISLGWFNS 285


>gi|384254260|gb|EIE27734.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 724

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 81/307 (26%), Positives = 127/307 (41%), Gaps = 47/307 (15%)

Query: 72  SLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITAD 129
           S E     Q+W   SG+    + + +   G RG+ A  NI KGE L+ +P   +LV++  
Sbjct: 70  SGEGPLGFQEWALQSGITSPSLRLAEF-AGLRGMAAADNIAKGEVLVSLPVAAALVVSPK 128

Query: 130 SKWSCPEAGEVLKQCSVPDWPL-LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
            +   P         S   W + +A  L+ E     +S+ + Y++ALP    + L W+ A
Sbjct: 129 ERSQLPGTFCSSAFYSKKPWYVQMALNLLYERQLGPASKLAPYVAALPVDFSTPLSWSEA 188

Query: 189 ELDRYLEASQIRERAIER------------------IT--NVIGTYNDLRLRIFSKYPDL 228
           +L        IRE A +R                  IT  ++I     +R R FS  P  
Sbjct: 189 QLQALCYPQLIREVATQREGLKRLHAELAVSTPGTPITEQDLIWALQAVRSRAFSG-PYA 247

Query: 229 FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------------------LVPWADM 268
            P     ++TF     +  + +     ++G +A                    + P  D 
Sbjct: 248 GPTWRSRLKTFGALGALAAASITVAHVLNGAIAAALFNLLYDVVLSQKVKWYAMCPVVDF 307

Query: 269 LNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
           LNH   V++ ++Y+  +          +  GEQVFISYGK+SN  LL  YGFV  E   P
Sbjct: 308 LNHKSTVQSEVEYEYFADRFSVRCQSYFSKGEQVFISYGKQSNDSLLQYYGFV--EPGIP 365

Query: 329 SDSVELP 335
            D+  +P
Sbjct: 366 HDTYTIP 372


>gi|432901733|ref|XP_004076920.1| PREDICTED: SET domain-containing protein 4-like [Oryzias latipes]
          Length = 441

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 80/297 (26%), Positives = 132/297 (44%), Gaps = 26/297 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L K+L   G     +        +RGL  L+ I+ G  L+ +P S ++T  +       G
Sbjct: 36  LMKFLHGRGFTSTPLQPALFSDTDRGLQTLQPIQPGGMLVSLPESCLLTTSTVLH-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             LK        L+A   +L+ E    ++S W  YI  LP       Y+T   +   +  
Sbjct: 95  PFLKSWKPRPSSLVALCVFLVCERHRGEASDWFPYIDVLPCSYCCPPYFTDTVMA--VLP 152

Query: 197 SQIRERAIER---ITNVIGTYND--LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           S +R RA E+   + ++   + D  + L+    +P   PEEV   E  +W++  + +R V
Sbjct: 153 SGVRRRAEEQREGLQHLYAVHQDFFMSLQPVLSHP---PEEVLTYEALRWAWCSINTRSV 209

Query: 252 RL--PS---MDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
            +  PS   + G    AL P+ D+LNH  +V+    ++++S      +    Q   Q FI
Sbjct: 210 FMDRPSSSFLSGPDNYALAPFLDLLNHRPDVQVKAGFNRTSGCYEIRSISGVQRYHQAFI 269

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSV----ELPLSLKKSDKCYKEKLEALRKYGL 357
           +YG   N  LLL YGFV     NP   +    +L   + + D+   EK++ LR+ G 
Sbjct: 270 NYGSHDNQRLLLEYGFV--SSCNPHSVIYVEEDLLCEVLRGDESLDEKMKFLRENGF 324


>gi|156538697|ref|XP_001607787.1| PREDICTED: histone-lysine N-methyltransferase setd3-like [Nasonia
           vitripennis]
          Length = 403

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 54/193 (27%), Positives = 96/193 (49%), Gaps = 10/193 (5%)

Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNM 236
           Y L+ +   +    L+ S   E A+++  N+   Y+  + ++F    +   +   +VF  
Sbjct: 104 YGLVLYMSMDDMMELKESPALETALKQCRNIARQYSYFK-KLFHNSKNSVSKLLADVFTY 162

Query: 237 ETFKWSFGILFSRLVRLPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
           E ++W+   + +R   +PS +      +L+P  DM NHS E +   ++++ S        
Sbjct: 163 EEYRWAVSTIMTRQNVIPSENQSAMVHSLIPMWDMCNHS-EGKITTNFNEISNCCECYAM 221

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           + ++  +Q+FI YG ++N E  +  GFV  +  N  DS EL L +  SDK   EK+E L 
Sbjct: 222 KSFKTDDQIFIYYGSRTNAEFFVHSGFVYPDNAN--DSYELHLGIGSSDKLRSEKVELLS 279

Query: 354 KYGLSASECFPIQ 366
           K GL  S  FP++
Sbjct: 280 KIGLPVSNQFPLK 292


>gi|301099608|ref|XP_002898895.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104601|gb|EEY62653.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 440

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 65/255 (25%), Positives = 124/255 (48%), Gaps = 24/255 (9%)

Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP---LLATYLISEASF 162
           +  +N+  G  LL +P S V++ +S  +    G +L+    PD P    L  +L+ E + 
Sbjct: 37  ITTENVEVGSVLLSLPMSQVMSVESA-ARGRVGLLLEVN--PDLPSAIALGLHLLEERAL 93

Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
             +S +S++++ LP      S L+++  E+ + LE SQ++   + R   V   Y+ L   
Sbjct: 94  GAASNFSDFVATLPTIEAINSTLFYSEDEM-KGLEGSQLQRFTLGRAQAVDAFYDALVQP 152

Query: 221 IFSKY---PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML-------- 269
           + S+    P +F +  F ++ F+W+ G+++S   +    +  V L P  + +        
Sbjct: 153 VTSREAVDPPIFHKSEFTLDKFRWAMGVVWSSTFQFGENEDDVILAPVLNTIGICTDLNQ 212

Query: 270 --NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
             N +C  ET +  D  +Q +       Y  G++V +S   KS+ +L+LS+GF  R   +
Sbjct: 213 EGNEACP-ETSIKVDTDTQRLTVYASVAYSKGQEVRLSMPGKSSTQLMLSHGFA-RARAS 270

Query: 328 PSDSVELPLSLKKSD 342
             D ++L ++L  SD
Sbjct: 271 KLDKLDLTVTLDSSD 285


>gi|302792358|ref|XP_002977945.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
 gi|300154648|gb|EFJ21283.1| hypothetical protein SELMODRAFT_107696 [Selaginella moellendorffii]
          Length = 467

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 78/341 (22%), Positives = 137/341 (40%), Gaps = 51/341 (14%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+ AL+++  GE +  +P +  +T  +      A + +++  +     L   L+ E S  
Sbjct: 33  GVRALRDLHHGELIATIPKAACLTLLT----TAARDAIERARLGGGLGLTVALMYERSKG 88

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           K S+W  Y+  LPRQ      W+  E+D  L  +++ +   E    +   + +    +  
Sbjct: 89  KGSKWYRYLKTLPRQESVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEENIAPLTK 148

Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF----- 278
           + P  FP + F  E++  +  ++ SR   + +  G   +VP AD+ NH  + E       
Sbjct: 149 EDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHG-YGMVPLADLFNHKTDAEDVHFMLN 207

Query: 279 ------------------------LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
                                   +  DKS   +V   D     G ++F +YG+  N  L
Sbjct: 208 ASDSDDDDNGLIIDDGLANGDCREISSDKSVLEMVMVKD--VAAGSEIFNTYGQLGNAAL 265

Query: 315 LLSYGFVPREGTNPSDSVELPLSL-------KKSDKCYKEKLEALRKYGLS-----ASEC 362
           L  YGF   E  NP D V L +         +   K  +++    RK G S      SE 
Sbjct: 266 LHRYGFT--EPNNPHDIVNLDMDCVLEVLLSRFQKKRVRKRGRVWRKAGFSGCESQGSEY 323

Query: 363 FPIQITGWP-LELMAYAYLVVSPPSMKGKFEEMAAAASNKM 402
           F I  +G P +EL+   +++ S        E+ AA    ++
Sbjct: 324 FEISASGKPQIELLLLLFVIQSRARDCKALEDAAAKVKGRV 364


>gi|329663327|ref|NP_001192753.1| SET domain-containing protein 4 [Bos taurus]
 gi|296490853|tpg|DAA32966.1| TPA: SET domain containing 4 [Bos taurus]
          Length = 440

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 121/257 (47%), Gaps = 19/257 (7%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +         RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 36  LKKWLKDRRFEDTTLIPAHFPGTGRGLMSKTSLQEGQTIISLPESCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYIAKWQPPPSPLLALCTFLVSEKHAGDRSPWKPYLEVLPKA-YTCPVCLEPEVVNLL-P 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           + ++ +A E+ ++V   ++  R    S  P LF E V   F+    +W++  + +R V +
Sbjct: 153 NPLKTKAWEQRSHVWEFFSSSRGFFSSLQP-LFSEAVETIFSYRALRWAWCAVNTRAVYM 211

Query: 254 ---------PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
                    P  D   AL P+ D+LNHS +V+    +++ ++     T  +    ++VFI
Sbjct: 212 KRPPLLCLSPEPDT-CALAPYLDLLNHSPDVQVKAAFNEETRCYEIRTATRCGKHKEVFI 270

Query: 305 SYGKKSNGELLLSYGFV 321
            YG   N  LLL YGFV
Sbjct: 271 CYGPHDNHRLLLEYGFV 287


>gi|297738159|emb|CBI27360.3| unnamed protein product [Vitis vinifera]
          Length = 449

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 117/265 (44%), Gaps = 41/265 (15%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP-LLATYLIS 158
            G RGL A +++ +GE +L VP S ++T+ S     +    +K+ +    P +L   L++
Sbjct: 46  AGGRGLAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLA 105

Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERI----TNVIG 212
           E S  KSS W  Y+  LPR   +L  +++ E    +  +A  + ERAI +        I 
Sbjct: 106 EMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIP 165

Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
              +L+L+           ++ N   + W+   + SR + +P  D    L P  D  N++
Sbjct: 166 LMEELKLK----------PQLQNFRAWLWASSTVSSRTMHIP-WDDAGCLCPVGDFYNYA 214

Query: 273 ------CEVETFLD---------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
                 C  E   D               Y +      F   + Y+ GEQV +SYG  +N
Sbjct: 215 APGEEPCGWEDLKDAEQDDVLSQRLTDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTN 274

Query: 312 GELLLSYGFVPREGTNPSDSVELPL 336
            ELL  YGF+  E  NP+D   +PL
Sbjct: 275 LELLEHYGFLLDE--NPNDKAFIPL 297


>gi|143584415|sp|Q5ZK17.2|SETD6_CHICK RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
          Length = 447

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 75/293 (25%), Positives = 131/293 (44%), Gaps = 30/293 (10%)

Query: 82  WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP---- 135
           W   +G  L P+    ++  V   GL+A  ++  GE L  VP S ++   S+ +C     
Sbjct: 24  WCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALL---SQHTCAIRAL 80

Query: 136 --EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL----YWTRAE 189
             +A E L+  SV  W  L   L+ E +   +SRW  Y S    Q +S L    +W   E
Sbjct: 81  LHDAQESLQSQSV--WVPLLLALLHEYT-TGTSRWRPYFSLW--QDFSSLDHPMFWPEEE 135

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
             R L+ + I E   + + N+   Y+ + L     +PD+F  E+  +E +K     + + 
Sbjct: 136 RVRLLQGTGIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELYKQLVAFVMAY 195

Query: 250 LVRLPSMDGRVA--------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
             + P  +            +VP AD+LNH       L+Y  +   +V  T +    G++
Sbjct: 196 SFQEPLEEEDEDEKGPNPPMMVPVADILNHVANHNASLEYAPTCLRMV--TTQPISKGQE 253

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           +F +YG+ +N +LL  YGF      N +D+ ++ +   +     + K EA ++
Sbjct: 254 IFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQMVTVRKAALQRAKNEAQQQ 306


>gi|158508540|ref|NP_001025734.2| N-lysine methyltransferase SETD6 [Gallus gallus]
          Length = 447

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 75/293 (25%), Positives = 131/293 (44%), Gaps = 30/293 (10%)

Query: 82  WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP---- 135
           W   +G  L P+    ++  V   GL+A  ++  GE L  VP S ++   S+ +C     
Sbjct: 24  WCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALL---SQHTCAIRAL 80

Query: 136 --EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL----YWTRAE 189
             +A E L+  SV  W  L   L+ E +   +SRW  Y S    Q +S L    +W   E
Sbjct: 81  LHDAQESLQSQSV--WVPLLLALLHEYT-TGTSRWRPYFSLW--QDFSSLDHPMFWPEEE 135

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
             R L+ + I E   + + N+   Y+ + L     +PD+F  E+  +E +K     + + 
Sbjct: 136 RVRLLQGTGIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELYKQLVAFVMAY 195

Query: 250 LVRLPSMDGRVA--------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
             + P  +            +VP AD+LNH       L+Y  +   +V  T +    G++
Sbjct: 196 SFQEPLEEEDEDEKGPNPPMMVPVADILNHVANHNASLEYAPTCLRMV--TTQPISKGQE 253

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           +F +YG+ +N +LL  YGF      N +D+ ++ +   +     + K EA ++
Sbjct: 254 IFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQMVTVRKAALQRAKSEAQQQ 306


>gi|302754812|ref|XP_002960830.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
 gi|300171769|gb|EFJ38369.1| hypothetical protein SELMODRAFT_402221 [Selaginella moellendorffii]
          Length = 393

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/185 (30%), Positives = 87/185 (47%), Gaps = 14/185 (7%)

Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRE 201
           +V  W  LA  ++ E  ++  + W+ YIS LP QP  L     W   EL  YL AS +  
Sbjct: 160 TVKPWTKLALIVLME-RYKGQAIWAPYISCLP-QPAELDNTFRWEDTELS-YLRASPLYG 216

Query: 202 RAIERITNVIGTY----NDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPS 255
           +A ER+  +   +    ND    +  +  D++P+     ++E  K  +  +FSR   L  
Sbjct: 217 KARERLEMITTEFGQVQNDFCTCVLEQALDVWPQLFGKVSLEDLKHVYATVFSR--SLAI 274

Query: 256 MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
            +    L+P  D  NH+      L ++      V T DR Y   +Q++I+YG  SN EL 
Sbjct: 275 GEDSTTLIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELA 334

Query: 316 LSYGF 320
           L YGF
Sbjct: 335 LDYGF 339


>gi|351701197|gb|EHB04116.1| SET domain-containing protein 3 [Heterocephalus glaber]
          Length = 705

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 47/155 (30%), Positives = 76/155 (49%), Gaps = 9/155 (5%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQGVVF 290
           F  E ++W+   + +R  ++P+ DG    +AL+P  DM NH+  + T   Y+        
Sbjct: 348 FTYEDYRWAVSSVMTRQNQIPTEDGSRVTLALIPLWDMCNHTNGLIT-TGYNLEDDRCEC 406

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
              + +Q GEQ++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E
Sbjct: 407 VALQDFQAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAE 464

Query: 351 ALRKYGLSA---SECFPIQITGWPLELMAYAYLVV 382
            L + G+     S  F +  T  P+     A+L V
Sbjct: 465 VLARAGIPTYVWSSVFALHFTEPPISAQLLAFLRV 499


>gi|414881221|tpg|DAA58352.1| TPA: hypothetical protein ZEAMMB73_556117 [Zea mays]
          Length = 270

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/55 (65%), Positives = 42/55 (76%)

Query: 375 MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSI 429
           MAYA+LVVSPP M   FEEMAAA SNK +SK  +  P ++EQ LQFILD CES+I
Sbjct: 1   MAYAFLVVSPPDMSQCFEEMAAAKSNKTSSKPGLNYPGLEEQTLQFILDCCESNI 55


>gi|307195794|gb|EFN77608.1| SET domain-containing protein 3 [Harpegnathos saltator]
          Length = 245

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/206 (31%), Positives = 99/206 (48%), Gaps = 13/206 (6%)

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVV 289
           +FN E   W+   + +R   +PS DG     AL+P  DM NH     T  D++ +S    
Sbjct: 1   MFN-EFSSWAVSTVMTRQNLVPSPDGSRMIHALIPMWDMCNHENGRIT-TDFNATSDRCE 58

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
               R +Q GEQ+FISYG ++N +  +  GFV  +  N  D  +L L + K+D   KE+ 
Sbjct: 59  CYALRNFQKGEQIFISYGPRTNSDFFVHSGFVYMD--NEQDGFKLRLGISKADSLQKERT 116

Query: 350 EALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIK 409
           E L K  L +   F ++    P+  M  A+L V   SM+ K E      S+K+   K + 
Sbjct: 117 ELLGKLDLPSVGEFLLKPGTEPISDMLLAFLRVF--SMR-KAELAHWLRSDKVFDLKHMD 173

Query: 410 CP---EIDEQALQFILDSCESSISKY 432
           C     ++E   +F+L   +  I+ Y
Sbjct: 174 CALETVVEENVRKFLLTRLQLLIANY 199


>gi|444915331|ref|ZP_21235465.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
 gi|444713560|gb|ELW54457.1| SET domain containing protein [Cystobacter fuscus DSM 2262]
          Length = 449

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 165/375 (44%), Gaps = 26/375 (6%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           S L +WL + G    K+ + + + GER ++A   I  GE +L VP + ++T +      +
Sbjct: 19  SNLLRWLEEGGARFPKLQLVRREDGERAVLAQAPISAGETVLQVPRTHMLTLELARES-D 77

Query: 137 AGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
            G  + +   PD     LA++L+ E   E  S W  YI +LP     +  +  +E    L
Sbjct: 78  IGRAIAEGLDPDNEDLYLASFLLQEKHRE-GSFWKPYIDSLPESYSQMPLFYGSEEHALL 136

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           +       A+  +T+   +  +  L +    P     E F    F W+   + SRL  L 
Sbjct: 137 KGC----FALTLLTHQAQSLREDYLSLCQNVPGY---ERFTPGEFVWARLSVSSRLFSLK 189

Query: 255 SMD--GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
                G+  LVP ADMLNH    +   +  +  +  V   +     G++V  SYG KSN 
Sbjct: 190 KGGFLGQT-LVPMADMLNHRRPPDVLWETTEDGESFVMKANNAVAAGDEVHDSYGAKSND 248

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI-QITGWP 371
            +LL +GFV  +  N  D   L L +   D     K   L     +A+  F I +     
Sbjct: 249 LMLLHFGFVTDD--NEHDEAFLGLRILDGDPLAATKQMLLMLPSPTAARPFKISRPYVHT 306

Query: 372 LELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCESS 428
              MA+++L ++  ++    E++    S+++ S +    P   E +E  L+ +  +C++ 
Sbjct: 307 TTRMAFSFLRIA-AAVPNDIEDI----SSRVMSGERALGPLSVENEENVLELLAATCQAR 361

Query: 429 ISKY-SRFLQVKELL 442
           +S + +   Q +ELL
Sbjct: 362 LSIFPTSLAQDEELL 376


>gi|219110715|ref|XP_002177109.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411644|gb|EEC51572.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 531

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 140/337 (41%), Gaps = 58/337 (17%)

Query: 92  KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL----------------VITADSKWSCP 135
           K+ +    V   GLVA + IRKGE  L +P  +                V++ D      
Sbjct: 73  KVTVAPSSVNRLGLVATEKIRKGEVFLAMPYDVRYELSADLARNVVFKDVLSEDYNSWTG 132

Query: 136 EAGEVL-----KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTR 187
           +AG +      + C   D  L     I + S +  +  S +++ALP  P  +   L W+ 
Sbjct: 133 DAGLIALLILNEVCLAADTGLGTKEPIRQNSLQ--AFMSAWVAALP-GPEDINHPLLWS- 188

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP-------EEV--FNMET 238
            E D+ +  S    R    + ++      L+  +F K  + FP       EE+  F++  
Sbjct: 189 -EEDQEILQSSSTNRIYRVLDDIEEDVTWLKTNVFEKDGNRFPVSIPWNGEEIPCFSLTG 247

Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV-----ETFLDYDKSSQGVVFTTD 293
           FKW+  +  SR   +   D  V L+P  D  NH+ E        F+    +++G      
Sbjct: 248 FKWAMALAQSRSFFV---DNAVRLLPLMDFCNHADEGTEEARAGFMGTFGTTKGAELVAG 304

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           + Y+ GE+VFI YG KS  + LL + F P +    + S EL   +   D+ Y +KL+ L 
Sbjct: 305 QSYEVGEEVFICYGPKSAADYLLEHAFCPEQSWKTAVS-ELFFEVDPKDRFYDDKLDILE 363

Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGK 390
                AS   P+Q           ++ VVS P   G+
Sbjct: 364 FETYDASPMDPVQ-----------SFDVVSAPGRDGE 389


>gi|410962953|ref|XP_003988033.1| PREDICTED: histone-lysine N-methyltransferase setd3 [Felis catus]
          Length = 591

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 81/324 (25%), Positives = 138/324 (42%), Gaps = 48/324 (14%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E + + +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGPLYSQDRILQAMGNITLAFHLLCERA-DPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITN----------VIGTY---NDLRLRIFSKYPDLFPEEVFNMET 238
           R L+++Q       +  N          VI T+   N L L+    Y D +   + ++  
Sbjct: 195 RDLQSTQAIHDVFSQYKNTARQYAYFYKVIQTHPHANKLPLKDAFTYED-YRLGLVSLAL 253

Query: 239 FKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
            +W+ G+     +   +  G+  +    ++ +  CE     D+               + 
Sbjct: 254 GRWALGLECGVGI---ARCGKPQITTGYNLEDDRCECVALQDF---------------RA 295

Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           GEQ++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L + G+ 
Sbjct: 296 GEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIP 353

Query: 359 ASECFPIQITGWPLELMAYAYLVV 382
            S  F +  T  P+     A+L V
Sbjct: 354 TSSVFALHFTEPPVSAQLLAFLRV 377


>gi|297836754|ref|XP_002886259.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332099|gb|EFH62518.1| hypothetical protein ARALYDRAFT_319874 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 541

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/276 (26%), Positives = 126/276 (45%), Gaps = 23/276 (8%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E  S L +W  D+G+   K+ I  +D   RG +A ++++ G+  L +P S +I+ +  ++
Sbjct: 141 EKESRLVEWGQDNGVK-TKLQIAHIDGYGRGAIASEDLKFGDVALEIPISSIISEEYVFN 199

Query: 134 CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAEL 190
                 + K   +    ++  + + E      S++  Y  +L        S       EL
Sbjct: 200 SDMYPILEKIDGITSETMVLLWTMREKH-NLDSKFKPYFDSLQENFCTGMSFGVNAIMEL 258

Query: 191 DRYL---EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
           D  L   E  Q +E   ER       Y++L + + S +  +FP E +  E + W+  + +
Sbjct: 259 DGTLLLDEIMQAKELLRER-------YDEL-IPLLSNHRHVFPPEHYTWEHYLWACELYY 310

Query: 248 SRLVRLPSMDGRV--ALVPWADMLNHSCEVETFLDYDK---SSQGVVFTTDRQYQPGEQV 302
           S  +++   DG++   L+P A  LNHS      + Y K    +  + F   R    GEQ 
Sbjct: 311 SNSMQIKFPDGKLKTCLIPVAGFLNHSI-YPHIVKYGKVCVETSSLKFPVSRPCNKGEQC 369

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           F+SYG  S+  LL  YGF+P+ G NP D + L   +
Sbjct: 370 FLSYGNYSSSHLLTFYGFLPK-GDNPYDVIPLDFDV 404


>gi|402077770|gb|EJT73119.1| hypothetical protein GGTG_09969 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 377

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 63/221 (28%), Positives = 101/221 (45%), Gaps = 15/221 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+ A +++++GE +L+VP  LV    S  + PE              LLA  L      E
Sbjct: 32  GMAAGRHLKEGEDILYVPTGLV---RSLHTVPEHVSRKLPSDTSIHALLAADLTVNGMTE 88

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
            +  W + +  L      + +    +L   L        A E + N +G ++    R+  
Sbjct: 89  LAL-WRDCLPTLADFSTGMPFMWHKKLQELLPKP-----ARELLENQLGNFHRDWARVTK 142

Query: 224 KYPDLFPEEVFN--METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY 281
            +PDL  E+  +  +     SF     ++   P  D R+ALVP AD+ NH+   +T    
Sbjct: 143 AFPDLQQEDYLHNWLAVSTRSFYYWTPQMELYPPAD-RLALVPIADLFNHA---DTGCGA 198

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
             +  G V +TDR+Y  G++++ISYG  +N  LL  YGFVP
Sbjct: 199 SFTPDGFVVSTDRKYHVGQEIYISYGTHTNDLLLAEYGFVP 239


>gi|299473350|emb|CBN77749.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 563

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 52/142 (36%), Positives = 71/142 (50%), Gaps = 12/142 (8%)

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--EEVFNMETFK 240
           ++WT  E+ R L+ S +  +  ER   + G Y  +         DL+P   +V  +E FK
Sbjct: 213 IFWTEEEM-RLLQGSYLVTQVEERNQAIEGDYGVI--------CDLYPPFRDVATLEEFK 263

Query: 241 WSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           W+   + SR   L     R  ALVP+ADMLNH    ET   YD +  G   TT  +   G
Sbjct: 264 WARMCVCSRNFGLDINGLRTSALVPYADMLNHYRPRETKWTYDNNRGGFTITTLHRILGG 323

Query: 300 EQVFISYGKKSNGELLLSYGFV 321
            QV+ SYG+K N   LL+YGF 
Sbjct: 324 AQVYDSYGQKCNHRFLLNYGFA 345


>gi|145344497|ref|XP_001416768.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576994|gb|ABO95061.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 514

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 70/270 (25%), Positives = 115/270 (42%), Gaps = 24/270 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA-- 160
           RG+   +N+  GE L  VP    + A S          +         +LA +++ EA  
Sbjct: 83  RGVATTRNVSAGELLAEVPLEKCLCAASARMDARLWRAIGASGASGDAILAAHVLREAFD 142

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT-----YN 215
           +  KS+ W  ++  LPR   S + W   EL      S++    +   T  I       Y+
Sbjct: 143 AGSKSAYWP-WLRLLPRDVDSTVGWNEDEL------SELSGSNVVVFTRAIKAQWRMEYD 195

Query: 216 DLRL-RIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRV----ALVPWAD 267
            L +  +  K+PD+F  E    +  + F W+  I++SR + L +          LVP  D
Sbjct: 196 ALDVPTLGEKFPDVFGGERAAHYTFDKFTWARFIIWSRAIDLSTESAEAPTIRVLVPLLD 255

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           M NH+   +   ++D  S  V       ++   ++  +Y  K +   LL YGF+P   TN
Sbjct: 256 MANHAPGGKLRPEWDARSNAVKVYAASAFREHTELRFNYDTKPSQYFLLQYGFIPE--TN 313

Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
           P++ VE  + +   D     K E LR +GL
Sbjct: 314 PAECVEATVRVSDHDSLRDAKEELLRLHGL 343


>gi|224042477|ref|XP_002188626.1| PREDICTED: SET domain-containing protein 4 [Taeniopygia guttata]
          Length = 457

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/285 (25%), Positives = 126/285 (44%), Gaps = 37/285 (12%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL + G     +   +     RGL+  K ++ G+ ++ +P   ++T  +  S    G
Sbjct: 35  LKKWLKERGFEDSNLRPAEFWETGRGLMTTKALQAGDLIISLPEKCLLTTGTVLSSCLGG 94

Query: 139 EVLKQCSVPDWPLLA--TYLI------------------SEASFEKSSRWSNYISALPRQ 178
            + K    P  PLLA  T+LI                  +E    + S W  Y+  LP+ 
Sbjct: 95  HIEKW-KPPVSPLLALCTFLIGQNLELLECFQFLLVNGIAEKHAGQKSPWKPYLDVLPK- 152

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFN 235
            Y+       ++   L    ++++A E+   +   +   R   FS    LF E+   +FN
Sbjct: 153 AYTCPACLEPDIINLL-PKPLQKKAQEQKMLIQELFQSSR-AFFSSLQPLFAEDTGNIFN 210

Query: 236 METFKWSFGILFSRLVRLP-------SMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQG 287
               +W++  + +R + +        S++  V AL P+ D+LNHS  V+    +++ ++ 
Sbjct: 211 FSALQWAWCTVNTRTIYMKHPHRECFSLEPDVYALAPYLDLLNHSPNVQVKAGFNEQTRS 270

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
               TD Q +  ++V I YG   N  LLL YGFV  +  NP  SV
Sbjct: 271 YEIWTDSQCKKYQEVLICYGPHDNQRLLLEYGFVATD--NPHSSV 313


>gi|449662705|ref|XP_002165483.2| PREDICTED: uncharacterized protein LOC100209819 [Hydra
           magnipapillata]
          Length = 819

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 145/334 (43%), Gaps = 50/334 (14%)

Query: 32  PRKRCGHRIVVHCSVSTT-----NDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDS 86
           P+K C  R +     +T      N+    K+   QN+  W  E   L       +W   +
Sbjct: 55  PKKFCNIRDIPIMQPNTVITFSDNEKDNDKSVGGQNIEQWTVEKKLL----NFLQWCKAN 110

Query: 87  GLP-PQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADS-----------KWS 133
            L    K+ +       R G++A ++I+KGE L  VP  L++  ++           KW 
Sbjct: 111 NLNLSSKVKVDFNGTSHRYGMLATEDIKKGEVLFTVPRQLLLNQNTATLKNRLNEFEKW- 169

Query: 134 CPEAGEVLKQCSVPDW-PLLATYLISEASFEKSSRWSNYISALPR-----QPYSLLYWTR 187
               G+ L   S   W PLL T L+ E + +K S W++Y+  +P       P   L+W  
Sbjct: 170 LDTHGKSLNDSS--GWLPLLIT-LMWEFN-QKDSFWASYLLLVPEISEFGHP---LFWKE 222

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGIL 246
            E +   +   +    I    N+   Y +  L    +  DLF   E +++E FK     +
Sbjct: 223 EEYNLEFQGMPLLNDIIVDRENIETEYAEFVLLFLRRNKDLFGSLENYSLEFFKRMVAFV 282

Query: 247 ----FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
               F+     PSM      VP AD+LNH       L + KS+  ++  + R+ + GE+V
Sbjct: 283 MAYSFTEDEESPSM------VPMADILNHHSNNNAHLVFHKSNLQMI--SIRRIKKGEEV 334

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           F ++GK  N ELL  YG+V    +N  DS+ LP+
Sbjct: 335 FNTFGKLGNTELLQMYGYVEI-PSNQYDSLLLPV 367


>gi|358056251|dbj|GAA97802.1| hypothetical protein E5Q_04481 [Mixia osmundae IAM 14324]
          Length = 433

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 78/289 (26%), Positives = 129/289 (44%), Gaps = 37/289 (12%)

Query: 74  ENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           E    L  W   +G  + P        D+G  G+ A  N+R   +L  +P SLV++  + 
Sbjct: 5   EGLKVLLDWFKSNGGSVQPHVEFASYPDMG-CGMRATSNLRSETELFSIPRSLVLSVHTS 63

Query: 132 ---WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
               S P+  E+    S   W  L   L+ E   + +S W  Y++++P    SL++W+  
Sbjct: 64  PLPKSLPDWSEI----STQGWVGLILCLMYE-QIDPASHWKRYLNSMPTCFDSLMFWSDD 118

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-PEEVFNMETFKWSFGILF 247
           EL R L+ S + ++ I R     G+Y  + +   SK+ D+F P E +++  +     ++ 
Sbjct: 119 EL-RELQGSSVLDK-IGR-EEAEGSYYSILVPYLSKHADIFKPLEAYSLALYHRCGSLIL 175

Query: 248 SRLVRLPSMDGR-----------------VALVPWADMLN-HSCEVETFLDYDKSSQGVV 289
           SR   + + D                   V +VP AD+LN  S      L Y      +V
Sbjct: 176 SRSFHVSNQDDSASDASDDDDAAYHEVETVGMVPMADVLNAKSGSANACLVY--HPDALV 233

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            TT ++   GEQ+F +Y    N +LL  YG V  +  N +D+VE+   L
Sbjct: 234 MTTTKEIAAGEQIFNTYNDPPNADLLRRYGHV--DEVNLNDNVEISADL 280


>gi|158295743|ref|XP_001688855.1| AGAP006364-PD [Anopheles gambiae str. PEST]
 gi|347965224|ref|XP_003435732.1| AGAP013401-PA [Anopheles gambiae str. PEST]
 gi|333469389|gb|EGK97284.1| AGAP013401-PA [Anopheles gambiae str. PEST]
          Length = 451

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 69/252 (27%), Positives = 108/252 (42%), Gaps = 66/252 (26%)

Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPR---QPY-----SLLYWTRAELDRYLEAS 197
           +P   LLA YL       KS+ +  Y+ +LP+    PY      L+Y +   L R +E +
Sbjct: 109 LPFQALLAFYL----CVTKSAHFDAYLQSLPQTFSNPYFCTKQELVYLSEVLLQRMVEQN 164

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
            + +  +ERI +V+               D + + V  +E FKW++ ++ +R V L  M 
Sbjct: 165 GLIKSGLERINSVL--------------RDEWKDTV-ELERFKWAYFVVNTRSVFLDPMA 209

Query: 258 GRV---------------------ALVPWADMLNHSCEVETFLDYDKSSQGVV------- 289
            ++                     AL P+ D  NH C  +T      S+  +        
Sbjct: 210 VKMINSFLPSGSLFEDFLADEPSMALAPFLDFFNHRCGAKTVNGLSLSTSQIRDCLLKER 269

Query: 290 -------FTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKS 341
                    TD  Y+ GEQ+FISYG  +N +LLL YGF +P   +NP D VEL +    +
Sbjct: 270 PLELYYNLHTDTAYRAGEQIFISYGTHNNTKLLLEYGFSIP---SNPDDFVELTIGTINA 326

Query: 342 DKCYKEKLEALR 353
              +  +L  LR
Sbjct: 327 FMKHDPELRCLR 338


>gi|255945819|ref|XP_002563677.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588412|emb|CAP86520.1| Pc20g11910 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 487

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 114/272 (41%), Gaps = 49/272 (18%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
           P  ++A  +     RG+VA  NI +GE+L  VP ++V+T  +       GE L++   P 
Sbjct: 34  PKLRLADLRATGAGRGVVAQSNISEGEELFSVPRAMVLTVQNSELRTLLGENLEEQMGP- 92

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER-----A 203
           W  L   ++ E    + SRW+ Y   LP +  +L++W+ AEL + L+AS I E+     A
Sbjct: 93  WLSLMLVMVYEYLQGEKSRWAPYFRVLPSRFDTLMFWSPAEL-QELQASTIVEKIGRSGA 151

Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFP--------------EEVFNMETFKWSFGILFSR 249
            E I N I         I +K PDLFP                +  +     S  + ++ 
Sbjct: 152 EESIRNSIAP-------ILAKRPDLFPPPQGLASWEGDAGDAALIQVGHIMGSLIMAYAF 204

Query: 250 LVRLPSMDGR--------------------VALVPWADMLNHSCEVETFLDYDKSSQGVV 289
            +     DG                       +VP AD+LN   +      Y +    +V
Sbjct: 205 DIEKSEDDGDEGEANDESYMTDDEEEEQLPKGMVPLADLLNADADRNNARLYQEEG-ALV 263

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
               +  Q GE++F  YG+    +LL  YG+V
Sbjct: 264 MKAIKPIQQGEEIFNDYGEIPRADLLRRYGYV 295


>gi|242009061|ref|XP_002425311.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
 gi|212509085|gb|EEB12573.1| SET domain-containing protein, putative [Pediculus humanus
           corporis]
          Length = 399

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 69/232 (29%), Positives = 122/232 (52%), Gaps = 17/232 (7%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLI 157
           D G RG+   K + KG+ L+ +P +L+IT  S+    +A + L   ++ D  L L+ +L+
Sbjct: 45  DTG-RGVKCRKKLEKGDLLIALPLNLLITPTSQ---SDAYKFLNDENIVDPQLRLSIFLM 100

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYW-TRAELDRYLEASQIRERAIERITNVIGTY-- 214
            E   +  S++ NYI  LP Q YS +Y+ T +E+   L    I++  + + T++   +  
Sbjct: 101 YENHLKNDSKYFNYIQTLP-QSYSNVYFCTDSEIQ--LLPDLIKKLVVTQKTDLEFLFEK 157

Query: 215 --NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADML 269
             N+L   I S + D   ++++N   F W++  + +R V       R   +AL P+ DM 
Sbjct: 158 LQNNLNDEICS-HCDKSIKKLYNRYEFIWAWFTVNTRSVYYEDKSMRKKSLALAPFLDMF 216

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           NHS +  T +  D  ++  +  T   ++  +Q+FI YG  SN +LL+ YGF+
Sbjct: 217 NHSSDANTKMYIDFDNELYILKTLNSFRKHQQIFIKYGPHSNLKLLIEYGFI 268


>gi|268573124|ref|XP_002641539.1| C. briggsae CBR-SET-27 protein [Caenorhabditis briggsae]
          Length = 483

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 80/314 (25%), Positives = 130/314 (41%), Gaps = 65/314 (20%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LA +L       + S+W +YIS LP    + L++T  +L + L+ S I E A+     + 
Sbjct: 136 LALFLAVHWLQNEKSKWHSYISILPNSFPTPLFYTEEQLLQ-LKPSPIFEEALTFYRTIA 194

Query: 212 GTY-----------------------NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
             +                       N + + IF   P  F    F    + WS G + +
Sbjct: 195 RQFCYFLLAVAKNKIYESAQRRKDARNTMDVPIFYNAP--FTVYNFTPRLYFWSVGTVTT 252

Query: 249 RLVRLPSMDGRV---------ALVPWADMLNHSCEVETFLD----YDKSSQGVVFTTDRQ 295
           R+  +PS +G           AL+P  DM NH   V   +D    Y  + +  V T+   
Sbjct: 253 RVNMVPSENGSGDDGKAIMIPALIPLLDMANHESVVTDPVDDLVCYAPADECAVITSHCD 312

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
            + G++V I YG +S GE L+  GF+P       D  +L + + KSDK  + K + + KY
Sbjct: 313 LEAGKEVTIFYGCRSKGEHLIHNGFIPINHQK-QDFFKLKIGIPKSDKTLEAKKKLIEKY 371

Query: 356 GLSA---SECFPIQITGW-----PLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD 407
             +       F + +  +     PL+L+ +A + V P        E   AA  K      
Sbjct: 372 VQNVYCTGNIFHVDLYNYPEQPFPLDLLMFAAIFVCP--------EATDAAITK------ 417

Query: 408 IKCPEIDEQALQFI 421
              PEI ++ L+F+
Sbjct: 418 ---PEIRKKGLEFL 428


>gi|324503528|gb|ADY41532.1| SET domain-containing protein 3 [Ascaris suum]
          Length = 502

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 74/328 (22%), Positives = 135/328 (41%), Gaps = 51/328 (15%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW------PLLATYLI 157
           GL A  + ++  +LL VP   +++    W       +LK+C   D        +    ++
Sbjct: 109 GLEATHSFKQDAELLRVPRKAMLS----WDQARKSAMLKKCFEQDMIVKTMDNVALALMV 164

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-----LEASQIRERAIER--ITNV 210
                   S W  Y+ ALP+   + LY++  EL +       E S I  R + R  +  +
Sbjct: 165 CCQKLSPDSSWLPYLDALPQTFSTPLYFSALELRKLSPSPAYEESLIMYRNVARQFVYFL 224

Query: 211 IGTYNDLRLR---------------IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
                  R R               +F   P  F    F  + ++W+   + +R+  +PS
Sbjct: 225 AAVQRSERSRSAKKDKNHAAVGMEPLFLNAP--FTVSNFTFDLYRWAVACVTTRINFIPS 282

Query: 256 MDGRVA---------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
              + +         L+P  DM NH  +    + +            + Y+ G++V I Y
Sbjct: 283 QYAKDSNGQPVAVPCLIPLLDMANHEFDHPLTVHFSTEGDYASIKATKDYKAGDEVTIFY 342

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPI 365
           G ++N +  L  GFVP +G N +D+ +L +   + DK  + +L+ +   G +A S  F  
Sbjct: 343 GIRTNRQFFLHNGFVP-DGENKNDTYKLKIGFPRGDKQVRARLKLMHDAGFNAESRVFVF 401

Query: 366 QITG----WPLELMAYA--YLVVSPPSM 387
           ++       PL L+ +A  +LV +P S+
Sbjct: 402 EVNASERPVPLSLLDFARVFLVENPDSV 429


>gi|189237481|ref|XP_001810520.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270006984|gb|EFA03432.1| hypothetical protein TcasGA2_TC013422 [Tribolium castaneum]
          Length = 413

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/338 (26%), Positives = 150/338 (44%), Gaps = 44/338 (13%)

Query: 70  IDSLENASTLQKWLSDSGL-PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
           +D  ENA  L+K+++ +G   P  + ++      RG+   +N+++ + L+ VP  L+I+ 
Sbjct: 17  LDHEENAINLRKFMARNGFNDPINLKLRNFPDTGRGVATPRNLKESDVLITVPYELMISY 76

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY---SLLYW 185
            +            +  +    LL  +L+ E   E +S W +YI +LP QP    +LL  
Sbjct: 77  TTLQKSNFLHLFTPESRLSIVDLLTAFLVIERDKE-NSFWRDYIKSLPPQPPWIPALLSQ 135

Query: 186 TRAEL---DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
            R EL   D  L A + R R +E       +++ LR  I  +        V ++ +F W 
Sbjct: 136 DRVELLPADLRLAAKKSR-RLLEE------SWSRLRKSIRRE-----ASCVIDLHSFIWG 183

Query: 243 FGILFSRLVRLPSMDGR---------------VALVPWADMLNHSCEVET--FLDYDKSS 285
           + ++ +R V +     R               +AL P+ DM NHS E +T   L  D+  
Sbjct: 184 YVLVNTRAVYVNPRIVRELCDCGSDILSDEPCMALCPFLDMFNHSHEAKTEATLMNDQGK 243

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN---PSDSVELPLSLKKSD 342
                TT    +  EQVFISYG   N +LL+ YGF     +N   P  S E+   L+ + 
Sbjct: 244 FVYQLTTLVGTRKHEQVFISYGDHDNVKLLIEYGFFIPGNSNDSIPIQSEEVFRVLEPNL 303

Query: 343 KCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYL 380
             ++ K   +R + L   +C  +   G    L A+ ++
Sbjct: 304 NDFQYKF--IRSHNL--DKCLYLTEAGASFNLKAFLFV 337


>gi|301112144|ref|XP_002905151.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262095481|gb|EEY53533.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 510

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 64/255 (25%), Positives = 124/255 (48%), Gaps = 24/255 (9%)

Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP---LLATYLISEASF 162
           +  +N+  G +LL +P S V++ +S  +    G +L+    PD P    L  +L+ E + 
Sbjct: 107 ITAENVEVGSELLSLPMSQVMSVESA-ARGRVGLLLEVN--PDLPSAIALGLHLLEERAL 163

Query: 163 EKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
             +S +S++++ LP      S L+++  E++  LE SQ++   + R   V   Y+ L   
Sbjct: 164 GAASNFSDFVATLPTIEAINSTLFYSEDEMNE-LEGSQLQRFTLGRAQAVEAFYDALVQP 222

Query: 221 IFSKY---PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML-------- 269
           + S+    P +F +  F ++ F+W+ G+++S   +    +  V L P  + +        
Sbjct: 223 VTSREAVDPPIFHKSEFTLDKFRWAMGVVWSSTFQFGENEDDVILAPVLNTIGICTDLNQ 282

Query: 270 --NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
             N +C  ET +  D  +Q +       Y   ++V +S   KS+ +L+LS+GF  R   +
Sbjct: 283 EGNEACP-ETSIKVDTDTQRLTVYASVAYSKSQEVRLSMPGKSSTQLMLSHGFA-RARAS 340

Query: 328 PSDSVELPLSLKKSD 342
             D ++L ++L  SD
Sbjct: 341 KLDKLDLTVTLDPSD 355


>gi|323447496|gb|EGB03414.1| hypothetical protein AURANDRAFT_72732 [Aureococcus anophagefferens]
          Length = 403

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 66/254 (25%), Positives = 118/254 (46%), Gaps = 14/254 (5%)

Query: 81  KWLSDSGLP-PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           +WL+++G      + ++  D   RG+ A +++   E L+ VP   +IT +   +      
Sbjct: 36  QWLTENGGKFADCVELRSYDDEVRGVHATRDLETEEILVEVPLKCLITVEMGKATDVGRA 95

Query: 140 VLK---QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLE 195
           VL+   +   P    L  +++ +   + S+ ++ Y   LP    ++ ++W   EL+ +L+
Sbjct: 96  VLEAELELDAPKHVFLMLFVLLDRR-DSSTFFAPYYDILPSTLSNMPIFWQPDELE-WLK 153

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
            S +  +  ER   +   Y      I   +P     +V  +E FKW+   + SR   +  
Sbjct: 154 GSYLLTQIEERKRAIKADYE----AICGIWPSFI--DVCTLEEFKWARMCVCSRNFGVVV 207

Query: 256 MDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
              R  A+VP+ADMLNH    ET   +D S      T+ ++   G Q++ SYG+K N   
Sbjct: 208 NGARTSAMVPYADMLNHFRPRETKWTFDNSRGAFTITSLQKISVGSQIYDSYGQKCNHRF 267

Query: 315 LLSYGFVPREGTNP 328
           LL+YGF   +   P
Sbjct: 268 LLNYGFAIEDNKEP 281


>gi|428171155|gb|EKX40074.1| hypothetical protein GUITHDRAFT_113813 [Guillardia theta CCMP2712]
          Length = 353

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 109/243 (44%), Gaps = 43/243 (17%)

Query: 101 GERGL--VALKNIRKGEKLLFVPPSLVITADSKWSCP----EAGEVLKQCSVPDWPLLAT 154
           G+ GL    +K++++GE LL +P   ++  D   +CP      GE+ +  ++P +  LA 
Sbjct: 44  GDNGLELRLVKDVKRGEVLLAIPRRAILEIDDAATCPCKEYITGEMWQ--AIPSYAKLAI 101

Query: 155 YLI-SEASFEKSSR-WSNYISALPRQPYSLLYWTRAEL----DRYL-EASQIRERAIERI 207
           YL+ S    E+  R   +Y   LP+Q  S   W+   +    D Y+ E  Q R R I+R+
Sbjct: 102 YLLYSIDHAEQDPRPLRDYFDVLPKQVLSTFSWSEEAIQELQDPYMIEQIQTRRRKIQRL 161

Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
            + I     L  RI               +   W+  I+ SR        G        D
Sbjct: 162 FHEI--QKGLSPRI-------------TYDRLLWAIEIVLSRAFAFSRTGG-------DD 199

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           ++     V+    YD S Q      ++ ++ G+ V ISYG KSN ELLLSYGF+  +  N
Sbjct: 200 LVFSGTSVK----YDNSKQEFQIVAEKDFKVGQSVEISYGLKSNHELLLSYGFILPD--N 253

Query: 328 PSD 330
           P D
Sbjct: 254 PED 256


>gi|308810511|ref|XP_003082564.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116061033|emb|CAL56421.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
          Length = 1472

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 67/245 (27%), Positives = 117/245 (47%), Gaps = 15/245 (6%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RG   ++++++G+ LL VP     + D   +  E  E+ K C   D  ++A ++  E   
Sbjct: 692 RGHGVVRDVQRGDVLLEVPLRRGFSYDDAMADDEMREIAKACVRRD-DVVALHVCLERYR 750

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE-RAIERITNVIGTYNDL--RL 219
            K ++ + ++ ALP+       W+  EL   +  + +++ RA+  I      Y+ +  RL
Sbjct: 751 GKEAKHAAHVEALPKTFDCAFNWSEDELSELVGTTCLKDTRAL--IEETREDYDAIGRRL 808

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSCEVET 277
               K   L  E   + E + W+   L+SR   L   DG    A++P+ D+ NHS E   
Sbjct: 809 MAMGKGGWLL-ERGVDYERYAWARQCLWSRQCDLMRPDGTRTRAMIPYFDIFNHSPEAPL 867

Query: 278 FLDYDKSSQG--VVFTTDRQYQPGEQVFISY--GKKSNGELLLSYGFVPREGTNPSDSVE 333
              +  +++   V     R Y+ GEQ FISY  G+ +N +LL  YGF      NP + ++
Sbjct: 868 GKTHKLNAERNCVTVYAGRDYKEGEQAFISYGSGEAANAKLLTWYGFCIE--NNPYEELD 925

Query: 334 LPLSL 338
           L L++
Sbjct: 926 LTLTI 930


>gi|388579878|gb|EIM20197.1| RuBisCO-cytochrome methylase [Wallemia sebi CBS 633.66]
          Length = 447

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 139/323 (43%), Gaps = 52/323 (16%)

Query: 73  LENASTLQKWLSDSGLPPQKMAI---QKVDVGERGLVALKNIRKGEKLLFVPPSLVI-TA 128
           + +++   +W + +G    K  +   + VD   RGLVA+ +I+    L  +P  +V+ T 
Sbjct: 1   MTDSAKFLEWFTTNGGEFSKDIVAIGENVDGMGRGLVAVADIKAQTSLFTIPRDIVLSTR 60

Query: 129 DSKWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
            S +      +V KQ    ++  W  L   +  E +   SS+W  Y   LP+Q  SL++W
Sbjct: 61  TSSFKEKVGQDVYKQLENDNIGSWTPLIMAMCWEYNQGGSSKWDAYFKILPKQFTSLMFW 120

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
           ++ EL      + + +  +E I N      D+  +  + + D+     + ++ FK    +
Sbjct: 121 SKEELSLLKGTTVVDKIGLEDIENEFERVRDIVKQNENVFGDIAN---YTLDLFKRMGSL 177

Query: 246 LFSR-------------------------LVRLPSMDGRVALVPWADMLNHSCE-VETFL 279
           + SR                          + L +    VA+VP AD+LN   + V    
Sbjct: 178 ILSRSFTVEEWKTEEEREKEEEEEEDEDEEIDLRTSVDDVAMVPMADILNSRTDSVNAHT 237

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--------PREGTNPSDS 331
           +Y+++   ++   D   + G+Q+F +Y    N +L+  YG V        P    N +D 
Sbjct: 238 EYEENCLRMISLQD--IKAGDQIFNTYNDPPNADLIRRYGHVDYSPLSQDPDFMGNKNDV 295

Query: 332 VELP------LSLKKSDKCYKEK 348
           VELP      L+L  + + +KE+
Sbjct: 296 VELPADILLELALPDAKESHKER 318


>gi|341877649|gb|EGT33584.1| CBN-SET-27 protein [Caenorhabditis brenneri]
          Length = 501

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 155/365 (42%), Gaps = 55/365 (15%)

Query: 71  DSLENASTLQ---KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT 127
           +S+ +A T++   KW  ++G+    + I         L A   I KG  +  VP + ++T
Sbjct: 66  NSVRDAETIKAFLKWSDENGIARNNVTIGPTKTSGLSLQATGPIPKGHIVARVPRNAMMT 125

Query: 128 ADSKW---SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
            D+     S  +A E  +  +  D   LA +L +     + SR+S YI+ LP    + L+
Sbjct: 126 LDNARKSNSLRKAFEKDQIVAGMDNVGLALFLATHWMQNEKSRFSPYIAILPNCFPTPLF 185

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTY-----------------------NDLRLRI 221
           +T  +L + L+ S I E A+     +   +                       N + + I
Sbjct: 186 YTEEQLLQ-LKPSPIFEEALTFYRTISRQFCYFLMAVSKNKMYEAAQRRKDARNTMEVPI 244

Query: 222 FSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS---MD------GRVALVPWADMLNH- 271
           F   P  F    F    + W+ G++ +R+  +PS   +D         AL+P+ DM NH 
Sbjct: 245 FYNSP--FTVANFTSRLYFWAVGVVTTRVNMVPSETLIDKDEKPIAIPALIPFLDMANHE 302

Query: 272 ----SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
                  +E  + Y    +  V T+      G +V I YG +S GE L+  GFVP     
Sbjct: 303 NFETDGPIEDLVCYSPLEECAVITSHCDMDAGREVTIFYGCRSKGEHLIHNGFVPL-NHG 361

Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSA---SECFPIQITG-----WPLELMAYAY 379
             + +++ + + K+DK    K + + KY  +       F + +       +PL+L+ +A 
Sbjct: 362 KQEIMKMKIGIPKTDKNLDVKKKLIEKYVANVFCTGNIFHVDLYNHPEHPFPLDLLMFAA 421

Query: 380 LVVSP 384
           + VSP
Sbjct: 422 IFVSP 426


>gi|255086705|ref|XP_002509319.1| predicted protein [Micromonas sp. RCC299]
 gi|226524597|gb|ACO70577.1| predicted protein [Micromonas sp. RCC299]
          Length = 784

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 89/211 (42%), Gaps = 45/211 (21%)

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           ++  S    + W  Y+  LPR+  SL+ W+  ELD  L+ S++ +RA ERI      Y++
Sbjct: 279 LAAGSAPHDAHWKAYVDLLPREVDSLIEWSENELD-ALQGSRLADRARERIALADSVYDE 337

Query: 217 LRLRIFSKYPDLF----------------------------PEEVFNMETFKWSFGILFS 248
           +  R+    P L+                             ++  + E F+W++  + +
Sbjct: 338 VFPRLNDADPTLWMSGKLGSAVAGGTGIDVTAAARKKGERARDKYTSKEAFRWAWATVLA 397

Query: 249 RLVRLPSM--DGRVALVPWADMLNHS-----CEVETFL------DYDKSSQG---VVFTT 292
           R   LP +  DG + L P  D+ NH      CEV   L      D D    G   V+   
Sbjct: 398 RAFSLPDVGEDGEMGLCPGLDLFNHGSEAEKCEVRGVLGASLELDEDDPQVGPRIVLRAG 457

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
               + GEQ+F  Y  +++G  LL +GF  R
Sbjct: 458 VGGAESGEQLFHDYADRASGGSLLEFGFTHR 488


>gi|442753255|gb|JAA68787.1| Putative set domain-containing protein [Ixodes ricinus]
          Length = 428

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 64/268 (23%), Positives = 118/268 (44%), Gaps = 21/268 (7%)

Query: 79  LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L  W+  +G     K+ ++      RG+VAL+ +  GE  L +P SL+I+  +       
Sbjct: 31  LLTWMEANGFRLHSKLGLRDFPDTGRGVVALEKLVGGETFLKLPTSLLISTRTALQSLLH 90

Query: 138 GEVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +    +    +L  +++ +    ++SRW  ++ +LPR   + ++  R   +     
Sbjct: 91  SFITRYHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPRTFTTPVFLRRTVFESL--P 148

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-----EVFNMETFKWSFGILFSRLV 251
             +RE    RIT++  T+  L++ +   + +  PE       F    F W++  + +R +
Sbjct: 149 KDLREEVHTRITSIQRTFLKLKV-LLGGHVEEEPEVQSLSTGFTWNNFVWAWTAVNTRCI 207

Query: 252 RLPSMDG-------RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
                +          AL P+ D LNH  +    ++     +     + + +   EQVFI
Sbjct: 208 FAQGSNSSSLWENDHCALAPFLDCLNHHWKAS--IETAMVGENFEILSHKSHDANEQVFI 265

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSV 332
           SYG  SN  L L YGFV  +  NP+D V
Sbjct: 266 SYGPHSNRRLFLDYGFVLPD--NPNDVV 291


>gi|168005531|ref|XP_001755464.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693592|gb|EDQ79944.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1033

 Score = 72.8 bits (177), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 70/265 (26%), Positives = 116/265 (43%), Gaps = 43/265 (16%)

Query: 82  WLSDSGLP-PQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           W+  +G    +K++I  +  G+    RG+V LKNIR+GE L  +P  + +  +       
Sbjct: 535 WMEGNGFSISEKLSITHLLAGDGKLVRGVVVLKNIRRGETLCNLPLDMGLYDN------- 587

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             E +    V  W   A  L+ E +   SS W++YI+ LP+     +     EL      
Sbjct: 588 --ETIVAGEVDSWDRAAARLLREKAKGSSSAWASYINILPQNMTVPILLEDHELHEVQWW 645

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLP 254
             +RE               +R  I   +  L  +++   + E ++W+  ++ SR   LP
Sbjct: 646 PVLRELV------------QVRKSIRESFSLLSVDDLAGADFEEYRWAAMMVHSRAFTLP 693

Query: 255 SM-DGRVA---LVPWADMLNHSCEVETFLDYDKSSQ-----GVVFTTDRQYQPGEQVFIS 305
              D   A   ++P+ DM+NH    +     D  SQ      V     R  + GE++F S
Sbjct: 694 VFADDHYAPYVMMPYMDMINHHYHYQA----DWMSQPIWGGKVEIVARRDIKKGEELFAS 749

Query: 306 YGKKSNGELLLSYGFVPREGTNPSD 330
           +G ++N  L L YGFV ++  NP D
Sbjct: 750 FGPRANDNLFLYYGFVLKD--NPFD 772


>gi|308802149|ref|XP_003078388.1| related to histone-lysine N-methyltransferase (ISS) [Ostreococcus
           tauri]
 gi|116056840|emb|CAL53129.1| related to histone-lysine N-methyltransferase (ISS), partial
           [Ostreococcus tauri]
          Length = 446

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 86/352 (24%), Positives = 156/352 (44%), Gaps = 30/352 (8%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE--AGEVLKQCSVPDWPLLATYLIS 158
            ERG+   +++ +GE L  VP    ++  S  +      G   +  +  D  +LA +++ 
Sbjct: 21  AERGVATTRDVTRGELLATVPLEKCVSTSSARADATLWRGLSARPGASLD-GILAAHVLR 79

Query: 159 EASF--EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE--RAIERITNVIGTY 214
           EA    E+S+ W  ++  LP +  + + W   EL R L+ S +    RAI++  +    Y
Sbjct: 80  EAFGLGERSAFWP-WLRLLPSETDAAVGWDEDEL-RELQGSNVVAFARAIKK--SWREEY 135

Query: 215 NDLRLRIFS-KYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRVA-----LVPW 265
           + L        +P+ F  E    +  E F W+  +++SR + L + D   A     LVP 
Sbjct: 136 DALDFAGLGVDFPEAFGGEHAAHYTFEKFTWARFVVWSRAIDLKT-DSTSAPVIRMLVPI 194

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            DM NH+   +    +D  +  V       ++   ++  +Y  K +   LL YGF+P   
Sbjct: 195 LDMANHAPSGKLLPRWDAKANAVKIYAGSAFKRNTELRFNYDTKPSQYFLLQYGFIPE-- 252

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQITGWPLELMAYAYLVVSP 384
            NP++ VE+ + L + D   + K   LR++GL  ++  F  ++ G   +L+A A ++   
Sbjct: 253 ANPAECVEVTMQLSQRDNLRERKEALLRRHGLDPTKRNFEWKVRGLDYDLLAAARIIAMD 312

Query: 385 PSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFL 436
            S       +A + S    S K+      D +    +L S  +S+  Y   L
Sbjct: 313 ESELDDDTSVALSVSGASVSAKN------DARTKAVLLKSLITSLDGYGTTL 358


>gi|440802665|gb|ELR23594.1| SET domain containing protein [Acanthamoeba castellanii str. Neff]
          Length = 984

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 71/294 (24%), Positives = 128/294 (43%), Gaps = 40/294 (13%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA- 137
            + WL  +     ++     ++G RG +A ++I  GE+L  +P  LV+T +         
Sbjct: 4   FEGWLQANEARYPRLTFAVSELGGRGGIATEDILPGEELCSIPVRLVLTTEIARKSEVGR 63

Query: 138 ----------GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
                     GE L+  +     +L  YLI + +  + + W  Y+ +LP+      +  R
Sbjct: 64  LVAAHLNAVQGERLRVSA--GRAILCAYLIHQRA-AQDAFWGPYLRSLPK------HDDR 114

Query: 188 AELD-RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
            + D ++L  + +     E+   +  +++ L   +   +P +FP ++F  + F W+F   
Sbjct: 115 PDEDIQHLAGTNLFYAMQEKQQQIRESFDLLFPALCHAHPTVFPPDLFTWDHFLWTFTAC 174

Query: 247 FSR-----LVRLPS------------MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV 289
            SR     LV+ P+            ++    L+P  DMLNH    +     D S+  + 
Sbjct: 175 SSRSFPQTLVQQPTATTSAHADPYDLLEIDECLLPGLDMLNHQYRKKITWALDPSTGRLK 234

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
           F T+   + G + F +YG K N ELL+ YGF   +  N  D V + LS   + K
Sbjct: 235 FVTEDTVEKGTEAFNNYGPKGNEELLMGYGFCIED--NEQDYVMIRLSFSPAGK 286


>gi|302826668|ref|XP_002994755.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
 gi|300136963|gb|EFJ04180.1| hypothetical protein SELMODRAFT_432653 [Selaginella moellendorffii]
          Length = 688

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 107/237 (45%), Gaps = 31/237 (13%)

Query: 103 RGLVALKNIRKGEKLLFV-----------PPSLVITADSKW----SC-PEAGEVLKQCSV 146
           RGL A + +R GE++L +           P  L     S W    SC PE  E+    +V
Sbjct: 437 RGLFASRPVRAGERVLEISLDLMIAPTRLPDQLSTLQSSAWAPYISCLPEPAEL--DNTV 494

Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERA 203
            D+ +     +S+    +SS W+ YIS LP +P  L     W   EL  YL AS +  + 
Sbjct: 495 LDYRVF----VSQKFQLQSSAWAPYISCLP-EPAELDNTFLWEDTELS-YLRASPLYGKT 548

Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
            ER+  +   +  ++      +P LF +   ++E F   +  +FSR + +   D  + ++
Sbjct: 549 RERLEIITTEFGQVQ-NALDVWPQLFGK--VSVEDFMHVYATVFSRPLAI-GEDSTLVMI 604

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           P  D  NH+      L ++      V T DR     +Q++I+ G  SN EL L YGF
Sbjct: 605 PMLDFFNHNAASFAKLSFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGF 661


>gi|159477607|ref|XP_001696900.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
 gi|158274812|gb|EDP00592.1| rubisco small subunit N-methyltransferase [Chlamydomonas
           reinhardtii]
          Length = 411

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 84/353 (23%), Positives = 151/353 (42%), Gaps = 72/353 (20%)

Query: 87  GLPPQKMAIQKV--DVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ 143
           GL      +Q++  DVGER  +VA +++R  E ++ +P +L +T     S P  G +  +
Sbjct: 19  GLKADACGVQRMTGDVGERVAIVAARDVRDKETVMVIPENLAVTRVDAESHPVVGPLAAE 78

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERA 203
            S  +   L  +L++E +    S ++  ++ LP    S L W+ AEL+  +  S +   A
Sbjct: 79  AS--ELTALTLWLLAERAAGAGSNYAGLLATLPESTLSPLLWSDAELEELMAGSPVLPEA 136

Query: 204 IERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
             R   +  T+  L  ++ +  P  FP                           GR A  
Sbjct: 137 RSRKKALADTWAALAPKLAAD-PARFPA--------------------------GRRAA- 168

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
                               + +GVV         G ++ ++ G+  NGELLL+ G +  
Sbjct: 169 -------------------GARKGVVVWDG----AGSEMLLNDGR-PNGELLLATGTL-- 202

Query: 324 EGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLV-V 382
           +  N SD +  P  L  +D+ Y  K + L   G SA+E FP+     P++L+AY  L  V
Sbjct: 203 QDNNSSDFLSWPAGLVPADRYYMMKSQVLESMGYSAAEEFPVYADRMPIQLLAYLRLSRV 262

Query: 383 SPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDE-QALQFILDSCESSISKYSR 434
           + P++  K            T + D++  +++E + LQ ++  C   ++ Y++
Sbjct: 263 ADPALLAKC-----------TFEADVELSQMNEYEILQILMGDCRERLASYTK 304


>gi|428175768|gb|EKX44656.1| hypothetical protein GUITHDRAFT_109433 [Guillardia theta CCMP2712]
          Length = 591

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 65/251 (25%), Positives = 115/251 (45%), Gaps = 23/251 (9%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV-----PDWPLLATY 155
           GERG+  + +I   E++  +P  ++++  S  +   A    K   V      +  L    
Sbjct: 39  GERGVRVISDIAPCEEMFSIPEKILMSRKSCMASSIAHVFRKHKDVLFSSRDELALTLLI 98

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L  +     +S W   I  LP  P +   W+  EL    + S ++  A+  + ++  TY 
Sbjct: 99  LYEKLDQGNASFWKPMIDILPADPGAASKWSEEELQELQDES-LKAEAMIVVASMQQTYQ 157

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWADMLNH 271
            +   I  ++ D+F  + +  E F+W+   +    F R +  PS      +VP+AD+LNH
Sbjct: 158 RVLRPILVQHGDVFSVDRYTWEEFRWALLCVESRTFGRFLPHPS------IVPFADLLNH 211

Query: 272 SCEVETFLDYDKSSQGVVFTTD----RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
              V+T   +    +   +  D      ++ GE+ F+SYG +SN ELLL YGF  +  +N
Sbjct: 212 -VNVQTSYRWLPEERRAAYMCDASGEHVHRRGEEAFMSYGPRSNAELLLHYGFALQ--SN 268

Query: 328 PSDSVELPLSL 338
             ++VEL   +
Sbjct: 269 RYEAVELNFRI 279


>gi|351697762|gb|EHB00681.1| SET domain-containing protein 6 [Heterocephalus glaber]
          Length = 486

 Score = 72.4 bits (176), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 68/279 (24%), Positives = 123/279 (44%), Gaps = 19/279 (6%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV- 146
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S  +C   G + ++  V 
Sbjct: 73  LSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---SPHTCSIGGLLERERDVL 129

Query: 147 ---PDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
                W  L   L+ E     +S WS Y +  P   +    ++W   E  R L+ + + E
Sbjct: 130 QSQSGWVPLLMALLHELQ-APASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVPE 188

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
              + + N+ G Y  + L     +PDLF   V ++E ++    ++ +   + P  +    
Sbjct: 189 AVDKDLVNIRGEYYAIVLPFMEAHPDLFGPSVRSLELYRQLVALVMAYSFQEPLEEEEEE 248

Query: 262 -------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
                  +VP AD+LNH     + L+Y      +V T  R    G ++F +YG+ +N +L
Sbjct: 249 KEPNSPLMVPAADILNHLANHNSNLEYSADYLRMVAT--RSIPKGHEIFNTYGQMANWQL 306

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           +  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 307 IHMYGFVEPYPHNTDDTADIQMVTVREAALQGVKEEAER 345


>gi|361129824|gb|EHL01706.1| putative Ribosomal N-lysine methyltransferase 4 [Glarea lozoyensis
           74030]
          Length = 483

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 75/266 (28%), Positives = 124/266 (46%), Gaps = 34/266 (12%)

Query: 81  KWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           +WLS  G+   P   +   K +   RG+VA  +  + E +  +P + V+  ++ ++  ++
Sbjct: 15  EWLSKIGVRINPKMTLKDLKSEGRGRGVVAAADFEEDEVVFCIPRTAVLNVNNVFAGQDS 74

Query: 138 G---EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
           G   E L Q  +P+W  L   ++SE   +  SRW+ Y++ LP++  SL++W+  EL   L
Sbjct: 75  GASKEALLQ--MPNWLALTATMMSEGQ-QSDSRWAPYLAVLPQKLDSLVFWSEEELAE-L 130

Query: 195 EASQI-----RERAIERITNVI-----GTYN-DLRLRIFS---KYPDLFPEEVFNMETFK 240
           +AS +     R  A E  T  I     G +N +L  ++ S    Y    PEE    E  K
Sbjct: 131 QASSVAKKIGRSSAEEMFTKHISPLGLGEFNVELCHQVASVIMAYAFDIPEE----EPAK 186

Query: 241 WSFGILFSRLVRLPSMDGR-----VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQ 295
              G        L S DG      ++++P ADMLN   E      Y   ++ +   T + 
Sbjct: 187 QENGGAEGETDDLVSDDGEDEKTILSMIPLADMLNADAERNNARIY-YENEDLEMRTIKP 245

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFV 321
              GE++F  YG+    +LL  YG+V
Sbjct: 246 IMAGEEIFNDYGQLPRSDLLRRYGYV 271


>gi|302835223|ref|XP_002949173.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
 gi|300265475|gb|EFJ49666.1| hypothetical protein VOLCADRAFT_120737 [Volvox carteri f.
           nagariensis]
          Length = 593

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 80/301 (26%), Positives = 132/301 (43%), Gaps = 17/301 (5%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEA 160
           RGL A   +  G+ +L VP  L+I+ ++     + G+VL      + D  +   +   E 
Sbjct: 193 RGLRADTAVAPGDVVLHVPADLLISYETAKKS-DLGKVLSALPLDLSDDSIALIWTCVE- 250

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE---RAIERITNVIGTYNDL 217
             E  +  + + +ALP   +S       E    LE + +     RA + ++    + +  
Sbjct: 251 RHEPEAPHAPFWAALPHS-FSTALSASQEDVALLEGTPLHGDAVRARQHLSEAFESSSPA 309

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRVALVPWADMLNHSC-- 273
              +   YPD F  E F+ E++ W+  + +S    V+  S D R  L P+  ++NH    
Sbjct: 310 FRSLLGAYPDYFKPEWFSWESYLWAAELWYSYGIQVQFASGDIRTCLAPYLGLMNHHPLP 369

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
            V  F   D  +  +     R  + G Q+F+SYG  SN +LLL YGF  R+  NP+D VE
Sbjct: 370 HVVHFSKVDPETGCLRVRAFRPCEAGNQLFLSYGPYSNAKLLLFYGFAVRD--NPADEVE 427

Query: 334 LPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEE 393
           L L +         +  AL   GLS            P  L++ A L+ +PP +  ++  
Sbjct: 428 LVLQVPPGAAATDRR--ALLAAGLSLEHRLRAGGRLAP-PLLSCARLLAAPPPLLKQWRR 484

Query: 394 M 394
           M
Sbjct: 485 M 485


>gi|412987667|emb|CCO20502.1| related to histone-lysine N-methyltransferase (ISS) [Bathycoccus
           prasinos]
          Length = 866

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 76/291 (26%), Positives = 130/291 (44%), Gaps = 40/291 (13%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW------------- 149
           RG    +++R+G+ LL +P S   + +S     +  E+L +                   
Sbjct: 25  RGNAVTEDVRRGDVLLEIPLSRCFSLESA----QKSEMLTKAMAKAAAAAAGTRFTPTHD 80

Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR-YLEASQIRERAIERIT 208
             +A +++ E +  K S    +I ++P+     L+W+  E  R  L  +      +    
Sbjct: 81  QYMAMFILLEQNLGKQSSHYEHILSIPKAYDLPLFWSEEERQRSLLFGTTTYAETLALDE 140

Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL----PSMDGRVALVP 264
            VI  Y  L+  +     D F E+   M+ FKW    L+SR   L    P       L+P
Sbjct: 141 EVIQDYELLKHHLGE---DFFREQNITMDRFKWVRATLWSRQCDLLRPAPETTRLRVLIP 197

Query: 265 WADMLNHSCEV----ETFLDYDKSSQGVVFTTDRQYQP-GEQVFISY--GKKSNGELLLS 317
             DM NHS +V       L+Y   S+G+V        P GEQ +ISY  G+ S+ +LLL 
Sbjct: 198 EFDMFNHSSKVPLGSSHKLNY---SRGLVTAFATANVPKGEQAYISYGSGEASSSKLLLW 254

Query: 318 YGFVP-REGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           YGF P  EG NP + +++ L    + +C  ++ E L++   ++++ +  +I
Sbjct: 255 YGFAPLNEGENPFEQLDVTL----TSQCSADRAECLKQALFASAQVYLRKI 301


>gi|395839524|ref|XP_003792639.1| PREDICTED: N-lysine methyltransferase SETD6 [Otolemur garnettii]
          Length = 448

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 75/305 (24%), Positives = 134/305 (43%), Gaps = 30/305 (9%)

Query: 67  GCEIDSLENASTLQKWLSDSGLP-PQKMAI-QKVDVGERGLVALKNIRKGEKLLFVPPSL 124
           GC+ D +   +    W    GL    K+A+ ++  V   G+VAL++++ GE L  VP + 
Sbjct: 16  GCDADPV---AGFLSWCGQVGLELSSKVAVTRQGTVAGYGMVALESVQPGELLFAVPRAA 72

Query: 125 VITADSKWSCPEAG----EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--Q 178
           ++   S+ +C  +G    E +   S   W  L   L+ E     +S W  Y +  P   +
Sbjct: 73  LL---SQHTCSISGLLEQERVALQSQSGWVPLLLALLHEVQ-APASPWRPYFALWPELGR 128

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
               ++W   E  R L+ + + E   + +TN+   Y  + L     +P+LF   V ++E 
Sbjct: 129 LEHPMFWPEEERHRLLQGTGVPEAVEKDLTNIRSEYCSIVLPFMEAHPELFSPRVRSLEL 188

Query: 239 FKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
           +     ++ +   + P  +           +VP AD+LNH       L+Y  +   +V T
Sbjct: 189 YHQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSANYLRMVAT 248

Query: 292 TDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
                QP   G ++F +YG+ +N +L+  YGFV     N  D+ ++ +   +       K
Sbjct: 249 -----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTK 303

Query: 349 LEALR 353
            EA R
Sbjct: 304 GEAER 308


>gi|330798760|ref|XP_003287418.1| hypothetical protein DICPUDRAFT_32466 [Dictyostelium purpureum]
 gi|325082565|gb|EGC36043.1| hypothetical protein DICPUDRAFT_32466 [Dictyostelium purpureum]
          Length = 479

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 132/329 (40%), Gaps = 83/329 (25%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG-------------------- 138
           D G RG++A  +I++ E L+ +P   +I + SK+S P                       
Sbjct: 23  DTG-RGVIANNDIKENEILISIPSKYLIHSHSKFSIPSLNIPELNNSDSSNSSSSSDDIY 81

Query: 139 ----EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR-- 192
                 LK+ +      ++  LI E   +K S W NY++ LP        ++  E++   
Sbjct: 82  TPFHNCLKKLNSKQ--RISLILIIEKLIKKHSIWFNYLNELPDDYTITSTYSDEEIESLS 139

Query: 193 ---YLEASQIRERAI----------------ERITNVIGTYNDLRLRIFSKYPDLFPEEV 233
              Y+E+S+  +  +                  +  V+   NDL++++          ++
Sbjct: 140 YPIYVESSKKLKNEMLNSFKLFCEIFQLYYGTDLDRVVIELNDLQVKL---------SDI 190

Query: 234 FNMETFKWSFGILFSR-------LVRLPSMDGR-----VALVPWADMLNHSCEVETFLDY 281
            N E + W +G + +R       + +  S +         LVP AD+ NH+  VET   +
Sbjct: 191 LNKELYIWCWGTIQTRTYFYDKNMKKNNSKENNEEKDDCTLVPLADLFNHTSNVETEALF 250

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL------- 334
           +         T   +  G QVFISYGK SN  L+  YGF+     N  DS+ L       
Sbjct: 251 NDELNCYQVKTKTPFSKGSQVFISYGKHSNFTLMNYYGFIIE--NNDQDSIPLLQSNCIP 308

Query: 335 -----PLSLKKSDKCYKEKLEALRKYGLS 358
                P +     K Y++K+  L  YGLS
Sbjct: 309 TEFAVPPTSSDEAKLYEKKIGILNNYGLS 337


>gi|145355885|ref|XP_001422177.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582417|gb|ABP00494.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 495

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 70/262 (26%), Positives = 107/262 (40%), Gaps = 49/262 (18%)

Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF-EK 164
           V    I +G+ ++ +P   ++ A S      A E  +   +  + LL   L+ E    + 
Sbjct: 88  VCDDGIARGDVIVAIPRDAMLDARSALG-DAAFERARARGLSSFQLLTVSLLREWRLKDT 146

Query: 165 SSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVI-GTYNDLRL-R 220
           +SRW  Y+  LP     +  L W   +++++L A+     A  R+  +I     D RL R
Sbjct: 147 TSRWKPYLDTLPEDDGRWHPLLWRDEDVEQHLPANSTHAGA--RLRGLIRACEEDTRLFR 204

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR--------------------- 259
                 ++  E   +M   +W+  I+ SR  RL  +D                       
Sbjct: 205 SIVDELNIDDENWPSMRHVRWAVSIVISRAFRLNELDDEECLREVRDDALLETLNDLDAD 264

Query: 260 -----------------VALVPWADMLNHSCEV--ETFLDYDKSSQGVVFTTDRQYQPGE 300
                            +ALVPWAD LNHS +   E  L YD  SQ       + Y  GE
Sbjct: 265 CWEGSGGDSGEDDEFSVMALVPWADGLNHSSDAGDEAILTYDTLSQTATLRAHKAYACGE 324

Query: 301 QVFISYGKK-SNGELLLSYGFV 321
           QVF SYG   S+ +L ++YGFV
Sbjct: 325 QVFDSYGSNLSDEDLFVNYGFV 346


>gi|195439104|ref|XP_002067471.1| GK16171 [Drosophila willistoni]
 gi|194163556|gb|EDW78457.1| GK16171 [Drosophila willistoni]
          Length = 511

 Score = 72.0 bits (175), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 123/285 (43%), Gaps = 24/285 (8%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           L   +   +W    G+    + I      + GL A K+I   +++L VP   + + + + 
Sbjct: 84  LAKIAAFSEWAKAGGIHSDGVEIAIFPGYQMGLRATKDINADQQVLRVPRKKIFS-EEQL 142

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
           S  E       C+      LA  L+ E S    S W  YI  LP +  ++LY+T  ++ R
Sbjct: 143 SKTERESF---CNFTTNFNLANALVVEKSRGADSIWKPYIDVLPSRYNTVLYFTVEQM-R 198

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIF--SKY--PD--LFPEEVFNMETFKWSFGIL 246
            L  + +   A+ +   +   Y  L    +  S Y  PD  LF +     E ++W+   +
Sbjct: 199 RLRGTSVCSSALRQCRMIARKYAKLYAFAYCDSSYLRPDTGLFTQHGLCYELYRWAVSTV 258

Query: 247 FSRLVRLPSM-----DGRV---ALVPWADMLNHS-CEVETFLDYDKSSQGVVFTTDRQYQ 297
            +R   +P       DG     AL+P  DM NH   ++ +F  YD ++  +  T     +
Sbjct: 259 MTRQNLVPREIATKDDGNSPISALIPCWDMANHRPGKITSF--YDSNAHQMECTAQEFCK 316

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
            G Q FI YG + N +LL+  GFV  +  N  D V + L L  +D
Sbjct: 317 AGNQFFIYYGDRPNADLLVHNGFV--DPNNNKDFVNIRLGLSPTD 359


>gi|302754816|ref|XP_002960832.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
 gi|300171771|gb|EFJ38371.1| hypothetical protein SELMODRAFT_437299 [Selaginella moellendorffii]
          Length = 418

 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 94/211 (44%), Gaps = 10/211 (4%)

Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
           +V  W  LA  ++ E    +    ++ +    R   +   W   EL  YL AS +  +A 
Sbjct: 162 TVKPWTKLALIVLMERYKGQYGHHTSRVFLNQRSSTTRFRWEDTELS-YLRASPLYGKAR 220

Query: 205 ERITNVIGTY----NDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDG 258
           ER+  +   +    ND    +  +  D++P+     ++E  K  +  +FSR + +   D 
Sbjct: 221 ERLEMITTEFGQVQNDFCTCVLEQALDVWPQLFGKVSLEDLKHVYATVFSRSLAI-GEDS 279

Query: 259 RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
            + ++P  D  NH+      L ++      V T DR Y   +Q++I+YG  SN EL L Y
Sbjct: 280 TLVMIPMLDFFNHNATSFAKLSFNGLLNYAVVTADRDYAENDQIWINYGDLSNAELALDY 339

Query: 319 GFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
           GF   E  NP D  EL     + +   K++L
Sbjct: 340 GFTVPE--NPYDETELLTQFPEMNTIIKDQL 368


>gi|412991339|emb|CCO16184.1| predicted protein [Bathycoccus prasinos]
          Length = 519

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 80/317 (25%), Positives = 129/317 (40%), Gaps = 54/317 (17%)

Query: 42  VHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVG 101
           V C       A   + T+ Q+++ W            ++   S SGL  +     K    
Sbjct: 46  VKCQSKELETALIREQTIVQDLVGW-----------CIENGFSGSGLGVRPSTSGKG--- 91

Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS 161
            RGL A + + K E +L +P    I  ++K       EV+++   P    LA  L+ E  
Sbjct: 92  -RGLEATRLVEKDECVLTLPLRSGIVDEAKGHPEHTREVIEK--APWGVRLACRLLQERK 148

Query: 162 FEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGT-YNDLRL 219
               S ++ Y+  +P     S L++   E+ R        E+ IE +   +   Y+DL  
Sbjct: 149 KGAESAYAAYLELIPENVETSPLHYASEEVSRICYPPM--EKEIEEMRKAVKKWYDDLNA 206

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNHSCEV 275
               +      EE      FK +  ++ SR   + S D       AL+P AD+LNH  + 
Sbjct: 207 GEGKEALAGASEE-----EFKCAVAVVHSRTYGVSSGDTGEGYFRALLPLADLLNHGGD- 260

Query: 276 ETFLDY--------------------DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
             ++D                     ++    + FT  +  +PGE+  +SYG++SN   L
Sbjct: 261 -EYIDETRSSTSTVSTETVAWSEITDEEDESEIAFTAQKTLEPGEEALMSYGERSNDHFL 319

Query: 316 LSYGFVPREGTNPSDSV 332
           L YGFVPR+  NP D V
Sbjct: 320 LYYGFVPRK--NPHDDV 334


>gi|440792461|gb|ELR13682.1| [Ribulose-bisphosphate-carboxylase]-lysine N-methyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 400

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 106/242 (43%), Gaps = 23/242 (9%)

Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSV-----PDWPLLATYL-ISEASFEKS 165
           ++GE L  VP   ++ + ++  C  A ++ K  +          +LA  L + E+     
Sbjct: 3   QEGELLAEVPARFILHSRNERVC-HAADLRKALAAHPRVASHRHMLAAVLWLLESVNCAQ 61

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY 225
           S W  Y+S LP    ++  W + EL        + E    +   +   Y  + L    + 
Sbjct: 62  SFWQPYLSELPDAVATVDRWNQEELAEVGHTLMLYEMVEYKKKKIAADYAAILLPFLQEN 121

Query: 226 PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSS 285
             LF   + + E ++ +  +++SR      + G    +P+ D LNHS       D  K++
Sbjct: 122 TQLFGGSIPSEEEYRRALSLVYSRTFDFSELIGEHVFIPFVDFLNHSIN-----DTGKAA 176

Query: 286 QGVVFTTDR---------QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
               +  D+          Y  GE+VFISYG+K++ +LL SYGF+     N  D+V++  
Sbjct: 177 CTYSYNHDKDCFELLAGADYDEGEEVFISYGEKTSSQLLASYGFMYE--NNAEDTVDITA 234

Query: 337 SL 338
           SL
Sbjct: 235 SL 236


>gi|195565510|ref|XP_002106342.1| GD16174 [Drosophila simulans]
 gi|194203718|gb|EDX17294.1| GD16174 [Drosophila simulans]
          Length = 395

 Score = 71.6 bits (174), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 72/307 (23%), Positives = 125/307 (40%), Gaps = 31/307 (10%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           L        W  D G+  + + I      + GL A + + K E +L VP  L+ + +S  
Sbjct: 25  LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRATRPLAKDELVLSVPRKLIFSEESNS 84

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
            C   G++ +   +     LA  L+ E    + S W  YI  LP +  ++LY+T  +++ 
Sbjct: 85  DCRLFGKMTQATHLN----LAYDLVIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTKQME- 139

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
            L  +     A+ +   +   Y  L     +          +P      F +     E +
Sbjct: 140 LLRGTAAASLALRQCRVIAKQYAFLYRYAHTMTEPSTGNRSHPGERGLFFTQHGLCYELY 199

Query: 240 KWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
           +W+   + +R   +PS             AL+P+ DM NH   ++ +F  Y    + +  
Sbjct: 200 RWAVSTVMTRQNLVPSEKQESEDTPKLISALIPYWDMANHRPGKITSF--YAAVPRQLEC 257

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
           T       GEQ FI YG +SN +LL+  GFV  +  N  D V + + L  +D    ++  
Sbjct: 258 TAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV--DDNNLKDYVNIRVGLSLTDALAAKRAS 315

Query: 351 ALRKYGL 357
            L K  +
Sbjct: 316 ILDKLNI 322


>gi|410970027|ref|XP_003991492.1| PREDICTED: SET domain-containing protein 4 [Felis catus]
          Length = 440

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 81/309 (26%), Positives = 134/309 (43%), Gaps = 27/309 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL D       +         RGL++  ++++G+ ++ +P + ++T D+       G
Sbjct: 36  LKKWLKDRKFEDTNLIPACFPGTGRGLMSKTSLQEGQVIISLPETCLLTTDTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYIAKWRPPPSPLLALCTFLVSEKHAGDQSVWKPYLEILPKA-YTCPVCLEPEVVN-LFP 152

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
             +R +A E+   V   ++  R    S  P LF E V   F+     W++  + +R V +
Sbjct: 153 KPLRAKAEEQRARVREFFSSSRGFFSSLQP-LFSEAVGSIFSYRALLWAWCTVNTRAVYV 211

Query: 254 PSMDGRV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                R         AL P+ D+LNHS  V+    +++ ++     T    +  E+VFI 
Sbjct: 212 KPRRRRCFSAEPDTCALAPYLDLLNHSPHVQVEAAFNEETRCYEIRTASSCRKHEEVFIC 271

Query: 306 YGKKSNGELLLSYGFV----PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           YG   N  LLL YGFV    P      S+ + L   L  +DK   +K+  L+ +    + 
Sbjct: 272 YGPHDNQRLLLEYGFVSIHNPHACVYVSEDI-LVKYLPSTDKQMNKKISILKDHDFIENL 330

Query: 362 CFPIQITGW 370
            F     GW
Sbjct: 331 TF-----GW 334


>gi|353236313|emb|CCA68310.1| related to SET7-Regulatory protein [Piriformospora indica DSM
           11827]
          Length = 493

 Score = 71.2 bits (173), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 89/344 (25%), Positives = 138/344 (40%), Gaps = 97/344 (28%)

Query: 74  ENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV------ 125
           E+     KW  DSG  L P  + I+  +   RG VAL +I+K   L  VP S++      
Sbjct: 4   ESTQEFLKWFRDSGATLHP-AVGIKDFEGVGRGAVALHDIQKDTVLFTVPRSILLSTRTA 62

Query: 126 ----ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
               I  D  WS           ++  W  L   ++ E S  K S WS Y+  LP +  +
Sbjct: 63  PLRDILGDEDWS-----------TLKGWEGLILSMMYEDSRVKDSPWSGYLQDLPTKFDT 111

Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP----EEVFNME 237
           L++WT  EL++ L+AS +R++  +  T     +++  L +  +  D+F     +  F +E
Sbjct: 112 LMFWTDEELEQ-LQASTVRDKIGKAATE--KDFHERVLPLLQRRTDVFEPALRDTFFTLE 168

Query: 238 TFKWSFGILFSR----------------------------LVRLP-SMD---GR------ 259
            F  +   + SR                            + R P +MD   GR      
Sbjct: 169 RFHINGSRILSRSFHVEEWHDEHASDDESIPSEPDHKPVEMSRDPDAMDTDEGRPEGGED 228

Query: 260 ------------VALVPWADMLN--HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
                       V +VP AD+LN  + C       Y+K    ++ T D     GEQ++ +
Sbjct: 229 DDAESDTESPDDVNMVPMADILNARYGCHNAKLF-YEKDHLNMIATKD--IPAGEQIWNT 285

Query: 306 YGKKSNGELLLSYGFV-------PREGT----NPSDSVELPLSL 338
           YG   N +LL  YG +       P  G     NP+D VE+   L
Sbjct: 286 YGDPPNADLLRQYGHIDRIPILNPEVGVYPFENPADEVEIRADL 329


>gi|444705829|gb|ELW47217.1| Histone-lysine N-methyltransferase setd3 [Tupaia chinensis]
          Length = 539

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 77/311 (24%), Positives = 125/311 (40%), Gaps = 70/311 (22%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G       +        GL A + I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASENGASVDGFEMVNFKEEGFGLRATREIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           RYL+++Q    AI  +              FS+Y +   +  +  +  + + G       
Sbjct: 195 RYLQSTQ----AIHDV--------------FSQYKNTARQYAYFYKVIQITTGY------ 230

Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
                          ++ +  CE     D+               +PGEQ++I YG +SN
Sbjct: 231 ---------------NLEDDRCECVALQDF---------------RPGEQIYIFYGTRSN 260

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP 371
            E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +  T  P
Sbjct: 261 AEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTDPP 318

Query: 372 LELMAYAYLVV 382
           +     A+L V
Sbjct: 319 ISAQLLAFLRV 329


>gi|425773952|gb|EKV12277.1| hypothetical protein PDIG_46020 [Penicillium digitatum PHI26]
 gi|425782378|gb|EKV20291.1| hypothetical protein PDIP_17950 [Penicillium digitatum Pd1]
          Length = 487

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 115/267 (43%), Gaps = 39/267 (14%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
           P  ++A  +     RG+VA  NI +GE+L  +P ++V+T  +        E L++   P 
Sbjct: 34  PKLRLADLRATGAGRGVVAQSNIVEGEELFSIPRTMVLTVQNSELRTLLAENLEEQMGP- 92

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
           W  L   ++ E    + SRW+ Y   LP +  +L++W+ AEL + L+AS I E+ I R +
Sbjct: 93  WLSLMLVMVYEYLQGEKSRWAPYFRVLPSRFDTLMFWSPAEL-QELQASTIVEK-IGR-S 149

Query: 209 NVIGTYNDLRLRIFSKYPDLFPE-------EVFNMETFKWSFGILFSRLVRLPSMD---- 257
           N   +  D    I +K PDLFP        E    +      G +   L+   + D    
Sbjct: 150 NAEESIRDSIAPILAKRPDLFPPPPGLASWEGIAGDAALIQVGHVMGSLIMAYAFDIEKA 209

Query: 258 ------GRV-----------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
                 G V                  +VP AD+LN   +      Y +    +V    +
Sbjct: 210 EDDDDEGEVNDESYMTDDEEEEQLPKGMVPLADLLNADADRNNARLYQEEG-ALVMKAIK 268

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFV 321
             Q G+++F  YG+    +LL  YG+V
Sbjct: 269 PIQKGDEIFNDYGEIPRADLLRRYGYV 295


>gi|328869852|gb|EGG18227.1| hypothetical protein DFA_03714 [Dictyostelium fasciculatum]
          Length = 504

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 93/435 (21%), Positives = 174/435 (40%), Gaps = 87/435 (20%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVD--------VGERGLVALKNIRKGEKLLFVPPSLVITA 128
           + +++WL D+ +   +  I+ VD        V   G++A ++++  E +  +P   V++ 
Sbjct: 8   TIIKQWLRDNCVVIDESKIEIVDTTTHPHVIVEGLGIIAKQDLKVDEIIAVIPKRCVLSP 67

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
            +    P    +L++  + +    +  L+ E S    S+W +YI ++P      + W + 
Sbjct: 68  KTTSIAP----ILEKYELEEAVATSIALMYETSKGVQSKWYSYIQSMPTVIDLPILWDKE 123

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
            ++ YL  + + E  IE I  +   Y +    I   +P+ F E +F +E+FK +  I+ S
Sbjct: 124 SIE-YLVGTDLEEIVIENIETLEEQYREDVEPIIKNHPETFKENIFTLESFKIASTIVSS 182

Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETF-LDYDKS----------------------- 284
           R   +    G  +LVP AD+ NH    E   ++ D +                       
Sbjct: 183 RAFNIDQYHGE-SLVPLADIFNHKTGRENVHVEADGNVCKQCGELDGCEHKKKKGGKKVV 241

Query: 285 --------------SQGVVFTTDRQYQPGEQVFIS--------------YGKKSNGELLL 316
                          +   F    +  P + +FI+              YG   N  LL 
Sbjct: 242 KGAPSLKKATPQDIEKKTTFKDRIELLPKDSLFITIVKPVNKDCEVFNTYGDHDNSLLLS 301

Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDK-------CYKEKLEALRK--------YGLSASE 361
            YGF+  E  NP D + +   L   DK        YK +  AL K        +   + E
Sbjct: 302 KYGFL--EMDNPCDLLRIDRQL--VDKLLFSYLDLYKIEQPALMKRLEFYCENFDEDSRE 357

Query: 362 CFPIQITGWPLE-LMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKD-IKCPEIDEQALQ 419
               ++ G   + L+   Y+ ++PP    ++++M  A   K+  KK  I   +  +Q  +
Sbjct: 358 HHAFELDGHGDDALITTLYIALAPPETFNQWKKMKQAQFYKIFEKKQAIDMVKEFQQVRK 417

Query: 420 FILDSCESSISKYSR 434
            ILD  +  + KY++
Sbjct: 418 AILDVIDKRLEKYNQ 432


>gi|221131915|ref|XP_002160713.1| PREDICTED: SET domain-containing protein 4-like [Hydra
           magnipapillata]
          Length = 429

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 65/273 (23%), Positives = 121/273 (44%), Gaps = 28/273 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L  W  ++GL  + +  +      RGL   K++  G+ ++ +P +L+IT D+     +  
Sbjct: 33  LFSWSLNNGLVLKAVTPKVFKKTGRGLKTTKSVSPGDLIIALPLNLLITFDTILENNDLN 92

Query: 139 EVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL----DRY 193
            + +   S+    L   +L+ E    ++S + +Y++ LP    +  Y ++ E+    +  
Sbjct: 93  FIFRNHPSICQKYLFILFLLIEKKKGENSYFFHYLNTLPENFSTPSYISQDEMQLCPNFI 152

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-- 251
            E + ++ R I      I   + L     S        EV      KW++ ++ +R V  
Sbjct: 153 QEETGLQNRQILNAIKHISCIHSLIANDLS----CIDSEV------KWAWNVINTRSVYF 202

Query: 252 ---------RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
                     + S++   AL P  D+LNH+        ++KS++     T+  Y PG Q+
Sbjct: 203 NAKHLKCFKNISSINVDFALAPVLDLLNHNDTANVVAGFNKSTKHYEVHTNDIYTPGSQL 262

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
           FI+YG  SN +L   YGFV     N  D++ +P
Sbjct: 263 FINYGPHSNRKLFCEYGFVLPFNMN--DTIPIP 293


>gi|307190530|gb|EFN74527.1| SET domain-containing protein 3 [Camponotus floridanus]
          Length = 232

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/186 (31%), Positives = 90/186 (48%), Gaps = 12/186 (6%)

Query: 253 LPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           +PS DG     AL+P  DM NH     T  D++ +S        R ++ GEQVFISYG +
Sbjct: 7   IPSPDGSRMIHALIPMWDMCNHENGRIT-TDFNATSDHCECYALRNFKKGEQVFISYGPR 65

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           +N +  +  GFV     N  D  +L L + K+D   KE++E L K GL +   F ++   
Sbjct: 66  TNSDFFVHSGFVYM--NNKQDGFKLRLGISKADSLQKERIELLSKLGLPSVGEFLLKPGT 123

Query: 370 WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCE 426
            P+     A+L V   SM+ K E      S+K+   K + C     ++E   +F+L   +
Sbjct: 124 EPISDTLLAFLRVF--SMR-KAELAHWLRSDKVFDLKHMDCALETVVEENVRKFLLTRLQ 180

Query: 427 SSISKY 432
             I+ Y
Sbjct: 181 LLIANY 186


>gi|159473090|ref|XP_001694672.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276484|gb|EDP02256.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 515

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 128/311 (41%), Gaps = 67/311 (21%)

Query: 76  ASTLQKWLSDSGLPPQKMA----IQKVDVGERGLVALKNIRKGEKLLFVPP--SLVITAD 129
           A   ++WL   G   +K A    ++++  G+RG+VA   I +GE LL +P   +L +  D
Sbjct: 2   AEAFEEWLDKHG--GKKHASLDLVKELPNGDRGVVATAPIAEGELLLLLPINCALYMPND 59

Query: 130 SKW-----SCPEAGEVLKQCSVPDWPLLAT--YLISEASFEKSSRWSNYISALPRQ-PYS 181
            +W     S PEA   L +      P LAT   L+SE +    S W+ Y+  LP   P  
Sbjct: 60  EEWAKRGSSFPEAVGYLHEHHRTLSPFLATTLALMSEVARGGESAWAAYVGTLPPSCPDC 119

Query: 182 LLYWTRAELDRYLEASQIRE-------RAIER-ITNVIGTYNDL---------------- 217
           LL W++ E  + LE + + E        A +R +  ++    DL                
Sbjct: 120 LLNWSKEE-KKDLEGTALEELGPDPAADAFKRHVAPILAARRDLWPLQQQGEGEGGAAAD 178

Query: 218 ----RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
                L +F +   L     F++E   W  G     + +L     +V L+P  DM+NHS 
Sbjct: 179 EAAADLALFVRVAGLVQSRAFHLEAENWVSGA--KEISKLEGGGTQVFLLPGIDMINHSH 236

Query: 274 E----------------VETFLDYDKSSQGV----VFTTDRQYQPGEQVFISYGKKSNGE 313
                                L    + +GV    V   D+    GE+V  +YG  S+ +
Sbjct: 237 NPARRNAHLQRLNVAQAAAAKLTEGGAPEGVEAFFVMRADKPIAEGEEVLHTYGNLSDAQ 296

Query: 314 LLLSYGFVPRE 324
           LL +YGF+  E
Sbjct: 297 LLQTYGFLDSE 307


>gi|241603784|ref|XP_002405757.1| SET domain-containing protein, putative [Ixodes scapularis]
 gi|215502568|gb|EEC12062.1| SET domain-containing protein, putative [Ixodes scapularis]
          Length = 429

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 65/270 (24%), Positives = 118/270 (43%), Gaps = 25/270 (9%)

Query: 79  LQKWLSDSGLPPQ-KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L  W+  +G     K+ ++      RG+VAL+ +  GE  L +P +L+I+  +       
Sbjct: 32  LLTWMEANGFRLHSKLGLRDFPDTGRGVVALEKLVGGETFLKLPATLLISTRTALQSRLH 91

Query: 138 GEVLKQ-CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             +++    +    +L  +++ +    ++SRW  ++ +LPR   + ++  R   +     
Sbjct: 92  SFIIRHHAKLTPIDVLTLFVLDQKLLGEASRWWPFVDSLPRTFTTPVFLRRKVFESL--P 149

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-----EVFNMETFKWSFGILFSRLV 251
             +RE     IT +  T+  L++ +   + +  PE       F    F W++  + +R +
Sbjct: 150 KDLREEVQTGITFIQRTFLKLKV-LLGGHVEEEPEVQCLSTGFTWNNFVWAWTAVNTRCI 208

Query: 252 RLPSM-------DGRVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
                       D   AL P+ D LNH     +ET +      +     + + +   EQV
Sbjct: 209 FAQGSNSSSLWEDDHCALAPFLDCLNHHWKASIETAM----VGENFEILSHKSHDANEQV 264

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSV 332
           FISYG  SN  L L YGFV  +  NP+D V
Sbjct: 265 FISYGPHSNRRLFLDYGFVLPD--NPNDVV 292


>gi|308501895|ref|XP_003113132.1| CRE-SET-27 protein [Caenorhabditis remanei]
 gi|308265433|gb|EFP09386.1| CRE-SET-27 protein [Caenorhabditis remanei]
          Length = 501

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/279 (25%), Positives = 116/279 (41%), Gaps = 51/279 (18%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI------- 204
           LA +L ++    + S+W  YIS LP    + L++T  +L + L+ S I E A+       
Sbjct: 153 LALFLATQWLLNEKSKWLPYISILPNSFPTPLFYTDEQLLQ-LKPSPIFEEALLFYRTIS 211

Query: 205 ----------------ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
                           E         N +   +F   P  F         + W+ G++ +
Sbjct: 212 RQFCYFLMAVAKNKIYESAQRRKDARNTMETPLFYNAP--FTVANLTPGLYFWAVGVVTT 269

Query: 249 RLVRLPSMDGR---------VALVPWADMLNHSC-----EVETFLDYDKSSQGVVFTTDR 294
           R+  +PS              AL+P+ DM NH        VE  + Y  + +  V T+  
Sbjct: 270 RVNMVPSEHSTDKDEKPNLIAALIPFLDMANHENVVTEDPVEDLVCYSPAEECAVITSHC 329

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
             + G QV I YG +S GE LL  GFVP       D ++L + + K+DK    K + + K
Sbjct: 330 DLEAGNQVTIFYGCRSRGEHLLHNGFVPIHHQR-QDVLKLKIGIPKTDKTLDSKTKLIEK 388

Query: 355 YGLSASEC----FPIQITGW-----PLELMAYAYLVVSP 384
           Y +   +C    F + +  +     PL+L+ +A + V P
Sbjct: 389 Y-VQNVQCNGNIFQVDLYNYPEQPFPLDLLMFAAIFVCP 426


>gi|347967016|ref|XP_003436005.1| AGAP002018-PB [Anopheles gambiae str. PEST]
 gi|333469796|gb|EGK97407.1| AGAP002018-PB [Anopheles gambiae str. PEST]
          Length = 504

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 74/320 (23%), Positives = 132/320 (41%), Gaps = 34/320 (10%)

Query: 73  LENASTLQKWLSDSGLPPQKMAI-QKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK 131
           +E  +   +W  + G   + + + +  + G  GL +   I  GE ++ VP S+     ++
Sbjct: 69  METVAHFMRWAVERGCQVENVRVAEHAEYGGLGLESCGPIPAGECIITVPRSMFFYVTNE 128

Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
               +  E++    + +    +LA  LI E  F   S W  Y+  LP +  + LY+T  +
Sbjct: 129 PRYRQLLELMPGAMMSEQGNIMLALALIME-RFRAKSDWKPYLDLLPDRYTTPLYYTTED 187

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDL---FPEEVFNMETFK------ 240
           +    E       A++   ++   Y  +R  +  K  +L   F  +VF +  F       
Sbjct: 188 MGELAETDAFLP-ALKLCKHIARQYGFIRRFVQEKVDELRDCFTYDVFRLLLFSLLIPHS 246

Query: 241 WSFGILFSRLVRLP-------SMDGRVALVPWADMLNHS---------CEVETFLDYDKS 284
           W+   + +R  ++P        MD  +AL+P  DM NH+         C  ET  +    
Sbjct: 247 WAVSTVMTRQNKVPVNLAEFDGMDHTLALIPLWDMANHAFPDTANETRCVAETCYNATNE 306

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSLKKSDK 343
                 T +        +FI YG +++ E L+  GFV PR   NP  +V+   +L  +  
Sbjct: 307 QLECSLTREVSDIASVPIFIVYGTRTDAEFLVHNGFVCPR---NPHANVQKRFTLVPAIP 363

Query: 344 CYKEKLEALRKYGLSASECF 363
            YKE+   L   G+  +  F
Sbjct: 364 LYKERAHLLELLGMPTTGTF 383


>gi|156717956|ref|NP_001096520.1| N-lysine methyltransferase setd6 [Xenopus (Silurana) tropicalis]
 gi|325530258|sp|A4QNG5.1|SETD6_XENTR RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|140832737|gb|AAI35641.1| LOC100125156 protein [Xenopus (Silurana) tropicalis]
          Length = 454

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 63/286 (22%), Positives = 127/286 (44%), Gaps = 25/286 (8%)

Query: 71  DSLEN---ASTLQKWLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV 125
           D L+N    S    W    GL   P+     +  V + G++A +++  GE L  +P S +
Sbjct: 14  DHLQNDLPVSCFLAWCKKVGLELNPKVYISTEGTVSQYGMLAREDLSDGELLFSIPRSAI 73

Query: 126 ITADS---KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPY 180
           ++ ++   +    +  + L+ CS   W  L   L+ EA+ + SS W+ Y    P    P 
Sbjct: 74  LSQNTTRIRDLIEKEQDSLQSCS--GWVPLLISLLYEAT-DSSSHWAPYFGLWPELDPPD 130

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
             ++W+  E  + L+ + I E   + + N+   YN + L    + P+ F      ++ +K
Sbjct: 131 MPMFWSEEEQTKLLQGTGILEAVHKDLKNIEKEYNSIVLPFIRRNPEKFCPMKHTLDLYK 190

Query: 241 WSFGILFSRLVRLPSMDGRVA----------LVPWADMLNHSCEVETFLDYDKSSQGVVF 290
                + +   + P  +              +VP AD+LNH  +    L++  + + +  
Sbjct: 191 RLVAFVMAYSFQEPQEEDEEEDIEKDILPPMMVPVADLLNHVAQHNAHLEF--TPECLRM 248

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
            T +    G+++F +YG+ +N +LL  YGF      N +++ ++ +
Sbjct: 249 ITTKSVCAGQELFNTYGQMANWQLLHMYGFAEPHPQNCNETADIQM 294


>gi|302804174|ref|XP_002983839.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
 gi|300148191|gb|EFJ14851.1| hypothetical protein SELMODRAFT_445692 [Selaginella moellendorffii]
          Length = 236

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 97/222 (43%), Gaps = 34/222 (15%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RGL A + +R GE++L +   L+I                   +PD          + S 
Sbjct: 38  RGLFASRPVRAGERMLEISLDLMIAP---------------SDLPD----------QLST 72

Query: 163 EKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
            +SS W+ YIS LP +P  L     W   EL  YL AS +  +  ER+  +   +  ++ 
Sbjct: 73  LQSSAWAPYISCLP-EPAGLDNTFLWEDTEL-SYLRASPLYGKTRERLEIITTEFGQVQ- 129

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFL 279
                +P LF +   ++E F   +  +FSR + +   D  + ++P  D  NH+      L
Sbjct: 130 NALDVWPQLFGK--VSVEDFMHVYATVFSRPLAI-GEDSTLVMIPMLDFFNHNAASFAKL 186

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
            ++      V T DR     +Q++I+ G  SN EL L YGF 
Sbjct: 187 SFNGLLNYAVVTADRDCAENDQIWINCGDLSNAELALDYGFT 228


>gi|348679693|gb|EGZ19509.1| putative ribulose-1,5 bisphosphate carboxylase/oxygenase small
           subunit N-methyltransferase I [Phytophthora sojae]
          Length = 606

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/251 (24%), Positives = 118/251 (47%), Gaps = 21/251 (8%)

Query: 109 KNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP---LLATYLISEASFEKS 165
           +++ +G +LL +P S V++  S       G +L+    PD P    L  +L+ E +    
Sbjct: 205 EDVEQGAELLSLPMSKVMSVASAARG-RVGLLLEVN--PDLPPAIALGLHLLEEQALGAK 261

Query: 166 SRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           S +S ++S+LP      S L+++  +L + +E SQ+    + R   V   Y+ L   + S
Sbjct: 262 SNFSEFVSSLPGVEAINSTLFYSENQL-KEMEGSQLLRYTLGRAQAVEAFYDALLQPVTS 320

Query: 224 KY---PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE---- 276
                P +F E+ F ++ F+W+ G++++    +   +  V L P  D +    +V     
Sbjct: 321 PEAVDPPIFKEQDFTLDKFRWAMGVVWASAFPVGEDEADVVLAPVLDTIGICTDVADEGD 380

Query: 277 -----TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
                  ++ D+SSQ +V       + G +V +S   KS+ + +L+ GF     +   D 
Sbjct: 381 EACPPNQIEVDQSSQRLVVHASSPLEKGREVRLSMPGKSSAQFMLNNGFARDRASKKLDK 440

Query: 332 VELPLSLKKSD 342
           ++L ++L  SD
Sbjct: 441 LDLTVTLDPSD 451


>gi|24640264|ref|NP_727144.1| CG32732 [Drosophila melanogaster]
 gi|22831862|gb|AAF46222.2| CG32732 [Drosophila melanogaster]
 gi|28316927|gb|AAO39485.1| RE55639p [Drosophila melanogaster]
 gi|220957744|gb|ACL91415.1| CG32732-PA [synthetic construct]
          Length = 537

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/307 (23%), Positives = 129/307 (42%), Gaps = 31/307 (10%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           L        W  D G+  + + I      + GL A + + K E +L VP  L+++ ++  
Sbjct: 115 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRATRPLAKDELVLSVPRKLILSEENNS 174

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
            C   G++ +   +     LA  L+ E    + S W  YI  LP +  ++LY+T  +++ 
Sbjct: 175 DCRLFGKMTQATHLN----LAYDLVIEKIRGEFSEWRPYIDVLPAKYNTVLYFTTKQME- 229

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
            L  +     A+ +   +   Y  L     +          +P      F +     + +
Sbjct: 230 LLRGTAAAALAMRQCRVIAKQYAFLYKYAHTMTEPSTGNRSHPGERGLFFTQHGLCYKLY 289

Query: 240 KWSFGILFSRLVRLPSM-----DG---RVALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
           +W+   + +R   +PS      DG     AL+P+ DM NH   ++ +F  Y   S+ +  
Sbjct: 290 RWAVSTVMTRQNLVPSEKQESEDGPKLISALIPYWDMANHRPGKITSF--YATVSRQLEC 347

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
           T       GEQ FI YG +SN +LL+  GFV  +  N  D V + + L  +D    ++  
Sbjct: 348 TAQEAVNTGEQFFIYYGDRSNTDLLVHNGFV--DPNNTKDYVNIRVGLSLTDALAAKRAS 405

Query: 351 ALRKYGL 357
            L K  +
Sbjct: 406 ILDKLNI 412


>gi|307107214|gb|EFN55457.1| hypothetical protein CHLNCDRAFT_52262 [Chlorella variabilis]
          Length = 478

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/371 (24%), Positives = 137/371 (36%), Gaps = 83/371 (22%)

Query: 14  PSFSHLHKAQSPAGFTDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSL 73
           P F  LH  +  A     PR+R G         S +   S+ +  VT++ + W  E   L
Sbjct: 22  PDFRQLHSCRPAAAL---PRRRHGTAAAAATPQSGSGSTSK-EAAVTKDYLSWATEAGIL 77

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITADSK 131
                           P+ M  Q    G RG  AL +I   E  + VP   +LV+  + +
Sbjct: 78  S---------------PKLM--QAYFGGLRGGQALSDIAADEVFVTVPRGAALVVAPNER 120

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
             CP+  +       P +  +A  L+ E    + S    YI  LP    + + W  A+L 
Sbjct: 121 CPCPDFVDPGFYKEAPWFVKMAVLLLWERRKGRGSSVWGYIEQLPSSIDTPVRWEEADLA 180

Query: 192 RYLEASQIRERAIERIT----------------------NVIGTYNDLRLRIFS------ 223
                  I+E   ++                        N +    ++R R FS      
Sbjct: 181 ELQYQPAIKEIKQQQTAWRQQYNRFCAAVRPGQGNYSWDNFLWAAENVRSRAFSGPYTGS 240

Query: 224 --------------------KYPDLFPEEVFN----METFKWSFGILFSRLVRLPSMDGR 259
                                +  L  E+V N    +  F   + +L S+ ++       
Sbjct: 241 SVGEKARTLGLLLAAGGGYAAWQQLPLEQVLNGFISVLLFNIIYDVLISKKLKW------ 294

Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
            AL P  D LNHS  VE+ + Y+      V +T   Y+ G+QVFISYG ++NG LL  Y 
Sbjct: 295 YALCPVVDALNHSSLVESDVAYEYFKDTFVLSTKSAYKAGQQVFISYGAQANGSLLQYYA 354

Query: 320 FVPREGTNPSD 330
           F   E  NP+D
Sbjct: 355 FT--EPGNPND 363


>gi|405953717|gb|EKC21325.1| SET domain-containing protein 6 [Crassostrea gigas]
          Length = 384

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/281 (23%), Positives = 111/281 (39%), Gaps = 22/281 (7%)

Query: 76  ASTLQKWLS--DSGLPPQKMAIQ-KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           A+   KW S  D      K+ I       + G+VA+ ++++GE L  +   +++   S  
Sbjct: 30  ANAFLKWFSSNDDNFFSGKVTIGPDGSCAQNGMVAIADVQEGESLFRISRKILLHPKSS- 88

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
           S     E     S   W  L   ++ E +  K S W  Y   LP      ++WT  E ++
Sbjct: 89  SISALFEKDPVNSESGWSELLICMMQEYN-TKDSPWKPYFDVLPETVDLPMFWTEEEREK 147

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
            L  + + E        ++  +  +      K+ D   E   ++E +K     + +    
Sbjct: 148 LLTGTGVVEAVNRDNKKILTEFQSVVSPYLKKHKDTISESCDDLELYKRMVSYVMAYSFT 207

Query: 253 LP---------------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
            P                    + +VP ADMLNH       L +      ++ T D   +
Sbjct: 208 EPPKDDDSDDFGEEDEEEEKSTIYMVPMADMLNHIANNNAHLSFKPDCLEMIATKD--IK 265

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            GE+VF +YG+ +N  LL  YGF      N  D+V++PL L
Sbjct: 266 KGEEVFNTYGELANWHLLHMYGFSEAYPANHYDTVDIPLDL 306


>gi|320170159|gb|EFW47058.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 640

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 75/267 (28%), Positives = 121/267 (45%), Gaps = 31/267 (11%)

Query: 79  LQKWLSDSGLPPQKMAIQKV-DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L  W+ ++GL     A   + DV E  L A   I     +  VP  LV+  ++     E 
Sbjct: 173 LTAWIDNAGLEINSNARPGLNDVDELYLFASNPIEAATLVATVPAPLVMF-ETYLRTLEN 231

Query: 138 GEVL------KQCSVPDWP-LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
             +L      K  SVPD    LA  L+ E S+E  S W  +IS+LP+   S ++W+  E 
Sbjct: 232 PMILAIDRRFKTMSVPDPSYALAMALLYE-SYEPKSMWREWISSLPQTLDSTVFWSAEEQ 290

Query: 191 DRYL-----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
           D          +QI ER ++++      YN    R+ + +P +F    ++ E FKW++ I
Sbjct: 291 DALQSLPLKRKTQILERHLQQL------YNATTPRLLAAFPHIFAGGNYSYEMFKWAYMI 344

Query: 246 LFSRLVRL---PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV-----FTTDRQYQ 297
           + SR +     P    ++ L P  D+L+H   V+T +      + V+       T R  +
Sbjct: 345 VDSRSLTFSTGPDTLPQIMLAPLVDLLHHD-PVQTNIQLGVHPEEVLGFEISLKTTRAIK 403

Query: 298 PGEQVFISYGKKSNGELLLSYGF-VPR 323
            GE +    G+  N +LLL +G  +PR
Sbjct: 404 KGEPLVRHIGELPNHQLLLRFGLAMPR 430


>gi|406603886|emb|CCH44637.1| hypothetical protein BN7_4206 [Wickerhamomyces ciferrii]
          Length = 477

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 133/287 (46%), Gaps = 25/287 (8%)

Query: 74  ENASTLQKWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           E+    Q WL +SG+   P  K+   +     RGL++L++I + E L  +P ++++  ++
Sbjct: 5   EDTHNFQNWLINSGVQISPKIKIEDLRYLSQGRGLISLQDINQDEILFKIPRNVLLNIET 64

Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
                      K  +  D W  L   ++ E S    S+W  Y   LP + +SL++W + E
Sbjct: 65  GSLSQINNNKEKLLTNYDHWEGLILTILYELSLGNESKWFQYFKILPNEFHSLMFWEKDE 124

Query: 190 LDRYLEASQI-----RERAIERITNVI-GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           L+  L+ S +     +E+A+E    +I     DL +   +   DLF +    +    +SF
Sbjct: 125 LE-LLKPSLVLDRIGQEKALETFNKLIPNALVDLGINHLNISLDLFHKVASTI--LSYSF 181

Query: 244 GILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEVETF-LDYDKSSQGVVFTTDRQ 295
            +           D +V       ++V  AD+LN    +    L Y+  ++ ++  + + 
Sbjct: 182 DVERPDFNEDMEDDEQVQYDGYFKSMVTLADLLNADTNLSNANLFYE--TEFLIMKSIKP 239

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS-LKKS 341
              G+Q++ +YG   N ELL  YG+V   G+   D  ELP+S +KK+
Sbjct: 240 IPQGQQIYNTYGDHPNSELLRRYGYVEYNGS-KFDFGELPISTIKKT 285


>gi|327291705|ref|XP_003230561.1| PREDICTED: n-lysine methyltransferase SETD6-like, partial [Anolis
           carolinensis]
          Length = 324

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 78/162 (48%), Gaps = 10/162 (6%)

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK-- 240
           ++W+R E  + L+ + + E   + + ++   ++ + L     +PDLF  +V N+E +K  
Sbjct: 6   MFWSREEQKQLLQGTGVPEAVEKDLASIQEEFSSVVLPFMKAHPDLFNPKVHNLELYKRL 65

Query: 241 ------WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
                 +SF  L             + +VP AD+LNH       L++  S + +     R
Sbjct: 66  VAFVMAYSFQELLDEEEEEEGKPSPLVMVPLADLLNHVANHNANLEF--SPEHLQMVATR 123

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
               G++VF +YGK SN +LL  YGF      N +D+ ++P+
Sbjct: 124 TIPKGQEVFNTYGKLSNWQLLHMYGFAEPYPGNTNDAADIPM 165


>gi|255070351|ref|XP_002507257.1| predicted protein [Micromonas sp. RCC299]
 gi|226522532|gb|ACO68515.1| predicted protein [Micromonas sp. RCC299]
          Length = 986

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 74/270 (27%), Positives = 123/270 (45%), Gaps = 32/270 (11%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK----GEKLLFVPPSLVITADSKWS 133
           TL +W++  G    K  +   D   R ++A +N+      G+ +  +P + ++T  + ++
Sbjct: 20  TLWEWVTRHGGSAPKARLS--DAYPRTVIAAENVNGAQDGGDTIFSIPITCLMTPAAAFA 77

Query: 134 CPEAGEVLK----QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
               G+V +      SV D  +L  +L  E     +S W  YI  LP    + L W+RAE
Sbjct: 78  DVTYGKVFELFAAHQSVEDRTVLVFFLAIERQRGMTSHWGPYIRELPSIFSNPLNWSRAE 137

Query: 190 L-----DRYLEASQIRERAIERITNVIGTYNDLRLR---IFSKYPDL-------FPEEVF 234
                  R   A++  + A+ ++T V        LR   I S              ++  
Sbjct: 138 TLRLAGTRLGGATKFHDCALLQLTEVCVPAFIAILRAQLILSANTKAIASGAISLAQDAL 197

Query: 235 NMETFKWSFGILFSRLVRLPSMDGR--VALVPWADMLNHS--CEVETFLDYDKSSQGVVF 290
           + +   WS   + SR   L  ++G+  +ALVP  DML+HS   ++E   D D + Q ++ 
Sbjct: 198 SPDRLAWSHSCVSSRAFSL-FLNGQRTIALVPLGDMLDHSPDAQIEWRTD-DTAGQFLII 255

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           + DR    G  +F +YG KSN EL+L YGF
Sbjct: 256 SHDR-LPAGSIMFNNYGAKSNEELILGYGF 284


>gi|363747293|ref|XP_003643967.1| PREDICTED: N-lysine methyltransferase SETD6-like [Gallus gallus]
          Length = 447

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 73/293 (24%), Positives = 128/293 (43%), Gaps = 30/293 (10%)

Query: 82  WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP---- 135
           W   +G  L P+    ++  V   GL+A  ++  GE L  VP S ++   S+ +C     
Sbjct: 24  WCEAAGVELSPKVSISRRGTVSGYGLLAAADLEPGELLFSVPRSALL---SQHTCAIRAL 80

Query: 136 --EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL----YWTRAE 189
             +A E L+  S   W  L   L+ E +   +S W  Y S    Q +S L    +W   E
Sbjct: 81  LHDAQESLQSQS--GWVPLLLALLHEYT-TGTSHWRPYFSLW--QDFSSLDHPMFWPEEE 135

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
             R L+ + I E   + + N+   Y+ + L     +PD+F  E+  +E +K     + + 
Sbjct: 136 RVRLLQGTGIPEAVDKDLANIQLEYSSIILPFMKSHPDIFDPELHTLELYKQLVAFVMAY 195

Query: 250 LVRLPSMDGRVA--------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
             + P  +            +VP AD+LNH       L Y  +   +V  T +    G++
Sbjct: 196 SFQEPLEEEDEDEKGPNPPMMVPVADILNHVANHNASLKYAPTCLRMV--TTQPISKGQE 253

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           +F +YG+ +N +LL  YGF      N +D+ ++ +   +     + K EA ++
Sbjct: 254 IFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQMVTVRKAALQRAKSEAQQQ 306


>gi|428177750|gb|EKX46628.1| hypothetical protein GUITHDRAFT_107412 [Guillardia theta CCMP2712]
          Length = 606

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 96/208 (46%), Gaps = 26/208 (12%)

Query: 155 YLISEASFEK-SSRWSNYISALPRQPYSLLYWTRAE-----LDRYLEASQIRERAIERIT 208
           +LI E   ++ +SRW  Y   LP +  + + +   E     LD  L  +   ++  +   
Sbjct: 281 FLIHEMKTKRETSRWKTYFDFLPGKFETGICFEEEEGGGLNLDEELAGTGFVQKRWKERE 340

Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RLPS-MDGRVA---- 261
            V  TYN L   +  ++P +F  E F+ ++F W+ G+  +R V  + P+   G+V     
Sbjct: 341 VVEHTYNMLFPWLTEEFPQVFDREHFDFQSFMWARGVFDTRCVTVKFPAEKTGKVGVDNN 400

Query: 262 ------------LVPWADMLNHSCEVE-TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
                       LVPWADM NH    +      D + + + F T    + G QVF++YG 
Sbjct: 401 GEGEKGTRDVTCLVPWADMCNHHPYAQLNKPSLDPTRKFLQFCTMAPIKQGSQVFLNYGP 460

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPL 336
             N +LLL YG+  ++    + ++EL L
Sbjct: 461 LDNTQLLLYYGYAEQDNPYQTYAIELEL 488


>gi|440804394|gb|ELR25271.1| rubisco lsmt substrate-binding protein [Acanthamoeba castellanii
           str. Neff]
          Length = 408

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 69/287 (24%), Positives = 123/287 (42%), Gaps = 26/287 (9%)

Query: 111 IRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-----SVPDWPLLATYLISEASFEKS 165
           +   E++L VP SL++ A +     + G V         +V +   LA +++ E   +  
Sbjct: 2   VLASERILEVPFSLLLDAGAALRAEDVGSVFAAVKPALDAVDNRLPLALFMLHELR-KPD 60

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY 225
           S W  Y  ALP +    ++W   ++ + L  S +    + +       + +  + I  +Y
Sbjct: 61  SFWRPYFDALPSRVNLPMFWADEDM-QLLAGSPLHAAVLAQKKQARDWHTEHIVPIVRRY 119

Query: 226 PDLFP--------EEVFNMETFKWSFGILFSRLVRLPSMDG--RVALVPWADMLNHSCEV 275
           P  F         E  +++  F+W   ++ SR      +       +VP AD++NHS   
Sbjct: 120 PRPFGVSDDDSSLEPSYSLARFEWVLSMIASRAFWHFDLKDTWEPHMVPMADLINHSLTN 179

Query: 276 ETFLDY--DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
           +    Y  D  +Q  +    + Y  GEQVFI+Y   SN ELL +Y  +  +  N    + 
Sbjct: 180 DNVSKYTFDDKTQTFIVHVQQPYAEGEQVFITYCTDSNFELLKTYAMMVEDNYNKYTEIR 239

Query: 334 LPLSLKKSDKCYKE-----KLEALRKYGLSASECFPIQITGWPLELM 375
           L      +  C  E     K  AL + GL A + +P++   +PL+L+
Sbjct: 240 LD-ETTIARICPDEVERLTKTRALTQRGL-AKQTYPVKSEEFPLDLV 284


>gi|348676124|gb|EGZ15942.1| hypothetical protein PHYSODRAFT_561656 [Phytophthora sojae]
          Length = 429

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 89/183 (48%), Gaps = 13/183 (7%)

Query: 156 LISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
           L++E +  + S +  YI  LP     P+S    +R E+ R+  A  I +  +     V+ 
Sbjct: 95  LLAELARGEESGFHGYIQQLPTSISLPFSWGAESR-EMLRHTTAHLILDDKL-----VLK 148

Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNH 271
            Y D    +  ++  ++P EV  +E F+W++ ++ SR  ++   DG+   L+P  DM NH
Sbjct: 149 MYADYAEPLMKEFSTIWPAEVSTLEKFQWAYSMVSSRAFKV--TDGQEPTLLPVIDMANH 206

Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
           + E          +     TT R+ +  E V ISYG  SN +LL  YGFV      PSDS
Sbjct: 207 AAENPAAHIVKTETGSFQLTTLRKVEKDESVTISYGDLSNAQLLCRYGFVLPTSV-PSDS 265

Query: 332 VEL 334
           + +
Sbjct: 266 IHI 268


>gi|325530255|sp|E1BI64.1|SETD6_BOVIN RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
          Length = 450

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 75/310 (24%), Positives = 135/310 (43%), Gaps = 34/310 (10%)

Query: 65  PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
           P G + D    AS L  W    GL   P+    ++  V   G+VA ++++ GE L  VP 
Sbjct: 13  PAGSDDDPAPVASFL-SWCQRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPR 71

Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALP-- 176
           + ++   S+ +C  +G + ++     S   W  L   L+ E     +S WS Y +  P  
Sbjct: 72  AALL---SQHTCSISGVLERERGALQSQSGWVPLLLALLHEMQ-APASPWSPYFALWPEL 127

Query: 177 ---RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV 233
              + P   ++W   E  R L+ + + E   + + N+   Y  + L     +PDLF   V
Sbjct: 128 GRLQHP---MFWPEEERRRLLQGTGVPEAVEKDLVNIRSEYYSIVLPFMDAHPDLFSPRV 184

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQ 286
            ++E ++    ++ +   + P  +           +VP AD+LNH       L+Y  +  
Sbjct: 185 RSLELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSPTCL 244

Query: 287 GVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
            +V       QP   G ++F +YG+ +N +L+  YGF      N +D+ ++ +   +   
Sbjct: 245 RMV-----AIQPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTNDTADIQMVTVREAA 299

Query: 344 CYKEKLEALR 353
               K+EA R
Sbjct: 300 LQGTKVEAER 309


>gi|308813462|ref|XP_003084037.1| unnamed protein product [Ostreococcus tauri]
 gi|116055920|emb|CAL58453.1| unnamed protein product [Ostreococcus tauri]
          Length = 467

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/313 (24%), Positives = 117/313 (37%), Gaps = 75/313 (23%)

Query: 68  CEIDSLENASTLQKWLSD-SGLPPQKMAIQKVDVG---ERGLVALKNIRKGEKLLFVPPS 123
           C+  +L+    L  WLS+  G   + + +   + G       V    IR+GE ++ +P  
Sbjct: 46  CDAAALDR---LHAWLSNIEGFSDRGLRLTTANDGFGVRATCVRESGIRRGEVIIQIPRE 102

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP--YS 181
             +TA  +    E G         D+  L   ++ E   +++SRW  Y+  LP     + 
Sbjct: 103 AFLTAHPRM---EYG--------SDFRRLVAAVLEEMKRKEASRWWPYLQTLPHHGDWHP 151

Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR--------LRIFSKYPDLFPEEV 233
           LL+   A   R    +   +R  ER+ N     +D R          +  K  D +P E 
Sbjct: 152 LLWSDEARTSRLPTWTVASKRLTERLRNCANDAHDFRSMGLGKDVKDVSGKNDDEWPSEA 211

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRV--------------------------------- 260
                 +W+  I  SR  RL   D  V                                 
Sbjct: 212 ----DVRWASAICASRAFRLEFFDDDVFENFDETDARAFAVLERLSDVDEDVWGPGPSED 267

Query: 261 -------ALVPWADMLNHSCEVE--TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK-S 310
                   LVPWAD LNHS + E  + L YD  S        + Y  G+QVF SYG   S
Sbjct: 268 DFDTSVLVLVPWADGLNHSSDAEESSILRYDAGSATATLRAHKSYARGDQVFDSYGVHLS 327

Query: 311 NGELLLSYGFVPR 323
           + ++ L +GFV R
Sbjct: 328 DVDVFLDFGFVVR 340


>gi|281207217|gb|EFA81400.1| mRNA-decapping enzyme 2 [Polysphondylium pallidum PN500]
          Length = 1078

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/282 (23%), Positives = 129/282 (45%), Gaps = 55/282 (19%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITADSK 131
           ++   +Q W  DS  P     +  +   E+G+ + ++I++GE+LL +P   SL +    +
Sbjct: 27  DDGINIQTWKQDSKQP-----LLSLTPNEKGIFSSRDIKEGEELLSLPWYNSLSMNKVQQ 81

Query: 132 ---WSCPEAGEVLKQCSVPDWPLLA----TYLISEASFEKSSRWSNYISALPRQPYSLLY 184
              W   +  ++  + +  D  ++A     Y + + SF+    +S + SA+P    S L+
Sbjct: 82  QLPWLFNKIQDL--ELTAEDGLVVALLYYRYCMDDLSFD----YSEWFSAMPEVLNSGLF 135

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDL---RL---RIFSKYPDLFPEEVFN--- 235
           ++ AE +               + N    Y DL   RL    +F +   LF E+ F+   
Sbjct: 136 FSDAEAE---------------LLNGSPAYIDLMNQRLDAKELFGRLKSLFKEQQFSKCA 180

Query: 236 --METFKWSFGILFSRLV--RLPSMDGR------VALVPWADMLNHSCEVETFLDYDKSS 285
              +  KW++ ++ SR +    P++D        V L P+ D  NH+ + +   D+D   
Sbjct: 181 MTYDRLKWAYSVVDSRKIYTEAPNLDANGNPFITVVLAPFLDYFNHAEDAQAAYDFDYDE 240

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKS-NGELLLSYGFVPREGT 326
             +     +  + GEQ+F++YG +  N +LL+ YGF+ +  T
Sbjct: 241 SAIKVVALQPIKKGEQIFLNYGNQDCNSDLLIHYGFIDQSST 282


>gi|255071849|ref|XP_002499599.1| predicted protein [Micromonas sp. RCC299]
 gi|226514861|gb|ACO60857.1| predicted protein [Micromonas sp. RCC299]
          Length = 588

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 77/295 (26%), Positives = 121/295 (41%), Gaps = 33/295 (11%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA-GEVLKQ-CSVPDWPLLATYLISEA 160
           RG  A  +I  G+    +P   + T       P   G+  +   ++ +  + A +LI+E 
Sbjct: 186 RGAAATTHIPAGDIAAAIPVERLFTVRHALEMPGPRGDAYRMFAALGEDTIAALWLIAER 245

Query: 161 SFEKSSRWSNYISALP--------RQPY---SLLYWTRAELDRYLEASQIRERAIERITN 209
           +  ++S W   I++LP          P    + + W R   D  L  + +   AI     
Sbjct: 246 ALGEASPWHAVIASLPWPEGGEGSASPCGGCTPVSWPREACDALLGGTPLLADAIAASEK 305

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFK-----W-SFGILFSRLVRLPSMDGRVALV 263
           +   +  L   +     D+FP   + ++ F+     W S+G+        P       L 
Sbjct: 306 LARQHAALFPALSEHMADVFPASAYTLDNFRRAHEAWNSYGMTVQAS---PGEPAATCLP 362

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVV-FTTDRQYQPGEQVFISYGKKSNGELLLSYGF-V 321
           P A + NH+      + Y +   G +     R    GE+VF+SYG KSN ELLL YGF +
Sbjct: 363 PVAMLCNHALWPH-VVRYSRLRDGTLRLPVARSVHAGEEVFVSYGAKSNAELLLFYGFAL 421

Query: 322 PREGTNPSDSVELPLSLKKSD--KCYKEKLEALRKYGLSASECFPIQITGWPLEL 374
           P    NP D V L L L   +     K +  AL + GL+ S   P  +   PL L
Sbjct: 422 P---GNPYDDVPLSLELPGGEVADVTKAREAALARAGLTLS---PHAVRAGPLPL 470


>gi|449472508|ref|XP_002187588.2| PREDICTED: N-lysine methyltransferase SETD6 [Taeniopygia guttata]
          Length = 383

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 113/263 (42%), Gaps = 18/263 (6%)

Query: 105 LVALKNIRKGEKLLFVPPSLVI---TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS 161
           ++A + +  GE L  +P + ++   T        EA E L+  S   W  L   L+ E +
Sbjct: 1   MLAAEELEAGEVLFTIPRTALLSQHTTSIHALLQEAQESLQ--SQSGWVPLLLALLHEYT 58

Query: 162 FEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
              +S W  Y S     R     ++W + E  R L+ + I E   + + N+   YN + L
Sbjct: 59  -ASNSHWQPYFSLWQDFRSLDHPMFWPQEERTRLLQGTGIPEAVDKDLANIQLEYNSIIL 117

Query: 220 RIFSKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
                +PD+F  ++  +E +K        +SF                  +VP AD+LNH
Sbjct: 118 PFMETHPDIFDPKLHTLELYKELVAFVMAYSFQEPLEEEEEDEKGPNPPMMVPVADILNH 177

Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
                  L+Y  S Q +   T +  + G+++F +YG+ +N +LL  YGF      N  D+
Sbjct: 178 VANHNANLEY--SPQCLRMVTTQPVRKGQEIFNTYGQMANWQLLHMYGFAEPYPGNSHDT 235

Query: 332 VELPLSLKKSDKCYKEKLEALRK 354
            ++ +   +     + K EA ++
Sbjct: 236 ADIQMVTLRRAALQRAKSEAQQQ 258


>gi|169595142|ref|XP_001790995.1| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
 gi|160701026|gb|EAT91800.2| hypothetical protein SNOG_00305 [Phaeosphaeria nodorum SN15]
          Length = 391

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 66/134 (49%), Gaps = 14/134 (10%)

Query: 214 YNDLRLRIFSKYPDLFPEE--VFNMETFKWSFGILFSRLVRLP------SMDGRVALVPW 265
           + DL   I +   DLF     + N  TF W +  L +   RLP      + D   A+ P+
Sbjct: 122 WKDLHPHIPAISKDLFTYTWLIVNTRTFYWEYPDLPNSHPRLPKKRKQLTADDCYAMCPF 181

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            D  NHS   +   D +  S+G   T DR+Y+ GE+VF+SYG  +N  LL+ YGF+    
Sbjct: 182 MDYFNHS---DVGCDPESDSKGYSVTADREYKAGEEVFVSYGAHTNDFLLVEYGFILDSN 238

Query: 326 TN---PSDSVELPL 336
            N   P D + LPL
Sbjct: 239 RNDAIPLDHLILPL 252


>gi|116200882|ref|XP_001226253.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
 gi|88175700|gb|EAQ83168.1| hypothetical protein CHGG_10986 [Chaetomium globosum CBS 148.51]
          Length = 400

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 76/260 (29%), Positives = 119/260 (45%), Gaps = 36/260 (13%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE--AGEVLKQCSVPDWPLLATYLISEAS 161
           G+VA + +++G+K+L VP  LV    S  + PE  +G +    S+    LLA  L  +  
Sbjct: 54  GMVAHRRLKRGQKILRVPTQLV---HSLHTVPERISGRLPPDMSI--HALLAANLTVDG- 107

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
               S W + +  L      L +    EL   L     RE   ++  +    +N    ++
Sbjct: 108 MAGLSTWKDSLPTLGDFNTGLPFMWHKELQELL-PKPARELLKKQQDSFHRDWN----KV 162

Query: 222 FSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS--- 272
              +PDL  ++      V N  TF ++      R  + P +D R+A+VP AD  NH+   
Sbjct: 163 AKAFPDLRQDDYLHSWFVINTRTFYYAT----PRTEKYPPVD-RLAIVPIADFFNHADTG 217

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
           CEV     +DK   G + + DR Y   ++V+ISYG  +N  LL  YGF+P    N  D V
Sbjct: 218 CEVT----FDKD--GFIVSADRDYHGDQEVYISYGAHTNDFLLAEYGFLP--AANRWDEV 269

Query: 333 ELP-LSLKKSDKCYKEKLEA 351
            +  + L K    +KE L+ 
Sbjct: 270 CVDEVILPKPSTAHKELLQG 289


>gi|255083504|ref|XP_002504738.1| predicted protein [Micromonas sp. RCC299]
 gi|226520006|gb|ACO65996.1| predicted protein [Micromonas sp. RCC299]
          Length = 453

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 79/307 (25%), Positives = 132/307 (42%), Gaps = 61/307 (19%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKWSC 134
           + + +WL  +G    + +  +  VG RG  L A +N+R GE ++ +P    IT D     
Sbjct: 43  TDIAEWLVANG---GECSAVRAGVGSRGRGLFAARNLRAGESIVRIPLKACIT-DIASPN 98

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL 194
           P  G        P    LA  +++E     SSRW+ Y+++LP++         A  D  L
Sbjct: 99  PYPG-------CPYSVTLAAAILTERDAGSSSRWAQYVASLPKEVVGY-----ANCDEAL 146

Query: 195 EASQIRERAI----ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
              +   RA     + + + + TY  L   +   +  +  +  +N   + W+   + SR 
Sbjct: 147 VGDEDVIRAAVGGDDALVDELQTYASL---VIGSHAAIV-QRGWNSRDWTWAMSQVHSRT 202

Query: 251 VRL----PSMDG-RVA-----------LVPWADMLNHS-------CE-------VETFLD 280
            R+    P+  G RV            L P+AD+LNH        CE       V   L 
Sbjct: 203 FRVDLEVPAAHGARVGNDGNRERTVRLLAPFADLLNHDSDQNEVCCEWGVEQRAVGNELG 262

Query: 281 YDKSSQGVVFT--TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            D S+ GV F     R  Q G +  +SYG++S+    + YGF+P+   NP +   L  +L
Sbjct: 263 SDLSN-GVDFVVKASRDIQEGSEALVSYGERSDPHFFMYYGFLPK--INPFNRAPLFRTL 319

Query: 339 KKSDKCY 345
           +++ + Y
Sbjct: 320 REASRWY 326


>gi|198470241|ref|XP_001355267.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
 gi|198145358|gb|EAL32324.2| GA17108 [Drosophila pseudoobscura pseudoobscura]
          Length = 568

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 132/308 (42%), Gaps = 40/308 (12%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           +   +W    G+    + I      + GL A ++I   + +L VP +L+    S+   PE
Sbjct: 137 TAFSEWAKAGGVKTDCLEIAIFPGYQLGLRATQDIAAEQPVLSVPRTLIF---SEEHLPE 193

Query: 137 AGEVLKQCSVPDWPLLATY-----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
               L      ++PLL  +     L+ E      S W  YI  LP +  ++LY++  ++ 
Sbjct: 194 TDRKL----FCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLYFSIEQMQ 249

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---------LFPEEVFNMETFKWS 242
           R L  +     A+ +   +   Y ++  +     PD         LF +     E ++W+
Sbjct: 250 R-LRGTAACTSALRQCRVIARQYANM-YKCAHIRPDASSASSMGVLFTQHGLCYELYRWA 307

Query: 243 FGILFSRLVRLP-----SMDGR-------VALVPWADMLNHS-CEVETFLDYDKSSQGVV 289
              + +R   +P     + DG         AL+P+ DM NH   ++ ++  YD     + 
Sbjct: 308 VSTVMTRQNLVPRELQANDDGDDLSQLPISALIPYWDMANHRPGKITSY--YDSGVHQMD 365

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
            T     + GEQ FI YG +SN +LL+  GF+  +  N  D V++ L L  SD   +++ 
Sbjct: 366 CTAQEACKAGEQFFIYYGDRSNADLLVHNGFI--DVNNRKDYVKIRLGLGLSDALVEQRA 423

Query: 350 EALRKYGL 357
           + L +  +
Sbjct: 424 KILARLNI 431


>gi|195168946|ref|XP_002025291.1| GL13316 [Drosophila persimilis]
 gi|194108747|gb|EDW30790.1| GL13316 [Drosophila persimilis]
          Length = 568

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 73/308 (23%), Positives = 132/308 (42%), Gaps = 40/308 (12%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           +   +W    G+    + I      + GL A ++I   + +L VP +L+    S+   PE
Sbjct: 137 TAFSEWAKAGGVKTDCLEIAIFPGYQLGLRATQDIAAEQPVLSVPRTLIF---SEEHLPE 193

Query: 137 AGEVLKQCSVPDWPLLATY-----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
               L      ++PLL  +     L+ E      S W  YI  LP +  ++LY++  ++ 
Sbjct: 194 TDRKL----FCNFPLLTNFNLAYALVIEKVRGPDSVWRPYIDVLPARYNTVLYFSIEQMQ 249

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---------LFPEEVFNMETFKWS 242
           R L  +     A+ +   +   Y ++  +     PD         LF +     E ++W+
Sbjct: 250 R-LRGTAACTSALRQCRVIARQYANM-YKCAHIRPDASSASSMGVLFTQHGLCYELYRWA 307

Query: 243 FGILFSRLVRLP-----SMDGR-------VALVPWADMLNHS-CEVETFLDYDKSSQGVV 289
              + +R   +P     + DG         AL+P+ DM NH   ++ ++  YD     + 
Sbjct: 308 VSTVMTRQNLVPRELQANDDGDDLSQLPISALIPYWDMANHRPGKITSY--YDSGVHQMD 365

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
            T     + GEQ FI YG +SN +LL+  GF+  +  N  D V++ L L  SD   +++ 
Sbjct: 366 CTAQEACKAGEQFFIYYGDRSNADLLVHNGFI--DVNNRKDYVKIRLGLGLSDALVEQRA 423

Query: 350 EALRKYGL 357
           + L +  +
Sbjct: 424 KILARLNI 431


>gi|195480581|ref|XP_002101314.1| GE17555 [Drosophila yakuba]
 gi|194188838|gb|EDX02422.1| GE17555 [Drosophila yakuba]
          Length = 548

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 72/307 (23%), Positives = 127/307 (41%), Gaps = 31/307 (10%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           L        W  D G+  + + I      + GL A + + K E +L VP  L+ + ++  
Sbjct: 119 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRANRPLAKEELVLSVPRKLIFSEENNS 178

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
            C   G++ +   +     LA  L+ E    + S W  YI  LP +  ++LY+T  +++R
Sbjct: 179 DCRLFGKMTQATHLN----LAYDLLIEKIRGEFSEWRPYIDVLPAKYSTVLYFTTKQMER 234

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
            L  +     A+ +   +   Y  L     +          +P      F +     + +
Sbjct: 235 -LRGTAACSLALRQCRVIAKQYAFLYRYAHTLAESSTGNRSHPGERGLFFTQRGLCYKLY 293

Query: 240 KWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHS-CEVETFLDYDKSSQGVVF 290
           +W+   + +R   +PS             AL+P+ DM NH   ++ +F  Y   S+ +  
Sbjct: 294 RWAVSTVMTRQNLVPSEKQEAQDSPKFISALIPYWDMANHRPGKITSF--YAAVSRQLEC 351

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
           T       GEQ FI YG +SN +LL+  GFV  +  N  D V + + L  +D    ++  
Sbjct: 352 TAQEAVAAGEQFFIYYGDRSNTDLLVHNGFV--DVNNLKDYVNIRVGLSPTDALAAKRAS 409

Query: 351 ALRKYGL 357
            L K  +
Sbjct: 410 ILDKLNI 416


>gi|212546319|ref|XP_002153313.1| SET domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064833|gb|EEA18928.1| SET domain protein [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 110/271 (40%), Gaps = 50/271 (18%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RG+VA  NI++GE L  +P  +V+   +        + LK      W  L   +I E S 
Sbjct: 48  RGVVARSNIQEGEDLFHLPHHIVLMVKTSRLNQILADDLKNLGP--WLSLVVVMIYEYSL 105

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
            + S W  Y   LP +  +L++W+  E  + L+AS + ++  +R         D    IF
Sbjct: 106 GEQSNWKQYFQVLPSKFDTLMFWSEEEFSQ-LQASAVVDKVGKR---------DAEEDIF 155

Query: 223 SK-------YPDLFP------------------EEVFNMETF--KWSFGILFSRLVRLPS 255
            K       +PDLFP                  E    M +    ++F I  +       
Sbjct: 156 EKVLPLVRAHPDLFPPIDGVMSYDDDTGAQALLELAHRMGSLIMAYAFDIEKAEEEESEG 215

Query: 256 MDGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
            DG +          +VP AD+LN   +      + +    +V    +  + G+++F  Y
Sbjct: 216 EDGYLTDDEEQLPKGMVPLADLLNADADRNNARLFQEEG-ALVMRAIKPIKAGDEIFNDY 274

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
           G+    +LL  YG+V  +     D VELPL+
Sbjct: 275 GELPRSDLLRRYGYVT-DNYAQYDVVELPLT 304


>gi|195353393|ref|XP_002043189.1| GM17489 [Drosophila sechellia]
 gi|194127287|gb|EDW49330.1| GM17489 [Drosophila sechellia]
          Length = 537

 Score = 68.9 bits (167), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 71/307 (23%), Positives = 125/307 (40%), Gaps = 31/307 (10%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           L        W  D G+  + + I      + GL + + + K E +L VP  L+ + +S  
Sbjct: 115 LAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRSTRPLAKDELVLSVPRKLIFSEESNS 174

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
            C   G++ +   +     LA  L+ E    + S W  YI  LP +  ++LY+T  +++ 
Sbjct: 175 DCRLFGKMTQATHLN----LAYDLVIEKIRGEFSEWRTYIDVLPAKYSTVLYFTTKQME- 229

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFS---------KYPD----LFPEEVFNMETF 239
            L  +     A+ +   +   Y  L     +          +P      F +     E +
Sbjct: 230 LLRGTAAASLALRQCRVIAKQYAFLYRYAHTMTEPSTGNRSHPGERGLFFTQHGLCYELY 289

Query: 240 KWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNH-SCEVETFLDYDKSSQGVVF 290
           +W+   + +R   +PS             AL+P+ DM NH   ++ +F  Y    + +  
Sbjct: 290 RWAVSTVMTRQNLVPSEKQESEDTPKLISALIPYWDMANHRQGKITSF--YAAVPRQLEC 347

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
           T       GEQ FI YG +SN +LL+  GFV  +  N  D V + + L  +D    ++  
Sbjct: 348 TAQEAVDAGEQFFIYYGDRSNTDLLVHNGFV--DDYNLKDYVNIRVGLSLTDALAAKRAS 405

Query: 351 ALRKYGL 357
            L K  +
Sbjct: 406 ILDKLNI 412


>gi|340519125|gb|EGR49364.1| predicted protein [Trichoderma reesei QM6a]
          Length = 963

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 109/264 (41%), Gaps = 32/264 (12%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE-----AGEVLKQCSVPDWPLLAT 154
           RG+VAL++I     L  VP S ++++++   K   PE     A EV  +     W  L  
Sbjct: 535 RGIVALQDIPAEAVLFTVPRSGILSSETSELKGKLPEIFQETAMEVDDKPQQDPWSTLII 594

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE------------- 201
            ++ E      S+W  YI  LP    + ++W+ AELD  L+AS  R              
Sbjct: 595 VMMYEYFKGSESKWKPYIDVLPSSFETPMFWSDAELDE-LQASATRSKVGKASAEEMFQD 653

Query: 202 ------RAIERITNVIGTYNDLRL-RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
                 RA + +     TY+D  L ++  +         F+ +             V   
Sbjct: 654 KVLPVIRANQHLFPTSQTYSDDDLIQLAHRMGSTIMSYSFDFQNEDEEDEDETEEWVEER 713

Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
                + +VP AD+LN   E    ++Y      +  T  R  + GE++F  YG   N EL
Sbjct: 714 EAKSTMGMVPMADILNADAEYNAHVNY--GDDALTVTALRTIKAGEEIFNYYGPHPNSEL 771

Query: 315 LLSYGFVPREGTNPSDSVELPLSL 338
           L  YG+V  + +   D VELP +L
Sbjct: 772 LRRYGYVTPKHSR-YDVVELPWTL 794


>gi|325186532|emb|CCA21071.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 441

 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 64/285 (22%), Positives = 122/285 (42%), Gaps = 19/285 (6%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
           ++L +WL    +    +  QK D  E  G+ A K+++KGE  + +P  L I+  +     
Sbjct: 10  ASLLQWLRSKSVTTDSLHFQKSDGHEGVGVYAAKSLQKGEITMEIPFHLTISKVTAMQSD 69

Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
               +  +  +    ++A +L+ E      S +  +I +LP Q    ++W  ++    LE
Sbjct: 70  LRQILQDKNELDQDEIVALFLMIERFKSSDSFFEPFIQSLPSQFDLPIFWNDSDFAE-LE 128

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET-------FKWSFGILFS 248
            + +   A      +   +  + + +   Y     EE  N+ T       ++W+  I+++
Sbjct: 129 GTNVALLAKIMRKQIEADFQAIHIPLLRAY-----EERLNLRTSEISISDYEWALSIIWT 183

Query: 249 RLVRLPSMDGRV-ALVPWADMLNHSCEVET----FLDYDKSSQGVVFTTDRQYQPGEQVF 303
           R   +      +  L P  DM NHS  V+     F+ YD     +      +    +  +
Sbjct: 184 RAFGITRYGEYLRVLCPALDMFNHSVLVQEPLDEFIKYDHMKDVLAHCVVMETSANDPFY 243

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
           ISYG  S+ +LL SYGFV     N  + ++L + +  +D  +K K
Sbjct: 244 ISYGSYSDAKLLYSYGFVSLNEKNRFNGIDLWMRVPVTDPNFKLK 288


>gi|323449371|gb|EGB05259.1| hypothetical protein AURANDRAFT_66448 [Aureococcus anophagefferens]
          Length = 762

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 67/252 (26%), Positives = 102/252 (40%), Gaps = 27/252 (10%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           S    WL   G    K+AI+   +G RG+VA      GE LL VP +L++T D      +
Sbjct: 18  SEFVAWLRAGGASFDKLAIKHTALG-RGVVATAAYEPGETLLSVPEALLLTVDKASRRAD 76

Query: 137 AGEVLKQCSVPDWPL------LATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAE 189
               L                LA +L  +    +S  W  Y + + R    L  +W  A+
Sbjct: 77  VAASLGAARARGVDANGGNLALALFLAGD----RSEAWRPYRNVISRSVSHLPCFWPTAD 132

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
            +  L  S + E  + R   +      L L             V + + F ++   + SR
Sbjct: 133 -EALLAGSPLGEDVVRRRDEIRRDCRSLGL-----------TAVEDRQAFAFAEAQVLSR 180

Query: 250 LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
                + +G  A+VP+AD++N +   E  +D+       V    R+   GE V  SYG K
Sbjct: 181 AF---AFNGTRAMVPFADLMNTARHHERHVDFAFERGAFVMRAVRRGAAGEPVTDSYGPK 237

Query: 310 SNGELLLSYGFV 321
           SN   LL+YGF 
Sbjct: 238 SNARYLLNYGFA 249


>gi|156064409|ref|XP_001598126.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980]
 gi|154691074|gb|EDN90812.1| hypothetical protein SS1G_00212 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 470

 Score = 68.6 bits (166), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 79/320 (24%), Positives = 149/320 (46%), Gaps = 46/320 (14%)

Query: 69  EIDSLE-NASTLQKWLSDSGLPPQ-KMAIQKVDVGE----RGLVALKNIRKGEKLLFVPP 122
           E+D  E   +T   WL + G+    KMA+  VD+ +    RG+VA  +I   E +  +P 
Sbjct: 2   EVDDFEARTATFSSWLKEMGVRTNPKMAL--VDLRQEGRGRGVVATGDIDDDEIIFSIPR 59

Query: 123 SLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
           + V+ A +    P +  + ++  +P W +L + L++EA  E +S+W+ Y++ LP +  SL
Sbjct: 60  NAVLNAQNVAPLPVSRRLFEK--MPSWLVLTSILMTEAQME-NSKWAPYLAVLPERLDSL 116

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTY------NDLRLRIFSKYPDLFPEEVFNM 236
           ++W+ +EL     ++ +++   +   ++  +Y            +  K   +     F++
Sbjct: 117 VFWSDSELAELQASAVVKKIGKKDAEDMFKSYIAPQGLKHSSTEMCHKVASVIMAYAFDI 176

Query: 237 ------ETFKWSFGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGV 288
                  T     G     LV     D +  ++++P ADMLN         D D+++  +
Sbjct: 177 PDPSDAPTSGGKGGEAGDDLVSDDGEDEKTILSMIPLADMLN--------ADADRNNARL 228

Query: 289 VFTTD----RQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSD----SVELPLS 337
           +   +    R  +P   GE++F  YG+    +LL  YG+V  +G +  D    S EL +S
Sbjct: 229 ICDNEELEMRAIKPISKGEEIFNDYGQLPRSDLLRRYGYVT-DGYSAYDVAEISAELIVS 287

Query: 338 LKKSDKCYKEKLEALRKYGL 357
           L ++ K +   L  L + GL
Sbjct: 288 LFRNGKVHP-SLHKLTQDGL 306


>gi|332020870|gb|EGI61268.1| SET domain-containing protein 3 [Acromyrmex echinatior]
          Length = 232

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 90/186 (48%), Gaps = 12/186 (6%)

Query: 253 LPSMDGRV---ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           +PS DG     AL+P  DM NH     T  D++ +S        R ++ GEQVFISYG +
Sbjct: 7   VPSPDGSRMIHALIPMWDMCNHENGRIT-TDFNATSDRCECYALRDFKKGEQVFISYGPR 65

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           +N +  +  GFV  +  N  D  +L L + K+D   KE++E L K  L +   F ++   
Sbjct: 66  TNSDFFVHSGFVCMD--NEQDGFKLRLGISKADSLQKERIELLSKLDLPSVGEFLLKPGT 123

Query: 370 WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP---EIDEQALQFILDSCE 426
            P+     A+L V   SM+ K E      S+K+   K + C     ++E   +F+L   +
Sbjct: 124 EPISDTLLAFLRVF--SMR-KAELTHWLRSDKVFDLKHVDCALETVVEENVRKFLLTRLQ 180

Query: 427 SSISKY 432
             I+ Y
Sbjct: 181 LLIANY 186


>gi|452825745|gb|EME32740.1| ribulose-1,5 bisphosphate carboxylase oxygenase large subunit
           N-methyltransferase, putative isoform 2 [Galdieria
           sulphuraria]
          Length = 331

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 74/295 (25%), Positives = 124/295 (42%), Gaps = 25/295 (8%)

Query: 34  KRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKM 93
           +RC H+ ++  +V T   +SR        ++   C    L+       WL+   +   K+
Sbjct: 40  RRCFHKPIL--NVPTLRPSSRLCKPSRLFLV---CSSGRLD---LFYHWLTRENVYMPKI 91

Query: 94  AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-SVPDWPLL 152
            + +   G RG+VA++ I   E  L VP  L +       C  +  V  +  S  +W + 
Sbjct: 92  KLDQNKDGLRGVVAVEGIECDESFLKVPRDLSLQVTEHEECTMSEFVDPELWSQENWYVK 151

Query: 153 ATYLISEASFEKS-SRWSNYISALPRQPYS-LLYWTRAELDRYLEASQIRERAIERITNV 210
            +  + +  +    S W  YI  LP    + L+YW+ +EL      +Q++ R +     +
Sbjct: 152 LSLKLLKEKYLGKLSLWKPYIDILPHALNTGLVYWSSSEL------AQLQYRPLIEEVKI 205

Query: 211 IGTYND-LRLRIFSKYPDLFPEEVF----NMETFKWSFGILFSRLVRLPSMDGRV-ALVP 264
              Y + L  R+F       P  V+        F W+  ++ SR   +P +  +  AL+P
Sbjct: 206 NQYYREALYTRVFESLSS--PVRVWLQNEKENVFFWALDMVQSRAFGIPDVGNKTYALLP 263

Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
             DMLNH    +T   YD  +      T  +  PG  ++ISYG   N  LL  YG
Sbjct: 264 MMDMLNHRVNSQTHFLYDSIANQYEMKTYSKLSPGTDIYISYGPLDNDHLLHFYG 318


>gi|194896580|ref|XP_001978500.1| GG17647 [Drosophila erecta]
 gi|190650149|gb|EDV47427.1| GG17647 [Drosophila erecta]
          Length = 544

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 134/318 (42%), Gaps = 45/318 (14%)

Query: 71  DSLENASTLQK------WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
           DS ++ S L K      W  D G+  + + I      + GL A + + K E +L VP  L
Sbjct: 109 DSPDDQSRLAKVEAFSAWAKDGGVHSEGLEIAIFPGYQLGLRATRPLAKEELVLTVPRKL 168

Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS----SRWSNYISALPRQPY 180
           + + ++   C   G++ +          AT+ + +   EK     S W  YI  LP +  
Sbjct: 169 IFSEENNSDCRLFGKMPQ----------ATHWVYDLVIEKIRGEFSEWRPYIDILPAKYS 218

Query: 181 SLLYWTRAELDRY-------LEASQIRERA-----IERITNVIGTYNDLRLRIFSKYPDL 228
           ++LY+T  +++R        L   Q R  A     + R  + +   +D       +    
Sbjct: 219 TVLYFTIKQMERLRGTAACSLALRQCRVIAKQYAFLYRYAHTLTEPSDGNRSHPGERGLF 278

Query: 229 FPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHS-CEVETFL 279
           F +     + ++W+   + +R   +PS             AL+P+ DM NH   ++ +F 
Sbjct: 279 FTQHGLCYKLYRWAVSTVMTRQNLVPSEKQESQDSPKFISALIPYWDMANHKPGKITSF- 337

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
            Y   S+ +  T     + GEQ FI YG +SN +LL+  GFV  +  N  D V + + L 
Sbjct: 338 -YAAVSRQLECTAQEAVEAGEQFFIYYGDRSNTDLLVHNGFV--DVNNLKDYVNIRVGLS 394

Query: 340 KSDKCYKEKLEALRKYGL 357
            +D    ++   L K  +
Sbjct: 395 PTDALAAKRASILDKLNI 412


>gi|343470335|emb|CCD16940.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 593

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 67/152 (44%), Gaps = 10/152 (6%)

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--------FNMETF 239
           A L ++L   + RE+ +E    VI  +      I + Y  +  E          F +E F
Sbjct: 206 AYLQQFLCFRRHREKVLEEQDCVIEEFRTFLSLISTYYSHVCCEASKTRLETFSFTLEQF 265

Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
            W++  L SR     SM    A++PW D  NHS      + +DK     VF T      G
Sbjct: 266 TWAYNTLMSRAFAYDSM--VWAVMPWVDYFNHSTLNNATMRFDKRLNCYVFVTVVPIAKG 323

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
           EQ+F+ YG  ++ ELLL YGF       PS S
Sbjct: 324 EQIFLQYGSYTDAELLLWYGFTVTPSLFPSYS 355


>gi|260807503|ref|XP_002598548.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
 gi|229283821|gb|EEN54560.1| hypothetical protein BRAFLDRAFT_118329 [Branchiostoma floridae]
          Length = 448

 Score = 68.2 bits (165), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 108/244 (44%), Gaps = 27/244 (11%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC---PEAGEVLKQCSVPDWPLLATY 155
           D G RGL+  + I++G+ ++ +P  ++++  +       P       Q +      + T+
Sbjct: 54  DTG-RGLMVPRKIKRGQTMIKMPQHMILSTKTVLDSVLGPYIESAEPQLTTIQ--AITTF 110

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           LI +    ++S W  Y+  LP +    +Y+   E D       +R     +    I +Y 
Sbjct: 111 LIYQKHIGETSFWKPYLDILPNEYTHPVYF--GEEDFLYLPHSLRANIKAKKQECIKSYE 168

Query: 216 DLRLRIFSKYPDLFP--EEVFNMETFKWSFGILFSRLVR--------LPSMD----GRVA 261
           +L+   F     L P  E +F  + ++W++  + +R +         L ++D    G  +
Sbjct: 169 ELK-PFFPSLEPLLPNWEGIFTFDAYRWAWSTVKTRSLYVDDKGSTVLRNLDKSGLGVTS 227

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
           LVP  D+LNHS    T L   KS +        T +  Y+ G+QV   Y +  N  LLL+
Sbjct: 228 LVPMVDLLNHSHSARTGLLIKKSCKNGDYFYTVTAEDDYKRGDQVLFCYRRADNQTLLLN 287

Query: 318 YGFV 321
           YGFV
Sbjct: 288 YGFV 291


>gi|189190580|ref|XP_001931629.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187973235|gb|EDU40734.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 372

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 121/265 (45%), Gaps = 35/265 (13%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           ++    L  W ++ G+    +  Q +     G++A ++I+ GE +LFVP  L  T     
Sbjct: 1   MDTYEELLSWATERGIKLSGIKPQNILSRGTGIIATRDIQAGETILFVPFKLFRTLKH-- 58

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP---YSLLYWTRAE 189
             P+A       ++    LLATYL    S +K+  ++     LP        + +   AE
Sbjct: 59  -VPKAISRRLPRNMSLHALLATYL----SLDKTDTFAIPNKTLPDLSSFEAGMPFLWPAE 113

Query: 190 LDRYLEASQI-----RERAIERITNVIG-TYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           L  +L    +     ++R+ +R  +++   Y+++     S+   L    + N  +F  + 
Sbjct: 114 LHPFLPKPALDLLMKQQRSFKRDWDIVSKAYSNI-----SQDQYLHAWLLVNTRSFYCTT 168

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGE 300
            I+     RLP  D R+A++P AD+ NH+   CE         +S+   F  DR Y+ GE
Sbjct: 169 PIM----ERLPH-DDRLAILPVADLFNHADVGCEARF------ASENYSFIADRDYRTGE 217

Query: 301 QVFISYGKKSNGELLLSYGFVPREG 325
           ++ ISYG  S   LL  YGFVP E 
Sbjct: 218 ELHISYGSHSTDFLLTEYGFVPTEN 242


>gi|452000836|gb|EMD93296.1| hypothetical protein COCHEDRAFT_1170833 [Cochliobolus
           heterostrophus C5]
          Length = 643

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 105/237 (44%), Gaps = 20/237 (8%)

Query: 141 LKQC--SVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEA 196
           L+QC   +PD  L    L+ +      S WS Y++ LP  R   + L++   +   +L  
Sbjct: 88  LQQCRDKIPDHILAYLLLLEQRDKGNDSPWSAYLACLPGPRDMTTPLWFDDVDF-AFLAG 146

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR---- 252
           + +   A ER   +   +      I  K+ DL   +V ++E+ +W+  I  SR       
Sbjct: 147 TSLAPAAKERKAELRQQWEHALQVI--KHLDLHLADVISLESLQWAATIFTSRAFISTHI 204

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
           LP  +    L P  D+LNHS   +   D++   S  +        +PGE++F +Y  K N
Sbjct: 205 LPGRETIPMLFPVIDILNHSVTAKVEWDFEPHRSFALKCLQADSVKPGEELFNNYAPKQN 264

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQIT 368
            ELLL YGF   +  NP +   L L+ +   + Y  +L      GL   +  P ++T
Sbjct: 265 DELLLGYGFCLED--NPIEQFALKLAFQPQLQQYAHQL------GLLDGKNVPFEMT 313


>gi|440302460|gb|ELP94773.1| hypothetical protein EIN_341910 [Entamoeba invadens IP1]
          Length = 823

 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 69/273 (25%), Positives = 123/273 (45%), Gaps = 20/273 (7%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS----------K 131
           W+ + G     + ++   V   GL + K   +G+ LL +P  L +               
Sbjct: 7   WVKEHGGHIDGVYVKNFPVYGNGLCSSKEFHEGDTLLSIPYHLQLNTIELHNVFESMVPG 66

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
           +  P  GE  K     +  ++  YL    + EK   +  YI+ LP      L ++  EL 
Sbjct: 67  FEVPRLGEGAKNRD-DENSVVYLYLAMNKTNEKCFHFP-YINTLPTTFSCPLSYSENEL- 123

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR-- 249
           + L+ +++    +E+    +   +D    +  +YP  F +     +   W+  + +SR  
Sbjct: 124 KMLKGTKLL-VTVEKTKTFLKKLSDYYETLTHQYPTRFQQFDDFYQRLVWAHQVFWSRAF 182

Query: 250 LVRLPSMDGRVA-LVPWADMLNHSCEVE-TFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
           LV  P   G VA L+P+AD  NH+ E + T++   ++    + T ++    GEQ+F +Y 
Sbjct: 183 LVIYPDPIGDVASLIPFADFSNHNTETKVTYVSNRQTQTFSLQTNEKVLHCGEQIFNNYR 242

Query: 308 KKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
            + N ++LL YGFV  E  NP D V L ++ K+
Sbjct: 243 IRPNEKMLLGYGFVISE--NPYDEVLLRINFKE 273


>gi|195040205|ref|XP_001991024.1| GH12451 [Drosophila grimshawi]
 gi|193900782|gb|EDV99648.1| GH12451 [Drosophila grimshawi]
          Length = 573

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 74/320 (23%), Positives = 132/320 (41%), Gaps = 49/320 (15%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITA 128
           E   L   +   +W    G+    + I      + GL A ++I   E +L VP  L+   
Sbjct: 120 EKTRLAKVAAFNEWARAGGVQSDCVEITTFPGYQLGLRAKRDIAAEELVLSVPRKLIF-- 177

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
            S+   PE    L + + P    +   LI E     +S W  +I  LP +  ++LY+T  
Sbjct: 178 -SEELLPEWKRELFR-NFPTHLNVTYTLIIEKVRGAASAWQPFIDTLPTRYSTVLYFTVD 235

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRL--------RIFSKYPDLFPEEVFNMETFK 240
           ++ R L  +     A+     +   Y  +           + +   +LF E     E ++
Sbjct: 236 QMQR-LRGTSACSAAMRHCLVIARLYASMYKCAYIQPGDNVMAAKANLFTEYGLCYELYR 294

Query: 241 WSFGILFSRLVRLP---SMDGRV----------------------------ALVPWADML 269
           W+   + +R   +P   S  G V                            AL+P+ DM 
Sbjct: 295 WAVSTVTTRQNLVPRELSTVGEVDQVCQLGGFEGTEIKRDAETGARNAPISALIPYWDMT 354

Query: 270 NHSC-EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
           NH C ++ ++  YD+++Q +  T    ++ GEQ FI YG +SN + L+ +GF+  +  N 
Sbjct: 355 NHRCGKITSY--YDRAAQQMECTAQEAFKAGEQFFIYYGDRSNADRLVHHGFL--DMHNL 410

Query: 329 SDSVELPLSLKKSDKCYKEK 348
            D V++ L L  +D   +++
Sbjct: 411 KDYVQIRLGLSPTDPLVEQR 430


>gi|357504157|ref|XP_003622367.1| SET domain-containing protein [Medicago truncatula]
 gi|355497382|gb|AES78585.1| SET domain-containing protein [Medicago truncatula]
          Length = 497

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 70/310 (22%), Positives = 123/310 (39%), Gaps = 57/310 (18%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLV--ALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           L++W+  +G      A+Q VD  E G+   AL  I  G+ +  +P    +T  +  +C  
Sbjct: 9   LKRWMKSNGFEWSS-ALQFVDTPEEGISVKALCEINAGDVVAKMPKKACLTIKTSGAC-- 65

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             E+++   +  +  LA  ++ E S  + S W  Y+  LP+Q    L W+  E+D+ L  
Sbjct: 66  --EIIENACLGGYLGLAVAIMYERSLAEESPWEGYLQLLPQQECLPLVWSVEEVDQLLCG 123

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
           +++ +   E    V   + +  L +    P       F +E +  +  ++ SR   +   
Sbjct: 124 TELHQTVQEDKALVYEDWRENILPLLDSEPSKLNPAFFGVEQYFAAKSLISSRSFEIDDY 183

Query: 257 DGRVALVPWADMLNHSCEVET-------------------FLD---------YDKSSQGV 288
            G   +VP AD+ NH    E                     +D          DK+ +GV
Sbjct: 184 HG-FGMVPLADLFNHKTGAEDVHFTALSSNNESEDDTDDEIVDEEALAQNSSMDKTEKGV 242

Query: 289 VFTTDRQY-------------------QPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
              +D +Y                     G +VF +YG   N  LL  YGF  ++ T   
Sbjct: 243 --DSDMEYSSITEDDTSMLEMVMIKDVSSGAEVFNTYGILGNAALLHRYGFTEQDNTYDI 300

Query: 330 DSVELPLSLK 339
            +++L L L+
Sbjct: 301 VNIDLELVLQ 310


>gi|343475275|emb|CCD13287.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 593

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 67/152 (44%), Gaps = 10/152 (6%)

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV--------FNMETF 239
           A L ++L   + RE+ +E    VI  +      I + Y  +  E          F +E F
Sbjct: 206 AYLQQFLCFRRHREKVLEEQDCVIEEFRTFLSLISTYYSHVSCEASKTRLETFSFTLEQF 265

Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
            W++  L SR     SM    A++PW D  NHS      + +DK     VF T      G
Sbjct: 266 TWAYNTLMSRAFAYDSM--VWAVMPWVDYFNHSTLNNATMRFDKRLNCYVFVTVVPIAKG 323

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
           EQ+F+ YG  ++ ELLL YGF       PS S
Sbjct: 324 EQIFLQYGSYTDAELLLWYGFTVTPSLFPSYS 355


>gi|241955755|ref|XP_002420598.1| SET domain-containing protein, putative; lysine methyltransferase,
           putative [Candida dubliniensis CD36]
 gi|223643940|emb|CAX41679.1| SET domain-containing protein, putative [Candida dubliniensis CD36]
          Length = 542

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 133/316 (42%), Gaps = 68/316 (21%)

Query: 79  LQKWLSDSGLPPQ-KMAIQKV-DVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
            Q WL  + +    K+AI    D  + RG++AL+NI   E +  +P S+V+  D+     
Sbjct: 11  FQDWLIKNNVEISPKIAIHDYRDTNQGRGIIALQNINPDEMIFKLPRSIVLNIDNNSLIK 70

Query: 136 EAGEVLKQCSVPD-WPLLATYLISEASFE---------KSSRWSNYISALPRQPYSLLYW 185
           +    LK+  + D W  L   L  E  F+          +S W  Y++ LP     L+YW
Sbjct: 71  QYPSALKKLRLLDQWIGLIIVLSFEMKFKFNPSDNDDNNNSFWYEYLNILPNDFNQLIYW 130

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
              EL+ +L+ S I +R        IG  N+L +  +++   +  +++  +E FK S  +
Sbjct: 131 NDEELN-HLQPSCILDR--------IGKENNLNM--YNQIISIINQDLSIIEEFKSS-PL 178

Query: 246 LFSRLVRLP------SMDGRV---------------------------------ALVPWA 266
            F    ++       S D  V                                 ++VP+A
Sbjct: 179 TFEEYNKVATIIMSYSFDVEVPKSKPNKEMTENGTNEEDDEDDYEEEEDHEYYKSMVPFA 238

Query: 267 DMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
           D LN    +    L Y  S+  ++ T  +    GEQV+ +Y    N ELL  YG+V   G
Sbjct: 239 DTLNADTHLNNAILIY--STDQLIMTCIKVIAKGEQVYNTYSDHPNSELLRRYGYVELNG 296

Query: 326 TNPSDSVELPLSLKKS 341
           +   D  E+PLS+ K+
Sbjct: 297 S-KYDFGEIPLSIIKT 311


>gi|302754814|ref|XP_002960831.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
 gi|300171770|gb|EFJ38370.1| hypothetical protein SELMODRAFT_402223 [Selaginella moellendorffii]
          Length = 486

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 77/164 (46%), Gaps = 11/164 (6%)

Query: 173 SALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY----NDLRLRIFSKYPDL 228
           S LP Q  S   W   EL  YL AS +  +A ER+  +   +    ND    +  +  D+
Sbjct: 306 SDLPDQ-LSTFRWEDTELS-YLRASPLYGKARERLEMITTEFGQVQNDFCTCVLEQALDV 363

Query: 229 FPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQ 286
           +P+     ++E  K  +  +FSR + +   D  + ++P  D  NH+      L ++    
Sbjct: 364 WPQLFGKVSLEDLKHVYATVFSRSLAIGE-DSTLVMIPMLDFFNHNATSFAKLSFNGLLN 422

Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
             V T DR Y   +Q++I+YG  SN EL L YGF   E  NP D
Sbjct: 423 YAVVTADRDYAENDQIWINYGDLSNAELALDYGFTVPE--NPYD 464


>gi|412992279|emb|CCO19992.1| predicted protein [Bathycoccus prasinos]
          Length = 640

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 86/325 (26%), Positives = 135/325 (41%), Gaps = 43/325 (13%)

Query: 68  CEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPS 123
           C+I++   A  L KW +  G       +    +      RGL A ++IR GE +L +P  
Sbjct: 195 CKIETKGYAVALTKWAASQGKDANVSKVAPCLLSSMNDARGLCATEDIRAGENILEIPRR 254

Query: 124 LVITADSKWSCPEAG------EVLKQCSVPDWPLLATYLISEASFEKSSR---WSNYISA 174
           +++ A +     E G       +L++C      ++  +++ E    K+ +   WS Y  +
Sbjct: 255 MLLDAGT-ICISEQGPFGDLLRILERCGAD--TIMTLWIMKERMKMKTKQETFWSLYFLS 311

Query: 175 LP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
           LP   Q  + L W    +   L  + I E  +     V   Y+ L   + +  P+ F   
Sbjct: 312 LPDGSQKLTPLSWPEDIVRVGLGNTPIFETVMHERQKVRNGYDALLPSLLANCPESFEG- 370

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRV------------ALVPWADMLNHSC--EVETF 278
             N E F WS+    S L    S    V             L P A   NH        +
Sbjct: 371 --NQEEF-WSYDQYISALELWMSYAMTVKPVHNSDSGTIDVLSPVAFFCNHGIYPHCVHY 427

Query: 279 LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL-- 336
                S + +VF   R  +  E++ +SYG KSNGELLL YGF   +  NP DS+++ L  
Sbjct: 428 SQLRLSDECLVFPAMRDIEKNEEIMLSYGAKSNGELLLFYGFCIDD--NPYDSIDITLDF 485

Query: 337 -SLKKSDK--CYKEKLEALRKYGLS 358
            SL   +K    K + E L K+ L+
Sbjct: 486 DSLNGVEKPEVRKRREELLVKHDLT 510


>gi|145353540|ref|XP_001421068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581304|gb|ABO99361.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 813

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 80/285 (28%), Positives = 131/285 (45%), Gaps = 20/285 (7%)

Query: 85  DSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC 144
           D+ L  +  A    D G RG  AL++  +GE LL +P     T  +         V   C
Sbjct: 5   DATLARRVEAHDFADTG-RGQRALRDCARGEVLLEIPLERGFTLAAALEDDAVKRVASCC 63

Query: 145 SVPDWPLLATYLISEA-SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERA 203
           +  D  ++A ++ +E    EK++R + +++ LPR   +  +W+  EL      + +RE  
Sbjct: 64  ARHD-DVVALHVCAERFRGEKATR-AAHVATLPRSFDTAFFWSEEELRELTGTTCLRETM 121

Query: 204 IERITNVIGTYNDLRLRIFS-KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-- 260
             R       Y  L  ++ +        E   + E + W+   L+SR   L   DG+   
Sbjct: 122 NLR-EETKNDYETLTKKMEAIGEGGWMREHEVDYERYAWARSNLWSRQCDLLMPDGKRTR 180

Query: 261 ALVPWADMLNHSCEV----ETFLDYDKSSQGVVFTTDRQYQPGEQVFISY--GKKSNGEL 314
           A+VP  D+ NHS +        L+ +K+   V    D  Y+ GEQ FISY  G+ +N +L
Sbjct: 181 AMVPTFDIFNHSAKAPLGKTHKLNAEKNCVTVYAADD--YKAGEQAFISYGSGEAANSKL 238

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE-ALRKYGLS 358
           L  YGF   +  NP + +++ L++   DK  K  LE ALR   ++
Sbjct: 239 LTWYGFCIDD--NPYEELDVTLTI-TVDKLRKTVLETALRASAVA 280


>gi|148237199|ref|NP_001085404.1| N-lysine methyltransferase setd6 [Xenopus laevis]
 gi|82184826|sp|Q6INM2.1|SETD6_XENLA RecName: Full=N-lysine methyltransferase setd6; AltName: Full=SET
           domain-containing protein 6
 gi|48734800|gb|AAH72257.1| MGC82362 protein [Xenopus laevis]
          Length = 455

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/295 (22%), Positives = 128/295 (43%), Gaps = 40/295 (13%)

Query: 70  IDSLENASTLQK---WLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
           +D L+N   + +   W    GL   P+     +  V + G++A ++I  GE L  VP S 
Sbjct: 13  VDHLQNGFPVTRFLAWCEKVGLELNPKVYISTEGTVSQYGMLAREDIADGELLFTVPRSA 72

Query: 125 VITADSKWSCPEAGEVLKQ-----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR-- 177
           +++ ++        E+L++      S   W  L   L+ EA+ + SS W+ Y    P   
Sbjct: 73  ILSQNTT----RIQELLEKEQESLQSTSGWVPLLISLLYEAT-DSSSLWAPYFGLWPELD 127

Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
            P   ++W+  E  + L+ + + E     + N+   YN + L   ++ P+ F      ++
Sbjct: 128 PPDMPMFWSEEEQTKLLQGTGVLEAIRNDLKNIEEEYNSIVLPFITRNPEKFCPMKHTLD 187

Query: 238 TFKWSFGILFSRLVR----------------LPSMDGRVALVPWADMLNHSCEVETFLDY 281
            +K     + +   +                LP M     +VP AD+LNH       L++
Sbjct: 188 LYKRLVAFVMAYSFQEPLEENDEEDEDEKDILPPM-----MVPVADLLNHVAHHNAHLEF 242

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
             + + +   T +    G+++F +YG+ +N +LL  YGF      N +++ ++ +
Sbjct: 243 --TPECLRMVTTKSVHAGQELFNTYGEMANWQLLHMYGFAEPHPQNSNETADIQM 295


>gi|255584095|ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
 gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
          Length = 510

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 77/288 (26%), Positives = 121/288 (42%), Gaps = 59/288 (20%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL----KQCSVPDWPLLAT 154
           D G RGL A ++++KGE +L VP S ++T D   S  + G +L       ++     L  
Sbjct: 51  DAGGRGLGAARDLKKGELVLRVPKSALLTKD---SFLKDGLLLSAINNHSALSPTQTLTV 107

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
            L+ E S  +SS W  Y+  LPR  Y +L  T +E ++  +A Q+ + AI      I   
Sbjct: 108 CLLYEMSKGQSSFWYPYLMHLPRS-YEILA-TFSEFEK--QALQV-DDAIWTAEKAISKA 162

Query: 215 NDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH-- 271
              R   +S   +L    +   +  + W+   + SR + +P  D    L P  D  N+  
Sbjct: 163 ELDRKEAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIP-WDEAGCLCPVGDFFNYAA 221

Query: 272 -----------------SC----------------------EVETFLD--YDKSSQGVVF 290
                            SC                      ++++  D  +D+      F
Sbjct: 222 PGEESSSPENDESWKPASCLEDASLSSERSTSNFCSETFDVQLKSLTDGGFDEDKAAYCF 281

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
              + Y+ G QV +SYG  +N ELL  YGF+  E  NP+D V +PL L
Sbjct: 282 YARQNYKKGAQVLLSYGTYTNLELLEHYGFLLNE--NPNDKVFIPLEL 327


>gi|428182191|gb|EKX51052.1| hypothetical protein GUITHDRAFT_134587 [Guillardia theta CCMP2712]
          Length = 365

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 116/271 (42%), Gaps = 32/271 (11%)

Query: 86  SGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC- 144
           +G+   K+ +++ + G RG+ A K I +GE ++ VPPSL+ + ++      AG  LK   
Sbjct: 2   AGVKFPKLEVRR-EGGVRGMYATKKIDRGEVIVSVPPSLLFSYET------AGGALKDVW 54

Query: 145 -SVPDWPLLATYLISEASFEKS--SRWSNYISALP--RQPYSLLYWTRAELDRYLEASQI 199
               D   L    +    F     SRW  ++  +P   +    + W+  +L+   E  + 
Sbjct: 55  KRTKDMQELDRLTLLLLYFSSKVRSRWDFFLCGIPGMNELGPAVLWSPKKLNETCEREEY 114

Query: 200 RERAIERITNVIGTYNDL-RLRIF---SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
                  + N    Y  L R  +     K+P +F ++      + W+   + SR+  +  
Sbjct: 115 SS-LCSFVENRRSMYKRLWRTEVAPLPRKFPHIFSQQDTGYSNYLWAIAAVLSRMWLMRR 173

Query: 256 MD-------------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
            +              +  + P A++LNH           +    +   +D  Y+PGEQV
Sbjct: 174 FEEPEFYPNGTWIGPAKWVMAPVAELLNHKPRAGHIRWGSQRRPHLEVVSDVSYRPGEQV 233

Query: 303 FISYGKKSNGELLLSYGF-VPREGTNPSDSV 332
           F+SYG K N ELLL YGF +P   T  +D +
Sbjct: 234 FVSYGNKCNLELLLEYGFEIPGNPTKCADEI 264


>gi|303284022|ref|XP_003061302.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226457653|gb|EEH54952.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 536

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 124/335 (37%), Gaps = 55/335 (16%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW--PL-----LA 153
           G RG+VA   I  G  LL VP +L+++  +  +       L +    D   PL     LA
Sbjct: 50  GWRGVVATAPIAAGATLLRVPTALLMSGRTAAADDVLARALSEHHERDGEPPLTPTDRLA 109

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
            +L+ E S    S W  Y+  LPR       WT AE +    A Q+   A++        
Sbjct: 110 VHLLRELSRGAESFWHLYLRQLPRSYALTCGWTAAERN----ALQL-PHAVDAADRSAAA 164

Query: 214 YNDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
             D   R       +  P    +   + W+   + SR V +P  D   AL P  D+ N+ 
Sbjct: 165 CRDAWARATPVMEKIGLPATYRSFGAWAWAAATISSRTVFVP-FDAAGALCPVGDLFNYV 223

Query: 273 CE--------VETFLD----------------------------YDKSSQGVVFTTDRQY 296
                     V T L+                            + ++S   VF   R Y
Sbjct: 224 PPTPPHVPKVVGTPLEGPSDERDDEEDDENDSYFLRRGVGGDGAWHEASDAWVFVARRDY 283

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           + GE++ + YG+ +N  LL  YGF    G N  D   L +  K  D  +   +  +  Y 
Sbjct: 284 RKGEEISLCYGQHTNLGLLTHYGFTMSHGENAHDEAPLVVDAKTLDVDFPAAVGDVGSYR 343

Query: 357 LSASEC-FPIQITGWP----LELMAYAYLVVSPPS 386
            +   C FP     W     L L A+A LV  PP 
Sbjct: 344 NAHEICVFPRGGVSWSTLERLRLAAHATLVGKPPG 378


>gi|322707769|gb|EFY99347.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 467

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 125/291 (42%), Gaps = 54/291 (18%)

Query: 74  ENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLV---ALKNIRKGEKLLFVPPSLVIT- 127
           +  STL +W S  G  L P     Q    G    V   A   ++  E ++ +P SL ++ 
Sbjct: 5   DTISTLVEWASSHGATLHPSIEVYQDPQTGLSFRVKPSAKSPVQPYEPIVQLPTSLTLSY 64

Query: 128 ----ADSKWSCPEA--GEVLKQCS---VPDWPLLATYLISEASFEKSSRWSNYISALPRQ 178
               ++   + P+A  GE L + +   V    L+  YL  + SF     W  YI ALP Q
Sbjct: 65  FNAVSEQGTASPDAFRGEFLARAAPHVVGRLFLIKEYLKRDKSF-----WWPYIRALP-Q 118

Query: 179 P-------YSLL-YWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----LRIFSKY 225
           P       ++L  +W   E +  LE + + E  I++I N +    DL+     LR+    
Sbjct: 119 PGQGNKSQWALAPFWDDDEAE-LLEGTNV-EVGIDKIRNDV--RRDLQEAQELLRLHGDA 174

Query: 226 PDLFPEEVFNMETFKWSFGILFSRLVR------------LP---SMDGRVALVPWADMLN 270
              F  +    E ++W++ I  SR  R            LP   +MD    L+P  D+ N
Sbjct: 175 DGAF-GKALTTELYQWAYCIFSSRSFRPSLVLSDEQRRSLPRGVTMDDFSVLLPLFDIGN 233

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           H    E   D D   Q       R + PG+QVF +Y  K+N ELLL YGF+
Sbjct: 234 HDMTTEIRWDLDDDRQTCELRVGRTHMPGQQVFNNYSMKTNAELLLGYGFM 284


>gi|145344075|ref|XP_001416564.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576790|gb|ABO94857.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 398

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 110/247 (44%), Gaps = 24/247 (9%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           L+   L         S ++ Y   +P +  +  +W+  +  + L  S   E  I     +
Sbjct: 45  LIVCLLYERYELGDRSAFAEYFRTMPGEFDTPTHWS-DDTAKELRGSDTYEVDIVDEYKL 103

Query: 211 IGT-YNDLRLRIFSKYPDLF-PEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADM 268
           + T +N LR+R+F  Y D+F  +   ++   +W++ +  +R  R+   DG +ALVP  DM
Sbjct: 104 LNTVWNALRVRVFDVYTDVFVGKAARSLYALRWAWTVAHARATRVSGKDG-LALVPVIDM 162

Query: 269 LNHS-----------CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
           +               + E F  YD  +  VV    R Y PGE++   +G  +NGE +  
Sbjct: 163 IRECGKDVDADKTDIVDDEGFAVYDPHADEVVVYAKRDYAPGEELCERFGGWNNGESVQH 222

Query: 318 YGFVPREGTNPS-DSVELPLSLKKSDKCYKEKLEALRKYGLSA--SECFPIQITGWPLEL 374
            G++P   TN + + V + L+ +K     K   E +RK G       C P   +   L++
Sbjct: 223 LGYLPDVHTNSTRNCVLMVLTPEK-----KRNEEKVRKAGFDVPWRVCVPSAASESSLDM 277

Query: 375 M-AYAYL 380
           + AYA L
Sbjct: 278 LSAYAEL 284


>gi|367042232|ref|XP_003651496.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
 gi|346998758|gb|AEO65160.1| hypothetical protein THITE_2111880 [Thielavia terrestris NRRL 8126]
          Length = 377

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/252 (26%), Positives = 116/252 (46%), Gaps = 13/252 (5%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           +E    L +W  D G+    +A +++     G+VA K ++  E+LL VP S + + ++  
Sbjct: 1   MEAYDELLRWAQDRGVEIHGIAPREIPGKGVGMVATKPLKANERLLHVPTSALRSLET-- 58

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELD 191
             P+  + L   +     L A   + + + +K + W+  + +      +L L W+   L 
Sbjct: 59  IRPKVKKALPADTRVHALLAADLALDKPTTKKYAPWNAIVPSAADLATALPLAWSSPVLH 118

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN--METFKWSFGILFSR 249
            YL       RA+ R       +      + + +P L P+   +  + T   +F    +R
Sbjct: 119 NYLPPPA---RALLRAQQ--AKFARDWAAVSAAFPALAPDAFRHAWLLTNTRTFYHETAR 173

Query: 250 LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
             RLP  D R+ L P AD+ NH+ +    + +  +S  +  T DR Y  GE+V I YG+ 
Sbjct: 174 TARLPH-DDRMVLQPVADLFNHAADGGCEVAFTPASFAI--TADRAYAEGEEVLICYGRH 230

Query: 310 SNGELLLSYGFV 321
           SN  LL+ YGFV
Sbjct: 231 SNDFLLVEYGFV 242


>gi|66819805|ref|XP_643561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
 gi|60471605|gb|EAL69561.1| hypothetical protein DDB_G0275621 [Dictyostelium discoideum AX4]
          Length = 526

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/225 (24%), Positives = 106/225 (47%), Gaps = 35/225 (15%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGER---------------------------GLVAL 108
           A  + KWL D+G+      I+ V +G+                            G+++L
Sbjct: 13  AEIINKWLIDNGVRINNKLIKIVYLGKENNFEQTENTTATTTTSERINDSIVSGLGVISL 72

Query: 109 KNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRW 168
           K ++  + +  +P S++++  +         +L++  + +    +  LI EAS  + S+W
Sbjct: 73  KELKVDDIVAKIPKSIILSIHT----SSISNILEKYKIENNIGTSIALIHEASLGEKSKW 128

Query: 169 SNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK-YPD 227
             YIS+LPR+    + W  +E  + L+ + I +   +    +   Y D+   I SK +P+
Sbjct: 129 YGYISSLPRKVDVPILWD-SESRKLLKGTAIEDVLNDDDILINQVYADVIESILSKNHPE 187

Query: 228 LFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
           +F + E++++E FK +  I+ SR   + S  G  +LVP AD+ NH
Sbjct: 188 IFGDKELYSIENFKIANSIISSRAFCVDSYHGD-SLVPLADIFNH 231


>gi|348519120|ref|XP_003447079.1| PREDICTED: N-lysine methyltransferase setd6-like [Oreochromis
           niloticus]
          Length = 460

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/304 (25%), Positives = 133/304 (43%), Gaps = 29/304 (9%)

Query: 52  ASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLP-PQKMAIQKVD-VGERGLVALK 109
           A+R K    ++    G E+  L+   +  +W    GL    K+ + K   V E G++A  
Sbjct: 2   AARAKRAKVED----GSELSPLQ---SFLQWCDGVGLELSSKVCVSKEGIVAEYGMLAKD 54

Query: 110 NIRKGEKLLFVPPS-LVITADSKWSCPEAGEVLKQCSVPDW-PLLATYLISEASFEKSSR 167
           +I +GE L  +P S L+    +K S     E     S   W PLL   L    S +  S 
Sbjct: 55  DIEEGEVLFTIPRSALLHQGTTKVSTLLEKEQSSLQSSSGWVPLLLALLYEYTSSQ--SH 112

Query: 168 WSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY 225
           W  Y+S     +     ++W++ E  R L+ + + E     + N+   Y D+ L   +++
Sbjct: 113 WKAYLSLWTDFKTLDHPMFWSKEERGRLLKGTGVPEAVDRDLANIQREYTDVVLPFMTRH 172

Query: 226 PDLFPEEVFNMETFKW--SFGILFS----------RLVRLPSMDGRVALVPWADMLNHSC 273
           PDL+      ++ +    +F + +S                       +VP ADMLNH  
Sbjct: 173 PDLWNPGTHTLDLYTQLVAFVMAYSFQEPQDEEDEEEEEEEKPPNPPMMVPMADMLNHVS 232

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
                L++ + S  +V    R    G++VF +YG+ +N +LL  YGF     +N +D+ +
Sbjct: 233 NHNANLEFTQDSLKMVCV--RPIHKGQEVFNTYGQIANWQLLHMYGFTEPCSSNSNDTAD 290

Query: 334 LPLS 337
           +P++
Sbjct: 291 IPVA 294


>gi|159471213|ref|XP_001693751.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
 gi|158283254|gb|EDP09005.1| transcription factor, E2F and DP-related [Chlamydomonas
           reinhardtii]
          Length = 656

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 112/245 (45%), Gaps = 21/245 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEA 160
           RGL A   +  G+ +L VP  L+++ ++     + G+VL      + D  +   +   E 
Sbjct: 75  RGLRADTGVGPGDVVLHVPVELLMSYETAKES-DLGKVLTALPLGLGDDSVALIWTCVER 133

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL-------EASQIRERAIERITNVIGT 213
              ++   + + S LP++  ++L  + A++D  L       EA Q R    E        
Sbjct: 134 REPEAPAAAWWAS-LPQRFSTVLSASDADVDAALAGSPLAAEAGQARRHLAEAFAASQPA 192

Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSMDGRVALVPWADMLNH 271
           +  L       YPD F    F+ E++ W+  + +S    V++ + D R  LVP+  ++NH
Sbjct: 193 FESL----LKAYPDYFQPHWFSWESYLWAAELWYSYGIQVQVAAGDIRTCLVPYLGLMNH 248

Query: 272 SC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
                V  F   D +S+G+     R    G Q+F+SYG   N +LLL YGF   +  NP 
Sbjct: 249 HPLPHVVHFSKVDPASRGLRVRAFRPCARGRQLFLSYGPYPNSKLLLFYGFALPD--NPV 306

Query: 330 DSVEL 334
           D VEL
Sbjct: 307 DEVEL 311


>gi|118353077|ref|XP_001009809.1| hypothetical protein TTHERM_00160790 [Tetrahymena thermophila]
 gi|89291576|gb|EAR89564.1| hypothetical protein TTHERM_00160790 [Tetrahymena thermophila
           SB210]
          Length = 409

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 74/298 (24%), Positives = 126/298 (42%), Gaps = 44/298 (14%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDV-------GERGLVALKNIRKGEKLLFVPPSLV--- 125
              L KWL+++G       + +V++       G  G  A    +KG+ L  +P   +   
Sbjct: 14  VDNLFKWLNENG----ATGLDQVEIKPSPECNGSIGCFAKIEFKKGDILAKIPKKCILGL 69

Query: 126 --------ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
                   I+  ++++  E G+ L +    +  +L  Y       EK + W  Y+ +LP 
Sbjct: 70  GQAVKSPLISKLNEYAQQEYGKKLDKQVFSNEFMLWIYEGQCLIEEKDNHWKAYLESLPS 129

Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
           +   +  W    L +  + +     A+E+   +     +    I SK+PDL   E+   +
Sbjct: 130 ESPIVCSWDNNILQKISKTN--LGSAVEKELAIFQKQIEFLQSIQSKFPDLLHPEI--TK 185

Query: 238 TFKWSFGILFSR-LVRLPSMDGRVA-----------LVPWADMLNHSCEVETFLDYDKSS 285
             +WS G   SR  V   ++DG  +           +VP+ D+LNH  + +    +D+  
Sbjct: 186 YIEWSKGNYLSRRFVGKLAIDGEGSGLEQYGGKMGCMVPFFDLLNHKNDHKVNFQHDEEY 245

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
             V +  +   + GE+VF +Y K SN ELL +YGF      N  D   LPL L   DK
Sbjct: 246 --VWYVCEYDIKAGEEVFNNYCKASNEELLFTYGFAVE--NNQLDV--LPLKLMACDK 297


>gi|313234004|emb|CBY19580.1| unnamed protein product [Oikopleura dioica]
          Length = 791

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 106/221 (47%), Gaps = 42/221 (19%)

Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL--EASQI 199
           ++ +VP   +   +L +E   +K+SR + +IS+LPR+ Y+L +   A+L + +  ++ Q+
Sbjct: 137 EKVAVPAEDVFIHFLCTEEKQKKASRVAGWISSLPRESYNLPFNWPADLQKCVCDDSLQL 196

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM--- 256
             +A       +  +N L  R F  Y  +  +  F+ +T  W++ +  +RL  LP     
Sbjct: 197 AIKA------QVADFNTLIAR-FEHYSTVIDDVKFSRDTLAWAYSVWSARLFELPQYPES 249

Query: 257 ----------------DGR-VALVPWADMLNHSCEVETFLD------YDKSS-----QGV 288
                           D R  A +P  D+LNHS     FLD      +DK+      +  
Sbjct: 250 SSNSVNLPKWLDNDPNDLRSFAFLPIFDLLNHSSTPNVFLDIREKHVWDKTKKIHPEEKF 309

Query: 289 VFTTDRQYQ--PGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           V + + + +   GE++ +SYG  S+ +L L YGFV ++G N
Sbjct: 310 VLSLEAKTKIAKGEELRMSYGNLSDRDLFLKYGFVLKKGEN 350


>gi|255714603|ref|XP_002553583.1| KLTH0E02156p [Lachancea thermotolerans]
 gi|238934965|emb|CAR23146.1| KLTH0E02156p [Lachancea thermotolerans CBS 6340]
          Length = 498

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 91/334 (27%), Positives = 141/334 (42%), Gaps = 64/334 (19%)

Query: 58  TVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGE 115
           T T N + W  +   L        W+SD      K+ +     G++G  +VA + I KGE
Sbjct: 8   TKTNNFLRWASKDAKL--------WISD------KVKLLGTRDGDQGRYMVATQEIIKGE 53

Query: 116 KLLFVPPSLVITADSKWSCPEAGEVLKQCS--VPDWPLLATYLISE-ASFEKSSRWSNYI 172
           KL  VP    +   +        +V K+ +  V  W  L   ++ E     ++S+W  Y 
Sbjct: 54  KLFEVPRGSALNVATLSLSMRDKQVYKKLTTEVGHWEGLVIAILYEFKVMNQNSKWWPYF 113

Query: 173 SALPR--QPYSLLYWTRAELDRYLEASQIRER-----AIERITNVIGTYNDLRLRIFSKY 225
             LP   +  SL+YWT AEL+ YL+ S + ER     A E    V+    DL++   ++ 
Sbjct: 114 EVLPEPARLNSLMYWTGAELE-YLKPSGVYERVDREGAEEMYARVMKCAEDLKI---TEL 169

Query: 226 PDLFPEEVFNMETFKWSFGILFSR-------------------LVRLPSMDGRV-ALVPW 265
            ++  EE  ++ +   ++     R                          DG   ++VP 
Sbjct: 170 TNITWEEFMHVASIIMAYSFDMERPDYEDSDEEVEESDEEEEEEKNTVWNDGYFKSMVPM 229

Query: 266 ADMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
           ADMLN   H C     L Y  S + ++         GEQ++ +YG+  N ELL  YG+V 
Sbjct: 230 ADMLNSDTHKCNAN--LTY--SPEALIMVAVADIPSGEQIYNNYGEYPNSELLRRYGYVE 285

Query: 323 REGTNPSDSVELPLS--LKKSDKCYKEKLEALRK 354
             G+   D  E+PL   LK  + C    LEA R+
Sbjct: 286 WSGSK-FDCGEMPLETLLKAIEVC----LEAPRR 314


>gi|303288325|ref|XP_003063451.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455283|gb|EEH52587.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 478

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 90/193 (46%), Gaps = 24/193 (12%)

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-RERA-IERITNVIGTY 214
           IS A +  S   +  ++ALP      + W+  +++R L    + R RA + RI  V    
Sbjct: 113 ISRARYVLSLPGAELVNALP------IGWSDEDIERRLRGDLLSRTRATLARIHAVADAI 166

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL--VRLP----SMDGRV-ALVPWAD 267
              R + F    D F   VF+++  KW+  + +SR   +R P    +  G V ALVP  D
Sbjct: 167 ARSRSKAFGDADDAFF--VFSVDDLKWAHAVFWSRAMTLRFPRKGFTGGGDVDALVPLVD 224

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQ---YQPGEQVFISYGKKSNGELLLSYGFV-PR 323
           M NH       L+  +   G  F   R     + G++VFI+YG K N ELL  +GFV P 
Sbjct: 225 MCNHRAGSTATLEIVEDDAGDAFYELRAGVATKAGDEVFINYGAKGNEELLRCHGFVIP- 283

Query: 324 EGTNPSDSVELPL 336
              NP D + + L
Sbjct: 284 --NNPCDVLAVDL 294


>gi|359476494|ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera]
          Length = 504

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 119/287 (41%), Gaps = 63/287 (21%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP-LLATYLIS 158
            G RGL A +++ +GE +L VP S ++T+ S     +    +K+ +    P +L   L++
Sbjct: 46  AGGRGLAAARDLSQGELILTVPKSALMTSQSLLKDEKLSVAVKRHTSLSSPQILTICLLA 105

Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERI----TNVIG 212
           E S  KSS W  Y+  LPR   +L  +++ E    +  +A  + ERAI +        I 
Sbjct: 106 EMSKGKSSWWHPYLMQLPRSYDTLANFSQFEKQALQVDDAIWVTERAILKAELEWKKAIP 165

Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS 272
              +L+L+           ++ N   + W+   + SR + +P  D    L P  D  N++
Sbjct: 166 LMEELKLK----------PQLQNFRAWLWASSTVSSRTMHIP-WDDAGCLCPVGDFYNYA 214

Query: 273 CEVE--------------------TFLDYDKSSQ-----------------------GVV 289
              E                    +F + D +S                           
Sbjct: 215 APGEEPCGWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDVLSQRLTDGGYKEDLAAYC 274

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           F   + Y+ GEQV +SYG  +N ELL  YGF+  E  NP+D   +PL
Sbjct: 275 FYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDE--NPNDKAFIPL 319


>gi|71995786|ref|NP_497604.2| Protein SET-27 [Caenorhabditis elegans]
 gi|373220599|emb|CCD73865.1| Protein SET-27 [Caenorhabditis elegans]
          Length = 502

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 151/384 (39%), Gaps = 69/384 (17%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           ++  T   W    G+    + I         L A   I K   +  VP   +IT D    
Sbjct: 72  DSIKTFLAWADGVGIARNNVTIGSTKTAGLSLQATGPIPKSHIVARVPRHAMITLD---- 127

Query: 134 CPEAGEVLKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
             +   +LK+    D P+        LA +L  +    + S+W +YIS LP    + L++
Sbjct: 128 LAKKSSLLKKAFEKD-PIVGGMDNVGLALFLACQWIQNEKSKWKSYISILPTTFPTPLFY 186

Query: 186 TRAELDRYLEASQIRERAI-----------------------ERITNVIGTYNDLRLRIF 222
           +  +L + L+ S I E AI                       E         N +   IF
Sbjct: 187 SEEQLLQ-LKPSPIFEEAILFYRTISRQFCYFLLAIAKNKIYEAAQRRKDARNAMETPIF 245

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM-----DGRV----ALVPWADMLNH-- 271
              P  F    F  + + W+ G++ +R+  +PS      DG      AL+P  DM NH  
Sbjct: 246 YNVP--FNVANFTPKLYFWAVGVVTTRVNMVPSENQVGEDGNPVIIPALIPVLDMANHEN 303

Query: 272 ------SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
                 +  +E  + Y    +  V T+    + G +V I YG +S GE LL  GFVP   
Sbjct: 304 VLTDVLTEPIEDLVCYSPEEECAVITSHCDVKAGNEVTIFYGCRSKGEHLLHNGFVPIYH 363

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKY---GLSASECFPIQITGW-----PLELMAY 377
               D ++L + + K+DK    K + ++K+      A   F + +  +     PL+L+ +
Sbjct: 364 -GKFDVLKLKIGIPKTDKTLDAKKKLIQKFVKKVYCAGNIFHVDLYNYHEQPFPLDLLMF 422

Query: 378 AYLVVSPPSMKGKFEEMAAAASNK 401
           A + VS        EE  +A  N+
Sbjct: 423 AAIFVSTTPT----EEAVSAPENR 442


>gi|18041979|gb|AAL57769.1|AF388528_1 hypothetical protein RDA279 [Rattus norvegicus]
          Length = 328

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/233 (27%), Positives = 98/233 (42%), Gaps = 28/233 (12%)

Query: 155 YLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           +L+SE      S W +Y+  LP+    P  L       L   L A    +RA  R+ ++ 
Sbjct: 1   FLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPGPLRAKAEEQRA--RVQDLF 58

Query: 212 GTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGRV-------- 260
            +  D     FS    LF E V   F+   F W++  + +R V L S             
Sbjct: 59  ASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTRAVYLKSRRQECLSSEPDTC 114

Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           AL P+ D+LNHS  V+    +++ ++     T  + +  ++ FI YG   N  LLL YGF
Sbjct: 115 ALAPFLDLLNHSPHVQVKAAFNEKTRCYEIRTASRCRKHQEAFICYGPHDNQRLLLEYGF 174

Query: 321 VPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQITGW 370
           V     +    V   + LK    +DK   +KL  L  +G + +  F     GW
Sbjct: 175 VAFGNPHACVPVSGEMLLKYLPPADKQVHKKLSILEDHGFTGNLTF-----GW 222


>gi|302805649|ref|XP_002984575.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
 gi|300147557|gb|EFJ14220.1| hypothetical protein SELMODRAFT_42811 [Selaginella moellendorffii]
          Length = 530

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 133/362 (36%), Gaps = 77/362 (21%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPP---------- 122
           E      +W  + G+  +  AI++V   E  GL    +  +G+ L F  P          
Sbjct: 3   ERLERFSRWSQEHGIQFRGCAIKRVSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVLV 62

Query: 123 ----SLVITADSKWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISAL 175
                L +T  +    P  G V ++     + D  L+  +LI E +  ++S W+ Y+  L
Sbjct: 63  VTPLDLALTPVTIVKDPVLGNVYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEML 122

Query: 176 PRQPYSLLYWTRAEL-----DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKY---PD 227
           P    + L++   EL         EA++ ++R +  +   IGT    +L + S Y   PD
Sbjct: 123 PSGFGTPLWFEDEELMELDGTTLFEATKAQQRCLPSV--YIGTLC-CQLFLVSLYLFRPD 179

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLP--------------SMDGRV------------- 260
              +     + F W+  I ++R + +P                DG               
Sbjct: 180 ---DRELEFQEFLWANCIFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISC 236

Query: 261 -----------ALVPWADMLNHSCEVETFLDYDKSS-------QGVVFTTDRQYQPGEQV 302
                       LVP  D  NH+       + D S          +    D  + PG +V
Sbjct: 237 SSKDVSTIWIEGLVPGIDFCNHTRRASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEV 296

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
            I+YG K N ELL  YGFV  + +N    V  P      D     KL+ LR+  LS    
Sbjct: 297 LINYGDKGNEELLFLYGFVEEDNSNDYVMVHFPKMFLDEDNTMDFKLQLLRELDLSLKWL 356

Query: 363 FP 364
            P
Sbjct: 357 LP 358


>gi|302810436|ref|XP_002986909.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
 gi|300145314|gb|EFJ11991.1| hypothetical protein SELMODRAFT_235145 [Selaginella moellendorffii]
          Length = 447

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/343 (22%), Positives = 132/343 (38%), Gaps = 53/343 (15%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+ AL+++  GE +  +P +  +T  +  +            +     L   ++ E S  
Sbjct: 9   GVRALRDLHHGELIATIPKAACLTLLTTAARDAIARARLGGGLG----LTVAVMYERSKG 64

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           K S+W  Y+  LP Q      W+  E+D  L  +++ +   E    +   + +    +  
Sbjct: 65  KGSKWYRYLKTLPCQESVPFLWSEEEIDGLLLGTELHKALKEDKLLMKEDWEENIAPLTK 124

Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETF----- 278
           + P  FP + F  E++  +  ++ SR   + +  G   +VP AD+ NH  + E       
Sbjct: 125 EDPLEFPAQDFTFESYLAAKSLVSSRSFEIDAEHG-YGMVPLADLFNHKTDAEDVHFMLN 183

Query: 279 --------------------------LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
                                     +  DKS   +V   D     G ++F +YG+  N 
Sbjct: 184 ASDSDDDDDNNGLIIDDGLANGDCREISSDKSVLEMVMVKD--VAAGSEIFNTYGQLGNA 241

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSL-------KKSDKCYKEKLEALRKYGLS-----AS 360
            LL  YGF   E  NP D V L +         +   K  +++    RK G S      S
Sbjct: 242 ALLHRYGFT--EPNNPHDIVNLDMDCLLEVLLSRFQKKRVRKRGRVWRKAGFSGCESQGS 299

Query: 361 ECFPIQITGWP-LELMAYAYLVVSPPSMKGKFEEMAAAASNKM 402
           E F I   G P +EL+   +++ SP       E+ AA    ++
Sbjct: 300 EYFEISAAGKPQIELLLLLFVIQSPARDCEALEDAAAKVKGRV 342


>gi|238882716|gb|EEQ46354.1| hypothetical protein CAWG_04701 [Candida albicans WO-1]
          Length = 549

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 85/327 (25%), Positives = 136/327 (41%), Gaps = 73/327 (22%)

Query: 74  ENASTLQKWLSDSGLPPQ-KMAIQK-VDVGE-RGLVALKNIRKGEKLLFVPPSLVITADS 130
           E +   Q WL  + +    K+AI    D  + RG++AL++I   E +  +P S+V+  D+
Sbjct: 6   EKSKLFQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN 65

Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFE----------KSSRWSNYISALPRQP 179
                    VLK+  V D W  L   L  E  F+            S W  Y++ LP Q 
Sbjct: 66  NSLIKLYPSVLKKLRVLDQWIGLIIVLGFEMKFKFNPNNNNDNNNKSFWYEYLNILPDQF 125

Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
             L+YW   EL+ +L+ S I +R        IG  N+L +  +++   +  +++  +E F
Sbjct: 126 NQLIYWNDEELN-HLQPSCILDR--------IGKENNLNM--YNQIISIINQDLSGVEEF 174

Query: 240 KWS------------FGILFSRLVRLP-----SMDGR----------------------- 259
           K S              + +S  V +P     + +G                        
Sbjct: 175 KSSPLTFEEYNKVATIIMSYSFDVEVPKSKKVTKNGTNEKGNDEDKEDDGDDDDDDEEED 234

Query: 260 ----VALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
                ++VP+AD LN    +    L Y  S+  ++ T  +    GEQV+ +Y    N EL
Sbjct: 235 NEYYKSMVPFADTLNADTHLNNAILIY--STDQLIMTCIKPIAKGEQVYNTYSDHPNSEL 292

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKS 341
           L  YG+V   G+   D  E+PLS  KS
Sbjct: 293 LRRYGYVELNGS-KYDFGEIPLSTIKS 318


>gi|452841392|gb|EME43329.1| hypothetical protein DOTSEDRAFT_131367 [Dothistroma septosporum
           NZE10]
          Length = 445

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/303 (26%), Positives = 131/303 (43%), Gaps = 52/303 (17%)

Query: 71  DSLENASTLQKWLSDSGLP-PQKMAIQKV---DVGERGLVALKNIRKGEKLLFVPPSLVI 126
           D  E +     WL D+G     K+ +  +   + G RG+VA++++ + E+L  VP S ++
Sbjct: 6   DFQERSRAFVNWLRDNGASISAKITLDDLRQQNAG-RGIVAVEDLDEDEELFSVPRSTML 64

Query: 127 TADSKWSCPEAGE-VLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
           T ++  +    GE VL++   P W  L   +  E      SRW  Y   LP    +L++W
Sbjct: 65  TTETSRN----GEAVLQEVDDP-WLSLIVVMALEYLDGSQSRWKPYFDVLPVSFDNLMFW 119

Query: 186 TRAELDRYLEASQI-------------RER---AIERITNVIGTYNDLRLRIFSKYPDLF 229
           +  EL R+LE S +             RE+    IERI+      N+  LR+  +     
Sbjct: 120 SDREL-RHLEGSTVVGKIGKEAADATFREQLIPVIERISKAKAADNEELLRMCHRMGSTI 178

Query: 230 PEEVFNMETF---------KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE---VET 277
               F++ET          +W           LP       +VP ADMLN   +    + 
Sbjct: 179 MAYGFDLETSSDQAKNDGEEWEEDSDAGET--LPK-----GMVPLADMLNADADRNNAKL 231

Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLS 337
           F + DK    VV  T +  + GE+++  +G     +LL  YG++  +     D VE+P  
Sbjct: 232 FYEDDK----VVMKTIKPVKAGEELYNDFGSLPRADLLRRYGYLT-DNYAQYDVVEIPAD 286

Query: 338 LKK 340
           L K
Sbjct: 287 LIK 289


>gi|400602527|gb|EJP70129.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 493

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 53/105 (50%), Gaps = 17/105 (16%)

Query: 232 EVFNMETFKWSFGILFSRLVR------------LP---SMDGRVALVPWADMLNHSCEVE 276
           E F  E ++W+F I  SR  R            LP   ++D    L+P  D+ NH   V 
Sbjct: 193 EQFRPELYRWAFAIFSSRSFRPSLVLSDEQARLLPPGVAIDDFSVLLPLFDIGNHDMTVP 252

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
             + + +   G    T R +QPGEQVF +YG K+N ELLL YGF+
Sbjct: 253 --VRWQRDGDGCALRTGRAHQPGEQVFNNYGLKTNAELLLGYGFM 295


>gi|428177025|gb|EKX45907.1| hypothetical protein GUITHDRAFT_138732 [Guillardia theta CCMP2712]
          Length = 505

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 97/201 (48%), Gaps = 12/201 (5%)

Query: 166 SRWSNYISALPR-QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
           S +  YI  LP    Y + +    E  + L + +I + ++   + +   ++ ++  +  +
Sbjct: 184 SEFKPYIDLLPEYHDYEMTWLWSVEEQQDLLSGKILKDSMSITSQIEREHHTIK-EVLGR 242

Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLP--------SMDGRVALVPWADMLNHSCEVE 276
           + D      F++E++KW+   + SR   L           +  + LVP  DM+NHS +  
Sbjct: 243 FQDCAEFGEFSLESYKWAQATIMSRAFDLDEGQETARRQGEQNLLLVPLCDMVNHSPDAS 302

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
             +D D +    +F ++  Y+ G++V I+YG  SN +LLLS+GFV  EG   +   E+ L
Sbjct: 303 FSIDCDAAGNVNLFASE-NYKAGQEVHINYGSSSNEQLLLSFGFV-LEGGWQAQETEITL 360

Query: 337 SLKKSDKCYKEKLEALRKYGL 357
            + +  + ++ K   L   GL
Sbjct: 361 EVPQDVEGFEIKRNLLFNGGL 381


>gi|313242187|emb|CBY34355.1| unnamed protein product [Oikopleura dioica]
          Length = 563

 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 106/221 (47%), Gaps = 42/221 (19%)

Query: 142 KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL--EASQI 199
           ++ +VP   +   +L +E   +K+SR + +IS+LPR+ Y+L +   A+L + +  ++ Q+
Sbjct: 32  EKVAVPAEDVFIHFLCTEEKQKKASRVAGWISSLPRESYNLPFNWPADLQKCVCDDSLQL 91

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM--- 256
             +A       +  +N L  R F  Y  +  +  F+ +T  W++ +  +RL  LP     
Sbjct: 92  AIKA------QVADFNTLIAR-FEHYSTVIDDVKFSRDTLAWAYSVWSARLFELPQYPES 144

Query: 257 ----------------DGR-VALVPWADMLNHSCEVETFLD------YDKSS-----QGV 288
                           D R  A +P  D+LNHS     FLD      +DK+      +  
Sbjct: 145 SSNSVNLPKWLDNDPNDLRSFAFLPIFDLLNHSSTPNVFLDIREKHVWDKTKKIHPEEKF 204

Query: 289 VFTTDRQYQ--PGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           V + + + +   GE++ +SYG  S+ +L L YGFV ++G N
Sbjct: 205 VLSLEAKTKIAKGEELRMSYGNLSDRDLFLKYGFVLKKGEN 245


>gi|255089515|ref|XP_002506679.1| predicted protein [Micromonas sp. RCC299]
 gi|226521952|gb|ACO67937.1| predicted protein [Micromonas sp. RCC299]
          Length = 584

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/340 (23%), Positives = 130/340 (38%), Gaps = 87/340 (25%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD 129
           +D  +  +   +W    G+   +   + V  G RG+VA ++I +   +L VP  ++++A 
Sbjct: 9   LDGPDADADFWRWARARGVVAVRCEARDVAEGWRGIVATEDIERDAVVLRVPGDILMSAR 68

Query: 130 SKWSCPEAGEVLKQC--------SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
           S     E  E L+          S+     LA +L+ EAS  + SRW  YIS LPR    
Sbjct: 69  SM----ERDEQLRDALRTHGETRSMTPADKLAVHLLLEASRGRGSRWHEYISRLPRAYNL 124

Query: 182 LLYWTRAE---------LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
           L  WTR E         +     A     ++ ER  +V+     L L + +++       
Sbjct: 125 LCCWTRRERAMLQDPAAIAVARRARDATRQSWERARDVLAA---LGLTLANRWG------ 175

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE------------------ 274
             +++ ++W+   + SR V +P  D   AL P  D+ N++                    
Sbjct: 176 --SIDAWRWARCTVSSRTVYVP-YDAAGALCPVGDLFNYAPPPPPHRHAIVGTPLEGGDE 232

Query: 275 ----------------------------VETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
                                       V     +D++S+  VF   R Y  GEQ+ + Y
Sbjct: 233 GKDDDEGKDDDKRGEDDDKGGEEGKRWTVSGDGAWDEASREYVFRARRPYVAGEQIMLCY 292

Query: 307 GKKSNGELLLSYGFV--------PREGTNPSDSVELPLSL 338
           G+ +N  LL  YGF+          E  NP D+    L  
Sbjct: 293 GRHTNLGLLEHYGFLLDEKESETGDEPGNPHDAAAFALGF 332


>gi|358392567|gb|EHK41971.1| hypothetical protein TRIATDRAFT_251278, partial [Trichoderma
           atroviride IMI 206040]
          Length = 956

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 70/265 (26%), Positives = 108/265 (40%), Gaps = 38/265 (14%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI---TADSKWSCPE-------AGEVLKQCSVPDWPLL 152
           RG+VAL++I     L  VP S ++   T++ +   P+       A EV  +     W  L
Sbjct: 529 RGIVALQDIPADTVLFTVPRSAIVNIETSELRAKLPDVFLNQDTAMEVDNKPQQDPWSTL 588

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
              LI E      S W  Y+  LP    + ++W+ AE+D  L+AS  R +  +  TN   
Sbjct: 589 IIVLIYEYFKGDQSSWKPYLDVLPASFETPMFWSDAEVDE-LQASATRSKIGK--TNAEE 645

Query: 213 TYNDLRLRIFSKYPDLF-------PEEVFN----METFKWSFGILFSRLVRLP------- 254
            ++   L +    PD+F        EE+      M +   S+   F              
Sbjct: 646 MFHAKILPVIRGNPDIFQTSQAKSDEELIQLAHRMGSTIMSYAFDFQNEDEEEEDDSEEW 705

Query: 255 ----SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
                    + +VP AD+LN   E    ++Y      +   T R  + GE++   YG   
Sbjct: 706 VEDREAKSTMGMVPMADILNADAEYNAHVNY--GDDALTVATLRTIKAGEEILNYYGPHP 763

Query: 311 NGELLLSYGFVPREGTNPSDSVELP 335
           N ELL  YG+V  + +   D VELP
Sbjct: 764 NSELLRRYGYVTPKHSR-YDVVELP 787


>gi|291390222|ref|XP_002711632.1| PREDICTED: SET domain containing 6 [Oryctolagus cuniculus]
          Length = 817

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 69/292 (23%), Positives = 129/292 (44%), Gaps = 29/292 (9%)

Query: 81  KWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           KW    GL   P+    ++  V   G+VA +++++GE L  VP + ++   S+ +C   G
Sbjct: 394 KWGLQVGLELSPKVAVSRQGTVAGYGMVARESVQRGELLFAVPRAAIL---SQHTC-SIG 449

Query: 139 EVLKQ-----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELD 191
           ++L++      S   W +     +       +S WS Y +  P   +    ++W   E  
Sbjct: 450 DLLERERGALQSQSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERR 508

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           R L+ + + E   + + ++   Y  + L     +PDLF  +V ++E +     ++ +   
Sbjct: 509 RLLQGTGVPEAVEKDLASIRSEYYSIVLPFMEAHPDLFSPKVHSLELYHQLVALVMAYSF 568

Query: 252 RLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQ 301
           + P  +           +VP AD+LNH       L+Y      +V T     QP   G +
Sbjct: 569 QEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSADYLRMVAT-----QPIPKGHE 623

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           +F +YG+ +N +L+  YGFV     N  D+ ++ +   +     K K+EA R
Sbjct: 624 IFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQKAKVEAER 675


>gi|298711968|emb|CBJ32910.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 247

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 78/167 (46%), Gaps = 13/167 (7%)

Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
            ++LV+L     R AL P  D++NH   +++ + Y+        TT R +  GEQV ISY
Sbjct: 1   MTKLVQLK----RYALTPVVDLINHQSGIDSDVSYNYFYGYFAVTTQRGWTAGEQVLISY 56

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK-SDKCYKEKLEALRKYGLSASECFPI 365
           G +SN  LL  YGFV ++  NP+D   +   + K SD   K+ +  LR+ G        +
Sbjct: 57  GPRSNDHLLRRYGFVEQD--NPNDVYRITGLIDKLSDVLGKDSVRVLRESG------GKL 108

Query: 366 QITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPE 412
             TG   E      + V    + G+ EE       ++   KD + PE
Sbjct: 109 GTTGDNAEGRPVESVTVGRSGLLGEKEEGRVMPVFRLAVVKDDQLPE 155


>gi|85099007|ref|XP_960703.1| hypothetical protein NCU06658 [Neurospora crassa OR74A]
 gi|28922220|gb|EAA31467.1| predicted protein [Neurospora crassa OR74A]
 gi|28950107|emb|CAD70887.1| conserved hypothetical protein [Neurospora crassa]
          Length = 469

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 78/261 (29%), Positives = 117/261 (44%), Gaps = 26/261 (9%)

Query: 111 IRKGE-----KLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEASFE 163
           +R GE      L+ VP SLV+   +     +  +  KQ   +V    +LA+    +A   
Sbjct: 38  VRDGELQPEVPLMTVPNSLVLNVQAVDEYAKEDKNFKQLLGAVGHHLVLASK-THQAPVG 96

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIF 222
            S+ W+ YI  LP+       WT  E  R L      E A+  ++T +   ++ +R    
Sbjct: 97  VSNPWTEYIKFLPKTVLVPTLWTEDE--RLLLRGTSLESAVNAKMTAITAEFDAVR-EAA 153

Query: 223 SKYPD----LFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWADMLNHSCE 274
           S  P     L+P E  N      S+ +L     SR++ LP      ++VP  DM+NHS  
Sbjct: 154 SSLPSWNDVLWPYEDGNSSASLRSWILLDALYRSRVLELPK--SGESMVPCIDMINHSTR 211

Query: 275 VETFLDYDKSSQGVVF-TTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSV 332
              + D +   + V+    D    PGE+V ISYG  K   E+L SYGF+  E T   +S+
Sbjct: 212 ASAYYDENAKDEVVLLPRPDSSISPGEEVTISYGDAKPAAEMLFSYGFIDPEAT--VESL 269

Query: 333 ELPLSLKKSDKCYKEKLEALR 353
            LPL   + D   K KL A +
Sbjct: 270 VLPLEPFEDDPLAKAKLFAFK 290


>gi|358386801|gb|EHK24396.1| hypothetical protein TRIVIDRAFT_168260 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 74/151 (49%), Gaps = 20/151 (13%)

Query: 222 FSKYPDLFPEEVFNMETFKW------SFGILFSRLVRLPSMDGRVALVPWADMLNHS--- 272
           ++ + D FP+  +   T+ W      SF       ++ P  D R+AL+P AD+ NHS   
Sbjct: 130 WNAFKDAFPDVPYEEYTYAWMIVNTRSFYNETPETLKYPWED-RLALIPVADLFNHSDDG 188

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS--- 329
           C+V        S+ G     DR+Y+ GE++FISY   SN  +LL YGF+P E  +     
Sbjct: 189 CKVYY------SADGYHIVADREYKKGEELFISYSSHSNDYILLEYGFIPDESLDDDVYI 242

Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
           D    P  L +  K   EK + L +Y L +S
Sbjct: 243 DDAVFP-KLSEGQKEELEKRDLLGEYPLESS 272


>gi|291000152|ref|XP_002682643.1| predicted protein [Naegleria gruberi]
 gi|284096271|gb|EFC49899.1| predicted protein [Naegleria gruberi]
          Length = 619

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 89/197 (45%), Gaps = 22/197 (11%)

Query: 155 YLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAEL------DRYLEASQIRERAIERI 207
           +LI E   EK  S    Y++ LPR+  + LY+   E+      + Y     IR+      
Sbjct: 112 FLIYELHVEKEKSTHFPYLNLLPREFTTALYFDEDEMAALRSTNLYKSVQSIRQ------ 165

Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL--VRLPSMDGRVA---- 261
            N+   Y      + +KYP  F  +VF+ E F W+F  ++SR+  +  P+ +G       
Sbjct: 166 -NLKQIYETKVEYLMNKYPQKFDRQVFSYENFMWAFSAVWSRVFPIEYPAENGEGVEIVP 224

Query: 262 -LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
            L+P  D+LNH    +    +  S +     T    + G+ V  +YG KSN   LLSYGF
Sbjct: 225 TLLPTVDILNHKFNAKITY-FTGSDRRFYLKTRESLKSGDYVCNNYGAKSNDSFLLSYGF 283

Query: 321 VPREGTNPSDSVELPLS 337
           V    +  +  V+  +S
Sbjct: 284 VIPNNSEDTLYVQFGIS 300


>gi|303279242|ref|XP_003058914.1| set domain protein [Micromonas pusilla CCMP1545]
 gi|226460074|gb|EEH57369.1| set domain protein [Micromonas pusilla CCMP1545]
          Length = 457

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 99/402 (24%), Positives = 159/402 (39%), Gaps = 75/402 (18%)

Query: 79  LQKWLSDSGLPPQKMAIQ---KVDVG------ERGLVALKNIRKGEKLLFVPPSLVITAD 129
            + WL  +G    + AI+   ++D G        G+ A ++I  GE +  +P S   T +
Sbjct: 4   FKTWLRSNGFWWNEDAIELGSRIDEGGGEDAPRVGVKAKRDIEIGESVARIPSSACFTCE 63

Query: 130 SKWSCPEAGEVLK---QCSVPDW-PLLATYLISEASFEKSSRWSNYISALPR-QPYSLLY 184
           +   C  A  V K        +W   L T L+ E +   SSRW+ Y+ +LP  +P  ++ 
Sbjct: 64  N---CAHADAVRKVKLSAGEDEWLASLGTALVLERTLGSSSRWNAYLDSLPHSEPDVVMM 120

Query: 185 WTR-AELDRYLEASQIR-----ERAIER------ITNVIGTYNDLRLRIFSKYPDLFPEE 232
           W+   E  RYL  + I      ERA  R      +  V+ T     LR  +K        
Sbjct: 121 WSEDGERRRYLCGTDIEQSLRDERAAARTEWTRHVKPVLDT-----LRGAAKD------- 168

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
               + F  +  +  SR   +    G   LVP AD+ NH          D      V   
Sbjct: 169 -VGFDDFLAARSVASSRAFTVNPRVG-AGLVPIADLFNHRTGGHHVYLSDARGTAAVSER 226

Query: 293 D-------------RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
           D             +  + GE+VF +YGK  N +LL SYGF   +  NP+D V + +   
Sbjct: 227 DEGSDDDALFVRVVKASKAGEEVFNTYGKLGNAKLLCSYGFAQLD--NPADKVTIGVPAL 284

Query: 340 K--------SDKCYKEKLEALRKYGLSASE-CFPIQITGWPLELMAYAYLVVSPP----- 385
           +        S      +L      GL   E  F +++   P +++     V++       
Sbjct: 285 RAAAALRGVSGAQIATRLAWCDAIGLCDDETTFELRLGADPPDVLLLVSWVLASTDDRFD 344

Query: 386 ---SMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDS 424
              ++K   +E + AA+ K T +   +    DE AL  IL++
Sbjct: 345 AVRAVKTGGDETSIAAALKETIESGARGGLKDESALGIILET 386


>gi|321470773|gb|EFX81748.1| hypothetical protein DAPPUDRAFT_317395 [Daphnia pulex]
          Length = 495

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 81/357 (22%), Positives = 151/357 (42%), Gaps = 46/357 (12%)

Query: 107 ALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEK 164
           A K +   E L  +P  L+++ ++  S      +     +   P   LA ++++E  ++ 
Sbjct: 108 ATKQVSTDELLFSIPQKLMLSNETANSSTIGHFINNDPILSQMPNVALAFHVLNEL-YDP 166

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
            S W  Y+ ALP    +++Y+T  E+   L+ S   + A+    N+         R +S 
Sbjct: 167 KSFWKPYLDALPSSYDTVMYFTPDEITE-LKGSPAFDDALRMCRNIA--------RQYSY 217

Query: 225 YPDLFPEEV----------FNMETFKWSFGILFSRLVRLPSMDGRV-----------ALV 263
           +  L  + V          F    ++W+   + +R   +PS +              AL+
Sbjct: 218 FYSLLQKNVDPALSNLRANFTYNDYRWAVSTVMTRQNLIPSQEEISGNDKDQLPPVNALI 277

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
           P  D  NH  + +   ++   S+  V    R + PGEQVFI YG ++  E  +  GFV  
Sbjct: 278 PLWDFCNHQ-DGQFSTEFQLESRRTVCQAGRDFGPGEQVFIFYGTRTCAEQFIHNGFV-- 334

Query: 324 EGTNPSDSVELPLSLKKSDKCYKE------KLEALRKYGLSASECFPIQITGWPLE--LM 375
           +  N  D++ L + L KSD    +      KL  L    +S    F ++    P++  L+
Sbjct: 335 DINNAHDALTLKVGLSKSDPLAGQRATLLCKLRILSDEKISGPIAFQLKAGPQPVDGKLL 394

Query: 376 AYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKY 432
           A+  L         ++ + +  ASN M  +  I+  E+D+++  F+   C+  +  Y
Sbjct: 395 AFLRLFCMTKDSLDRWLQ-SDNASNLMHEECGIET-EVDDKSWSFLKARCQLLLQLY 449


>gi|336472467|gb|EGO60627.1| hypothetical protein NEUTE1DRAFT_75928 [Neurospora tetrasperma FGSC
           2508]
 gi|350294307|gb|EGZ75392.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 469

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 79/290 (27%), Positives = 128/290 (44%), Gaps = 23/290 (7%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
            L  W   +G+    + +  ++    G+V    ++    L+ VP SLV+   +     + 
Sbjct: 10  ALPAWALLNGIAFPHVKVANIEGKGFGVVRDGELKPEVPLMTVPNSLVLNVQTVDEYAKE 69

Query: 138 GEVLKQC--SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
            +  KQ   +V    +LA+    +A    S+ W+ YI  LP+       WT  E  R L 
Sbjct: 70  DKNFKQLLGAVGHHLVLASK-THQAPVGVSNPWTEYIKFLPKTVLVPTLWTEDE--RLLL 126

Query: 196 ASQIRERAIE-RITNVIGTYNDLR-----LRIFSKYPDLFPEEVFNMETF--KWSF--GI 245
                E A+  ++T +   ++ +R     L I++    L+P E  N      +W     +
Sbjct: 127 RGTSLESAVNAKMTAITAEFDAVREAASSLPIWNDI--LWPYEDGNSSASLRRWILLDAL 184

Query: 246 LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVF-TTDRQYQPGEQVFI 304
             SR++ LP      ++VP  DM+NHS     + D +   + V+    D    PGE+V I
Sbjct: 185 YRSRVLELPK--SGESMVPCIDMINHSTRASAYYDENAKDEVVLLPRPDSSISPGEEVTI 242

Query: 305 SYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           SYG  K   E+L SYGF+  E T   +S+ LPL   + D   K KL A +
Sbjct: 243 SYGDAKPAAEMLFSYGFIDPEAT--VESLVLPLEPFEDDPLAKAKLFAFK 290


>gi|428177623|gb|EKX46502.1| hypothetical protein GUITHDRAFT_138238 [Guillardia theta CCMP2712]
          Length = 486

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 124/272 (45%), Gaps = 34/272 (12%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E    L +W  + G+  +K+++Q+ + G   + A +++ + E +  VP S+ I  +S W 
Sbjct: 168 ERREKLLEWAREHGIGFEKISLQEDEFGGTAMFASEDLEEDEVIGVVPFSISIGRESLWR 227

Query: 134 CPEAGEVLKQC-----SVPDWPLLATYLISEASFEKSSRWSNYISALPR----------Q 178
               GE+L Q      + PD  L++   +       SS +  Y+  LP            
Sbjct: 228 S-RHGELLGQLYEDERTPPD--LISCIFLLLERRSSSSFFRPYLDMLPTPSGVSNVFHWD 284

Query: 179 PYSLLYWTRAELDRYLEASQIR--ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
            ++L  ++  E  R L A+ +   ER  +R   V+    + + R F K+     +E+F+ 
Sbjct: 285 AHALSAFSPHEEARSLAAAHLSLFERTYQRYFTVVNKNEEFQ-RQFGKH-----QEIFSR 338

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNH--SCEVETFLDYD-KSSQGVVF 290
           +   W++ +L SR    P  + R +   ++P AD+ NH  S     ++  + ++ QG VF
Sbjct: 339 DQVLWAYSLLISRAWEHPDYNYRTSFHRMLPIADIANHKMSPTGSGWMSVEFRNQQGAVF 398

Query: 291 TTDR--QYQPGEQVFISYGKKSNGELLLSYGF 320
              R    + G+++  SY    N  LL+ YGF
Sbjct: 399 LVTRGGAIRRGQEIVTSYSNAGNALLLVQYGF 430


>gi|384251065|gb|EIE24543.1| hypothetical protein COCSUDRAFT_40909 [Coccomyxa subellipsoidea
           C-169]
          Length = 685

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 102/224 (45%), Gaps = 11/224 (4%)

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
           G+ L    +   PLL  Y + +   +K S ++ + ++LP    + L  T  E+   LE +
Sbjct: 18  GQALAALGIEKDPLLLLYTMID-RHDKDSDFAPFWASLPEVFMTGLSATEEEVS-MLEGT 75

Query: 198 QIRERAIERITNVIGTYNDLR---LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
                 +E   ++   Y   +     + + YPD    ++   + F W+  + +S  + + 
Sbjct: 76  PAHTTFVEARQHIREQYRAAQPVLQALTAAYPDDITPDLVTEDKFIWACELWYSYAIEVE 135

Query: 255 SMDG--RVALVPWADMLNHSC--EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
            +DG  R  LVP A +LNHS    +  +   D ++  +     R    GEQ F+SYG   
Sbjct: 136 YVDGAVRQTLVPIAHLLNHSPWPHIVRYGRLDAATDSLRLRAFRHCAAGEQCFLSYGPLP 195

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           N +LLL YGF   +  NP D+V +    +K++    + LEA  K
Sbjct: 196 NLKLLLFYGFALPD--NPHDTVPITFEAEKNEGDVTDMLEACLK 237


>gi|378728064|gb|EHY54523.1| SET domain-containing protein 6 [Exophiala dermatitidis NIH/UT8656]
          Length = 495

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 120/312 (38%), Gaps = 80/312 (25%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA--GEVLKQCSVPDWPLLATYLISEA 160
           RG VA+ +I   E+L  +P SLV+T  +  S P +   E+  + +   WP L   +I E 
Sbjct: 48  RGAVAIADIASDEELFAIPRSLVLTTATS-SIPRSVLKELEDKGATGAWPPLIVTIIYEY 106

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYN 215
              +SS W  Y   LP    +L++W  AEL   L+AS +     R +A E   N I    
Sbjct: 107 LRGESSPWHPYFKILPTTFNTLMFWNDAELAE-LQASAVVDKIGRRQAEEEWQNTI---- 161

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--------------- 260
              +   + +PDLFP             G   ++L+ L  M G +               
Sbjct: 162 ---IPTMADHPDLFP------------VGGSSAKLIELAHMAGSLIMAYAFDIDRDDMED 206

Query: 261 --------------------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
                                      +VP+ADMLN   +      + +    ++    +
Sbjct: 207 DNDNDKDGADSADDEFEEDDEDEPFKGMVPFADMLNADADKNNARLFQEPDY-LIMKATK 265

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL---------KKSDKCY 345
               GEQ+F  YG     +LL  YG+V  +     D VE    L         K  D+ +
Sbjct: 266 PISAGEQIFNDYGPLPRSDLLRMYGYV-TDNYAQYDVVEFSHDLLLEVAGKHSKSKDQVW 324

Query: 346 KEKLEALRKYGL 357
           +E+ + L + G+
Sbjct: 325 REREQQLDELGV 336


>gi|322703179|gb|EFY94792.1| UV-endonuclease UVE-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1118

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 69/281 (24%), Positives = 115/281 (40%), Gaps = 51/281 (18%)

Query: 94  AIQKVDV----GERGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE----AGEVLK 142
           AI+ VD+      RG+VAL++I     L  +P   +I +D+   +   PE     G+  +
Sbjct: 679 AIEIVDLRSRDAGRGIVALRDIPADTTLFTIPRDAIINSDTSSLREKLPELFESQGDEDE 738

Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER 202
           Q ++  W  L   ++ E      S+W  YI  LP    + ++W+  EL  YL+AS     
Sbjct: 739 QQALDSWSALILIMMYEFFLGHQSKWKPYIDVLPLTFDTPMFWSEEELS-YLQASA---- 793

Query: 203 AIERITNVIGTYND---LRLR----------IFSKYPDLFPEEVFNME------TFKWSF 243
                 N IG  +     R R          +F+   D   E++  +          ++F
Sbjct: 794 ----TVNKIGKADAEEMFRTRLIPAIRGNPSVFASSGDCSDEDLIGLAHRMGSTIMAYAF 849

Query: 244 GILFSRLVRLPSMDGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
            +           DG V          +V  AD+LN   E    +++    + +  T+ R
Sbjct: 850 DLENEEAENDDESDGWVEDREGKSMMGMVAMADILNADAEFNAHVNH--GDEELTVTSIR 907

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
             + GE++   YG   N ELL  YG++  E  +  D VE+P
Sbjct: 908 DIKAGEEILNYYGPHPNSELLRRYGYIT-EKHSRYDVVEIP 947


>gi|311257193|ref|XP_003127001.1| PREDICTED: N-lysine methyltransferase SETD6 [Sus scrofa]
 gi|335289289|ref|XP_003355838.1| PREDICTED: N-lysine methyltransferase SETD6-like [Sus scrofa]
          Length = 448

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 64/278 (23%), Positives = 122/278 (43%), Gaps = 18/278 (6%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE L  VP + V+   S+ +C  +G + ++    
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQPGELLFVVPRAAVL---SQHTCSISGLLERERGAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +S WS Y +  P   +    ++W   E  R L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEDEK 211

Query: 262 ------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
                 +VP AD+LNH       L+Y  +   +V T  +    G ++F +YG+ +N +L+
Sbjct: 212 EPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT--QSIPKGHEIFNTYGQMANWQLI 269

Query: 316 LSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
             YGFV     N  D+ ++ +   +       K+EA R
Sbjct: 270 HMYGFVEPYPDNKDDTADIQMVTVREAALQGTKIEAER 307


>gi|344301751|gb|EGW32056.1| hypothetical protein SPAPADRAFT_138237 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 483

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 65/260 (25%), Positives = 116/260 (44%), Gaps = 26/260 (10%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSK---WSCPEAGEVLKQCSVPDWPLLATYLISE 159
           RG++AL++I   E L  +P +++I + +       PE  + L      +W  L   L+ E
Sbjct: 38  RGIIALEDIEIDETLFTIPRTVLINSLNNSLVQDQPELADKLAGLE-NEWDALILVLLYE 96

Query: 160 ASFEKSSRWSNYISALPR----QPYSLLYWTRAELDRYLEASQIRER--------AIERI 207
               K S+W++Y + LP     + + LL+W   +L   L+ S + +R          ER+
Sbjct: 97  YK-RKESKWTDYFNVLPDLDTFEFHELLFWNDEQLSD-LKPSLVLDRIGKDKTVEMYERL 154

Query: 208 TNVIGTYN-----DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
             ++  +N      + +  F+K   +     F++                   ++   ++
Sbjct: 155 VAIVNQWNLEELKGMTMEEFTKIATIIMSYSFDVAQGTEDEDEDEDDEEEEEEVEYIKSM 214

Query: 263 VPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           VP AD LN    +    L Y+K+ Q +V T  +  + GEQV+ +Y    N E+L  YG+V
Sbjct: 215 VPLADTLNADTHLNNAILTYNKN-QDLVMTCIKPIKKGEQVYNTYSDHPNCEILRRYGYV 273

Query: 322 PREGTNPSDSVELPLSLKKS 341
              G+   D  E+PL+L  S
Sbjct: 274 ETTGS-KYDFGEIPLTLITS 292


>gi|390477743|ref|XP_003735352.1| PREDICTED: N-lysine methyltransferase SETD6 [Callithrix jacchus]
          Length = 449

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 65/283 (22%), Positives = 121/283 (42%), Gaps = 27/283 (9%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+++C   G + ++    
Sbjct: 36  LSPKVEVSRQGTVAGYGMVARESVQAGELLFVVPRAAIL---SQYTCSIGGLLERERGAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +S W  Y +  P   +    ++W   E  R L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASHWRPYFALWPELGHLKHPMFWPEEERRRLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF--------KWSFGILFSRLVRL 253
              + + ++   Y+ + L     +PDLF   V ++E +         +SF          
Sbjct: 152 AVEKDLDSIRSEYHSIVLPFMEAHPDLFSLRVHSLELYLQLVALVMAYSFQEPLEEEEDE 211

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
              +  V +VP AD+LNH       L+Y      +V T     QP   G ++F +YG+ +
Sbjct: 212 KEPNSPV-MVPAADILNHLANHNANLEYSADCLRMVAT-----QPIPKGHEIFNTYGQMA 265

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           N +L+  YGFV     N  D+ ++ + + +       K EA R
Sbjct: 266 NWQLIHMYGFVEPYPDNTDDTADIQMVIVREAALQGTKTEAER 308


>gi|115472017|ref|NP_001059607.1| Os07g0471100 [Oryza sativa Japonica Group]
 gi|22093661|dbj|BAC06955.1| SET-domain transcriptional regulator family-like protein [Oryza
           sativa Japonica Group]
 gi|50510036|dbj|BAD30661.1| SET-domain transcriptional regulator family-like protein [Oryza
           sativa Japonica Group]
 gi|113611143|dbj|BAF21521.1| Os07g0471100 [Oryza sativa Japonica Group]
 gi|218199573|gb|EEC82000.1| hypothetical protein OsI_25940 [Oryza sativa Indica Group]
          Length = 479

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 72/279 (25%), Positives = 121/279 (43%), Gaps = 51/279 (18%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITA------DSKWSCPEAGEVLKQCSVPDWPLL 152
           D G RGL A +++R+GE +L  P + ++T+      D + +   A  + +  SV     L
Sbjct: 37  DAGGRGLAAARDLRRGELVLRAPRAALLTSGRVMDDDPRIASSVASHLPRLSSVQT---L 93

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-LEASQIRERAIERITNVI 211
              L+SE    KSS W  Y+S LP   Y++L    A  + +  EA Q+ E        + 
Sbjct: 94  IICLLSEVGKGKSSNWYLYLSQLPSY-YTIL----ATFNDFETEALQVDEAIWVAQKALR 148

Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
           G  +D            F  ++   +++ W+F  + SR + + + D    L P  D+ N+
Sbjct: 149 GIRSDWEEATPLMKGLGFKPKLLMFKSWIWAFATVSSRTLHI-AWDDAGCLCPIGDLFNY 207

Query: 272 SC------------------EVETFLD--------------YDKSSQGVVFTTDRQYQPG 299
           +                   E    LD              Y+  ++  ++   R Y+ G
Sbjct: 208 AAPNDDNSSTDEDRDDMMHQETNKMLDQTDFDSSEKLTDGGYEDVNEYRLYARKR-YRKG 266

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           EQV ++YG  +N ELL  YGF+   G NP++ + +PL L
Sbjct: 267 EQVLLAYGTYTNLELLEHYGFLL--GENPNEKIYIPLDL 303


>gi|10437194|dbj|BAB15011.1| unnamed protein product [Homo sapiens]
          Length = 449

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 118/282 (41%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
                W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QGQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP------- 254
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   ++P       
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQVPLEEEEDE 211

Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|432119027|gb|ELK38252.1| SET domain-containing protein 4 [Myotis davidii]
          Length = 339

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 64/225 (28%), Positives = 101/225 (44%), Gaps = 21/225 (9%)

Query: 110 NIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLA--TYLISEASFEKSSR 167
             R+G+ ++ +P S ++T D+       G  + +   P  PLLA  T+L++E      S 
Sbjct: 29  GAREGQVIISLPESCLLTTDTVIRS-YLGAYIAKWQPPPSPLLALCTFLVAEKHAGDRSP 87

Query: 168 WSNYISALPRQ---PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
           W  Y+  LP+    P  L     A L R LEA     +A E+ T V   +   R R  S 
Sbjct: 88  WKPYLEVLPKAYTCPVCLEPEVVALLPRPLEA-----KAREQRTRVRELFTSSRGRFSSL 142

Query: 225 YPDL--FPEEVFNMETFKWSFGILFSRLV--------RLPSMDGRVALVPWADMLNHSCE 274
            P L      VF+   F+W++  + +R V         L +     AL P+ D+LN+S  
Sbjct: 143 QPLLSEAAASVFSYRAFRWAWCTVNTRAVYMERGRRQGLSAEPDTCALAPYLDLLNNSPA 202

Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
           V+    +++ ++     T    +  E+VFI YG   +  LLL YG
Sbjct: 203 VQVKAAFNEETRCYEIRTGSGCRRHEEVFICYGPHDSRRLLLEYG 247


>gi|158254422|dbj|BAF83184.1| unnamed protein product [Homo sapiens]
          Length = 473

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)

Query: 89  PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           PP ++A+ +   V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 60  PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHACSIGGLLERERVAL 116

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 117 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 175

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 176 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 235

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 236 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 290

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 291 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 332


>gi|238550107|ref|NP_001153777.1| N-lysine methyltransferase SETD6 isoform a [Homo sapiens]
 gi|308153495|sp|Q8TBK2.2|SETD6_HUMAN RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
 gi|119603387|gb|EAW82981.1| SET domain containing 6, isoform CRA_b [Homo sapiens]
          Length = 473

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)

Query: 89  PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           PP ++A+ +   V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 60  PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 116

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 117 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 175

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 176 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 235

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 236 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 290

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 291 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 332


>gi|255553959|ref|XP_002518020.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
 gi|223543002|gb|EEF44538.1| Protein SET DOMAIN GROUP, putative [Ricinus communis]
          Length = 471

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 75/335 (22%), Positives = 138/335 (41%), Gaps = 51/335 (15%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLV---ALKNIRKGEKLLFVPPSLVITADSKWSCP 135
            ++W+   G+     A++ +D  ++  +   AL+ +++GE +  +P +  +T+ +     
Sbjct: 12  FKRWMKSQGISWCSDALELIDAPDQDGIFVKALRALKEGEVVASIPKAACLTSRT----S 67

Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
            A  +++  S      L+  L+ E S    S W++Y+  LP      L WT  E+D +L 
Sbjct: 68  GARHIIEATSFTGCLGLSFALMYEISLGHLSPWASYLHLLPDSECLPLVWTLDEVDYFLS 127

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
            +++ +   E    +   + +  L +   +  L P+  F    +  +  ++ SR  ++  
Sbjct: 128 GTELHKIVKEDKALIYDDWKECILPLVDVHH-LNPQ-YFGAHQYFAARTLIASRSFQIDD 185

Query: 256 MDGRVALVPWADMLNHSCEVETFL------------------DYDKSSQGVVFTTDRQ-- 295
             G + +VP AD+ NH    E                      + +++   V + DR+  
Sbjct: 186 YHG-IGMVPLADLFNHKTGAEDVHFTCGSSDSDSDDNSNGNHSFTENTVDEVPSDDREIL 244

Query: 296 -------YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK-------KS 341
                   + G +VF +YG   N  LL  YGF   E  NP D V + L L         S
Sbjct: 245 EMIMVKDVKSGAEVFNTYGSAGNAGLLHRYGFT--EPDNPYDIVNIDLDLVFKWSSSLFS 302

Query: 342 DKCYKEKLEALRKYGLSA-----SECFPIQITGWP 371
           D+  + +L   RK G S      +E F I   G P
Sbjct: 303 DRYTRARLSLWRKLGYSGCVSENAEYFEISFDGDP 337


>gi|170067683|ref|XP_001868579.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167863782|gb|EDS27165.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 269

 Score = 65.9 bits (159), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 66/109 (60%), Gaps = 5/109 (4%)

Query: 262 LVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           L+P  DM NH + ++ T   Y++++Q V       Y+ GEQ+FI YG ++N + L+  GF
Sbjct: 41  LIPLWDMANHVNGQITT--GYNEAAQQVESLALGDYRKGEQIFIYYGNRTNADFLVHNGF 98

Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           V  +  N + +V +PLSL  +++ ++++ + L K GL++S  F +Q  G
Sbjct: 99  VYPD--NANSAVAIPLSLNPTEEQFEQRKQLLEKLGLASSGDFNVQRGG 145


>gi|412991387|emb|CCO16232.1| predicted protein [Bathycoccus prasinos]
          Length = 622

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 60/174 (34%), Positives = 87/174 (50%), Gaps = 28/174 (16%)

Query: 111 IRKGEKLLFVPPSLVITA-----DSKWSCPEAGEVLKQCSV-----PDWPLLATYLISEA 160
           I+K E +L V  S+ +TA     D+   C   G++LK+ +      P W  L+ YL+ E 
Sbjct: 156 IKKNESILKVGDSVWMTAEKAREDADGKC---GKILKRLAAQGEAAPAWVELSVYLVCEL 212

Query: 161 SFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
              +SS ++ Y+S L       S L+W+  +++  +  SQ+ + A    + V GTY  L 
Sbjct: 213 EKGESSFYAPYLSYLREATVLESPLFWSTEDVNA-IAGSQLLDDAAGYDSYVRGTYESLN 271

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-RVALVPWADMLNH 271
           L       D  PE+     TF W+FGIL SR  + P  DG  V LVP  DMLNH
Sbjct: 272 LS-----NDGVPED-----TFLWAFGILRSR-AQQPMRDGSEVTLVPGLDMLNH 314


>gi|126305181|ref|XP_001376097.1| PREDICTED: n-lysine methyltransferase SETD6-like [Monodelphis
           domestica]
          Length = 453

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 59/257 (22%), Positives = 111/257 (43%), Gaps = 33/257 (12%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVI----TADSKWSCPEAGEVLKQCS-VPDWPLLAT 154
           V   G+VAL+++++GE L  VP ++++    TA       E G +  Q   VP       
Sbjct: 50  VAGYGMVALEDVQRGELLFVVPRAVLLSQKTTAIRDLLEKEHGALQSQSGWVP-----LL 104

Query: 155 YLISEASFEKSSRWSNYISALP-----RQPYSLLYWTRAELDRYLEASQIRERAIERITN 209
             +      + S WS Y S  P     + P   ++W+  EL + L+ + + E     + N
Sbjct: 105 LALLYEYLAEDSPWSCYFSLWPDLGSLQHP---MFWSEGELRQLLQGTGVPEAVQRDLAN 161

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS-------RLVRLPSMDGRVAL 262
           +   Y+ +       +P++FP +  ++E ++    ++ +                    +
Sbjct: 162 ISQEYDAIVQPFLEAHPEIFPPQARSLELYRRLVAMVMAYSFQEPLEEEEDEKEPNPPMM 221

Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYG 319
           VP AD+LNH       L+Y      +V T     QP   G+++F +YG+ +N +L+  YG
Sbjct: 222 VPAADILNHVANHNANLEYSPEYLRMVAT-----QPILKGQEIFNTYGQMANWQLVHMYG 276

Query: 320 FVPREGTNPSDSVELPL 336
           F      N  D+ ++ +
Sbjct: 277 FAEPYPGNTDDTADIQM 293


>gi|268535512|ref|XP_002632889.1| C. briggsae CBR-SET-29 protein [Caenorhabditis briggsae]
          Length = 319

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 83/173 (47%), Gaps = 19/173 (10%)

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE------RAIERITNVIGTYNDL 217
           ++S WS Y+  LP++ +    +   + D       IR+      + I  I+  +G + ++
Sbjct: 84  ETSAWSPYLKVLPKE-FDTPAFKGIDYDVNTLPLSIRKFWVDQKKEISEISEKVGDHYEV 142

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADM 268
           R +I  +   LFPE     +   W++ ++ +R +         + + DG  +A++P+ DM
Sbjct: 143 RKKIVFQLRRLFPE--LTHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDM 200

Query: 269 LNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           LNH  +  +    ++K +   V    RQ   GEQVF+ YG   N  LL+ YGF
Sbjct: 201 LNHDPQKYQGVAIHEKRNGRYVVQAKRQIMEGEQVFVCYGAHDNARLLVEYGF 253


>gi|150864441|ref|XP_001383253.2| hypothetical protein PICST_42613 [Scheffersomyces stipitis CBS
           6054]
 gi|149385697|gb|ABN65224.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 453

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/282 (23%), Positives = 117/282 (41%), Gaps = 54/282 (19%)

Query: 92  KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------------------------ 127
           K+ I++ +   RG+ A  +I++ +++L +P S ++                         
Sbjct: 34  KIEIKQFEQSGRGIAAKDDIKRSQQILRIPHSFLLNFTTVVSHITRHNSNIKLKEPYYLG 93

Query: 128 ----------ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR 177
                      D   +  ++ E+    ++  + LL+ YL  E     SS W  ++  LP 
Sbjct: 94  IYVPLESTNNNDKFTNIYKSLELQDLLALTSFQLLSLYLCFERQRIHSSFWKPFLEMLPD 153

Query: 178 -QPYSL--LYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDL- 228
              +SL  L W   ++D++ E  Q      + RA +     +  Y  +R  +     DL 
Sbjct: 154 ISDFSLNPLIWQVLQVDQWEELIQFLPESAKRRAEDVYERFLEDYVVVRALVSRILDDLK 213

Query: 229 ----FPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-----ALVPWADMLNHSCEVETFL 279
                 +E   ++ F W++  + SR + +    G+       + P+ D LNHSC  E  +
Sbjct: 214 LSESSADEYIPVDLFLWAWMCINSRCLYMTIPQGKTNADNFTMAPYVDFLNHSCNDECSI 273

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
             D +   V  TT   Y PG+Q+F+SYG   N  LL  YGFV
Sbjct: 274 LIDTTGFHVRTTT--PYMPGDQLFLSYGPHCNEFLLCEYGFV 313


>gi|149044196|gb|EDL97578.1| rCG27725, isoform CRA_b [Rattus norvegicus]
          Length = 538

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 74/311 (23%), Positives = 128/311 (41%), Gaps = 70/311 (22%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            +L      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -ILGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           R L+++Q    AI  +              FS+Y +   +  +            F +++
Sbjct: 195 RCLQSTQ----AIHDV--------------FSQYKNTARQYAY------------FYKVI 224

Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
           ++ +           ++ +  CE     D+               Q G+Q++I YG +SN
Sbjct: 225 QITT---------GYNLEDDRCECVALQDF---------------QAGDQIYIFYGTRSN 260

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP 371
            E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +  T  P
Sbjct: 261 AEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHFTEPP 318

Query: 372 LELMAYAYLVV 382
           +     A+L V
Sbjct: 319 ISAQLLAFLRV 329


>gi|109128729|ref|XP_001102146.1| PREDICTED: SET domain-containing protein 6-like isoform 1 [Macaca
           mulatta]
          Length = 481

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 124/282 (43%), Gaps = 26/282 (9%)

Query: 89  PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           PP ++A+ +   V   G+VA ++++ GE L  VP + ++   S+++C   G + ++    
Sbjct: 61  PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQYTCSIGGLLERERGAL 117

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   +    L+ + + E
Sbjct: 118 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPE 176

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y+ + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 177 AVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 236

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 237 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 291

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 292 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 333


>gi|408390178|gb|EKJ69586.1| hypothetical protein FPSE_10234 [Fusarium pseudograminearum CS3096]
          Length = 456

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 73/280 (26%), Positives = 120/280 (42%), Gaps = 42/280 (15%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGL--VALKNIRKGEKLLFVPPSLVITADSK 131
           E A+ L +W + +G      ++Q   + E GL   A       + ++ +PPSL ++  + 
Sbjct: 8   ERAAALVQWATSNGATINP-SVQVSHLPETGLSFCATAPTSPFDTIVSIPPSLTLSYLNT 66

Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--- 186
               +  +      +   P  ++  +++ +    + S W+ YI ALP QP  +  W+   
Sbjct: 67  LPGRDDPKPFSSNFLAKTPPHVIGRFVLIKHFLLRESFWTPYIQALP-QPNDVDSWSLPP 125

Query: 187 -----RAELDRYLEASQIRERAIERITNVIGTYN---DLRLRIFSKYPDLFPE--EVFNM 236
                 AEL    E + I         NV+  +    DL  R      D  PE  + F +
Sbjct: 126 FWPDEDAEL---FEGTNIEVGVANIKANVMREFRAGCDLLDR-----DDWEPELLKQFTL 177

Query: 237 ETFKWSFGILFSR-----LV-------RLPS---MDGRVALVPWADMLNHSCEVETFLDY 281
             ++W++ I  SR     LV       RLP    +D    L+P  D+ NH    +   + 
Sbjct: 178 PLYQWAYSIFSSRSFRPSLVLGLEDQQRLPENVKLDDFSVLMPLFDVGNHDMTTQVRWER 237

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           D+ S        + YQPGEQ+F +Y  K+N ELLL YGF+
Sbjct: 238 DEKSNDCSLKVGKAYQPGEQIFNNYSMKTNAELLLGYGFM 277


>gi|299115166|emb|CBN75532.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 524

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 73/325 (22%), Positives = 135/325 (41%), Gaps = 58/325 (17%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVAL--KNIRKGEKLLFVPPSLVITADSKWSCPE 136
           L  W  + G    K+ ++ +  GE  L  L  + + KGE ++ +P SL +T DS      
Sbjct: 31  LLSWFVEHGGSMTKLCLEDLG-GEMSLSLLTGQALNKGEVVMSIPISLCMTVDS------ 83

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD---RY 193
                         +LA +L++E      S W  Y+  LP    + L W   + +   R 
Sbjct: 84  --------------VLALHLMAERRKGDGSFWKQYLRTLPDDVDTPLRWLVEQAEEEFRL 129

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-- 251
           L+ + +   +    + V   + +  L +   +P++     F  E + W+   ++SR    
Sbjct: 130 LDGTMVGLLSRMMHSQVRKDWEEFHLPLVEAHPEILGGVTF--EDYLWAMSSIWSRSFDY 187

Query: 252 RLPSMD----GRVALVPWADMLNH----SCEVETFLDYDKSSQGVVF------------- 290
           + P  D     R A+VP  +  NH    +  +   +++     G+               
Sbjct: 188 QEPGPDDSPCSRRAMVPVINAANHDPSAADSLSEMIEFQAQEGGLSMGIGEPGRARGTLR 247

Query: 291 -TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
            +  R Y   EQ FI YG+ SN +LL SYGFV    +NP   ++  + + ++D  +  K 
Sbjct: 248 VSAGRDYAAREQFFILYGRYSNAKLLYSYGFVL--ASNPYGGLDYWVRVPQTDPGFAWKQ 305

Query: 350 EALRKYGLSASECF----PIQITGW 370
             L ++ L+A++ +     ++  GW
Sbjct: 306 ALLDEHPLTAAQAYDFSGTVRAGGW 330


>gi|328772383|gb|EGF82421.1| hypothetical protein BATDEDRAFT_86633 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 648

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 72/305 (23%), Positives = 121/305 (39%), Gaps = 68/305 (22%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL---------------KQCSVP 147
           RG  A K+I    ++ F+P S ++ ++S     E G+ +                +   P
Sbjct: 65  RGAYASKDIPPNSEICFIP-STILLSESDVRASEIGKAILTYIDEHQDAKQKISDKIKHP 123

Query: 148 DWPLL---ATYLISEASFEKS-SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERA 203
              +L   A +++ + S   + S W  Y+++LP+     L WTR  +   L  + +    
Sbjct: 124 HAEILLAMAAFIVHQVSLPTADSHWLPYLASLPKNYALPLMWTRDRIQNLLGGTSLLYMM 183

Query: 204 IER----------ITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-- 251
           IER          + N  G Y              FP     +++ +W+   ++SR    
Sbjct: 184 IERLEWIQNSTKVVENACGHY--------------FPTGALTVQSMQWATCSIWSRAFPK 229

Query: 252 RLPSMD---------------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
             PS+D                 + L P  DM NH       +++  + +GV F T    
Sbjct: 230 AKPSLDLQDGSHQDVQDWIGLSEICLFPILDMFNHKRGYR--VEWRMTEKGVSFITPDGI 287

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK---LEALR 353
             G ++  +YG K N  LL +YGFV     NP D  ++ L L++ D  Y  K   LE + 
Sbjct: 288 CKGSELLNNYGPKGNENLLSNYGFVIE--NNPEDYFKVFLGLQQEDPLYTAKKAVLEVVS 345

Query: 354 KYGLS 358
           +  LS
Sbjct: 346 ENDLS 350


>gi|18490888|gb|AAH22451.1| SETD6 protein [Homo sapiens]
          Length = 473

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)

Query: 89  PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           PP ++A+ +   V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 60  PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 116

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 117 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 175

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 176 AVEKDLANISSEYQSIVLPFMEAHPDLFSLGVRSLELYHQLVALVMAYSFQEPLEEEEDE 235

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 236 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 290

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 291 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 332


>gi|320170797|gb|EFW47696.1| hypothetical protein CAOG_05634 [Capsaspora owczarzaki ATCC 30864]
          Length = 903

 Score = 65.5 bits (158), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 74/285 (25%), Positives = 122/285 (42%), Gaps = 30/285 (10%)

Query: 70  IDSLENASTLQKWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
           ++S +    L +WL ++G+       ++I   +   RG++A + I  G ++L +P  L+I
Sbjct: 381 LESRKIGDNLLQWLHNAGMTSIAENHLSIADFEHTGRGVLANERIEAGVEVLHLPQHLLI 440

Query: 127 TADSKW--SCPEAGEVLKQC--SVPDWPLLATYLISEASFEKS-SRWSNYISALPRQPYS 181
                   S P  G VL        D  LL  Y++ E     S SRW+ +   LP    S
Sbjct: 441 NIHVALDESHP-IGRVLSDLRDEYDDDTLLLLYVLHEKLVAGSASRWAPFFETLPATYNS 499

Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW 241
            L +   EL   LE +++ +   E I + +   ++    +   YP LFP + F  E   W
Sbjct: 500 PLLFHVTEL-LELEGTRLIDETFE-IKDGLRVLHESLGPLAEAYPALFPTDAFTYENLLW 557

Query: 242 SFGILFSRLVRLPSMDGRVA-----------------LVPWADMLNHSCEVE-TFLDYDK 283
              ++ SR ++LP      A                 L+P+ DM+NH      +   YD 
Sbjct: 558 VRAMIDSRAMKLPVPAAAAAVAAAAPEDATETPFVANLIPFVDMINHEEHSHISVRRYDT 617

Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
           S++ +V TT      G Q+ + Y    + + LL YG +  E  NP
Sbjct: 618 SAKALVLTTLGACAAGTQLSLHYSTLPSWQQLLYYGMLSTE-LNP 661


>gi|345565943|gb|EGX48890.1| hypothetical protein AOL_s00079g111 [Arthrobotrys oligospora ATCC
           24927]
          Length = 445

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 116/244 (47%), Gaps = 30/244 (12%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+VA + ++K E++ F+P SL++        P +  +     V     LA Y+ S+  F 
Sbjct: 52  GIVASERVKKNEEITFIPKSLLVNLHD-IPFPNSSPIDHPTKVH--SSLAAYIASQ--FH 106

Query: 164 KSSRWSNYISALPR----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
           KS     +IS LP     +    L+W+   LD    +  +R  AI++   +   Y     
Sbjct: 107 KSDNNDPFISILPSFSSFKSSMPLFWSNEVLDNC--SPWVRSFAIKQQEKLKDDY----A 160

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDGRVALVPWADMLNH 271
                + +   E  F+ E ++W++  + +R +        ++P+ D  + + P+ D  NH
Sbjct: 161 HALKMHGERGVE--FSKEEYEWAWAAVNTRTIYYRPKKWYKVPAEDC-MTMCPFIDYYNH 217

Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSD 330
             + +       S  G+  TT ++Y  GE++F++YG+ +N  LL+ YGF +P+   N +D
Sbjct: 218 DAKGDESCTVSFSIDGLRVTTQKEYSVGEEIFVTYGEYNNDHLLVEYGFTLPK---NQAD 274

Query: 331 SVEL 334
           ++ +
Sbjct: 275 NMNI 278


>gi|322700433|gb|EFY92188.1| hypothetical protein MAC_01789 [Metarhizium acridum CQMa 102]
          Length = 469

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 85/197 (43%), Gaps = 36/197 (18%)

Query: 155 YLISEASFEKSSRWSNYISALPRQP-------YSLL-YWTRAELDRYLEASQIRERAIER 206
           +LI E      S W  YI ALP QP       ++L  +W   E +  LE + + E  I++
Sbjct: 96  FLIKEYLKRDKSFWHPYIQALP-QPGQGNKSQWALAPFWDDDEAE-LLEGTNV-EVGIDK 152

Query: 207 ITNVIGTYNDLR-----LRIFSKYPDLFP-EEVFNMETFKWSFGILFSRLVRLPSM---- 256
           I N +    DL+     LR+     D     E    E ++W++ I  SR  R PS+    
Sbjct: 153 IRNDVK--RDLKEARELLRLHGDGADGGAFNEALTTELYQWAYCIFSSRSFR-PSLVLSD 209

Query: 257 ------------DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
                       D    L+P  D+ NH    E   D D   Q       + + PG+QVF 
Sbjct: 210 KQRRMLPEGVDTDDFSVLLPLFDIGNHDMTTEVRWDLDNERQNCELRVGKTHMPGQQVFN 269

Query: 305 SYGKKSNGELLLSYGFV 321
           +Y  K+N ELLL YGF+
Sbjct: 270 NYSMKTNAELLLGYGFM 286


>gi|384248321|gb|EIE21805.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 74/275 (26%), Positives = 116/275 (42%), Gaps = 19/275 (6%)

Query: 75  NASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC 134
           + S +  WL  +G+     +I   D   RG+VA K+I  GE ++ VP   V+  ++  SC
Sbjct: 2   DVSKVLAWLRLAGIECSCCSIDVFDGSGRGVVATKDISCGEVVVHVPDESVLMPEN-CSC 60

Query: 135 PEAGEVLKQCSVP-DWPL----LATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRA 188
            EA E     +   D  +    L   L++E    KSS+W  Y+  LP+  P   L+W   
Sbjct: 61  SEALEDAGLTNASGDAEMESIGLILALMTEKKLGKSSKWKGYLDFLPKSIPGMPLFWDSE 120

Query: 189 ELDRYLEASQIRERAI------ERITNVIGTYNDLRLRIFSKYPDL-FPEEVFNM-ETFK 240
           +L + LE + + E+        +R       +N + L        L  P    +    + 
Sbjct: 121 QL-QSLEGTSLIEKMNGCKAMPDRPLEPPCKFNSVVLPFLQSNAHLKLPHNAASTRRLYV 179

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPG 299
           W+  ++ +    +   D   A+VP  D LNH +      L +      +          G
Sbjct: 180 WATAMVSAYSFTI-GEDRFQAMVPMWDALNHITGHANVRLHHCARKGALRMIATCLITKG 238

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           EQV  SYG   N ELL  YGFV  +  NP D +E+
Sbjct: 239 EQVINSYGDLPNSELLRRYGFVETD-PNPHDCLEV 272


>gi|340966944|gb|EGS22451.1| hypothetical protein CTHT_0019870 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 499

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 74/251 (29%), Positives = 107/251 (42%), Gaps = 42/251 (16%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA----ELD-RYLEASQIRERAIE 205
           L+  YL  E SF     W  YI+ LP QP  +  WT      E D ++LE +       E
Sbjct: 113 LIKEYLKGENSF-----WWPYIATLP-QPEQVNSWTLPAFWPEDDIQFLEGTNAHVAIGE 166

Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR------------ 252
              N+   Y   R ++  +  + FP  + ++   +KW+F I  SR  R            
Sbjct: 167 IQANIKREYKQAR-KVLKE--ENFPNWKEYSQMLYKWAFSIFTSRSFRPSLILSQSVKDY 223

Query: 253 ----LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
               LPS   +D    L P  D+ NHS       D               Y+PG+QVF +
Sbjct: 224 VSTLLPSAREIDDFSILQPLFDIANHSMTATYTWDTTSDPNCCQLICQDSYRPGDQVFNN 283

Query: 306 YGKKSNGELLLSYGFV-PREGTNPSDSVEL-------PLSLKKSDKCYKEKLEALRKYGL 357
           YG K+N ELLL+YGF+ P   T  +D V +         +  KSD   K+ L +LR    
Sbjct: 284 YGFKTNSELLLAYGFILPETDTLHNDYVHVRKRQQPEGENASKSDDQPKDFLISLRSMND 343

Query: 358 SASECFPIQIT 368
           ++S    I++T
Sbjct: 344 ASSLAGKIRLT 354


>gi|50557134|ref|XP_505975.1| YALI0F28061p [Yarrowia lipolytica]
 gi|49651845|emb|CAG78787.1| YALI0F28061p [Yarrowia lipolytica CLIB122]
          Length = 454

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 71/266 (26%), Positives = 117/266 (43%), Gaps = 49/266 (18%)

Query: 92  KMAIQ--KVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK----WSCPEAGEVLKQCS 145
           K+AI   + D   RG++A ++I + E L  +P S  ++ ++        PEA ++     
Sbjct: 27  KIAIHDYRSDHQGRGVIASEDIEEDEVLFKIPRSSFLSVENDPDFIKQVPEAKKL----- 81

Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE 205
              W  L  Y++   S    ++W  Y   LP Q  SL+ WT  EL+  L+ S I    ++
Sbjct: 82  -NSWLQLILYMMKAGSM---TKWKPYFDVLPTQLDSLMMWTDDELEG-LKGSMI----VK 132

Query: 206 RITNVIGTYNDLRLR---IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS------- 255
           +I    G   D + +   I   +P+ F +   ++E+F    G++ +     P        
Sbjct: 133 KIGKA-GAEEDYQEKLKPIIDAHPEYFKDCDTSLESFHRMGGLIMAYSFDAPDSFSEDEE 191

Query: 256 -----------MDGRV-ALVPWADMLN-HS--CEVETFLDYDKSSQGVVFTTDRQYQPGE 300
                       +G V A+VP AD LN H+  C      + D    G   T  +  + GE
Sbjct: 192 DDEDIEHDDLYNEGLVKAMVPLADTLNAHTRFCNANLIAEDD---GGFSMTAIQPIKKGE 248

Query: 301 QVFISYGKKSNGELLLSYGFVPREGT 326
           QV+ +YG+  N + L  YG+V  EGT
Sbjct: 249 QVYNTYGELPNCDFLRRYGYVENEGT 274


>gi|341883062|gb|EGT38997.1| CBN-SET-29 protein [Caenorhabditis brenneri]
          Length = 414

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 101/228 (44%), Gaps = 23/228 (10%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASF 162
           G+ A    R G+ ++ +P   +I +      P   + L + +    P+ + T   S   F
Sbjct: 30  GIYATTGFRTGKPIITLPEHDMINSALVVDLPFYKKKLAKINEKMKPMEILTMFFSFEDF 89

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           E+S+ WS Y+  LP+  +    +   + D       IR+  I++   +      LR    
Sbjct: 90  EQSA-WSPYLKVLPKT-FDTPAFKGIDYDVNTLPLSIRKYWIDQKKEISEISEKLR---- 143

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADMLNHSC 273
                LFPE     +   W++ ++ +R +         + + DG  +A++P+ DMLNH  
Sbjct: 144 ----HLFPE--LTHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDMLNHDP 197

Query: 274 E-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           E  +    ++K +   V    RQ Q GEQ+F+ YG   N  LL+ YGF
Sbjct: 198 EKYQGVALHEKRNGRYVVQAKRQIQEGEQIFVCYGAHDNARLLVEYGF 245


>gi|148686778|gb|EDL18725.1| mCG18357, isoform CRA_c [Mus musculus]
          Length = 536

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 125/311 (40%), Gaps = 70/311 (22%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S  +     
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESAKNS---- 137

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 138 -VLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEEEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
           R L+++Q    AI  +              FS+Y +   +  +  +  + + G       
Sbjct: 195 RCLQSTQ----AIHDV--------------FSQYKNTARQYAYFYKVIQITTGY------ 230

Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
                          ++ +  CE     D+               Q G+Q++I YG +SN
Sbjct: 231 ---------------NLEDDRCECVALQDF---------------QAGDQIYIFYGTRSN 260

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWP 371
            E ++  GF      N  D V++ L + KSD+ Y  K E L + G+  S  F +  T  P
Sbjct: 261 AEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLARAGIPTSSVFALHSTEPP 318

Query: 372 LELMAYAYLVV 382
           +     A+L V
Sbjct: 319 ISAQLLAFLRV 329


>gi|121719466|ref|XP_001276432.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119404630|gb|EAW15006.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 426

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 93/406 (22%), Positives = 168/406 (41%), Gaps = 68/406 (16%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           E  +   +W    G+    +    +     G++A + I++GE ++ VP +++IT D   S
Sbjct: 8   EEHTQFMQWAKSHGVKINGITPAHIPGRGAGMIATRCIQEGEVMISVPLNIMITID---S 64

Query: 134 CPEAGEVLKQCSVPDWPLLATYL-ISEASFEKSSRWSNYISALP-RQPY--SLLYWTRAE 189
            P +             +LA +L + +  F K  +W ++    P R+ +  S+       
Sbjct: 65  IPASFIKRFPSGTSIHGILAAFLTVGDQKFLK--KWDSWRKVWPSRKDFEESMPILWPGH 122

Query: 190 LDRYLEASQIRERAIER-------ITNVIGTYNDLR-----------------LRIFSKY 225
           L R    S+ + + +ER        + +  T+++++                  R+   +
Sbjct: 123 LRR--SNSRFQAQPLERPYLLPQPASGIWNTFDNIQRDSTSVPKCQSLLSQQETRLQGAW 180

Query: 226 PDL---FPEEVFNMETFKW----SFGILFSRLVRLP--SMDGRVALVPWADMLNHSCEVE 276
            ++   FP   ++  +F W    S    + +  R P    +  + LVP+AD  NH+ + +
Sbjct: 181 RNVLAVFPNMDWDAFSFHWLILNSRSFYYVKPGRQPPDEWNDAIGLVPFADYFNHADDAD 240

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-- 334
           T + +D   +   FT  RQ++ GE++F+SYG  SN  L + YGF      N SD + L  
Sbjct: 241 TEVVFD--GRKYTFTATRQFEKGEEIFMSYGAHSNDFLFVEYGFFLDH--NESDVIFLDD 296

Query: 335 --PLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFE 392
                L + ++   E  + L  Y ++ +   P  +T   L+ M+              F 
Sbjct: 297 IISKELSEDERKELESQQGLEDYQVTMAGICPRTLTAACLKYMSTE-----------DFR 345

Query: 393 EMAAAASNK-MTSKKDIKC----PEIDEQALQFILDSCESSISKYS 433
           E A   S K   SKK  K      E+ +Q  Q  LDS E  + K S
Sbjct: 346 EYAHGHSTKAFDSKKTWKIIHEWVELYQQECQATLDSLEGILKKRS 391


>gi|332846060|ref|XP_003315172.1| PREDICTED: N-lysine methyltransferase SETD6 [Pan troglodytes]
          Length = 474

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 66/282 (23%), Positives = 121/282 (42%), Gaps = 26/282 (9%)

Query: 89  PPQKMAIQKVD-VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           PP ++A+ +   V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 61  PPAQVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIRGLLERERVAL 117

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 118 QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 176

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 177 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 236

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 237 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 291

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 292 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 333


>gi|320170563|gb|EFW47462.1| hypothetical protein CAOG_05400 [Capsaspora owczarzaki ATCC 30864]
          Length = 479

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/314 (23%), Positives = 128/314 (40%), Gaps = 29/314 (9%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLIS---- 158
           RGLVA + I        +P + +I  +      + G V+    + D  +++ +L      
Sbjct: 70  RGLVAKQAIPPKTVFARIPLTALINIEHAM-VSDLGPVIDASDLSDQEIMSVFLWHQLHG 128

Query: 159 ----EASFEKSSRWSNYISALP-RQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIG 212
               E      S W  ++  LP RQ   L + WT  +L  +L+ S +R+ +  RI  +  
Sbjct: 129 CGQVEDGGVAESNWQPFLDTLPDRQEMHLTMLWTPEQL-AHLDGSLLRDFSERRIQVLEA 187

Query: 213 TYNDLRLRIFSKYPDLFPEE--VFNMETFKWSFGILFSRLVRLPSMDGRVA------LVP 264
           ++   +   F K+P     +   F +E F W   I +SR   +   DG  A      LVP
Sbjct: 188 SFKRHQQSTFGKFPSAESCDWTKFTLEDFLWGMAIGWSRTHAVRVRDGEGAWQTANCLVP 247

Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFT--TDRQYQPGEQVFISYGKKS----NGELLLSY 318
            AD+LN     +   +   + +   F   T  Q    E++   Y   S    N  LL+ Y
Sbjct: 248 VADLLNTDIASKVNAECYTNDESTHFECRTRHQLAQSEELLAQYNADSASIDNHHLLMDY 307

Query: 319 GFVPREGT-NPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAY 377
           GFV  + +   + ++  P+ L   D+   ++L  L+++ +      P+  T + L L  Y
Sbjct: 308 GFVLNDDSARRAATIGRPIPLDDPDRA--KRLSVLKQHKMQMGMSLPLSFTDFELPLTYY 365

Query: 378 AYLVVSPPSMKGKF 391
             L      +  K+
Sbjct: 366 RILCARSQVLNSKW 379


>gi|145349216|ref|XP_001419036.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579266|gb|ABO97329.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 476

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 112/244 (45%), Gaps = 22/244 (9%)

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSR----WSNYISALPRQPYSLLYWTRAELDRYL 194
           EVLKQ +     ++ T  ++ A   + SR    W+  + ALPR P + L W    +   +
Sbjct: 118 EVLKQLTAMGDQIIMTIWLAAAMSGQDSRLYEAWAPTLRALPRAPCTALAWDVDTMRLVM 177

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVF-NMETFKWSFGILFSRLVRL 253
           +  Q+ ER I+    V   Y+ L   +  + P+ FP  VF +   F  ++ I  S  +++
Sbjct: 178 DHDQV-ERLIDYQRKVRVQYDALFPALCEQVPEAFPASVFGDYSRFALAYDIWTSYAMKV 236

Query: 254 P---SMDGRVALVPWADMLNHSCEVET--FLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
               ++     +VP   + NH+    +  +   ++ ++       R  +PG+ + ISYG+
Sbjct: 237 QDPQTLQIYEVIVPGVFLCNHALYAHSVRYTSLERGTRAFRLELARGARPGDAITISYGR 296

Query: 309 KSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
             N +L+  YGF +P   +NP D V L      + +  +E+  AL +    ASE   + +
Sbjct: 297 LDNADLMAYYGFTLP---SNPYDRVVLS---SLASQANEEQTAALAR----ASEMCGVDL 346

Query: 368 TGWP 371
           T  P
Sbjct: 347 TELP 350


>gi|145354720|ref|XP_001421625.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581863|gb|ABO99918.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 375

 Score = 65.1 bits (157), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 119/277 (42%), Gaps = 32/277 (11%)

Query: 118 LFVPPSLVITADSKWSCPEAGE----VLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
           + +P + V+  ++  + P  G     +L++  V +   +  + + E    + S W  YI 
Sbjct: 1   MTIPDAAVVNWNTAAAHPTLGSTFESLLRRGVVDERLAVMCFFMIERRRGEESAWKEYID 60

Query: 174 ALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL---RLRIFSKYPD--- 227
           +LPR   + L ++  EL+R L  + +      +  +V   + +     +R  ++  +   
Sbjct: 61  SLPRAYDAPLSFSDEELERELSGTTVYAPVKAQKAHVKKMFEECVRPAMRELTQADNAAG 120

Query: 228 ----LFPEEVFNMETFKWSFGILFSRLVRLP-SMDGRV---ALVPWADMLNHSCEV---- 275
               + P+   + + F W+F   +SR + +P    G V   ++VP  DM+NH+       
Sbjct: 121 SSLHMLPD--VSEKEFAWAFQTFWSRALAIPVGAGGSVTVDSVVPGVDMVNHAPRARANA 178

Query: 276 --ETFLDYDKSSQGVVFTT----DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
             E   D  +   G V       +R  + GE++FI+YG KSN ELL +YGF  ++     
Sbjct: 179 RWEHVEDSSRPDGGYVALVSAPPNRTMKDGEEIFINYGDKSNEELLFTYGFALKDNAVEE 238

Query: 330 DSVELP--LSLKKSDKCYKEKLEALRKYGLSASECFP 364
             V  P      +  +    ++E LR  GL      P
Sbjct: 239 RMVFFPPWAGDAEHSEDVTRRIELLRAKGLPQHVVLP 275


>gi|307107162|gb|EFN55406.1| hypothetical protein CHLNCDRAFT_134525 [Chlorella variabilis]
          Length = 705

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 49/203 (24%), Positives = 91/203 (44%), Gaps = 12/203 (5%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGER------GLVALKNIRKGEKLLFVPPSLVITADSKW 132
            Q W+   G+ P   AI+ VD  +       G+ A++++ +GE+L  +P +  ++  +  
Sbjct: 9   FQAWMESVGIEPNSDAIELVDAAQGCSGLALGVRAVRDVAEGERLCAIPKAACLSIRTT- 67

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
              +  +V++   +     L   ++ E S    SRW  Y +ALP + Y  L+W+ A+L R
Sbjct: 68  ---QLADVIEAEELGGGLGLVLAVMHEMSLGAESRWHGYFAALPPREYLPLFWSDAQL-R 123

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
            L  +++   A          + +  L +  KYP         ++ F+ +   + SR   
Sbjct: 124 LLAGTELEGSAESDREASAEDFEEHVLPLLHKYPGRLRPAACTLDRFRVAASFVGSRAFC 183

Query: 253 LPSMDGRVALVPWADMLNHSCEV 275
           +    G  A+VP AD+ NH   V
Sbjct: 184 VDEWHGD-AMVPLADIFNHKASV 205


>gi|397614297|gb|EJK62711.1| hypothetical protein THAOC_16665 [Thalassiosira oceanica]
          Length = 467

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 68/276 (24%), Positives = 118/276 (42%), Gaps = 51/276 (18%)

Query: 75  NASTLQKWLSDSGLPPQKMAIQKVD----VGERGL----VALKNIRKGEKLLFVPPSLVI 126
           N + L+ W + +G       IQKVD      E GL    +   +I  G  ++F+P  + +
Sbjct: 54  NMAQLEDWGAQNG-------IQKVDGLELYSEDGLDWQYITTVDIPAGTTIMFIPAGVCL 106

Query: 127 TADS-------------KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
            + +             + +  + G +  Q SV D+ L    L++E   E +S +  +I 
Sbjct: 107 ASSAVEAELKAASNGGMQAAIDQLGRIGGQNSVADFNLFVK-LLAEYEQEDNSPFLPWID 165

Query: 174 ALPRQPYSLLYWTRAELD----RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF 229
           +LPR  Y+ +  T    +         S++ +   +    V+G     ++ I S Y    
Sbjct: 166 SLPRLFYNAVSMTDFCYECLPPLVFSLSRLEKVKFDNFKQVLG-----KVDIVSDYVKQ- 219

Query: 230 PEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWADMLNHSCEVETFLDYDKSSQ 286
                N E  KW+F  +++R       DG+   V L+P  DM NH  E E  + +D+   
Sbjct: 220 -----NDEVLKWAFNSVYTR--AYADKDGQGSDVTLIPMGDMFNHGTETEIEVYFDEGGN 272

Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLS-YGFV 321
           G+V+T+         + ISYG  +N   L + YGF+
Sbjct: 273 GMVYTS-ADVAANSPLRISYGCPTNPSFLFARYGFL 307


>gi|428162643|gb|EKX31766.1| hypothetical protein GUITHDRAFT_149078 [Guillardia theta CCMP2712]
          Length = 581

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 118/264 (44%), Gaps = 28/264 (10%)

Query: 82  WLSDSGLPPQKMAIQKVDVGERGL-VALKNIRKGEKLLFVPPSLVITA---DSKWSCPEA 137
           W+ + G     + +++ + G+ G+  +   I KG +L+ VP   ++     D +    + 
Sbjct: 32  WVRERGGEVGPIVLREGEGGDCGVFTSSAKINKGHELVKVPTCCLLLGRQEDIEGMKLKL 91

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP-YSLL--YWTRAELDRYL 194
               + C       LA  L+   + ++SS +  YIS LP Q  ++ L  +W+R E +  L
Sbjct: 92  NRGERDCERD--VALALALLHHRNLKESSAFHAYISTLPPQDLFTSLPAWWSREEREELL 149

Query: 195 EASQIRERAIERITNVIGTYNDLRL--RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
            +S++ + A    +N    Y +L+   R+ S   +           F W+   + SR   
Sbjct: 150 GSSELADAATTMASNADQDYEELKAAGRMSSSKGE-----------FLWALACVSSRSFD 198

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDK----SSQGVVFTTDRQYQPGEQVFISYGK 308
              + G V +VP  D  NH    +T   Y +    +  G V T+ R     E+V+I+YG 
Sbjct: 199 ADEL-GEV-MVPILDCFNHKRPRDTAYSYRREEAPARAGFVLTSLRDLGEEEEVYIAYGA 256

Query: 309 KSNGELLLSYGFVPREGTNPSDSV 332
           K + ELLL+YGF   +   P  S+
Sbjct: 257 KGSRELLLNYGFCVMDNVEPDGSM 280


>gi|255088291|ref|XP_002506068.1| set domain protein [Micromonas sp. RCC299]
 gi|226521339|gb|ACO67326.1| set domain protein [Micromonas sp. RCC299]
          Length = 513

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 122/292 (41%), Gaps = 30/292 (10%)

Query: 72  SLENA--STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT-- 127
           +L+NA  S+ Q+W    G     +++ +      GL A +++  GE+L+ +P  L +T  
Sbjct: 41  ALQNAMLSSFQRWFEAYGGTTSGISMVQTPSIGWGLTASRDVDVGERLILLPRVLQMTYS 100

Query: 128 ---ADSKWSCPEAGEVLKQ------------CSVPD--WPL-LATYLISEASFEKSSRWS 169
               +S  S  +A   L +              +PD  W + L   L+ E +    S + 
Sbjct: 101 LQDRESTSSSDQATAELDREPDTPLYLKELIAQIPDELWSVRLGLALLHERALGGKSPFF 160

Query: 170 NYISALPRQPYSL-LYWTRAELD--RYLE-ASQIRERAIERITNVIGTYNDLRLRIFSKY 225
            YIS LP     L L++    +D  +YL    Q++ R+   I    G   ++      + 
Sbjct: 161 QYISLLPAMHRGLPLFFGPEAVDALQYLPLVVQVKRRSRFLIDYSSGPLKNVTAGKNGET 220

Query: 226 PDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV--ETFLDYD 282
             + F       +   W+F    SR  R+       A++P  D+ NHS E   E      
Sbjct: 221 ESVPFNGYSVGADALGWAFACASSRAFRVAGEGKPAAMLPLIDVANHSFEASAEVRAAMG 280

Query: 283 KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           +    +     R  + G++V ++YG  SN   LL YGFVP +G N  D+  L
Sbjct: 281 EGPGAIEMVASRPLRAGDEVTLNYGNLSNDHFLLDYGFVP-QGINKHDTASL 331


>gi|354495008|ref|XP_003509624.1| PREDICTED: N-lysine methyltransferase SETD6-like [Cricetulus
           griseus]
          Length = 492

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 70/306 (22%), Positives = 129/306 (42%), Gaps = 29/306 (9%)

Query: 65  PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
           P G + D    A  L +W +  GL   P+    ++  V   G+VA ++++ GE L  VP 
Sbjct: 50  PRGGQSDGDAVAGFL-RWCAGVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPR 108

Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR- 177
           S ++   S  +C  +G + ++     S+  W +     +       +S WS Y +  P  
Sbjct: 109 SALL---SPHTCSISGLLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPEL 164

Query: 178 -QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
            +    ++W   E  R L+ + + E   + + N+   Y+ + L     + DLF   V ++
Sbjct: 165 GRLEHPMFWPEEERRRLLQGTGVPEAVEKDLVNITSEYHSIVLPFMEAHSDLFSPTVRSL 224

Query: 237 ETFKWSFGILFSRLVRLP--------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV 288
           E ++    ++ +   + P               +VP AD+LNH       L+Y      +
Sbjct: 225 ELYRQLVALVMAYSFQEPLEEEDDDEKEPNSPLMVPAADLLNHIANHNANLEYSADYLRM 284

Query: 289 VFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCY 345
           V T     QP   G ++F +YG+ +N +L+  YGF      N  D+ ++ +   +     
Sbjct: 285 VAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTDDTADIQMVTVRDAALQ 339

Query: 346 KEKLEA 351
             K EA
Sbjct: 340 GTKDEA 345


>gi|224077384|ref|XP_002305239.1| SET domain protein [Populus trichocarpa]
 gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa]
          Length = 518

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 112/284 (39%), Gaps = 70/284 (24%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---CSVPDWPLLATY 155
           D G RGL A+++++KGE +L VP S++IT DS     +    +      S+    +LA  
Sbjct: 78  DAGGRGLAAVRDLKKGELVLRVPKSVLITRDSLLKDEKLCSFVNNNTYSSLSPTQILAVC 137

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ E    KSS W  Y+  LPR  Y +L    A   + +  ++   +    + +      
Sbjct: 138 LLYEMGKGKSSWWYPYLMHLPRS-YDVL----ASFKKAVSKAKSEWKEANSLMDA----- 187

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
            L+L+           ++     + W+   + SR + +P  D    L P  D+ N++   
Sbjct: 188 -LKLK----------PQLLTFRAWIWASATISSRALHIP-WDEAGCLCPVGDLFNYAAPG 235

Query: 276 ETFLD-------------------------------------------YDKSSQGVVFTT 292
           E   D                                           +D++     F  
Sbjct: 236 EESNDLENVVHWMNASSLEDSSLSNGETTDDFIGDQPDIGLERLTDGGFDENMAAYCFYA 295

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
            + Y+ G QV + YG  +N ELL  YGF+  E  NP+D V +PL
Sbjct: 296 RKNYKKGTQVLLGYGTYTNLELLEHYGFLLNE--NPNDKVFIPL 337


>gi|449453201|ref|XP_004144347.1| PREDICTED: N-lysine methyltransferase setd6-like [Cucumis sativus]
          Length = 500

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/200 (22%), Positives = 95/200 (47%), Gaps = 8/200 (4%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLV--ALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
            ++W++  G+     A+Q  D  + G+   AL ++R+G+ +  VP    +T  +      
Sbjct: 9   FKRWMTSQGIQCSD-ALQFTDTPDNGISVKALYDLREGDVVANVPKLACLTVKT----TS 63

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
           A  ++++  +  +  L+  L+ E S  ++S W+ Y+  LP +    L W+  ++D++L  
Sbjct: 64  ASSIIEEVGLGGYLGLSVALMYERSLGENSNWAGYLQLLPDKECVPLLWSLQDVDQFLCG 123

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
           +++ +   E  T +   + +  L +    P +F  E F +E +  +  ++ SR   +   
Sbjct: 124 TELHKTVKEDKTLMYEDWKENILPLMMSAPLMFSPEFFGIEQYFSARSLISSRSFDIDDF 183

Query: 257 DGRVALVPWADMLNHSCEVE 276
            G   +VP AD+ NH    E
Sbjct: 184 HG-FGMVPLADLFNHKTNAE 202


>gi|422293679|gb|EKU20979.1| set domain containing protein, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 193

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 239 FKWSFGILFSRLVRLPSMDGRVA-LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
           FKW+   + SR   L     R A LVP+ADMLNH    ET   +D + Q    TT +   
Sbjct: 77  FKWARMCVCSRNFGLEVNRIRTAALVPYADMLNHQRPRETKWTFDNARQAFTITTLQPIA 136

Query: 298 PGEQVFISYGKKSNGELLLSYGFV 321
           PG QV+ SYG+K N   LL+YGF 
Sbjct: 137 PGAQVYDSYGQKCNHRFLLNYGFA 160


>gi|72389450|ref|XP_845020.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62176703|gb|AAX70803.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801554|gb|AAZ11461.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 586

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
           F M+ F W++  L SR     S D  V A++PW D  NHS      + +D+     +F T
Sbjct: 255 FTMQQFIWAYNTLMSRGF---SYDPEVWAVIPWVDYFNHSLTNNATMRFDRCMGAYIFVT 311

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
                 G+QVF+ YG  ++ EL+L YGF+    T PS
Sbjct: 312 TAPVSKGDQVFLQYGSYTDAELVLWYGFI----TTPS 344


>gi|302417794|ref|XP_003006728.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261354330|gb|EEY16758.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 457

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 120/294 (40%), Gaps = 43/294 (14%)

Query: 79  LQKWLSDSG--LPPQKMAIQKVDVGERGLVAL-KNIRKGEKLLFVPPSLVIT----ADSK 131
           L  W S  G  L P        D G    V + + +R GE ++  P SL ++     D K
Sbjct: 4   LTSWASSHGAELHPAIEIFNDNDTGNSFRVKVGQQLRPGETIVTCPFSLTLSFLNALDLK 63

Query: 132 WSCPEAGEVLKQC------SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYW 185
               E+ +  +        +VP   +   +LI +    + S W  YI  LP QP  L  W
Sbjct: 64  SHGHESHDDTQPLPREFVETVPPHIVARFFLIKQYLLGRESFWYPYICTLP-QPDQLSSW 122

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP--EEVFNMETF---- 239
           +   L    +   + +      TN+     +++ R+ ++Y    P  E + N   +    
Sbjct: 123 SLPPLWPSDDIELLED------TNIHTAVAEIKARLKAEYKQATPLLEALPNANDYTRLL 176

Query: 240 -KWSFGILFSRLVR------------LP---SMDGRVALVPWADMLNHSCEVETFLDYDK 283
             W++ I  SR  R            LP   ++D    L+P  D+ NHS   +   D   
Sbjct: 177 YHWAYSIFTSRSFRPSRVVPDHESLPLPEGCAIDDFHILMPLFDVGNHSHSAKISWDIAP 236

Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPL 336
            +   V  T   Y  G QVF +YG K+N EL+L+YGF +P   T  +D V L L
Sbjct: 237 GTSTTVLKTLDAYDSGAQVFNNYGSKTNAELMLAYGFLIPESPTVHNDFVHLQL 290


>gi|297698886|ref|XP_002826530.1| PREDICTED: N-lysine methyltransferase SETD6 isoform 2 [Pongo
           abelii]
          Length = 449

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 120/282 (42%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFMVPRAALL---SQHTCSIGGLLERERVAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E ++    ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYRQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|410983655|ref|XP_003998153.1| PREDICTED: N-lysine methyltransferase SETD6, partial [Felis catus]
          Length = 417

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 67/283 (23%), Positives = 123/283 (43%), Gaps = 27/283 (9%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G +L+Q    
Sbjct: 4   LSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---SQHTC-SIGGLLEQERGA 59

Query: 144 -CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIR 200
             S   W +     +       +S WS Y +  P   +    ++W   E  R L+ + + 
Sbjct: 60  LQSQSGW-VPLLLALLHELQAPASPWSPYFAMWPELGRLEHPMFWPEEERRRLLQGTGVP 118

Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
           E   + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +   
Sbjct: 119 EAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEED 178

Query: 261 A-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
                   +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +
Sbjct: 179 EKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-----QPIPKGHEIFNTYGQMA 233

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           N +L+  YGFV     N  D+ ++ +   +       K+EA R
Sbjct: 234 NWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALLGTKVEAER 276


>gi|426243560|ref|XP_004015620.1| PREDICTED: N-lysine methyltransferase SETD6 [Ovis aries]
          Length = 450

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 72/307 (23%), Positives = 132/307 (42%), Gaps = 28/307 (9%)

Query: 65  PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
           P G + D    AS L  W    GL   P+    ++  V   G+VA ++++ GE L  VP 
Sbjct: 13  PAGSDEDPAPVASFL-SWCQRVGLELSPKVAVSRQGTVAGYGMVARESVQPGELLFAVPR 71

Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR- 177
           + ++   S+ +C  +G + ++     S   W  L   L+ E     +S W  Y +  P  
Sbjct: 72  AALL---SQHTCSISGVLERERGALQSQSGWVPLLLALLHEMQ-APASLWRPYFALWPEL 127

Query: 178 -QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
            +    ++W   E  R L+ + + E   + + N+   Y  + L     + DLF   V ++
Sbjct: 128 GRLEHPMFWPEEERRRLLQGTGVPEAVEKDLANIRSEYYSIVLPFMDAHADLFSPRVRSL 187

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVV 289
           E ++    ++ +   + P  +           +VP AD+LNH       L+Y  +   +V
Sbjct: 188 ELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSPTCLRMV 247

Query: 290 FTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYK 346
                  QP   G ++F +YG+ +N +L+  YGF      N +D+ ++ +   +      
Sbjct: 248 -----AIQPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTNDTADIQMVTVREAALQG 302

Query: 347 EKLEALR 353
            K+EA R
Sbjct: 303 TKVEAER 309


>gi|299115489|emb|CBN75653.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 451

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 66/275 (24%), Positives = 113/275 (41%), Gaps = 24/275 (8%)

Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS 161
           ERG+   +NI     ++ VP   ++T DS    P  G +++  +  D  L    L     
Sbjct: 38  ERGVFCEENIPAETIVVSVPWEALMTVDSAKGTPFEG-LMEAGAREDDVLCLLLLYHRHI 96

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
            ++ S    ++  LPR+ +  ++++  EL+       +R  ++  +T       D   R 
Sbjct: 97  LKERSPLKGHMDVLPREYHQTIFYSDDELE------LLRGTSLHAVTVQWKAQVDTDFRE 150

Query: 222 FSKYPDLFP--------------EEVFNMETFKWSFGILFSRLVRLPSMD-GRVALVPWA 266
               P   P              E     E + W+ G ++SR V +     G  A+ P  
Sbjct: 151 LEALPLPSPRSEEGGSSTARDALEGFLTKEEYLWALGTVWSRFVTVERAGRGLKAMAPVF 210

Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           DM NH     T   Y +S+  +   T + +  G +V  SYG   N  LLL +GF   +  
Sbjct: 211 DMFNHGPLSSTVHGYQESNDCLHLVTLQDWASGSEVKFSYGPLPNSRLLLLHGFCLPD-- 268

Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           NP +SVEL   ++     + EK + +   G+  S+
Sbjct: 269 NPFESVELWAMMEPGAPGFAEKNKIMLDNGVDPSK 303


>gi|320166344|gb|EFW43243.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 514

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 111/265 (41%), Gaps = 24/265 (9%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC----------SVPDWPLLA 153
           GLV     R+GE ++ +PP       +++  P     L+            ++     LA
Sbjct: 88  GLVLNAPARRGEAIVTLPPR------ARFRVPAFDSALRSLIDEFNEQHDNAIDPMTALA 141

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
             L+ E S    S W  ++  LP    S+L W   EL   +E   ++E   ERI N+   
Sbjct: 142 LGLMYERS-RADSPWRAWLRMLPDPIESMLEWNDVEL-WPVEQLYVKELREERIRNLEAV 199

Query: 214 YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC 273
           Y  +       Y        F +E F W+  I  +R +     +G ++L+P  DM+NH  
Sbjct: 200 YESVITPFIDTYESDLVGVDFTIEAFVWAAVIAQTRGLHESEKNG-LSLLPIVDMINHHR 258

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
           E    +    S   ++  T    + GE++ I Y + S+  LLL YGFV  E +   D   
Sbjct: 259 EPNAVV--VASGPNILVRTKTSLKAGEEITIDY-EMSSHVLLLLYGFV--EMSENLDFYP 313

Query: 334 LPLSLKKSDKCYKEKLEALRKYGLS 358
           + LS +  D  Y  +L  L   GLS
Sbjct: 314 IRLSWESKDIDYPRRLRLLEGRGLS 338


>gi|46117158|ref|XP_384597.1| hypothetical protein FG04421.1 [Gibberella zeae PH-1]
          Length = 456

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 72/278 (25%), Positives = 118/278 (42%), Gaps = 38/278 (13%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQKVDVGERGL--VALKNIRKGEKLLFVPPSLVITADSK 131
           E AS L +W + +G      ++Q   + E GL   A       + ++ +PP+L ++    
Sbjct: 8   ERASALVQWATSNGATINP-SVQVSHLPETGLSFCATAPTSPFDTIVSIPPTLTLSYLDT 66

Query: 132 WSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--- 186
               +  +      +   P  ++  +++ +    + S W+ YI ALP QP  +  W+   
Sbjct: 67  LPGRDDPKPFSSNFLVKTPPHVIGRFVLIKHFLLRESFWTPYIQALP-QPNDVDSWSLPP 125

Query: 187 -----RAELDRYLEASQIRERAIERITNVIGTYN---DLRLRIFSKYPDLFPEEVFNMET 238
                 AEL    E + I         NV+  +    DL  R     P L  +  F +  
Sbjct: 126 FWPDEDAEL---FEGTNIEVGVANIKANVMREFRAGCDLLDRD-DWEPQLLKQ--FTLPL 179

Query: 239 FKWSFGILFSR-----LV-------RLPS---MDGRVALVPWADMLNHSCEVETFLDYDK 283
           ++W++ I  SR     LV       RLP    +D    L+P  D+ NH    +   + D+
Sbjct: 180 YQWAYSIFSSRSFRPSLVLGPEDQQRLPEGVKLDDFSVLMPLFDVGNHDMTTQVRWERDE 239

Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
            S        + YQPGEQ+F +Y  K+N ELLL YGF+
Sbjct: 240 KSSDCSLKVGKAYQPGEQIFNNYSMKTNAELLLGYGFM 277


>gi|261328372|emb|CBH11349.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 586

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 50/97 (51%), Gaps = 8/97 (8%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
           F M+ F W++  L SR     S D  V A++PW D  NHS      + +D+     +F T
Sbjct: 255 FTMQQFIWAYNTLMSRGF---SYDPEVWAVIPWVDYFNHSLTNNATMRFDRCMGAYIFET 311

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS 329
                 G+QVF+ YG  ++ EL+L YGF+    T PS
Sbjct: 312 TAPVSKGDQVFLQYGSYTDAELVLWYGFI----TTPS 344


>gi|344290687|ref|XP_003417069.1| PREDICTED: N-lysine methyltransferase SETD6-like [Loxodonta
           africana]
          Length = 452

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/290 (23%), Positives = 125/290 (43%), Gaps = 27/290 (9%)

Query: 82  WLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           W    GL   P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G 
Sbjct: 31  WCGGVGLELSPKVAVSRQGTVAGYGMVAQESVQPGELLFAVPRAAIL---SQHTCCIGGL 87

Query: 140 VLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRY 193
           + ++     S   W +     +       SS WS Y +  P   +    ++W   E  R 
Sbjct: 88  LERERGALQSQSGW-VPLLLALLHELQAPSSPWSPYFALWPELSRLEHPMFWPEEEWRRL 146

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL 253
           L+ + + E   + + N+   Y  + L     +P+LF   V ++E ++    ++ +   + 
Sbjct: 147 LQGTGVPEAVEKDLANIRSEYYSIVLPFMEAHPELFSPCVRSLELYQQLVALVMAYSFQE 206

Query: 254 PSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVF 303
           P  +           +VP AD+LNH       L+Y      +V T     QP   G+++F
Sbjct: 207 PLEEEEDEKEPNSPLMVPAADILNHLANHNAHLEYSPDCLRMVAT-----QPIPKGQEIF 261

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +YG+ +N +L+  YGFV     N  D+ ++ +   +       K+EA R
Sbjct: 262 NTYGQMANWQLIHMYGFVEPYPGNTDDTADIQMVTVREAALQGTKVEAER 311


>gi|367016539|ref|XP_003682768.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
 gi|359750431|emb|CCE93557.1| hypothetical protein TDEL_0G01900 [Torulaspora delbrueckii]
          Length = 573

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/336 (25%), Positives = 141/336 (41%), Gaps = 49/336 (14%)

Query: 72  SLENASTLQKWLSDSG-LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           +L+   T  +W  D G +   ++  +         +A   I+  E L+ VP +L+IT + 
Sbjct: 3   NLDQLKTCVEWCKDHGAIIDDRLEFKVTQAAGVTAIAKSVIKTTEPLISVPANLLITKE- 61

Query: 131 KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSR--WSNYISALPR---QPYSLLYW 185
             +  E G      S  +   L     ++  F+ S+R     Y   LP    QPY   +W
Sbjct: 62  -LAEKEFGSASGAVSSENPNALVQLFTAKMKFDPSARPFHKPYFDILPTKLDQPY---FW 117

Query: 186 TRAELDRYLEASQIRERAIERITNVIGTYNDL--RLRIFSKYPDLFPE-EVFNMETFK-- 240
              E++  L+ + I     + +  ++  ++ L  +L++  +  +L+ + E  + +  K  
Sbjct: 118 KLQEVE-LLKGTDIYLLMKQNLRKIVKEWHVLLDQLKLKPEDGELYEQSEAQDFDILKYI 176

Query: 241 -------------------WSFGILFSR------LVRLPSMDGRVALVPWADMLNHSCEV 275
                              W+ GI  SR      L    S      L P  D+LNH  + 
Sbjct: 177 CEYREQHKSISWKSFVGYLWATGIFTSRAFPKLILEEKCSSINEAFLYPLVDLLNHKND- 235

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
            T + +  ++  V F +    + GE+VF +YG+KSN +LLLSYGFV  +  NP D   L 
Sbjct: 236 -TKVKWTFTNDNVCFVSQEIMKEGEEVFNNYGEKSNEDLLLSYGFV--QDQNPYDLTRLT 292

Query: 336 LSLKKS--DKCYKEKLEALRKYGLSASECFPIQITG 369
           L L K   D+    +L    K  + A +C   QIT 
Sbjct: 293 LRLTKEMIDEALNAELGFSEKNKV-ADDCVQFQITA 327


>gi|294659704|ref|XP_462118.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
 gi|199434171|emb|CAG90604.2| DEHA2G13354p [Debaryomyces hansenii CBS767]
          Length = 480

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/233 (28%), Positives = 103/233 (44%), Gaps = 32/233 (13%)

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPR-QPYSL--LYWTRAELDRYLEASQI----RE 201
           + LL+ Y+  E     SS W  +I  LP    + L  L W   ++D Y E  ++     +
Sbjct: 146 FQLLSFYICFEKQRGSSSFWKPFIDMLPETSDFDLAPLVWKVLKVDHYEELLKLLPNSTK 205

Query: 202 RAIERITNVIGT-YNDLRLRIFSKYPDLFPEEVFN-----------METFKWSFGILFSR 249
           R +++I +   T YN ++  I  K  ++   E  N           +E + WS+  + SR
Sbjct: 206 RHMDKIYDRFQTDYNVVKDLISIKLKEISDNERSNDLTDAIRHLVPIELYLWSWMCINSR 265

Query: 250 LVRLPSMDGRVA-----LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
            + +     + A     + P+ D LNHSC+ +  L  D +  G    T   Y P EQ+F+
Sbjct: 266 CLYMEIPQSKNAADNFTMAPYVDFLNHSCDDQCGLKIDGT--GFQVYTTCSYNPDEQLFL 323

Query: 305 SYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYG 356
           SYG  SN  LL  YGF +P    N  D  +  L L K      +++E L  Y 
Sbjct: 324 SYGPHSNEFLLCEYGFTLPENKWNDLDVSDYILPLMKP-----QQIEFLNDYS 371


>gi|302845036|ref|XP_002954057.1| hypothetical protein VOLCADRAFT_94881 [Volvox carteri f.
           nagariensis]
 gi|300260556|gb|EFJ44774.1| hypothetical protein VOLCADRAFT_94881 [Volvox carteri f.
           nagariensis]
          Length = 598

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 45/143 (31%), Positives = 66/143 (46%), Gaps = 30/143 (20%)

Query: 239 FKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNHSC----------------EVET-- 277
           F+W+  ++ SR     +  G V    LVP  DMLNH                  EV T  
Sbjct: 173 FRWALSVVHSRTFANAAPGGGVGVRMLVPLVDMLNHGGDTAAQGSLGLVGPGGGEVATDN 232

Query: 278 ----FLDYDKSSQG---VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
                L  D+SS G   +  +  R   PG+++ +SYG++ N +  L YGFVPR   NP D
Sbjct: 233 VRWDLLPPDRSSAGGWSMAVSATRDIHPGQELLLSYGERPNDDFFLHYGFVPR--ANPHD 290

Query: 331 SVELPLSLKKSDKCYKEKLEALR 353
              L   L+ + + + E+L AL+
Sbjct: 291 DAVLWPDLEAALEWHYERLGALQ 313


>gi|449520517|ref|XP_004167280.1| PREDICTED: LOW QUALITY PROTEIN: sulfate transporter 4.1,
           chloroplastic-like, partial [Cucumis sativus]
          Length = 923

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 103/214 (48%), Gaps = 19/214 (8%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLL 152
           ++I K  +G R L A + IR G+ +L VP ++ I+ DS    P     L    + +   L
Sbjct: 724 LSIGKSSIG-RFLFASETIRAGDCILKVPFNVQISPDS---LPLPIRDLLGNEIGNVAKL 779

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERITN 209
           A  ++ E      S W+ YI  LP QP+ +   ++W  +EL+  +  S + E ++ + + 
Sbjct: 780 AVVVLLEHKLGLGSEWAPYIIRLP-QPWEMHNTIFWKESELEM-IRKSSLYEESLNQRSQ 837

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWAD 267
           +   +  +R     K  + FPE +   + + F  ++ ++ SR  R  S +G V+L+P+AD
Sbjct: 838 IKREFLAIR-----KALEAFPEIIDRISCDDFMHAYALVTSRAWR--STEG-VSLIPFAD 889

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
            LNH    E  L  D   Q      DR + PGE 
Sbjct: 890 FLNHDGASEAMLLNDDDKQLSEVVADRDFAPGEH 923


>gi|332227974|ref|XP_003263165.1| PREDICTED: N-lysine methyltransferase SETD6 [Nomascus leucogenys]
          Length = 449

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 121/282 (42%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE LL VP + ++   S+ +C   G + ++    
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLLVVPRAALL---SQHTCSIGGLLDRERGAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|168016200|ref|XP_001760637.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687997|gb|EDQ74376.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 450

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 71/286 (24%), Positives = 113/286 (39%), Gaps = 42/286 (14%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGER---GLVALKNIRKGEKLLFVPPSLVITADSKWSCP 135
            + W+  +G+  +   I+    GE    GL A K+  +G  L+  P  L IT  +    P
Sbjct: 18  FRDWMQINGVQSRFCEIRPSSNGENAGFGLFATKDNAQG-VLMVTPLLLAITPMTVLQDP 76

Query: 136 EAG----EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
           E G    +++++  V D  L+  +L+ E +  + S W+ Y+  LP +  + L ++  EL 
Sbjct: 77  ELGGHYCKLMEEGEVDDRLLIMLFLVIERARGRFSFWAPYLEILPFKFGTPLSFSEEELS 136

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM---ETFKWSFGILFS 248
                   +    +  T +I     LR  +  +   +F     N+    +F   F +   
Sbjct: 137 ELKGTHLFQATQQQSTTGLI-----LRCPVLDRANSVFWTRALNIPCPHSFNNRFAVDLD 191

Query: 249 RL---------------VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
                            V++PS      LVP  D  NH  +     + D     V    +
Sbjct: 192 STTHKKPEESSAADTDDVKIPSSVWVEGLVPGIDFCNHDLKAVALWEVDGPEGSVTGVPN 251

Query: 294 RQY---------QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
             Y           G ++FISYG KSN ELL  YGFV  E  NP D
Sbjct: 252 SMYLVTGLDVVISNGSEIFISYGNKSNEELLYLYGFVLVE--NPDD 295


>gi|118357514|ref|XP_001012006.1| hypothetical protein TTHERM_00808050 [Tetrahymena thermophila]
 gi|89293773|gb|EAR91761.1| hypothetical protein TTHERM_00808050 [Tetrahymena thermophila
           SB210]
          Length = 454

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 28/235 (11%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWS-CPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           LVA +++ +GE+LL +P +L IT     +  P   EV    +V    +LA +L  E   +
Sbjct: 39  LVAKQSVNEGEELLRIPETLFITLSVAITKLPILREVKSNLNVQKKSILAFFLFKEKK-D 97

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELD--------RYLEASQIRERAIERITNVIGTYN 215
            SS +  Y++++P+Q  + + W   + +        ++ +  Q      + I N I +  
Sbjct: 98  ASSFYHCYLNSIPKQYTNTITWQEIQFNLLRDELKTKHQKKQQKLLSEFDAIKNYISSNK 157

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNH 271
           D        Y  +F  E  N   F     ++ SR +   +         A++P+ D+ NH
Sbjct: 158 D--------YSHIF--EGINEAEFLQLVAMIESRTLFFKNEQDSTSEVGAMIPFYDLANH 207

Query: 272 S----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP 322
           +     +   +  +D+ S+  V    + +   EQ+FI+YG  +N   L  YGF+P
Sbjct: 208 TFMEGIDHFKYFYFDQISKEYVMRAYKHFVAEEQIFITYGNYNNEHFLDYYGFIP 262


>gi|358397725|gb|EHK47093.1| hypothetical protein TRIATDRAFT_298882 [Trichoderma atroviride IMI
           206040]
          Length = 481

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 56/194 (28%), Positives = 89/194 (45%), Gaps = 33/194 (17%)

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRERAIERI 207
           LI E    + S W  YI ALP QP  +  W          AEL   LE + + E  +++I
Sbjct: 92  LIKELLRGEESFWWPYIQALP-QPEDVDDWALPPFWPEEEAEL---LEGTNV-EVGLDKI 146

Query: 208 TNVIG-TYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSRLVR------------ 252
            + +   + + +  + +   D   +  E+   E + W++ I  SR  R            
Sbjct: 147 RDDLKREFREAKAMLLASQKDAEDDFSELLTRELYNWAYCIFSSRSFRASLVMTEAQQQA 206

Query: 253 LP---SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV--FTTDRQYQPGEQVFISYG 307
           LP   S+D    L+P  D+ NH   V+   + D ++ G        R++QPG+Q+F +Y 
Sbjct: 207 LPEDVSVDDFSVLLPLFDIGNHDMAVDVRWELDAANSGAACQLRVGREHQPGQQIFNNYS 266

Query: 308 KKSNGELLLSYGFV 321
            K+N ELLL YGF+
Sbjct: 267 PKTNAELLLGYGFM 280


>gi|238550105|ref|NP_079136.2| N-lysine methyltransferase SETD6 isoform b [Homo sapiens]
 gi|333944471|pdb|3QXY|A Chain A, Human Setd6 In Complex With Rela Lys310
 gi|333944473|pdb|3QXY|B Chain B, Human Setd6 In Complex With Rela Lys310
 gi|333944524|pdb|3RC0|A Chain A, Human Setd6 In Complex With Rela Lys310 Peptide
 gi|333944526|pdb|3RC0|B Chain B, Human Setd6 In Complex With Rela Lys310 Peptide
 gi|119603386|gb|EAW82980.1| SET domain containing 6, isoform CRA_a [Homo sapiens]
 gi|307686123|dbj|BAJ20992.1| SET domain containing 6 [synthetic construct]
          Length = 449

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERVAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|323452617|gb|EGB08490.1| hypothetical protein AURANDRAFT_71532 [Aureococcus anophagefferens]
          Length = 1114

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 75/273 (27%), Positives = 114/273 (41%), Gaps = 35/273 (12%)

Query: 116 KLLFVPPSLVITADSKWSCP---EA------GEVLKQCSVPDWPLLATYLISEASFEKSS 166
           +L+FVP   ++ A S   C    EA      G +L + +  D   LA  L+ E      S
Sbjct: 57  ELVFVPFDAMLHARSPLVCSGEREANDARALGALLGKVTRED-DALALRLLYERRKGAKS 115

Query: 167 RWSNYISALPRQP-YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL-------- 217
           RW  +I+ LP  P ++LL W+ AEL   L  S   E A    + V   ++++        
Sbjct: 116 RWGPHIALLPATPPHALLRWSEAELAE-LAGSDALELANRWRSQVSSDFSEIVDKSRAAV 174

Query: 218 -------RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR---VALVPWAD 267
                  +L    K    FP    ++E F W+  +++SR V + S  G     A +P  D
Sbjct: 175 EESDPGKQLSAAVKASLRFP--WLDLEGFSWAVSMIWSRCVSV-SRKGAPPIKAFLPVVD 231

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           M NH         +D +  G V       + G+++ + Y    N  LLL YGF      +
Sbjct: 232 MHNHDPGAPENHGFDDARDGFVLRRTGNAKKGDELKLCYDGLPNAWLLLLYGFALDHAAH 291

Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
               +  PLS +     Y+ K  AL K GL A+
Sbjct: 292 AGRDLYAPLSPEAPH--YEAKRAALEKLGLGAT 322


>gi|307173810|gb|EFN64588.1| SET domain-containing protein 4 [Camponotus floridanus]
          Length = 376

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/296 (22%), Positives = 134/296 (45%), Gaps = 29/296 (9%)

Query: 74  ENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           E+  +L+ WL +++ L  + +  +   +  RGL  LK+I   E L+ +P  ++IT D+  
Sbjct: 2   ESLISLKSWLLNENCLSIRHLIPEYFPLTGRGLKTLKHIECNEVLIQLPFRMLITTDTLL 61

Query: 133 SCPEAGEVLKQC-SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE-- 189
                   L    S     +LAT+L+ E      S+W  Y+  LP+   +  + +  E  
Sbjct: 62  QSNIRFLFLNTTDSFNPQCMLATFLVYETHLGIKSKWYLYLKTLPQSFTNPDFCSNKEKR 121

Query: 190 -LDRYLEASQIRERAIERITNVIGT---YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGI 245
            L  ++  S  +   +E   +++     + D+  +    + +L   ++   E +KW++ +
Sbjct: 122 ILPSFILNSLHQAHRLESNFSLLMKAVKHLDIINKNHCSHCNLHLRKIITFEKYKWAYYV 181

Query: 246 LFSRLVRLPSMDGR------------VALVPWADMLNHSCEVE---TFLDYDKSSQGVVF 290
           + +R V + +   R            +AL P+ D+ NH+ +     + +  +  +Q    
Sbjct: 182 VNTRAVYIDTKLLREKNIFNIKQPNNLALAPFLDLFNHNVDTAVKVSIITDNNQNQFYQI 241

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYG-FVPREGTNPSDSVELPLSLKKSDKCY 345
            T + +    QVFI+YG  +N +L + YG F+P    NP D  E+   + +  +C+
Sbjct: 242 ITLKPFDRESQVFINYGAHNNLKLYIDYGFFIP---CNPLD--EIYFDILEIQRCF 292


>gi|320167148|gb|EFW44047.1| hypothetical protein CAOG_02072 [Capsaspora owczarzaki ATCC 30864]
          Length = 533

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/269 (24%), Positives = 116/269 (43%), Gaps = 39/269 (14%)

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND----- 216
           F+  S W  +    PR+     +W    L   L+ + IR+ AI ++  +I    D     
Sbjct: 159 FDPDSFWQPWFQLFPRELDCAGFWDDLLL-MELDNTSIRD-AIRQLEALIEYEYDQLDLP 216

Query: 217 -LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR-VALVPWADMLNHSCE 274
            LRLR    +PD F  + F+ + FKW+F +L SR + +   +     ++P+ D  NH+  
Sbjct: 217 ALRLR----FPDSFVADRFSYDDFKWAFMVLASRGLTMSVNNAPCTVMIPFVDFFNHNGA 272

Query: 275 VETFL---------------DYDKSSQGV---VFTTDRQYQPGEQVFISYGKKSNGELLL 316
                               +YD S + +   V + +  + PGEQ+F++Y   SN  LLL
Sbjct: 273 KSIAFSYTRRAGDASDVSSGNYDDSVENLNCAVISGNETFLPGEQMFLNYKAHSNEVLLL 332

Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDK---CYKEKLEALRKYGLSASECFPIQITGWPLE 373
            YGF      + +  V L    +K++      +E L  LR  G+  +  F ++  G  ++
Sbjct: 333 HYGFALPHNEHDTFLVRLHFDREKTNDPLMDLREHLLELR--GIQENHPFLLRWHGDIID 390

Query: 374 ---LMAYAYLVVSPPSMKGKFEEMAAAAS 399
              L A   ++ +   +    EE  A+ S
Sbjct: 391 PDILFALRVMIATKDQLDFLLEEGIASNS 419


>gi|367048695|ref|XP_003654727.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
 gi|347001990|gb|AEO68391.1| hypothetical protein THITE_2117893 [Thielavia terrestris NRRL 8126]
          Length = 481

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 56/192 (29%), Positives = 76/192 (39%), Gaps = 29/192 (15%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRA-----ELDRYLEASQIRERAIERITN 209
           +LI E    + S W+ YI+ LP QP  +  W        E   YL  +       E   N
Sbjct: 108 FLIKEYLKGRDSFWAPYIATLP-QPEHVSAWALPAFWPEEDIAYLAGTNAHVAIAEIQAN 166

Query: 210 VIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR---------------- 252
           V   +   R  + +     FP  + +    +KW+F I  SR  R                
Sbjct: 167 VKSEFKQARKALKAAG---FPAWQDYTQMLYKWAFCIFTSRSFRPSLVLSEPAKQQMAEL 223

Query: 253 LP---SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           LP    +D    L P  D+ NHS       D               YQPGEQV+ +YG K
Sbjct: 224 LPPGCQLDDFSILQPLFDIANHSMTARYAWDVASDPASCQLVCHDAYQPGEQVYNNYGLK 283

Query: 310 SNGELLLSYGFV 321
           +N ELLL+YGF+
Sbjct: 284 TNSELLLAYGFI 295


>gi|109128727|ref|XP_001102235.1| PREDICTED: SET domain-containing protein 6-like isoform 2 [Macaca
           mulatta]
          Length = 456

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 122/282 (43%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+++C   G + ++    
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQYTCSIGGLLERERGAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   +    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y+ + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|297807745|ref|XP_002871756.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297317593|gb|EFH48015.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 116/273 (42%), Gaps = 49/273 (17%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA--GEVLKQCSVPDWPLLATYLI 157
            G RGL A++ ++KGE +L VP + ++T +S  +        V+   S+    +L+  L+
Sbjct: 49  AGGRGLGAVRELKKGELVLKVPRNALMTTESMIAKDRKLNDAVILHGSLSSTQILSVCLL 108

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
            E    K S W  Y+  LPR  Y LL  T  E ++  +A Q+ E A+      I      
Sbjct: 109 YEMGKGKRSFWYPYLVHLPRD-YDLLA-TFGEFEK--QALQV-EDAVWATEKAIA----- 158

Query: 218 RLRIFSKYPDLFPEEVFNMETFK------WSFGILFSRLVRLPSMDGRVALVPWADMLN- 270
           + +   K   L  EE+     F+      W+   + SR + +P  D    L P  D+ N 
Sbjct: 159 KCQFEWKEVGLLMEELELKSKFRSFQAWLWASATISSRTLHVP-WDSAGCLCPVGDLFNY 217

Query: 271 -------HSCE--------------VETFLD------YDKSSQGVVFTTDRQYQPGEQVF 303
                  H+ E              VET  +      +++          R YQ GEQV 
Sbjct: 218 DAPGDDLHTLEGPESANDVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVL 277

Query: 304 ISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           + YG  +N ELL  YGF+  E +N  D V +PL
Sbjct: 278 LCYGTYTNLELLEHYGFMLEENSN--DKVFIPL 308


>gi|133902101|ref|NP_490849.4| Protein SET-29 [Caenorhabditis elegans]
 gi|373219869|emb|CCD70787.1| Protein SET-29 [Caenorhabditis elegans]
          Length = 401

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 59/235 (25%), Positives = 101/235 (42%), Gaps = 36/235 (15%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE--------AGEVLKQCSVPDWPLLATY 155
           G+ A    R G+  + +P + +I A      P          GE LK   +     L  +
Sbjct: 31  GIYATTGFRTGKAFITLPETDMINAALVVDLPVYRKKLAKIGGEKLKPMEI-----LTMF 85

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
              E +  + S WS Y+  LP++ ++   +   + D       IR+  I++   +     
Sbjct: 86  FAFEDT--EHSAWSPYLKVLPKE-FNTPAFKGIDYDVNTLPLSIRKYWIDQKKEISEISE 142

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWA 266
            LR         LFPE   + +   W++ ++ +R +         + + DG  +A++P+ 
Sbjct: 143 KLR--------RLFPE--LSHDKILWAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYV 192

Query: 267 DMLNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           DMLNH  E  +    ++K +   V    RQ Q GEQ+F+ YG   N  LL+ YGF
Sbjct: 193 DMLNHDPEKYQGLALHEKRNGRYVVQAKRQIQEGEQIFVCYGAHDNARLLVEYGF 247


>gi|68488193|ref|XP_712057.1| hypothetical protein CaO19.10177 [Candida albicans SC5314]
 gi|46433419|gb|EAK92860.1| hypothetical protein CaO19.10177 [Candida albicans SC5314]
          Length = 552

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 132/321 (41%), Gaps = 58/321 (18%)

Query: 74  ENASTLQKWLSDSGLPPQ-KMAIQK-VDVGE-RGLVALKNIRKGEKLLFVPPSLVITADS 130
           E +   Q WL  + +    K+AI    D  + RG++AL++I   E +  +P S+V+  D+
Sbjct: 6   EKSKLFQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN 65

Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFE----------KSSRWSNYISALPRQP 179
                    VLK+  V D W  L   L  E  F+            S W  Y++ LP Q 
Sbjct: 66  NSLIKSYPSVLKKLRVLDQWIGLIIVLGFEIKFKFNPSDNNDNHNRSFWYEYLNILPDQF 125

Query: 180 YSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYN-DLR-LRIFSKYPDLFPEE 232
             L+YW   EL+ +L+ S I     +E  +     +I   N DL  +  F   P  F EE
Sbjct: 126 NQLIYWNDEELN-HLQPSCILDRIGKENNLNMYNQIISIINQDLSGVEEFKSSPLTF-EE 183

Query: 233 VFNMETF--KWSFGILFSRLVRLP----SMDGRV-------------------------A 261
              + T    +SF +   +  ++     +  G                           +
Sbjct: 184 YNKVATIIMSYSFDVEVPKSKKMTKNGTNEKGNDEEDEEEDEDKEDDDDDEEEDNEYYKS 243

Query: 262 LVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           +VP+AD LN    +    L Y  S+  ++ T  +    GEQV+ +Y    N ELL  YG+
Sbjct: 244 MVPFADTLNADTHLNNAILIY--STDQLIMTCIKPIAKGEQVYNTYSDHPNSELLRRYGY 301

Query: 321 VPREGTNPSDSVELPLSLKKS 341
           V   G+   D  E+PLS  KS
Sbjct: 302 VELNGS-KYDFGEIPLSTIKS 321


>gi|403306046|ref|XP_003943557.1| PREDICTED: N-lysine methyltransferase SETD6 [Saimiri boliviensis
           boliviensis]
          Length = 449

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 64/283 (22%), Positives = 118/283 (41%), Gaps = 27/283 (9%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S  +C   G + ++    
Sbjct: 36  LSPKVEVSRQGTVAGYGMVARESVQAGELLFVVPRAAIL---SPHTCSIGGLLERERGAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +S W  Y +  P   +    ++W   E  R L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAAASHWRPYFALWPELGHLEHPMFWPEEERRRLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF--------KWSFGILFSRLVRL 253
              + + ++   Y+ + L     +PDLF   V ++E +         +SF          
Sbjct: 152 AVEKDLDSIRSEYHSIVLPFMEAHPDLFSLRVHSLELYLQLVALVMAYSFQEPLEEEEDE 211

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
              +  + +VP AD+LNH       L+Y      +V T     QP   G ++F +YG+ +
Sbjct: 212 KEPNSPI-MVPAADILNHLANHNANLEYSADCLRMVAT-----QPIPKGHEIFNTYGQMA 265

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           N +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 266 NWQLIHMYGFVEPYPNNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|358335378|dbj|GAA53907.1| histone-lysine N-methyltransferase setd3 [Clonorchis sinensis]
          Length = 254

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/190 (27%), Positives = 90/190 (47%), Gaps = 12/190 (6%)

Query: 258 GRVA--LVPWADMLNHSC-EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
           G VA  LVP  D++NH   +V T  D+D  S  ++F +        Q+ + YGK+++ E 
Sbjct: 13  GAVAMCLVPIWDLINHKLGQVTT--DFDPESGELIFYSMEFTPKNTQILMDYGKRTSAEF 70

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLEL 374
           L+  GFVP   TNP ++V + L + KSD+   ++ + L    L +     + ITG    L
Sbjct: 71  LMFSGFVP--ATNPHNNVRIVLGVSKSDQLSSKREQLLELIALQSP--LILHITGDLSSL 126

Query: 375 ---MAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISK 431
              +A+A + V           M  +A + + +        ID+QA+ F++   E  +S 
Sbjct: 127 SDAIAFARVFVMDSDQLDAHLSMTTSALHALRTSPLCPGDPIDDQAIAFLIMRFELLVSA 186

Query: 432 YSRFLQVKEL 441
           Y   +   E+
Sbjct: 187 YGPMVSEDEV 196


>gi|322802325|gb|EFZ22721.1| hypothetical protein SINV_12919 [Solenopsis invicta]
          Length = 435

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 72/326 (22%), Positives = 137/326 (42%), Gaps = 49/326 (15%)

Query: 74  ENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           E+   L+ WL S++ +       +   +  RGL  LK I K E L+ +P  ++IT D   
Sbjct: 25  ESLICLKSWLLSENCMSISYFIPEHFPLSGRGLKTLKRIEKNEVLIQLPLRMLITTDILM 84

Query: 133 SCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE- 189
                   L   +    P  +LAT+L+ E      S+W  Y+  LP+   +  + +  E 
Sbjct: 85  QSDVKTLFLYSTTDSFSPQCMLATFLVYETHLGIKSKWYLYLKTLPQSFTNPDFCSNKEK 144

Query: 190 -------LDRYLEASQIRE------RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
                  L    +A ++++      +A++R+        D+  R    + +   +++   
Sbjct: 145 AILPDFILHPLHQAHKLQKDFSLLMKAVKRL--------DINSRNSCPHCNACLQKIITF 196

Query: 237 ETFKWSFGILFSRLVRLPS-----------MDGRVALVPWADMLNH----SCEVETFLDY 281
             +KW++ ++ +R V + +               +AL P+ D+ NH    + +V      
Sbjct: 197 AKYKWAYYVVNTRAVYIDNGVCKENVFNIKQPNNLALAPFLDLFNHDINTAVKVSIVTVS 256

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG-FVPREGTNPSDSVELPLSLKK 340
           D  ++     T + +  G QVFI+YG   + +L + YG F+P    NP D  E+   +  
Sbjct: 257 DCQNKFYQIVTLKPFDKGSQVFINYGAHDSLKLYIDYGFFIPH---NPLD--EIKFDIFD 311

Query: 341 SDKCY---KEKLEALRKYGLSASECF 363
             +C+   + KL+ +   G   S  F
Sbjct: 312 IQRCFDVSRNKLDFIMLNGFHKSMSF 337


>gi|307103410|gb|EFN51670.1| hypothetical protein CHLNCDRAFT_139898 [Chlorella variabilis]
          Length = 543

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 113/291 (38%), Gaps = 39/291 (13%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE-- 136
           L  W+  SG     +A+++ + G  GL A ++   G  L+ +P    +T D   S P   
Sbjct: 50  LVAWVESSGGSAAGVAVRRNEAGF-GLAASRDCGAGSTLVSLPQRCHLTYDDS-SDPRLL 107

Query: 137 --AGEV----LKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL 190
              G+V    L+  + P  P ++  L+  A         + ++AL   P +     R   
Sbjct: 108 ALIGQVVAHRLQGATSPFAPYISNLLLGVAGLPMFF-GGDALAALQYPPVTEQVKRRC-- 164

Query: 191 DRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
            R+L A   RE A  R                    D F     +     W+  ++ SR 
Sbjct: 165 -RWLLAFAQRELAAARRGG----------------GDPFGGADVDANALGWALAVVTSRA 207

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
            R    D   A++P  DM NHS +             +     R  Q GE V ISYG  S
Sbjct: 208 FRTRGPDQPAAMLPLIDMANHSFQAANAKIAPGPGGSMCMVATRALQAGEPVLISYGALS 267

Query: 311 NGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS 360
           N  LL+ YGF VP    NP D+V+L     + D+   E  +A+   G +  
Sbjct: 268 NDFLLMDYGFIVP---GNPHDTVQL-----RFDRGLIEAAKAVAGVGCTGG 310


>gi|428163078|gb|EKX32170.1| hypothetical protein GUITHDRAFT_121664 [Guillardia theta CCMP2712]
          Length = 449

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 83/395 (21%), Positives = 163/395 (41%), Gaps = 49/395 (12%)

Query: 75  NASTLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNIRKGEKLLFVPPSLVITADSKW 132
           + S + +W + +G    K+ ++  D GE G  L A ++I  GE +L +P +L+       
Sbjct: 25  DGSDVYEWAAANGANVSKVVLR--DDGEAGPILHAKEDIEAGEVILSLPANLLFPTRVSD 82

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
             P    +++  ++     +  YLISE + + SS W  ++ +LP + +  L ++  ++  
Sbjct: 83  HSPVV-HMIENTTIGRITAICLYLISERA-DSSSHWKPWLQSLPPRFFHALSYSEDDM-L 139

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP------------EEVFNMETFK 240
           + +AS  +E    +  NV   Y      +  K P   P             E F  E F+
Sbjct: 140 HFQASSFKELRDRKKKNVRQEYEQTVAPLLHKLPAFDPLLAAVDKPQNVTREDFTYEAFE 199

Query: 241 WSFGILFSRLVRLPSMDGR---------VALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
           W++ ++ +R +  P + G          + L P AD   H       + YD      VF+
Sbjct: 200 WAYSVVTTRGI-FPGLLGEEEREGEVPLLVLGPLADSFIHGAS-GVKISYDAQEHRCVFS 257

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
              +      + I  G  SN ELL + GF+ +   N ++ V +   L ++   +    E+
Sbjct: 258 ALHKVAKNSPISIGVGMSSNMELLANRGFMMQ--NNGNNFVLMKFQLDRNSDMHASARES 315

Query: 352 -LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
            +++  LS    + ++    P  L+A   +    P   G +              K +  
Sbjct: 316 MMKQLNLSNPMTYVVRYGEMPQGLLASLRIQSLSPVEFGSY-------------GKALAT 362

Query: 411 P---EIDEQALQFILDSCESSISKYSRFLQVKELL 442
           P   E + +A + ++ SC S ++ Y   ++  E++
Sbjct: 363 PVTLENEWRAYRLLISSCNSILAMYPTTIEEDEIV 397


>gi|58261130|ref|XP_567975.1| nucleus protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|134115865|ref|XP_773415.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256040|gb|EAL18768.1| hypothetical protein CNBI2600 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57230057|gb|AAW46458.1| nucleus protein, putative [Cryptococcus neoformans var. neoformans
           JEC21]
          Length = 495

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 106/264 (40%), Gaps = 63/264 (23%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITA----------DSKWSCPEAGEVLKQCSVPDWPLLA 153
           G VA+K+I +G  L  VP  L+++A           S+W     G          W  L 
Sbjct: 44  GAVAVKDIEEGTPLFHVPDDLILSAYTSDLKDHLDASEWDQLNKG----------WAQLI 93

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWT---RAELDRYLEASQI-RERAIERITN 209
             ++ E      SRW+ Y++ +P    + ++WT   R +L     A +I RE A    T+
Sbjct: 94  LVMMWETIKGSKSRWAGYLANMPVLFETPMFWTERQREQLSGTDIADRIGREDAEAEYTS 153

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLP-----------SM 256
           V+  +          +PDLFP +     M+ F      + SR   +P             
Sbjct: 154 VLAPF-------IKAHPDLFPVDSPHITMDAFHIQGSRILSRSFTVPLHRFGRSHSQSRS 206

Query: 257 DGR-------------VALVPWADMLNHSC---EVETFLDYDKS---SQGVVFTTDRQYQ 297
           DG              V ++P+ADMLN +        ++D D      +GVV  + +  +
Sbjct: 207 DGNSEKESDDEDEEEMVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTQLVK 266

Query: 298 PGEQVFISYGKKSNGELLLSYGFV 321
             EQ++ +Y    N ELL  YG V
Sbjct: 267 QSEQIYNTYDSPPNSELLRKYGHV 290


>gi|145511243|ref|XP_001441549.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408799|emb|CAK74152.1| unnamed protein product [Paramecium tetraurelia]
          Length = 731

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 86/201 (42%), Gaps = 15/201 (7%)

Query: 92  KMAIQKVDVGER--GLVALKNIRKGEKLLFVPPSLVITADSKWSCP------EAGEVLKQ 143
           K AI K   G +  GLVA + I   E L+ VP  L++T    +  P      +  +    
Sbjct: 50  KYAIFKTKNGLKYPGLVASEKILSNETLVSVPRDLLLTTRHAFESPLKQMFLDHPQYFSN 109

Query: 144 CSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE 201
              P W    L  +++ E      S W   IS LPR    L++W   E +  L+  Q+ +
Sbjct: 110 QFYPSWEDHQLMAFILYEYQRGPESEWHLLISNLPRDIDYLVFWNPEEQE-LLDDQQLVK 168

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
            A ++    +  Y  L+  I  KYP LF  E    E  +W +  L +R          V 
Sbjct: 169 LARKQYQEFVIEYETLKC-ITDKYPQLFKPETVTFENARWVYTHLVTRC--FGKYLAYVT 225

Query: 262 LVPWADMLNHSCEVETFLDYD 282
           +VP+ ++ NH C  + F D++
Sbjct: 226 MVPFCELFNHEC-TDVFYDFE 245



 Score = 41.6 bits (96), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 27/154 (17%)

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA--- 351
           Q++ G QV+  YG+ SN  +L+ YG       N  D   L +   K    Y + +EA   
Sbjct: 516 QFEKGAQVYFCYGRLSNRMMLMRYGMTLE--YNKYDHAHLRIDYLK----YVQNIEAVWL 569

Query: 352 LRKYGLSASECFPIQITGWPLELMAYA---YLVVSPPSMKGKFEEMAAAASNKMTSKKDI 408
           + KY LS  + F ++ T +P++ + +    Y   +  S+   F+               I
Sbjct: 570 VHKYQLSKYKRFKLKHTTFPIDFIVFCKSIYWTFNVHSLDTFFK---------------I 614

Query: 409 KCPEIDEQALQFILDSCESSISKYSRFLQVKELL 442
           +  +++ +ALQ  L+     ISK++  L+  E L
Sbjct: 615 QDLKLERKALQLALEILVEEISKFTDKLEDNEKL 648


>gi|326927087|ref|XP_003209726.1| PREDICTED: n-lysine methyltransferase SETD6-like [Meleagris
           gallopavo]
          Length = 410

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 61/241 (25%), Positives = 108/241 (44%), Gaps = 28/241 (11%)

Query: 114 GEKLLFVPPSLVITADSKWSCP------EAGEVLKQCSVPDWPLLATYLISEASFEKSSR 167
           GE L  VP S ++   S+ +C       +A E L+  S   W  L   L+ E +   +SR
Sbjct: 12  GELLFSVPRSALL---SQHTCAIRALLHDAQESLQSQS--GWVPLLLALLHEYT-TSTSR 65

Query: 168 WSNYISALPRQPYSLL----YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           W  Y S    Q +S L    +W   E  + L+ + I E   + + N+   Y+ + L    
Sbjct: 66  WQPYFSLW--QDFSSLDHPMFWPEEERTKLLQGTGIPEAVDKDLANIQLEYSSIILPFMK 123

Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA--------LVPWADMLNHSCEV 275
            +PD+F  E+  +E +K     + +   + P  +            +VP AD+LNH    
Sbjct: 124 SHPDIFDPELHTLELYKQLVAFVMAYSFQEPLEEEDEDEKGPNPPMMVPVADILNHVANH 183

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
              L+Y  + + +   T +    G+++F +YG+ +N +LL  YGF      N +D+ ++ 
Sbjct: 184 NASLEY--APRCLRMVTTQPISKGQEIFNTYGQMANWQLLHMYGFAEPYPGNTNDTADIQ 241

Query: 336 L 336
           +
Sbjct: 242 M 242


>gi|68488236|ref|XP_712036.1| hypothetical protein CaO19.2654 [Candida albicans SC5314]
 gi|46433396|gb|EAK92838.1| hypothetical protein CaO19.2654 [Candida albicans SC5314]
          Length = 552

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 86/321 (26%), Positives = 132/321 (41%), Gaps = 58/321 (18%)

Query: 74  ENASTLQKWLSDSGLPPQ-KMAIQK-VDVGE-RGLVALKNIRKGEKLLFVPPSLVITADS 130
           E +   Q WL  + +    K+AI    D  + RG++AL++I   E +  +P S+V+  D+
Sbjct: 6   EKSKLFQDWLIKNNVEISPKIAIHDYCDTNQGRGIIALEDINPDEMIFKLPRSIVLNIDN 65

Query: 131 KWSCPEAGEVLKQCSVPD-WPLLATYLISEASFE----------KSSRWSNYISALPRQP 179
                    VLK+  V D W  L   L  E  F+            S W  Y++ LP Q 
Sbjct: 66  NSLIKSYPSVLKKLRVLDQWIGLIIVLGFEIKFKFNPSDNNDNHNRSFWYEYLNILPDQF 125

Query: 180 YSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYN-DLR-LRIFSKYPDLFPEE 232
             L+YW   EL+ +L+ S I     +E  +     +I   N DL  +  F   P  F EE
Sbjct: 126 NQLIYWNDEELN-HLQPSCILDRIGKENNLNMYNQIISIINQDLSGVEEFKSSPLTF-EE 183

Query: 233 VFNMETF--KWSFGILFSRLVRLP----SMDGRV-------------------------A 261
              + T    +SF +   +  ++     +  G                           +
Sbjct: 184 YNKIATIIMSYSFDVEVPKSKKVTENGTNEKGNDEEDDDEDEDKEDDDDDEEEDNEYYKS 243

Query: 262 LVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           +VP+AD LN    +    L Y  S+  ++ T  +    GEQV+ +Y    N ELL  YG+
Sbjct: 244 MVPFADTLNADTHLNNAILIY--STDQLIMTCIKPIAKGEQVYNTYSDHPNSELLRRYGY 301

Query: 321 VPREGTNPSDSVELPLSLKKS 341
           V   G+   D  E+PLS  KS
Sbjct: 302 VELNGS-KYDFGEIPLSTIKS 321


>gi|145545977|ref|XP_001458672.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426493|emb|CAK91275.1| unnamed protein product [Paramecium tetraurelia]
          Length = 666

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 48/176 (27%), Positives = 82/176 (46%), Gaps = 12/176 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVLKQCSVPDWP--LLATY 155
           GL  ++ I     L+ VP  L++T      +D +       +   Q     W   +L TY
Sbjct: 69  GLKTIEKIESDSILVSVPRELMLTTKIAYFSDIQEIFDAYPQFFSQHCAGGWQDRILLTY 128

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ ++   + S+W + I+ LPR    L++W+  EL + L   ++  +A   + + +    
Sbjct: 129 LLYQSQLGRQSQWYHLIANLPRDIDYLIFWSDEEL-KLLNDEKLVLKAKRELQDFLLIQK 187

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
            L   I  +YP  F +E +++E  KW F  L SR     S   +VA VP+ +M NH
Sbjct: 188 TLT-HILDQYPQHFKKETYSLENIKWIFIHLVSRC--FGSTLEQVAFVPFCEMFNH 240


>gi|159490820|ref|XP_001703371.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280295|gb|EDP06053.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 339

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 65/238 (27%), Positives = 109/238 (45%), Gaps = 21/238 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPEAGEVLKQCSVPDWPLLATYLISE 159
           R LVA +NI+ GE ++ VP   V+ A++   +    E G  + + S  +  L    L+  
Sbjct: 66  RALVASRNIKMGEVVVEVPDDAVLMAENCGLRDVLEEEG--MTKDSADEEILEVQGLVIA 123

Query: 160 ASFEK----SSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERI------- 207
             +E+     SRW+ Y++ LP     + LYW R E  R L  +   ++ + R        
Sbjct: 124 VMWERWRGPESRWAPYLALLPDDMTHMPLYWKRREF-RELRGTAAYDKMLGRAQHPSDAP 182

Query: 208 TNVIGTYNDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWA 266
           T V   ++++     +++P+L  P      E ++W+   + S    L   D   A+VP  
Sbjct: 183 TQVPLLWSEVVGPFIAEHPELGLPGGERGYELYRWATAAVASYSFILGD-DKYQAMVPVW 241

Query: 267 DMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
           D+LNH + +V   L +      +     R    G ++  +YG+ SN ELL  YGFV R
Sbjct: 242 DLLNHITGDVNVRLHHCSKRHVLQMIAMRDIVAGSELVNNYGELSNAELLRGYGFVER 299


>gi|402908594|ref|XP_003917022.1| PREDICTED: N-lysine methyltransferase SETD6 [Papio anubis]
          Length = 456

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 121/282 (42%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G + ++    
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIGGLLERERGAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y+ + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYHSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|428182808|gb|EKX51668.1| hypothetical protein GUITHDRAFT_102933 [Guillardia theta CCMP2712]
          Length = 436

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 79/322 (24%), Positives = 131/322 (40%), Gaps = 33/322 (10%)

Query: 68  CEIDSLENASTLQKWLSDS-GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
            E+D     + L+ WL +  G+   K+ +Q+  +   G+ A + +  GE L  +P S  I
Sbjct: 18  VEVDGGLRGNALRIWLEEEHGVDMSKVDLQRSPLEGLGVFANRRLEPGETLFMIPKSCCI 77

Query: 127 TADSKWSCPEAGEVLKQCSVP-----DWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
             +  +   + G+ +++ +       +   LAT+L  E      S +  +I  LP     
Sbjct: 78  YPELVFEDRQLGKSMQKLASAAGEGIEVVALATFLAREKMKGSESSYKPFIDVLPWDSLH 137

Query: 182 LLYWTRAELD------RYLEASQIRER---AIERITNVIGTYNDLRLRIFSKYPDLFPEE 232
            L WT  E+D       + E    RE+   A E    V+      +     +   + PEE
Sbjct: 138 PLLWTDEEVDLLEGTYAHREILAFREQVEVATELFEPVLNPKGWKQFFQTIETEKMTPEE 197

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRVAL-----VPWADMLNH-----SCEVETFLDYD 282
              M   + +F  + SR        G   L     +P  D+ NH     S   +T L+ D
Sbjct: 198 FGFM--MRGAFASVLSRAFDSKIGRGDKGLEERVVIPLLDIFNHGSYGPSITFDTALERD 255

Query: 283 KSSQGVVFTTDR--QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPS-DSVELPLSLK 339
                 V   D+    + GE++F  YG K N  +L +YGFV     NP      L +S+ 
Sbjct: 256 NEKGFPVRVADKGKSIEEGEELFGFYGDKPNWNMLTTYGFV---SPNPKCQETTLSVSID 312

Query: 340 KSDKCYKEKLEALRKYGLSASE 361
           + D  + +K E L+  G+ A E
Sbjct: 313 EKDPYFAQKEEILKARGMVAVE 334


>gi|422293007|gb|EKU20308.1| ribulose- -bisphosphate carboxylase oxygenase small subunit
           n-methyltransferase i [Nannochloropsis gaditana CCMP526]
          Length = 385

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 107/251 (42%), Gaps = 35/251 (13%)

Query: 82  WLSDS---GLPPQKMAIQKVDVGE-------RGLVALKNIRKGEKLLFVPPSLVITADSK 131
           W+ D    G+PP  + +    + E       RGL+    I  G  L  +P S+VI  +  
Sbjct: 121 WMQDKSGWGVPPHPLLLSSRTIDEIELEDSGRGLICKYPINMGNALFQLPLSIVIDKEKS 180

Query: 132 WSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAE 189
            +  +         + ++  +A  LI E +   SS W+ YI  LP   +    L W   +
Sbjct: 181 LAAFDGA---LPADINEYFAIALMLIKERALGPSSFWAPYIDVLPTTEEVNPTLVWPEGD 237

Query: 190 LDRYLEASQI--RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
           L   LEAS +    R+++R   +   +  L  +      D+F   VF  E + W+F  +F
Sbjct: 238 L-ALLEASPLVAATRSLKR--KLAAEFALLEEQYMRARSDVFDPSVFTFEAYLWAFINIF 294

Query: 248 SRLVRLPSMDGR---------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
           SR +R+     R         + + P+AD++NH+    T++  +K  +  +F   R    
Sbjct: 295 SRAIRVKIGGKRGPSGEEEESIIMCPYADLINHNPFANTYIVAEKPFK--MFNPIR---- 348

Query: 299 GEQVFISYGKK 309
           GE+V   Y  K
Sbjct: 349 GEEVITIYADK 359


>gi|365989356|ref|XP_003671508.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
 gi|343770281|emb|CCD26265.1| hypothetical protein NDAI_0H00910 [Naumovozyma dairenensis CBS 421]
          Length = 540

 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 77/313 (24%), Positives = 123/313 (39%), Gaps = 64/313 (20%)

Query: 71  DSLENAST--LQKWLSDSGLPPQKMAIQKVDVGERG----LVALKNIRKGEKLLFVPPSL 124
           DSL N  T     WL+  G       I+  D+   G    ++A K+I   E L  +P S 
Sbjct: 50  DSLFNEQTESFLSWLTTDGKVTVSSKIKIEDLRSEGQGRCIIASKDIDTDELLFEIPRSS 109

Query: 125 VITADSKWSCPEAGEVL-KQCSVPDWPLLATYLISEAS-FEKSSRWSNYISALPRQP--Y 180
           ++   +   C +   +  K   +  W  L   ++ E    +  SRWS+Y + LP      
Sbjct: 110 ILNVTTSQLCVDFPHITGKLMELSQWDSLIICMMYEMKVLQHESRWSSYFNVLPSSESLN 169

Query: 181 SLLYWTRAELDRYLEASQIRERA--------IERITNVIGTYNDLRLRIFSKYPDLFPEE 232
           +L+YW   EL  +L  S +  R           RI + I  +N+          D+  E+
Sbjct: 170 TLMYWNDKELS-FLTPSLVVNRVGKGDAETMYRRILDTINEFNE----------DILTEK 218

Query: 233 VFNMETFKWSFGILFSRLVRLPSMDGRV------------------------ALVPWADM 268
              + +  W   +    ++   S D  +                        +++P AD 
Sbjct: 219 ---LGSISWEEFLYIPSIIMAYSFDVEIKNDDDENEGDEEFDEKEEEPELLKSMIPLADT 275

Query: 269 LN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
           LN   H C     L YDK S  ++    +  + GEQV+ +YG+  N ELL  YG+V   G
Sbjct: 276 LNADTHKCNAN--LTYDKDSLKMLAI--KPIKKGEQVYNTYGELPNSELLRKYGYVEWGG 331

Query: 326 TNPSDSVELPLSL 338
           +   D  E+P  L
Sbjct: 332 SQ-FDYGEVPFDL 343


>gi|116206234|ref|XP_001228926.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
 gi|88183007|gb|EAQ90475.1| hypothetical protein CHGG_02410 [Chaetomium globosum CBS 148.51]
          Length = 442

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 59/203 (29%), Positives = 83/203 (40%), Gaps = 31/203 (15%)

Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL------LYWTRAELDRYLEASQ 198
           SVP   L   +LI E    K S W  Y++ LP  P  +       +W   ++  YLE + 
Sbjct: 104 SVPPHVLGRFFLIKEYLKGKDSFWWPYLATLP-SPDQVNAWVLPAFWPEDDI-AYLECTN 161

Query: 199 IRERAIERITNVIGTYNDLRLRIFSK-YPDLFPEEVFNMETFKWSFGILFSRLVR----- 252
                 E   NV G +   R  + ++ +PD+     +    +KW+F I  SR  R     
Sbjct: 162 AHVAIQEIQANVKGEFKQARKILKNENFPDV---AAYTSLMYKWAFTIFTSRSFRPSLIL 218

Query: 253 -----------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
                      LP    +D    L P  D+ NHS       D    +         +Y P
Sbjct: 219 SDTTKRHISTLLPQSVELDDFSILQPLLDIANHSPTAVYSWDTTSPADACTLVCGDRYPP 278

Query: 299 GEQVFISYGKKSNGELLLSYGFV 321
           G QVF +YG K+N ELLL YGF+
Sbjct: 279 GAQVFNNYGLKTNSELLLGYGFI 301


>gi|410215066|gb|JAA04752.1| SET domain containing 6 [Pan troglodytes]
          Length = 449

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIRGLLERERVAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|410257726|gb|JAA16830.1| SET domain containing 6 [Pan troglodytes]
 gi|410351697|gb|JAA42452.1| SET domain containing 6 [Pan troglodytes]
          Length = 449

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 119/282 (42%), Gaps = 25/282 (8%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG----EVLKQ 143
           L P+    ++  V   G+VA ++++ GE L  VP + ++   S+ +C   G    E +  
Sbjct: 36  LSPKVAVSRQGTVAGYGMVARESVQAGELLFVVPRAALL---SQHTCSIRGLLERERVAL 92

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRE 201
            S   W +     +       +SRW  Y +  P   +    ++W   E    L+ + + E
Sbjct: 93  QSQSGW-VPLLLALLHELQAPASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPE 151

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
              + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +    
Sbjct: 152 AVEKDLANIRSEYQSIVLPFMEAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDE 211

Query: 261 ------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSN 311
                  +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N
Sbjct: 212 KEPNSPVMVPAADILNHLANHNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMAN 266

Query: 312 GELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +L+  YGFV     N  D+ ++ +   +       K EA R
Sbjct: 267 WQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKTEAER 308


>gi|302834219|ref|XP_002948672.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
 gi|300265863|gb|EFJ50052.1| hypothetical protein VOLCADRAFT_104004 [Volvox carteri f.
           nagariensis]
          Length = 510

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 65/133 (48%), Gaps = 6/133 (4%)

Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           A+ P  D+ NHS  V++ + Y+           R+++ GEQVFISYG +SN  L+  YGF
Sbjct: 294 AICPLIDLFNHSSAVQSEVAYNYFGDSYSVVASREFKKGEQVFISYGAQSNDSLMQYYGF 353

Query: 321 VPREGTNPSD---SVELPLSLKKSDKCYKEKLEALRKYGLSAS-ECFPIQITGWPLELMA 376
              E  NP D     ++   L       + +L+AL+   L+ S +   IQ  G+P E + 
Sbjct: 354 A--EANNPQDVYVMTDMLRWLTAVRSVGQSRLDALKGSPLANSLQQVAIQRAGFPSETLQ 411

Query: 377 YAYLVVSPPSMKG 389
               +++  S  G
Sbjct: 412 AVRFLLAADSEAG 424


>gi|308498155|ref|XP_003111264.1| CRE-SET-29 protein [Caenorhabditis remanei]
 gi|308240812|gb|EFO84764.1| CRE-SET-29 protein [Caenorhabditis remanei]
          Length = 401

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 57/228 (25%), Positives = 101/228 (44%), Gaps = 23/228 (10%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASF 162
           G+ A ++ R G  ++ +P   +I +      P   + +   +    P+ + T       F
Sbjct: 30  GIYATRSFRSGLPIITLPEYDMINSALVLDLPFYRKKMANVNEKLKPMEILTMFFCFEDF 89

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           E+S+ WS Y+  LP++ +    + R + D       IR+  I++   +      LR    
Sbjct: 90  EQSA-WSPYLKILPKE-FDTPAFKRIDYDVNTLPLSIRKYWIDQKKEISEISEKLR---- 143

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADMLNHSC 273
                LFPE     +   W++ ++ +R +         + + DG  +A++P+ DMLNH  
Sbjct: 144 ----RLFPE--LTHDKILWAWHVVNTRCIFVENEEHDNVDNTDGDTIAVIPYVDMLNHDP 197

Query: 274 E-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
           E  +    ++K +   V    RQ   GEQVF+ YG   N  LL+ YGF
Sbjct: 198 EKYQGVALHEKRNGRYVVQARRQILEGEQVFVCYGAHDNARLLVEYGF 245


>gi|348572449|ref|XP_003472005.1| PREDICTED: LOW QUALITY PROTEIN: N-lysine methyltransferase
           SETD6-like [Cavia porcellus]
          Length = 466

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 112/249 (44%), Gaps = 23/249 (9%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+ +  ++  V   G+VA ++++ GE L  VP + ++   S  +C   G++L++    
Sbjct: 77  LSPKVVVSKQGTVAGYGMVARESVQPGELLFAVPRAALL---SPHTC-SIGDLLERERSA 132

Query: 144 -CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIR 200
             S   W +     +       +S WS Y +  P   +    ++W   E  R L+ + + 
Sbjct: 133 LQSQSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVP 191

Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
           E   + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +   
Sbjct: 192 EAVDKDLANIRSEYYAIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEEE 251

Query: 261 A-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP-GEQVFISYGKKSNG 312
                   +VP AD+LNH       L+Y      +V T   Q+ P G ++F +YG+ +N 
Sbjct: 252 EKDPNSPLMVPGADILNHLANHNANLEYSADYLRMVAT---QFIPKGHEIFNTYGQMANW 308

Query: 313 ELLLSYGFV 321
           +L+  YGFV
Sbjct: 309 QLIHMYGFV 317


>gi|326913214|ref|XP_003202935.1| PREDICTED: SET domain-containing protein 4-like, partial [Meleagris
           gallopavo]
          Length = 241

 Score = 62.4 bits (150), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 102/212 (48%), Gaps = 19/212 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW-SCPEA 137
           L+KWL D G     +   +     RGL+  + ++ GE ++ +P   ++T ++   SC   
Sbjct: 35  LKKWLKDRGFGDSSLRPAQFWGTGRGLMTTRALQAGELVISLPEKCLVTTNTVLNSC--L 92

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
           GE + +   P  PL+A  T+LI+E    + S W  Y+  LP+  YS       ++ + L 
Sbjct: 93  GEYIMKWKPPVSPLIALCTFLIAEKHAGEKSLWKPYLDVLPKT-YSCPVCLEQDVIQ-LF 150

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRLVR 252
              +R++A E+ T V   Y   +   FS    LF E    +FN    +W++  + +R + 
Sbjct: 151 PEPLRKQAQEQRTTVHELYMSSK-AFFSSLQSLFAENTATIFNHSALEWAWCTINTRTIY 209

Query: 253 LP-------SMDGRV-ALVPWADMLNHSCEVE 276
           +        S++  V AL P+ D+LNHS  V+
Sbjct: 210 MKHSQRECFSLEPDVYALAPYLDLLNHSPNVQ 241


>gi|428178458|gb|EKX47333.1| hypothetical protein GUITHDRAFT_152084 [Guillardia theta CCMP2712]
          Length = 294

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 107/262 (40%), Gaps = 40/262 (15%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL-KQCSVPDWPLLATYLISEAS 161
           RG+  L+ ++  + ++ VP S  ++  S    PE  ++  ++ S+  +  L   L+ EA+
Sbjct: 17  RGVAVLQEMKSDDVIVEVPASSFLSIWSVKDVPELSKIFGEEKSIDSFTGLMILLLHEAN 76

Query: 162 FEKSSRWSNYISALPRQPYSLLYWTRAEL-DRYLEASQIRERAIERITNVIGTYNDLRLR 220
            E S+ W  Y+ +LP        W+ A++   ++   ++ E     +     +YN     
Sbjct: 77  KETSA-WRKYLCSLPLYMPLPFMWSDADIPADFMRMPEVVEERKMLLEYTSLSYNSTIAP 135

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA------------------- 261
           +  KYP +FPE+ F    + W+  I+ SR + +    G +                    
Sbjct: 136 LILKYPQVFPEDRFTKSKWAWALSIVVSRSIAMKRTGGVLGYSWSLADPEVLDVANVLEA 195

Query: 262 -------------LVPWADMLNHSCEVETFLDYDKSSQGVVFTT--DRQYQPGEQVFISY 306
                        LVP  DM+NH           + + G +  T  D   Q G +V I+Y
Sbjct: 196 LKSGKSDAHVAPVLVPVVDMMNHDSNSSLACKMKQKTDGTIIVTAADEGLQRGYEVAINY 255

Query: 307 GKKSNGELLLS-YGFV--PREG 325
             K  G   L+ +GFV  P EG
Sbjct: 256 SPKLCGNKPLNRWGFVLPPCEG 277


>gi|431912319|gb|ELK14453.1| SET domain-containing protein 6, partial [Pteropus alecto]
          Length = 847

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 74/337 (21%), Positives = 138/337 (40%), Gaps = 39/337 (11%)

Query: 44  CSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVG 101
           C+ S +++         +   P G     L+  +    W    GL   P+    ++  V 
Sbjct: 382 CTHSESSEGLGQVNHTMEVAGPVGARDPDLDPVAGFLSWCRRVGLELSPKVAVSRQGTVA 441

Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLI 157
             G+VA ++++ GE L  VP ++++   S+ +C  +G + ++     S   W +     +
Sbjct: 442 GYGMVARESVQPGELLFAVPRAVLL---SQHTCSISGLLERERGALQSQSGW-VPLLLAL 497

Query: 158 SEASFEKSSRWSNYISALPR-----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
                  +S W+ Y +  P       P   ++W   E  R L+ + + E   + + N+  
Sbjct: 498 LHELQAPASPWTPYFALWPELGSLEHP---MFWPEEERRRLLQGTGVPEAVEKDLANIRS 554

Query: 213 TYNDLRLRIFSKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVA--- 261
            Y  + L     +PDLF   V ++E +         +S  +  S    L   D       
Sbjct: 555 EYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSQALYGSFQEPLEEEDDEKEPNS 614

Query: 262 --LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLL 316
             +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+ +N +L+ 
Sbjct: 615 PLMVPAADILNHLASHNANLEYSPNYLRMVAT-----QPIPKGHEIFNTYGQMANWQLIH 669

Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            YGFV     N  D+ ++ +   +       ++EA R
Sbjct: 670 MYGFVEPYPNNTDDTADIQMVTVREAALQGTEVEAER 706


>gi|328772032|gb|EGF82071.1| hypothetical protein BATDEDRAFT_23340 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 419

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 76/335 (22%), Positives = 134/335 (40%), Gaps = 29/335 (8%)

Query: 67  GCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
           G  I+  E  +  QKWL  +      + +       RGL+A  + + G+ ++ +P  L++
Sbjct: 9   GTIIEDNECWALFQKWLVLNNCSISSLVLAHFSDTGRGLMATSDFQIGDPVVRIPARLLL 68

Query: 127 T---ADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL 183
                   ++   A   LKQ      P +A + I+      +  WS YI  LPR   ++ 
Sbjct: 69  VPRRTHKLFNNHPAIVALKQH-----PSIALF-IAWQKIHPTPEWSPYIDILPRSFDTMP 122

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
                +L   L    I+E A  + + +   Y  +   +     ++ P+++F     KW++
Sbjct: 123 LCIDLKLLAML-PYDIQEIAKNQQSKLDTDYAFVCTALAVSGYEMIPKDIF-----KWAW 176

Query: 244 GILFSRLVRL-------PSMDGR-----VALVPWADMLNHSCEVETFLDYDKSSQGVVFT 291
            ++ +R + +       P +        + L P+ D LNH+        YD   +  +  
Sbjct: 177 IVVNTRCITMNTNAISKPQLSHIHQQPIITLAPFLDCLNHTSTARISAGYDTVEKAYIIR 236

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
           T   Y+ G QVFI+YG   N  LL  YGF   +  NP + V L   +    + +    + 
Sbjct: 237 TLVPYKKGSQVFINYGPHDNNFLLAEYGFAILK--NPFNHVVLDREVDFMMQHFGTVSDL 294

Query: 352 LRKYGLSASECFPIQITGWPLELMAYAYLVVSPPS 386
           L+  GL           G+ L      Y+ VS  S
Sbjct: 295 LKSEGLYGEFIIANDDLGYRLMNAMRLYVAVSQGS 329


>gi|242823770|ref|XP_002488126.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713047|gb|EED12472.1| SET domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 480

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 152/385 (39%), Gaps = 83/385 (21%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RG+VA  +I++GE L  +P  +V+   +        + LK  ++  W  L   +I E S 
Sbjct: 48  RGVVARSDIQEGEDLFHLPQRVVLMVKTSPLNEILADELK--NLGPWLSLVVVMIYEYSL 105

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
            + S W+ Y   LP +  +L++W+  EL      SQ++  A   + + IG   D    IF
Sbjct: 106 GERSNWNQYFQVLPTKFDTLMFWSGEEL------SQLQASA---VIHKIGK-KDAEEDIF 155

Query: 223 SK-------YPDLFP------------------EEVFNMETF--KWSFGI---------- 245
            K       +PDLFP                  E    M +    ++F I          
Sbjct: 156 EKIIPLVRSHPDLFPPVNGVMSYDDDAGAQALLELAHRMGSLIMAYAFDIEKGEEEESEG 215

Query: 246 ----LFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
               L     +LP       +VP AD+LN   +      + +    +V    +  + G++
Sbjct: 216 EDGYLTDDEEQLPK-----GMVPLADLLNADADRNNARLFQEDG-ALVMRAIKPIKTGDE 269

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           +F  YG+    +LL  YG+V  +     D VELPL    +  C+   L+ +       S+
Sbjct: 270 IFNDYGELPRSDLLRRYGYVT-DNYAQYDVVELPL----TGICHAAGLDNIE------SQ 318

Query: 362 CFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAAS-------------NKMTSKKDI 408
            +P       LE++   Y ++ P +     + +                   ++ SK+  
Sbjct: 319 EYPHLKLLHELEILEDGYCILRPSAEDSLTDILPDELLALLKSLTLEREELQRLQSKQKP 378

Query: 409 KCPEIDEQALQFILDSCESSISKYS 433
             P +  +  + +LDS +S +S+Y 
Sbjct: 379 PKPILAAREARILLDSVKSKLSQYG 403


>gi|321257099|ref|XP_003193469.1| nucleus protein [Cryptococcus gattii WM276]
 gi|317459939|gb|ADV21682.1| nucleus protein, putative [Cryptococcus gattii WM276]
          Length = 491

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 107/259 (41%), Gaps = 54/259 (20%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVI---TAD-------SKWSCPEAGEVLKQCSVPDWPLLA 153
           G VA+K+I +G  L  V  +L++   T+D       S+W     G          W  L 
Sbjct: 43  GAVAVKDIEEGTPLFHVTDNLILSPYTSDLKDHLDASEWDQLNKG----------WAQLI 92

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
             ++ E      SRW+ Y++ +P    + ++WT  + D+ L  + I +R I R  +    
Sbjct: 93  LVMMWETIKGSKSRWAGYLTNMPVMFETPMFWTEQQRDQ-LSGTDIADR-IGR-EDAEAE 149

Query: 214 YNDLRLRIFSKYPDLFPEE-------VFNME---TFKWSFGILFSRLVRLPSM---DGRV 260
           Y  L       +PDLFP +        F+++       SF +   R  R  S    DG  
Sbjct: 150 YTSLLAPFIKAHPDLFPVDSPHTTIDAFHIQGSRILSRSFTVPLHRFGRSQSQSQSDGNE 209

Query: 261 A------------LVPWADMLNHSC---EVETFLDYDKS---SQGVVFTTDRQYQPGEQV 302
                        ++P+ADMLN +        ++D D      +GVV  + R  +  EQ+
Sbjct: 210 TESDDEEEEEVVVMIPFADMLNAAWGKDNAHLYVDEDTIEGFDEGVVMKSTRLVKQSEQI 269

Query: 303 FISYGKKSNGELLLSYGFV 321
           + +Y    N ELL  YG V
Sbjct: 270 YNTYDSPPNSELLRKYGHV 288


>gi|255536985|ref|XP_002509559.1| conserved hypothetical protein [Ricinus communis]
 gi|223549458|gb|EEF50946.1| conserved hypothetical protein [Ricinus communis]
          Length = 348

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 55/190 (28%), Positives = 84/190 (44%), Gaps = 20/190 (10%)

Query: 83  LSDSGLPPQKMAIQKVDVGERGL------VALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           +  +GLPP K+ +++    +  L       A ++++ G+    VP SLV+T +       
Sbjct: 1   MHKNGLPPCKVVLKERPSHDAKLRPIHYVAASEDLQTGDVAFSVPNSLVVTLERVLGNET 60

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-------PYSLLYWTRAE 189
             E+L    + +   LA YL+ E    K S W  YI  L RQ         S L W+ AE
Sbjct: 61  VVELLTTNKLSELACLALYLMYEKKQGKKSFWYPYIRELDRQRGRGQLAVESPLLWSEAE 120

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRL------RIFSKYPDLFPEEVFNMETFKWSF 243
           L  YL  S  +   +ER   +   Y++L         +F +YP   P E F  E FK +F
Sbjct: 121 L-AYLTGSPTKAEVLERADGIKREYDELDTVWFMAGSLFQQYPYDIPTEAFPFEIFKQAF 179

Query: 244 GILFSRLVRL 253
             + S +V L
Sbjct: 180 VAIQSCVVHL 189


>gi|46136815|ref|XP_390099.1| hypothetical protein FG09923.1 [Gibberella zeae PH-1]
          Length = 484

 Score = 62.0 bits (149), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 87/190 (45%), Gaps = 11/190 (5%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDL--RLRI 221
           S+ W+ Y+  LPR       W+  E++R L      E A+E +  ++   + DL  +  +
Sbjct: 132 STPWTEYLKFLPRDVPVPTMWS--EVERALLQGTSLEAALEAKFASLSKEFEDLTDKSSV 189

Query: 222 FSKYPDLFPEE-VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
              +  LF E+    ++ +        SR + LP   G  A+VP  DM NHS     + D
Sbjct: 190 LPFWNSLFWEKGTVTIQDWILVDAWYRSRCLELPR--GGDAMVPGLDMANHSHHPTAYYD 247

Query: 281 YDKSSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
            D     V+    +     GE+V ISYG K+  E+L SYGF+  E T   + + LP+ + 
Sbjct: 248 EDDKDDVVLLVRPETTVSAGEEVNISYGDKNPAEMLFSYGFIDNEST--VEGLNLPVKVL 305

Query: 340 KSDKCYKEKL 349
             D   K KL
Sbjct: 306 PDDPLGKAKL 315


>gi|73950321|ref|XP_544379.2| PREDICTED: N-lysine methyltransferase SETD6 isoform 2 [Canis lupus
           familiaris]
          Length = 453

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 128/303 (42%), Gaps = 35/303 (11%)

Query: 73  LENASTLQKWLSDSGLP--PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           L+  +    W    GL   P+    ++  V   G+VA ++++ GE L  VP + ++   S
Sbjct: 23  LDPVAGFLSWCPQVGLELIPKVTVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---S 79

Query: 131 KWSCP-------EAGEVLKQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPR--QPY 180
           + +C        E G +  Q   VP    L   L + AS      WS Y +  P   +  
Sbjct: 80  QHTCSIGGLLERERGALQSQSGWVPLLLALLHELQTPASL-----WSPYFALWPELGRLE 134

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
             ++W   E  + L+ + + E   + + N+   Y  + L     +PDLF   V +++ ++
Sbjct: 135 HPMFWPEEERRQLLQGTGVPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLDLYR 194

Query: 241 WSFGILFSRLVRLPSMD-------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
               ++ +   + P  +           +VP AD+LNH       L+Y  +   +V T  
Sbjct: 195 QLVALVMAYSFQEPLEEEDDEKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-- 252

Query: 294 RQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLE 350
              QP   G ++F +YG+ +N +L+  YGF      N  D+ ++ +   +       K+E
Sbjct: 253 ---QPIPKGHEIFNTYGQMANWQLIHMYGFAEPYPDNTDDTADIQMVTVREAALQGTKVE 309

Query: 351 ALR 353
           A R
Sbjct: 310 AER 312


>gi|68467835|ref|XP_722076.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|68468152|ref|XP_721915.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|46443858|gb|EAL03137.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
 gi|46444024|gb|EAL03302.1| potential protein lysine methyltransferase [Candida albicans
           SC5314]
          Length = 433

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 65/276 (23%), Positives = 111/276 (40%), Gaps = 36/276 (13%)

Query: 80  QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------------ 127
           +K +S+      K+ ++ V    RG+ A++ ++KGE +L +P S ++             
Sbjct: 22  EKKISNHTYISPKIDVKDVRSSGRGIYAVEPLKKGELILNIPHSFLLNFTTVMAHIAKYN 81

Query: 128 ---ADSKWSCP------EAGEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYI 172
               DS    P      E  E+ +  +      +  + LL+ YL  E      S W  ++
Sbjct: 82  GMAIDSHIHVPFDKSEDEYTEIYRTLTKEEILELSSFQLLSLYLTFERKRSHKSFWKPFL 141

Query: 173 SALPR-QPYSLL--YWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFSKYPD- 227
             LP    + L+   W +        ++++R + +  R  N      +L      K  D 
Sbjct: 142 DMLPSMDDFELMPIDWPQEVCTLLPSSTEVRNKKVRSRFDNDYQVICELIKTKIDKDGDV 201

Query: 228 --LFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSS 285
             L P +   +     +   L+  L    +      + P+ D +NHSC+    L  D   
Sbjct: 202 TTLLPRQEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYVDFMNHSCDDHCTLKID--G 259

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           +G    T  QY  G+QV++SYG  SN  LL  YGFV
Sbjct: 260 KGFQVRTTSQYNTGDQVYLSYGPHSNDFLLCEYGFV 295


>gi|345794208|ref|XP_003433871.1| PREDICTED: N-lysine methyltransferase SETD6 isoform 1 [Canis lupus
           familiaris]
          Length = 476

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 67/285 (23%), Positives = 122/285 (42%), Gaps = 33/285 (11%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP-------EAGEVL 141
           P Q    ++  V   G+VA ++++ GE L  VP + ++   S+ +C        E G + 
Sbjct: 64  PVQVTVSRQGTVAGYGMVARESVQPGELLFAVPRAALL---SQHTCSIGGLLERERGALQ 120

Query: 142 KQCS-VPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQ 198
            Q   VP    L   L + AS      WS Y +  P   +    ++W   E  + L+ + 
Sbjct: 121 SQSGWVPLLLALLHELQTPASL-----WSPYFALWPELGRLEHPMFWPEEERRQLLQGTG 175

Query: 199 IRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD- 257
           + E   + + N+   Y  + L     +PDLF   V +++ ++    ++ +   + P  + 
Sbjct: 176 VPEAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLDLYRQLVALVMAYSFQEPLEEE 235

Query: 258 ------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGK 308
                     +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YG+
Sbjct: 236 DDEKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-----QPIPKGHEIFNTYGQ 290

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
            +N +L+  YGF      N  D+ ++ +   +       K+EA R
Sbjct: 291 MANWQLIHMYGFAEPYPDNTDDTADIQMVTVREAALQGTKVEAER 335


>gi|393245275|gb|EJD52786.1| SET domain-containing protein [Auricularia delicata TFB-10046 SS5]
          Length = 519

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 80/160 (50%), Gaps = 20/160 (12%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY----- 155
           G RGLVA+K I+ GE L  VP +L+++       P   ++ +     DW  L  +     
Sbjct: 29  GGRGLVAVKEIQVGETLFAVPRTLLLS-------PRTCQLPQLIGAQDWKRLNLHKGWSG 81

Query: 156 ----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
               ++ E +   +S+W+ Y +A+P +  +L++WT  EL+  L+ S I E+  +   +V 
Sbjct: 82  LILCMLWEEAQGPASQWAGYFAAMPTEFSTLMFWTPEELED-LKGSSITEKIGKE--DVE 138

Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
             Y+D  L      PDLFP E  +  T +  F I  SR++
Sbjct: 139 SEYHDRVLPAVKARPDLFPPEQADRYTLE-RFHIAGSRIL 177


>gi|302900929|ref|XP_003048357.1| hypothetical protein NECHADRAFT_106330 [Nectria haematococca mpVI
           77-13-4]
 gi|256729290|gb|EEU42644.1| hypothetical protein NECHADRAFT_106330 [Nectria haematococca mpVI
           77-13-4]
          Length = 460

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 86/196 (43%), Gaps = 36/196 (18%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRER 202
           L+  YL+ + SF     WS YI ALP QP     W+         AEL    E + I E 
Sbjct: 91  LVKHYLLGDESF-----WSPYIRALP-QPEDQDSWSLPPFWPDDDAEL---FEGTNI-EV 140

Query: 203 AIERITNVIGTYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSR-----LV---- 251
            + RI   +       L   +   D  PE  + F +  ++W++ I  SR     LV    
Sbjct: 141 GVGRIKADVKRDFKAALDALTA-EDWEPELRKGFTLGLYQWAYSIFSSRSFRPSLVLGPE 199

Query: 252 ---RLPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFIS 305
              RLP    +D    L+P  D+ NH    E   + D+  +       + Y+ GEQVF +
Sbjct: 200 DQKRLPEGVKIDDFSVLMPLFDVGNHDMRTEVRWELDEEKKHCSLKVSKAYEAGEQVFNN 259

Query: 306 YGKKSNGELLLSYGFV 321
           Y  K+N ELLL YGF+
Sbjct: 260 YSMKTNAELLLGYGFM 275


>gi|417410782|gb|JAA51857.1| Hypothetical protein, partial [Desmodus rotundus]
          Length = 447

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 68/283 (24%), Positives = 125/283 (44%), Gaps = 27/283 (9%)

Query: 88  LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           L P+    ++  V   G+VA + ++ G+ L  VP + ++   S+++C  +G + ++    
Sbjct: 34  LSPKVAVSRQGTVAGYGMVAREYVQPGDLLFAVPRAALL---SQYTCSISGLLERERGAL 90

Query: 144 CSVPDW-PLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIR 200
            S   W PLL   L    +   +S WS Y +  P   +    ++W   E  R L+ + + 
Sbjct: 91  QSQSGWVPLLLALLHELQA--PASPWSPYFALWPELGRLEHPMFWPEEERRRLLQGTGVP 148

Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
           E   + + N+   Y  + L     +PDLF   V ++E +     ++ +   + P  +   
Sbjct: 149 EAVEKDLANIRSEYYSIVLPFMEAHPDLFSPRVRSLELYHQLVALVMAYSFQEPLEEEED 208

Query: 261 A-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKS 310
                   +VP AD+LNH       L+Y  +   +V T     QP   G ++F +YGK +
Sbjct: 209 EKEPNPPLMVPAADILNHLANHNANLEYSSNCLRMVAT-----QPIPKGHEIFNTYGKMA 263

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           N +L+  YGFV     N  D+ ++ +   +       K +A R
Sbjct: 264 NWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKGDAER 306


>gi|320584053|gb|EFW98265.1| Nuclear protein that contains a SET-domain [Ogataea parapolymorpha
           DL-1]
          Length = 499

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 119/268 (44%), Gaps = 39/268 (14%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI---TADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
           RGL A  +I+K   L  +    ++   TA      P   EVL+  ++  W  L   L  E
Sbjct: 38  RGLRARNDIQKDTVLFRLARDHILNIRTAALGKLKPGNQEVLE--TLNQWEALILCLAYE 95

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTY 214
               + SRWS+Y++ LP +  SL++W+  EL++ L+ S +     RE+A +  + ++  Y
Sbjct: 96  MMLGEESRWSSYLAVLPEKFNSLMFWSSEELEK-LKPSNVLQRIGREQAEQMYSKLVPEY 154

Query: 215 NDLRLRIFSK----YPDLFPEEVFNMETFKWSFGILFSRLVRLPSM-------------- 256
               LR+ SK    Y  +    V       +SF +                         
Sbjct: 155 C---LRLGSKKLVEYLTIDRFHVVASIIMSYSFDVDDPEDDPEDDEDEEEDFDEIEQECI 211

Query: 257 --DGRV-ALVPWADMLNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
             DG + ++VP AD LN +   V   L Y+  +  +V T  +  + GEQ++  YG+  N 
Sbjct: 212 KYDGYLKSMVPLADTLNSNTNLVNANLSYE--NDALVMTATKDIKKGEQIYNIYGELPNS 269

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKK 340
           E+L  YG+V    +   +  ELPL++ K
Sbjct: 270 EILRKYGYVELPAS-KYEFAELPLTVIK 296


>gi|303272215|ref|XP_003055469.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463443|gb|EEH60721.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 468

 Score = 61.6 bits (148), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 102/249 (40%), Gaps = 44/249 (17%)

Query: 130 SKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPYSL-LYWTR 187
           S  +C  A E L+   +     L   ++ E +    SRW +Y + LP R   +L ++WT 
Sbjct: 80  SARTCSVAKE-LRDARLGGGLALNVAVMVERALGSESRWRDYFAVLPSRGERTLPMFWTE 138

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLR-IFSKYPDLFPEEVFNMETFKWSFGIL 246
           A L+  L+ + +     E   N+   Y++  +  +   +P+ F  E    E +  +  + 
Sbjct: 139 ARLE-ALKGTDLATHVREDAENLRADYDEEVVNGLCVAHPEKFRREELTFERYLEAASLS 197

Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK----------------------- 283
            SR   +    G  ALVPWADM NH  + ET                             
Sbjct: 198 ASRAFYIGEECGE-ALVPWADMFNHRTDDETVRVLGADEEEEEDEEEEEEEEEDEEEDDE 256

Query: 284 --------------SSQG-VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
                         + QG +V  T +    G+++F ++G+++N  LL  YGF  R   + 
Sbjct: 257 EEDDDAAAPPPPLTTPQGALVIHTHKAVSKGDELFNTFGQQNNASLLHKYGFCERGNAHA 316

Query: 329 SDSVELPLS 337
           + +V+L L+
Sbjct: 317 TIAVDLALA 325


>gi|302307608|ref|NP_984333.2| ADR237Cp [Ashbya gossypii ATCC 10895]
 gi|299789080|gb|AAS52157.2| ADR237Cp [Ashbya gossypii ATCC 10895]
 gi|374107548|gb|AEY96456.1| FADR237Cp [Ashbya gossypii FDAG1]
          Length = 574

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 86/182 (47%), Gaps = 32/182 (17%)

Query: 171 YISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL----RLRIFSK 224
           Y+S LP  ++ ++  +WT +EL   L    I  +A + +  ++  +++L     LR  +K
Sbjct: 108 YLSVLPTHKEMHTPYFWTNSEL-LLLRGMDIYLKAKKNLRQLVNEWHELVTAGELRNDTK 166

Query: 225 YPDLF-PEEVFN-------------------METFKWSFGILFSR----LVRLPSMDGRV 260
           + DLF   E F+                      + W+  I  SR    L+   + D   
Sbjct: 167 FYDLFNSSENFDAGEYISNQLADPTTTDWTDFPAYLWASSIFSSRAFPTLILGTTTDLNE 226

Query: 261 ALV-PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
           A + P  D+LNHS        Y++    V F+T +  + G++++ +YG KSN ELLL+YG
Sbjct: 227 AFLNPIIDLLNHSAGTNVTWSYNEQVAAVTFSTAQTLETGDELYNNYGDKSNDELLLNYG 286

Query: 320 FV 321
           FV
Sbjct: 287 FV 288


>gi|395508683|ref|XP_003758639.1| PREDICTED: N-lysine methyltransferase SETD6 [Sarcophilus harrisii]
          Length = 396

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 85/189 (44%), Gaps = 26/189 (13%)

Query: 166 SRWSNYISALP-----RQPYSLLYWTRAELDRYLEASQIRERAIER-ITNVIGTYNDLRL 219
           S W  Y S  P     R P   ++W+  E  + L+ + + E A+ER + ++   Y  + L
Sbjct: 59  SPWKGYFSLWPELGSLRHP---MFWSEEERKQLLQGTGVPE-AVERDLASISYEYGTIVL 114

Query: 220 RIFSKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
                +PD+FP +  ++E ++        +SF                  +VP AD+LNH
Sbjct: 115 PFLEAHPDVFPLQAQSLELYRQLVAMVMAYSFQEPLEEEEEEEEEPNPPMMVPAADILNH 174

Query: 272 SCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNP 328
                  L+Y      +V T     QP   G+++F +YG+ +N +L+  YGF      N 
Sbjct: 175 VANHNANLEYSPECLKMVAT-----QPIPKGQEIFNTYGQMANWQLIHMYGFAEPYPGNT 229

Query: 329 SDSVELPLS 337
           +DS ++ ++
Sbjct: 230 NDSADIQMA 238


>gi|46130858|ref|XP_389160.1| hypothetical protein FG08984.1 [Gibberella zeae PH-1]
          Length = 1000

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 64/267 (23%), Positives = 114/267 (42%), Gaps = 34/267 (12%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKW---SCPEAGEVLK--QCSVP---DWPLLAT 154
           RG++ALK+I     L  +P   +I  ++       P+  ++ K  +  VP    W  L  
Sbjct: 576 RGIIALKDIPAETTLFTIPRKGIINTETSELPKKIPDVFDLDKPDEDDVPGLDSWSSLIL 635

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
            +I E     SS+W +Y   LP    + ++W+  ELD+ L+AS +R +  +   +    +
Sbjct: 636 IMIYEYLQGDSSQWKSYFDVLPSSFDTPMFWSENELDQ-LQASHMRHKIGK--ADAEDMF 692

Query: 215 NDLRLRIFSKYPDLFPEE------------VFNMETFKWSFGILFSRLVRLP------SM 256
               + I    P +F  E                    ++F +                 
Sbjct: 693 KKTLVPIIRSNPSIFNAENRSDYELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVEDR 752

Query: 257 DGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
           +G+  + +VP AD+LN   E    +++++ S  +  T+ R  + GE++   YG   N EL
Sbjct: 753 EGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTSLRPIKAGEEILNYYGPHPNSEL 810

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKS 341
           L  YG+V  E  +  D VE+P  + +S
Sbjct: 811 LRRYGYV-TEKHSRYDVVEIPWDIVES 836


>gi|358332734|dbj|GAA51355.1| SET domain-containing protein 4 [Clonorchis sinensis]
          Length = 493

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%)

Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           R+  +P      AL+P+ D LNH   V++ L+ D++ + +     + + PGEQV I+YG 
Sbjct: 234 RIKLIPDRYSDTALIPFFDFLNHCPLVDSRLEVDRTGKAIQLFVQQSFGPGEQVLINYGP 293

Query: 309 KSNGELLLSYGFVPREGTNPSDSV 332
             N  L + YGF      NP ++V
Sbjct: 294 HDNLTLFIEYGFSLLPSENPHNAV 317


>gi|428176276|gb|EKX45161.1| hypothetical protein GUITHDRAFT_139093 [Guillardia theta CCMP2712]
          Length = 281

 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 69/255 (27%), Positives = 109/255 (42%), Gaps = 21/255 (8%)

Query: 78  TLQKWLSDSG--LPPQKMAIQKVDVGERGLVAL--KNIRKGEKLLFVPPSLVITADSKWS 133
            LQ W+S +G  + P   A Q  D+   GL     K++R+GE ++ +PP L ++ +    
Sbjct: 27  ALQTWISSNGGSIHPSVCAKQAGDMQGVGLFVKEGKSVRRGEVMVSIPPKLHLSYEKVVG 86

Query: 134 CPEAGEVLKQCSVPDWPL----LATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRA 188
                 + K+     W +        + S AS  +  +W  Y+ +LP+   +L +++  A
Sbjct: 87  KDLNTLIDKEVPAEKWDVKLALALLSVASSASAAEGQQWGPYLESLPQTLNNLPIFYKGA 146

Query: 189 ELDRYLEASQIRERAIERITN-VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILF 247
            L         +E     I++ V+G    L+    S        E  ++    W++GI  
Sbjct: 147 ALKE-------KEETYPGISSEVVGRAALLKTVSSSLANAHACLEGLSVRRLAWAYGIAT 199

Query: 248 SRLVRL-PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQ-GVVFTTDRQYQPGEQVFIS 305
           SR VRL    DG   L+P+ D  NH  E    +    SS         R     EQ+ I 
Sbjct: 200 SRSVRLDKKRDG--LLLPFVDFANHDFEPNAQIRRSGSSSPSAELVAQRDLSASEQITIC 257

Query: 306 YGKKSNGELLLSYGF 320
           YG   N ELLL+YGF
Sbjct: 258 YGNLGNQELLLNYGF 272


>gi|440464611|gb|ELQ34010.1| hypothetical protein OOU_Y34scaffold00824g3 [Magnaporthe oryzae
           Y34]
          Length = 373

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 82/303 (27%), Positives = 128/303 (42%), Gaps = 45/303 (14%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLV--ITADSKWSC 134
           S L  W S  GL    +    ++   +G+VA + ++  E +L  P   +  IT  S+   
Sbjct: 5   SELLGWASAEGLILNGIQPAWINGCGKGIVACRELKAEEAILIAPIQAIRSITTVSR--- 61

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSR---WSNYISALPRQPYSLLYWTRAELD 191
               +++K+   P  PL    L +E +   +S    W   + A+     +L +    EL 
Sbjct: 62  ----DLIKRLP-PSLPLHGI-LAAELALTDTSTPSPWQKSLPAMADITATLPFMWPKELQ 115

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL- 250
           + L  S     A   + N    YN     +    P +      + E F++ + I+ +R  
Sbjct: 116 KLLPTS-----ARVFLENQQTKYNHEWNTVSQAMPSI------SEERFQYYWHIVNTRTF 164

Query: 251 ------VRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
                     S + R+ALVP AD+ NH+   C V    ++       V TTDR Y+ GE+
Sbjct: 165 LYEVSETECYSWEDRLALVPLADIFNHADEGCRVSYMPEH------YVITTDRAYEAGEE 218

Query: 302 VFISYGKKSNGELLLSYGFV---PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           +FISYG  SN  LL  YGF+    R      D V LP  L +S K    + + L  Y L 
Sbjct: 219 LFISYGDHSNDCLLTEYGFLLPKNRWDIICIDEVVLP-RLDESAKELLRQRDLLGDYTLH 277

Query: 359 ASE 361
           A +
Sbjct: 278 AEK 280


>gi|255719552|ref|XP_002556056.1| KLTH0H04004p [Lachancea thermotolerans]
 gi|238942022|emb|CAR30194.1| KLTH0H04004p [Lachancea thermotolerans CBS 6340]
          Length = 585

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 83/315 (26%), Positives = 127/315 (40%), Gaps = 69/315 (21%)

Query: 74  ENASTLQKWLSDSGLP-PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           E    L  W  D+G+  P    ++ V+VG +G   +      E    +P SL+I +    
Sbjct: 4   EKLKVLLDWGLDNGVKCPDD--VEFVNVGGKGFACIAKSDITEAEFIIPESLIIKSSLAV 61

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSS----------RWSNYISALPRQPYSL 182
           S  +      Q S   W  L   LI++  F+KSS          +++ YI ALP +  S 
Sbjct: 62  SFFKVNS--NQTS---WLKL---LIAKLKFDKSSTTVDDENLKAKFAPYIDALPDEIDSP 113

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME----- 237
           L W  +ELD  L  + +R     ++ ++   +  L +    K+ +    E+ N+E     
Sbjct: 114 LVWNPSELD-LLGNTNLRSSLRIKLYSIFNEWK-LIMETLKKHRNEVQAEILNIEETLGQ 171

Query: 238 --------------------------TFKWSFGILFSR----LVRLPSMD-GRVALVPWA 266
                                      F WS  +  SR     V  PS D   V L+P  
Sbjct: 172 SEDHVYRNITSKVFQHSSETDWWSFPAFLWSHMMFLSRAFPEYVINPSTDPSNVVLLPII 231

Query: 267 DMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPR 323
           D+LNH    +   +    + GV     R+ +    GE++F +YG K N ELL  YGFV  
Sbjct: 232 DLLNHDYRSKVEWNQRDGAFGV-----RKLETVLRGEEIFNNYGGKGNEELLSGYGFVLE 286

Query: 324 EGTNPSDSVELPLSL 338
           E  N  D+V L + L
Sbjct: 287 E--NIFDTVALKIQL 299


>gi|146419922|ref|XP_001485920.1| hypothetical protein PGUG_01591 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 592

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 89/311 (28%), Positives = 129/311 (41%), Gaps = 57/311 (18%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEK-LLFVPPSLVITADSKW-S 133
            + L +W    GL   +  I+   +GE    A      GEK  + +P  L IT DS   S
Sbjct: 2   VAELVQWAKTQGLELNE-GIEFRGIGENNTGAFYTTNNGEKPYIRLPVELAITVDSALRS 60

Query: 134 CPEAGEVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-QPYSLLYWTRAELD 191
             +  E L+ QC   +  +L   L  E S  K+S    Y+  LP  Q  +  Y   AE  
Sbjct: 61  FGQDLEALRDQCDSSN-TVLKLCLARERSRLKNSTIKKYLECLPTLQQMNTPYCWDAETK 119

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF--PEEVF-NME----------- 237
           RYL+ + +     E I  ++  +     +I +  PDL   PE+ F NM+           
Sbjct: 120 RYLQGTNLGSSLKENIGVLVEEW----WKIINLLPDLVQKPEQHFVNMKYYYESKFYTDD 175

Query: 238 -------------------TFKWSFGILFSR----LVRLPSMDGRV-----ALVPWADML 269
                               F W+  IL SR     +   ++D  V      L+P  D+L
Sbjct: 176 DAYAYFVTNEDPANWTSFPNFLWASIILKSRSFPAYLIADAVDWDVKRHDTMLLPVIDLL 235

Query: 270 NH--SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           NH  S  VE  L+  +S    VF +D   + G Q+F +YG K N ELLL+YGF   +  N
Sbjct: 236 NHLPSAHVEWGLERKESKSYFVFKSD-DVKSGSQLFNNYGMKGNEELLLAYGFCLED--N 292

Query: 328 PSDSVELPLSL 338
            SD   L + +
Sbjct: 293 SSDVSALKIKV 303


>gi|449456212|ref|XP_004145844.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Cucumis sativus]
          Length = 483

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 77/279 (27%), Positives = 122/279 (43%), Gaps = 53/279 (18%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV-LKQC-SVPDWPLLATYL 156
           D G RGL A++ ++KGE +L  P S+++T  S     E  ++ LK+  S+     L   L
Sbjct: 43  DTGGRGLAAVRQLKKGELVLRAPKSILLTTQSLSLEDEKLDMALKRYPSLSSTQKLTFCL 102

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           + E S   SS W  Y+  LP Q Y +L  T  E ++  +A Q+           + +  D
Sbjct: 103 LYEISKGPSSWWFPYLKHLP-QSYDILA-TFGEFEK--QALQVDYAIWATEKAALKSRTD 158

Query: 217 LRLRIFSKYPDLFPEEVF--NMETFK---WSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
            R         L  E      ++TFK   W+   + SR + +P  D    L P  D+ N+
Sbjct: 159 WR-----GVEGLMQESNIKSQLQTFKAWLWASATISSRTLYVP-WDEAGCLCPVGDLFNY 212

Query: 272 SC-EVETF--LD-------------------------------YDKSSQGVVFTTDRQYQ 297
           +  E E+F  +D                               +++++    F     Y+
Sbjct: 213 AAPEGESFNAVDVLSFPSHASLNDELELLEEQRDSQWALTDGGFEENASAYCFYARESYR 272

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
            GEQV +SYG  +N ELL  YGF+ +E  NP+D V +P+
Sbjct: 273 KGEQVLLSYGTYTNLELLEYYGFLLQE--NPNDKVFIPI 309


>gi|358399747|gb|EHK49084.1| hypothetical protein TRIATDRAFT_213818 [Trichoderma atroviride IMI
           206040]
          Length = 378

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/115 (39%), Positives = 60/115 (52%), Gaps = 16/115 (13%)

Query: 257 DGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           + R+AL+P AD+ NH+   C V        S +G     DR Y+ GE+++ISY   SN  
Sbjct: 177 EDRLALIPVADLFNHADAGCRVYY------SPEGYHIVADRDYKRGEELYISYSSHSNDY 230

Query: 314 LLLSYGFVPREGTNPSDSVELP----LSLKKSDKCYKEKLEALRKYGLS-ASECF 363
            L+ YGFVP E  NPSD V +       L +S K   EK + L  Y L  A+E F
Sbjct: 231 NLVEYGFVPDE--NPSDDVYIDDVIFPKLSESQKADLEKRDLLGVYPLGEATEEF 283


>gi|452986759|gb|EME86515.1| hypothetical protein MYCFIDRAFT_131111 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 391

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 71/292 (24%), Positives = 122/292 (41%), Gaps = 46/292 (15%)

Query: 55  TKTTVTQNMIPWGCEI-DSLENASTLQKWLSDSGLPPQKMAIQKVDVGERG--LVALKNI 111
           TK   T + +    EI  S +      +W  D G+  Q  +++   +  RG  LV   NI
Sbjct: 8   TKRRRTSSGVAVDAEIRHSQDEHEHFTQWAKDRGV--QIGSVKPAHIAGRGVGLVTTANI 65

Query: 112 RKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP--LLATYLISEASFEKSSR-- 167
           ++ EKL+F  P + +T            ++ +C  P+ P  +  +       FE S    
Sbjct: 66  KQDEKLIFASPHVQLTL----------SIMAECEEPESPYNVWRSTWPGPQDFESSMPLF 115

Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
           WS+ +  L   P SL      +LD + + ++ R   +           +LR     K  D
Sbjct: 116 WSHKLRDL--LPPSLQQPLDRQLDDWRKDAEFRRTIVA----------NLRDNSARKEQD 163

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLP---SMDGRVALVPWADMLNHSCEVETFLDYDKS 284
                    + FK+ + I+ SR        +  G + L P+ D +NH     T ++  ++
Sbjct: 164 ---------DVFKYYWAIVNSRSFHFKPPGAKPGFMVLCPFIDYMNHGPS-GTGVNVRQT 213

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV--PREGTNPSDSVEL 334
           ++G   T +R Y  GE+V  +YG   N +LL+ YGF+   + G    D + L
Sbjct: 214 AKGYEVTANRDYVAGEEVLATYGAHPNDKLLVHYGFINSSKPGAPSDDDIRL 265


>gi|223994225|ref|XP_002286796.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220978111|gb|EED96437.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 346

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 62/246 (25%), Positives = 102/246 (41%), Gaps = 58/246 (23%)

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
            G R +   +++  G+ LL +P S  ITAD+  S                  LA  L  E
Sbjct: 30  AGGRYVTCRQDVTAGDDLLQIPLSSCITADNLES------------------LAERLAYE 71

Query: 160 ASFEKSSRWSNYISALPR-QPYSLL----YWTRAELDRYLEASQIRERAIERITNVIGTY 214
                 S+++ YI+ LP  +  SLL    +W  + LD   +  Q+  R            
Sbjct: 72  RELGSKSKFTAYINVLPTLESKSLLELPRFWKGSRLDLVTDGGQLEAR------------ 119

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
                        +  +E  +++  +W+   + SR   L   D   ++ P  DM+NH   
Sbjct: 120 -------------MSKDERKDLD--QWALACVDSRANFLG--DEGYSMTPMLDMINHDAS 162

Query: 275 VETF--LDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNP 328
           V+T   ++ DK   G    +  T+ + Y  GE+ FISYG  +N + L  YGFV  +    
Sbjct: 163 VQTRARIEEDKGFAGDGDVLHLTSGKSYSKGEEAFISYGNLANLDTLADYGFVTEKNPCN 222

Query: 329 SDSVEL 334
            +S+E+
Sbjct: 223 VESIEV 228


>gi|42567909|ref|NP_197226.2| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
 gi|75271674|sp|Q6NQJ8.1|SDG40_ARATH RecName: Full=Protein SET DOMAIN GROUP 40
 gi|34222078|gb|AAQ62875.1| At5g17240 [Arabidopsis thaliana]
 gi|51969984|dbj|BAD43684.1| unknown protein [Arabidopsis thaliana]
 gi|332005020|gb|AED92403.1| protein SET DOMAIN GROUP 40 [Arabidopsis thaliana]
          Length = 491

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 87/318 (27%), Positives = 129/318 (40%), Gaps = 63/318 (19%)

Query: 61  QNMIPWGCEI---DSLENA----STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
           +  + W  EI   DS++++    S L   LS S  P         D G RGL A + ++K
Sbjct: 9   ETFLRWAAEIGISDSIDSSRFRDSCLGHSLSVSDFP---------DAGGRGLGAARELKK 59

Query: 114 GEKLLFVPPSLVITADSKWS--CPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNY 171
           GE +L VP   ++T +S  +     +  V    S+    +L+  L+ E S EK S W  Y
Sbjct: 60  GELVLKVPRKALMTTESIIAKDLKLSDAVNLHNSLSSTQILSVCLLYEMSKEKKSFWYPY 119

Query: 172 ISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE 231
           +  +PR  Y LL    A    + + +   E A+                  S   +L  +
Sbjct: 120 LFHIPRD-YDLL----ATFGNFEKQALQVEDAVWATEKATAKCQSEWKEAGSLMKELELK 174

Query: 232 EVF-NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG--- 287
             F + + + W+   + SR + +P  D    L P  D+ N+        DY  + QG   
Sbjct: 175 PKFRSFQAWLWASATISSRTLHVP-WDSAGCLCPVGDLFNYDAPG----DYSNTPQGPES 229

Query: 288 ---------VVFT-----TD---------------RQYQPGEQVFISYGKKSNGELLLSY 318
                    VV T     TD               R YQ GEQV + YG  +N ELL  Y
Sbjct: 230 ANNVEEAGLVVETHSERLTDGGFEEDVNAYCLYARRNYQLGEQVLLCYGTYTNLELLEHY 289

Query: 319 GFVPREGTNPSDSVELPL 336
           GF+  E +N  D V +PL
Sbjct: 290 GFMLEENSN--DKVFIPL 305


>gi|241956097|ref|XP_002420769.1| ribosomal N-lysine methyltransferase, putative [Candida
           dubliniensis CD36]
 gi|223644111|emb|CAX41854.1| ribosomal N-lysine methyltransferase, putative [Candida
           dubliniensis CD36]
          Length = 435

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/290 (23%), Positives = 117/290 (40%), Gaps = 37/290 (12%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI--- 126
           +D ++N    +K  + + + PQ + ++ V    RG+ A++ ++K E +L +P S ++   
Sbjct: 13  LDWVKNTDDEKKISNHTYISPQ-IDVKDVRSSGRGIYAVRPLKKAELILNIPHSFLLNFT 71

Query: 127 ------------TADSKWSCP------EAGEVLKQCS------VPDWPLLATYLISEASF 162
                       T DS    P      E  E+ +  +      +  + LL+ YL  E   
Sbjct: 72  TVMAHIAKYNGMTIDSHIHVPFDKHKDEYTEIYRMLTKEEILDLSSFQLLSLYLTFERRR 131

Query: 163 EKSSRWSNYISALPR-QPYSLL--YWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLR 218
              S W  ++  LP  + + L+   W          ++ +R R +  R  N      +L 
Sbjct: 132 SSKSFWKPFLDMLPSMEDFELMPIDWPHEIYTLLPSSTGVRNRKVRSRFENDYRVICELI 191

Query: 219 LRIFSKYPD---LFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
                K  D   L P +   +     +   L+  L    +      + P+ D +NHSC+ 
Sbjct: 192 KTKIDKAGDVTTLLPRQEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYVDFMNHSCDD 251

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
              L  D   +G    T  QY  G+QV++SYG  SN  LL  YGFV  E 
Sbjct: 252 HCTLKID--GKGFQVRTTSQYNIGDQVYLSYGPHSNEFLLCEYGFVIPEN 299


>gi|255637489|gb|ACU19071.1| unknown [Glycine max]
          Length = 497

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/284 (22%), Positives = 119/284 (41%), Gaps = 59/284 (20%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE-VLKQCSVPDWPLLATYLISE 159
           G RGL A++++R+GE +L VP S ++T ++     +  + V +  S+    +L   L+ E
Sbjct: 51  GGRGLGAVRDLRRGEIVLRVPKSALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYE 110

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL----EASQIRERAIERITNVIGTYN 215
               K+SRW  Y+  LP   Y +L     E +++     EA  + E+A+ +  +     +
Sbjct: 111 MGKGKTSRWHPYLMHLP-HTYDVLA-MFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAH 168

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC-- 273
            L   +      +F  + F  + +  +   + SR + +P  D    L P  D+ N+    
Sbjct: 169 SLMQDL------MFKPQFFTFKAWVRAAATISSRTLHIP-WDEAGCLCPVGDLFNYDAPG 221

Query: 274 -------EVETFLD----------------------------------YDKSSQGVVFTT 292
                  +++  L                                   +++ +    F  
Sbjct: 222 IEPSGIEDLDRLLSNTSIPDTIVLNGDKNIVVDAEQLDSHSWRLTDGGFEEDANAYCFYA 281

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
              Y+ G+QV + YG  +N ELL  YGF+ +E  NP+D V +PL
Sbjct: 282 REHYKKGDQVLLCYGTYTNLELLEHYGFLLQE--NPNDKVFIPL 323


>gi|255720552|ref|XP_002556556.1| KLTH0H16126p [Lachancea thermotolerans]
 gi|238942522|emb|CAR30694.1| KLTH0H16126p [Lachancea thermotolerans CBS 6340]
          Length = 571

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 51/97 (52%), Gaps = 8/97 (8%)

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           L P  D LNH    +     +K   GV F++  Q + G+++F +YG KSN ELLL+YGF 
Sbjct: 224 LYPIVDFLNHHSGQKVQWQLNKDRNGVSFSSGNQIEKGQEIFNNYGDKSNEELLLNYGFA 283

Query: 322 PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLS 358
            +   N S ++ L L           +LE+L+ Y L+
Sbjct: 284 IQNNMNDSSTLTLRLP--------PGQLESLKSYDLT 312


>gi|330924929|ref|XP_003300837.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
 gi|311324820|gb|EFQ91062.1| hypothetical protein PTT_12198 [Pyrenophora teres f. teres 0-1]
          Length = 372

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/77 (42%), Positives = 45/77 (58%), Gaps = 10/77 (12%)

Query: 252 RLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
           RLP  D R+A++P AD+ NH+   CE +       +S+   F  DR Y+ GE+++ISYG 
Sbjct: 173 RLPH-DDRLAILPVADLFNHADVGCEAQF------ASENYSFIADRTYRAGEELYISYGT 225

Query: 309 KSNGELLLSYGFVPREG 325
            S   LL  YGFVP E 
Sbjct: 226 HSTDFLLAEYGFVPAEN 242


>gi|301122457|ref|XP_002908955.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099717|gb|EEY57769.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 423

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/166 (27%), Positives = 79/166 (47%), Gaps = 4/166 (2%)

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L++E + + +S +  YI  LP      L W   +  + L+ +       +++  V+  Y 
Sbjct: 94  LLAELARKDTSDFHGYIQQLPTAISLPLSWDENQ-RKMLKDTTAFPILDDKL--VLKLYE 150

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
           D  +   +++P ++P EV  ++ F+W++ I+ SR  ++ +      L+P  DM NHS E 
Sbjct: 151 DYAVPFANEFPVIWPTEVSTLKKFQWAYSIVSSRAFKVAN-GLEPTLLPVIDMANHSAEN 209

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
                    S        R+ +  E V ISYG  SN +LL  YGFV
Sbjct: 210 PAAHIVKTESGSFQLVALREVEKKEPVTISYGDLSNAQLLCRYGFV 255


>gi|195018080|ref|XP_001984717.1| GH16622 [Drosophila grimshawi]
 gi|193898199|gb|EDV97065.1| GH16622 [Drosophila grimshawi]
          Length = 455

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 72/275 (26%), Positives = 119/275 (43%), Gaps = 49/275 (17%)

Query: 103 RGLVA-LKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW----------PL 151
           RGL +  +  R G++L+ +P   +I+  +     E+ E  K    PD            L
Sbjct: 51  RGLCSKTQCFRAGDELIRLPAGCLISIAT----LESDEEFKALFDPDLFDKDSRISFQAL 106

Query: 152 LATYLISEASFEKS---SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
           +A YL+      ++   S ++ Y+ +LPR   +  +   +EL    EA  I ER + +  
Sbjct: 107 IACYLLHLQHLHEARQESPFAAYLDSLPRSYTTPYFCAVSELQCLPEA--ILERTVSQNR 164

Query: 209 NVIGTYNDLRLRIFSKYPDL----FPEEVFNMETFKWSFGILFSRLVRLPSMD---GR-- 259
            +   Y  L+  + +++       + EE++ +  ++ ++  + SR V L S     GR  
Sbjct: 165 QIRDCYQVLKSLVGAQHCQCCGQRYCEEIWTLAEYRRAYFAVNSRSVYLSSRQLYTGRSH 224

Query: 260 ----------VALVPWADMLNHSCEVETFLDYD----KSSQGVVFTTDR----QYQPGEQ 301
                     +AL P+ D+ NHS  V+T  +         Q  V T D     Q +P EQ
Sbjct: 225 FQELLSGTNNLALAPFLDLFNHSDTVQTTAELQLLASSKCQDYVLTLDSLAAAQLKPYEQ 284

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           +FISYG   N +LL  YGF  +   N  D  E  L
Sbjct: 285 LFISYGALPNLKLLTEYGFYLKR--NAHDYFEFSL 317


>gi|78097104|ref|NP_001030295.1| N-lysine methyltransferase SETD6 [Mus musculus]
 gi|81904260|sp|Q9CWY3.1|SETD6_MOUSE RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
 gi|12845648|dbj|BAB26837.1| unnamed protein product [Mus musculus]
 gi|74198625|dbj|BAE39788.1| unnamed protein product [Mus musculus]
 gi|148679234|gb|EDL11181.1| RIKEN cDNA 0610039J04 [Mus musculus]
 gi|187951385|gb|AAI39199.1| SET domain containing 6 [Mus musculus]
 gi|187952351|gb|AAI39200.1| SET domain containing 6 [Mus musculus]
          Length = 473

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 118/274 (43%), Gaps = 27/274 (9%)

Query: 81  KWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           +W    GL   P+    ++  V   G+VA +++R GE L  VP S ++   S  +C  +G
Sbjct: 51  RWCRRVGLELSPKVTVSRQGTVAGYGMVARESVRAGELLFAVPRSALL---SPHTCSISG 107

Query: 139 EVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
            + ++     S+  W +     +       +S WS Y +  P   +    ++W   E  R
Sbjct: 108 LLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERLR 166

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
            L+ + + E   + + N+   Y  + L     + DLF   V ++E ++    ++ +   +
Sbjct: 167 LLKGTGVPEAVEKDLVNIRSEYYSIVLPFMEAHSDLFSPSVRSLELYQQLVALVMAYSFQ 226

Query: 253 LPSMD-------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQV 302
            P  +           +VP AD+LNH       L+Y      +V T     QP   G ++
Sbjct: 227 EPLEEDDDEKEPNSPLMVPAADILNHIANHNANLEYSADYLRMVAT-----QPILEGHEI 281

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           F +YG+ +N +L+  YGF      N  D+ ++ +
Sbjct: 282 FNTYGQMANWQLIHMYGFAEPYPNNTDDTADIQM 315


>gi|448092000|ref|XP_004197467.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
 gi|448096594|ref|XP_004198498.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
 gi|359378889|emb|CCE85148.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
 gi|359379920|emb|CCE84117.1| Piso0_004720 [Millerozyma farinosa CBS 7064]
          Length = 595

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 70/230 (30%), Positives = 94/230 (40%), Gaps = 45/230 (19%)

Query: 152 LATYLISEASFEKSSRWSN--YISALPRQPYSLLYWTRAE------------LDRYL--- 194
           L T LIS+  F+KS   S   YI  LP +     YW   E            LDR     
Sbjct: 80  LTTLLISKLKFDKSCEHSFGPYIDILPDKLSLPFYWNHQERSLVEDTDLKVILDRNFQKL 139

Query: 195 --EASQIRERAIERITNV---IGTYNDLRLR---IFSKYPDLFPEEVFN--------MET 238
             E   + E  I++  ++    G   DL      I  KY +    E  N           
Sbjct: 140 VEEWHSLVESLIDKEKHLSFEAGLKADLNFYEEYITGKYDEYRLYEYLNKKIQSWTSFSA 199

Query: 239 FKWSFGILFSRLV--RLPSMDG------RVALVPWADMLNHSCE--VETFLDYDKSSQGV 288
           + WS  IL SR     L + D       +  L+P  D+LNH     +      +  +  V
Sbjct: 200 YVWSRSILMSRGFPYLLVAEDNSKPNLTKACLIPLFDILNHKSNSPIRWTPVMESGTGNV 259

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           +F  +R  + GEQ+F +YG KSN ELLLSYGF   E  NP DS  + L +
Sbjct: 260 IFQLERGVKKGEQLFNNYGNKSNCELLLSYGFA--EEKNPHDSASITLKI 307


>gi|294948379|ref|XP_002785721.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
 gi|239899769|gb|EER17517.1| hypothetical protein Pmar_PMAR008080 [Perkinsus marinus ATCC 50983]
          Length = 353

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 103/230 (44%), Gaps = 32/230 (13%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
           G  G  A  +I +GE+LLFVP S  +T       P   + L +  V    +LA  L+   
Sbjct: 41  GMIGCTATADICQGERLLFVPHSACVT-------PSGVQGLYEPQV----MLAASLVKHR 89

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
           + + +S + +Y+ +LP +    L W+  EL   L+ + + E             + L L 
Sbjct: 90  T-DPNSPFHDYLQSLPSEFEHPLEWSADEL-VCLKGTTVWE------------MHQLSLE 135

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +     +L P     M   +W+  ++ SR     S    + ++P AD  NHS   +    
Sbjct: 136 VVDSVAELCPNSPRAM--IRWAVEVMMSRA--FESEVCGLCVIPLADQFNHS-STKWHTR 190

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
             +  +G     ++  + GE++F +YG  +N  LLL++GF+  E  NP D
Sbjct: 191 VREVEEGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGFI--EFDNPHD 238


>gi|355718756|gb|AES06374.1| SET domain containing 4 [Mustela putorius furo]
          Length = 256

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 107/243 (44%), Gaps = 27/243 (11%)

Query: 147 PDWPLLA-----TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE 201
           P  PLLA     T+L+SE      S W  Y+  LP+     +      ++ + E   ++ 
Sbjct: 3   PPSPLLALCTLCTFLVSEKHAGDQSLWKPYLDILPKAYTCPVCLEPKVVNLFPEP--LKA 60

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK---WSFGILFSRLVRLPSMDG 258
           +A E+   V G ++  R    S  P LF E V N+ ++    W++  + +R V +     
Sbjct: 61  KAEEQRARVQGFFSSSRDFFSSLQP-LFSEAVENIFSYSALLWAWCTVNTRAVYMKHGQR 119

Query: 259 RV--------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           +         AL P+ D+LNHS +V+    +++ ++     T    +  EQVFI YG   
Sbjct: 120 KCFSPEPDTYALAPYLDLLNHSPDVQVKAAFNEETRCYEVRTASGCRKHEQVFICYGPHD 179

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYKEKLEALRKYGLSASECFPIQI 367
           N  LLL YGFV  +  +    V   L +K    +DK   +K+  L+ +    +  F    
Sbjct: 180 NQRLLLEYGFVSIQNPHACVYVSADLLVKYLPSTDKQMNKKISILKDHDFIENLTF---- 235

Query: 368 TGW 370
            GW
Sbjct: 236 -GW 237


>gi|302793745|ref|XP_002978637.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
 gi|300153446|gb|EFJ20084.1| hypothetical protein SELMODRAFT_52721 [Selaginella moellendorffii]
          Length = 523

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 78/354 (22%), Positives = 127/354 (35%), Gaps = 68/354 (19%)

Query: 74  ENASTLQKWLSDSGLPPQKMAIQK-VDVGERGLVALKNIRKGEKLLFVPP---------- 122
           E      +W  + G+  +  AI++  D    GL    +  +G+ L F  P          
Sbjct: 3   ERLERFSRWSQEHGIQFRGCAIKRGSDAEGFGLYTQNDSARGDFLSFCAPLSTDFADVLV 62

Query: 123 ----SLVITADSKWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISAL 175
                L +T  +    P  G V ++     + D  L+  +LI E +  ++S W+ Y+  L
Sbjct: 63  VTPLDLALTPVTIVKDPVLGNVYREMLGNEIDDRLLVMIFLIIERARGRASFWAPYLEML 122

Query: 176 PRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN 235
           P    + L++   EL   L+ + + E A +       T+    + ++   PD   +    
Sbjct: 123 PSGFGTPLWFEDEEL-MELDGTTLFE-ATKAQVFFPSTFVSTCMSLYLFRPD---DRELE 177

Query: 236 METFKWSFGILFSRLVRLP--------------SMDGRV--------------------- 260
            + F W+  I ++R + +P                DG                       
Sbjct: 178 FQEFLWANCIFWTRALNIPCPASFVTSSSPEVAKDDGNRLVIYVLPHPFISCSAKDVSTI 237

Query: 261 ---ALVPWADMLNHSCEVETFLDYDKSS-------QGVVFTTDRQYQPGEQVFISYGKKS 310
               LVP  D  NH+       + D S          +    D  + PG +V I+YG K 
Sbjct: 238 WIEGLVPGIDFCNHTRRASGLWEIDGSDGSTSGVPHSMYLIADVVFPPGSEVLINYGDKG 297

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
           N ELL  YGFV  + +N    V  P      D     KL+ LR+  LS     P
Sbjct: 298 NEELLFLYGFVEEDNSNDYVMVHFPKMFLDEDNTMDFKLQLLRELDLSLQWLLP 351


>gi|303272869|ref|XP_003055796.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463770|gb|EEH61048.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 677

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 62/135 (45%), Gaps = 22/135 (16%)

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP-----SMDGRV-ALVPWADML 269
           DLR R  +K P   P     M+ F W++   +SR + LP        G V A+VP  D  
Sbjct: 327 DLRRRRATK-PSAKP---ITMDEFLWAYATFWSRALALPIGPDPEASGAVEAIVPGIDFA 382

Query: 270 NHSC-------EVETFLDYDKSSQG---VVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
           NHSC        V      + ++ G   V         PGE+V ISYG K N ELL  +G
Sbjct: 383 NHSCARPNARWAVANASGREGATAGEPTVTLECLSVPGPGEEVLISYGDKPNEELLFVHG 442

Query: 320 FVPREGTNPSDSVEL 334
           F  RE  NP D++ L
Sbjct: 443 FAERE--NPHDALVL 455


>gi|440640494|gb|ELR10413.1| hypothetical protein GMDG_00825 [Geomyces destructans 20631-21]
          Length = 492

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 112/268 (41%), Gaps = 42/268 (15%)

Query: 81  KWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           KWL+  G+      ++   + D   R LVA  +  + E +  VP +  ++   K + PE 
Sbjct: 14  KWLNHVGVRISAKAELTCLRADGRGRALVAKGDFAEDELIFSVPRTSTLSV--KAALPEM 71

Query: 138 GEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
               +  S      +P W  L   +ISE      S+W+ Y + LP +  SL++W+  EL 
Sbjct: 72  LSGRQDISPEDIESMPGWAALTAVIISEG-LRPESKWAPYFNVLPTKLDSLVFWSPEELA 130

Query: 192 RYLEASQI-----RERAIERITNVI------GTYNDLRLRIFS-------KYPDLFPEEV 233
             L+AS +     +++A E     I      GT  D+  R+ S         PD+  E+ 
Sbjct: 131 E-LQASAVLKKVGKDKAEEIFHQSISKVTPEGTDVDIFHRVASTIMAYAFDIPDIEQED- 188

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
                     G     LV        +A++P ADMLN   +    L YD   + +   T 
Sbjct: 189 --------EEGANEDDLVDDDEQKTSLAMIPLADMLNADADNNARLHYD--GEELEMRTI 238

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFV 321
              + GE++   YG+    +LL  YG+V
Sbjct: 239 NPIKTGEEILNDYGQLPRSDLLRRYGYV 266


>gi|452982650|gb|EME82409.1| hypothetical protein MYCFIDRAFT_40308 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 449

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/284 (23%), Positives = 125/284 (44%), Gaps = 37/284 (13%)

Query: 69  EIDSLENAST-LQKWLSDSGL---PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSL 124
           +ID  ++ S     WL ++G    P  ++A  +     RG+VA  ++   E++  +P + 
Sbjct: 2   DIDDFQSMSDKFLTWLKNTGATISPKIQLADLRDRAAGRGVVATSDLTSDEEIFRIPRTS 61

Query: 125 VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLY 184
           ++T ++    P+  E+L+Q + P W  L   +I E     +SR+  Y+  LP    +L++
Sbjct: 62  ILTTETT-DLPQ--EILQQLTDP-WLSLILAMIFEYLLGTNSRFKPYLDILPESFNTLMF 117

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF----- 239
           WT  EL +YL+ S I  +  +   +   T+++  L I +K P++F     N +       
Sbjct: 118 WTDNEL-QYLQGSAILSKIGKEEAD--NTFSEQLLPIITKNPEIFKIGTCNNQDLLALCH 174

Query: 240 -------KWSFGILFSRLVRLPSMDGRV-------------ALVPWADMLNHSCEVETFL 279
                   ++F +         S +                AL+P ADMLN + ++ T  
Sbjct: 175 RMGSIIMSYAFDLDPPPTTTTSSSEEWESDSDSENEKISPKALIPLADMLNANGDL-TNS 233

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
               SS   +  T +    GE++   +G     +LL  YGFV +
Sbjct: 234 KLFFSSDSFIMKTLQPVAAGEELLNDFGPLPPADLLRRYGFVTK 277


>gi|408393455|gb|EKJ72719.1| hypothetical protein FPSE_07119 [Fusarium pseudograminearum CS3096]
          Length = 465

 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 65/267 (24%), Positives = 115/267 (43%), Gaps = 34/267 (12%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI---TADSKWSCPEAGEVLK--QCSVP---DWPLLAT 154
           RG++AL++I     L  +P    I   T++     P+  ++ K  +  VP    W  L  
Sbjct: 41  RGIIALRDIPAETTLFTIPRKGSINIETSELPQKIPDVFDLDKPDEDDVPGLDSWSSLIL 100

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
            +I E     SS+W +Y   LP    + ++W+  ELD+ L+AS +R +  +   +    +
Sbjct: 101 IMIYEYLRGDSSQWKSYFDVLPSSFDTPMFWSENELDQ-LQASHMRHKIGK--ADAENMF 157

Query: 215 NDLRLRIFSKYPDLFPEEV------------FNMETFKWSFGILFSRLVRLPSM------ 256
               + I    P +F  E                    ++F +         +       
Sbjct: 158 KKTLVPIIRSNPSIFNAENRSDSELVEIAHRMGSTIMAYAFDLENDEEEEEETEEWVEDR 217

Query: 257 DGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
           DG+  + +VP AD+LN   E    +++++ S  +  T+ R  + GE++   YG   N EL
Sbjct: 218 DGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTSLRPIKAGEEILNYYGPHPNSEL 275

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKS 341
           L  YG+V  E  +  D VE+P  + +S
Sbjct: 276 LRRYGYVT-EKHSRYDVVEIPWDIVES 301


>gi|238882888|gb|EEQ46526.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 433

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 111/276 (40%), Gaps = 36/276 (13%)

Query: 80  QKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------------ 127
           +K +S+      K+ ++ V    RG+ A++ ++KGE +L +P S ++             
Sbjct: 22  EKKISNHTYISPKIDVKDVRSSGRGIYAVEPLKKGELILNIPHSFLLNFTTVMAHIAKYN 81

Query: 128 ---ADSKWSCP------EAGEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYI 172
               DS    P      E  E+ +  +      +  + LL+ YL  E      S W  ++
Sbjct: 82  GMAIDSHIHVPFDKSEDEYTEIYRTLTKEEILELSSFQLLSLYLTFERKRSHKSFWKPFL 141

Query: 173 SALPR-QPYSLL--YWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFSKYPDL 228
             LP    + L+   W +        ++++R + +  R  N      +L      K  D+
Sbjct: 142 DMLPSMDDFELMPIDWPQEVCTLLPSSTEVRNKKVRSRFDNDYQVICELIKTKIDKDGDV 201

Query: 229 ---FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSS 285
               P +   +     +   L+  L    +      + P+ D +NHSC+    L  D   
Sbjct: 202 TTFLPRQEVLLSWLCINSRCLYMDLPTSKNSADNFTMAPYVDFMNHSCDDHCTLKID--G 259

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           +G    T  QY  G+QV++SYG  SN  LL  YGFV
Sbjct: 260 KGFQVRTTSQYNTGDQVYLSYGPHSNDFLLCEYGFV 295


>gi|171679805|ref|XP_001904849.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939528|emb|CAP64756.1| unnamed protein product [Podospora anserina S mat+]
          Length = 468

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 65/218 (29%), Positives = 94/218 (43%), Gaps = 34/218 (15%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL-----DRYLEASQIRERAIERITN 209
           +L+ E   EK S W  YIS LP QP  +  W    +        LE +       E   N
Sbjct: 105 FLVKEYLKEKDSYWWPYISTLP-QPDRVDTWALPAVWPEDDIECLEETNAHVAVREIQAN 163

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM-------DGRVAL 262
           +   Y   R  +  K  D    + +    +KW+F I  SR  R PS+       D  + L
Sbjct: 164 IKKEYKHARKLL--KEVDFPGWQEYTQLLYKWAFCIFTSRSFR-PSLILSQETQDHVLGL 220

Query: 263 VPWADMLNHSCEVETFLD---YDKSSQ---------GVVFTTDRQYQPGEQVFISYGKKS 310
            P    ++    ++  LD   +D +SQ               +  YQPG+QVF +YG KS
Sbjct: 221 TPHGTKVDDFSILQPLLDIGNHDPTSQYQWNLEVDGTCQLICNNAYQPGQQVFNNYGLKS 280

Query: 311 NGELLLSYGFV-PREGTNPSDSVEL-----PLSLKKSD 342
           N ELLL YGF+ P   T  +D V +     P +L+K++
Sbjct: 281 NSELLLGYGFILPVTDTLHNDYVHVKSRRPPSTLQKNE 318


>gi|412989087|emb|CCO15678.1| predicted protein [Bathycoccus prasinos]
          Length = 640

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/89 (37%), Positives = 48/89 (53%), Gaps = 4/89 (4%)

Query: 261 ALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
           ALVP+ DMLNH+        L +D S+  +     R  + G+++F +YG+ S+GELL  Y
Sbjct: 392 ALVPFWDMLNHAHPALASVKLSHDASTNRLNMIAVRDIRKGDEIFNTYGELSDGELLRRY 451

Query: 319 GFVPREGTNPSDSVELPLS--LKKSDKCY 345
           GF+P    NP +SV +         DK Y
Sbjct: 452 GFLPTSSRNPHNSVTISFKELFAACDKVY 480


>gi|408397548|gb|EKJ76689.1| hypothetical protein FPSE_03100 [Fusarium pseudograminearum CS3096]
          Length = 467

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/197 (28%), Positives = 87/197 (44%), Gaps = 11/197 (5%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDL--RLRI 221
           S+ W+ Y+  LPR       W+  E++R L      E A+E +  ++   + DL  +  +
Sbjct: 132 STPWTEYLKFLPRDVPVPTMWS--EVERALLQGTSLEAALEAKFASLSKEFEDLTEKSSV 189

Query: 222 FSKYPDLFPEE-VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
              +  LF E+    ++ +        SR + LP   G   +VP  DM NHS     + D
Sbjct: 190 LPFWNSLFWEKGTVAIQDWILVDAWYRSRCLELPR--GGDVMVPGLDMANHSHHPTAYYD 247

Query: 281 YDKSSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
            D     V+      +   GE+V ISYG K+  E+L SYGF+  E T   + + LP+ + 
Sbjct: 248 EDDKDDVVLLVRPGTKVSAGEEVNISYGDKNPAEMLFSYGFIDNEST--VEGLNLPVKVL 305

Query: 340 KSDKCYKEKLEALRKYG 356
             D   K KL      G
Sbjct: 306 PDDPLGKAKLHIFGSSG 322


>gi|303272707|ref|XP_003055715.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463689|gb|EEH60967.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 647

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 110/284 (38%), Gaps = 57/284 (20%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL---------KQCSVP--DW 149
           G RGLVA+  I  G  L  VPPS  +TA +     E G+ L         K+  +P  D 
Sbjct: 101 GGRGLVAVAPIPAGAILFRVPPSRTLTAAAALES-ETGKALLAAIDASPTKKTRLPVGDL 159

Query: 150 PLLATYLISEASF-------EKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIR 200
            L     +S  +F       + +S +  Y + L  +P   S ++W   E +R L  S + 
Sbjct: 160 ALAVRVALSTRAFGGKHVCVDDASPFRGYFALLADEPLDESPVWWDEDERERRLRGSMLL 219

Query: 201 ERAIERITNVIGTYNDLR---LRIFSKYPDLFPEEV-----------------FNMETFK 240
             A+    +V   Y  +    +R   K P     +V                  N E F+
Sbjct: 220 HDAVALAADVRADYEGVVEGVMRDAEKTPARRDVDVDAAGNRRAAERFTRTKTGNYERFR 279

Query: 241 WSFGILFSRLVRLPSMDGRV----------ALVPWADMLNHSC---EVETFLDYDKSSQG 287
            +   ++SR   +  + G+           ALVP  D  NH     E    +D D     
Sbjct: 280 RALAAIWSRSFNVGGVRGQTQTDEDESFAGALVPLLDCANHHRKPRECSWTIDEDGC--- 336

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
           V+    R +  G  + I+YG + N +L+L YGF   + T P  S
Sbjct: 337 VLVVAIRAFDAGGAIRIAYGARGNHDLMLRYGFAVEDNTEPDGS 380


>gi|213407234|ref|XP_002174388.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
 gi|212002435|gb|EEB08095.1| lysine methyltransferase [Schizosaccharomyces japonicus yFS275]
          Length = 537

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 90/344 (26%), Positives = 147/344 (42%), Gaps = 49/344 (14%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVAL--KNIRKGEKLLFVPPSLVITADS---- 130
           S LQ+  ++       +  Q+ D  E   +A+  K+I   + L+  P S +IT       
Sbjct: 3   SFLQEAFNNGCYLHPGIQFQRSDNVEGTFIAIASKDIDGDQVLISCPESYIITLQKAKNE 62

Query: 131 --KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
             + S   A E +         ++ T+   E    + S+W+ YI  LP+   + LY+T  
Sbjct: 63  LCRLSPKFADEKMHT-------IVCTFFALERLKGEKSQWAKYIEYLPKTFDTPLYFTDD 115

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF----G 244
           EL           +++E  TN+    ND R RI+ +      + + N + F W+      
Sbjct: 116 EL-----------KSLEH-TNIFYGCND-RKRIWKEEHATAAKLLDNPDNFSWNMYLWAA 162

Query: 245 ILFSR-------LVRLPSMDGRVALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQ 295
            +FS        L    + D    L+P  D LNH   C +  +    K S  V   + + 
Sbjct: 163 TVFSSRCFSSALLGEEDTDDAAPILIPLVDSLNHKPRCPI-IWNKVTKESHAVQLVSVKP 221

Query: 296 YQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
              G QV+ +YG K N ELL+ YGF +P    N  ++  L LSL K+    ++K   L  
Sbjct: 222 ISSGGQVYNNYGPKGNEELLMGYGFCLPN---NEFETFALRLSLDKAVYNSEKKRSILAS 278

Query: 355 YGLSASECF-PIQITGWPLELMAYAYLVV--SPPSMKGKFEEMA 395
           +GLS    + P Q+    L+ +  A +V+  SP  +K   E +A
Sbjct: 279 HGLSKLNFWIPKQVDFSHLQNILDALVVITASPFELKTLEEHLA 322


>gi|313234617|emb|CBY10572.1| unnamed protein product [Oikopleura dioica]
          Length = 395

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/293 (24%), Positives = 126/293 (43%), Gaps = 40/293 (13%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQ---CSVPDWPLLATYLISE 159
           RG+ +  +    E ++  P   ++T   K  C E   + +Q     + +  +L  YL   
Sbjct: 34  RGMKSDVDFEANEFIISFPNEALLTG--KRMCQELPHLEEQRVKHHLTNELVLVFYL--- 88

Query: 160 ASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRER-AIE--RITNVIGTYN 215
            +F     +  Y ++LP++ P   LYW++ + D   E   +R +  +E  R  +V+G   
Sbjct: 89  -AFYCEEHFPEYYASLPKEFPNYWLYWSKEDWDSLYEDRVLRRKIKVEKRRYHSVMG--- 144

Query: 216 DLRLRIFSKYPDLFPEE---VFNMETFKWSFGILFSRLVRLPSMDGRV------------ 260
             R+ + +K   +  +    +  ME FK ++  L +R V L      V            
Sbjct: 145 --RVEMSNKEAIVMDQNHHFLLFMEPFKLAWAQLNTRTVYLSDQWHDVGKNENNDDSTLN 202

Query: 261 -ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
            AL P+ D  NH  + +T +  D  S+      ++++  GEQ+FISYG+  +  LLL YG
Sbjct: 203 WALAPFLDQFNHHHDAKTVIHDD--SEKFAIKVEKKHGKGEQLFISYGEHPDVFLLLEYG 260

Query: 320 FVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPL 372
           F+    +NP + +EL  +    D   K   +    +G        +Q   W L
Sbjct: 261 FIIG-NSNPHNLIELTNTTVLRDPVTK---KMAIDFGCGDGHAISLQGASWSL 309


>gi|367029027|ref|XP_003663797.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
 gi|347011067|gb|AEO58552.1| hypothetical protein MYCTH_2080826, partial [Myceliophthora
           thermophila ATCC 42464]
          Length = 357

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 78/202 (38%), Gaps = 29/202 (14%)

Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL--DR---YLEASQI 199
           SVP   L   +L+ E    K S W  YI+ LP  P  +  W       D    YLE +  
Sbjct: 108 SVPPHVLGRFFLVKEYLKGKDSFWWPYIATLP-PPEQVAVWALPPFWPDHDIAYLEGTNA 166

Query: 200 RERAIERITNVIGTYNDLR-LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR------ 252
                E   NV   +   R L     +PDL     +    +KW+F I  SR  R      
Sbjct: 167 HVAIQEIQENVKREFKQARKLLKEEDFPDL---PAYTQLLYKWAFCIFTSRSFRPSLVLS 223

Query: 253 ----------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
                     LP    +D    L P  D+ NHS       D               YQPG
Sbjct: 224 DATKRRLSALLPQGVQLDDFSVLQPLLDIANHSPTARYTWDTTSVPDTCRLICHDPYQPG 283

Query: 300 EQVFISYGKKSNGELLLSYGFV 321
            QV+ +YG K+N ELLL+YGF+
Sbjct: 284 TQVYNNYGLKTNSELLLAYGFI 305


>gi|145354549|ref|XP_001421544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581782|gb|ABO99837.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 488

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 110/252 (43%), Gaps = 26/252 (10%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPD--WP-LLATYLISE 159
           L A  +I  GE+L+ +PP L++  DS     +  E LK     VP+  W   +   L+ E
Sbjct: 84  LEADGDIADGERLVSLPPKLMLRCDSD----DVSEPLKNVVDRVPNEFWSSKVGLVLLRE 139

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY--NDL 217
                 S ++ YI+ LP        +   +  R LE + I ++ I +    +GT+  N L
Sbjct: 140 RVAGAHSAFAPYITLLPAVHEGSPTFFPPDAVRALEYAPIVQQ-INKRARFLGTFAGNAL 198

Query: 218 RLRIFSKYPD-LFP-----EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
            +     Y D   P     E   +     W+     SR  ++   +   A++P  D+ NH
Sbjct: 199 TVDDGESYVDEAHPGRQRVEMTIDANALGWATACASSRAFKV-GANSAPAMLPVIDICNH 257

Query: 272 S----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           S      V    + D +  GV     R    GE + +SYG  SN ELLL YGF+ ++  N
Sbjct: 258 SFNPSVSVRAIEEGDNAG-GVELIARRALTSGEPIELSYGNLSNDELLLDYGFIVKD--N 314

Query: 328 PSDSVELPLSLK 339
           P D V+L   LK
Sbjct: 315 PFDCVKLRWDLK 326


>gi|308802351|ref|XP_003078489.1| unnamed protein product [Ostreococcus tauri]
 gi|116056941|emb|CAL53230.1| unnamed protein product [Ostreococcus tauri]
          Length = 433

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/116 (31%), Positives = 63/116 (54%), Gaps = 7/116 (6%)

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA----LVPWADMLNHS-CEVETFLDYDKSSQGVVFT 291
           + ++W+  ++ SR  R+    GR A    L+  AD+LNHS    E   D+  +    V T
Sbjct: 197 DEWRWALSMVHSRTFRIEDEYGRRATRRALIAAADLLNHSSVRGEVNCDWSANDDYFVVT 256

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
           T R  + GE++ ISYG++ +    L YGF+P +  NP + V+L  + +++   Y+E
Sbjct: 257 TTRDVRAGEELCISYGEQCDRHFALFYGFLPSQ--NPFNRVKLFFNGREALDWYQE 310


>gi|366992371|ref|XP_003675951.1| hypothetical protein NCAS_0C05970 [Naumovozyma castellii CBS 4309]
 gi|342301816|emb|CCC69587.1| hypothetical protein NCAS_0C05970 [Naumovozyma castellii CBS 4309]
          Length = 580

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 50/101 (49%), Gaps = 1/101 (0%)

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           L P  D+LNH  +     +++K  +   F  +   +  +++F +YG KS  ELLL YGF+
Sbjct: 225 LYPVVDLLNHKNDTNVKWEFEKDEERADFIFNETLKANDELFNNYGDKSKEELLLGYGFI 284

Query: 322 PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC 362
           P E  NP D+  L L L ++       L  L    L+A  C
Sbjct: 285 P-EDINPYDTSSLTLRLDENHISQARLLAKLPDVNLAADNC 324


>gi|242045610|ref|XP_002460676.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
 gi|241924053|gb|EER97197.1| hypothetical protein SORBIDRAFT_02g032970 [Sorghum bicolor]
          Length = 489

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 114/282 (40%), Gaps = 59/282 (20%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP------LL 152
           + G RGL A +++R+GE +L  P + ++T+D           +  C     P      +L
Sbjct: 51  NAGGRGLAAARDLRRGELVLRAPRAALLTSDR---VTADDPRIAACVSAHRPRLSSVQIL 107

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-LEASQIRERAIERITNVI 211
              L++E    ++S W  Y+S LP   Y++L    A  D + +EA Q+         + I
Sbjct: 108 IVCLLAEVGKGRNSVWYPYLSQLPSY-YTIL----ATFDDFEVEALQV--------DDAI 154

Query: 212 GTYNDLRLRIFSKYPDL--------FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
                 +  I S + D+        F  ++   +++ W+F  + SR + + + D    L 
Sbjct: 155 WVAQKAKSAIKSDWEDVTPLMKELEFKPKLLMFKSWLWAFATVSSRTLHI-AWDEAGCLC 213

Query: 264 PWADMLNHSC----------EVETFLDYDK-----------------SSQGVVFTTDRQY 296
           P  D+ N++           +     +Y +                  S        + Y
Sbjct: 214 PVGDLFNYAAPDDDTSLEAEDTAELTNYQQKNEMINSSERLTDGGYEDSNAYCLYARKNY 273

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           + GEQV + YG  +N ELL  YGF+  E  N    +EL L +
Sbjct: 274 KQGEQVLLGYGTYTNLELLEHYGFLLGENPNEKTFIELDLDI 315


>gi|346978889|gb|EGY22341.1| SET domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 457

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 87/209 (41%), Gaps = 35/209 (16%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           L+  YL+   SF     W  YI  LP QP  L  W+   L    +   + +      TN+
Sbjct: 94  LMKQYLLGRDSF-----WYPYICTLP-QPDQLSSWSLPPLWPSDDIELLED------TNI 141

Query: 211 IGTYNDLRLRIFSKYPDLFP-------EEVFNMETFKWSFGILFSRLVR----------- 252
                +++ R+ ++Y    P          +    + W++ I  SR  R           
Sbjct: 142 HTAVAEIKARLKAEYKQATPLLAALPNANDYTRLLYNWAYSIFTSRSFRPSRVVPDHESL 201

Query: 253 -LP---SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
            LP   ++D    L+P  D+ NHS       D    +   V  T   Y+ G QVF +YG 
Sbjct: 202 PLPEGCAIDDFHILMPLFDIGNHSHSAGISWDIAPGTSTTVLKTLDAYESGAQVFNNYGS 261

Query: 309 KSNGELLLSYGF-VPREGTNPSDSVELPL 336
           K+N EL+L+YGF +P   T  +D V L L
Sbjct: 262 KTNAELMLAYGFLIPESPTLHNDFVHLQL 290


>gi|238494116|ref|XP_002378294.1| SET domain protein [Aspergillus flavus NRRL3357]
 gi|317148877|ref|XP_001822982.2| SET domain protein [Aspergillus oryzae RIB40]
 gi|220694944|gb|EED51287.1| SET domain protein [Aspergillus flavus NRRL3357]
          Length = 478

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 73/292 (25%), Positives = 120/292 (41%), Gaps = 60/292 (20%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI-TADSKWSCPEAGEVLKQC--S 145
           P  ++A  +     RG+VA  +I +GE+L  +P   V+ T +SK       ++L Q    
Sbjct: 34  PKIRLADLRSRAAGRGVVAQSDIAEGEELFTIPREHVLSTQNSKLK-----DLLSQDVEE 88

Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----R 200
           +  W  L   +I E      S W++Y   LPR+  +L++W+ +EL + L+ S I     +
Sbjct: 89  LGPWLSLMLVMIYEYLLGDQSAWASYFKILPRKFDTLMFWSPSEL-QELQGSAIVDRIGK 147

Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV 260
           E A E I  +I         I    P LFP  V  + ++    G     L+ L  + G +
Sbjct: 148 EGAEESILEMIAP-------IVRANPSLFP-PVDGLASYDGDAGT--QALLNLAHVMGSL 197

Query: 261 ----------------------------------ALVPWADMLNHSCEVETFLDYDKSSQ 286
                                              +VP AD+LN   +      + + + 
Sbjct: 198 IMAYAFDIEKPEDEDDEGDDESGYVTDDEEQLSKGMVPLADLLNADADQNNARLFQEET- 256

Query: 287 GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           G+V    +    G ++F  YG+    +LL  YG+V  +  +P D VEL L L
Sbjct: 257 GLVMKAIKPISAGAEIFNDYGEIPRADLLRRYGYV-TDNYSPYDVVELSLEL 307


>gi|145501218|ref|XP_001436591.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403732|emb|CAK69194.1| unnamed protein product [Paramecium tetraurelia]
          Length = 716

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 85/193 (44%), Gaps = 15/193 (7%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK---------QCSVPDWPLLAT 154
           GLVA + I   E L+ VP  L++T    +  P     +          Q S  D  L+A 
Sbjct: 64  GLVASEKILSNETLVSVPRDLLLTTRHAFESPLKQMFIDHPQYFSNQFQSSWEDHQLMA- 122

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
           +++ E      S W   IS LPR    L++W+  E    L+  ++ + A ++ +  +  Y
Sbjct: 123 FILYEYQRGPESEWHLLISNLPRDIDYLVFWSHEE-QELLDDEKLIKLARKQYSEFLLEY 181

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
             L+  I  KYP  F  E   +E  +W +  L +R          V +VP+ ++ NH C 
Sbjct: 182 ETLKC-ITDKYPQHFKPETVTLENARWVYTHLVTRC--FGKYLAYVTMVPFCELFNHEC- 237

Query: 275 VETFLDYDKSSQG 287
            + F D++ ++  
Sbjct: 238 TDVFYDFEYNADN 250



 Score = 41.6 bits (96), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 67/151 (44%), Gaps = 21/151 (13%)

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           Q+  G QV+  YG+ SN  +L+ YG       N  D V L +   K  +   E +  + K
Sbjct: 501 QFDKGAQVYFCYGRLSNRMMLMRYGMSLE--YNKYDHVHLRIEYLKYLQS-NEAIWLVHK 557

Query: 355 YGLSASECFPIQITGWPLELMAYA---YLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCP 411
           Y +S  + F ++ T +PL+ + +    Y   +  S+   F+               I+  
Sbjct: 558 YQISKFKKFKLKHTTFPLDFIVFCKSIYWTFNVHSLDSFFK---------------IQDL 602

Query: 412 EIDEQALQFILDSCESSISKYSRFLQVKELL 442
           +++ +ALQ  L+     ISK+S  L+  E L
Sbjct: 603 KLERKALQLALEILVEEISKFSDKLEDNEKL 633


>gi|242813336|ref|XP_002486146.1| hypothetical protein TSTA_101480 [Talaromyces stipitatus ATCC
           10500]
 gi|218714485|gb|EED13908.1| hypothetical protein TSTA_101480 [Talaromyces stipitatus ATCC
           10500]
          Length = 426

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 83/343 (24%), Positives = 141/343 (41%), Gaps = 64/343 (18%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
           +   +W  D G+    +   ++     G++A ++I++ E L+ VP S +++ DS  S  +
Sbjct: 12  TAFMQWAIDEGVKVNGVEPARITGRGLGMIATRDIQEHEMLIDVPLSAMLSVDSVPS--D 69

Query: 137 AGEVLKQCSVPDWPLLATYLI--SEASFEKSSRW-------SNYISALP----------- 176
              +    S+    LLA YL        +K   W       S++   +P           
Sbjct: 70  FVNLFSGISIQG--LLAAYLTHGDPRCLKKYDLWKATWPTYSDFEEGMPILWPKELGGSG 127

Query: 177 -RQPYSLLYWTRAELDRYLEAS------QIRERAI----ERITNVIGTYNDLRLR----- 220
            + P S    T    D  L  S       IR++A+    E     I    + RL+     
Sbjct: 128 LKHPISPTATTHHPPDGKLPPSISGSWTTIRKKALVEEYETKHQNILFQQEKRLQDAWRD 187

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVR--LPSM------DGRVALVPWADMLNHS 272
           + + +PD       + ETF + + +L +R     +P        +  +A+VP+AD  NH+
Sbjct: 188 VLAVFPDT------DWETFSYHWLVLNTRCFYYVMPGTEPPEDTNDAIAMVPFADYFNHT 241

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
            E E  + +D   +   F   R Y+ GE++++SYG   N  L + YGF      N SDS+
Sbjct: 242 DETECDVKFD--GKNYTFRAMRAYKKGEEIYMSYGPHPNDFLFVEYGFYLDH--NKSDSL 297

Query: 333 EL-PLSLKKSDKCYKEKLEALRKYG-----LSASECFPIQITG 369
            L  +  K      KE+L   R YG     L +  CF  ++  
Sbjct: 298 FLDDIIFKDFTVAEKEELIHHRYYGNYQITLESGPCFRTEVAA 340


>gi|384246167|gb|EIE19658.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 523

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 109/270 (40%), Gaps = 18/270 (6%)

Query: 72  SLENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITAD 129
           S  +A  L +W+S SG  + P         V   GL A K  R GE L+ +P S  ++ D
Sbjct: 37  SASSADRLVQWVSSSGGTVSPTVHVSPPDSVMGAGLRASKACRSGELLVSLPRSCQLSYD 96

Query: 130 SKWSCPEAGEVLKQCSVPDWPL-LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
              + P   +++ +     W   LA  ++ E      S + +YI  LP     +  +   
Sbjct: 97  GS-TEPNLLQLISKVPEELWGAKLALRVLKERIMGPDSPFHSYIDNLPMGVPGIPMFFSP 155

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP-DLFPEEVFNMETFKWSFGILF 247
           +  R LE        +++    + +++   L      P D F     +     W+  +  
Sbjct: 156 DAIRALEQYPPLSEQVKKRCRWLLSFSSEHLSALPGSPADPFLGTPVDANILGWALAMTT 215

Query: 248 SRLVRLPSMDGRVALVPWADMLNHS----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
           SR  R+       AL+P  DM NHS    CEV+           V     R  +  E + 
Sbjct: 216 SRAFRVQGPQHPAALLPLIDMSNHSFAPNCEVKP-----GPGGSVEMVASRDIRAEEDLL 270

Query: 304 ISYGKKSNGELLLSYGF-VPREGTNPSDSV 332
           +SYGK  N  LLL YGF VP    NP D+V
Sbjct: 271 LSYGKLDNTFLLLDYGFMVP---GNPHDTV 297


>gi|396469509|ref|XP_003838423.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
 gi|312214991|emb|CBX94944.1| similar to SET domain-containing protein [Leptosphaeria maculans
           JN3]
          Length = 415

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 64/132 (48%), Gaps = 17/132 (12%)

Query: 209 NVIGTYNDLRLRIFSKYPDLFPEE--VFNMETFKWSFGILFSRLVRLPSMDGRV------ 260
           N+   ++D++  I S   DLF     V N  TF W +  L +   RLP    ++      
Sbjct: 138 NLEQDWSDVKADIPSIDKDLFTYVWLVVNTRTFYWDYPDLSNAHPRLPKRRAKLTSADCY 197

Query: 261 ALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
           A+ P+ D  NHS   CE +       ++ G     DR Y+ GE+V++SYG  +N  LL+ 
Sbjct: 198 AMCPFMDYFNHSDSGCEPQ------HNAHGYSVLADRAYRAGEEVYVSYGPHTNDFLLVE 251

Query: 318 YGFVPREGTNPS 329
           YGF+    +N S
Sbjct: 252 YGFLLDANSNDS 263


>gi|302896454|ref|XP_003047107.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
 gi|256728035|gb|EEU41394.1| SET domain protein [Nectria haematococca mpVI 77-13-4]
          Length = 1037

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 67/281 (23%), Positives = 119/281 (42%), Gaps = 37/281 (13%)

Query: 94  AIQKVDVGER----GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS---- 145
           AI+ VD+ +R    G++AL++I     L  +P   +I  ++     +  +V         
Sbjct: 595 AIKIVDLRDRNAGRGIIALQDIPAETTLFTIPRKGIINVETSELPKKLPDVFDLDKPIDD 654

Query: 146 ------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI 199
                 +  W  L   L+ E    + S+W  Y   LP    + ++W+ +ELD+ L+AS +
Sbjct: 655 DDEAPRLDSWSSLILVLMYEYLQGEKSQWKPYFDVLPSSFDTPMFWSESELDQ-LQASHM 713

Query: 200 RERA----------------IERITNVIGTYN---DLRLRIFSKYPDLFPEEVFNMETFK 240
           R +                 I + ++V G  N   D  + I  +         F++E  +
Sbjct: 714 RHKIGKADAESMFRKTLLPIIRKNSSVFGGENRSDDDLVEIAHRMGSTIMAYAFDLENDE 773

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
                     V        + +VP AD+LN   E    +++++ S  +  T+ R  + GE
Sbjct: 774 DEEEEETDGWVEDREGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTSLRPIKAGE 831

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
           ++F  YG   N ELL  YG+V  E  +  D VE+P  + +S
Sbjct: 832 EIFNYYGPHPNSELLRRYGYVT-ERHSRYDVVEIPWDVVES 871


>gi|313228180|emb|CBY23330.1| unnamed protein product [Oikopleura dioica]
          Length = 421

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/256 (21%), Positives = 109/256 (42%), Gaps = 29/256 (11%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT----ADSKWSCPEAGEVLKQ-CSVPDWPLLATYLI 157
           RG +A K+I +G+ +L +P + VIT     ++  SC     VL +   +    L+  +L+
Sbjct: 115 RGFIAKKSINRGQMVLEIPKAAVITPNWIYNNAISCVSNIVVLNEDFKIDSTDLMIIWLV 174

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
            E      S   +YI+++P     L  +    L   L   Q+R++   +   ++  +  L
Sbjct: 175 KEKQKGMQSPVRDYITSMPSLCTPLFNYRPRHLK--LFPKQLRQKVENQKKELLARFEYL 232

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM----------------DGRVA 261
             + F ++         +   F+W+  ++ +R ++   +                   + 
Sbjct: 233 D-KCFRRHG-----RGISFHEFQWAASMVLTRSIQAKGLTLCTELFEAPWFNNDSSHEIG 286

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           L P+ D+LNHS E     D++  +  V        +  +Q++I Y +  +  +L++YGF 
Sbjct: 287 LCPFFDLLNHSSENNCDWDFNPVTGSVWVEAVGDIKNSDQLYIDYDQGCDDYMLMNYGFC 346

Query: 322 PREGTNPSDSVELPLS 337
                NP  ++EL  S
Sbjct: 347 METAANPKTALELSWS 362


>gi|340522118|gb|EGR52351.1| predicted protein [Trichoderma reesei QM6a]
          Length = 377

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 74/301 (24%), Positives = 128/301 (42%), Gaps = 44/301 (14%)

Query: 77  STLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE 136
             L  W    G+    +   K+     G++A + I+  E++L VPP ++        C E
Sbjct: 5   QNLMTWAKAQGVAINGIQPSKIPGRGTGILATRKIKAQEEILRVPPRVL-------RCLE 57

Query: 137 AGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR----QPYSLLYWTRAELDR 192
           +  +  +  +P    +   L ++   ++S+    + + LP+    +    + W R EL +
Sbjct: 58  SVPLRVREKLPADSTIQALLAADLVLDRSANSKPWKAVLPKMADFEAGMPMLWPR-ELKQ 116

Query: 193 YL--EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKW------SFG 244
            L  E+    ER   R       ++D +        + FP+   +  T+ W      +F 
Sbjct: 117 LLPLESQVTLER---REKEFQDNWDDFK--------EAFPDVPRDDYTYAWLVVNTRTFY 165

Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
                 ++ P  D R+AL+P AD+ NH+   C V     Y           DR Y+ GE+
Sbjct: 166 HETPETLKYPWED-RLALIPVADLFNHAAGGCRV-----YYSPEGCYHVVADRAYKKGEE 219

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPS---DSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           +FISY   SN   LL YGF+P E +      D V +P  L +S K   ++ + L +Y L 
Sbjct: 220 LFISYSSHSNDYNLLEYGFIPDENSLDDVYIDDVVMP-KLSESHKAELQRRDLLGEYPLG 278

Query: 359 A 359
           +
Sbjct: 279 S 279


>gi|302840199|ref|XP_002951655.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
 gi|300262903|gb|EFJ47106.1| hypothetical protein VOLCADRAFT_105180 [Volvox carteri f.
           nagariensis]
          Length = 517

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 91/197 (46%), Gaps = 14/197 (7%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERI--- 207
           L   ++ E S  + SRW+ Y++ +P     + LYW   E  + L  +   ++ + ++   
Sbjct: 187 LIIAVMYEKSRGRQSRWAPYLNLIPDDMTHMPLYWKHREF-KELRGTAAYDKMMGKVQCP 245

Query: 208 ----TNVIGTYNDLRLRIFSKYPDL-FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
               T V   ++++      ++P+L  PE     + ++W+   + S    L   D   A+
Sbjct: 246 ADAPTQVPVLWSEVVEPFIQEHPELELPEGKAGYDLYRWATCAVASYSFILGD-DKYQAM 304

Query: 263 VPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           VP  D+LNH +  V   L +      +     R    GE++  +YG+ SN ELL  YGFV
Sbjct: 305 VPVWDLLNHITGRVNVRLHHCAKRHVLHMIATRDILRGEELVNNYGELSNAELLRGYGFV 364

Query: 322 PREGTNPSDSVELPLSL 338
             E  N ++ V++PL  
Sbjct: 365 --EARNRNNHVQVPLGF 379


>gi|342875304|gb|EGU77102.1| hypothetical protein FOXB_12400 [Fusarium oxysporum Fo5176]
          Length = 371

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 70/266 (26%), Positives = 116/266 (43%), Gaps = 39/266 (14%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPE-AGEVLKQCSVPDWPLLATYLISEASF 162
           G+VA ++I+  E +L VP   + T D+    P+   E L+  SV         L +E + 
Sbjct: 32  GIVATRDIKPNETILSVPMKALRTIDT---VPKNITEALQGVSV------HGILAAEIAL 82

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDR---YLEASQIRERAIERITNVIGTYNDLRL 219
           +KS  +S + + LP         TR +L+     +  S+++    +R  +++   N    
Sbjct: 83  DKSDDFSVWKTVLP---------TREDLEAGVPMMWPSELQALLPKRAKDILDNQNTTFR 133

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-----RVALVPWADMLNHS-- 272
           R        FP+   +   + W      +    +P M       R+  +P AD+ NH+  
Sbjct: 134 RECEIVLKAFPKLTRDEYLYSWVLINTRTFYNSMPKMKSYAHVDRLVCMPTADLFNHADQ 193

Query: 273 -CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
            C++        S+ G     DR Y  GE+V++SYG  SN  LL  YGF+    TN  D 
Sbjct: 194 GCKLAY------SALGYSVQADRVYHQGEEVYVSYGPHSNDFLLSEYGFIL--DTNRWDE 245

Query: 332 VELP-LSLKKSDKCYKEKLEALRKYG 356
           V L  + L K +K  +  LE++   G
Sbjct: 246 VYLDEVILPKLNKTQRADLESINFLG 271


>gi|241712095|ref|XP_002413441.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507255|gb|EEC16749.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 227

 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 96/218 (44%), Gaps = 34/218 (15%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           KW  D+G     + +Q +   E G  A ++I+ G   L VP  +++T        + G +
Sbjct: 10  KWCLDNGATINGITLQALPDDEYGFAAEQDIQVGPVFLGVPLGMMMTTIGARK-SKLGAL 68

Query: 141 LKQCSVPDWPL--------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR 192
           LK     D P+        L+ +LI E     +S W  YIS LPR   ++LY++  EL +
Sbjct: 69  LK-----DDPIMKSMENVALSMFLILELCAGSASFWHPYISILPRSFNTVLYFSVDEL-Q 122

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSR 249
            L  S + + A++   ++   Y     +IF  +P    L  ++ F  + ++W+   + +R
Sbjct: 123 LLTGSSVLDEALKLHRSIARQYAYFH-KIFRTHPLAKSLPYKDCFTYDLYRWAVSAVMTR 181

Query: 250 LVRLP---------------SMDGRVALVPWADMLNHS 272
              +P               S  G  ALVP  D+ NHS
Sbjct: 182 QNAVPRAVVCGGADDACARGSGSGVAALVPLFDLCNHS 219


>gi|44890428|gb|AAH66931.1| SETD3 protein [Homo sapiens]
          Length = 292

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
           R  ++P+ DG    +AL+P  DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280


>gi|342879010|gb|EGU80287.1| hypothetical protein FOXB_09214 [Fusarium oxysporum Fo5176]
          Length = 530

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 57/191 (29%), Positives = 82/191 (42%), Gaps = 11/191 (5%)

Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP- 226
           W+ Y+  LPR       W+  EL+R L      E A++   + +    D  +   S  P 
Sbjct: 158 WTEYLKFLPRDIPVPTMWS--ELERALLQGTSLEVALDAKLSALNKEFDELIERSSALPF 215

Query: 227 --DLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK 283
               F E E   ++ +        SR + LP      A+VP  DM NHS     + D D 
Sbjct: 216 WNSFFWEREAVTIDDWVLVDAWYRSRCLELPRSGH--AMVPVLDMANHSHSQTAYYDEDD 273

Query: 284 SSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
               V+      +   G++V ISYG+KS  E++ SYGF+ RE T   + + LPL     D
Sbjct: 274 EDNVVLLPRPGMEISIGDEVTISYGEKSPAEMIFSYGFIDREST--VEGLTLPLESLADD 331

Query: 343 KCYKEKLEALR 353
              K KL   R
Sbjct: 332 PLGKAKLHIFR 342


>gi|384483765|gb|EIE75945.1| hypothetical protein RO3G_00649 [Rhizopus delemar RA 99-880]
          Length = 376

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/236 (25%), Positives = 103/236 (43%), Gaps = 37/236 (15%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLIS------ 158
           ++A ++I  GE ++ VP + +IT +S                    L  T+ +S      
Sbjct: 1   MMATEDIEAGEVIVSVPRNFLITNESLTK-----------------LYGTHSLSPHQLLA 43

Query: 159 ----EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
                 + +K S W  Y   LP    ++     +EL  +L  S +++  +++  N+   Y
Sbjct: 44  LHLVLLTRDKQSWWKPYTDLLPMHFNTMPVNYPSELLSHLPNS-LKQETMQQKDNIHTDY 102

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD-----GRVALVPWADML 269
             +    F K   L P+++   E FKW++  + +R + +   D       +AL P  D L
Sbjct: 103 --VTCLKFCKSKQL-PQDI-TAEEFKWAWLCVNTRCIHMTVPDYLAKGENIALAPMLDFL 158

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
           NH+ E +    ++  +Q     T   Y+ GEQV+I+YG   N  +L  YGFV  E 
Sbjct: 159 NHTTEAKIESGFNIRTQRFEIKTLTAYKKGEQVYINYGPHDNLAMLKEYGFVLNEN 214


>gi|159131477|gb|EDP56590.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 490

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 6/98 (6%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG---VVF 290
           F  + +K+   +  SR+V LP      A+VP  DM NH+CE      YD+   G   +  
Sbjct: 198 FTFDDWKYVDAVYRSRVVDLPRSGH--AIVPCVDMANHACEDSVKAKYDEEGAGNAVLQL 255

Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
            T ++ + GE+V ISYG +K   E++ SYGFV  E T+
Sbjct: 256 RTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERTD 293


>gi|111306423|gb|AAI20969.1| SETD3 protein [Homo sapiens]
          Length = 284

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
           R  ++P+ DG    +AL+P  DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280


>gi|451852073|gb|EMD65368.1| hypothetical protein COCSADRAFT_159025 [Cochliobolus sativus
           ND90Pr]
          Length = 408

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 74/284 (26%), Positives = 113/284 (39%), Gaps = 40/284 (14%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP-PSLVITA 128
           +D   N +    W   +G+    +A  +      G+VA ++I+KG+KL+ V   SLV  A
Sbjct: 5   LDPGTNHTDFVAWAKSNGVEINGIAPARFVGRGMGIVAAQDIKKGDKLVHVSNKSLVHVA 64

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSR-------WSN-------YISA 174
                 P     ++   +PD   +   L    +   + R       W N       + S 
Sbjct: 65  -----LPS----IRSLKLPDTITVHGKLALSLALWYTGRKDHDYTLWQNVWPTSSDFKST 115

Query: 175 LPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVF 234
           +P      L        R L   Q++   +ER    I  +N    +    Y  L    + 
Sbjct: 116 MPLYYPPSLQPLLPPAARTLLTKQLQN--LERDWTSIAPHNPGITKETYTYTWL----II 169

Query: 235 NMETFKWSFGILFSRLVRLP------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGV 288
           N  TF WS+  L +    LP      + D    + P+ D  NHS   ++  D   S  G 
Sbjct: 170 NTRTFYWSYPDLPNASALLPKRRAKLTADDCYCMCPFTDYFNHS---DSGCDPQMSPSGY 226

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
             T DR Y  GE+VF++YG  +N  LL  YGF+ +E  N  D V
Sbjct: 227 TVTADRAYVAGEEVFVTYGPHTNDFLLTEYGFILQE-KNRHDGV 269


>gi|336261436|ref|XP_003345507.1| hypothetical protein SMAC_07495 [Sordaria macrospora k-hell]
 gi|380088183|emb|CCC13858.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 499

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 80/200 (40%), Gaps = 45/200 (22%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE--RITNVIG 212
           YLI +    KSS W+ YIS L            ++LD++       E  IE  R TN   
Sbjct: 108 YLIQQYLKGKSSLWAPYISTLT---------DPSQLDKWALPPFWTEHDIELLRGTNAYV 158

Query: 213 TYNDLRLRIFSKY------------PDLFPEEVFNMETFKWSFGILFSRLVR-------- 252
              +++  + S+Y            PD      +    + W++ +  SR  R        
Sbjct: 159 AIQEIQDNVKSEYKQARKILKQEGSPDY---RAYTQVLYNWAYCMFTSRSFRPSLILSES 215

Query: 253 --------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
                   LP    +D    L P  D+ NHS E E   +               Y+PG+Q
Sbjct: 216 AREYVERLLPEGAKIDDFSILQPLYDIGNHSPEAEYSWNLTSEPSACELICRNSYEPGQQ 275

Query: 302 VFISYGKKSNGELLLSYGFV 321
           VF +YGKK+N ELLL YGFV
Sbjct: 276 VFNNYGKKTNSELLLGYGFV 295


>gi|242804795|ref|XP_002484448.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218717793|gb|EED17214.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 409

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 74/303 (24%), Positives = 125/303 (41%), Gaps = 38/303 (12%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           +W    G+    ++   +     G++A + I  GE ++ VP   ++T DS    P +   
Sbjct: 12  RWCESQGIKIHGVSPAGIPGRRLGMIATRRISAGETIVTVPLVAMLTIDS---VPPSFVR 68

Query: 141 LKQCSVPDWPLLATYLI--SEASFEKSSRW-------SNYISALPRQPYSLLYWTRAELD 191
           +   + P   +LA +         EK   W        ++  +LP     +L    + L 
Sbjct: 69  MFSKATPLHAILAAFFTHGDPVLLEKWEYWRRVWPLRHDFEKSLPLFWSEMLPANESILP 128

Query: 192 RYLEAS-QIRERAIE------RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
             +  S   R +  E      R TN++          +S+   +FP   +N  ++ W   
Sbjct: 129 PSVSGSWSFRNKKPEDIEYGSRYTNILSHQKKRLQDAWSEVLLVFPHTDWNFFSYNWLIL 188

Query: 245 ILFSRLVRLPSMD------GRVALVPWADMLNHS----CEVETFLDYDKSSQGVVFTTDR 294
              S     P  D        +ALVP+AD  NH     CEV    +Y        F   R
Sbjct: 189 NTRSFFYVSPEKDEPEDWNDAIALVPFADYFNHDDKAPCEVNFNGEY------YTFKASR 242

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALR 353
           +++ GE++FISYG  SN  LL+ YGF+  +  N SD++ L  + L +     K++L + +
Sbjct: 243 RFEKGEELFISYGSHSNDFLLVEYGFLLDD--NKSDAIFLDDIVLPELATANKKELLSRQ 300

Query: 354 KYG 356
            YG
Sbjct: 301 LYG 303


>gi|85090666|ref|XP_958526.1| hypothetical protein NCU09827 [Neurospora crassa OR74A]
 gi|28919896|gb|EAA29290.1| predicted protein [Neurospora crassa OR74A]
          Length = 532

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 93/233 (39%), Gaps = 39/233 (16%)

Query: 122 PSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
           PS    AD +   PE  E     S   W LL   L+ E   + SS WS Y+S LP Q  +
Sbjct: 99  PSHTADADDEPPSPENDEEDDSQSQDSWTLLILILMHE-YLQGSSNWSPYLSILPTQFDT 157

Query: 182 LLYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
            ++WT AEL   L+AS +     +E A + I   I         +F  YP   P+     
Sbjct: 158 PMFWTEAELSE-LQASALVAKVGKEEADKMIRTKIVKVVQEHEEVF--YPADTPKTQRLE 214

Query: 237 ETFKWSFGILFSRLV----------------------------RLPSMDGRVALVPWADM 268
           E      G      +                            ++  M+  + +VP ADM
Sbjct: 215 EGELLKLGHRMGSAIMAYAFDLANDDEDEDEEEEEEEDGWVEDKIAGMNDSMGMVPMADM 274

Query: 269 LNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           LN        +++ ++   +  T+ R+ + GE++   YG  S+ ELL  YG+V
Sbjct: 275 LNADAVFNAHINHGEAC--LTATSLREIKEGEEILNYYGPLSSAELLRRYGYV 325


>gi|209489216|gb|ACI49001.1| hypothetical protein Cbre_JD01.008 [Caenorhabditis brenneri]
          Length = 333

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 82/170 (48%), Gaps = 22/170 (12%)

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
            FE+S+ WS Y+  LP+  +    +   + D       IR+  I++   +      LR R
Sbjct: 7   DFEQSA-WSPYLKVLPK-TFDTPAFKGIDYDVNTLPLSIRKYWIDQKKEISEISEKLR-R 63

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLV--------RLPSMDG-RVALVPWADMLNH 271
           +F   P+L  ++V       W++ ++ +R +         + + DG  +A++P+ DMLNH
Sbjct: 64  LF---PELTHDKVL------WAWHVVNTRCIFVENEEHDNVDNSDGDTIAVIPYVDMLNH 114

Query: 272 SCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
             E  +    ++K +   V    RQ Q GEQ+F+ YG   N  LL+ YGF
Sbjct: 115 DPEKYQGVALHEKRNGRYVVQAKRQIQEGEQIFVCYGAHDNARLLVEYGF 164


>gi|357445001|ref|XP_003592778.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
 gi|355481826|gb|AES63029.1| Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit
           N-methyltransferase [Medicago truncatula]
          Length = 243

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 51/99 (51%), Gaps = 14/99 (14%)

Query: 268 MLNHSCEVETFLDYDKSSQGVVFT----------TDRQYQPGEQVFISYGKKSNGELLLS 317
           M  H   +  FL++D  S+ +V +          +DR Y PGEQV I YGK SN  L+L 
Sbjct: 1   MGKHKGFISDFLNHDGISEAIVMSDDDNKCSEVFSDRDYVPGEQVLIRYGKFSNATLMLD 60

Query: 318 YGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           +GF +P    N  D V++   + K D     KLE L++Y
Sbjct: 61  FGFTIP---YNIYDQVQIQYDIPKYDPLRHTKLELLQQY 96


>gi|303277863|ref|XP_003058225.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460882|gb|EEH58176.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 612

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 131/325 (40%), Gaps = 41/325 (12%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA-GEVLKQ-CSVPDWPLLATYLISEA 160
           RG  A  ++  G   L +P S ++T+      P A G+  +    + +  L+  +L+ E 
Sbjct: 193 RGAAASTDLPAGADALTIPSSALLTSRVALEDPTARGDAYRTFAGLGEDTLMTLWLVYEK 252

Query: 161 -SFEKSSRWSNYISALP----------RQPYSLLYWTRAE-----LDRYLEASQIRERAI 204
            +    S W+  +++LP          R     L  T A       D  L  + + + A+
Sbjct: 253 YALGDRSPWAPLLASLPMDDGGGDDGDRTAAGALGLTPASWPAEVTDALLRGAPLLDDAV 312

Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK-----W-SFGILF-SRLVRLPSMD 257
           +        +  L   +   +P++FP E++ +  F+     W ++G+   +  V   S  
Sbjct: 313 KARETTARQHAALFPALGEHFPEVFPTELYTLRRFRIASEAWNAYGMTVQAETVGGASGG 372

Query: 258 GR-------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
           G          L P A + NH+               +     R  + GE++F+SYG KS
Sbjct: 373 GEHHPPAPTTCLPPIALLCNHATWPHAVRYSRLRDDALHLPIARGVRAGEEIFVSYGAKS 432

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSD----KCYKEKLEALRKYGLSASECFPIQ 366
           N ELLL YGF  R+  NP D V L L L + +       +E++    K  LS      ++
Sbjct: 433 NAELLLFYGFGVRD--NPYDDVPLSLELPQGEVRDVSALRERVLHRAKLSLSPHS---VR 487

Query: 367 ITGWPLELMAYAYLVVSPPSMKGKF 391
               PL L+    ++ +  S  G +
Sbjct: 488 CGALPLPLVGTLRVLTADASTLGTY 512


>gi|211826273|gb|AAH09054.2| SETD3 protein [Homo sapiens]
          Length = 228

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 14  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 68

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 69  SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 126

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 127 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 185

Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
           R  ++P+ DG    +AL+P  DM NH+
Sbjct: 186 RQNQIPTEDGSRVTLALIPLWDMCNHT 212


>gi|50557274|ref|XP_506045.1| YALI0F30327p [Yarrowia lipolytica]
 gi|49651915|emb|CAG78858.1| YALI0F30327p [Yarrowia lipolytica CLIB122]
          Length = 430

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 86/187 (45%), Gaps = 21/187 (11%)

Query: 163 EKSSRWSNYISALPR-----QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
           + SS  S+Y+S+LP+     QP+S   W   E+   L+ + +    + +  +V   Y   
Sbjct: 93  QSSSGMSDYVSSLPKSTEMDQPWS---WPETEIFDSLKGTSLLMACVHKKMHVQAKY--- 146

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR----VALVPWADMLNHSC 273
            L I  K      E++ + + F  S   + SR + +P   G     + +VP  D +NHS 
Sbjct: 147 -LAIVGKKGG---EKLVSEQQFYLSEQWVVSRSLEIPESPGSETLALTMVPVLDYVNHSP 202

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLSYGFV-PREGTNPSDS 331
           +     + D     ++   D   + G++VFI+YG  KS  E L  YGF+    G   S +
Sbjct: 203 KANCRFEVDSGEVVLIVNEDVLIKAGDEVFINYGPDKSAAEFLFCYGFIDAAHGVTKSIT 262

Query: 332 VELPLSL 338
           +E PL L
Sbjct: 263 LETPLML 269


>gi|40068483|ref|NP_954574.1| histone-lysine N-methyltransferase setd3 isoform b [Homo sapiens]
 gi|28071060|emb|CAD61911.1| unnamed protein product [Homo sapiens]
 gi|111309143|gb|AAI20968.1| SET domain containing 3 [Homo sapiens]
 gi|118341365|gb|AAI27625.1| SET domain containing 3 [Homo sapiens]
 gi|118341638|gb|AAI27626.1| SET domain containing 3 [Homo sapiens]
 gi|119602071|gb|EAW81665.1| SET domain containing 3, isoform CRA_b [Homo sapiens]
 gi|156138972|gb|AAI48252.1| SET domain containing 3 [Homo sapiens]
          Length = 296

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 94/207 (45%), Gaps = 21/207 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW S++G   +   +        GL A ++I+  E  L+VP  L++T +S        
Sbjct: 82  LMKWASENGASVEGFEMVNFKEEGFGLRATRDIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGPLYSQDRILQAMGNIALAFHLLCERA-SPNSFWQPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           RYL+++Q       +  N    Y     ++   +P    L  ++ F  E ++W+   + +
Sbjct: 195 RYLQSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPHANKLPLKDSFTYEDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
           R  ++P+ DG    +AL+P  DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280


>gi|320586350|gb|EFW99029.1| set domain containing protein [Grosmannia clavigera kw1407]
          Length = 537

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 92/238 (38%), Gaps = 63/238 (26%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL-----------DRYLEASQIRERA 203
           +L+ +    + S W  YI++LP QP  +  W    L              LE +     A
Sbjct: 113 FLMQQYLMGQQSHWHAYIASLP-QPEHVSSWNLPALWPRNGEDAESGPALLEGTNAGIAA 171

Query: 204 IERITNVIGTYNDLRLRIFS-KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM------ 256
            E   NV   Y   R  + S  YP L     F    + W+F I  SR  R PS+      
Sbjct: 172 AEMQANVAQEYRQARRLLKSVAYPALAE---FTRLLYHWAFCIFTSRSFR-PSLVLSAEA 227

Query: 257 ------DGRVA-------------------LVPWADMLNH------SCEVETFLDYDKSS 285
                 DG  +                   L+P  D+ NH      + E  +    D S+
Sbjct: 228 QTELGTDGEASGGRSVPRLSSGCKTDDFSILLPVLDLANHDPTARATWEATSAHLADGSA 287

Query: 286 QG--------VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV-PREGTNPSDSVEL 334
            G        V F   ++Y PG+QVF +YG K+N ELLL YGFV P   T  +D V L
Sbjct: 288 SGHEAASVTGVSFRVQQRYAPGQQVFNNYGMKTNSELLLGYGFVLPPTATLHNDYVHL 345


>gi|70995934|ref|XP_752722.1| SET domain protein [Aspergillus fumigatus Af293]
 gi|66850357|gb|EAL90684.1| SET domain protein [Aspergillus fumigatus Af293]
          Length = 490

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 53/98 (54%), Gaps = 6/98 (6%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQG---VVF 290
           F  + +K+   +  SR+V LP      A+VP  DM NH+CE      YD+   G   +  
Sbjct: 198 FTFDDWKYVDAVYRSRVVDLPRSGH--AIVPCVDMANHACEDSVKARYDEEGAGNAVLQL 255

Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
            T ++ + GE+V ISYG +K   E++ SYGFV  E T+
Sbjct: 256 RTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERTD 293


>gi|358374896|dbj|GAA91484.1| ribosomal N-lysine methyltransferase [Aspergillus kawachii IFO
           4308]
          Length = 445

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 70/145 (48%), Gaps = 23/145 (15%)

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPS--------MDGRVALVPWADMLNHSCEVETFL 279
           ++PE  + M  + W   I+ SR     S         +  + +VP+AD  NH  +    +
Sbjct: 197 VYPETEWKMFAYYWC--IINSRSFYYVSPGKDEPEDWNDAIGMVPFADYFNHVDDAACDV 254

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSL 338
           ++D   +   F   R+Y+ GE+V++SYG  SN  LL+ YGF +P   TNPSDS+ L    
Sbjct: 255 NFD--GKKYTFRATRRYEKGEEVYMSYGNHSNDFLLVEYGFTLP---TNPSDSIYL---- 305

Query: 339 KKSDKCYKEKLEALRKYGLSASECF 363
              D    + L   +K  L+  E F
Sbjct: 306 ---DDIIFQDLSISQKQELAKQEIF 327


>gi|50556556|ref|XP_505686.1| YALI0F20944p [Yarrowia lipolytica]
 gi|49651556|emb|CAG78495.1| YALI0F20944p [Yarrowia lipolytica CLIB122]
          Length = 402

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 132/311 (42%), Gaps = 41/311 (13%)

Query: 73  LENAST-LQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS 130
           +EN+   L +W+ S+ G    K+ I+++    RGLV    +   E+L+ +    ++   S
Sbjct: 1   MENSHVQLIEWITSNGGYISPKLEIRELPGRGRGLVVNDRVYPNERLIHLKLRQLLNYSS 60

Query: 131 KWSCPEAGEVLKQC---SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL---- 183
             +    G  L +    S+    +LA +L+ + S    S W  ++  LP +    L    
Sbjct: 61  IHNAMVDGGHLSETEYRSMSAHQVLALFLVIQQSLGSKSDWKAFMGLLPDRKEGFLDVPL 120

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTYN---DLRLRIFSKYPDLFPEEVFNMETFK 240
            W++ + D       +    I  +   + T+    D      +KY D  P + +      
Sbjct: 121 QWSKEDQD------SLTPEGIVVLKKTLDTFEADYDKTKTFVAKY-DSDPRDAY-----L 168

Query: 241 WSFGILFSRLV--RLPSMDGR---------VALVPWADMLNHSCEVE-TFLDYDKSSQGV 288
           W++  + SR +   L    G+         + L P+ D++NHS E   T      SS G 
Sbjct: 169 WAWLCVNSRCLYFDLTLTTGKKDAQEVPDNITLAPYVDLINHSVESGPTHCQLKTSSIGF 228

Query: 289 -VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS--DKCY 345
            +    R Y   E++F+ YG +SN  LL  YGF   E  NP D V++  +L+ +   K +
Sbjct: 229 EILCGQRGYTADEEIFLCYGPRSNSVLLCEYGFTVPE--NPWDDVDISDALENTFLTKQH 286

Query: 346 KEKLEALRKYG 356
           +  L  +  YG
Sbjct: 287 ETVLREMGYYG 297


>gi|412990233|emb|CCO19551.1| predicted protein [Bathycoccus prasinos]
          Length = 417

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 72/297 (24%), Positives = 125/297 (42%), Gaps = 36/297 (12%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERG--------LVALKNIRKGEKLLFVP 121
           ++ +E  S    W   +G+  Q +   +VD    G        +V+ +++   EKLL +P
Sbjct: 11  LEVIERLSKFLSWSVSNGI--QVLDAVRVDARWDGVNKKYTLCIVSTRHLHCFEKLLSIP 68

Query: 122 PSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPY 180
            ++ + A +   C  + E++    +     L   +  E +    S W +Y+  LP +   
Sbjct: 69  KTVCLGAKT---CSISKELV-AVGLGGGLALNFAIAQELALGPDSCWFDYLCILPSKGEQ 124

Query: 181 SL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
           SL ++W++ E  + L+ + +    I    +    Y +   ++  KY + F  +  N E +
Sbjct: 125 SLPMFWSKQERKK-LKGTSLYSHIIMDDQSFADDY-EFGFKLLQKYIN-FKSDRVNFELY 181

Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY-DKSSQGVVFTTD----- 293
           K +  I  SR   +    G   L+PWAD+ NHS        Y  KSS    F  D     
Sbjct: 182 KKAVSIAASRAFYIDEYFGE-CLIPWADLFNHSTHNMHVKVYCSKSSSRNTFDMDENEVL 240

Query: 294 -------RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
                  R+YQ   ++F ++G +SN  LL  YGF      N   S+  P    + DK
Sbjct: 241 IRSVQSVRKYQ---ELFNTFGLQSNSSLLHKYGFCELSNKNGFVSIYSPFGKLRRDK 294


>gi|159468798|ref|XP_001692561.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278274|gb|EDP04039.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 724

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 46/183 (25%), Positives = 82/183 (44%), Gaps = 15/183 (8%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           L   ++ EA+   +S+W  Y+ +LP + Y  ++W+  +L + L  + + ++A E   ++ 
Sbjct: 93  LVAAVMYEAARGPASKWHGYLRSLPAREYLPVFWSARQLQQ-LAGTDLADKAEEDRASMA 151

Query: 212 GTYNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
             ++     + S+YP         +++E F  +   + SR   +    G  ALVP AD+ 
Sbjct: 152 ADFSTHLAPLLSRYPGRLGHLAAGWSLEAFMHAASWVASRAFYVDDTHGD-ALVPLADVF 210

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQY--------QP---GEQVFISYGKKSNGELLLSY 318
           NH        +    S G V               QP   G +V+ +YG+ SN EL+  Y
Sbjct: 211 NHKAARVDLGEGSGWSAGFVVAEQEGVELLDIVAAQPLAGGTEVYNTYGEHSNAELVNKY 270

Query: 319 GFV 321
           GF 
Sbjct: 271 GFA 273


>gi|327259114|ref|XP_003214383.1| PREDICTED: SET domain-containing protein 3-like, partial [Anolis
           carolinensis]
          Length = 311

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           + ++ GEQ++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L 
Sbjct: 18  QDFKAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 75

Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
           + G+  S  F +  T  P+     A+L V
Sbjct: 76  RAGIPTSSVFALHATEPPISAQLLAFLRV 104


>gi|313230936|emb|CBY18934.1| unnamed protein product [Oikopleura dioica]
          Length = 303

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 39/62 (62%)

Query: 259 RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
           R AL PW DMLNHS       ++D ++  ++ T     + GEQ+FISYG +++ +LLL Y
Sbjct: 66  RAALAPWFDMLNHSRSNNAEFEFDFTTGRLLVTCVSPIKAGEQLFISYGARADDDLLLEY 125

Query: 319 GF 320
           GF
Sbjct: 126 GF 127


>gi|357122881|ref|XP_003563142.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Brachypodium
           distachyon]
          Length = 480

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 67/282 (23%), Positives = 117/282 (41%), Gaps = 53/282 (18%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS-CPEAGEVLKQCSVPDWPLLATY-- 155
           D G RG  A +++R+GE +L VP + ++T+D   +  PE    +  C     P L++   
Sbjct: 38  DAGGRGFAAARDLRRGELVLRVPRAALLTSDRVMADDPE----IASCIAARHPRLSSVQR 93

Query: 156 ----LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD--RYLEASQIRERAIERITN 209
               L++E    KSS W  Y+S LP     L  +   E++  +  +A  I ++++  I +
Sbjct: 94  LIVCLLAEVGKGKSSSWYLYLSQLPSYYTVLATFNDFEIEALQVDDAIWIAQKSLSAIRS 153

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADML 269
                  L   +       F  ++   +T+ W+F  + SR + + + D    L P  D+ 
Sbjct: 154 EWEDATPLMQGL------KFKPKLLIFKTWLWAFATVSSRTLHV-AWDDAGCLCPVGDLF 206

Query: 270 NHSC----------------------EVETFLDYDKS-----------SQGVVFTTDRQY 296
           N++                       E+   + + +S           S+       + Y
Sbjct: 207 NYAAPDDDISSEEENREEVTKCQQKNEMLEEVKFGRSSERLSDGGYEDSEAYCLYARKCY 266

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
             GEQV + YG  +N ELL  YGF+  E  N    ++L L L
Sbjct: 267 TKGEQVLLGYGTYTNLELLEHYGFLLAENPNEKTYIQLDLDL 308


>gi|26344391|dbj|BAC35846.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 62/272 (22%), Positives = 116/272 (42%), Gaps = 27/272 (9%)

Query: 81  KWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           +W    GL   P+    ++  V   G+VA +++R G  L  VP S ++   S  +C  +G
Sbjct: 51  RWCRRVGLELSPKVTVSRQGTVAGYGMVARESVRAGTLLFAVPRSALL---SPHTCSISG 107

Query: 139 EVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
            + ++     S+  W +     +       +S WS Y +  P   +    ++W   E  R
Sbjct: 108 LLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPELGRLEHPMFWPEEERLR 166

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
            L+ + + E   + + N+   Y  + L     + DLF   V ++E ++    ++ +   +
Sbjct: 167 LLKGTGVPEAVEKDLVNIRSEYYSIVLPFMEAHSDLFSPSVRSLELYQQLVALVMAYSFQ 226

Query: 253 LPSMD-------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP---GEQV 302
            P  +           +VP AD+LNH       L+Y      +V T     QP   G ++
Sbjct: 227 EPLEEDDDEKEPNSPLMVPAADILNHIANHNANLEYSADYLRMVAT-----QPILEGHEI 281

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           F +YG+ +N +L+  YGF      N  D+ ++
Sbjct: 282 FNTYGQMANWQLIHMYGFAEPYPNNTDDTADI 313


>gi|322694827|gb|EFY86647.1| SET domain protein [Metarhizium acridum CQMa 102]
          Length = 467

 Score = 58.9 bits (141), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 65/281 (23%), Positives = 113/281 (40%), Gaps = 51/281 (18%)

Query: 94  AIQKVDV----GERGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE----AGEVLK 142
           AI+ VD+      RG+ AL++I     L  +P   +I +++   +   P+     G+  +
Sbjct: 28  AIEIVDLRSRDAGRGITALRDIPADTTLFTIPRDAIINSETSSLRKKLPDLFESQGDEDE 87

Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRER 202
           + ++  W  L   ++ E      S+W  YI  LP    + ++W+  EL  YL+AS     
Sbjct: 88  EQALDSWSALILIMMYEFFLGDESKWKPYIDVLPLTFDTPMFWSEEEL-SYLQASA---- 142

Query: 203 AIERITNVIGTYND---LRLR----------IFSKYPDLFPEEVFNME------TFKWSF 243
                 N IG  +     R R          +F    D   E++  +          ++F
Sbjct: 143 ----TVNKIGKADAEEMFRTRLIPAIRGNPSVFVSSGDCSDEDLIGLAHRMGSTIMAYAF 198

Query: 244 GILFSRLVRLPSMDGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
            +           DG V          +V  AD+LN   E    +++    + +  T+ R
Sbjct: 199 DLENEEAENDEESDGWVEDREGKSMMGMVAMADILNADAEFNAHVNH--GDEELTVTSIR 256

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
             + GE++   YG   N ELL  YG++  E  +  D VE+P
Sbjct: 257 DIKAGEEILNYYGPHPNSELLRRYGYIT-EKHSRYDVVEIP 296


>gi|313216036|emb|CBY37421.1| unnamed protein product [Oikopleura dioica]
 gi|313219606|emb|CBY30528.1| unnamed protein product [Oikopleura dioica]
          Length = 346

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 79/332 (23%), Positives = 131/332 (39%), Gaps = 65/332 (19%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI-- 126
           E++  +  S L+ ++ ++G    ++ + K    + G++A  +I++ E L+ +P S  I  
Sbjct: 2   EVEKDKAISHLRSFIEENGFVDPRIQVCKTAF-DLGIIASASIKEDEVLIRIPRSCQINL 60

Query: 127 -TADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-----PY 180
            + DS  +        K   +    +LA YL  E     S     +I  LP        Y
Sbjct: 61  HSFDSSQTSISKFIRQKNIKLTCHQILALYLALERRNPLSPAMKKFIPTLPSSCCLPLNY 120

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFK 240
           S         + YL +  +++   E +   +G    L                   E   
Sbjct: 121 SPKAMANLPCEVYLLSVALQKNVTE-LCFALGKQIGL-----------------TKEDLT 162

Query: 241 WSFGILFSRLVRLPSMDGR--------------VALVPWADMLNHSCEVETFLDYDKSSQ 286
           W+F ++ SR   LP  D                  L P+ D++NHS     + + D  + 
Sbjct: 163 WAFSMVLSRTFSLPKYDKSSDFDYCSQVDSSKSAFLCPFMDLINHSSAPNCYYETDSETG 222

Query: 287 GVVFTTDRQYQPGEQVFISY-GKKSNGELLLSYGFVPREGTN-------------PSD-- 330
             V   DR+ Q  E++FI+Y G KS+  LL  YGF    G N             PS   
Sbjct: 223 DFVLRADRELQQKEELFITYGGSKSDHVLLAFYGFCLPPGVNRNSYIVFSPNFIGPSSHS 282

Query: 331 ---SVELPLSLKKSDKCYKE----KLEALRKY 355
              + +  L+ KK+ KC+KE    KLE+ RK+
Sbjct: 283 KFTAFKFFLNSKKA-KCFKESLNKKLESWRKF 313


>gi|387191841|gb|AFJ68625.1| set domain-containing protein [Nannochloropsis gaditana CCMP526]
          Length = 736

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 85/192 (44%), Gaps = 11/192 (5%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           LA  L++E      S W  Y+  LP +  +  +++  +E       S +R   ++    +
Sbjct: 260 LAVLLVAERMKGPQSFWWPYLRNLPEKYAHMPIFYNNSEFGSIQIPSLMR--TVQSRCRM 317

Query: 211 IGTYNDLRLRIFSK---YPDLFPEEVFNMETFKWSFGILFSRLVR-LPSMDGRVALVPWA 266
           +   +D  LR  S      + F ++V +     W      SR +R +P +     +VP  
Sbjct: 318 LVNISDGYLRQLSHGGPAENPFLDDV-HANDMGWGLCAASSRALRNIPGLGSTPLMVPVI 376

Query: 267 DMLNHSCEVETFL-DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
           D   H+     ++ DY KS   +     R  QPG+ + ISYG  +N +LLL YGF   + 
Sbjct: 377 DFCEHAVSPTCYIKDYRKSGGSIQLVAGRDLQPGDALTISYGNLTNPQLLLDYGFTLSD- 435

Query: 326 TNPSDSVELPLS 337
            NP D  E+ LS
Sbjct: 436 -NPHDRFEVTLS 446


>gi|190345582|gb|EDK37493.2| hypothetical protein PGUG_01591 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 592

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 128/311 (41%), Gaps = 57/311 (18%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEK-LLFVPPSLVITADSKW-S 133
            + L +W    GL   +  I+   +GE    A      GEK  + +P  L IT DS   S
Sbjct: 2   VAELVQWAKTQGLELNE-GIEFRGIGENNTGAFYTTNNGEKPYIRLPVELAITVDSALRS 60

Query: 134 CPEAGEVLK-QCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-QPYSLLYWTRAELD 191
             +  E L+ QC   +  +L   L  E S  K+S    Y+  LP  Q  +  Y   AE  
Sbjct: 61  FGQDLEALRDQCDSSN-TVLKLCLARERSRLKNSTIKKYLECLPTLQQMNTPYCWDAETK 119

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF--PEEVF-NME----------- 237
           RYL+ + +     E I  ++  +     +I +  PD    PE+ F NM+           
Sbjct: 120 RYLQGTNLGSSLKENIGVLVEEW----WKIINLLPDSVQKPEQHFVNMKYYYESKFYTDD 175

Query: 238 -------------------TFKWSFGILFSR----LVRLPSMDGRV-----ALVPWADML 269
                               F W+  IL SR     +   ++D  V      L+P  D+L
Sbjct: 176 DAYAYFVTNEDPANWTSFPNFLWASIILKSRSFPAYLIADAVDWDVKRHDTMLLPVIDLL 235

Query: 270 NHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           NHS    VE  L+  +S    VF +D   + G Q+F +YG K N ELLL+YGF   +  N
Sbjct: 236 NHSPSAHVEWGLERKESKSYFVFKSD-DVKSGSQLFNNYGMKGNEELLLAYGFCLED--N 292

Query: 328 PSDSVELPLSL 338
            SD   L + +
Sbjct: 293 SSDVSALKIKV 303


>gi|380490713|emb|CCF35823.1| SET domain-containing protein [Colletotrichum higginsianum]
          Length = 403

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/119 (35%), Positives = 61/119 (51%), Gaps = 14/119 (11%)

Query: 257 DGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           D  + L P AD+LNH+   C V     +D  S  ++   DR Y PG+++ I YG+ SN  
Sbjct: 206 DDHMVLQPVADLLNHAPRGCSVA----FDARSFTIL--ADRDYSPGDEIHICYGRHSNDF 259

Query: 314 LLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEA---LRKYGLSA-SECFPIQI 367
           LL+ YGFV   G N  D   L  + L +    ++ +LE    L KY L A + C+  Q+
Sbjct: 260 LLVEYGFVMAPGENDWDEACLDDVLLPRLSDAHRRRLEERGFLGKYMLDAETVCYRTQV 318


>gi|145524165|ref|XP_001447910.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415443|emb|CAK80513.1| unnamed protein product [Paramecium tetraurelia]
          Length = 717

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 42/176 (23%), Positives = 83/176 (47%), Gaps = 11/176 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCP------EAGEVLKQCSVPDWP--LLATY 155
           GL+A + I   + L+ +P   ++T    +  P      E  +      +P W   ++ ++
Sbjct: 64  GLIATEKIVPNDNLVILPRETLLTTRQAFESPLKPMFLEFPQFFSPKFMPSWQYHIILSF 123

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ E      S+W   I+  P+     ++W   +L+  L+  ++ + AI++   +I T+ 
Sbjct: 124 LLYEYQKGAESKWHLLINNFPKDIDYAVFWKSEDLE-LLQDKKMAKHAIQKNRYLITTFQ 182

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
            L+  I SK+PDLF  EV  +E   W +  + +R      +   V +VP+ ++ NH
Sbjct: 183 TLQY-ITSKFPDLFKPEVVTLENIIWIYTSIVTRCFGGQGL-KYVTMVPFCELFNH 236


>gi|357491725|ref|XP_003616150.1| Protein SET DOMAIN GROUP [Medicago truncatula]
 gi|355517485|gb|AES99108.1| Protein SET DOMAIN GROUP [Medicago truncatula]
          Length = 532

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 72/319 (22%), Positives = 127/319 (39%), Gaps = 94/319 (29%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADS-----KWSC------------------PEA 137
           G RGL A++++++GE +L VP S ++T++S     K  C                  P+ 
Sbjct: 51  GGRGLGAVRDLKRGEIILRVPKSALMTSESVIMEDKKLCLAVNRHSSLSSVQRNTPNPKR 110

Query: 138 GEVLKQ----------CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
             V ++          C      +L   L+ E    K+SRW  Y+  LP Q Y LL    
Sbjct: 111 CHVTERSRVQVLETASCVKQGKAILTVCLLYEVGKGKTSRWHPYLVHLP-QSYDLLA-MF 168

Query: 188 AELDRYL----EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
            E ++      EA  + E+A+++  +     + L   +      +F  ++   + + W+ 
Sbjct: 169 GEFEKQALQVDEAMWVTEKAVQKAKSEWKEAHALMEDL------MFKPQLLTFKAWVWAA 222

Query: 244 -------------GILFSRLVRLPSMDGRVALVPWADMLNHSC---------EVETFLD- 280
                        G++ SR + +P  D    L P  D+ N+           +V+ FL  
Sbjct: 223 ATGRTVPETFHLPGLISSRTLHIP-WDEAGCLCPVGDLFNYDAPGEELSGVEDVDHFLSN 281

Query: 281 -----------------------YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
                                  +++ +    F     Y+ G+QV + YG  +N ELL  
Sbjct: 282 GDMNVVIDEGQIDFNSQRLTDGGFEEDANAYCFYARTNYKKGDQVLLCYGTYTNLELLEH 341

Query: 318 YGFVPREGTNPSDSVELPL 336
           YGF+ +E  NP+D + +PL
Sbjct: 342 YGFLLQE--NPNDKIFIPL 358


>gi|345325919|ref|XP_001512656.2| PREDICTED: histone-lysine N-methyltransferase setd3-like
           [Ornithorhynchus anatinus]
          Length = 345

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           + +  GEQ++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L 
Sbjct: 51  QDFTAGEQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 108

Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
           + G+  S  F +  T  P+     A+L V
Sbjct: 109 RAGIPTSSVFALHFTEPPISAQLLAFLRV 137


>gi|159476096|ref|XP_001696150.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
 gi|158275321|gb|EDP01099.1| protein N-methyltransferase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 89/407 (21%), Positives = 154/407 (37%), Gaps = 54/407 (13%)

Query: 29  TDFPRKRCGHRIVVHCSVSTTNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGL 88
           +D P +R    +    SV+ T  A +        ++       +  N +     +  +G+
Sbjct: 22  SDAPARRRAPVVAARASVTPTEPAPQAPQQQPPAVLLDDSRTQTFMNWARGPASIRFAGV 81

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP--PSLVITADSKWSCPEAGEVLKQCSV 146
            P   A      G RGL A  +I     ++ VP   ++V+    + SCP         + 
Sbjct: 82  KPSTFA------GIRGLAASSDIANDALIVEVPRHSAVVLAPKQRNSCPGMVNDEWWKNA 135

Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDR----YLEASQIRER 202
           P +  +   L+        S  + +I+ LP      L W+  +L      YL A Q++E+
Sbjct: 136 PWFAKMGAMLLWHKRQGSQSPLAPWIAQLPADTGVPLNWSDKQLAALQYPYLVA-QVKEQ 194

Query: 203 AIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG---- 258
             E  T +  T     +   +  P        + E F W+ G++ SR    P +      
Sbjct: 195 QRE-WTALYDTLRGSGMAAGAAPP--------SREEFWWAMGVVRSRTFSGPYIGSTLSD 245

Query: 259 --------------------RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
                               + A+ P  D+ NH+   ++ + Y+           R ++ 
Sbjct: 246 RLRLAGLVAALVVILSRSLKQYAICPLIDLFNHTSAAQSEVSYNYFGDSYSVVASRDFKK 305

Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV---ELPLSLKKSDKCYKEKLEALRKY 355
           GEQVFI+YG +SN  L+  YGF   E  NP D+    ++   L+        +++AL+  
Sbjct: 306 GEQVFITYGAQSNDSLMQYYGFA--EADNPQDTYVISDVLRWLQGFRPLPPGRVQALQGS 363

Query: 356 GLSAS--ECFPIQITGWPLE-LMAYAYLVVSPPSMKGKFEEMAAAAS 399
            L A+      +Q  G+P E L A  +L+ S           A A S
Sbjct: 364 SLGAACLSNVAVQRAGFPAEALQALRFLLASDAEAAAGVSAFAKAGS 410


>gi|212544736|ref|XP_002152522.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
 gi|210065491|gb|EEA19585.1| hypothetical protein PMAA_003730 [Talaromyces marneffei ATCC 18224]
          Length = 429

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 16/154 (10%)

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPSMD------GRVALVPWADMLNHSCEVETFLDY 281
           +FP+  +   ++ W      S    +P  D        +A+VP+AD  NH+ + E  + +
Sbjct: 194 VFPDTDWEKFSYHWLIVNTRSFYYLMPGQDPPEDTNDAMAMVPFADYFNHTDDAECEVHF 253

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKK 340
           D  S    F   R Y+ GE++++SYG   N  L + YGF     TN SD++ L  +  K 
Sbjct: 254 DGKS--YTFRATRLYKKGEEIYMSYGPHPNDFLFVEYGFYLE--TNESDAIFLDDIIFKD 309

Query: 341 SDKCYKEKLEALRKYG-----LSASECFPIQITG 369
                KE+L   R YG     L +  CF  ++  
Sbjct: 310 FTVAEKEELIRQRYYGNYQITLESGPCFRTEVAA 343


>gi|444314545|ref|XP_004177930.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
 gi|387510969|emb|CCH58411.1| hypothetical protein TBLA_0A06190 [Tetrapisispora blattae CBS 6284]
          Length = 550

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 54/184 (29%), Positives = 82/184 (44%), Gaps = 19/184 (10%)

Query: 167 RWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP 226
           R+  YI+ALP    S L W   EL + L+ + +     ER  NV   +     +   KY 
Sbjct: 100 RFEPYINALPEIIDSPLNWNEDEL-KLLQNTNLGNCLKERFQNVYDEW----FKFLEKYQ 154

Query: 227 DLFPEEV------FNMETFKWSFGILFSR-----LVRLPSMDGRVALVPWADMLNHSCEV 275
           +    E       +N   F W+  I+ SR     ++        V L+P  D+LNHS   
Sbjct: 155 NYQEFETQSETSWYNFSNFLWAHLIITSRSFPEYIINPNCPRDSVMLLPVLDLLNHSNYS 214

Query: 276 ETFLDYDKSSQGVVFTTDRQ-YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           +   D +K    +    D Q  + G++++ +YG K N ELL  YGFV  +  N  DSV L
Sbjct: 215 KVEWDGNKGGNFIYKKLDLQEIEIGDEIYNNYGGKGNEELLNGYGFVIED--NLFDSVLL 272

Query: 335 PLSL 338
            + +
Sbjct: 273 KIKI 276


>gi|407852222|gb|EKG05847.1| hypothetical protein TCSYLVIO_003073 [Trypanosoma cruzi]
          Length = 565

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 106/245 (43%), Gaps = 27/245 (11%)

Query: 148 DWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEA 196
           D PLL   LI E    ++S W+  + + P + P    +W   +L          D   + 
Sbjct: 197 DEPLLVLSLIYERYVAETSHWNELLLSCPGEYPNVPSFWDWEDLAELEGLDVLDDVLAKK 256

Query: 197 SQIRERAIERITNVIGTYNDLRLRI-FSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
           +Q+ +   E +  +   +  L     F K   L   E F++E   W+     SR   L +
Sbjct: 257 AQLAQFQTETMAVLPFIHEALAGGCRFGKDEFL---ECFSIEAMMWARATFDSRAFNL-N 312

Query: 256 MDGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKK 309
           +DGRV  ALVP ADM+NH    +  +   + + G     +  +      G ++++SYG  
Sbjct: 313 VDGRVVIALVPVADMINHHNRSDVLVRRVEPNGGDFVMQIGASLTAQDIGREIWMSYGPL 372

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKS---DKCYKEKLEALRKYGLSASECFPIQ 366
            N ELL  YGFV  EG N  D +  PL   ++   D+    +   + KYGL  + C  I 
Sbjct: 373 QNWELLQFYGFV-LEG-NEHDRLPFPLDFPEAAVGDEWDGRRAALVAKYGLHLAGCCWIC 430

Query: 367 ITGWP 371
             G P
Sbjct: 431 HDGRP 435


>gi|330800139|ref|XP_003288096.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
 gi|325081857|gb|EGC35358.1| hypothetical protein DICPUDRAFT_152307 [Dictyostelium purpureum]
          Length = 525

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 86/179 (48%), Gaps = 9/179 (5%)

Query: 95  IQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLAT 154
           + K  +   G+++ K+++    +  +P  ++++  +         +L + ++      A 
Sbjct: 72  LNKTIISGLGIISNKDLKVNNIVAKIPKDIILSIHT----SSISNILTKYTMERNIATAI 127

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG-T 213
            LI EAS  + S+W  YIS+LP +    + W +    + L    + E  I+    +I   
Sbjct: 128 ALIYEASIGEKSKWYGYISSLPLKVDIPILWDKES--QQLLNGTVMEDVIQDDNILINHA 185

Query: 214 YNDLRLRIFSK-YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
           Y D+   +  K +P+ F +E+F+ E FK +  I+ SR   + S  G  +LVP AD+ NH
Sbjct: 186 YADIVESLLIKNHPEYFSKEIFSFENFKIANSIVSSRAFCIDSYHGD-SLVPLADIFNH 243


>gi|310794069|gb|EFQ29530.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 375

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 41/116 (35%), Positives = 59/116 (50%), Gaps = 8/116 (6%)

Query: 257 DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
           D  + L P AD+LNH+    +    D+S        +R Y PGE++ I YG+ SN  LL+
Sbjct: 176 DDHMILQPVADLLNHASRGCSVAFDDRS---FTIKAERDYAPGEEMHICYGRHSNDFLLV 232

Query: 317 SYGFVPREGTNPSDSVELPLS-LKKSDKCYKEKLEA---LRKYGLSA-SECFPIQI 367
            YGFV  +G N  D   L  + L + D   + +LE    L KY L A + C+  Q+
Sbjct: 233 EYGFVMAQGENEWDEACLDDAILPRLDAACRRRLEERGFLGKYMLDAETVCYRTQV 288


>gi|119495234|ref|XP_001264406.1| SET domain protein [Neosartorya fischeri NRRL 181]
 gi|119412568|gb|EAW22509.1| SET domain protein [Neosartorya fischeri NRRL 181]
          Length = 492

 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 37/98 (37%), Positives = 54/98 (55%), Gaps = 6/98 (6%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE--VETFLDYDKSSQGVV-F 290
           F  +  K+   +  SR+V LP      A+VP  DM NH+CE  V+   D D +   V+  
Sbjct: 198 FTFDDLKYVDAVYRSRVVDLPRSGH--AIVPCVDMANHACEDLVKARYDEDGAGNAVLQL 255

Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
            T ++ + GE+V ISYG +K   E++ SYGFV  E T+
Sbjct: 256 RTGKKLRVGEEVTISYGDEKPASEMVFSYGFVENERTD 293


>gi|294896472|ref|XP_002775574.1| Protein SET DOMAIN GROUP, putative [Perkinsus marinus ATCC 50983]
 gi|239881797|gb|EER07390.1| Protein SET DOMAIN GROUP, putative [Perkinsus marinus ATCC 50983]
          Length = 416

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 61/266 (22%), Positives = 104/266 (39%), Gaps = 50/266 (18%)

Query: 87  GLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSC-PEAGEVLKQCS 145
           G    ++A+++     RG+     I+ G+  + +P   +ITA+    C  +  EV ++  
Sbjct: 18  GAELNRVAVEEFSGAGRGVRVKTAIQSGQVAIGIPHKFIITANVSSPCLSDDSEVYRK-- 75

Query: 146 VPDWPLLATYLISEASFEKS---------------SRW-SNYISALPRQPYSLLYWTRAE 189
              W +    ++S A                    S W S Y+  LP +  ++ YW+ A+
Sbjct: 76  ---WIMKMGKILSGAELLSLILLRLLERSRSNLSPSDWRSLYLRTLPLEYTTISYWSEAD 132

Query: 190 LDRYLEASQIRERAIERITNVIG-TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
              +  AS +    +E        +   +R     +Y          ME  +W++  + +
Sbjct: 133 KKMFSAASSVLAEELETAERTCKISCEKIRQATGGRYA---------MEDIEWAYWTIET 183

Query: 249 RLV--RLPSMDGRVALVPWADMLNHSCEV------------ETFLDYDKSSQGVVFTTDR 294
           R    RL    G V LVP  DM+NHS E                 +Y+       FT  R
Sbjct: 184 RGCYHRL----GGVCLVPLGDMVNHSAEAYSTENCGKCGMQNRCGNYNVREHRYEFTAMR 239

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGF 320
            Y   EQ F+ Y   ++ +LL+ YGF
Sbjct: 240 DYNENEQFFVVYSGCASTDLLMRYGF 265


>gi|149059900|gb|EDM10783.1| hypothetical protein RDA279, isoform CRA_c [Rattus norvegicus]
          Length = 314

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 60/216 (27%), Positives = 97/216 (44%), Gaps = 24/216 (11%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           L+KWL +       + +     G  RGL++  ++++G+ ++ +P S ++T D+       
Sbjct: 70  LRKWLKERKFEDTGLLVPACFPGTGRGLMSKASLQEGQVIISLPESCLLTTDTVIRS-SV 128

Query: 138 GEVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQ---PYSLLYWTRAELDR 192
           G  +K+   P  PLLA  T+L+SE      S W +Y+  LP+    P  L       L  
Sbjct: 129 GPYIKKWKPPVSPLLALCTFLVSERHAGSHSLWKSYLDILPKSYTCPVCLEPEVVDLLPG 188

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSR 249
            L A    +RA  R+ ++  +  D     FS    LF E V   F+   F W++  + +R
Sbjct: 189 PLRAKAEEQRA--RVQDLFASSRDF----FSTLQPLFAESVDSIFSYHAFLWAWCTVNTR 242

Query: 250 LVRLPSMDGR--------VALVPWADMLNHSCEVET 277
            V L S             AL P+ D+LNHS  V+ 
Sbjct: 243 AVYLKSRRQECLSSEPDTCALAPFLDLLNHSPHVQV 278


>gi|400602586|gb|EJP70188.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 797

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/105 (39%), Positives = 52/105 (49%), Gaps = 20/105 (19%)

Query: 226 PDLFPEEVFNMETFKW------SFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVE 276
           PDL  +E      + W      SF       +  PS D R+AL+P AD+LNH+   C V 
Sbjct: 559 PDLEKQEYL----YSWFLVGTRSFYYEIEETLSYPSHD-RLALLPVADVLNHANAGCSVA 613

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
                  S++    T DR YQ GE+V+ SYG  SN  LL  YGFV
Sbjct: 614 F------STEAYDITADRAYQAGEEVYTSYGAHSNDFLLAEYGFV 652


>gi|300124011|emb|CBK25282.2| unnamed protein product [Blastocystis hominis]
          Length = 366

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 62/227 (27%), Positives = 104/227 (45%), Gaps = 20/227 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVP----DWPLLATYLIS 158
           RG+   ++I++G K++ +P  L+++ D    C E    L++ +V          A +L+ 
Sbjct: 2   RGVYLNRDIQRGTKIIKIPKKLIMSCDMGRDC-ELSANLREQNVDFEKYKHVYFANFLLE 60

Query: 159 EASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
           +   E S  +  Y   LP    ++ + WT +E+++ L  S        R+  +   Y  +
Sbjct: 61  DMENEDSF-YKPYYDTLPEDISNIPVIWTNSEINQ-LHGSYFSICIRSRVVEIYRDYQKM 118

Query: 218 --RLRIFSKYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
                 F +YP  F + +   +     +FG  F+ L           LVP ADMLNH+  
Sbjct: 119 CDVNSFFCRYP--FDQYLRVRLLIGSRNFGSFFNSL-------NNGILVPLADMLNHTRP 169

Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
            +T  +YD   +  V T+    + G QV  SYG++ N  LL SYGFV
Sbjct: 170 RQTTWEYDDKEKAFVITSLLNLRQGAQVMDSYGRRDNRRLLFSYGFV 216


>gi|294868786|ref|XP_002765694.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
 gi|239865773|gb|EEQ98411.1| hypothetical protein Pmar_PMAR013760 [Perkinsus marinus ATCC 50983]
          Length = 330

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 59/230 (25%), Positives = 102/230 (44%), Gaps = 32/230 (13%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
           G  G  A  +I +GE+LL+VP S  +T       P   + L +  V    +LA  L+   
Sbjct: 41  GMIGCTATADICQGERLLYVPHSACVT-------PSGVQGLYEPQV----MLAASLVKHR 89

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
           + + +S + +Y+ +LP +    L W+  EL   L+ + + E             + L L 
Sbjct: 90  T-DPNSPFHDYLQSLPSEFDHPLEWSADEL-VCLKGTTVWE------------MHQLSLE 135

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
           +     +L P     M   +W+  ++ SR     S    + ++P AD  NHS   +    
Sbjct: 136 VVDSVVELCPNSPRAM--IRWAVEVMMSRA--FESEVCGLCVIPLADQFNHS-STKWHTR 190

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
             +   G     ++  + GE++F +YG  +N  LLL++GF+  E  NP D
Sbjct: 191 VREVEGGFQMLAEKPVKKGEEIFNNYGLYTNEMLLLTHGFI--EFDNPHD 238


>gi|145551877|ref|XP_001461615.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429450|emb|CAK94242.1| unnamed protein product [Paramecium tetraurelia]
          Length = 666

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 81/176 (46%), Gaps = 12/176 (6%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVLKQCSVPDWP--LLATY 155
           GL  ++ I     L+ VP  L++T      +D +       +   Q     W   +L TY
Sbjct: 69  GLQTIQKIETDSILVSVPRELMLTTKIAYFSDIQEIFDAYPQFFCQHCAGGWQDRILLTY 128

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           ++ ++   + S+W + IS LP+    L++W+  EL + L   ++  +A   + + +    
Sbjct: 129 ILYQSQLGRQSQWYHLISNLPKDIDYLIFWSEQEL-KLLNDEKLILKAKRDLQDFLLIQK 187

Query: 216 DLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
            L   I  ++P  F +E ++ E  KW F  L SR     S   +VA VP+ +M NH
Sbjct: 188 TLT-HILDQFPQHFQKETYSFENIKWIFIHLVSRC--FGSTLEQVAFVPFCEMFNH 240


>gi|302815683|ref|XP_002989522.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
 gi|300142700|gb|EFJ09398.1| hypothetical protein SELMODRAFT_129980 [Selaginella moellendorffii]
          Length = 464

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 59/248 (23%), Positives = 118/248 (47%), Gaps = 21/248 (8%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITA----DSKWSCPEAGEVLKQCS-VPD--WPL-LATY 155
           GLVA +++ +G  ++ +P  + +      ++    P  G + +  + VP+  W + L   
Sbjct: 49  GLVATQDLPQGSTIITLPRRIPMPMPDPENAAVLAPSEGVICEIANRVPEELWAMRLGLK 108

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ E + +K S W  YIS LP      ++++  +++  ++ + +  +  +R   ++   +
Sbjct: 109 LLYERA-QKGSYWWPYISMLPHSFTLPIFFSGVDIES-IDYAPVTHQVKKRCRFLLQFSS 166

Query: 216 DL-RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNH 271
           +L +L    +    F  +  +     W+   + SR  R+  +  ++    ++P  DM NH
Sbjct: 167 ELAKLESLPEEIHPFAGQFVDSGALGWAMAAVSSRAFRIHGVTNKLCSAMMLPLIDMCNH 226

Query: 272 SCEVETFLDYD--KSSQGVVF---TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           S +    ++ D  + +Q V F    T R  + G  + ++YG  SN  LLL YGFV  +  
Sbjct: 227 SFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPD-- 284

Query: 327 NPSDSVEL 334
           NP D +EL
Sbjct: 285 NPHDRIEL 292


>gi|320168265|gb|EFW45164.1| hypothetical protein CAOG_03170 [Capsaspora owczarzaki ATCC 30864]
          Length = 464

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 70/300 (23%), Positives = 127/300 (42%), Gaps = 52/300 (17%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCP----------EAGEVLKQCSVPDWPLLA 153
           G++A ++I  G+  + VP +L++TA+                ++ E+    +  D  LL 
Sbjct: 169 GVIARRDIPAGQTFINVPEALMMTAEKARKSETFQLITSGALDSTELSPAMAKLDNFLLR 228

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
            +LI E     +S WS YI  LP++    LY+T AEL+  L+ S   + A  ++ NV+  
Sbjct: 229 MFLIVERRRGGNSYWSPYIDLLPQRFRLPLYFTEAELE-LLKPSPALQEAFVQLRNVVRQ 287

Query: 214 Y----------------------NDLRLRIFSK------YPDLFPEEVFNMETFKWSFGI 245
           Y                       D   +I  +       P  + E  +++  F W+   
Sbjct: 288 YAAWKQYLMMLELARAAELPSGSGDAHQKILDQRRRAQAMPVRYNELTYDL--FCWASSA 345

Query: 246 LFSRLVRLPSMDGR--------VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ 297
           + +R  ++   + R        +AL+P  DM NH+    +   YD  ++ +         
Sbjct: 346 VATRQNQIVVGEVRANQAPELSLALIPGWDMCNHAFGGASSF-YDTQTRSLECVAVAPIA 404

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
            GE V + YG +S+     +  FVP +  +P+D   + L++ K D  +K K   L+  G+
Sbjct: 405 KGEPVLLHYGDRSSMAYFGNSEFVPAD--HPTDQYLILLAVGKQDPLFKSKSTILQALGV 462


>gi|397589374|gb|EJK54638.1| hypothetical protein THAOC_25717 [Thalassiosira oceanica]
          Length = 468

 Score = 58.2 bits (139), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 74/316 (23%), Positives = 134/316 (42%), Gaps = 46/316 (14%)

Query: 45  SVSTTNDASRTKTTVTQNMI----PWGC------EIDSLENASTLQKWLSDSGLPPQK-- 92
            VS  + A RT+TT     +    P         +I +  +  ++++W +  G+  QK  
Sbjct: 28  GVSVFDSARRTRTTANAAALNQYAPQAAGLVDVSDIYAQRDVYSMEEWAAQFGM--QKAP 85

Query: 93  -MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS------ 145
            + I   D  +  L A + I  G+ +++VP  +V+ + S     E GE L Q        
Sbjct: 86  GVEIASEDGVDYSLQATQPISTGQSVVYVPSDIVLNSASIQQ--EFGESLAQAEAVLVQG 143

Query: 146 --------------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
                         +P + L+A  L+     ++S+ +  ++++LPRQ ++ +  T+A   
Sbjct: 144 IKVRVRVKEGINYRLPLFRLMAKILVEYEKGQESAFYP-WLNSLPRQFFNGVSMTKAC-- 200

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
                S +   A    +N    Y      +   +  L  E V N E  KW++ + F+R  
Sbjct: 201 ----TSCLPPYAGWLTSNEKINYARFAQALRQGWVPLSQETVSNEEVVKWAYNVAFTRFH 256

Query: 252 RLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
            +   +    + P ADMLNH+ +    ++ D S   +     R+   G  + ISYG  +N
Sbjct: 257 EVWQPERAKLIGPMADMLNHAADPNCAIEVDYSG-NINVVALREIPAGSALTISYGDPTN 315

Query: 312 -GELLLSYGFVPREGT 326
              L   YGF+P++ T
Sbjct: 316 PTPLFAQYGFLPQDCT 331


>gi|328866266|gb|EGG14651.1| hypothetical protein DFA_10909 [Dictyostelium fasciculatum]
          Length = 581

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 257 DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
           +    LVP AD+ NH+  V+T   Y  + +     TD +++ GEQVFISYG  +N  LL 
Sbjct: 352 NDNACLVPLADLFNHNPNVKTMASYCAADRCYRVYTDTRFEKGEQVFISYGLHNNATLLH 411

Query: 317 SYGFVPREGTNPSDSVEL 334
            YGFV     N  D +E+
Sbjct: 412 YYGFVI--DNNHLDGIEI 427



 Score = 40.0 bits (92), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 91/198 (45%), Gaps = 21/198 (10%)

Query: 63  MIPWGCEIDSLENASTLQ-KWLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKL 117
           MI    EI + E+   L  +W   +G+   +  ++  D G+    RG++A + I  G+ L
Sbjct: 1   MISNNKEISNQEHQERLMIEWGKKNGVKWDEEMMEIHDFGDSGGGRGVIAKRTIESGDLL 60

Query: 118 LFVPPSLVITADSKWSC-PEAGEVLKQCSVPDWPLLATY-LISEASFEKSSRWSNYISAL 175
           + VP SL+I +    S  P    +     + D      + LI E      SRW  Y+  +
Sbjct: 61  VEVPLSLLIHSLPILSVVPPFEHIETVLKLLDSKQTICFQLIYERLIRNRSRWYGYLDCI 120

Query: 176 PRQPYSLLYWTRAELDRYL------EASQIRERAIE---RITNVIGTYNDLRLRIFSKYP 226
           P++  + + +T AE+          EA+++R+  ++   +   ++ ++  L L++ S + 
Sbjct: 121 PKEYNTTVSYTDAEIGELSYPYYKNEATKLRKEMLDSHKQYKEILQSH--LTLKVLSSHS 178

Query: 227 DLFPEEVFNMETFKWSFG 244
           +L   +  N  TFK S G
Sbjct: 179 EL---DNNNNSTFKSSGG 193


>gi|298708218|emb|CBJ30557.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 493

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 74/273 (27%), Positives = 117/273 (42%), Gaps = 33/273 (12%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVI----TADSK--W------SCPEAGEVLK-QCSVP 147
           G RG+VA K+I +   L+ +  S  +    T +SK  W      S  +A +  + + S P
Sbjct: 66  GYRGVVATKDIPRDAVLVRIARSCCLGPETTDESKNSWTKAMSTSAVDATKTTQGESSKP 125

Query: 148 DWP------LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE 201
             P      L    L+ E    +SS + +Y+S LP+     L WT AE+      S    
Sbjct: 126 PAPRLTRACLTVLRLLHERGLGESSPFHSYLSVLPQDHRLPLEWTEAEVGLLQGTSAEPL 185

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
                + +    +      + +++P ++   V     F      + SR     ++ G   
Sbjct: 186 VGAGSLDSQFEAFQS----VVAQHPTVWEPSVCTKAAFAKGVNWVRSRGF---TVMGDPH 238

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           ++P ADM NH    ++        +  V  T +  + GE+VF S+G  SN +LL SYGFV
Sbjct: 239 MIPGADMFNHDPNKQSVQIGTDGEEHFVMKTVQPVKAGEEVFSSFGHISNAQLLNSYGFV 298

Query: 322 -PREGTNPSDSVELPLSLKKSDKCYKE--KLEA 351
            P    N  D+V +P  L   + CY    KLEA
Sbjct: 299 LP---GNSFDTVLIPTQL-VVNTCYATFVKLEA 327


>gi|397569514|gb|EJK46791.1| hypothetical protein THAOC_34522 [Thalassiosira oceanica]
          Length = 702

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 60/264 (22%), Positives = 109/264 (41%), Gaps = 25/264 (9%)

Query: 78  TLQKWLSDSGLPP-QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSK----W 132
           ++++W   +G+   + + +   D  + GL    ++  G  +++VP S VI++D       
Sbjct: 99  SMEEWCVQNGVERIEGIQLYTEDGADYGLSTQVDLPAGSTIVYVPSSTVISSDQVAEDLG 158

Query: 133 SCPEAGE----VLKQCSVPDWPLLATYLISEASFEKSSRWSNY--ISALPRQPYSLLYWT 186
              EA E     ++  +    PL    +     +EK     +Y  + +LPRQ Y+ +  T
Sbjct: 159 GSLEAAENALIQMETLTARRIPLFRLMVFVLKEYEKGVNSKHYAWLQSLPRQYYNGVSMT 218

Query: 187 R---AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSF 243
                 L  Y       ER           YN+    +   Y D+  E V +     W++
Sbjct: 219 EDCFGVLPPYAAKLAKSERE---------NYNNFVAGLREGYVDIADEIVDDDTIVNWAY 269

Query: 244 GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVF 303
            +  +R + +   + +  +VP  DM+NHS E    + +D     +V TT      G  + 
Sbjct: 270 NVALTRFIEVWQPNRQKKIVPMVDMINHSSEPNVDISFDDDGNCLV-TTLYDIPAGSALT 328

Query: 304 ISYGKKSN-GELLLSYGFVPREGT 326
           IS G  +N   +   YGF+P + T
Sbjct: 329 ISLGDPTNPTPIFAQYGFLPLDCT 352


>gi|392864101|gb|EJB10745.1| hypothetical protein CIMG_00433 [Coccidioides immitis RS]
          Length = 435

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 94/421 (22%), Positives = 157/421 (37%), Gaps = 71/421 (16%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
             +W  D G+    +A  +      G+ AL+ I  GE ++ VP S ++T D     P   
Sbjct: 16  FTQWAKDQGIQINGVAAVRFPGRGIGIAALRGIDAGETIVSVPTSSLLTLDK---IPSTF 72

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-----------PYSLLYWTR 187
               Q   P   + A YL  +   +  SR++ + +  P             P  L+    
Sbjct: 73  REKFQGDTPVQGIFAAYLACDD--DARSRYAPWRATWPTMRDFEDSIPLLWPKYLIGTPG 130

Query: 188 AELDRYLEASQIRERAIERIT--NVIGTYNDLRLRIFSKYPDLFPEEV------------ 233
            EL    E +   +     +   ++ G +N L  R+     D  P+              
Sbjct: 131 DELKGQGETTGRGQEVFPSLLPPSISGHFN-LSNRVGRFSGDYTPDHQNLLENQRSRFRK 189

Query: 234 -----------FNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCE 274
                       N+E F + +    +R     + D  V        AL P+AD  NHS  
Sbjct: 190 AFSRVKLACPGINLEIFTYYWFATHTRCFFYVAKDSEVPEDRNDAMALCPFADYFNHSSN 249

Query: 275 ---VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
               +   D D    G  FT  + Y  GE+VF+ YG  ++  LL  YGFVP E  N  D+
Sbjct: 250 DPGCKASFDGD----GYTFTATKSYAKGEEVFVCYGNHTSDVLLTDYGFVPDE--NKWDA 303

Query: 332 VELP----LSLKKSDKCYKEKLEALRKYGLS-ASECFPIQITGWPLELMAYA----YLVV 382
           + L       + K  + Y ++   L  Y ++ A  CF  ++    L  M+      Y+  
Sbjct: 304 IFLDDIVLQDINKIKRRYLKEDNYLGNYQVTRAGPCFRTEVAA-SLTYMSVDDWSLYVGG 362

Query: 383 SPPSM--KGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQVKE 440
           + PS     K + + A+   K   + D+    ++      I ++     S   R+ Q++E
Sbjct: 363 TIPSSFDSEKTDRIVASWIRKYRDEADLAIARVEGMVRSGIYETANRLKSIILRWKQIRE 422

Query: 441 L 441
           L
Sbjct: 423 L 423


>gi|358380690|gb|EHK18367.1| hypothetical protein TRIVIDRAFT_47382 [Trichoderma virens Gv29-8]
          Length = 479

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 86/205 (41%), Gaps = 43/205 (20%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRERAIER 206
           +LI E    + S W  YI ALP QP     W          AEL   LE + + E  +E+
Sbjct: 92  FLIKELLRGQESFWYPYIQALP-QPEDFDDWALPPFWPEEEAEL---LEGTNV-EIGLEK 146

Query: 207 ITNVIG-TYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSRLVR----------- 252
           I   +G  + D R  + +   D   +  +    E ++W++ I  SR  R           
Sbjct: 147 IREDLGREFRDARNLLIASQKDAEDDFSDHLTRELYQWAYCIFSSRSFRPSLVLSEEQQQ 206

Query: 253 -LP---SMDGRVALVPWADMLNHSCEVETFLDYDK------------SSQGVVFTTDRQY 296
            LP   S++    L+P  D+ NH   V    D               S   V     R++
Sbjct: 207 SLPDGVSVNDFSVLLPLFDIGNHDMTVHVRWDLAAGDEAAAGAGVRGSGAAVQLKVGREH 266

Query: 297 QPGEQVFISYGKKSNGELLLSYGFV 321
           +PG+Q+F +Y  K+N ELLL YGF+
Sbjct: 267 KPGQQIFNNYSPKTNAELLLGYGFM 291


>gi|345328941|ref|XP_001507526.2| PREDICTED: LOW QUALITY PROTEIN: N-lysine methyltransferase
           SETD6-like [Ornithorhynchus anatinus]
          Length = 495

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/198 (23%), Positives = 86/198 (43%), Gaps = 11/198 (5%)

Query: 166 SRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFS 223
           S W +Y S  P        ++W + E  R L+ + + E   + + N+   Y+ + L    
Sbjct: 160 SPWHHYFSLWPDLNDLDHPMFWPKEERGRLLQGTGVPEAVEKDLANISHEYSSIVLPFTE 219

Query: 224 KYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVE 276
            +PDLFP    ++E +     ++ +   + P  +           +VP AD+LNH     
Sbjct: 220 AHPDLFPAGSCSLELYCRLVAVVMAYSFQEPLEEEEEDEEPNPPLMVPVADILNHVANHN 279

Query: 277 TFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
             L+Y      +V T  R    G ++F +YG+ +N +L+  YGF      N  D+ ++ +
Sbjct: 280 ANLEYAPECLRMVAT--RPIPKGHEIFNTYGQMANWQLVHMYGFAEPYPGNTDDTADIQM 337

Query: 337 SLKKSDKCYKEKLEALRK 354
              ++      + EA R+
Sbjct: 338 VTVRAAALQGAETEAERQ 355


>gi|320580679|gb|EFW94901.1| hypothetical protein HPODL_3273 [Ogataea parapolymorpha DL-1]
          Length = 423

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 90/383 (23%)

Query: 71  DSLENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKN-IRKGEKLLFVPPSLVIT 127
           +SLEN   L KW    G      + AI + D G  G   LKN +   E L+ VP +L IT
Sbjct: 5   NSLEN---LLKWSKSHGAYFDNVEFAIDR-DKGVHG--TLKNEVTFDEPLIIVPEALFIT 58

Query: 128 ADSKWSCPEAGEVLKQCSVPDWPLLATY--------------LISEASFEKSSRWSNYIS 173
                      E+ K     D P+                  LI   S   S+ +  YI+
Sbjct: 59  P----------ELAKGVFGLDNPIDDLSLLLLAKLKFDKKETLIDGNSL--STMYEPYIA 106

Query: 174 ALPRQPYSL---LYWTRAE------LDRYLEASQIRERAIERITNVIGTYNDLR-LRIFS 223
            LP     +   L+WT  E       D Y    + RE   ER ++++   N+ + L +  
Sbjct: 107 FLPDSCLEVGLPLFWTDHEQELLKGTDAYPRLKRTREELFERWSSLMSLLNEQKKLDLVM 166

Query: 224 KYPDLFPEEVF--NMETFKWSFGILFSRLVRLPSMDGRVA--------LVPWADMLNH-- 271
           K   L  + +   + E F W++ I  +R    P+   + +        L P  D+LNH  
Sbjct: 167 KEAPLCKDSLSWKSFEAFSWAYSIYCTR--AFPNFLRKQSERSLNIGFLCPIVDLLNHKN 224

Query: 272 ------SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
                 +CE  +F         V   + ++ + GE+++ +YG KSN +LLL+YGF+  + 
Sbjct: 225 GEKVTWTCEDNSF---------VFKASAKRIRAGEEIYNNYGNKSNTDLLLNYGFILND- 274

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE-------CFPIQ-ITGWPLELMAY 377
            N S++  L L +++S       +EA  K+GL   E       CF +  +T  P +L+ +
Sbjct: 275 -NESETTTLTLKVEES------VIEAGTKFGLKLPEGTSANGICFNLSLVTPLPKDLLRF 327

Query: 378 AYLVVSPPSMKGKFEEMAAAASN 400
              +   P+       ++ A  N
Sbjct: 328 MGFLHQLPAGDSPLRFISDAYEN 350


>gi|317035930|ref|XP_001397212.2| ribosomal N-lysine methyltransferase [Aspergillus niger CBS 513.88]
          Length = 434

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 58/115 (50%), Gaps = 14/115 (12%)

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPS--------MDGRVALVPWADMLNHSCEVETFL 279
           ++PE  + M  + W   I+ SR     S         +  + +VP+AD  NH  +    +
Sbjct: 186 VYPETEWKMFAYYWC--IINSRSFYYVSPGKDEPEDWNDAIGMVPFADYFNHVDDAACEV 243

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           ++D   +   F   R+Y+ GE+V++SYG  SN  LL+ YGF     TNPSD + L
Sbjct: 244 NFD--GKKYTFRATRRYEKGEEVYMSYGNHSNDFLLIEYGFTL--STNPSDCIYL 294


>gi|299470104|emb|CBN78133.1| protein N-methyltransferase [Ectocarpus siliculosus]
          Length = 482

 Score = 58.2 bits (139), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 62/241 (25%), Positives = 94/241 (39%), Gaps = 58/241 (24%)

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRY--- 193
           E  ++C  P W  L   L+ E +  + SR  +Y+  LPR  +  + + W+  +LDR    
Sbjct: 60  EAWQRC--PWWVRLGVRLLKERADGEGSRLQDYVGMLPRPGETGAPVNWSAEQLDRLHYP 117

Query: 194 --LEASQIRERAIE-----------RITNVIGTY-NDLRLRIFSKYPDLFPEEVFNMETF 239
             L   +++ R  E           R  N   T   D   R+ S   D           F
Sbjct: 118 RLLSQIKLQRRLFEGFRKFLLADARRGDNAPSTREGDGVNRLVSALAD--------PAMF 169

Query: 240 KWSFGILFSRLVRLPSMDG--------------------------RVALVPWADMLNHSC 273
            W+   + SR  +LP                              R+AL+P  D +NH  
Sbjct: 170 SWALECVLSRAFQLPPRSAAALVVEEGDDVPVKAPEVTPPAPDEMRMALLPLIDSINHYS 229

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE 333
            + T + Y ++   +  +    + PG+  F SYG  SN +LL  YGFV  E  NPSD+  
Sbjct: 230 RMPTHM-YWEADGALSLSVGAAFDPGDHAFASYGPVSNDDLLQYYGFV--EQDNPSDTYV 286

Query: 334 L 334
           L
Sbjct: 287 L 287


>gi|397506651|ref|XP_003823836.1| PREDICTED: N-lysine methyltransferase SETD6 [Pan paniscus]
          Length = 386

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/201 (23%), Positives = 86/201 (42%), Gaps = 17/201 (8%)

Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           +SRW  Y +  P   +    ++W   E    L+ + + E   + + N+   Y  + L   
Sbjct: 50  ASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFM 109

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEV 275
             +PDLF   V ++E +     ++ +   + P  +           +VP AD+LNH    
Sbjct: 110 EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANH 169

Query: 276 ETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
              L+Y  +   +V T     QP   G ++F +YG+ +N +L+  YGFV     N  D+ 
Sbjct: 170 NANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTA 224

Query: 333 ELPLSLKKSDKCYKEKLEALR 353
           ++ +   +       K EA R
Sbjct: 225 DIQMVTVREAALQGTKTEAER 245


>gi|148671822|gb|EDL03769.1| SET domain containing 4, isoform CRA_c [Mus musculus]
          Length = 269

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/212 (27%), Positives = 96/212 (45%), Gaps = 17/212 (8%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL +       +         RGL++  ++++G+ ++ +P S ++T D+       G
Sbjct: 35  LRKWLKERKFEDTDLVPASFPGTGRGLMSKASLQEGQVMISLPESCLLTTDTVIR-SSLG 93

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             +K+   P  PLLA  T+L+SE      S W +Y+  LP+  Y+       E+   L  
Sbjct: 94  PYIKKWKPPVSPLLALCTFLVSEKHAGCRSLWKSYLDILPKS-YTCPVCLEPEVVDLL-P 151

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRL 253
           S ++ +A E+   V   +   R   FS    LF E    VF+   F W++  + +R V L
Sbjct: 152 SPLKAKAEEQRARVQDLFTSAR-GFFSTLQPLFAEPVDSVFSYRAFLWAWCTVNTRAVYL 210

Query: 254 PSMDGR--------VALVPWADMLNHSCEVET 277
            S             AL P+ D+LNHS  V+ 
Sbjct: 211 RSRRQECLSAEPDTCALAPFLDLLNHSPHVQV 242


>gi|350636529|gb|EHA24889.1| hypothetical protein ASPNIDRAFT_40813 [Aspergillus niger ATCC 1015]
          Length = 437

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/142 (29%), Positives = 64/142 (45%), Gaps = 17/142 (11%)

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPSMD------GRVALVPWADMLNHSCEVETFLDY 281
           ++PE  + M  + W      S     P  D        + +VP+AD  NH  +    +++
Sbjct: 197 VYPETEWKMFAYYWCIINSRSFYYVSPGKDEPEDWNDAIGMVPFADYFNHVDDAACEVNF 256

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
           D   +   F   R+Y+ GE+V++SYG  SN  LL+ YGF     TNPSD + L       
Sbjct: 257 D--GKKYTFRATRRYEKGEEVYMSYGNHSNDFLLIEYGFTL--STNPSDCIYL------- 305

Query: 342 DKCYKEKLEALRKYGLSASECF 363
           D    + L   +K  L+  E F
Sbjct: 306 DDIIFQDLSISQKQELAKQEIF 327


>gi|426382401|ref|XP_004057794.1| PREDICTED: N-lysine methyltransferase SETD6 [Gorilla gorilla
           gorilla]
          Length = 541

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 19/202 (9%)

Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           +SRW  Y +  P   +    ++W   E    L+ + + E   + + N+   Y  + L   
Sbjct: 205 ASRWRPYFALWPELGRLEHPMFWPEEERRCLLQGTGVPEAVEKDLANIRSEYQSIVLPFM 264

Query: 223 SKYPDLFPEEVFNMETFK--------WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
             +PDLF   V ++E +         +SF             +  V +VP AD+LNH   
Sbjct: 265 EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPV-MVPAADILNHLAN 323

Query: 275 VETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDS 331
               L+Y  +   +V T     QP   G ++F +YG+ +N +L+  YGFV     N  D+
Sbjct: 324 HNANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDT 378

Query: 332 VELPLSLKKSDKCYKEKLEALR 353
            ++ +   +       K EA R
Sbjct: 379 ADIQMVTVREAALQGTKTEAER 400


>gi|34784341|gb|AAH57968.1| Setd3 protein [Mus musculus]
          Length = 408

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           + +Q G+Q++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L 
Sbjct: 115 QDFQAGDQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 172

Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
           + G+  S  F +  T  P+     A+L V
Sbjct: 173 RAGIPTSSVFALHSTEPPISAQLLAFLRV 201


>gi|317033156|ref|XP_001394952.2| ribosomal N-lysine methyltransferase [Aspergillus niger CBS 513.88]
          Length = 415

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 88/337 (26%), Positives = 139/337 (41%), Gaps = 59/337 (17%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G++A + I K   L+ VP S ++T       P           P   L A YL + AS  
Sbjct: 39  GMIATRKIEKDSILVKVPHSAMLTPSK---LPSTFTSRFPADTPTHTLYAAYL-TNASPS 94

Query: 164 KSSRWSN-------YISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN- 215
               W N       + S++P     +L+ + + L    + S+I++     I+N   T   
Sbjct: 95  HLKPWRNTWPTMEDFTSSMP-----ILWSSTSPLTPNSKTSKIQDLLPPSISNTWSTITP 149

Query: 216 ------------------DLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
                             + RLR  +     +FPE    + T+ W      S    LP  
Sbjct: 150 GKRKHKSDTRHQNLLKAQETRLRKAWDIVVRVFPETDKELFTYHWVIVNTRSFFYLLPGA 209

Query: 257 ------DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
                 +  +ALVP+AD  NHS +V   + +D   +  VF   ++Y  GE++++SYG  S
Sbjct: 210 EMPEDRNDAMALVPFADYFNHS-DVACNVKFD--GEEYVFRAAKEYNEGEEIYMSYGPHS 266

Query: 311 NGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYG----LSASECFPI 365
           N  L   YGF     TN S+++ L  + L+  +   +E+LE  + YG     S   C+  
Sbjct: 267 NDFLFTEYGFY--LDTNASETLYLDEIILQDLNASKQEELEFHQYYGNYQLTSDGVCYRT 324

Query: 366 QI----TGWPLELMAYAYLVVSPPSMKGKFEEMAAAA 398
           +I    T  PL L     L     S +G  E+M+AA 
Sbjct: 325 EIAAGLTYMPLRLWQDYVLGY---STEGVDEKMSAAV 358


>gi|407396203|gb|EKF27396.1| hypothetical protein MOQ_008884 [Trypanosoma cruzi marinkellei]
          Length = 572

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 63/140 (45%), Gaps = 9/140 (6%)

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
           A L  YL+  + R + +    NV   +   +  L      P    EE      +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANVEAEFQLCKSVLSFLQTMPHSNGEERSMPVTVEQFLWA 256

Query: 243 FGILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
           +  L +R     + D  V +L+PW D  NH+      + YD+  +  +F      + GEQ
Sbjct: 257 YNTLMTRGF---AYDPEVWSLMPWVDYFNHALTSNATMKYDERRRAYIFEALFPIETGEQ 313

Query: 302 VFISYGKKSNGELLLSYGFV 321
           +F+ YG  ++ ELLL YGF 
Sbjct: 314 IFLPYGAYTDMELLLWYGFT 333


>gi|149238199|ref|XP_001524976.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146451573|gb|EDK45829.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 488

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/95 (36%), Positives = 48/95 (50%), Gaps = 9/95 (9%)

Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
            + P+ D +NHSC+    L  D  ++G   TT   Y+PG+Q+++SYG  SN  LL  YGF
Sbjct: 285 TMAPYIDFINHSCDDHCTLKID--AKGFQITTTTAYKPGDQLYLSYGPHSNEFLLCEYGF 342

Query: 321 V---PREGTNPSD----SVELPLSLKKSDKCYKEK 348
           V   P E    +D    S  LP+       C KE 
Sbjct: 343 VVTLPEEENRWNDLDISSYLLPMFNANQIDCLKEN 377


>gi|255078794|ref|XP_002502977.1| set domain protein [Micromonas sp. RCC299]
 gi|226518243|gb|ACO64235.1| set domain protein [Micromonas sp. RCC299]
          Length = 536

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/259 (25%), Positives = 107/259 (41%), Gaps = 51/259 (19%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEAS 161
           RGL A ++I  GE +L +P  + I         EA EV+       W + LA  L+ E +
Sbjct: 107 RGLEASRDIENGEPVLRLPLEMGICDYQDGHPAEAWEVMSNAP---WGVRLACRLLQERA 163

Query: 162 FEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR 220
             + S ++ YI+ +P   P S L WT  E+        + E     + + + T+      
Sbjct: 164 KGEDSDYAPYIALIPESVPGSPLMWTDDEVASLQYPPAVAE--AREMRDAVATW------ 215

Query: 221 IFSKYPDLFPEEV--FNMETFKWSFGILFSR---LVRLPSMDGRV-ALVPWADMLNHSCE 274
            F K     P  +   +++ FK +  ++ SR   +    S +G   AL+P AD+LNH  +
Sbjct: 216 -FRKLSAEAPVALAGADLDAFKSAVSVVHSRTYGVASSASGEGYFRALLPLADLLNHGGD 274

Query: 275 ------------------------------VETFLDYDKSSQGVV-FTTDRQYQPGEQVF 303
                                          +     + S +GV+ F   R   P E+  
Sbjct: 275 EYPESASSPANRGGKANKSPASPKWPPAGCSDNIAWSELSDEGVIEFAATRAIAPHEEAA 334

Query: 304 ISYGKKSNGELLLSYGFVP 322
           +SYG++SN   L+ YGFVP
Sbjct: 335 MSYGERSNDHFLVYYGFVP 353


>gi|300122775|emb|CBK23792.2| unnamed protein product [Blastocystis hominis]
          Length = 854

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/256 (24%), Positives = 119/256 (46%), Gaps = 35/256 (13%)

Query: 102 ERGLVALKNIRKGEKLL------FVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATY 155
           +RGLVA++ I+    L+       + PS V+      S PE+ + L   ++ D  +LA +
Sbjct: 59  DRGLVAVEEIKPNSTLIELDLDDVIYPSTVLK-----SVPESEKNLF-LAMSDDLMLAAF 112

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           LI E    ++SRW  ++  LP+ P     +T++E+  + + + I+   I+R ++   TY 
Sbjct: 113 LIQERIKGRASRWYPWLQTLPKHPTVPSSFTQSEIKEFEDPAIIQRLNIQR-SDYYSTYF 171

Query: 216 DLRLRIFSKYPDL---FPEEVF--NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
                + + +  +   F + ++  +   F+W + ++ +R V     + R+ L+P  D  N
Sbjct: 172 AFTRHMCTYFLQVDAPFRDRLWACSYSGFEWGYTMVITRTV----TENRL-LIPLMDYRN 226

Query: 271 HSCEVETFLDYDKSSQGVVF----------TTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
                  F   D+S +   F           TD++ + G QV++ Y    +   L  +GF
Sbjct: 227 FISTDSPFEAVDRSHERTHFIINEQNQLRVVTDKRVKRGRQVYLDYEAFPSHYYLQHFGF 286

Query: 321 VPREGTNPSDSVELPL 336
           VP   +N  D + +PL
Sbjct: 287 VP--ISNIHDCLLIPL 300


>gi|224001788|ref|XP_002290566.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973988|gb|EED92318.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 595

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/159 (26%), Positives = 73/159 (45%), Gaps = 31/159 (19%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK---------- 283
           F+ E F+++  ++ SR   +   DG + L+P+ D  NH        D+D           
Sbjct: 296 FSQEGFRYAVSLVRSRSFFV---DGSLRLLPYLDFANHD-------DFDSLELVGGGIGT 345

Query: 284 ---SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP-----REGTNPSDSVELP 335
              S++G +  + +  + G+++ ISYG K   + LL +GFVP       GT  + + EL 
Sbjct: 346 LWGSAKGALMKSGKALEVGDEIRISYGPKGPADYLLDHGFVPPMCQTTSGTGGAITAELS 405

Query: 336 LSLKKSDKCYKEKLEALRKYGLSASECFPIQ---ITGWP 371
             +  SD+   +KL+ L       +   P+Q   +TG P
Sbjct: 406 FEIDDSDRFRDDKLDVLEYETYDLAPMEPLQVFDVTGGP 444


>gi|219126019|ref|XP_002183265.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405540|gb|EEC45483.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 344

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 51/101 (50%), Gaps = 7/101 (6%)

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY-----DKSSQGVVFTTDRQ 295
           W+   + SR   L  M    ++ P  DMLNH C V T         D+  + +    ++ 
Sbjct: 132 WAMACVCSRSNFLNDMS--YSMTPLLDMLNHDCTVRTSAKVSKNKLDEDDKWLSLQIEQC 189

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           Y+ G+QVFISYG  SN E L  YGFV R  +   +S+++ +
Sbjct: 190 YRAGDQVFISYGSLSNLETLCDYGFVDRSNSCNFESIQVQM 230


>gi|322706860|gb|EFY98439.1| SET domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 595

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 91/351 (25%), Positives = 140/351 (39%), Gaps = 51/351 (14%)

Query: 38  HRIVVHCSVST----------TNDASRTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSG 87
           H+  +H +V T          T  ASR    V ++M P    + S     T   W   + 
Sbjct: 66  HKAPLHTAVCTDTAGTPSRKQTEFASRASPEVNKHM-PSKPSMSSQLPIDTFPAWAHLND 124

Query: 88  LPPQKMAIQKVDVGER-GLVALKNIRKGEK------LLFVPPSLVITADSKWSCPEAGE- 139
           +    + +Q V  G+  GLVA  ++   E        + +P  LV++A++     +    
Sbjct: 125 VQFTHVNLQDVGEGKGFGLVAHADLESAEADGTSKGPVTIPHDLVLSAEAVEDFAKVDHN 184

Query: 140 -------VLKQCSVPDWPLLATYLISEASFEKSSR--------WSNYISALPRQPYSLLY 184
                  V +Q +  D   +  YL+S+  F +SSR        W+ YI  LPR       
Sbjct: 185 FKQLLEAVGRQSTRGD---IMLYLVSQ--FAQSSRPKGLSPTPWTEYIRLLPRPIPVPTM 239

Query: 185 WTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD----LFPEEVFNMETFK 240
           WT  E  R L      E A+E     +G   D    +   +P     L+  E  ++E + 
Sbjct: 240 WTEPE--RLLLNGTSLEAALEAKLLSLGKEFDTLREVSEDFPFWNEFLWSGEEVSLEDWV 297

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR-QYQPG 299
                  SR + LP      A+VP  DM+NHS +   + + D     V+        + G
Sbjct: 298 LVDAWYRSRCLELPR--SGTAMVPGLDMVNHSSKATAYYEEDDHDNVVLLIRPGCPVRSG 355

Query: 300 EQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
           E+V ISYG  K   E+L SYGF+  +  N  D + L L     D   + KL
Sbjct: 356 EEVTISYGDAKPASEMLFSYGFI--DPNNIVDKLTLRLDPFPDDPLARAKL 404


>gi|134079652|emb|CAK97078.1| unnamed protein product [Aspergillus niger]
          Length = 443

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/291 (25%), Positives = 121/291 (41%), Gaps = 48/291 (16%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G++A + I K   L+ VP S ++T       P           P   L A YL + AS  
Sbjct: 39  GMIATRKIEKDSILVKVPHSAMLTPSK---LPSTFTSRFPADTPTHTLYAAYL-TNASPS 94

Query: 164 KSSRWSN-------YISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN- 215
               W N       + S++P     +L+ + + L    + S+I++     I+N   T   
Sbjct: 95  HLKPWRNTWPTMEDFTSSMP-----ILWSSTSPLTPNSKTSKIQDLLPPSISNTWSTITP 149

Query: 216 ------------------DLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
                             + RLR  +     +FPE    + T+ W      S    LP  
Sbjct: 150 GKRKHKSDTRHQNLLKAQETRLRKAWDIVVRVFPETDKELFTYHWVIVNTRSFFYLLPGA 209

Query: 257 ------DGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS 310
                 +  +ALVP+AD  NHS +V   + +D   +  VF   ++Y  GE++++SYG  S
Sbjct: 210 EMPEDRNDAMALVPFADYFNHS-DVACNVKFD--GEEYVFRAAKEYNEGEEIYMSYGPHS 266

Query: 311 NGELLLSYGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYGLSAS 360
           N  L   YGF     TN S+++ L  + L+  +   +E+LE  + YG   S
Sbjct: 267 NDFLFTEYGFY--LDTNASETLYLDEIILQDLNASKQEELEFHQYYGYVTS 315


>gi|336467028|gb|EGO55192.1| hypothetical protein NEUTE1DRAFT_147775 [Neurospora tetrasperma
           FGSC 2508]
 gi|350288355|gb|EGZ69591.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 504

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 83/201 (41%), Gaps = 33/201 (16%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTR----AELDRYLEASQIRERAIERI-TN 209
           YLI +    KSS W+ YIS L   P  L  W      AE D  L        AI+ I +N
Sbjct: 108 YLIQQYLKGKSSFWAPYISTLA-DPSQLDKWALPPFWAEDDIELLKGTNAYVAIQEIQSN 166

Query: 210 VIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR---------------- 252
           V   Y   R +I  K  + FP+   +    + W++ +  SR  R                
Sbjct: 167 VKSEYKQAR-KILKK--EGFPDYRDYTQVLYNWAYCMFTSRSFRPSLVLSESAREYVERL 223

Query: 253 LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
           LP    +D    L P  D+ NHS +     +            +  Y PG+QVF +YG K
Sbjct: 224 LPEGSKIDDFSILQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQQVFNNYGFK 283

Query: 310 SNGELLLSYGFVPREGTNPSD 330
           +N ELLL YGF+     NP D
Sbjct: 284 TNSELLLGYGFI----INPKD 300


>gi|303270905|ref|XP_003054814.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462788|gb|EEH60066.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 58/122 (47%), Gaps = 16/122 (13%)

Query: 239 FKWSFGILFSRLVRL-------PSMDGRVALVPWADMLNHS-------CEVETFLDYDKS 284
           ++W+  ++ SR  RL          + R  +VP+ D+LNH        CE +   D    
Sbjct: 222 WRWAMSMVHSRTFRLEEPAAGVAGFETRRVMVPYVDLLNHDSRADVWQCEWDCEWDLGGG 281

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKC 344
               V T  R  + GE+V +SYG++ +    L +GF+P    NP ++V L  + +++   
Sbjct: 282 GGTFVVTATRDVRAGEEVLVSYGERCDRHFFLFFGFLP--APNPHNTVALFANAREAAAW 339

Query: 345 YK 346
           Y+
Sbjct: 340 YE 341


>gi|313225781|emb|CBY07255.1| unnamed protein product [Oikopleura dioica]
          Length = 346

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 68/156 (43%), Gaps = 38/156 (24%)

Query: 237 ETFKWSFGILFSRLVRLPSMDGR--------------VALVPWADMLNHSCEVETFLDYD 282
           E   W+F ++ SR   LP  D                  L P+ D++NHS     + + D
Sbjct: 159 EDLTWAFSMVLSRTFSLPKYDKSSDFDYCSQVDSSKSAFLCPFMDLINHSSAPNCYYETD 218

Query: 283 KSSQGVVFTTDRQYQPGEQVFISY-GKKSNGELLLSYGFVPREGTN-------------P 328
             +   V   DR+ Q  E++FI+Y G KS+  LL  YGF    G N             P
Sbjct: 219 SETGDFVLRADRELQQKEELFITYGGSKSDHVLLAFYGFCLPPGVNRNSYIVFSPNFIGP 278

Query: 329 SD-----SVELPLSLKKSDKCYKE----KLEALRKY 355
           S      + +  L+ KK+ KC+KE    KLE+ RK+
Sbjct: 279 SSHSKFTAFKFFLNSKKA-KCFKESLNKKLESWRKF 313


>gi|148686780|gb|EDL18727.1| mCG18357, isoform CRA_e [Mus musculus]
          Length = 458

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR 353
           + +Q G+Q++I YG +SN E ++  GF      N  D V++ L + KSD+ Y  K E L 
Sbjct: 165 QDFQAGDQIYIFYGTRSNAEFVIHSGFFF--DNNSHDRVKIKLGVSKSDRLYAMKAEVLA 222

Query: 354 KYGLSASECFPIQITGWPLELMAYAYLVV 382
           + G+  S  F +  T  P+     A+L V
Sbjct: 223 RAGIPTSSVFALHSTEPPISAQLLAFLRV 251


>gi|428165190|gb|EKX34191.1| hypothetical protein GUITHDRAFT_147375 [Guillardia theta CCMP2712]
          Length = 681

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 73/300 (24%), Positives = 123/300 (41%), Gaps = 27/300 (9%)

Query: 42  VHCSVSTTNDASRT---KTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKV 98
           V   VS  ND  R+   +      M P           + L++W   +G+    + I+++
Sbjct: 39  VVSGVSKLNDRGRSECPRACPRTVMAPLMASGTGARREAMLEEWARANGIFCM-LNIKRM 97

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS----VPDWPL-LA 153
             GER  VA   +  GE+++ VP  +        + P     + Q +       W   LA
Sbjct: 98  ADGERKAVAANALAGGERVVRVPREVSFVTFQGDASPLPPSFVDQDTWQQLDEHWNAKLA 157

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWT---RAELDRYLEASQIR-----ERAIE 205
             L+ E    +   W++   A  + P  +   +   R+     +EAS+ R     + A E
Sbjct: 158 LMLLHE--MRRGVHWTDEELAELQNPRLVAAASDSKRSHAGLTIEASEWRYFDRMQAAGE 215

Query: 206 RITNVIG--TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGR--VA 261
           + T+++   T+     R+        P    ++   +W+     SR   + + +G     
Sbjct: 216 QETSMLDQETFQRDLHRLLCDRMTAPP----SLAELRWAMDCAQSRSFGVSTTEGVKCFC 271

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           L P ADMLNH        D+D ++      T R +  GE+V ISYG+ SN +LL  YGFV
Sbjct: 272 LCPLADMLNHDPSSPALFDFDPATSCFAIRTSRAWSEGEEVTISYGELSNEDLLQFYGFV 331


>gi|388578758|gb|EIM19096.1| SET domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 413

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 122/292 (41%), Gaps = 39/292 (13%)

Query: 106 VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE------ 159
           +A  +I +  +++ V  ++ IT  +   C EA + L    +P+  L+A Y+         
Sbjct: 38  LARNDIPEDAEIVSVNKNICITEST---CKEAFKNLNNEGLPEKLLIAVYISLHYIYDQL 94

Query: 160 -ASFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
             S +       Y+  LP   Q  + LYWT  EL+ Y + + +     ER       Y  
Sbjct: 95  PESLKSKLHHRRYVDILPEIGQTLTTLYWTDDELE-YTKPTSLFNATKEREIQWKSDY-- 151

Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS---MDGRVA---LVPWADMLN 270
               +  K+      EVF  + FK S  ++ SR    PS    D  ++   LVP  D+ N
Sbjct: 152 ---EVVKKWSRANDVEVFTWDVFKHSLTMISSR--AFPSKLIQDDEISSPMLVPLWDIGN 206

Query: 271 HSCE---VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTN 327
           H  +   V T + Y   +  +     +  Q   +VF +YG K   ELLL+YGF      +
Sbjct: 207 HKSQSAIVWTDVKY-TGTDNIGMKLPQGAQKDNEVFNNYGGKPTNELLLAYGF----AVD 261

Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGLSASEC----FPIQIT-GWPLEL 374
             +   +P  +       + K + L+K+GL   +C    F I +  G PL L
Sbjct: 262 NINYDVVPFRIGAGVSLSESKKDILKKHGLLNEDCTLKTFNINLNEGLPLGL 313


>gi|171684553|ref|XP_001907218.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942237|emb|CAP67889.1| unnamed protein product [Podospora anserina S mat+]
          Length = 396

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 5/71 (7%)

Query: 255 SMDGRVALVPWADMLNHSCE-VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           + D ++AL P AD+LNHS E  E   D    +     + DR+Y+ GE+V+I YG  SN  
Sbjct: 197 TKDDKMALQPVADLLNHSDEGCEVVFD----TGCYTISADREYKQGEEVYICYGTHSNDF 252

Query: 314 LLLSYGFVPRE 324
           L++ YGF P E
Sbjct: 253 LMVEYGFCPEE 263


>gi|154290554|ref|XP_001545870.1| hypothetical protein BC1G_15621 [Botryotinia fuckeliana B05.10]
          Length = 336

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 71/157 (45%), Gaps = 22/157 (14%)

Query: 229 FPEEVFNMETFKWSFGILFSRLVRL-----------PSMDGRVALVPWADMLNHSCEVET 277
           FPE     + F +++ I+ SR               PS + R+AL P+AD +NHS E   
Sbjct: 101 FPEPPITYDEFIYNYSIVNSRTFYYLSPTIKPSKPQPSKENRLALNPFADYINHSSE--P 158

Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL--- 334
            +D   S  G   T  +  + G +V ISYG  +N  LL+ YGF+  +  N  D V L   
Sbjct: 159 TVDATLSRAGYTLTASQPIKQGSEVHISYGSHNNDFLLVEYGFILED--NRWDEVTLDPW 216

Query: 335 --PLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQIT 368
             PL L    K + E+   L KY L     C+  Q+ 
Sbjct: 217 ITPL-LSVEQKEHLEETGFLGKYLLDRDTICYRTQVV 252


>gi|159474448|ref|XP_001695337.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158275820|gb|EDP01595.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 360

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/236 (23%), Positives = 101/236 (42%), Gaps = 31/236 (13%)

Query: 92  KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL 151
           ++ I + + G RGL   + ++KGE ++ +P  +V++  +  +   + ++LK+   P    
Sbjct: 45  RVTISRDEAGVRGLYTTQPVKKGEVIVSIPQHIVLSVKNVAAAEASPQLLKEIHSP---- 100

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLL--YWTRAELDRYLEASQIRERAIE---- 205
                         SR   Y+  LP  P  +L  Y    E  +YL    + E+       
Sbjct: 101 -------------CSRLRPYLDTLP-GPDGVLTAYNWPEEYIKYLADPAMEEQLKNSFKL 146

Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPW 265
              N    +ND  + +    P+    +   ++ ++    +L SR   +    G ++LVP 
Sbjct: 147 HARNTWLGHNDDEMEV--TIPEAIGRKNITLKEWEHVVSLLSSRTFSI--RKGALSLVPV 202

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
            D++NH       ++   +S  V     +    GEQV I+YG   N ELL+ YGFV
Sbjct: 203 LDLVNHDVR---DINQLGNSSTVDLVAGKDLAAGEQVTITYGSMRNDELLMYYGFV 255


>gi|451997605|gb|EMD90070.1| hypothetical protein COCHEDRAFT_1022164 [Cochliobolus
           heterostrophus C5]
          Length = 408

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 116/279 (41%), Gaps = 30/279 (10%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVP-PSLVITA 128
           +D   N +    W   +G+    +A  +      G+VA ++I+KG+KL+ V   SLV  A
Sbjct: 5   LDPGTNHTDFVSWAKSNGVEINGIAPARFVGRGMGIVAAQDIKKGDKLVHVSNKSLVHVA 64

Query: 129 ---DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS---SRW---SNYISALPRQP 179
                    P+   V  + ++     LA +       + +   + W   S++ S +P   
Sbjct: 65  LPSIHSLKLPDTITVHGKLALA----LALWYTGRKDHDYTLWQNVWPTASDFKSTMPFYY 120

Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
            S L        R L   Q++   +ER    I  +N    +    Y  L    + N  TF
Sbjct: 121 PSPLQSLLPPAARTLLTKQLQN--LERDWTSITPHNPGITKETYTYTWL----IVNTRTF 174

Query: 240 KWSFGILFSRLVRLP------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
            WS+  L +    LP      + D    + P+ D  NHS   ++  D   S  G   T D
Sbjct: 175 YWSYPDLPNASPLLPKRRAKLTADDCYCMCPFTDYFNHS---DSGCDPQMSPSGYTVTAD 231

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
           R Y  GE+VF++YG  +N  LL  YGF+ +E  N  D V
Sbjct: 232 RAYAAGEEVFVTYGPHTNDFLLTEYGFILQE-KNRHDGV 269


>gi|46129354|ref|XP_389038.1| hypothetical protein FG08862.1 [Gibberella zeae PH-1]
          Length = 478

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/270 (25%), Positives = 116/270 (42%), Gaps = 43/270 (15%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+VA+ +IR  + +L VP   V T D+        + L   SV         L +E + +
Sbjct: 32  GIVAVCDIRANQTILSVPTRAVRTIDTVPK--HIKDALHGVSV------HGILAAEIALD 83

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEA---SQIRERAIERITNVIGTYNDLRLR 220
            S  ++ + + LP         TR +L+  +     S+++    +R  +++   N    R
Sbjct: 84  DSDDFAIWRTVLP---------TREDLEGGMPMMWPSELQALLPKRAKDLLDNQNTTFRR 134

Query: 221 ----IFSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
               +   +P L  +E      + N  TF  S   +     ++ +   R+  +P AD+ N
Sbjct: 135 ECDIVLKAFPTLTRDEYMLSWVLINTRTFYNSMPKM-----KIYAHSDRLVCMPVADLFN 189

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---PREGTN 327
           H    +       S+ G    TDR Y+ GE+V++SYG  SN  LL  YGF+    R    
Sbjct: 190 HDQGCKLVY----SALGYSVQTDRVYKQGEEVYVSYGPHSNDFLLTEYGFILDTNRWDEV 245

Query: 328 PSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
             D V LPL L K+ +   E +  L +Y L
Sbjct: 246 YLDEVILPL-LNKTQRAELESVGFLGRYTL 274


>gi|261190993|ref|XP_002621905.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239590949|gb|EEQ73530.1| SET domain-containing protein [Ajellomyces dermatitidis SLH14081]
 gi|239613147|gb|EEQ90134.1| SET domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327354785|gb|EGE83642.1| SET domain-containing protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 481

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/273 (26%), Positives = 113/273 (41%), Gaps = 57/273 (20%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT-ADSKWSCPEAGEVLKQCSVP 147
           P  K+A  + +   RG+VAL NI + E+L  +P +LV++  +SK       + L   S  
Sbjct: 34  PKIKIADLRSEGAGRGIVALSNINEDEELFAIPQNLVLSFQNSKL------KDLLHISEK 87

Query: 148 D---WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
           D   W  L   +I E     +S WS Y   LP +  +L++WT  EL         RE + 
Sbjct: 88  DLGPWLCLILVMIYEYLQGGASPWSRYFQVLPTEFDTLMFWTDEEL---------RELSG 138

Query: 205 ERITNVIGTYND----LR--LRIFSKYPDLFPEEVFNMETFKWSFG--ILFSRLVRLPSM 256
             + N IG  +     LR    I S  P LFP  +  + ++    G   L S   R+ S+
Sbjct: 139 SAVLNKIGKSDAEAAILRDIFPIVSTNPHLFP-PISGLGSYDSPDGRATLLSLAHRMGSL 197

Query: 257 -------------------DGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGV 288
                              DG +          +VP AD+LN   +      + +    +
Sbjct: 198 IMAYAFDIEKGEDEEGEVQDGYITDEGEELTKGMVPLADLLNADADRNNARLFQEDGY-L 256

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
              + +  + GE++F  YG+    +LL  YG+V
Sbjct: 257 AMKSIKPIRNGEEIFNDYGELPRADLLRRYGYV 289


>gi|168021415|ref|XP_001763237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685720|gb|EDQ72114.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 489

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/275 (23%), Positives = 111/275 (40%), Gaps = 44/275 (16%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLK-QCSVPDWPLLATYLI 157
           D G RGL A ++++ GE +L VP   ++   S     E    L    S+    +L  +L+
Sbjct: 51  DAGGRGLAAARDLKLGELILRVPEKALMNGRSARLDAELTRALALYPSLSHVQVLCVHLL 110

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
            E +  ++S    Y+  LPR  ++  +++  E     +A Q+++ A+     V+    + 
Sbjct: 111 REIAKGRTSERFPYLVHLPRYYHTASFYSPFE----AQALQVKD-AVSMAEGVVQNSREE 165

Query: 218 RLRIFSKYPDLFPEEVF-NMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC--- 273
            L+       L     F  ++ + W+F  + SR + +P  D    L P  D  N++C   
Sbjct: 166 WLQARPVLEKLGLGRRFCTLQGWLWAFATISSRTLYVP-WDEAGTLCPVGDFFNYACPGV 224

Query: 274 --------------EVETFLDYDKSSQGVVFTTDR-------------------QYQPGE 300
                         E +   + D  + G +   DR                    YQ G+
Sbjct: 225 PYNLPPTAQDTQMREGDLISEEDVDTSGGIEIRDRLRDGGFEDERGEYCFYARQDYQEGQ 284

Query: 301 QVFISYGKKSNGELLLSYGFVPREGTNPSDSVELP 335
           QV + YG  +N ELL  YGF+     N    +ELP
Sbjct: 285 QVLLCYGTYTNLELLEHYGFLLPFNPNDKVHIELP 319


>gi|118383229|ref|XP_001024769.1| hypothetical protein TTHERM_00237390 [Tetrahymena thermophila]
 gi|89306536|gb|EAS04524.1| hypothetical protein TTHERM_00237390 [Tetrahymena thermophila
           SB210]
          Length = 840

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/177 (28%), Positives = 84/177 (47%), Gaps = 14/177 (7%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVL--KQCSVPDWPLLATY 155
           G  A + I   + L+ VP  L++       +D +    E       K+  + +  ++  Y
Sbjct: 81  GFRATEEINPSDILIKVPRKLILNTRTCMFSDIQKVVKENLNFFTAKKGGLVEDHIMLVY 140

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ +    K+S W + IS L R   ++ +W   E  +YL+    RER   R+      Y 
Sbjct: 141 LLRQYQLGKASPWYHLISNLSRYIDTVDFWEDEEY-KYLDDPIFRERI--RLQRNYFNYI 197

Query: 216 DLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNH 271
              LR I  KYPDLF E+ ++ E  +W +  ++SR     SMD  +++VP  +M+NH
Sbjct: 198 AKNLREILPKYPDLFEEQTYSEENIRWIYIHIWSRSFG-GSMD-YISMVPIVEMMNH 252


>gi|356574815|ref|XP_003555540.1| PREDICTED: ribosomal N-lysine methyltransferase 3-like [Glycine
           max]
          Length = 506

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 72/360 (20%), Positives = 137/360 (38%), Gaps = 72/360 (20%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERGLV--ALKNIRKGEKLLFVPPSLVITADSKWSCP 135
             ++W+   GL     A++ VD  E G+   AL  +++G+ +  +P    +T  +     
Sbjct: 8   AFKRWMKSKGLEWSD-ALEFVDTPEEGVEVRALCQLKEGDVVAKMPKEACLTTKTSG--- 63

Query: 136 EAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLE 195
            A +++++  +     LA  ++ E S +  S ++ Y+  LP Q    + WT  E++  L 
Sbjct: 64  -ARKIIEEAGLDGHLGLAFAIMYERSLDGDSPFAGYLQLLPHQECVPIVWTLDEVNELLC 122

Query: 196 ASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS 255
            +++ +   E    +   + +  L +    P     + F +E +  +  ++ SR   +  
Sbjct: 123 GTELHQTVQEDKALIYDDWKENILPLLDLAPLKLNPKFFGVEQYFAAKSLISSRSFEIDD 182

Query: 256 MDGRVALVPWADMLNHSC--------------EVETFLDYDKSSQGVV------------ 289
             G   +VP AD+ NH                E +T +D     +G+V            
Sbjct: 183 YHG-FGMVPLADLFNHKTGAEDVHFTAMSSNDESDTDVDGCNDDEGIVKEETLAQNSSID 241

Query: 290 ----------------------------FTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
                                           +    G +VF +YG   N  LL  YGF 
Sbjct: 242 MTVLNNGNCNVSDSDSSSVSDGDTSMLEMIMIKDVSSGTEVFNTYGLLGNAALLHRYGFT 301

Query: 322 PREGTNPSDSVELPLSLK-----KSDKCYKEKLEALRKYGLSA-----SECFPIQITGWP 371
            ++ +    ++++ L L+      SD+  + ++   RK G SA     SE F I   G P
Sbjct: 302 EQDNSYDIVNIDMELVLQWCTSVFSDRHSRARVSLWRKLGYSACGSQNSEYFEISFDGEP 361


>gi|414886518|tpg|DAA62532.1| TPA: hypothetical protein ZEAMMB73_960129 [Zea mays]
          Length = 483

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/282 (23%), Positives = 115/282 (40%), Gaps = 59/282 (20%)

Query: 99  DVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWP------LL 152
           D G RGL A +++R+GE +L +P + ++T+D           +  C     P      +L
Sbjct: 45  DAGGRGLAAARDLRRGELVLRLPRAALLTSDR---VTADDPRIAACVSAHKPRLSSVQIL 101

Query: 153 ATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY-LEASQIRERAIERITNVI 211
              L++E     +S W  Y+  LP   Y++L    A  + + +EA Q+         + I
Sbjct: 102 IVCLLAEVGKGSNSVWYPYLCQLPSY-YTIL----ATFNDFEVEALQV--------DDAI 148

Query: 212 GTYNDLRLRIFSKYPDL--------FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALV 263
                 +  I S + D         F  ++   +++ W+F  + SR + + + D    L 
Sbjct: 149 WVAQKAKSAIKSDWEDATPLMKELEFKPKLLMFKSWLWAFATVSSRTLHI-AWDEAGCLC 207

Query: 264 PWADMLNHSC-------------EVETFLDYDKSSQGVVFTTD--------------RQY 296
           P  D+ N++              E+  +   +  +      TD              + Y
Sbjct: 208 PVGDLFNYAAPDDDTLLEDEDTAELTNYQQKNGMTNSSERLTDGGYEDCNAYCLYARKNY 267

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           + GEQV ++YG  +N ELL  YGF+  E  N    +EL L +
Sbjct: 268 KKGEQVLLAYGTYTNLELLEHYGFLLGENPNEKTFIELDLDI 309


>gi|347836900|emb|CCD51472.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 470

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 77/308 (25%), Positives = 141/308 (45%), Gaps = 45/308 (14%)

Query: 69  EIDSLE-NASTLQKWLSDSGLPPQ-KMAIQKVDVGE----RGLVALKNIRKGEKLLFVPP 122
           ++D  E   +T   WL + G+    KMA+  VD+ +    RG+VA+++I   E +  +P 
Sbjct: 2   DVDDFEARTATFSAWLQEMGIRTNPKMAL--VDLRQEGRGRGVVAIEDIDDDEIIFSIPR 59

Query: 123 SLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL 182
           S V+ A +  + P A        +P W  L + L++E   +  S+W+ Y++ LP Q  SL
Sbjct: 60  SAVLNAQN--AKPLAISKRLAEKMPSWLALTSILMAEGQVD-DSKWAPYLAILPEQLNSL 116

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTY------NDLRLRIFSKYPDLFPEEVFNM 236
           ++W+ +EL     ++ +++   +   ++  TY            +  K   +     F++
Sbjct: 117 VFWSDSELAELQASAVVKKIGKQGAEDMFKTYITPQGLQHSSTEMCHKVASVIMAYAFDI 176

Query: 237 ---ETFKWSFGILFSRLVRLPSMDGR-----VALVPWADMLNHSCEVETFLDYDKSSQGV 288
                   S G        L S DG      ++++P ADMLN         D D+++  +
Sbjct: 177 PDPSEGPTSGGKGEEAADDLVSDDGEDEKTILSMIPLADMLNA--------DADRNNARL 228

Query: 289 VFTTD----RQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSD----SVELPLS 337
           +   +    R  +P   GE++F  YG+    +LL  YG+V  +G +  D    S EL +S
Sbjct: 229 ICDNEDLEMRAIKPIAKGEEIFNDYGQLPRSDLLRRYGYVT-DGYSAYDVAEISAELIVS 287

Query: 338 LKKSDKCY 345
           L ++ K +
Sbjct: 288 LFRNGKVH 295


>gi|121701277|ref|XP_001268903.1| SET domain protein [Aspergillus clavatus NRRL 1]
 gi|119397046|gb|EAW07477.1| SET domain protein [Aspergillus clavatus NRRL 1]
          Length = 498

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/174 (29%), Positives = 81/174 (46%), Gaps = 16/174 (9%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI--- 221
           S+ W+ YI  +P       ++T AEL+  L  + +R     ++ ++   +  LR      
Sbjct: 131 SNPWTEYIRFMPPSIRLPTFYTEAELE-LLRGTSLRTAVFAKLASLEKEFERLRQSTEGI 189

Query: 222 --FSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCEVET 277
               KY   + E+   +    W +   +  SR+V LP  +   A+VP  DM NH+ E   
Sbjct: 190 PWCQKY--WWDEDTGRLTFDDWKYVDAVYRSRVVELP--ESGHAIVPCVDMANHASEDSV 245

Query: 278 FLDYDKSSQGVVFTTDRQYQ---PGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
              YD+SS        RQ +    GE+V ISYG +K   E++ SYGFV  E T+
Sbjct: 246 KARYDESSTEDALLQLRQGRRICSGEEVTISYGSEKPASEMVFSYGFVENERTD 299


>gi|340053796|emb|CCC48089.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 587

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 64/138 (46%), Gaps = 10/138 (7%)

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLR-----LRIFSKYPDLFPEEVFN-METFKWSF 243
           L ++L  S+ R++ I    NV   ++        LR   ++ D      F+ +E F W++
Sbjct: 209 LQQFLHFSRHRKKVILEQENVKRKFDHCLSVLSVLRFLLRFDDENKLSRFSSLEKFVWAY 268

Query: 244 GILFSRLVRLPSMDGRV-ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
             L SR     S    V  L+PW D  NHS      + YD   +  VF +      GEQ+
Sbjct: 269 NTLMSRGF---SYHTEVWVLMPWVDYFNHSSVNNATMRYDSCRRSYVFESRLAISKGEQI 325

Query: 303 FISYGKKSNGELLLSYGF 320
           ++ YG  ++ ELLL YGF
Sbjct: 326 WLQYGSYNDIELLLWYGF 343


>gi|302762396|ref|XP_002964620.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
 gi|300168349|gb|EFJ34953.1| hypothetical protein SELMODRAFT_81798 [Selaginella moellendorffii]
          Length = 464

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/248 (23%), Positives = 117/248 (47%), Gaps = 21/248 (8%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITA----DSKWSCPEAGEVLKQCS-VPD--WPL-LATY 155
           GLVA +++ +G  ++ +P  + +      ++    P  G + +  + VP+  W + L   
Sbjct: 49  GLVATQDLPQGSTIITLPRRVPMPMPDPENAAVLAPSEGVICEIANRVPEELWAMRLGLK 108

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYN 215
           L+ E + +K S W  YIS LP      ++++  +++  ++ + +  +  +R   ++    
Sbjct: 109 LLYERA-QKGSYWWPYISMLPHSFTLPIFFSGVDIES-IDYAPVTHQVKKRCRFLLQFSA 166

Query: 216 DL-RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA---LVPWADMLNH 271
           +L +L    +    F  +  +     W+   + SR  R+  +  ++    ++P  DM NH
Sbjct: 167 ELAKLESLPEEVHPFAGQSVDSGALGWAMAAVSSRAFRIHGVTNKLCSAMMLPLIDMCNH 226

Query: 272 SCEVETFLDYD--KSSQGVVF---TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           S +    ++ D  + +Q V F    T R  + G  + ++YG  SN  LLL YGFV  +  
Sbjct: 227 SFQPNAHIEEDLSRDAQDVSFLKVVTKRNLEKGSAITLNYGPLSNDLLLLDYGFVIPD-- 284

Query: 327 NPSDSVEL 334
           NP D +EL
Sbjct: 285 NPHDRIEL 292


>gi|85093434|ref|XP_959692.1| hypothetical protein NCU09581 [Neurospora crassa OR74A]
 gi|28921141|gb|EAA30456.1| predicted protein [Neurospora crassa OR74A]
          Length = 504

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 82/201 (40%), Gaps = 27/201 (13%)

Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR----AELDRYLEASQIR 200
           S+P   +   YLI +    KSS W+ YIS L   P  L  W      AE D  L      
Sbjct: 98  SLPPHVIGRFYLIQQYLKGKSSFWAPYISTLA-DPSQLDKWALPPFWAEDDIELLQGTNA 156

Query: 201 ERAIERITNVIGTYNDLRLRIFSKYPDLFPE-EVFNMETFKWSFGILFSRLVR------- 252
             AI+ I N + +      +I  K  + FP+   +    + W++ +  SR  R       
Sbjct: 157 YIAIQEIQNNVKSEYKQARKILKK--EGFPDYREYTQVLYNWAYCMFTSRSFRPSLVLSE 214

Query: 253 ---------LPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE 300
                    LP    +D    L P  D+ NHS +     +            +  Y PG+
Sbjct: 215 SAREYVERLLPEGTKIDDFSVLQPLYDIGNHSWDASYTWNLTSEPSACELICNDSYGPGQ 274

Query: 301 QVFISYGKKSNGELLLSYGFV 321
           QVF +YG K+N ELLL YGF+
Sbjct: 275 QVFNNYGFKTNSELLLGYGFI 295


>gi|347841961|emb|CCD56533.1| similar to SET domain-containing protein [Botryotinia fuckeliana]
          Length = 377

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/157 (31%), Positives = 71/157 (45%), Gaps = 22/157 (14%)

Query: 229 FPEEVFNMETFKWSFGILFSRLVRL-----------PSMDGRVALVPWADMLNHSCEVET 277
           FPE     + F +++ I+ SR               PS + R+AL P+AD +NHS E   
Sbjct: 142 FPEPPITYDGFIYNYSIVNSRTFYYLSPTIKPSKPQPSKENRLALNPFADYINHSSE--P 199

Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL--- 334
            +D   S  G   T  +  + G +V ISYG  +N  LL+ YGF+  +  N  D V L   
Sbjct: 200 TVDATLSRAGYTLTASQPIKQGSEVHISYGSHNNDFLLVEYGFILED--NRWDEVTLDPW 257

Query: 335 --PLSLKKSDKCYKEKLEALRKYGLSASE-CFPIQIT 368
             PL L    K + E+   L KY L     C+  Q+ 
Sbjct: 258 ITPL-LSVEQKEHLEETGFLGKYLLDRDTICYRTQVV 293


>gi|358369129|dbj|GAA85744.1| SET domain-containing protein [Aspergillus kawachii IFO 4308]
          Length = 416

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 86/186 (46%), Gaps = 23/186 (12%)

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPSM------DGRVALVPWADMLNHSCEVETFLDY 281
           +FPE    + T+ W      S    LP        +  +ALVP+AD  NHS +V   + +
Sbjct: 183 VFPETDKELFTYHWVIVNTRSFFYLLPGAEMPEDRNDAMALVPFADYFNHS-DVACNVKF 241

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL-PLSLKK 340
           D   +  VF   ++Y  GE++++SYG  SN  L   YGF     TN S+++ L  + L+ 
Sbjct: 242 D--GEEYVFRAAKEYNEGEEIYMSYGPHSNDFLFTEYGFY--LDTNASETLYLDEIILQD 297

Query: 341 SDKCYKEKLEALRKYG----LSASECFPIQI----TGWPLELMAYAYLVVSPPSMKGKFE 392
            +   +E+LE  + YG     S   C+  +I    T  PL L     L     S  G  E
Sbjct: 298 LNASKQEELEFHQYYGNYQLTSEGVCYRTEIAAGLTYMPLRLWQDYVLGY---STDGVDE 354

Query: 393 EMAAAA 398
           +M+AA 
Sbjct: 355 KMSAAV 360


>gi|367023575|ref|XP_003661072.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
 gi|347008340|gb|AEO55827.1| hypothetical protein MYCTH_2300057 [Myceliophthora thermophila ATCC
           42464]
          Length = 496

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/278 (25%), Positives = 108/278 (38%), Gaps = 47/278 (16%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-----------------TADSKWSCPEAGEVLKQCS 145
           RG+VA  +I     L  +P S +I                   D   +    GE     S
Sbjct: 47  RGIVARTDIAADTVLFTIPRSSIICTATSALKNEIPGIFDLEGDEDGNSDSGGEDGTSSS 106

Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----R 200
              W LL   LI E     +S+W  Y+  LP    + ++W+  EL   L+AS +     R
Sbjct: 107 QDSWTLLILILIYEYLQGDASQWKPYLDVLPSAFDTPMFWSPTEL-AELQASALVTKVGR 165

Query: 201 ERAIERITN----VIGTYN-------------DLRLRIFSKYPDLFPEEVFNMETFKWSF 243
           E A   I +    VI  ++             D    +  +         F++E    + 
Sbjct: 166 EEADRMIRSKILPVIRGHDHVFFPHGRQRLDDDQLFELAHRMGSAIMAYAFDLEKDDDAN 225

Query: 244 GILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
                +   +   +GR  + +VP ADMLN   E   ++++   S  +  T  R  + GE+
Sbjct: 226 EEASEQDEWVDDREGRTMLGMVPMADMLNADAEFNAYINHGADS--LTATALRTIKAGEE 283

Query: 302 VFISYGKKSNGELLLSYGFV-PREGTNPSDSVELPLSL 338
           +   YG   NGELL  YG+V P+      D VELP  L
Sbjct: 284 ILNYYGPLPNGELLRRYGYVTPKHAR--YDVVELPWDL 319


>gi|340516784|gb|EGR47031.1| predicted protein [Trichoderma reesei QM6a]
          Length = 483

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 89/210 (42%), Gaps = 49/210 (23%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT--------RAELDRYLEASQIRER 202
           L+  YL    SF     W  YI ALP QP  +  W          AEL   LE + + E 
Sbjct: 92  LIKEYLRGAESF-----WHPYIQALP-QPEDVDDWALPPLWPEEEAEL---LEGTNV-EI 141

Query: 203 AIERITNVIG-TYNDLRLRIFSKYPDLFPE--EVFNMETFKWSFGILFSRLVR------- 252
            +++I   +G  + D +  + +   D   +   +   E ++W++ I  SR  R       
Sbjct: 142 GLDKIREDLGREFRDAQKLLLASNGDAEDDFSSLLTRELYQWAYCIFSSRSFRPSLVLSR 201

Query: 253 ------LP---SMDGRVALVPWADMLNHSCEVETFLDY------DKSSQG------VVFT 291
                 LP   S +    L+P  D+ NH   V    D       D SS G      V   
Sbjct: 202 EQQEALLPPGVSANDFSVLLPVFDIGNHDMTVHVRWDVTSGGQADASSTGGSAAAAVQLK 261

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
             R+++PGEQ+F +Y  K+N ELLL YGF+
Sbjct: 262 VGREHKPGEQIFNNYSPKTNAELLLGYGFM 291


>gi|302842147|ref|XP_002952617.1| hypothetical protein VOLCADRAFT_118106 [Volvox carteri f.
           nagariensis]
 gi|300261961|gb|EFJ46170.1| hypothetical protein VOLCADRAFT_118106 [Volvox carteri f.
           nagariensis]
          Length = 713

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 75/320 (23%), Positives = 123/320 (38%), Gaps = 83/320 (25%)

Query: 82  WLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP--SLVITADSKW----- 132
           WL   G  + P+      +  G+RG++A  +I +GE LL +P   ++ I  D ++     
Sbjct: 6   WLRSRGGRIHPELDLFHTLPSGDRGVIARSDIAEGELLLLLPIDCAIYIPTDEEFKKHPN 65

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLI--SEASFEKSSRWSNYISALPRQ-PYSLLYWTRAE 189
             P+A   L++      P LAT L+  SE +    S W+ Y++ LP   P  LL WT  E
Sbjct: 66  DFPDAVRYLREAHPGLSPFLATTLVLMSEMTRGSVSPWAAYVATLPASCPDCLLNWTEEE 125

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFP------------------- 230
               L+ + + +   +   +V   Y      I +   DL+P                   
Sbjct: 126 -KLELKGTSLEQSGPDPAVDV---YRRHVAPILACRTDLWPGLAAKEPPAAEATGATLDA 181

Query: 231 -------------EEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS----- 272
                           F++E   W  G     +  L +   +V L+P  DM+NHS     
Sbjct: 182 GLAAFARAAGLVQSRAFHLEAENWVSGA--KEIAHLENGGTQVFLLPGIDMINHSHNPDR 239

Query: 273 --CEVETF-----------------LDYDKSSQGV---------VFTTDRQYQPGEQVFI 304
               +E                    D  + ++GV         V   D+  + GE+V  
Sbjct: 240 RNAHLERLNVAQAAAAKLLEREPGEEDAREGAKGVGVRGVEAFFVMRADKPIKAGEEVLH 299

Query: 305 SYGKKSNGELLLSYGFVPRE 324
           +YG  S+ +LL +YGF+  E
Sbjct: 300 TYGNLSDAQLLQTYGFLDSE 319


>gi|326427686|gb|EGD73256.1| hypothetical protein PTSG_04969 [Salpingoeca sp. ATCC 50818]
          Length = 455

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 74/325 (22%), Positives = 135/325 (41%), Gaps = 56/325 (17%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC-SVPDWPL 151
           +AI+   +   G+ A  +++    +  VP SL++  +     P+ G +      + D  +
Sbjct: 50  VAIRTSPLTGNGVYATADMKAHTTVFAVPFSLMMNVEHALVDPDLGRLWDMLPDLSDLEV 109

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LA +L  E S   +  W  Y+++L   P +   +++ E+D    ++ + + A +R  ++ 
Sbjct: 110 LAGFLAFE-SLRGTGFWQPYLASLGPPPTTPTLFSQEEMDLLAPSAAVFDIAQQRHLDLS 168

Query: 212 GTYNDLRLR-----IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG------RV 260
             Y DL ++     +  K  + + +       + W+  +L SR  +L   D        +
Sbjct: 169 EVY-DLIVKAATNGLNKKEKEAWQQLGMRKSDYLWAHVVLRSRSHKLSIKDAVGQWHDAM 227

Query: 261 ALVPWADMLN-----HSCEVETFLDYDKSSQGVVF--TTDRQYQPGEQVFISY----GKK 309
            LVP AD+ N     ++  V  +   +       F   T R     E++ + Y     ++
Sbjct: 228 CLVPLADLFNTDLRNNTANVACYTGEEGHGTASTFYCETTRDINHSEELLVEYIGDAMRR 287

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLS-----------------------LKKSDKCYK 346
           S+G+LLL YGFVP   T+ SDSV L L                        L+ SD C K
Sbjct: 288 SSGKLLLDYGFVPT--THDSDSVLLHLPKLSETAEQRLKHFHFREYEPLPWLENSDACLK 345

Query: 347 EKL------EALRKYGLSASECFPI 365
            +L      + L K G   S+ F +
Sbjct: 346 LRLVIYFAIDVLDKNGFDISDDFKV 370


>gi|310798181|gb|EFQ33074.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 485

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 88/226 (38%), Gaps = 45/226 (19%)

Query: 144 CSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL------------D 191
            SVP   +   +LI +    K+S W  YIS LP QP  L  W    L            +
Sbjct: 96  ASVPPHVIGRFFLIHQYLLGKASFWHPYISTLP-QPEHLQSWILPPLWPSDDVELLEDTN 154

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV 251
            ++  ++ + R      + +    D        Y  L          + W++ I  SR  
Sbjct: 155 VHVAVAETKARLKAEFKHAVAALGDDTAAARRGYTRLL---------YHWAYCIFASRSF 205

Query: 252 R--------------LPS---MDGRVALVPWADMLNHSCEV----ETFLDYDKSSQGVVF 290
           R              LP+   +D    L+P  D+ NH+       +T  D D +S     
Sbjct: 206 RPSLVIPAARKATLALPAGCAVDDFSLLMPLLDVGNHAPTAAVAWDTDADGDGASNSCAL 265

Query: 291 TTDRQYQPGEQVFISYG-KKSNGELLLSYGF-VPREGTNPSDSVEL 334
            T   Y PG QVF +YG  K+N EL+L+YGF VP      +D V +
Sbjct: 266 RTLDPYAPGAQVFNNYGTSKTNAELMLAYGFCVPESAGLHNDYVHV 311


>gi|449302028|gb|EMC98037.1| hypothetical protein BAUCODRAFT_67154 [Baudoinia compniacensis UAMH
           10762]
          Length = 381

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 74/321 (23%), Positives = 127/321 (39%), Gaps = 52/321 (16%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-- 130
           ++      +W    G+   K+   ++     GLV   +++K +++LFVP   +   DS  
Sbjct: 1   MQRHEAFTEWAQARGVEIGKVKPARLPGRGLGLVTTASVKKNQRILFVPEKAMFKPDSAL 60

Query: 131 ------KWSCPEAG---EVLKQCS--VPDWPLLATYLISEASFEKSS--RWSNYISALPR 177
                 + + P+A     VL  C+  V    L      ++  FE+    RW   +  L  
Sbjct: 61  LKQHNLQHASPQAQLAVSVLAACATQVSAIALWEATWPTDTDFEQGMPMRWDGCLQDL-- 118

Query: 178 QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNME 237
            P S+    + + D Y++             +V   +  LR    ++    +   + N  
Sbjct: 119 LPPSVQQPLQRQQDDYMK-------------DVGSVHTFLRHARVTEQRFRYYWSIVNSR 165

Query: 238 TFKWSFGILFSRLVRLP-SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
           +F W          + P    G + + P+ D  NH     T  +  + S G     +R Y
Sbjct: 166 SFHW----------KPPKGKAGSMVMCPFIDYTNHG-PTGTGCNVSQRSNGYEMLANRDY 214

Query: 297 QPGEQVFISYGKKSNGELLLSYGFV---PREGTNPSDSVELP-LSLKKSDKCYKEKLEA- 351
             GE+V  +YG  SN +LL+ YGF+   P    N  D + L  L + K D   ++KL+  
Sbjct: 215 DAGEEVLFTYGAHSNDKLLVHYGFICESPPGLRNKDDDIRLDHLLIPKLDSHVRDKLQDV 274

Query: 352 --LRKYGLSASE---CFPIQI 367
             L  Y L  +E   CF  Q+
Sbjct: 275 GFLGAYALLPAENELCFKTQV 295


>gi|355710254|gb|EHH31718.1| hypothetical protein EGK_12845 [Macaca mulatta]
          Length = 379

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 87/201 (43%), Gaps = 17/201 (8%)

Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           +SRW  Y +  P   +    ++W   +    L+ + + E   + + N+   Y+ + L   
Sbjct: 36  ASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPEAVEKDLANIRSEYHSIVLPFM 95

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEV 275
             +PDLF   V ++E +     ++ +   + P  +           +VP AD+LNH    
Sbjct: 96  EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANH 155

Query: 276 ETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
              L+Y  +   +V T     QP   G ++F +YG+ +N +L+  YGFV     N  D+ 
Sbjct: 156 NANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTA 210

Query: 333 ELPLSLKKSDKCYKEKLEALR 353
           ++ +   +       K EA R
Sbjct: 211 DIQMVTVREAALQGTKTEAER 231


>gi|409045252|gb|EKM54733.1| hypothetical protein PHACADRAFT_97093 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 513

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 89/197 (45%), Gaps = 26/197 (13%)

Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
           VA+VP ADMLN     ET   +      +   T  + + GEQ++ +YG   N +LL  YG
Sbjct: 270 VAMVPMADMLNGRFNTETARLFYDDEHVLRMMTVHEIKAGEQIWNTYGDPPNSDLLRRYG 329

Query: 320 FV-------PREGT-NPSDSVELPLSL--KKSDKCYKEKLEALRKYGLSASECFPIQITG 369
           F+       P  G  NP+D VE+P +L  + + K    K +    + L  +E   + + G
Sbjct: 330 FIDVTKLESPLSGAGNPADIVEIPANLVVEAATKHTTSKTQDRVDWWLEEAED-DVFVVG 388

Query: 370 ----WPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQALQFILDSC 425
                P E+++ A L++ P   K ++E        K  +K  +  P +D       +D  
Sbjct: 389 TDCELPPEMVSLARLLLQP---KAEWE--------KTKAKGKVPKPTMDTTIAAIAMDVL 437

Query: 426 ESSISKYSRFLQVKELL 442
           +S + +Y   ++  E L
Sbjct: 438 QSRLKEYPTSVEEDERL 454


>gi|19112238|ref|NP_595446.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces pombe
           972h-]
 gi|74626910|sp|O74738.1|SET10_SCHPO RecName: Full=Ribosomal N-lysine methyltransferase set10
 gi|3738151|emb|CAA21252.1| ribosomal lysine methyltransferase Set10 [Schizosaccharomyces
           pombe]
          Length = 547

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/192 (27%), Positives = 85/192 (44%), Gaps = 14/192 (7%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           L T+L  E+     S+W  YI  LP+   + LY+   + + +L ++     A ER+    
Sbjct: 82  LCTFLALESLKGIQSKWYGYIEYLPKTFNTPLYFNEND-NAFLISTNAYSAAQERLHIWK 140

Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL----FSRLVRLPSMDGRVALVPWAD 267
             Y +      S +P   P E F  + + WS  +     FS  +     +    L+P  D
Sbjct: 141 HEYQE----ALSLHPS--PTERFTFDLYIWSATVFSSRCFSSNLIYKDSESTPILLPLID 194

Query: 268 MLNHSCEVETFLDYD-KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
            LNH  +     + D +  + V   +      G Q+F +YG K N ELL+ YGF   +  
Sbjct: 195 SLNHKPKQPILWNSDFQDEKSVQLISQELVAKGNQLFNNYGPKGNEELLMGYGFCLPD-- 252

Query: 327 NPSDSVELPLSL 338
           NP D+V L +++
Sbjct: 253 NPFDTVTLKVAI 264


>gi|384248108|gb|EIE21593.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 229

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 84/194 (43%), Gaps = 24/194 (12%)

Query: 81  KWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEV 140
           +WL   G     +    V  G RG++A  NI +G  L+ VP  L+++A S        E 
Sbjct: 8   EWLQKGGALIADIEPGAVAEGFRGVIAKANIEEGTLLVAVPERLLLSAHSAKKDRAFAEA 67

Query: 141 L---KQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
           L    + S+    +LA +L+ EAS  + S W  Y++ LPRQ Y+ L         Y    
Sbjct: 68  LLATNKQSIGSSQVLAAHLLHEASKGQESFWRPYLATLPRQ-YTCL--------SYFSPE 118

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
            IRE  +E   +           I S   +    +  +++    +   + SR + LP  D
Sbjct: 119 DIRELQVEYAMD-----------IASSVVEALRSDHTSVKPLLNALATVASRTMYLPD-D 166

Query: 258 GRVALVPWADMLNH 271
              AL+P+ D+ NH
Sbjct: 167 AAGALMPFGDLHNH 180


>gi|194864902|ref|XP_001971164.1| GG14807 [Drosophila erecta]
 gi|190652947|gb|EDV50190.1| GG14807 [Drosophila erecta]
          Length = 1183

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 98/214 (45%), Gaps = 35/214 (16%)

Query: 151 LLATYLISEASFEK------SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
           L+A Y++     +K      SS +S Y+  LPR   +  + +  EL    E+  + ER +
Sbjct: 154 LVACYVLYHKHLQKCSLGTRSSPYSAYLDTLPRGYTTPYFCSIPELQCLPES--LLERTV 211

Query: 205 ERITNVIGTYNDLRLRIFSK--YPDLFPEEVFNMETFKWSFGILFSRLVRLPSM------ 256
            +   + G +  ++  + +       + +E++ +  FKW++  + +R V L S       
Sbjct: 212 AQNRQIRGYFEIIKNLVLNCDCCAKSYGQEIWTLADFKWAYFTVNTRSVHLSSRFLKKQS 271

Query: 257 ---------DGRVALVPWADMLNHSCEVETFL-----DYDKSSQGVVFTTDRQYQPGEQV 302
                    D  +AL P+ D+ NHS  V+        DY  + + + F+  + Y   +Q+
Sbjct: 272 NYFQPLISGDTNMALAPFLDLFNHSDSVQITAEIEGPDYVLTLKSLPFSKTKPY---DQL 328

Query: 303 FISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           FISYG  SN +LL  YGF  +E  N  D  E+ L
Sbjct: 329 FISYGALSNFKLLTEYGFWLQE--NKHDYFEVSL 360


>gi|412987807|emb|CCO19203.1| predicted protein [Bathycoccus prasinos]
          Length = 638

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 63/222 (28%), Positives = 91/222 (40%), Gaps = 48/222 (21%)

Query: 163 EKSSR------WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           E+SSR      W  Y+  LP    SLL W+  EL  +L+ S+I ERA  R   V   +  
Sbjct: 234 EESSRDDYSPHWQAYVDLLPTNVDSLLLWSTEEL-AFLQNSRIGERAKRRKKLVAEEFQV 292

Query: 217 L-----------RLRI---------FSKYPDLFPEEVFNMET----FKWSFGILFSRLVR 252
           L           RL +           +   L P   F  ET    F+W++  + +R   
Sbjct: 293 LFSNASLREEFERLMMPQTKQTQTQTRQRKRLTP---FRFETLREWFEWAYATVLARAFT 349

Query: 253 LPSM-DGRVALVPWADMLNHS-----CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
           LP + +G + L P  D+ N +     CEV      D S    V      Y+ G Q F  Y
Sbjct: 350 LPEIENGALLLCPGLDIFNSARDAAKCEVRLSPHDDISLHATVGG----YRAGTQAFHDY 405

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
             KS+G  LL +GF+     +  + +  PL +   D+  K K
Sbjct: 406 ADKSSGGALLEFGFI----YDDDERLNFPLFMDDDDEKEKAK 443


>gi|159490102|ref|XP_001703025.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158270838|gb|EDO96670.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 471

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/229 (22%), Positives = 103/229 (44%), Gaps = 7/229 (3%)

Query: 109 KNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASFEKSSR 167
           +++++G +L+ +P +  +T  +    P    ++++     W   LA  LI++      S+
Sbjct: 29  RDVQQGHRLITLPNAAHLTYGAN-DDPRLLALIEKVPSELWGAKLALQLIAQRLQGGESQ 87

Query: 168 WSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP 226
           +++Y++ LP+  P   +++ R  LD  ++     ++  +R   +     ++  R+     
Sbjct: 88  FASYVAELPKGFPGIPVFFPRTALD-MIDYPPCSQQVKKRCKWLYEFSTEVLARLPGSPE 146

Query: 227 DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET-FLDYDKSS 285
           D F     ++    W+   + SR  R        A++P  DM NH+       L  +   
Sbjct: 147 DPFGGVAVDINALGWAMAAVSSRAFRTRGPTQPAAMLPLIDMANHTFSPNAEVLPLEGGG 206

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
             V     R    GE + +SYG+ SN  L + YGF+  +  NP DSV+L
Sbjct: 207 GAVGLFARRAITEGEPLLLSYGQLSNDFLFMDYGFIVED--NPYDSVQL 253


>gi|157818191|ref|NP_001099637.1| N-lysine methyltransferase SETD6 [Rattus norvegicus]
 gi|325530256|sp|D3ZSK5.1|SETD6_RAT RecName: Full=N-lysine methyltransferase SETD6; AltName: Full=SET
           domain-containing protein 6
 gi|149032380|gb|EDL87271.1| similar to hypothetical protein FLJ21148 (predicted) [Rattus
           norvegicus]
          Length = 474

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/288 (23%), Positives = 125/288 (43%), Gaps = 28/288 (9%)

Query: 65  PWGCEIDSLENASTLQKWLSDSGL--PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPP 122
           P G E +S   A  L +W +  GL   P+ +  ++  V   G+VA ++++ GE L  VP 
Sbjct: 37  PSGHEPESDAVAGFL-RWCTRVGLELSPKVLVSRQGTVAGYGMVARESVQPGELLFAVPR 95

Query: 123 SLVITADSKWSCPEAGEVLKQ----CSVPDWPLLATYLISEASFEKSSRWSNYISALPR- 177
           S ++   S  +C  +  + ++     S+  W +     +       +S WS Y +  P  
Sbjct: 96  SALL---SPHTCSISDLLERERGALQSLSGW-VPLLLALLHELQAPASPWSPYFALWPEL 151

Query: 178 -QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNM 236
            +    ++W   E  R L+ + + E   + + N+   Y  + L     + DLF   V ++
Sbjct: 152 GRLEHPMFWPEEERLRLLKGTGVPEAVEKDLVNIRSEYYSIVLPFMEAHSDLFSPTVRSL 211

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVV 289
           E ++    ++ +   + P  +           +VP AD+LNH       L+Y      +V
Sbjct: 212 ELYRQLVALVMAYSFQEPLEEEEDEKEPNSPLMVPAADILNHIANHNANLEYSAEYLRMV 271

Query: 290 FTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
            T     QP   G ++F +YG+ +N +L+  YGF      N  D+ ++
Sbjct: 272 AT-----QPILKGHEIFNTYGQMANWQLIHMYGFAEPYPNNTDDTADI 314


>gi|429851283|gb|ELA26485.1| set domain-containing protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 196

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/80 (42%), Positives = 42/80 (52%), Gaps = 9/80 (11%)

Query: 262 LVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
           L P AD+LNH+   C V     +D  S       DR Y PGE++ I YG+ SN  LL+ Y
Sbjct: 3   LQPVADLLNHASRGCNVA----FDTES--FTIRADRDYSPGEEIHICYGRHSNDFLLVEY 56

Query: 319 GFVPREGTNPSDSVELPLSL 338
           GFV  EG N  D   L  +L
Sbjct: 57  GFVMGEGENEWDEACLDEAL 76


>gi|408392258|gb|EKJ71616.1| hypothetical protein FPSE_08255 [Fusarium pseudograminearum CS3096]
          Length = 527

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 82/322 (25%), Positives = 135/322 (41%), Gaps = 52/322 (16%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+VA+ +IR  + +L VP   V T D+        + L   SV         L +E + +
Sbjct: 32  GIVAVYDIRANQTILSVPTRAVRTIDTVPK--HIKDALHGVSV------HGILAAEIALD 83

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEA---SQIRERAIERITNVIGTYNDLRLR 220
            S  ++ + + LP         TR +L+  +     S+++    +R  +++   N    R
Sbjct: 84  DSDDFAIWRTVLP---------TREDLEGGMPMMWPSELQALLPKRAKDLLDNQNTTFRR 134

Query: 221 ----IFSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
               +   +P L  +E      + N  TF  S   + S      +   R+  +P  D+ N
Sbjct: 135 ECDIVLKAFPTLTRDEYMLSWVLINTRTFYNSMPKMKSY-----AHSDRLVCMPVLDLFN 189

Query: 271 HSCEVETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---PREGT 326
           H  + +   L Y  S+ G    TDR Y+ GE+VF+SYG  SN  LL  YGF+    R   
Sbjct: 190 HEDQSQGCKLVY--SALGYSVQTDRAYKQGEEVFVSYGPHSNDFLLTEYGFILDTNRWDE 247

Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPS 386
              D V LPL L K+ +   E +  L +Y L        Q  G     +A   L  SP  
Sbjct: 248 VYLDEVILPL-LNKTQRAELESVGFLGRYTLDD------QTPGCHRTQVALRMLCCSP-- 298

Query: 387 MKGKFEEMAAAASNKMTSKKDI 408
             G+++    A  +  +S+ ++
Sbjct: 299 --GQWQRFFDACEDGRSSQVEV 318


>gi|355756831|gb|EHH60439.1| SET domain-containing protein 6, partial [Macaca fascicularis]
          Length = 371

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 87/201 (43%), Gaps = 17/201 (8%)

Query: 165 SSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF 222
           +SRW  Y +  P   +    ++W   +    L+ + + E   + + N+   Y+ + L   
Sbjct: 36  ASRWRPYFALWPELGRLEHPMFWPEEQRRCLLQGTGVPEAVEKDLANIRSEYHSIVLPFM 95

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEV 275
             +PDLF   V ++E +     ++ +   + P  +           +VP AD+LNH    
Sbjct: 96  EAHPDLFSLRVRSLELYHQLVALVMAYSFQEPLEEEEDEKEPNSPVMVPAADILNHLANH 155

Query: 276 ETFLDYDKSSQGVVFTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
              L+Y  +   +V T     QP   G ++F +YG+ +N +L+  YGFV     N  D+ 
Sbjct: 156 NANLEYSANCLRMVAT-----QPIPKGHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTA 210

Query: 333 ELPLSLKKSDKCYKEKLEALR 353
           ++ +   +       K EA R
Sbjct: 211 DIQMVTVREAALQGTKTEAER 231


>gi|68479052|ref|XP_716460.1| hypothetical protein CaO19.7326 [Candida albicans SC5314]
 gi|46438129|gb|EAK97465.1| hypothetical protein CaO19.7326 [Candida albicans SC5314]
          Length = 579

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 80/332 (24%), Positives = 143/332 (43%), Gaps = 71/332 (21%)

Query: 73  LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKL-------LFVPPS 123
           +E+ + L KW   +G  + P       V+  E     +  I KG K+       + +P  
Sbjct: 4   IESINKLLKWAESNGAQISPD------VEFKEISKNYIGAIYKGNKVPDSPFCPISIPSK 57

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-----Q 178
           L+IT  + +   E  + LK   + D  +L  +L  E     +S +  Y++ LP       
Sbjct: 58  LIITPQTAFK--EFSKSLKNTDINDNSILKLHLCHE-RLNGNSFFYPYLNLLPSLSEIDS 114

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVI--------GTYNDL--------RLRIF 222
           PY+   W+ A    YL+ + +     E +  ++          ++DL         ++ +
Sbjct: 115 PYT---WS-ANDKSYLQGTNLGNSLKENLVTLVEEWWKAINALHDDLPKPEQHYINMKFY 170

Query: 223 SKYP---------DLFPEEVFNMETFK---WSFGILFSR-----LVRLPSMDGRVALVPW 265
            +Y           L  E + N  +F    W+  IL SR     L+   +      L+P 
Sbjct: 171 YEYKFYTDDDLNKYLNDENIENWTSFPNYLWASLILKSRSFPAYLIDKNNKQDSAMLLPV 230

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            D+LNH+ + +  + +D S     F+++    PG+++F +YG K N ELLL+YGF     
Sbjct: 231 VDLLNHNSKSK--VHWDVSDNYFKFSSE-SIVPGKEIFNNYGLKGNEELLLAYGFCIE-- 285

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
            N  DSV L + +       +EK++A+ +YG+
Sbjct: 286 NNSQDSVALKIKMP------EEKIKAIEEYGI 311


>gi|302848348|ref|XP_002955706.1| hypothetical protein VOLCADRAFT_106928 [Volvox carteri f.
           nagariensis]
 gi|300258899|gb|EFJ43131.1| hypothetical protein VOLCADRAFT_106928 [Volvox carteri f.
           nagariensis]
          Length = 542

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/238 (23%), Positives = 111/238 (46%), Gaps = 17/238 (7%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPL-LATYLISEASF 162
           GL A +++  G +L+ +P +  +T  +K   P    ++++     W   LA  L+S+   
Sbjct: 101 GLQASQDLEPGRRLIVLPAACHLTYGAK-DDPRLLALIEKVPNELWGAKLALQLLSQRLR 159

Query: 163 EKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI 221
              S ++ YIS LPR  P   +++++  LD  ++   + ++ +++    + T++    ++
Sbjct: 160 GADSLFAAYISNLPRGIPGIPMFFSKRALD-LIDYPPVTQQ-VQKRCRWLHTFSQ---QV 214

Query: 222 FSKYP----DLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVET 277
            +K P    D F     ++    W+   + SR  R        A++P  DM NH+     
Sbjct: 215 MAKLPGSPEDPFGGVTVDINALGWALACVTSRAFRTRGPAHPAAMLPLIDMANHTFTPNA 274

Query: 278 -FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
             L       G+   +  +   GE + +SYGK +N  L + YGF+  +  NP D+V+L
Sbjct: 275 EVLPLPGGDMGLFAKS--KVATGEPLLLSYGKLNNDFLFMDYGFIVPD--NPYDTVQL 328


>gi|342877200|gb|EGU78693.1| hypothetical protein FOXB_10798 [Fusarium oxysporum Fo5176]
          Length = 456

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/194 (29%), Positives = 81/194 (41%), Gaps = 32/194 (16%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNV 210
           L+  +L+   SF     W+ YI ALP QP     W+        +A       IE    V
Sbjct: 91  LIKHFLLKGESF-----WAPYIQALP-QPDDHDSWSLPPFWPDEDAELFEGTNIE--VGV 142

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFK--------WSFGILFSR-----LV------ 251
                +++ R F    DL   E + +E  K        W++ I  SR     LV      
Sbjct: 143 TSIRANVK-REFKTAHDLLAAESWELELLKQFTLPLYQWAYSIFSSRSFRPSLVLGPEDQ 201

Query: 252 -RLPS---MDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG 307
            RLP    +D    L+P  D+ NH    +     ++   G     ++ YQ GEQVF +Y 
Sbjct: 202 QRLPEGVKLDDFSVLMPLFDVGNHDMTTKVEWVRNERINGCSLKVEKAYQAGEQVFNNYS 261

Query: 308 KKSNGELLLSYGFV 321
            K+N ELLL YGF+
Sbjct: 262 MKTNAELLLGYGFM 275


>gi|449019745|dbj|BAM83147.1| similar to protein N-methyltransferase [Cyanidioschyzon merolae
           strain 10D]
          Length = 576

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 9/97 (9%)

Query: 243 FGILFSRLVRLPSMDG----RVALVPWADMLNHSCEVETFLDYDKSSQGVVFT-TDRQYQ 297
           F  L+ R+ +  SM G    +  + P+ D+ NHS  V++ + Y+        + ++R   
Sbjct: 384 FDALYPRIYQ--SMQGAPLKKYVIAPFIDLFNHSSRVQSKVAYEYFYDAFSLSISNRDTH 441

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
            G+QVFISYG  +N ELL  YGFV  E  NP D+ +L
Sbjct: 442 AGDQVFISYGTLTNDELLALYGFV--EEDNPHDTYKL 476


>gi|363747032|ref|XP_003643892.1| PREDICTED: histone-lysine N-methyltransferase setd3-like, partial
           [Gallus gallus]
          Length = 283

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 94/207 (45%), Gaps = 21/207 (10%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L KW +++G   +   I   +    GL A + I+  E  L+VP  L++T +S        
Sbjct: 82  LIKWATENGASTEGFEIANFEEEGFGLKATREIKAEELFLWVPRKLLMTVESA-----KN 136

Query: 139 EVLKQCSVPDWPL-------LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
            VL      D  L       LA +L+ E +   +S W  YI  LP +  + LY+   E+ 
Sbjct: 137 SVLGSLYSQDRILQAMGNITLAFHLLCERA-NPNSFWLPYIQTLPSEYDTPLYFEEDEV- 194

Query: 192 RYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD---LFPEEVFNMETFKWSFGILFS 248
           +YL ++Q       +  N    Y     ++   +P+   L  ++ F  + ++W+   + +
Sbjct: 195 QYLRSTQAIHDVFSQYKNTARQYAYF-YKVIQTHPNASKLPLKDSFTYDDYRWAVSSVMT 253

Query: 249 RLVRLPSMDGR---VALVPWADMLNHS 272
           R  ++P+ DG    +AL+P  DM NH+
Sbjct: 254 RQNQIPTEDGSRVTLALIPLWDMCNHT 280


>gi|313239023|emb|CBY14007.1| unnamed protein product [Oikopleura dioica]
          Length = 299

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 11/107 (10%)

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
           GR  L P  D+LNH+ E    ++ D++   +     R+ + GE++F S+G+ SN  LL  
Sbjct: 74  GRFML-PVGDLLNHTAENNARIELDENDHTI--NAIREIESGEEIFNSFGEMSNSRLLHM 130

Query: 318 YGFVPREGTNPSDSVELPLSLKKSDKCYKE-------KLEALRKYGL 357
           YGF  +E  +  D+   P S+KK  + +K+       KL+ L K+G 
Sbjct: 131 YGFAEKE-NDADDAFLDPNSIKKGIEEFKDEIPAIQAKLKLLEKHGF 176


>gi|258567286|ref|XP_002584387.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237905833|gb|EEP80234.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 706

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 102/227 (44%), Gaps = 28/227 (12%)

Query: 151 LLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
           L+  YL+ + SF     W+ YI +LP   Q   L Y+T  +L ++LE + + +   + + 
Sbjct: 130 LMDQYLLGDESF-----WAPYIQSLPDDSQFTRLEYYTGDDL-KWLEGTNLLKLREKLLE 183

Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL--------------VRLP 254
            +   Y +  LR+  ++P+    + +  E F W+  I+ SR                R+ 
Sbjct: 184 RLKAKY-ETGLRLLKEFPNKNTPK-YTWERFLWASSIILSRAFSSEVLKDYIKGTPTRVK 241

Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGEL 314
            ++    LVP  D+ NH    +  +++  S + +     +   PGE+V  +YG +SN  L
Sbjct: 242 PLEDFSVLVPLVDISNHQPLAQ--VEWATSLEKIGLIVHKTLLPGEEVPNNYGPRSNERL 299

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE 361
           +++YGF  R   N  D  E+ L           K E   ++G S S+
Sbjct: 300 MMNYGFCIR--GNVCDYREMNLRAPPDSPLAIAKQEQQTRFGASKSK 344


>gi|255080174|ref|XP_002503667.1| set domain protein [Micromonas sp. RCC299]
 gi|226518934|gb|ACO64925.1| set domain protein [Micromonas sp. RCC299]
          Length = 401

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 75/160 (46%), Gaps = 10/160 (6%)

Query: 125 VITADSKWSCPEA-----GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQ 178
           V+TA  K +C  A      E L++  +     L   ++ E S  + SRW+ Y + LP R 
Sbjct: 22  VVTAIPKAACLSARTCSVAETLREARLGGGLALNIAIMHERSLGEGSRWAGYFAVLPARG 81

Query: 179 PYSL-LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR-IFSKYPDLFPEEVFNM 236
             +L ++WT A+L+ +L  + +     E   ++   +N+  +  +   +P  FP     +
Sbjct: 82  ERTLPMFWTSAQLE-HLRGTDLLRHVTEDAESMRLDFNENVVDGLCVTHPVAFPPGKHTL 140

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
           E +  +  +  SR   +    G  ALVPWADM NH  + E
Sbjct: 141 EAYMEAASLAASRAFYIGEECGE-ALVPWADMFNHKTDGE 179


>gi|328867430|gb|EGG15812.1| hypothetical protein DFA_09480 [Dictyostelium fasciculatum]
          Length = 466

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/250 (22%), Positives = 112/250 (44%), Gaps = 33/250 (13%)

Query: 102 ERGLVALKNIRKGEKLLFVPPSLVITADS-KWSCPEAGEVLKQCSVPDWPLLATYLI--S 158
           ERG+   ++I++G  ++ +    + + D  K   P   ++L++  V     LA  L+   
Sbjct: 75  ERGIFVNEDIKEGSDIVNLQWINIFSLDRIKHETPHLYQLLQENDVSAEIGLAISLMYYR 134

Query: 159 EASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR 218
               + +S +  ++++LP +  + LY+   E+ + L+ S      + +I      Y  L 
Sbjct: 135 YCKDDTTSEYYQWMNSLPTELNTGLYFNPDEI-QLLKGSPAFVHLMLQIDQTREMYQKLN 193

Query: 219 LRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--RLP---------------------- 254
             +F     +F     N + +KW+  I+ SR +   LP                      
Sbjct: 194 GEMFK--DKIFDGCAINWDRYKWAVSIVGSRGIYTELPIDKELEKKKETKEKEEKEEEEE 251

Query: 255 SMDGRVALV--PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKS-N 311
           S+  R+ +V  P+ D  NH+ + +   D+D  S  +  +  +  + GEQ+F++YG ++ +
Sbjct: 252 SLQQRLTIVLAPFIDYFNHNNDAQATYDFDHESSSIRVSLLKSVKSGEQIFLNYGNQNCD 311

Query: 312 GELLLSYGFV 321
            + L+ YGFV
Sbjct: 312 SDFLIDYGFV 321


>gi|407417214|gb|EKF38012.1| hypothetical protein MOQ_001785 [Trypanosoma cruzi marinkellei]
          Length = 578

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 113/269 (42%), Gaps = 26/269 (9%)

Query: 115 EKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISA 174
           EK+ F+  ++V   D       +GE+    S  D PLL   LI E    ++S W++ + +
Sbjct: 166 EKMFFID-TVVKYCDLGRVVHASGELSSMIS-GDEPLLVLSLIYERYVAETSHWNDLLCS 223

Query: 175 LPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPE-- 231
            P   P    +W   +L   LE   + +  + +   +   + +    +   Y  L     
Sbjct: 224 CPVDYPNVPSFWDWEDLAE-LEGLDVLDDVLAKKAQLAQFHTETMAVLPFIYEALAGSCR 282

Query: 232 -------EVFNMETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSCEVETFLDYD 282
                  E F++E   W+     SR   L ++DGRV  ALVP ADM+NH    +  +   
Sbjct: 283 LGKDEFLECFSIEAMMWARATFDSRAFNL-NVDGRVVIALVPVADMINHHNRSDVLVRKV 341

Query: 283 KSSQG----VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           + + G     +  +      G ++++SYG   N ELL  YGFV  E  N  D +  P   
Sbjct: 342 EPNGGDFVMQIGASLTAQDIGRELWMSYGPLQNWELLQFYGFVVEE--NEHDRLPFPFDF 399

Query: 339 KK---SDKCYKEKLEALRKYGLS-ASECF 363
            +    D+  + +   +  YGL  A  C+
Sbjct: 400 PEGVAGDEWDRRRATLVATYGLHLAGRCW 428


>gi|330924024|ref|XP_003300479.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
 gi|311325361|gb|EFQ91406.1| hypothetical protein PTT_11726 [Pyrenophora teres f. teres 0-1]
          Length = 642

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 91/214 (42%), Gaps = 28/214 (13%)

Query: 141 LKQCS--VPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEA 196
           L+QC   +PD  L    LI +    ++S W  YI+ LP  R   + L++   ++  +L  
Sbjct: 84  LRQCQGRIPDHILTYLLLIEQRDRGQASPWHAYIACLPSPRDMTTPLWFNEGDM-AFLAG 142

Query: 197 SQIRERAIERITNV-------IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
           + +   A ER   +       +    +L + +          +  ++E+  W+  +  SR
Sbjct: 143 TSLAPAAKERRAELQQQWERAVAVMEELSIPL---------AKGIDIESLLWAATVFTSR 193

Query: 250 LVR----LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD-RQYQPGEQVFI 304
                  LP  +    L P  D+LNHS   +   D+       +   D   +QP +++F 
Sbjct: 194 AFISTHILPEKETVPILFPVVDILNHSVSAKVEWDFQPRQSFALKCLDGHSFQPRQELFN 253

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           +Y  K N ELLL YGF   +  NP +   L L+ 
Sbjct: 254 NYAPKQNDELLLGYGFCLED--NPIEQFALKLAF 285


>gi|397574384|gb|EJK49180.1| hypothetical protein THAOC_31979 [Thalassiosira oceanica]
          Length = 462

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/259 (22%), Positives = 115/259 (44%), Gaps = 23/259 (8%)

Query: 79  LQKWLSDSGL-PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-----KW 132
           +++W    GL     + +   D  +  LVA   +  G+ +L+VP  ++++++        
Sbjct: 75  MEQWAQQYGLQKADGVELYSDDGADYQLVAQNGVAAGQMILYVPSDIILSSNGVAEEFGG 134

Query: 133 SCPEAGEVLKQCS------VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
           +  +A +VL Q        +P + L+A  +++E    ++S +  ++++LPRQ Y+ +  T
Sbjct: 135 AIAQAEQVLVQMDQGTQKRLPLFRLMAK-ILNEYDQGENSPFYPWLASLPRQYYNGVSMT 193

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGIL 246
            A        + +   A    +N    Y+     +   Y  L    V + +   W++ + 
Sbjct: 194 DACF------ACLPPYAGWLTSNERNNYHFFVAALRKGYVQL--NSVHDEKAVMWAYNVA 245

Query: 247 FSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
           F+R   + S   +  + P ADM+NHS E    + +D      V T      PG  +  SY
Sbjct: 246 FTRFEEVWSPSRQKLIAPMADMINHSAEPNCQISFDDMGNCQV-TALYDIPPGTPITKSY 304

Query: 307 GKKSN-GELLLSYGFVPRE 324
           G  +N   +   YGF+P++
Sbjct: 305 GDPTNPTPIFAQYGFLPQD 323


>gi|302914506|ref|XP_003051150.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
           77-13-4]
 gi|256732088|gb|EEU45437.1| hypothetical protein NECHADRAFT_106131 [Nectria haematococca mpVI
           77-13-4]
          Length = 499

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 52/193 (26%), Positives = 86/193 (44%), Gaps = 9/193 (4%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRI--F 222
           S+ W+ Y+  LPR       W+  E    L+ + +      ++  +   +++LR +    
Sbjct: 134 STPWTEYLKFLPRHVPVPTMWSEVER-ALLQGTSLEAALEAKLAALNNEFDELREKSSGL 192

Query: 223 SKYPDLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDY 281
           + +  LF E E   ++ +     +  SR + LP      A+VP  DM NHS +   + + 
Sbjct: 193 AFWNSLFWEKETATIQDWILIDALYRSRCLELPRAGD--AMVPGLDMANHSHDPTAYYEE 250

Query: 282 DKSSQGVVFTT-DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
           D     V+      +   GE+V ISYG KS  E+L SYGF+ R+    +  + LPL    
Sbjct: 251 DDKDDVVLLLRLGVEVTGGEEVSISYGDKSPAEMLFSYGFIDRDSA--AHDLTLPLEALP 308

Query: 341 SDKCYKEKLEALR 353
            D   K KL   +
Sbjct: 309 DDPLGKAKLHIFK 321


>gi|358384957|gb|EHK22554.1| hypothetical protein TRIVIDRAFT_83966 [Trichoderma virens Gv29-8]
          Length = 378

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 47/143 (32%), Positives = 73/143 (51%), Gaps = 14/143 (9%)

Query: 221 IFSKYPDLFPEE------VFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE 274
           + + YP+L  ++      + N  TF ++         +LP  D R+AL P AD+ NH+ E
Sbjct: 140 VTAAYPELHKDDYLYSWLLINTRTFYYTD----RGTDKLPRED-RMALQPVADLFNHTPE 194

Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
                ++D   +   FTT R +QPGE+VFI YG  +N +LL+ YGF      NP D   L
Sbjct: 195 GYCVANFD--DKFFTFTTTRTHQPGEEVFIRYGPHANDKLLVEYGFTLPSSVNPWDETCL 252

Query: 335 PLSLKKSDKC-YKEKLEALRKYG 356
              +  S     +++LEA+  +G
Sbjct: 253 DSYICPSMTVDQRDRLEAVGFWG 275


>gi|428179206|gb|EKX48078.1| hypothetical protein GUITHDRAFT_106158 [Guillardia theta CCMP2712]
          Length = 410

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 93/228 (40%), Gaps = 27/228 (11%)

Query: 134 CPEAGEVLKQCSVPD-----WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
           C    EV+K  S  D       LL+ +L+ E    + SRW  +I ++P    +L  W+  
Sbjct: 3   CRFVREVVKAASQEDEQLCERQLLSLHLLVEKWKAERSRWWRFIRSIPPSYDTLENWSEQ 62

Query: 189 ELDR-----YLEASQIRERAI-ERITNVIGTYNDLRLRIFSKYPDLFPE-------EVFN 235
            + R     +L  +  R+R + +  + +    +  + R +++                F+
Sbjct: 63  SVARLQYKPFLAIAARRKRVVNDEFSQLQRLLSRCKKRSWNEPEAAEEAERIQLGFSSFS 122

Query: 236 METFKWSFGILFSRLVRLPSMDGRV-------ALVPWADMLNHSCEVETFLDYDKSSQGV 288
            E + W+ G + +R        G          LVP  D LNHS        + K +   
Sbjct: 123 REDYLWAAGTVSTRSCHYERKSGYSLRGETVGCLVPVLDFLNHSTAPVAACGFCKDAMVY 182

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
             T  R Y+ GEQV I YG  SN  LL  YGFV  +  NP DS  L L
Sbjct: 183 RVTCLRSYEEGEQVMIHYGNWSNAGLLEHYGFVLED--NPLDSCMLWL 228


>gi|258569485|ref|XP_002543546.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237903816|gb|EEP78217.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 480

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 90/407 (22%), Positives = 156/407 (38%), Gaps = 85/407 (20%)

Query: 92  KMAIQKVDVGERGL--VALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW 149
           ++A  + +   RG+  VA + I + E+L  +P +LV++  +           K+     W
Sbjct: 42  RIADLRSNAAGRGVETVACEEIAQDEELFAIPENLVLSVQNSKLKDHLNFTDKELD--SW 99

Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAI 204
             L   +I E     +SRWS+Y + LP    +L++W++ EL R L+ S +     R+ A 
Sbjct: 100 LSLIVTMIYEYLHGGASRWSSYFAVLPTDFDTLMFWSQDEL-RELQGSSVLSKIGRQEAD 158

Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPE----EVFNMETFKWSFGILFSRLVRL------- 253
           E I   +         +   YP LFP       FN +  K +   L  R+  L       
Sbjct: 159 EMIMGKV-------YPLILDYPGLFPTPKELSSFNSQQGKEAILHLAHRMGTLIMAYAFD 211

Query: 254 ------------PSMDGRV---------ALVPWADMLN---HSCEVETFLDYDKSSQGVV 289
                          DG +          +VP ADMLN   H      F    +     +
Sbjct: 212 IENEMDREEEDQDGEDGYITDNEQETAKGMVPLADMLNADAHRNNARLF----QEDGYFI 267

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
             +       E++F  YG+    +LL  YG++  E  +P D VE+              L
Sbjct: 268 MKSIVPISMEEEIFNDYGELPRADLLRRYGYIT-ENYSPYDVVEI-------------SL 313

Query: 350 EALRKYGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEE-------------MAA 396
           EA+ K       C  +++     E++   Y ++ P +     E              M  
Sbjct: 314 EAICKVAGVEKNCPQLELLETA-EILEDGYSLLRPETDANLVEAISPELIVLLRTLTMTP 372

Query: 397 AASNKMTSKKDIKCPEIDEQALQFILDSCESSISKY-SRFLQVKELL 442
              N+M  K  +  P++D+ + + +++  +S  + Y +   Q  ELL
Sbjct: 373 DNLNQMRVKGKLPKPQLDQASAKLLIEVLQSRQNDYPTTIAQDDELL 419


>gi|380477696|emb|CCF44010.1| SET domain-containing protein, partial [Colletotrichum
           higginsianum]
          Length = 448

 Score = 56.2 bits (134), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 93/197 (47%), Gaps = 16/197 (8%)

Query: 165 SSRWSNYISALP-RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR--LRI 221
           S+ W+ Y+  LP R P   L WT  E D  L+ + +      +I  +   +++LR     
Sbjct: 118 STPWTEYVKYLPPRVPVPTL-WTEQERD-MLQGTSLESATAAKIVALTDEFDELRETSST 175

Query: 222 FSKYPDLFPE-EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
            + + +LF E E  ++  +        SR + LP      A+VP  D+ NHS E   +  
Sbjct: 176 LTFWNELFWESEKISLIDWVRVDAWFRSRCLELPK--SGEAMVPVLDLANHSSEANAY-- 231

Query: 281 YDKSSQGVVFTTDR---QYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVELPL 336
           Y+++ +  V    R   +   GE++ ISYG  KS  E+L SYGF+  +  + +D + LPL
Sbjct: 232 YEENGKDEVVLLLRPGCRVSSGEEMTISYGDAKSGAEMLFSYGFI--DPVSAADRMTLPL 289

Query: 337 SLKKSDKCYKEKLEALR 353
              + D   K KL   +
Sbjct: 290 MPLEDDPLGKAKLHIFK 306


>gi|428182388|gb|EKX51249.1| hypothetical protein GUITHDRAFT_103166 [Guillardia theta CCMP2712]
          Length = 386

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 62/195 (31%), Positives = 84/195 (43%), Gaps = 29/195 (14%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDW----PLLATYLIS 158
           RGL A   I   E L  +PP L+I   +    P   EV K  S+P +      LA YLI 
Sbjct: 148 RGLKANGTIHDHEILFRIPPKLLINVGTVSQDPNFAEVWK--SIPQFHKGLSGLAVYLIH 205

Query: 159 EASFEKSSRWSNYISALPRQ-PYSLLYWTRAE--------LDRYLEASQIRERAIERITN 209
           E S  KSS W  Y+ ALPR  P  + Y  R          L    E     +  IER  +
Sbjct: 206 E-SLNKSSFWRPYLCALPRHVPLPIFYSERKFERERREKILKPLPEQVTRFDDLIERARD 264

Query: 210 VIGT-YNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR----------LVRLPSMDG 258
           V+   Y +L  ++FS++P  F    +    + W+  I+ SR          L ++   + 
Sbjct: 265 VLEVHYVELMPKLFSQFPLKFSPADYTYARWAWACSIIMSRTWGRKFKDDVLGKMTGENV 324

Query: 259 RVA--LVPWADMLNH 271
            V   LVP ADM NH
Sbjct: 325 SVVHTLVPAADMPNH 339


>gi|405119695|gb|AFR94467.1| nuclear protein [Cryptococcus neoformans var. grubii H99]
          Length = 495

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 63/260 (24%), Positives = 108/260 (41%), Gaps = 55/260 (21%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVI---TAD-------SKWSCPEAGEVLKQCSVPDWPLLA 153
           G VA+K+I +G  L  +P  L++   T+D       S+W     G          W  L 
Sbjct: 44  GAVAVKDIEEGTPLFHIPDDLILSPYTSDLKDHLDASEWDQLNKG----------WAQLI 93

Query: 154 TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGT 213
             L+ E      SRW+ Y++ +P    + ++WT  + ++ L  + I +R I R  +    
Sbjct: 94  LVLMWETIKGSKSRWAGYLANMPVTFETPMFWTEQQREQ-LAGTDIADR-IGR-EDAEAE 150

Query: 214 YNDLRLRIFSKYPDLFPEEV--FNMETFKWSFGILFSRLVRLPSMD-GR----------- 259
           Y  +       +PDLFP +     ++ F      + SR   +P    GR           
Sbjct: 151 YTSVLAPFIKAHPDLFPIDSPHITIDAFHIQGSRILSRSFTVPLHRFGRSQSQTRSDSNS 210

Query: 260 ------------VALVPWADMLNHSC---EVETFLDYDKS---SQGVVFTTDRQYQPGEQ 301
                       V ++P+ADMLN +        ++D D +   ++GVV  + +  +  EQ
Sbjct: 211 EMEGDGEEEEEVVVMIPFADMLNAAWGKDNAHLYIDEDTNEGFNEGVVMKSIQLVKQSEQ 270

Query: 302 VFISYGKKSNGELLLSYGFV 321
           ++ +Y    N ELL  YG +
Sbjct: 271 IYNTYDSPPNSELLRKYGHI 290


>gi|336262426|ref|XP_003345997.1| hypothetical protein SMAC_06551 [Sordaria macrospora k-hell]
 gi|380089590|emb|CCC12472.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 482

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 90/200 (45%), Gaps = 18/200 (9%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFS 223
           S+ W+ YI  LP+       WT  E  R L      E A+  ++T +   ++ +R    S
Sbjct: 111 SNPWTEYIKFLPKTVLVPTLWTEDE--RLLLRGTSLESAVNAKMTALTAEFDAVR-EAAS 167

Query: 224 KYPD----LFPEEVFN----METFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV 275
             P     L+P E  N    +  +     +  SR++ LP      ++VP  DM+NHS   
Sbjct: 168 SLPTWNDVLWPFENGNSPASLRNWILLDALYRSRVLELPK--SGESMVPCIDMINHSTCA 225

Query: 276 ETFLDYDKSSQGVVF-TTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVE 333
             + D +   + ++    DR    G++V ISYG  K   E+L SYGF+  E T   +S+ 
Sbjct: 226 SAYYDENTKDEVILLPRPDRTISSGKEVTISYGDAKPAAEMLFSYGFIDPETT--VESLV 283

Query: 334 LPLSLKKSDKCYKEKLEALR 353
           LPL     D   K KL A +
Sbjct: 284 LPLEPFGDDPLEKAKLFAFK 303


>gi|119493213|ref|XP_001263813.1| hypothetical protein NFIA_070870 [Neosartorya fischeri NRRL 181]
 gi|119411973|gb|EAW21916.1| hypothetical protein NFIA_070870 [Neosartorya fischeri NRRL 181]
          Length = 362

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 19/122 (15%)

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSM------DGRVALVPWADMLNHS 272
           + S +PD+      + ET+ +++ I+ +R     +P        +  +AL+P+AD  NHS
Sbjct: 124 VVSVFPDV------DWETYSYNWLIVNTRSFYYLMPGQKPPEDRNDAMALLPFADYFNHS 177

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
            +VE  + +D   Q  VF   R+Y  GE++++SYG   N  LL+ YGF   E  N SD++
Sbjct: 178 -DVEDDVKFD--GQKYVFRATRRYDSGEEIYMSYGPHPNDFLLVEYGFYLDE--NGSDAI 232

Query: 333 EL 334
            L
Sbjct: 233 YL 234


>gi|444320075|ref|XP_004180694.1| hypothetical protein TBLA_0E01160 [Tetrapisispora blattae CBS 6284]
 gi|387513737|emb|CCH61175.1| hypothetical protein TBLA_0E01160 [Tetrapisispora blattae CBS 6284]
          Length = 615

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 8/109 (7%)

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQG-VVFTTDRQYQPGEQVFISYGKKSNGELLL 316
             V L P  D+LNHS       + + + +  + F T    +  +++F +YG KS  + LL
Sbjct: 244 NSVFLFPIVDLLNHSNNSNVIWNLNPNDKNSICFNTIDPIEKSQELFNNYGNKSTEDFLL 303

Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           SYGF+ +E T P D   L L L KS       ++ L+ +GL  ++ F +
Sbjct: 304 SYGFILKEET-PFDYASLTLRLDKS------IIQNLKNFGLGLNDNFIV 345


>gi|402862437|ref|XP_003895567.1| PREDICTED: SET domain-containing protein 4 [Papio anubis]
          Length = 456

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 72/310 (23%), Positives = 125/310 (40%), Gaps = 36/310 (11%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
           L+KWL         +         RGL++  ++++G+ ++ +P S ++TAD+       G
Sbjct: 36  LRKWLKARKFQDSHLVPACFPGTGRGLMSQTSLQEGQMIISLPESCLLTADTVIR-SYLG 94

Query: 139 EVLKQCSVPDWPLLA--TYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEA 196
             + +   P  PLLA  T+L+SE      S W  Y+  LP+  Y+       E+   L  
Sbjct: 95  AYITKWKPPPSPLLALCTFLVSEKHAGDRSLWKPYLEILPKA-YTCPVCLEPEVVNLLPK 153

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRL 253
           S ++ +A E+  +V   +   R   FS    LF E V   F+     W++  + +R V L
Sbjct: 154 S-LKAKAEEQRAHVQEFFASSR-DFFSSLQPLFVEAVDSIFSYSALLWAWCTVNTRAVYL 211

Query: 254 -PSMDGRVALVPWADMLNHSC-----------------------EVETFLDYDKSSQGVV 289
            P     ++  P    L  +C                         +    +++ +    
Sbjct: 212 RPRQRECLSAEPDTCALAPACLCPFSPLCSSGLQNLMLHPLSGLSQQVKAAFNEETHSYE 271

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK---KSDKCYK 346
             T  +++  E+VFI YG   N  L L YGFV     +    V   + +K     DK   
Sbjct: 272 IRTTSRWRKHEEVFICYGPHDNQRLFLEYGFVSVHNPHACVYVSREILVKYLPSRDKQMD 331

Query: 347 EKLEALRKYG 356
           +K+  L+ +G
Sbjct: 332 KKISILKDHG 341


>gi|189236574|ref|XP_975615.2| PREDICTED: similar to SET domain containing 3 [Tribolium castaneum]
          Length = 667

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 9/145 (6%)

Query: 241 WSFGILFSRLVRLPSMDGRVALVPWADMLNHS-CEVETFLD--YDKSSQGVVFTTDRQYQ 297
           W+   + +R   +P  +   AL+P  DM NH+   + T  +   D+S    V    + ++
Sbjct: 432 WAVSTVMTRQNTIPFQEDYYALIPLWDMCNHTNGTISTAYNPVLDRSECLAV----KNFK 487

Query: 298 PGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
            GEQ+FI YG +SN +L +  GFV     N  D   + L + KSD   +++   L K  +
Sbjct: 488 AGEQLFIFYGSRSNADLFVHNGFVFE--NNDYDVYWIRLGISKSDPLQQKRGHLLGKLSI 545

Query: 358 SASECFPIQITGWPLELMAYAYLVV 382
           +++  F I+    P++    A+L V
Sbjct: 546 ASTCDFSIRKGASPIDGQLLAFLRV 570


>gi|115492035|ref|XP_001210645.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114197505|gb|EAU39205.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 514

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 61/119 (51%), Gaps = 7/119 (5%)

Query: 234 FNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD 293
            +++ +K+   +  SRLV LP      A+VP  DM NH+ +      Y++ ++G      
Sbjct: 196 LDIDDWKYVDALYRSRLVDLPRSGH--AMVPCVDMANHASDDTVKALYEEDAEGNALLQL 253

Query: 294 RQYQ---PGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEK 348
           R+ Q   PG++V ISYG +K   E+L SYGF+P E    +  V L LS+   D     K
Sbjct: 254 REGQVLHPGDEVTISYGSEKPAAEMLFSYGFLP-EDKEDAGQVFLDLSIPDDDPLRNHK 311


>gi|400597281|gb|EJP65016.1| SET domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 484

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 63/280 (22%), Positives = 108/280 (38%), Gaps = 47/280 (16%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT--------------ADSKWSCPEAG----EVLKQC 144
           RG+VAL++I     L  +P   +I               +D +   P+ G    + L   
Sbjct: 40  RGIVALRDIAPETVLFTIPRQSIINVETSGLRSQLPQLFSDEEGLAPQHGVADDDPLSSS 99

Query: 145 SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
            +  W  L   L+ E     +S W  Y+  LP    + ++WT AEL      +   +   
Sbjct: 100 PLDAWGALILVLLYEHLRGAASAWRPYLDVLPATFETPMFWTGAELGALQAGATAGKVGR 159

Query: 205 ERITNVIGTYNDLRLRIFSKYPDLF------PEEVFNMETFKWSFGIL------------ 246
           E   +   T+  + L +   +PD+F       +E       +    I+            
Sbjct: 160 ESAED---TFRGILLPVVRAHPDVFQGSAALSDEALVALAHRMGSTIMAYAFDLENDEER 216

Query: 247 ---FSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
                    +   DG+  + +VP AD+LN   E    +++  +   +  T  R  + GE+
Sbjct: 217 EDEEDEDGWVEDRDGKAMMGMVPMADILNADAEFNAHVNHGDNE--LTVTALRPIKAGEE 274

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
           +   YG   N ELL  YG+V  E  +  D VE+P  L ++
Sbjct: 275 ILNYYGPHPNSELLRRYGYV-TERHSRYDVVEIPWELVEA 313


>gi|45552859|ref|NP_995955.1| CG33230 [Drosophila melanogaster]
 gi|45445739|gb|AAS64931.1| CG33230 [Drosophila melanogaster]
 gi|223364426|gb|ACM86246.1| MIP03820p [Drosophila melanogaster]
          Length = 446

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 98/215 (45%), Gaps = 37/215 (17%)

Query: 151 LLATYLISEASFEK------SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAI 204
           L+A Y++     ++      SS ++ Y+  LPR   +  + +  EL    E+  + ER +
Sbjct: 105 LIACYILYHKHLQECTLGTHSSTYAAYLDTLPRCYSTPYFCSIPELQCLPES--LLERTV 162

Query: 205 ERITNVIGTYNDLRLRIFSKYP---DLFPEEVFNMETFKWSFGILFSRLVRLPSM----- 256
            +   + G +  ++  I  K       + +E++ +  FKW++  + +R V L S      
Sbjct: 163 AQNRQIRGYFEIIK-NIVHKCDCCGKSYGQEIWTLADFKWAYFSVNTRSVHLSSRFLKKQ 221

Query: 257 ----------DGRVALVPWADMLNHSCEVETFL-----DYDKSSQGVVFTTDRQYQPGEQ 301
                     D  +AL P+ D+ NHS +VE        DY  + + + F+  +   P +Q
Sbjct: 222 SNYFQPLISGDTNLALAPFLDLFNHSDQVEITAGIEGPDYVLTLKSLPFSETK---PYDQ 278

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           +FISYG   N +LL  YGF  RE  N  D  E+ L
Sbjct: 279 LFISYGALPNFKLLTEYGFWLRE--NKHDYFEVSL 311


>gi|213408453|ref|XP_002174997.1| SET domain-containing protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212003044|gb|EEB08704.1| SET domain-containing protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 441

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 65/267 (24%), Positives = 102/267 (38%), Gaps = 57/267 (21%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCS----VPDWPLLATYLISE 159
           G+VA+ NI+  E ++F P   V+          +G  L+       +P+W  L   +++E
Sbjct: 39  GIVAVDNIKADETVVFFPKDSVMKV--------SGSYLQHLEGIEELPNWAALLLLMMNE 90

Query: 160 ASFEKSSRWSNYISALPRQPY--SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
            +    S W  YIS  P +    SL YW  AE  + L  S + E   +R + V   + + 
Sbjct: 91  KN-NPESFWKPYISVFPTKERITSLFYWD-AEKQKRLLKSTVLENMQDR-SEVKTVWKET 147

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFS-----RLVRLPSMDGRVA----------- 261
            L    K       E   +E F+    ++ S     + ++  + D + A           
Sbjct: 148 VLPFIDKNKSKL-REGLTLEDFEHMAAVMSSYSFDVKRIKTENNDSQKASKQMDVDNSEH 206

Query: 262 ----------------------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
                                 + P ADM N   E+     YD    G      R  + G
Sbjct: 207 SENNEDDSDLESEYDPEVFEKAMCPIADMFNGDDELCNVRMYD-LEDGYHMMVTRDIEKG 265

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGT 326
           EQ++ +YG   NGELL  YGF   +GT
Sbjct: 266 EQLWNTYGDIDNGELLRKYGFTKPDGT 292


>gi|413950742|gb|AFW83391.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
 gi|413950743|gb|AFW83392.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 252

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 255 SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           S+  R ALVP    +L +    +  L  D  S  V    DR Y+ GE + I  G ++N  
Sbjct: 17  SLARRFALVPLGPPLLTYKSNCKAMLTVDGES--VRLVVDRPYKAGEPIIIWCGPQTNSR 74

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           L+L+YGFV  +  NP D + +  SL   D  Y+EK    ++ G  A + F + +
Sbjct: 75  LVLNYGFV--DENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYV 126


>gi|366987955|ref|XP_003673744.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
 gi|342299607|emb|CCC67363.1| hypothetical protein NCAS_0A08050 [Naumovozyma castellii CBS 4309]
          Length = 499

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 126/333 (37%), Gaps = 68/333 (20%)

Query: 73  LENASTLQKWLSDS---GLPPQKMAIQKVDVGE-RGLVALKNIRKGEKLLFVPPSL---V 125
           L+N      WL++S    L P+       D  + R ++A ++I+  E L  +P      V
Sbjct: 7   LKNTENFHSWLTNSVGYKLSPKIKIADGRDTNQGRFILATEDIKTDELLFEIPRESILNV 66

Query: 126 ITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKS-SRWSNYISALPRQP--YSL 182
           +T+      P    +L    V  W  L   ++ E   +K+ S+W+ Y   LP      SL
Sbjct: 67  LTSSLVSEYPAWENILLDGDVGHWEGLIICMLFEIKVKKNMSKWAPYFDVLPESTDLNSL 126

Query: 183 LYWTRAEL---------DRYLE--ASQIRERAIERITNVIGTYNDLRLRIFSKYPDL-FP 230
           +YWT  EL         DR     A Q+ E+ +E I            R F K   +   
Sbjct: 127 MYWTAEELEALKPSLVLDRIGNDGAHQMHEKVMELI------------RTFEKDHSVDLS 174

Query: 231 EEVFNMETFKWSFGIL--FSRLVRLPSMDGRV----------------------ALVPWA 266
                 E F +   I+  +S  V LP                            +++P A
Sbjct: 175 FGTITWEDFLYVASIIMSYSFDVELPPTSADENEEDDEVEEDVEQTVRNEGSLKSMIPLA 234

Query: 267 DMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPR 323
           D LN   + C      D D      +       + GEQV+  YG   N E+L  YG+V  
Sbjct: 235 DTLNSDTNKCNAHLIYDEDSLKMRAI----SNIKAGEQVYNIYGNHPNAEILRRYGYVEW 290

Query: 324 EGTNPSDSVELPLS--LKKSDKCYKEKLEALRK 354
           EG+   D  ELPL   ++   + Y   +E +RK
Sbjct: 291 EGSK-YDFGELPLEVIIETLHEQYDIPIEKIRK 322


>gi|298712711|emb|CBJ48736.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 1030

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 111/267 (41%), Gaps = 34/267 (12%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGER-GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
              W      P +K+    +  G R GLVA + I +G+  L VP S+++ A    + PE 
Sbjct: 486 FNAWTRTVDFPIRKVEAGLIGNGMRLGLVATEAIPQGQTYLSVPGSIILDASKARTDPEL 545

Query: 138 GEVLKQCSVPDWPL---------LATYLISEASFEKS--SRWSNYISALPR----QPYSL 182
           G  L +      PL         L   LI+E +F ++  S W+ Y+  LP     + Y  
Sbjct: 546 GPPLARLEASLGPLGLWDQDTDSLRVLLIAE-TFVRADLSPWAPYLRLLPTLSEMEAYHP 604

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYND---LRLRIFSKYPDLFPEEVFNMETF 239
           L++  A +  + E S ++    ER  + +  +     +      +  D+       +E +
Sbjct: 605 LFFDNATIASF-EGSDVQASLRERRDSEMAGFTTKFAVEGSAGRELQDVLGVGWITLERY 663

Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEV------ETFLDYDKSSQGVVFTTD 293
            W+  I+ SR +      GR  LVP  D++N + +       ET  D D S+   V    
Sbjct: 664 LWAAAIVDSRCIW---WGGRKHLVPLLDLVNDARDSPLDFVHETLQDSDGSA---VTAAA 717

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGF 320
           R    G+QV   YG   N  L+  +GF
Sbjct: 718 RNVDKGDQVMEDYGHP-NHVLIFEHGF 743


>gi|440636605|gb|ELR06524.1| hypothetical protein GMDG_02159 [Geomyces destructans 20631-21]
          Length = 682

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 81/185 (43%), Gaps = 33/185 (17%)

Query: 155 YLISEASFEKSSRWSNYISALPR--QPYSL---LYWTRAELDRYLEASQIRERAIERITN 209
           +LI +     SS W+ YI +LP+  +P+ L   LY+   E  R+L  + +     +R   
Sbjct: 137 FLIQQYLRGSSSHWAPYIRSLPQPDEPHKLATPLYYPE-EARRWLGGTNLPAAIAQREGM 195

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-------L 262
             G +        SK   L P+EV+          +L   +   PS   ++A       L
Sbjct: 196 WRGDF-------VSK---LIPDEVYG--------DVLDQPVDGYPSWREKIAEEGPYPVL 237

Query: 263 VPWADMLNHSCE--VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
            P  D+ NH  +  VE F++     +     TD     GEQVF +Y  K N ELLL YGF
Sbjct: 238 FPLLDIANHDAKAWVEWFVNAQGPVKDFSIITDAAIGEGEQVFNNYAPKGNTELLLGYGF 297

Query: 321 VPREG 325
           + R G
Sbjct: 298 LRRRG 302


>gi|432119396|gb|ELK38474.1| N-lysine methyltransferase SETD6 [Myotis davidii]
          Length = 322

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 45/187 (24%), Positives = 84/187 (44%), Gaps = 21/187 (11%)

Query: 183 LYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWS 242
           ++W   E  R L+ + + E   + + N+   Y+ + L     +PDLF   V ++E ++  
Sbjct: 1   MFWPEGERRRLLQGTGVPEAVEKDLANIRSEYHSVVLPFMEAHPDLFSPRVRSLELYQQL 60

Query: 243 FGIL--FSRLVR------LPSMDGRVA-----LVPWADMLNHSCEVETFLDYDKSSQGVV 289
             ++  +S+++       L   D         +VP AD+LNH  +    L+Y  +   +V
Sbjct: 61  VALVMAYSQVLSGSFQEPLEEEDDEKEPNPPLMVPAADILNHVAKHNANLEYSPNCLQMV 120

Query: 290 FTTDRQYQP---GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYK 346
            T     QP   G ++F +YG+ +N +L+  YGF      N  D+ ++ +   +      
Sbjct: 121 AT-----QPIPKGREIFNTYGQMANWQLIHMYGFAEPYPDNTDDTADIQMVTVRKAALQG 175

Query: 347 EKLEALR 353
            K EA R
Sbjct: 176 TKDEAER 182


>gi|428167603|gb|EKX36559.1| hypothetical protein GUITHDRAFT_155193 [Guillardia theta CCMP2712]
          Length = 321

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 60/260 (23%), Positives = 111/260 (42%), Gaps = 32/260 (12%)

Query: 74  ENASTLQKWL-SDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           E+ +   KW  S+ G+   K+ + KV  G +G+   + +R+GE ++  P +L +   +  
Sbjct: 69  EDWTAFVKWFRSNGGIISSKLTV-KVRNGRQGVYFKERMRRGETIVSFPRNLRLDEKTAM 127

Query: 133 SCPEAGEVLKQCS----VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRA 188
              +AG V ++       PD  ++  +++ E    K S W  Y   L RQ  ++++ T  
Sbjct: 128 KG-KAGHVFQRLKQDKCYPDL-MVILHVVHEDKLGKDSFWFPYFKLLRRQYNNIMFLTEP 185

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF---------PEEVFNMETF 239
           ++   L     R    E   N+      +  R F+ + + +         PE  F  +  
Sbjct: 186 QMKTLL-----RRPGCENTYNL----GVMMRRTFNNFYEWYKKNIEPWAPPEFQFTRDEI 236

Query: 240 KWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
            W F  L +      + +G   +VP++D+ NH  E        K+S     +  ++YQ G
Sbjct: 237 LWGFNTLVTCGWGQQNGNGDKLMVPFSDIPNHRRESA-----QKASNRGFISAAKEYQAG 291

Query: 300 EQVFISYGKKSNGELLLSYG 319
           E++   YG   N  +L  YG
Sbjct: 292 EELTFDYG-LLNDAVLAYYG 310


>gi|308806489|ref|XP_003080556.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
 gi|116059016|emb|CAL54723.1| SET-domain transcriptional regulator-like protein (ISS)
           [Ostreococcus tauri]
          Length = 394

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 66/268 (24%), Positives = 117/268 (43%), Gaps = 22/268 (8%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS--KWSCPE-AGEVLKQCS-VPDWPLLATYLIS 158
           R + A++ +  GE +  VP   ++  +   + S P    E+LKQ + + D  ++  +L +
Sbjct: 2   REVRAVERVEAGECVARVPWDALLGVEQTVETSSPSPTSEILKQLTRMGDQIIMVIWLTA 61

Query: 159 EA-SFE-----KSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIG 212
              +FE         W+  + ALP +  S L W   +L   +    +  R  E   +V  
Sbjct: 62  ALDAFECGDASAYEEWAPALRALPTRASSSLAWNADDLG-AVAGEDLANRLREYRRSVKV 120

Query: 213 TYNDLRLRIFSKYPDLFPEEVF-NMETFKWSFGILFSRLVRLPSMDG---RVALVPWADM 268
            Y+ L   +  + P+ FP   F +   F+ ++ I  S  +++   D    R  +VP   +
Sbjct: 121 QYDALFPALCEQVPEAFPARAFGDYAKFERAYDIWTSYAMKVQDPDSLQIREVIVPGVFL 180

Query: 269 LNHSCEVET--FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
            NHS    +  +   ++ ++       R    GE + ISYG+  N +LL+ YGF      
Sbjct: 181 CNHSLSAHSVRYTSLERGTKAFRLELSRGCVEGEAITISYGRLDNADLLMFYGFSLE--N 238

Query: 327 NPSDSVELPLSLKKSDKCYKEKLEALRK 354
           NP D V L      +++    +LEALR 
Sbjct: 239 NPYDRVSLHSITGDANET---QLEALRH 263


>gi|451999637|gb|EMD92099.1| hypothetical protein COCHEDRAFT_1134267 [Cochliobolus
           heterostrophus C5]
          Length = 476

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 74/293 (25%), Positives = 117/293 (39%), Gaps = 44/293 (15%)

Query: 82  WLSDSG--LPP--QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           WL  +G  + P  Q   ++  D G RG+ A ++I + E L  +P S +++ ++     E 
Sbjct: 14  WLKHTGAQINPKIQLEDLRAKDAG-RGVAAKQDIAEHELLFSIPRSSILSVENSILSTEI 72

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
                    P W  L   ++ E     +S W+ Y + LP    +L++WT  EL   L+AS
Sbjct: 73  PPTTFALLGP-WLSLILVMLYEYHNGSASNWAPYFAVLPTDFDTLMFWTEDELTE-LQAS 130

Query: 198 QI---------RERAIERITNVIGTYNDLRLRIFSKYPDLFPE----EVFNMETFKWSFG 244
            +          E  IE++  VI  + D+      +  DL  E    E   +     S  
Sbjct: 131 AVVNKIGKEGANEVFIEQLLPVIEEFADVIFSGDERAKDLAKEMRAPENLELMHKMGSLI 190

Query: 245 ILFSRLVRLPSMDGRV----------------ALVPWADMLNHS---CEVETFLDYDKSS 285
           + ++  V     D  V                 +VP ADMLN     C    F + D   
Sbjct: 191 MAYAFDVEPAISDKEVDEEGFAEEEEDAALPKGMVPLADMLNADADRCNARLFYEKD--- 247

Query: 286 QGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            G+     +  Q GE++F  YG     +LL  YG++  E     D VE+P  L
Sbjct: 248 -GLEMKALKPIQAGEEIFNDYGPLPRSDLLRRYGYIT-ENYAQYDVVEIPADL 298


>gi|308804211|ref|XP_003079418.1| N-methyltransferase (ISS) [Ostreococcus tauri]
 gi|116057873|emb|CAL54076.1| N-methyltransferase (ISS), partial [Ostreococcus tauri]
          Length = 305

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 60/123 (48%), Gaps = 13/123 (10%)

Query: 261 ALVPWADMLNHS--CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
            +VP+ DMLNH+  C     L++D   +G+   T R+ + GE+VF +YG   N ELL  Y
Sbjct: 171 GMVPFWDMLNHAPPCAASVRLNHD-PKRGLQMITVREVKKGEEVFNTYGPLRNAELLRRY 229

Query: 319 GFV----PREGTNPSDSVELPLSLKKSDKCYKE---KLEALRKYGLSASEC---FPIQIT 368
           GFV    P  GT       +  ++  +   Y+E   +L  L   GL+  E    F +  T
Sbjct: 230 GFVLARNPHGGTTVGLDEVIEAAMMANPDLYEELPLRLAWLESRGLADEELSTRFFVHQT 289

Query: 369 GWP 371
           G P
Sbjct: 290 GRP 292


>gi|413950744|gb|AFW83393.1| hypothetical protein ZEAMMB73_866859 [Zea mays]
          Length = 281

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/114 (32%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 255 SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           S+  R ALVP    +L +    +  L  D  S  V    DR Y+ GE + I  G ++N  
Sbjct: 17  SLARRFALVPLGPPLLTYKSNCKAMLTVDGES--VRLVVDRPYKAGEPIIIWCGPQTNSR 74

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           L+L+YGFV  +  NP D + +  SL   D  Y+EK    ++ G  A + F + +
Sbjct: 75  LVLNYGFV--DENNPFDRISIEASLNTEDPQYQEKRMVAQRNGKHAIQNFNVYV 126


>gi|384501024|gb|EIE91515.1| hypothetical protein RO3G_16226 [Rhizopus delemar RA 99-880]
          Length = 354

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 65/254 (25%), Positives = 115/254 (45%), Gaps = 41/254 (16%)

Query: 148 DWPLLATYLISEASFEKSSRWSNYISALPR----QPYSLLYWTRAELDRYLEASQIRER- 202
           D  +L  +LI    F ++++W  Y+  LP     Q   +L+         LE S IR + 
Sbjct: 6   DRTILCLFLIYYRFFNENTKWKPYMDILPTLEFFQKTHVLFNPGTVKGTCLENS-IRSKI 64

Query: 203 -AIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA 261
            ++ER    I  Y   R+                ++ + W+   ++SR+V +   +  +A
Sbjct: 65  SSLERELEEINQYWPTRIE---------------LDMYLWADCTVWSRVVGIT--ETEIA 107

Query: 262 LVPWADMLNHSCEVETFLDYD-KSSQGVVFTTDRQYQP-GEQVFISYGKKSNGELLLSYG 319
           LVP+ D+ NHS   E+ + ++    +G++  T +  +   E++ + YG KSN ELL  +G
Sbjct: 108 LVPYFDLANHSLN-ESNIKWELTDDEGLMLVTTKDIKSQDEELTLFYGSKSNQELLFLHG 166

Query: 320 FVPREGTNPSDS-VELPLS--LKKSDKCYKEKLEALRKYGLS---------ASECFPIQI 367
           F  ++  NP  S + +PL   L  SD     K++ L+  G            S   P+  
Sbjct: 167 FCIQD--NPETSRITIPLMPFLDLSDPVDISKIQWLKSVGAKPILTLMGSRTSNLDPLVA 224

Query: 368 TGWPLELMAYAYLV 381
            GW ++ +A  YL+
Sbjct: 225 DGWTVDSVAALYLI 238


>gi|281208586|gb|EFA82762.1| hypothetical protein PPL_04457 [Polysphondylium pallidum PN500]
          Length = 534

 Score = 55.5 bits (132), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 66/310 (21%), Positives = 126/310 (40%), Gaps = 63/310 (20%)

Query: 81  KWLSDSGLP-PQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE 139
           +W  ++G+     + I + +   + + A+ +I  G  L+ VP SL+I  +     P + +
Sbjct: 3   EWSKENGIVWNDHLEIYQDNTSGQSVRAINDIAAGSLLVSVPESLLIHINK----PVSEQ 58

Query: 140 VLK-QCSVPDWPLLATY--LISEASFEKS----SRWSNYISALPRQPYSLLYWTRAELDR 192
           +L   C      LL     LI   + EK+    S+W  Y++ +P+Q  +   +T  E++ 
Sbjct: 59  LLSLSCIDKVLDLLDNVQRLIFHINVEKAIGEKSKWYRYLNDIPQQYDTSSMYTDEEIED 118

Query: 193 YLEASQIRERAIERITNVIGTYNDL----------------------------RLRIFSK 224
            L     +E A +    ++ +YN                               +  F+ 
Sbjct: 119 -LTYPYYKEEAYKLKHELLQSYNSFIDIIDNHFNIDNNNNSNISNNNNNNNNNSIDSFTL 177

Query: 225 YPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL-------------------VPW 265
             ++  + + N++ FKW++ ++ +R     + +                        +P 
Sbjct: 178 LKEI-KQNLTNLDNFKWAWAVIQTRTYYFNNNNYVNNNNNNNKRKYSNSNNNNNNSSIPM 236

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
           AD+ NH  +V T   ++   +     T  +++  EQV+ISYG  SN  LL  YGF     
Sbjct: 237 ADLFNHRYDVVTRAVFNDEERCFQVFTGTEFKKNEQVYISYGNHSNATLLHFYGFAI--D 294

Query: 326 TNPSDSVELP 335
            NP DS+ +P
Sbjct: 295 NNPLDSIVIP 304


>gi|322696758|gb|EFY88546.1| 2-hydroxyacid dehydrogenase, putative [Metarhizium acridum CQMa
           102]
          Length = 1025

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 73/278 (26%), Positives = 109/278 (39%), Gaps = 44/278 (15%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           +E    L  W    G+    +A Q++     G VA ++I+ G+ L+ +P   ++  DS  
Sbjct: 427 MEGLEPLINWARTRGVELDGVAPQQMPGRGIGAVATRSIKAGQVLMTIPARAILRLDSVL 486

Query: 133 SC-----PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
           +      P A  +          LLA  L + +  E + R     S       + L+W R
Sbjct: 487 ASISSRLPSASSIHG--------LLAAQLAASSDAETTLRRDAMPSLQSFAATTPLFWHR 538

Query: 188 AELDRYLEASQIR-----ERAIERITNVI-----GTYNDLRLRI-FSKYPDLFPEEVFNM 236
             L   L A   R     E A+ER          G   D  LR  F      F  E    
Sbjct: 539 -RLQDLLPAGARRLVDRQEAALERDWAAFHEAFPGVARDAYLRCWFLVGTRAFYHETDAT 597

Query: 237 ETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
             + W               + R+AL+P ADM NH+      + +  S +    T  R  
Sbjct: 598 LLYPW---------------EDRLALLPVADMFNHAGVPGCSVAF--SPEAYTVTATRAC 640

Query: 297 QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
             G++VF+SYG+ SN  LL  YGF+  +  N  DSV+L
Sbjct: 641 ARGDEVFLSYGEHSNDFLLAEYGFLLDD--NQWDSVDL 676


>gi|255723006|ref|XP_002546437.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240130954|gb|EER30516.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 578

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 81/327 (24%), Positives = 136/327 (41%), Gaps = 60/327 (18%)

Query: 73  LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKL--LFVPPSLVITA 128
           LE  ++L KW   +G  L P  +  +++     G +    I   +    + +P  L+IT 
Sbjct: 4   LEKINSLVKWAESNGAELSPD-VQFKEITTNNIGAIYDGKIAPSDNGYPISIPFKLIITT 62

Query: 129 DSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-----QPYSLL 183
            +  +  E G+ LK     +   +  + +        S +  Y+  LP       PY+  
Sbjct: 63  QN--AITEFGKYLKSTEDKNSNAILKFYLCHERINADSFYHPYLKLLPSLAAIDSPYT-- 118

Query: 184 YWTRAELDRYLEASQIRERAIERITNVIGTY----NDLRLRI---------------FSK 224
            W+ A+   YL+ + +     E + +++  +    N L+  +               F  
Sbjct: 119 -WS-AQDKTYLKGTNLGNSLKENLGSLVEEWWEVINLLKDEVSKPEQHFINMKFYYDFKF 176

Query: 225 YPD------LFPEEVFNMETFK---WSFGILFSR-----LVRLPSMDGRVALVPWADMLN 270
           Y D      L  E++ N  +F    W+  IL SR     L+          L+P  D+LN
Sbjct: 177 YTDDDLDKYLNEEDINNWTSFPNYLWASLILKSRSFPAYLIDKSCNKNDAMLLPVVDLLN 236

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
           H+ + +   D    S G         QPG+++F +YG K N ELLL+YGF   EG NP D
Sbjct: 237 HNPQAKVNWD---VSDGFFRFKSESIQPGKEIFNNYGLKGNEELLLAYGFCI-EG-NPRD 291

Query: 331 SVELPLSLKKSDKCYKEKLEALRKYGL 357
           SV L +      K  +EKL+ + + G+
Sbjct: 292 SVALKI------KMPEEKLKEIEEQGI 312


>gi|313239201|emb|CBY14158.1| unnamed protein product [Oikopleura dioica]
          Length = 393

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 60/270 (22%), Positives = 114/270 (42%), Gaps = 39/270 (14%)

Query: 92  KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----AGEVLKQCS-- 145
           K+ I   D G RG+ +   I + E L+ VP   ++T        E    A +VL+  S  
Sbjct: 22  KLKISDGDCG-RGVFSSAVIEQSELLISVPIDALLTTRKAQHVVESHKSARQVLQNFSTC 80

Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE 205
           +    LL   L  E   E++S+WS ++S++P Q ++       EL+      ++  +  +
Sbjct: 81  LNGTDLLVCALFLELENEENSKWSAFLSSIPNQLWNPFMLDEKELNLLTAKCRLPSKCFK 140

Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-----RLPSMD--- 257
           +           +++I +++      E+ N E   W F ++ SR       R  + +   
Sbjct: 141 Q-----------KIKISTEFLKALGFEI-NEEILNWCFSVVLSRSFGGSSERCETRNHFK 188

Query: 258 ------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKS 310
                     L P  D++NH  E      +++        + ++   G+++F++YG  KS
Sbjct: 189 IEIDNSANFCLCPAIDLINHEKEYNCEYRWNEDKTAFQVFSRKKILQGQELFVNYGTTKS 248

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKK 340
             E+   YGFV      PSD  ++   L++
Sbjct: 249 EYEIYNFYGFVL-----PSDDFQVEFELQR 273


>gi|428170888|gb|EKX39809.1| hypothetical protein GUITHDRAFT_114060 [Guillardia theta CCMP2712]
          Length = 476

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 68/254 (26%), Positives = 104/254 (40%), Gaps = 42/254 (16%)

Query: 103 RGLVALKNIRKGEKLLFVP--PSLVITAD---SKWSCPEAGEVLK--QCSVPDWPLLATY 155
           RG+ AL+ + +GE ++ +    S  I+ D      S  E  + L+  +  + D  LL  Y
Sbjct: 28  RGIWALEQVEEGEVVMRLANSASFRISGDFIRQSSSLAEGVQALEASEGRLADDLLLTLY 87

Query: 156 LISEASFEKSSRWSNYISALP-RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY 214
           L S    E S  +S Y+ +LP  +P   ++W  AEL      +     A+         Y
Sbjct: 88  LASSRKKEGSFPFSEYVRSLPLEKPDLPIFWDSAELQTLPRMTMTLVEAMRE------EY 141

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA----------LVP 264
            D   +I  K  +    +V + E  KW + ++ SR +R   + G               P
Sbjct: 142 EDYSSKI-RKVTEALQVQVDD-EDVKWGYAMVKSRALREKVVKGSSGQDVPETLERYCFP 199

Query: 265 WADMLNHSCEV----------ETFLDYDKSSQGVV------FTTDRQYQPGEQVFISYGK 308
            ADM NH              +  +    S +G V      FT  R    G +V  +YG+
Sbjct: 200 LADMFNHEPSAVPPPASDLLRQADVHRGPSVRGSVVGDHFEFTATRAIPAGSEVSWTYGQ 259

Query: 309 KSNGELLLSYGFVP 322
            +N ELLL YGFVP
Sbjct: 260 LTNEELLLRYGFVP 273


>gi|328873307|gb|EGG21674.1| SET domain-containing protein [Dictyostelium fasciculatum]
          Length = 514

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 76/158 (48%), Gaps = 15/158 (9%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGE-----RGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           +  KWLSD+G    K A  KV +        GLVAL +I +G++ + VP  L +T ++  
Sbjct: 65  SFTKWLSDNGC---KEAFDKVKIVRGLTEGSGLVALGDIGEGDEFIAVPSKLFMTQET-- 119

Query: 133 SCPEAGE-VLKQCSVPDWP--LLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAE 189
           +    GE V+++      P  LL  +LI E      S W+ YI  LPR   ++L +T  +
Sbjct: 120 AIKSIGEKVIREPLFRYIPSLLLTVHLIQEQLIMPKSFWAPYIRMLPRTYRTILQFTMDD 179

Query: 190 LDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPD 227
             R L  S + E AI    N +  Y  L    F+K PD
Sbjct: 180 F-RALLGSAVLEEAISTYRNTLRQYCFL-YDFFNKTPD 215


>gi|312101598|ref|XP_003149686.1| hypothetical protein LOAG_14135 [Loa loa]
          Length = 314

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 70/275 (25%), Positives = 108/275 (39%), Gaps = 43/275 (15%)

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF--- 222
           S W  YI  LP    + L++T  +L + L  S + E ++    NV   +    L I    
Sbjct: 25  SHWQPYIKVLPECFDTPLFFTVEQL-QCLRPSPLFEESLLLYRNVSRQFIHFLLEIIRSD 83

Query: 223 ----------SKYPDLFPEEV--------FNMETFKWSFGILFSRLVRLPSMDGR----- 259
                      K  +L P  V        F    ++WS   + +R+  +PS   +     
Sbjct: 84  EFRHRKKKSKDKISELEPIYVNSPLTAANFTFNLYRWSVACISTRINMIPSEVWKDDIGQ 143

Query: 260 ----VALVPWADMLNHSCEVETF---LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
                 L+P+ DM NHS     F   + +            R Y+P E V I YG +SN 
Sbjct: 144 PRMIPGLIPFLDMANHSYTESAFHEAVHFSDEFDCAEVIAVRDYKPLEPVNIFYGWRSNR 203

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGW- 370
           + LL  GF+P E  N  D  +L + L KS +    ++      G  A S  F  +I+   
Sbjct: 204 DFLLHNGFIPLE-KNIRDIYKLKIGLPKSKR-EDARMRLFHALGFVAESTIFAFEISVCE 261

Query: 371 -----PLELMAYAYLVVSPPSMKGKFEEMAAAASN 400
                 L   A  Y++   PS   + EE A+++ N
Sbjct: 262 PYFHDSLFRFAQIYILDEVPSAAEQVEEAASSSDN 296


>gi|302422352|ref|XP_003009006.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261352152|gb|EEY14580.1| SET domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 485

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 53/196 (27%), Positives = 89/196 (45%), Gaps = 20/196 (10%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
           S+ W+ Y+  LPR+      W+  E D +L+ + +      +I  +   +  LR     K
Sbjct: 116 STPWTEYVKFLPREVPVPTMWSEQERD-FLQGTSLELAVSAKIQALTNEFEALR----EK 170

Query: 225 YPDL-FPEEVF----NMETFKWSFGILF--SRLVRLPSMDGRVALVPWADMLNHSCEVET 277
             DL F   +F    N+    W     +  SR + LP +   V++VP  D+ NH+     
Sbjct: 171 SSDLPFWNAIFWDKNNVILADWFLVDAWYRSRSLELPGVG--VSMVPVLDLANHAPTPNA 228

Query: 278 FLDYDKSSQG---VVFTTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVE 333
           + +     +G   ++         G++V ISYG  KS  E+L SYGF+  +    +D+V 
Sbjct: 229 YYEESARREGDVELLLRPGSTLAAGDEVTISYGAGKSGAEMLFSYGFI--DPARSTDTVA 286

Query: 334 LPLSLKKSDKCYKEKL 349
           LPL+  + D   K K+
Sbjct: 287 LPLAPLEDDPLSKAKV 302


>gi|71652808|ref|XP_815053.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70880079|gb|EAN93202.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 572

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/139 (27%), Positives = 59/139 (42%), Gaps = 7/139 (5%)

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
           A L  YL+  + R + +    N    +   +  L  F   P    EE      +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANAEAEFQLCKSTLSFFQTMPHSDCEERSMPITLEQFLWA 256

Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           +  L +R           +L+PW D  N++      + YD+     VF      + GEQ+
Sbjct: 257 YNTLMTR--GFAYYSEVWSLMPWVDYFNYALNSNATMKYDERRGAYVFEVLFPIESGEQI 314

Query: 303 FISYGKKSNGELLLSYGFV 321
           F+ YG  ++ ELLL YGF 
Sbjct: 315 FLQYGAYTDMELLLWYGFT 333


>gi|310800174|gb|EFQ35067.1| SET domain-containing protein [Glomerella graminicola M1.001]
          Length = 485

 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 56/191 (29%), Positives = 88/191 (46%), Gaps = 18/191 (9%)

Query: 168 WSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYP- 226
           W+ YI  LP Q      WT  E  + L  + +      +I  +   +++LR  + S  P 
Sbjct: 121 WTEYIKYLPPQVPVTTLWTVRE-RQMLNGTSLESATAAKIVALSDEFDELR-EVSSSLPL 178

Query: 227 --DLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYD 282
             +LF E    +    W        SR + LP      A+VP  D+ NHS +   +  Y+
Sbjct: 179 WNELFWES-GKVSLIDWVRVDAWFRSRCLELPK--SGEAMVPVLDLANHSSKANAY--YE 233

Query: 283 KSSQGVVFTTDR---QYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           ++S+  V    R   +   GE++ ISYG  KS  E+L SYGF+  +  + +D + LPL+ 
Sbjct: 234 QNSKDEVVLLLRPGCRVSSGEEMTISYGDAKSGAEMLFSYGFI--DPASAADRITLPLTP 291

Query: 339 KKSDKCYKEKL 349
            + D   K KL
Sbjct: 292 LEDDPLGKAKL 302


>gi|238880307|gb|EEQ43945.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 579

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 79/332 (23%), Positives = 143/332 (43%), Gaps = 71/332 (21%)

Query: 73  LENASTLQKWLSDSG--LPPQKMAIQKVDVGERGLVALKNIRKGEKL-------LFVPPS 123
           +E+ + L KW   +G  + P       V+  E     +  I KG K+       + +P  
Sbjct: 4   IESINKLLKWAESNGAQISPD------VEFKEISKNYIGAIYKGNKVPDSPFCPISIPSK 57

Query: 124 LVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR-----Q 178
           L+IT  + +   E  + LK   + D  +L  +L  E     +S +  Y++ LP       
Sbjct: 58  LIITPQTAFK--EFSKSLKNTDINDNSILKLHLCHE-RLNGNSFFYPYLNLLPSLSEIDS 114

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVI--------GTYNDL--------RLRIF 222
           PY+   W+ A    YL+ + +     + +  ++          ++DL         ++ +
Sbjct: 115 PYT---WS-ANDKSYLQGTNLGNSLKKNLVTLVEEWWKAINALHDDLPKPEQHYINMKFY 170

Query: 223 SKYP---------DLFPEEVFNMETFK---WSFGILFSR-----LVRLPSMDGRVALVPW 265
            +Y           L  E + N  +F    W+  IL SR     L+   +      L+P 
Sbjct: 171 YEYKFYTDDDLNKYLNDENIENWTSFPNYLWASLILKSRSFPAYLIDKNNKQDSAMLLPV 230

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREG 325
            D+LNH+ + +  + +D S     F+++    PG+++F +YG K N ELLL+YGF     
Sbjct: 231 VDLLNHNSKSK--VHWDVSDNYFKFSSE-SIVPGKEIFNNYGLKGNEELLLAYGFCIE-- 285

Query: 326 TNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
            N  DSV L + +       +EK++A+ +YG+
Sbjct: 286 NNSQDSVALKIKMP------EEKIKAIEEYGI 311


>gi|145485580|ref|XP_001428798.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124395886|emb|CAK61400.1| unnamed protein product [Paramecium tetraurelia]
          Length = 331

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 56/256 (21%), Positives = 114/256 (44%), Gaps = 30/256 (11%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADS-------- 130
           L +W    G+  + + I ++  G  G+VA + I   + ++ +P  L I ++         
Sbjct: 4   LLQWFESEGIHTESIKIAELTHGCNGVVATQPIPSDQIVIKIPLHLCIFSEDLLKNHYQR 63

Query: 131 -KWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSR--WSNYISALPRQPYSLLYWTR 187
            K   P    +       ++  L  Y++ +   E S    + +Y+    + P ++L WT+
Sbjct: 64  YKKFYPHIFNI-NLNEDAEFNSLVLYILQQRDNEMSLHKPYFDYV----KDPQNILSWTQ 118

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFG 244
            +++  ++ +   ++ I+R+   +G    L+L  F ++   F E+     N + F +++ 
Sbjct: 119 EQVNTIMDEN--LKKTIQRMR--VG----LQLN-FVRFVTFFKEQFKKGLNYDQFLYAYQ 169

Query: 245 ILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
            + +R            LVP+ DMLNH  + +T          +VF T +Q Q  E+++ 
Sbjct: 170 FVMTRCFGGDDHLQSPCLVPFGDMLNHHDKCQT--KQKIIGTDLVFITTKQIQENEEIYN 227

Query: 305 SYGKKSNGELLLSYGF 320
            +G+  N  LL  YGF
Sbjct: 228 FFGEHGNSFLLCWYGF 243


>gi|367036287|ref|XP_003648524.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
 gi|346995785|gb|AEO62188.1| hypothetical protein THITE_2106073 [Thielavia terrestris NRRL 8126]
          Length = 496

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 68/283 (24%), Positives = 111/283 (39%), Gaps = 46/283 (16%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITA-----------------DSKWSCPEAGE---VLK 142
           RG++A  +I     L  +P S ++ A                 D      ++G+      
Sbjct: 40  RGIIAKADIPADTVLFTIPRSAILCAATSALRDKIPDVFDLEGDHGAGHSDSGDEDGAAS 99

Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT---------------- 186
             S   W LL   LI E    ++SRW  Y+  LP    + ++W+                
Sbjct: 100 SSSQDSWTLLILVLIYEHLQGEASRWRPYLDVLPPTFDTPMFWSPTELSELQASALVAKV 159

Query: 187 -RAELDRYLEASQIRE-RAIERITNVIG--TYNDLRL-RIFSKYPDLFPEEVFNMETFKW 241
            RAE DR +EA  +   RA E +    G    +D +L  +  +         F++E    
Sbjct: 160 GRAEADRMIEAKVLPVIRAHEEVFFPPGRAKLDDAQLFELAHRMGSTIMAYAFDLENDDS 219

Query: 242 SFGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
                      +   +GR  + +VP ADMLN   E    +++      +  T  R  + G
Sbjct: 220 DNDEADEDDEWVEDREGRTMLGMVPMADMLNADAEFNAHINH--GDDALTATALRPIRAG 277

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           +++   YG   NGELL  YG+V  + +   D VELP  L +++
Sbjct: 278 DEILNYYGPLPNGELLRRYGYVTPKHSR-YDVVELPWELVEAE 319


>gi|380482827|emb|CCF40997.1| SET domain-containing protein [Colletotrichum higginsianum]
          Length = 472

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 64/258 (24%), Positives = 105/258 (40%), Gaps = 34/258 (13%)

Query: 110 NIRKGEKLLFVPPSLVITADSKWSCPEAG-------------EVLKQCSVPDWPLLATYL 156
            +  GE ++  P  L ++  +  S P  G                   SVP   +   +L
Sbjct: 39  GVEPGETIVTCPLDLTLSYLNAASTPSPGFHHEGAAPSSSPFPPSFLASVPPHVIGRFFL 98

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL------------DRYLEASQIRERAI 204
           I +    K S W  YI  LP QP+ L  W    L            + ++  ++I+ R  
Sbjct: 99  IHQYLLGKESFWYPYIKTLP-QPHHLQSWILPPLWPADDLELLEDTNVHVAVAEIKSRLK 157

Query: 205 ERITNVIGTYNDLRLRI-FSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLP---SMDG 258
               + I ++ D   R  +++    +   +F   +F+ S  I  +R   + LP   ++D 
Sbjct: 158 AEFKHAIASFADDPARHDYTRLLYNWAYCIFTSRSFRPSLVIPAARQPTLSLPEGCAIDX 217

Query: 259 RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELLLS 317
              L+P  D+ NH+       D D  +      T   Y PG QVF +YG  K+N EL+L+
Sbjct: 218 FSLLLPLFDVGNHAPTAAIAWDADADTNKCTLRTLHPYVPGAQVFNNYGTTKTNAELMLA 277

Query: 318 YGF-VPREGTNPSDSVEL 334
           YGF +P      +D V +
Sbjct: 278 YGFCIPESAHLHNDYVHV 295


>gi|392580059|gb|EIW73186.1| hypothetical protein TREMEDRAFT_59348 [Tremella mesenterica DSM
           1558]
          Length = 503

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 108/285 (37%), Gaps = 84/285 (29%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVIT----------ADSKWSCPEAGEVLKQCSVPDWPLLA 153
           G VAL +I     L  +P +L++T          AD +W   + G          W  L 
Sbjct: 44  GAVALIDIPVDTPLFHIPSNLLLTPYTSRLASLLADEEWDKLDIG----------WARLI 93

Query: 154 TYLISEASFEKSSRWSNYISAL--PRQPYSL--------------LYWT---RAEL-DRY 193
             ++ E S  + S+W  Y+S L  P +P  L              ++W    RAEL    
Sbjct: 94  LVMMYETSLGQKSKWYQYLSKLFLPCRPTWLIAETESMPTKFDTPMFWDETRRAELIGTD 153

Query: 194 LEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFN--METFKWSFGILFSRLV 251
           LE    RE A ++       Y ++ L I   +PD+FP    +  +E +      + SR  
Sbjct: 154 LEGRIGREDADKQ-------YFEILLPIIQAHPDIFPPNSIDTSLEAYHLQGSRILSRSF 206

Query: 252 RLP----------------------SMDGRVALVPW-ADMLNHSCEVETFLDYDKSSQ-- 286
            +P                        +G VA + W ADMLN + E++    YD S    
Sbjct: 207 TIPRSKAGGPAPHVVESDDSDSDSDEEEGGVAAMVWMADMLNAAYELDNARLYDTSENTS 266

Query: 287 ----------GVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
                     G    + +  + GEQ++ +Y    N ELL  YG V
Sbjct: 267 ATSTEPWDRPGYTMRSTKLIKAGEQIYNTYDSPPNSELLRKYGHV 311


>gi|387197713|gb|AFJ68815.1| set domain protein, partial [Nannochloropsis gaditana CCMP526]
          Length = 327

 Score = 55.1 bits (131), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 49/194 (25%), Positives = 88/194 (45%), Gaps = 23/194 (11%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEA 160
           RG VAL +I   E ++ +P  L++T D     P+ G+V +       D  +L   L+ E 
Sbjct: 98  RGAVALDDINSNEDMVSIPEPLLLTPDVALKDPDIGKVFEDNLEDFSDEDMLLILLMHER 157

Query: 161 SFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL-RL 219
              ++S +  Y++ LPR P +LL W    L  +L+   +      R + +   Y  L   
Sbjct: 158 GKGETSFFYPYLATLPRLPDTLLNWNEEGLS-WLQDEGLSLEVFLRESQLTAHYTRLVEE 216

Query: 220 RIFSKYPDLFPE-------------EVFNMETFKWSFGILFSRLV--RLPSMDGRVALVP 264
           ++ + +P LF E             + +++E F++++  + +R    RLP      AL+P
Sbjct: 217 KLKAGWPGLFGEAPDDASDSESKGADPYSLENFRFAWLTIQARAFGRRLPY----SALIP 272

Query: 265 WADMLNHSCEVETF 278
             D  NH+    T+
Sbjct: 273 LCDSFNHANVAVTY 286


>gi|393904017|gb|EJD73630.1| SET domain-containing protein 3 [Loa loa]
          Length = 444

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 71/275 (25%), Positives = 108/275 (39%), Gaps = 43/275 (15%)

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF--- 222
           S W  YI  LP    + L++T  +L + L  S + E ++    NV   +    L I    
Sbjct: 19  SHWQPYIKVLPECFDTPLFFTVEQL-QCLRPSPLFEESLLLYRNVSRQFIHFLLEIIRSD 77

Query: 223 ----------SKYPDLFPEEV--------FNMETFKWSFGILFSRLVRLPSMDGR----- 259
                      K  +L P  V        F    ++WS   + +R+  +PS   +     
Sbjct: 78  EFRHRKKKSKDKISELEPIYVNSPLTAANFTFNLYRWSVACISTRINMIPSEVWKDDIGQ 137

Query: 260 ----VALVPWADMLNHSCEVETF---LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNG 312
                 L+P+ DM NHS     F   + +            R Y+P E V I YG +SN 
Sbjct: 138 PRMIPGLIPFLDMANHSYTESAFHEAVHFSDEFDCAEVIAVRDYKPLEPVNIFYGWRSNR 197

Query: 313 ELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSA-SECFPIQITGW- 370
           + LL  GF+P E  N  D  +L + L KS K    ++      G  A S  F  +I+   
Sbjct: 198 DFLLHNGFIPLE-KNIRDIYKLKIGLPKS-KREDARMRLFHALGFVAESTIFAFEISVCE 255

Query: 371 -----PLELMAYAYLVVSPPSMKGKFEEMAAAASN 400
                 L   A  Y++   PS   + EE A+++ N
Sbjct: 256 PYFHDSLFRFAQIYILDEVPSAAEQVEEAASSSDN 290


>gi|302921343|ref|XP_003053266.1| hypothetical protein NECHADRAFT_105995 [Nectria haematococca mpVI
           77-13-4]
 gi|256734206|gb|EEU47553.1| hypothetical protein NECHADRAFT_105995 [Nectria haematococca mpVI
           77-13-4]
          Length = 371

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 66/269 (24%), Positives = 113/269 (42%), Gaps = 38/269 (14%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+VA ++IR+ E +L VP   + T D+     +  + L   SV         L +E + +
Sbjct: 32  GVVATRDIRENEAILTVPMKALRTIDT--VPKQISKALHGVSV------HGILAAEIALD 83

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDRYLEA---SQIRERAIERITNVIGTYNDLRLR 220
           KS  ++ + + LP         T+ +L+  +     S+++     R  N++        R
Sbjct: 84  KSDDFAVWKTVLP---------TKEDLESGMPMMWPSELQLLLPRRAKNLLDKQTTTFRR 134

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDG-----RVALVPWADMLNHS--- 272
                 + FP    +   + W      +    +P M       R+  +P AD+ NH+   
Sbjct: 135 ECEIVLNAFPNLTRDDYLYAWVLINTRTFYNSMPKMKAYAQADRLVCMPAADLFNHADQG 194

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---PREGTNPS 329
           C++        S  G     DR Y+ GE+V++SYG  SN  LL  YGF+    R      
Sbjct: 195 CQLSF------SPLGYTIKADRVYRQGEEVYVSYGPHSNDFLLTEYGFILGPNRWDEVYL 248

Query: 330 DSVELPLSLKKSDKCYKEKLEALRKYGLS 358
           D V LPL L K+ +     ++ L ++ L 
Sbjct: 249 DDVILPL-LNKTQRAELASVDFLGRFTLD 276


>gi|350629837|gb|EHA18210.1| hypothetical protein ASPNIDRAFT_38188 [Aspergillus niger ATCC 1015]
          Length = 480

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 73/277 (26%), Positives = 120/277 (43%), Gaps = 33/277 (11%)

Query: 78  TLQKWLSDSGLPPQKMAIQKV---DVGERG--LVALKNIRKG---------EKLLFVPPS 123
           TL  W+  +G+    +A +K+   D  ++G  +V  +    G         E LL VP  
Sbjct: 10  TLPSWIKLNGVSVNGIAFRKLQADDGTDKGSAIVGTEVKSTGNAEGSEVEPEVLLRVPTD 69

Query: 124 LVITADSKWSCPEAGEVLKQC--SVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYS 181
           LV++ D      ++   L++   +V D+  +  Y         SS W+ Y+  +P     
Sbjct: 70  LVLSFDFVEEYSKSDRQLREVLEAVGDFGRV-DYASERHQIGLSSPWTEYMKYMPPAISL 128

Query: 182 LLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----LRIFSKYPDLFPEEVFNM 236
             +++  EL+  L  S +R     +I ++   +  LR     L    KY   + E+   +
Sbjct: 129 PTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRRSTEGLDWCEKY--WWDEDTGKL 185

Query: 237 ETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCE--VETFLDYDKSSQGVV-FT 291
               W +   +  SR+V LP      A+VP  DM NH+ E  V+   D D     V+   
Sbjct: 186 TFNDWKYVDALYRSRMVDLPRHGH--AMVPCIDMANHASEGTVKALYDEDADGNAVLQLR 243

Query: 292 TDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
             R  +  E+V ISYG +KS  EL+ SYGF+    T+
Sbjct: 244 EGRSLRADEEVTISYGDEKSASELIFSYGFLDEHTTD 280


>gi|189193345|ref|XP_001933011.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187978575|gb|EDU45201.1| predicted protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 642

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/214 (25%), Positives = 89/214 (41%), Gaps = 28/214 (13%)

Query: 141 LKQCS--VPDWPLLATYLISEASFEKSSRWSNYISALP--RQPYSLLYWTRAELDRYLEA 196
           L+QC   +PD  L    LI +    ++S W  YI+ LP  R   + L++   ++  +L  
Sbjct: 84  LRQCRGRIPDHILTYLLLIEQRDKGQASPWHAYIACLPNSRDMTTPLWFDEGDM-AFLAG 142

Query: 197 SQIRERAIERITNV-------IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR 249
           + +   A ER   +       I    +L + +          +  + E+  W+  I  SR
Sbjct: 143 TSLVPAAKERKAELQQQWEGAIAVMEELSIPL---------AKGIDTESLLWAATIFTSR 193

Query: 250 LVR----LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD-RQYQPGEQVFI 304
                  LP  +    L P  D+LNHS   +   D+       +   D   +QP +++F 
Sbjct: 194 AFISTHILPERETVPILFPVVDILNHSVSAKVEWDFQPGQSFALKCLDGDSFQPEQELFN 253

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           +Y  K N ELLL YGF      NP +   L L+ 
Sbjct: 254 NYAPKQNDELLLGYGFCLE--NNPIEQFALKLAF 285


>gi|392863014|gb|EAS36291.2| hypothetical protein CIMG_01513 [Coccidioides immitis RS]
          Length = 746

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 95/205 (46%), Gaps = 32/205 (15%)

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
           PE    +K+     + L+  YL+ + SF     W+ YI +LP   Q   L Y++  +L+ 
Sbjct: 132 PEFLPAVKEKGASAFLLMDQYLLGDESF-----WAPYIRSLPEDSQLTRLEYYSDEDLE- 185

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
           +LE + + +     +  +  TY ++ L++  + P+   +  +  E F W+  I+ SR   
Sbjct: 186 WLEGTNLLKLRENMLIKLKTTY-EVGLQMLKESPNKNTKN-YTWERFLWASSIIISRAFS 243

Query: 253 LPSMDGRV--------------ALVPWADMLNHS--CEVETFLDYDKSSQGVV-FTTDRQ 295
              +   V               LVP  DM NH    +VE      ++SQGVV     + 
Sbjct: 244 SEVLKDYVKNSKSINVTGGEFSVLVPLLDMTNHQPLAQVEW-----RTSQGVVGLIVHKT 298

Query: 296 YQPGEQVFISYGKKSNGELLLSYGF 320
             PG++V  +YG ++N  L+L+YGF
Sbjct: 299 LLPGQEVPNNYGPRNNERLMLNYGF 323


>gi|212721460|ref|NP_001132025.1| uncharacterized protein LOC100193433 [Zea mays]
 gi|194693232|gb|ACF80700.1| unknown [Zea mays]
 gi|414881264|tpg|DAA58395.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
 gi|414881265|tpg|DAA58396.1| TPA: hypothetical protein ZEAMMB73_027665 [Zea mays]
          Length = 252

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 5/114 (4%)

Query: 255 SMDGRVALVPWAD-MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           S+  R ALVP    +L +    +  L  D  S  V    DR Y+ GE + I  G ++N  
Sbjct: 17  SLARRFALVPLGPPLLTYRSNCKAMLTADGDS--VRLVVDRPYKAGEPIIIWCGPQTNSR 74

Query: 314 LLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQI 367
           L+L+YGFV  +  NP D V +  SL   D  Y+EK    ++ G  A + F + +
Sbjct: 75  LVLNYGFVDED--NPFDRVAIEASLNTEDPQYQEKRMVAQRNGKLAIQNFNVYV 126


>gi|453088140|gb|EMF16181.1| SET domain-containing protein, partial [Mycosphaerella populorum
           SO2202]
          Length = 307

 Score = 54.7 bits (130), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 121/297 (40%), Gaps = 56/297 (18%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           GL+    I K ++++FVP + + T  +  S  +        S      LA  ++SE    
Sbjct: 31  GLLTTAKISKDDQIIFVPKNAMFTPKTAESHTKKPSPSPSPSPSPQAHLAISIMSECLSP 90

Query: 164 KS------SRW---SNYISALPRQPYSLLYWTRAELDRYLEAS------QIRERAIERIT 208
            S        W   S++ S +P      L+W+  EL  +L  S      ++RE   + +T
Sbjct: 91  SSPYLTWKKTWPTLSDFESGMP------LFWS-PELCHHLPESVKQPLERMREDYEKDLT 143

Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL--PSMD-GRVALVPW 265
            ++    D              ++ +  E FK+ + I+ SR      P +  G + L P+
Sbjct: 144 YMLSLNCD--------------DQTWKEEDFKYYWAIVNSRCFHFKPPGLKPGFMVLCPF 189

Query: 266 ADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV---- 321
            D +NH     T +   +S +G     DR Y+P  ++  +YG   N +LL+ YGF     
Sbjct: 190 IDYMNHG-PTGTGVKVSQSPKGYEVVADRDYEPNTEILATYGSHPNDKLLVHYGFCLSYK 248

Query: 322 PREGTNPS---DSVELPLSLKKSDKCYKEKLEALRKYGL--------SASECFPIQI 367
           P E ++     D + LP ++  + K   + +  L  Y L         A  CF  Q+
Sbjct: 249 PNEPSDDDIRLDHILLP-AMSANTKSQLQDVGMLGSYALLPPSAQRQQAELCFKTQV 304


>gi|169606334|ref|XP_001796587.1| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
 gi|160706968|gb|EAT86035.2| hypothetical protein SNOG_06204 [Phaeosphaeria nodorum SN15]
          Length = 634

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 87/213 (40%), Gaps = 10/213 (4%)

Query: 143 QCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPYSLLYWTRAELDRYLEASQIRE 201
           Q  +PD  L    LI + +  K S W  YI+ LP  +  +   W   E   +L  + +  
Sbjct: 94  QGKIPDHILTYLLLIEQRNKGKESPWHAYIACLPGAESMTTPLWFDDEDMAFLAGTSLAP 153

Query: 202 RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR----LPSMD 257
            A ER +     +    L I         +EV + E+  W+  I  SR       LP  +
Sbjct: 154 AAKERKSLYYQQWEQ-ALGIMKDAGVALADEV-DFESLLWAATIFTSRAFISTHILPDHE 211

Query: 258 GRVALVPWADMLNHSCEVETFLDYDK-SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
               L P  D+LNHS   +   ++   +S  +       +  G+++F +Y  K N ELLL
Sbjct: 212 TVPLLFPIVDILNHSVSAKVEWEFQPLASFSLKLLEGDTFTAGQELFNNYAPKQNDELLL 271

Query: 317 SYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
            YGF      NP +   L L+     + Y E +
Sbjct: 272 GYGFCLEH--NPIEQFPLKLAFPPMLQEYAEAM 302


>gi|402224283|gb|EJU04346.1| hypothetical protein DACRYDRAFT_114691 [Dacryopinax sp. DJM-731 SS1]
          Length = 1313

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/212 (26%), Positives = 95/212 (44%), Gaps = 14/212 (6%)

Query: 81   KWLSDSGLPPQKMAIQKVDVGE--RGLVALKNIRKGEKLLFVPPSLVITADSKWSCP-EA 137
             W + +G      AI   D+ E  RG VAL++I +GEKL  +P SL+++  +  S P   
Sbjct: 819  NWFTSAGGTFDSSAIGIEDLPETGRGAVALRDIYEGEKLFTIPRSLLLSTRTS-SLPFLL 877

Query: 138  GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
            GE         W  L   ++ E +  + S W  Y+ ++P +  +L++WT  EL   L+ S
Sbjct: 878  GEEDWNALGDGWAGLILCMMWEEARAEESPWRGYLESMPTEFSTLMFWTDEELG-LLKGS 936

Query: 198  QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
             + ++ I R       YN+  L +  K  DLF   +F       ++ I  SR+     + 
Sbjct: 937  LVLDK-IGR-AGAEKDYNEKVLPLLQKRTDLFAPSLFQTRYTLQNYHIQGSRI-----LS 989

Query: 258  GRVALVPWADML--NHSCEVETFLDYDKSSQG 287
                + PW+  +  N   E    +D   +S G
Sbjct: 990  RSFTVSPWSGAVPENDEDEAPELVDTSMASAG 1021



 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 8/81 (9%)

Query: 260  VALVPWADMLNHSCEVETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSY 318
            VA+VP ADMLN  C      L Y +    ++ T  +    GEQ++ +YG   N +LL  Y
Sbjct: 1059 VAMVPMADMLNARCGCNNAKLFYTRDDLQMMAT--KPIAKGEQIWNTYGDPPNSDLLRRY 1116

Query: 319  GFV-----PREGTNPSDSVEL 334
            G+V     P    +PSD VE+
Sbjct: 1117 GYVDALTLPDGVGSPSDVVEI 1137


>gi|367021574|ref|XP_003660072.1| hypothetical protein MYCTH_2297882 [Myceliophthora thermophila ATCC
           42464]
 gi|347007339|gb|AEO54827.1| hypothetical protein MYCTH_2297882 [Myceliophthora thermophila ATCC
           42464]
          Length = 426

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 255 SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT--DRQYQPGEQVFISYGKKSNG 312
           + D R+ L P AD+LNH+        +D +     FT   DR Y PGE+V I YG+  N 
Sbjct: 221 ARDDRMVLQPVADLLNHAAAGYATAGFDGAGGIGWFTVAADRAYAPGEEVHICYGRHHND 280

Query: 313 ELLLSYGFV 321
            LL+ YGF+
Sbjct: 281 LLLVEYGFL 289


>gi|7329638|emb|CAB82703.1| putative protein [Arabidopsis thaliana]
          Length = 486

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 71/285 (24%), Positives = 115/285 (40%), Gaps = 48/285 (16%)

Query: 54  RTKTTVTQNMIPWGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRK 113
           +T+ ++  N +PW   I   +  +TL    S  G               R L A K I  
Sbjct: 37  QTQASLDNNFLPWLERIAGAKITNTLSIGKSTYG---------------RSLFASKVIYA 81

Query: 114 GEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYIS 173
           G+ +L VP +  IT D     P    VL    V +  +LA  LI E    + SRW  YIS
Sbjct: 82  GDCMLKVPFNAQITPDE---LPSDIRVLLSNEVGNIGMLAAVLIREKKMGQKSRWVPYIS 138

Query: 174 ALPR--QPYSLLYWTRAELDRY----LEASQIRERA-IERITNVIGTYNDLRLRIFSKYP 226
            LP+  + +S ++W   EL       +    ++++A IE+  + +         I ++ P
Sbjct: 139 RLPQPAEMHSSIFWGEDELSMIRCSAVHQETVKQKAQIEKDFSFVAQAFKQHCPIVTERP 198

Query: 227 DLFPEEVFNMETFKWSFGILFSRLVRLP--------------SMDGRVALVPWADMLNHS 272
           DL        E F +++ +    L  +               S  G++      D +NH 
Sbjct: 199 DL--------EDFMYAYALGEKVLCIVLFLLNLDNLLLDSGISCVGKLK-THITDFMNHD 249

Query: 273 CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
               + +  D+ +Q    T DR Y PG++V +S   K  G L L+
Sbjct: 250 GLSASIVLRDEDNQLSEVTADRNYSPGDEVDLSDWLKLMGLLKLT 294


>gi|412994115|emb|CCO14626.1| unnamed protein product [Bathycoccus prasinos]
          Length = 390

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 62/125 (49%), Gaps = 16/125 (12%)

Query: 225 YPDLFPEE--VFNMETFKWSFGILFSRLVRLPSMDGRV-----ALVPWADMLNH----SC 273
           +  +F EE    + E F W+   + SR + + S +  +     + +P  D+LNH    +C
Sbjct: 172 HAGIFGEENKAVSYEMFAWAISTVLSRALSVSSENKNIDSLFYSFIPGVDLLNHDANANC 231

Query: 274 EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK-KSNGELLLSYGF-VPREGTNPSDS 331
           E+    + + +S  +     R  +  E+  ISYG  +SN ELL  YGF VP    N +DS
Sbjct: 232 EIRLVSNKNNASTSIEVYAIRDIENDEECTISYGNHRSNDELLRKYGFCVP---NNRNDS 288

Query: 332 VELPL 336
           +++ L
Sbjct: 289 IDVRL 293


>gi|71406326|ref|XP_805712.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70869221|gb|EAN83861.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 572

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 59/139 (42%), Gaps = 7/139 (5%)

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
           A L  YL+  + R + +    N    +   +  L  F   P    EE      +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANAEAEFQLCKSALSFFQTMPHSDCEERSMPVTLEQFLWA 256

Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           +  L +R           +L+PW D  N++      + YD+     +F      + GEQ+
Sbjct: 257 YNTLMTR--GFAYYSEVWSLMPWVDYFNYALNSNATMKYDERRGAYIFEVLFPIESGEQI 314

Query: 303 FISYGKKSNGELLLSYGFV 321
           F+ YG  ++ ELLL YGF 
Sbjct: 315 FLQYGAYTDMELLLWYGFT 333


>gi|424512980|emb|CCO66564.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/304 (25%), Positives = 117/304 (38%), Gaps = 59/304 (19%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----AGEVLKQCSVPDWPL-LATYLI 157
           RG+ A  +I   + L+ +P    +        P     + E+  +     W L +A  L+
Sbjct: 107 RGVKATSDIASEDDLVRLPREATMLVVEGQENPHEEYISNELWAKAGDERWALRVALVLL 166

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAEL--------DRYLEASQI-RERAIERIT 208
            E S    S++  YI  LP+   +L  WT  E+        +++ +  ++  E+A E I 
Sbjct: 167 YEKSLGSRSKFYEYIEQLPKSFENLGTWTEEEVRELQYSVGEKFAKEQRLENEKACELIQ 226

Query: 209 N--------------VIGTYNDLRLRIFS-KYPD-------LFPEEVFNMETF------- 239
                          VI   + +R R+FS K  D       L P  +     F       
Sbjct: 227 EYARDGGLKTIEREEVIWALDVVRSRVFSGKIADQEALQRKLLPRALSVGTVFASFLTAQ 286

Query: 240 ----KWSFGILFSRLVRLPSM-------DGRVALVPWADMLNHSCEVETFLDYDKSSQGV 288
               KW        LV   S        D    L+P  D  NH   ++T  ++  S    
Sbjct: 287 TTELKWLCVFALLALVVFDSTKENDVKTDTAYVLMPLIDAFNHQTMLKTEFEFTNSE--F 344

Query: 289 VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE--LPLSLKKSDKCYK 346
              + + Y+ GE+V ISYG   N ELLL YGFV  +    +   E  LP  L ++D   K
Sbjct: 345 ALKSPKSYKKGEEVLISYGLMPNDELLLRYGFVDDQNVADTYQFEGLLPY-LTQNDPTLK 403

Query: 347 EKLE 350
           E LE
Sbjct: 404 ENLE 407


>gi|412986734|emb|CCO15160.1| predicted protein [Bathycoccus prasinos]
          Length = 450

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 9/101 (8%)

Query: 239 FKWSFGILFSRLVRLPSMDGRVA----LVPWADMLNHSC---EVETFLDYDKSSQGVVFT 291
           + W+   +FSR  R+    GR A    ++P  D+LNHS    EV       +     +  
Sbjct: 207 YGWALSQVFSRTFRIEDARGRRAPRRVMIPIVDLLNHSSVEEEVNVTWRVKEDLSAFIVE 266

Query: 292 TDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSV 332
             R     E++ +SYG++++   LL YGF+P    NP +SV
Sbjct: 267 AKRNVGKDEELILSYGERNDQHFLLFYGFLP--SMNPCNSV 305


>gi|410082051|ref|XP_003958604.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
 gi|372465193|emb|CCF59469.1| hypothetical protein KAFR_0H00600 [Kazachstania africana CBS 2517]
          Length = 508

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/251 (25%), Positives = 108/251 (43%), Gaps = 37/251 (14%)

Query: 103 RGLVALKNIRKGEKLLFVPPSL---VITADSKWSCPEAGEVLKQCSVPDWPLLATYLISE 159
           RG++A+K+I +GE L  +P      V+T+       +  E L+  S+  W  L   L+ E
Sbjct: 39  RGVIAVKDIAEGEVLFEIPRDSILNVLTSSLSSDFSDLEETLQ--SIGSWEGLILCLLYE 96

Query: 160 ASFEKS-SRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
              +K  S+W  Y + LP       L+YW   EL+ +L  S + +R  ++    +  +  
Sbjct: 97  WKGKKEKSKWWKYFNVLPSSNAMNGLMYWNEQELE-HLRPSLVLDRIGKKSAKNM-YHKV 154

Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV---------------- 260
           L L   SK+P++        E F ++  ++ +    + + + +                 
Sbjct: 155 LTLVKESKFPEVLCN--VEWEDFVYAASVIMAYSFDVENGESQTLNEEDDDQDEEENTGY 212

Query: 261 --ALVPWADMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
             +++P AD LN   H C      D DK  +       +  + GEQVF  YG   N E+L
Sbjct: 213 IKSMIPLADTLNSDTHQCNANLMYD-DKFLKMYAI---KPIKKGEQVFNIYGNHPNAEIL 268

Query: 316 LSYGFVPREGT 326
             YG+V   G+
Sbjct: 269 RRYGYVEWSGS 279


>gi|410082986|ref|XP_003959071.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
 gi|372465661|emb|CCF59936.1| hypothetical protein KAFR_0I01550 [Kazachstania africana CBS 2517]
          Length = 584

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 52/108 (48%), Gaps = 16/108 (14%)

Query: 241 WSFGILFSR----LVRLP---SMDGRVALVPWADMLNHSCEVE---TFLDYDKSSQGVVF 290
           WSFGI  SR    ++  P   S   +  L P  D+LNH        TF D     Q   F
Sbjct: 202 WSFGIFTSRAFPEILINPDNCSNVNQAFLYPIVDLLNHKNGTSVKWTFED----DQAHFF 257

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           T ++  +   ++F +YG KSN ELLL YGFV  +  N  D  +L L L
Sbjct: 258 TNEKNLKKHTELFNNYGDKSNEELLLGYGFV--QSNNAHDDTKLTLKL 303


>gi|384249602|gb|EIE23083.1| SET domain-containing protein, partial [Coccomyxa subellipsoidea
           C-169]
          Length = 306

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/178 (23%), Positives = 84/178 (47%), Gaps = 8/178 (4%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G+ A++++ +G++L  +P + V++  +        ++L+Q  +     L   ++ E S  
Sbjct: 3   GVFAVQDLCEGQRLCEIPKTAVLSVQNTG----IADILEQHRIRGGLGLIIAIMYELSIG 58

Query: 164 KSSRWSNYISALPRQPYSLLYWTRAELDR-YLEASQIRERAIERITNVIGTYNDLRLRIF 222
           K S W  Y+  L ++ Y  L+W  AE +R  L+ ++   R  E        +      + 
Sbjct: 59  KESFWHGYLEELHKREYLPLFW--AEQERSLLQGTEAEHRPQEDEELTQEDFETHVPPLV 116

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLD 280
            ++ D    + F +E+F+ +   + SR   + S  G +++VP AD+ NH   +  F D
Sbjct: 117 EQHADRLRADSFTLESFRVAASWVASRAFGVDSFHG-MSMVPLADIFNHKAAIVQFSD 173


>gi|406602709|emb|CCH45757.1| hypothetical protein BN7_5343 [Wickerhamomyces ciferrii]
          Length = 569

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/134 (31%), Positives = 65/134 (48%), Gaps = 15/134 (11%)

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQG--VVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
           LVP  D+LNH  E     D   SS G   +F T+++ + G++++ SYG K+N EL+  YG
Sbjct: 231 LVPIFDLLNHDNEANVKWDSLDSSNGKNFIFKTEQKLKNGDEIYNSYGPKTNQELMFGYG 290

Query: 320 FVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASE---CFPIQITG-WPLELM 375
           F      N  D   L L + +++      +E+   +GL  +     +PI      P  L+
Sbjct: 291 FAIE--NNKEDRATLALRIPEAN------IESANTFGLKLTTNEVSYPITKENPLPTPLI 342

Query: 376 -AYAYLVVSPPSMK 388
             +AYLV S    K
Sbjct: 343 DLFAYLVKSDEETK 356


>gi|115386294|ref|XP_001209688.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114190686|gb|EAU32386.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 486

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 68/279 (24%), Positives = 111/279 (39%), Gaps = 61/279 (21%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVI-TADSKWSCPEAGEVLKQC--SVPDWPLLATYLISE 159
           RG+VA  +I + E+L  +P  LV+ T +SK       ++L Q    +  W  L   ++ E
Sbjct: 48  RGVVAQTDIPENEELFTIPRDLVLSTQNSKLK-----DLLSQDLEELGPWLSLMLVMMYE 102

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTY 214
                 S W+ Y   LPR+  +L++WT +EL   L+ S +     R+ A E I  +I   
Sbjct: 103 YLLGDQSTWAAYFKVLPRKFDTLMFWTPSEL-LELQGSAVIDKIGRQGADESILEMIAP- 160

Query: 215 NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV-------------- 260
                 I   +P LFP  V  + ++    G     L+ L    G +              
Sbjct: 161 ------IVRAHPSLFP-PVDGLPSYDGDAGT--QALLHLAHTMGSLIMAYAFDIEKPEDE 211

Query: 261 ---------------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPG 299
                                 +VP AD+LN   +      + +    +V    +    G
Sbjct: 212 DEEGDGEGGYMTDEEEEQLSKGMVPLADLLNADADRNNARLF-QDENALVMKAIKPIAKG 270

Query: 300 EQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
           E++F  YG+    +LL  YG+V  +   P D VE+ L +
Sbjct: 271 EEIFNDYGEIPRADLLRRYGYV-TDNYAPYDVVEVSLDV 308


>gi|145349891|ref|XP_001419360.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579591|gb|ABO97653.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 465

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 70/260 (26%), Positives = 108/260 (41%), Gaps = 30/260 (11%)

Query: 93  MAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVIT------ADSKWSCPEAGEVLKQCSV 146
            A+QK +  +RG+ A + + +G  L  +P    +T      AD   S  EA  ++K  + 
Sbjct: 36  FAVQKAN-KDRGVTAKRALERGAILAVIPFEACLTLKTCSRADVAASVEEA--LVKTKTE 92

Query: 147 PDWPL-LATYLISEASFEKSSRWSNYISALPR-QPYSLLYWTRAELDRYLEASQIRERAI 204
             W   L   L  E S    SR+  Y   LPR +   +  W   E   YL  +++     
Sbjct: 93  ASWLCGLTAALCVERSLGLKSRYFAYDRVLPRCEANVVCAWNDGERS-YLAGTEVETSLR 151

Query: 205 ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVP 264
           +        +  +   +F ++     E  F  E F  +  ++ SR   L    G V LVP
Sbjct: 152 DEAAAAKNEWERVVAPVFKEHG---VECSF--EQFIEARTVVSSRAFTLSPNAG-VGLVP 205

Query: 265 WADMLNH--SCEVETFLDYD---KSSQG-----VVFTTDRQYQPGEQVFISYGKKSNGEL 314
            AD  NH          D D   +S  G     V  T ++  + G+++F +YG   N +L
Sbjct: 206 IADAFNHLTGNHHVNVGDGDAVVRSETGGEALCVKVTNEQGVRRGDEIFNTYGFHGNAKL 265

Query: 315 LLSYGFVPREGTNPSDSVEL 334
           L SYGF   +  NP+D V L
Sbjct: 266 LNSYGFTQND--NPADEVRL 283


>gi|145345009|ref|XP_001417016.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577242|gb|ABO95309.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 390

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 38/60 (63%), Gaps = 2/60 (3%)

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           L+P  D LNH   ++T  +++ S    V  + R+Y+ GE+VFISYG  +N EL+  YGFV
Sbjct: 178 LMPLIDALNHKTMLKT--EFEFSGGAFVLRSPREYKTGEEVFISYGVLNNDELITRYGFV 235


>gi|154272535|ref|XP_001537120.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150409107|gb|EDN04563.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 485

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/274 (26%), Positives = 113/274 (41%), Gaps = 59/274 (21%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVL--KQCSV 146
           P  K+A  + +   RG+VA  +I + E+L  +P SLV++    +      ++L   +   
Sbjct: 34  PKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQSLVLS----FQNSRLKDLLDFNERDF 89

Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIER 206
             W  L   +I E     +S WS Y   LP    +L++WT  EL R L  S +    + +
Sbjct: 90  DPWLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEEL-RELSGSAV----LNK 144

Query: 207 ITNVIGTYNDLR--LRIFSKYPDLFPEEVFNMETFKWSFG--ILFSRLVRLPSM------ 256
           I       N  R  L + S  P LFP  +  + +F    G   L S   R+ S+      
Sbjct: 145 IGRSDAEANIFRNILPLVSGNPSLFP-PMSGVASFDSPEGKAALLSLAHRMGSLVMAYAF 203

Query: 257 -------DGR---------------VALVPWADMLNHSCEVETFLDYDKSS----QGVVF 290
                  DGR                 +VP AD+LN         D D+++    Q   +
Sbjct: 204 DIEKGENDGREGQDGYVTDDEEELSKGMVPLADLLNA--------DADRNNARLFQEDCY 255

Query: 291 TTDRQYQP---GEQVFISYGKKSNGELLLSYGFV 321
            + R  +P   GE++F  YG+    +LL  YG+V
Sbjct: 256 LSMRSIKPIRKGEEIFNDYGELPRADLLRRYGYV 289


>gi|171692069|ref|XP_001910959.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945983|emb|CAP72784.1| unnamed protein product [Podospora anserina S mat+]
          Length = 454

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/109 (36%), Positives = 56/109 (51%), Gaps = 6/109 (5%)

Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTT-DRQYQPGEQVFISY 306
           SR + LP      ++VP  DM+NHS +   + D +   + V+          G++V ISY
Sbjct: 178 SRCLELPK--SGESMVPCIDMINHSSDPSAYYDQNSDYEAVLLLRPGASMSKGQEVTISY 235

Query: 307 GK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
           G  KS  E+L SYGF+  E T  S+S+ LPL+    D   K KL A  K
Sbjct: 236 GDTKSAAEMLFSYGFIDPEST--SESLVLPLAPFPDDPLAKAKLVAFGK 282


>gi|303313087|ref|XP_003066555.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
 gi|240106217|gb|EER24410.1| SET domain containing protein [Coccidioides posadasii C735 delta
           SOWgp]
          Length = 329

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 66/289 (22%), Positives = 100/289 (34%), Gaps = 49/289 (16%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG 138
             +W  D G+    +A  +      G+ AL+ I  GE ++ VP S ++T D   S     
Sbjct: 16  FTQWAKDQGIQINGVAAVRFPGRGMGIAALRGIDAGETIVSVPTSSLLTLDKIRSTFRE- 74

Query: 139 EVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQ-----------PYSLLYWTR 187
               Q   P   + A YL  +   +  SR++ + +  P             P  L+    
Sbjct: 75  --KFQGDTPVQGIFAAYLACDD--DARSRYAPWRATWPTMRDFEDSIPLLWPKYLIGTPG 130

Query: 188 AELDRYLEASQIRERAI---------------ERITNVIGTYNDLRLRIFSKYPDLFPEE 232
            EL    E +   E                   R+    G Y      +       F + 
Sbjct: 131 DELKGQGETTARGEEVFASLLPPSISGHFTLSNRVGRFSGDYTPDHQNLLENQRSRFRKA 190

Query: 233 V---------FNMETFKWSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCEV 275
                      N+E F + +    +R     + D  V        AL P+AD  NHS   
Sbjct: 191 FSRVKLACPGINLEIFTYYWFATHTRCFFYVAKDSEVPEDRNDAMALCPFADYFNHSSN- 249

Query: 276 ETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPRE 324
           +          G  FT  + Y  GE+VF+ YG  ++  LL  YGFVP E
Sbjct: 250 DPGCKASFDGGGYTFTATKSYAKGEEVFVCYGNHTSDVLLTDYGFVPDE 298


>gi|449016030|dbj|BAM79432.1| similar to ribulose-1,5 bisphosphate carboxylase/oxygenase large
           subunit N-methyltransferase [Cyanidioschyzon merolae
           strain 10D]
          Length = 458

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 84/353 (23%), Positives = 141/353 (39%), Gaps = 55/353 (15%)

Query: 82  WLSDSGLPPQKMAIQKVDVGE----RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           +L ++  P   +A   VD       RGLVA   IR GE +  +P  L I   S+   P  
Sbjct: 139 YLREAHFPKVALAEVPVDGASSLKMRGLVATAAIRAGEVICRIPRRLAICLGSEGENP-- 196

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLL---YWTRAELDRYL 194
                   +P   LL      EA      ++  Y   LPR     +   ++   EL +  
Sbjct: 197 -------GLPALHLLRMMTDGEAV----HKYKAYFDVLPRPEMCQMTTDFYNDEELGQIA 245

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKY--PDLFPEEVFNMETFK---WSFGILFSR 249
               + E    R   +  T+    LR  + Y  P +  + + +M  F+   W+  ++ SR
Sbjct: 246 HTPTVEE-TRRRRQQLRDTFLQEFLRTGADYLHPQVAAQNLDHMPEFQRYLWAVHLVVSR 304

Query: 250 LVRLPSMD-GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
            + + + D  +  L+P  DM+N   + +  L Y  ++   V       +  E++ I YG 
Sbjct: 305 ALAVRTGDEAQRYLIPLLDMINCRMDSKHELRYRIATDEFVLIAGESVRRSEEIRIPYGG 364

Query: 309 K--SNGELLLSYGFVPREGTNPSDSVE-LPLS-LKKSDKCYKEKLEALRKYGLSASECFP 364
              SN  L+  YGF+     NP+D +  LP   ++++D    E+ E +R+      EC  
Sbjct: 365 GFVSNDRLIQDYGFIVER--NPADLLLFLPRHCVQRADLLSSEERENVRQ------ECAA 416

Query: 365 IQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
           +         +A    V+S PS + +       A N+       KC E+ E A
Sbjct: 417 V---------LARQQQVLSVPSERIR-------AFNRAVVDAARKCIELLETA 453


>gi|407832777|gb|EKF98587.1| hypothetical protein TCSYLVIO_010514 [Trypanosoma cruzi]
          Length = 572

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 59/139 (42%), Gaps = 7/139 (5%)

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLR--LRIFSKYPDLFPEEV---FNMETFKWS 242
           A L  YL+  + R + +    N    +   +  L  F   P    EE      +E F W+
Sbjct: 197 AYLRPYLQFERHRHKVLREQANAEAEFQLCKSTLSFFQTMPHSDCEERSMPITLEHFLWA 256

Query: 243 FGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQV 302
           +  L +R           +L+PW D  N++      + YD+     +F      + GEQ+
Sbjct: 257 YNTLMTR--GFAYYSEVWSLMPWVDYFNYALNSNATMKYDELRGAYIFEVLFPIESGEQI 314

Query: 303 FISYGKKSNGELLLSYGFV 321
           F+ YG  ++ ELLL YGF 
Sbjct: 315 FLQYGAYTDMELLLWYGFT 333


>gi|261328667|emb|CBH11645.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 583

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/212 (27%), Positives = 91/212 (42%), Gaps = 28/212 (13%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEASQI 199
           LL   LI E    ++S W + + + P   P    YW+  +L          D   +  ++
Sbjct: 202 LLVLALIYERFVARTSHWKDLLLSCPTDFPTVPSYWSWNDLSGLYGLDVLDDVLAKQERL 261

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSM 256
           R+   E +T+V+    D           +  EE    F +E   W+  +  SR   L ++
Sbjct: 262 RQFHTE-VTSVLPLIYD----ALEGCSGIEREEFMGHFTIENIMWARAVFDSRAFNL-NV 315

Query: 257 DGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKS 310
           DGRV  ALVP ADM+NHS   +  +   +   G     V     +   G ++ +SYG   
Sbjct: 316 DGRVVLALVPCADMINHSNHPDVLIRRVEPCGGDFVMQVGAGLAREDVGRELGMSYGPLQ 375

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           N ELL  YGFV  +  N  D +  P  + ++D
Sbjct: 376 NWELLQHYGFVLDD--NEHDKLPFPFDVHEAD 405


>gi|425766115|gb|EKV04742.1| hypothetical protein PDIG_87340 [Penicillium digitatum PHI26]
 gi|425778867|gb|EKV16969.1| hypothetical protein PDIP_33360 [Penicillium digitatum Pd1]
          Length = 679

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 93/204 (45%), Gaps = 29/204 (14%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQPYSL---LYWTRAELDRYLEASQIRERAIERI 207
           L+A YL     F     W  Y+  LP QP  L   L++   ++D +++ + I E A+ERI
Sbjct: 110 LMAQYLRGPEGF-----WYPYLRTLP-QPGQLTTPLFFGEEDVD-WIQGTGIPEAAVERI 162

Query: 208 TNVIGTYNDLRLRI-FSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV------ 260
                 Y+   L++  + +PD    E +  E + W+  I+ SR      + G V      
Sbjct: 163 KIWEEKYDSGYLQLGATGFPDC---ETYTWELYLWASTIITSRAFSAKVLSGAVQPGDLP 219

Query: 261 -----ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELL 315
                AL+P  D+ NH    +          G++   D  +  G+++  +YG ++N +LL
Sbjct: 220 EDGVSALLPLIDLPNHRPMAKVEWRAGDKDIGLLVLED--HSAGQEISNNYGPRNNEQLL 277

Query: 316 LSYGFVPREGTNPSDSVELPLSLK 339
           ++YGF      NP+D   + L +K
Sbjct: 278 INYGFC--IAGNPTDYRIVHLGVK 299


>gi|119194277|ref|XP_001247742.1| hypothetical protein CIMG_01513 [Coccidioides immitis RS]
          Length = 718

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/205 (28%), Positives = 90/205 (43%), Gaps = 46/205 (22%)

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
           PE    +K+     + L+  YL+ + SF     W+ YI +LP   Q   L Y++  +L+ 
Sbjct: 118 PEFLPAVKEKGASAFLLMDQYLLGDESF-----WAPYIRSLPEDSQLTRLEYYSDEDLE- 171

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
           +LE            TN++     LR  +  K     P ++F  E F W+  I+ SR   
Sbjct: 172 WLEG-----------TNLL----KLRENMLIKLKTTVPNKIFR-ERFLWASSIIISRAFS 215

Query: 253 LPSMDGRV--------------ALVPWADMLNHS--CEVETFLDYDKSSQGVV-FTTDRQ 295
              +   V               LVP  DM NH    +VE      ++SQGVV     + 
Sbjct: 216 SEVLKDYVKNSKSINVTGGEFSVLVPLLDMTNHQPLAQVEW-----RTSQGVVGLIVHKT 270

Query: 296 YQPGEQVFISYGKKSNGELLLSYGF 320
             PG++V  +YG ++N  L+L+YGF
Sbjct: 271 LLPGQEVPNNYGPRNNERLMLNYGF 295


>gi|428179814|gb|EKX48683.1| hypothetical protein GUITHDRAFT_136380 [Guillardia theta CCMP2712]
          Length = 335

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 123/286 (43%), Gaps = 47/286 (16%)

Query: 76  ASTLQKWLSDSGLPPQKMAIQKVDVGE---RGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           +  L +WL  +G   + +AI++ D G    RG+ A  +IR+G++LL VP SL +    K 
Sbjct: 11  SDQLLEWLQRNGGQAESIAIRQFDHGGEKVRGVGASSSIRRGQELLRVPRSLFLAPGKK- 69

Query: 133 SCPEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP-RQPYSLL---YWTRA 188
               AG  L+Q  V     LA  +  +       ++  Y+ ALP R+    L   Y +  
Sbjct: 70  ----AG--LEQQEVT----LAAEIAKQFQLGSDGQYERYLQALPGREELDGLHPFYASNE 119

Query: 189 ELDRYLE---ASQIR-ERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFG 244
           +L  + +    S IR +R + R   +     D     F    D      F+   F  +F 
Sbjct: 120 DLVLFSDIPYVSAIRKKRTVLRDAWLKHKNGDSASSSFDWEGD------FDWPEFLHAFI 173

Query: 245 ILFSRLVR--LPSMDG-----RVALVPWADMLN-----HSCEVETFLDYDKSSQGVVFTT 292
           +  SR +R  LP+  G      +ALVP  DM+N      +  VE  L  D  +  +V  +
Sbjct: 174 LQLSRRMRIVLPAEKGGTSEETIALVPIIDMINFCGSKEAANVELKLVDDGDA--LVVVS 231

Query: 293 DRQYQPGEQVFI-----SYGKKSNGELLLSYGFVPREGTNPSDSVE 333
            R    GE++ +        KK NG L+  YG +  +   PS  +E
Sbjct: 232 KRSINEGEELLLYAAAAEQKKKENGALVYQYGVMMGDNDVPSQQLE 277


>gi|313214063|emb|CBY42615.1| unnamed protein product [Oikopleura dioica]
          Length = 393

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/273 (21%), Positives = 118/273 (43%), Gaps = 39/273 (14%)

Query: 92  KMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPE----AGEVLKQCS-- 145
           K+ I   D G RG+ +   I + E L+ VP   ++T        E    A +VL+  S  
Sbjct: 22  KLKISDGDCG-RGVFSSAVIEQSELLISVPIDALLTTRKAQHVVESHKSARQVLQNFSTC 80

Query: 146 VPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE 205
           +    LL   L  E    ++S+W+ ++S++P+Q ++       EL+      ++  + ++
Sbjct: 81  LNGTDLLVCALFLELETGENSKWTAFLSSIPKQLWNPFMLDEKELNLLTAKCRLPSKCLK 140

Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV-----RLPSMD--- 257
           +           +++I +++      E+ N E   W F ++ SR       R  + +   
Sbjct: 141 Q-----------KIKISTEFLKALGFEI-NEEILSWCFSVVLSRSFGGSPERCQTRNHFK 188

Query: 258 ------GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYG-KKS 310
                     L P  D++NH  E      ++++       + ++   G+++F++YG  KS
Sbjct: 189 IEVDNSANFCLCPAIDLINHEKEYNCEYRWNENKTAFQVFSRKKILQGQELFVNYGTTKS 248

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
             E+   YGF+      PSD+ ++   L++  K
Sbjct: 249 EYEIYSFYGFIL-----PSDNFQVEFELQRIRK 276


>gi|403370373|gb|EJY85047.1| hypothetical protein OXYTRI_17100 [Oxytricha trifallax]
          Length = 777

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 95/235 (40%), Gaps = 43/235 (18%)

Query: 90  PQKMAIQKVDVGER----GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAG------E 139
           PQ+ A Q  ++ E     G+ A K I   E  L++P  L+I  D  +    A        
Sbjct: 93  PQQFADQTENLSEIPYLIGVAAKKFIGPNEAYLYIPNKLIINEDKLYKSEYAQIFIDHPN 152

Query: 140 VLKQCSVPDWPLLATYLISEASFEKSSRWSNYI-----SALPRQPYSLLYWTRAELDRYL 194
             K     D   L  ++  E    + S W  Y      S LP+      +W    +D  L
Sbjct: 153 EFKNTEKSDQTSLIFFVALELLKGEESYWHPYFETAQDSDLPQ------FWEDQNIDE-L 205

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLV--R 252
           E + I+        + IG Y ++   I + YPDL   E F +E +K ++ I+ +R     
Sbjct: 206 EDALIKAELQMHQVDFIGDY-EIAHGIANHYPDLVHAEKFTIEIYKRAYNIVTTRCFGWS 264

Query: 253 LPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTD-----RQYQPGEQV 302
            PS      LVP+AD  NH         ++  +Q  +F +D     R Y P ++V
Sbjct: 265 CPS----TCLVPFADCFNH---------FNLDNQYEIFNSDLHFKLRDYDPKKKV 306


>gi|451854554|gb|EMD67847.1| hypothetical protein COCSADRAFT_34629 [Cochliobolus sativus ND90Pr]
          Length = 476

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 96/411 (23%), Positives = 168/411 (40%), Gaps = 81/411 (19%)

Query: 82  WLSDSG--LPP--QKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           WL  +G  + P  Q   ++  D G RG+VA ++I + E L  +P S ++  ++     E 
Sbjct: 14  WLKHTGAQINPKIQLEDLRAKDAG-RGVVAKQDIAEHELLFSIPRSSILGVENSILSTEI 72

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
                    P W  L   ++ E     +S W+ Y + LP    +L++WT  EL   L+AS
Sbjct: 73  PPATFAHLGP-WLSLILIMLYEYHNGSASNWAPYFAVLPTDFDTLMFWTEDELAE-LQAS 130

Query: 198 QI---------RERAIERITNVIGTYNDLRLRIFS-------KYPDLFPEEVFNMETFKW 241
            +          E  IE++  VI  + D+   IFS       K  ++   E   +     
Sbjct: 131 AVVNKIGKEGANEVFIEQLLPVIEEFADV---IFSGDERAKHKAKEMRAPENLELMHKMG 187

Query: 242 SFGILFSRLVRLPSMDGRV----------------ALVPWADMLNHS---CEVETFLDYD 282
           S  + ++  V     D  V                 +VP ADMLN     C    F + D
Sbjct: 188 SLIMAYAFDVEPAISDKEVDEEGFAEEEEDAALPKGMVPLADMLNADGDRCNARLFYEKD 247

Query: 283 KSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL-----S 337
               G+     +  Q G+++F  YG     +LL  YG++  +     D VE+P+     +
Sbjct: 248 ----GLEMKALKPIQAGDEIFNDYGPLPRSDLLRRYGYIT-DNYAQYDVVEIPVDLVSQT 302

Query: 338 LKKSDKCYKEKLEALRK-------YGLSASECFPIQITGWPLELMAYAYLVVSPPSMKGK 390
           L      ++E++E L +       Y ++AS  F ++ +  P EL+     ++ P   + +
Sbjct: 303 LAHDGLWHEERIEYLDEQEIVDTGYDIAASIPFSLEESLSP-ELVILVETMLLP---REE 358

Query: 391 FEEMAAAA----SNKMTSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
           FE + +      + KMT K           A +F+    ++ I++Y   L+
Sbjct: 359 FERLQSKGRLPKAEKMTGK-----------AAKFLYKIVQARIAQYPTTLE 398


>gi|340054011|emb|CCC48305.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 572

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 63/228 (27%), Positives = 96/228 (42%), Gaps = 25/228 (10%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAELDRYLEASQIRERAIERITN 209
           LL   LI E      S W   + A P + P    YW   +L   L    + +  + +   
Sbjct: 200 LLILALIYERFVVDLSHWHELLVACPSEYPTVPSYWEFDDLSE-LHGLDVLDDVLTKRAR 258

Query: 210 VIGTYNDLRL------RIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSMDGR- 259
           V   Y+++ L       + +    L  EE    F++E   W+     SR   L ++DGR 
Sbjct: 259 VHDFYSEIMLVLPVIHSLVAGSSGLEREEFLRRFSVENIMWARATFDSRAFNL-NVDGRT 317

Query: 260 -VALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKSNGEL 314
            +ALVP ADM+NHS   +  +   +   G     +     Q   G ++ +SYG   N EL
Sbjct: 318 LLALVPNADMVNHSNRADVLVRMVEPDGGDFVMRIGAGLTQEDIGRELSMSYGPLQNWEL 377

Query: 315 LLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALR-----KYGL 357
           L  YGFV  +  N  D +  PL L  +    +++ +A R     KY L
Sbjct: 378 LQHYGFVLED--NEHDKLPFPLDLPGTADEDRDEWDARRAVLIEKYAL 423


>gi|255723423|ref|XP_002546645.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
 gi|240130776|gb|EER30339.1| hypothetical protein CTRG_06123 [Candida tropicalis MYA-3404]
          Length = 428

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 63/291 (21%), Positives = 119/291 (40%), Gaps = 48/291 (16%)

Query: 70  IDSLENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI--- 126
           +D ++N    +K  + S + P K+ ++      RG+ A  +++KGE ++ +P S ++   
Sbjct: 12  LDWIKNTQDEKKPSTHSYISP-KIDVKDARSSGRGIYATNSLKKGELIMNIPHSFLLNFT 70

Query: 127 TADSKWSCPEAGEVLKQCSVP-------------------------DWPLLATYLISEAS 161
           T  +  S     +      VP                          + LL+ YL  E  
Sbjct: 71  TVMAHISRYNGMQDESHIYVPFDNSDGDQFTNIYSKLTREEILELSSFQLLSIYLTFEKQ 130

Query: 162 FEKSSRWSNYISALPR-QPYSLL--YWTRAELDRYLEASQ-----IRER---AIERITNV 210
              +S W  ++  LP  + ++L+  +W+        +++Q     +R+R     + I ++
Sbjct: 131 RGTNSFWKPFLDMLPSMEDFALMPIHWSDETCKLAPDSTQKSSLKVRDRFENDYKLICDL 190

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLN 270
           I T  DL +        L P +   +     +   L+  L    +      + P+ D +N
Sbjct: 191 IQTKTDLDVTT------LLPRQDVLLSWLCINSRCLYMNLPTSKNTADNFTMAPYVDFMN 244

Query: 271 HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           HSC+    L  D   +G   ++   Y   +QV++SYG  SN  LL  YGF+
Sbjct: 245 HSCDDHCTLKID--GKGFQVSSTCSYNIDDQVYLSYGPHSNDFLLCEYGFI 293


>gi|294882647|ref|XP_002769782.1| hypothetical protein Pmar_PMAR004863 [Perkinsus marinus ATCC 50983]
 gi|239873531|gb|EER02500.1| hypothetical protein Pmar_PMAR004863 [Perkinsus marinus ATCC 50983]
          Length = 433

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 82/351 (23%), Positives = 147/351 (41%), Gaps = 57/351 (16%)

Query: 99  DVGERG--LVALKNIRKGEKLLFVP--PSLVITADSKWSCPEAGEV---LKQCSVPDWPL 151
           D+G +G  LV   +I+ G  ++ +P    ++I  D+    P+ G+V   LK   + +   
Sbjct: 51  DMGTKGMGLVVSTDIKAGTAMITIPRKSKVLINIDTACDDPDFGKVICYLKGAGLDERGC 110

Query: 152 LATYLI-----SEASFEKSSRWSNYISALPR----QPYSLLY---WTRAELDRYLEAS-- 197
           LA +L+     S  S +  +RW  Y + LP     + + LL       A  +  L AS  
Sbjct: 111 LAFWLVLQKLASTRSSKVKTRWCPYAAMLPTAQKLRDHPLLLDDSGMSAIANTALHASVT 170

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYP-DLFPEEVFNMETFKWSFGILFSR----LVR 252
            ++E  + ++ +++G  + L ++   + P D F       + + W+  +L SR    L  
Sbjct: 171 SMKENTLRQLGHLLGILSALEVK--DEGPVDTFIRHTKIKQLWLWAHAVLLSRSGFGLSG 228

Query: 253 LPSMDGRVA-----LVPWADMLNH---SCEVETFLD------YDKSSQGVVFTTDRQYQP 298
            P   G +A     ++P  D  NH       E  +       +  SS+ +     R  + 
Sbjct: 229 EPGDSGVMAGEGLLMIPLVDFANHDSSGGNAEIRIQHASTGWFGGSSESISLVAKRDIKA 288

Query: 299 GEQVFISYGKKSNGELL------LSYGF-VPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
           GE++ ISY   + G++L       +YGF + R G    D   +P   K      ++ +  
Sbjct: 289 GEEILISY---TGGDILSAEQSIFTYGFRMDRLGA--PDKFAVPTVSKPDGSDMRDVIRR 343

Query: 352 LRKYGLSASEC--FPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASN 400
           L    ++  E     IQ   +P  L+AY  + +     +GK E +  A SN
Sbjct: 344 LVHMDVAKDEADVITIQKDTYPEALVAYMCIDILAEK-RGKLESLCQAYSN 393


>gi|72389967|ref|XP_845278.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62359268|gb|AAX79710.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801813|gb|AAZ11719.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 583

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/212 (27%), Positives = 90/212 (42%), Gaps = 28/212 (13%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEASQI 199
           LL   LI E    ++S W + + + P   P    YW   +L          D   +  ++
Sbjct: 202 LLVLALIYERFVARTSHWKDLLLSCPTDFPTVPSYWNWNDLSGLYGLDVLDDVLAKQERL 261

Query: 200 RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEV---FNMETFKWSFGILFSRLVRLPSM 256
           R+   E +T+V+    D           +  EE    F +E   W+  +  SR   L ++
Sbjct: 262 RQFHTE-VTSVLPLIYD----ALEGCSGIEREEFMGHFTIENIMWARAVFDSRAFNL-NV 315

Query: 257 DGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKS 310
           DGRV  ALVP ADM+NHS   +  +   +   G     V     +   G ++ +SYG   
Sbjct: 316 DGRVVLALVPCADMINHSNHPDVLIRRVEPCGGDFVMQVGAGLTREDVGRELGMSYGPLQ 375

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           N ELL  YGFV  +  N  D +  P  + ++D
Sbjct: 376 NWELLQHYGFVLDD--NEHDKLPFPFDVHEAD 405


>gi|346327621|gb|EGX97217.1| SET domain-containing protein, putative [Cordyceps militaris CM01]
          Length = 371

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 122/308 (39%), Gaps = 62/308 (20%)

Query: 73  LENASTLQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKW 132
           ++   +L +W +D G+    +   ++     G+VA ++I KGE L+ VP   +    S+ 
Sbjct: 1   MDTIDSLLQWAADQGVVLDGVRPSRIPGRGLGMVATRHIHKGEVLIAVPTPAI---RSRH 57

Query: 133 SCPEA--GEVLKQCSV------------PDWPLLATYLISEASFEKSSRWSNYISALPRQ 178
           + P++  G+     ++            PD     T + S A FE S+ +          
Sbjct: 58  TLPKSLMGKAPTNMTLHGLLAADLLLHPPDVAAWGTLVPSLADFESSTPFF--------W 109

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
           P +L      E  + L   Q R R                 R +S     FP        
Sbjct: 110 PETLQDLLPPEAKKLLRTQQQRFR-----------------RDWSHAHAGFPSVAEQDYL 152

Query: 239 FKW------SFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVV 289
           + W      SF       +  P  D R+AL+P ADM NH+   C V        S++   
Sbjct: 153 YAWFLVGTRSFYYQVDETLPYPWHD-RLALLPVADMFNHASVGCAVAF------STEVYD 205

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKL 349
            T DR Y+  E+++ SYG  SN  LL  YGF+ ++  NP D + L   L    +   E  
Sbjct: 206 VTADRDYEADEELYTSYGAHSNDFLLAEYGFMLQD--NPHDQLCLDAVLLA--RLSAEHK 261

Query: 350 EALRKYGL 357
            AL + GL
Sbjct: 262 AALLQRGL 269


>gi|358395796|gb|EHK45183.1| hypothetical protein TRIATDRAFT_39811 [Trichoderma atroviride IMI
           206040]
          Length = 484

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/166 (28%), Positives = 82/166 (49%), Gaps = 12/166 (7%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL--RLRIF 222
           S+ W+ YI  LPR       WT  E + +L+ + +      +++ +   Y++L  +    
Sbjct: 119 STPWTEYIKFLPRSISVPTMWTSEERE-FLQGTSLESSVNAKLSVLSREYDELSEKASTL 177

Query: 223 SKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYD 282
             + DL  E    +E +  +  +  SR + LP     +A+VP  DM NHS +      YD
Sbjct: 178 PFWNDLLSESGM-LEDWILADALYRSRCLELPH--AGLAMVPGLDMANHSPKY--LARYD 232

Query: 283 KSSQGVVF---TTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPRE 324
           ++ +G V    ++      GE++ ISYG+ KS  E+L SYGF+ +E
Sbjct: 233 ETPEGDVVLLPSSGSGVSSGEEITISYGEAKSAAEMLFSYGFIDQE 278


>gi|303311395|ref|XP_003065709.1| hypothetical protein CPC735_049340 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240105371|gb|EER23564.1| hypothetical protein CPC735_049340 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320039566|gb|EFW21500.1| hypothetical protein CPSG_01657 [Coccidioides posadasii str.
           Silveira]
          Length = 636

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 95/205 (46%), Gaps = 32/205 (15%)

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPR--QPYSLLYWTRAELDR 192
           PE    +K+     + L+  YL+ + SF     W+ YI +LP   Q   L Y++  +L+ 
Sbjct: 22  PEFLPAVKEKGALAFLLMDQYLLGDESF-----WAPYIRSLPEDSQLTRLEYYSDEDLE- 75

Query: 193 YLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVR 252
           +LE + + +     +  +  TY ++ L++  + P+   +  +  E F W+  I+ SR   
Sbjct: 76  WLEGTNLLKLRENMLIKLKTTY-EVGLQMLKESPNKNTKN-YTWERFLWASSIIISRAFS 133

Query: 253 LPSMDGRV--------------ALVPWADMLNHS--CEVETFLDYDKSSQGVV-FTTDRQ 295
              +   V               LVP  DM NH    +VE      ++SQGVV     + 
Sbjct: 134 SEVLKDYVKNSKSINVTGGEFSVLVPLLDMTNHQPLAQVEW-----RTSQGVVGLIVHKT 188

Query: 296 YQPGEQVFISYGKKSNGELLLSYGF 320
             PG++V  +YG ++N  L+L+YGF
Sbjct: 189 LLPGQEVPNNYGPRNNERLMLNYGF 213


>gi|303284299|ref|XP_003061440.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456770|gb|EEH54070.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 644

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 79/204 (38%), Gaps = 39/204 (19%)

Query: 151 LLATYLISEASFEKS---SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERI 207
           LLA  ++ E++  +    S W+ Y++ LPR+  SL+ W   EL   L+ S+   RA ERI
Sbjct: 214 LLALGVLHESAHSRDDPPSHWATYVTLLPRRVGSLMEWDERELS-ALQGSRHATRARERI 272

Query: 208 TNVIGTYNDLRLRIFSKY----PDLFPEEVFNMET------FKWSFGILFSRLVRLPSMD 257
                 ++D R + F         LF E+            ++W+   + +R    P  +
Sbjct: 273 A----LFDDARAKCFPALLRADESLFGEDEATRRAHESPRAWRWAVATVLARAFYFPDAN 328

Query: 258 GRVALVPWADMLNHSCEVETFLDYD-------------------KSSQGVVFTTDRQYQP 298
               L P  D+ NH  E E  +  D                   K +  V         P
Sbjct: 329 -EHGLCPGLDLFNHCSEAEKCVVEDGTADDEGDEGDEGEGKYAHKEAPRVTLRAGVGGVP 387

Query: 299 -GEQVFISYGKKSNGELLLSYGFV 321
            G Q+F  Y   + G  LL +GF 
Sbjct: 388 AGTQIFHDYADHARGGCLLEFGFT 411


>gi|406868331|gb|EKD21368.1| SET domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 480

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 92/208 (44%), Gaps = 26/208 (12%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIE-RITNVIGTYNDLRLRIFS 223
           S+ W+ Y+  LP        W+  E  R +      E A+  +  ++I    DLR     
Sbjct: 117 SNPWTEYVRMLPESIPVPTMWSEEE--RVMLTGTSLETAVSAKCASLISEIEDLR----G 170

Query: 224 KYPDL-------FPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVE 276
           K  ++       + EE    E   WS    + R   L   +   +++P  DM+NH+ E  
Sbjct: 171 KTAEIAWCQKCWWEEESLRYEN--WSLLDAWYRSRSLEVPNAGESMIPCVDMVNHAAEAN 228

Query: 277 TFLDYDKSSQ---GVVFTTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSV 332
           ++  Y+++S     ++   D Q +   +V ISYG  KS  E+L SYGF+  +GT+   ++
Sbjct: 229 SY--YERTSDNNIALLLRPDTQLEAESEVTISYGSSKSEAEMLFSYGFIDEQGTSKGLTL 286

Query: 333 ELPLS----LKKSDKCYKEKLEALRKYG 356
            +  S    L K+      K + LR +G
Sbjct: 287 NIDPSPDDPLGKAKAAAFSKSKTLRIFG 314


>gi|50287013|ref|XP_445936.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49525242|emb|CAG58855.1| unnamed protein product [Candida glabrata]
          Length = 599

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/216 (24%), Positives = 96/216 (44%), Gaps = 40/216 (18%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTY-------NDL 217
           S ++  YI ALP+  +S L W  +EL+  L  + +     E++ +VI  +        DL
Sbjct: 101 SKKFHPYIQALPKLIHSPLVWNPSELETLLVGTNLGGSVKEKLCSVIKEWIVLIESREDL 160

Query: 218 RLRI-------FSKYPDLFPEEVFNM-----------------ETFKWSFGILFSR---- 249
           + ++       F  Y DL  E+++N+                   F +S  +  SR    
Sbjct: 161 KSKVDGKYLINFENYNDLVYEDIYNIFVKPVEFEFADLVWLSFPAFLYSHLVFTSRAFPE 220

Query: 250 -LVRLPSMDGRVALVPWADMLNH--SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISY 306
            ++   + +  V L+P  D++NH  + +V+ F    ++     +      + G+++  +Y
Sbjct: 221 YVIDKNANEFSVILLPILDLMNHNYNSKVQWFPKEHQNGTSFCYQCLADMKAGDELDNNY 280

Query: 307 GKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSD 342
           G K N ELL  YGFV  +  N  D+V L + L + +
Sbjct: 281 GGKGNEELLNGYGFVIDD--NIFDTVALRIKLSEEE 314


>gi|156374449|ref|XP_001629819.1| predicted protein [Nematostella vectensis]
 gi|156216828|gb|EDO37756.1| predicted protein [Nematostella vectensis]
          Length = 281

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 73/281 (25%), Positives = 123/281 (43%), Gaps = 25/281 (8%)

Query: 69  EIDSLENASTLQKWLSDSGLPPQKM--AIQKVDVGERGLVALKNIRKGEKLLFVPPSLVI 126
           E+DS    S+   W  D+ L       ++QK      G+VA+++I   E L  VP  L++
Sbjct: 11  ELDSA--ISSFLLWCHDNDLKLNNKVSSMQKGSCHRYGMVAMEDISPDECLFKVPRGLLL 68

Query: 127 TADS-KWSCPEAGEVLKQ--CSVPDWPLLATYLISEASFEKSSRWSNYISALP-----RQ 178
              +   S    G+V++        W  L   L+ E +   +S W  Y+  +P      Q
Sbjct: 69  EPKTCGISKILTGKVIQNMLSQHEGWVPLLLALMYEYT-NPTSLWKPYMDIVPGIDILDQ 127

Query: 179 PYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
           P   ++W        L+ +   +   +    +   Y  + + I  K+   F  +  ++  
Sbjct: 128 P---MFWPDETRQSLLQGTGFEDDVEDDKQRIERQYFTVAVPIMKKFKKFFDLKRHSLSL 184

Query: 239 FKW--SFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
           +K   +F + +S     PS  G     +VP AD+LNH       L++ +    +V T  +
Sbjct: 185 YKHMAAFIMAYSFTEDSPSFHGNNVPVMVPMADILNHHSNNNARLEFGEEELSMVST--Q 242

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREG-TNPSDSVEL 334
               G +VF +YG+ +N  LL SYGFV  EG  NP+D+V L
Sbjct: 243 HILKGGEVFNTYGQLANCHLLQSYGFV--EGPDNPNDTVSL 281


>gi|397563943|gb|EJK44003.1| hypothetical protein THAOC_37500 [Thalassiosira oceanica]
          Length = 595

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 42/155 (27%), Positives = 73/155 (47%), Gaps = 31/155 (20%)

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDK-------- 283
           + F ++ F+++  ++ SR   +   DG + L+P+ D  NH        DYD         
Sbjct: 296 QCFTVDGFRYAVAVVRSRSFFV---DGALRLLPYVDYANHD-------DYDSNELVGGGI 345

Query: 284 -----SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVP----REGTNPSDSVEL 334
                SS+GV+  + +  + G++V ISYG K   + +L +GFVP          + + EL
Sbjct: 346 GMLWGSSKGVLLKSGKALRVGDEVRISYGPKGPADYILDHGFVPPMCQLSTQGGAITAEL 405

Query: 335 PLSLKKSDKCYKEKLEAL--RKYGLSASECFPIQI 367
              + +SD+   +KL+ L    Y L+  E  P Q+
Sbjct: 406 SFEVDESDRFRDDKLDILEFETYDLAPME--PAQV 438


>gi|402581480|gb|EJW75428.1| hypothetical protein WUBG_13665, partial [Wuchereria bancrofti]
          Length = 118

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 11/107 (10%)

Query: 239 FKWSFGILFSRLV----RLPSM-----DGRVALVPWADMLNHSCEVETFLDYDKSSQGVV 289
           F W++ I+ +R +    +L  +     D  +A+VP  DMLNHS + +    +D       
Sbjct: 14  FLWAWHIVNTRCIYRNNKLHPLIDNTEDDSLAIVPLIDMLNHSNDSQCCAIWDGKLNLCK 73

Query: 290 FTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
               R  + GEQ+FI YG  +NG L + YGF  ++  N  + VE+ L
Sbjct: 74  VIVTRPIRKGEQIFICYGSHTNGSLWIEYGFYLKD--NICNKVEISL 118


>gi|365989204|ref|XP_003671432.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
 gi|343770205|emb|CCD26189.1| hypothetical protein NDAI_0H00150 [Naumovozyma dairenensis CBS 421]
          Length = 589

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 114/272 (41%), Gaps = 58/272 (21%)

Query: 117 LLFVPPSLVITADS-----KWSCPEAGEVLKQC--SVPDWPLLATYLISEASFEKSSRWS 169
           L+ VP +L++T  +      WS  + G +      ++    L    L  EA   K S+W 
Sbjct: 51  LISVPTNLLLTYQNATDFFNWSIAKGGNLPTNNPNAITQLYLSHLKLNPEA---KKSQWD 107

Query: 170 NYISALP---RQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL--------- 217
            Y+  L     QPY   +WT  +L + L+ + +  +  +    +I  Y +L         
Sbjct: 108 KYVEILSLDLNQPY---FWTVDQLQQ-LKGTDLYIKIQQDFATIIQEYIELLQILKVDIL 163

Query: 218 ---RLR-------IFSKYPDLFPEEV--FNMETFKWSFGILFSR------LVRLPSMDGR 259
              +L+       I S  P L   ++   +  ++ WS  I  SR      L    S  G 
Sbjct: 164 DQEKLQTATISHYINSHLPTLLDGKLPWNHFVSYLWSHCIFKSRAFPQLLLNNAGSDVGN 223

Query: 260 VALV---PWADMLNHSCEVETFLDYDKSSQG--------VVFTTDRQYQPGEQVFISYGK 308
           + L    P  D+LNH  +V   + ++ S+          + F T      G+Q+F +YG 
Sbjct: 224 INLAFLFPIVDLLNHKNDV--VVKWESSNDINNKNDNKVLTFITQETLHVGDQIFNNYGN 281

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
           KSN ELLL YGF+ +E  N  D  EL L L +
Sbjct: 282 KSNEELLLGYGFI-QENNNNYDYSELTLKLNE 312


>gi|348679311|gb|EGZ19127.1| hypothetical protein PHYSODRAFT_493969 [Phytophthora sojae]
          Length = 776

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/202 (25%), Positives = 87/202 (43%), Gaps = 27/202 (13%)

Query: 151 LLATYLISEASFEKSSRWSNYISALP-----RQPYSLLYWTRAELDRYLEASQIRERAI- 204
           LL  +L+ E      SRW+ Y+S LP         S L+++  E+   L+  ++ + A  
Sbjct: 11  LLTAFLLWEELSGHESRWTPYLSLLPPLSSRDDVVSPLFFSSDEVVEALQDERMVKTARA 70

Query: 205 --ERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVAL 262
             +R     G +     R+F  +P L   +      + W+  ++ SR     S+ G+  L
Sbjct: 71  ERQRAKKAHGRFK----RLFRSFPAL---KALEWPQYAWARFLVNSRAF---SIQGQRVL 120

Query: 263 VPWADMLNHSCEVET--------FLD-YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           VP+ D+ N   + +         FL  +D  S G+    DR    G+Q+F  YG  SN  
Sbjct: 121 VPFGDIFNGKPDDDAREHDNGQRFLQLHDLQSMGMTIRADRGAAKGKQLFEDYGDNSNYV 180

Query: 314 LLLSYGFVPREGTNPSDSVELP 335
             L +GF+  +G     +  LP
Sbjct: 181 YFLHHGFLMGDGCFDCAAFRLP 202


>gi|294657576|ref|XP_459875.2| DEHA2E13090p [Debaryomyces hansenii CBS767]
 gi|199432797|emb|CAG88116.2| DEHA2E13090p [Debaryomyces hansenii CBS767]
          Length = 505

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 76/298 (25%), Positives = 127/298 (42%), Gaps = 53/298 (17%)

Query: 81  KWLSDSGLP-PQKMAIQKV--DVGERGLVALKNIRKGEKLLFVPPSLVITADS---KWSC 134
           +WLS+  +    K+ ++ +  D   RG+VA ++I + E+L  +P   +I  D+     + 
Sbjct: 13  EWLSEENVTISSKLVVKDLRKDNQGRGMVANEDIEEDEELFSIPRETIINIDNCSLTKTN 72

Query: 135 PEAGEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALP---RQPY---SLLYWTRA 188
            +A + L   S+  W  L   L+ E      S+WS Y + LP    Q Y    L++W+  
Sbjct: 73  SKARDGL--LSLNQWEALIIVLLYELKVNGKSKWSAYFNTLPIKDTQNYKFNQLMFWSHE 130

Query: 189 ELDRYLEASQIRERAIERI--TNVIGTYNDLRLRIFS--KYPDLFP---EEVFNMETFKW 241
           +L   L  S I    I+RI        YN L  ++      P+LF    EE   + +   
Sbjct: 131 QLAD-LSPSLI----IDRIGKDEAEAMYNKLFPKVVEDLNIPELFKVTLEEYHKVASLIM 185

Query: 242 SFGILFSR-------------------LVRLPSMDGRV--ALVPWADMLNHSCEVETF-L 279
           S+     R                         ++G    ++VP AD+LN   ++    L
Sbjct: 186 SYSFDVERPEFNQVEDDEAEDDEEEDDEGDGTILNGNYYKSMVPLADILNADTKLHNASL 245

Query: 280 DYDKSSQGV-VFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
            Y   + GV V  + +  + GEQ++ +Y    N E+L  YG+V   G+   D  E+PL
Sbjct: 246 VY---TPGVLVMKSVKPIKKGEQIYNTYSDHPNSEILRRYGYVETNGS-ELDFGEIPL 299


>gi|302410103|ref|XP_003002885.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
 gi|261357909|gb|EEY20337.1| SET domain-containing protein RMS1 [Verticillium albo-atrum
           VaMs.102]
          Length = 469

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 69/296 (23%), Positives = 124/296 (41%), Gaps = 41/296 (13%)

Query: 81  KWLSDSGLPPQKMAIQKVDV----GERGLVALKNIRKGEKLLFVPPSLVI---TADSKWS 133
           +W   +G   +   +Q VD+      RG++A ++I +   L  +P   +I   T++    
Sbjct: 13  QWFKAAGGEFRDDLLQIVDLRPQAAGRGIIATRDIPEETTLFTIPRQAIINVLTSELPQK 72

Query: 134 CPEA--GEV--LKQCSVP--DWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTR 187
            P+   G +  +   + P   W  L   ++ E     SSRW  Y   LP+Q  + ++W+ 
Sbjct: 73  LPQVFDGSIDEMDDNAEPLDSWGQLILVMLYEVLQGDSSRWKPYFDILPQQFDTPIFWSD 132

Query: 188 AELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-PE--------EVFNME- 237
            EL   L+ + +    I ++ +    +    L I    P +F PE        E+ ++  
Sbjct: 133 GEL-LELQGTSLTAEKIGKVES-DAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLAH 190

Query: 238 -----TFKWSFGILFSRLVR------LPSMDGR--VALVPWADMLNHSCEVETFLDYDKS 284
                   ++F +             +   +GR  + +VP AD LN + E    +++ +S
Sbjct: 191 RMGSTIMAYAFDLENDDENENEEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGES 250

Query: 285 SQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
            +      D   + G+QV   YG     ELL  YG+V  E +   D VE+P +L K
Sbjct: 251 LEATAIRAD--IRAGDQVLNYYGPLPTSELLRRYGYVTPEHSR-YDVVEVPWTLVK 303


>gi|346970168|gb|EGY13620.1| SET domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 485

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 88/199 (44%), Gaps = 20/199 (10%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSK 224
           S+ W+ Y+  LPR+      W+  E +  L+ + +      +I  +   +  LR     K
Sbjct: 116 STPWTEYVKFLPREVPVPTMWSEQERE-LLQGTSLELAVSAKIQALTSEFEALR----EK 170

Query: 225 YPDL-FPEEVF----NMETFKWSFGILF--SRLVRLPSMDGRVALVPWADMLNHSCEVET 277
             DL F   +F    N+    W     +  SR + LPS    V++VP  D+ NH+     
Sbjct: 171 SSDLPFWHAIFWDTNNVSLADWFLVDAWYRSRSLELPS--AGVSMVPVLDLANHAPAPSA 228

Query: 278 FLDYDKSSQGVV---FTTDRQYQPGEQVFISYGK-KSNGELLLSYGFVPREGTNPSDSVE 333
           + +     +G V            G++V ISYG  KS  E+L SYGF+  +    +D+V 
Sbjct: 229 YYEESARREGDVELRLRPGSTLAAGDEVTISYGAGKSGAEMLFSYGFI--DPARSTDTVA 286

Query: 334 LPLSLKKSDKCYKEKLEAL 352
           LPL+  + D   K K+ + 
Sbjct: 287 LPLAPLEDDPLGKAKVHSF 305


>gi|302839507|ref|XP_002951310.1| hypothetical protein VOLCADRAFT_91853 [Volvox carteri f.
           nagariensis]
 gi|300263285|gb|EFJ47486.1| hypothetical protein VOLCADRAFT_91853 [Volvox carteri f.
           nagariensis]
          Length = 730

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL--PLSLK 339
           D +S G         +PGE+++ISYG+KSN ELL+ YGF   +  NP D + L  PL  +
Sbjct: 374 DSASLGATLHGGAHVRPGEELYISYGEKSNEELLMLYGFALED--NPHDHLMLYCPLPPR 431

Query: 340 KS-DKCYKEKLEALRKYGL 357
              D     ++E L+ YGL
Sbjct: 432 AEWDDVMYARMELLQAYGL 450


>gi|67900706|ref|XP_680609.1| hypothetical protein AN7340.2 [Aspergillus nidulans FGSC A4]
 gi|40742521|gb|EAA61711.1| hypothetical protein AN7340.2 [Aspergillus nidulans FGSC A4]
 gi|259483305|tpe|CBF78584.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
          Length = 441

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/144 (29%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 228 LFPEEVFNMETFKWSFGILFSRLVRLPS--------MDGRVALVPWADMLNHSCEVETFL 279
           +FPE  +    + W+  I+ SR     S         +  + +VP+AD  NH  +    +
Sbjct: 212 VFPETDWKAVAYNWA--IINSRSFYYVSPGKNEPSDWNDAIGMVPFADYFNHRDDASCEV 269

Query: 280 DYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLK 339
            +D+ S   +F  ++    GE++++SYG  SN  LL+ YGF   +  NPSD V L     
Sbjct: 270 TFDRDS--YIFRAEK----GEEIYMSYGPHSNDFLLVEYGFYLDD--NPSDRVYL----- 316

Query: 340 KSDKCYKEKLEALRKYGLSASECF 363
             D     KL    K  L+  ECF
Sbjct: 317 --DDIILPKLTRSEKKELAERECF 338


>gi|294950065|ref|XP_002786443.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239900735|gb|EER18239.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 551

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 44/86 (51%), Gaps = 2/86 (2%)

Query: 253 LPSMDGRVALVPWADMLNHSCEVET-FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSN 311
           LP       +VP AD+LNH    +  F  +DK S+  V T +     G ++FI+YG   N
Sbjct: 328 LPPDQSITCVVPGADLLNHHQRGQCGFPRFDKKSRSFVITAEANVPAGSELFINYGGLQN 387

Query: 312 GELLLSYGFVPREGTNPSDSVELPLS 337
            E L+ YGF      NP DSV L L+
Sbjct: 388 WEQLMYYGFC-EFAQNPYDSVTLDLA 412


>gi|354548388|emb|CCE45124.1| hypothetical protein CPAR2_701280 [Candida parapsilosis]
          Length = 565

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/96 (34%), Positives = 54/96 (56%), Gaps = 11/96 (11%)

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           L+P  D+LNH+ +  T + +  +  G +F +D     GE++F +YG+K N ELLL+YGF 
Sbjct: 216 LLPVVDLLNHNPK--TKVQWSGTDGGFLFQSDDA-SSGEELFNNYGQKGNEELLLAYGFA 272

Query: 322 PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGL 357
                NP+DS  L + +  S      KL+ ++  G+
Sbjct: 273 IE--NNPADSAALKIKIPDS------KLQVVKDLGI 300


>gi|340509072|gb|EGR34645.1| SET domain protein [Ichthyophthirius multifiliis]
          Length = 326

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 78/174 (44%), Gaps = 14/174 (8%)

Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSL-LYWTRAELDRYLEASQIRERAIE 205
           P   ++  Y+  E      S WS YI+ LP       +++   +L+ +L  S    + +E
Sbjct: 157 PIVKIIEEYIHEEKFINPDSLWSIYINILPSDYNQYPIFFPEEDLE-WLSGSPFLNQVLE 215

Query: 206 RITNVIGTYNDLRLRIFSKYPDLFPEEVFN-METFKWSFGILFSRLVRLPSMDGRV--AL 262
           +  ++   Y+D+          + PE   N  + F W+     SR+  L  +DG+   A 
Sbjct: 216 KKADIKRDYDDI--------CSIAPEFAINTFQDFCWARITASSRVFGL-QIDGQKTDAF 266

Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
           VP ADMLNH    +T   YD + QG +         G+QV+ SYG+K N    L
Sbjct: 267 VPLADMLNHRRPKQTSWQYDDNRQGFIIEALEDIPRGDQVYDSYGQKCNSRFSL 320


>gi|317151155|ref|XP_001824477.2| ribosomal N-lysine methyltransferase [Aspergillus oryzae RIB40]
 gi|391868702|gb|EIT77912.1| hypothetical protein Ao3042_05894 [Aspergillus oryzae 3.042]
          Length = 407

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 123/280 (43%), Gaps = 43/280 (15%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G++A +NI + E ++ VP   ++T++     P           P   L A +L +  + E
Sbjct: 39  GMIATRNIEEDEAIVTVPLKAMLTSER---IPSYFTSKFPDGTPTHALYAAFL-TNGNAE 94

Query: 164 KSSRWSNYISALP-RQPYSL---LYWTRAELDRYLEAS--------QIRER-AIERITNV 210
               ++ +    P RQ +     + W+ + L  YL  S        Q R++   E     
Sbjct: 95  DLEEFNAWRKTWPSRQDFEDSMPILWSES-LRNYLPPSISSHWHSIQSRDKLQYETTHQN 153

Query: 211 IGTYNDLRLR-----IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSM------D 257
           +    + RLR     + S +PD       + ETF + + I+ +R     +P        +
Sbjct: 154 LLAQQEQRLRTAWDIVVSVFPDT------DWETFSYHWLIVNTRSFFYLMPGQEPPEDRN 207

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
             +AL+P+AD  NHS +V   + +D   +  VF   + Y  GE++++SYG   N  L   
Sbjct: 208 DAMALLPFADYFNHS-DVACNVKFD--GENYVFRATKHYDEGEEIYMSYGPHPNDFLFAE 264

Query: 318 YGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYG 356
           YGF   E  N S+++ L  + LK      +E+LE  + YG
Sbjct: 265 YGFYLDE--NESETLYLDDIILKDLSTSLQEELEFQQYYG 302


>gi|189189204|ref|XP_001930941.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187972547|gb|EDU40046.1| SET domain-containing protein RMS1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 476

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 72/298 (24%), Positives = 115/298 (38%), Gaps = 54/298 (18%)

Query: 82  WLSDSGL---PPQKMA-IQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           WL  SG    P  K+  ++  D G RG+VA ++I + E L  +P + +++ ++     E 
Sbjct: 14  WLRQSGAEISPKIKLEDLRNKDAG-RGVVASQDIAEHELLFRIPRASILSVENSILSTEI 72

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
                    P W  L   ++ E     +S W+ Y + LP +  +L++WT  EL   L+AS
Sbjct: 73  PAATLSLLGP-WLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDELAE-LQAS 130

Query: 198 QI---------RERAIERITNVIGTYNDL-------------------RLRIFSKYPDLF 229
            +          E  +E++  VI  + D+                    L +  K   L 
Sbjct: 131 AVVGKVGKESADEAFLEQLLPVIEEFADIVFSGDERAKDKAKEMRSLENLELMHKMGSLI 190

Query: 230 PEEVFNME------TFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLD 280
               F++E                     LP       +VP ADMLN     C    F +
Sbjct: 191 MAYAFDVEPATPTKEVDEEGFAEEEEDAALPK-----GMVPLADMLNADADRCNARLFYE 245

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSL 338
            D      +    +  Q GE++F  YG     +LL  YG+V  +     D VE+P  L
Sbjct: 246 KDCLEMKAL----KPIQAGEEIFNDYGPLPRSDLLRRYGYVT-DNYAQYDVVEIPTDL 298


>gi|71425330|ref|XP_813082.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877934|gb|EAN91231.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 94/224 (41%), Gaps = 31/224 (13%)

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRV--ALVPWADMLNHSCEVETFLDYDKSSQG-- 287
           E F++E   W+     SR   L ++DGRV  ALVP ADM+NH    +  +   + + G  
Sbjct: 290 ECFSIEAMMWARATFDSRAFNL-NVDGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDF 348

Query: 288 --VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK---SD 342
              +  +      G ++++SYG   N ELL  YGFV  EG N  D +  P    +    D
Sbjct: 349 VMQIGASLTAQDIGREIWMSYGPLQNWELLQFYGFV-LEG-NEHDRLPFPFDFPEGVVGD 406

Query: 343 KCYKEKLEALRKYGLS-ASECFPIQITGWPLELMAYAYLVVSPP------SMKGKFEEMA 395
           +    +   +  YGL  A  C+       P  L+A   + ++          KG F  + 
Sbjct: 407 EWDGRRAALVATYGLHLAGRCWICHDGRPPPALVALLRVHLAEAEEFDTMERKGPFASLG 466

Query: 396 AAASNKM--TSKKDIKCPEIDEQALQFILDSCESSISKYSRFLQ 437
           A    ++  T    I+C          ILD   +S+ +  R L+
Sbjct: 467 AGTEARVVATIADTIRC----------ILDLFSTSLEEDERLLE 500


>gi|260946533|ref|XP_002617564.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
 gi|238849418|gb|EEQ38882.1| hypothetical protein CLUG_03008 [Clavispora lusitaniae ATCC 42720]
          Length = 430

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 65/270 (24%), Positives = 109/270 (40%), Gaps = 22/270 (8%)

Query: 151 LLATYLISEASFEKSSRWSNYISALPR-QPYSL--LYWTRAELDRYLEASQIRER-AIER 206
           LLA YL+ E     +S W  +I  LP  +  SL  + W   ++    +  ++  R A + 
Sbjct: 122 LLAIYLVLEKERGAASFWKPFIDMLPSIEELSLAPVVWKVLQVPHCDDLWRMLSRSARKH 181

Query: 207 ITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP-----SMDGRVA 261
             +V+  +     + ++   DL     F   +F W++  + SR + +             
Sbjct: 182 AESVVARFE----KDYAVVCDLPSVPAFERSSFLWAWMCINSRCLYMSMPQAKDTSDNFT 237

Query: 262 LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF- 320
           + P+ D LNHS E +  +  D    G    T   Y+P E+++ SYG  SN  LL  YGF 
Sbjct: 238 MAPYVDFLNHSNEDQCGIKIDP--HGFHVLTSSAYKPQEELYFSYGPHSNEFLLCEYGFT 295

Query: 321 VPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPIQITGWPLELMAYAYL 380
           +P    N  D  +  L L +      E++  L+  G         +   +  E+ A A L
Sbjct: 296 LPHNKWNYIDITDFILPLLR-----PEQVSFLKDMGYYGDYTVNTEGMSFRTEI-ALATL 349

Query: 381 VVSPPSMKGKFEEMAAAASNKMTSKKDIKC 410
             S P    K + +    S+    +K  K 
Sbjct: 350 QESEPQQSRKLKALVEGMSDGAVFEKQSKV 379


>gi|156035929|ref|XP_001586076.1| hypothetical protein SS1G_13169 [Sclerotinia sclerotiorum 1980]
 gi|154698573|gb|EDN98311.1| hypothetical protein SS1G_13169 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 291

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)

Query: 229 FPEEVFNMETFKWSFGILFSRLVRL-----------PSMDGRVALVPWADMLNHSCEVET 277
           FPE   + + F +++ I+ SR               P  + R+AL P+AD +NHS +   
Sbjct: 140 FPEPPISYDEFMYNYSIVNSRTFYYLSPTIKSSKLQPPKEDRLALNPFADYMNHSSQ--P 197

Query: 278 FLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD 330
            ++   S  G   T  +  + G +V ISYG  +N  LL+ YG   R G N  D
Sbjct: 198 TVNATLSRAGYTLTASQPIKEGSEVHISYGSHNNDFLLIEYG---RHGNNSED 247


>gi|392569623|gb|EIW62796.1| SET domain-containing protein [Trametes versicolor FP-101664 SS1]
          Length = 509

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 90/205 (43%), Gaps = 39/205 (19%)

Query: 258 GRVALVPWADMLNHSCEVETF-LDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
           G VA+VP ADMLN   E E   L YD+    +V T  +  + GEQ++ +YG   N +LL 
Sbjct: 261 GDVAMVPMADMLNARFESENAKLFYDERELKMVST--KPIKAGEQIWNTYGDPPNSDLLR 318

Query: 317 SYGFV-------PREG-TNPSDSVELPLSL------KKSDKCYKEKL-----EALRKYGL 357
            YG V       P  G  NP D VE+   L      KK     KE++     EA     +
Sbjct: 319 RYGHVDLVPLSAPLSGLGNPGDVVEVRADLIVSVAAKKVKHDLKERVDWWLEEADDDVFV 378

Query: 358 SASECFPIQITGWPLELMAYAYLVVSPPSMKGKFEEMAAAASNKMTSKKDIKCPEIDEQA 417
             ++C   +      EL+++  L++ P   K ++E        K   K  +  P++D+  
Sbjct: 379 LRTDCELAE------ELVSFVRLLLLP---KDEWE--------KAAQKSKLPKPKLDKDV 421

Query: 418 LQFILDSCESSISKYSRFLQVKELL 442
           L   +D  E  +  Y   L+  E L
Sbjct: 422 LTIAVDVLEKRLKDYPTTLEEDEAL 446



 Score = 38.9 bits (89), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 55/119 (46%), Gaps = 3/119 (2%)

Query: 76  ASTLQKW--LSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWS 133
           AS    W  L    L  +K+ I +     RG +AL++I +   L  +P  L ++  +   
Sbjct: 6   ASEFVHWFQLQHGNLDTEKVGIVEFPEHGRGAIALQDIPEDYTLFTIPRELTLSTRTCSL 65

Query: 134 CPEAGEVLKQCSVPD-WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELD 191
               G+  K+  + + W  L   +I E S    S+WS Y++ LP    + ++W + +L+
Sbjct: 66  PTLMGQAWKEHGLHEGWAGLILCMIWEESRGSDSKWSGYLATLPSSFDTPMFWGQEDLN 124


>gi|167534011|ref|XP_001748684.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772925|gb|EDQ86571.1| predicted protein [Monosiga brevicollis MX1]
          Length = 945

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 67/286 (23%), Positives = 119/286 (41%), Gaps = 46/286 (16%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITAD------------SKWSCPEAGEVLKQCSVPDWPL 151
           G+VA  ++ +G+ L  +P S +IT +            + W+  E            W  
Sbjct: 523 GMVAATSLAEGDVLFEIPRSALITVNNSQINQQLSEMAAAWAEEEDEPEDGDGDPRQWTQ 582

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSL--LYWTRAELDRYLEASQIRERAIERITN 209
           L   ++ E + + +SR+  Y+  LP        + WT AE D+ L   ++ +     +  
Sbjct: 583 LVCAMMVENT-DPASRFRPYLDFLPDHTTLAHPMLWTSAERDQLLAGLRLAQDVENDLEM 641

Query: 210 VIGTYNDLRLRIFSKYPDLFPEEV-FNMETFKWSFGILFSRLVRLPSMD----GRVALVP 264
           +   + +L L    ++   FP     + E  + +F + F+ +V   S      G V +VP
Sbjct: 642 INSHFQELALPFLRRHA--FPALAELSDEDLRRNF-MAFAAVVMAYSFTDDTTGEVCMVP 698

Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPRE 324
            AD+LNH           K +  + +  D       Q+F +YG   N +LL  +GFV   
Sbjct: 699 VADILNHVT--------GKCNAKLYYAKD-----ALQIFNTYGSLDNQQLLQKHGFVEPT 745

Query: 325 GTNPSDSV----ELPLSLKKS------DKCYKEKLEALRKYGLSAS 360
           GT   +S+    EL  +L+ S      D   + KL+ L + G ++ 
Sbjct: 746 GTPFDESILPVEELVAALRPSFEGVLDDAAVERKLDLLLERGFASG 791


>gi|429850390|gb|ELA25672.1| set domain-containing protein, partial [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 443

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 91/204 (44%), Gaps = 30/204 (14%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAEL---------DRYLEASQIRERAIERITNVIGTYN 215
           S+ W+ Y+  LP Q      WT  E          D  L +SQ    A  +I  +   ++
Sbjct: 105 STPWTEYVKFLPPQVPVTTLWTEQEREMLVGTSLEDHVLTSSQSATAA--KIVTLTDEFD 162

Query: 216 DLRLRIFSKYPDLFPEEVF----NMETFKWSF--GILFSRLVRLPSMDGRVALVPWADML 269
           +LR       P  F  E+F     +    W+       SR + LP      A+VP  D+ 
Sbjct: 163 ELR-ETSEALP--FWNELFWESDKVSLIDWARVDAWFRSRCLELPK--SGEAMVPVLDLA 217

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDR---QYQPGEQVFISYGK-KSNGELLLSYGFVPREG 325
           NHS +   +  Y+++S+  V    R   +   G+++ ISYG  KS  E+L SYGF+  + 
Sbjct: 218 NHSAQANAY--YEENSKDEVVLLLRPGCRVLSGDEMTISYGDAKSGAEMLFSYGFI--DP 273

Query: 326 TNPSDSVELPLSLKKSDKCYKEKL 349
            + +D + LPL+  + D   K KL
Sbjct: 274 ASAADRITLPLAPLEDDPLGKAKL 297


>gi|225554758|gb|EEH03053.1| SET domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 485

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 114/268 (42%), Gaps = 47/268 (17%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
           P  K+A  + +   RG+VA  +I + E+L  +P +LV++  +  S  +  +  ++   P 
Sbjct: 34  PKIKIADLRSEGAGRGIVADDDIGEDEELFAIPQNLVLSFQNS-SLKDLLDFNERDFDP- 91

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
           W  L   +I E     +S WS Y   LP    +L++WT  EL R L  S +    + +I 
Sbjct: 92  WLCLIVVMIYEYLQGGASTWSRYFQLLPTNFDTLMFWTDEEL-RELSGSAV----LNKIG 146

Query: 209 NVIGTYNDLR--LRIFSKYPDLFPEEVFNMETFKWSFG--ILFSRLVRLPSM-------- 256
                 N LR  L + S  P  FP  +  + +F    G   L S   R+ S+        
Sbjct: 147 RSDAEANILRNILPLVSGNPSHFP-PMSGVASFDSPEGKAALLSLAHRMGSLIMAYAFDI 205

Query: 257 -----------DGRV---------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQY 296
                      DG V          +VP AD+LN     +T  +  +  Q   + + R  
Sbjct: 206 EKGENDGGEGQDGYVTDDEEELSKGMVPLADLLN----ADTDRNNARLFQEDCYLSMRSI 261

Query: 297 QP---GEQVFISYGKKSNGELLLSYGFV 321
           +P   GE++F  YG+    +LL  YG+V
Sbjct: 262 KPIRKGEEIFNDYGELPRADLLRRYGYV 289


>gi|320167915|gb|EFW44814.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 614

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 71/317 (22%), Positives = 131/317 (41%), Gaps = 45/317 (14%)

Query: 79  LQKWLSDSGLPPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCP--E 136
           LQ + +++ +   K  I+     + GL A  +I +G++++  P +L I +      P  E
Sbjct: 106 LQDFYTENSIELIKANIRYSPETDFGLYATADIDQGDEIVRAPVTLTIASQYLEDSPLTE 165

Query: 137 AGEVLKQCSVPD-WPLLATYLISEASFEKSSRWSNYIS---------------ALPRQPY 180
             + L     PD    +A +++ E   +  S +S +I                    +  
Sbjct: 166 EMQRLFGDQQPDELTAIALHILHEKVHKSQSFYSRWIHIGAHNCSMISNGFDCVAVEELN 225

Query: 181 SLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF---------SKYPDLFPE 231
           S + W   E++      QI E  +  + +++    +   R F         + +  +   
Sbjct: 226 STVMWDFNEVNEL----QISEEFVAMMQSLVDHMQEQYHRYFEPVSKARALAGFLSIMDG 281

Query: 232 EVFNMETFKWSFGILFSRLVRLPSMDGRVA--LVPWADMLNHSCEVETFLDYDKSSQG-- 287
            +   E F+W++    +R V + S  G V+  +VP  D +NH+ +    LD+  S QG  
Sbjct: 282 IIVKPEVFQWAYLTAIARGVPMKSKTGDVSYGIVPGIDWVNHAYDNNAHLDF--SMQGRM 339

Query: 288 ---VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSD--SVELPLSLKKSD 342
              +     R    GEQ+  +Y    N +LLL +GF  R+  NP D  SV L  ++  + 
Sbjct: 340 LGSMTLRATRDIAAGEQIVRNYVPMPNNQLLLRFGFAIRD--NPHDFVSVFLDQAVGATQ 397

Query: 343 KCYKEKLEALRKYGLSA 359
              + K   LR++ L A
Sbjct: 398 MAARRK-AILRRHQLDA 413


>gi|336473420|gb|EGO61580.1| hypothetical protein NEUTE1DRAFT_58975 [Neurospora tetrasperma FGSC
           2508]
 gi|350293291|gb|EGZ74376.1| SET domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 533

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 92/234 (39%), Gaps = 39/234 (16%)

Query: 122 PSLVITADSKWSCPEAGE--VLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQP 179
           PS    AD +   PE  +       S   W LL   L+ E     SS WS Y+S LP Q 
Sbjct: 99  PSHTADADDEPPSPENDDDDAEDSQSQDSWTLLILILMHEYLQGSSSNWSPYLSILPHQF 158

Query: 180 YSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVF 234
            + ++WT AEL   L+AS +     ++ A + I   I         +F  YP   P+   
Sbjct: 159 DTPMFWTEAELAE-LQASALVAKVGKDEADKMIRTKIVKVVQENEDVF--YPAGTPKTQR 215

Query: 235 NMETFKWSFGILFSRLV---------------------------RLPSMDGRVALVPWAD 267
             E      G      +                           ++  M+  + +VP AD
Sbjct: 216 LDEGELLKLGHRMGSAIMAYAFDLAKEEDDDEDEEEEEDGWVEDKIGGMNDTMGMVPMAD 275

Query: 268 MLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           MLN        +++ ++   +  T+ R+ + GE++   YG  S+ ELL  YG+V
Sbjct: 276 MLNADAVFNAHINHGEAC--LTATSLREIKEGEEILNYYGPLSSAELLRRYGYV 327


>gi|389639446|ref|XP_003717356.1| hypothetical protein MGG_06237 [Magnaporthe oryzae 70-15]
 gi|351643175|gb|EHA51037.1| hypothetical protein MGG_06237 [Magnaporthe oryzae 70-15]
 gi|440465360|gb|ELQ34683.1| hypothetical protein OOU_Y34scaffold00748g2 [Magnaporthe oryzae
           Y34]
 gi|440490993|gb|ELQ70482.1| hypothetical protein OOW_P131scaffold00027g18 [Magnaporthe oryzae
           P131]
          Length = 500

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 55/199 (27%), Positives = 84/199 (42%), Gaps = 36/199 (18%)

Query: 155 YLISEASFEKSSRWSNYISALPRQPYSLLYWTR----AELDRYLEASQIRERAIERITNV 210
           +LI +      S W  YI++LP QP  L  W       E D  + A+     A + +  +
Sbjct: 110 FLIQQYLLGPKSHWHPYIASLP-QPEHLASWNLPPFWPEEDAAVLAATNAGVAAKEMAGI 168

Query: 211 IGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPS--------------- 255
            G  +    R   +   L     ++   ++W++ I  SR  R PS               
Sbjct: 169 AGRESKQGRRAL-RESGLENWREYSPLLYRWAYCIFTSRSFR-PSLVVPPAVWESLRNEH 226

Query: 256 --------MDGRVALVPWADMLNHSCEVETFLDYDKSSQG-----VVFTTDRQYQPGEQV 302
                   MD    L+P  D+ NH   V+   D  +S  G     V   + + Y+PGEQ+
Sbjct: 227 AKYLDGCEMDDFSILMPLFDIANHDPLVQATWD-SESVPGECRLLVNGRSGQGYRPGEQI 285

Query: 303 FISYGKKSNGELLLSYGFV 321
           F +YG K+N ELL++YGFV
Sbjct: 286 FNNYGLKTNSELLVAYGFV 304


>gi|156849027|ref|XP_001647394.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156118080|gb|EDO19536.1| hypothetical protein Kpol_1018p68 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 494

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 115/272 (42%), Gaps = 56/272 (20%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKW---SCPEAGEVLKQCSVPDWPLLATYLISE 159
           R +VA++++ +GE L  +P   ++  ++       P  G       + +W  L   ++ E
Sbjct: 41  RAMVAVEDVAEGETLFEIPRGSILNVNTSALTRDYPSFG----TSQLGEWEELILCMLYE 96

Query: 160 A-SFEKSSRWSNYISALPR--QPYSLLYWTRAELDRYLEASQIRER-----AIERITNVI 211
                ++SRW  Y + LP   +  SL+YW+  EL   L+ S + ER     + E  + V+
Sbjct: 97  MFVLGENSRWYPYFNVLPSSAELNSLIYWSDRELG-LLKPSFVIERIGRGKSQEMFSKVL 155

Query: 212 G--TYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV--------- 260
                 D  L + +KY           E F +   I+ S    +  ++ +          
Sbjct: 156 SYIENQDSDLSLIAKY--------LTWENFVYVASIIMSYSFDVEDLNPQSDEDDEIEDD 207

Query: 261 -------------ALVPWADMLN---HSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
                        +++P AD LN   H C     L YDK +  +  T  +  + GE+VF 
Sbjct: 208 DNDSEMSPDKSIKSMIPLADTLNSDTHLCNAN--LMYDKET--LKMTAIKPIRAGEEVFN 263

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
            YG+  N E+L  YG+V  +G+   D  ELPL
Sbjct: 264 IYGEHPNSEILRRYGYVEWKGS-KYDFAELPL 294


>gi|356499056|ref|XP_003518360.1| PREDICTED: LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 40-like
           [Glycine max]
          Length = 497

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 64/281 (22%), Positives = 116/281 (41%), Gaps = 56/281 (19%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGE-VLKQCSVPDWPLLATYLISE 159
           G RGL A +++ +GE +L VP S ++T +S     +  + V +  S+    +L   L+ E
Sbjct: 64  GRRGLGAARDLGRGEIVLRVPKSALMTRESVMEDEKLCDAVNRHSSLSPAQMLIVCLLYE 123

Query: 160 ASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYL---EASQIRERAIERITNVIGTYND 216
              + +SRW  Y+  +P Q Y +L        R L   EA  + E+A+ +  +     + 
Sbjct: 124 MG-KXTSRWHPYLVHMP-QTYDILAMFGEFEKRALQVDEAMWVTEKAMLKAKSEWKEAHA 181

Query: 217 LRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSC--- 273
           L   +      +F  +    + + W+   + S+ + +P  D    L    D+ N+     
Sbjct: 182 LMEDL------MFKPQFLTFKAWVWAAATISSQTMHIP-WDEAGCLCLVGDLFNYDAPGM 234

Query: 274 ------EVETFLD--------------------------------YDKSSQGVVFTTDRQ 295
                 ++E FL                                 +++      F +   
Sbjct: 235 EPSGIEDLEHFLSNSSIHDTSLLNGDNNIMMQTTLILTQRLTDGWFEEDVNAYCFYSRAH 294

Query: 296 YQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           Y+ G+QV + YG  +N EL+  YGF+ +E  NP+D V +PL
Sbjct: 295 YKKGDQVLLCYGIYTNLELVEHYGFLLQE--NPNDKVFIPL 333


>gi|402584499|gb|EJW78440.1| hypothetical protein WUBG_10651 [Wuchereria bancrofti]
          Length = 362

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 88/214 (41%), Gaps = 42/214 (19%)

Query: 166 SRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIF--- 222
           S W  YI  LP    + L++T  +L ++L  S + E ++    NV   +    L I    
Sbjct: 19  SHWQPYIKVLPENFNTPLFFTVEQL-QFLRPSPLFEESLLLYRNVSRQFIHFLLEIIRSD 77

Query: 223 ---------SKYPDLFPEEV--------FNMETFKWSFGILFSRLVRLPSMDGR------ 259
                     +   L P  V        F    ++WS   + +R+  +PS   R      
Sbjct: 78  QFRHRKKKSKEMSKLEPIYVNSPLTAANFTFNLYRWSVACISTRINMIPSEVLRDDIGQP 137

Query: 260 ---VALVPWADMLNHSC-------EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK 309
                L+P+ DM NHS         V   +++D +    V    R Y+P E V I YG +
Sbjct: 138 RLIPGLIPFLDMANHSYIEGAFHESVHFSVEFDCAEIIAV----RDYKPLEPVNIFYGWR 193

Query: 310 SNGELLLSYGFVPREGTNPSDSVELPLSLKKSDK 343
           SN + LL  GFVP E  N  D  +L + L KS +
Sbjct: 194 SNRDFLLHNGFVPSE-KNIRDIYKLKIGLPKSKR 226


>gi|238505934|ref|XP_002384169.1| SET domain-containing protein, putative [Aspergillus flavus
           NRRL3357]
 gi|220690283|gb|EED46633.1| SET domain-containing protein, putative [Aspergillus flavus
           NRRL3357]
          Length = 418

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 71/280 (25%), Positives = 123/280 (43%), Gaps = 43/280 (15%)

Query: 104 GLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFE 163
           G++A +NI + E ++ VP   ++T++     P           P   L A +L +  + E
Sbjct: 39  GMIATRNIEEDEAIVTVPLKAMLTSER---IPSYFTSKFPDGTPTHALYAAFL-TNGNAE 94

Query: 164 KSSRWSNYISALP-RQPYSL---LYWTRAELDRYLEAS--------QIRER-AIERITNV 210
               ++ +    P RQ +     + W+ + L  YL  S        Q R++   E     
Sbjct: 95  DLEEFNAWRKTWPSRQDFEDSMPILWSES-LRNYLPPSISSHWHSIQSRDKLQYETTHQN 153

Query: 211 IGTYNDLRLR-----IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLPSM------D 257
           +    + RLR     + S +PD       + ETF + + I+ +R     +P        +
Sbjct: 154 LLAQQEQRLRTAWDIVVSIFPDT------DWETFSYHWLIVNTRSFFYLMPGQEPPEDRN 207

Query: 258 GRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
             +AL+P+AD  NHS +V   + +D   +  VF   + Y  GE++++SYG   N  L   
Sbjct: 208 DAMALLPFADYFNHS-DVACNVKFD--GENYVFRATKHYDEGEEIYMSYGPHPNDFLFAE 264

Query: 318 YGFVPREGTNPSDSVEL-PLSLKKSDKCYKEKLEALRKYG 356
           YGF   E  N S+++ L  + LK      +E+LE  + YG
Sbjct: 265 YGFYLDE--NESETLYLDDIILKDLSTSLQEELEFQQYYG 302


>gi|223995875|ref|XP_002287611.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976727|gb|EED95054.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 538

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 72/311 (23%), Positives = 124/311 (39%), Gaps = 64/311 (20%)

Query: 43  HCSVSTTNDASRTKTTVTQNMIP---WGCEIDSLENASTLQKWLSDSGLPPQKMAIQKVD 99
           H + +T   A  T     +N+ P   W  E+  ++    L  +  D G+    MAI  VD
Sbjct: 58  HLTTTTALAAISTDENTPRNIPPFQSWCAEM-GVQQMDGLDLYTQDGGV--DYMAITTVD 114

Query: 100 VGERGLVALKNIRKGEKLLFVPPSLVITAD---------SKWSCPEA----GEVLKQCSV 146
                      I  G  +L+VP  +V+++D         S     +A    G +    SV
Sbjct: 115 -----------IPAGTTILYVPSGMVLSSDRVAEELNAISNGGVADAVNQLGRIGGGSSV 163

Query: 147 PDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRY---------LEAS 197
           P + L    L+ E   +++S +  ++ +LPR     LY+    +  +            S
Sbjct: 164 PKFYLFIKMLM-EYENQENSPFYPWLDSLPR-----LYFNAVSMTDFCYECLPPLVFNLS 217

Query: 198 QIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMD 257
           +  +   +    VI      ++ + S Y         N E  KW+F  +++R       D
Sbjct: 218 RTEKVKFDNFLQVIK-----KVDVVSDYVKS------NEEVLKWAFNSVYTR--TYCHKD 264

Query: 258 GR----VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           G+    VAL P+AD  NH  + E  + +D+    + +TT         + ISYG  +N  
Sbjct: 265 GQNEDDVALTPFADYFNHGTDTEVEVCFDEEGNCMAYTT-TDVAANSPLRISYGCPTNPS 323

Query: 314 LLLS-YGFVPR 323
            L + YGF+ +
Sbjct: 324 FLFARYGFLDQ 334


>gi|358388339|gb|EHK25932.1| hypothetical protein TRIVIDRAFT_82204 [Trichoderma virens Gv29-8]
          Length = 915

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 57/232 (24%), Positives = 91/232 (39%), Gaps = 29/232 (12%)

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRE------- 201
           W +L   ++ E      S+W  Y+  LP    + ++W+ AELD  L+AS  R        
Sbjct: 542 WSILIIIMMFEYFKGDESKWKPYMDVLPASFETPMFWSGAELDE-LQASATRTKVGKADA 600

Query: 202 ------------RAIERITNVIGTYNDLRL-RIFSKYPDLFPEEVFNMETFKWSFGILFS 248
                       RA   I     +Y+D  L ++  +         F+ +           
Sbjct: 601 EEMFHAKVLPVIRANHEIFPSSQSYSDDELVQLAHRMGSTIMSYAFDFQNEDEEDEEDEE 660

Query: 249 RLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
             V        + +VP AD+LN   E    ++Y      +  T  R  + GE++   YG 
Sbjct: 661 EWVEDRESKSTMGMVPMADILNADAEYNAHVNY--GDDALTVTALRTIKAGEEILNYYGP 718

Query: 309 KSNGELLLSYGFVPREGTNPSDSVELPL-----SLKKSDKCYKEKLEALRKY 355
             N ELL  YG+V  + +   D VELP      SL  S    +++L+  R+Y
Sbjct: 719 HPNSELLRRYGYVTPKHSR-YDVVELPWKLVENSLAASLGLSEQQLDNAREY 769


>gi|146415322|ref|XP_001483631.1| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 466

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/299 (22%), Positives = 120/299 (40%), Gaps = 62/299 (20%)

Query: 79  LQKWLSD-----SGLPPQKMAIQKVDV-----GERGLVALKNIRKGEKLLFVPPSLVITA 128
           L +WLS      S  P       KV+V       RG+ A +N+   E L+ +P S ++  
Sbjct: 37  LVQWLSSPPAPFSAAPNVTHISSKVNVLSNETSGRGVYATQNVSAKETLVRIPHSFLMNT 96

Query: 129 DS----------KWSCPEAG---------------EVLKQCSVPDWPLL------ATYLI 157
           ++          K S P+ G               E+  +  +  W  L      A Y+ 
Sbjct: 97  NTIIKHISRFNGKESVPDLGYSVLLPSEYTTDQWTELYAKIPISKWLQLTAFQRTALYIC 156

Query: 158 SEASFEKSSRWSNYISALPRQP---YSLLYWTRAELDRYLEASQIRE-------RAIERI 207
            E   +++S W  +IS+LP+     ++ + W   E++  L  S+  +        +    
Sbjct: 157 LEKKRKENSFWCAFISSLPKLEELDFAPIVW---EVESELTGSKAADFFELLPRSSRNHA 213

Query: 208 TNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL--PSMDGRV---AL 262
             V+  +N+    + S++      E  N   F W++  + SR + +  PS         L
Sbjct: 214 KKVLVRFNEDYTAV-SEFLTAAKSEPLNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTL 272

Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
            P+ D LNH C+ +  +  D  S+G +  +   +  G+++  SYG  SN  LL  Y F 
Sbjct: 273 APYVDFLNHDCDEKCAIKID--SRGFLVISCVDHAAGQELLFSYGPHSNEFLLCEYAFT 329


>gi|425767698|gb|EKV06264.1| hypothetical protein PDIG_78250 [Penicillium digitatum PHI26]
 gi|425780393|gb|EKV18400.1| hypothetical protein PDIP_26670 [Penicillium digitatum Pd1]
          Length = 494

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 75/305 (24%), Positives = 128/305 (41%), Gaps = 44/305 (14%)

Query: 78  TLQKWLSDSGLPPQKMAIQKVDVGERG------LVALK------NIRKGEKLLFVPPSLV 125
           TL  W   +G+  Q +++ K+   + G      L+A +      N  K + LL VPP LV
Sbjct: 10  TLPAWQRLNGIVTQGISVHKIGSDQHGADKGSALIATETQMSSENDAKPKILLQVPPELV 69

Query: 126 ITADSKWSCPEAGEVLKQC--SVPDWPLLA-----TYLISEASFEKS------------S 166
           ++ ++  +  +    L+    ++ D+   A      +LI + S   S            +
Sbjct: 70  LSLETVQNQAKTDGHLRDVLEAIGDFGRTARGAILIFLIIQISHSSSDLHPKHETIGISN 129

Query: 167 RWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLR---IFS 223
            W+ Y+  LP   + L  +  AE    L  + + E  + ++  +   +  LR     I  
Sbjct: 130 PWTEYVKFLP-PSFPLPTFYTAEEQELLRGTSLAEPLVAKLAFLEREFEQLRQATGGIAW 188

Query: 224 KYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNH--SCEVETFL 279
                + E    +    W +      SRL+ LP     +A+VP  DM NH  S  V+   
Sbjct: 189 CQRSWWHERTGALTIDDWKYVDAAYRSRLLDLPGSG--LAMVPCIDMANHVSSDGVKALY 246

Query: 280 DYDKSSQGVV-FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTNPSDSVELPLS 337
           D D     V+     +  QPGE++ ISYG +KS  E++ SYGF+   G   +  + L L 
Sbjct: 247 DTDSEGNAVLQLRWGKTIQPGEEITISYGNEKSASEMIFSYGFL-ESGITQAREMFLNLE 305

Query: 338 LKKSD 342
           + + D
Sbjct: 306 IPEDD 310


>gi|444727366|gb|ELW67864.1| SET domain-containing protein 4 [Tupaia chinensis]
          Length = 268

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 11/107 (10%)

Query: 226 PDLFPEEVFNMETFK---WSFGILFSRLVRLPSMDGRV--------ALVPWADMLNHSCE 274
           P  FP+ V N+ ++    W++  + +R V L               AL P+ D+LNHS  
Sbjct: 116 PACFPDAVDNVFSYSALLWAWCTVNTRAVYLRHRQRECLSAEPDTYALAPYLDLLNHSPN 175

Query: 275 VETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           V+    +++ ++         Y+  E+VFI YG   N  LLL YGFV
Sbjct: 176 VQVKAAFNEETRCYEIQAASNYRKYEEVFICYGPHDNQRLLLEYGFV 222


>gi|145344581|ref|XP_001416808.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577034|gb|ABO95101.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 303

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 51/96 (53%), Gaps = 6/96 (6%)

Query: 237 ETFKWSFGILFSRLVRLPSMDGRV----ALVPWADMLNHSCEVETF-LDYDKSSQGVVF- 290
           + ++W+  ++ SR  RL    GR     ALV  AD++NHS   E    D+  +   V F 
Sbjct: 66  DEWRWAVSLVHSRTFRLEDERGRRPTRRALVAGADLINHSSVPEDVNCDWVANDADVFFI 125

Query: 291 TTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           +  +  + GE+ F+SYG++ +    L YGF+PR  +
Sbjct: 126 SATKDVRKGEEFFLSYGEQCDRHFALFYGFLPRRNS 161


>gi|428163640|gb|EKX32701.1| hypothetical protein GUITHDRAFT_121141 [Guillardia theta CCMP2712]
          Length = 135

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 37/110 (33%), Positives = 53/110 (48%), Gaps = 12/110 (10%)

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRL----VRLPSMDGRVALVPWADMLNHSCEV 275
           + F++ P LFP  + ++  + W+  I+ SR           D    LVP ADM+NH  + 
Sbjct: 11  KFFAENPGLFPSPI-DVREWMWASAIIMSRSWGQKAERAGNDKMHILVPLADMVNHDAKA 69

Query: 276 ETFLDYDKSSQG--VVFTTDRQYQPGEQVFISYGKKSNGELLLSYG-FVP 322
                    S+G  +V    R    GE+V I+YG K N EL+  YG FVP
Sbjct: 70  RKV----GISEGGAIVIYAGRNLAAGEEVCITYGDKCNMELMAHYGFFVP 115


>gi|342881738|gb|EGU82570.1| hypothetical protein FOXB_06936 [Fusarium oxysporum Fo5176]
          Length = 467

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 119/289 (41%), Gaps = 52/289 (17%)

Query: 94  AIQKVDVGERG--------LVALKNIRKGEKLLFVPPSLVITADSKW---SCPEA----- 137
           AI+ VD+ +R          VAL++I     L  +P   +I  ++       P+A     
Sbjct: 28  AIKIVDLRDRNAGRGEVNKTVALEDIPAETTLFTIPRKGIINVETSELPKKIPDAFDLDK 87

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
            +      +  W  L   +I E    ++S+W  Y   LP    + ++W+  ELD+ L+AS
Sbjct: 88  PDDDDAPGLDSWSSLILIMIYEYLQGENSKWKPYFDVLPSSFDTPMFWSDNELDQ-LQAS 146

Query: 198 QIRE-------------------RAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMET 238
            +R                    R+   I N     +   + I  +         F++E 
Sbjct: 147 HMRHKIGKADAENMFQKTLLPIIRSNAEIFNAGNKTDAELIEIAHRMGSTIMAYAFDLEN 206

Query: 239 ----FKWSFGILFSRLVRLPSMDGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTT 292
                + + G +  R       DG+  + +VP AD+LN   E    +++++ S  +  T+
Sbjct: 207 DEEEEEEADGWVEDR-------DGKSMMGMVPMADILNADAEFNAHVNHEEES--LTVTS 257

Query: 293 DRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKS 341
            R  + GE++   YG   N ELL  YG+V  E  +  D VE+P  + +S
Sbjct: 258 LRPIKAGEEILNYYGPHPNSELLRRYGYVT-EKHSRYDVVEIPWDIVES 305


>gi|429857094|gb|ELA31976.1| set domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 466

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 105/245 (42%), Gaps = 34/245 (13%)

Query: 107 ALKNIRKGEKLLFVPPSLVITADSKWSCPEAG-----------EVLKQCSVPDWPLLATY 155
           A   ++ G+ ++  P  L ++  +  + P  G             LK  SVP   +   +
Sbjct: 36  ASAGVKPGDTVVTCPLGLTLSYLNATTTPNPGFHHEDTPPFPPSFLK--SVPPHVIGRFF 93

Query: 156 LISEASFEKSSRWSNYISALPRQPYSLLYWTRAEL------------DRYLEASQIRERA 203
           LI++    K S W  YI  LP QP  L  W    L            + ++  ++I+ R 
Sbjct: 94  LINQYLLGKDSFWYPYIRTLP-QPEHLQSWALPPLWPSDDIELLEDTNIHVAITEIKARL 152

Query: 204 IERITNVIGTYNDLRLR-IFSKYPDLFPEEVFNMETFKWSFGILFSR--LVRLP---SMD 257
                  I  + +  +R  +++    +   +F   +F+ S  I  SR   + LP   ++D
Sbjct: 153 KSEYKQAIAAFGEDPVRKDYTRLLYNWAYCIFTSRSFRPSLVIPASRQHTLSLPEGCAID 212

Query: 258 GRVALVPWADMLNHSCEVETFLDY-DKSSQGVVFTTDRQYQPGEQVFISYG-KKSNGELL 315
               L+P  D+ NHS   +   D+ + +       T   Y PG+QV+ +YG  K+N EL+
Sbjct: 213 DFSLLLPLFDVGNHSTLAKISWDHPEDAVDTCALRTLDAYGPGDQVYNNYGTNKTNAELM 272

Query: 316 LSYGF 320
           L+YGF
Sbjct: 273 LAYGF 277


>gi|351695156|gb|EHA98074.1| SET domain-containing protein 4 [Heterocephalus glaber]
          Length = 449

 Score = 52.8 bits (125), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 45/154 (29%), Positives = 65/154 (42%), Gaps = 22/154 (14%)

Query: 231 EEVFNMETFKWSFGILFSRLVRL---------PSMDGRVALVPWADMLNHSCEVETFLDY 281
           + VF+     W++  + +R V L         P  D   AL P+ D+LNHS  V+    +
Sbjct: 198 DRVFSYSALLWAWCTVNTRAVYLRTRRRDCLSPEPDT-CALAPYLDLLNHSPHVQVKAAF 256

Query: 282 DKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVE-----LPL 336
           ++ +      T    +  E+VFI YG   N  LLL YGFV     NP   V      L  
Sbjct: 257 NEETGCYEIRTASSCRKHEEVFICYGPHDNHRLLLEYGFVSLR--NPHACVYVSREILVR 314

Query: 337 SLKKSDKCYKEKLEALRKYGLSASECFPIQITGW 370
            L  +DK    K+  L+ +G   +  F     GW
Sbjct: 315 YLPSTDKQMNRKIAILKDHGFMENLTF-----GW 343


>gi|320034953|gb|EFW16895.1| conserved hypothetical protein [Coccidioides posadasii str.
           Silveira]
          Length = 469

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 80/330 (24%), Positives = 130/330 (39%), Gaps = 64/330 (19%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVIT-ADSKWSCPEAGEVLKQCSVPD-----WPLLATYL 156
           RG+VA + I + E+L  +P  LV++ A+SK        +  + +  D     W  L   +
Sbjct: 44  RGVVACEEIVQDEELFAIPEDLVLSVANSK--------IKDRINFADENFDTWLSLIVTM 95

Query: 157 ISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYND 216
           I E      S+WS Y   LP    +L++WT  EL R L+ S + ++  ++ T+ +    D
Sbjct: 96  IFEYLQGGVSKWSPYFGVLPTDFDTLMFWTENEL-RELQGSSVLDKIGKQETDQV--ILD 152

Query: 217 LRLRIFSKYPDLFP-------------EEV----------------FNMETFKWSFGILF 247
             L +  ++PDLFP             +EV                F++E  +       
Sbjct: 153 KVLPVVLEHPDLFPPVNGLASFDSPSGKEVVLQLAHRMGTLIMAYAFDIEMDQDEDQDGE 212

Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGE---QVFI 304
              V     +    +VP AD+LN          + +    ++    R   P     ++F 
Sbjct: 213 DGYVTDDEQEKAKGMVPLADLLNADAHRNNARLFQEDGYFIM----RSIAPISIEMEIFN 268

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE---KLEALRK------- 354
            YG+    +LL  YG++  E   P D VE+ L    +    +E   +LE L         
Sbjct: 269 DYGELPRSDLLRRYGYIT-ENYAPYDVVEISLEAICNIAGVEEGCCQLELLEDAGVLEDG 327

Query: 355 YGLSASECFPIQITGWPLELMAYAYLVVSP 384
           Y LS  E   I     P EL+     + SP
Sbjct: 328 YALSRPEGDAITTEAIPAELLILLRTLRSP 357


>gi|367036851|ref|XP_003648806.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
 gi|346996067|gb|AEO62470.1| hypothetical protein THITE_2106671 [Thielavia terrestris NRRL 8126]
          Length = 479

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 43/109 (39%), Positives = 57/109 (52%), Gaps = 10/109 (9%)

Query: 248 SRLVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR---QYQPGEQVFI 304
           SR + LP + G  ++VP  DMLNHS     +  YD++ Q  V    R       G+++ I
Sbjct: 202 SRCLELP-VHGE-SMVPCIDMLNHSATPSAY--YDENPQDDVVLLLRPGISLAEGDEITI 257

Query: 305 SYGK-KSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
           SYG  KS  E+L SYGF+    T  +DS+ LPLS    D   K KL A 
Sbjct: 258 SYGDAKSAAEMLFSYGFIDPRST--ADSLVLPLSPFPDDPLAKAKLVAF 304


>gi|190347905|gb|EDK40262.2| hypothetical protein PGUG_04360 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 466

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 67/298 (22%), Positives = 115/298 (38%), Gaps = 60/298 (20%)

Query: 79  LQKWLSD-----SGLPPQKMAIQKVDV-----GERGLVALKNIRKGEKLLFVPPSLVITA 128
           L +WLS      S  P       KV+V       RG+ A +N+   E L+ +P S ++  
Sbjct: 37  LVQWLSSPPAPFSAAPNVTHISSKVNVLSNETSGRGVYATQNVSAKETLVRIPHSFLMNT 96

Query: 129 DS----------KWSCPEAG---------------EVLKQCSVPDWPLL------ATYLI 157
           ++          K S P+ G               E+  +  +  W  L      A Y+ 
Sbjct: 97  NTIIKHISRFNGKESVPDLGYSVSLPSEYTTDQWTELYAKIPISKWLQLTAFQRTALYIC 156

Query: 158 SEASFEKSSRWSNYISALPRQP---YSLLYWTRAELDRYLEASQIRE------RAIERIT 208
            E   +++S W  +IS+LP+     ++ + W   E++  L  S+  +      R+     
Sbjct: 157 LEKKRKENSFWCAFISSLPKLEELDFAPIVW---EVESELTGSKAADFFELLPRSSRNHA 213

Query: 209 NVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRL--PSMDGRV---ALV 263
             +    +      S++      E  N   F W++  + SR + +  PS         L 
Sbjct: 214 KKVSVRFNEDYTAVSEFLTAAKSEPLNKMEFLWAWMCINSRCLYMSFPSSKAEADNFTLA 273

Query: 264 PWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFV 321
           P+ D LNH C+ +  +  D     V+   D  +  G+++  SYG  SN  LL  Y F 
Sbjct: 274 PYVDFLNHDCDEKCAIKIDSRGFSVISCVD--HAAGQELLFSYGPHSNEFLLCEYAFT 329


>gi|358371988|dbj|GAA88594.1| SET domain protein [Aspergillus kawachii IFO 4308]
          Length = 497

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 81/174 (46%), Gaps = 16/174 (9%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----L 219
           SS WS Y+  LP       +++  EL+  L  S +R     +I ++   +  LR     L
Sbjct: 130 SSPWSEYMKYLPSSIPLPTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRQSTEGL 188

Query: 220 RIFSKY--PDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHSCE--V 275
               +Y   D   +  FN   +K+   +  SR+V LP      A+VP  DM NH+ E  V
Sbjct: 189 DWCKRYWWDDDTGKLTFN--DWKYVDALYRSRMVDLPQHGH--AMVPCIDMANHAPEGTV 244

Query: 276 ETFLDYDKSSQGVVFTTD-RQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
           +   D D     V+   D R  +  E+V ISYG +KS  E++ SYGF+    T+
Sbjct: 245 KALYDEDADGNAVLQLRDGRSLRADEEVTISYGDEKSASEMIFSYGFLDEHTTD 298


>gi|317030555|ref|XP_001392774.2| SET domain protein [Aspergillus niger CBS 513.88]
          Length = 473

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 79/174 (45%), Gaps = 16/174 (9%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----L 219
           SS W+ Y+  +P       +++  EL+  L  S +R     +I ++   +  LR     L
Sbjct: 130 SSPWTEYMKYMPPAISLPTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRRSTEGL 188

Query: 220 RIFSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCE--V 275
               KY   + E+   +    W +   +  SR+V LP      A+VP  DM NH+ E  V
Sbjct: 189 DWCEKY--WWDEDTGKLTFNDWKYVDALYRSRMVDLPRHGH--AMVPCIDMANHASEGTV 244

Query: 276 ETFLDYDKSSQGVV-FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
           +   D D     V+     R  +  E+V ISYG +KS  EL+ SYGF+    T+
Sbjct: 245 KALYDEDADGNAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHTTD 298


>gi|320586761|gb|EFW99424.1| set domain containing protein [Grosmannia clavigera kw1407]
          Length = 437

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 254 PSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGE 313
           P  D  +A+ P AD+ NH+ +    + +  +  G  F  DR+Y  G +V ISYG  ++  
Sbjct: 208 PQRDDHMAMQPVADLFNHTADGGCRVAFGPA--GFTFVADRRYDAGVEVPISYGAHADDA 265

Query: 314 LLLSYGFV 321
           LL+ YGFV
Sbjct: 266 LLVEYGFV 273


>gi|330933580|ref|XP_003304225.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
 gi|311319308|gb|EFQ87682.1| hypothetical protein PTT_16721 [Pyrenophora teres f. teres 0-1]
          Length = 476

 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 96/410 (23%), Positives = 157/410 (38%), Gaps = 79/410 (19%)

Query: 82  WLSDSGL---PPQKMA-IQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEA 137
           WL  SG    P  K+  ++  D G RG+VA + I + E L  +P + +++ ++     E 
Sbjct: 14  WLRKSGAEISPKIKLEDLRNKDAG-RGVVASQEIAEHELLFRIPRTSILSVENSILSTEI 72

Query: 138 GEVLKQCSVPDWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEAS 197
                    P W  L   ++ E     +S W+ Y + LP +  +L++WT  EL   L+AS
Sbjct: 73  PAATLSLLGP-WLSLILVMLYEYHNGSASNWAPYFAVLPTEFNTLMFWTEDELAE-LQAS 130

Query: 198 QI---------RERAIERITNVIGTYNDL-------------------RLRIFSKYPDLF 229
            +          E  +E++  VI  + D+                    L +  K   L 
Sbjct: 131 AVVGKIGKESADEAFLEQLLPVIEEFADIVFSGDEKAKDKAKEMRSPKNLELMHKMGSLI 190

Query: 230 PEEVFNME------TFKWSFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLD 280
               F++E                     LP       +VP ADMLN     C    F +
Sbjct: 191 MAYAFDVEPATPTKEVDEEGFAEEEEDAALPK-----GMVPLADMLNADADRCNARLFYE 245

Query: 281 YDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
            D      +    +  Q GE++F  YG     +LL  YG+V  +     D VE+P  L  
Sbjct: 246 KDCLEMKAL----KPIQAGEEIFNDYGPLPRSDLLRRYGYV-TDNYAQYDVVEIPTDLVS 300

Query: 341 SDKCY-----KEKLEALRK-------YGLSASECFPIQITGWPLELMAYAYLVVSPPSMK 388
               +     + +LE L +       Y ++AS  F +Q +  P        LVV   SM 
Sbjct: 301 EVLVHEGVWQENRLEYLDEQEVLDTGYDIAASTPFTLQESLSP-------ELVVLVESML 353

Query: 389 GKFEEMAAAASNKMTSKKDIKCPE-IDEQALQFILDSCESSISKYSRFLQ 437
              E+       ++ SK  +  PE I  Q    +    ++ I++Y+  L+
Sbjct: 354 LSDEDF-----ERLKSKGKLPKPEKITAQGADMLHKMLQARIAQYATTLE 398


>gi|389646769|ref|XP_003721016.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|86196443|gb|EAQ71081.1| hypothetical protein MGCH7_ch7g488 [Magnaporthe oryzae 70-15]
 gi|351638408|gb|EHA46273.1| hypothetical protein MGG_02740 [Magnaporthe oryzae 70-15]
 gi|440466942|gb|ELQ36183.1| hypothetical protein OOU_Y34scaffold00666g44 [Magnaporthe oryzae
           Y34]
 gi|440488101|gb|ELQ67845.1| hypothetical protein OOW_P131scaffold00283g3 [Magnaporthe oryzae
           P131]
          Length = 390

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 59/134 (44%), Gaps = 15/134 (11%)

Query: 242 SFGILFSRLVRLPSMDGRVALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQP 298
           +F  +  R  RL   D R+ L P AD+ NH+   C V  F D D          DR Y  
Sbjct: 164 TFYFVCPRTERLGKED-RMVLQPVADLFNHADAGCAV-AFNDED-----FTIRADRDYDA 216

Query: 299 GEQVFISYGKKSNGELLLSYGFV---PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           GE+V I YG  SN  LL  YGFV    R      D   LPL  K   +  KE+   L  Y
Sbjct: 217 GEEVLICYGNHSNDFLLAEYGFVLAANRWDEVCIDDAILPLLTKAQRELLKER-NFLGNY 275

Query: 356 GLSASE-CFPIQIT 368
            L A+  C+  ++ 
Sbjct: 276 MLDAATVCYRTEVA 289


>gi|440636170|gb|ELR06089.1| hypothetical protein GMDG_07800 [Geomyces destructans 20631-21]
          Length = 373

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 4/80 (5%)

Query: 260 VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
           +AL P+AD  NH+ +V + + +  S+ G    TDR+ + GE+++ISYG  SN  LL  YG
Sbjct: 180 LALAPFADCFNHA-DVASKVTF--STSGYDICTDRRIEKGEEIYISYGNHSNDFLLAEYG 236

Query: 320 FVPREGTNPSDSV-ELPLSL 338
           F+  E      S+ E+ LSL
Sbjct: 237 FILDENKWDEISIDEVILSL 256


>gi|312377428|gb|EFR24260.1| hypothetical protein AND_11267 [Anopheles darlingi]
          Length = 273

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 63/141 (44%), Gaps = 28/141 (19%)

Query: 241 WSFGILFSRLVRLP-------SMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVV-FTT 292
           W+   + +R  ++P        +D  +AL+P  DM NH    +   D  +SS G     T
Sbjct: 5   WAVSTVMTRQNKVPVNLSTFEELDFTLALIPLWDMANHITPEQR--DGHRSSNGSSPLVT 62

Query: 293 DRQY----------------QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           D  Y                + GE VFI+YGK+++ E L+  GF    G NP+  +    
Sbjct: 63  DTTYCSKLEKLESILQADCTKAGEPVFINYGKRTDAEFLVHNGF--SFGKNPNTRITKLF 120

Query: 337 SLKKSDKCYKEKLEALRKYGL 357
           +L ++D  YK++   L   G+
Sbjct: 121 ALNRTDSLYKKRARLLELLGV 141


>gi|255071473|ref|XP_002499410.1| predicted protein [Micromonas sp. RCC299]
 gi|226514673|gb|ACO60669.1| predicted protein [Micromonas sp. RCC299]
          Length = 323

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/119 (33%), Positives = 55/119 (46%), Gaps = 23/119 (19%)

Query: 237 ETFKWSFGILFSRLVRLP-----SMDGRVALVPWADMLNHSC-------EVETFLD---- 280
           + F+W++   +SR + LP     S     A+VP  D  NHSC       EV         
Sbjct: 116 DEFRWAYSAYWSRALSLPIGADPSAPTVEAIVPGIDFANHSCGAPNARWEVRGVRGGAPD 175

Query: 281 -YDKSSQGVVFTTDRQY----QPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
             D S  G       ++     PGE+V ISYG K+N ELL  +GF  R+  NP D++ L
Sbjct: 176 PNDPSGSGPRVELLGEFGSLPAPGEEVVISYGDKTNEELLFVHGFADRD--NPHDALVL 232


>gi|428182869|gb|EKX51728.1| hypothetical protein GUITHDRAFT_102333 [Guillardia theta CCMP2712]
          Length = 398

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 4/93 (4%)

Query: 244 GILFSRLVRLPSMDG--RVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
           G + +    LP+  G     + P  DM+NH  + ++ + Y          +   +  G+Q
Sbjct: 203 GGILANTFLLPNFLGLTHYVIAPMIDMINHDGQSKSIVTYQALQAAFEVQSSSNFNVGDQ 262

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVEL 334
           VFISYG +SN +LL  YGFV  E  N  D  E+
Sbjct: 263 VFISYGDRSNDQLLQYYGFV--EMDNVHDLYEI 293


>gi|355718768|gb|AES06378.1| SET domain containing 6 [Mustela putorius furo]
          Length = 216

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 15/173 (8%)

Query: 189 ELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFS 248
           E  R L+ + + E   + + N+   Y  + L     +PDLF   V ++E ++    ++ +
Sbjct: 4   ERRRLLQGTGVPEAVEKDLANIRSEYYSIVLPFMETHPDLFSPRVRSLELYRQLVALVMA 63

Query: 249 RLVRLPSMDGRVA-------LVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP--- 298
              + P  +           +VP AD+LNH       L+Y  +   +V T     QP   
Sbjct: 64  YSFQEPLEEEEDEKEPNSPLMVPAADILNHLANHNANLEYSPNCLRMVAT-----QPIPK 118

Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
           G ++F +YG+ +N +L+  YGFV     N  D+ ++ +   +       K EA
Sbjct: 119 GHEIFNTYGQMANWQLIHMYGFVEPYPDNTDDTADIQMVTVREAALQGTKAEA 171


>gi|256079856|ref|XP_002576200.1| hypothetical protein [Schistosoma mansoni]
          Length = 330

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 42/72 (58%), Gaps = 2/72 (2%)

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKE 347
           ++F T   Y+PG+Q+F+ YG +SN +  +  GF+P+   N ++ + + L +  SD     
Sbjct: 117 LIFCTMEAYKPGDQIFMDYGNRSNDDFFMFSGFIPQ--VNLNNKLTITLGISSSDSLALT 174

Query: 348 KLEALRKYGLSA 359
           + + L+ +GLS 
Sbjct: 175 RKQLLQTFGLSV 186


>gi|241959368|ref|XP_002422403.1| ribosomal N-lysine methyltransferase, putative [Candida
           dubliniensis CD36]
 gi|223645748|emb|CAX40410.1| ribosomal N-lysine methyltransferase, putative [Candida
           dubliniensis CD36]
          Length = 579

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 48/183 (26%), Positives = 83/183 (45%), Gaps = 47/183 (25%)

Query: 180 YSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETF 239
           Y   ++T  +LD+YL                    ND  +  ++ +P+           +
Sbjct: 171 YEYKFYTDDDLDKYL--------------------NDENIENWTSFPN-----------Y 199

Query: 240 KWSFGILFSR-----LVRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR 294
            W+  IL SR     L+   +      L+P  D+LNH+ + +  + +D       F ++ 
Sbjct: 200 LWASLILKSRSFPAYLIDKNNKQDSAMLLPVVDLLNHNSKSK--VHWDIFENHFKFGSE- 256

Query: 295 QYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRK 354
             +PG+++F +YG K N ELLL+YGF      N  DSV L +      K  +EK++A+ +
Sbjct: 257 SIEPGKEIFNNYGLKGNEELLLAYGFCIE--NNLQDSVALKI------KMPEEKIKAIEE 308

Query: 355 YGL 357
           YG+
Sbjct: 309 YGV 311


>gi|270005261|gb|EFA01709.1| hypothetical protein TcasGA2_TC007289 [Tribolium castaneum]
          Length = 230

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 68/138 (49%), Gaps = 9/138 (6%)

Query: 248 SRLVRLPSMDGRVALVPWADMLNHS-CEVETFLD--YDKSSQGVVFTTDRQYQPGEQVFI 304
           +R   +P  +   AL+P  DM NH+   + T  +   D+S    V    + ++ GEQ+FI
Sbjct: 2   TRQNTIPFQEDYYALIPLWDMCNHTNGTISTAYNPVLDRSECLAV----KNFKAGEQLFI 57

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFP 364
            YG +SN +L +  GFV     N  D   + L + KSD   +++   L K  ++++  F 
Sbjct: 58  FYGSRSNADLFVHNGFVFE--NNDYDVYWIRLGISKSDPLQQKRGHLLGKLSIASTCDFS 115

Query: 365 IQITGWPLELMAYAYLVV 382
           I+    P++    A+L V
Sbjct: 116 IRKGASPIDGQLLAFLRV 133


>gi|145343084|ref|XP_001416296.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576521|gb|ABO94589.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1280

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 67/263 (25%), Positives = 105/263 (39%), Gaps = 48/263 (18%)

Query: 101 GERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEA 160
           G RG+ A++++  GE LL VP  LV+ + S     +  +  +  S      +A  L+ E 
Sbjct: 76  GTRGVEAVRDLAPGETLLRVPWGLVVESASASGDDDGDDDARWSSA-----MAMTLLEEL 130

Query: 161 SFEKSSRWSNYISALPR-QPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLRL 219
           S  +S+  + +++ LPR  P +        LD   EA  +RE   E + +          
Sbjct: 131 SEGESNERAAWLTRLPRPAPKT------PALDFDDEA--LREIEDESVVDEALAVRRAHE 182

Query: 220 RIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRVA-----LVPWADMLNHSC- 273
           R    Y +        +E  KW+  ++ SR+      DG        LVP  DM NH   
Sbjct: 183 RAREAYGERLAAIGATVEDLKWATAVVHSRV--FTRRDGAAERATRLLVPGVDMCNHDAA 240

Query: 274 -------------------------EVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGK 308
                                    E+      DK  + VV       + GE++ ISYG 
Sbjct: 241 RFNAIVRVVTSPETCQGAAATEEIAEIAPSSTMDKFFELVVDPDGETVEAGEEILISYGS 300

Query: 309 KSNGELLLSYGFVPREGTNPSDS 331
             N   L+ +GF+PR GTN +D+
Sbjct: 301 FPNDVWLMYFGFIPR-GTNVNDT 322


>gi|71656153|ref|XP_816628.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70881769|gb|EAN94777.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 565

 Score = 52.4 bits (124), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 101/244 (41%), Gaps = 25/244 (10%)

Query: 148 DWPLLATYLISEASFEKSSRWSNYISALPRQ-PYSLLYWTRAEL----------DRYLEA 196
           D PLL   LI E    ++S W+  + + P + P    +W   +L          D   + 
Sbjct: 197 DEPLLVLSLIYERYVAETSHWNELLFSCPGEYPNVPTFWDWEDLAELEGLDVLDDVLAKK 256

Query: 197 SQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSM 256
           +Q+ +   E +  +   +  L         +    E F++E   W+     SR   L ++
Sbjct: 257 AQLAQFQTETMAVLPFIHEALAGSCRLGKDEFL--ECFSIEAMMWARTTFDSRAFNL-NV 313

Query: 257 DGRV--ALVPWADMLNHSCEVETFLDYDKSSQG----VVFTTDRQYQPGEQVFISYGKKS 310
           DGRV  ALVP ADM+NH    +  +   + + G     +  +      G ++++SYG   
Sbjct: 314 DGRVVIALVPVADMINHHNRSDVLVRKVEPNGGDFVMQIGASLTAQDIGREIWMSYGPLQ 373

Query: 311 NGELLLSYGFVPREGTNPSDSVELPLSLKK---SDKCYKEKLEALRKYGLSASECFPIQI 367
           N ELL  YGFV  EG N  + +  P    +    D+    +   +  YGL  + C  I  
Sbjct: 374 NWELLQFYGFVV-EG-NEHERLPFPFDFPEGAVGDEWDGRRAALVATYGLHLAGCCWICH 431

Query: 368 TGWP 371
            G P
Sbjct: 432 DGRP 435


>gi|440633283|gb|ELR03202.1| hypothetical protein GMDG_01185 [Geomyces destructans 20631-21]
          Length = 372

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 38/65 (58%), Gaps = 9/65 (13%)

Query: 260 VALVPWADMLNHS---CEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLL 316
           +AL P+AD  NH+   C VE         +G   TT++ Y  GE+++ISYGK SN  LL 
Sbjct: 180 MALNPFADYFNHASQGCTVEF------GPEGFEITTNKVYGEGEEIYISYGKHSNDFLLA 233

Query: 317 SYGFV 321
            YGF+
Sbjct: 234 EYGFI 238


>gi|340517549|gb|EGR47793.1| hypothetical protein TRIREDRAFT_122428 [Trichoderma reesei QM6a]
          Length = 482

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 67/260 (25%), Positives = 111/260 (42%), Gaps = 34/260 (13%)

Query: 95  IQKVDVGERGLVALKNIRKGEK-------LLFVPPSLVITADSKWSCPEAGEVLKQ---- 143
           I+ ++    GLVA  +I    +       +L +P  LV++A++     +  +  KQ    
Sbjct: 25  IRNIEGKGFGLVAKHDITDESRDASGPATILRIPRDLVLSAEAVEEYAKVDQNFKQLLDV 84

Query: 144 ----CSVPDWPL-LATYLISEASFEKSSR------WSNYISALPRQPYSLLYWTRAELDR 192
                +  D  L L T+L+   +    +R      W+ YI  LPR       WT  E + 
Sbjct: 85  AGHQSTRGDIMLYLLTHLVQSKATSPGTRAFASTPWTEYIRFLPRPIPVPTMWTNDERE- 143

Query: 193 YLEASQIRERAIERITNVIGTYNDL--RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRL 250
            L+ + +      +++ +   Y+ L       S +  L  E    +E +  +     SR 
Sbjct: 144 LLKGTSLEAAVSAKLSALSSEYDKLCEEASALSFWSTLLSESA-TLEDWVLADAWYRSRC 202

Query: 251 VRLPSMDGRVALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDR---QYQPGEQVFISYG 307
           + LP      A+VP  DM NHS     +  YD+SS G V    R   +   G ++ ISYG
Sbjct: 203 LELPRAGH--AMVPGLDMANHSQSHSAY--YDESSDGDVVLLPRPGSKIPAGAEITISYG 258

Query: 308 K-KSNGELLLSYGFVPREGT 326
           + K   E+L SYGF+ ++ T
Sbjct: 259 EAKPAAEMLFSYGFIDKDST 278


>gi|219122993|ref|XP_002181819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407095|gb|EEC47033.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 579

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 52/112 (46%), Gaps = 12/112 (10%)

Query: 259 RVALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLS 317
           R  + P  DM NH S +    + ++  +      TD+    G++V+ISYG +SN +LL  
Sbjct: 333 RYVICPMIDMANHQSVKFAGQVSFEYFANAYSLATDQAIPSGDEVYISYGPRSNDQLLQY 392

Query: 318 YGFVPREGTNPSDSVELP---------LSLKKSDKCYKEKLEALRKYGLSAS 360
           YGFV R   NP+D   +P         L      K    +LE L + GL  S
Sbjct: 393 YGFVER--NNPNDVYVMPPLREWDIEALERATDRKFAVGRLEKLNRAGLLGS 442


>gi|134077289|emb|CAK45629.1| unnamed protein product [Aspergillus niger]
          Length = 498

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 79/174 (45%), Gaps = 16/174 (9%)

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDLR-----L 219
           SS W+ Y+  +P       +++  EL+  L  S +R     +I ++   +  LR     L
Sbjct: 130 SSPWTEYMKYMPPAISLPTFYSEEELE-LLRGSSLRLAVHAKIASLEKEFEHLRRSTEGL 188

Query: 220 RIFSKYPDLFPEEVFNMETFKWSF--GILFSRLVRLPSMDGRVALVPWADMLNHSCE--V 275
               KY   + E+   +    W +   +  SR+V LP      A+VP  DM NH+ E  V
Sbjct: 189 DWCEKY--WWDEDTGKLTFNDWKYVDALYRSRMVDLPRHGH--AMVPCIDMANHASEGTV 244

Query: 276 ETFLDYDKSSQGVV-FTTDRQYQPGEQVFISYG-KKSNGELLLSYGFVPREGTN 327
           +   D D     V+     R  +  E+V ISYG +KS  EL+ SYGF+    T+
Sbjct: 245 KALYDEDADGNAVLQLREGRSLRADEEVTISYGDEKSASELIFSYGFLDEHTTD 298


>gi|413925566|gb|AFW65498.1| hypothetical protein ZEAMMB73_874532 [Zea mays]
          Length = 450

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 27/34 (79%)

Query: 360 SECFPIQITGWPLELMAYAYLVVSPPSMKGKFEE 393
           SE FP+ +TGW +ELMAYA+LVVSPP M   FE+
Sbjct: 201 SESFPLWVTGWSVELMAYAFLVVSPPDMSQCFED 234


>gi|384249279|gb|EIE22761.1| SET domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 438

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 72/165 (43%), Gaps = 36/165 (21%)

Query: 221 IFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP--------SMDGR-VALVPWADMLNH 271
           +  + PDL+P      + F  + G++ SR   +         + DG  + L+P  DM+NH
Sbjct: 35  VLEQRPDLWPPASCGYDAFVHAAGMVQSRAFHMKKENWITGENEDGEELYLIPGIDMINH 94

Query: 272 SCEV-ETFLDYDKSSQGVVF------------------TTDRQYQPGEQVFISYGKKSNG 312
           S  + E     ++SS GV F                     R+   GEQ+  +YG  S+ 
Sbjct: 95  SSRLQERNTALEQSSDGVTFRRKPDLPPEEYNGGLFVMKAGRKVAAGEQILHTYGDLSDA 154

Query: 313 ELLLSYGFV--PREGTNPSDSVELPLSLKKSDKCYKEKLEALRKY 355
           +LL +YGFV  P      + +V LP S        ++ + ALR++
Sbjct: 155 QLLQTYGFVEDPARPNRHNRNVRLPTS------DLQKGIRALRRH 193


>gi|116180202|ref|XP_001219950.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
 gi|88185026|gb|EAQ92494.1| hypothetical protein CHGG_00729 [Chaetomium globosum CBS 148.51]
          Length = 510

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 94/210 (44%), Gaps = 29/210 (13%)

Query: 158 SEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVIGTYNDL 217
           S A+   S+ W+ Y+  LP        WT  E  R L      E A++   + +    DL
Sbjct: 110 SHANVGLSNPWTEYLKFLPETVLVPTLWTEDE--RLLLRGTSLEAAVDAKISALDAEFDL 167

Query: 218 RLRIFSKYPDLFP-EEVFNMETFKWSF-------GILFSRLVRLPSMDGRVALVPWADML 269
              +  K  D+    ++  ME    SF        +  SR + LP+     ++VP  DM+
Sbjct: 168 ---VREKSSDIIAWNDLLWMEGVPVSFTDWIRLDALYRSRCLELPT--SGESMVPCIDMI 222

Query: 270 NHSCEVETFLDYDKSSQGVVFTTDRQYQPG------EQVFISYGK-KSNGELLLSYGFVP 322
           NHS     +  YD++++  V    R   PG      +++ ISYG  KS  E+L SYGF+ 
Sbjct: 223 NHSATPSAY--YDENTKDEVVLLPR---PGVGISKDDEITISYGDAKSAAEMLFSYGFI- 276

Query: 323 REGTNPSDSVELPLSLKKSDKCYKEKLEAL 352
            +  + ++S+ LPL+    D   K KL A 
Sbjct: 277 -DPASSAESLVLPLNPFPDDPLAKAKLVAF 305


>gi|326427099|gb|EGD72669.1| hypothetical protein PTSG_04400 [Salpingoeca sp. ATCC 50818]
          Length = 1063

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 53/235 (22%), Positives = 90/235 (38%), Gaps = 54/235 (22%)

Query: 152 LATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERITNVI 211
           LAT ++  AS   +S+W  Y+S+LP+   + + +    L   L  + +    I+R     
Sbjct: 158 LATAIVFHAS-NPTSKWHGYLSSLPKHNLTTMTFDERAL-HLLRGTNLHHATIDRRNATA 215

Query: 212 GTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSR--------------LVRLPSMD 257
            T   +   +  K+P       F ++ + W+   + SR              L+ L  +D
Sbjct: 216 RTAATICRWLQHKWPQ--HAAAFTLDAYVWAAETISSRALSGRVSQPDTVIHLLHLGIVD 273

Query: 258 GRV------------------------------ALVPWADMLNHSCEVETFLDYDKSSQG 287
           G                                 L+P  D+ +H  + +  + +  +   
Sbjct: 274 GDTPTSASDEQSSTRTSKSVVTVAPFPVLAHTPCLLPLLDLFDHDPQAD--VTWRNTGTH 331

Query: 288 VVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF-VPREGTNPSDSVELPLSLKKS 341
           V   T     PGE VF +YG K N EL+L+YGF +P    N  D + + L L  S
Sbjct: 332 VRLITREAVAPGEPVFNNYGGKGNEELMLAYGFALP---NNKHDDMHVMLGLPAS 383


>gi|6320463|ref|NP_010543.1| Rkm4p [Saccharomyces cerevisiae S288c]
 gi|46577338|sp|Q12504.1|RKM4_YEAST RecName: Full=Ribosomal N-lysine methyltransferase 4; AltName:
           Full=SET domain-containing protein 7
 gi|1136212|emb|CAA92714.1| unknown [Saccharomyces cerevisiae]
 gi|1226033|emb|CAA94096.1| unknown [Saccharomyces cerevisiae]
 gi|51830266|gb|AAU09704.1| YDR257C [Saccharomyces cerevisiae]
 gi|190404795|gb|EDV08062.1| hypothetical protein SCRG_00269 [Saccharomyces cerevisiae RM11-1a]
 gi|259145494|emb|CAY78758.1| Set7p [Saccharomyces cerevisiae EC1118]
 gi|285811273|tpg|DAA12097.1| TPA: Rkm4p [Saccharomyces cerevisiae S288c]
 gi|323349272|gb|EGA83501.1| Set7p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365766338|gb|EHN07836.1| Set7p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392300372|gb|EIW11463.1| Rkm4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 494

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++    R+ +          F  + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  V                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284


>gi|157167893|ref|XP_001662890.1| hypothetical protein AaeL_AAEL002998 [Aedes aegypti]
 gi|108881507|gb|EAT45732.1| AAEL002998-PA [Aedes aegypti]
          Length = 259

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 55/106 (51%), Gaps = 5/106 (4%)

Query: 261 ALVPWADMLNH-SCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYG 319
            L+P  DM NH + ++ T   Y++  Q V   T + +  GEQ+FI YG ++N + L+  G
Sbjct: 33  VLIPLWDMANHINGQITT--GYNEELQRVESQTLKAFAKGEQIFIHYGNRTNADFLVHNG 90

Query: 320 FVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSASECFPI 365
           FV  + +N    V + L+L   +  + ++ E L K  +  +  F +
Sbjct: 91  FVFPDNSNT--EVTIQLALNSGEDLFDQRKELLEKLNVPIAGEFTV 134


>gi|256270722|gb|EEU05884.1| Set7p [Saccharomyces cerevisiae JAY291]
          Length = 494

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++    R+ +          F  + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  V                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284


>gi|151942233|gb|EDN60589.1| SET domain-containing protein [Saccharomyces cerevisiae YJM789]
          Length = 494

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 62/273 (22%), Positives = 106/273 (38%), Gaps = 76/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++    R+ +          F  + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  +                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 194 MQDSSINENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284


>gi|346980096|gb|EGY23548.1| SET domain-containing protein RMS1 [Verticillium dahliae VdLs.17]
          Length = 469

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 69/297 (23%), Positives = 126/297 (42%), Gaps = 43/297 (14%)

Query: 81  KWLSDSGLPPQKMAIQKVDV----GERGLVALKNIRKGEKLLFVPPSL----VITADSKW 132
           +W   +G   +   +Q VD+      RG++A ++I + E +LF  P      V+T++   
Sbjct: 13  QWFKAAGGEFRDDLLQIVDLRPQAAGRGIIATRDIPE-ETILFTIPRQAIINVLTSELPQ 71

Query: 133 SCPEA--GEV--LKQCSVP--DWPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWT 186
             P+   G +  +   + P   W  L   ++ E     +SRW  Y   LP+Q  + ++W+
Sbjct: 72  KLPQVFDGSIDEMDDNAEPLDSWGQLILVMLYEVLQGDASRWKPYFDILPQQFDTPIFWS 131

Query: 187 RAELDRYLEASQIRERAIERITNVIGTYNDLRLRIFSKYPDLF-PE--------EVFNME 237
             EL   L+ + +    I ++ +    +    L I    P +F PE        E+ ++ 
Sbjct: 132 DGEL-LELQGTSLTAEKIGKVES-DAMFRSKILPIVQANPAIFYPEGAAQPTEDELLHLA 189

Query: 238 ------TFKWSFGILFSRLVR------LPSMDGR--VALVPWADMLNHSCEVETFLDYDK 283
                    ++F +             +   +GR  + +VP AD LN + E    +++ +
Sbjct: 190 HRMGSTIMAYAFDLENDDENENEEDGWVEDREGRTMLGMVPMADTLNANAEFNAHINHGE 249

Query: 284 SSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKK 340
           S +      D   + G+Q+   YG     ELL  YG+V  E +   D VE+P +L K
Sbjct: 250 SLEATAIRAD--IKAGDQILNYYGPLPTSELLRRYGYVTPEHSR-YDVVEVPWTLVK 303


>gi|349577313|dbj|GAA22482.1| K7_Set7p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 494

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 62/273 (22%), Positives = 106/273 (38%), Gaps = 76/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++    R+ +          F  + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  +                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 194 MQDSSINENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284


>gi|323355591|gb|EGA87411.1| Set7p [Saccharomyces cerevisiae VL3]
          Length = 515

 Score = 51.6 bits (122), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISA--LPRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++    R+ +          F  + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  V                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284


>gi|323334121|gb|EGA75505.1| Set7p [Saccharomyces cerevisiae AWRI796]
          Length = 515

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 63/273 (23%), Positives = 106/273 (38%), Gaps = 76/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++    R+ +          F  + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFS----RVATS---------FEFDNFAYIASIILSYSFDLE 193

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  V                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 194 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 251

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 252 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 284


>gi|296810368|ref|XP_002845522.1| SET domain-containing protein [Arthroderma otae CBS 113480]
 gi|238842910|gb|EEQ32572.1| SET domain-containing protein [Arthroderma otae CBS 113480]
          Length = 491

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 52/241 (21%), Positives = 94/241 (39%), Gaps = 27/241 (11%)

Query: 105 LVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASFEK 164
           L A+++I + E+L  +P  L+++ ++  +    G  L +  +  W  L   +I E    +
Sbjct: 61  LGAVRDIAEDEELFVIPEDLILSVENSKAREALG--LNETQLGPWLSLIIVMIYEYYQGE 118

Query: 165 SSRWSNYISALPRQPYSLLYWTRAELDRY--------LEASQIRERAIERITNVIGTY-- 214
            SRW  Y   LP    +L++WT A+L           +  S   E  ++++  +I     
Sbjct: 119 QSRWEPYFHILPTSFDTLMFWTEAQLQELQGCAVVDKIGKSAADEAILQKVVPLIQANPH 178

Query: 215 -------------NDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV- 260
                        ND  L +  +   L     F++E  + +              D    
Sbjct: 179 HFPARSGMPPLDSNDALLCLAHRMGSLIMAYAFDIEKTEGADDDAAEDGYMTDDEDEPAK 238

Query: 261 ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKKSNGELLLSYGF 320
            +VP AD+ N   +      + +     V    R  Q GE++F  YG+    +LL  YG+
Sbjct: 239 GMVPLADIFNADAQRNNARLFQEEG-SFVMKAIRNIQAGEEIFNDYGELPRADLLRRYGY 297

Query: 321 V 321
           V
Sbjct: 298 V 298


>gi|308798945|ref|XP_003074252.1| unnamed protein product [Ostreococcus tauri]
 gi|116000424|emb|CAL50104.1| unnamed protein product [Ostreococcus tauri]
          Length = 405

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 70/165 (42%), Gaps = 25/165 (15%)

Query: 207 ITNVIGTYNDLRLRIFSKYPDLFPEEVFNM-ETFKWSFGILFSRLVR---LPSMDGRVAL 262
           +T + GT    RLR    +          M +   W+ G + S  ++   +P      AL
Sbjct: 202 LTALDGTVTAARLRKRGDFVRALASSTGLMVKDVSWAIGAVSSHAMKSEIVP-----YAL 256

Query: 263 VPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFISYGKK-SNGELLLSYGFV 321
           VP  D+L+HS      +  D+++  V   + R   PGE++ ISYGK   N   L  YGF 
Sbjct: 257 VPGCDLLDHSTTPNCVVRRDETTNDVFCASTRDVAPGEKLTISYGKSLCNDRALRMYGFA 316

Query: 322 PRE--------------GTNPSDSVELPLSLKKSDKCYKEKLEAL 352
            RE              G +PS+      S+ +S+K +  +  AL
Sbjct: 317 SRELYSNDARVLPGGFRGVHPSNEA-FDASVDESEKVFGGRKAAL 360


>gi|1150596|emb|CAA86307.1| putative transcription regulator [Saccharomyces cerevisiae]
          Length = 496

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 63/273 (23%), Positives = 105/273 (38%), Gaps = 74/273 (27%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEAS- 161
           R +VA + I+K E L  +P S V++  +        +++K     D+P L    ++E   
Sbjct: 39  RAVVATQKIKKDETLFKIPRSSVLSVTT-------SQLIK-----DYPSLKDKFLNETGS 86

Query: 162 --------------FEKSSRWSNYISAL--PRQPYSLLYWTRAELDRYL----------- 194
                          ++ SRW+ Y      P    +L++W   EL               
Sbjct: 87  WEGLIICILYEMEVLQERSRWAPYFKVWNKPSDMNALIFWDDNELQLLKPSLVLERIGKK 146

Query: 195 EASQIRERAIERITNVIGTYNDLRLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLP 254
           EA ++ ER I+ I  + G ++       SK+           + F +   I+ S    L 
Sbjct: 147 EAKEMHERIIKSIKQIGGEFSTCVANCPSKF-----------DNFAYIASIILSYSFDLE 195

Query: 255 SMDGRV--------------------ALVPWADMLN-HSCEVETFLDYDKSSQGVVFTTD 293
             D  V                    +++P ADMLN  + +    L YD +   +V    
Sbjct: 196 MQDSSVNENEEEETSEEELENERYLKSMIPLADMLNADTSKCNANLTYDSNCLKMVAL-- 253

Query: 294 RQYQPGEQVFISYGKKSNGELLLSYGFVPREGT 326
           R  +  EQV+  YG+  N ELL  YG+V  +G+
Sbjct: 254 RDIEKNEQVYNIYGEHPNSELLRRYGYVEWDGS 286


>gi|159122413|gb|EDP47534.1| SET domain protein [Aspergillus fumigatus A1163]
          Length = 492

 Score = 51.6 bits (122), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 126/325 (38%), Gaps = 70/325 (21%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPDWPLLATYLISEASF 162
           RG+VA  +I  GE+L  +P  LV++A +        + L++     W  L   ++ E   
Sbjct: 48  RGVVARSDIFDGEELFSIPRGLVLSAQNSKLKDLLSQDLEELGP--WLSLILVMMYEYLL 105

Query: 163 EKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERAIERITNVIGTYNDL 217
            + S W+ Y   LP+   +L++W+ +EL R L+ S I     +E A + I  +I      
Sbjct: 106 GEQSAWAPYFKILPKSFDTLMFWSPSEL-RELQGSAIVSKIGKEGAEDSIMQMIAP---- 160

Query: 218 RLRIFSKYPDLFPEEVFNMETFKWSFGILFSRLVRLPSMDGRV----------------- 260
              +    P LFP  V  + ++    G     L+RL  + G +                 
Sbjct: 161 ---VVRANPSLFP-SVDGLASWDGEAG--SHALLRLAHIMGSLIMAYAFDIEKVEDEDDE 214

Query: 261 -------------------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQ 301
                               +VP AD+LN   +      + +    +V    +  + GE+
Sbjct: 215 NNDEEDGYVTDDEQDQSSKGMVPLADILNADADRNNARLF-QEDDSLVMKAIKPIRVGEE 273

Query: 302 VFISYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEALRKYGLSAS- 360
           +F  YG+    +LL  YG+V  +     D VEL L             +  R  GL  + 
Sbjct: 274 IFNDYGELPRADLLRRYGYV-TDNYAQYDVVELSLD------------QICRSAGLQNAD 320

Query: 361 -ECFPIQITGWPLELMAYAYLVVSP 384
            E +P       LEL+   Y++  P
Sbjct: 321 IESYPPLAFLEDLELLDDGYVIPRP 345


>gi|326435209|gb|EGD80779.1| hypothetical protein PTSG_01368 [Salpingoeca sp. ATCC 50818]
          Length = 627

 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 44/185 (23%), Positives = 79/185 (42%), Gaps = 24/185 (12%)

Query: 150 PLLATYLISEASFEKSSRWSNYISALPRQP--YSLLYWTRAELDRYLEASQIRERAIERI 207
           PLL   ++   + E +S ++ Y + LP     +    WT  E    L+ S+++E     +
Sbjct: 179 PLLLAMMLDMDAGE-ASEFAPYFNILPEDDELHHPHVWTDRERSTLLKDSRLQEDVARDL 237

Query: 208 TNVIGTYNDLRLRIFSKYPDLFPE---EVFNMETFKWSFGILFSRLVRLPSMDGRVALVP 264
           T +   Y+ +      ++P +FP+   + F+   +     I+          DGRV LVP
Sbjct: 238 TLMKREYDTIAKPFMIRHPKIFPQPGKKAFSFRKYAQCAAIVMGYSF-TDEEDGRVCLVP 296

Query: 265 WADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQ--------PGEQVFISYGKKSNGELLL 316
            AD+LNH             +   +F +D+  Q         G ++F +YG   N +L+ 
Sbjct: 297 VADILNH---------VTGKNNARLFFSDKTLQMRSIKRIPAGAEIFNTYGDLDNLQLVQ 347

Query: 317 SYGFV 321
            +GF 
Sbjct: 348 QHGFA 352


>gi|226294776|gb|EEH50196.1| SET domain-containing protein [Paracoccidioides brasiliensis Pb18]
          Length = 488

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 112/278 (40%), Gaps = 37/278 (13%)

Query: 89  PPQKMAIQKVDVGERGLVALKNIRKGEKLLFVPPSLVITADSKWSCPEAGEVLKQCSVPD 148
           P  K+A  + +   RG+VA  +I + E+L  +P  LV++  +     +  E+  +  +  
Sbjct: 34  PKIKIADLRSEGAGRGIVAYDDINEEEELFAIPQGLVLSFQNS-KLKDLMEI-NERDLGQ 91

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQIRERAIERIT 208
           W  L   +I E     +S W+ Y   LP    +L++WT AEL   L+ S +  R I + T
Sbjct: 92  WLCLILVMIYEYLQGAASPWAPYFKVLPTDFDTLMFWTDAEL-LELKGSAVLGR-IGKST 149

Query: 209 NVIGTYNDLRLRIFSKYPDLFP--------------------EEVFNMETFKWSFGILFS 248
                  DL L + SK  +LFP                              ++F +   
Sbjct: 150 AEEVFLRDL-LPLVSKNSELFPLTGGLLSYNSPDGKAALLSLAHRMGSLIMSYAFDVEND 208

Query: 249 RLVRLPSMDGRV----------ALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQP 298
               +   DG V           ++P AD+LN   +      + +    +   + +  + 
Sbjct: 209 EAEEVEGEDGYVTDDEERQLPKGMIPLADLLNADADRNNARLFQEDGY-LSMKSIKSIRK 267

Query: 299 GEQVFISYGKKSNGELLLSYGFVPREGTNPSDSVELPL 336
           GE++F  YG+    ELL  YG+V  +     D  E+P+
Sbjct: 268 GEEIFNDYGELPRAELLRRYGYV-TDSYAQYDEAEVPI 304


>gi|171676308|ref|XP_001903107.1| hypothetical protein [Podospora anserina S mat+]
 gi|170936220|emb|CAP60879.1| unnamed protein product [Podospora anserina S mat+]
          Length = 495

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 121/287 (42%), Gaps = 44/287 (15%)

Query: 103 RGLVALKNIRKGEKLLFVPPSLVITADS---KWSCPE-----------AGEVLKQCSVPD 148
           RG+VA  +I     L  +P + ++ A +   K   PE           +G+     +   
Sbjct: 46  RGIVAQADIAADTVLFTIPRNSILCAATSPLKDILPEIFDLDNDDEDESGDESDGDNQNS 105

Query: 149 WPLLATYLISEASFEKSSRWSNYISALPRQPYSLLYWTRAELDRYLEASQI-----RERA 203
           W LL   LI E     SS+W  Y+  LP    + ++WT ++L  +L+AS +     +E A
Sbjct: 106 WTLLILILIHEYLQGSSSQWKPYLDVLPSTFNTPMFWTPSQL-SFLQASAVTSKIGQEEA 164

Query: 204 IERITNVIGTYNDLRLRIF---SKYP---DLFPEEVFNMETFKWSFGILFSRLVRLPSM- 256
            + I + I        +IF   S  P   D   +    M +   S+     + + +P   
Sbjct: 165 DKMIASKILPVIRSHPQIFFPSSATPLSDDQLIQLAHRMGSTIMSYAFDLEQDMEIPEQL 224

Query: 257 ----------DGR--VALVPWADMLNHSCEVETFLDYDKSSQGVVFTTDRQYQPGEQVFI 304
                     +G+  + +VP AD+LN   E    +++  +   +   + R  + GE++  
Sbjct: 225 ENDDEWEEDREGKTMLGMVPMADILNADAEFNAHINH--AEDALTAVSLRPIRKGEEILN 282

Query: 305 SYGKKSNGELLLSYGFVPREGTNPSDSVELPLSLKKSDKCYKEKLEA 351
            YG  S+ ELL  YG+V  E     D VEL  SL  S    + KL+A
Sbjct: 283 FYGPLSSAELLRRYGYVT-EKHARWDVVELSWSLISS--ALQSKLQA 326


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.133    0.400 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,702,432,964
Number of Sequences: 23463169
Number of extensions: 270060229
Number of successful extensions: 625139
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 460
Number of HSP's successfully gapped in prelim test: 1116
Number of HSP's that attempted gapping in prelim test: 622485
Number of HSP's gapped (non-prelim): 1997
length of query: 442
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 296
effective length of database: 8,933,572,693
effective search space: 2644337517128
effective search space used: 2644337517128
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)